KR20230123452A - 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 - Google Patents
수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 Download PDFInfo
- Publication number
- KR20230123452A KR20230123452A KR1020230098039A KR20230098039A KR20230123452A KR 20230123452 A KR20230123452 A KR 20230123452A KR 1020230098039 A KR1020230098039 A KR 1020230098039A KR 20230098039 A KR20230098039 A KR 20230098039A KR 20230123452 A KR20230123452 A KR 20230123452A
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- gly
- ala
- val
- ile
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 18
- 108010078791 Carrier Proteins Proteins 0.000 claims abstract description 83
- 150000001413 amino acids Chemical class 0.000 claims abstract description 50
- 238000004519 manufacturing process Methods 0.000 claims abstract description 33
- 229930024421 Adenine Natural products 0.000 claims abstract description 23
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 claims abstract description 23
- 229960000643 adenine Drugs 0.000 claims abstract description 23
- 102000014914 Carrier Proteins Human genes 0.000 claims description 76
- 229940024606 amino acid Drugs 0.000 claims description 60
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 52
- 229960005190 phenylalanine Drugs 0.000 claims description 28
- 108091033319 polynucleotide Proteins 0.000 claims description 16
- 102000040430 polynucleotide Human genes 0.000 claims description 16
- 239000002157 polynucleotide Substances 0.000 claims description 16
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 9
- 241000588722 Escherichia Species 0.000 claims description 8
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 8
- 238000012258 culturing Methods 0.000 claims description 6
- 108020004705 Codon Proteins 0.000 claims description 5
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 claims description 3
- IFGCUJZIWBUILZ-UHFFFAOYSA-N sodium 2-[[2-[[hydroxy-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyphosphoryl]amino]-4-methylpentanoyl]amino]-3-(1H-indol-3-yl)propanoic acid Chemical compound [Na+].C=1NC2=CC=CC=C2C=1CC(C(O)=O)NC(=O)C(CC(C)C)NP(O)(=O)OC1OC(C)C(O)C(O)C1O IFGCUJZIWBUILZ-UHFFFAOYSA-N 0.000 claims description 2
- FVTCRASFADXXNN-SCRDCRAPSA-N flavin mononucleotide Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-SCRDCRAPSA-N 0.000 abstract description 23
- 244000005700 microbiome Species 0.000 abstract description 7
- 230000004952 protein activity Effects 0.000 abstract description 2
- 102100039338 Aminomethyltransferase, mitochondrial Human genes 0.000 abstract 1
- 108050001492 Ammonium transporters Proteins 0.000 abstract 1
- 101710125177 FAD transporter Proteins 0.000 abstract 1
- 125000003275 alpha amino acid group Chemical group 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 57
- 235000001014 amino acid Nutrition 0.000 description 56
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 48
- 108090000623 proteins and genes Proteins 0.000 description 41
- 239000002609 medium Substances 0.000 description 35
- 239000013598 vector Substances 0.000 description 34
- 229960004799 tryptophan Drugs 0.000 description 29
- 210000004027 cell Anatomy 0.000 description 25
- 239000012634 fragment Substances 0.000 description 25
- 241000588724 Escherichia coli Species 0.000 description 23
- VWWQXMAJTJZDQX-UYBVJOGSSA-N flavin adenine dinucleotide Chemical compound C1=NC2=C(N)N=CN=C2N1[C@@H]([C@H](O)[C@@H]1O)O[C@@H]1CO[P@](O)(=O)O[P@@](O)(=O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C2=NC(=O)NC(=O)C2=NC2=C1C=C(C)C(C)=C2 VWWQXMAJTJZDQX-UYBVJOGSSA-N 0.000 description 22
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 description 22
- 239000011714 flavin adenine dinucleotide Substances 0.000 description 22
- FVTCRASFADXXNN-UHFFFAOYSA-N flavin mononucleotide Natural products OP(=O)(O)OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-UHFFFAOYSA-N 0.000 description 22
- 229940013640 flavin mononucleotide Drugs 0.000 description 22
- 239000011768 flavin mononucleotide Substances 0.000 description 22
- 229940093632 flavin-adenine dinucleotide Drugs 0.000 description 22
- 235000019231 riboflavin-5'-phosphate Nutrition 0.000 description 22
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 20
- 239000013612 plasmid Substances 0.000 description 20
- 108010037850 glycylvaline Proteins 0.000 description 18
- 239000007787 solid Substances 0.000 description 18
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 16
- 108010050848 glycylleucine Proteins 0.000 description 16
- 229930027917 kanamycin Natural products 0.000 description 14
- 229960000318 kanamycin Drugs 0.000 description 14
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 14
- 229930182823 kanamycin A Natural products 0.000 description 14
- 102220257821 rs770511115 Human genes 0.000 description 14
- 230000035772 mutation Effects 0.000 description 13
- 241000880493 Leptailurus serval Species 0.000 description 12
- 239000002773 nucleotide Substances 0.000 description 12
- 125000003729 nucleotide group Chemical group 0.000 description 12
- 101100054918 Escherichia coli (strain K12) adeQ gene Proteins 0.000 description 11
- 108010047495 alanylglycine Proteins 0.000 description 11
- 101150001904 amtB gene Proteins 0.000 description 11
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 10
- 239000001963 growth medium Substances 0.000 description 10
- 239000007788 liquid Substances 0.000 description 10
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 9
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 9
- 238000012408 PCR amplification Methods 0.000 description 9
- 238000000137 annealing Methods 0.000 description 9
- 238000004925 denaturation Methods 0.000 description 9
- 230000036425 denaturation Effects 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 238000006116 polymerization reaction Methods 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- 230000010076 replication Effects 0.000 description 9
- 230000009466 transformation Effects 0.000 description 9
- 238000011144 upstream manufacturing Methods 0.000 description 9
- 239000004471 Glycine Substances 0.000 description 8
- 108020005004 Guide RNA Proteins 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 108010061238 threonyl-glycine Proteins 0.000 description 8
- 108010080629 tryptophan-leucine Proteins 0.000 description 8
- 229960000723 ampicillin Drugs 0.000 description 7
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 7
- -1 aromatic amino acids Chemical class 0.000 description 7
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 6
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 6
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 6
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 101150035667 yeeO gene Proteins 0.000 description 6
- 101100323184 Aquifex aeolicus (strain VF5) amt gene Proteins 0.000 description 5
- 101100162669 Bacillus subtilis (strain 168) nrgA gene Proteins 0.000 description 5
- 101100431983 Escherichia coli (strain K12) yeeO gene Proteins 0.000 description 5
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 4
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 4
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 4
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 4
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 4
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 4
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 description 4
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 4
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 4
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 4
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 4
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 4
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 4
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 4
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 4
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 4
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 4
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 4
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 4
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 4
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 4
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 4
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 4
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 4
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 4
- 235000003704 aspartic acid Nutrition 0.000 description 4
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 4
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 230000008685 targeting Effects 0.000 description 4
- 229960004441 tyrosine Drugs 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108091092584 GDNA Proteins 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 238000001338 self-assembly Methods 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 239000012138 yeast extract Substances 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 2
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 2
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 2
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- BUIYOWKUSCTBRE-CIUDSAMLSA-N Cys-Arg-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O BUIYOWKUSCTBRE-CIUDSAMLSA-N 0.000 description 2
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 2
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 2
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 2
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 2
- QCUJUETWTSWPNZ-NAKRPEOUSA-N Cys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N QCUJUETWTSWPNZ-NAKRPEOUSA-N 0.000 description 2
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 2
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 2
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 2
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 2
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 2
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 2
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 2
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 2
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 2
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 2
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 2
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 2
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 2
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 2
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 2
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 2
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 2
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 2
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 2
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 2
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 2
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 2
- KLGIQJRMFHIGCQ-ZFWWWQNUSA-N Met-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)NCC(O)=O)=CNC2=C1 KLGIQJRMFHIGCQ-ZFWWWQNUSA-N 0.000 description 2
- UXJHNUBJSQQIOC-SZMVWBNQSA-N Met-Trp-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O UXJHNUBJSQQIOC-SZMVWBNQSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 2
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 2
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 2
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 2
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 2
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 2
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 2
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 2
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 2
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 2
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 2
- 241000588733 Pseudescherichia vulneris Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 2
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 2
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 2
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 2
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 2
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 2
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 2
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- ICFRWCLVYFKHJV-FXQIFTODSA-N Val-Cys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N ICFRWCLVYFKHJV-FXQIFTODSA-N 0.000 description 2
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 2
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 2
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 2
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 2
- RFZFBOQPPFCOKG-BZSNNMDCSA-N Val-Trp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N RFZFBOQPPFCOKG-BZSNNMDCSA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000012136 culture method Methods 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 235000014113 dietary fatty acids Nutrition 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 229930195729 fatty acid Natural products 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 150000004665 fatty acids Chemical class 0.000 description 2
- 239000001530 fumaric acid Substances 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 150000007523 nucleic acids Chemical group 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- PAWQVTBBRAZDMG-UHFFFAOYSA-N 2-(3-bromo-2-fluorophenyl)acetic acid Chemical compound OC(=O)CC1=CC=CC(Br)=C1F PAWQVTBBRAZDMG-UHFFFAOYSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- ATRRKUHOCOJYRX-UHFFFAOYSA-N Ammonium bicarbonate Chemical compound [NH4+].OC([O-])=O ATRRKUHOCOJYRX-UHFFFAOYSA-N 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- 241000588732 Atlantibacter hermannii Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 102000034573 Channels Human genes 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N DMSO Substances CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241001240954 Escherichia albertii Species 0.000 description 1
- 241001522878 Escherichia coli B Species 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 241000588720 Escherichia fergusonii Species 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000600434 Homo sapiens Putative uncharacterized protein encoded by MIR7-3HG Proteins 0.000 description 1
- 101100321817 Human parvovirus B19 (strain HV) 7.5K gene Proteins 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- WHXSMMKQMYFTQS-UHFFFAOYSA-N Lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 102100037401 Putative uncharacterized protein encoded by MIR7-3HG Human genes 0.000 description 1
- 101100221606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS7 gene Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241000588717 Shimwellia blattae Species 0.000 description 1
- 235000019764 Soybean Meal Nutrition 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 235000019486 Sunflower oil Nutrition 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- YOTRXXBHTZHKLU-BVSLBCMMSA-N Tyr-Trp-Met Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C1=CC=C(O)C=C1 YOTRXXBHTZHKLU-BVSLBCMMSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 239000001099 ammonium carbonate Substances 0.000 description 1
- 235000012501 ammonium carbonate Nutrition 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 239000000908 ammonium hydroxide Substances 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 229940097012 bacillus thuringiensis Drugs 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000004359 castor oil Substances 0.000 description 1
- 235000019438 castor oil Nutrition 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 235000010980 cellulose Nutrition 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000003240 coconut oil Substances 0.000 description 1
- 235000019864 coconut oil Nutrition 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 229910000366 copper(II) sulfate Inorganic materials 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 239000013601 cosmid vector Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- ZEMPKEQAKRGZGQ-XOQCFJPHSA-N glycerol triricinoleate Natural products CCCCCC[C@@H](O)CC=CCCCCCCCC(=O)OC[C@@H](COC(=O)CCCCCCCC=CC[C@@H](O)CCCCCC)OC(=O)CCCCCCCC=CC[C@H](O)CCCCCC ZEMPKEQAKRGZGQ-XOQCFJPHSA-N 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000003630 growth substance Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 150000002484 inorganic compounds Chemical class 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 229910000358 iron sulfate Inorganic materials 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 229910052744 lithium Inorganic materials 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 235000014593 oils and fats Nutrition 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001522 polyglycol ester Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000013587 production medium Substances 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000004455 soybean meal Substances 0.000 description 1
- 239000003549 soybean oil Substances 0.000 description 1
- 235000012424 soybean oil Nutrition 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000002600 sunflower oil Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/22—Tryptophan; Tyrosine; Phenylalanine; 3,4-Dihydroxyphenylalanine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/22—Tryptophan; Tyrosine; Phenylalanine; 3,4-Dihydroxyphenylalanine
- C12P13/222—Phenylalanine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/22—Tryptophan; Tyrosine; Phenylalanine; 3,4-Dihydroxyphenylalanine
- C12P13/225—Tyrosine; 3,4-Dihydroxyphenylalanine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/22—Tryptophan; Tyrosine; Phenylalanine; 3,4-Dihydroxyphenylalanine
- C12P13/227—Tryptophan
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
- C12R2001/19—Escherichia coli
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
본 발명은 수송단백질 신규 변이체 및 이를 이용한 L-방향족 아미노산 생산 방법에 관한 것으로, 상기 수송단백질 변이체는 암모늄 수송단백질, 아데닌 수송단백질 또는 FMN/FAD 수송단백질을 구성하는 아미노산 서열 중 하나 이상의 아미노산이 치환됨으로써 단백질 활성이 변화되어, 이를 포함하는 재조합 미생물은 L-방향족 아미노산을 효율적으로 생산할 수 있다.
Description
본 발명은 수송단백질 신규 변이체 및 이를 이용한 L-방향족 아미노산 생산 방법에 관한 것이다.
아미노산은 곁사슬의 성질에 따라 소수성, 친수성, 염기성 및 산성 아미노산으로 구분되며, 이들 중 벤젠 고리를 갖는 아미노산을 방향족 아미노산이라 한다. 방향족 아미노산에는 페닐알라닌, 티로신 및 트립토판이 있으며, 페닐알라닌과 트립토판은 생체 내에서 합성되지 않는 필수 아미노산으로 전세계적으로 연간 3000억 달러 규모의 시장을 형성하고 있는 고부가가치 산업에 해당한다.
방향족 아미노산의 생산은 자연상태에서 수득된 야생형 균주나 이의 아미노산 생산능이 향상되도록 변형된 변이주를 이용할 수 있다. 최근에는 방향족 아미노산의 생산 효율을 개선시키기 위해 L-아미노산과 같은 유용물질 생산에 많이 이용되는 대장균, 코리네박테리움 등의 미생물을 대상으로 유전자 재조합 기술을 적용하여 우수한 L-방향족 아미노산 생산능을 갖는 다양한 재조합 균주 또는 변이주 및 이를 이용한 L-방향족 아미노산 생산 방법이 개발되고 있다. 특히 L-방향족 아미노산의 생합성 경로에 관여하는 효소, 전사인자, 수송 단백질 등의 유전자를 대상으로 하거나, 또는 이들의 발현을 조절하는 프로모터에 변이를 유도하여 해당 아미노산의 생산량을 증대시키려는 시도가 있었다. 그러나 L-방향족 아미노산 생산에 직간접적으로 연관된 효소, 전사인자, 수송 단백질 등 단백질의 종류가 수십여 종에 이르기 때문에 이러한 단백질의 활성 변화에 따른 L-방향족 아미노산 생산능 증가 여부에 관해 여전히 많은 연구가 필요한 실정이다.
본 발명은 신규한 수송단백질 변이체를 제공하는 것을 목적으로 한다.
또한, 본 발명은 상기 변이체를 암호화하는 폴리뉴클레오티드를 제공한다.
또한, 본 발명은 상기 변이체 또는 폴리뉴클레오티드를 포함하는 형질전환체를 제공한다.
또한, 본 발명은 상기 형질전환체를 이용한 L-방향족 아미노산의 생산 방법을 제공하는 것을 목적으로 한다.
본 발명의 일 양상은 서열번호 3의 아미노산 서열에서 363번째 글리신이 아스파르트산으로 치환된, 서열번호 1의 아미노산 서열로 이루어진 암모늄 수송단백질 변이체; 서열번호 7의 아미노산 서열에서 136번째 트립토판이 종결 코돈으로 치환된, 서열번호 5의 아미노산 서열로 이루어진 아데닌 수송단백질 변이체; 및 서열번호 11의 아미노산 서열에서 272번째 글리신이 글루타민으로 치환된, 서열번호 9의 아미노산 서열로 이루어진 FMN/FAD 수송단백질 변이체로 이루어진 군에서 선택된 변이체를 제공한다.
본 발명에서 사용된 ”수송단백질(transporter)”은 세포막을 통해 이온, 화학물질, 단백질 등 다양한 물질의 이동을 담당하는 단백질의 총칭으로, 수송단백질은 수송되는 물질에 대한 특이성을 가지고 있다. 이러한 수송단백질은 크게 막상에 내부와 외부를 관통하는 구멍을 형성하여 물질의 이동 통로가 되는 통로 단백질과 단백질의 구조적 변형을 통해 운반 물질의 결합부위를 막을 가로질러 이동시키는 수송체와 같은 운반체 단백질로 구분된다.
본 발명에서 사용된 "암모H 수송단백질”, "아데닌 수송단백질” 및 " FMN/FAD 수송단백질”은 모두 운반체 단백질에 해당되며, 각각 암모늄, 아데닌 및 FMN(flavin mononucleotide)/FAD(flavin adenine dinucleotide)만을 특이적으로 외부로 배출시키는 특성을 가진다.
상기 수송단백질은 각 수송단백질을 암호화하는 유전자 또는 이와 실질적 동일성을 가지는 서열일 수 있다. 여기서 “실질적 동일성”이란 각각의 유전자 서열, 즉 염기서열 또는 뉴클레오티드 서열과 임의의 다른 뉴클레오티드 서열을 최대한 대응되도록 정렬하여 분석하였을 때 상기 임의의 다른 뉴클레오티드 서열이 각각의 뉴클레오티드 서열과 70% 이상, 80% 이상, 90% 이상 또는 98% 이상의 서열 상동성을 가지는 것을 의미한다.
본 발명에서의 암모늄 수송단백질은 amtB 유전자에 의해 암호화된 것으로, 서열번호 3의 아미노산 서열을 포함한다.
본 발명의 일 구체예에 따르면, 상기 서열번호 3의 아미노산 서열은 야생형 대장균에서 유래한 것일 수 있다.
본 발명에서의 아데닌 수송단백질은 yicO 유전자에 의해 암호화된 것으로, 서열번호 7의 아미노산 서열을 포함한다.
본 발명의 일 구체예에 따르면, 상기 서열번호 7의 아미노산 서열은 야생형 대장균에서 유래한 것일 수 있다.
본 발명에서의 FMN/FAD 수송단백질은 yeeO 유전자에 의해 암호화된 것으로, 서열번호 11의 아미노산 서열을 포함한다.
본 발명의 일 구체예에 따르면, 상기 서열번호 11의 아미노산 서열은 야생형 대장균에서 유래한 것일 수 있다.
본 발명에서 사용된 “변이체”는 특정 유전자의 아미노산 서열 중 N-말단, C-말단 및/또는 내부에서 하나 이상의 아미노산이 보존적 치환(conservative substitution) 및/또는 변형(modification)되어 상기 변이체의 변이 전 아미노산 서열과 상이하나 기능(functions) 또는 특성(properties)이 유지되는 폴리펩티드를 의미한다. 여기서 “보존적 치환”이란 하나의 아미노산을 구조적 및/또는 화학적 성질이 유사한 다른 아미노산으로 치환시키는 것을 의미하며, 단백질 또는 폴리펩티드의 활성에 거의 영향을 미치지 않거나, 또는 전혀 영향을 미치지 않을 수 있다. 또한, 변이체는 N-말단 리더 서열 또는 막전이 도메인(transmembrane domain)과 같은 하나 이상의 부분이 제거되거나, 또는 성숙 단백질(mature protein)의 N- 및/또는 C-말단으로부터 일부분이 제거된 것을 포함한다. 이러한 변이체는 그 능력이 변이 전 폴리펩티드에 비하여 증가되거나, 변하지 않거나, 또는 감소될 수 있다. 본 발명에서는 변이체가 변이형, 변형, 변이형 폴리펩티드, 변이된 단백질, 변이 등과 혼용될 수 있다.
본 발명에서의 변이체는 서열번호 3의 아미노산 서열에서 363번째 위치한 아미노산인 글리신이 아스파르트산으로 치환된 암모늄 수송단백질 변이체로, 서열번호 1의 아미노산 서열로 이루어진 것일 수 있다.
또한, 본 발명에서의 변이체는 서열번호 7의 아미노산 서열에서 136번째 위치한 아미노산인 트립토판이 종결 코돈으로 치환된 아데닌 수송단백질 변이체로, 서열번호 5의 아미노산 서열로 이루어진 것일 수 있다.
또한, 본 발명에서의 변이체는 서열번호 11의 아미노산 서열에서 272번째 위치한 아미노산인 글리신이 글루타민으로 치환된 FMN/FAD 수송단백질 변이체로, 서열번호 9의 아미노산 서열로 이루어진 것일 수 있다.
본 발명의 다른 양상은 상기 암모늄 수송단백질 변이체, 아데닌 수송단백질 변이체 또는 FMN/FAD 수송단백질 변이체를 암호화하는 폴리뉴클레오티드를 제공한다.
본 발명에서 사용된 “폴리뉴클레오티드(polynucleotide)”는 뉴클레오티드 단위체(monomer)가 공유결합에 의해 길게 사슬모양으로 이어진 뉴클레오티드의 중합체(polymer)로 일정한 길이 이상의 DNA 또는 RNA 가닥으로서, 보다 구체적으로는 상기 변이체를 암호화하는 폴리뉴클레오티드 단편을 의미한다.
상기 폴리뉴클레오티드는 서열번호 1, 5 또는 9의 아미노산 서열을 암호화하는 염기서열을 포함할 수 있다.
본 발명의 일 구체예에 따르면, 상기 폴리뉴클레오티드는 서열번호 2, 6 또는 10으로 표시되는 염기서열을 포함하는 것일 수 있다.
보다 구체적으로, 상기 암모늄 수송단백질 변이체를 암호화하는 폴리뉴클레오티드는 서열번호 2로 표시되는 염기서열을 포함하고, 상기 아데닌 수송단백질 변이체를 암호화하는 폴리뉴클레오티드는 서열번호 6으로 표시되는 염기서열을 포함하고, 상기 FMN/FAD 수송단백질 변이체를 암호화하는 폴리뉴클레오티드는 서열번호 10으로 표시되는 염기서열을 포함한다.
본 발명의 다른 양상은 상기 암모늄 수송단백질 변이체, 아데닌 수송단백질 변이체 또는 FMN/FAD 수송단백질 변이체를 암호화하는 폴리뉴클레오티드를 포함하는 벡터를 제공한다.
또한, 본 발명의 다른 양상은 상기 암모늄 수송단백질 변이체, 아데닌 수송단백질 변이체 또는 FMN/FAD 수송단백질 변이체, 또는 이의 폴리뉴클레오티드를 포함하는 형질전환체를 제공한다.
본 발명에서 사용된 “벡터(vector)”는 숙주세포에 목적 유전자를 전달하여 발현시키기 위한 수단으로 사용되는 모든 유형의 핵산 서열 운반 구조체를 의미한다. 상기 벡터는 특별한 언급이 없는 한, 담지된 핵산 서열이 숙주세포 유전체 내 삽입되어 발현되도록 하는 것 및/또는 독자적으로 발현되도록 하는 것을 의미할 수 있다. 이러한 벡터는 유전자 삽입물이 발현되도록 작동가능하게 연결된 필수적인 조절요소를 포함하며, “작동가능하게 연결된(operably linked)”이란 목적 유전자와 이의 조절 서열이 서로 기능적으로 결합되어 유전자 발현을 가능케 하는 방식으로 연결된 것을 의미하고, “조절요소”는 전사를 수행하기 위한 프로모터, 전사를 조절하기 위한 임의의 오퍼레이터 서열, 적합한 mRNA 리보좀 결합 부위를 암호화하는 서열, 및 전사 및 해독의 종결을 조절하는 서열을 포함한다.
본 발명에서 사용되는 벡터는 숙주세포 내에서 복제 가능한 것이라면 특별히 한정되지 않으며, 당업계에 알려진 임의의 벡터를 이용할 수 있다. 상기 벡터의 일례로는 천연 상태이거나 재조합된 상태의 플라스미드, 코스미드, 바이러스 및 박테리오파지를 들 수 있다. 예를 들면, 파지 벡터 또는 코스미드 벡터로는 pWE15, M13, λMBL3, λMBL4, λIXII, λASHII, λAPII, λt10, λt11, Charon4A, Charon21A 등이 있으며, 플라스미드 벡터로는 pBR계, pUC계, pBluescriptII계, pGEM계, pTZ계, pCL계 및 pET계 등이 있으나, 이에 한정되는 것은 않는다.
상기 벡터는 전형적으로 클로닝을 위한 벡터 또는 발현을 위한 벡터로서 구축될 수 있다. 발현을 위한 벡터는 당업계에서 식물, 동물 또는 미생물에서 외래의 유전자 또는 단백질을 발현하는데 사용되는 통상의 것을 사용할 수 있으며, 당업계에 공지된 다양한 방법을 통해 구축될 수 있다.
재조합 벡터는 원핵세포 또는 진핵세포를 숙주로 하여 구축될 수 있다. 예를 들어, 사용되는 벡터가 발현 벡터이고 원핵세포를 숙주로 하는 경우에는, 전사를 진행시킬 수 있는 강력한 프로모터 (예컨대, pLλ프로모터, CMV 프로모터, trp 프로모터, lac 프로모터, tac 프로모터, T7 프로모터), 해독의 개시를 위한 라이보좀 결합 자리 및 전사/해독 종결 서열을 포함하는 것이 일반적이다. 진핵세포를 숙주로 하는 경우에는, 벡터에 포함되는 진핵세포에서 작동하는 복제원점은 f1 복제원점, SV40 복제원점, pMB1 복제원점, 아데노 복제원점, AAV 복제원점 및 BBV 복제원점 등을 포함하나, 이에 한정되는 것은 아니다. 또한, 포유동물 세포의 게놈으로부터 유래된 프로모터 (예컨대, 메탈로 티오닌 프로모터) 또는 포유동물 바이러스로부터 유래된 프로모터 (예컨대, 아데노 바이러스 후기 프로모터, 백시니아 바이러스 7.5K 프로모터, SV40 프로모터, 사이토 메갈로 바이러스 프로모터, HSV의 tk 프로모터)가 이용될 수 있으며, 전사 종결 서열로서 폴리아데닐화 서열을 일반적으로 가진다.
상기 재조합 벡터는 선택 마커(selection marker)를 포함할 수 있는데, 상기 선택 마커는 벡터로 형질전환된 형질전환체 (숙주세포)를 선별하기 위한 것으로 상기 선택 마커가 처리된 배지에서 선택 마커를 발현하는 세포만 생존할 수 있기 때문에, 형질전환된 세포의 선별이 가능하다. 상기 선택 마커는 대표적인 예로 카나마이신, 스트렙토마이신, 클로람페니콜 등이 있으나, 이에 한정되는 것은 아니다.
재조합 벡터를 숙주세포에 삽입함으로써 형질전환체를 만들 수 있으며, 상기 형질전환체는 재조합 벡터를 적절한 숙주세포에 도입시킴으로써 얻어진 것일 수 있다. 숙주세포는 상기 발현벡터를 안정되면서 연속적으로 클로닝 또는 발현시킬 수 있는 세포로서 당업계에 공지된 어떠한 숙주세포도 이용할 수 있다.
재조합 미생물을 제작하기 위하여 원핵세포에 형질전환시키는 경우에는 숙주세포로서 E. coli JM109, E. coli BL21, E. coli RR1, E. coli LE392, E. coli B, E. coli X 1776, E. coli W3110, E. coli XL1-Blue와 같은 대장균 속 균주, 바실러스 서브틸리스, 바실러스 츄린겐시스와 같은 바실러스 속 균주, 살모넬라 티피무리움, 세라티아 마르세슨스 및 슈도모나스 종과 같은 다양한 장내균과 균주 등이 이용되는 것일 수 있으나, 이에 한정되는 것은 아니다.
재조합 미생물을 제작하기 위하여 진핵세포에 형질전환을 하는 경우에는 숙주세포로서 효모 (예컨대, 사카로마이세스 세레비지에), 곤충 세포, 식물 세포 및 동물 세포, 예를 들어, Sp2/0, CHO K1, CHO DG44, PER.C6, W138, BHK, COS7, 293, HepG2, Huh7, 3T3, RIN, MDCK 세포주 등이 이용될 수 있으나, 이에 한정되는 것은 아니다.
본 발명에서 사용된 “형질전환(transformation)”은 외부 DNA를 숙주세포 내로 도입하여 인위적으로 유전적인 변화를 일으키는 현상을 의미하며, “형질전환체(transformat)”는 외부 DNA가 도입되어 목적 유전자의 발현을 안정적으로 유지하는 숙주세포를 의미한다.
상기 형질전환은 숙주세포에 따라 적합한 벡터 도입 기술이 선택되어 목적 유전자 또는 이를 포함하는 재조합 벡터를 숙주세포 내에서 발현시킬 수 있다. 예를 들면, 벡터 도입은 전기천공법(electroporation), 열 충격(heat-shock), 인산칼슘(CaPO4) 침전, 염화칼슘(CaCl2) 침전, 미세주입법(microinjection), 폴리에틸렌글리콜(PEG)법, DEAE-덱스트란법, 양이온 리포좀법, 초산 리튬-DMSO법, 또는 이들의 조합에 의해 수행될 수 있으나, 이에 한정되는 것은 아니다. 형질전환된 유전자는 숙주세포 내에서 발현될 수 있으면 숙주세포의 염색체 내 삽입 또는 염색체 외에 위치하고 있는 것이든 제한하지 않고 포함될 수 있다.
상기 형질전환체는 생체내 또는 시험관내에서 본 발명에 따른 재조합 벡터로 형질감염, 형질전환, 또는 감염된 세포를 포함하며, 재조합 숙주세포, 재조합 세포 또는 재조합 미생물과 동일한 용어로 사용될 수 있다.
본 발명의 일 구체예에 따르면, 상기 형질전환체는 에스케리치아(Escherichia) 속 균주인 것일 수 있다.
상기 에스케리치아 속 균주로는 에스케리치아 콜라이(Escherichia coli), 에스케리치아 알베르티(Escherichia albertii), 에스케리치아 블라태(Escherichia blattae), 에스케리치아 퍼구소니(Escherichia fergusonii), 에스케리치아 헤르만니(Escherichia hermannii), 에스케리치아 불네리스(Escherichia vulneris) 등이 있으나, 이에 한정되는 것은 아니다.
본 발명에서의 형질전환체는 전술한 수송단백질 변이체 또는 이를 암호화하는 폴리뉴클레오티드를 포함하거나, 또는 이를 포함하는 벡터를 포함하는 균주, 상기 수송단백질 변이체 또는 폴리뉴클레오티드를 발현하는 균주, 또는 상기 수송단백질 변이체에 대한 활성을 가지는 균주일 수 있으나, 이에 한정되는 것은 아니다.
본 발명의 일 구체예에 따르면, 상기 형질전환체는 L-방향족 아미노산 생산능을 가지는 것일 수 있다.
상기 형질전환체는 자연적으로 L-방향족 아미노산 생산능을 가지고 있거나, 또는 인위적으로 L-방향족 아미노산 생산능이 부여된 것일 수 있다.
본 발명의 일 구체예에 따르면, 상기 형질전환체는 암모늄 수송단백질 변이체, 아데닌 수송단백질 변이체 또는 FMN/FAD 수송단백질의 활성이 변화되어 L-방향족 아미노산 생산능이 향상된 것일 수 있다.
본 발명에서 사용된 “생산능이 향상된”은 모균주에 비해 L-방향족 아미노산의 생산성이 증가된 것을 의미한다. 상기 모균주는 변이의 대상이 되는 야생형 또는 변이주를 의미하며, 직접 변이의 대상이 되거나 재조합된 벡터 등으로 형질전환되는 대상을 포함한다. 본 발명에 있어서, 모균주는 야생형 에스케리치아 속 균주 또는 야생형으로부터 변이된 에스케리치아 속 균주일 수 있다.
본 발명에 따른 형질전환체는 암모늄 수송단백질 변이체, 아데닌 수송단백질 변이체 또는 FMN/FAD 수송단백질 변이체가 도입됨으로써 각 수송단백질의 활성이 변화하여 모균주에 비해 증가된 L-방향족 아미노산 생산능을 나타내며, 특히 모균주에 비해 L-방향족 아미노산 생산량이 10% 이상, 구체적으로는 10 내지 80% (바람직하게는 15 내지 60%) 증가되어 균주 배양액 1 ℓ 당 3.5 ~ 20 g의 L-방향족 아미노산을 생산할 수 있다.
본 발명의 다른 양상은 상기 형질전환체를 배지에서 배양하는 단계; 및 상기 형질전환체 또는 형질전환체가 배양된 배지로부터 L-방향족 아미노산을 회수하는 단계를 포함하는 L-방향족 아미노산의 생산 방법을 제공한다.
상기 배양은 당업계에 알려진 적절한 배지와 배양 조건에 따라 이루어질 수 있으며, 통상의 기술자라면 배지 및 배양 조건을 용이하게 조정하여 사용할 수 있다. 구체적으로, 상기 배지는 액체 배지일 수 있으나, 이에 한정되는 것은 아니다. 배양 방법은 예를 들면, 회분식 배양(batch culture), 연속식 배양(continuous culture), 유가식 배양(fed-batch culture) 또는 이들의 조합 배양을 포함할 수 있으나, 이에 한정되는 것은 아니다.
본 발명의 일 구체예에 따르면, 상기 배지는 적절한 방식으로 특정 균주의 요건을 충족해야 하며, 통상의 기술자에 의해 적절하게 변형될 수 있다. 에스케리치아 속 균주에 대한 배양 배지는 공지된 문헌 (Manual of Methods for General Bacteriology. American Society for Bacteriology. Washington D.C., USA, 1981)을 참조할 수 있으나, 이에 한정되는 것은 아니다.
본 발명의 일 구체예에 따르면, 배지에 다양한 탄소원, 질소원 및 미량원소 성분을 포함할 수 있다. 사용될 수 있는 탄소원으로는 글루코스, 수크로스, 락토스, 프락토스, 말토스, 전분, 셀룰로스와 같은 당 및 탄수화물, 대두유, 해바라기유, 피마자유, 코코넛유 등과 같은 오일 및 지방, 팔미트산, 스테아린산, 리놀레산과 같은 지방산, 글리세롤, 에탄올과 같은 알코올, 아세트산과 같은 유기산이 포함된다. 이들 물질은 개별적으로 또는 혼합물로서 사용될 수 있으나, 이에 한정되는 것은 아니다. 사용될 수 있는 질소원으로는 펩톤, 효모 추출물, 육즙, 맥아 추출물, 옥수수 침지액, 대두밀 및 요소 또는 무기 화합물, 예를 들면 황산 암모늄, 염화암모늄, 인산암모늄, 탄산암모늄 및 질산암모늄이 포함될 수 있다. 질소원 또한 개별적으로 또는 혼합물로서 사용할 수 있으나 이에 한정되는 것은 아니다. 사용될 수 있는 인의 공급원으로는 인산이수소칼륨 또는 인산수소이칼륨 또는 상응하는 나트륨-함유 염이 포함될 수 있으며, 이에 한정되는 것은 아니다. 또한, 배양 배지는 성장에 필요한 황산마그네슘 또는 황산철과 같은 금속염을 함유할 수 있으며, 이에 한정되는 것은 아니다. 그 외에, 아미노산 및 비타민과 같은 필수 성장 물질이 포함될 수 있다. 또한 배양 배지에 적절한 전구체들이 사용될 수 있다. 상기 배지 또는 개별 성분은 배양과정에서 배양액에 적절한 방식에 의해 회분식으로 또는 연속식으로 첨가될 수 있으나, 이에 한정되는 것은 아니다.
본 발명의 일 구체예에 따르면, 배양 중에 수산화암모늄, 수산화칼륨, 암모니아, 인산 및 황산과 같은 화합물을 미생물 배양액에 적절한 방식으로 첨가하여 배양액의 pH를 조정할 수 있다. 또한, 배양 중에 지방산 폴리글리콜 에스테르와 같은 소포제를 사용하여 기포 생성을 억제할 수 있다. 추가적으로, 배양액의 호기 상태를 유지하기 위하여, 배양액 내로 산소 또는 산소-함유 기체 (예, 공기)를 주입할 수 있다. 배양액의 온도는 통상 20℃ 내지 45℃, 예를 들면 25℃ 내지 40℃일 수 있다. 배양기간은 유용물질이 원하는 생산량으로 수득될 때까지 계속될 수 있으며, 예를 들면 10 내지 160 시간일 수 있다.
본 발명의 일 구체예에 따르면, 상기 배양된 형질전환체 또는 형질전환체가 배양된 배지에서 L-방향족 아미노산을 회수하는 단계는 배양 방법에 따라 당해 분야에 공지된 적합한 방법을 이용하여 배지로부터 생산된 L-방향족 아미노산을 수집 또는 회수할 수 있다. 예를 들면 원심분리, 여과, 추출, 분무, 건조, 증발, 침전, 결정화, 전기영동, 분별용해 (예를 들면, 암모늄 설페이트 침전), 크로마토그래피 (예를 들면, 이온 교환, 친화성, 소수성 및 크기배제) 등의 방법을 사용할 수 있으나, 이에 한정되는 것은 않는다.
본 발명의 일 구체예에 따르면, 상기 L-방향족 아미노산을 회수하는 단계는 배양 배지를 저속 원심분리하여 바이오매스를 제거하고 얻어진 상등액을 이온교환 크로마토그래피를 통하여 분리할 수 있다.
본 발명의 일 구체예에 따르면, 상기 L-방향족 아미노산을 회수하는 단계는 L-방향족 아미노산을 정제하는 공정을 포함할 수 있다.
본 발명의 일 구체예에 따르면, 상기 L-방향족 아미노산은 L-트립토판, L-페닐알라닌 및 L-티로신으로 이루어진 군에서 선택된 1종 이상인 것일 수 있다.
본 발명에 따른 수송단백질 변이체는 암모늄 수송단백질, 아데닌 수송단백질 또는 FMN/FAD 수송단백질을 구성하는 아미노산 서열 중 하나 이상의 아미노산이 치환됨으로써 단백질 활성이 변화되어, 이를 포함하는 재조합 미생물은 L-방향족 아미노산을 효율적으로 생산할 수 있다.
도 1은 본 발명의 일 실시예에 따른 플라스미드 pDSG의 구조를 나타낸 것이다.
도 2는 본 발명의 일 실시예에 따른 플라스미드 pDS9의 구조를 나타낸 것이다.
도 2는 본 발명의 일 실시예에 따른 플라스미드 pDS9의 구조를 나타낸 것이다.
이하, 본 발명을 보다 상세하게 설명한다. 그러나, 이러한 설명은 본 발명의 이해를 돕기 위하여 예시적으로 제시된 것일 뿐, 본 발명의 범위가 이러한 예시적인 설명에 의하여 제한되는 것은 아니다.
실시예 1. 암모늄 수송단백질 변이체를 발현하는 균주 제작
암모늄 수송단백질의 아미노산 서열 (서열번호 3) 중 363번째 위치한 글리신이 아스파르트산으로 치환된 변이체가 L-방향족 아미노산의 생산에 미치는 영향을 확인하기 위해 상기 암모늄 수송단백질 변이체를 발현하는 벡터 및 상기 벡터가 도입된 균주를 제작하였다. 균주 내 암모늄 수송단백질 변이체의 유전자 삽입을 위해 플라스미드 pDSG 및 pDS9를 사용하여 다음과 같이 제작하였다.
여기서 플라스미드 pDSG는 대장균에서만 작용하는 복제원점을 가지며, 암피실린 내성 유전자 및 가이드 RNA(gRNA) 발현 기전을 가진다. 플라스미드 pDS9는 대장균에서만 작용하는 복제원점을 가지며, 카나마이신 내성 유전자, λ Red 유전자 (exo, bet 및 gam) 및 Streptococcus pyogenes 유래 CAS9 발현 기전을 가진다.
1-1. 형질전환용 벡터 pDSG-amtB(Gly363Asp) 제작
대장균(Escherichia coli) MG1655 (KCTC14419BP) gDNA를 주형으로 프라이머 7 및 프라이머 9의 프라이머 쌍과 프라이머 8 및 프라이머 10의 프라이머 쌍을 이용하여 암모늄 수송단백질을 암호화하는 대장균 amtB 유전자의 363번 아미노산 변이의 upstream 단편과, 프라이머 11 및 프라이머 13의 프라이머 쌍과 프라이머 12 및 프라이머 14의 프라이머 쌍을 이용하여 대장균 amtB 유전자의 363번 아미노산 변이의 downstream 단편을 각각 PCR 수행을 통해 수득하였다. 이때 각 upstream과 downstream 단편에는 amtB 유전자의 363번째 아미노산 잔기인 글리신(Gly)을 아스파르트산(Asp)으로 변경하는 서열을 포함시켰다. 여기서 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 15초 어닐링 및 72℃ 10초 중합을 30회 반복하여 수행하였다.
플라스미드 pDSG를 주형으로 프라이머 3 및 프라이머 5의 프라이머 쌍, 프라이머 4 및 프라이머 6의 프라이머 쌍, 프라이머 15 및 프라이머 1, 및 프라이머 16 및 프라이머 2의 프라이머 쌍을 각각 이용하여 4개의 pDSG 유전자 단편을 PCR 수행을 통해 수득하였다. 이때 각 유전자 단편에는 amtB 유전자의 363번째 Gly을 타겟하는 gRNA 서열을 포함시켰다. gRNA는 변이를 유발하고자 하는 서열의 NGG 앞 20 mer로 선택하였다. 여기서 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 15초 어닐링 및 72℃ 15초 중합을 30회 반복하여 수행하였다.
수득된 amtB 유전자의 363번 아미노산의 upstream과 downstream, 그리고 4개의 pDSG 유전자 단편들을 self-assembly cloning 방법 (BioTechniques 51:55-56 (July 2011))을 이용하여 클로닝하여 재조합 플라스미드를 획득하였으며, 이를 pDSG-amtB(Gly363Asp)로 명명하였다.
1-2. 암모늄 수송단백질 변이체 amtB(Gly363Asp)가 도입된 L-트립토판 또는 L-페닐알라닌 생산 균주 제작
L-트립토판 생산 균주 및 L-페닐알라닌 생산 균주를 제작하기 위해 모균주로서 대장균 KFCC11660P 및 KCCM10016을 이용하였다.
KFCC11660P 균주 또는 KCCM10016 균주에 플라스미드 pDS9을 1차 형질전환하여 LB-Km (LB 액체배지 25 g/L 및 카나마이신 50 mg/L 함유) 고체배지에서 배양한 후 카나마이신 내성을 가지는 콜로니를 선별하였다. 선별된 콜로니에 pDSG-amtB(Gly363Asp) 플라스미드를 2차 형질전환하여 LB-Amp&Km (LB 액체배지 25 g/L, 암피실린 100 mg/L 및 카나마이신 50 mg/L 함유) 고체배지에서 배양하여 암피실린 및 카나마이신 내성을 가지는 콜로니를 선별한 후 프라이머 17 및 프라이머 18의 프라이머 쌍을 이용하여 PCR 수행을 통해 유전자 단편을 수득하였다. 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 10초 어닐링 및 72℃ 15초 중합을 30회 반복하여 수행하였다. 마크로젠에 위탁하여 프라이머 17 및 프라이머 18의 프라이머 쌍을 이용하여 수득된 유전자 단편의 서열을 확인하였다.
선별된 2차 형질전환체는 LB 액체배지에서 7번 계대배양하여 LB 고체배지에서 콜로니를 선별하였다. 각 콜로니를 LB, LB-Amp 및 LB-Km 고체배지에서 각각 선별적으로 배양하였다. LB 고체배지에서 생장하면서 LB-Amp 및 LB-Km 고체배지에서 생육하지 않는 콜로니를 선별하였다. 이와 같은 방법으로 제작된 균주를 각각 KFCC11660P_pDSG-amtB(Gly363Asp) 및 KCCM10016_pDSG-amtB(Gly363Asp)로 명명하였다
실시예 1에서 사용된 프라이머 서열은 하기 표 1과 같다.
프라이머 명칭 | 서열번호 | 프라이머 서열 (5’-3’) |
프라이머 1 | 13 | CAATTTTATTATAGTAATTGACTATTATAC |
프라이머 2 | 14 | CCACGCCGCCCAATTTTATTATAGTAATTGACTATTATAC |
프라이머 3 | 15 | GGCGGCGTGGGCTTCGCTGAGTTTTAGAGCTAGAAATAGC |
프라이머 4 | 16 | GCTTCGCTGAGTTTTAGAGCTAGAAATAGC |
프라이머 5 | 17 | GAGCCTGTCGGCCTACCTGCT |
프라이머 6 | 18 | CGGCCGGCATGAGCCTGTCG |
프라이머 7 | 19 | ATGCCGGCCGTCGGTTGGTTTGGCTTTAAC |
프라이머 8 | 20 | TCGGTTGGTTTGGCTTTAAC |
프라이머 9 | 21 | CAGCGAGCTGGCGGCAAAAA |
프라이머 10 | 22 | CCACGCCGCCCAGCGAGCTGGCGGCAAAAA |
프라이머 11 | 23 | GGCGGCGTGGACTTCGCTGAAGGTGTGACG |
프라이머 12 | 24 | ACTTCGCTGAAGGTGTGACG |
프라이머 13 | 25 | GTGAGTTTTATACCCAAGAC |
프라이머 14 | 26 | TTTGTGCGCGGTGAGTTTTA |
프라이머 15 | 27 | CGCGCACAAAGAGCTCCTGAAAATCTCGATAAC |
프라이머 16 | 28 | GAGCTCCTGAAAATCTCGATAAC |
프라이머 17 | 29 | TGGCACCGTGGTGCACATTA |
프라이머 18 | 30 | AGCCGCACGGCATCGGTTTT |
실험예 1. 암모늄 수송단백질 변이체가 도입된 변이주의 L-방향족 아미노산 생산능 평가
모균주 (KFCC11660P 및 KCCM10016)와 암모늄 수송단백질 변이체가 도입된 변이주 (KFCC11660P_amtB(Gly363Asp) 및 KCCM10016_amtB(Gly363Asp))의 L-트립토판 또는 L-페닐알라닌 생산능을 비교하였다.
하기 표 2의 트립토판 생산용 배지 또는 페닐알라닌 생산용 배지 10 mL가 담긴 100 mL 플라스크에 각 균주 (모균주 또는 변이주)를 부피 기준으로 1%씩 접종하여 37℃, 200 rpm의 조건으로 72시간 진탕 배양하였다. 배양 종료 후 HPLC (Agilent)를 사용하여 배지 내 L-트립토판 또는 L-페닐알라닌의 농도를 측정하였고, 그 결과를 각각 하기 표 3 및 4에 나타내었다.
트립토판 생산용 배지 | 페닐알라닌 생산용 배지 | ||
조성 | 함량 | 조성 | 함량 |
Glucose | 80.0 g/L | Glucose | 80.0 g/L |
(NH4)2SO4 | 20.0 g/L | (NH4)2SO4 | 20.0 g/L |
K2HPO4 | 0.8 g/L | K2HPO4 | 1.0 g/L |
K2SO4 | 0.4 g/L | KH2PO4 | 1.0 g/L |
MgCl2 | 0.8 g/L | K2SO4 | 0.4 g/L |
Fumaric acid | 1.0 g/L | MgCl2 | 1.0 g/L |
Yeast extract | 1.0 g/L | Fumaric acid | 0.5 g/L |
(NH4)6Mo7O24 | 0.12 ppm | Yeast extract | 1.0 g/L |
H3BO3 | 0.01 ppm | Glutamic acid | 0.5 g/L |
CuSO4 | 0.01 ppm | CaCl2 | 5.00 ppm |
MnCl2 | 2.00 ppm | MnCl2 | 2.00 ppm |
ZnSO4 | 0.01 ppm | ZnSO4 | 1.00 ppm |
CoCl2 | 0.10 ppm | CoCl2 | 0.10 ppm |
FeCl2 | 10.00 ppm | FeCl2 | 10.00 ppm |
Thiamine_HCl | 20.00 ppm | Thiamine_HCl | 20.00 ppm |
L-Tyrosine | 200.00 ppm | L-Tyrosine | 200.00 ppm |
L-phenylalanine | 300.00 ppm | CaCO3 | 3% |
CaCO3 | 3% | - | - |
pH 7.0 with NaOH (33%) | pH 7.0 with NaOH (33%) |
균주 | L-트립토판 농도 (g/L) |
L-트립토판 농도 증가율 (%) |
KFCC11660P | 4.12 | - |
KFCC11660P_amtB(Gly363Asp) | 4.89 | 18.7 |
균주 | L-페닐알라닌 농도 (g/L) |
L- 페닐알라닌 농도 증가율 (%) |
KCCM10016 | 3.40 | - |
KCCM10016_amtB(Gly363Asp) | 4.52 | 32.9 |
상기 표 3 및 4에 나타낸 바와 같이, 암모늄 수송단백질 변이체가 도입된 변이주 KFCC11660P_amtB(Gly363Asp) 및 KCCM10016_amtB(Gly363Asp)는 모균주 KFCC11660P 및 KCCM10016에 비해 L-트립토판 및 L-페닐알라닌 생산량이 각각 약 19% 및 33% 향상된 것으로 확인되었다.
실시예 2. 아데닌 수송단백질 변이체를 발현하는 균주 제작
아데닌 수송단백질의 아미노산 서열 (서열번호 7) 중 136번째 위치한 트립토판이 종결 코돈으로 치환된 변이체가 L-방향족 아미노산의 생산에 미치는 영향을 확인하기 위해 상기 아데닌 수송단백질 변이체를 발현하는 벡터 및 상기 벡터가 도입된 균주를 제작하였다. 균주 내 아데닌 수송단백질 변이체의 유전자 삽입을 위해 실시예 1의 플라스미드 pDSG 및 pDS9를 사용하여 다음과 같이 제작하였다.
2-1. 형질전환용 벡터 pDSG-yicO(Trp136Stop) 제작
대장균(Escherichia coli) MG1655 (KCTC14419BP) gDNA를 주형으로 프라이머 7 및 프라이머 9의 프라이머 쌍과 프라이머 8 및 프라이머 10의 프라이머 쌍을 이용하여 아데닌 수송단백질을 암호화하는 대장균 yicO 유전자의 136번 아미노산 변이의 upstream 단편과, 프라이머 11 및 프라이머 13의 프라이머 쌍과 프라이머 12 및 프라이머 14의 프라이머 쌍을 이용하여 대장균 yicO 유전자의 136번 아미노산 변이의 downstream 단편을 각각 PCR 수행을 통해 수득하였다. 이때 각 upstream과 downstream 단편에는 yicO 유전자의 136번째 아미노산 잔기인 트립토판(Trp)을 종결 코돈(Stop)으로 변경하는 서열을 포함시켰다. 여기서 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 15초 어닐링 및 72℃ 10초 중합을 30회 반복하여 수행하였다.
플라스미드 pDSG를 주형으로 프라이머 3 및 프라이머 5의 프라이머 쌍, 프라이머 4 및 프라이머 6의 프라이머 쌍, 프라이머 15 및 프라이머 1, 및 프라이머 16 및 프라이머 2의 프라이머 쌍을 각각 이용하여 4개의 pDSG 유전자 단편을 PCR 수행을 통해 수득하였다. 이때 각 유전자 단편에는 yicO 유전자의 136번째 Trp을 타겟하는 gRNA 서열을 포함시켰다. gRNA는 변이를 유발하고자 하는 서열의 NGG 앞 20 mer로 선택하였다. 여기서 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 15초 어닐링 및 72℃ 15초 중합을 30회 반복하여 수행하였다.
수득된 yicO 유전자의 136번 아미노산의 upstream과 downstream, 그리고 4개의 pDSG 유전자 단편들을 self-assembly cloning 방법 (BioTechniques 51:55-56 (July 2011))을 이용하여 클로닝하여 재조합 플라스미드를 획득하였으며, 이를 pDSG-yicO(Trp136Stop)로 명명하였다.
2-2. 아데닌 수송단백질 변이체 yicO(Trp136Stop)가 도입된 L-트립토판 또는 L-페닐알라닌 생산 균주 제작
L-트립토판 생산 균주 및 L-페닐알라닌 생산 균주를 제작하기 위해 모균주로서 대장균 KFCC11660P 및 KCCM10016을 이용하였다.
KFCC11660P 균주 또는 KCCM10016 균주에 플라스미드 pDS9을 1차 형질전환하여 LB-Km (LB 액체배지 25 g/L 및 카나마이신 50 mg/L 함유) 고체배지에서 배양한 후 카나마이신 내성을 가지는 콜로니를 선별하였다. 선별된 콜로니에 pDSG-yicO(Trp136Stop) 플라스미드를 2차 형질전환하여 LB-Amp&Km (LB 액체배지 25 g/L, 암피실린 100 mg/L 및 카나마이신 50 mg/L 함유) 고체배지에서 배양하여 암피실린 및 카나마이신 내성을 가지는 콜로니를 선별한 후 프라이머 17 및 프라이머 18의 프라이머 쌍을 이용하여 PCR 수행을 통해 유전자 단편을 수득하였다. 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 10초 어닐링 및 72℃ 15초 중합을 30회 반복하여 수행하였다. 마크로젠에 위탁하여 프라이머 17 및 프라이머 18의 프라이머 쌍을 이용하여 수득된 유전자 단편의 서열을 확인하였다.
선별된 2차 형질전환체는 LB 액체배지에서 7번 계대배양하여 LB 고체배지에서 콜로니를 선별하였다. 각 콜로니를 LB, LB-Amp 및 LB-Km 고체배지에서 각각 선별적으로 배양하였다. LB 고체배지에서 생장하면서 LB-Amp 및 LB-Km 고체배지에서 생육하지 않는 콜로니를 선별하였다. 이와 같은 방법으로 제작된 균주를 각각 KFCC11660P_yicO(Trp136Stop) 및 KCCM10016_yicO(Trp136Stop)로 명명하였다
실시예 2에서 사용된 프라이머 서열은 하기 표 5와 같다.
프라이머 명칭 | 서열번호 | 프라이머 서열 (5’-3’) |
프라이머 1 | 13 | CAATTTTATTATAGTAATTGACTATTATAC |
프라이머 2 | 31 | CGTGACGTTCCAATTTTATTATAGTAATTGACTATTATAC |
프라이머 3 | 32 | GAACGTCACGACCTTCGTCAGTTTTAGAGCTAGAAATAGC |
프라이머 4 | 33 | ACCTTCGTCAGTTTTAGAGCTAGAAATAGC |
프라이머 5 | 17 | GAGCCTGTCGGCCTACCTGCT |
프라이머 6 | 18 | CGGCCGGCATGAGCCTGTCG |
프라이머 7 | 34 | ATGCCGGCCGCGGCCGGGTAGATACCCAGAG |
프라이머 8 | 35 | CGGCCGGGTAGATACCCAGAG |
프라이머 9 | 36 | TCTGCTGTTCGTTGACAACA |
프라이머 10 | 37 | GTCGTGACGTTCTGCTGTTC |
프라이머 11 | 38 | ACGTCACGACTTTCGTCACGGAATTTCTCA |
프라이머 12 | 39 | TTTCGTCACGGAATTTCTCA |
프라이머 13 | 40 | TCCGTTCGCTAAGGGCGGTA |
프라이머 14 | 41 | ACCTGATGTGTCCGTTCGCT |
프라이머 15 | 42 | CACATCAGGTGAGCTCCTGAAAATCTCGATAAC |
프라이머 16 | 28 | GAGCTCCTGAAAATCTCGATAAC |
프라이머 17 | 43 | TCCGTTCGCTAAGGGCGGTA |
프라이머 18 | 44 | ACCTGATGTGTCCGTTCGCT |
실험예 2. 아데닌 수송단백질 변이체가 도입된 변이주의 L-방향족 아미노산 생산능 평가
모균주 (KFCC11660P 및 KCCM10016)와 아데닌 수송단백질 변이체가 도입된 변이주 (KFCC11660P_yicO(Trp136Stop) 및 KCCM10016_yicO(Trp136Stop))의 L-트립토판 또는 L-페닐알라닌 생산능을 비교하였다. 실험예 1과 동일한 방법으로 L-트립토판 및 L-페닐알라닌의 농도를 측정하였고, 그 결과를 각각 하기 표 6 및 7에 나타내었다.
균주 | L-트립토판 농도 (g/L) |
L-트립토판 농도 증가율 (%) |
KFCC11660P | 4.15 | - |
KFCC11660P_yicO(Trp136Stop) | 5.22 | 25.7 |
균주 | L-페닐알라닌 농도 (g/L) |
L- 페닐알라닌 농도 증가율 (%) |
KCCM10016 | 3.23 | - |
KCCM10016_yicO(Trp136Stop) | 3.85 | 19.1 |
상기 표 6 및 7에 나타낸 바와 같이, 아데닌 수송단백질 변이체가 도입된 변이주 KFCC11660P_yicO(Trp136Stop) 및 KCCM10016_yicO(Trp136Stop)는 모균주 KFCC11660P 및 KCCM10016에 비해 L-트립토판 및 L-페닐알라닌 생산량이 각각 약 26% 및 19% 향상된 것으로 확인되었다.
실시예 3. FMN/FAD 수송단백질 변이체를 발현하는 균주 제작
FMN/FAD 수송단백질의 아미노산 서열 (서열번호 11) 중 272번째 위치한 글리신이 글루타민으로 치환된 변이체가 L-방향족 아미노산의 생산에 미치는 영향을 확인하기 위해 상기 FMN/FAD 수송단백질 변이체를 발현하는 벡터 및 상기 벡터가 도입된 균주를 제작하였다. 균주 내 FMN/FAD 수송단백질 변이체의 유전자 삽입을 위해 실시예 1과 동일한 방법으로 플라스미드 pDSG 및 pDS9를 사용하여 다음과 같이 제작하였다.
3-1. 형질전환용 벡터 pDSG-yeeO(Gly272Glu) 제작
대장균(Escherichia coli) MG1655 (KCTC14419BP) gDNA를 주형으로 프라이머 7 및 프라이머 9의 프라이머 쌍과 프라이머 8 및 프라이머 10의 프라이머 쌍을 이용하여 FMN/FAD 수송단백질을 암호화하는 대장균 yeeO 유전자의 272번 아미노산 변이의 upstream 단편과, 프라이머 11 및 프라이머 13의 프라이머 쌍과 프라이머 12 및 프라이머 14의 프라이머 쌍을 이용하여 대장균 yeeO 유전자의 272번 아미노산 변이의 downstream 단편을 각각 PCR 수행을 통해 수득하였다. 이때 각 upstream과 downstream 단편에는 yeeO 유전자의 272번째 아미노산 잔기인 글리신(Gly)을 글루타민(Glu)으로 변경하는 서열을 포함시켰다. 여기서 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 15초 어닐링 및 72℃ 10초 중합을 30회 반복하여 수행하였다.
플라스미드 pDSG를 주형으로 프라이머 3 및 프라이머 5의 프라이머 쌍, 프라이머 4 및 프라이머 6의 프라이머 쌍, 프라이머 15 및 프라이머 1, 및 프라이머 16 및 프라이머 2의 프라이머 쌍을 각각 이용하여 4개의 pDSG 유전자 단편을 PCR 수행을 통해 수득하였다. 이때 각 유전자 단편에는 yeeO 유전자의 272번째 Gly을 타겟하는 gRNA 서열을 포함시켰다. gRNA는 변이를 유발하고자 하는 서열의 NGG 앞 20 mer로 선택하였다. 여기서 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 15초 어닐링 및 72℃ 15초 중합을 30회 반복하여 수행하였다.
수득된 yeeO 유전자의 272번 아미노산의 upstream과 downstream, 그리고 4개의 pDSG 유전자 단편들을 self-assembly cloning 방법 (BioTechniques 51:55-56 (July 2011))을 이용하여 클로닝하여 재조합 플라스미드를 획득하였으며, 이를 pDSG-yeeO(Gly272Glu)로 명명하였다.
3-2. FMN/FAD 수송단백질 변이체 yeeO(Gly272Glu)가 도입된 L-트립토판 또는 L-페닐알라닌 생산 균주 제작
L-트립토판 생산 균주 및 L-페닐알라닌 생산 균주를 제작하기 위해 모균주로서 대장균 KFCC11660P 및 KCCM10016을 이용하였다.
KFCC11660P 균주 또는 KCCM10016 균주에 플라스미드 pDS9을 1차 형질전환하여 LB-Km (LB 액체배지 25 g/L 및 카나마이신 50 mg/L 함유) 고체배지에서 배양한 후 카나마이신 내성을 가지는 콜로니를 선별하였다. 선별된 콜로니에 pDSG-yeeO(Gly272Glu) 플라스미드를 2차 형질전환하여 LB-Amp&Km (LB 액체배지 25 g/L, 암피실린 100 mg/L 및 카나마이신 50 mg/L 함유) 고체배지에서 배양하여 암피실린 및 카나마이신 내성을 가지는 콜로니를 선별한 후 프라이머 17 및 프라이머 18의 프라이머 쌍을 이용하여 PCR 수행을 통해 유전자 단편을 수득하였다. 중합효소는 Takara PrimeSTAR Max DNA polymerase를 사용하였으며, PCR 증폭 조건은 95℃에서 10초 변성, 57℃ 10초 어닐링 및 72℃ 15초 중합을 30회 반복하여 수행하였다. 마크로젠에 위탁하여 프라이머 17 및 프라이머 18의 프라이머 쌍을 이용하여 수득된 유전자 단편의 서열을 확인하였다.
선별된 2차 형질전환체는 LB 액체배지에서 7번 계대배양하여 LB 고체배지에서 콜로니를 선별하였다. 각 콜로니를 LB, LB-Amp 및 LB-Km 고체배지에서 각각 선별적으로 배양하였다. LB 고체배지에서 생장하면서 LB-Amp 및 LB-Km 고체배지에서 생육하지 않는 콜로니를 선별하였다. 이와 같은 방법으로 제작된 균주를 각각 KFCC11660P_yeeO(Gly272Glu) 및 KCCM10016_yeeO(Gly272Glu)로 명명하였다
실시예 3에서 사용된 프라이머 서열은 하기 표 8과 같다.
프라이머 명칭 | 서열번호 | 프라이머 서열 (5’-3’) |
프라이머 1 | 13 | CAATTTTATTATAGTAATTGACTATTATAC |
프라이머 2 | 45 | AGGAGAAAAGCAATTTTATTATAGTAATTGACTATTATAC |
프라이머 3 | 46 | CTTTTCTCCTGGCCGGGACTGTTTTAGAGCTAGAAATAGC |
프라이머 4 | 47 | GGCCGGGACTGTTTTAGAGCTAGAAATAGC |
프라이머 5 | 17 | GAGCCTGTCGGCCTACCTGCT |
프라이머 6 | 18 | CGGCCGGCATGAGCCTGTCG |
프라이머 7 | 48 | ATGCCGGCCGTTTAGTCTCGGTAAGCGGGA |
프라이머 8 | 49 | TTTAGTCTCGGTAAGCGGGA |
프라이머 9 | 50 | GAAAAGGCCGTAAATCAATATGCCG |
프라이머 10 | 51 | CCGGCCAGGAGAAAAGGCCGTAAAT |
프라이머 11 | 52 | TCCTGGCCGGAACTGGGATTTGTCGGGGCA |
프라이머 12 | 53 | AACTGGGATTTGTCGGGGCA |
프라이머 13 | 54 | GCAGAGCCGAGCGCACTTCC |
프라이머 14 | 55 | GATCGTAGAAGCAGAGCCGA |
프라이머 15 | 56 | TTCTACGATCGAGCTCCTGAAAATCTCGATAAC |
프라이머 16 | 28 | GAGCTCCTGAAAATCTCGATAAC |
프라이머 17 | 57 | CTTTTCTGGTCAGCTGGCTG |
프라이머 18 | 58 | TTAGCCAGGCGATGGCCGTT |
실험예 3. FMN/FAD 수송단백질 변이체가 도입된 변이주의 L-방향족 아미노산 생산능 평가
모균주 (KFCC11660P 및 KCCM10016)와 FMN/FAD 수송단백질 변이체가 도입된 변이주 (KFCC11660P_yeeO(Gly272Glu) 및 KCCM10016_yeeO(Gly272Glu))의 L-트립토판 또는 L-페닐알라닌 생산능을 비교하였다. 실험예 1과 동일한 방법으로 L-트립토판 및 L-페닐알라닌의 농도를 측정하였고, 그 결과를 각각 하기 표 9 및 10에 나타내었다.
균주 | L-트립토판 농도 (g/L) |
L-트립토판 농도 증가율 (%) |
KFCC11660P | 4.15 | - |
KFCC11660P_yeeO(Gly272Glu) | 5.04 | 21.4 |
균주 | L-페닐알라닌 농도 (g/L) |
L- 페닐알라닌 농도 증가율 (%) |
KCCM10016 | 3.41 | - |
KCCM10016_yeeO(Gly272Glu) | 4.87 | 42.8 |
상기 표 9 및 10에 나타낸 바와 같이, FMN/FAD 수송단백질 변이체가 도입된 변이주 KFCC11660P_yeeO(Gly272Glu) 및 KCCM10016_yeeO(Gly272Glu)는 모균주 KFCC11660P 및 KCCM10016에 비해 L-트립토판 및 L-페닐알라닌 생산량이 각각 약 21% 및 43% 향상된 것으로 확인되었다.
이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.
<110> DAESANG CORPORATION
<120> Novel variant of transporter and method for preparing L-aromatic
amino acid using the same
<130> BPN221015D1
<160> 60
<170> KoPatentIn 3.0
<210> 1
<211> 428
<212> PRT
<213> Artificial Sequence
<220>
<223> amtB (G363D) variant
<400> 1
Met Lys Ile Ala Thr Ile Lys Thr Gly Leu Ala Ser Leu Ala Met Leu
1 5 10 15
Pro Gly Leu Val Met Ala Ala Pro Ala Val Ala Asp Lys Ala Asp Asn
20 25 30
Ala Phe Met Met Ile Cys Thr Ala Leu Val Leu Phe Met Thr Ile Pro
35 40 45
Gly Ile Ala Leu Phe Tyr Gly Gly Leu Ile Arg Gly Lys Asn Val Leu
50 55 60
Ser Met Leu Thr Gln Val Thr Val Thr Phe Ala Leu Val Cys Ile Leu
65 70 75 80
Trp Val Val Tyr Gly Tyr Ser Leu Ala Phe Gly Glu Gly Asn Asn Phe
85 90 95
Phe Gly Asn Ile Asn Trp Leu Met Leu Lys Asn Ile Glu Leu Thr Ala
100 105 110
Val Met Gly Ser Ile Tyr Gln Tyr Ile His Val Ala Phe Gln Gly Ser
115 120 125
Phe Ala Cys Ile Thr Val Gly Leu Ile Val Gly Ala Leu Ala Glu Arg
130 135 140
Ile Arg Phe Ser Ala Val Leu Ile Phe Val Val Val Trp Leu Thr Leu
145 150 155 160
Ser Tyr Ile Pro Ile Ala His Met Val Trp Gly Gly Gly Leu Leu Ala
165 170 175
Ser His Gly Ala Leu Asp Phe Ala Gly Gly Thr Val Val His Ile Asn
180 185 190
Ala Ala Ile Ala Gly Leu Val Gly Ala Tyr Leu Ile Gly Lys Arg Val
195 200 205
Gly Phe Gly Lys Glu Ala Phe Lys Pro His Asn Leu Pro Met Val Phe
210 215 220
Thr Gly Thr Ala Ile Leu Tyr Ile Gly Trp Phe Gly Phe Asn Ala Gly
225 230 235 240
Ser Ala Gly Thr Ala Asn Glu Ile Ala Ala Leu Ala Phe Val Asn Thr
245 250 255
Val Val Ala Thr Ala Ala Ala Ile Leu Gly Trp Ile Phe Gly Glu Trp
260 265 270
Ala Leu Arg Gly Lys Pro Ser Leu Leu Gly Ala Cys Ser Gly Ala Ile
275 280 285
Ala Gly Leu Val Gly Val Thr Pro Ala Cys Gly Tyr Ile Gly Val Gly
290 295 300
Gly Ala Leu Ile Ile Gly Val Val Ala Gly Leu Ala Gly Leu Trp Gly
305 310 315 320
Val Thr Met Leu Lys Arg Leu Leu Arg Val Asp Asp Pro Cys Asp Val
325 330 335
Phe Gly Val His Gly Val Cys Gly Ile Val Gly Cys Ile Met Thr Gly
340 345 350
Ile Phe Ala Ala Ser Ser Leu Gly Gly Val Asp Phe Ala Glu Gly Val
355 360 365
Thr Met Gly His Gln Leu Leu Val Gln Leu Glu Ser Ile Ala Ile Thr
370 375 380
Ile Val Trp Ser Gly Val Val Ala Phe Ile Gly Tyr Lys Leu Ala Asp
385 390 395 400
Leu Thr Val Gly Leu Arg Val Pro Glu Glu Gln Glu Arg Glu Gly Leu
405 410 415
Asp Val Asn Ser His Gly Glu Asn Ala Tyr Asn Ala
420 425
<210> 2
<211> 1287
<212> DNA
<213> Artificial Sequence
<220>
<223> amtB (G363D) variant
<400> 2
atgaagatag cgacgataaa aactgggctt gcttcactgg cgatgcttcc gggactggta 60
atggctgcac ctgcggtggc cgataaagcc gacaatgcgt ttatgatgat ttgtactgcg 120
ctggtgctgt ttatgactat tccggggatt gccctgtttt acggtgggtt gattcgcggc 180
aaaaacgtgc tgtcgatgct gacgcaggtg acggtgacat ttgcactggt ctgtattctc 240
tgggtggttt acggttactc gctggcgttt ggtgagggca acaacttctt cggcaacatt 300
aactggttga tgctgaaaaa catcgaactg acggcggtga tgggcagcat ttatcagtat 360
atccacgtgg cgtttcaggg atcgtttgcc tgcattaccg tcggcttgat agttggggcg 420
ctggcggaac gaatccgctt ctcagctgtg ttgattttcg tggtggtatg gctgacgctc 480
tcttacattc cgattgcgca tatggtgtgg ggcggtggtt tgctggcttc tcacggtgcg 540
ctggatttcg cgggtggcac cgtggtgcac attaacgccg caatcgccgg tctggtgggc 600
gcgtatctga taggaaaacg cgtgggcttc ggtaaagagg cgtttaaacc gcacaacctg 660
ccgatggtct tcaccgggac tgccattctc tatatcggtt ggtttggctt taacgccggg 720
tcagcgggca cggcgaatga aatcgcggca ctggcatttg tgaatactgt ggtcgcaacg 780
gcggcggcaa ttcttggctg gatcttcggt gaatgggcgc tgcgtggtaa gccttcactg 840
ctgggggcgt gttctggcgc gattgccggt ctggtcggcg tgacgccagc ctgcggctac 900
attggggttg gcggcgcgtt gattatcggc gtggtagctg gtctggcggg cttgtggggc 960
gttaccatgc tcaaacgctt gctgcgggtg gatgatccct gcgatgtctt cggtgtgcac 1020
ggcgtttgtg gcattgtcgg ctgtatcatg accgggattt ttgccgccag ctcgctgggc 1080
ggcgtggact tcgctgaagg tgtgacgatg ggccatcagt tgctggtaca gctggaaagc 1140
atcgccatta cgatcgtctg gtccggtgtt gtggcattta tcggctacaa attggcggat 1200
ctgacggttg gtctgcgtgt accggaagag caggagcgag aagggctgga tgtcaacagc 1260
cacggcgaga atgcctataa cgcgtaa 1287
<210> 3
<211> 428
<212> PRT
<213> Artificial Sequence
<220>
<223> amtB
<400> 3
Met Lys Ile Ala Thr Ile Lys Thr Gly Leu Ala Ser Leu Ala Met Leu
1 5 10 15
Pro Gly Leu Val Met Ala Ala Pro Ala Val Ala Asp Lys Ala Asp Asn
20 25 30
Ala Phe Met Met Ile Cys Thr Ala Leu Val Leu Phe Met Thr Ile Pro
35 40 45
Gly Ile Ala Leu Phe Tyr Gly Gly Leu Ile Arg Gly Lys Asn Val Leu
50 55 60
Ser Met Leu Thr Gln Val Thr Val Thr Phe Ala Leu Val Cys Ile Leu
65 70 75 80
Trp Val Val Tyr Gly Tyr Ser Leu Ala Phe Gly Glu Gly Asn Asn Phe
85 90 95
Phe Gly Asn Ile Asn Trp Leu Met Leu Lys Asn Ile Glu Leu Thr Ala
100 105 110
Val Met Gly Ser Ile Tyr Gln Tyr Ile His Val Ala Phe Gln Gly Ser
115 120 125
Phe Ala Cys Ile Thr Val Gly Leu Ile Val Gly Ala Leu Ala Glu Arg
130 135 140
Ile Arg Phe Ser Ala Val Leu Ile Phe Val Val Val Trp Leu Thr Leu
145 150 155 160
Ser Tyr Ile Pro Ile Ala His Met Val Trp Gly Gly Gly Leu Leu Ala
165 170 175
Ser His Gly Ala Leu Asp Phe Ala Gly Gly Thr Val Val His Ile Asn
180 185 190
Ala Ala Ile Ala Gly Leu Val Gly Ala Tyr Leu Ile Gly Lys Arg Val
195 200 205
Gly Phe Gly Lys Glu Ala Phe Lys Pro His Asn Leu Pro Met Val Phe
210 215 220
Thr Gly Thr Ala Ile Leu Tyr Ile Gly Trp Phe Gly Phe Asn Ala Gly
225 230 235 240
Ser Ala Gly Thr Ala Asn Glu Ile Ala Ala Leu Ala Phe Val Asn Thr
245 250 255
Val Val Ala Thr Ala Ala Ala Ile Leu Gly Trp Ile Phe Gly Glu Trp
260 265 270
Ala Leu Arg Gly Lys Pro Ser Leu Leu Gly Ala Cys Ser Gly Ala Ile
275 280 285
Ala Gly Leu Val Gly Val Thr Pro Ala Cys Gly Tyr Ile Gly Val Gly
290 295 300
Gly Ala Leu Ile Ile Gly Val Val Ala Gly Leu Ala Gly Leu Trp Gly
305 310 315 320
Val Thr Met Leu Lys Arg Leu Leu Arg Val Asp Asp Pro Cys Asp Val
325 330 335
Phe Gly Val His Gly Val Cys Gly Ile Val Gly Cys Ile Met Thr Gly
340 345 350
Ile Phe Ala Ala Ser Ser Leu Gly Gly Val Gly Phe Ala Glu Gly Val
355 360 365
Thr Met Gly His Gln Leu Leu Val Gln Leu Glu Ser Ile Ala Ile Thr
370 375 380
Ile Val Trp Ser Gly Val Val Ala Phe Ile Gly Tyr Lys Leu Ala Asp
385 390 395 400
Leu Thr Val Gly Leu Arg Val Pro Glu Glu Gln Glu Arg Glu Gly Leu
405 410 415
Asp Val Asn Ser His Gly Glu Asn Ala Tyr Asn Ala
420 425
<210> 4
<211> 1287
<212> DNA
<213> Artificial Sequence
<220>
<223> amtB
<400> 4
atgaagatag cgacgataaa aactgggctt gcttcactgg cgatgcttcc gggactggta 60
atggctgcac ctgcggtggc cgataaagcc gacaatgcgt ttatgatgat ttgtactgcg 120
ctggtgctgt ttatgactat tccggggatt gccctgtttt acggtgggtt gattcgcggc 180
aaaaacgtgc tgtcgatgct gacgcaggtg acggtgacat ttgcactggt ctgtattctc 240
tgggtggttt acggttactc gctggcgttt ggtgagggca acaacttctt cggcaacatt 300
aactggttga tgctgaaaaa catcgaactg acggcggtga tgggcagcat ttatcagtat 360
atccacgtgg cgtttcaggg atcgtttgcc tgcattaccg tcggcttgat agttggggcg 420
ctggcggaac gaatccgctt ctcagctgtg ttgattttcg tggtggtatg gctgacgctc 480
tcttacattc cgattgcgca tatggtgtgg ggcggtggtt tgctggcttc tcacggtgcg 540
ctggatttcg cgggtggcac cgtggtgcac attaacgccg caatcgccgg tctggtgggc 600
gcgtatctga taggaaaacg cgtgggcttc ggtaaagagg cgtttaaacc gcacaacctg 660
ccgatggtct tcaccgggac tgccattctc tatatcggtt ggtttggctt taacgccggg 720
tcagcgggca cggcgaatga aatcgcggca ctggcatttg tgaatactgt ggtcgcaacg 780
gcggcggcaa ttcttggctg gatcttcggt gaatgggcgc tgcgtggtaa gccttcactg 840
ctgggggcgt gttctggcgc gattgccggt ctggtcggcg tgacgccagc ctgcggctac 900
attggggttg gcggcgcgtt gattatcggc gtggtagctg gtctggcggg cttgtggggc 960
gttaccatgc tcaaacgctt gctgcgggtg gatgatccct gcgatgtctt cggtgtgcac 1020
ggcgtttgtg gcattgtcgg ctgtatcatg accgggattt ttgccgccag ctcgctgggc 1080
ggcgtgggct tcgctgaagg tgtgacgatg ggccatcagt tgctggtaca gctggaaagc 1140
atcgccatta cgatcgtctg gtccggtgtt gtggcattta tcggctacaa attggcggat 1200
ctgacggttg gtctgcgtgt accggaagag caggagcgag aagggctgga tgtcaacagc 1260
cacggcgaga atgcctataa cgcgtaa 1287
<210> 5
<211> 444
<212> PRT
<213> Artificial Sequence
<220>
<223> yicO (TRP136Stop) variant
<400> 5
Met Asn Asn Asp Asn Thr Asp Tyr Val Ser Asn Glu Ser Gly Thr Leu
1 5 10 15
Ser Arg Leu Phe Lys Leu Pro Gln His Gly Thr Thr Val Arg Thr Glu
20 25 30
Leu Ile Ala Gly Met Thr Thr Phe Leu Thr Met Val Tyr Ile Val Phe
35 40 45
Val Asn Pro Gln Ile Leu Gly Ala Ala Gln Met Asp Pro Lys Val Val
50 55 60
Phe Val Thr Thr Cys Leu Ile Ala Gly Ile Gly Ser Ile Ala Met Gly
65 70 75 80
Ile Phe Ala Asn Leu Pro Val Ala Leu Ala Pro Ala Met Gly Leu Asn
85 90 95
Ala Phe Phe Ala Phe Val Val Val Gly Ala Met Gly Ile Ser Trp Gln
100 105 110
Thr Gly Met Gly Ala Ile Phe Trp Gly Ala Val Gly Leu Phe Leu Leu
115 120 125
Thr Leu Phe Arg Ile Arg Tyr *** Met Ile Ser Asn Ile Pro Leu Ser
130 135 140
Leu Arg Ile Gly Ile Thr Ser Gly Ile Gly Leu Phe Ile Ala Leu Met
145 150 155 160
Gly Leu Lys Asn Thr Gly Val Ile Val Ala Asn Lys Asp Thr Leu Val
165 170 175
Met Ile Gly Asp Leu Ser Ser His Gly Val Leu Leu Gly Ile Leu Gly
180 185 190
Phe Phe Ile Ile Thr Val Leu Ser Ser Arg His Phe His Ala Ala Val
195 200 205
Leu Val Ser Ile Val Val Thr Ser Cys Cys Gly Leu Phe Phe Gly Asp
210 215 220
Val His Phe Ser Gly Val Tyr Ser Ile Pro Pro Asp Ile Ser Gly Val
225 230 235 240
Ile Gly Glu Val Asp Leu Ser Gly Ala Leu Thr Leu Glu Leu Ala Gly
245 250 255
Ile Ile Phe Ser Phe Met Leu Ile Asn Leu Phe Asp Ser Ser Gly Thr
260 265 270
Leu Ile Gly Val Thr Asp Lys Ala Gly Leu Ile Asp Gly Asn Gly Lys
275 280 285
Phe Pro Asn Met Asn Lys Ala Leu Tyr Val Asp Ser Val Ser Ser Val
290 295 300
Ala Gly Ala Phe Ile Gly Thr Ser Ser Val Thr Ala Tyr Ile Glu Ser
305 310 315 320
Thr Ser Gly Val Ala Val Gly Gly Arg Thr Gly Leu Thr Ala Val Val
325 330 335
Val Gly Val Met Phe Leu Leu Val Met Phe Phe Ser Pro Leu Val Ala
340 345 350
Ile Val Pro Pro Tyr Ala Thr Ala Gly Ala Leu Ile Phe Val Gly Val
355 360 365
Leu Met Thr Ser Ser Leu Ala Arg Val Asn Trp Asp Asp Phe Thr Glu
370 375 380
Ser Val Pro Ala Phe Ile Thr Thr Val Met Met Pro Phe Thr Phe Ser
385 390 395 400
Ile Thr Glu Gly Ile Ala Leu Gly Phe Met Ser Tyr Cys Ile Met Lys
405 410 415
Val Cys Thr Gly Arg Trp Arg Asp Leu Asn Leu Cys Val Val Val Val
420 425 430
Ala Ala Leu Phe Ala Leu Lys Ile Ile Leu Val Asp
435 440
<210> 6
<211> 1335
<212> DNA
<213> Artificial Sequence
<220>
<223> yicO (TRP136Stop) variant
<400> 6
atgaataatg acaataccga ttacgtgagt aatgaatcag ggacgctttc gcgattattt 60
aaactacctc agcatgggac caccgtccgc acagaattga ttgcggggat gaccactttt 120
ttaaccatgg tgtacatcgt ttttgtgaac ccgcaaatcc tcggcgcggc acaaatggac 180
ccgaaagtgg tgtttgttac cacctgtttg attgccggta tcggcagtat tgcgatgggg 240
atatttgcta acttacccgt ggcgctggct ccggcaatgg ggctgaacgc cttctttgcc 300
ttcgtggtcg tgggggcgat gggcatctcc tggcagaccg ggatgggcgc aatattctgg 360
ggcgcagttg gactattttt gctcacgctg tttcgtatcc ggtactgaat gatctccaac 420
attcccttaa gtttacgtat tggtatcacc agcggaattg gattatttat cgccttaatg 480
ggattaaaaa atactggcgt tattgtcgcc aataaagaca cgctggtgat gattggcgat 540
ttaagttctc acggcgtgtt gttaggtatt ttagggtttt ttattataac cgtgttgtca 600
tcacgtcatt ttcatgccgc ggtgctggtt tctattgtgg tgacgtcttg ctgtggatta 660
tttttcggtg atgttcattt tagcggcgtc tattccattc cgcctgatat tagcggcgtc 720
attggtgaag tagatttgag cggcgcgtta acacttgaac tcgccggtat cattttctcc 780
tttatgctga tcaacctatt tgattcatca ggaacattaa ttggtgtaac tgataaagcg 840
ggcttaatag atggtaacgg taaattcccc aatatgaata aggcgctgta tgttgatagc 900
gtcagttcgg tggcgggtgc gtttatcggc acctcgtctg ttactgccta tattgaaagt 960
acttctggtg tggcagtcgg tggccgcacg gggctgactg cggttgtggt tggcgttatg 1020
ttcctgttgg ttatgttctt ctcaccgctg gtggcgatag ttcctcctta cgcaaccgcc 1080
ggagcgttaa tctttgttgg cgtgctgatg acttcgagcc tggcgcgcgt taactgggat 1140
gattttaccg aatcggtgcc tgcgtttatt accacggtga tgatgccctt tactttctcg 1200
atcaccgaag ggattgcact cggctttatg tcgtactgca tcatgaaagt atgcaccggg 1260
cgctggcgcg atctgaacct gtgtgtggtg gtggtcgcag ctctgtttgc actgaagatt 1320
attctggtgg attag 1335
<210> 7
<211> 444
<212> PRT
<213> Artificial Sequence
<220>
<223> yicO
<400> 7
Met Asn Asn Asp Asn Thr Asp Tyr Val Ser Asn Glu Ser Gly Thr Leu
1 5 10 15
Ser Arg Leu Phe Lys Leu Pro Gln His Gly Thr Thr Val Arg Thr Glu
20 25 30
Leu Ile Ala Gly Met Thr Thr Phe Leu Thr Met Val Tyr Ile Val Phe
35 40 45
Val Asn Pro Gln Ile Leu Gly Ala Ala Gln Met Asp Pro Lys Val Val
50 55 60
Phe Val Thr Thr Cys Leu Ile Ala Gly Ile Gly Ser Ile Ala Met Gly
65 70 75 80
Ile Phe Ala Asn Leu Pro Val Ala Leu Ala Pro Ala Met Gly Leu Asn
85 90 95
Ala Phe Phe Ala Phe Val Val Val Gly Ala Met Gly Ile Ser Trp Gln
100 105 110
Thr Gly Met Gly Ala Ile Phe Trp Gly Ala Val Gly Leu Phe Leu Leu
115 120 125
Thr Leu Phe Arg Ile Arg Tyr Trp Met Ile Ser Asn Ile Pro Leu Ser
130 135 140
Leu Arg Ile Gly Ile Thr Ser Gly Ile Gly Leu Phe Ile Ala Leu Met
145 150 155 160
Gly Leu Lys Asn Thr Gly Val Ile Val Ala Asn Lys Asp Thr Leu Val
165 170 175
Met Ile Gly Asp Leu Ser Ser His Gly Val Leu Leu Gly Ile Leu Gly
180 185 190
Phe Phe Ile Ile Thr Val Leu Ser Ser Arg His Phe His Ala Ala Val
195 200 205
Leu Val Ser Ile Val Val Thr Ser Cys Cys Gly Leu Phe Phe Gly Asp
210 215 220
Val His Phe Ser Gly Val Tyr Ser Ile Pro Pro Asp Ile Ser Gly Val
225 230 235 240
Ile Gly Glu Val Asp Leu Ser Gly Ala Leu Thr Leu Glu Leu Ala Gly
245 250 255
Ile Ile Phe Ser Phe Met Leu Ile Asn Leu Phe Asp Ser Ser Gly Thr
260 265 270
Leu Ile Gly Val Thr Asp Lys Ala Gly Leu Ile Asp Gly Asn Gly Lys
275 280 285
Phe Pro Asn Met Asn Lys Ala Leu Tyr Val Asp Ser Val Ser Ser Val
290 295 300
Ala Gly Ala Phe Ile Gly Thr Ser Ser Val Thr Ala Tyr Ile Glu Ser
305 310 315 320
Thr Ser Gly Val Ala Val Gly Gly Arg Thr Gly Leu Thr Ala Val Val
325 330 335
Val Gly Val Met Phe Leu Leu Val Met Phe Phe Ser Pro Leu Val Ala
340 345 350
Ile Val Pro Pro Tyr Ala Thr Ala Gly Ala Leu Ile Phe Val Gly Val
355 360 365
Leu Met Thr Ser Ser Leu Ala Arg Val Asn Trp Asp Asp Phe Thr Glu
370 375 380
Ser Val Pro Ala Phe Ile Thr Thr Val Met Met Pro Phe Thr Phe Ser
385 390 395 400
Ile Thr Glu Gly Ile Ala Leu Gly Phe Met Ser Tyr Cys Ile Met Lys
405 410 415
Val Cys Thr Gly Arg Trp Arg Asp Leu Asn Leu Cys Val Val Val Val
420 425 430
Ala Ala Leu Phe Ala Leu Lys Ile Ile Leu Val Asp
435 440
<210> 8
<211> 1335
<212> DNA
<213> Artificial Sequence
<220>
<223> yicO
<400> 8
atgaataatg acaataccga ttacgtgagt aatgaatcag ggacgctttc gcgattattt 60
aaactacctc agcatgggac caccgtccgc acagaattga ttgcggggat gaccactttt 120
ttaaccatgg tgtacatcgt ttttgtgaac ccgcaaatcc tcggcgcggc acaaatggac 180
ccgaaagtgg tgtttgttac cacctgtttg attgccggta tcggcagtat tgcgatgggg 240
atatttgcta acttacccgt ggcgctggct ccggcaatgg ggctgaacgc cttctttgcc 300
ttcgtggtcg tgggggcgat gggcatctcc tggcagaccg ggatgggcgc aatattctgg 360
ggcgcagttg gactattttt gctcacgctg tttcgtatcc ggtactggat gatctccaac 420
attcccttaa gtttacgtat tggtatcacc agcggaattg gattatttat cgccttaatg 480
ggattaaaaa atactggcgt tattgtcgcc aataaagaca cgctggtgat gattggcgat 540
ttaagttctc acggcgtgtt gttaggtatt ttagggtttt ttattataac cgtgttgtca 600
tcacgtcatt ttcatgccgc ggtgctggtt tctattgtgg tgacgtcttg ctgtggatta 660
tttttcggtg atgttcattt tagcggcgtc tattccattc cgcctgatat tagcggcgtc 720
attggtgaag tagatttgag cggcgcgtta acacttgaac tcgccggtat cattttctcc 780
tttatgctga tcaacctatt tgattcatca ggaacattaa ttggtgtaac tgataaagcg 840
ggcttaatag atggtaacgg taaattcccc aatatgaata aggcgctgta tgttgatagc 900
gtcagttcgg tggcgggtgc gtttatcggc acctcgtctg ttactgccta tattgaaagt 960
acttctggtg tggcagtcgg tggccgcacg gggctgactg cggttgtggt tggcgttatg 1020
ttcctgttgg ttatgttctt ctcaccgctg gtggcgatag ttcctcctta cgcaaccgcc 1080
ggagcgttaa tctttgttgg cgtgctgatg acttcgagcc tggcgcgcgt taactgggat 1140
gattttaccg aatcggtgcc tgcgtttatt accacggtga tgatgccctt tactttctcg 1200
atcaccgaag ggattgcact cggctttatg tcgtactgca tcatgaaagt atgcaccggg 1260
cgctggcgcg atctgaacct gtgtgtggtg gtggtcgcag ctctgtttgc actgaagatt 1320
attctggtgg attag 1335
<210> 9
<211> 547
<212> PRT
<213> Artificial Sequence
<220>
<223> yeeO (G272E) variant
<400> 9
Met Leu Arg His Ile Leu Thr Ala Lys Asn Leu Leu Ser Asn Pro Ile
1 5 10 15
Phe Lys Phe Pro Asn Cys Leu Pro Phe Leu Ser Thr Val Cys Cys Ile
20 25 30
Cys Arg Gln Phe Val Gly Glu Asn Leu Cys Ser Phe Ala Asp Ser Pro
35 40 45
Ser Leu Phe Glu Met Trp Phe His Phe Leu Gln Leu Arg Ser Ala Leu
50 55 60
Asn Ile Ser Ser Ala Leu Arg Gln Val Val His Gly Thr Arg Trp His
65 70 75 80
Ala Lys Arg Lys Ser Tyr Lys Val Leu Phe Trp Arg Glu Ile Thr Pro
85 90 95
Leu Ala Val Pro Ile Phe Met Glu Asn Ala Cys Val Leu Leu Met Gly
100 105 110
Val Leu Ser Thr Phe Leu Val Ser Trp Leu Gly Lys Asp Ala Met Ala
115 120 125
Gly Val Gly Leu Ala Asp Ser Phe Asn Met Val Ile Met Ala Phe Phe
130 135 140
Ala Ala Ile Asp Leu Gly Thr Thr Val Val Val Ala Phe Ser Leu Gly
145 150 155 160
Lys Arg Asp Arg Arg Arg Ala Arg Val Ala Thr Arg Gln Ser Leu Val
165 170 175
Ile Met Thr Leu Phe Ala Val Leu Leu Ala Thr Leu Ile His His Phe
180 185 190
Gly Glu Gln Ile Ile Asp Phe Val Ala Gly Asp Ala Thr Thr Glu Val
195 200 205
Lys Ala Leu Ala Leu Thr Tyr Leu Glu Leu Thr Val Leu Ser Tyr Pro
210 215 220
Ala Ala Ala Ile Thr Leu Ile Gly Ser Gly Ala Leu Arg Gly Ala Gly
225 230 235 240
Asn Thr Lys Ile Pro Leu Leu Ile Asn Gly Ser Leu Asn Ile Leu Asn
245 250 255
Ile Ile Ile Ser Gly Ile Leu Ile Tyr Gly Leu Phe Ser Trp Pro Glu
260 265 270
Leu Gly Phe Val Gly Ala Gly Leu Gly Leu Thr Ile Ser Arg Tyr Ile
275 280 285
Gly Ala Val Ala Ile Leu Trp Val Leu Ala Ile Gly Phe Asn Pro Ala
290 295 300
Leu Arg Ile Ser Leu Lys Ser Tyr Phe Lys Pro Leu Asn Phe Ser Ile
305 310 315 320
Ile Trp Glu Val Met Gly Ile Gly Ile Pro Ala Ser Val Glu Ser Val
325 330 335
Leu Phe Thr Ser Gly Arg Leu Leu Thr Gln Met Phe Val Ala Gly Met
340 345 350
Gly Thr Ser Val Ile Ala Gly Asn Phe Ile Ala Phe Ser Ile Ala Ala
355 360 365
Leu Ile Asn Leu Pro Gly Ser Ala Leu Gly Ser Ala Ser Thr Ile Ile
370 375 380
Thr Gly Arg Arg Leu Gly Val Gly Gln Ile Ala Gln Ala Glu Ile Gln
385 390 395 400
Leu Arg His Val Phe Trp Leu Ser Thr Leu Gly Leu Thr Ala Ile Ala
405 410 415
Trp Leu Thr Ala Pro Phe Ala Gly Val Met Ala Ser Phe Tyr Thr Gln
420 425 430
Asp Pro Gln Val Lys His Val Val Val Ile Leu Ile Trp Leu Asn Ala
435 440 445
Leu Phe Met Pro Ile Trp Ser Ala Ser Trp Val Leu Pro Ala Gly Phe
450 455 460
Lys Gly Ala Arg Asp Ala Arg Tyr Ala Met Trp Val Ser Met Leu Ser
465 470 475 480
Met Trp Gly Cys Arg Val Val Val Gly Tyr Val Leu Gly Ile Met Leu
485 490 495
Gly Trp Gly Val Val Gly Val Trp Met Gly Met Phe Ala Asp Trp Ala
500 505 510
Val Arg Ala Val Leu Phe Tyr Trp Arg Met Val Thr Gly Arg Trp Leu
515 520 525
Trp Lys Tyr Pro Arg Pro Glu Pro Gln Lys Cys Glu Lys Lys Pro Val
530 535 540
Val Ser Glu
545
<210> 10
<211> 1644
<212> DNA
<213> Artificial Sequence
<220>
<223> yeeO (G272E) variant
<400> 10
ttgttgaggc acatcttaac ggcgaaaaat cttttgtcaa acccgatttt taaattcccc 60
aactgtttgc cgtttctatc aacagtttgt tgcatttgca gacaatttgt tggcgaaaat 120
ctttgcagct ttgctgattc tccctcatta tttgaaatgt ggtttcactt tctgcaatta 180
aggtcggctt tgaatatctc ctctgcttta cgccaggttg ttcacggcac tcgctggcac 240
gctaaacgca agagctacaa agtgttgttc tggcgcgaga taaccccgct tgctgttcct 300
atcttcatgg agaatgcctg tgtcctgttg atgggggttc tgagcacttt tctggtcagc 360
tggctgggaa aagatgcgat ggccggcgtg ggattggcgg acagcttcaa tatggtcatt 420
atggcttttt ttgctgctat cgatcttggt actactgtcg ttgtggcatt tagtctcggt 480
aagcgggatc gacgacgagc gagggtggcg acgcggcagt cattggtgat catgacgttg 540
tttgccgtac tgttggcaac gcttattcat cattttggcg aacaaattat tgatttcgtc 600
gcgggtgatg ccacgacaga agttaaagca ctggcgttga cttatctgga gctgacggta 660
ctcagttatc cagcagctgc catcactctt attggtagcg gggcacttcg tggtgcaggg 720
aatacgaaaa taccgctatt gattaacggt agcctgaata ttcttaatat tattattagc 780
ggcatattga tttacggcct tttctcctgg ccggaactgg gatttgtcgg ggcagggctg 840
ggtttaacca tttctcgtta tattggcgca gttgcaattt tgtgggtgct ggcgattggt 900
tttaatcctg cgctaaggat ttcgttaaag agctatttta aaccgctgaa ttttagcatt 960
atctgggaag tcatggggat tggtattccc gcgagtgtcg aatcagtgtt atttaccagt 1020
ggtcggttat taacccaaat gttcgttgcc gggatgggga ccagtgttat tgccggaaat 1080
tttatcgcgt tttcaattgc ggctcttatc aacttacccg gaagtgcgct cggctctgct 1140
tctacgatca ttacaggccg aaggttgggg gtagggcaga tagcgcaagc agagattcag 1200
ttgcggcatg tgttctggct ttccactctt ggattaacgg ccatcgcctg gctaacggct 1260
ccctttgccg gggttatggc atcgttttac acccaggatc cacaggttaa acatgtcgtt 1320
gtgattctga tttggctaaa tgctttattt atgcctattt ggtccgcctc atgggtgcta 1380
cccgctggat ttaaaggtgc tcgtgatgcc cgttacgcca tgtgggtttc gatgttgagc 1440
atgtggggtt gtcgggttgt agtcggttat gtgctgggaa tcatgcttgg ctggggtgtg 1500
gttggtgtct ggatgggaat gtttgccgac tgggctgtgc gggccgtgct gttttactgg 1560
cgaatggtta ctggacgttg gctatggaaa taccctcgac ccgagccgca aaagtgtgaa 1620
aaaaagccag ttgtgtcgga ataa 1644
<210> 11
<211> 547
<212> PRT
<213> Artificial Sequence
<220>
<223> yeeO
<400> 11
Met Leu Arg His Ile Leu Thr Ala Lys Asn Leu Leu Ser Asn Pro Ile
1 5 10 15
Phe Lys Phe Pro Asn Cys Leu Pro Phe Leu Ser Thr Val Cys Cys Ile
20 25 30
Cys Arg Gln Phe Val Gly Glu Asn Leu Cys Ser Phe Ala Asp Ser Pro
35 40 45
Ser Leu Phe Glu Met Trp Phe His Phe Leu Gln Leu Arg Ser Ala Leu
50 55 60
Asn Ile Ser Ser Ala Leu Arg Gln Val Val His Gly Thr Arg Trp His
65 70 75 80
Ala Lys Arg Lys Ser Tyr Lys Val Leu Phe Trp Arg Glu Ile Thr Pro
85 90 95
Leu Ala Val Pro Ile Phe Met Glu Asn Ala Cys Val Leu Leu Met Gly
100 105 110
Val Leu Ser Thr Phe Leu Val Ser Trp Leu Gly Lys Asp Ala Met Ala
115 120 125
Gly Val Gly Leu Ala Asp Ser Phe Asn Met Val Ile Met Ala Phe Phe
130 135 140
Ala Ala Ile Asp Leu Gly Thr Thr Val Val Val Ala Phe Ser Leu Gly
145 150 155 160
Lys Arg Asp Arg Arg Arg Ala Arg Val Ala Thr Arg Gln Ser Leu Val
165 170 175
Ile Met Thr Leu Phe Ala Val Leu Leu Ala Thr Leu Ile His His Phe
180 185 190
Gly Glu Gln Ile Ile Asp Phe Val Ala Gly Asp Ala Thr Thr Glu Val
195 200 205
Lys Ala Leu Ala Leu Thr Tyr Leu Glu Leu Thr Val Leu Ser Tyr Pro
210 215 220
Ala Ala Ala Ile Thr Leu Ile Gly Ser Gly Ala Leu Arg Gly Ala Gly
225 230 235 240
Asn Thr Lys Ile Pro Leu Leu Ile Asn Gly Ser Leu Asn Ile Leu Asn
245 250 255
Ile Ile Ile Ser Gly Ile Leu Ile Tyr Gly Leu Phe Ser Trp Pro Gly
260 265 270
Leu Gly Phe Val Gly Ala Gly Leu Gly Leu Thr Ile Ser Arg Tyr Ile
275 280 285
Gly Ala Val Ala Ile Leu Trp Val Leu Ala Ile Gly Phe Asn Pro Ala
290 295 300
Leu Arg Ile Ser Leu Lys Ser Tyr Phe Lys Pro Leu Asn Phe Ser Ile
305 310 315 320
Ile Trp Glu Val Met Gly Ile Gly Ile Pro Ala Ser Val Glu Ser Val
325 330 335
Leu Phe Thr Ser Gly Arg Leu Leu Thr Gln Met Phe Val Ala Gly Met
340 345 350
Gly Thr Ser Val Ile Ala Gly Asn Phe Ile Ala Phe Ser Ile Ala Ala
355 360 365
Leu Ile Asn Leu Pro Gly Ser Ala Leu Gly Ser Ala Ser Thr Ile Ile
370 375 380
Thr Gly Arg Arg Leu Gly Val Gly Gln Ile Ala Gln Ala Glu Ile Gln
385 390 395 400
Leu Arg His Val Phe Trp Leu Ser Thr Leu Gly Leu Thr Ala Ile Ala
405 410 415
Trp Leu Thr Ala Pro Phe Ala Gly Val Met Ala Ser Phe Tyr Thr Gln
420 425 430
Asp Pro Gln Val Lys His Val Val Val Ile Leu Ile Trp Leu Asn Ala
435 440 445
Leu Phe Met Pro Ile Trp Ser Ala Ser Trp Val Leu Pro Ala Gly Phe
450 455 460
Lys Gly Ala Arg Asp Ala Arg Tyr Ala Met Trp Val Ser Met Leu Ser
465 470 475 480
Met Trp Gly Cys Arg Val Val Val Gly Tyr Val Leu Gly Ile Met Leu
485 490 495
Gly Trp Gly Val Val Gly Val Trp Met Gly Met Phe Ala Asp Trp Ala
500 505 510
Val Arg Ala Val Leu Phe Tyr Trp Arg Met Val Thr Gly Arg Trp Leu
515 520 525
Trp Lys Tyr Pro Arg Pro Glu Pro Gln Lys Cys Glu Lys Lys Pro Val
530 535 540
Val Ser Glu
545
<210> 12
<211> 1644
<212> DNA
<213> Artificial Sequence
<220>
<223> yeeO
<400> 12
ttgttgaggc acatcttaac ggcgaaaaat cttttgtcaa acccgatttt taaattcccc 60
aactgtttgc cgtttctatc aacagtttgt tgcatttgca gacaatttgt tggcgaaaat 120
ctttgcagct ttgctgattc tccctcatta tttgaaatgt ggtttcactt tctgcaatta 180
aggtcggctt tgaatatctc ctctgcttta cgccaggttg ttcacggcac tcgctggcac 240
gctaaacgca agagctacaa agtgttgttc tggcgcgaga taaccccgct tgctgttcct 300
atcttcatgg agaatgcctg tgtcctgttg atgggggttc tgagcacttt tctggtcagc 360
tggctgggaa aagatgcgat ggccggcgtg ggattggcgg acagcttcaa tatggtcatt 420
atggcttttt ttgctgctat cgatcttggt actactgtcg ttgtggcatt tagtctcggt 480
aagcgggatc gacgacgagc gagggtggcg acgcggcagt cattggtgat catgacgttg 540
tttgccgtac tgttggcaac gcttattcat cattttggcg aacaaattat tgatttcgtc 600
gcgggtgatg ccacgacaga agttaaagca ctggcgttga cttatctgga gctgacggta 660
ctcagttatc cagcagctgc catcactctt attggtagcg gggcacttcg tggtgcaggg 720
aatacgaaaa taccgctatt gattaacggt agcctgaata ttcttaatat tattattagc 780
ggcatattga tttacggcct tttctcctgg ccgggactgg gatttgtcgg ggcagggctg 840
ggtttaacca tttctcgtta tattggcgca gttgcaattt tgtgggtgct ggcgattggt 900
tttaatcctg cgctaaggat ttcgttaaag agctatttta aaccgctgaa ttttagcatt 960
atctgggaag tcatggggat tggtattccc gcgagtgtcg aatcagtgtt atttaccagt 1020
ggtcggttat taacccaaat gttcgttgcc gggatgggga ccagtgttat tgccggaaat 1080
tttatcgcgt tttcaattgc ggctcttatc aacttacccg gaagtgcgct cggctctgct 1140
tctacgatca ttacaggccg aaggttgggg gtagggcaga tagcgcaagc agagattcag 1200
ttgcggcatg tgttctggct ttccactctt ggattaacgg ccatcgcctg gctaacggct 1260
ccctttgccg gggttatggc atcgttttac acccaggatc cacaggttaa acatgtcgtt 1320
gtgattctga tttggctaaa tgctttattt atgcctattt ggtccgcctc atgggtgcta 1380
cccgctggat ttaaaggtgc tcgtgatgcc cgttacgcca tgtgggtttc gatgttgagc 1440
atgtggggtt gtcgggttgt agtcggttat gtgctgggaa tcatgcttgg ctggggtgtg 1500
gttggtgtct ggatgggaat gtttgccgac tgggctgtgc gggccgtgct gttttactgg 1560
cgaatggtta ctggacgttg gctatggaaa taccctcgac ccgagccgca aaagtgtgaa 1620
aaaaagccag ttgtgtcgga ataa 1644
<210> 13
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 1
<400> 13
caattttatt atagtaattg actattatac 30
<210> 14
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 2
<400> 14
ccacgccgcc caattttatt atagtaattg actattatac 40
<210> 15
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 3
<400> 15
ggcggcgtgg gcttcgctga gttttagagc tagaaatagc 40
<210> 16
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 4
<400> 16
gcttcgctga gttttagagc tagaaatagc 30
<210> 17
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 5
<400> 17
gagcctgtcg gcctacctgc t 21
<210> 18
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 6
<400> 18
cggccggcat gagcctgtcg 20
<210> 19
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 7
<400> 19
atgccggccg tcggttggtt tggctttaac 30
<210> 20
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 8
<400> 20
tcggttggtt tggctttaac 20
<210> 21
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 9
<400> 21
cagcgagctg gcggcaaaaa 20
<210> 22
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 10
<400> 22
ccacgccgcc cagcgagctg gcggcaaaaa 30
<210> 23
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 11
<400> 23
ggcggcgtgg acttcgctga aggtgtgacg 30
<210> 24
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 12
<400> 24
acttcgctga aggtgtgacg 20
<210> 25
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 13
<400> 25
gtgagtttta tacccaagac 20
<210> 26
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 14
<400> 26
tttgtgcgcg gtgagtttta 20
<210> 27
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 15
<400> 27
cgcgcacaaa gagctcctga aaatctcgat aac 33
<210> 28
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 16
<400> 28
gagctcctga aaatctcgat aac 23
<210> 29
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 17
<400> 29
tggcaccgtg gtgcacatta 20
<210> 30
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 18
<400> 30
agccgcacgg catcggtttt 20
<210> 31
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 2
<400> 31
cgtgacgttc caattttatt atagtaattg actattatac 40
<210> 32
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 3
<400> 32
gaacgtcacg accttcgtca gttttagagc tagaaatagc 40
<210> 33
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 4
<400> 33
accttcgtca gttttagagc tagaaatagc 30
<210> 34
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 7
<400> 34
atgccggccg cggccgggta gatacccaga g 31
<210> 35
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 8
<400> 35
cggccgggta gatacccaga g 21
<210> 36
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 9
<400> 36
tctgctgttc gttgacaaca 20
<210> 37
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 10
<400> 37
gtcgtgacgt tctgctgttc 20
<210> 38
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 11
<400> 38
acgtcacgac tttcgtcacg gaatttctca 30
<210> 39
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 12
<400> 39
tttcgtcacg gaatttctca 20
<210> 40
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 13
<400> 40
tccgttcgct aagggcggta 20
<210> 41
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 14
<400> 41
acctgatgtg tccgttcgct 20
<210> 42
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 15
<400> 42
cacatcaggt gagctcctga aaatctcgat aac 33
<210> 43
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 17
<400> 43
tccgttcgct aagggcggta 20
<210> 44
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 18
<400> 44
acctgatgtg tccgttcgct 20
<210> 45
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 2
<400> 45
aggagaaaag caattttatt atagtaattg actattatac 40
<210> 46
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 3
<400> 46
cttttctcct ggccgggact gttttagagc tagaaatagc 40
<210> 47
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 4
<400> 47
ggccgggact gttttagagc tagaaatagc 30
<210> 48
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 7
<400> 48
atgccggccg tttagtctcg gtaagcggga 30
<210> 49
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 8
<400> 49
tttagtctcg gtaagcggga 20
<210> 50
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 9
<400> 50
gaaaaggccg taaatcaata tgccg 25
<210> 51
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 10
<400> 51
ccggccagga gaaaaggccg taaat 25
<210> 52
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 11
<400> 52
tcctggccgg aactgggatt tgtcggggca 30
<210> 53
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 12
<400> 53
aactgggatt tgtcggggca 20
<210> 54
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 13
<400> 54
gcagagccga gcgcacttcc 20
<210> 55
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 14
<400> 55
gatcgtagaa gcagagccga 20
<210> 56
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 15
<400> 56
ttctacgatc gagctcctga aaatctcgat aac 33
<210> 57
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 17
<400> 57
cttttctggt cagctggctg 20
<210> 58
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Primer 18
<400> 58
ttagccaggc gatggccgtt 20
<210> 59
<211> 3234
<212> DNA
<213> Artificial Sequence
<220>
<223> pDSG
<400> 59
tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 60
gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 120
ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 180
cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 240
caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 300
ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 360
taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 420
taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 480
cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 540
tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 600
gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 660
catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa 720
atcaatctaa agtatatatg agtaaacttg gtctgacagt taaaagattg acagtataat 780
agtcaattac tataataaaa ttgatgcgca ggtgaaacac ctgctgcagt tttagagcta 840
gaaatagcaa gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg 900
gtgctttttt tcgttttccg ggacgccctc gcggacgtgc tcatagtcca cgacgcccgt 960
gattttgtag ccctggccga cggccagcag gtaggccgac aggctcatgc cggccggagc 1020
tcctgaaaat ctcgataact caaaaaatac gcccggtagt gatcttattt cattatggtg 1080
aaagttggaa cctcttagga tcctctagat ttaagaagga gatatacata aaactccttc 1140
tgagctagtt ctctagcatt ctattatttt gattcgacac cttaataata gcagaaggag 1200
tttttacctg tcaaagaacc atcaaaccct tgatacacaa ggctttgacc taattttgaa 1260
aaatgatgtt gtttctatat agtatcaaga taagaaagaa aaggattttt cgctacgctc 1320
aaatcctttc ccgtcacggg cttctcaggg cgttttatgg cgggtctgct atgtggtgct 1380
atctgacttt ttgctgttca gcagttcctg ccctctgatt ttccagtctg accacttcgg 1440
attatcccgt gacaggtcat tcagactggc taatgcaccc agtaaggcag cggtatcatc 1500
aacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1560
atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1620
aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat 1680
ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac 1740
tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg 1800
ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag 1860
tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt 1920
aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt 1980
gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt 2040
tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt 2100
cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct 2160
tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt 2220
ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac 2280
cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa 2340
actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa 2400
ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca 2460
aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct 2520
ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga 2580
atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc 2640
aaaaaaacac aaaagaccac attttttaat gtggtcttta ttcttcaact aaagcaccca 2700
ttagttcaac aaacgaaaat tggataaagt gggatatttt taaaatatat atttatgtta 2760
cagtaagctg cctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac atgcagctcc 2820
cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc cgtcagggcg 2880
cgtcagcggg tgttggcggg tgtcggggcg cagccatgac ccagtcacgt agcgatagcg 2940
gagtgtatac tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 3000
gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc gctcttccgc 3060
ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 3120
ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 3180
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtt 3234
<210> 60
<211> 9656
<212> DNA
<213> Artificial Sequence
<220>
<223> pDS9
<400> 60
aaagccatga caaaaacgcg taacaaaagt gtctataatc acggcagaaa agtccacatt 60
gattatttgc acggcgtcac actttgctat gccatagcat ttttatccat aagattagcg 120
gatcctacct gacgcttttt atcgcaactc tctactgttt ctccataccc gtttttttgg 180
gaattcgagc tctaaggagg ttataaaaaa tggatattaa tactgaaact gagatcaagc 240
aaaagcattc actaaccccc tttcctgttt tcctaatcag cccggcattt cgcgggcgat 300
attttcacag ctatttcagg agttcagcca tgaacgctta ttacattcag gatcgtcttg 360
aggctcagag ctgggcgcgt cactaccagc agctcgcccg tgaagagaaa gaggcagaac 420
tggcagacga catggaaaaa ggcctgcccc agcacctgtt tgaatcgcta tgcatcgatc 480
atttgcaacg ccacggggcc agcaaaaaat ccattacccg tgcgtttgat gacgatgttg 540
agtttcagga gcgcatggca gaacacatcc ggtacatggt tgaaaccatt gctcaccacc 600
aggttgatat tgattcagag gtataaaacg aatgagtact gcactcgcaa cgctggctgg 660
gaagctggct gaacgtgtcg gcatggattc tgtcgaccca caggaactga tcaccactct 720
tcgccagacg gcatttaaag gtgatgccag cgatgcgcag ttcatcgcat tactgatcgt 780
tgccaaccag tacggcctta atccgtggac gaaagaaatt tacgcctttc ctgataagca 840
gaatggcatc gttccggtgg tgggcgttga tggctggtcc cgcatcatca atgaaaacca 900
gcagtttgat ggcatggact ttgagcagga caatgaatcc tgtacatgcc ggatttaccg 960
caaggaccgt aatcatccga tctgcgttac cgaatggatg gatgaatgcc gccgcgaacc 1020
attcaaaact cgcgaaggca gagaaatcac ggggccgtgg cagtcgcatc ccaaacggat 1080
gttacgtcat aaagccatga ttcagtgtgc ccgtctggcc ttcggatttg ctggtatcta 1140
tgacaaggat gaagccgagc gcattgtcga aaatactgca tacactgcag aacgtcagcc 1200
ggaacgcgac atcactccgg ttaacgatga aaccatgcag gagattaaca ctctgctgat 1260
cgccctggat aaaacatggg atgacgactt attgccgctc tgttcccaga tatttcgccg 1320
cgacattcgt gcatcgtcag aactgacaca ggccgaagca gtaaaagctc ttggattcct 1380
gaaacagaaa gccgcagagc agaaggtggc agcatgacac cggacattat cctgcagcgt 1440
accgggatcg atgtgagagc tgtcgaacag ggggatgatg cgtggcacaa attacggctc 1500
ggcgtcatca ccgcttcaga agttcacaac gtgatagcaa aaccccgctc cggaaagaag 1560
tggcctgaca tgaaaatgtc ctacttccac accctgcttg ctgaggtttg caccggtgtg 1620
gctccggaag ttaacgctaa agcactggcc tggggaaaac agtacgagaa cgacgccaga 1680
accctgtttg aattcacttc cggcgtgaat gttactgaat ccccgatcat ctatcgcgac 1740
gaaagtatgc gtaccgcctg ctctcccgat ggtttatgca gtgacggcaa cggccttgaa 1800
ctgaaatgcc cgtttacctc ccgggatttc atgaagttcc ggctcggtgg tttcgaggcc 1860
ataaagtcag cttacatggc ccaggtgcag tacagcatgt gggtgacgcg aaaaaatgcc 1920
tggtactttg ccaactatga cccgcgtatg aagcgtgaag gcctgcatta tgtcgtgatt 1980
gagcgggatg aaaagtacat ggcgagtttt gacgagatcg tgccggagtt catcgaaaaa 2040
atggacgagg cactggctga aattggtttt gtatttgggg agcaatggcg atgagatcta 2100
aagaggagaa aggatctatg gataagaaat actcaatagg cttagatatc ggcacaaata 2160
gcgtcggatg ggcggtgatc actgatgaat ataaggttcc gtctaaaaag ttcaaggttc 2220
tgggaaatac agaccgccac agtatcaaaa aaaatcttat aggggctctt ttatttgaca 2280
gtggagagac agcggaagcg actcgtctca aacggacagc tcgtagaagg tatacacgtc 2340
ggaagaatcg tatttgttat ctacaggaga ttttttcaaa tgagatggcg aaagtagatg 2400
atagtttctt tcatcgactt gaagagtctt ttttggtgga agaagacaag aagcatgaac 2460
gtcatcctat ttttggaaat atagtagatg aagttgctta tcatgagaaa tatccaacta 2520
tctatcatct gcgaaaaaaa ttggtagatt ctactgataa agcggatttg cgcttaatct 2580
atttggcctt agcgcatatg attaagtttc gtggtcattt tttgattgag ggagatttaa 2640
atcctgataa tagtgatgtg gacaaactat ttatccagtt ggtacaaacc tacaatcaat 2700
tatttgaaga aaaccctatt aacgcaagtg gagtagatgc taaagcgatt ctttctgcac 2760
gattgagtaa atcaagacga ttagaaaatc tcattgctca gctccccggt gagaagaaaa 2820
atggcttatt tgggaatctc attgctttgt cattgggttt gacccctaat tttaaatcaa 2880
attttgattt ggcagaagat gctaaattac agctttcaaa agatacttac gatgatgatt 2940
tagataattt attggcgcaa attggagatc aatatgctga tttgtttttg gcagctaaga 3000
atttatcaga tgctatttta ctttcagata tcctaagagt aaatactgaa ataactaagg 3060
ctcccctatc agcttcaatg attaaacgct acgatgaaca tcatcaagac ttgactcttt 3120
taaaagcttt agttcgacaa caacttccag aaaagtataa agaaatcttt tttgatcaat 3180
caaaaaacgg atatgcaggt tatattgatg ggggagctag ccaagaagaa ttttataaat 3240
ttatcaaacc aattttagaa aaaatggatg gtactgagga attattggtg aaactaaatc 3300
gtgaagattt gctgcgcaag caacggacct ttgacaacgg ctctattccc catcaaattc 3360
acttgggtga gctgcatgct attttgagaa gacaagaaga cttttatcca tttttaaaag 3420
acaatcgtga gaagattgaa aaaatcttga cttttcgaat tccttattat gttggtccat 3480
tggcgcgtgg caatagtcgt tttgcatgga tgactcggaa gtctgaagaa acaattaccc 3540
catggaattt tgaagaagtt gtcgataaag gtgcttcagc tcaatcattt attgaacgca 3600
tgacaaactt tgataaaaat cttccaaatg aaaaagtact accaaaacat agtttgcttt 3660
atgagtattt tacggtttat aacgaattga caaaggtcaa atatgttact gaaggaatgc 3720
gaaaaccagc atttctttca ggtgaacaga agaaagccat tgttgattta ctcttcaaaa 3780
caaatcgaaa agtaaccgtt aagcaattaa aagaagatta tttcaaaaaa atagaatgtt 3840
ttgatagtgt tgaaatttca ggagttgaag atagatttaa tgcttcatta ggtacctacc 3900
atgatttgct aaaaattatt aaagataaag attttttgga taatgaagaa aatgaagata 3960
tcttagagga tattgtttta acattgacct tatttgaaga tagggagatg attgaggaaa 4020
gacttaaaac atatgctcac ctctttgatg ataaggtgat gaaacagctt aaacgtcgcc 4080
gttatactgg ttggggacgt ttgtctcgaa aattgattaa tggtattagg gataagcaat 4140
ctggcaaaac aatattagat tttttgaaat cagatggttt tgccaatcgc aattttatgc 4200
agctgatcca tgatgatagt ttgacattta aagaagacat tcaaaaagca caagtgtctg 4260
gacaaggcga tagtttacat gaacatattg caaatttagc tggtagccct gctattaaaa 4320
aaggtatttt acagactgta aaagttgttg atgaattggt caaagtaatg gggcggcata 4380
agccagaaaa tatcgttatt gaaatggcac gtgaaaatca gacaactcaa aagggccaga 4440
aaaattcgcg agagcgtatg aaacgaatcg aagaaggtat caaagaatta ggaagtcaga 4500
ttcttaaaga gcatcctgtt gaaaatactc aattgcaaaa tgaaaagctc tatctctatt 4560
atctccaaaa tggaagagac atgtatgtgg accaagaatt agatattaat cgtttaagtg 4620
attatgatgt cgatcacatt gttccacaaa gtttccttaa agacgattca atagacaata 4680
aggtcttaac gcgttctgat aaaaatcgtg gtaaatcgga taacgttcca agtgaagaag 4740
tagtcaaaaa gatgaaaaac tattggagac aacttctaaa cgccaagtta atcactcaac 4800
gtaagtttga taatttaacg aaagctgaac gtggaggttt gagtgaactt gataaagctg 4860
gttttatcaa acgccaattg gttgaaactc gccaaatcac taagcatgtg gcacaaattt 4920
tggatagtcg catgaatact aaatacgatg aaaatgataa acttattcga gaggttaaag 4980
tgattacctt aaaatctaaa ttagtttctg acttccgaaa agatttccaa ttctataaag 5040
tacgtgagat taacaattac catcatgccc atgatgcgta tctaaatgcc gtcgttggaa 5100
ctgctttgat taagaaatat ccaaaacttg aatcggagtt tgtctatggt gattataaag 5160
tttatgatgt tcgtaaaatg attgctaagt ctgagcaaga aataggcaaa gcaaccgcaa 5220
aatatttctt ttactctaat atcatgaact tcttcaaaac agaaattaca cttgcaaatg 5280
gagagattcg caaacgccct ctaatcgaaa ctaatgggga aactggagaa attgtctggg 5340
ataaagggcg agattttgcc acagtgcgca aagtattgtc catgccccaa gtcaatattg 5400
tcaagaaaac agaagtacag acaggcggat tctccaagga gtcaatttta ccaaaaagaa 5460
attcggacaa gcttattgct cgtaaaaaag actgggatcc aaaaaaatat ggtggttttg 5520
atagtccaac ggtagcttat tcagtcctag tggttgctaa ggtggaaaaa gggaaatcga 5580
agaagttaaa atccgttaaa gagttactag ggatcacaat tatggaaaga agttcctttg 5640
aaaaaaatcc gattgacttt ttagaagcta aaggatataa ggaagttaaa aaagacttaa 5700
tcattaaact acctaaatat agtctttttg agttagaaaa cggtcgtaaa cggatgctgg 5760
ctagtgccgg agaattacaa aaaggaaatg agctggctct gccaagcaaa tatgtgaatt 5820
ttttatattt agctagtcat tatgaaaagt tgaagggtag tccagaagat aacgaacaaa 5880
aacaattgtt tgtggagcag cataagcatt atttagatga gattattgag caaatcagtg 5940
aattttctaa gcgtgttatt ttagcagatg ccaatttaga taaagttctt agtgcatata 6000
acaaacatag agacaaacca atacgtgaac aagcagaaaa tattattcat ttatttacgt 6060
tgacgaatct tggagctccc gctgctttta aatattttga tacaacaatt gatcgtaaac 6120
gatatacgtc tacaaaagaa gttttagatg ccactcttat ccatcaatcc atcactggtc 6180
tttatgaaac acgcattgat ttgagtcagc taggaggtga ctaacgcatc ctcacgataa 6240
tatccgggta ggcgcaatca ctttcgtcta ctccgttaca aagcgaggct gggtatttcc 6300
cggcctttct gttatccgaa atccactgaa agcacagcgg ctggctgagg agataaataa 6360
taaacgaggg gctgtatgca caaagcatct tctgttgagt taagaacgag tatcgagatg 6420
gcacatagcc ttgctcaaat tggaatcagg tttgtgccaa taccagtaga aacagacgaa 6480
gaatccatgg gtatggacag ttttcccttt gatatgtaac ggtgaacagt tgttctactt 6540
ttgtttgtta gtcttgatgc ttcactgata gatacaagag ccaaatctag taatatttta 6600
tctgattaat aagatgatct tcttgagatc gttttggtct gcgcgtaatc tcttgctctg 6660
aaaacgaaaa aaccgccttg cagggcggtt tttcgaaggt tctctgagct accaactctt 6720
tgaaccgagg taactggctt ggaggagcgc agtcaccaaa acttgtcctt tcagtttagc 6780
cttaaccggc gcatgacttc aagactaact cctctaaatc aattaccagt ggctgctgcc 6840
agtggtgctt ttgcatgtct ttccgggttg gactcaagac gatagttacc ggataaggcg 6900
cagcggtcgg actgaacggg gggttcgtgc atacagtcca gcttggagcg aactgcctac 6960
ccggaactga gtgtcaggcg tggaatgaga caaacgcggc cataacagcg gaatgacacc 7020
ggtaaaccga aaggcaggaa caggagagcg cacgagggag ccgccagggg gaaacgcctg 7080
gtatctttat agtcctgtcg ggtttcgcca ccactgattt gagcgtcaga tttcgtgatg 7140
cttgtcaggg gggcggagcc tatggaaaaa cggctttgcc gcggccccgc tgaatattcc 7200
ttttgtctcc gaccatcagg cacctgagtc gctgtctttt tcgtgacatt cagttcgctg 7260
cgctcacggc tctggcagtg aatgggggta aatggcacta caggcgcctt ttatggattc 7320
atgcaaggaa actacccata atacaagaaa agatgtggtc tttattcttc aactaaagca 7380
cccattagtt caacaaacga aaattggata aagtgggata tttttaaaat atatatttat 7440
gttacctggc agagcctgta ctttttacag tcggttttct aatgtcacta acctgccccg 7500
ttagtcgcca tttgcatcga tttattatga caacttgacg gctacatcat tcactttttc 7560
ttcacaaccg gcacggaact cgctcgggct ggccccggtg cattttttaa atacccgcga 7620
gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg gtggcgatag gcatccgggt 7680
ggtgctcaaa agcagcttcg cctggctgat acgttggtcc tcgcgccagc ttaagacgct 7740
aatccctaac tgctggcgga aaagatgtga cagacgcgac ggcgacaagc aaacatgctg 7800
tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga tcgctgatgt actgacaagc 7860
ctcgcgtacc cgattatcca tcggtggatg gagcgactcg ttaatcgctt ccatgcgccg 7920
cagtaacaat tgctcaagca gatttatcgc cagcagctcc gaatagcgcc cttccccttg 7980
cccggcgtta atgatttgcc caaacaggtc gctgaaatgc ggctggtgcg cttcatccgg 8040
gcgaaagaac cccgtattgg caaatattga cggccagtta agccattcat gccagtaggc 8100
gcgcggacga aagtaaaccc actggtgata ccattcgcga gcctccggat gacgaccgta 8160
gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc ggtcggcaaa caaattctcg 8220
tccctgattt ttcaccaccc cctgaccgcg aatggtgaga ttgagaatat aacctttcat 8280
tcccagcggt cggtcgataa aaaaatcgag ataaccgttg gcctcaatcg gcgttaaacc 8340
cgccaccaga tgggcattaa acgagtatcc cggcagcagg ggatcatttt gcgcttcagc 8400
catacttttc atactcccgc cattcagaga agaaaccaat tgtccatatt gcatcagaca 8460
ttgccgtcac tgcgtctttt actggctctt ctcgctaacc aaaccggtaa ccccgcttat 8520
taaaagcatt ctgtaacaaa gcgggaccag taatagaaag tgagggagcc acggttgatg 8580
agagctttgt tgtaggtgga ccagttggtg attttgaact tttgctttgc cacggaacgg 8640
tctgcgttgt cgggaagatg cgtgatctga tccttcaact cagcaaaagt tcgatttatt 8700
caacaaagcc acgttgtgtc tcaaaatctc tgatgttaca ttgcacaaga taaaaatata 8760
tcatcatgaa caataaaact gtctgcttac ataaacagta atacaagggg tgttatgagc 8820
catattcaac gggaaacgtc ttgctcgagg ccgcgattaa attccaacat ggatgctgat 8880
ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac aatctatcga 8940
ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg tagcgttgcc 9000
aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat gcctcttccg 9060
accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac tgcgatcccc 9120
gggaaaacag cattccaggt attagaagaa tatcctgatt caggtgaaaa tattgttgat 9180
gcgctggcag tgttcctgcg ccggttgcat tcgattcctg tttgtaattg tccttttaac 9240
agcgatcgcg tatttcgtct cgctcaggcg caatcacgaa tgaataacgg tttggttgat 9300
gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg gaaagaaatg 9360
cataagcttt tgccattctc accggattca gtcgtcactc atggtgattt ctcacttgat 9420
aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg agtcggaatc 9480
gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt ttctccttca 9540
ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa taaattgcag 9600
tttcatttga tgctcgatga gtttttctaa tcagaattgg ttaattggtt gtaaca 9656
Claims (7)
- 서열번호 7의 아미노산 서열에서 136번째 트립토판이 종결 코돈으로 치환된, 서열번호 5의 아미노산 서열로 이루어진 아데닌 수송단백질 변이체.
- 청구항 1의 변이체를 암호화하는 폴리뉴클레오티드.
- 청구항 1의 변이체 또는 청구항 2의 폴리뉴클레오티드를 포함하는 형질전환체.
- 청구항 3에 있어서,
상기 형질전환체는 에스케리치아(Escherichia) 속 균주인 것인 형질전환체. - 청구항 3에 있어서,
상기 형질전환체는 L-방향족 아미노산 생산능을 가지는 것인 형질전환체. - 청구항 3의 형질전환체를 배지에서 배양하는 단계; 및
상기 형질전환체 또는 형질전환체가 배양된 배지로부터 L-방향족 아미노산을 회수하는 단계를 포함하는 L-방향족 아미노산의 생산 방법. - 청구항 6에 있어서,
상기 L-방향족 아미노산은 L-트립토판, L-페닐알라닌 및 L-티로신으로 이루어진 군에서 선택된 1종 이상인 것인 방법.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020230098039A KR20230123452A (ko) | 2022-02-16 | 2023-07-27 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020220020012A KR20230123544A (ko) | 2022-02-16 | 2022-02-16 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 |
KR1020230098039A KR20230123452A (ko) | 2022-02-16 | 2023-07-27 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020220020012A Division KR20230123544A (ko) | 2022-02-16 | 2022-02-16 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20230123452A true KR20230123452A (ko) | 2023-08-23 |
Family
ID=87578518
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020220020012A KR20230123544A (ko) | 2022-02-16 | 2022-02-16 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 |
KR1020230098040A KR20230123453A (ko) | 2022-02-16 | 2023-07-27 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 |
KR1020230098039A KR20230123452A (ko) | 2022-02-16 | 2023-07-27 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020220020012A KR20230123544A (ko) | 2022-02-16 | 2022-02-16 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 |
KR1020230098040A KR20230123453A (ko) | 2022-02-16 | 2023-07-27 | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 |
Country Status (3)
Country | Link |
---|---|
KR (3) | KR20230123544A (ko) |
CN (3) | CN117043178A (ko) |
WO (1) | WO2023158175A1 (ko) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102251946B1 (ko) * | 2019-10-31 | 2021-05-17 | 대상 주식회사 | yeeO 유전자 불활성에 의해 방향족 아미노산 생산능력이 향상된 균주 |
CN117486984B (zh) * | 2023-11-07 | 2024-06-21 | 苏州华赛生物工程技术有限公司 | 转运蛋白KefG在提高L-肌肽产量中的应用 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101830002B1 (ko) | 2016-10-11 | 2018-02-19 | 대상 주식회사 | 부기질의 공급 강화를 통한 l-트립토판이 과발현되는 균주 및 이를 이용하는 l-트립토판의 제조 방법 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5662167B2 (ja) * | 2009-02-09 | 2015-01-28 | 協和発酵バイオ株式会社 | L−アミノ酸の製造法 |
CN108949647B (zh) * | 2018-04-19 | 2020-12-29 | 江南大学 | 一种工程菌及其在生产l-酪氨酸中的应用 |
KR102251946B1 (ko) * | 2019-10-31 | 2021-05-17 | 대상 주식회사 | yeeO 유전자 불활성에 의해 방향족 아미노산 생산능력이 향상된 균주 |
-
2022
- 2022-02-16 KR KR1020220020012A patent/KR20230123544A/ko not_active Application Discontinuation
-
2023
- 2023-02-14 CN CN202380008895.0A patent/CN117043178A/zh active Pending
- 2023-02-14 CN CN202311729862.3A patent/CN117843736A/zh active Pending
- 2023-02-14 WO PCT/KR2023/002096 patent/WO2023158175A1/ko active Application Filing
- 2023-02-14 CN CN202311729811.0A patent/CN117964724A/zh active Pending
- 2023-07-27 KR KR1020230098040A patent/KR20230123453A/ko not_active Application Discontinuation
- 2023-07-27 KR KR1020230098039A patent/KR20230123452A/ko not_active Application Discontinuation
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101830002B1 (ko) | 2016-10-11 | 2018-02-19 | 대상 주식회사 | 부기질의 공급 강화를 통한 l-트립토판이 과발현되는 균주 및 이를 이용하는 l-트립토판의 제조 방법 |
Also Published As
Publication number | Publication date |
---|---|
CN117843736A (zh) | 2024-04-09 |
CN117043178A (zh) | 2023-11-10 |
KR20230123453A (ko) | 2023-08-23 |
WO2023158175A1 (ko) | 2023-08-24 |
CN117964724A (zh) | 2024-05-03 |
KR20230123544A (ko) | 2023-08-24 |
WO2023158175A9 (ko) | 2023-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102421920B1 (ko) | Atp 신타아제 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 | |
KR20230123452A (ko) | 수송단백질 신규 변이체 및 이를 이용한 l-방향족 아미노산생산 방법 | |
AU2012267271B2 (en) | dsRNA endoribonucleases | |
DK3027733T3 (en) | Preparation of 3-Hydroxypropionic Acid in Recombinant Yeast Expressing an Insect Aspartate-1 Decarboxylase | |
KR102096604B1 (ko) | 신규한 crispr 연관 단백질 및 이의 용도 | |
KR20200023455A (ko) | T7 rna 폴리머라제 변이체 | |
CN110540989A (zh) | 基于pcr技术克隆已知区域旁邻的未知dna序列的引物及方法 | |
JP4663631B2 (ja) | 放線菌由来のampデアミナーゼ及びその利用 | |
KR102421911B1 (ko) | 징크 바인딩 디하이드로게나제 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 | |
CN101302531A (zh) | 一种在大肠杆菌-链霉菌-假单胞菌之间穿梭表达的bac载体及其构建方法 | |
CN109666657B (zh) | 提升耐热性的葡萄糖氧化酶 | |
KR20230123454A (ko) | 스트레스 단백질 신규 변이체 및 이를 이용한 l-방향족아미노산 생산 방법 | |
KR20230127949A (ko) | 시그마 38 신규 변이체 및 이를 이용한 l-방향족 아미노산 생산 방법 | |
KR20230134060A (ko) | 아세토하이드록시산 신타아제 신규 변이체 및 이를 이용한 l-발린 생산 방법 | |
CN109593695B (zh) | 一种在枯草芽孢杆菌芽孢表面展示葡萄糖氧化酶的方法与应用 | |
PT698106E (pt) | Marcador genetico | |
AU2017329752B2 (en) | Method for selective carbon source-independent expression of protein-encoding sequences in a filamentous fungus cell | |
KR101683302B1 (ko) | 박테리아 세포 내 로커스 증폭 방법 | |
KR102077931B1 (ko) | 산화적 스트레스에 대한 저항성 및 내성을 증가시키는 바실러스 종 pamc22784 유래 펩타이드 | |
CN113950532A (zh) | 哺乳动物表达载体 | |
US20100304461A1 (en) | Portable, Temperature and Chemically Inducible Expression Vector for High Cell Density Expression of Heterologous Genes in Escherichia Coli | |
CN116904484A (zh) | 可提高核苷产量的dna片段、菌株和生产方法 | |
CN110777156A (zh) | 一种iapv感染性克隆的构建方法与应用 | |
PL180091B1 (pl) | Szczepionka, zwlaszcza do podawania doustnego lub donosowego PL PL | |
CN116144564A (zh) | 一种谷氨酸生产菌株的构建方法及其在生产谷氨酸中的用途 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A107 | Divisional application of patent | ||
E902 | Notification of reason for refusal |