CN101193911B - 半胱氨酸减少的疏水蛋白融合蛋白、其生产和用途 - Google Patents
半胱氨酸减少的疏水蛋白融合蛋白、其生产和用途 Download PDFInfo
- Publication number
- CN101193911B CN101193911B CN2006800205772A CN200680020577A CN101193911B CN 101193911 B CN101193911 B CN 101193911B CN 2006800205772 A CN2006800205772 A CN 2006800205772A CN 200680020577 A CN200680020577 A CN 200680020577A CN 101193911 B CN101193911 B CN 101193911B
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- val
- leu
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 101710091977 Hydrophobin Proteins 0.000 title claims description 76
- 238000004519 manufacturing process Methods 0.000 title abstract description 11
- 235000018417 cysteine Nutrition 0.000 title description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 title description 2
- 108020001507 fusion proteins Proteins 0.000 title description 2
- 102000037865 fusion proteins Human genes 0.000 title description 2
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 79
- 229920001184 polypeptide Polymers 0.000 claims abstract description 35
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 35
- 108090000623 proteins and genes Proteins 0.000 claims description 57
- 102000004169 proteins and genes Human genes 0.000 claims description 36
- 238000000034 method Methods 0.000 claims description 29
- 150000001413 amino acids Chemical class 0.000 claims description 28
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 27
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 23
- 241000894006 Bacteria Species 0.000 claims description 18
- 150000007523 nucleic acids Chemical class 0.000 claims description 18
- 108020004707 nucleic acids Proteins 0.000 claims description 16
- 102000039446 nucleic acids Human genes 0.000 claims description 16
- 239000011521 glass Substances 0.000 claims description 14
- 229930182817 methionine Natural products 0.000 claims description 14
- 101100387133 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) dewA gene Proteins 0.000 claims description 13
- 239000004471 Glycine Substances 0.000 claims description 10
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 10
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 9
- -1 Serine Chemical compound 0.000 claims description 9
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 9
- 239000004473 Threonine Substances 0.000 claims description 9
- 230000000968 intestinal effect Effects 0.000 claims description 6
- 101100215658 Aspergillus parasiticus (strain ATCC 56775 / NRRL 5862 / SRRC 143 / SU-1) aflY gene Proteins 0.000 claims description 3
- 101100232322 Bradyrhizobium diazoefficiens (strain JCM 10833 / BCRC 13528 / IAM 13628 / NBRC 14792 / USDA 110) hypA2 gene Proteins 0.000 claims description 3
- 101100059802 Chlamydophila caviae (strain ATCC VR-813 / DSM 19441 / GPIC) groEL1 gene Proteins 0.000 claims description 3
- 101100365080 Clostridium perfringens (strain 13 / Type A) scpB gene Proteins 0.000 claims description 3
- 101100232321 Synechocystis sp. (strain PCC 6803 / Kazusa) hypA1 gene Proteins 0.000 claims description 3
- 101150077981 groEL gene Proteins 0.000 claims description 3
- 101150006844 groES gene Proteins 0.000 claims description 3
- 101150081485 hypA gene Proteins 0.000 claims description 3
- 101150019385 hypB gene Proteins 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 239000011248 coating agent Substances 0.000 claims 1
- 238000000576 coating method Methods 0.000 claims 1
- 108010050848 glycylleucine Proteins 0.000 description 52
- 108020004414 DNA Proteins 0.000 description 36
- 235000018102 proteins Nutrition 0.000 description 35
- 241000282326 Felis catus Species 0.000 description 33
- 230000004927 fusion Effects 0.000 description 30
- 108010005233 alanylglutamic acid Proteins 0.000 description 28
- 229940024606 amino acid Drugs 0.000 description 26
- 235000001014 amino acid Nutrition 0.000 description 26
- 108010038633 aspartylglutamate Proteins 0.000 description 24
- 108010049041 glutamylalanine Proteins 0.000 description 18
- 108010061238 threonyl-glycine Proteins 0.000 description 18
- 230000014509 gene expression Effects 0.000 description 17
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 16
- 108010016616 cysteinylglycine Proteins 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 15
- 239000013612 plasmid Substances 0.000 description 15
- 108010038320 lysylphenylalanine Proteins 0.000 description 14
- 108010073969 valyllysine Proteins 0.000 description 14
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 14
- 230000001276 controlling effect Effects 0.000 description 13
- 108010047857 aspartylglycine Proteins 0.000 description 12
- 108010004073 cysteinylcysteine Proteins 0.000 description 12
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 10
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 10
- 241000233866 Fungi Species 0.000 description 10
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 10
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 10
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 10
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 10
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 10
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 10
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 10
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 10
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 10
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 10
- 239000004141 Sodium laurylsulphate Substances 0.000 description 10
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 10
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 10
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 108010087924 alanylproline Proteins 0.000 description 10
- 244000005700 microbiome Species 0.000 description 10
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 10
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 10
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 9
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 9
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 8
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 8
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 8
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 8
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 8
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 8
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 8
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 8
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 8
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 8
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 8
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 8
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 8
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 8
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 8
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 8
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 8
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 8
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 8
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 8
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 8
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 8
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 8
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 8
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 8
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 8
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 8
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 8
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 8
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 8
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 8
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 8
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 8
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 8
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 8
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 8
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 8
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 8
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 8
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 8
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 8
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 8
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 8
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 8
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 8
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 8
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 8
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 8
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 8
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 8
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 8
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 8
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 8
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 8
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 8
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 8
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 8
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 8
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 8
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 8
- 108010068380 arginylarginine Proteins 0.000 description 8
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 108010068265 aspartyltyrosine Proteins 0.000 description 8
- 239000012153 distilled water Substances 0.000 description 8
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 8
- 108010000998 wheylin-2 peptide Proteins 0.000 description 8
- 230000008859 change Effects 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 6
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 6
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 6
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 6
- DVKQPQKQDHHFTE-ZLUOBGJFSA-N Cys-Cys-Asn Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)N DVKQPQKQDHHFTE-ZLUOBGJFSA-N 0.000 description 6
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 6
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 6
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 6
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 6
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 6
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 6
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 6
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 6
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 6
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 6
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 6
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 6
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 6
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 6
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 6
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 6
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 6
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 6
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 6
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 6
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 230000002349 favourable effect Effects 0.000 description 6
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 6
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 6
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 6
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 6
- 230000006801 homologous recombination Effects 0.000 description 6
- 238000002744 homologous recombination Methods 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 5
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 241000589516 Pseudomonas Species 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 238000004587 chromatography analysis Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 4
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 4
- RCQRKPUXJAGEEC-ZLUOBGJFSA-N Ala-Cys-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RCQRKPUXJAGEEC-ZLUOBGJFSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- 239000004475 Arginine Substances 0.000 description 4
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 4
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 4
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 4
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 4
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 4
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 4
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 4
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 4
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 4
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 4
- 241000351920 Aspergillus nidulans Species 0.000 description 4
- 241000193830 Bacillus <bacterium> Species 0.000 description 4
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 4
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 4
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 4
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 4
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 4
- ATTWDCRXQNKRII-GUBZILKMSA-N Gln-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ATTWDCRXQNKRII-GUBZILKMSA-N 0.000 description 4
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 4
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 4
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 4
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 4
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 4
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 4
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 4
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 4
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 4
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 4
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 4
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 4
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 4
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 4
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 4
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 4
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 4
- 239000004809 Teflon Substances 0.000 description 4
- 229920006362 Teflon® Polymers 0.000 description 4
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 4
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 4
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 230000000845 anti-microbial effect Effects 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 230000004087 circulation Effects 0.000 description 4
- 238000005520 cutting process Methods 0.000 description 4
- 108010054812 diprotin A Proteins 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 230000002209 hydrophobic effect Effects 0.000 description 4
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- 241000194107 Bacillus megaterium Species 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 3
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- 239000006035 Tryptophane Substances 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 230000000844 anti-bacterial effect Effects 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 235000003704 aspartic acid Nutrition 0.000 description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000013016 damping Methods 0.000 description 3
- 239000005357 flat glass Substances 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 229960002989 glutamic acid Drugs 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 235000013372 meat Nutrition 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 239000012064 sodium phosphate buffer Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 230000009182 swimming Effects 0.000 description 3
- BFKJFAAPBSQJPD-UHFFFAOYSA-N tetrafluoroethene Chemical group FC(F)=C(F)F BFKJFAAPBSQJPD-UHFFFAOYSA-N 0.000 description 3
- 229960004799 tryptophan Drugs 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 241000589158 Agrobacterium Species 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- 241000235349 Ascomycota Species 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 2
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 241000221198 Basidiomycota Species 0.000 description 2
- 241001453380 Burkholderia Species 0.000 description 2
- 241000222122 Candida albicans Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 244000251987 Coprinus macrorhizus Species 0.000 description 2
- 235000001673 Coprinus macrorhizus Nutrition 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 2
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 2
- GGIHYKLJUIZYGH-ZLUOBGJFSA-N Cys-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O GGIHYKLJUIZYGH-ZLUOBGJFSA-N 0.000 description 2
- HIPHJNWPLMUBQQ-ACZMJKKPSA-N Cys-Cys-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O HIPHJNWPLMUBQQ-ACZMJKKPSA-N 0.000 description 2
- ISWAQPWFWKGCAL-ACZMJKKPSA-N Cys-Cys-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISWAQPWFWKGCAL-ACZMJKKPSA-N 0.000 description 2
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 2
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 2
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 2
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 2
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 2
- 241000588921 Enterobacteriaceae Species 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 241000223221 Fusarium oxysporum Species 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 2
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 2
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 2
- UQTKYYNHMVAOAA-HJPIBITLSA-N His-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N UQTKYYNHMVAOAA-HJPIBITLSA-N 0.000 description 2
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 2
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 2
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- 241000186809 Kurthia Species 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- 241000186660 Lactobacillus Species 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 2
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 2
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 2
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- 241000191938 Micrococcus luteus Species 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 101100386053 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-3 gene Proteins 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 241001655308 Nocardiaceae Species 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 2
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 241000947836 Pseudomonadaceae Species 0.000 description 2
- 241001633102 Rhizobiaceae Species 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 241000222481 Schizophyllum commune Species 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- MAWSJXHRLWVJEZ-ACZMJKKPSA-N Ser-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N MAWSJXHRLWVJEZ-ACZMJKKPSA-N 0.000 description 2
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- 241000204060 Streptomycetaceae Species 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 2
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 2
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- 241000499912 Trichoderma reesei Species 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 2
- ICFRWCLVYFKHJV-FXQIFTODSA-N Val-Cys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N ICFRWCLVYFKHJV-FXQIFTODSA-N 0.000 description 2
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 2
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- 238000007605 air drying Methods 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- KLOHDWPABZXLGI-YWUHCJSESA-M ampicillin sodium Chemical compound [Na+].C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C([O-])=O)(C)C)=CC=CC=C1 KLOHDWPABZXLGI-YWUHCJSESA-M 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 229940095731 candida albicans Drugs 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000000502 dialysis Methods 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 150000002460 imidazoles Chemical class 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 101150001852 mrdB gene Proteins 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 101150042737 rodA gene Proteins 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 229960004249 sodium acetate Drugs 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- 102000001049 Amyloid Human genes 0.000 description 1
- 108010094108 Amyloid Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 240000006439 Aspergillus oryzae Species 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 241000206594 Carnobacterium Species 0.000 description 1
- 208000036086 Chromosome Duplication Diseases 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000186031 Corynebacteriaceae Species 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 239000004278 EU approved seasoning Substances 0.000 description 1
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 101100502848 Escherichia coli (strain K12) fkpB gene Proteins 0.000 description 1
- 101100287343 Escherichia coli (strain K12) ispH gene Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 241001074968 Halobacteria Species 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- MFGOTAHWOBKNNU-XMHGGMMESA-N Isodigeranyl Chemical group CC(C)=CCC\C(C)=C\CC(C)(C=C)CCC=C(C)C MFGOTAHWOBKNNU-XMHGGMMESA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241001074893 Methanococci Species 0.000 description 1
- 241000205276 Methanosarcina Species 0.000 description 1
- 101100412856 Mus musculus Rhod gene Proteins 0.000 description 1
- 241000187654 Nocardia Species 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 241000520272 Pantoea Species 0.000 description 1
- 101100502849 Pseudomonas fluorescens yaaD gene Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241000190932 Rhodopseudomonas Species 0.000 description 1
- 101100434411 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ADH1 gene Proteins 0.000 description 1
- 108010034546 Serratia marcescens nuclease Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241001655322 Streptomycetales Species 0.000 description 1
- 101100242191 Tetraodon nigroviridis rho gene Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 101150102866 adc1 gene Proteins 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000004927 clay Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000012407 engineering method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 235000011194 food seasoning agent Nutrition 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 244000053095 fungal pathogen Species 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000005661 hydrophobic surface Effects 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- RBTARNINKXHZNM-UHFFFAOYSA-K iron trichloride Chemical compound Cl[Fe](Cl)Cl RBTARNINKXHZNM-UHFFFAOYSA-K 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- LFEUVBZXUFMACD-UHFFFAOYSA-H lead(2+);trioxido(oxo)-$l^{5}-arsane Chemical compound [Pb+2].[Pb+2].[Pb+2].[O-][As]([O-])([O-])=O.[O-][As]([O-])([O-])=O LFEUVBZXUFMACD-UHFFFAOYSA-H 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 159000000003 magnesium salts Chemical class 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 150000002696 manganese Chemical class 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 239000010445 mica Substances 0.000 description 1
- 229910052618 mica group Inorganic materials 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 125000001477 organic nitrogen group Chemical group 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 238000002161 passivation Methods 0.000 description 1
- 101150054557 pdxS gene Proteins 0.000 description 1
- 101150111339 pdxT gene Proteins 0.000 description 1
- 239000002304 perfume Substances 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000003910 polypeptide antibiotic agent Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 201000005484 prostate carcinoma in situ Diseases 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 239000000018 receptor agonist Substances 0.000 description 1
- 229940044601 receptor agonist Drugs 0.000 description 1
- 239000002464 receptor antagonist Substances 0.000 description 1
- 229940044551 receptor antagonist Drugs 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003014 reinforcing effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/37—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Mycology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Surface Treatment Of Glass (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
本发明涉及通用结构式(I)的多肽Xn-C1-X1-50-C2-X0-5-C3-Xp-C4-X1-100-C5-X1-50-C6-X0-5-C7-X1-50-C8-Xm(I),及其生产和用途。
Description
技术领域
本发明涉及新型疏水蛋白融合蛋白、其生产和用途。
背景技术
疏水蛋白是约100个氨基酸的小蛋白质,其为丝状真菌的特征蛋白质并且不存在于其它生物体中。最近,在天蓝色链霉菌(Streptomyces coelicolor)中发现疏水蛋白类蛋白质,称为“Chaplins”,而且其同样具有高表面活性的性质。Chaplins可以在水-空气界面组装形成淀粉样原纤维(Classen等人2003 Genes Dev 1714-1726;Elliot等人2003,Genes Dev.17,1727-1740)。
疏水蛋白以不溶于水的形式分布于多种真菌结构,例如气生菌丝、孢子、子实体的表面。疏水蛋白的基因从子囊菌(ascomycetes)、半知菌(deuteromycetes)和担子菌(basidiomycetes)中分离而来。某些真菌含有多于一个的疏水蛋白基因,例如普通裂褶菌(Schizophyllum commune)、灰盖鬼伞(Coprinus cinereus)、构巢曲霉(Aspergillus nidulans)。明显地,多种疏水蛋白涉及真菌发育的不同阶段。推测所述疏水蛋白负责不同的功能(vanWetter等人,2000,Mol.Mierobiol.,36,201-210;Kershaw等人1998,Fungal Genet.Biol,1998,23,18-33)。
除了为产生气生菌丝减少水的表面张力以外,疏水蛋白的生物学功能还有使孢子疏水化(Wsten等人1999,Curr.Biol.,19,1985-88;Bell等人1992,Genes Dev.,6,2382-2394)。此外,疏水蛋白被用于内衬地衣子实体中的气体通道,并且作为真菌病原体辨别植物表面系统的成分(Lugones等人1999,Mycol.Res.,103,635-640;Hamer和Talbot,1998,Curr.Opinion Microbiol.,卷1,693-697)。
互补实验证明单个类型中的疏水蛋白在某种程度上可以在功能上相互取代。
先前公开的疏水蛋白只能以中等产量制备,并且用常用的蛋白质化学纯化和分离方法纯化。目前在遗传方法的辅助下供应更大量的疏水蛋白的尝试尚未成功。
发明主题
本发明的主题是提供新的疏水蛋白及其生产方法,所述方法使能够经济地生产疏水蛋白并在多种技术领域使用。
发明详述
本发明涉及通用结构式(I)的多肽
Xn-C1-X1-50-C2-X0-5-C3-Xp-C4-X1-100-C5-X1-50-C6-X0-5-C7-X1-50-C8-Xm (I)
其中X可以是任何20种天然存在的氨基酸(苯丙氨酸Phe、亮氨酸Leu、丝氨酸Ser、酪氨酸Tyr、半胱氨酸Cys、色氨酸Trp、脯氨酸Pro、组氨酸His、谷氨酰胺Gln、精氨酸Arg、异亮氨酸Ile、甲硫氨酸Met、苏氨酸Thr、天冬酰胺Asn、赖氨酸Lys、缬氨酸Val、丙氨酸Ala、天冬氨酸Asp、谷氨酸Glu、甘氨酸Gly),且在X的指数表示氨基酸数,指数n和m是0-500之间,优选在15-300之间的数,p是1-250之间,优选在1-100之间的数,并且C是半胱氨酸、丙氨酸、丝氨酸、甘氨酸、甲硫氨酸或苏氨酸,其中至少4个由C指定的残基为半胱氨酸,附带条件是缩写为Xn 或Xm或Xp的肽序列的至少一个是长度至少为20个氨基酸的非天然连接于疏水蛋白的肽序列,该多肽在包被玻璃表面之后使接触角改变至少20°。
C1到C8指定的氨基酸优选为半胱氨酸,但是它们也可能被其它相似的大氨基酸(优选丙氨酸、丝氨酸、苏氨酸、甲硫氨酸或甘氨酸)所代替。然而,C1到C8位置的至少4个、优选至少5个、尤其优选至少6个且尤其至少7个应该包含半胱氨酸。本发明蛋白质中的半胱氨酸可以是还原形 式或彼此形成二硫键。尤其优选形成分子内C-C桥,特别是具有至少1个、优选2个、尤其优选3个和特别尤其优选4个分子内二硫键的那些。当如上所述用相似的大氨基酸代替半胱氨酸时,那些彼此之间形成分子内二硫键的C位置的配对最好也被代替。
如果在以X指定的位置也使用半胱氨酸、丝氨酸、丙氨酸、甘氨酸、甲硫氨酸或苏氨酸,可以相应改变通式中单个半胱氨酸位点的数量。
尤其有利的多肽是那些具有通式(II)的多肽
Xn-C1-X3-25-C2-X0-2-C3-X5-50-C4-X2-35-C5-X2-15-C6-X0-2-C7-X3-35-C8-Xm (II)
其中X可以是任何20种天然存在的氨基酸(苯丙氨酸Phe、亮氨酸Leu、丝氨酸Ser、酪氨酸Tyr、半胱氨酸Cys、色氨酸Trp、脯氨酸Pro、组氨酸His、谷氨酰胺Gln、精氨酸Arg、异亮氨酸Ile、甲硫氨酸Met、苏氨酸Thr、天冬酰胺Asn、赖氨酸Lys、缬氨酸Val、丙氨酸Ala、天冬氨酸Asp、谷氨酸Glu、甘氨酸Gly),在X的指数表示氨基酸数,指数n和m是2-300之间的数,并且C是半胱氨酸、丙氨酸、丝氨酸、甘氨酸、甲硫氨酸或苏氨酸,其中至少4个由C指定的残基为半胱氨酸,附带条件是缩写为Xn或Xm的肽序列的至少一个是长度至少为35个氨基酸的非天然连接疏水蛋白的肽序列,该多肽在包被玻璃表面之后使接触角改变至少20°。
特别尤其有利的是那些具有通式(III)的多肽
Xn-C1-X5-9-C2-C3-X11-39-C4-X2-23-C5-X5-9-C6-C7-X6-18-C8-Xm (III)
其中X可以是任何20种天然存在的氨基酸(苯丙氨酸Phe、亮氨酸Leu、丝氨酸Ser、酪氨酸Tyr、半胱氨酸Cys、色氨酸Trp、脯氨酸Pro、组氨酸His、谷氨酰胺Gln、精氨酸Arg、异亮氨酸Ile、甲硫氨酸Met、苏氨酸Thr、天冬酰胺Asn、赖氨酸Lys、缬氨酸Val、丙氨酸Ala、天冬氨酸Asp、谷氨酸Glu、甘氨酸Gly),在X的指数表示氨基酸数,指数n和m是0-200之间的数,并且C是半胱氨酸、丙氨酸、丝氨酸、甘氨酸、甲硫氨酸或苏氨酸,其中至少6个由C指定的残基为半胱氨酸,附带条件是缩写为Xn或Xm的肽序列的至少一个是长度至少为40个氨基酸的非天然连接疏水蛋白的肽序列,该多肽在包被玻璃表面之后使接触角改变至少20°。
所述本发明的优选实施方案是具有通用结构式(I)、(II)或(III)的多肽,该结构式包含至少一种I类疏水蛋白,优选至少一个dewA、rodA、hypA、hypB、sc3、basf1、basf2疏水蛋白,或其部分或衍生物。下文的序列表对所述疏水蛋白进行了结构表征。也可能将多个(优选2个或3个)相同或不同结构的疏水蛋白彼此连接,并且连接于并非天然连接疏水蛋白的相应适宜多肽序列上。
本发明尤其优选的实施方案是具有SEQ ID NO:20、22、24所示的多肽序列的新蛋白质,及其编码核酸序列,具体是SEQ ID NO:19、21、23中定义的序列。尤其优选的实施方案还包括由SEQ ID NO:20、22或24所示多肽序列起始,通过取代、插入或缺失至少1个至多10个、优选5个、更优选所有氨基酸的5%而产生的蛋白质,所述蛋白质仍然具有起始蛋白质至少50%的生物特性。此处蛋白质的生物学特性指上述接触角的改变,如实施例10中所述。
本发明的蛋白质中至少一个缩写为Xn或Xm或Xp位置的多肽序列不是天然连接于疏水蛋白的,该多肽序列包含至少20个、优选至少35个、尤其优选至少50个、和尤其至少100个氨基酸(也指下文中的融合配偶体)。这是为了说明所述蛋白质可能由自然条件下并不以这种形式共同存在的疏水蛋白部分和融合配偶体部分组成。
融合配偶体部分可能选自多种蛋白质。对于多数融合配偶体,其也可能连接于一个疏水蛋白部分,例如连接于所述疏水蛋白部分的氨基端(Xn)和羧基端(Xm)或中间(Xp)。然而,也可能例如两个融合配偶体连接于本发明蛋白质的单个位置(Xn或Xm)。
尤其优选的融合配偶体部分是能够使本发明的蛋白质包被玻璃表面的多肽序列,并且其引起蛋白质处理的玻璃表面对去污剂处理具有抗性,这在实验部分(实施例10)中有详细描述(例如1%SDS在80℃处理10分钟)。
尤其合适的融合配偶体部分是在微生物、特别是大肠杆菌(E.coli)或枯草芽孢杆菌(Bacillus subtilis)中天然存在的多肽。此类融合配偶体的实例是序列yaad(SEQ ID NO:15和16)、yaae(SEQ ID NO:17和18)和硫氧还蛋 白。同样高度适用的是所述序列的片段或衍生物,其只含有所述序列的部分、优选10-90%、尤其优选25-75%。此处优选C末端缺失体,例如仅由N-末端前75个氨基酸组成的yaad片段,或其中与所述序列相比有单个氨基酸或核苷酸改变的序列。例如可以在yaad和yaae序列的C-末端附加另外的氨基酸,特别是2个另外的氨基酸(优选精氨酸、丝氨酸)。也可能优选与天然存在的序列相比,插入到yaae序列中另外的氨基酸,例如SEQ IDNO:17和18中的第2号氨基酸(甘氨酸)。
其它融合配偶体、尤其是那些在通式(I)的Xp位置的融合配偶体的实例是酶活性结构域、抗微生物结构域、作为受体兴奋剂/拮抗剂的多肽序列、色素、调味剂和芳香剂、金属结合结构域。还可能产生与多种活性化合物和效应子共价结合的特异性偶联位点。例如,在该环中插入另外的赖氨酸,以便在技术人员所公知的异双功能(heterobifunctional)连接的帮助下,通过伯氨基基团(primary amino group)使活性化合物和效应子特异的连接于疏水蛋白分子主链。
此外,也可能在两个融合配偶体的连结点处插入另外的氨基酸,导致在核酸水平上新产生或钝化限制性内切酶识别位点。
本发明蛋白质的多肽序列还可能通过例如糖基化、乙酰化或通过其它与例如戊二醛化学交联的方法加以修饰。
本发明蛋白质的一个特性是当用所述蛋白质包被表面时改变表面的性质。实验上是通过在以蛋白质包被表面之前和之后测量水滴接触角,并确定两次测量的差别来确定所述表面性质的改变。
用于测量接触角的精确实验条件在实施例10中的实验部分说明。在这些条件下,本发明的蛋白质具有使接触角至少增大20°,优选25°,尤其优选30°的性质。
目前已知的疏水蛋白中,疏水蛋白部分的极性和非极性氨基酸的位置是保守的,导致特征性的疏水性图谱。根据生物物理学性质和疏水性的差异将目前已知的疏水蛋白分为两类,I类和II类(Wessels等人1994,Ann.Rev.Phytopathol.,32,413-437)。
I类疏水蛋白组装的膜高度不可溶(即使是升高温度在1%的十二烷基硫酸钠(SDS)中也是如此),并且只能通过浓缩的三氟乙酸(TFA)或甲酸而再次解离。与之相反,II类疏水蛋白的组装形式较不稳定。它们通过60%浓度的乙醇或1%SDS(在室温)即可再次解离。这种对于溶剂和去污剂的高稳定性是疏水蛋白的特殊性质,并且其将本发明的多肽包被与表面上多种蛋白质形成的“非特异性”蛋白质包被区分开来。
氨基酸序列的比较揭示,在II类疏水蛋白中C3和C4半胱氨酸之间区域的长度明显小于I类疏水蛋白。
此外,II类疏水蛋白比I类疏水蛋白具有更多带电荷的氨基酸。
本发明还涉及生产本发明蛋白质的方法。这些多肽可以通过肽合成的成熟技术化学制备,例如通过Merrifield固相合成。
然而,尤其有用的是基因工程方法,其中两个分别编码融合配偶体和疏水蛋白部分的核酸序列(优选DNA序列)组合,从而通过组合核酸序列的基因表达在宿主生物体中产生所希望的蛋白质。
此处用于所述构建方法的合适的宿主生物体(生产生物体)可以是原核生物(包括古细菌(Archaea))或真核生物,尤其是细菌包括嗜盐菌(halobacteria)和甲烷球菌(methanococci),真菌,昆虫细胞,植物细胞和哺乳动物细胞,尤其优选大肠杆菌、枯草芽孢杆菌、巨大芽孢杆菌(Bacillusmegaterium)、米曲霉(Aspergillus oryzea)、构巢曲霉(Aspergillus nidulans)、黑曲霉(Aspergillus niger)、巴斯德毕赤酵母(Pichia pastoris)、假单胞菌属物种(Pseudomonas spec.)、乳酸杆菌(lactobacilli)、多形汉逊酵母(Hansenulapolymorpha)、里氏木霉(Trichoderma reesei)、SF9(或相关细胞)等等。
本发明还涉及表达构建体的用途,所述表达构建体包含在调控核酸序列的遗传控制下编码本发明的多肽的核酸序列,并且本发明也涉及包含至少一个所述表达构建体的载体。
本发明的构建体优选包含特定编码序列的5’上游启动子和3’下游终止序列,适用的情况下还包含各自与所述编码序列有效连接的其他常用调控元件。
“有效连接”指启动子、编码序列、终止子和适用情况下的其他调控元件的有序排列,从而使每个调控元件能够实现其表达编码序列所需的功能。
有效连接序列的实例为寻靶序列以及增强子、多腺苷酸化信号等等。其它调控元件包含可选标记、扩增信号、复制起点等等。合适的调控元件在例如Goeddel,基因表达技术:酶学方法185,学术出版社,San Diego,CA(1990)中描述。
除这些调控序列以外,可能在实际结构基因的上游仍存在对这些序列的天然调控,且适用的情况下可对之进行遗传修饰,从而通过关闭天然调控而增强基因表达。
优选的核酸构建体最好还包含一个或多个与启动子功能性连接的上述增强子序列,以增强核酸序列的表达。其他有利的序列,例如其他的调控元件或终止子也可插入到DNA序列的3’末端。
在构建体中可能具有本发明的核酸的一个或多个拷贝。如果为筛选所述构建体所需,构建体也可能包含其他标记,例如抗生素抗性或营养缺陷型补偿基因。
例如,对本发明方法有利的调控序列存在于启动子如coS、tac、trp、tet、trp-tet、lpp、lac、lpp-lac、lacIq-T7、T5、T3、gal、trc、ara、rhaP(rhaPBAD)SP6、λ-PR启动子或λ-P启动子中,这些启动子最好在革兰氏阴性菌中使用。其他有利的调控序列存在于例如革兰氏阳性启动子amy和SP02,和酵母或真菌启动子ADC1、MFα、AC、P-60、CYC1、GAPDH、TEF、rp28、ADH。
也可以使用人工启动子进行调控。
为了在宿主生物体中表达,最好将核酸构建体插入例如能够使基因在宿主细胞中最优表达的载体(例如质粒或噬菌体)。除质粒和噬菌体以外,载体也指技术人员所公知的任何其它载体,即例如病毒(如SV40、CMV、杆状病毒和腺病毒),转座子,IS元件,噬粒,粘粒,和线性或环状DNA,以及农杆菌系统。
这些载体可以在宿主生物体中自主复制或随染色体复制。这些载体构成本发明的其他实施方案。合适质粒的实例是大肠杆菌中的pLG338、pACYC184、pBR322、pUC18、pUC19、pKC30、pRep4、pHS1、pKK223-3、pDHE19.2、pHS2、pPLc236、pMBL24、pLG200、pUR290、pIN-III”3-B1、tgt11或pBdCl,链霉菌(Streptomyces)的pIJ101、pIJ364、pIJ702或pIJ361,杆菌的pUB110、pC194或pBD214,棒状杆菌(Corynebacterium)的pSA77或pAJ667,真菌的pALS1、pIL2或pBB116,酵母中的2α、pAG-1、YEp6、YEp13或pEMBLYe23,或植物中的pLGV23、pGHlac+、pBIN19、pAK2004或pDH51。所述质粒是可能质粒的一小部分选择。其它质粒是技术人员所公知的,并且能够在例如《克隆载体)》(Eds.Pouwels P.H.等人Elsevier,Amsterdam-New York-Oxford,1985,ISBN 0 444 904018)中找到。
为了表达其它存在的基因,核酸构建体也最好包含3’-末端和/或5’-末端调控序列来增强表达,根据宿主生物和基因或所选基因对所述序列加以选择,以达到最佳表达。
这些调控序列旨在使基因和蛋白质特异性表达。这意味着取决于宿主生物体,例如基因只在诱导后表达或过表达,或者立即表达和/或过表达。
在这点上,优选引入的调控序列或因子具有正面影响,并且增强其被引入的基因的表达。因此,使用强转录信号(例如启动子和/或增强子)在转录水平增强调控元件是有利的。然而,除此之外,也可能通过例如改良mRNA稳定性来增强翻译。
在载体的另一实施方案中,包含本发明核酸构建体或本发明核酸的载体也可能以线性DNA的形式被有利地引入微生物中,并通过异源或同源重组的方式整合到宿主生物体的基因组中。此线性DNA可能由线性化载体(例如质粒)组成,或仅由本发明的核酸构建体或核酸组成。
为实现生物中异源基因的最佳表达,最好根据生物中特定的密码子使用来修饰核酸序列。通过计算机分析所研究生物的其他已知基因,可以容易的确定密码子的使用。
将合适的启动子与合适的编码核苷酸序列和终止信号或多聚腺苷酸信 号融合来制备表达盒。为此使用的常规重组和克隆技术见于例如T.Maniatis,E.F.Fritsch和J.Sambrook,分子克隆:实验室操作手册,ColdSpring Harbor Laboratory,Cold Spring Harbor,NY(1989)和T.J.Silhavy,M.L.Berman和L.W.Enquist,基因融合实验,Cold SpringHarbor Laboratory,Cold Spring Harbor,NY(1984)以及在Ausubel,F.M.等人,现代分子生物学实验方法,Greene Publishing Assoc.和WileyInterscience(1987)中的描述。
为了在合适宿主生物体中表达,重组核酸构建体或基因构建体最好插入宿主特异的载体,该载体能够使基因在宿主中最优表达。载体是技术人员所熟知的,并且能够在例如“克隆载体”(Pouwels P.H.等人,编辑,Elsevier,Amsterdam-New York-Oxford,1985)中找到。
可以利用本发明的载体制备例如由至少一个本发明的载体转化的重组微生物,并且其可用于生产本发明所用多肽。最好将本发明的上述重组构建体引入合适的宿主系统并表达之。在此情况下,优选使用技术人员熟知的常用克隆和转染方法,例如共沉淀、原生质体融合、电穿孔法、逆转录病毒转染等等,以引起所述核酸在特定表达系统中表达。合适的系统在例如现代分子生物学实验方法,F.Ausubel等人编辑,Wiley Interscience,New York 1997,或Sambrook等人,分子克隆:实验室操作手册,第二版,Cold Spring Harbor Laboratory,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,NY,1989中描述。
也可以根据本发明制备同源重组微生物。为此,可制备包含至少一部分本发明的基因或编码序列的载体,适用的情况下,所述编码序列中引入至少一个氨基酸缺失、氨基酸添加或氨基酸取代,从而修饰(例如功能性破坏)本发明的序列(敲除载体)。引入的序列可为例如相关微生物的同源物,或由哺乳动物、酵母或昆虫来源衍生而来。或者,可设计用于同源重组的载体,从而使内源性基因在同源重组时发生突变或其他改变,但仍然编码功能性蛋白质(例如改变上游调控区域由此改变内源性蛋白质的表达)。本发明所用的基因的改变的部分位于同源重组载体中。适合同源重组的载体 的构建在例如Thomas,K.R.和Capecchi,M.R.(1987)Cell 51:503中描述。
适合本发明的核酸或核酸构建体的重组宿主生物体原则上是任何原核或真核生物体。最好使用微生物例如细菌、真菌或酵母作为宿主生物体。革兰氏阳性或革兰氏阴性细菌,优选肠杆菌科(Enterobacteriaceae)、假单胞菌科(Pseudomonadaceae)、根瘤菌科(Rhizobiaceae)、链霉菌科(Streptomycetaceae)或诺卡氏菌科(Nocardiaceae)的细菌,特别优选使用的细菌为埃希氏菌属(Escherichia)、假单胞菌属、链酶菌属、诺卡氏菌属、伯克霍尔德菌属(Burkholderia)、沙门氏菌属(Salmonella)、农杆菌属或红球菌属(Rhodococcus)的细菌。
依据宿主生物体,按技术人员公知的方法生长或培养本发明方法中使用的生物体。微生物通常生长在液体培养基中,其中包含碳源(通常以糖的形式)、氮源(通常以有机氮源的形式例如酵母提取物,或例如硫酸铵等盐类)、微量元素(例如铁盐、锰盐和镁盐)、以及适用情况下的维生素,生长温度在0℃到100℃之间,优选10℃到60℃,同时供以氧气。此种情况培养液pH值可以维持或不维持在固定值,即在培养过程中可以调节或不调节。可以进行分批发酵、半分批发酵或连续培养。可以在发酵最初开始时引入营养素,或以半连续或连续方式进行随后加料。可以通过实施例中所描述的方法从生物体中分离酶,或者在反应中使用酶粗提物。
可以用重组方法制备本发明的多肽或其功能生物活性片段,通过培养产生多肽的微生物,适用的情况下诱导表达多肽,而且从培养液中分离所述多肽。如果需要,也可以通过此种方法在工业上生产多肽。可以通过已知方法培养和发酵重组微生物。例如细菌可以在20-40℃,pH6-9条件下,在TB培养基或LB培养基中繁殖。合适的培养条件例如在T.Maniatis,E.F.Fritsch和J.Sambrook,分子克隆:实验室操作手册,Cold Spring HarborLaboratory,Cold Spring Harbor,NY(1989)中详细描述。
如果本发明使用的多肽并非分泌至培养基中,那么就要破坏细胞,通过已知的多肽分离方法从溶菌产物中获得产品。正如所预期的,通过高频超声、高压(例如在弗氏压碎器中)、渗透、使用去污剂、水解酶或有机溶 剂、匀浆等手段,或通过所列出的方法中的多个的组合来破坏细胞。
多肽可以用已知的层析方法纯化,例如分子筛层析(凝胶过滤法),例如Q琼脂糖层析,离子交换层析和疏水层析,以及使用其它常用方法,例如超滤、结晶、盐析、透析和非变性凝胶电泳。合适的方法在例如Cooper,F.G.,Biochemische Arbeitsmethoden,[original title:The tools ofbiochemistry],Verlag Walter de Gruyter,Berlin,New York or in Scopes,R.,蛋白质纯化,Springer Verlag,New York,Heidelberg,Berlin中描述。
最好通过使用载体系统或寡核苷酸分离重组蛋白质,所述寡核苷酸以特定核苷酸序列延伸cDNA,从而编码例如有利于纯化的改变的多肽或融合蛋白质。此类合适的修饰的实例是作为锚起作用的“标签”,例如六组氨酸锚的修饰、或能够作为抗原被抗体识别的表位(例如在Harlow,E.和Lane,D.,1988,抗体:实验室操作手册,Cold Spring Harbor(N.Y.)Press中描述)。其它合适的标签是例如HA、钙调节蛋白-BD、GST、MBD;几丁质-BD、链霉亲和素-BD-avi-标签、Flag-标签、T7等等。这些锚可以用于将蛋白质附着在固体载体上,例如聚合体基质,例如其可填充到层析柱中,或用于微孔滴定板或其它载体。相应的纯化方法可以从商品化的亲和标签供应商那里获得。
在显微镜下可以观察到许多疏水蛋白包被的真菌表面(孢子、子实体、菌丝),其特征结构称为“小棒(rodlets)”。类似的厚度约为10nm的小棒也可以在疏水蛋白包被的亲水表面检测到(例如玻璃、云母等)(Wsten等人,1993,Plant Cell,5,1567-1574)。
由于包被表面的疏水蛋白的特殊性质(例如抗去污剂,例如1%浓度的SDS溶液),这些蛋白质具有用于许多工业应用的极大潜能。多种专利文件涉及此类应用的实例,此处引用其中关于疏水蛋白的应用。
(申请)号 优先权 申请人
WO 03/10331 | 07.23.2001 | Nanosystems B.V.申请 |
WO 04/00880 | 06.21.2002 | Nanosystems B.V.申请 |
WO 03/84508 | 04.04.2002 | Nanosystems B.V.申请 |
WO 00/40968 | 01.05.1999 | Unilever N.V.(Hindustan Lever Ltd.) |
EP-B 1 252 516 | 02.04.2000 | Nanosystems B.V. Stichting voor de Technische Wetenschappen申请 |
EP-B 1257 571 | 02.04.2000 | Nanosystems B.V.申请 |
WO 01/57066 | 02.04.2000 | Nanosystems B.V.申请 |
WO 03/10331 | 07.23.2001 | Nanosystems B.V.申请 |
WO 03/53383 | 12.14.2001 | L’Oreal |
疏水蛋白(尤其是I类疏水蛋白)在工业上的利用至今还不成功,这是由于缺乏有效的生产和纯化的方法。先前描述的从天然来源(孢子、真菌菌丝等等)提取的方法只能产生毫克级的材料量(例如WO 96/41882)。
通过重组生产多种生物生产者的途径同样被证明非常复杂,并且不是很令人满意。
本发明的疏水蛋白,其融合形式(即与融合配偶体部分一起)以及分离的形式两者都具有所希望的疏水蛋白性质。因此本发明的蛋白质既可能直接用作融合蛋白,又可以在剪切后除去融合配偶体而用作“纯化的”疏水蛋白。
当需要移除融合配偶体时,推荐在融合蛋白中疏水蛋白部分和融合配偶体部分之间掺入潜在的切割位点(蛋白酶的特定识别位点)。尤其合适的切割位点包括通过生物信息学工具能够确定的那些否则将既不存在于疏水蛋白部分也不存在于融合配偶体部分中的肽序列。尤其有用的是例如在甲硫氨酸的BrCN切割或蛋白酶介导的Xa因子切割、肠激酶切割、凝血酶、TEV切割(烟草蚀纹病毒蛋白酶)。
实施例
实施例1
克隆yaad-His6/yaaE-His6的准备工作
用寡核苷酸Hal570和Hal571(Hal 572/Hal 573)进行多聚酶链式反应。所使用的模板DNA是枯草芽孢杆菌基因组DNA。获得的PCR片段包含枯草芽孢杆菌yaaD/yaaE基因的编码序列,和在其末端的NcoI和BglII限制性切割位点。纯化PCR片段,并用限制性内切酶NcoI和BgHI切割。此DNA片段用作插入片段克隆到Qiagen pQE60载体中,该载体预先用限制性内切酶NcoI和BglII线性化。这样获得的载体pQE60YAAD#2/pQE60YaaE#5可以分别用于表达由YAAD::HIS6和YAAE::HIS6组成的蛋白质。
Hal570:gcgcgcccatggctcaaacaggtactga
Hal571:gcagatctccagccgcgttcttgcatac
Hal572:ggccatgggattaacaataggtgtactagg
Hal573:gcagatcttacaagtgccttttgcttatattcc
实施例2a
yaad-疏水蛋白DewA-His6的克隆
用寡核苷酸KaM416和KaM417进行多聚酶链式反应。所使用的模板DNA是构巢霉菌基因组DNA。获得的PCR片段包含疏水蛋白基因dewA的编码序列,和N-末端因子Xa蛋白酶切割位点。纯化PCR片段,并用限制性内切酶BamHI切割。此DNA片段用作插入片段克隆到载体pQE60YAAD#2中,该载体预先用限制性内切酶BglII线性化。
这样获得的载体#508用于表达由YAAD::Xa::dewA::HIS6组成的融合蛋白质。
KaM416:
GCAGCCCATCAGGGATCCCTCAGCCTTGGTACCAGCGC
KaM417:
CCCGTAGCTAGTGGATCCATTGAAGGCCGCATGAAGTTCTCCGT
CTCCGC
实施例2b
具有抗菌性质的嵌合体融合蛋白质的制备
为了引入具有抗菌性质的肽序列,遵循以下克隆策略:
从我们的Yaad-DewA表达质粒“pQE60 Yaad dewA His”开始,通过半胱氨酸3和4之间的融合PCR将抗菌肽序列插入。
图2
1a)PCR区域:YaaD-dewA到半胱氨酸3,包括T7 novispirin突出端
模板:pQE60 YaaD dewA 6His
引物:引物1
(AATTAACCATGGCTCAAACA)20-mer
引物2
(GCCATATTTTTTAATAATATGAATAATTTTACGGGTAATACGACGC
AGGTTTTTGCAGCAAGCGATCGAGCCGA)74-mer
PCR条件:55℃,1118bp
1b)PCR区域:novispirin突出端到6His/终止
模板:pQE60 YaaD dewA 6His
引物:引物3
(ATATTATTAAAAAATATGGCAACTCCCCCGCTGAGACCAA)40-mer
引物4
(CTAATTAAGCTTAGTGATGGT)21-mer
PCR条件:54℃,306bp
2)退火PCR
将PCR 1a和1b产物50pmol组合,用Pfu-聚合酶进行退火PCR(95℃1分钟,72℃5分钟-10个循环)
随后加入外部引物(outside primers),进行通常的35个循环的PCR。
引物:引物1
(AATTAACCATGGCTCAAACA)20-mer
引物4
(CTAATTAAGCTTAGTGATGGT)21-mer
PCR条件:53℃,1404bp
3)连接
载体:pQE60 YaaD dewA 6His
用NcoI/KpnI消化
制备凝胶,分离3428bp的条带
插入片段:退火PCR的产物
用NcoI/KpnI消化
制备凝胶,分离1350bp的条带
4)转化XL10/TG10化学感受态细胞(chemocomp.cells)
图3
pQE60 YaaD dewA Cys3 G10 novispirin
1a)PCR区域:YaaD-dewA到Cys3,包括G10 novispirin突出端
模板:pQE60 YaaD dewA 6His
引物:引物1
(AATTAACCATGGCTCAAACA)20-mer
引物5
(GCCATATTTTTTAATAATATGAATGCCTTTACGAATAATACGACGC
AGGTTTTTGCAGCAAGCGATCGAGCCGA)74-mer
PCR条件:55℃,1118bp
1b)PCR区域:novispirin突出端到6His/终止
图4
模板:pQE60 YaaD dewA 6His
引物:引物6
(ATATTATTAAAAAATATGGCAACTCCCCCGCTGAGACCAA)40-mer
引物4
(CTAATTAAGCTTAGTGATGGT)21-mer
PCR条件:54℃,306bp
2)退火PCR
将PCR 1a和1b产物50pmol组合,用Pfu-聚合酶进行退火PCR(95℃1分钟,72℃5分钟-10个循环)
随后加入外部引物,进行普通的35个循环的PCR。
引物:引物1
(AATTAACCATGGCTCAAACA)20-mer
引物4
(CTAATTAAGCTTAGTGATGGT)21-mer
PCR条件:53℃,1404bp
3)连接
载体:pQE60 YaaD dewA 6His
用NcoI/KpnI消化
制备凝胶,分离3428bp的条带
插入片段:退火PCR的产物
用NcoI/KpnI消化
制备凝胶,分离1350bp的条带
4)转化XL10/TG10化学感受态细胞
图5
蛋白质的纯化和评估与实施例8-10类似。
可以测定抗菌性质:
抗菌作用的测定在6-孔微量滴定板中进行,该滴定板填充相应生长所需的琼脂(LB、YM)。然后将平板培养过夜。
在测定中使用的微生物如下:
枯草芽孢杆菌
巨大芽胞杆菌
大肠杆菌XL1 Blue MR
藤黄微球菌(Micrococcus luteus)
泛菌(Pantoea)
库特菌(Kurthia gibs.)
假单胞菌属物种
肉杆菌(Carnobacterium)
白色念珠菌(Candida albicans)
尖孢镰刀菌(Fusarium oxysporum)
除肉杆菌和假单胞菌以外的所有微生物有机体(MO)都在20ml YPD培养基中培养过夜(30℃,200rpm);肉杆菌在TSB中培养,假单胞菌在CASO中培养。将20μl特定细菌或真菌的悬浮液应用于微孔滴定板的每个孔,并轻轻铺平。使悬浮液在某种程度上浸入琼脂。然后将疏水蛋白溶液(同样20μl)应用于孔的中央;在30℃培养箱中培养琼脂平板。在过夜培养后已经可以观测到最初结果:通过应用点周围没有生长物来揭示抗菌作用,而缺乏抗菌作用的孔被完全长满。
实施例3
yaad疏水蛋白RodA-His6的克隆
质粒#513的克隆类似于质粒#508,使用寡核苷酸KaM434和KaM435。
KaM434:GCTAAGCGGATCCATTGAAGGCCGCATGAAGTTCTCCATTGCTGC
KaM435:CCAATGGGGATCCGAGGATGGAGCCAAGGG
实施例4
yaad疏水蛋白BASF1-His6的克隆
质粒#507的克隆类似于质粒#508,使用寡核苷酸KaM417和KaM418。
使用的模板DNA是人工合成的DNA序列,疏水蛋白BASF1(见附件)。
KaM417:CCCGTAGCTAGTGGATCCATTGAAGGCCGCATGAAGTTCTCCGTCTCCGC
KaM418:CTGCCATTCAGGGGATCCCATATGGAGGAGGGAGACAG
实施例5
yaad疏水蛋白BASF2-His6的克隆
质粒#506的克隆类似于质粒#508,使用寡核苷酸KaM417和KaM418。使用的模板DNA是人工合成的DNA序列,疏水蛋白BASF2(见附件)。
KaM417:CCCGTAGCTAGTGGATCCATTGAAGGCCGCATGAAGTTCTCCGTCTCCGC
KaM418:CTGCCATTCAGGGGATCCCATATGGAGGAGGGAGACAG
实施例6
yaad疏水蛋白SC3-His6的克隆
质粒#526的克隆类似于质粒#508,使用寡核苷酸KaM464和KaM465。使用的模板DNA是Schyzophyllum commune cDNA(见附件)。
KaM464:CGTTAAGGATCCGAGGATGTTGATGGGGGTGC
KaM465:GCTAACAGATCTATGTTCGCCCGTCTCCCCGTCGT
实施例7
yaad疏水蛋白DewA-His6重组大肠杆菌菌株的发酵
将含有表达yaad疏水蛋白DewA-His6的大肠杆菌菌株的3ml LB液体培养基接种于15ml Greiner管中。于37℃在振荡器上以200rpm培养8小时。将1ml此预培养液各自接种于2个1升有塞子锥形瓶中的250mlLB培养基中(+100μg/ml氨苄青霉素),在37℃以180rpm振荡培养9小时。
将0.5升的预培养液(相对于水测量OD600nm 1∶10)接种于13.5升的LB培养基(+100μg/ml氨苄青霉素)中,置于20升的发酵罐。在OD600nm约3.5时添加140ml的100mM IPTG。3小时以后,将发酵罐冷却到10℃,并通过离心除去发酵液。细胞沉淀用于进一步的纯化。
实施例8
重组疏水蛋白融合蛋白的纯化
(带有C-末端His6标签的疏水蛋白融合蛋白的纯化)
100g的细胞沉淀(100-500mg的疏水蛋白)用50mM pH7.5的磷酸钠缓冲液补足到总体积为200ml,并将沉淀重悬。悬浮液用T25型高速分散器Ultraturrax(Janke和Kunkel;IKA-Labortechnik)处理10分钟,随后为了降解核酸,再与500单位benzonase核酸酶(Merck,Darmstadt;order No.1.01697.0001)在室温培育1小时。在破坏细胞之前使用玻璃萃取柱(P1)进行过滤。为了破坏细胞和剪断剩余的基因组DNA,用匀浆器在1500巴进行2次处理(Microfluidizer M-110EH;Microfluidics Corp.)。将匀浆液离心(Sorvall RC-5B,GSA转子,250ml离心筒,60分钟,4℃,12000rpm,23000g),上清置于冰上,用100ml pH7.5的磷酸钠缓冲液重悬沉淀。重复3次离心和重悬,在第3次重复时磷酸钠缓冲液中包含1%SDS。重悬以后,将溶液搅拌1小时,然后进行最终离心(Sorvall RC-5B,GSA转子,250ml离心筒,60分钟,4℃,12000rpm,23000g)。根据SDS-PAGE分析,疏 水蛋白存在于最终离心的上清液部分(图1)。实验显示疏水蛋白可以以包涵体的形式存在于相应大肠杆菌中。50ml含疏水蛋白的上清液应用于以50mM pH8.0的Tris-Cl缓冲液平衡的50ml镍-琼脂糖高性能17-5268-02柱(Amersham)。用50mM pH8.0的Tris-Cl缓冲液洗柱,随后用包含200mM咪唑的50mM pH8.0的Tris-Cl缓冲液洗脱疏水蛋白。为了去除咪唑,溶液用50mM pH8.0的Tris-Cl缓冲液透析。
图1描述本发明疏水蛋白的纯化
1泳道:应用于镍-琼脂糖柱的溶液(1∶10稀释)
2泳道:流出液=清洗步骤的洗脱液
3-5泳道:洗脱级分的OD 280峰
图1中本发明的疏水蛋白分子量约为53kD。一些较小条带代表所述疏水蛋白的降解产物。
实施例9
具有疏水蛋白表面的包被/评估
疏水蛋白或疏水蛋白融合蛋白的包被性质优选以玻璃和聚四氟乙烯(Teflon)分别作为亲水和疏水表面的模型进行评估。
标准包被实验
玻璃:
-疏水蛋白浓度:1-100μg/mL
-玻璃片在50mM pH4醋酸钠+0.1%吐温20中培育过夜(温度80℃)
-包被以后,在蒸馏水中清洗
-然后于80℃在1%SDS中培育10分钟
-在蒸馏水中清洗
聚四氟乙烯(Teflon):
-浓度:1-100μg/mL
-聚四氟乙烯(Teflon)平板在10mM pH8的Tris中培育过夜(温度80℃)
-包被以后,在蒸馏水中清洗
-然后于80℃在1%吐温20中培育10分钟
-在蒸馏水中清洗
-然后于80℃在1%SDS中培育10分钟
-在蒸馏水中清洗
样品在空气中干燥,测定5μl水滴的接触角(单位为度),得到如下值,例如:
根据实施例8以yaad-DewA融合蛋白进行实验(对照:无蛋白质;yaad-dewA-his6:100μg/ml融合蛋白配偶体):
80℃,1%SDS处理之后 | ||
聚四氟乙烯Teflon | 玻璃 | |
对照 | 96.8 | 30 |
yaad | 97.4 | 38.7 |
100μg/ml | 77.7 | 76.8 |
50μg/ml | 85.9 | 77.9 |
10μg/ml | 83.5 | 74.5 |
5μg/ml | 104 | 70.3 |
1μg/ml | 104.9 | 73 |
实施例10
具有疏水蛋白表面的包被/评估
玻璃(窗玻璃,Süddeutsche Glas,Mannheim,Germany):
-疏水蛋白浓度:100μg/mL
-玻璃片在50mM pH4的醋酸钠+0.1%吐温20中培育过夜(温度80℃)
-包被以后,在蒸馏水中清洗
-然后于80℃在1%SDS的蒸馏水溶液中培育10分钟
-在蒸馏水中清洗
样品在空气中干燥,且测定5μl水滴的接触角(单位为度)。
接触角的测量用Dataphysics接触角系统OCA 15+仪器,应用软件为SCA 20.2.0.(2002年11月)。根据制造商的说明书进行测量。
未处理的玻璃接触角为30±5°,以根据实施例8(yaad-dewA-his6)的功能性疏水蛋白包被的玻璃接触角为75±5°。
序列表中DNA和多肽序列的序列命名
dewA DNA和多肽序列 | SEQ ID NO:1 |
dewA多肽序列 | SEQ ID NO:2 |
rodA DNA和多肽序列 | SEQ ID NO:3 |
rodA多肽序列 | SEQ ID NO:4 |
hypA DNA和多肽序列 | SEQ ID NO:5 |
hypA多肽序列 | SEQ ID NO:6 |
hypB DNA和多肽序列 | SEQ ID NO:7 |
hypB多肽序列 | SEQ ID NO:8 |
sc3 DNA和多肽序列 | SEQ ID NO:9 |
sc3多肽序列 | SEQ ID NO:10 |
basf1 DNA和多肽序列 | SEQ ID NO:11 |
basf1多肽序列 | SEQ ID NO:12 |
basf2 DNA和多肽序列 | SEQ ID NO:13 |
basf2多肽序列 | SEQ ID NO:14 |
yaadDNA和多肽序列 | SEQ ID NO:15 |
yaad多肽序列 | SEQ ID NO:16 |
yaaeDNA和多肽序列 | SEQ ID NO:17 |
yaae多肽序列 | SEQ ID NO:18 |
yaad-Xa-dewA-his DNA和多肽序列 | SEQ ID NO:19 |
yaad-Xa-dewA-his多肽序列 | SEQ ID NO:20 |
yaad-Xa-rodA-his DNA和多肽序列 | SEQ ID NO:21 |
yaad-Xa-rodA-his多肽序列 | SEQ ID NO:22 |
yaad-Xa-basf1-his DNA和多肽序列 | SEQ ID NO:23 |
yaad-Xa-basf1-his多肽序列 | SEQ ID NO:24 |
序列表
<110>巴斯福股份公司
<120>新型半胱氨酸减少的疏水蛋白融合蛋白、其生产和用途
<130>AE 20040837
<160>24
<170>PatentIn version 3.1
<210>1
<211>405
<212>DNA
<213>basf-dewA
<220>
<221>CDS
<222>(1)..(405)
<223>
<400>1
atg cgc ttc atc gtc tct ctc ctc gcc ttc act gcc gcg gcc acc gcg 48
Met Arg Phe Ile Val Ser Leu Leu Ala Phe Thr Ala Ala Ala Thr Ala
1 5 10 15
acc gcc ctc ccg gcc tct gcc gca aag aac gcg aag ctg gcc acc tcg 96
Thr Ala Leu Pro Ala Ser Ala Ala Lys Asn Ala Lys Leu Ala Thr Ser
20 25 30
gcg gcc ttc gcc aag cag gct gaa ggc acc acc tgc aat gtc ggc tcg 144
Ala Ala Phe Ala Lys Gln Ala Glu Gly Thr Thr Cys Asn Val Gly Ser
35 40 45
atc gct tgc tgc aac tcc ccc gct gag acc aac aac gac agt ctg ttg 192
Ile Ala Cys Cys Asn Ser Pro Ala Glu Thr Asn Asn Asp Ser Leu Leu
50 55 60
agc ggt ctg ctc ggt gct ggc ctt ctc aac ggg ctc tcg ggc aac act 240
Ser Gly Leu Leu Gly Ala Gly Leu Leu Asn Gly Leu Ser Gly Asn Thr
65 70 75 80
ggc agc gcc tgc gcc aag gcg agc ttg att gac cag ctg ggt ctg ctc 288
Gly Ser Ala Cys Ala Lys Ala Ser Leu Ile Asp Gln Leu Gly Leu Leu
85 90 95
gct ctc gtc gac cac act gag gaa ggc ccc gtc tgc aag aac atc gtc 336
Ala Leu Val Asp His Thr Glu Glu Gly Pro Val Cys Lys Asn Ile Val
100 105 110
gct tgc tgc cct gag gga acc acc aac tgt gtt gcc gtc gac aac gct 384
Ala Cys Cys Pro Glu Gly Thr Thr Asn Cys Val Ala Val Asp Asn Ala
115 120 125
ggc gct ggt acc aag gct gag 405
Gly Ala Gly Thr Lys Ala Glu
130 135
<210>2
<211>135
<212>PRT
<213>basf-dewA
<400>2
Met Arg Phe Ile Val Ser Leu Leu Ala Phe Thr Ala Ala Ala Thr Ala
1 5 10 15
Thr Ala Leu Pro Ala Ser Ala Ala Lys Asn Ala Lys Leu Ala Thr Ser
20 25 30
Ala Ala Phe Ala Lys Gln Ala Glu Gly Thr Thr Cys Asn Val Gly Ser
35 40 45
Ile Ala Cys Cys Asn Ser Pro Ala Glu Thr Asn Asn Asp Ser Leu Leu
50 55 60
Ser Gly Leu Leu Gly Ala Gly Leu Leu Asn Gly Leu Ser Gly Asn Thr
65 70 75 80
Gly Ser Ala Cys Ala Lys Ala Ser Leu Ile Asp Gln Leu Gly Leu Leu
85 90 95
Ala Leu Val Asp His Thr Glu Glu Gly Pro Val Cys Lys Asn Ile Val
100 105 110
Ala Cys Cys Pro Glu Gly Thr Thr Asn Cys Val Ala Val Asp Asn Ala
115 120 125
Gly Ala Gly Thr Lys Ala Glu
130 135
<210>3
<211>471
<212>DNA
<213>basf-rodA
<220>
<221>CDS
<222>(1)..(471)
<223>
<400>3
atg aag ttc tcc att gct gcc gct gtc gtt gct ttc gcc gcc tcc gtc 48
Met Lys Phe Ser Ile Ala Ala Ala Val Val Ala Phe Ala Ala Ser Val
1 5 10 15
gcg gcc ctc cct cct gcc cat gat tcc cag ttc gct ggc aat ggt gtt 96
Ala Ala Leu Pro Pro Ala His Asp Ser Gln Phe Ala Gly Asn Gly Val
20 25 30
ggc aac aag ggc aac agc aac gtc aag ttc cct gtc ccc gaa aac gtg 144
Gly Asn Lys Gly Asn Ser Asn Val Lys Phe Pro Val Pro Glu Asn Val
35 40 45
acc gtc aag cag gcc tcc gac aag tgc ggt gac cag gcc cag ctc tct 192
Thr Val Lys Gln Ala Ser Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser
50 55 60
tgc tgc aac aag gcc acg tac gcc ggt gac acc aca acc gtt gat gag 240
Cys Cys Asn Lys Ala Thr Tyr Ala Gly Asp Thr Thr Thr Val Asp Glu
65 70 75 80
ggt ctt ctg tct ggt gcc ctc agc ggc ctc atc ggc gcc ggg tct ggt 288
Gly Leu Leu Ser Gly Ala Leu Ser Gly Leu Ile Gly Ala Gly Ser Gly
85 90 95
gcc gaa ggt ctt ggt ctc ttc gat cag tgc tcc aag ctt gat gtt gct 336
Ala Glu Gly Leu Gly Leu Phe Asp Gln Cys Ser Lys Leu Asp Val Ala
100 105 110
gtc ctc att ggc atc caa gat ctt gtc aac cag aag tgc aag caa aac 384
Val Leu Ile Gly Ile Gln Asp Leu Val Asn Gln Lys Cys Lys Gln Asn
115 120 125
att gcc tgc tgc cag aac tcc ccc tcc agc gcg gat ggc aac ctt att 432
Ile Ala Cys Cys Gln Asn Ser Pro Ser Ser Ala Asp Gly Asn Leu Ile
130 135 140
ggt gtc ggt ctc cct tgc gtt gcc ctt ggc tcc atc ctc 471
Gly Val Gly Leu Pro Cys Val Ala Leu Gly Ser Ile Leu
145 150 155
<210>4
<211>157
<212>PRT
<213>basf-rodA
<400>4
Met Lys Phe Ser Ile Ala Ala Ala Val Val Ala Phe Ala Ala Ser Val
1 5 10 15
Ala Ala Leu Pro Pro Ala His Asp Ser Gln Phe Ala Gly Asn Gly Val
20 25 30
Gly Asn Lys Gly Asn Ser Asn Val Lys Phe Pro Val Pro Glu Asn Val
35 40 45
Thr Val Lys Gln Ala Ser Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser
50 55 60
Cys Cys Asn Lys Ala Thr Tyr Ala Gly Asp Thr Thr Thr Val Asp Glu
65 70 75 80
Gly Leu Leu Ser Gly Ala Leu Ser Gly Leu Ile Gly Ala Gly Ser Gly
85 90 95
Ala Glu Gly Leu Gly Leu Phe Asp Gln Cys Ser Lys Leu Asp Val Ala
100 105 110
Val Leu Ile Gly Ile Gln Asp Leu Val Asn Gln Lys Cys Lys Gln Asn
115 120 125
Ile Ala Cys Cys Gln Asn Ser Pro Ser Ser Ala Asp Gly Asn Leu Ile
130 135 140
Gly Val Gly Leu Pro Cys Val Ala Leu Gly Ser Ile Leu
145 150 155
<210>5
<211>336
<212>DNA
<213>basf-HypA
<220>
<221>CDS
<222>(1)..(336)
<223>
<400>5
atg atc tct cgc gtc ctt gtc gct gct ctc gtc gct ctc ccc gct ctt 48
Met Ile Ser Arg Val Leu Val Ala Ala Leu Val Ala Leu Pro Ala Leu
1 5 10 15
gtt act gca act cct gct ccc gga aag cct aaa gcc agc agt cag tgc 96
Val Thr Ala Thr Pro Ala Pro Gly Lys Pro Lys Ala Ser Ser Gln Cys
20 25 30
gac gtc ggt gaa atc cat tgc tgt gac act cag cag act ccc gac cac 144
Asp Val Gly Glu Ile His Cys Cys Asp Thr Gln Gln Thr Pro Asp His
35 40 45
acc agc gcc gcc gcg tct ggt ttg ctt ggt gtt ccc atc aac ctt ggt 192
Thr Ser Ala Ala Ala Ser Gly Leu Leu Gly Val Pro Ile Asn Leu Gly
50 55 60
gct ttc ctc ggt ttc gac tgt acc ccc att tcc gtc ctt ggc gtc ggt 240
Ala Phe Leu Gly Phe Asp Cys Thr Pro Ile Ser Val Leu Gly Val Gly
65 70 75 80
ggc aac aac tgt gct gct cag cct gtc tgc tgc aca gga aat caa ttc 288
Gly Asn Asn Cys Ala Ala Gln Pro Val Cys Cys Thr Gly Asn Gln Phe
85 90 95
acc gca ttg att aac gct ctt gac tgc tct cct gtc aat gtc aac ctc 336
Thr Ala Leu Ile Asn Ala Leu Asp Cys Ser Pro Val Asn Val Asn Leu
100 105 110
<210>6
<211>112
<212>PRT
<213>basf-HypA
<400>6
Met Ile Ser Arg Val Leu Val Ala Ala Leu Val Ala Leu Pro Ala Leu
1 5 10 15
Val Thr Ala Thr Pro Ala Pro Gly Lys Pro Lys Ala Ser Ser Gln Cys
20 25 30
Asp Val Gly Glu Ile His Cys Cys Asp Thr Gln Gln Thr Pro Asp His
35 40 45
Thr Ser Ala Ala Ala Ser Gly Leu Leu Gly Val Pro Ile Asn Leu Gly
50 55 60
Ala Phe Leu Gly Phe Asp Cys Thr Pro Ile Ser Val Leu Gly Val Gly
65 70 75 80
Gly Asn Asn Cys Ala Ala Gln Pro Val Cys Cys Thr Gly Asn Gln Phe
85 90 95
Thr Ala Leu Ile Asn Ala Leu Asp Cys Ser Pro Val Asn Val Asn Leu
100 105 110
<210>7
<211>357
<212>DNA
<213>basf-HypB
<220>
<221>CDS
<222>(1)..(357)
<223>
<400>7
atg gtc agc acg ttc atc act gtc gca aag acc ctt ctc gtc gcg ctc 48
Met Val Ser Thr Phe Ile Thr Val Ala Lys Thr Leu Leu Val Ala Leu
1 5 10 15
ctc ttc gtc aat atc aat atc gtc gtt ggt act gca act acc ggc aag 96
Leu Phe Val Asn Ile Asn Ile Val Val Gly Thr Ala Thr Thr Gly Lys
20 25 30
cat tgt agc acc ggt cct atc gag tgc tgc aag cag gtc atg gat tct 144
His Cys Ser Thr Gly Pro Ile Glu Cys Cys Lys Gln Val Met Asp Ser
35 40 45
aag agc cct cag gct acg gag ctt ctt acg aag aat ggc ctt ggc ctg 192
Lys Ser Pro Gln Ala Thr Glu Leu Leu Thr Lys Asn Gly Leu Gly Leu
50 55 60
ggt gtc ctt gct ggc gtg aag ggt ctt gtt ggc gcg aat tgc agc cct 240
Gly Val Leu Ala Gly Val Lys Gly Leu Val Gly Ala Asn Cys Ser Pro
65 70 75 80
atc acg gca att ggt att ggc tcc ggc agc caa tgc tct ggc cag acc 288
Ile Thr Ala Ile Gly Ile Gly Ser Gly Ser Gln Cys Ser Gly Gln Thr
85 90 95
gtt tgc tgc cag aat aat aat ttc aac ggt gtt gtc gct att ggt tgc 336
Val Cys Cys Gln Asn Asn Asn Phe Asn Gly Val Val Ala Ile Gly Cys
100 105 110
act ccc att aat gcc aat gtg 357
Thr Pro Ile Asn Ala Asn Val
115
<210>8
<211>119
<212>PRT
<213>basf-HypB
<400>8
Met Val Ser Thr Phe Ile Thr Val Ala Lys Thr Leu Leu Val Ala Leu
1 5 10 15
Leu Phe Val Asn Ile Asn Ile Val Val Gly Thr Ala Thr Thr Gly Lys
20 25 30
His Cys Ser Thr Gly Pro Ile Glu Cys Cys Lys Gln Val Met Asp Ser
35 40 45
Lys Ser Pro Gln Ala Thr Glu Leu Leu Thr Lys Asn Gly Leu Gly Leu
50 55 60
Gly Val Leu Ala Gly Val Lys Gly Leu Val Gly Ala Asn Cys Ser Pro
65 70 75 80
Ile Thr Ala Ile Gly Ile Gly Ser Gly Ser Gln Cys Ser Gly Gln Thr
85 90 95
Val Cys Cys Gln Asn Asn Asn Phe Asn Gly Val Val Ala Ile Gly Cys
100 105 110
Thr Pro Ile Asn Ala Asn Val
115
<210>9
<211>408
<212>DNA
<213>basf-sc3
<220>
<221>CDS
<222>(1)..(408)
<223>
<400>9
atg ttc gcc cgt ctc ccc gtc gtg ttc ctc tac gcc ttc gtc gcg ttc 48
Met Phe Ala Arg Leu Pro Val Val Phe Leu Tyr Ala Phe Val Ala Phe
1 5 10 15
ggc gcc ctc gtc gct gcc ctc cca ggt ggc cac ccg ggc acg acc acg 96
Gly Ala Leu Val Ala Ala Leu Pro Gly Gly His Pro Gly Thr Thr Thr
20 25 30
ccg ccg gtt acg acg acg gtg acg gtg acc acg ccg ccc tcg acg acg 144
Pro Pro Val Thr Thr Thr Val Thr Val Thr Thr Pro Pro Ser Thr Thr
35 40 45
acc atc gcc gcc ggt ggc acg tgt act acg ggg tcg ctc tct tgc tgc 192
Thr Ile Ala Ala Gly Gly Thr Cys Thr Thr Gly Ser Leu Ser Cys Cys
50 55 60
aac cag gtt caa tcg gcg agc agc agc cct gtt acc gcc ctc ctc ggc 240
Asn Gln Val Gln Ser Ala Ser Ser Ser Pro Val Thr Ala Leu Leu Gly
65 70 75 80
ctg ctc ggc att gtc ctc agc gac ctc aac gtt ctc gtt ggc atc agc 288
Leu Leu Gly Ile Val Leu Ser Asp Leu Asn Val Leu Val Gly Ile Ser
85 90 95
tgc tct ccc ctc act gtc atc ggt gtc gga ggc agc ggc tgt tcg gcg 336
Cys Ser Pro Leu Thr Val Ile Gly Val Gly Gly Ser Gly Cys Ser Ala
100 105 110
cag acc gtc tgc tgc gaa aac acc caa ttc aac ggg ctg atc aac atc 384
Gln Thr Val Cys Cys Glu Asn Thr Gln Phe Asn Gly Leu Ile Asn Ile
115 120 125
ggt tgc acc ccc atc aac atc ctc 408
Gly Cys Thr Pro Ile Asn Ile Leu
130 135
<210>10
<211>136
<212>PRT
<213>basf-sc3
<400>10
Met Phe Ala Arg Leu Pro Val Val Phe Leu Tyr Ala Phe Val Ala Phe
1 5 10 15
Gly Ala Leu Val Ala Ala Leu Pro Gly Gly His Pro Gly Thr Thr Thr
20 25 30
Pro Pro Val Thr Thr Thr Val Thr Val Thr Thr Pro Pro Ser Thr Thr
35 40 45
Thr Ile Ala Ala Gly Gly Thr Cys Thr Thr Gly Ser Leu Ser Cys Cys
50 55 60
Asn Gln Val Gln Ser Ala Ser Ser Ser Pro Val Thr Ala Leu Leu Gly
65 70 75 80
Leu Leu Gly Ile Val Leu Ser Asp Leu Asn Val Leu Val Gly Ile Ser
85 90 95
Cys Ser Pro Leu Thr Val Ile Gly Val Gly Gly Ser Gly Cys Ser Ala
100 105 110
Gln Thr Val Cys Cys Glu Asn Thr Gln Phe Asn Gly Leu Ile Asn Ile
115 120 125
Gly Cys Thr Pro Ile Asn Ile Leu
130 135
<210>11
<211>483
<212>DNA
<213>basf-BASF1
<220>
<221>CDS
<222>(1)..(483)
<223>
<400>11
atg aag ttc tcc gtc tcc gcc gcc gtc ctc gcc ttc gcc gcc tcc gtc 48
Met Lys Phe Ser Val Ser Ala Ala Val Leu Ala Phe Ala Ala Ser Val
1 5 10 15
gcc gcc ctc cct cag cac gac tcc gcc gcc ggc aac ggc aac ggc gtc 96
Ala Ala Leu Pro Gln His Asp Ser Ala Ala Gly Asn Gly Asn Gly Val
20 25 30
ggc aac aag ttc cct gtc cct gac gac gtc acc gtc aag cag gcc acc 144
Gly Asn Lys Phe Pro Val Pro Asp Asp Val Thr Val Lys Gln Ala Thr
35 40 45
gac aag tgc ggc gac cag gcc cag ctc tcc tgc tgc aac aag gcc acc 192
Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys Ala Thr
50 55 60
tac gcc ggc gac gtc ctc acc gac atc gac gag ggc atc ctc gcc ggc 240
Tyr Ala Gly Asp Val Leu Thr Asp Ile Asp Glu Gly Ile Leu Ala Gly
65 70 75 80
ctc ctc aag aac ctc atc ggc ggc ggc tcc ggc tcc gag ggc ctc ggc 288
Leu Leu Lys Asn Leu Ile Gly Gly Gly Ser Gly Ser Glu Gly Leu Gly
85 90 95
ctc ttc gac cag tgc gtc aag ctc gac ctc cag atc tcc gtc atc ggc 336
Leu Phe Asp Gln Cys Val Lys Leu Asp Leu Gln Ile Ser Val Ile Gly
100 105 110
atc cct atc cag gac ctc ctc aac cag gtc aac aag cag tgc aag cag 384
Ile Pro Ile Gln Asp Leu Leu Asn Gln Val Asn Lys Gln Cys Lys Gln
115 120 125
aac atc gcc tgc tgc cag aac tcc cct tcc gac gcc acc ggc tcc ctc 432
Asn Ile Ala Cys Cys Gln Asn Ser Pro Ser Asp Ala Thr Gly Ser Leu
130 135 140
gtc aac ctc ggc ctc ggc aac cct tgc atc cct gtc tcc ctc ctc cat 480
Val Asn Leu Gly Leu Gly Asn Pro Cys Ile Pro Val Ser Leu Leu His
145 150 155 160
atg 483
Met
<210>12
<211>161
<212>PRT
<213>basf-BASF1
<400>12
Met Lys Phe Ser Val Ser Ala Ala Val Leu Ala Phe Ala Ala Ser Val
1 5 10 15
Ala Ala Leu Pro Gln His Asp Ser Ala Ala Gly Asn Gly Asn Gly Val
20 25 30
Gly Asn Lys Phe Pro Val Pro Asp Asp Val Thr Val Lys Gln Ala Thr
35 40 45
Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys Ala Thr
50 55 60
Tyr Ala Gly Asp Val Leu Thr Asp Ile Asp Glu Gly Ile Leu Ala Gly
65 70 75 80
Leu Leu Lys Asn Leu Ile Gly Gly Gly Ser Gly Ser Glu Gly Leu Gly
85 90 95
Leu Phe Asp Gln Cys Val Lys Leu Asp Leu Gln Ile Ser Val Ile Gly
100 105 110
Ile Pro Ile Gln Asp Leu Leu Asn Gln Val Asn Lys Gln Cys Lys Gln
115 120 125
Asn Ile Ala Cys Cys Gln Asn Ser Pro Ser Asp Ala Thr Gly Ser Leu
130 135 140
Val Asn Leu Gly Leu Gly Asn Pro Cys Ile Pro Val Ser Leu Leu His
145 150 155 160
Met
<210>13
<211>465
<212>DNA
<213>basf-BASF2
<220>
<221>CDS
<222>(1)..(465)
<223>
<400>13
atg aag ttc tcc gtc tcc gcc gcc gtc ctc gcc ttc gcc gcc tcc gtc 48
Met Lys Phe Ser Val Ser Ala Ala Val Leu Ala Phe Ala Ala Ser Val
1 5 10 15
gcc gcc ctc cct cag cac gac tcc gcc gcc ggc aac ggc aac ggc gtc 96
Ala Ala Leu Pro Gln His Asp Ser Ala Ala Gly Asn Gly Asn Gly Val
20 25 30
ggc aac aag ttc cct gtc cct gac gac gtc acc gtc aag cag gcc acc 144
Gly Asn Lys Phe Pro Val Pro Asp Asp Val Thr Val Lys Gln Ala Thr
35 40 45
gac aag tgc ggc gac cag gcc cag ctc tcc tgc tgc aac aag gcc acc 192
Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys Ala Thr
50 55 60
tac gcc ggc gac gtc acc gac atc gac gag ggc atc ctc gcc ggc ctc 240
Tyr Ala Gly Asp Val Thr Asp Ile Asp Glu Gly Ile Leu Ala Gly Leu
65 70 75 80
ctc aag aac ctc atc ggc ggc ggc tcc ggc tcc gag ggc ctc ggc ctc 288
Leu Lys Asn Leu Ile Gly Gly Gly Ser Gly Ser Glu Gly Leu Gly Leu
85 90 95
ttc gac cag tgc gtc aag ctc gac ctc cag atc tcc gtc atc ggc atc 336
Phe Asp Gln Cys Val Lys Leu Asp Leu Gln Ile Ser Val Ile Gly Ile
100 105 110
cct atc cag gac ctc ctc aac cag cag tgc aag cag aac atc gcc tgc 384
Pro Ile Gln Asp Leu Leu Asn Gln Gln Cys Lys Gln Asn Ile Ala Cys
115 120 125
tgc cag aac tcc cct tcc gac gcc acc ggc tcc ctc gtc aac ctc ggc 432
Cys Gln Asn Ser Pro Ser Asp Ala Thr Gly Ser Leu Val Asn Leu Gly
130 135 140
aac cct tgc atc cct gtc tcc ctc ctc cat atg 465
Asn Pro Cys Ile Pro Val Ser Leu Leu His Met
145 150 155
<210>14
<211>155
<212>PRT
<213>basf-BASF2
<400>14
Met Lys Phe Ser Val Ser Ala Ala Val Leu Ala Phe Ala Ala Ser Val
1 5 10 15
Ala Ala Leu Pro Gln His Asp Ser Ala Ala Gly Asn Gly Asn Gly Val
20 25 30
Gly Asn Lys Phe Pro Val Pro Asp Asp Val Thr Val Lys Gln Ala Thr
35 40 45
Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys Ala Thr
50 55 60
Tyr Ala Gly Asp Val Thr Asp Ile Asp Glu Gly Ile Leu Ala Gly Leu
65 70 75 80
Leu Lys Asn Leu Ile Gly Gly Gly Ser Gly Ser Glu Gly Leu Gly Leu
85 90 95
Phe Asp Gln Cys Val Lys Leu Asp Leu Gln Ile Ser Val Ile Gly Ile
100 105 110
Pro Ile Gln Asp Leu Leu Asn Gln Gln Cys Lys Gln Asn Ile Ala Cys
115 120 125
Cys Gln Asn Ser Pro Ser Asp Ala Thr Gly Ser Leu Val Asn Leu Gly
130 135 140
Asn Pro Cys Ile Pro Val Ser Leu Leu His Met
145 150 155
<210>15
<211>882
<212>DNA
<213>basf-yaad
<220>
<221>CDS
<222>(1)..(882)
<223>
<400>15
atg gct caa aca ggt act gaa cgt gta aaa cgc gga atg gca gaa atg 48
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
caa aaa ggc ggc gtc atc atg gac gtc atc aat gcg gaa caa gcg aaa 96
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
atc gct gaa gaa gct gga gct gtc gct gta atg gcg cta gaa cgt gtg 144
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
cca gca gat att cgc gcg gct gga gga gtt gcc cgt atg gct gac cct 192
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
aca atc gtg gaa gaa gta atg aat gca gta tct atc ccg gta atg gca 240
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
aaa gcg cgt atc gga cat att gtt gaa gcg cgt gtg ctt gaa gct atg 288
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
ggt gtt gac tat att gat gaa agt gaa gtt ctg acg ccg gct gac gaa 336
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
gaa ttt cat tta aat aaa aat gaa tac aca gtt cct ttt gtc tgt ggc 384
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
tgc cgt gat ctt ggt gaa gca aca cgc cgt att gcg gaa ggt gct tct 432
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
atg ctt cgc aca aaa ggt gag cct gga aca ggt aat att gtt gag gct 480
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
gtt cgc cat atg cgt aaa gtt aac gct caa gtg cgc aaa gta gtt gcg 528
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
atg agt gag gat gag cta atg aca gaa gcg aaa aac cta ggt gct cct 576
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
tac gag ctt ctt ctt caa att aaa aaa gac ggc aag ctt cct gtc gtt 624
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
aac ttt gcc gct ggc ggc gta gca act cca gct gat gct gct ctc atg 672
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
atg cag ctt ggt gct gac gga gta ttt gtt ggt tct ggt att ttt aaa 720
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
tca gac aac cct gct aaa ttt gcg aaa gca att gtg gaa gca aca act 768
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
cac ttt act gat tac aaa tta atc gct gag ttg tca aaa gag ctt ggt 816
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
act gca atg aaa ggg att gaa atc tca aac tta ctt cca gaa cag cgt 864
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
atg caa gaa cgc ggc tgg 882
Met Gln Glu Arg Gly Trp
290
<210>16
<211>294
<212>PRT
<213>basf-yaad
<400>16
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
Met Gln Glu Arg Gly Trp
290
<210>17
<211>591
<212>DNA
<213>basf-yaae
<220>
<221>CDS
<222>(1)..(591)
<223>
<400>17
atg gga tta aca ata ggt gta cta gga ctt caa gga gca gtt aga gag 48
Met Gly Leu Thr Ile Gly Val Leu Gly Leu Gln Gly Ala Val Arg Glu
1 5 10 15
cac atc cat gcg att gaa gca tgc ggc gcg gct ggt ctt gtc gta aaa 96
His Ile His Ala Ile Glu Ala Cys Gly Ala Ala Gly Leu Val Val Lys
20 25 30
cgt ccg gag cag ctg aac gaa gtt gac ggg ttg att ttg ccg ggc ggt 144
Arg Pro Glu Gln Leu Asn Glu Val Asp Gly Leu Ile Leu Pro Gly Gly
35 40 45
gag agc acg acg atg cgc cgt ttg atc gat acg tat caa ttc atg gag 192
Glu Ser Thr Thr Met Arg Arg Leu Ile Asp Thr Tyr Gln Phe Met Glu
50 55 60
ccg ctt cgt gaa ttc gct gct cag ggc aaa ccg atg ttt gga aca tgt 240
Pro Leu Arg Glu Phe Ala Ala Gln Gly Lys Pro Met Phe Gly Thr Cys
65 70 75 80
gcc gga tta att ata tta gca aaa gaa att gcc ggt tca gat aat cct 288
Ala Gly Leu Ile Ile Leu Ala Lys Glu Ile Ala Gly Ser Asp Asn Pro
85 90 95
cat tta ggt ctt ctg aat gtg gtt gta gaa cgt aat tca ttt ggc cgg 336
His Leu Gly Leu Leu Asn Val Val Val Glu Arg Asn Ser Phe Gly Arg
100 105 110
cag gtt gac agc ttt gaa gct gat tta aca att aaa ggc ttg gac gag 384
Gln Val Asp Ser Phe Glu Ala Asp Leu Thr Ile Lys Gly Leu Asp Glu
115 120 125
cct ttt act ggg gta ttc atc cgt gct ccg cat att tta gaa gct ggt 432
Pro Phe Thr Gly Val Phe Ile Arg Ala Pro His Ile Leu Glu Ala Gly
130 135 140
gaa aat gtt gaa gtt cta tcg gag cat aat ggt cgt att gta gcc gcg 480
Glu Asn Val Glu Val Leu Ser Glu His Asn Gly Arg Ile Val Ala Ala
145 150 155 160
aaa cag ggg caa ttc ctt ggc tgc tca ttc cat ccg gag ctg aca gaa 528
Lys Gln Gly Gln Phe Leu Gly Cys Ser Phe His Pro Glu Leu Thr Glu
165 170 175
gat cac cga gtg acg cag ctg ttt gtt gaa atg gtt gag gaa tat aag 576
Asp His Arg Val Thr Gln Leu Phe Val Glu Met Val Glu Glu Tyr Lys
180 185 190
caa aag gca ctt gta 591
Gln Lys Ala Leu Val
195
<210>18
<211>197
<212>PRT
<213>basf-yaae
<400>18
Met Gly Leu Thr Ile Gly Val Leu Gly Leu Gln Gly Ala Val Arg Glu
1 5 10 15
His Ile His Ala Ile Glu Ala Cys Gly Ala Ala Gly Leu Val Val Lys
20 25 30
Arg Pro Glu Gln Leu Asn Glu Val Asp Gly Leu Ile Leu Pro Gly Gly
35 40 45
Glu Ser Thr Thr Met Arg Arg Leu Ile Asp Thr Tyr Gln Phe Met Glu
50 55 60
Pro Leu Arg Glu Phe Ala Ala Gln Gly Lys Pro Met Phe Gly Thr Cys
65 70 75 80
Ala Gly Leu Ile Ile Leu Ala Lys GluIle Ala Gly Ser Asp Asn Pro
85 90 95
His Leu Gly Leu Leu Asn Val Val Val Glu Arg Asn Ser Phe Gly Arg
100 105 110
Gln Val Asp Ser Phe Glu Ala Asp Leu Thr Ile Lys Gly Leu Asp Glu
115 120 125
Pro Phe Thr Gly Val Phe Ile Arg Ala Pro His Ile Leu Glu Ala Gly
130 135 140
Glu Asn Val Glu Val Leu Ser Glu His Asn Gly Arg Ile Val Ala Ala
145 150 155 160
Lys Gln Gly Gln Phe Leu Gly Cys Ser Phe His Pro Glu Leu Thr Glu
165 170 175
Asp His Arg Val Thr Gln Leu Phe Val Glu Met Val Glu Glu Tyr Lys
180 185 190
Gln Lys Ala Leu Val
195
<210>19
<211>1329
<212>DNA
<213>basf-yaad-Xa-dewA-his
<220>
<221>CDS
<222>(1)..(1329)
<223>
<400>19
atg gct caa aca ggt act gaa cgt gta aaa cgc gga atg gca gaa atg 48
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
caa aaa ggc ggc gtc atc atg gac gtc atc aat gcg gaa caa gcg aaa 96
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
atc gct gaa gaa gct gga gct gtc gct gta atg gcg cta gaa cgt gtg 144
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
cca gca gat att cgc gcg gct gga gga gtt gcc cgt atg gct gac cct 192
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
aca atc gtg gaa gaa gta atg aat gca gta tct atc ccg gta atg gca 240
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
aaa gcg cgt atc gga cat att gtt gaa gcg cgt gtg ctt gaa gct atg 288
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
ggt gtt gac tat att gat gaa agt gaa gtt ctg acg ccg gct gac gaa 336
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
gaa ttt cat tta aat aaa aat gaa tac aca gtt cct ttt gtc tgt ggc 384
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
tgc cgt gat ctt ggt gaa gca aca cgc cgt att gcg gaa ggt gct tct 432
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
atg ctt cgc aca aaa ggt gag cct gga aca ggt aat att gtt gag gct 480
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
gtt cgc cat atg cgt aaa gtt aac gct caa gtg cgc aaa gta gtt gcg 528
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
atg agt gag gat gag cta atg aca gaa gcg aaa aac cta ggt gct cct 576
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
tac gag ctt ctt ctt caa att aaa aaa gac ggc aag ctt cct gtc gtt 624
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
aac ttt gcc gct ggc ggc gta gca act cca gct gat gct gct ctc atg 672
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
atg cag ctt ggt gct gac gga gta ttt gtt ggt tct ggt att ttt aaa 720
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
tca gac aac cct gct aaa ttt gcg aaa gca att gtg gaa gca aca act 768
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
cac ttt act gat tac aaa tta atc gct gag ttg tca aaa gag ctt ggt 816
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
act gca atg aaa ggg att gaa atc tca aac tta ctt cca gaa cag cgt 864
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
atg caa gaa cgc ggc tgg aga tcc att gaa ggc cgc atg cgc ttc atc 912
Met Gln Glu Arg Gly Trp Arg Ser Ile Glu Gly Arg Met Arg Phe Ile
290 295 300
gtc tct ctc ctc gcc ttc act gcc gcg gcc acc gcg acc gcc ctc ccg 960
Val Ser Leu Leu Ala Phe Thr Ala Ala Ala Thr Ala Thr Ala Leu Pro
305 310 315 320
gcc tct gcc gca aag aac gcg aag ctg gcc acc tcg gcg gcc ttc gcc 1008
Ala Ser Ala Ala Lys Asn Ala Lys Leu Ala Thr Ser Ala Ala Phe Ala
325 330 335
aag cag gct gaa ggc acc acc tgc aat gtc ggc tcg atc gct tgc tgc 1056
Lys Gln Ala Glu Gly Thr Thr Cys Asn Val Gly Ser Ile Ala Cys Cys
340 345 350
aac tcc ccc gct gag acc aac aac gac agt ctg ttg agc ggt ctg ctc 1104
Asn Ser Pro Ala Glu Thr Asn Asn Asp Ser Leu Leu Ser Gly Leu Leu
355 360 365
ggt gct ggc ctt ctc aac ggg ctc tcg ggc aac act ggc agc gcc tgc 1152
Gly Ala Gly Leu Leu Asn Gly Leu Ser Gly Asn Thr Gly Ser Ala Cys
370 375 380
gcc aag gcg agc ttg att gac cag ctg ggt ctg ctc gct ctc gtc gac 1200
Ala Lys Ala Ser Leu Ile Asp Gln Leu Gly Leu Leu Ala Leu Val Asp
385 390 395 400
cac act gag gaa ggc ccc gtc tgc aag aac atc gtc gct tgc tgc cct 1248
His Thr Glu Glu Gly Pro Val Cys Lys Asn Ile Val Ala Cys Cys Pro
405 410 415
gag gga acc acc aac tgt gtt gcc gtc gac aac gct ggc gct ggt acc 1296
Glu Gly Thr Thr Asn Cys Val Ala Val Asp Asn Ala Gly Ala Gly Thr
420 425 430
aag gct gag gga tct cat cac cat cac cat cac 1329
Lys Ala Glu Gly Ser His His His His His His
435 440
<210>20
<211>443
<212>PRT
<213>basf-yaad-Xa-dewA-his
<400>20
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
Met Gln Glu Arg Gly Trp Arg Ser Ile Glu Gly Arg Met Arg Phe Ile
290 295 300
Val Ser Leu Leu Ala Phe Thr Ala Ala Ala Thr Ala Thr Ala Leu Pro
305 310 315 320
Ala Ser Ala Ala Lys Asn Ala Lys Leu Ala Thr Ser Ala Ala Phe Ala
325 330 335
Lys Gln Ala Glu Gly Thr Thr Cys Asn Val Gly Ser Ile Ala Cys Cys
340 345 350
Asn Ser Pro Ala Glu Thr Asn Asn Asp Ser Leu Leu Ser Gly Leu Leu
355 360 365
Gly Ala Gly Leu Leu Asn Gly Leu Ser Gly Asn Thr Gly Ser Ala Cys
370 375 380
Ala Lys Ala Ser LeuIle Asp Gln Leu Gly Leu Leu Ala Leu Val Asp
385 390 395 400
His Thr Glu Glu Gly Pro Val Cys Lys Asn Ile Val Ala Cys Cys Pro
405 410 415
Glu Gly Thr Thr Asn Cys Val Ala Val Asp Asn Ala Gly Ala Gly Thr
420 425 430
Lys Ala Glu Gly Ser His His His His His His
435 440
<210>21
<211>1395
<212>DNA
<213>basf-yaad-Xa-rodA-his
<220>
<221>CDS
<222>(1)..(1395)
<223>
<400>21
atg gct caa aca ggt act gaa cgt gta aaa cgc gga atg gca gaa atg 48
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
caa aaa ggc ggc gtc atc atg gac gtc atc aat gcg gaa caa gcg aaa 96
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
atc gct gaa gaa gct gga gct gtc gct gta atg gcg cta gaa cgt gtg 144
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
cca gca gat att cgc gcg gct gga gga gtt gcc cgt atg gct gac cct 192
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
aca atc gtg gaa gaa gta atg aat gca gta tct atc ccg gta atg gca 240
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
aaa gcg cgt atc gga cat att gtt gaa gcg cgt gtg ctt gaa gct atg 288
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
ggt gtt gac tat att gat gaa agt gaa gtt ctg acg ccg gct gac gaa 336
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
gaa ttt cat tta aat aaa aat gaa tac aca gtt cct ttt gtc tgt ggc 384
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
tgc cgt gat ctt ggt gaa gca aca cgc cgt att gcg gaa ggt gct tct 432
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
atg ctt cgc aca aaa ggt gag cct gga aca ggt aat att gtt gag gct 480
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
gtt cgc cat atg cgt aaa gtt aac gct caa gtg cgc aaa gta gtt gcg 528
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
atg agt gag gat gag cta atg aca gaa gcg aaa aac cta ggt gct cct 576
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
tac gag ctt ctt ctt caa att aaa aaa gac ggc aag ctt cct gtc gtt 624
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
aac ttt gcc gct ggc ggc gta gca act cca gct gat gct gct ctc atg 672
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
atg cag ctt ggt gct gac gga gta ttt gtt ggt tct ggt att ttt aaa 720
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
tca gac aac cct gct aaa ttt gcg aaa gca att gtg gaa gca aca act 768
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
cac ttt act gat tac aaa tta atc gct gag ttg tca aaa gag ctt ggt 816
His Phe Thr Asp Tyr Lys LeuIle Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
act gca atg aaa ggg att gaa atc tca aac tta ctt cca gaa cag cgt 864
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
atg caa gaa cgc ggc tgg aga tct att gaa ggc cgc atg aag ttc tcc 912
Met Gln Glu Arg Gly Trp Arg Ser Ile Glu Gly Arg Met Lys Phe Ser
290 295 300
att gct gcc gct gtc gtt gct ttc gcc gcc tcc gtc gcg gcc ctc cct 960
Ile Ala Ala Ala Val Val Ala Phe Ala Ala Ser Val Ala Ala Leu Pro
305 310 315 320
cct gcc cat gat tcc cag ttc gct ggc aat ggt gtt ggc aac aag ggc 1008
Pro Ala His Asp Ser Gln Phe Ala Gly Asn Gly Val Gly Asn Lys Gly
325 330 335
aac agc aac gtc aag ttc cct gtc ccc gaa aac gtg acc gtc aag cag 1056
Asn Ser Asn Val Lys Phe Pro Val Pro Glu Asn Val Thr Val Lys Gln
340 345 350
gcc tcc gac aag tgc ggt gac cag gcc cag ctc tct tgc tgc aac aag 1104
Ala Ser Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys
355 360 365
gcc acg tac gcc ggt gac acc aca acc gtt gat gag ggt ctt ctg tct 1152
Ala Thr Tyr Ala Gly Asp Thr Thr Thr Val Asp Glu Gly Leu Leu Ser
370 375 380
ggt gcc ctc agc ggc ctc atc ggc gcc ggg tct ggt gcc gaa ggt ctt 1200
Gly Ala Leu Ser Gly Leu Ile Gly Ala Gly Ser Gly Ala Glu Gly Leu
385 390 395 400
ggt ctc ttc gat cag tgc tcc aag ctt gat gtt gct gtc ctc att ggc 1248
Gly Leu Phe Asp Gln Cys Ser Lys Leu Asp Val Ala Val Leu Ile Gly
405 410 415
atc caa gat ctt gtc aac cag aag tgc aag caa aac att gcc tgc tgc 1296
Ile Gln Asp Leu Val Asn Gln Lys Cys Lys Gln Asn Ile Ala Cys Cys
420 425 430
cag aac tcc ccc tcc agc gcg gat ggc aac ctt att ggt gtc ggt ctc 1344
Gln Asn Ser Pro Ser Ser Ala Asp Gly Asn Leu Ile Gly Val Gly Leu
435 440 445
cct tgc gtt gcc ctt ggc tcc atc ctc gga tct cat cac cat cac cat 1392
Pro Cys Val Ala Leu Gly Ser Ile Leu Gly Ser His His His His His
450 455 460
cac 1395
His
465
<210>22
<211>465
<212>PRT
<213>basf-yaad-Xa-rodA-his
<400>22
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
Met Gln Glu Arg Gly Trp Arg Ser Ile Glu Gly Arg Met Lys Phe Ser
290 295 300
Ile Ala Ala Ala Val Val Ala Phe Ala Ala Ser Val Ala Ala Leu Pro
305 310 315 320
Pro Ala His Asp Ser Gln Phe Ala Gly Asn Gly Val Gly Asn Lys Gly
325 330 335
Asn Ser Asn Val Lys Phe Pro Val Pro Glu Asn Val Thr Val Lys Gln
340 345 350
Ala Ser Asp Lys Cys Gly Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys
355 360 365
Ala Thr Tyr Ala Gly Asp Thr Thr Thr Val Asp Glu Gly Leu Leu Ser
370 375 380
Gly Ala Leu Ser Gly Leu Ile Gly Ala Gly Ser Gly Ala Glu Gly Leu
385 390 395 400
Gly Leu Phe Asp Gln Cys Ser Lys Leu Asp Val Ala Val Leu Ile Gly
405 410 415
Ile Gln Asp Leu Val Asn Gln Lys Cys Lys Gln Asn Ile Ala Cys Cys
420 425 430
Gln Asn Ser Pro Ser Ser Ala Asp Gly Asn Leu Ile Gly Val Gly Leu
435 440 445
Pro Cys Val Ala Leu Gly Ser Ile Leu Gly Ser His His His His His
450 455 460
His
465
<210>23
<211>1407
<212>DNA
<213>basf-yaad-Xa-BASF1-his
<220>
<221>CDS
<222>(1)..(1407)
<223>
<400>23
atg gct caa aca ggt act gaa cgt gta aaa cgc gga atg gca gaa atg 48
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
caa aaa ggc ggc gtc atc atg gac gtc atc aat gcg gaa caa gcg aaa 96
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
atc gct gaa gaa gct gga gct gtc gct gta atg gcg cta gaa cgt gtg 144
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
cca gca gat att cgc gcg gct gga gga gtt gcc cgt atg gct gac cct 192
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
aca atc gtg gaa gaa gta atg aat gca gta tct atc ccg gta atg gca 240
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
aaa gcg cgt atc gga cat att gtt gaa gcg cgt gtg ctt gaa gct atg 288
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
ggt gtt gac tat att gat gaa agt gaa gtt ctg acg ccg gct gac gaa 336
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
gaa ttt cat tta aat aaa aat gaa tac aca gtt cct ttt gtc tgt ggc 384
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
tgc cgt gat ctt ggt gaa gca aca cgc cgt att gcg gaa ggt gct tct 432
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
atg ctt cgc aca aaa ggt gag cct gga aca ggt aat att gtt gag gct 480
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
gtt cgc cat atg cgt aaa gtt aac gct caa gtg cgc aaa gta gtt gcg 528
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
atg agt gag gat gag cta atg aca gaa gcg aaa aac cta ggt gct cct 576
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
tac gag ctt ctt ctt caa att aaa aaa gac ggc aag ctt cct gtc gtt 624
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
aac ttt gcc gct ggc ggc gta gca act cca gct gat gct gct ctc atg 672
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
atg cag ctt ggt gct gac gga gta ttt gtt ggt tct ggt att ttt aaa 720
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
tca gac aac cct gct aaa ttt gcg aaa gca att gtg gaa gca aca act 768
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
cac ttt act gat tac aaa tta atc gct gag ttg tca aaa gag ctt ggt 816
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
act gca atg aaa ggg att gaa atc tca aac tta ctt cca gaa cag cgt 864
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
atg caa gaa cgc ggc tgg aga tct att gaa ggc cgc atg aag ttc tcc 912
Met Gln Glu Arg Gly Trp Arg Ser Ile Glu Gly Arg Met Lys Phe Ser
290 295 300
gtc tcc gcc gcc gtc ctc gcc ttc gcc gcc tcc gtc gcc gcc ctc cct 960
Val Ser Ala Ala Val Leu Ala Phe Ala Ala Ser Val Ala Ala Leu Pro
305 310 315 320
cag cac gac tcc gcc gcc ggc aac ggc aac ggc gtc ggc aac aag ttc 1008
Gln His Asp Ser Ala Ala Gly Asn Gly Asn Gly Val Gly Asn Lys Phe
325 330 335
cct gtc cct gac gac gtc acc gtc aag cag gcc acc gac aag tgc ggc 1056
Pro Val Pro Asp Asp Val Thr Val Lys Gln Ala Thr Asp Lys Cys Gly
340 345 350
gac cag gcc cag ctc tcc tgc tgc aac aag gcc acc tac gcc ggc gac 1104
Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys Ala Thr Tyr Ala Gly Asp
355 360 365
gtc ctc acc gac atc gac gag ggc atc ctc gcc ggc ctc ctc aag aac 1152
Val Leu Thr Asp Ile Asp Glu Gly Ile Leu Ala Gly Leu Leu Lys Asn
370 375 380
ctc atc ggc ggc ggc tcc ggc tcc gag ggc ctc ggc ctc ttc gac cag 1200
Leu Ile Gly Gly Gly Ser Gly Ser Glu Gly Leu Gly Leu Phe Asp Gln
385 390 395 400
tgc gtc aag ctc gac ctc cag atc tcc gtc atc ggc atc cct atc cag 1248
Cys Val Lys Leu Asp Leu Gln Ile Ser Val Ile Gly Ile Pro Ile Gln
405 410 415
gac ctc ctc aac cag gtc aac aag cag tgc aag cag aac atc gcc tgc 1296
Asp Leu Leu Asn Gln Val Asn Lys Gln Cys Lys Gln Asn Ile Ala Cys
420 425 430
tgc cag aac tcc cct tcc gac gcc acc ggc tcc ctc gtc aac ctc ggc 1344
Cys Gln Asn Ser Pro Ser Asp Ala Thr Gly Ser Leu Val Asn Leu Gly
435 440 445
ctc ggc aac cct tgc atc cct gtc tcc ctc ctc cat atg gga tct cat 1392
Leu Gly Asn Pro Cys Ile Pro Val Ser Leu Leu His Met Gly Ser His
450 455 460
cac cat cac cat cac 1407
His His His His His
465
<210>24
<211>469
<212>PRT
<213>basf-yaad-Xa-BASF1-his
<400>24
Met Ala Gln Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met
1 5 10 15
Gln Lys Gly Gly Val Ile Met Asp Val Ile Asn Ala Glu Gln Ala Lys
20 25 30
Ile Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val
35 40 45
Pro Ala Asp Ile Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro
50 55 60
Thr Ile Val Glu Glu Val Met Asn Ala Val Ser Ile Pro Val Met Ala
65 70 75 80
Lys Ala Arg Ile Gly His Ile Val Glu Ala Arg Val Leu Glu Ala Met
85 90 95
Gly Val Asp Tyr Ile Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu
100 105 110
Glu Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly
115 120 125
Cys Arg Asp Leu Gly Glu Ala Thr Arg Arg Ile Ala Glu Gly Ala Ser
130 135 140
Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn Ile Val Glu Ala
145 150 155 160
Val Arg His Met Arg Lys Val Asn Ala Gln Val Arg Lys Val Val Ala
165 170 175
Met Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro
180 185 190
Tyr Glu Leu Leu Leu Gln Ile Lys Lys Asp Gly Lys Leu Pro Val Val
195 200 205
Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Ala Asp Ala Ala Leu Met
210 215 220
Met Gln Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly Ile Phe Lys
225 230 235 240
Ser Asp Asn Pro Ala Lys Phe Ala Lys Ala Ile Val Glu Ala Thr Thr
245 250 255
His Phe Thr Asp Tyr Lys Leu Ile Ala Glu Leu Ser Lys Glu Leu Gly
260 265 270
Thr Ala Met Lys Gly Ile Glu Ile Ser Asn Leu Leu Pro Glu Gln Arg
275 280 285
Met Gln Glu Arg Gly Trp Arg Ser Ile Glu Gly Arg Met Lys Phe Ser
290 295 300
Val Ser Ala Ala Val Leu Ala Phe Ala Ala Ser Val Ala Ala Leu Pro
305 310 315 320
Gln His Asp Ser Ala Ala Gly Asn Gly Asn Gly Val Gly Asn Lys Phe
325 330 335
Pro Val Pro Asp Asp Val Thr Val Lys Gln Ala Thr Asp Lys Cys Gly
340 345 350
Asp Gln Ala Gln Leu Ser Cys Cys Asn Lys Ala Thr Tyr Ala Gly Asp
355 360 365
Val Leu Thr AspIle Asp Glu Gly Ile Leu Ala Gly Leu Leu Lys Asn
370 375 380
Leu Ile Gly Gly Gly Ser Gly Ser Glu Gly Leu Gly Leu Phe Asp Gln
385 390 395 400
Cys Val Lys Leu Asp Leu Gln Ile Ser Val Ile Gly Ile Pro Ile Gln
405 410 415
Asp Leu Leu Asn Gln Val Asn Lys Gln Cys Lys Gln Asn Ile Ala Cys
420 425 430
Cys Gln Asn Ser Pro Ser Asp Ala Thr Gly Ser Leu Val Asn Leu Gly
435 440 445
Leu Gly Asn Pro Cys Ile Pro Val Ser Leu Leu His Met Gly Ser His
450 455 460
His His His His His
465
Claims (11)
1.通用结构式(I)的多肽
Xn-C1-X1-50-C2-X0-5-C3-Xp-C4-X1-100-C5-X1-50-C6-X0-5-C7-X1-50-C8-Xm
(I)
其中X可以是20种天然存在的氨基酸中的任何一种,
n和m是0-500之间的数,p是1-250之间的数,C是半胱氨酸、丙氨酸、丝氨酸、甘氨酸、甲硫氨酸或苏氨酸,有至少4个由C指定的残基为半胱氨酸,且其中C3为半胱氨酸和C4为半胱氨酸,
附带条件是至少一个缩写为Xn或Xm或Xp的肽序列是长度至少为20个氨基酸的非天然连接至疏水蛋白的肽序列,该多肽在包被玻璃表面之后使接触角改变至少20°。
2.根据权利要求1的多肽,其中结构式(I)包含I类疏水蛋白。
3.根据权利要求1的多肽,其中结构式(I)包含选自dewA、rodA、sc3、hypA、hypB、basf1、basf2组成的组的疏水蛋白。
4.根据权利要求1的多肽,其中结构式(I)包含选自SEQ ID NO:2、4、6、8、10、12、14的序列。
5.根据权利要求1的多肽,其中Xn或Xm包含选自SEQ ID NO:16、18的多肽序列。
6.根据权利要求1的多肽,其中Xn或Xm是(His)4-10序列。
7.根据权利要求1的多肽,其中结构式(I)包含选自SEQ ID NO:20、22、24组成的组的多肽。
8.编码权利要求1-7中任一项定义的多肽的核酸。
9.产生权利要求1-7中任一项定义的多肽的方法,该方法是通过在宿主生物体中表达权利要求8中定义的核酸,并且分离由此获得的蛋白质,如果合适则在纯化之后分离。
10.根据权利要求9的方法,其中使用的宿主生物体是大肠杆菌。
11.权利要求1-7中任一项的多肽用于表面包被的用途。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102005027139A DE102005027139A1 (de) | 2005-06-10 | 2005-06-10 | Neue Cystein-verarmte Hydrophobinfusionsproteine, deren Herstellung und Verwendung |
DE102005027139.1 | 2005-06-10 | ||
PCT/EP2006/063066 WO2006131564A2 (de) | 2005-06-10 | 2006-06-09 | Neue cystein-verarmte hydrophobinfusionsproteine, deren herstellung und verwendung |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101193911A CN101193911A (zh) | 2008-06-04 |
CN101193911B true CN101193911B (zh) | 2013-06-12 |
Family
ID=37498802
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2006800205772A Expired - Fee Related CN101193911B (zh) | 2005-06-10 | 2006-06-09 | 半胱氨酸减少的疏水蛋白融合蛋白、其生产和用途 |
Country Status (7)
Country | Link |
---|---|
US (1) | US7910699B2 (zh) |
EP (1) | EP1893639A2 (zh) |
JP (1) | JP5302676B2 (zh) |
CN (1) | CN101193911B (zh) |
CA (1) | CA2611234A1 (zh) |
DE (1) | DE102005027139A1 (zh) |
WO (1) | WO2006131564A2 (zh) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1848734A2 (de) * | 2005-02-07 | 2007-10-31 | Basf Aktiengesellschaft | Verfahren zum beschichten von oberflächen mit hydrophobinen |
EP1848733B1 (de) | 2005-02-07 | 2017-06-21 | Basf Se | Neue hydrophobinfusionsproteine, deren herstellung und verwendung |
EP1866150B1 (de) | 2005-03-31 | 2016-10-19 | Basf Se | Metallische substrate mit polypeptiden als haftvermittler |
BRPI0608678A2 (pt) | 2005-04-01 | 2010-11-30 | Basf Ag | uso de uma hidrofobina, processo para perfurar um furo de sondagem para desenvolver depósitos subterráneos, lama de perfuração, e, processo para produzir uma lama de perfuração |
JP2008534554A (ja) | 2005-04-01 | 2008-08-28 | ビーエーエスエフ ソシエタス・ヨーロピア | 乳化破壊剤としての蛋白質の使用 |
DE102005027139A1 (de) | 2005-06-10 | 2006-12-28 | Basf Ag | Neue Cystein-verarmte Hydrophobinfusionsproteine, deren Herstellung und Verwendung |
DE102005029704A1 (de) * | 2005-06-24 | 2007-01-11 | Basf Ag | Verwendung von Hydrophobin-Polypeptiden sowie Konjugaten aus Hydrophobin-Polypeptiden mit Wirk-oder Effektstoffen und ihre Herstellung sowie deren Einsatz in der Kosmetik |
DE102005033002A1 (de) * | 2005-07-14 | 2007-01-18 | Basf Ag | Wässrige Monomeremulsionen enthaltend Hydrophobin |
DE102005048720A1 (de) | 2005-10-12 | 2007-04-19 | Basf Ag | Verwendung von Proteinen als Antischaum-Komponente in Kraftstoffen |
EP2054687B1 (de) * | 2006-08-15 | 2011-10-19 | Basf Se | Verfahren zur herstellung von trockenen freifliessenden hydrophobinzubereitungen |
WO2008107439A1 (de) * | 2007-03-06 | 2008-09-12 | Basf Se | Mit hydrophobinen modifizierte offenzellige schaumstoffe |
US20100166627A1 (en) * | 2007-05-24 | 2010-07-01 | Basf Se | Use of hydrophobins as additives in the crystallization of solids |
WO2009037061A2 (de) * | 2007-09-13 | 2009-03-26 | Basf Se | Verwendung von hydrophobin-polypeptiden als penetrationsverstärker |
EP2042155A1 (de) * | 2007-09-28 | 2009-04-01 | Basf Se | Verfahren zum Entfernen von wasserunlöslichen Substanzen von Substratoberflächen |
WO2010072665A1 (de) | 2008-12-23 | 2010-07-01 | Basf Se | Modifizierung von nano- oder mesofasern oder textilen flächengebilden hergestellt mittels elektrospinnen mit amphiphilen proteinen |
EP2395969A2 (de) | 2009-02-10 | 2011-12-21 | Basf Se | Verwendung von hydrophobin als spreitmittel |
WO2010102934A1 (de) | 2009-03-09 | 2010-09-16 | Basf Se | Verwendung einer mischung aus wasserloslichen polymeren und hydrophobinen zum verdicken wässriger phasen |
EP2462166A2 (en) | 2009-08-03 | 2012-06-13 | Basf Se | Process for deposition of thin layers of metal oxides |
EP2371844A1 (en) | 2010-03-25 | 2011-10-05 | B.R.A.I.N. Biotechnology Research and Information Network AG | Chimeric surface active proteins |
AU2011234510A1 (en) | 2010-03-31 | 2012-10-25 | Basf Se | Coated stents and process for coating with protein |
WO2012004255A1 (de) | 2010-07-07 | 2012-01-12 | Basf Se | Zusammensetzung enthaltend ein hydrophobin und verfahren zum reinigen von hydrophoben oberflächen |
WO2012049250A2 (de) | 2010-10-13 | 2012-04-19 | Basf Se | Verfahren zum immobilisieren kationischer wirkstoffe auf oberflächen |
CA2832975A1 (en) | 2011-04-15 | 2012-10-18 | Danisco Us Inc. | Methods of purifying hydrophobin |
US20130219559A1 (en) | 2012-02-22 | 2013-08-22 | Kimmo Koivu | Method for hydrophobin production in plants and methods to produce hydrophobin multimers in plants and microbes |
BR112014021521B1 (pt) | 2012-04-05 | 2022-05-10 | Basf Plant Science Company Gmbh | Método para aumentar a resistência à infecção por phakopsora em uma planta de soja transgênica, método para a produção de uma planta de soja transgênica, método para a produção de um produto e método para criar uma planta transgênica resistente a fungos |
AU2013266516A1 (en) | 2012-05-21 | 2014-10-09 | Danisco Us Inc. | Trichoderma hydrophobin production |
US9982229B2 (en) * | 2013-12-19 | 2018-05-29 | Danisco Us Inc. | Use of hydrophobins to increase gas transfer in aerobic fermentation processes |
WO2016071136A1 (en) | 2014-11-05 | 2016-05-12 | Basf Se | A method of preparing an agrochemical composition with reduced toxicity by milling a premix of a pesticide and a hydrophobin |
CN108004254B (zh) * | 2017-12-13 | 2020-04-17 | 天津大学 | 疏水蛋白mHGFI基因及表达的蛋白及应用 |
CN108060169B (zh) * | 2017-12-13 | 2020-04-17 | 天津大学 | 疏水蛋白mHFBI基因及表达的蛋白及应用 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101115768A (zh) * | 2005-02-07 | 2008-01-30 | 巴斯福股份公司 | 新的疏水蛋白融合蛋白产品、其制备及用途 |
Family Cites Families (56)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB195876A (en) | 1922-04-25 | 1923-04-12 | Sharples Specialty Co | Process for resolving water-in-oil emulsions |
US2399161A (en) * | 1942-06-30 | 1946-04-30 | Claude R Wickard | Process for producing glues and adhesives from keratin protein materials |
GB1278924A (en) * | 1970-02-06 | 1972-06-21 | Ici Ltd | Improvements in synthetic film materials |
DE2609104A1 (de) * | 1976-03-05 | 1977-09-15 | Basf Ag | Verfahren zur herstellung von styrol-suspensionspolymerisaten |
DE2638839A1 (de) * | 1976-08-28 | 1978-03-02 | Basf Ag | Verfahren zur herstellung von styrol-suspensionspolymerisaten |
JPS60206893A (ja) | 1984-03-31 | 1985-10-18 | Yoshinari Shimada | 油中水滴型乳状燃料油の製造方法 |
US5049504A (en) * | 1986-11-24 | 1991-09-17 | Genex Corporation | Bioadhesive coding sequences |
JPS6323670A (ja) * | 1986-04-25 | 1988-01-30 | バイオ−ポリマ−ズ インコ−ポレ−テツド | 接着・被覆組成物とその使用方法 |
IT1196484B (it) | 1986-07-11 | 1988-11-16 | Sclavo Spa | Vettore ad espressione e secrezione in lieviti,utile per la preparazione di proteine eterologhe |
GB8918185D0 (en) | 1989-08-09 | 1989-09-20 | Secr Defence | Ochrobactrum surfactant |
DE4024871A1 (de) | 1990-08-06 | 1992-02-13 | Basf Ag | Perlfoermige antistatische expandierbare styrolpolymerisate |
DE4220225A1 (de) * | 1992-06-20 | 1993-12-23 | Basf Ag | Verfahren zur Herstellung von perlförmigen expandierbaren Styrolpolymerisaten |
WO1994009094A1 (en) | 1992-10-09 | 1994-04-28 | Won Jae Yim | A process for preparing emulsified fuel oil |
FR2701490B1 (fr) | 1993-02-16 | 1995-04-14 | Inst Francais Du Petrole | Procédé de production d'un mout de xanthane ayant une propriété améliorée, composition obtenue et application de la composition dans une boue de forage de puits. |
JP3508157B2 (ja) | 1993-05-19 | 2004-03-22 | 旭硝子株式会社 | 分裂酵母接合フェロモン前駆体遺伝子 |
ATE197608T1 (de) | 1993-12-10 | 2000-12-15 | Korea Inst Sci & Tech | Signalsequenzen für die secretion heterologer proteine von hefe |
JP3537487B2 (ja) | 1994-04-27 | 2004-06-14 | 株式会社海洋バイオテクノロジー研究所 | ムラサキイガイ接着蛋白質遺伝子 |
IL110938A (en) * | 1994-09-12 | 2001-01-28 | Haber Meir | Adhesive proteins isolated from mature macro and microalgae |
DE69636536T2 (de) | 1995-02-03 | 2007-10-25 | Asahi Glass Co., Ltd. | Sekretionssignal-Gen und dieses enthaltender Expressionsvektor |
JPH08266281A (ja) | 1995-03-31 | 1996-10-15 | Kaiyo Bio Technol Kenkyusho:Kk | イガイ接着蛋白質遺伝子 |
WO1996041882A1 (en) | 1995-06-12 | 1996-12-27 | Proefstation Voor De Champignoncultuur | Hydrophobins from edible fungi, genes, nucleotide sequences and dna-fragments encoding for said hydrophobins, and expression thereof |
US6093222A (en) | 1996-04-04 | 2000-07-25 | Ck Witco Corporation | Diesel fuel antifoam composition |
EP1223219A3 (en) | 1997-10-31 | 2002-10-23 | Asahi Glass Company Ltd. | Inducible promoter and secretion signal for use in schizosaccharomyces pombe, expression vector containing them and their use |
US6572845B2 (en) | 1998-10-16 | 2003-06-03 | Burt D. Ensley | Recombinant hair treatment compositions |
AU778477B2 (en) | 1999-03-25 | 2004-12-09 | Valtion Teknillinen Tutkimuskeskus | Process for partitioning of proteins |
DE19942539A1 (de) | 1999-09-07 | 2001-03-08 | Cognis Deutschland Gmbh | Waschmittel |
DE19956802A1 (de) * | 1999-11-25 | 2001-06-13 | Cognis Deutschland Gmbh | Waschmitteltabletten |
GB0002663D0 (en) * | 2000-02-04 | 2000-03-29 | Biomade B V | Method of stabalizing a hydrophobin-containing solution and a method of coating a surface with a hydrophobin |
GB0002660D0 (en) | 2000-02-04 | 2000-03-29 | Biomade B V | Method of stabilizing a hydrophobin-containing solution and a method of coatinga surface with a hydrophobin |
FR2805278B1 (fr) | 2000-02-17 | 2006-09-29 | Chalen Papier Europ Service | Compositions utiles comme surfactants et leurs utilisations |
AU2001292574A1 (en) * | 2000-09-06 | 2002-03-22 | Zymogenetics Inc. | Human phermone polypeptide |
GB0030038D0 (en) | 2000-12-08 | 2001-01-24 | Univ Warwick | Yeast-based assay |
DE10061280A1 (de) | 2000-12-08 | 2002-06-13 | Novaprot Gmbh | Reinigungswirksame, grenzflächenaktive Kombination aus nachwachsenden Rohstoffen mit hoher Fettlösekraft |
EP1430090A4 (en) | 2001-08-31 | 2006-10-11 | Keratec Ltd | PREPARATION OF FILM, FILM, FIBER, FOAM AND ADHESIVE BIOPOLYMER MATERIALS FROM SOLUBLE S-SULFONATED KERATINE DERIVATIVES |
AT410669B (de) | 2001-10-11 | 2003-06-25 | Bcd Rohstoffe F Bauchemie Hand | Sprühgetrocknete dispersionen, verfahren zu ihrer herstellung und deren anwendung |
FR2833490B1 (fr) | 2001-12-14 | 2004-12-10 | Oreal | Utilisition cosmetique d'au moins une hydrophobine pour le traitement des matieres keratiniques et compositions mises en oeuvre |
AU2003216019A1 (en) | 2002-03-26 | 2003-10-08 | Magnus Qvist | Method for attaching two surfaces to each other using a bioadhesive polyphenolic protein and periodate ions. |
WO2004000880A1 (en) | 2002-06-21 | 2003-12-31 | Applied Nanosystems B.V. | Method of binding a compound to a surface |
DE10342794A1 (de) | 2003-09-16 | 2005-04-21 | Basf Ag | Sekretion von Proteinen aus Hefen |
CN1909979A (zh) | 2004-01-16 | 2007-02-07 | 应用超微系统股份有限公司 | 在低温下用疏水蛋白涂覆物体的方法 |
NZ551366A (en) | 2004-05-24 | 2009-01-31 | Basf Ag | Keratin-binding polypeptides |
US7241734B2 (en) * | 2004-08-18 | 2007-07-10 | E. I. Du Pont De Nemours And Company | Thermophilic hydrophobin proteins and applications for surface modification |
EP1848734A2 (de) | 2005-02-07 | 2007-10-31 | Basf Aktiengesellschaft | Verfahren zum beschichten von oberflächen mit hydrophobinen |
EP1848733B1 (de) * | 2005-02-07 | 2017-06-21 | Basf Se | Neue hydrophobinfusionsproteine, deren herstellung und verwendung |
US20090305930A1 (en) | 2005-03-30 | 2009-12-10 | Basf Aktiengesellschaft | Use of hydrophobin for hard surface soil-repellent treatment |
EP1866106B1 (de) | 2005-03-30 | 2010-05-26 | Basf Se | Verwendung von hydrophobinen zur oberflächenbehandlung von gehärteten mineralischen baustoffen, naturstein, kunststein und keramiken |
EP1866150B1 (de) | 2005-03-31 | 2016-10-19 | Basf Se | Metallische substrate mit polypeptiden als haftvermittler |
BRPI0608678A2 (pt) | 2005-04-01 | 2010-11-30 | Basf Ag | uso de uma hidrofobina, processo para perfurar um furo de sondagem para desenvolver depósitos subterráneos, lama de perfuração, e, processo para produzir uma lama de perfuração |
EP1868698A1 (de) | 2005-04-01 | 2007-12-26 | Basf Aktiengesellschaft | Verwendung von proteinen als demulgatoren |
JP2008534554A (ja) | 2005-04-01 | 2008-08-28 | ビーエーエスエフ ソシエタス・ヨーロピア | 乳化破壊剤としての蛋白質の使用 |
DE102005027039A1 (de) | 2005-06-10 | 2006-12-21 | Basf Ag | Hydrophobin als Beschichtungsmittel für expandierbare oder expandierte, thermoplastische Polymerpartikel |
DE102005027139A1 (de) | 2005-06-10 | 2006-12-28 | Basf Ag | Neue Cystein-verarmte Hydrophobinfusionsproteine, deren Herstellung und Verwendung |
DE102005029704A1 (de) | 2005-06-24 | 2007-01-11 | Basf Ag | Verwendung von Hydrophobin-Polypeptiden sowie Konjugaten aus Hydrophobin-Polypeptiden mit Wirk-oder Effektstoffen und ihre Herstellung sowie deren Einsatz in der Kosmetik |
DE102005033002A1 (de) | 2005-07-14 | 2007-01-18 | Basf Ag | Wässrige Monomeremulsionen enthaltend Hydrophobin |
CN101233220B (zh) | 2005-08-01 | 2012-11-21 | 巴斯福股份公司 | 表面活性非酶促蛋白质用于织物洗涤的应用 |
DE102005048720A1 (de) | 2005-10-12 | 2007-04-19 | Basf Ag | Verwendung von Proteinen als Antischaum-Komponente in Kraftstoffen |
-
2005
- 2005-06-10 DE DE102005027139A patent/DE102005027139A1/de not_active Withdrawn
-
2006
- 2006-06-09 CN CN2006800205772A patent/CN101193911B/zh not_active Expired - Fee Related
- 2006-06-09 JP JP2008515224A patent/JP5302676B2/ja not_active Expired - Fee Related
- 2006-06-09 CA CA002611234A patent/CA2611234A1/en not_active Abandoned
- 2006-06-09 WO PCT/EP2006/063066 patent/WO2006131564A2/de not_active Application Discontinuation
- 2006-06-09 US US11/921,905 patent/US7910699B2/en not_active Expired - Fee Related
- 2006-06-09 EP EP06763632A patent/EP1893639A2/de not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101115768A (zh) * | 2005-02-07 | 2008-01-30 | 巴斯福股份公司 | 新的疏水蛋白融合蛋白产品、其制备及用途 |
Non-Patent Citations (2)
Title |
---|
CORVIS, Y.等.Preparing catalytic surfaces for sensing applications by lmmobilizing enzymes via hydrophobin layers.《Analytical Chemistry》.2005,第77卷(第6期),第1622-1630页. * |
SCHOLTMEIJER, K等.Surface modifications created by using engineered hydrophobins.《Applied and Environmental Microbiology》.2002,第68卷(第3期),第1367-1373页. * |
Also Published As
Publication number | Publication date |
---|---|
JP5302676B2 (ja) | 2013-10-02 |
CA2611234A1 (en) | 2006-12-14 |
CN101193911A (zh) | 2008-06-04 |
US7910699B2 (en) | 2011-03-22 |
JP2008545435A (ja) | 2008-12-18 |
DE102005027139A1 (de) | 2006-12-28 |
WO2006131564A2 (de) | 2006-12-14 |
WO2006131564A3 (de) | 2007-06-07 |
US20090136996A1 (en) | 2009-05-28 |
EP1893639A2 (de) | 2008-03-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101193911B (zh) | 半胱氨酸减少的疏水蛋白融合蛋白、其生产和用途 | |
US7892788B2 (en) | Hydrophobin fusion products, production and use thereof | |
CN101326271B (zh) | 蛋白质作为燃料中消泡剂成分的用途 | |
CN101208361B (zh) | 含有疏水蛋白的水单体乳剂 | |
CN101228249B (zh) | 含疏水蛋白的钻井液 | |
US8859106B2 (en) | Use of polypeptides in the form of adhesive agents | |
KR101250105B1 (ko) | 경화 광물 건축 재료, 천연석, 인조석 및 세라믹의 표면처리를 위한 하이드로포빈의 용도 | |
CN101175540B (zh) | 疏水蛋白作为相稳定剂的用途 | |
CN101189290A (zh) | 疏水蛋白用作可发或膨胀型热塑性聚合体颗粒的包被剂 | |
AU2006254154B2 (en) | Method for reducing the evaporation rate of liquids | |
CN101151148B (zh) | 多肽作为增附剂的用途 | |
CN101115768B (zh) | 新的疏水蛋白融合蛋白产品、其制备及用途 | |
CN101151359B (zh) | 疏水蛋白在硬表面防污处理中的用途 | |
CN101151106A (zh) | 疏水蛋白在处理固化矿物建材、天然石材、铸石及陶瓷表面中的用途 | |
CN101180431A (zh) | 纤维基质表面的涂布 | |
DE102005036341A1 (de) | Verwendung von Hydrophobinen zur schmutzabweisenden Behandlung von harten Oberflächen |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130612 Termination date: 20150609 |
|
EXPY | Termination of patent right or utility model |