KR20090088856A - Enzyme systems for saccharification of plant cell wall polysaccharides - Google Patents
Enzyme systems for saccharification of plant cell wall polysaccharides Download PDFInfo
- Publication number
- KR20090088856A KR20090088856A KR1020097007532A KR20097007532A KR20090088856A KR 20090088856 A KR20090088856 A KR 20090088856A KR 1020097007532 A KR1020097007532 A KR 1020097007532A KR 20097007532 A KR20097007532 A KR 20097007532A KR 20090088856 A KR20090088856 A KR 20090088856A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- gly
- ser
- leu
- asn
- Prior art date
Links
- 102000004190 Enzymes Human genes 0.000 title abstract description 74
- 108090000790 Enzymes Proteins 0.000 title abstract description 74
- 210000002421 cell wall Anatomy 0.000 title abstract description 23
- 229920001282 polysaccharide Polymers 0.000 title description 14
- 239000005017 polysaccharide Substances 0.000 title description 12
- 150000004676 glycans Chemical class 0.000 title 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims abstract description 118
- 238000000034 method Methods 0.000 claims abstract description 38
- 108090000623 proteins and genes Proteins 0.000 claims description 82
- 102000004169 proteins and genes Human genes 0.000 claims description 53
- 108010059892 Cellulase Proteins 0.000 claims description 45
- 150000001875 compounds Chemical class 0.000 claims description 43
- 229940106157 cellulase Drugs 0.000 claims description 42
- 244000005700 microbiome Species 0.000 claims description 35
- 239000012978 lignocellulosic material Substances 0.000 claims description 33
- 150000001720 carbohydrates Chemical class 0.000 claims description 30
- 238000004519 manufacturing process Methods 0.000 claims description 26
- 235000000346 sugar Nutrition 0.000 claims description 21
- 241001170685 Saccharophagus degradans 2-40 Species 0.000 claims description 18
- 150000008163 sugars Chemical class 0.000 claims description 17
- 230000013595 glycosylation Effects 0.000 claims description 14
- 238000006206 glycosylation reaction Methods 0.000 claims description 14
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 11
- 241000588724 Escherichia coli Species 0.000 claims description 9
- 239000000872 buffer Substances 0.000 claims description 9
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 6
- 241000894006 Bacteria Species 0.000 claims description 6
- 229920001184 polypeptide Polymers 0.000 claims description 5
- 101100016206 Cellvibrio japonicus (strain Ueda107) celC gene Proteins 0.000 claims description 4
- 101100172078 Phanerodontia chrysosporium Eg5A gene Proteins 0.000 claims description 4
- 101100284150 Salipaludibacillus agaradhaerens cel5A gene Proteins 0.000 claims description 4
- 239000000693 micelle Substances 0.000 claims description 4
- 239000003973 paint Substances 0.000 claims description 4
- 239000006072 paste Substances 0.000 claims description 4
- 108010088751 Albumins Proteins 0.000 claims description 3
- 102000009027 Albumins Human genes 0.000 claims description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 3
- 229960002685 biotin Drugs 0.000 claims description 3
- 235000020958 biotin Nutrition 0.000 claims description 3
- 239000011616 biotin Substances 0.000 claims description 3
- 239000002738 chelating agent Substances 0.000 claims description 3
- 239000003599 detergent Substances 0.000 claims description 3
- 229910001410 inorganic ion Inorganic materials 0.000 claims description 3
- 229910021645 metal ion Inorganic materials 0.000 claims description 3
- 108091033319 polynucleotide Proteins 0.000 claims description 3
- 102000040430 polynucleotide Human genes 0.000 claims description 3
- 239000002157 polynucleotide Substances 0.000 claims description 3
- 230000001105 regulatory effect Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 229920002678 cellulose Polymers 0.000 abstract description 65
- 239000001913 cellulose Substances 0.000 abstract description 65
- 230000003413 degradative effect Effects 0.000 abstract description 3
- 229940088598 enzyme Drugs 0.000 description 73
- 241001670248 Saccharophagus degradans Species 0.000 description 69
- 235000018102 proteins Nutrition 0.000 description 48
- 230000000694 effects Effects 0.000 description 46
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 43
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 43
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 39
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 39
- 108010079364 N-glycylalanine Proteins 0.000 description 29
- 241000196324 Embryophyta Species 0.000 description 28
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 28
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 28
- 108020004414 DNA Proteins 0.000 description 27
- 108010047495 alanylglycine Proteins 0.000 description 27
- 108010061238 threonyl-glycine Proteins 0.000 description 27
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 25
- 108010050848 glycylleucine Proteins 0.000 description 25
- 108010037850 glycylvaline Proteins 0.000 description 25
- 239000000758 substrate Substances 0.000 description 25
- 108010078144 glutaminyl-glycine Proteins 0.000 description 24
- 108010047857 aspartylglycine Proteins 0.000 description 23
- 241000880493 Leptailurus serval Species 0.000 description 21
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 21
- 108010005233 alanylglutamic acid Proteins 0.000 description 21
- 235000014633 carbohydrates Nutrition 0.000 description 21
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 21
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 20
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 20
- 238000004458 analytical method Methods 0.000 description 20
- 210000004027 cell Anatomy 0.000 description 20
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 19
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 19
- 238000003556 assay Methods 0.000 description 19
- 108010049041 glutamylalanine Proteins 0.000 description 18
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 17
- 108010038633 aspartylglutamate Proteins 0.000 description 17
- 108010034529 leucyl-lysine Proteins 0.000 description 17
- 108010044940 alanylglutamine Proteins 0.000 description 16
- 108010087924 alanylproline Proteins 0.000 description 16
- 108010077245 asparaginyl-proline Proteins 0.000 description 16
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 16
- 230000015556 catabolic process Effects 0.000 description 16
- 230000006870 function Effects 0.000 description 16
- 108010089804 glycyl-threonine Proteins 0.000 description 16
- 108010084389 glycyltryptophan Proteins 0.000 description 16
- 108010057821 leucylproline Proteins 0.000 description 16
- 108010031719 prolyl-serine Proteins 0.000 description 16
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 15
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 15
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 15
- 108010073969 valyllysine Proteins 0.000 description 15
- 150000001413 amino acids Chemical class 0.000 description 14
- 238000006243 chemical reaction Methods 0.000 description 14
- 238000006731 degradation reaction Methods 0.000 description 14
- 108010036413 histidylglycine Proteins 0.000 description 14
- 108010084185 Cellulases Proteins 0.000 description 13
- 102000005575 Cellulases Human genes 0.000 description 13
- 108010017391 lysylvaline Proteins 0.000 description 13
- 108010038745 tryptophylglycine Proteins 0.000 description 13
- 229920001221 xylan Polymers 0.000 description 13
- 150000004823 xylans Chemical class 0.000 description 13
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 12
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 12
- 108700026244 Open Reading Frames Proteins 0.000 description 12
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 12
- 239000000499 gel Substances 0.000 description 12
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 12
- 108010070643 prolylglutamic acid Proteins 0.000 description 12
- 108010080629 tryptophan-leucine Proteins 0.000 description 12
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 11
- 150000004804 polysaccharides Chemical class 0.000 description 11
- 108010048818 seryl-histidine Proteins 0.000 description 11
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 10
- 108010068265 aspartyltyrosine Proteins 0.000 description 10
- 108010081551 glycylphenylalanine Proteins 0.000 description 10
- 108010026333 seryl-proline Proteins 0.000 description 10
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 9
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 9
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 9
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 9
- 108010041407 alanylaspartic acid Proteins 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 238000010367 cloning Methods 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 9
- 239000002609 medium Substances 0.000 description 9
- 229920000642 polymer Polymers 0.000 description 9
- 230000008569 process Effects 0.000 description 9
- 108010015796 prolylisoleucine Proteins 0.000 description 9
- 108010053725 prolylvaline Proteins 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- 108010051110 tyrosyl-lysine Proteins 0.000 description 9
- 108010003137 tyrosyltyrosine Proteins 0.000 description 9
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 8
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 8
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 8
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 8
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 8
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 8
- 229920002488 Hemicellulose Polymers 0.000 description 8
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 8
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 8
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 8
- 108010066427 N-valyltryptophan Proteins 0.000 description 8
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 8
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 8
- 235000001014 amino acid Nutrition 0.000 description 8
- 108010013835 arginine glutamate Proteins 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 239000004202 carbamide Substances 0.000 description 8
- 230000002255 enzymatic effect Effects 0.000 description 8
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 8
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 8
- 239000006166 lysate Substances 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 239000000047 product Substances 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 7
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 7
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 7
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 7
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 7
- 229920001503 Glucan Polymers 0.000 description 7
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 7
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 7
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 7
- 229920001410 Microfiber Polymers 0.000 description 7
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 7
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 7
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 7
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 7
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 7
- 230000027455 binding Effects 0.000 description 7
- 108010087823 glycyltyrosine Proteins 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 239000003658 microfiber Substances 0.000 description 7
- 108010029020 prolylglycine Proteins 0.000 description 7
- DBTMGCOVALSLOR-DEVYUCJPSA-N (2s,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-3,5-dihydroxy-6-(hydroxymethyl)-4-[(2s,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxy-6-(hydroxymethyl)oxane-2,3,5-triol Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](CO)O[C@H](O)[C@@H]2O)O)O[C@H](CO)[C@H]1O DBTMGCOVALSLOR-DEVYUCJPSA-N 0.000 description 6
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 6
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 6
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 6
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 6
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 6
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 6
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 6
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 6
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 6
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 6
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 6
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 6
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 6
- 229920001543 Laminarin Polymers 0.000 description 6
- 239000005717 Laminarin Substances 0.000 description 6
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 6
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 6
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 6
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 6
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 6
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 6
- 108010059820 Polygalacturonase Proteins 0.000 description 6
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 6
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 6
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 6
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 6
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 6
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 6
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 6
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 6
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 6
- 239000007983 Tris buffer Substances 0.000 description 6
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 6
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 6
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 6
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 239000001768 carboxy methyl cellulose Substances 0.000 description 6
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 6
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 6
- 230000001461 cytolytic effect Effects 0.000 description 6
- 230000000593 degrading effect Effects 0.000 description 6
- 108010093305 exopolygalacturonase Proteins 0.000 description 6
- 239000008103 glucose Substances 0.000 description 6
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 6
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 6
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 239000011159 matrix material Substances 0.000 description 6
- 108010034507 methionyltryptophan Proteins 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 108010079317 prolyl-tyrosine Proteins 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 235000002639 sodium chloride Nutrition 0.000 description 6
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 6
- 108010009962 valyltyrosine Proteins 0.000 description 6
- 238000001262 western blot Methods 0.000 description 6
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 5
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 5
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 5
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 5
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 5
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 5
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 5
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 5
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 5
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 5
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 5
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 5
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 5
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 5
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 5
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 5
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 5
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 5
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 5
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 5
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 description 5
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical group O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 5
- 241000588698 Erwinia Species 0.000 description 5
- 241000588722 Escherichia Species 0.000 description 5
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 5
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 5
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 5
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 5
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 5
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 5
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 5
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 5
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 5
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 5
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 5
- JKSMZVCGQWVTBW-STQMWFEESA-N Gly-Trp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O JKSMZVCGQWVTBW-STQMWFEESA-N 0.000 description 5
- 241000209219 Hordeum Species 0.000 description 5
- 235000007340 Hordeum vulgare Nutrition 0.000 description 5
- 102000004157 Hydrolases Human genes 0.000 description 5
- 108090000604 Hydrolases Proteins 0.000 description 5
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- 241000588748 Klebsiella Species 0.000 description 5
- 241000588749 Klebsiella oxytoca Species 0.000 description 5
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 5
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 5
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 5
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 5
- 239000007990 PIPES buffer Substances 0.000 description 5
- NEHSHYOUIWBYSA-DCPHZVHLSA-N Phe-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NEHSHYOUIWBYSA-DCPHZVHLSA-N 0.000 description 5
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 5
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 5
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 5
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 5
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 5
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 5
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 5
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 5
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 5
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 5
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 5
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 5
- 241000589634 Xanthomonas Species 0.000 description 5
- 241000588901 Zymomonas Species 0.000 description 5
- 108010045649 agarase Proteins 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 235000010443 alginic acid Nutrition 0.000 description 5
- 229920000615 alginic acid Polymers 0.000 description 5
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 5
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 108010040030 histidinoalanine Proteins 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 108010078274 isoleucylvaline Proteins 0.000 description 5
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 5
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 5
- 229920005610 lignin Polymers 0.000 description 5
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 235000010987 pectin Nutrition 0.000 description 5
- 239000001814 pectin Substances 0.000 description 5
- 229920001277 pectin Polymers 0.000 description 5
- 239000008188 pellet Substances 0.000 description 5
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- LWFUFLREGJMOIZ-UHFFFAOYSA-N 3,5-dinitrosalicylic acid Chemical compound OC(=O)C1=CC([N+]([O-])=O)=CC([N+]([O-])=O)=C1O LWFUFLREGJMOIZ-UHFFFAOYSA-N 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 4
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 4
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 4
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 4
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 4
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 4
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 4
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 4
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 4
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 4
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 4
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 4
- 108010076441 Ala-His-His Proteins 0.000 description 4
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 4
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 4
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 4
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 4
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 4
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 4
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 4
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 4
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 4
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 4
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 4
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 4
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 4
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 4
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 4
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 4
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 4
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 4
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 4
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 4
- 108010090461 DFG peptide Proteins 0.000 description 4
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 4
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 4
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 4
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 4
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 4
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 4
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 4
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 4
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 4
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 4
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 4
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 4
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 4
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 4
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 4
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 4
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 4
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 4
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 4
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 4
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 4
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 4
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 4
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 4
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 4
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 4
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 4
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 4
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 4
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 4
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 4
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 4
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 4
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 4
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 4
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 4
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 4
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 4
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 4
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 4
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 4
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 4
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 4
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 4
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 4
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 4
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 4
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 4
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 4
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 4
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 4
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 4
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 4
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 4
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 4
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 4
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 4
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 4
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 4
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010070783 alanyltyrosine Proteins 0.000 description 4
- 235000013405 beer Nutrition 0.000 description 4
- 108010047754 beta-Glucosidase Proteins 0.000 description 4
- 102000006995 beta-Glucosidase Human genes 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 239000012228 culture supernatant Substances 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 4
- 235000013305 food Nutrition 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 238000004949 mass spectrometry Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 4
- 108010077112 prolyl-proline Proteins 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- 108010044292 tryptophyltyrosine Proteins 0.000 description 4
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- FYGDTMLNYKFZSV-URKRLVJHSA-N (2s,3r,4s,5s,6r)-2-[(2r,4r,5r,6s)-4,5-dihydroxy-2-(hydroxymethyl)-6-[(2r,4r,5r,6s)-4,5,6-trihydroxy-2-(hydroxymethyl)oxan-3-yl]oxyoxan-3-yl]oxy-6-(hydroxymethyl)oxane-3,4,5-triol Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1[C@@H](CO)O[C@@H](OC2[C@H](O[C@H](O)[C@H](O)[C@H]2O)CO)[C@H](O)[C@H]1O FYGDTMLNYKFZSV-URKRLVJHSA-N 0.000 description 3
- IHPYMWDTONKSCO-UHFFFAOYSA-N 2,2'-piperazine-1,4-diylbisethanesulfonic acid Chemical compound OS(=O)(=O)CCN1CCN(CCS(O)(=O)=O)CC1 IHPYMWDTONKSCO-UHFFFAOYSA-N 0.000 description 3
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 3
- FHVDTGUDJYJELY-UHFFFAOYSA-N 6-{[2-carboxy-4,5-dihydroxy-6-(phosphanyloxy)oxan-3-yl]oxy}-4,5-dihydroxy-3-phosphanyloxane-2-carboxylic acid Chemical compound O1C(C(O)=O)C(P)C(O)C(O)C1OC1C(C(O)=O)OC(OP)C(O)C1O FHVDTGUDJYJELY-UHFFFAOYSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 3
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 3
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 3
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 3
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 3
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 3
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 3
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 3
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 3
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 3
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 3
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 3
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 3
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 3
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 3
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 3
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 3
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 3
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 3
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 3
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 3
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 3
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 3
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 3
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 3
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 3
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 3
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 3
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 3
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 3
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 3
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 3
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 3
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 3
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 3
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 3
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 3
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 3
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 3
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 3
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 3
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 3
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 3
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 3
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 3
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 3
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 3
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 3
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 3
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 3
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 3
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 3
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 3
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 3
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 3
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 3
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 3
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 3
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 3
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 3
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 3
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 3
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 3
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 3
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- 229920002498 Beta-glucan Polymers 0.000 description 3
- 229920003043 Cellulose fiber Polymers 0.000 description 3
- 229920002101 Chitin Polymers 0.000 description 3
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 3
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 3
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 3
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 3
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 3
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 3
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 3
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 3
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 3
- CMBXOSFZCFGDLE-IHRRRGAJSA-N Gln-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O CMBXOSFZCFGDLE-IHRRRGAJSA-N 0.000 description 3
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 3
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 3
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 3
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 3
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 3
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 3
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 3
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 3
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 3
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 3
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 3
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 3
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 3
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 3
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 3
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 3
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 3
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 3
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 3
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 3
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 3
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 3
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 3
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 3
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 3
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 3
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 3
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 3
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 3
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 description 3
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 3
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 3
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 3
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 3
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 3
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 3
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 3
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 3
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 3
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 3
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 3
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 3
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 3
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 3
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 3
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 3
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 3
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 3
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 3
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 3
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 3
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 3
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 3
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 3
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 3
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 3
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 3
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 3
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 3
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 3
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 3
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 3
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 3
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 3
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 3
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 3
- SPNKGZFASINBMR-IHRRRGAJSA-N Lys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N SPNKGZFASINBMR-IHRRRGAJSA-N 0.000 description 3
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 3
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 3
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 3
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 3
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 3
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 108010033276 Peptide Fragments Proteins 0.000 description 3
- 102000007079 Peptide Fragments Human genes 0.000 description 3
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 3
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 3
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 3
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 3
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 3
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 3
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 3
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 3
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 3
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 3
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 3
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 3
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 3
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 3
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 3
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 3
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 3
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 3
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 3
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 3
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 3
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 3
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 3
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 3
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 3
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 3
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 3
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 3
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 3
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 3
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 3
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 3
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 3
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 3
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 3
- YRXXUYPYPHRJPB-RXVVDRJESA-N Trp-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YRXXUYPYPHRJPB-RXVVDRJESA-N 0.000 description 3
- UPOGHWJJZAZNSW-XIRDDKMYSA-N Trp-His-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O UPOGHWJJZAZNSW-XIRDDKMYSA-N 0.000 description 3
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 3
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 3
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 3
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 3
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 3
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 3
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 3
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 3
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 3
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 3
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 3
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 3
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 3
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 3
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 3
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 3
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 3
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 238000005903 acid hydrolysis reaction Methods 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 229940072056 alginate Drugs 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 108010089934 carbohydrase Proteins 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 238000000354 decomposition reaction Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000001952 enzyme assay Methods 0.000 description 3
- 239000004744 fabric Substances 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 238000011331 genomic analysis Methods 0.000 description 3
- 125000003147 glycosyl group Chemical group 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 3
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 3
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 3
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 230000007062 hydrolysis Effects 0.000 description 3
- 238000006460 hydrolysis reaction Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- 108010053037 kyotorphin Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 108010004131 poly(beta-D-mannuronate) lyase Proteins 0.000 description 3
- 238000000575 proteomic method Methods 0.000 description 3
- 239000012557 regeneration buffer Substances 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 238000004007 reversed phase HPLC Methods 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 230000005945 translocation Effects 0.000 description 3
- 108700004896 tripeptide FEG Proteins 0.000 description 3
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- 235000014101 wine Nutrition 0.000 description 3
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 2
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 2
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 2
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 2
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 2
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 2
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 2
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 2
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 2
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 2
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 2
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 2
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 2
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 2
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 2
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 2
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 2
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 2
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- UYRPHDGXHKBZHJ-CIUDSAMLSA-N Asn-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N UYRPHDGXHKBZHJ-CIUDSAMLSA-N 0.000 description 2
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 2
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 2
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 2
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 2
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 2
- TZQWZQSMHDVLQL-QEJZJMRPSA-N Asn-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N TZQWZQSMHDVLQL-QEJZJMRPSA-N 0.000 description 2
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 2
- ATHZHGQSAIJHQU-XIRDDKMYSA-N Asn-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ATHZHGQSAIJHQU-XIRDDKMYSA-N 0.000 description 2
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 2
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 2
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 2
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 2
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- 101710104295 Beta-1,4-xylanase Proteins 0.000 description 2
- 102100032487 Beta-mannosidase Human genes 0.000 description 2
- 108010022172 Chitinases Proteins 0.000 description 2
- 102000012286 Chitinases Human genes 0.000 description 2
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 2
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 2
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 2
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 2
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 2
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 2
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 2
- CWHKESLHINPNBX-XIRDDKMYSA-N Cys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CCCCN)C(O)=O)=CNC2=C1 CWHKESLHINPNBX-XIRDDKMYSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 2
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 2
- 101710112457 Exoglucanase Proteins 0.000 description 2
- 101710098246 Exoglucanase 2 Proteins 0.000 description 2
- 229920000926 Galactomannan Polymers 0.000 description 2
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 2
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 2
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 2
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 2
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 2
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- OFIHURVSQXAZIR-SZMVWBNQSA-N Glu-Lys-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OFIHURVSQXAZIR-SZMVWBNQSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- GWKBAXRZPLSWJS-QEJZJMRPSA-N Glu-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GWKBAXRZPLSWJS-QEJZJMRPSA-N 0.000 description 2
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 2
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 2
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 2
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 2
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 2
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 2
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 2
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 2
- MKWFGXSFLYNTKC-XIRDDKMYSA-N His-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N MKWFGXSFLYNTKC-XIRDDKMYSA-N 0.000 description 2
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 2
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 2
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- JNDYZNJRRNFYIR-VGDYDELISA-N Ile-His-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N JNDYZNJRRNFYIR-VGDYDELISA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- BQIIHAGJIYOQBP-YFYLHZKVSA-N Ile-Trp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N BQIIHAGJIYOQBP-YFYLHZKVSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 2
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 2
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 2
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 2
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 2
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 2
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 2
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 2
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 2
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 2
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 2
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- 229920000057 Mannan Polymers 0.000 description 2
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 2
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 2
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 2
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 2
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 2
- FGAMAYQCWQCUNF-DCAQKATOSA-N Met-His-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FGAMAYQCWQCUNF-DCAQKATOSA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 2
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 2
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 2
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 2
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 2
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 2
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 2
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 2
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 2
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 2
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 2
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- AOKZOUGUMLBPSS-PMVMPFDFSA-N Phe-Trp-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O AOKZOUGUMLBPSS-PMVMPFDFSA-N 0.000 description 2
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 2
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 2
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 2
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 2
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 2
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 229920001218 Pullulan Polymers 0.000 description 2
- 239000004373 Pullulan Substances 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 2
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 2
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 2
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- FZNNGIHSIPKFRE-QEJZJMRPSA-N Ser-Trp-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZNNGIHSIPKFRE-QEJZJMRPSA-N 0.000 description 2
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 2
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 2
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 2
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 2
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 2
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 2
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 2
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 2
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 2
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 2
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 2
- SSNGFWKILJLTQM-QEJZJMRPSA-N Trp-Gln-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SSNGFWKILJLTQM-QEJZJMRPSA-N 0.000 description 2
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 2
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 2
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 2
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 2
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 2
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 2
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 2
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 2
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 2
- JTMZSIRTZKLBOA-NWLDYVSISA-N Trp-Thr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTMZSIRTZKLBOA-NWLDYVSISA-N 0.000 description 2
- RPTAWXPQXXCUGL-OYDLWJJNSA-N Trp-Trp-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O RPTAWXPQXXCUGL-OYDLWJJNSA-N 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 2
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 2
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 2
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 2
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 2
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 2
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 2
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 2
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 2
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 2
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 2
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 2
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 2
- 229920002000 Xyloglucan Polymers 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 239000000783 alginic acid Substances 0.000 description 2
- 229960001126 alginic acid Drugs 0.000 description 2
- 150000004781 alginic acids Chemical class 0.000 description 2
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 2
- AEMOLEFTQBMNLQ-BKBMJHBISA-N alpha-D-galacturonic acid Chemical compound O[C@H]1O[C@H](C(O)=O)[C@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-BKBMJHBISA-N 0.000 description 2
- -1 arabinan Polymers 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 108010055059 beta-Mannosidase Proteins 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- IQFVPQOLBLOTPF-HKXUKFGYSA-L congo red Chemical compound [Na+].[Na+].C1=CC=CC2=C(N)C(/N=N/C3=CC=C(C=C3)C3=CC=C(C=C3)/N=N/C3=C(C4=CC=CC=C4C(=C3)S([O-])(=O)=O)N)=CC(S([O-])(=O)=O)=C21 IQFVPQOLBLOTPF-HKXUKFGYSA-L 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000007071 enzymatic hydrolysis Effects 0.000 description 2
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 108010038658 exo-1,4-beta-D-xylosidase Proteins 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 238000010230 functional analysis Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 229930182830 galactose Natural products 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 229930182470 glycoside Natural products 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 229910001453 nickel ion Inorganic materials 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 239000010318 polygalacturonic acid Substances 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 235000019423 pullulan Nutrition 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 108010038196 saccharide-binding proteins Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 239000004753 textile Substances 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 229950003937 tolonium Drugs 0.000 description 2
- HNONEKILPDHFOL-UHFFFAOYSA-M tolonium chloride Chemical compound [Cl-].C1=C(C)C(N)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 HNONEKILPDHFOL-UHFFFAOYSA-M 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 238000000108 ultra-filtration Methods 0.000 description 2
- 238000002604 ultrasonography Methods 0.000 description 2
- 241000556533 uncultured marine bacterium Species 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 125000000969 xylosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)CO1)* 0.000 description 2
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- OMDQUFIYNPYJFM-XKDAHURESA-N (2r,3r,4s,5r,6s)-2-(hydroxymethyl)-6-[[(2r,3s,4r,5s,6r)-4,5,6-trihydroxy-3-[(2s,3s,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]methoxy]oxane-3,4,5-triol Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O[C@H]2[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@H](O)[C@H](O)O1 OMDQUFIYNPYJFM-XKDAHURESA-N 0.000 description 1
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- SEFVRKXJJPMVHQ-YUMQZZPRSA-N (2s)-2-[[2-[[(2s)-2-[(2-aminoacetyl)amino]-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]butanedioic acid Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O SEFVRKXJJPMVHQ-YUMQZZPRSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 description 1
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 1
- UZDMJOILBYFRMP-UHFFFAOYSA-N 2-[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]propanoylamino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NC(C)C(=O)NC(C(O)=O)C(C)CC UZDMJOILBYFRMP-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- WDMUXYQIMRDWRC-UHFFFAOYSA-N 2-hydroxy-3,4-dinitrobenzoic acid Chemical compound OC(=O)C1=CC=C([N+]([O-])=O)C([N+]([O-])=O)=C1O WDMUXYQIMRDWRC-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- GQUAIKWWRYLALG-UHFFFAOYSA-N 4-formamido-n-[5-[[5-[3-[(9-methoxy-5,11-dimethyl-6h-pyrido[4,3-b]carbazol-1-yl)amino]propylcarbamoyl]-1-methylpyrrol-3-yl]carbamoyl]-1-methylpyrrol-3-yl]-1-methylpyrrole-2-carboxamide Chemical compound C=12C(C)=C3C4=CC(OC)=CC=C4NC3=C(C)C2=CC=NC=1NCCCNC(=O)C(N(C=1)C)=CC=1NC(=O)C(N(C=1)C)=CC=1NC(=O)C1=CC(NC=O)=CN1C GQUAIKWWRYLALG-UHFFFAOYSA-N 0.000 description 1
- DUYYBTBDYZXISX-UKKRHICBSA-N 4-nitrophenyl-ara Chemical compound O[C@@H]1[C@@H](O)[C@H](CO)O[C@H]1OC1=CC=C([N+]([O-])=O)C=C1 DUYYBTBDYZXISX-UKKRHICBSA-N 0.000 description 1
- 229940117976 5-hydroxylysine Drugs 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- MCGGOMKMWPPXKQ-JBDRJPRFSA-N Ala-Asn-Gln-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MCGGOMKMWPPXKQ-JBDRJPRFSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- QJABSQFUHKHTNP-SYWGBEHUSA-N Ala-Ile-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QJABSQFUHKHTNP-SYWGBEHUSA-N 0.000 description 1
- GHBSKQGCIYSCNS-NAKRPEOUSA-N Ala-Leu-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GHBSKQGCIYSCNS-NAKRPEOUSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- FCXAUASCMJOFEY-NDKCEZKHSA-N Ala-Leu-Thr-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O FCXAUASCMJOFEY-NDKCEZKHSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- OARAZORWIMYUPO-FXQIFTODSA-N Ala-Met-Cys Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CS)C(O)=O OARAZORWIMYUPO-FXQIFTODSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- XIHHLOVIGNFBCR-HVTMNAMFSA-N Ala-Val-Asp-His Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XIHHLOVIGNFBCR-HVTMNAMFSA-N 0.000 description 1
- OIRCZHKOHJUHAC-SIUGBPQLSA-N Ala-Val-Asp-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OIRCZHKOHJUHAC-SIUGBPQLSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 244000153158 Ammi visnaga Species 0.000 description 1
- 235000010585 Ammi visnaga Nutrition 0.000 description 1
- 102100029464 Aquaporin-9 Human genes 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- BAVDUESNGSMLPI-CIUDSAMLSA-N Arg-Asn-Gly-Ser Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BAVDUESNGSMLPI-CIUDSAMLSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 1
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 1
- PCQXGEUALSFGIA-WDSOQIARSA-N Arg-His-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PCQXGEUALSFGIA-WDSOQIARSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 1
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- BXLDDWZOTGGNOJ-SZMVWBNQSA-N Arg-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N BXLDDWZOTGGNOJ-SZMVWBNQSA-N 0.000 description 1
- LOVIQNMIPQVIGT-BVSLBCMMSA-N Arg-Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)C1=CC=CC=C1 LOVIQNMIPQVIGT-BVSLBCMMSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- GURLOFOJBHRPJN-AAEUAGOBSA-N Asn-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GURLOFOJBHRPJN-AAEUAGOBSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- HXWUJJADFMXNKA-BQBZGAKWSA-N Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O HXWUJJADFMXNKA-BQBZGAKWSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 1
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 1
- LUJQEUOZJUWRRX-BPUTZDHNSA-N Asn-Trp-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O LUJQEUOZJUWRRX-BPUTZDHNSA-N 0.000 description 1
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 1
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 1
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- GBAWQWASNGUNQF-ZLUOBGJFSA-N Asp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N GBAWQWASNGUNQF-ZLUOBGJFSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 1
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- RMFITHMDQGFSDC-UBHSHLNASA-N Asp-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RMFITHMDQGFSDC-UBHSHLNASA-N 0.000 description 1
- IHZFGJLKDYINPV-XIRDDKMYSA-N Asp-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(O)=O)N)C(O)=O)C1=CN=CN1 IHZFGJLKDYINPV-XIRDDKMYSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- GWOVSEVNXNVMMY-BPUTZDHNSA-N Asp-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N GWOVSEVNXNVMMY-BPUTZDHNSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 241000589152 Azotobacter chroococcum Species 0.000 description 1
- 238000000035 BCA protein assay Methods 0.000 description 1
- 101100354312 Bacillus subtilis (strain 168) licC gene Proteins 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 102000005701 Calcium-Binding Proteins Human genes 0.000 description 1
- 108010045403 Calcium-Binding Proteins Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 229920002299 Cellodextrin Polymers 0.000 description 1
- 101710095524 Cellodextrinase Proteins 0.000 description 1
- 241000186320 Cellulomonas fimi Species 0.000 description 1
- 108010054033 Chitin deacetylase Proteins 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- MIKUYHXYGGJMLM-UUOKFMHZSA-N Crotonoside Chemical compound C1=NC2=C(N)NC(=O)N=C2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O MIKUYHXYGGJMLM-UUOKFMHZSA-N 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- OJQJUQUBJGTCRY-WFBYXXMGSA-N Cys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N OJQJUQUBJGTCRY-WFBYXXMGSA-N 0.000 description 1
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 1
- QDFBJJABJKOLTD-FXQIFTODSA-N Cys-Asn-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QDFBJJABJKOLTD-FXQIFTODSA-N 0.000 description 1
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- WKELHWMCIXSVDT-UBHSHLNASA-N Cys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WKELHWMCIXSVDT-UBHSHLNASA-N 0.000 description 1
- LMXOUGMSGHFLRX-CIUDSAMLSA-N Cys-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N LMXOUGMSGHFLRX-CIUDSAMLSA-N 0.000 description 1
- DZIGZIIJIGGANI-FXQIFTODSA-N Cys-Glu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DZIGZIIJIGGANI-FXQIFTODSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- KCSDYJSCUWLILX-BJDJZHNGSA-N Cys-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N KCSDYJSCUWLILX-BJDJZHNGSA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- PGBLJHDDKCVSTC-CIUDSAMLSA-N Cys-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O PGBLJHDDKCVSTC-CIUDSAMLSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 1
- RAGIABZNLPZBGS-FXQIFTODSA-N Cys-Pro-Cys Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O RAGIABZNLPZBGS-FXQIFTODSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 1
- SPJRFUJMDJGDRO-UBHSHLNASA-N Cys-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CS)N)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 SPJRFUJMDJGDRO-UBHSHLNASA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 1
- ATFSDBMHRCDLBV-BPUTZDHNSA-N Cys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N ATFSDBMHRCDLBV-BPUTZDHNSA-N 0.000 description 1
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108010001817 Endo-1,4-beta Xylanases Proteins 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- 101100410352 Escherichia coli (strain K12) chbC gene Proteins 0.000 description 1
- 101100129092 Escherichia coli hic gene Proteins 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 108010046649 GDNP peptide Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 1
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 1
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- QENSHQJGWGRPQS-QEJZJMRPSA-N Gln-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 QENSHQJGWGRPQS-QEJZJMRPSA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- CMFBOXUBWMZZMD-BPUTZDHNSA-N Gln-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CMFBOXUBWMZZMD-BPUTZDHNSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 1
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- KGVHCTWYMPWEGN-FSPLSTOPSA-N Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CN KGVHCTWYMPWEGN-FSPLSTOPSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- FBUMPXILDTWCJW-UHFFFAOYSA-N Gly-Trp-Ala-Pro Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)CN)C(=O)NC(C)C(=O)N1CCCC1C(O)=O FBUMPXILDTWCJW-UHFFFAOYSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 1
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- AAXMRLWFJFDYQO-GUBZILKMSA-N His-Asp-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O AAXMRLWFJFDYQO-GUBZILKMSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 1
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 1
- QPSCMXDWVKWVOW-BZSNNMDCSA-N His-His-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QPSCMXDWVKWVOW-BZSNNMDCSA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- CMMBEMZGNGYJRJ-IHRRRGAJSA-N His-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N CMMBEMZGNGYJRJ-IHRRRGAJSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- ABCCKUZDWMERKT-AVGNSLFASA-N His-Pro-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O ABCCKUZDWMERKT-AVGNSLFASA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- WKEABZIITNXXQZ-CIUDSAMLSA-N His-Ser-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N WKEABZIITNXXQZ-CIUDSAMLSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- RXKFKJVJVHLRIE-XIRDDKMYSA-N His-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CN=CN3)N RXKFKJVJVHLRIE-XIRDDKMYSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 1
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 1
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 108091006054 His-tagged proteins Proteins 0.000 description 1
- 101000771413 Homo sapiens Aquaporin-9 Proteins 0.000 description 1
- 101000921370 Homo sapiens Elongation of very long chain fatty acids protein 1 Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- QNBYCZTZNOVDMI-HGNGGELXSA-N Ile-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QNBYCZTZNOVDMI-HGNGGELXSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- GTSAALPQZASLPW-KJYZGMDISA-N Ile-His-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N GTSAALPQZASLPW-KJYZGMDISA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 1
- HZVRQFKRALAMQS-SLBDDTMCSA-N Ile-Trp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZVRQFKRALAMQS-SLBDDTMCSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 229920000433 Lyocell Polymers 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 101000763602 Manilkara zapota Thaumatin-like protein 1 Proteins 0.000 description 1
- 101000763586 Manilkara zapota Thaumatin-like protein 1a Proteins 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 1
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- RCMDUFDXDYTXOK-CIUDSAMLSA-N Met-Gln-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O RCMDUFDXDYTXOK-CIUDSAMLSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 1
- MHQXIBRPDKXDGZ-ZFWWWQNUSA-N Met-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MHQXIBRPDKXDGZ-ZFWWWQNUSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- UDOYVQQKQHZYMB-DCAQKATOSA-N Met-Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDOYVQQKQHZYMB-DCAQKATOSA-N 0.000 description 1
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- RKRFGIBULDYDPF-XIRDDKMYSA-N Met-Trp-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKRFGIBULDYDPF-XIRDDKMYSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- WOGNGBROIHHFAO-JYJNAYRXSA-N Met-Tyr-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N WOGNGBROIHHFAO-JYJNAYRXSA-N 0.000 description 1
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 241000212300 Microbulbifer hydrolyticus Species 0.000 description 1
- 206010027626 Milia Diseases 0.000 description 1
- 101000966653 Musa acuminata Glucan endo-1,3-beta-glucosidase Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- CTQNGGLPUBDAKN-UHFFFAOYSA-N O-Xylene Chemical compound CC1=CC=CC=C1C CTQNGGLPUBDAKN-UHFFFAOYSA-N 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- LZDIENNKWVXJMX-JYJNAYRXSA-N Phe-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CC=CC=C1 LZDIENNKWVXJMX-JYJNAYRXSA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- AKJAKCBHLJGRBU-JYJNAYRXSA-N Phe-Glu-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AKJAKCBHLJGRBU-JYJNAYRXSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- PTLMYJOMJLTMCB-KKUMJFAQSA-N Phe-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N PTLMYJOMJLTMCB-KKUMJFAQSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- JLDZQPPLTJTJLE-IHPCNDPISA-N Phe-Trp-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JLDZQPPLTJTJLE-IHPCNDPISA-N 0.000 description 1
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- IPVPGAADZXRZSH-RNXOBYDBSA-N Phe-Tyr-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IPVPGAADZXRZSH-RNXOBYDBSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- RETPETNFPLNLRV-JYJNAYRXSA-N Pro-Asn-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O RETPETNFPLNLRV-JYJNAYRXSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 1
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 1
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 1
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- RSTWKJFWBKFOFC-JYJNAYRXSA-N Pro-Trp-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RSTWKJFWBKFOFC-JYJNAYRXSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- VVEQUISRWJDGMX-VKOGCVSHSA-N Pro-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 VVEQUISRWJDGMX-VKOGCVSHSA-N 0.000 description 1
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 241000192142 Proteobacteria Species 0.000 description 1
- 101100163901 Rattus norvegicus Asic2 gene Proteins 0.000 description 1
- 241000282849 Ruminantia Species 0.000 description 1
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 1
- 238000010847 SEQUEST Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- BLRPTPMANUNPDV-UHFFFAOYSA-N Silane Chemical compound [SiH4] BLRPTPMANUNPDV-UHFFFAOYSA-N 0.000 description 1
- 241001506240 Spartina alterniflora var. glabra Species 0.000 description 1
- 101150006914 TRP1 gene Proteins 0.000 description 1
- 241000203780 Thermobifida fusca Species 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- NCGUQWSJUKYCIT-SZZJOZGLSA-N Thr-His-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NCGUQWSJUKYCIT-SZZJOZGLSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- LAIUAVGWZYTBKN-VHWLVUOQSA-N Trp-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O LAIUAVGWZYTBKN-VHWLVUOQSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- HDQJVXVRGJUDML-UBHSHLNASA-N Trp-Cys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HDQJVXVRGJUDML-UBHSHLNASA-N 0.000 description 1
- JZHJLBPBQKPTNX-UBHSHLNASA-N Trp-Cys-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 JZHJLBPBQKPTNX-UBHSHLNASA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 1
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 1
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 1
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 1
- ZJKZLNAECPIUTL-JBACZVJFSA-N Trp-Gln-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ZJKZLNAECPIUTL-JBACZVJFSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- MHCLIYHJRXZBGJ-AAEUAGOBSA-N Trp-Gly-Cys Chemical compound N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)NCC(=O)N[C@@H](CS)C(=O)O MHCLIYHJRXZBGJ-AAEUAGOBSA-N 0.000 description 1
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 1
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 1
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 1
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 1
- LFMMXTLRXKBPMC-FDARSICLSA-N Trp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LFMMXTLRXKBPMC-FDARSICLSA-N 0.000 description 1
- BYSKNUASOAGJSS-NQCBNZPSSA-N Trp-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BYSKNUASOAGJSS-NQCBNZPSSA-N 0.000 description 1
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 1
- RCMHSGRBJCMFLR-BPUTZDHNSA-N Trp-Met-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 RCMHSGRBJCMFLR-BPUTZDHNSA-N 0.000 description 1
- VOCHZIJXPRBVSI-XIRDDKMYSA-N Trp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VOCHZIJXPRBVSI-XIRDDKMYSA-N 0.000 description 1
- HJWLQSFTGDQSRX-BPUTZDHNSA-N Trp-Met-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HJWLQSFTGDQSRX-BPUTZDHNSA-N 0.000 description 1
- MICFJCRQBFSKPA-UMPQAUOISA-N Trp-Met-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 MICFJCRQBFSKPA-UMPQAUOISA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- JZSLIZLZGWOJBJ-PMVMPFDFSA-N Trp-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N JZSLIZLZGWOJBJ-PMVMPFDFSA-N 0.000 description 1
- BOMYCJXTWRMKJA-RNXOBYDBSA-N Trp-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N BOMYCJXTWRMKJA-RNXOBYDBSA-N 0.000 description 1
- CSOBBJWWODOYGW-ILWGZMRPSA-N Trp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O CSOBBJWWODOYGW-ILWGZMRPSA-N 0.000 description 1
- KOVPHHXMHLFWPL-BPUTZDHNSA-N Trp-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CC(=O)N)C(=O)O KOVPHHXMHLFWPL-BPUTZDHNSA-N 0.000 description 1
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 1
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 1
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- JKLJVFCPCWMNMZ-UMPQAUOISA-N Trp-Thr-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(O)=O)[C@@H](C)O)=CNC2=C1 JKLJVFCPCWMNMZ-UMPQAUOISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- YMNSKLWJSOANFS-OYDLWJJNSA-N Trp-Trp-Met Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O YMNSKLWJSOANFS-OYDLWJJNSA-N 0.000 description 1
- RQKMZXSRILVOQZ-GMVOTWDCSA-N Trp-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RQKMZXSRILVOQZ-GMVOTWDCSA-N 0.000 description 1
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 1
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 1
- IEESWNWYUOETOT-BVSLBCMMSA-N Trp-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccccc1)C(O)=O IEESWNWYUOETOT-BVSLBCMMSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- SMLCYZYQFRTLCO-UWJYBYFXSA-N Tyr-Cys-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O SMLCYZYQFRTLCO-UWJYBYFXSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 1
- HMPMGPISLMLHSI-JBACZVJFSA-N Tyr-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N HMPMGPISLMLHSI-JBACZVJFSA-N 0.000 description 1
- RZAGEHHVNYESNR-RNXOBYDBSA-N Tyr-Trp-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RZAGEHHVNYESNR-RNXOBYDBSA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- CCEVJBJLPRNAFH-BVSLBCMMSA-N Tyr-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N CCEVJBJLPRNAFH-BVSLBCMMSA-N 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- WGHVMKFREWGCGR-SRVKXCTJSA-N Val-Arg-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WGHVMKFREWGCGR-SRVKXCTJSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 1
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 108010027199 Xylosidases Proteins 0.000 description 1
- MFGSTSNUMVXOHJ-UHFFFAOYSA-M [I+].CC([O-])=O Chemical compound [I+].CC([O-])=O MFGSTSNUMVXOHJ-UHFFFAOYSA-M 0.000 description 1
- 239000003463 adsorbent Substances 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 210000003484 anatomy Anatomy 0.000 description 1
- 108010076955 arabinogalactan endo-1,4-beta-galactosidase Proteins 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- 125000000089 arabinosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O)CO1)* 0.000 description 1
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010027234 aspartyl-glycyl-glutamyl-alanine Proteins 0.000 description 1
- 235000015173 baked goods and baking mixes Nutrition 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 238000010504 bond cleavage reaction Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 230000004106 carbohydrate catabolism Effects 0.000 description 1
- 238000004177 carbon cycle Methods 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 101150072516 cbhA gene Proteins 0.000 description 1
- 101150008389 cbhB gene Proteins 0.000 description 1
- 101150080131 celB gene Proteins 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 230000008618 cell wall macromolecule catabolic process Effects 0.000 description 1
- 108091008394 cellulose binding proteins Proteins 0.000 description 1
- 230000008166 cellulose biosynthesis Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 210000004081 cilia Anatomy 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 239000001064 degrader Substances 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 101150095218 der gene Proteins 0.000 description 1
- VGONTNSXDCQUGY-UHFFFAOYSA-N desoxyinosine Natural products C1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 VGONTNSXDCQUGY-UHFFFAOYSA-N 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- 229940079919 digestives enzyme preparation Drugs 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000000132 electrospray ionisation Methods 0.000 description 1
- 108010091371 endoglucanase 1 Proteins 0.000 description 1
- 108010091384 endoglucanase 2 Proteins 0.000 description 1
- 108010092450 endoglucanase Z Proteins 0.000 description 1
- 101150022470 engB gene Proteins 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000005562 fading Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 230000003619 fibrillary effect Effects 0.000 description 1
- 210000003495 flagella Anatomy 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 235000020400 fruit nectar Nutrition 0.000 description 1
- 235000013572 fruit purees Nutrition 0.000 description 1
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 1
- 235000012055 fruits and vegetables Nutrition 0.000 description 1
- 230000008571 general function Effects 0.000 description 1
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010026195 glycanase Proteins 0.000 description 1
- 229940047135 glycate Drugs 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010084760 glycyl-tyrosyl-glycyl-aspartate Proteins 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 239000008241 heterogeneous mixture Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000005040 ion trap Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- OYKBQNOPCSXWBL-SNAWJCMRSA-N n-hydroxy-3-[(e)-3-(hydroxyamino)-3-oxoprop-1-enyl]benzamide Chemical compound ONC(=O)\C=C\C1=CC=CC(C(=O)NO)=C1 OYKBQNOPCSXWBL-SNAWJCMRSA-N 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 150000004713 phosphodiesters Chemical group 0.000 description 1
- 230000000865 phosphorylative effect Effects 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N phosphotyrosine Chemical compound OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 238000013492 plasmid preparation Methods 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 108010029895 rubimetide Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 1
- 229940055619 selenocysteine Drugs 0.000 description 1
- 235000016491 selenocysteine Nutrition 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 229910000077 silane Inorganic materials 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000000779 smoke Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000003335 steric effect Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000012536 storage buffer Substances 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000012134 supernatant fraction Substances 0.000 description 1
- 239000013595 supernatant sample Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 210000005239 tubule Anatomy 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 150000003672 ureas Chemical class 0.000 description 1
- 238000011514 vinification Methods 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 239000008096 xylene Substances 0.000 description 1
- 101150085982 xyn11A gene Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2434—Glucanases acting on beta-1,4-glucosidic bonds
- C12N9/2437—Cellulases (3.2.1.4; 3.2.1.74; 3.2.1.91; 3.2.1.150)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/06—Ethanol, i.e. non-beverage
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/14—Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01004—Cellulase (3.2.1.4), i.e. endo-1,4-beta-glucanase
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Virology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Immobilizing And Processing Of Enzymes And Microorganisms (AREA)
Abstract
Description
[관련 출원의 상호 참조][Cross Reference of Related Application]
본 출원은 미합중국 특허 출원 제11/121,154호(2005년 5월 4일자 출원)의 일부 계속 출원이며, 미합중국 가출원 제60/567,971호(2004년 5월 4일자 출원)의 우선권을 주장하고, 그 내용들은 인용에 의해, 전체적으로, 본 명세서에 일체화된다. This application is a continuing application of United States Patent Application No. 11 / 121,154, filed May 4, 2005, and claims priority of US Provisional Application No. 60 / 567,971 (filed May 4, 2004), the contents of which are incorporated herein by reference. Are incorporated herein by reference in their entirety.
[연방정부의 지원을 받은 연구 또는 개발에 관한 진술][Statement of research or development supported by federal government]
본 발명은 미 국립 해양 대기청(National Oceanic and Atmospheric Administration)(NOAA)에 의해 수여된 계약 번호(Contract Number) SA7528051 E 및 미 국립 과학 재단(National Science Foundation)(NSF)에 의해 수여된 계약 번호 DEB0109869 하에서 정부 지원을 받았다. 정부는 본 발명에 일부 권리를 갖는다. The present invention relates to Contract Number SA7528051 E awarded by the National Oceanic and Atmospheric Administration (NOAA) and Contract No. DEB0109869 awarded by the National Science Foundation (NSF). Under government support. The government has some rights in the invention.
[서열 목록][Sequence list]
본 출원은 프린트된 종이 복사본 대신에 3본 CD-R을 통해 제출된 긴 서열 목록을 포함하며, 인용에 의해, 전체적으로 본 명세서에 일체화된다. 관련된 미합중국 출원 제11/121,154호(2005년 5월 4일자 출원)에서 2005년 9월 14일자로 기록된, CD-R은 각각, "CRF", "Copy 1" 및 "Copy 2"로 라벨되어 있고, 각각은 단지 하나의 동일한 828 KB 파일(18172121.APP)을 포함한다.This application contains a long list of sequences submitted via three CD-Rs in place of printed paper copies and is incorporated herein by reference in its entirety. The CD-R, recorded September 14, 2005 in the associated US application Ser. No. 11 / 121,154 (filed May 4, 2005), is labeled "CRF", "Copy 1" and "
[기술분야][Technical Field]
본 발명은 전체적으로 분해 효소(degradative enzyme) 및 시스템에 관한 것이다. 특히, 본 발명은 마이크로벌비퍼 데그라단스(Microbulbifer degradans)에서 발견된 식물 세포벽 분해 효소 및 관련된 단백질, 그러한 효소 및/또는 단백질을 포함하는 시스템, 및 에탄올을 얻기 위하여 상기 시스템을 사용하는 방법에 관한 것이다.The present invention relates generally to degradative enzymes and systems. In particular, the present invention relates to plant cell wall degrading enzymes and related proteins found in Microbulbifer degradans , systems comprising such enzymes and / or proteins, and methods of using such systems to obtain ethanol. will be.
셀룰라아제(cellulase) 및 관련 효소는 음식, 맥주, 와인, 동물 사료, 직물 제조 및 세탁, 펄프 및 종이 산업, 및 농업 산업에서 활용되어 오고 있다. 다양한 그러한 사용이 M. K. Bhat의 문헌 "Cellulases and related enzymes in biotechnology"(Biotechnical Advances 18 (2000) 355-383)에 기재되어 있으며, 그 내용은 인용에 의해 전체적으로 본 명세서에 일체화된다. Cellulase and related enzymes have been utilized in food, beer, wine, animal feed, textile manufacturing and laundry, pulp and paper industries, and agricultural industries. Various such uses are described in M. K. Bhat, "Cellulases and related enzymes in biotechnology" (Biotechnical Advances 18 (2000) 355-383), the contents of which are incorporated herein in their entirety by reference.
식물의 세포벽은 공유 및 비공유 수단을 통해 상호작용하는 복합 다당류의 이질성 혼합물로 구성된다. 고등 식물 세포 벽의 복합 다당류는, 예를 들어, 일반적으로 세포 벽 성분에서 발견되는 탄소의 35-50%를 구성하는 셀룰로오스(β-1,4 글루칸)를 포함한다. 반-결정성 셀룰로오스 마이크로섬유를 형성하기 위하여 셀룰로오스 폴리머는 수소 결합, 반 데르 발스 상호작용 및 소수성 상호작용을 통해 스스로 회합(associate)한다. 이들 마이크로섬유는 또한, 일반적으로 무정형 셀룰로오스로 알려진 비결정성 영역을 포함한다. 셀룰로오스 마이크로섬유는 헤미셀룰로오스(예를 들어, 크실란, 아라비난, 및 만난을 포함), 펙틴(예를 들어, 갈락투로난 및 갈락탄), 및 다양한 기타 β-1,3 및 β-1,4 글루칸으로 된 매트릭스에 임베드된다. 이들 매트릭스 폴리머는, 예를 들어, 아라비노스, 갈락토스, 및/또는 크실로오스 잔기로 종종 치환되어, 고복합 아라비노크실란, 아라비노갈락탄, 갈락토만난, 및 크실로글루칸을 얻는다. 헤미셀룰로오스 매트릭스는, 이어서 폴리페놀성 리그닌에 의해 둘러싸인다. The cell wall of a plant consists of a heterogeneous mixture of complex polysaccharides that interact through covalent and non-covalent means. Complex polysaccharides of higher plant cell walls include, for example, cellulose (β-1,4 glucan), which constitutes 35-50% of the carbon typically found in cell wall components. To form semi-crystalline cellulose microfibers, the cellulose polymers associate themselves through hydrogen bonding, van der Waals interactions and hydrophobic interactions. These microfibers also include amorphous regions, commonly known as amorphous cellulose. Cellulose microfibers include hemicellulose (eg, including xylan, arabinan, and mannan), pectin (eg, galacturonan and galactan), and various other β-1,3 and β-1, It is embedded in a matrix of 4 glucans. These matrix polymers are often substituted, for example, with arabinose, galactose, and / or xylose residues to obtain high complex arabinoxsilanes, arabingalactans, galactomannans, and xyloglucans. The hemicellulose matrix is then surrounded by polyphenolic lignin.
매트릭스의 복합성(complexity)은, 효소가 코어 셀룰로오스 마이크로섬유에서 작용할 수 있기 전에 리그닌 및 헤미셀룰로오스 성분이 분해되어야 하기 때문에, 미생물에 의한 분해를 어렵게 한다. 통상, 구성성분인 단당류를 방출하기 위하여, 상이한 미생물의 공동체는 세포벽 폴리머를 분해하는 것이 요구된다. 식물 세포벽의 당화(saccharification)를 위해, 리그닌은 투과되어야 하고, 헤미셀룰로오스가 제거되어 셀룰로오스-분해 효소(cellulose-degrading enzyme)가 그들의 기질 상에서 작용하도록 해야 한다. 세포 벽의 산업적 당화를 위해, 높은 온도 및 압력에서 희석된 황산으로 처리된 가공된 공급원료에 많은 양의 1차적 진균 셀룰라아제를 가하여, 리그닌을 투과성으로 하고(permeabilize), 헤미셀룰로오스 성분을 부분적으로 당화하였다. The complexity of the matrix makes it difficult to degrade by microorganisms, since the lignin and hemicellulose components must be degraded before the enzyme can work on the core cellulose microfibers. Typically, in order to release the constituent monosaccharides, different microbial communities are required to degrade cell wall polymers. For saccharification of plant cell walls, lignin must be permeated and hemicellulose must be removed to allow cellulose-degrading enzymes to act on their substrates. For industrial saccharification of the cell walls, a large amount of primary fungal cellulase was added to the processed feedstock treated with sulfuric acid diluted at high temperature and pressure to permeabilize the lignin and partially glycosylate the hemicellulose component. .
사카로파거스 데그라단스(Saccharophagus degradans) 균주 2-40(여기에서 "S. 데그라단스 2-40" 또는 "2-40"으로 언급됨)은 복합 다당류(CP)를 분해하는 해양성 박테리아의 대표적인 신흥 그룹(emerging group)이다. S. 데그라단스은 American Type Culture Collection에 기탁되었고, 수납 번호 ATCC 43961를 갖는다. 이미 공지되고, 여기에서 마이크로벌비퍼 데그라단스(Microbulbifer degradans) 균 주 2-40 ("M. degradans 2-40")과 동의어인 S. 데그라단스 2-40은, Chesapeake Bay 분수선(watershed)에서 염습지 코드 그라스(salt marsh cord grass)인 스파리나 알터니플로라(Sparina alterniflora)를 분해하여 분리된 해양성 □-프로테오박테리움(proteobacterium)이다. 식물 성분의 분해로부터 그것의 분리와 부합되게, S. 데그라단스 균주 2-40은, 고등 식물의 세포 벽의 일반적인 성분들인 셀룰로오스, 펙틴, 크실란, 및 기틴을 포함하는, 많은 복합 다당류를 분해할 수 있다. S. 데그라단스 균주 2-40은 또한 단백질, 전분, 풀룰란, 및 알긴산과, 아가, 아가로스, 및 라미나린(laminarin)과 같은 해조류 세포 벽 성분을 분해할 수 있다. 폴리머의 이러한 과잉(plethora)을 분해하는 것에 더하여, S. 데그라단스 균주 2-40은 유일한 탄소원으로서 각각의 다당류를 활용할 수 있다. 따라서, S. 데그라단스 균주 2-40은 불용성 복합 다당류(ICPs)의 미생물 분해의 우수한 모델이 될 뿐만 아니라, 이들 ICPs의 완전한 대사작용에 대한 패러다임으로서도 사용될 수 있다. ICPs는 동물 및 식물에서 구조 및 형태를 위해 사용되는 중합된 당류(saccharides)이다. 이들은 물에 불용성이고, 따라서 화학 변화를 일으키기 어렵다. Saccharophagus degradans strain 2-40 (herein referred to as "S. degradans 2-40" or "2-40") is a marine bacterium that degrades complex polysaccharides (CP). It is a representative emerging group. S. degradans has been deposited with the American Type Culture Collection and has accession number ATCC 43961. S. degradans 2-40, already known and synonymous with Microbulbifer degradans strain 2-40 (" M. degradans 2-40"), is a Chesapeake Bay watershed Is a marine □ -proteobacterium isolated from the salt marsh cord grass, Sparina alterniflora . Consistent with its separation from the degradation of plant components, S. degradans strains 2-40 break down many complex polysaccharides, including cellulose, pectin, xylan, and gutin, which are common components of the cell walls of higher plants. can do. S. degradans strains 2-40 can also degrade protein, starch, pullulan, and alginic acid and algae cell wall components such as agar, agarose, and laminarin. In addition to breaking down this plethora of polymers, S. degradans strains 2-40 can utilize each polysaccharide as the only carbon source. Thus, S. degradans strains 2-40 are not only excellent models of microbial degradation of insoluble complex polysaccharides (ICPs), but can also be used as a paradigm for the complete metabolism of these ICPs. ICPs are polymerized saccharides used for structure and morphology in animals and plants. They are insoluble in water and therefore difficult to cause chemical changes.
마이크로벌비퍼 데그라단스 균주 2-40은 성장을 위해서 적어도 1%의 바다 소금을 필요로 하고, 10%만큼 높은 염 농도도 견딜 것이다. 이는, 호기성이고, 일반적으로 로드(rod)-형상이고, 단일 극성 편모에 의해 운동성을 가지는, 매우 다형태의(pleomorphic), 그람 음성 박테리아이다. 이전의 연구로, 2-40이 아가, 키틴, 알긴산, 카르복시메틸셀룰로오스(CMC), β-글루칸, 라미나린, 펙틴, 풀룰란, 전분 및 크실란을 포함하는 적어도 10개의 상이한 탄수화물 폴리머(CP)를 분해할 수 있 는 것이 밝혀졌다(Ensor, Stotz 등, 1999). 또한, 진정한 티로시나아제를 합성하는 것이 보여졌다(Kelley, Coyne 등, 1990). 16S rDNA 분석은, 좀조개의 공생자인 셀룰로오스분해(cellulolytic) 질소-고정 박테리아인 Teridinibacter sp.,(Distel, Morrill 등, 2002) 및 Microbulbifer hydrolyticus(Gonzalez 및 Weiner 2000)와 관련된, 프로테오박테리아 문(phylum Proteobacteria)의 감마-서브클래스의 구성원임을 보이고 있다. Microbulbifer degradans strains 2-40 require at least 1% sea salt for growth and will tolerate salt concentrations as high as 10%. It is a very polymorphic, Gram-negative bacterium that is aerobic, generally rod-shaped, and mobilized by a single polar flagella. In previous studies, at least 10 different carbohydrate polymers (CP), including 2-40, agar, chitin, alginic acid, carboxymethylcellulose (CMC), β-glucan, laminarin, pectin, pullulan, starch and xylan It has been found that it can decompose (Ensor, Stotz et al., 1999). It has also been shown to synthesize true tyrosinase (Kelley, Coyne et al., 1990). 16S rDNA analysis showed that Teridinibacter sp. , A cellulolytic nitrogen-fixing bacterium that is a symbiotic commensal . (Distel, Morrill et al., 2002) and Microbulbifer hydrolyticus (Gonzalez and Weiner 2000), which are members of the gamma-subclass of the phylum Proteobacteria .
아가라아제, 키티나아제 및 알기나아제 시스템은 전체적으로 특성화되어 있다. 자이모그램 활성 실험 프로토콜은, 3개 시스템 모두 복수의 분해효소(depolymerase)로 구성됨을 나타내고, 복수의 라인의 증거가, 이들 분해효소의 적어도 일부가 세포 표면에 부착되는 것을 제시한다(Stotz 1994; Whitehead 1997; Chakravorty 1998). 활성 분석은, 2-40 효소 활성의 대부분이 CP1에서 로그 성장(logarithmic growth) 도중 세포 분획에서 존재하는 반면, 나중의 성장 상에서는 활성의 대부분이 상청액에서 발견되고, 세포-결합 활성은 극적으로 감소하는 것을 밝히고 있다(Stotz 1994). CP상 성장은 또한 세포 형태(cell morphology)에서 극적인 변화를 수반한다. 2-40의 글루코스-성장 배양은, 일반적으로 스무스하고 특색없는 세포 표면과 함께, 세포 크기 및 형상에 있어서 상대적으로 균일하다. 그러나, 아가로스, 알기네이트, 또는 키틴에서 성장되었을 때, 2-40 세포는 신규의 표면 구조 및 특색을 나타낸다. Agarase, chitinase and alginase systems have been characterized throughout. Zymogram activity experimental protocols indicate that all three systems consist of a plurality of depolymerases, and a plurality of lines of evidence suggest that at least some of these degrading enzymes are attached to the cell surface (Stotz 1994; Whitehead 1997; Chakravorty 1998). Activity assays show that most of the 2-40 enzymatic activity is present in the cell fraction during logarithmic growth in CP1, whereas in later growth most of the activity is found in the supernatant and cell-binding activity is dramatically reduced. (Stotz 1994). Phase growth on CP also involves dramatic changes in cell morphology. Glucose-growth cultures of 2-40 are relatively uniform in cell size and shape, generally with smooth and uncharacteristic cell surfaces. However, when grown in agarose, alginate, or chitin, 2-40 cells show new surface structures and features.
이들 세포내- 및 세포외 구조(ES)는 작은 융기, 세포로부터 방출된 것으로 보이는 큰 수포형 구조, 미세한 섬모 또는 털(pili), 및 일종의 세관일 수 있는 피 브릴형 부속기(appendage)의 네트워크를 포함한다. 면역전자 현미경법은 아가라아제, 알기나아제 및/또는 기티나아제가 2-40 ES의 적어도 일부 타입에 국소화됨을 나타내고 있다. 표면 융기에 대한 2-40 효소의 면역국소화(immunolocalization)의 패턴 및 표면 국소 해부학은 Clostridium 속의 셀룰로오스분해 멤버에서 나타났던 것과 매우 유사하다.These intracellular and extracellular structures (ES) have a network of fibrillary appendages, which may be small bumps, large vesicular structures that appear to have been released from the cells, fine cilia or pili, and a type of tubule. Include. Immunoelectron microscopy shows that agarase, alginase and / or gutinase are localized to at least some types of 2-40 ES. The pattern of immunolocalization of the 2-40 enzyme to surface bumps and surface local anatomy are very similar to those seen in the cellulolytic members of the genus Clostridium .
리그노셀룰로오스 물질(lignocellulosic material)을 당류로 변환하는 것을 연구한 가장 오래된 방법은 산 가수분해에 기초하고 있다(예를들어, Grethlein에 의한 검토, Chemical Breakdown Of Cellulosic Materials, J.APPL.CHEM. BIOTECHNOL. 28:296-308(1978)을 참조). 이 프로세스는 농축 또는 희석 산의 용도와 관련지을 수 있다. 예를 들어, 인용에 의해 본 명세서에 전체로서 일체화된 미국 특허 제5,221,537호 및 제5,536,325호는, 리그노셀룰로오스 물질의 글루코스로의 산 가수분해에 대한 2단계 프로세스를 기재하고 있다. 이들 프로세스는 예를 들어, 산의 회수, 구조의 특수화된 재료의 요구, 시스템에서 물을 최소화하기 위한 필요성, 및 에탄올로의 발효를 억제할 수 있는 분해 산물의 높은 생성을 포함하는 많은 단점을 갖고 있다. The oldest method of studying the conversion of lignocellulosic material to sugars is based on acid hydrolysis (eg, review by Grethlein, Chemical Breakdown Of Cellulosic Materials, J. APPL.CHEM.BIOTECHNOL 28: 296-308 (1978). This process may relate to the use of concentrated or dilute acids. For example, US Pat. Nos. 5,221,537 and 5,536,325, incorporated herein by reference in their entirety, describe a two step process for acid hydrolysis of lignocellulosic material to glucose. These processes have many disadvantages, including, for example, the recovery of acids, the need for specialized materials in the structure, the need to minimize water in the system, and the high production of degradation products that can inhibit fermentation with ethanol. have.
산 가수분해 프로세스의 문제점을 극복하기 위하여, 효소적 가수분해를 사용하여 셀룰로오스 전환 프로세스가 개발되고 있다. 예를 들어, 인용에 의해 본 명세서에 전체로서 일체화된 미국 특허 제5,916,780호를 참조하며, 이는 섬유 구조의 완전성을 파괴하고 셀룰로오스를 보다 접근 가능하게 하여 처리 단계에서 셀룰로오스 효소에 의한 공격을 위한 전처리 단계와 함께 효소적 가수분해를 기재하고 있 다. In order to overcome the problems of acid hydrolysis processes, cellulose conversion processes have been developed using enzymatic hydrolysis. See, for example, U.S. Patent No. 5,916,780, which is incorporated herein by reference in its entirety, which destroys the integrity of the fiber structure and makes the cellulose more accessible, pretreatment step for attack by cellulose enzyme in the treatment step. And enzymatic hydrolysis is described.
인용에 의해 본 명세서에 전체로서 일체화된 미국 특허 제6,333,181호는, 초음파와 함께 리그노셀룰로오스, 셀룰로오스, 및 에탄올생성(ethanologenic) 미생물의 혼합물의 처리에 의해 리그노셀룰로오스 물질로부터 에탄올의 생성을 기재하고 있다. US Pat. No. 6,333,181, incorporated herein by reference in its entirety, describes the production of ethanol from lignocellulosic material by treatment of a mixture of lignocellulosic, cellulose, and ethanologenic microorganisms with ultrasound. have.
기질로서 셀룰로오스를 사용하는 효소 시스템을 확인하고, 적절한 벡터를 사용하여 단백질을 코딩하는 유전자를 발현하고, 아미노산 생성물(효소 및 비-효소 생성물)을 확인 및 분리하고, Bhat의 문헌에 기재된 에탄올의 제조 및 용도와 같은 목적으로, 이들 유전자를 포함하는 유기체 및 이들 생성물을 사용하기 위한 필요성이 존재한다. 또한 에탄올의 더 높은 수율을 가져오는 더욱 효과적인 처리 방법을 개발하기 위하여, 에탄올의 제조를 위해 리그노셀룰로오스 물질을 사용하는 기술 분야에서의 필요성이 존재한다. Identification of enzyme systems using cellulose as a substrate, expression of genes encoding proteins using appropriate vectors, identification and isolation of amino acid products (enzymes and non-enzyme products) and preparation of ethanol as described in Bhat's literature. And for purposes such as uses, there is a need for using organisms and these products comprising these genes. There is also a need in the art to use lignocellulosic materials for the production of ethanol in order to develop more effective treatment methods resulting in higher yields of ethanol.
본 발명의 한 측면은 식물 벽 활성 탄수화물분해효소(carbohydrase) 및 관련 단백질의 시스템에 관한 것이다. One aspect of the invention relates to a system of plant wall active carbohydrases and related proteins.
본 발명의 추가의 측면은 셀룰로오스를 포함하는 기질의 분해 방법에 관한 것이다. 이 방법은 사카로파거스 데그라단스(Saccharophagus degradans) 균주 2-40으로부터 얻어진 하나 이상의 화합물과 셀룰로오스 함유 기질을 접촉시키는 것을 포함한다. A further aspect of the invention relates to a process for the decomposition of a substrate comprising cellulose. This method involves contacting a cellulose-containing substrate with one or more compounds obtained from Saccharophagus degradans strains 2-40.
본 발명의 추가의 측면은 셀룰로오스와 관련된 반응을 촉매하는 효소의 그룹에 관한 것이다. A further aspect of the invention relates to a group of enzymes that catalyze a reaction involving cellulose.
본 발명의 또다른 측면은 셀룰로오스 분해 또는 셀룰로오스 결합 활성을 가지는 폴리펩티드를 코딩하는 폴리뉴클레오티드에 관한 것이다. Another aspect of the invention relates to a polynucleotide encoding a polypeptide having cellulose degradation or cellulose binding activity.
본 발명의 추가의 측면은 셀룰로오스 분해효소 활성을 가지는 폴리펩티드를 코딩하는 유전자를 포함하는 키메라 유전자(chimeric gene) 및 벡터에 관한 것이다. A further aspect of the invention relates to chimeric genes and vectors comprising genes encoding polypeptides having cellulolytic activity.
본 발명의 추가의 측면은 S. degradans으로부터 하기 활성 : 셀룰로오스 분해효소, 또는 셀룰로오스 결합 중 어느 하나를 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열의 확인 방법에 관한 것이다. S. degradans 유전자 라이브러리는 E. coli 에서 구성되었고, 원하는 활성에 대해서 스크린되었다. 특이적 활성을 갖는 형질전환된 E. coli 세포가 생성되고 분리된다. A further aspect of the invention relates to a method of identifying a nucleotide sequence from S. degradans encoding a polypeptide comprising any of the following activities: cellulose degrading enzyme, or cellulose bond. The S. degradans gene library was constructed in E. coli and screened for the desired activity. Transformed E. coli cells with specific activity are generated and isolated.
본 발명의 또다른 측면은 리그노셀룰로오스 물질로부터 에탄올을 생산하는 방법에 관한 것으로, 당류를 얻기 위하여 리그노셀룰로오스 물질을 도 4 - 11에 나열된 화합물의 하나 이상의 유효한 당화 양(saccharifying amount)으로 처리하고, 당류를 전환하여 에탄올을 생성하는 것을 포함한다. 에탄올로의 당의 전환 및 회수는 이에 한정되는 것은 아니지만, 기술분야의 당업자에게 공지된 임의의 확립된 방법에 의해 수행될 수 있다. 예를 들어, Zymomonas, Erwinia, Klebsiella, Xanthomonas, 및 Escherichia, 특히 대장균(Escherichia coli) K011 및 Klebsiella oxytoca P2와 같은 에탄올생성 미생물의 사용을 들 수 있다. Another aspect of the invention relates to a method for producing ethanol from lignocellulosic material, wherein the lignocellulosic material is treated with at least one effective saccharifying amount of the compounds listed in FIGS. 4-11 to obtain sugars. Converting the sugars to produce ethanol. The conversion and recovery of sugars to ethanol may be carried out by any established method known to those skilled in the art, but not limited thereto. For example, the use of ethanologenic microorganisms such as Zymomonas, Erwinia, Klebsiella, Xanthomonas, and Escherichia , in particular Escherichia coli K011 and Klebsiella oxytoca P2.
본 발명의 추가의 측면은, 당류를 얻기 위하여 리그노셀룰로오스 물질을 도 4 - 11에 나열된 화합물의 하나 이상의 유효한 당화 양을 발현하는 미생물과 접촉시키고, 당류를 전환하여 에탄올을 생성하는 것을 포함하는, 리그노셀룰로오스 물질로부터 에탄올을 생산하는 방법에 관한 것이다. A further aspect of the invention includes contacting the lignocellulosic material with a microorganism that expresses one or more effective glycosylation amounts of the compounds listed in FIGS. 4-11 to obtain sugars, and converting the sugars to produce ethanol, A method for producing ethanol from lignocellulosic material.
본 발명의 추가의 측면은, 리그노셀룰로오스 물질을 도 4-11에 나열된 화합물의 하나 이상의 유효한 당화 양을 발현하는 에탄올생성 미생물과 접촉시켜서 에탄올을 생성하는 것을 포함하는, 리그노셀룰로오스 물질로부터 에탄올을 생산하는 방법에 관한 것이다. 그러한 에탄올생성 미생물은 리그노셀룰로오스 물질을 당화하기 위해 도 4-11에 나열된 유효량의 하나 이상의 화합물, 및 후속하여 당류의 에탄올로의 전환을 (개별적으로 또는 협력하여) 촉매하는 유효량의 하나 이상의 효소 또는 효소 시스템을 발현한다. A further aspect of the present invention is to obtain ethanol from lignocellulosic material, comprising contacting the lignocellulosic material with an ethanologenic microorganism that expresses one or more effective glycosylation amounts of the compounds listed in FIGS. 4-11 to produce ethanol. It is about how to produce. Such ethanologenic microorganisms may include an effective amount of one or more compounds listed in FIGS. 4-11 to glycosylate lignocellulosic material, and an effective amount of one or more enzymes that subsequently catalyze (separately or in concert) the conversion of sugars to ethanol or Expresses the enzyme system.
본 발명의 추가의 측면은 음식, 맥주, 와인, 동물 사료, 직물 제조 및 세탁, 펄프 및 종이 산업, 및 농업 산업에서 셀룰로오스 분해 기질의 사용에 관한 것이다. Further aspects of the present invention relate to the use of cellulose degradation substrates in food, beer, wine, animal feed, fabric preparation and laundry, pulp and paper industries, and agricultural industries.
본 발명은 리그닌을 투과하지 않고, 및/또는 셀룰로오스-분해 효소가 그들의 기질 상에서 작용할 수 있기 전에 헤미셀룰로오스 또는 헤미셀룰로오스 성분을 제거 또는 부분적으로 당화하지 않으면서, 식물 세포 벽의 당화 및 당화를 포함하는 에탄올 생성 프로세스가 얻어질 수 있는 점에서 유익하다. 본 발명은 또한 당화 프로세스에서 높은 압력, 높은 온도, 및 감소된 양의 진균 셀룰라아제, 산(예를 들어, 황산) 없이 또는 함께, 당화 및 당화를 포함하는 에탄올 생성 프로세스를 가능하게 한다. The present invention does not penetrate lignin and / or produce ethanol including saccharification and saccharification of plant cell walls without removing or partially saccharifying hemicellulose or hemicellulose components before the cellulose-degrading enzymes can act on their substrates. It is beneficial in that a process can be obtained. The present invention also enables an ethanol production process including saccharification and saccharification, with or without high pressure, high temperature, and reduced amounts of fungal cellulase, acid (eg, sulfuric acid) in the saccharification process.
본 발명의 일부이며, 또한 본 발명의 원리를 예시로서 상세히 도시한 첨부한 도면과 함께, 본 발명의 다른 측면, 특색, 및 장점은, 하기 상세한 설명으로부터 명백해질 것이다. Other aspects, features, and advantages of the present invention will become apparent from the following detailed description, taken in conjunction with the accompanying drawings, which are part of the invention and which illustrate in detail the principles of the invention.
도 1A는 셀룰로오스의 화학식을 나타낸다. 1A shows the chemical formula of cellulose.
도 1B는 셀룰로오스의 물리적 구조를 도시한다. 1B shows the physical structure of cellulose.
도 2A는 셀룰로오스 섬유의 분해를 도시한다. 2A shows the decomposition of cellulose fibers.
도 2B는 셀로비오스 및 글루코스로의 셀룰로오스 분해의 화학적 묘사를 나타낸다. 2B shows the chemical depiction of cellulose degradation into cellobiose and glucose.
도 3은 2-40 배양 상청액의 SDS-PAGE 및 자이모그램 분석을 나타낸다. 3 shows SDS-PAGE and Zymogram analysis of 2-40 culture supernatants.
도 4는 S. degradans 2-40(도 4-10으로부터의 서열은, 부록에서 각각 외관의 순서로 서열 식별 번호 1-214로 기재된다)의 예측된 셀룰라아제를 나열한다. FIG. 4 lists the predicted cellulase of S. degradans 2-40 (the sequences from FIGS. 4-10 are each listed in SEQ ID NO: 1-214 in the order of appearance in the appendix).
도 5는 M. degradans 2-40의 예측된 크실라나아제, 크실로시다아제 및 관련 부속물을 나열한다. FIG. 5 lists the predicted xylanase, xylosidase and related appendages of M. degradans 2-40.
도 6은 S. degradans 2-40의 예측된 펙티나아제 및 관련 부속물을 나열한다. 6 lists the predicted pectinase and related appendages of S. degradans 2-40.
도 7은 S. degradans 2-40의 아라비나나아제 및 아라비노갈락타나아제를 나열한다. Figure 7 lists arabinanases and arabinogalactanases of S. degradans 2-40.
도 8은 S. degradans 2-40의 만난나아제를 나열한다. 8 lists the metnasase of S. degradans 2-40.
도 9는 S. degradans 2-40의 라미나리나아제를 나열한다. 9 lists lamininase of S. degradans 2-40.
도 10은 S. degradans 2-40의 선택된 탄수화물-결합 모듈 단백질을 나열한 다. 10 lists selected carbohydrate-binding module proteins of S. degradans 2-40.
도 11은 S. degradans 2-40의 재조합 단백질 및 그들의 예측 : 관찰된 분자량의 비교를 나열한다. 11 lists recombinant proteins of S. degradans 2-40 and their predictions: comparison of observed molecular weight.
S. degradans 2-40의 게놈 서열의 분석은 식물-유래 탄수화물을 분해할 것으로 예측되는 효소를 코딩하는 유전자가 풍부한 것을 밝히고 있다. 현재까지, 2-40은 명백하게 완전한 셀룰라아제 및 크실라나아제 시스템, 및 식물 벽 활성 탄수화물분해효소를 함유하는 다수의 기타 시스템을 가진, 유일한 시퀀스된(sequenced) 해양성 박테리아이다. Analysis of the genomic sequence of S. degradans 2-40 reveals abundance of genes encoding enzymes that are expected to degrade plant-derived carbohydrates. To date, 2-40 is the only sequenced marine bacterium with apparently complete cellulase and xylanase systems, and a number of other systems containing plant wall active carbohydrates.
따라서 2-40은, 다양한 해조, 식물성(plantal), 및 무척추동물 소스로부터 CP의 붕괴(breakdown)를 매개하는 "초-분해자(super-degrader)"로서 기능하는, 해양성 탄소 사이클에서 중요한 역할을 담당할 수 있는 것으로 보인다. 현저한 효소적 다양성, 신규한 표면 특색(ES), 및 ES에 대한 탄수화물분해효소의 명백한 국소화(localization)는, S. degradans 2-40이 CP 대사작용의 세포 생물학 및 표면 효소 부착을 연구하기 위한, 흥미를 자아내는 유기체가 되도록 한다. Thus 2-40 play an important role in the marine carbon cycle, functioning as a "super-degrader" that mediates the breakdown of CP from various seaweed, plantal, and invertebrate sources. It seems to be possible. Significant enzymatic variability, novel surface features (ES), and apparent localization of carbohydrates against ES have been shown to allow S. degradans 2-40 to study cell biology and surface enzyme attachment of CP metabolism. Make it an interesting organism.
식물 세포벽을 분해하기 위하여, 2-40은, 적절하게 위치된 효소의 완전한 보체(complement)를 갖는다는 것이 이제 밝혀졌다. 이는 하기 시도 : a) 2-40 식물-벽 활성 효소 시스템의 어노테이션(annotation) 및 게놈 분석, b) 표면 효소 디스플레이와 관련될 수 있는 도메인 또는 모티프를 포함하는 효소 및 기타 단백질의 확인, c) 확인된 단백질 모티프에 기반한 테스트가능한 모델의 개발, 및 d) 면역전 자 현미경법을 사용하여 표면 효소 디스플레이의 제안된 모델의 테스트를 가능하게 하기 위하여, 항체 프로브의 제조를 위해 선택된 단백질의 클로닝 및 발현에 의해 수행될 수 있다. To degrade plant cell walls, it has now been found that 2-40 has the complete complement of properly positioned enzymes. This attempted to: a) annotation and genome analysis of the 2-40 plant-wall active enzyme system, b) identification of enzymes and other proteins comprising domains or motifs that may be associated with surface enzyme display, c) identification By the cloning and expression of a protein selected for the preparation of antibody probes, to enable the development of testable models based on the protein motifs, and d) testing of the proposed model of surface enzyme display using immunoelectron microscopy. Can be performed.
이러한 노력들은, 표면 부착에 잠재적인 관련성을 가지는 단백질을 코딩하는 유전자가, 표면 부착 및/또는 접착에 기능성인 것으로 알려진 모듈 또는 도메인과 서열 상동성(homology)에 기반하여 확인될 수 있는 전략을 가능하게 하는, 2-40의 최근의 서열분석(sequencing)에 의해 크게 용이하게 되었다. These efforts enable strategies in which genes encoding proteins with potential relevance to surface attachment can be identified based on sequence homology with modules or domains known to be functional for surface attachment and / or adhesion. This is greatly facilitated by the recent sequencing of 2-40.
BLAST 및 기타 아미노산 서열 정렬 및 분석 도구를 사용하여 강제적인 서열 요소(compelling sequence element)를 갖는 효소적 및 비-효소적 ORF가 확인된다. 관심의 대상이 되는 유전자는 E coli에 클론되고, 프레임 내(in-frame) 폴리히스티딘 친화도 태그 융합과 함께 발현되고, 니켈 이온 크로마토그래피에 의해 정제될 수 있고, 그에 의해 연구 및 항체 프로브 생산을 위한 재조합 2-40 단백질을 확인 및 제조하는 수단을 제공한다. Enzymatic and non-enzymatic ORFs with compelling sequence elements are identified using BLAST and other amino acid sequence alignment and analysis tools. Genes of interest can be cloned into E coli , expressed with in-frame polyhistidine affinity tag fusions and purified by nickel ion chromatography, thereby allowing for research and antibody probe production. Means for identifying and preparing recombinant 2-40 protein for the present invention.
2-40의 게놈 서열은 Department of Energy's Joint Genome Initiative (JGI)와 협력하여 최근에 얻어졌다. 2005년 1월 19일자의 완성된 드래프트 서열은 단일의 인접 서열에 함유된 5.1 Mbp를 포함한다. 오픈 리딩 프레임(ORF)의 자동화된 어노테이션은 Oak Ridge National Laboratory (ORNL)의 컴퓨터 유전체학 부서에 의해 수행되었고, 어노테이트된 서열은 월드 와이드웹(World Wide Web)(http://genome.ornl.gov/microbial/mdeg)에서 입수 가능하다. Genomic sequences of 2-40 have recently been obtained in collaboration with the Department of Energy's Joint Genome Initiative (JGI). The completed draft sequence of January 19, 2005 contains 5.1 Mbp contained in a single contiguous sequence. Automated annotation of the Open Reading Frame (ORF) was performed by the Department of Computer Genomics at Oak Ridge National Laboratory (ORNL), and the annotated sequences were published on the World Wide Web (http://genome.ornl.gov). / microbial / mdeg).
최초의 게놈 어노테이션은 다수의 아가라아제, 알기나아제 및 키티나아제를 포함하는, 다양한 탄수화물분해효소를 밝혀냈다. 현저하게, 게놈은 또한 셀룰라아제, 크실라나아제, 펙티나아제 및 기타 글루카나아제 및 글루코시다아제와 상동성을 갖는 다수의 ORF를 포함하는, 식물 세포벽 폴리머의 분해에 예측된 역할을 갖는 풍부한 효소를 포함한다. 모두 합쳐서, 탄수화물 이화작용(catabolism)에서 유망한 역할을 갖는 180개 이상의 오픈 리딩 프레임이 드래프트 게놈에서 확인되었다. The first genomic annotations revealed a variety of carbohydrates, including a number of agarases, alginases and chitinases. Remarkably, the genome also contains abundant enzymes with a predicted role in the degradation of plant cell wall polymers, including many ORFs homologous to cellulase, xylanase, pectinase and other glucanases and glucosidases. It includes. All in all, more than 180 open reading frames have been identified in the draft genome that have a promising role in carbohydrate catabolism.
2-40 셀룰라아제, 크실라나아제 및 펙티나아제 시스템의 정의를 시작하기 위하여, BLAST 상동성에 의해 그들 시스템 중 하나에 속하는 것으로 유전자들을 최초로 분류하였다. 모호한 ORF는 가장 잘 알려진 히트(hit)의 클래스에 시험적으로 할당되었다. 이러한 시험적인 분류를 세밀히 구별하는데 사용되는 기타 도구들은 단백질 서열 내의 디스크릿 모듈의 도메인(discreet modular domain)을 확인하기 위해 히든 Markov 모델(서열 일치 상동성의 통계적 모델) 및 복수의 정렬을 사용하는 Pfam(Protein families database of alignments and HMMs; http://www.sanger.ac.uk/Software/Pfam/) 및 SMART(Simple Modular Architecture Research Tool; http://smart.embl-heidelberg.de/)를 포함한다. 이들 분석은 상대적으로 성공적이었지만; 다수의 ORF는 서열 상동성 단독에 기초하여 분류하기에는 어려움이 남아 있었다. To begin the definition of 2-40 cellulase, xylanase and pectinase systems, genes were first classified as belonging to one of those systems by BLAST homology. Ambiguous ORFs have been experimentally assigned to the most well known class of hits. Other tools used to fine-tune this experimental classification include Pfam, which uses a hidden Markov model (a statistical model of sequence concordance homology) and multiple alignments to identify the discreet modular domain in a protein sequence. Protein families database of alignments and HMMs; http://www.sanger.ac.uk/Software/Pfam/ ) and SMART (Simple Modular Architecture Research Tool; http://smart.embl-heidelberg.de/ ) . These analyzes were relatively successful; Many ORFs remain difficult to classify based on sequence homology alone.
효소는 전통적으로 기질 특이성 및 반응 생성물에 의해 분류되어왔다. 게놈 시대 이전에는, 기능은 효소를 비교하기 위한 가장 순종적인(및 아마도 가장 유용한) 기초로서 간주되었고, 다양한 효소 활성에 대한 분석이 오랫동안 잘 개발되어서, 잘 알려진 EC 분류 스킴을 가져왔다. 셀룰라아제 및 두 탄수화물 모이어티(또 는 니트로페놀-글리코시드 유도체에서 발생하는 바와 같이 - 탄수화물 및 비-탄수화물 모이어티) 사이의 글리코시드 결합에 작용하는, 기타 O-글리코실(Glycosyl) 가수분해효소는, EC 3.2.1.-로 지정되며, 이때 마지막 숫자는 절단된 결합의 정확한 타입을 나타낸다. 이러한 스킴에 따라서, 엔도-작용 셀룰라아제(1,4-β-엔도글루카나아제)는 EC 3.2.1.4.로 지정된다. Enzymes have traditionally been classified by substrate specificity and reaction product. Prior to the genomic era, function was considered the most obedient (and perhaps the most useful) basis for comparing enzymes, and assays for various enzyme activities have long been well developed, resulting in a well-known EC classification scheme. Other O-glycosyl hydrolases, which act on glycosidic bonds between cellulase and two carbohydrate moieties (or carbohydrate and non-carbohydrate moieties, as occurs in nitrophenol-glycoside derivatives) , EC 3.2.1.-, where the last number indicates the exact type of cleaved bond. According to this scheme, the endo-acting cellulase (1,4-β-endoglucanase) is designated EC 3.2.1.4.
클론된 유전자의 뉴클레오티드 서열 결정의 편이성 및 널리 알려진 게놈 서열분석 프로젝트의 출현으로, 서열 데이터의 매우 증가된 양은 전례가 없는 스케일로 관련 유전자 및 단백질의 비교 및 분석을 용이하게 하였다. 이는 탄수화물분해효소에 대해서 특히 그러하며; E.C. 명명법 스킴에서 보는 바와 같이, 반응 특이성에 따른 그러한 효소의 분류는 서열 유사성을 전달하는데 무능한 것에 의해 제한된다는 것이 명백해졌다. 또한, 점점 늘어나는 수의 탄수화물분해효소는 결정화되었고, 그들의 3-D 구조가 밝혀졌다. With the ease of nucleotide sequencing of cloned genes and the emergence of well-known genomic sequencing projects, the increased amounts of sequence data facilitated the comparison and analysis of related genes and proteins on an unprecedented scale. This is especially true for carbohydrates; E.C. As can be seen in the nomenclature scheme, it has been clarified that the classification of such enzymes by reaction specificity is limited by their inability to convey sequence similarity. In addition, an increasing number of carbohydrates crystallized and their 3-D structures were revealed.
탄수화물분해효소 서열 및 구조 분석의 주요한 누설(revelation) 중 하나는, 그들의 아미노산 서열에 기반하여 예측될 수 있는 보존된 3차원 폴드를 포함하는 관련된 서열을 가지는 효소의 디스크릿 패밀리(discreet family)가 있다는 점이다. 또한, 동일한 3차원 폴드를 가지는 효소는, 이들이 상이한 반응을 촉매하는 경우라 할지라도, 가수분해의 동일한 입체특이성(stereospecificity)을 나타낸다는 것이 밝혀졌다(Henrissat, Teeri 등 1998; Coutinho 및 Henrissat 1999). One of the major revelations of carbohydrate sequence and structural analysis is that there is a discreet family of enzymes with related sequences that include conserved three-dimensional folds that can be predicted based on their amino acid sequence. Is the point. It has also been found that enzymes with the same three-dimensional folds exhibit the same stereospecificity of hydrolysis, even if they catalyze different reactions (Henrissat, Teeri et al. 1998; Coutinho and Henrissat 1999).
이러한 발견은 http://afmb.cnrs-mrs.fr/CAZY/index.html(Coutinho 및 Henrissat 1999; Coutinho 및 Henrissat 1999)에서, 인터넷 데이터베이스, Carbohydrate-Active enZYme server (CAZy)의 형태로 이용가능한 탄수화물분해효소 모듈의 서열-기반 분류의 근간을 형성한다. These findings are available at http://afmb.cnrs-mrs.fr/CAZY/index.html (Coutinho and Henrissat 1999; Coutinho and Henrissat 1999), carbohydrates available in the form of an Internet database, Carbohydrate-Active enZYme server (CAZy). It forms the basis of sequence-based classification of degrading enzyme modules.
CAZy는 촉매된 반응의 타입에 기반하여, 4개의 주요한 클래스의 탄수화물분해효소(carbohydrase) : 글리코실 가수분해효소(GH's), 글리코실트랜스페라아제(Glycosyltransferase)(GT's), 다당류 분해효소(PL's), 및 탄수화물 에스테라아제 (CE's)를 정의한다. GH's는 가수분해를 통해 글리코사이드 결합을 절단한다. 이 케이스는 많은 잘 알려진 폴리사카리다아제(polysaccharidase), 예컨대 셀룰라아제, 크실라나아제, 및 아가라아제를 포함한다. GT's는 다당류 합성에서 일반적으로 기능하고, 우리딘 이인산염(UDP)과 같은, 활성화된 캐리어 분자로부터, 수용체 분자로의 당 분자의 전달을 통해 새로운 글리코사이드 결합의 형성을 촉매한다. GT's는 종종 생합성에서 기능하지만, 셀로비오스 및 셀로덱스트린의 인분해성 절단(phosphorolytic cleavage)에서 발생하는 바와 같이, 결합 절단을 위한 메커니즘이 이용되는 예들이 존재한다(Lou, Dawson 등 1996). PL's은 결합 절단을 매개하기 위하여 β-제거 메커니즘을 사용하고, 통상적으로 알기네이트 및 펙틴 해중합(depolymerization)과 관련이 있다. CE's는 일반적으로 O- 또는 N- 치환 다당류에서 디아세틸라아제(deacetylase)로서 작용한다. 통상의 예로는 크실란 및 키틴 디아세틸라아제를 포함한다. 서열-기반 패밀리는 GH5 : 글리코실 가수분해효소 패밀리 5에서 보는 바와 같이, 각각의 클래스 내에서의 숫자에 의해 지정된다. GH5의 멤버는 원래의 결합 입체특이성의 유지(retention)를 가져오는 이중-치환(double-displacement) 메커니즘을 사용하여, 유지 방식(retaining fashion)으로 β-1,4 결합을 가수분해한다. 아노머 구성(anomeric configuration)의 유지 또는 역전은 주어진 GH 패밀리의 일반적인 특성이다(Henrissat 및 Bairoch 1993; Coutinho 및 Henrissat 1999). GH5에 속하는 엔도셀룰라아제, 크실라나아제 및 만난나아제의 많은 예들은, GH 패밀리 내에서 가능한 다양한 기질 특이성을 나타내는 것으로 보고되었다. 또한, GH5는 폴리머 사슬 내부의 랜덤 위치에서 그들 각각의 기질의 사슬을 절단하는-엔도가수분해효소가 지배적이다. GH5에 대해서는 그러하지만, 이러한 일반화는 많은 기타 GH 패밀리에 대해서는 맞지 않는다. 탄수화물분해효소에 더하여, CAZy 서버는 탄수화물 결합 모듈(Carbohydrate Binding Module)(CBM)의 많은 패밀리를 정의한다. 촉매 모듈과 같이, CBM 패밀리는 아미노산 서열 유사성 및 보존된 3차원 폴드에 기반하여 지정된다. CAZy has four major classes of carbohydrases: glycosyl hydrolase (GH's), glycosyltransferase (GT's) and polysaccharide degrading enzymes (PL's), based on the type of catalyzed reaction. And carbohydrate esterases (CE's). GH's cleave glycoside bonds through hydrolysis. This case includes many well known polysaccharidases such as cellulase, xylanase, and agarase. GT's generally function in polysaccharide synthesis and catalyze the formation of new glycoside bonds through the transfer of sugar molecules from activated carrier molecules to receptor molecules, such as uridine diphosphate (UDP). While GT's often function in biosynthesis, there are examples where mechanisms for binding cleavage are used, as occurs in phosphorolytic cleavage of cellobiose and cellodextrins (Lou, Dawson et al. 1996). PL's use a β-removal mechanism to mediate bond cleavage and are commonly associated with alginate and pectin depolymerization. CE's generally act as deacetylases on O- or N-substituted polysaccharides. Typical examples include xylan and chitin deacetylases. Sequence-based families are designated by numbers within each class, as seen in GH5: glycosyl hydrolase family 5. Members of GH5 hydrolyze β-1,4 bonds in a retaining fashion, using a double-displacement mechanism that results in retention of the original binding stereospecificity. The maintenance or reversal of the anomeric configuration is a general property of a given GH family (Henrissat and Bairoch 1993; Coutinho and Henrissat 1999). Many examples of endocellulase, xylanase and mannanase belonging to GH5 have been reported to exhibit the various substrate specificities possible within the GH family. In addition, GH5 is dominated by endohydrolases, which cleave the chains of their respective substrates at random locations within the polymer chains. This is true for GH5, but this generalization is not true for many other GH families. In addition to carbohydrates, the CAZy server defines many families of Carbohydrate Binding Modules (CBMs). Like the catalytic module, the CBM family is designated based on amino acid sequence similarity and conserved three-dimensional folds.
CAZyme 구조적 패밀리는, Bernard Henrissat 및 그 동료들(Henrissat, Teeri 등 1998)에 의해 개발된 새로운 분류 및 명명 스킴으로 도입되었다. 전통적인 유전자/단백질 명명법은 일반적 기능 및 발견의 순서를 나타내는 두문자어(acronym)를 할당하고; 이러한 스킴에서, 유기체의 셀룰로오스 유전자는, 셀룰로오스에서 작용의 실제 메커니즘과는 무관하게, celA, celB 등으로 정해진다. 몇몇 연구자들은 엔도글루카나아제(engA, engB) 또는 셀로비오가수분해효소(cbhA, cbhB)와 같이 셀룰라아제를 명명함으로서 더 많은 정보를 전하는 것을 시도하였지만, 이는 생체내에서 기능의 결정을 필요로 하고, 단백질 서열 및 구조의 관련성을 전하는데 여전히 실패하고 있다. CAZyme 명명법은 유전자가 패밀리 수 지정에 속하고, 도입되는 기능성(functional) 시스템을 나타내기 위해 친숙한 두문자어를 보유한다. 패밀리 수 다음의 대문자는 주어진 유기체 시스템 내에서 보고의 순서를 나타낸다. CeHulomonas fimi.의 두 엔도글루카나아제(endoglucanase)인, CenA 및 CenB에 의한 예가 제공된다. 구 명명법에서는, 발견의 순서 외에는, 명칭으로부터 어떠한 것도 추론될 수 없었다. 이들을 각각, Cel6A 및 Cel9A로 명명하는 것은, 이들 두 셀룰라아제가 서열에 있어서 관련성이 없고, 그리하여 상이한 GH 패밀리에 속한다는 것을 즉시 명확하게 한다(여기서, Cel은 셀룰로오스를 나타내고, 9는 글리코실 가수분해효소 패밀리 9를 나타낸다). 이들 스킴은 엔도- 및 엑소- 활성을 구분하지 않지만, 이들 지정은 절대적인 것은 아니며, 관련될 때 효소의 논의에서 포함될 수 있다(즉, 셀로비오가수분해효소 Cel6A, 엔도크실라나아제 Xyn1OB). 촉매 모듈은 탄수화물분해효소를 명명하는 것에 우선하며; 많은(또는 대부분의) 탄수화물분해효소는 적어도 하나의 CBM을 함유하기 때문에, 이들은 효소 모듈(enzymatic module)에 대해 명명된다. 만일 하나 이상의 촉매 도메인이 존재하는 경우, 이들은 N-말단으로부터 C-말단의 순서로 명명되며, 즉, cel9A-cel48A는 아미노 말단에 GH9 및 카르복시 말단에 GH48을 포함한다. 양 도메인은 셀룰로오스에 대해 작용한다. 그러나, 어떠한 예측된 탄수화물분해효소 모듈 없이 단백질 상에서 발생하는 CBM 모듈의 많은 예들이 있다. 일부 다른 예측된 기능성 도메인(단백질분해효소와 같이)의 부재시, 이들 단백질은 CBM 모듈 패밀리로 명명된다. 복수의 CBM 패밀리가 존재하는 경우, 다시 아미노로부터 카르복시 말단으로, 즉, cbm2D-cbm10A(Henrissat, Teeri 등 1998)으로 명명된다. 이러한 명명법은 널리 수용되었고, 모든 2-40 식물-벽 활성 탄수화물분해효소 및 이 연구의 일부로서 고려되는 관련된 단백질의 명명 에 사용될 것이다. The CAZyme structural family was introduced into a new classification and naming scheme developed by Bernard Henrissat and his colleagues (Henrissat, Teeri et al. 1998). Traditional gene / protein nomenclature assigns acronyms that indicate general function and order of discovery; In this scheme, the cellulose gene of an organism is defined as celA, celB, etc., regardless of the actual mechanism of action in cellulose. Some researchers have attempted to convey more information by naming cellulase, such as endoglucanase (engA, engB) or cellobiohydrolases (cbhA, cbhB), but this requires determination of function in vivo and However, there is still a failure to convey the relevance of protein sequences and structures. CAZyme nomenclature holds the familiar acronyms to indicate the functional system in which the gene belongs to family number designation and is introduced. The uppercase letters after the family number indicate the order of reporting within a given organism system. CeHulomonas fimi. Examples are given by two endoglucanases of CenA and CenB. In the old nomenclature, nothing could be inferred from the name except in the order of discovery. Naming them, respectively, Cel6A and Cel9A, immediately clarifies that these two cellulases are irrelevant in sequence and thus belong to different GH families (where Cel represents cellulose and 9 is glycosyl hydrolase). Family 9). These schemes do not distinguish between endo- and exo-activity, but these assignments are not absolute and may be included in the discussion of enzymes when relevant (ie, cellobiohydrolase Cel6A, endoxylanase Xyn1OB). The catalyst module takes precedence over naming carbohydrates; Since many (or most) carbohydrates contain at least one CBM, they are named for an enzymatic module. If more than one catalytic domain is present, they are named in order from the N-terminus to the C-terminus, ie cel9A-cel48A comprises GH9 at the amino terminus and GH48 at the carboxy terminus. Both domains act on cellulose. However, there are many examples of CBM modules that occur on proteins without any predicted carbohydrate module. In the absence of some other predicted functional domains (such as proteases), these proteins are termed the CBM module family. When multiple CBM families are present, they are again named amino to carboxy terminus, ie cbm2D-cbm10A (Henrissat, Teeri et al. 1998). This nomenclature has been widely accepted and will be used for the naming of all 2-40 plant-wall activated carbohydrates and related proteins considered as part of this study.
고등 식물의 세포 벽은 다양한 탄수화물 폴리머(CP) 성분으로 구성된다. 이들 CP는 공유 및 비-공유 수단을 통해 상호작용하여, 단단한 세포 벽을 형성하고, 팽창압을 견디기 위해 식물에 요구되는 구조적 완전성을 제공한다. 식물에서 발견된 주요 CP는, 세포 벽의 구조적 백본(structural backbone)을 형성하는 셀룰로오스이다. 도 1A를 참조한다. 셀룰로오스 생합성 도중, 폴리-β-1,4-D-글루코스의 사슬은 수소 결합 및 소수성 상호작용을 통해 스스로 회합하여, 더 큰 섬유를 형성하기 위하여 추가로 스스로 회합하는 셀룰로오스 마이크로섬유를 형성한다. 셀룰로오스 마이크로섬유는 다소 불규칙적이고, 변화하는 결정성의 영역을 포함한다. 셀룰로오스 섬유의 결정성 정도는 그 성분 셀룰로오스 사슬 간 수소 결합이 얼마나 단단히 배열되었는지에 의존한다. 덜-배열된 결합, 따라서 더욱 접근가능한 글루코스 사슬을 가진 영역은 무정형 영역으로 언급된다(도 1B). 상대적인 결정성결정성 직경은 셀룰로오스의 생물학적 소스의 특성이다(Beguin 및 Aubert 1994; Tomme, Warren 등 1995; Lynd, Weimer 등 2002). 셀룰로오스 섬유의 불규칙성은 효소의 접근 및 후속의 분해를 방해하는 입체적 효과 및 매우 다양한 변화된 결합 각도를 초래한다. The cell walls of higher plants consist of various carbohydrate polymer (CP) components. These CPs interact through covalent and non-covalent means to form a rigid cell wall and provide the structural integrity required for plants to withstand expansion pressures. The main CP found in plants is cellulose, which forms the structural backbone of the cell wall. See FIG. 1A. During cellulose biosynthesis, the chains of poly-β-1,4-D-glucose associate themselves through hydrogen bonding and hydrophobic interactions to form cellulose microfibers which further associate themselves to form larger fibers. Cellulose microfibers are rather irregular and contain varying crystalline regions. The degree of crystallinity of cellulose fibers depends on how tightly the hydrogen bonds between their component cellulose chains are arranged. Regions with less-arranged bonds, thus more accessible glucose chains, are referred to as amorphous regions (FIG. 1B). Relative crystalline diameter is a characteristic of the biological source of cellulose (Beguin and Aubert 1994; Tomme, Warren et al. 1995; Lynd, Weimer et al. 2002). Irregularity of cellulose fibers results in steric effects that hinder the access and subsequent degradation of the enzyme and a wide variety of varied bonding angles.
글루코스로의 셀룰로오스 해중합에 대한 일반적인 모델은 최소한 3개의 별개의 효소 활성과 관련이 있다(도 2A 및 2B 참조). 엔도글루카나아제는 셀룰로오스 사슬을 내부적으로 절단하여, 더 짧은 사슬을 생성하고, 엑소글루카나아제에 의해 작용되는 접근가능한 말단의 수를 증가시킨다. 이들 엑소글루카나아제는 환원 말 단 또는 비-환원 말단에 대해 특이적이고, 종종 셀로비오스, 셀룰로오스의 다이머(셀로비오가수분해효소)를 유리시킨다. 증가하는 셀로비오스는 셀로비아제(β-1,4-글루코시다아제)에 의해 글루코스로 절단된다. 많은 시스템에서, 추가적인 타입의 효소가 존재하며: 셀로덱스트리나아제는, 셀로비오스로부터가 아니라 셀룰로오스 올리고머로부터 글루코스 모노머를 절단하는 β-1,4-글루코시다아제이다. 셀룰로오스의 여러 결정성 및 구조적인 복합성, 및 요구되는 효소 활성이 분해이기 때문에, "완전한(complete)" 셀룰라아제 시스템을 가진 유기체는 다양한 엔도 및/또는 엑소-작용 β-1,4-글루카나아제를 합성한다. A general model for cellulose depolymerization to glucose is associated with at least three separate enzyme activities (see Figures 2A and 2B). Endoglucanases cleave cellulose chains internally, creating shorter chains and increasing the number of accessible ends acted by exoglucanase. These exoglucanases are specific for the reducing or non-reducing end and often liberate cellobiose, dimers of cellulose (cellobi hydrolases). Increasing cellobiose is cleaved into glucose by cellobiase (β-1,4-glucosidase). In many systems, additional types of enzymes are present: Cellodextrinases are β-1,4-glucosidases that cleave glucose monomers from cellulose oligomers, not from cellobiose. Because of the many crystalline and structural complexities of cellulose, and the enzymatic activity required, degradation, organisms with a "complete" cellulase system can produce a variety of endo and / or exo-functional β-1,4-glucanases. Synthesize
예를 들어, Cellulomonas fimi 및 Thermomonospora fusca는 각각 6개의 셀룰라아제를 합성하는 것으로 나타난 반면, Clostridium thermocellum은 15개 이상을 갖는 것으로 나타났다(Tomme, Warren 등 1995). 아마도, 이들 많은 셀룰라아제의 기질-결합 포켓(substrate-binding pocket) 및/또는 활성 부위의 형상에 있어서의 변화가, 완전한 셀룰로오스 분해를 용이하게 한다(Warren 1996). 완전한 셀룰라아제 시스템을 가진 유기체는, 셀룰로오스 분해를 매개하면서 탄소 및 에너지원으로서 식물 바이오매스를 효율적으로 사용할 수 있을 것으로 생각된다. 비록 이들 중 많은 것들이 공동으로 전체 또는 거의-전체 셀룰로오스 가수분해를 달성할 수 있는 (제1위 군집(ruminal community)과 같은) 공동체(consortia)의 멤버로서 기능하는 것으로 생각되지만, 불완전한 셀룰로오스 시스템의 생태학적 및 진화적 역할은 덜 명백하다(Ljungdahl 및 Eriksson 1985; Tomme, Warren 등 1995). For example, Cellulomonas fimi and Thermomonospora fusca have been shown to synthesize six cellulase, respectively, while Clostridium thermocellum has been shown to have more than 15 (Tomme, Warren et al. 1995). Perhaps a change in the shape of the substrate-binding pocket and / or active site of many of these cellulases facilitates complete cellulose degradation (Warren 1996). It is contemplated that organisms with complete cellulase systems will be able to efficiently use plant biomass as a carbon and energy source while mediating cellulose degradation. Although many of these are thought to function as members of a consortia (such as the ruminal community) that can collectively achieve total or near-total cellulose hydrolysis, the ecology of an incomplete cellulose system The scientific and evolutionary roles are less clear (Ljungdahl and Eriksson 1985; Tomme, Warren et al. 1995).
식물 세포벽에서, 셀룰로오스의 마이크로섬유는 헤미셀룰로오스(크실란, 아 라비난 및 만난 포함), 펙틴(갈락투로난 및 갈락탄), 및 다양한 β-1,3 및 β-1,4 글루칸의 매트릭스에 임베드된다. 이들 매트릭스 폴리머들은 종종 아라비노스, 갈락토스 및/또는 크실로오스 잔기로 치환되어, 아라비노크실란, 갈락토만난 및 크실로글루칸을 얻으며 - 몇 개를 지정한다(Tomme, Warren 등 1995; Warren 1996; Kosugi, Murashima 등 2002; Lynd, Weimer 등 2002). 이들 비-셀룰로오스성 CP에 의해 제시되는 상이한 글리코실 결합의 완전한 수(sheer number) 및 복합성은, 효소 카운트 및 복합성에서 셀룰라아제 시스템과 종종 경쟁하는 특이적 효소 시스템을 필요로 한다. 그 이질성 때문에, 식물 세포벽 분해는 미생물의 공동체를 종종 필요로 한다(Ljungdahl 및 Eriksson 1985; Tomme, Warren 등 1995).In plant cell walls, microfibers of cellulose are incorporated into a matrix of hemicelluloses (including xylan, arabinan and mannan), pectin (galacturonan and galactan), and various β-1,3 and β-1,4 glucans. Is embedded. These matrix polymers are often substituted with arabinose, galactose and / or xylose residues to obtain arabinoxsilane, galactomannan and xyloglucan-designating several (Tomme, Warren et al. 1995; Warren 1996; Kosugi; , Murashima et al. 2002; Lynd, Weimer et al. 2002). The complete sheer number and complexity of the different glycosyl bonds presented by these non-cellulosic CPs require specific enzyme systems that often compete with cellulase systems in enzyme counts and complexities. Because of its heterogeneity, plant cell wall degradation often requires a community of microorganisms (Ljungdahl and Eriksson 1985; Tomme, Warren et al. 1995).
목적하는 -S. degradans 및 M degradans는 식물 세포벽의 주요한 구조적 폴리머를 분해하는 완전한 멀티-효소 시스템을 합성한다. A) 서열 상동성에 의해 기능이 예측될 수 없는 유전자의 활성도를 결정하는, 셀룰라아제 및 크실라나아제 시스템을 정의함; 및 B) 서열 상동성에 의해 다른 식물-분해 효소 시스템의 게놈의 확인 및 어노테이션(즉, 펙티나아제, 라미나리나아제 등). -S. Desired degradans and M degradans synthesize a complete multi-enzyme system that degrades the major structural polymers of plant cell walls. A) defining cellulase and xylanase systems that determine the activity of genes whose function cannot be predicted by sequence homology; And B) identification and annotation of genomes of other plant-degrading enzyme systems by sequence homology (ie, pectinase, lamininase, etc.).
모든 특허공개, 특허 및 특허 출원들은, 마치 각각의 개별적인 특허공개, 특허 또는 특허 출원이 명확하게, 또한 전체적으로 인용에 의해 개별적으로 도입되는 것을 나타내는 것과 같이 동일한 정도로, 명백히 인용에 의해 전체로서 여기에 일체화된다. All patent publications, patents, and patent applications are hereby expressly incorporated by reference in their entirety to the same extent as if each individual patent publication, patent or patent application is clearly and individually incorporated by reference in its entirety. do.
하기 실시예들은 예시적인 것이며, 어떠한 식으로도 본 발명의 범주를 한정하려는 의도는 아니다. The following examples are illustrative and are not intended to limit the scope of the invention in any way.
실험 결과Experiment result
I: 2-40 식물-벽 활성 효소의 게놈, 프로테오믹 및 기능성 분석I: Genomic, Proteomic and Functional Analysis of 2-40 Plant-Wall Active Enzymes
ORNL 어노테이션으로부터, 2-40 게놈은 식물 세포벽 폴리머에 대한 예측된 활성을 가지는 많은 효소를 함유하는 것이 명백하다. 이는 2-40이 아가, 알기네이트, 및 키틴과 같은 통상적인 해양성 다당류를 분해하는 몇몇(several) 복합 효소 시스템을 가지는, 강어귀지역의 박테리아(estuarine bacterium)이기 때문에 특히 놀라운 것이다. 자동화된 어노테이션에 기반한 다효소 시스템(multienzyme system)을 정의하는 것은 열악하게 보존된 도메인의 존재 및/또는 도메인의 신규의 조합 때문에 복잡하다. 2-40의 식물-벽 활성 효소에서 이것의 많은 예들이 존재한다. 따라서, 탄수화물분해효소 ORF의 ORNL 어노테이션은 모듈 조성에 중점을 두고 수동으로 관찰되며, 이어서 이들이 관련될 것으로 보이는(즉, 셀룰로오스 또는 크실란 분해) 기질에 기반한 일반적인 그룹에 할당된다. 이들 게놈 서열 분석은 약 25개의 잠재적인 셀룰라아제, 11개의 크실라나아제 및 17개의 펙티나아제의 풀을 가져왔다. From the ORNL annotation, it is clear that the 2-40 genome contains many enzymes with predicted activity on plant cell wall polymers. This is particularly surprising because 2-40 is an estuary-like bacteria in the estuary that has several complex enzyme systems that degrade conventional marine polysaccharides such as agar, alginate, and chitin. Defining a multienzyme system based on automated annotation is complex due to the presence of poorly conserved domains and / or new combinations of domains. There are many examples of this in 2-40 plant-wall active enzymes. Thus, ORNL annotations of carbohydrate ORFs are observed manually with a focus on module composition, and are then assigned to a general group based on substrates that are likely to be involved (ie cellulose or xylan degradation). These genomic sequencing led to a pool of about 25 potential cellulase, 11 xylanases and 17 pectinase.
서열 상동성이 잘 보존될 때, 기능의 고도로 정확한 예측이 가능하다. 따라서, M degradans에서 기능성(functioning) 셀룰라아제 및 크실라나아제 시스템의 존재를 확인하기 위하여, 자이모그램 및 효소 활성 분석을 아래에 설명하는 바와 같이 수행하였다. 또한, 질량 분광법(Mass Spectrometry) 기반 프로테오믹스를 사용하여 2-40 배양 상청액으로부터 효소를 확인하기 위한 시도를 행하였다. When sequence homology is well preserved, highly accurate predictions of function are possible. Thus, to confirm the presence of functional cellulase and xylanase systems in M degradans , a zymogram and enzyme activity assay was performed as described below. In addition, attempts were made to identify enzymes from 2-40 culture supernatants using Mass Spectrometry based proteomics.
다음에, 만약 있다면, 셀룰라아제 및 크실라나아제 시스템에서, 그들의 역할 을 결정하기 위한 기능성 특성화를 필요로 하는 ORF를 확인하기 위해 및 가능한 곳에서 기능을 예측하기 위해, 보다 정교한 게놈 분석을 사용하였다. 서열 분석 및 B. Henrissat의 기능성 예측에 기반하여 다른 식물 벽-활성 효소 시스템에 속하는 ORF를 시험적으로 분류하였다. Next, more sophisticated genomic analysis was used, if any, to identify ORFs that require functional characterization to determine their role in cellulase and xylanase systems and to predict function where possible. Based on sequencing and B. Henrissat's functional predictions, ORFs belonging to different plant wall-active enzyme systems were experimentally classified.
2-40 셀룰라아제 및 크실라나아제의 유도(induction) 및 발현을 통찰하기 위하여, 본 제안의 마지막에 있는 실험 프로토콜 섹션에서 검토된 바와 같이 디니트로살리실산 환원-당 분석(DNSA 분석)에 의해 아비셀(avicel) 및 크실란-성장 세포 및 상청액에 대해 특이적 활성을 측정하였다. 식물 세포벽에서 함께 발생하는 이들 두 기질에 의한 활성의 가능한 동시-유도(co-induction)를 조사하기 위하여, 아비셀-성장 배양에 대해 크실라나아제 활성을 측정하고, 반대로도 하였다. To gain insight into the induction and expression of 2-40 cellulase and xylanase, Avicel (DNA) assay was performed by dinitrosalicylic acid reducing-sugar assay (DNSA analysis) as reviewed in the Experimental Protocols section at the end of this proposal. specific activity was measured for avicel) and xylan-growing cells and supernatants. In order to investigate possible co-induction of activity by these two substrates co-occurring in the plant cell wall, xylanase activity was measured for Avicel-growth cultures and vice versa.
아비셀 또는 크실란에서의 성장은 양 기질에 대한 효소 활성을 가져왔으며, 셀룰라아제 및 크실라나아제 시스템의 동시-유도를 나타낸다. 다른 2-40 탄수화물분해효소 시스템과 함께, 동종의 기질에 의해 최고 수준의 활성이 유도된다. 결과는 또한 이들 두 시스템의 발현에 있어서 몇가지 중요한 차이점을 밝히고 있다. 아비셀에서 성장되었을 때, 셀룰라아제 활성은 초기 성장에서 세포-회합되고, 나중 단계의 상청액에서는 상당히 축적된다. 세포 및 상청액 분획은 모든 성장 단계 내내 거의 동등하게 존재하는 낮은 수준의 크실라나아제 활성을 나타낸다. 대조적으로, 크실란-성장 배양은 성장 사이클 내내 세포 분획에서 크실라나아제 및 셀룰라아제 활성의 대부분을 나타낸다. 셀룰라아제 활성은 상청액에서 축적되지 않으며, 크실라나아제 활성은 온건하게 축적되지만, 여전히 세포-부착 활성의 아래에 남아 있다. Growth in Avicel or xylan resulted in enzymatic activity for both substrates and indicates co-induction of cellulase and xylanase systems. The highest level of activity is induced by homogenous substrates, along with other 2-40 carbohydrate systems. The results also reveal some important differences in the expression of these two systems. When grown in Avicel, cellulase activity is cell-associated in early growth and significantly accumulated in later stage supernatants. Cell and supernatant fractions exhibit low levels of xylanase activity, which are almost equally present throughout all growth stages. In contrast, xylan-growth cultures exhibit most of the xylanase and cellulase activity in cell fractions throughout the growth cycle. Cellulase activity does not accumulate in the supernatant, xylanase activity accumulates moderately, but still remains under cell-adhesion activity.
아비셀 및 크실란 성장 세포 펠렛 및 배양 상청액의 효소 활성 젤(자이모그램)을 분석하여 발현된 셀룰라아제 및 크실라나아제를 시각화 및 확인하였다. 자이모그램은 크실란-성장 상청액에서 5개의 크실란가수분해성 밴드를 밝혀냈고(도 3), 이들 중 4개는 예측된 크실라나아제의 계산된 MW과 잘 대응되었다(xyl/arb43G-xyn10D: 129.6kDa, xyn1OE: 75.2kDa, xyn1OC: 42.3kDa, 및 xyn11A: 30.4kDa; 도 2 참조). 아비셀-성장 배양은 CMC 자이모그램에서 30-15OkDa 범위에 있는 MW과 8개의 활성 밴드를 나타냈다. CMC는 일반적으로 엔도셀룰라아제 활성에 대한 적절한 기질이다. 이들 자이모그램은 아비셀에서 성장하는 동안 2-40이 다양한 사이즈의 다수의 엔도셀룰라아제를 합성하는 것을 명백히 입증하며 - 기능성 다효소 셀룰라아제 시스템임을 나타낸다. CMC 및 크실란 자이모그램은 함께 M degradans 2-40에서 다효소 셀룰라아제 및 크실라나아제 시스템의 유도가능한 발현 및 게놈 분석의 결과를 확인한다. Enzymatically active gels (Zymograms) of Avicel and xylan growing cell pellets and culture supernatants were analyzed to visualize and confirm expressed cellulase and xylanase. Zymograms revealed five xylan hydrolyzable bands in xylan-growth supernatants (FIG. 3), four of which corresponded well to the calculated MW of predicted xylanase (xyl / arb43G-xyn10D : 129.6 kDa, xyn1OE: 75.2 kDa, xyn1OC: 42.3 kDa, and xyn11A: 30.4 kDa; see FIG. 2). Avicel-growth cultures showed MW and 8 active bands in the 30-15OkDa range in the CMC zymogram. CMC is generally a suitable substrate for endocellulase activity. These zymograms clearly demonstrate that during growth in Avicel 2-40 synthesizes a number of endocellulases of varying sizes-indicating that it is a functional polyenzyme cellulase system. The CMC and xylan zymogram together confirm the results of inducible expression and genomic analysis of the multienzyme cellulase and xylanase system at M degradans 2-40.
CP상 성장 도중 생성된 개개의 셀룰라아제 및 크실라나아제를 확인하기 위하여, 탠덤 질량 분광법(tandem Mass Spectrometry)(MS/MS)과 결합된 역상 고성능 액체 크로마토그래피(RP-HPLC)를 사용하여 배양 상청액을 프로테오믹 분석을 행하였다. 전기스프레이 이온화(electrospray ionization) 및 MS/MS 분석에 앞서, RP-HPLC 컬럼상 펩티드의 분리로부터의 동력은 복합 샘플로부터 많은 수의 단백질의 확인을 가능하게 한다(Smith, Loo 등, 1990; Shevchenko, WiIm 등 1996; Jonsson, Aissouni 등 2001). 이들 분석은 확신을 갖고 100개 이상의 상이한 비-효소적 단 백질 및 다수의 탄수화물분해효소, 예컨대 크실라나아제, 두 개의 크실로시다아제, 셀룰라아제, 및 두 개의 셀로덱스트리나아제를 확인하였다. 아가로스-성장 상청액의 추가적인 분석 도중 아가라아제를 확인하였다. Culture supernatants using reverse phase high performance liquid chromatography (RP-HPLC) combined with tandem mass spectrometry (MS / MS) to identify individual cellulase and xylanase generated during CP phase growth Proteomic analysis was performed. Prior to electrospray ionization and MS / MS analysis, the power from separation of peptides on RP-HPLC columns allows the identification of large numbers of proteins from complex samples (Smith, Loo et al., 1990; Shevchenko, WiIm et al. 1996; Jonsson, Aissouni et al. 2001). These analyzes confidently identified more than 100 different non-enzymatic proteins and a number of carbohydrates such as xylanase, two xylosidases, cellulases, and two cellodedextrinases. Agarase was identified during further analysis of the agarose-growth supernatant.
Stanford University 질량 분광법 설비에서 수행된 실험 프로토콜-슬라이스 소화, 추출, 및 MS/MS 분석으로 아비셀-성장 상청액 샘플로부터 2개의 어노테이트된 셀룰라아제를 확인하였다. cel5H로 지정된 하나는, 67kDa의 예측된 MW를 갖고, 75kDa의 명백한 MW를 가지는 밴드로부터 확인되었다. 다른 하나인, cel9B는 89kDa의 예측된 MW를 갖지만, 120kDa의 명백한 MW를 가졌다. cel9B의 예측된 및 명백한 MW 간의 불일치는, E coli에서 클론 및 발현된 일부 2-40 단백질이 그들의 예측된 MW보다 30-40% 더 많은 명백한 MW를 나타내는 유사한 경우와 일관된다. Experimental protocol-slice digestion, extraction, and MS / MS analysis performed on a Stanford University mass spectroscopy facility identified two annotated cellulase from Avicel-growth supernatant samples. One designated cel5H was identified from the band with a predicted MW of 67 kDa and an apparent MW of 75 kDa. The other, cel9B, had a predicted MW of 89 kDa, but had an apparent MW of 120 kDa. The discrepancy between the predicted and apparent MW of cel9B is consistent with a similar case in which some 2-40 proteins cloned and expressed in E coli show 30-40% more apparent MW than their predicted MW.
2-40 드래프트 게놈에 있는 모든 유전자 모델의 아미노산 전위(translation)를 AFMB-CRNS에 있는 CAZy ModO (Carbohydrase Active enZyme Modular Organization) 서버에서 분석하였다. 이 분석으로 촉매 모듈(GH GT, PL, 또는 CE) 및/또는 CBM을 포함하는 모든 유전자 모델을 확인하였다. 모두 합쳐, 게놈은 CAZy 도메인을 함유하는 222개 유전자 모델을 포함하였고, 이들 중 대부분은 모듈 구조(modular architecture)를 갖는다. 이들 중에서, 117개는 GH 모듈을 포함하고, 39개는 GT, 29개는 PL, 및 17개는 CE를 포함한다. 이들 중 많은 것들이 다양한 패밀리로부터 하나 이상의 CBM을 지닌다. CBM은 포함하지만 예측된 탄수화물 분해효소 도메인을 포함하지 않는 20개의 단백질도 존재한다. The amino acid translocations of all genetic models in the 2-40 draft genome were analyzed on a CAZy ModO (Carbohydrase Active enZyme Modular Organization) server in AFMB-CRNS. This analysis identified all genetic models including catalyst modules (GH GT, PL, or CE) and / or CBM. All in all, the genome included 222 gene models containing the CAZy domain, most of which have a modular architecture. Of these, 117 include GH modules, 39 include GT, 29 include PL, and 17 include CE. Many of these have more than one CBM from various families. There are also 20 proteins that contain CBM but do not contain the predicted carbohydrate domain.
ModO 데이터베이스에 있는 것들에 대한 2-40 모듈 서열의 상세한 비교는 활 성 부위의 서열이 고도도 보존된 모듈에 대한 기능의 구체적인 예측을 가능하게 하였다. 예를 들어, (실험 프로토콜 슬라이스 MS/MS로부터의) Cel9B는 엔도셀룰라아제로서 기능할 것으로 예측되는 GH9 모듈, CBM2 및 CBM10 모듈을 포함한다. Detailed comparison of the 2-40 module sequences to those in the ModO database allowed specific predictions of the function of the modules with highly conserved sequences of active sites. For example, Cel9B (from experimental protocol slice MS / MS) includes the GH9 module, CBM2 and CBM10 module, which are expected to function as endocellulase.
촉매 모듈 서열이 덜 보존될 때, 전체적인 메커니즘만이 예측될 수 있다. 1,3 또는 1,4 글루카나아제일 것으로 예측되는 GH5를 포함하는 gly5M의 경우도 마찬가지이며, 서열 분석은 어떠한 것인지 알 수 없어서, 글리카나아제에 대해 두문자어 지정 "gly"를 썼다. When the catalyst module sequence is less conserved, only the overall mechanism can be predicted. The same is true for gly5M containing GH5, which is expected to be 1,3 or 1,4 glucanase, and because the sequence analysis is unknown, the acronym designation "gly" was written for the glycanase.
이러한 상세한 평가 및 분석의 결과는 유전자를 셀룰라아제, 크실라나아제, 펙티나아제, 라미나리나아제, 아라비나나아제 및 만난나아제 시스템에 할당하는데 사용되었다. 각각의 시스템은 또한 상응하는 부속물 효소에 할당되었으며, 즉, 셀로비아제는 셀룰라아제 시스템에 속하고, 크실로시다아제는 크실라나아제 시스템에 속한다. 셀룰라아제, 크실라나아제 또는 부속물로서 기능하는 최대의 가능성을 가지는 덜 보존된 GH 모듈을 가진 유전자가 기능의 입증의 필요에 따라 확인되었고, 지정되었다. The results of this detailed evaluation and analysis were used to assign genes to the cellulase, xylanase, pectinase, lamininase, arabinase and mannanase systems. Each system is also assigned a corresponding accessory enzyme, ie the cellobiase belongs to the cellulase system and the xylosidase belongs to the xylanase system. Genes with less conserved GH modules with the greatest likelihood of functioning as cellulase, xylanase or adjuncts were identified and designated as needed for demonstration of function.
ORNL 어노테이션, 뒤따르는 어노테이션 분석, 프로테오믹(질량 분광법) 분석, CAZyme 모듈 분석 및 기능성 예측의 결과는, 2-40의 선택된 CBM 만의 유전자 및 예측된 식물 벽 활성 탄수화물분해효소를 요약한 표를 포함하는 도 4-11에 도입되었다. The results of the ORNL annotation, subsequent annotation analysis, proteomic (mass spectrometry) analysis, CAZyme module analysis, and functional prediction include a table summarizing the genes and predicted plant wall active carbohydrates of only 2-40 selected CBMs. Was introduced in Figures 4-11.
클로닝 및 기능성 분석을 위해 선택된 유전자들은 탄수화물분해효소 gly3C, gly5K, gly5M, gly9C, 및 gly43M을 포함한다. gly5L의 활성부위는 gly5K의 활성 부위와 매우 동질이기 때문에, 그 활성은 gly5K로부터 얻어진 결과로부터 추론된다. 20개의 "CBM 만의" 단백질 중 4개인, cbm2A, cbm2B, cbm2C 및 cbm2D-cbm10A는 효소 기능의 예측된 결여를 조사하기 위한 활성 분석에 포함된다. 이들 4개는 결정성 셀룰로오스에 결합할 것으로 예측되는 CBM2 모듈을 포함한다. 이러한 예측된 친화도는 활성 분석에서 이들이 포함되는 이유이며, 셀룰로오스에 결합하는 그러한 단백질은 서열 분석에 의해 검출되지 않는 셀룰라아제 또는 크실라나아제 모듈을 포함할 가능성이 크다. CBM만의 단백질로, 검출된 효소 활성의 결여는 촉매 도메인(CD)의 부재를 확인할 것이다. Genes selected for cloning and functional analysis include carbohydrates gly3C, gly5K, gly5M, gly9C, and gly43M. Since the active site of gly5L is very homogeneous with the active site of gly5K, its activity is inferred from the results obtained from gly5K. Four of the 20 “CBM only” proteins, cbm2A, cbm2B, cbm2C and cbm2D-cbm10A, are included in the activity assay to investigate the predicted lack of enzyme function. These four contain CBM2 modules that are expected to bind to crystalline cellulose. This predicted affinity is why they are included in activity assays, and such proteins that bind cellulose are likely to include cellulase or xylanase modules that are not detected by sequencing. With CBM only protein, the lack of detected enzyme activity will confirm the absence of the catalytic domain (CD).
M degradans의 완전한 셀룰라아제 및 크실라나아제 시스템을 정의하기 위하여, 그 시스템에는 속하지만, 서열 상동성에 기반하여 확신을 갖고 할당될 수 없는 그러한 효소들이, 발현, 정제 및 실험 프로토콜에서 기재된 바와 같이 활성에 대해 분석될 것이다. 현재까지, gly3C, gly5K, gly5M, gly9C 및 gly43M과, cbm2A, cbm2B, cbm2C 및 cbm2D-cbm10A이 pETBlue2 (Novagen) 구조체로서 발현 균주로 클론되었다. 이러한 벡터는 유도가능한 T7 lac 프로모터의 제어 하에 발현을 위치시키고, C-말단 6xHistidine 태그를 도입하여, 니켈 이온 친화도에 의해 재조합 단백질의 정제를 가능하게 한다. 이들 단백질의 성공적인 클로닝 및 발현은 α-HisTag® 모노클로널 항체(Novagen)를 사용하여 웨스턴 블롯에 의해 확인하였다. 모든 발현된 단백질은 불안정한 것으로 보이는 Cbm2D-CbmlOA를 제외하고는, 그들의 예측된 MW에 가깝거나, 또는 이보다 큰 명백한 MW를 가졌고(표 8); 이 단백질을 클론 및 발현하기 위한 두 별개의 시도는, 웨스턴 블롯에서 염료 전방(dye front) 근처에서 발생하는 HisTag® 포함 밴드를 가져왔고, 이는 이들 유전자 생성물의 단백질분해성(proteolytic) 분해를 시사한다. 활성 분석에서 엔도셀룰라아제 양성 대조군으로서의 사용을 위해 추가적인 효소인, Cel5A를 클론 및 발현하였다. Cel5A는 129 kDa의 예측된 MW를 가지고, 두 개의 GH5 모듈을 포함하고, 또한 HE-셀룰로오스 자이모그램에서 고도로 활성이다. In order to define the complete cellulase and xylanase system of M degradans , such enzymes belonging to the system but which cannot be assigned with confidence based on sequence homology, are described in terms of activity, as described in expression, purification and experimental protocols. Will be analyzed. To date, gly3C, gly5K, gly5M, gly9C and gly43M and cbm2A, cbm2B, cbm2C and cbm2D-cbm10A have been cloned into expression strains as pETBlue2 (Novagen) constructs. This vector positions expression under the control of the inducible T7 lac promoter and introduces a C-terminal 6xHistidine tag, allowing purification of the recombinant protein by nickel ion affinity. Successful cloning and expression of these proteins was confirmed by Western blot using α-HisTag ® monoclonal antibody (Novagen). All expressed proteins had clear MWs close to or greater than their predicted MWs, except for Cbm2D-CbmlOA, which appeared to be unstable (Table 8); Two separate attempts to clone and express a protein is brought a HisTag ® including bands that occurred near the dye front (dye front) in the Western blot, suggesting that the protein degradability (proteolytic) degradation of these gene products. An additional enzyme, Cel5A, was cloned and expressed for use as an endocellulase positive control in the activity assay. Cel5A has a predicted MW of 129 kDa, contains two GH5 modules, and is also highly active in the HE-cellulose zymogram.
기능을 할당하기 위한 주요한 기준은 작용되는 기질, 및 검출된 활성의 타입일 것이다. 그로서, 다양한 효소 활성 분석은 상대적인 활성 수준의 엄격한 정량보다는 기능의 정성적 증명의 제공에 촛점을 맞출 것이다. 요구되는 분석은 테스트되는 기질에 의해 지시되며, 실험 프로토콜에서 보다 자세하게 설명된다. 셀룰로오스에 대하여, β-1,4-엔도글루카나아제(엔도셀룰라아제), β-1,4-엑소글루카나아제(셀로비오가수분해효소), 및 β-1,4-글루코시다아제(셀로비아제) 활성 간을 구별하는 것은 중요하다. 이는 엔도셀룰라아제에 대해 자이모그램, 셀로비오가수분해효소에 대해 DNSA 환원-당 분석, 및 셀로비아제 활성에 대해 p-니트로페놀-β-1,4-셀로비오시드(pnp-셀로비오스)를 사용하여 수행되었다. 모든 3가지 분석으로부터의 조합된 결과는 다음과 같이 기능의 정의를 가능하게 할 것이다 : 포지티브 자이모그램은 엔도셀룰라아제 활성을 나타내고, 포지티브 DNSA 분석 및 네거티브 pnp-셀로비오스 분석과 조합된 네거티브 자이모그램은 엑소셀룰라아제를 나타내는 반면, 포지티브 pnp-셀로비오스 결과와 함께 DNSA 및 네거티브 자이모그램은 효소 가 셀로비아제임을 의미할 것이다. The primary criterion for assigning a function will be the substrate on which it is acted and the type of activity detected. As such, various enzyme activity assays will focus on providing qualitative demonstration of function rather than rigorous quantification of relative activity levels. The required assay is dictated by the substrate being tested and described in more detail in the experimental protocol. With respect to cellulose, β-1,4-endoglucanase (endocellulase), β-1,4-exoglucanase (cellobiohydrolase), and β-1,4-glucosidase (cell Roviase) It is important to distinguish between active. This is a zymogram for endocellulase, a DNSA reduction-sugar assay for cellobiohydrolase, and p-nitrophenol-β-1,4-cellobiside (pnp-cellobiose for cellobiase activity). ) Was performed. The combined results from all three assays will enable the definition of function as follows: Positive zymograms show endocellularase activity and negative zymograms combined with positive DNSA assay and negative pnp-cellobiose assay Represents exocellulases, while DNSA and negative zymograms with positive pnp-cellobiose results will mean that the enzyme is a cellobiase.
크실라나아제(β-1,4-크실라나아제), 라미나리나아제(β-1,3-글루카나아제), 및 믹스된 글루카나아제(β-1,3(4)-글루카나아제) 활성은 각각 크실란, 라미나린 및 보리(barley) 글루칸 자이모그램에 의해 결정될 것이다. 셀룰로오스와 달리, 이들 기질로부터 다이머(dimer)를 특이적으로 절단하는 "크실로비오가수분해효소(xylobiohydrolase)" 또는 다른 어떠한 엑소-작용 효소의 보고도 나타나지 않는다. 따라서 자이모그램은 분해효소 (엔도) 활성을 입증하는데 충분할 것이고, pnp-유도체는 단당류 (엑소) 절단을 검출할 것이다. 본 연구에서 사용된 pnp-유도체는 pnp-α-L-아라비노푸라노시드, -α-L-아라비노피라노시드, -β-L-아라비노피라노시드, -β-D-셀로비오시드, -α-D-크실로피라노시드 및 -β-D-크실로피라노시드를 포함할 것이다. 이들 기질은 문제의 도메인의 가능한 활성에 기반하여 선택되었다. 이 분석은 임의의 α- 및 β-아라비노시다아제, β-셀로비아제, β-크실로시다아제, 2기능성(bifunctional) α- 아라비노시다아제/β-크실로시다아제, 및 α-크실로시다아제에 대한 기능의 측정을 가능하게 할 것이며, 이들은 크실로글루칸으로부터 α-연결된(linked) 크실로오스를 절단한다. pnp-유도체 분석은, 실험 프로토콜에 기재된 바와 같이, p-니트로페놀 농도의 표준 커브를 사용하여 96-웰 마이크로타이터 플레이트에서 수행될 것이다. Xylanase (β-1,4-xylanase), lamininase (β-1,3-glucanase), and mixed glucanase (β-1,3 (4) -glu Canase) activity will be determined by xylan, laminarin and barley glucan zymogram, respectively. Unlike cellulose, no reports of "xylobiohydrolase" or any other exo-acting enzyme appear that specifically cleave dimers from these substrates. Therefore, the zymogram will be sufficient to demonstrate the degrading enzyme (endo) activity, and the pnp-derivative will detect monosaccharide (exo) cleavage. The pnp-derivatives used in this study were pnp-α-L-arabinofuranoside, -α-L-arabinofyranoside, -β-L-arabinofyranoside, -β-D-cellobi It will include an oxide, -α-D-xycopyyranoside, and -β-D-xylopyranoside. These substrates were selected based on the possible activity of the domain in question. This assay includes any α- and β-arabinosidase, β-cellobiase, β-xylosidase, bifunctional α-arabinosidase / β-xylosidase, and α- It will be possible to measure the function for xyloxysidase, which cleaves the α-linked xylose from xyloglucan. pnp-derivative assays will be performed in 96-well microtiter plates using standard curves of p-nitrophenol concentrations, as described in the experimental protocol.
β-1,4-, β-1,3-, 및 β-1,3(4)- 글루카나아제 활성에 대한, 및 β-1,4-크실라나아제 및 다양한 엑소-글리코시다아제 활성에 대한 분석의 조합은, 모호한 탄수화물분해효소의 기능을 명확하게 분석할 수 있다. 입증된 활성을 갖는 단백질은 적절한 효소 시스템에 할당될 것이다. β-1,4-, β-1,3-, and β-1,3 (4) -glucanase activity, and β-1,4-xylanase and various exo-glycosidase activities The combination of assays for can clearly analyze the function of the ambiguous carbohydrate. Proteins with proven activity will be assigned to the appropriate enzyme system.
실험 프로토콜Experimental protocol
자이모그램Zymogram
분리 겔 내로 직접적으로 도입된 적절한 CP 기질과 함께 표준 SDS-PAGE 겔로서, 모든 활성 겔을 제조하였다. 자이모그램을 8% 폴리아크릴아미드 농도 및 dH2O 및/또는 겔 버퍼 용액에 용해된 기질로 캐스트하여, 0.1% (HE-셀룰로오스), 0.15% (보리 β-글루칸), 또는 0.2% (크실란)의 최종 농도를 얻었다. 겔은 10OmM 디티오트레이톨(DTT) 및 2% SDS의 최종 농도를 가지는 샘플 버퍼 중 95℃에서 8분 처리를 제외하고, 라엠리(Laemmli 1970)의 절차에 따라 불연속 조건 하에서 사용되었다. 전기영동 후, 2.5% Triton X-100, 2mM DTT 및 2.5mM CaCl2를 포함하는 2OmM PIPES 버퍼 pH 6.8의 80ml의 재생 버퍼(renaturing buffer)에서 실온에서 1시간 동안 겔을 인큐베이션하였다. Lam16A의 tsp3s와 같은 강력한 칼슘-결합 도메인의 재폴딩(refolding)을 보조하기 위하여 칼슘을 포함하였다. All active gels were prepared as standard SDS-PAGE gels with appropriate CP substrates introduced directly into the separation gel. Zymograms are cast into substrates dissolved in 8% polyacrylamide concentration and dH 2 O and / or gel buffer solution to yield 0.1% (HE-cellulose), 0.15% (barley β-glucan), or 0.2% Final concentration of silane). Gels were used under discontinuous conditions according to the procedure of Laemmli 1970, except for 8 min treatment at 95 ° C. in sample buffer with a final concentration of 100 mM dithiothreitol (DTT) and 2% SDS. After electrophoresis, the gels were incubated for 1 hour at room temperature in 80 ml of regeneration buffer of 20 mM PIPES buffer pH 6.8 containing 2.5% Triton X-100, 2 mM DTT and 2.5 mM CaCl 2 . Calcium was included to aid in the refolding of potent calcium-binding domains such as tsp3s of Lam16A.
1시간 평형 후, 겔을 재생 버퍼의 새로운 80ml 부분에 두고, 온화한 진동과 함께 4℃에서 밤새 유지하였다. 다음날 아침 겔을 실온에서 1 시간 동안 2OmM PIPES pH6.8의 80ml에 평형시키고, 깨끗한 용기로 옮기고, 최소량의 PIPES 버퍼로 커버하고, 4시간 동안 37℃에서 인큐베이션하였다. 인큐베이션 후, d-H2O(HE-셀룰로오스, β-글루칸 및 크실란) 중 0.25% 콩고 레드(Congo red) 또는 7% 아세트산 중 0.01% 톨루이딘 블루(Toluidine blue)의 용액으로 30분 동안 겔을 염색하였다. 염색된 배경에 대해 선명한 밴드가 관찰가능할 때까지 콩고 레드에 대해 1M NaCl 및 톨루이딘 블루에 대해 dH2O로 겔을 탈염색하였다. After 1 hour equilibration, the gel was placed in a fresh 80 ml portion of the regeneration buffer and kept overnight at 4 ° C. with gentle vibrations. The next morning the gel was equilibrated to 80 ml of 20 mM PIPES pH6.8 for 1 hour at room temperature, transferred to a clean container, covered with a minimum amount of PIPES buffer and incubated at 37 ° C. for 4 hours. After incubation, the gel was stained for 30 minutes with a solution of 0.25% Congo red in dH 2 O (HE-cellulose, β-glucan and xylan) or 0.01% Toluidine blue in 7% acetic acid. . The gel was destained with 1M NaCl for Congo Red and dH 2 O for toluidine blue until a clear band was observed against the stained background.
넬슨-소모지(Nelson-Somogyi) 환원-당 분석Nelson-Somogyi Reduction-Sugar Analysis
5Oul 반응 부피를 사용하여, 96-웰 마이크로타이터 플레이트에 대해 적용된 넬슨-소모지 환원 당 방법의 변경을 사용하여 정제된 단백질을 활성에 대해 분석하였다(Green, Clausen 등 1989). 테스트 기질은, 2OmM PIPES pH 6.8 중 1%(보리 글루칸 및 라미나린, 0.5%)에 용해된 아비셀, CMC, 인산 팽창된(swollen) 셀룰로오스(PASC), 보리 글루칸, 라미나린, 및 크실란을 포함하였다. 보리 글루칸, 라미나린 및 크실란 분석을 37℃에서 2시간 동안 인큐베이션하고; 아비셀, CMC 및 PASC 분석을 37℃에서 36 시간 동안 인큐베이션하였다. 샘플을 3본으로 분석하고, 블랭크 값에 대해 보정하고, 수준을 표준 커브로부터 추정하였다. 효소 분석 샘플의 단백질 농도는 제조자의 사용설명서에 따라 Pierce BCA 단백질 분석을 사용하여 3본으로 구하였다. 1μM의 방출된 환원 당/분으로서 정의된, 및 U/mg 단백질로 특이적 활성으로서 보고된 하나의 단위(U)로, 효소 활성을 계산하였다. Purified proteins were assayed for activity using a 50 Oul reaction volume using a change in the Nelson-Smoke Reduction Sugar method applied on 96-well microtiter plates (Green, Clausen et al. 1989). Test substrates include Avicel, CMC, Phosphoric Acid Swollen Cellulose (PASC), Barley Glucan, Laminarin, and Xylene dissolved in 1% (Barley Glucan and Laminarin, 0.5%) in 20 mM PIPES pH 6.8. It was. Barley glucan, laminarin and xylan assays were incubated at 37 ° C. for 2 hours; Avicel, CMC and PASC assays were incubated at 37 ° C. for 36 hours. Samples were analyzed in triplicate, corrected for blank values, and levels were estimated from standard curves. Protein concentrations of the enzyme assay samples were obtained in triplicate using the Pierce BCA protein assay according to the manufacturer's instructions. Enzyme activity was calculated in one unit (U), defined as 1 μM released reduced sugars / min and reported as specific activity with U / mg protein.
엑소글리코시다아제 활성 분석 : pnp-유도체Exoglycosidase Activity Assay: pnp-derivative
α-L-아라비노푸라노시드, -α-L-아라비노피라노시드, -β-L-아라비노피라노시드, -β-D-셀로비오시드, - α-D-글루코피라노시드, -β-D-글루코피라노시드, -α-D-크실로피라노시드 및 -β-D-크실로피라노시드의 pNp 유도체에 대한 활성에 대하여, 정제된 단백질을 분석하였다. 25μl의 효소 용액을 2OmM PIPES pH 6.8 중 125μl의 5mM 기질 용액에 가하고, 37℃에서 30분 동안 인큐베이션하고, A405를 측정하였다. 블랭크 반응에 대해 보정한 후, 표시도수(reading)를 p-니트로페놀 표준 커브와 비교하고, 1μmol p-Np/분으로서 정의된 하나의 단위(U)로, U/mg 단백질로 특이적 활성(specific activity)으로서 보고하였다. α-L-arabinofuranoside, -α-L-arabinofyranoside, -β-L-arabinofyranoside, -β-D-cellobioside, -α-D-glucopyrano Purified proteins were analyzed for the activity on pNp derivatives of seeds, -β-D-glucopyranoside, -α-D-xycopyyranoside and -β-D-xyclopyranoside. 25 μl enzyme solution was added to 125 μl 5 mM substrate solution in 20 mM PIPES pH 6.8, incubated at 37 ° C. for 30 minutes, and A 405 measured. After correcting for the blank reaction, the readings are compared to the p-nitrophenol standard curve and specific activity as U / mg protein in one unit (U), defined as 1 μmol p-Np / min. (specific activity).
질량 분광법 및 프로테오믹 분석Mass spectroscopy and proteomic analysis
아비셀, CMC, 및 크실란-성장된 배양으로부터의 고정-상(stationary-phase) 상청액을 마이크로콘(microcon) 또는 센트리콘 장치(centricon device)(Millipore)를 사용하여 원심력 초여과에 의해 25배 농축하였다. 샘플 단백질 농도를 BCA 단백질 분석에 의해 측정하였다. 샘플을 8M 요소 및 1OmM DTT를 또한 포함하는, 10OmM Tris 버퍼, pH 8.5로 교환하였다. 샘플을 37℃에서 2시간 동안 교반하면서 인큐베이션하여 단백질을 변성시키고, 이황화결합을 환원하였다. 환원 후, 1 M 요오드아세테이트를 5OmM의 최종 농도에 가하고, 어두운 곳에서 25℃에서 30분 동안 반응을 인큐베이션하였다. 이 단계는 환원된 시스테인 잔기를 알킬화하고, 그에 의해 이황화 결합의 재형성(reformation)을 방지한다. 이어서 마이크로콘 장치를 사용하여 샘플을 5OmM Tris, 1mM CaCl2, pH 8.5로 교환하였다. 변성, 환원 및 알킬화된 샘플을 1:50 효소(트립신) 대 기질(상청액) 비율로 프로테오믹스-등급 트립신을 사용하여 펩티드 단편 내로 소화시켰다. 통상의 소화 반응은 약 150μl 전체 부피였다. 소화를 37℃에서 밤새 인큐베이션하고, ~1%의 최종 농도로 99% 포름산의 첨가에 의해 중단하고, Life Sciences CORE 질량 분광법 설비의 UMCP College에 서 RPHPLC- MS/MS 에 의해 분석하였다. Stationary-phase supernatants from Avicel, CMC, and xylan-grown cultures are concentrated 25-fold by centrifugal force ultrafiltration using a microcon or centricon device (Millipore) It was. Sample protein concentration was measured by BCA protein analysis. Samples were exchanged with 100 mM Tris buffer, pH 8.5, which also contained 8M urea and 10 mM DTT. Samples were incubated with stirring at 37 ° C. for 2 hours to denature proteins and reduce disulfide bonds. After reduction, 1 M iodine acetate was added to a final concentration of 50 mM and the reaction was incubated for 30 minutes at 25 ° C. in the dark. This step alkylates the reduced cysteine residues, thereby preventing the reformation of disulfide bonds. The sample was then exchanged to 50 mM Tris, 1 mM CaCl 2 , pH 8.5 using a microcon apparatus. Denatured, reduced and alkylated samples were digested into peptide fragments using proteomics-grade trypsin in a 1:50 enzyme (trypsin) to substrate (supernatant) ratio. Typical digestion reaction was about 150 μl total volume. Digestion was incubated overnight at 37 ° C., stopped by addition of 99% formic acid to a final concentration of ˜1% and analyzed by RPHPLC-MS / MS at UMCP College of Life Sciences CORE mass spectroscopy equipment.
펩티드 단편을, 흡착제로서 C18을 포함하는 12cm 마이크로보어 컬럼(microbore column)으로 핏트된 Waters 2960 HPLC 상으로 로드하고, 전기스프레이 이온화 장치로 증가하는 아세토니트릴(CH3CN) 농도의 선형 구배(gradient)로 용출하였다. 전기스프레이 장치는 펩티드를 이온화하고, Finnagin LCQ 탠덤 질량 분광계(Mass Spectrometer) 내로 주입하였다. 자동화된 운영 소프트웨어는 용매 구배를 제어하고, 용출된 펩티드를 지속적으로 스캔하였다. 상기 프로그램은 조사 스캔에서 세가지 가장 풍부한 이온 종의 각각을 확인하고, 질량 분광계의 이온 트랩에서 그들의 각각을 분리하고, 헬륨 분자로 충돌을 유도함으로서 그들을 조각내었다. SEQUEST 및 MASCOT와 같은 펩티드 분석 패키지에 의한 추가의 분석을 위해 결과로 얻어지는 부-단편 매스(sub-fragment mass)를 기록하였다. 3번의 서브스캔 및 충돌 사이클이 완료된 후, MS는 또다른 조사 스캔을 받고, 통상적으로 약 3시간인, 동작의 마지막까지 사이클을 반복하였다. 원료(raw) MS 판독은 분석 소프트웨어에 의해 사용되어, 2-40 드래프트 게놈에서 모든 유전자 모델의 아미노산 서열 전위와 비교되는 펩티드 단편 서열을 생성한다. 각각의 프로그램에 대해 특이적인 통계적 유의성의 허용된 역치를 사용하여 펩티드 동일성 매치를 평가하였다. Peptide fragments were loaded onto a Waters 2960 HPLC fitted with a 12 cm microbore column containing C18 as adsorbent and eluted with a linear gradient of increasing acetonitrile (CH3CN) concentration with an electrospray ionizer. It was. The electrospray device ionized the peptide and injected it into a Finnagin LCQ tandem mass spectrometer. Automated operating software controlled the solvent gradient and continuously scanned the eluted peptides. The program fragmented them by identifying each of the three most abundant ionic species in the irradiation scan, separating each of them from the ion trap of the mass spectrometer, and inducing collisions with helium molecules. The resulting sub-fragment mass was recorded for further analysis by peptide analysis packages such as SEQUEST and MASCOT. After three subscan and crash cycles were completed, the MS received another irradiation scan and repeated the cycle until the end of the operation, typically about 3 hours. Raw MS reads are used by analysis software to generate peptide fragment sequences that are compared to amino acid sequence translocations of all genetic models in the 2-40 draft genome. Peptide identity matches were assessed using the accepted threshold of statistical significance specific to each program.
E coli에서 2-40 단백질의 클로닝 및 발현Cloning and Expression of 2-40 Protein in E coli
기본적인 클로닝 및 발현 시스템은 벡터로서 pETBlue2 (Novagen), 클로닝 균주로서 E coli DH5α (Invitrogen), 및 단백질 발현 균주로서 E coli BL-21(DE3) Tuner® 세포(Novagen)를 사용하였다. 이 시스템은 독성 또는 다른 상이한 유전자 의 클로닝을 가능하게 하는데, 그 이유는 벡터가 클로닝 균주 DH5α에서 결여된 - T7 lac 프로모터의 조절 하에 발현을 두고, 그에 의해 플라스미드 스크리닝 및 증식 도중 심지어 낮은 수준 발현을 폐지하기 때문이다. 청/백 스크린 후, 플라스미드를 DH5α로부터 정제하고, 발현 숙주 내로 형질전환하였다(Tuners). 상기 Tuner 균주는 T7 lac 프로모터를 갖지고, 벡터-코딩된 단백질의 IPTG-유도가능한 발현을 가능하게 하고, Lon 및 Omp 단백질분해효소를 결여한다. The basic cloning and expression system used pETBlue2 (Novagen) as the vector, E coli DH5α (Invitrogen) as the cloning strain, and E coli BL-21 (DE3) Tuner® cells (Novagen) as the protein expression strain. This system enables cloning of virulence or other different genes, because the vector is expressed under the control of the -T7 lac promoter lacking in cloning strain DH5α, thereby abolishing even low levels of expression during plasmid screening and propagation. Because. After blue / back screen, plasmids were purified from DH5α and transformed into expression hosts (Tuners). The Tuner strain has a T7 lac promoter, enables IPTG-inducible expression of vector-encoded protein, and lacks Lon and Omp proteases.
DOE JGI's Microbulbifer degradans 게놈 웹 서버로부터 유전자 모델의 뉴클레오티드 서열을 얻고, http://biotools.idtdna.com/Primerquest/에 있는 Integrated DNA Technologies 웹 페이지에서 PrimerQuest™ 디자인 툴에 넣었다. 디자인 파라미터들은 최적(Optimum) Tm 6O℃, 최적 프라이머 크기(Optimum Primer Size) 20nt, 최적 GC% = 50이었고, 유전자의 대부분을 가능한한 알맞게 클론하기 위하여 각각의 ORF의 처음 및 마지막 100 뉴클레오티드에서 프라이머가 선택되도록 생성물 크기 범위가 선택되었다. 클로닝 및 발현 벡터인, pETBlue2는 C-말단 6xHistidine 융합 및 단백질 발현에 대한 개시 및 중지 코돈을 제공한다. 따라서, PCR 프라이머에 5' 제한 부위를 가할 때 벡터 및 삽입 서열의 프레임에 대한 세심한 주의가 요구된다. 결과물인 "꼬리를 가진 프라이머(tailed primer)"는 26 내지 30nt의 길이를 갖고, 그들의 서열은 PDRAW 소프트웨어 패키지 (http://www.acaclone.com)를 사용하여 "가상 클로닝(virtual cloning)" 분석에 의해 확인하였다. 이 프로그램은 벡터 및 삽입 DNA 서열이, 표준의 제한 효소로 절단되고, 함께 결찰되는 것을 가능하게 한다. 결과물인 서열의 아미노산 전위는 프 라이머 디자인에서 실수로 도입된 임의의 프레임 시프트를 검출하기 위해 조사되었다. 이러한 조회 후, 프라이머를 Invitrogen (Frederick, MD)로부터 구입하였다. The nucleotide sequence of the genetic model was obtained from DOE JGI's Microbulbifer degradans genomic web server and placed in the PrimerQuest ™ design tool at the Integrated DNA Technologies web page at http://biotools.idtdna.com/Primerquest/ . Design parameters were Optim Tm 6O ° C., Optimal Primer Size 20nt, Optimal GC% = 50, with primers at the first and last 100 nucleotides of each ORF to clone as much of the gene as possible. The product size range was chosen to be selected. PETBlue2, a cloning and expression vector, provides start and stop codons for C-terminal 6 × Histidine fusion and protein expression. Therefore, careful attention to the frame of the vector and insertion sequence is required when adding 5 'restriction sites to PCR primers. The resulting "tailed primers" are 26-30 nts long and their sequences are "virtual cloning" analysis using the PDRAW software package (http://www.acaclone.com). It confirmed by. This program allows the vector and insert DNA sequences to be cleaved with standard restriction enzymes and ligated together. The amino acid translocations of the resulting sequence were examined to detect any frame shifts that were mistakenly introduced in the primer design. After this inquiry, primers were purchased from Invitrogen (Frederick, MD).
PCR 반응은 주형으로서 0.5μl의 2-40 게놈의 DNA와 50μl 반응에서 1OpMol의 포워드 및 리버스 프라이머, 1μl의 1OmM DNTPs, 1.5μl의 10OmM MgCl2, 및 1μl Proof Pro® Pfu 폴리머라아제(Polymerase)를 포함하였다. PCR 조건은 Pfu DNA 폴리머라아제 및 꼬리를 가진 프라이머에 대한 표준 파라미터를 사용하였다. PCR 생성물은 QIAGEN QIAquick PCR Cleanup 키트로 깨끗히 청소하고, 0.8% 아가로스 겔에서 관찰하였다. 청소 및 크기의 확인 후, PCR 생성물 및 pETBlue2를 적절한 제한 효소로, 통상적으로 37℃에서 1 내지 4 시간 동안 Ascl 및 Clal로 절단하고, QIAquick 키트를 사용하여 깨끗히 청소하고, 아가로스 겔에서 관찰하였다. 깨끗한 절단물(digestion)을 실온에서 어두운 곳에서 적어도 2시간 동안 T4 DNA 리가아제를 사용하여 결찰하였다. 이어서 결찰한 것을 전기천공(electroporation)에 의해 E coli DH5α로 형질전환하였다. 비-선택적 배지에서 37℃에서 한 시간 동안 형질전환주(transformant)를 인큐베이션하고, 이어서 암피실린 및 X-gal을 포함하는 LB 아가 상 평판배양하였다. pETBlue2가 Ampr 유전자를 보유하고 삽입물이 lacZ ORF 내로 클론됨에 따라, 흰색 콜로니(white colony)는 삽입 서열을 포함한다. 흰색 콜로니는 이쑤시개로 픽킹되고(picked), 새로운 LB/Amp/X-gal 플레이트 상으로 패치되고, 패치된 콜로니 중 3개는 3ml 밤샘 액체배지에 접종하는데 사용되었다. 밤새 성장(overnight outgrowth) 후 하얗게 남아있는 패치된 콜로니에 상응하는 액체 배지로부터 플라스미드를 준비하였다. 이어서 이들 플라스미드 준비물(prep)을 적절한 제한 효소로 단독으로 절단하고, 크기 확인을 위해 아가로스 전기영동에 의해 시각화하였다. PCR reactions were performed with 0.5 μl 2-40 genome DNA and 10 μM forward and reverse primers, 1
이어서 플라스미드를 염색체 클로람페니콜 내성 유전자(Cmr)를 보유하는 Tuner® 균주로 열-충격 형질전환하였다. 형질전환주를 비-선택적 구조 배지에서 37℃에서 1시간 인큐베이션하고, Amp 및 Cm (Tuner 배지)와 함께 LB 아가 상 평판배양하고, 37℃에서 밤새 인큐베이션하였다. 그렇게 선택된 임의의 콜로니는 벡터 및 인서트를 포함하게 된다. 이는 Tuner 배지 플레이트 상으로 3 콜로니를 패치하고, 상응하는 3ml 밤샘 액체배지에 접종함으로서 확인되었다. 다음날 아침 상기 액체배지를 사용하여 25ml 액체배지를 접종하고, 이는 0.6(2-3 시간) 근처의 OD60O으로 성장하였다. 이 때 1ml 부분표본(aliquot)을 배양으로부터 제거하고, 펠릿으로 하고(pelleted), 1/10 부피 1X SDS-PAGE 처리 버퍼에 재부유하였다. 이러한 사전-유도된 샘플을 웨스턴 블롯에서의 나중 사용을 위해 -20℃에서 냉동하였다. 이어서 남아있는 액체배지를 1mM IPTG로 수정하고, 37℃에서 4시간 인큐베이션하였다. 유도된 펠릿 샘플을 시간 간격으로 수집하였다. 이들 샘플 및 사전-유도된 대조군을 표준 SDS-PAGE 겔에서 실행하고, PVDF 막 상으로 전기블롯팅하였다. 이어서 상기 막을 모노클로널 마우스 α- HisTag® 1차 항체에 이어 HRP-컨주게이트된 염소 α-마우스 IgG 2차 항체의 1/5000 희석을 사용하여 웨스턴 블롯으로서 처리하였다. BioRad's Opti-4CN 기질 키트를 사용하여 밴드를 비색적으 로(colorimetrically) 시각화하였다. 비유도된 대조군에서는 존재하지 않지만, 유도된 샘플에서 His 태그의 밴드의 존재로, 성공적인 발현을 확인하였고, 시간 간격의 시점으로부터의 밴드의 비교는 나중의 유도 파라미터, 더큰-스케일 정제를 최적화하기 위해 사용되었다. The plasmid was then heat-shock transformed with the Tuner® strain carrying the chromosome chloramphenicol resistance gene (Cm r ). Transformants were incubated for 1 hour at 37 ° C. in non-selective rescue media, plated on LB agar with Amp and Cm (Tuner medium) and incubated at 37 ° C. overnight. Any colony so selected will include the vector and the insert. This was confirmed by patching 3 colonies onto Tuner medium plate and inoculating the corresponding 3 ml overnight liquid medium. The next morning the 25 ml liquid medium was inoculated using the liquid medium, which grew to OD 60O near 0.6 (2-3 hours). At this
재조합 단백질의 제조 및 정제Preparation and Purification of Recombinant Proteins
발현 균주를 튜너 배지의 500ml 또는 1 리터 액체배지에서 0.6 내지 0.8의 OD60O으로 성장시켰다. 이 때 비-유도 샘플을 수집하고, 나머지 배양을 10OmM IPTG의 첨가에 의해 1mM의 최종 농도로 유도하였다. 유도는 37℃에서 4시간 동안 또는 25℃에서 16 시간 동안 수행하였다. 배양 펠릿을 수확하고, 보관 및 세포 용해(cell lysis)를 돕기 위해 -20℃에서 밤새 냉동하였다. 이어서 펠릿을 10분 동안 얼음에서 녹이고, 미리 무게를 잰 팔콘 튜브로 옮기고, 무게를 재었다. 이어서 세포를 젖은 펠릿 중량 그램당 4ml의 용해 버퍼(8M Urea, 10OmM NaH2PO4, 25mM Tris, pH 8.0)에서 25℃에서 1시간 동안 요동시켰다. 용해질(lysate)을 15,00Og에서 30분 동안 원심분리하여 세포 파편을 펠릿으로 하였다. 깨끗하게 한 용해질(상청액)을 깨끗한 팔콘 튜브로 피펫으로 옮겼으며, 여기서 각각의 4ml의 깨끗하게 한 용해질에 대해 1ml의 QIAGEN 50% Nickel-NTA 수지를 가하였다. 이 혼합물을 실온에서 1시간 동안 온화하게 교반하여 수지상 Ni+2 이온 및 재조합 단백질의 His 태그 사이의 결합을 용이하게 하였다. 결합 후, 슬러리를 일회용 미니 컬럼으로 로드하고, 플로우 쓰루(flow thru)(열화된 용해질)를 수집하고, 나중의 평가를 위해 저장 하였다. 수지를 pH 7.0으로 조정된 용해 버퍼로 2회 세척하였으며; 이들 세척물의 각각의 부피는 깨끗하게 한 용해질의 원래 부피와 동일하였다. 이들 두 세척물의 플로우 쓰루는 정제 효율을 평가하기 위한 웨스턴 블롯에서의 후속의 분석을 위해 또한 저장하였다. Expression strains were grown to OD 60O of 0.6 to 0.8 in 500 ml or 1 liter liquid medium of tuner medium. Non-derived samples were collected at this time and the remaining cultures were induced to a final concentration of 1 mM by addition of 100 mM IPTG. Induction was carried out at 37 ° C. for 4 hours or at 25 ° C. for 16 hours. Culture pellets were harvested and frozen overnight at −20 ° C. to aid storage and cell lysis. The pellet was then melted on ice for 10 minutes, transferred to a pre-weighed falcon tube and weighed. The cells were then shaken for 1 hour at 25 ° C. in 4 ml of lysis buffer (8 M Urea, 10 mM NaH 2 PO 4 , 25 mM Tris, pH 8.0) per gram wet pellet. Lysates were centrifuged at 15,00 g for 30 minutes to form cell debris. The cleared lysate (supernatant) was pipetted into a clean Falcon tube where 1 ml of
이 때 컬럼은 그들의 C-말단에서 His 태그에 의해 고정된 상대적으로 정제된 재조합 단백질을 포함한다. 이는 재폴딩을 위해 이상적인 상황이며, 컬럼은 4℃ 실로 이동하고, 감소하는 요소 농도를 갖는 일련의 재생 버퍼(renaturation buffer)는 상기 컬럼을 통해 통과한다. 재생 버퍼는 25mM Tris pH 7.4, 50OmM NaCl, 및 20% 글리세롤에서 변화하는 양의 요소를 포함한다. 이 버퍼는 6M, 4M, 2M 및 1M 요소를 포함하는 원액(stock solution)으로서 제조된다. 이들의 부분표본은 손쉽게 혼합되어 5M 및 3M 요소 농도를 얻을 수 있고, 그리하여 1M 단계에서 내려가는(descending) 일련의 요소 농도를 제공한다. 6M 버퍼의 한 부피(원래의 용해질 부피)는 상기 컬럼을 통해 통과하고, 이어서 5M 버퍼의 한 부피가 후속하고, 1M 버퍼가 계속되며 - 이는 1M 요소에서 컬럼의 평형을 확보하기 위해 한번 반복된다. 이 때 재폴드된 단백질은 25OmM 이미다졸을 포함하는, 1M 요소, 25mM Tris pH 7.4, 50OmM NaCl, 20% 글리세롤을 사용하여, 1/10th 원래의 부피의 8 분획에서 용출된다. 이미다졸은 Nickel 이온-His 태그 상호작용을 붕괴시키고, 그에 의해 컬럼으로부터 단백질을 방출시킨다.The column then contains a relatively purified recombinant protein fixed by His tag at their C-terminus. This is an ideal situation for refolding, where the column moves to a 4 ° C. chamber, and a series of regeneration buffers with decreasing urea concentration pass through the column. The regeneration buffer contains varying amounts of urea at 25 mM Tris pH 7.4, 50 mM NaCl, and 20% glycerol. This buffer is prepared as a stock solution containing 6M, 4M, 2M and 1M urea. Their aliquots can be easily mixed to obtain 5M and 3M urea concentrations, thus providing a series of urea concentrations descending in the 1M steps. One volume of 6M buffer (original lysate volume) passes through the column, followed by one volume of 5M buffer, followed by 1M buffer-which is repeated once to ensure column equilibrium in the 1M element. The refolded protein is then eluted at 8 fractions of 1/10 th original volume using 1M urea, 25 mM Tris pH 7.4, 50OmM NaCl, 20% glycerol, including 25OmM imidazole. Imidazole disrupts Nickel ion-His tag interactions, thereby releasing proteins from the column.
웨스턴 블롯이 사용되어, 용출된 분획, 두 세척물 및 열화된 용해질에서 His 태그된 단백질의 양을 평가하였다. 열화된 용해질 및/또는 세척물에서 재조합 단백질이 풍부하게 존재하는 경우, 프로세스를 반복하고, 더 많은 단백질을 "청소(scavenge)"하는 것이 가능하다. 관심있는 단백질을 포함하는 용출물 분획을 풀로 하고(pooled), 이어서 농축하고, 센트리콘 원심력 초여과 장치(Millipore)를 사용하여 저장 버퍼(2OmM Tris pH 7.4, 1OmM NaCl, 10% 글리세롤)로 교환하였다. 효소 제제를 이어서 부분표본으로 하고, 활성 분석에서의 사용을 위해 -80℃에서 냉동하였다. Western blot was used to assess the amount of His tagged protein in the eluted fractions, both washes and degraded lysate. If abundant recombinant protein is present in the degraded lysate and / or wash, it is possible to repeat the process and "scavenge" more protein. The eluate fractions containing the protein of interest were pooled and then concentrated and exchanged into storage buffer (20 mM Tris pH 7.4, 10 mM NaCl, 10% glycerol) using a Centricon centrifugal ultrafiltration device (Millipore). . Enzyme preparations were then aliquoted and frozen at −80 ° C. for use in activity assays.
본 발명의 다양한 구현예에서, 본 발명의 셀룰로오스 분해 효소, 이를 포함하는, 예를 들어 하나 이상의 효소 또는 셀룰로오스-결합 단백질을 포함하는, 관련 단백질 및 시스템은 많은 용도를 갖는다. 본 발명의 셀룰라아제의 많은 가능한 용도는 M. K. Bhat (Biotechnical Advances 18 (2000))에 의한 문헌 "Cellulases and related enzymes in biotechnology"에 있는 다른 셀룰라아제에 대해 기재된 바와 동일하며, 그 내용은 전체적으로 인용에 의해 본 명세서에 일체화된다. 예를 들어, 본 발명의 셀룰라아제 및 그 시스템은 음식, 맥주, 와인, 동물 사료, 직물 제조 및 세탁, 펄프 및 종이 산업, 및 농업 산업에 사용될 수 있다. In various embodiments of the invention, the cellulolytic enzymes of the invention, including related proteins and systems, including, for example, one or more enzymes or cellulose-binding proteins, have many uses. Many possible uses of the cellulase of the present invention are the same as described for other cellulase in the cell "Cellulases and related enzymes in biotechnology" by MK Bhat (Biotechnical Advances 18 (2000)), the contents of which are incorporated herein by reference in their entirety. Is integrated into. For example, the cellulase of the present invention and its systems can be used in food, beer, wine, animal feed, textile manufacturing and laundry, pulp and paper industry, and agricultural industry.
한 구현예에서, 세가지 시스템이 사용되어 셀룰로오스를 분해하여, 의약에서의 사용을 위한 단쇄(short chain) 펩티드를 제조할 수 있다. In one embodiment, three systems can be used to break down cellulose to make short chain peptides for use in medicine.
다른 구현예에서, 세가지 시스템이 사용되어, 과일 및 야채 주스의 추출 및/또는 정화에서, 과일 넥타 및 퓨레의 제조 및 보관에서, 음식의 풍미 및 기타 감각적인 특성, 직물의 변경에서, 올리브유의 추출에서, 제빵 제품의 품질의 향상에서, 맥주 양조 및 와인 제조에서, 단위동물(monogastic) 및 반추동물(ruminant) 사료의 제조에서, 리오셀의 세동제거(defibrillation), 세탁 의복 및 이와 유사한 것, "페이딩(fading)" 데님 재료를 포함하는 직물 및 세탁 기술, 종이 및 펄프 제품의 제조에서, 및 농업 용도에서 셀룰로오스를 파괴한다. In another embodiment, three systems are used to extract olive oil in the extraction and / or purification of fruit and vegetable juices, in the manufacture and storage of fruit nectar and puree, in the flavor and other sensory properties of food, in altering the fabric, In the improvement of the quality of bakery products, in beer brewing and wine making, in the manufacture of monogastic and ruminant feeds, defibrillation of lyocells, laundry garments and the like, " Fading " destroys cellulose in fabric and laundry techniques including denim materials, in the manufacture of paper and pulp products, and in agricultural applications.
본 발명의 일부 구현예에서, 셀룰로오스는 환경 오염물 및 폐기물 파편을 흡수하는데 사용될 수 있다. 셀룰로오스는 이어서 본 발명의 셀룰라아제 분해 시스템에 의해 분해될 수 있다. 환경 오염물을 신진대사할 수 있고, 셀룰로오스를 분해할 수 있는 박테리아가 독성 물질을 분해하는 바이오리액터에서 사용될 수 있다. 그러한 바이오리액터는 박테리아를 유지하기 위해 추가적인 영양소를 가할 필요성이 없기 때문에 유리할 것이며 - 이들은 탄소원으로서 셀룰로오스를 이용할 것이다. In some embodiments of the invention, cellulose can be used to absorb environmental contaminants and waste debris. Cellulose can then be degraded by the cellulase degradation system of the present invention. Bacteria that can metabolize environmental contaminants and break down cellulose can be used in bioreactors that break down toxic substances. Such bioreactors would be advantageous because there would be no need to add additional nutrients to keep the bacteria-they would use cellulose as the carbon source.
본 발명의 일부 구현예에서, 셀룰로오스 분해 효소 시스템은 건조 형태로, 버퍼 내에, 페이스트, 페인트, 미셀(micelle) 등으로서 제공될 수 있다. 셀룰로오스 분해 효소 시스템은 또한 추가적인 성분, 예컨대 금속 이온, 킬레이터, 세정제(detergent), 유기 이온, 무기 이온, 추가적인 단백질, 예컨대 비오틴 및 알부민을 포함할 수 있다. In some embodiments of the invention, the cellulolytic enzyme system may be provided in dry form, in a buffer, as a paste, paint, micelle, and the like. Cellulolytic enzyme systems may also include additional components such as metal ions, chelators, detergents, organic ions, inorganic ions, additional proteins such as biotin and albumin.
본 발명의 일부 구현예에서, 본 발명의 셀룰로오스 분해 시스템은 셀룰로오스 재료에 직접 적용될 수 있다. 예를 들어, 도 4-11에 나열된 화합물의 하나, 일부, 또는 모두를 포함하는 시스템은, 이 시스템이 식물 또는 다른 셀룰로오스 함유 아이템을 분해하도록, 식물 또는 다른 셀룰로오스 함유 아이템에 직접 적용될 수 있다. 다른 실시예로서, 2-40은 식물 또는 다른 셀룰로오스 함유 아이템에서 성장할 수 있으며, 이는 2-40이 성장함에 따라 셀룰로오스 함유 아이템을 분해하기 위하여, 2-40으로 하여금 도 4-11에 나열된 화합물을 제조하는 것을 가능하게 할 수 있다. 2-40 또는 본 발명의 시스템을 사용하는 장점은, 셀룰로오스 함유 식물 또는 아이템의 분해가 해양성 환경, 예를 들어 수중에서 수행될 수 있다는 점에 있다. In some embodiments of the present invention, the cellulose degradation system of the present invention may be applied directly to cellulose materials. For example, a system comprising one, some, or all of the compounds listed in FIGS. 4-11 can be applied directly to a plant or other cellulose containing item such that the system degrades the plant or other cellulose containing item. As another example, 2-40 can be grown on a plant or other cellulose containing item, which causes 2-40 to prepare the compounds listed in FIGS. 4-11 to degrade the cellulose containing item as 2-40 grows. Can make it possible to do. An advantage of using 2-40 or the system of the present invention is that decomposition of cellulose containing plants or items can be carried out in an oceanic environment, for example in water.
도 4-11에 나열된 화합물의 서열 중 어느 것에 대해 100%, 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 또는 75%에서 선택된 상동성을 갖는 뉴클레오티드 서열을 제공하는 것은 본 발명의 한 측면이다. Nucleotides with homology selected from 100%, 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, or 75% for any of the sequences of compounds listed in FIGS. 4-11 Providing a sequence is an aspect of the present invention.
본 발명은 또한 도 4-11에 나열된 화합물의 임의의 서열의 1 내지 20 뉴클레오티드의, 비-천연 또는 비-표준 뉴클레오티드, 예를 들어 포스포로티오에이트, 데옥시이노신, 데옥시우리딘, 이소시토신, 이소구아노신, 2-O-메틸을 포함하는 리보핵산으로의 치환, 및 포스포디에스테르 백본의, 예를 들어 알킬쇄, 아릴기, 및 단백질 핵산(PNA)으로의 치환을 커버한다. The invention also relates to non-natural or non-standard nucleotides, such as phosphorothioate, deoxyinosine, deoxyuridine, isocitosine, of 1-20 nucleotides of any sequence of the compounds listed in FIGS. 4-11. , Substitutions with ribonucleic acids including isoguanosine, 2- O -methyl, and substitutions of phosphodiester backbones with, for example, alkyl chains, aryl groups, and protein nucleic acids (PNAs).
1 x SSC, 2 x SSC, 3 x SSC1, 4 x SSC, 5 x SSC, 6 x SSC, 7 x SSC, 8 x SSC, 9 x SSC, 또는 10 x SSC의 엄격한 조건(stringency condition) 하에서 도 4-11에 나열된 화합물의 임의의 한 서열에 하이브리드하는 뉴클레오티드 서열을 제공하는 것은 본 발명의 일부 구현예의 또다른 측면이다. Fig. 4 under stringent conditions of 1 x SSC, 2 x SSC, 3 x SSC1, 4 x SSC, 5 x SSC, 6 x SSC, 7 x SSC, 8 x SSC, 9 x SSC, or 10 x SSC. Providing a nucleotide sequence that hybridizes to any one of the compounds listed in -11 is another aspect of some embodiments of the invention.
본 발명의 범주는 도 4-11에 나열된 화합물의 임의의 한 서열의 천연 및 비천연 대립유전자를 커버한다. 본 발명의 일부 구현예에서, 도 4-11에 나열된 화합 물의 임의의 한 서열의 대립유전자는, 1개, 2개, 3개, 4개, 또는 5개의 천연적으로 생기는 아미노산의, 유사하게 대전된(charged), 형상의, 크기의, 또는 위치에 있는(situated) 아미노산으로의 치환을 포함할 수 있다(보존적 치환). 본 발명은 또한 비-천연 또는 비-표준 아미노산, 예를 들어 셀레노시스테인, 피롤리신, 4-히드록시프롤린, 5-히드록시라이신, 포스포세린, 포스포티로신, 및 20 표준 아미노산의 D-이성체를 커버한다. The scope of the present invention covers the natural and unnatural alleles of any one sequence of the compounds listed in FIGS. 4-11. In some embodiments of the invention, alleles of any one sequence of the compounds listed in FIGS. 4-11 are similarly charged of one, two, three, four, or five naturally occurring amino acids. Substitutions with charged, shaped, sized, or situated amino acids (conservative substitutions). The invention also relates to non-natural or non-standard amino acids such as selenocysteine, pyrrolysine, 4-hydroxyproline, 5-hydroxylysine, phosphoserine, phosphotyrosine, and D- of 20 standard amino acids. Cover the isomers.
본 발명의 일부 구현예는, 당류를 얻기 위하여, 도 4-11에 나열된 화합물의 하나 이상, 바람직하게는 도 4에 나열된 셀룰라아제 cel5A의 유효한 당화 양으로 리그노셀룰로오스 물질을 처리하고, 에탄올을 제조하기 위해 당류를 변환하는 것을 포함하는, 리그노셀룰로오스 물질로부터 에탄올을 제조하는 방법에 관한 것이다. 상기 처리는 해양성 환경, 예컨대 수중에서 수행될 수 있다. 도 4-11에 나열된 화합물의 하나 이상은 건조 형태로, 버퍼 내에, 또는 페이스트, 페인트, 또는 미셀의 형태로, 존재할 수 있다. Some embodiments of the invention are directed to treating lignocellulosic material with an effective glycosylation amount of one or more of the compounds listed in FIGS. 4-11, preferably the cellulase cel5A listed in FIG. To a process for preparing ethanol from lignocellulosic material, comprising converting a saccharide to a saccharide. The treatment can be carried out in a marine environment, such as in water. One or more of the compounds listed in FIGS. 4-11 may be present in dry form, in a buffer, or in the form of a paste, paint, or micelle.
에탄올로의 당의 전환 및 회수는 이에 한정되는 것은 아니지만, 기술분야의 당업자에게 공지된 임의의 잘-확립된 방법에 의해 수행될 수 있다. 예를 들어, 에탄올생성 미생물, 예컨대 Zymomonas, Erwinia, Klebsiella, Xanthomonas, 및 Escherichia, 바람직하게는 대장균(Escherichia coli) K011 및 Klebsiella oxytoca P2을 사용한다. The conversion and recovery of sugars to ethanol can be carried out by any well-established method known to those skilled in the art, but not limited thereto. For example, ethanologenic microorganisms such as Zymomonas, Erwinia, Klebsiella, Xanthomonas , and Escherichia, preferably Escherichia coli K011 and Klebsiella oxytoca P2 are used.
본 발명의 추가의 측면에서, 리그노셀룰로오스 물질은 도 4-11에 나열된 화합물 모두의 유효한 당화 양으로 처리된다. In a further aspect of the invention, the lignocellulosic material is treated with an effective glycosylation amount of all of the compounds listed in FIGS. 4-11.
본 발명의 추가의 측면에서, 도 4-11에 나열된 화합물의 하나 이상은 Microbulbifer degradans 2-40으로부터 유래한다. In a further aspect of the invention, one or more of the compounds listed in FIGS. 4-11 are from Microbulbifer degradans 2-40.
본 발명의 추가의 측면에서, 도 4-11에 나열된 화합물의 하나 이상은, 도 4-11에 나열된 화합물의 하나 이상을 필수로 포함하는 시스템, 또는 금속 이온, 킬레이터, 세정제(detergent), 유기 이온, 무기 이온, 또는 하나 이상의 추가적인 단백질, 예컨대 비오틴 및/또는 알부민을 추가로 포함하는 시스템에 있다. In a further aspect of the invention, one or more of the compounds listed in FIGS. 4-11 essentially comprises a system, or metal ion, chelator, detergent, organic, comprising one or more of the compounds listed in FIGS. 4-11. Ions, inorganic ions, or one or more additional proteins such as biotin and / or albumin.
본 발명의 일부 구현예는, 당류를 얻기 위하여, 도 4-11에 나열된 화합물의 하나 이상의 유효한 당화 양으로 리그노셀룰로오스 물질을 처리하고, 에탄올을 제조하기 위해 당류를 변환하는 것에 의해 제조된 에탄올에 관한 것이다. 에탄올로의 당의 전환 및 회수는 이에 한정되는 것은 아니지만, 기술분야의 당업자에게 공지된 임의의 잘-확립된 방법에 의해 수행될 수 있다. 예를 들어, 에탄올생성 미생물, 예컨대 Zymomonas, Erwinia, Klebsiella, Xanthomonas, 및 Escherichia, 바람직하게는 대장균(Escherichia coli) K011 및 Klebsiella oxytoca P2을 사용한다. Some embodiments of the present invention are directed to ethanol prepared by treating lignocellulosic material with at least one effective glycosylation amount of the compounds listed in FIGS. 4-11 to obtain sugars and converting the sugars to produce ethanol. It is about. The conversion and recovery of sugars to ethanol can be carried out by any well-established method known to those skilled in the art, but not limited thereto. For example, ethanologenic microorganisms such as Zymomonas, Erwinia, Klebsiella, Xanthomonas , and Escherichia, preferably Escherichia coli K011 and Klebsiella oxytoca P2 are used.
본 발명의 추가의 구현예는 당류를 얻기 위하여, 도 4-11에 나열된 화합물의 하나 이상, 바람직하게는 도 4에 나열된 셀룰라아제 cel5A의 유효한 당화 양을 발현하는 미생물과 리그노셀룰로오스 물질을 접촉시키고, 에탄올을 제조하기 위해 당류를 변환하는 것을 포함하는, 리그노셀룰로오스 물질로부터 에탄올을 제조하는 방법에 관한 것이다. 상기 접촉은 해양성 환경, 예컨대 수중에서 수행될 수 있다. 도 4-11에 나열된 화합물의 하나 이상은 건조 형태로, 버퍼 내에, 또는 페이스트, 페인트, 또는 미셀의 형태로, 존재할 수 있다. 상기 미생물은 Microbulbifer degradans 2-40 또는 도 4-11에 나열된 화합물의 적어도 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 적어도 하나의 폴리뉴클레오티드를 포함하는 키메라 유전자를 포함하는 재조합 미생물일 수 있고; 여기서 상기 유전자는 미생물에 의해 아미노산 서열의 발현을 가능하게 하는 조절 서열과 작동적으로 연결(operably linked)된다. 재조합 미생물은, 박테리아 또는 효모, 예컨대 대장균(Escherichia coli)일 수 있다. 본 발명의 일부 측면에서, 재조합 미생물은 에탄올생성 미생물, 예컨대 Zymomonas, Erwinia, Klebsiella, Xanthomonas, 또는 Escherichia, 바람직하게는 대장균(Escherichia coli) K011 또는 Klebsiella oxytoca P2 종으로부터의 미생물이다. A further embodiment of the invention comprises contacting a lignocellulosic material with a microorganism expressing an effective glycosylation amount of one or more of the compounds listed in Figures 4-11, preferably the cellulase cel5A listed in Figure 4, to obtain sugars, A method for producing ethanol from lignocellulosic material, comprising converting sugars to produce ethanol. The contact can be carried out in a marine environment, such as in water. One or more of the compounds listed in FIGS. 4-11 may be present in dry form, in a buffer, or in the form of a paste, paint, or micelle. The microorganism may be a recombinant microorganism comprising a chimeric gene comprising at least one polynucleotide encoding a polypeptide comprising a microbulbifer degradans 2-40 or at least one amino acid sequence of a compound listed in FIGS. 4-11; Wherein said gene is operably linked to regulatory sequences that enable expression of amino acid sequences by microorganisms. Recombinant microorganisms can be bacteria or yeasts such as Escherichia coli . In some aspects of the invention, the recombinant microorganism is an ethanologenic microorganism such as Zymomonas, Erwinia, Klebsiella, Xanthomonas , or Escherichia, preferably Escherichia coli K011 or Klebsiella oxytoca P2 species.
본 발명의 일부 측면은, 당류를 얻기 위하여, 도 4-11에 나열된 화합물의 하나 이상의 유효한 당화 양을 발현하는 미생물과 리그노셀룰로오스 물질을 접촉시키고, 에탄올을 제조하기 위해 당류를 변환하는 것에 의해 제조된 에탄올에 관한 것이다. Some aspects of the invention are prepared by contacting a lignocellulosic material with a microorganism expressing one or more effective glycosylation amounts of the compounds listed in FIGS. 4-11 to obtain a saccharide and converting the saccharide to produce ethanol. To ethanol.
본 발명의 추가의 측면은 에탄올을 제조하기 위하여 도 4-11에 나열된 화합물의 하나 이상의 유효한 당화 양을 발현하는 에탄올생성 미생물과 리그노셀룰로오스 물질을 접촉시키는 것을 포함하는, 리그노셀룰로오스 물질로부터 에탄올을 제조하는 방법에 관한 것이다. 에탄올생성 미생물은, 리그노셀룰로오스 물질을 당화하기 위해 도 4-11에 나열된 화합물의 하나 이상의 유효량, 및 후속하여 당류(예를 들어, 크실로오스 및/또는 글루코스와 같은 당)의 에탄올로의 전환을 (개별적으로 또는 협력하여) 촉매하는 하나 이상의 효소 또는 효소 시스템의 유효량을 발현한 다. 에탄올생성 유기체의 하나 이상의 효소 또는 효소 시스템은 천연적으로, 또는 이에 한정되는 것은 아니지만 기술분야의 당업자에게 공지된 임의의 방법에 의해 발현될 수 있다. 예를 들어, 하나 이상의 효소 또는 효소 시스템의 방출은 초음파의 사용을 통해 얻어질 수 있다. 본 발명의 일부 측면에서, 도 4-11에 나열된 화합물의 하나 이상을 발현할 수 있도록 하기 위하여 에탄올생성 미생물이 형질젼환된다. 본 발명의 일부 측면에서, 에탄올생성 미생물은 Zymomonas, Erwinia, Klebsiella, Xanthomonas, 또는 Escherichia, 바람직하게는 대장균(Escherichia coli) K011 또는 Klebsiella oxytoca P2 종으로부터의 미생물이다. A further aspect of the present invention is to prepare ethanol from lignocellulosic material, comprising contacting the lignocellulosic material with an ethanologenic microorganism expressing one or more effective glycosylation amounts of the compounds listed in FIGS. 4-11 to produce ethanol. It relates to a manufacturing method. Ethanologenic microorganisms convert one or more effective amounts of the compounds listed in FIGS. 4-11 and subsequent sugars (eg, sugars such as xylose and / or glucose) to ethanol to glycate lignocellulosic material. Express an effective amount of one or more enzymes or enzyme systems that catalyze (either individually or in concert). One or more enzymes or enzyme systems of the ethanologenic organism can be expressed naturally or by any method known to those skilled in the art. For example, the release of one or more enzymes or enzyme systems can be obtained through the use of ultrasound. In some aspects of the invention, ethanologenic microorganisms are transfected to be able to express one or more of the compounds listed in FIGS. 4-11. In some aspects of the invention , the ethanologenic microorganism is a microorganism from Zymomonas, Erwinia, Klebsiella, Xanthomonas , or Escherichia, preferably Escherichia coli K011 or Klebsiella oxytoca P2 species.
본 발명의 추가의 구현예는 도 4-11에 나열된 화합물의 하나 이상의 리그노셀룰로오스 물질로부터 에탄올의 제조를 위한 용도에 관한 것이다. 다른 구현예는 리그노셀룰로오스 물질로부터 에탄올을 제조하기 위하여 도 4-11에 나열된 화합물의 하나 이상의 유효한 당화 양을 발현하는 미생물 또는 에탄올생성 미생물의 용도에 관한 것이다. A further embodiment of the invention relates to the use for the production of ethanol from one or more lignocellulosic materials of the compounds listed in FIGS. 4-11. Another embodiment relates to the use of microorganisms or ethanologenic microorganisms expressing at least one effective glycosylation amount of the compounds listed in FIGS. 4-11 for preparing ethanol from lignocellulosic material.
특정 구현예를 사용하여 본 발명을 상기에 상세히 설명하였지만, 상세한 설명 및 실시예는 본 발명의 구조적 및 기능적 원리를 설명하기 위한 것이고, 본 발명의 범주를 한정하려는 의도는 아님을 이해해야 한다. 반대로, 본 발명은 첨부한 특허청구범위의 사상 및 범주 내에서 모든 변형, 변경, 및 치환을 포함하는 것을 의도한다. While the invention has been described in detail above using specific embodiments, it is to be understood that the description and examples are intended to illustrate structural and functional principles of the invention and are not intended to limit the scope of the invention. On the contrary, the invention is intended to embrace all such alterations, modifications and substitutions within the spirit and scope of the appended claims.
인용된 참고문헌Cited References
SEQUENCE LISTING
<110> TAYLOR, LARRY EDMUND
WEINER, RONALD M.
HUTCHESON, STEVEN WAYNE
EKBORG, NATHAN A.
HOWARD, MICHAEL
<120> ENZYME SYSTEMS FOR SACCHARIFICATION OF PLANT CELL WALL POLYSACCHARIDES
<130> 108172-00124
<140> 11/519,104
<141> 2006-09-12
<150> 11/121,154
<151> 2005-05-04
<150> 60/567,971
<151> 2004-05-04
<160> 214
<170> PatentIn version 3.3
<210> 1
<211> 1167
<212> PRT
<213> Microbulbifer degradans
<400> 1
Met Thr Ile Lys Arg Trp Pro Phe Asp Arg Lys Gly Pro Pro Lys Lys
1 5 10 15
Pro Asn Ala Lys Lys Leu Leu Ala Ser Leu Ala Ala Ala Leu Ser Leu
20 25 30
Thr Ala Met Gln Ser Thr Ala Ala Val Glu Pro Leu Gln Thr Ser Gly
35 40 45
Asn Gln Ile Leu Val Gly Asn Gln Ala Lys Ala Leu Gly Gly His Ser
50 55 60
Leu Phe Trp His Asn Val Pro Ala Ala Gly Ser Leu Tyr Asn Ala Asp
65 70 75 80
Thr Val Ser Arg Leu Lys Asn Asp Trp Asn Ser Lys Val Ile Arg Ala
85 90 95
Ala Ile Gly Val Glu Val Pro Phe Asn Ser Glu Asn Thr Tyr Ile Gly
100 105 110
Asn Lys Gly Ser Ser Leu Ala Ala Ile Asp Arg Val Val Asn Ala Ala
115 120 125
Val Ala Asn Asp Met Tyr Val Ile Ile Asp Phe His Thr His His Ala
130 135 140
Asp Gln Val Glu Asn Val Ala His Asp Phe Phe Asn Glu Val Ser Ser
145 150 155 160
Arg Tyr Gly His Leu Asn Asn Val Ile Tyr Glu Val Phe Asn Glu Pro
165 170 175
Glu Trp Cys Gly Glu His Gly Arg Trp Ala Ser Thr Ile Lys Pro Tyr
180 185 190
Ala Glu Arg Val Ile Gln Thr Ile Arg Asn Asn Asp Pro Asp Asn Leu
195 200 205
Val Ile Val Gly Thr Thr Cys Phe Ser Gln Asp Val Asp Val Ala Ala
210 215 220
Ala Asp Pro Ile Asn Asp Val Asn Val Ala Tyr Thr Leu His Phe Tyr
225 230 235 240
Ala Ala Thr Pro Ala His Gln Gln Pro Leu Arg Asp Lys Ala Gln Thr
245 250 255
Ala Leu Asp Arg Gly Ala Pro Leu Phe Val Thr Glu Trp Gly Thr Thr
260 265 270
Thr Phe Thr Gly Asp Gly Phe Val Asp Glu Ala Gln Thr Arg Thr Trp
275 280 285
Ile Asn Trp Leu Asn Glu Arg Gly Ile Ser His Val Asn Trp Ser Ala
290 295 300
Ser Thr Gln Pro Glu Ser Ser Ala Ile Trp Asn Gly Asp Met Thr Tyr
305 310 315 320
Lys His Ser Gly Leu Leu Val Gly Glu Leu Val Gln Gln Thr Asn Gly
325 330 335
Thr Thr Thr Pro Pro Thr Gly Glu Ile Ser Gly Pro Cys Asp Leu His
340 345 350
Phe Val Pro Ala Lys Ala Glu Ala Glu Ser Phe Cys Thr Ala Lys Gly
355 360 365
Ile Gln Phe Glu Thr Thr Thr Asp Thr Gly Gly Gly Gln Asn Met Gly
370 375 380
Trp Leu Asp Ala Gly Asp Trp Val Thr Phe Asp Val Asp Val Pro Ala
385 390 395 400
Ser Gly Gln Tyr Leu Ile Asp Tyr Arg Val Ala Ser Glu Leu Gly Asp
405 410 415
Gly Arg Phe Arg Thr Glu Ala Ala Asn Gly Thr Ala Leu Gly Thr Ile
420 425 430
Ser Val Pro Asn Thr Gly Gly Trp Gln Asn Trp Gln Thr His Thr His
435 440 445
Thr Val Gln Leu Ser Gln Gly Thr Gln Thr Val Lys Leu Val Ala Glu
450 455 460
Thr Gly Gly Trp Asn Leu Asn Trp Phe Glu Val Arg Ala Gly Glu Val
465 470 475 480
Cys Glu Gly Ala Asp Cys Pro Cys Glu Gly Ala Glu Cys Pro Cys Pro
485 490 495
Asp Cys Asn Gly Thr Pro Val Lys Phe Glu Ala Glu Thr Phe Val Ala
500 505 510
Met Gln Gly Val Gln Leu Glu Asn Thr Ser Asp Val Gly Gly Gly Gln
515 520 525
Asn Val Gly Tyr Ile Asp Ser Gly Asp Trp Ile Thr Tyr Asn Gly Ala
530 535 540
Leu Pro Ala Ser Ala Asp Asn Arg Tyr Val Val Ser Tyr Arg Val Ala
545 550 555 560
Arg Gln Pro Ser Gly Asn Ala Lys Phe Lys Ile Glu Gln Pro Gly Gly
565 570 575
Ala Ala Val Tyr Gly Glu Ile Ser Val Pro Ser Thr Gly Gly Trp Gln
580 585 590
Thr Trp Thr Thr Ile Ser His Thr Ile Thr Ile Pro Ala Asn Ala Asn
595 600 605
Gly Phe Ala Leu Ala Ala Ile Asp Gly Gly Trp Asn Ile Asn Trp Ile
610 615 620
Glu Ile Lys Pro Ala Thr Thr Gln Pro Pro Glu Pro Ile Asn Pro Leu
625 630 635 640
Lys Leu Gln Ala Glu Asp Tyr Ile Asn Phe Asn Asp Thr Thr Pro Gly
645 650 655
Asn Glu Gly Gly Ala His Arg Ser Asp Asp Val Asp Ile Gln Ala Thr
660 665 670
Thr Asp Thr Gly Gly Gly Phe Asn Val Gly Trp Val Asp Ala Gly Glu
675 680 685
Trp Leu Glu Tyr Glu Phe Phe Leu Glu Ser Pro Asp Phe Tyr Ala Ala
690 695 700
Asp Val Arg Val Ala Ser Asp Gln Thr Gly Gly Ala Leu Gln Leu Gln
705 710 715 720
Ile Asp Gly Gln Asn Val Gly Gln Ala Ile Thr Val Gly Asn Thr Gly
725 730 735
Gly Trp Gln Ala Trp Thr Thr Lys Asn Thr Leu Ile Gly Asp Leu Ser
740 745 750
Ala Gly Thr His Thr Leu Arg Val Tyr Ala Gln Ser Gly Pro Leu Asn
755 760 765
Leu Asn Trp Val Glu Leu Lys Arg Thr Thr Pro Ala Pro Ala Thr Ser
770 775 780
Cys Phe Asn Ile Ala Glu Asp Arg Leu Asn Val His Leu Asp Ala His
785 790 795 800
Cys Thr Ala Gly Ser Asn Leu Gln Tyr Asn Trp Asp Phe Gly Asp Gly
805 810 815
Asn Ser Ala Thr Gly Val Ala Thr Ser His Ser Tyr Tyr Thr Ser Gly
820 825 830
Thr Tyr Thr Ile Thr Leu Thr Val Ser Asp Thr Arg Thr Thr Asp Thr
835 840 845
Ser Ser Gln Gln Val Thr Val Asp Phe Ser Ala Pro Ala Gly Pro Val
850 855 860
Asp Phe Tyr Gly Glu Leu Met Val Asn Gly Asn Arg Ile His Gly Glu
865 870 875 880
Lys Thr Gly Glu Pro Ala Gln Val Arg Gly Met Ser Phe Phe Trp Ser
885 890 895
Asn Thr Gly Trp Gly Gln Glu Lys Trp Trp Asn Ala Ser Thr Val Asp
900 905 910
Arg Met Val Asp Glu Phe Lys Val Glu Leu Val Arg Gly Ala Met Gly
915 920 925
Thr Asp Glu Gly Gly Gly Tyr Leu His Asp Ala Ser Asn Lys Ala Arg
930 935 940
Leu Gln Ala Val Val Glu Gln Ala Ile Ala Arg Asn Val Tyr Val Ile
945 950 955 960
Ile Asp Trp His Thr His His Ala Glu Asp Asn Ile Ala Glu Ala Ile
965 970 975
Thr Phe Phe Ser Glu Met Ala Gln Leu Tyr Gly His His Asp Asn Val
980 985 990
Ile Phe Glu Ile Tyr Asn Glu Pro Leu Asn Thr Thr Ser Trp Gly Thr
995 1000 1005
Ile Lys His Tyr Ala Glu Gln Val Ile Pro Ala Ile Arg Ala His Ser
1010 1015 1020
Asp Asn Leu Ile Val Val Gly Thr Arg Thr Trp Ser Gln Asn Val Asp
1025 1030 1035 1040
Glu Ala Ala Phe Asp Lys Ile Asn Asp Ser Asn Thr Ala Tyr Ala Leu
1045 1050 1055
His Phe Tyr Val Gly Ser His Gly Asn His Val Arg Asn Leu Ala Gln
1060 1065 1070
Thr Ala Leu Asn Asn Gly Ala Ala Ile Phe Ala Ser Glu Trp Gly Ile
1075 1080 1085
Trp Pro Asn Asn Asn Tyr Asp Gly Met Asn Ala Asp Asp Trp Met Asn
1090 1095 1100
Phe Leu Asp Gln Asn Lys Ile Ser Trp Ala Asn Trp Ala Ile Ser Asp
1105 1110 1115 1120
Lys Val Asp Pro Asn Thr Gly Gln Leu Glu Pro Pro Ser Met Phe Asn
1125 1130 1135
Pro Asp Gly Ser Leu Ser Ser Asn Gly Gln Tyr Val Val Asn Lys Leu
1140 1145 1150
Asn Glu Tyr Ala Ala Gln Ala Pro Trp Arg Glu Ala Ile Ala Asn
1155 1160 1165
<210> 2
<211> 3504
<212> DNA
<213> Microbulbifer degradans
<400> 2
atgacaatta aacgttggcc gttcgaccga aaaggcccac ctaaaaaacc taacgctaaa 60
aaattactcg caagcttagc ggctgcacta agcttaaccg ccatgcaaag cactgcagcg 120
gtagagccat tacaaaccag cggcaatcaa attcttgttg gcaaccaagc caaagccctt 180
ggcggccaca gcttgttttg gcataacgtg ccggcagcag gcagcttata caatgcagat 240
acagtaagca ggcttaagaa tgattggaac tccaaggtta ttcgggccgc aattggggtt 300
gaagtacctt tcaattcaga aaacacctac ataggcaata agggcagctc gctggccgca 360
atagaccgcg tagttaatgc cgctgttgcc aacgatatgt atgtgattat cgattttcat 420
actcaccatg cagatcaagt agaaaacgtt gcccacgact ttttcaacga agtttctagc 480
cgttacggtc atttaaacaa tgttatttat gaagtattta acgagccaga atggtgtggc 540
gagcacggtc ggtgggcatc taccattaag ccctacgccg agcgcgttat ccaaaccatt 600
cgcaacaatg acccagacaa cctagtaata gtaggcacta cctgtttctc gcaagatgta 660
gatgtagccg cagccgaccc cattaacgat gtaaacgtgg cctatacgct acacttttac 720
gcagccaccc ctgcccacca gcaacccttg cgcgacaagg cccaaaccgc gctcgaccgc 780
ggcgcgccac tatttgtaac cgaatggggt acaaccacat ttacaggtga tggttttgta 840
gatgaggcgc aaacgcgcac atggattaac tggttaaacg aacgcggtat tagccacgtt 900
aactggtcgg cgtctaccca gccagaaagc tcagctatat ggaatggcga catgacctac 960
aagcattcgg gcttattggt tggcgaactg gtgcaacaaa caaatggcac aaccacgcca 1020
ccaaccggtg aaataagtgg cccgtgcgat ttacattttg tacctgccaa agccgaggct 1080
gaaagcttct gtaccgccaa aggcattcaa tttgaaacca ccaccgacac gggcggcggc 1140
caaaacatgg gctggctaga tgccggcgac tgggtaactt ttgatgtaga tgtacctgct 1200
agcggccaat atttaataga ttaccgcgta gcatcagagc taggtgatgg tcggttccgc 1260
accgaagccg ccaacggcac tgcccttggc acaatatctg tacccaatac cggcggctgg 1320
cagaattggc aaacgcacac acacacagtg caactctcgc aaggcacaca aaccgttaaa 1380
ctagttgccg aaactggtgg ctggaactta aattggtttg aagtgcgcgc aggtgaggtg 1440
tgcgaaggcg ctgactgccc atgtgaagga gccgaatgcc cttgcccaga ttgcaacggc 1500
acaccggtta agtttgaggc agaaacgttt gtggctatgc aaggcgtgca gctagaaaac 1560
acatccgatg tgggcggcgg ccaaaacgtt ggctacattg atagcggcga ctggataact 1620
tacaacgggg ccttgcccgc aagtgcagac aaccgctatg tagtgtctta tagagtagcg 1680
cgtcaaccta gcggcaatgc caaatttaaa atagaacagc caggtggagc agcggtatat 1740
ggcgaaattt cggtgcccag caccggcggc tggcaaacat ggacaaccat tagccacacc 1800
ataacaattc ccgctaacgc aaacggcttt gcactagcag caatagatgg cggttggaat 1860
ataaactgga tagaaataaa accggcgacc actcaaccac ccgagccaat caacccgtta 1920
aaacttcaag ctgaagatta catcaacttt aacgacacca cccccggtaa cgaaggcggt 1980
gcacacagaa gcgatgatgt agatattcaa gcaactaccg ataccggtgg cggttttaat 2040
gttggctggg tagacgctgg cgaatggcta gagtatgagt tctttttaga gtctcctgat 2100
ttttatgcag ctgatgtacg ggttgcttca gaccaaactg gcggcgcact gcaactacaa 2160
atagatggcc aaaacgttgg ccaagccatt accgttggca acaccggtgg ctggcaagcg 2220
tggacaacca aaaacacact cattggcgac ctaagtgcag gcacccacac gttgcgtgta 2280
tacgcgcaaa gcggcccatt aaatttaaac tgggtagagc taaagcgtac aacgcccgca 2340
ccagccactt cgtgttttaa tattgccgaa gaccgcttaa acgttcacct agatgcgcac 2400
tgtactgcag gcagcaacct gcaatacaat tgggattttg gtgacggcaa cagcgcaacc 2460
ggcgtagcca ctagccacag ctactacact agcggcactt acaccattac cttaaccgtt 2520
agtgataccc gcaccacaga cacctctagc caacaggtaa cggtagattt ttctgcccct 2580
gcaggccctg tggattttta cggcgaacta atggtgaatg gcaaccgcat tcacggcgaa 2640
aaaaccggcg aacccgcaca agtacgcggc atgagctttt tttggagcaa caccggttgg 2700
ggccaagaaa aatggtggaa cgccagcacc gtggaccgca tggttgatga gttcaaagta 2760
gaacttgtgc gcggcgcaat gggcactgat gaaggcggcg gttatttaca cgacgcgtct 2820
aataaggctc gcttacaagc agttgttgaa caagccattg cacgcaatgt gtatgtaatt 2880
atcgactggc acacccacca tgccgaagat aacattgccg aagccattac attctttagc 2940
gaaatggcgc agctttatgg ccaccacgac aacgtgattt tcgagattta caacgagcca 3000
ttaaacacca caagctgggg cactattaag cactacgctg aacaagttat tcctgctatt 3060
cgcgctcatt ccgataattt aattgttgtg ggcacgcgca cctggtcgca aaacgtagac 3120
gaagccgcgt tcgataaaat taacgacagc aacaccgcct acgccctgca cttttatgtt 3180
ggctcgcacg gcaaccacgt tcgcaaccta gcacaaaccg cactaaacaa cggcgcggct 3240
atttttgcta gcgaatgggg aatttggcca aacaacaact acgatggcat gaacgccgac 3300
gattggatga actttttaga ccaaaacaaa atatcttggg ctaactgggc catatccgac 3360
aaagtagacc ccaacacagg ccaactagaa ccacccagca tgttcaaccc agacggcagc 3420
ctaagcagta atggtcaata tgtagtgaac aaactaaatg aatacgcagc acaagcaccg 3480
tggagggagg caatcgctaa ttga 3504
<210> 3
<211> 133
<212> PRT
<213> Microbulbifer degradans
<400> 3
Met Val Val Ser Leu Ala Asp Asn Ser Ala Gly Ala Ile Ser Cys Trp
1 5 10 15
His Ala Lys Ala Ser Pro Pro Glu Glu Leu Glu Glu Leu Leu Asp Glu
20 25 30
Glu Leu Glu Leu Asp Glu Leu Glu Leu Glu Glu Glu Leu Glu Glu Leu
35 40 45
Val Glu Glu Leu Leu Glu Glu Leu Leu Asp Glu Leu Leu Leu Asp Glu
50 55 60
Leu Glu Asp Glu Pro Leu Ala Ala Pro Pro Ser Leu Pro Pro Pro Gln
65 70 75 80
Ala Val Ser Pro Ala Lys Gln Leu Ile Ser Ser Ala Asp Phe Lys Lys
85 90 95
Val Ser Phe Arg Val Gln Leu Asn Ala Val Arg Val Lys Ser Lys Arg
100 105 110
Asn Ile Asn His Ser Arg Ile Phe Leu Phe Trp Leu Phe Ser His Phe
115 120 125
Arg Ser Arg Arg Cys
130
<210> 4
<211> 566
<212> PRT
<213> Microbulbifer degradans
<400> 4
Met Phe Leu Leu Asp Phe Thr Arg Thr Ala Phe Ser Cys Thr Arg Lys
1 5 10 15
Leu Thr Phe Leu Lys Ser Ala Leu Leu Ile Ser Cys Phe Ala Gly Leu
20 25 30
Thr Ala Cys Gly Gly Gly Ser Asp Gly Gly Ala Ala Ser Gly Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
50 55 60
Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
65 70 75 80
Ser Ser Ser Ser Asn Ser Ser Ser Ser Ser Ser Gly Gly Asp Ala Leu
85 90 95
Ala Cys Gln His Glu Met Ala Pro Ala Leu Leu Ser Ala Ser Asp Thr
100 105 110
Thr Met Val Gln Ala Glu Tyr Tyr Asp Thr Cys Ala Ser Ser Ala Leu
115 120 125
Asp Asn Thr Thr Gly Asn Ser Gly Gly Glu Leu Arg Thr Asp Asp Val
130 135 140
Asp Ile Val Ala Ile Ala Asp Gly Tyr Ala Ile Thr Asp Met Gln Ser
145 150 155 160
Gly Glu Tyr Val Glu Tyr Ser Leu Thr Val Gln Thr Ser Gly Leu Phe
165 170 175
Asp Ile Ser Phe Ala Val Gln Pro His Ala Ala Asn Thr Ala Gly Leu
180 185 190
Ala Leu Ser Val Asp Gly Ala Val Leu Gly Thr Val Asp Ile Ala Ala
195 200 205
Asn Asp Ser Thr Ala Phe Gly Glu Tyr Thr Leu Asn Gly Val Tyr Ile
210 215 220
Ser Asp Gly Ala Gln Val Ile Arg Val Thr Met Ala Gly Glu Gly Ala
225 230 235 240
Ala Ile Gly Leu Asp Ser Ile Ala Phe Asn Tyr Thr Asp Asn Thr Val
245 250 255
Tyr Thr Pro Glu Asn Ala Val Leu Gly Met Gly Ile Gly Ile Asn Leu
260 265 270
Gly Asn Thr Leu Asp Ala Phe Pro Asn Glu Gly Asp Trp Ala Pro Ala
275 280 285
Ala Gln Glu Tyr Tyr Phe Lys Ala Tyr Lys Asp Ala Gly Phe Arg His
290 295 300
Val Arg Ile Pro Ala Thr Trp Asp Asp His Thr Ala Asp Thr Ala Pro
305 310 315 320
Tyr Ala Val Asn Ala Ala Arg Met Asp Arg Thr Glu Gln Ile Val Asp
325 330 335
Trp Ala Leu Ala Gln Gly Tyr Phe Val Ile Leu Asn Ala His His Glu
340 345 350
His Trp Leu Lys Glu Asn Tyr Gly Asn Gln Thr Tyr Arg Asp Arg Phe
355 360 365
Asp Ala Ile Trp Gln Gln Ile Ala Glu Arg Phe Lys Asn Lys Ser Ala
370 375 380
Arg Leu Met Phe Glu Ile Leu Asn Glu Pro Asn Gly Met Thr Val Ala
385 390 395 400
Asp Val Asp Asp Leu Asn Pro Arg Ile Leu Asp Ile Ile Arg Glu Thr
405 410 415
Asn Pro Thr Arg Leu Val Val Phe Ser Gly Asn Gly Tyr Thr Pro Val
420 425 430
Asp Ala Leu Leu Ala Ala Ala Ile Pro Asn Asp Asp Tyr Leu Ile Gly
435 440 445
Asn Phe His Ser Tyr Asp Pro Trp Gln Phe Gly Gly Gln Cys Val Arg
450 455 460
Ser Trp Gly Thr Glu Gln Asp Tyr Thr Asp Leu Glu Asn Ile Tyr Lys
465 470 475 480
Arg Ala Asn Thr Trp Ser Glu Gln His Asp Ile Pro Val Met Val Asn
485 490 495
Glu Phe Gly Ala Ala His Tyr Asp Phe Thr Ala Pro Gln Asn Val Cys
500 505 510
Asn Gln Gln Ala Arg Leu Ala Tyr Leu Gly Ala His Ala Thr Phe Ala
515 520 525
Ile Gln Tyr Gly Phe Gly Ala Ser Val Trp Asp Asp Gly Gly Ser Phe
530 535 540
Glu Val Tyr Lys Arg Gly Glu Asn Ser Trp Arg Glu Ala Lys Asp Val
545 550 555 560
Leu Val Ala Pro Asn Pro
565
<210> 5
<211> 1701
<212> DNA
<213> Microbulbifer degradans
<400> 5
atgtttcttt tagactttac ccgcactgcg tttagctgta cacgaaagct tacctttttg 60
aaatccgcgc tacttataag ctgctttgcc gggcttactg cctgtggtgg cgggagtgat 120
ggcggtgctg caagtggctc atcctctagc tcgtctagca gcagttcgtc tagtagctct 180
tcgagcagtt cttcaactag ttcctcaagc tcctcttcaa gctctagttc gtccagttcc 240
agctcttcgt ctaatagttc ctctagctcc tctggtggcg atgctttagc gtgccagcat 300
gaaatggcac cagcgctatt atctgcaagt gatactacca tggtgcaagc ggagtattac 360
gatacctgtg cttcttcggc attagataac accactggta acagtggcgg tgagttgcga 420
actgacgatg tagatatagt ggccattgcg gacggctatg ctattacgga tatgcagtca 480
ggcgagtacg tagaatattc actaacagtg caaacttccg gtttgtttga cattagtttt 540
gcggtacagc cgcacgcagc taatactgcc ggtttggcgc tgagtgtaga tggcgcagtg 600
ttaggcacag ttgatattgc cgctaatgac agcaccgcat ttggcgaata tacgcttaac 660
ggcgtgtaca taagcgatgg cgcgcaagta ataagggtaa ccatggccgg cgaaggcgct 720
gctattgggt tagattccat tgcctttaat tacaccgata ataccgttta caccccagaa 780
aacgccgtgt tgggtatggg aataggtatt aacctaggca ataccttaga tgccttcccc 840
aacgaaggtg actgggcacc ggctgcgcag gaatactatt ttaaagccta caaggatgca 900
ggtttccgcc atgtacgcat cccagcaact tgggatgatc acacggctga tacagccccc 960
tacgctgtaa atgcagcacg tatggatcgc actgagcaga ttgtagattg ggccttggcg 1020
cagggctatt tcgtaattct taatgcccac cacgaacact ggctaaaaga aaactacggc 1080
aatcaaacat accgcgatcg ctttgatgca atttggcagc aaattgccga acgctttaag 1140
aataagtcgg ctcgcttaat gtttgagata ctcaatgagc caaacggcat gacagtggcc 1200
gatgtggatg acctcaaccc acgtattctc gatattattc gcgaaaccaa tcccacgcga 1260
ttggtagtgt tctctggtaa tgggtatacc cctgtggatg ccttacttgc ggctgcaatc 1320
cctaatgatg attaccttat tggtaacttt cactcctacg acccttggca gtttggcggt 1380
cagtgcgtac gatcgtgggg tacagagcaa gattacaccg acctagagaa catatataag 1440
cgcgcaaata cttggtctga gcagcacgac atacccgtta tggtgaacga atttggcgct 1500
gcccattacg attttactgc accgcagaat gtatgtaacc agcaggctcg tttggcttat 1560
ttaggtgccc atgccacatt tgctattcag tacggctttg gcgcaagtgt atgggacgac 1620
ggtggatcat ttgaggtgta caagcgcggt gaaaatagct ggcgcgaagc taaagatgta 1680
ttagtggcgc caaacccgta g 1701
<210> 6
<211> 451
<212> PRT
<213> Microbulbifer degradans
<400> 6
Met Arg Ile Ile Thr Ala Phe Ala Val Met Leu Leu Cys Ile Thr Gly
1 5 10 15
Cys Ser Gly Ser Gly Ala Ser Asp Ser Pro Gln Ala Ser Asn Ser Ser
20 25 30
Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser
50 55 60
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Glu Ala Leu
65 70 75 80
Tyr Pro Ser Tyr Asn Thr Asn Pro Pro Ala Pro Asp Met Thr Gly Met
85 90 95
Thr Ser Thr Ala Thr Gln Leu Ala Asp Arg Ile Thr Val Gly Trp Asn
100 105 110
Ile Gly Asn Thr Leu Glu Ala Ile Gly Gly Glu Thr Asn Trp Gly Asn
115 120 125
Pro Leu Val Thr Asn Glu Leu Ile Gln Ala Val Lys Ala Ser Gly Phe
130 135 140
Asp Ser Ile Arg Ile Pro Ala Ala Trp Asp Gln Tyr Ala Asn Gln Glu
145 150 155 160
Thr Ala Ala Ile Asp Ile Asn Trp Leu Asn Arg Val Lys Gln Val Val
165 170 175
Gln Tyr Ser Ile Asp Asn Asp Met Val Val Val Leu Asn Ile His Trp
180 185 190
Asp Gly Gly Trp Leu Glu Arg Asn Val Glu Pro Ser Glu Gln Val Ala
195 200 205
Val Asn Ala Lys Gln Lys Ala Tyr Trp Glu Gln Ile Ala Thr His Leu
210 215 220
Arg Asp Phe Asp Glu Arg Leu Ile Phe Ala Ser Ala Asn Glu Pro His
225 230 235 240
Val Glu Thr Glu Ala Gln Met Ala Val Leu Asn Val Tyr His Gln Thr
245 250 255
Phe Val Asp Thr Val Arg Ala Thr Gly Gly Lys Asn Ala Tyr Arg Val
260 265 270
Leu Val Leu Gln Gly Pro Lys Thr Asp Ile Glu Thr Thr Ser Leu Leu
275 280 285
Trp Thr Gln Met Pro Gln Asp Ser Ala Val Asn Lys Leu Met Ala Glu
290 295 300
Leu His Phe Tyr Thr Pro Tyr Asn Phe Thr Leu Met Asn Val Asp Glu
305 310 315 320
Ser Trp Gly Asn Gln Phe Tyr Tyr Trp Gly Glu Gly Asn His Ser Thr
325 330 335
Thr Asp Thr Gly Arg Asn Pro Thr Trp Gly Glu Glu Ala Thr Val Asp
340 345 350
Ser Leu Leu Ala Ile Thr Lys Gln Gln Phe Val Asp Gln Gly Ile Pro
355 360 365
Val Ile Ile Gly Glu Tyr Gly Ala Gln Arg Arg Asp Asn Leu Thr Gly
370 375 380
Asp Glu Leu Ala Leu His Leu Gln Ser Arg Asn Tyr Tyr Leu Lys Tyr
385 390 395 400
Val Thr Gln Lys Cys Val Glu Leu Gly Leu Lys Pro Phe Tyr Trp Asp
405 410 415
Thr Gly Gly Leu Asp Asn Asn Gln Ser Gly Leu Phe Asn Arg Ser Thr
420 425 430
Tyr Gln Val Phe Asp Gln Asn Ala Leu Asp Ala Ile Met Glu Gly Ala
435 440 445
Arg Gly Glu
450
<210> 7
<211> 1356
<212> DNA
<213> Microbulbifer degradans
<400> 7
atgagaataa taacggcgtt tgcagttatg ctgctatgca taacaggctg tagcggatcg 60
ggcgcgagtg atagcccgca agcatccaat tcgtcttcgg gcagttcttc tagctctagc 120
agttcgtcaa gttcgagcag ttcctctagt tcgtcgtcta gctcttcaac aagctctagc 180
agctcatcta gctccagctc atcatcaagc tctagcagtt cttcgggcgg cgaagcgctt 240
tacccaagct acaatacaaa cccgccagcg ccagatatga ccggcatgac aagtactgcc 300
acacaactag cagatcgtat aaccgtgggc tggaatattg gtaacacgct agaggcaata 360
ggcggcgaaa ccaactgggg taacccgctg gttactaacg aattaattca agcggtaaaa 420
gccagtggct ttgattccat tcgtataccc gccgcgtggg atcaatacgc caaccaagaa 480
acggccgcaa tagatataaa ctggctaaac cgcgttaaac aagttgtgca atacagcata 540
gataacgaca tggtggtagt gctaaacatc cactgggatg gcggttggct agagcgcaat 600
gtagagccca gcgagcaagt agcagtaaat gcaaaacaaa aagcctattg ggaacaaatt 660
gccactcacc tgcgcgactt tgacgagcgc ctaatatttg ccagcgccaa cgaaccccat 720
gtagaaaccg aagcacaaat ggccgtacta aacgtatacc atcaaacgtt tgtagataca 780
gtgcgtgcaa ctggcggtaa aaatgcttac cgcgtactgg tattgcaggg gccaaaaaca 840
gatatagaaa ccacctcgct attgtggacc caaatgccgc aagatagcgc cgtaaataaa 900
cttatggcag agctacactt ctataccccg tacaacttta cgttaatgaa tgtagatgaa 960
agctggggca accagttcta ctactggggc gaaggtaatc attccactac cgacacaggc 1020
cgcaacccaa cctggggcga agaagcaaca gtagattcac tgctggcaat taccaaacaa 1080
cagtttgtgg accaaggtat acccgtaatt attggcgaat acggtgcaca acgccgcgat 1140
aaccttaccg gcgatgaatt ggccctgcac ttacaatcgc gcaactacta cttaaaatac 1200
gttactcaaa aatgtgtaga gctaggctta aaaccttttt attgggatac cggcggctta 1260
gacaacaatc aatctggcct gtttaatcgc agtacctacc aagtatttga tcaaaatgcc 1320
ctagatgcca ttatggaagg ggccagaggg gaataa 1356
<210> 8
<211> 621
<212> PRT
<213> Microbulbifer degradans
<400> 8
Met Leu Lys His Gln Phe Ser Lys Ala Leu Arg Ala Leu Gly Phe Gly
1 5 10 15
Gly Ala Val Phe Ala Ala Ser Leu Met Ala Ser Gln Ala Ser Ala Leu
20 25 30
Glu Cys Glu His Ser Ile Ser Asn Asp Trp Gly Ala Gly Phe Thr Gly
35 40 45
Ala Met Lys Val Thr Asn Asn Asp Ser Ser Pro Ile Thr Gly Trp Arg
50 55 60
Val Glu Trp Ala Tyr Ser Gly Asn Val Asn Ile Val Asn Ser Trp Asn
65 70 75 80
Ala Ser Val Thr Lys Gly Ser Asn Tyr Val Ala Val Asp Ala Gly Trp
85 90 95
Asn Gly Asn Leu Gln Pro Ser Gln Ser Thr Glu Phe Gly Leu Gln Gly
100 105 110
Asp Gly Ala Asp Arg Asn Val Thr Ile Ile Ser Cys Val Ala Glu Gly
115 120 125
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
130 135 140
Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr
145 150 155 160
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Thr Ser Ser
165 170 175
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Asn Cys Val
180 185 190
Ala Met Cys Asn Trp Tyr Gly Glu Asn Arg Pro Val Cys Ala Asn Gln
195 200 205
Asn Thr Gly Trp Gly Trp Glu Asn Asn Gln Ser Cys Ile Gly Ala Asn
210 215 220
Thr Cys Asn Asp Gln Trp Gly Asp Gly Gly Val Val Ser Ser Cys Gly
225 230 235 240
Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
245 250 255
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser
260 265 270
Ser Ser Ser Ser Ser Gly Gly Leu Ser Ala Val Glu Phe Ser Gln Gln
275 280 285
Met Gly Leu Gly Trp Asn Leu Gly Asn Ser Leu Glu Ala Ile Gly Gly
290 295 300
Glu Thr Ala Trp Gly Asn Pro Met Val Thr Gln Gln Leu Ile Asn Ser
305 310 315 320
Ile Lys Ala Ala Gly Phe Asp Thr Ile Arg Ile Pro Val Ala Trp Ser
325 330 335
Gln Phe Ser Asp Glu Ala Asn Phe Val Ile Asn Ser Asn Trp Ile Ala
340 345 350
Arg Val Glu Glu Val Val Asn Tyr Ala Leu Ser Ala Asp Met Tyr Val
355 360 365
Val Met Asn Gln His Trp Asp Gly Gly Trp Met Gln Pro Thr Tyr Ala
370 375 380
Gln Gln Glu Tyr Val Asn Asn Arg Leu Gln Ile Met Trp Thr Gln Ile
385 390 395 400
Ala Asn His Phe Lys Asp Tyr Asp Ser Arg Leu Leu Phe Ala Gly Thr
405 410 415
Asn Glu Val Met Val Glu Gly Asp Tyr Gly Thr Pro Thr Phe Glu Tyr
420 425 430
Tyr Thr Val Gln Asn Ser Phe Asn Gln Thr Phe Val Asp Ala Val Arg
435 440 445
Ala Thr Gly Gly Ala Asn Ala Ser Arg Tyr Leu Val Val Gln Gly Phe
450 455 460
Asn Thr Asn Ile Asp His Thr Val Asn Phe Ala Val Val Pro Thr Asp
465 470 475 480
Pro Ala Thr Asn Arg Leu Met Met Glu Val His Tyr Tyr Asp Pro Tyr
485 490 495
Asn Phe Thr Leu Asn Thr Asn Ser Asn Ile Thr Gln Trp Gly Val Ile
500 505 510
Ala Thr Asp Pro Ser Val Thr Glu Thr Trp Ala Asn Glu Ser Tyr Val
515 520 525
Asp Ala Thr Phe Gln Lys Met Lys Thr Asn Phe Val Asp Gln Gly Ile
530 535 540
Ala Val Ile Leu Gly Glu Tyr Gly Val Val Ser Arg Ala Asn Val Ala
545 550 555 560
Gly His Glu Thr Tyr Arg Glu Tyr Trp Asn Gln Tyr Ile Thr Gln Ser
565 570 575
Ala Val Asp His Gly Met Val Pro Ile Tyr Trp Asp Asn Gly Tyr Ser
580 585 590
Gly Asp Gly Gly Met Ala Leu Phe Asp Arg Ala Ser Gly Asn Gln Leu
595 600 605
Tyr Pro Asn Ile Ile Asn Ala Ile Ile Asn Ala Gly Asn
610 615 620
<210> 9
<211> 85
<212> PRT
<213> Microbulbifer degradans
<400> 9
Met Leu Glu Glu Glu Leu Glu Val Glu Leu Glu Glu Glu Leu Val Glu
1 5 10 15
Glu Leu Glu Leu Leu Asp Glu Glu Val Leu Glu Leu Asp Glu Glu Leu
20 25 30
Leu Glu Glu Leu Glu Glu Leu Asp Glu Leu Glu Asp Asp Pro Pro Ser
35 40 45
Ala Thr Gln Leu Ile Met Val Thr Phe Leu Ser Ala Pro Ser Pro Cys
50 55 60
Lys Pro Asn Ser Val Asp Trp Leu Gly Cys Lys Leu Pro Phe His Pro
65 70 75 80
Ala Ser Thr Ala Thr
85
<210> 10
<211> 1866
<212> DNA
<213> Microbulbifer degradans
<400> 10
atgttgaaac atcaattcag caaagcgctg cgtgcgctag gctttggtgg ggctgtgttt 60
gcggcatcgc taatggctag ccaagcaagt gcccttgagt gtgagcattc aatcagtaat 120
gattggggcg ccggctttac cggtgcaatg aaagttacca ataatgactc tagccccatt 180
accggttggc gggtcgaatg ggcgtatagc ggcaatgtaa atattgttaa ttcgtggaac 240
gcctcagtaa caaaaggcag taattatgtt gccgtagatg ccggatggaa tggtaattta 300
cagccgagcc aatctaccga atttggctta cagggtgatg gcgccgatag aaatgtaacc 360
attattagtt gtgttgccga aggcggatca tcttctagtt catcaagttc ttccagctcc 420
tcaagtagtt cttcatctag ctcaagtact tcttcatcga gtagctcaag ttcctcgacg 480
agctcttctt ctagttcgac ttctagctct tcttcaagca cctcttctag ctcatcgtcc 540
agttcatcaa gctcttcttc gggcggcaac tgtgttgcaa tgtgtaattg gtacggtgaa 600
aaccgccctg tttgtgccaa tcaaaatact ggttgggggt gggaaaacaa ccaaagctgt 660
ataggtgcaa acacctgtaa cgatcaatgg ggcgacgggg gcgtggtgtc cagctgtggt 720
acgtctagct cttcatccag ttcttcgtcc agttcgtcta ccagttcatc ctcgtcttct 780
agctcgagca ccagctctac aagcagctca tcaagctcta gttcgtcgtc tggtgggtta 840
agcgcggtag agttttcgca gcaaatgggc ttggggtgga atcttggaaa ctccctagaa 900
gcgattggtg gcgaaaccgc gtggggcaac ccaatggtta cgcagcaatt aattaactcc 960
ataaaagctg ctgggttcga cactattcgc attccggttg cgtggagcca attctcggac 1020
gaagctaatt ttgttatcaa tagcaattgg attgcacgcg tagaagaagt agtgaactac 1080
gcattgagcg ccgatatgta cgtggtaatg aaccaacatt gggacggcgg ttggatgcag 1140
cccacatatg cacagcaaga atatgttaac aatcgcttgc aaattatgtg gacgcaaata 1200
gctaatcact ttaaagatta cgatagtcgc ttactgtttg caggcaccaa cgaagtgatg 1260
gtggaaggcg attacggtac gcccaccttc gaatactaca cagtacaaaa tagctttaac 1320
caaacgtttg tggatgctgt acgtgcaacc ggtggcgcta atgctagccg ttacttagtg 1380
gtacaggggt ttaataccaa catagatcac acggtgaact tcgcggtagt gccaaccgac 1440
ccggcaacaa acaggttaat gatggaagta cactattacg acccctataa ctttacgtta 1500
aataccaaca gcaacattac tcagtggggc gtaattgcaa ctgaccctag cgttaccgaa 1560
acatgggcga atgaatctta tgtggatgcg actttccaaa aaatgaaaac taacttcgtt 1620
gatcaaggta tagcggtaat tttaggtgag tacggggttg tatcgcgcgc gaatgtggcc 1680
gggcacgaaa cttaccgaga gtattggaac caatacatta ctcaatctgc ggtagatcat 1740
ggaatggtgc ctatttattg ggataacggt tattccggtg atggtggtat ggcattgttt 1800
gatcgcgcca gtggcaatca actttacccc aatattatta acgcaattat caatgccggt 1860
aactaa 1866
<210> 11
<211> 673
<212> PRT
<213> Microbulbifer degradans
<400> 11
Met Leu Ile Gly Thr Val Thr Ala Ser Ala Leu Val Gly Arg Gly Arg
1 5 10 15
Gly Thr Pro Lys Lys Ile Ile Asn Lys Gly Ser Ile Met Trp Gln Ile
20 25 30
Asn Lys Ser Ala Leu Ala Ala Val Val Leu Val Cys Ser Ser Ser Ser
35 40 45
Phe Ala Gln Ser Ala Cys Asp Thr Gln Arg Ile Glu Ala Glu Asn Tyr
50 55 60
Val Ala Met Ser Gly Ile Gln Thr Glu Ser Thr Ala Asp Thr Gly Gly
65 70 75 80
Gly Leu Asn Val Gly Trp Ile Asp Ala Gly Asp Trp Leu Ser Tyr Gln
85 90 95
Val Asn Leu Pro Ala Ala Gly Gln Tyr Glu Val Arg Tyr Arg Val Ala
100 105 110
Ser Arg Asn Gly Gly Gly Val Leu Arg Leu Glu Gly Asn Ala Gly Gln
115 120 125
Thr Leu Tyr Gly Thr Met Asn Val Pro Asn Thr Gly Gly Trp Gln Asn
130 135 140
Trp Gln Thr Leu Ser His Ser Val Thr Leu Ala Ala Gly Glu Gln Ser
145 150 155 160
Ile Gly Ile Gly Val Pro Ser Gly Gly Phe Asn Ile Asn Trp Leu Glu
165 170 175
Phe Val Pro Leu Asp Cys Ser Gly Pro Ile Asp Pro Pro Ile Asn Pro
180 185 190
Pro Ser Asn Cys Ala Ser Ile Val Phe Glu Ala Glu Asn Tyr Asp Gln
195 200 205
Met Ser Gly Ile Arg Thr Gln Thr Thr Ser Asp Thr Gly Gly Gly Leu
210 215 220
Asn Val Gly Trp Ile Asp Ala Gly Asp Trp Leu Ser Tyr Ala Thr Val
225 230 235 240
Asn Ile Pro Ser Thr Gln Val Tyr Asn Phe Glu Tyr Arg Val Ala Ser
245 250 255
Pro Asn Gly Gly Ser Phe Asn Leu Gln Gly Ser Ala Gly Ala Glu Asn
260 265 270
Phe Asp Thr Ala Thr Leu Pro Asn Thr Gly Gly Trp Gln Asn Trp Thr
275 280 285
Thr Val Thr Gly Ser Ala Leu Leu Pro Ala Gly Asn Val Asn Phe Gly
290 295 300
Ile Ser Ala Ile Thr Gly Gly Trp Asn Ile Asn Trp Phe Lys Ala Thr
305 310 315 320
Pro Glu Ser Cys Asp Asp Ile Asn Pro Pro Ser Thr Gly Ile Thr Ala
325 330 335
Lys Gln Ala Ala Ala Ala Met Gly Lys Gly Phe Asn Leu Gly Gln Met
340 345 350
Phe Glu Ser Thr Gln His Pro Arg Thr Phe Asn Ala Ala Lys Ser Lys
355 360 365
Ile Asp Ala Tyr Tyr Asn Met Gly Tyr Arg Asn Val Arg Ile Pro Ile
370 375 380
Thr Trp Thr Glu Ala Val Gly Gly Asn Arg Leu Val Ala Asp Ala Asn
385 390 395 400
Val Gly Ala Val Asn Arg Asn His Ser Arg Leu Ala Val Ile Thr Gln
405 410 415
Val Val Asp Tyr Ala Leu Ser Leu Pro Gly Met Tyr Val Val Ile Asn
420 425 430
Ala His His Glu Gly Gly Leu Lys Thr Asn Asn Arg Trp Trp Val Leu
435 440 445
Glu Thr Leu Trp Ala Asp Ile Ala Asp Ile Phe Lys Asp Arg Asp His
450 455 460
Arg Leu Leu Phe Glu Ile Leu Asn Glu Pro His Leu Ser Asp Ala Asn
465 470 475 480
Lys Ser Pro Met Pro Pro Ala Asn Leu Arg Phe Met Thr Gly Lys Ala
485 490 495
Tyr Asn Lys Ile Arg Ala Ile Asp Ala Gln Arg Ile Val Ile Ile Gly
500 505 510
Gly Asn Gln Trp Phe Gly Ala Gly Glu Met Ala Asn Val Trp Pro Asn
515 520 525
Leu Asn Asp Val Gly Gly Gly Ser Asp Ala Tyr Val Met Ala Thr Phe
530 535 540
His His Tyr Asp Pro Trp Ser Phe Ser Gly Asp Asn Gln Gly Asp Tyr
545 550 555 560
Ala Asp Ala Trp Thr Leu Ser Asn Val Gly Asn Pro Met Asp Ile Met
565 570 575
Gln Ser Trp Ala Asn Gly Val Gly Gln Gly Met Pro Val Tyr Ile Gly
580 585 590
Glu Trp Gly Val Gly Trp Gly Ser Arg Tyr Ser Ala Met Gln Cys Asn
595 600 605
Asn Ile Arg Tyr Trp Tyr Gln Leu Phe Asp Ala Ser Tyr Ala Ser Ala
610 615 620
Lys Gly Gln Pro Thr Ala Val Trp Asp Asp Gly Gly Trp Phe Lys Ile
625 630 635 640
Phe Asp His Gly Thr Asn Ser Phe Asn Asn Asn Leu Ala Gln Cys Ile
645 650 655
Gly Gly Asn Cys Ala Trp Asp Gly Ala Asp Arg Phe Asn Ser Gly Cys
660 665 670
Asn
<210> 12
<211> 2022
<212> DNA
<213> Microbulbifer degradans
<400> 12
atgttgattg gtactgttac ggcttcagca ctggttggtc gaggccgtgg cacccctaaa 60
aaaataatca acaagggttc tattatgtgg caaatcaaca aatcggcttt agcggccgtg 120
gtattagtgt gttcctcatc tagctttgcg caatctgcat gtgacactca acgcattgaa 180
gccgaaaatt acgtggcaat gagtggtatt caaaccgaaa gcacggcaga cactggtggc 240
ggtttaaatg tgggctggat agacgccggc gactggctta gttaccaagt taacctacct 300
gctgcagggc agtacgaggt gcgctatcgc gttgccagta gaaatggcgg cggtgtactt 360
cggttagagg gcaatgccgg tcaaaccttg tatggaacta tgaatgtacc caacacgggt 420
ggctggcaaa attggcaaac cctttctcat tcagtgacat tagcggcagg agagcagtct 480
attggtattg gtgtgccaag cggcgggttt aatattaatt ggctggagtt cgtaccttta 540
gattgcagtg ggccaatcga cccgcccatt aacccacctt cgaactgcgc gagcattgta 600
ttcgaggccg aaaattacga tcaaatgagc ggcattagaa cgcaaaccac aagtgatacc 660
ggaggcggct taaatgtggg gtggatagat gctggcgact ggcttagcta tgccactgtg 720
aatatcccca gcacgcaggt gtacaatttt gaataccgtg tggctagccc taatggcggc 780
agttttaatt tgcagggttc ggctggcgca gagaattttg ataccgctac tttgcccaat 840
acgggtggtt ggcaaaattg gacaacggta acaggctcgg cgcttttacc tgctggcaat 900
gtgaatttcg gtattagtgc gattactggt ggctggaata taaactggtt taaagctaca 960
ccagagagct gtgatgatat aaaccctcca agtaccggta ttactgctaa gcaagcagcg 1020
gcagccatgg gcaaggggtt taatttgggg caaatgttcg aaagtacgca acacccaaga 1080
acatttaatg ctgcaaaaag taaaatagat gcttactaca atatgggcta cagaaatgtg 1140
cgcatcccta ttacttggac tgaagccgta ggcggaaaca ggcttgttgc agatgcaaat 1200
gtaggcgcag tcaatcgcaa ccactctcgc ttagctgtaa ttactcaagt agtagattac 1260
gcgctttcgc tacccggcat gtacgtggtt attaatgcgc atcacgaagg tggattaaaa 1320
accaataatc gctggtgggt gttagaaact ctgtgggcag atattgccga tatatttaaa 1380
gacagagatc accgtttgct atttgaaata ttaaacgagc cacacctaag cgatgccaat 1440
aagtcgccta tgccccccgc caatttgcgt tttatgacgg gcaaagccta taacaaaatt 1500
cgcgcgatag atgcgcagcg aatcgttatt attggtggca accagtggtt tggtgcaggt 1560
gaaatggcaa acgtatggcc aaaccttaat gatgttggcg gcggttccga tgcatatgta 1620
atggctactt ttcaccatta cgacccgtgg tcgtttagtg gcgataacca aggcgattac 1680
gccgatgctt ggacgctatc taacgtgggt aacccaatgg atataatgca aagctgggca 1740
aacggcgtag gccaaggtat gcctgtgtat attggcgagt ggggcgtagg ttggggcagc 1800
cgctacagcg ccatgcagtg caataatatt cgctattggt accagctgtt cgacgcgagc 1860
tatgcctcgg caaaaggcca gcctacggca gtgtgggatg acggcggttg gtttaaaata 1920
ttcgaccacg gtaccaacag cttcaataat aatttagccc aatgtattgg tggaaactgc 1980
gcttgggatg gcgccgatag atttaattct ggctgtaatt aa 2022
<210> 13
<211> 365
<212> PRT
<213> Microbulbifer degradans
<400> 13
Met Arg Thr Thr Lys Phe Leu Ala Leu Ala Leu Cys Leu Leu Ala Ser
1 5 10 15
Ala Ser Ala Leu Ser Ala Asn Asn Ser Ala Pro Ser Asn Asp Trp Trp
20 25 30
Asp Ile Pro Tyr Pro Ser Gln Phe Asp Val Lys Ser Leu Lys Thr Gln
35 40 45
Ser Phe Ile Ser Val Lys Gly Asn Lys Phe Ile Asp Asp Lys Gly Lys
50 55 60
Thr Phe Thr Phe Arg Gly Val Asn Ile Ala Asp Thr Gly Lys Leu Leu
65 70 75 80
Ser Gln Asn Gln Trp Gln Lys Ser Leu Phe Glu Glu Leu Ala Asn Asn
85 90 95
Trp Gly Val Asn Thr Ile Arg Leu Pro Ile His Pro Val Ser Trp Arg
100 105 110
Lys Leu Gly Pro Asp Val Tyr Leu Gly His Ile Asp Glu Ala Val Arg
115 120 125
Trp Ala Asn Asp Leu Gly Ile Tyr Leu Ile Leu Asp Trp His Ser Ile
130 135 140
Gly Tyr Leu Pro Thr Glu Gln Tyr Gln His Pro Met Tyr Asp Thr Thr
145 150 155 160
Ile Lys Glu Thr Arg Asp Phe Trp Arg Arg Ile Thr Phe Arg Tyr Lys
165 170 175
Asn Val Pro Thr Val Ala Val Tyr Glu Leu Phe Asn Glu Pro Thr Thr
180 185 190
Met Gly Asn Thr Leu Gly Glu Arg Asn Trp Ala Glu Trp Lys Thr Leu
195 200 205
Asn Glu Ser Leu Ile Asp Met Ile Tyr Ala Ser Asp Lys Thr Val Ile
210 215 220
Pro Leu Val Ala Gly Phe Asn Trp Ala Tyr Asp Leu Ser Pro Ile Lys
225 230 235 240
Lys Ala Pro Ile Glu Arg Glu Gly Ile Ala Tyr Ala Ala His Pro Tyr
245 250 255
Pro Gln Lys Ala Lys Pro Glu Val Lys Asn Asp Lys Asn Phe Phe Lys
260 265 270
Leu Trp Asp Glu Lys Trp Gly Phe Ala Ala Asp Thr Tyr Pro Val Ile
275 280 285
Ala Thr Glu Leu Gly Trp Val Gln Pro Asp Gly Tyr Gly Ala His Ile
290 295 300
Pro Val Lys Asp Asp Gly Ser Tyr Gly Pro Arg Ile Val Lys Tyr Met
305 310 315 320
Gln Lys Lys Gly Val Ser Tyr Thr Val Trp Val Phe Asp Pro Asp Trp
325 330 335
Ser Pro Thr Met Ile Asn Asp Trp Asp Phe Thr Pro Ser Glu Gln Gly
340 345 350
Ala Phe Phe Lys Gln Val Met Leu Glu Ala Lys Lys Arg
355 360 365
<210> 14
<211> 1098
<212> DNA
<213> Microbulbifer degradans
<400> 14
atgcgcacaa ccaaatttct tgcgcttgca ctctgcttgc tggcctcagc cagtgcactg 60
agtgcaaata acagcgcccc atcaaacgac tggtgggata taccctaccc gagccaattc 120
gatgtaaaaa gccttaaaac gcaaagtttt atatcggtaa aaggtaacaa gttcattgat 180
gataagggca aaaccttcac ttttagaggg gtaaacattg ccgatacagg taagctactt 240
agccaaaatc aatggcaaaa atcgctgttt gaagagctgg ctaataactg gggggtaaat 300
actattcgcc tgcctattca ccctgtaagt tggcgtaaac ttgggccaga cgtttattta 360
ggccacatcg atgaggcggt acgctgggcg aatgatttag gtatttacct tattcttgat 420
tggcactcca ttggctattt gcccaccgag caataccaac accccatgta cgacaccacc 480
attaaagaaa cccgcgactt ttggcgcaga attacgttcc gctacaaaaa cgtgcccacc 540
gtagcggtat acgaattatt taatgagcca accaccatgg gtaacaccct aggcgaacgc 600
aactgggccg agtggaaaac cttaaatgaa agcctaattg atatgatata tgccagtgac 660
aaaaccgtca ttccgctggt tgcaggcttc aactgggcct atgatttatc gccaatcaaa 720
aaggcaccta tcgagcgtga aggcattgct tacgccgcac acccctaccc gcaaaaggcg 780
aaaccagagg ttaagaacga taaaaacttc ttcaaactgt gggacgaaaa gtggggcttt 840
gctgcagaca cctaccctgt aatagcaaca gagctaggct gggtacaacc cgatggttat 900
ggtgcccaca tacccgttaa agacgacggc agttacggcc cccgcatagt gaagtatatg 960
cagaaaaaag gcgtttctta cacggtatgg gtattcgacc ccgactggag cccaacaatg 1020
attaacgact gggattttac ccccagcgag caaggcgcgt tttttaaaca ggttatgcta 1080
gaagctaaaa aacgctaa 1098
<210> 15
<211> 638
<212> PRT
<213> Microbulbifer degradans
<400> 15
Met Thr Phe Thr Arg Met Lys Ser Ser His Gln Gly Ala Cys Arg Pro
1 5 10 15
Arg Ser Ser Thr Leu Gln Arg Leu Ile Ala Ser Ser Leu Thr Thr Ala
20 25 30
Cys Leu Leu Ala Ala Ser Thr Phe Ala Asp Val Ala Pro Leu Thr Val
35 40 45
Asp Gly Asn Arg Ile Leu Ser Gly Gly Gln Glu Ala Ser Phe Ala Gly
50 55 60
Asn Ser Leu Phe Trp Ser Asn Asn Tyr Trp Gly Gly Glu Lys Tyr Tyr
65 70 75 80
Thr Ala Glu Thr Val Asn Trp Leu Lys Gln Asp Trp Gly Ala Thr Leu
85 90 95
Val Arg Ala Ala Met Gly Val Glu Asp Asn Gly Gly Tyr Leu Asp Asp
100 105 110
Lys Glu Gly Asn Lys Gln Lys Val Lys Thr Val Val Asp Ala Ala Ile
115 120 125
Ala Asn Asp Met Tyr Val Ile Ile Asp Trp His Ser His His Ala Glu
130 135 140
Asp His Lys Ser Glu Ala Ile Ala Phe Phe Glu Asp Met Ala Arg Thr
145 150 155 160
Tyr Gly Asn Lys Lys His Val Ile Tyr Glu Ile Tyr Asn Glu Pro Leu
165 170 175
Gln Ile Ser Trp Ser Asn Thr Ile Lys Pro Tyr Ala Glu Asp Val Ile
180 185 190
Arg Ala Ile Arg Ala Ile Asp Pro Asp Asn Leu Ile Ile Val Gly Thr
195 200 205
Pro Thr Trp Ser Gln Asp Val Asp Val Ala Ser Gln Asp Pro Ile Thr
210 215 220
Gly Tyr Ala Asn Ile Ala Tyr Thr Leu His Phe Tyr Ala Gly Thr His
225 230 235 240
Lys Gln Ser Leu Arg Asp Lys Ala Gln Thr Ala Leu Asn Asn Gly Ile
245 250 255
Ala Leu Phe Ala Thr Glu Trp Gly Thr Val Asn Ala Asn Gly Asp Gly
260 265 270
Ala Val Asn Thr Thr Glu Thr Asp Lys Trp Met Thr Phe Phe Lys Thr
275 280 285
Asn His Ile Ser His Ala Asn Trp Ala Leu Asn Asp Lys Ser Glu Gly
290 295 300
Ala Ser Ala Leu Asn Pro Gly Ala Ser Pro Asn Gly Asn Trp Ser Asn
305 310 315 320
Ala Asp Leu Thr Thr Ser Gly Lys Tyr Val Lys Asn Ile Ile Lys Asn
325 330 335
Trp Asn Asp Gly Thr Pro Gly Gly Ser Ser Ser Ser Ser Ser Gly Gly
340 345 350
Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Ser Ser Ser Gly
355 360 365
Ala Gly Lys Val Asn Leu Pro Ala Arg Ile Glu Ala Glu Asn Tyr Asn
370 375 380
Ser Ala Pro Val Glu Thr Thr Ala Gly Asn Ser Gly Gly Ser Val Ser
385 390 395 400
Gln Cys Thr Tyr Arg Gly Leu Asn Val Asp Val Gln Asp Ala Ser Glu
405 410 415
Gly Thr Cys Asn Ile Gly Trp Thr Ala Ala Gly Glu Lys Val Thr Tyr
420 425 430
Asn Ile Gly Thr Ala Asn Asn Thr Tyr Asn Ile Ala Leu Arg Thr Ala
435 440 445
Ser Leu Asp Ala Gly Lys Arg Val Ser Val Tyr Val Gly Asn Thr Leu
450 455 460
Ala Asp Thr Ile Ser Thr Gln Gly Gly Gly Trp Gln Asn Trp Lys Thr
465 470 475 480
Gln Thr Ile Pro Asn Val Tyr Ile Pro Ser Asn Ser Val Ile Thr Val
485 490 495
Glu Phe Tyr Asp Gly Arg Thr Asn Leu Asn Tyr Leu Asn Ile Ser Ala
500 505 510
Ala Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser
515 520 525
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Gly Ser Cys
530 535 540
Ser Ser Tyr Ile Asp Ile Pro Trp Asn Thr Arg Thr Glu Val Thr Leu
545 550 555 560
Thr Ser Gly Ala Cys Val Arg Phe Asn Gln Asn Leu Ser Gly Lys Thr
565 570 575
Leu Gln Val Trp Asp Ser Asp Ala Asn Ser Ser Cys Asp Phe Arg Gly
580 585 590
Thr Val Thr Thr Val Gly Gly Thr Gly Ser Leu Asn Val Ser Ser Asn
595 600 605
Tyr Val Ser Ser Lys Ser Leu Thr Gly Thr Lys Leu Thr Phe Asn Ser
610 615 620
Ala Ser Asn Asn Asn Cys Lys Tyr Val Lys Val Arg Ala Tyr
625 630 635
<210> 16
<211> 1917
<212> DNA
<213> Microbulbifer degradans
<400> 16
atgactttca caagaatgaa atcatcacac caaggcgcgt gtcgaccaag gtcttccacc 60
ctacagcgac taatcgcctc atcacttacc accgcatgtt tgctagcagc gtctactttt 120
gccgacgtag cgccgttaac cgtagatggc aaccgcattc tcagcggtgg ccaagaggct 180
agctttgccg gtaacagttt gttttggagc aacaattatt ggggcggtga gaaatactac 240
acagccgaaa ctgttaactg gttaaaacaa gactggggcg caacactagt gcgcgcggcc 300
atgggtgtag aagataacgg cggctaccta gatgacaaag aaggcaacaa acaaaaggta 360
aaaaccgttg tagatgctgc tattgccaac gacatgtatg taattatcga ttggcacagc 420
caccacgccg aagaccacaa aagtgaagcc attgcttttt ttgaggatat ggcgcgcacc 480
tacggcaata aaaaacacgt tatttacgaa atttataacg agcctttaca aatttcgtgg 540
agcaacacaa ttaaacccta cgccgaagat gtaattagag ctattcgcgc gatagacccc 600
gacaacttaa ttattgttgg tacgccaacg tggtcgcaag atgtagacgt agcatcgcaa 660
gaccccatta ccggctacgc caatattgcc tacacattgc acttttacgc aggcacccac 720
aaacaatctt tacgagacaa agcgcaaacc gcacttaaca acggcatagc gcttttcgca 780
acagagtggg gaacagtaaa tgcaaacggt gatggcgctg taaacaccac cgaaacagac 840
aagtggatga cgttctttaa aaccaaccac ataagccacg caaactgggc gctaaacgac 900
aaatcagaag gcgcttctgc attaaacccc ggagccagcc ccaatggcaa ctggagcaac 960
gccgacttaa ccacatcggg taagtacgta aaaaacatta tcaaaaactg gaacgacggc 1020
acgccgggag gcagctcttc aagctcgtcc ggcggctcaa ccagttcctc ctcaagctca 1080
tctagctcta attccagctc tggtgctggc aaagtaaatt tacccgcacg cattgaagcc 1140
gaaaactata acagtgcacc ggtagaaaca actgcaggca atagtggcgg cagcgtttca 1200
caatgtacat acagagggct aaatgtagac gtacaagacg caagcgaagg cacttgtaat 1260
attggctgga cagcagcagg cgaaaaagtt acctacaaca taggcacagc aaataatact 1320
tacaatattg cacttcgcac cgcatcgctt gatgcaggca agcgcgtatc ggtatatgta 1380
ggcaacaccc tcgccgacac aataagcacc caaggtggcg gctggcaaaa ttggaagacg 1440
caaaccatcc ccaatgtata tattccatca aactcagtta ttaccgtgga attctacgat 1500
ggccgcacca accttaacta cttaaacatt agtgcagctt cggggtcttc ctcttcaagc 1560
tcctcatcta gctcgtcaac gtctagctct tcttcgagct catcttctag ctcttcaggt 1620
ggtggcagtt gtagcagcta tatagatata ccttggaata ctcgcaccga agttacccta 1680
acaagtggcg cctgcgttcg ctttaaccaa aacctttcgg gcaaaaccct acaagtgtgg 1740
gatagcgatg caaactcatc gtgcgatttc cggggcacag ttacaacagt aggcggcact 1800
ggcagtttaa atgtaagcag caactatgtt tcgtctaaga gcctaacagg aaccaaactt 1860
acatttaatt cagcaagtaa taacaattgt aagtacgtta aagttcgtgc ttattag 1917
<210> 17
<211> 630
<212> PRT
<213> Microbulbifer degradans
<400> 17
Met Lys Ser Ala Thr Thr Asn Gln Ser Arg Ala Arg Ser Ser Ala Phe
1 5 10 15
Lys Asn Met Leu Ala Ala Ser Leu Ala Gly Leu Gly Leu Leu Ser Ala
20 25 30
Ser Ala Phe Ala Asp Val Ala Pro Leu Thr Val Asp Gly Asn Lys Ile
35 40 45
Leu Ser Gly Gly Gln Gln Ala Ser Phe Ala Gly Asn Ser Leu Phe Trp
50 55 60
Ser Asn Asn Gly Trp Gly Gly Glu Lys Tyr Tyr Thr Ala Gly Thr Val
65 70 75 80
Glu Trp Leu Lys Gln Asp Trp Gly Ser Asn Leu Val Arg Ala Ala Met
85 90 95
Gly Val Asp Glu Asn Gly Gly Tyr Leu Glu Asp Pro Ala Gly Asn Lys
100 105 110
Ala Lys Val Thr Thr Val Val Asp Ala Ala Ile Ala Asn Asp Met Tyr
115 120 125
Val Ile Ile Asp Trp His Ser His His Ala Glu Asp Tyr Gln Asn Gln
130 135 140
Ala Ile Ser Phe Phe Gln Asp Met Ala Arg Thr Tyr Gly Asn Asn Asn
145 150 155 160
Asn Val Ile Tyr Glu Ile Tyr Asn Glu Pro Leu Gln Val Ser Trp Ser
165 170 175
Gly Thr Ile Lys Pro Tyr Ala Glu Ala Val Ile Gly Ala Ile Arg Ala
180 185 190
Ile Asp Pro Asp Asn Leu Ile Ile Val Gly Thr Pro Thr Trp Ser Gln
195 200 205
Asp Val Asp Val Ala Ser Arg Asp Pro Ile Thr Gln Tyr Ser Asn Ile
210 215 220
Ala Tyr Thr Ile His Phe Tyr Ala Gly Thr His Lys Gln Ser Leu Arg
225 230 235 240
Asp Lys Ala Gln Thr Ala Leu Asn Asn Gly Ile Ala Leu Phe Ala Thr
245 250 255
Glu Trp Gly Thr Val Asn Ala Asn Gly Asp Gly Gly Val Asp Ala Ala
260 265 270
Glu Thr Asp Arg Trp Met Gln Phe Phe Lys Ala Asn His Ile Ser His
275 280 285
Ala Asn Trp Ala Leu Asn Asp Lys Ala Glu Gly Ser Ser Ala Leu Lys
290 295 300
Pro Gly Ser Asn Ala Asn Gly Gly Trp Ser Asn Ser Asp Leu Thr Ala
305 310 315 320
Ser Gly Thr Tyr Val Lys Asn Leu Ile Lys Thr Trp Asn Asp Gly Ser
325 330 335
Pro Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly
355 360 365
Gly Thr Asn Leu Pro Ala Arg Ile Glu Ala Glu Asn Tyr Asp Ser Ala
370 375 380
Pro Val Glu Thr Thr Ala Gly Asn Ser Gly Ser Pro Thr Asn Cys Ser
385 390 395 400
Tyr Lys Gly Met Gly Val Asp Val Glu Asn Ser Thr Glu Gly Ala Cys
405 410 415
Asn Ile Gly Trp Thr Ala Ala Gly Glu Lys Val Thr Tyr Asn Ile Gly
420 425 430
Asn Ala Asp Gly Thr Tyr Asp Ile Ala Leu Arg Val Ala Ser Met Asp
435 440 445
Ala Gly Lys Arg Ile Ser Val His Val Asn Asn Ser Leu Ala Asp Thr
450 455 460
Val Thr Thr Gln Gly Gly Gly Trp Gln Ala Trp Thr Thr Glu Thr Ile
465 470 475 480
Ser Asn Val Tyr Ile Pro Ser Asn Ser Val Ile Thr Val Glu Phe Tyr
485 490 495
Asp Ser Gly Ser Asn Leu Asn Phe Leu Asn Ile Thr Glu Ser Ser Gly
500 505 510
Thr Glu Pro Pro Val Glu Pro Pro Val Glu Pro Pro Val Glu Pro Pro
515 520 525
Val Asp Asn Gly Asn Phe Pro Cys Asn Asp Gly Asn Ser Thr Leu Ala
530 535 540
Asn Asn Gly Ala Ser Ile Asn Leu Asn Gln Gly Ala Cys Val Lys Tyr
545 550 555 560
Asn His Gly Trp Gly Asp Ile Arg Leu Gly Thr Trp Ser Gly Asn Gly
565 570 575
Thr Ile Arg Tyr Asp Val Leu Asp Cys Asn Asn Asn Val Met Ser Asp
580 585 590
Ile Ala Gln Lys Leu Asn Asp Phe Thr Ala Val Asp Thr Ala Thr Met
595 600 605
Asn Cys Ala His Tyr Ile Tyr Val Lys Gln Ala Pro Ser Ser Tyr Thr
610 615 620
Leu Gln Phe Gly Ser Trp
625 630
<210> 18
<211> 1893
<212> DNA
<213> Microbulbifer degradans
<400> 18
atgaaatcag caaccacaaa tcaatcgagg gcacgcagta gcgcctttaa aaatatgttg 60
gcggcatcgc tcgcaggttt agggctacta tcagcttctg catttgccga tgtagccccg 120
ctaaccgtag acggcaataa aattcttagc ggtggccagc aagccagttt tgccggtaat 180
agcttatttt ggtctaacaa tggctggggc ggtgagaagt attacacggc cggtaccgtt 240
gaatggctaa agcaagactg gggcagtaat ttagttcgcg ccgcaatggg tgtcgatgaa 300
aacggcggct acttagaaga cccagcagga aacaaagcga aagtaacaac cgttgtagat 360
gcagccatcg ctaacgatat gtatgtaatt atcgattggc acagccacca cgccgaagac 420
taccaaaacc aagccattag ctttttccaa gatatggctc gcacctacgg taacaacaac 480
aacgttatat acgaaattta taacgagcca ttacaggttt cttggagcgg caccatcaag 540
ccttacgcag aagcggtaat tggcgcaatt cgcgcaatcg acccagataa ccttattatt 600
gtgggcacgc ctacttggtc gcaggatgta gacgtagcct cgcgcgaccc catcacgcag 660
tacagcaaca ttgcctacac tattcacttt tatgcgggca cccacaaaca atccctacgc 720
gataaagcac aaaccgcatt aaataatggt attgctttgt ttgctaccga atggggtaca 780
gtaaatgcca acggtgacgg cggtgtagac gcagccgaaa ctgatcgttg gatgcagttt 840
tttaaagcga atcatataag ccatgccaac tgggccttaa acgataaagc cgaaggctct 900
tctgcattaa agcctggctc taacgcaaac ggcggctgga gcaattccga cttaaccgcc 960
tctggtacct atgttaaaaa cttaattaaa acatggaacg acggctcacc gagcagcagc 1020
tcatctagca gcaccagttc ttcttcaagc agctcctcgt ctagtagctc atcatctagc 1080
agctcttcat ctagtagttc tggcggtacc aatttacccg cgcgcattga agcagaaaac 1140
tacgatagcg caccggtaga aaccactgca ggtaatagcg gctcacccac caattgttcg 1200
tataaaggta tgggcgtaga tgtagaaaac tctactgaag gtgcttgtaa tattggctgg 1260
actgcggcag gcgaaaaagt aacttacaac attggcaatg ccgatggcac ttacgatatt 1320
gcattgcgcg tagcctctat ggatgcgggc aaacgtatct ctgtgcatgt aaacaacagc 1380
ctagcagata ccgtaaccac acaaggtggc ggctggcagg catggactac cgaaaccatt 1440
tctaacgtgt atatcccatc aaactcggta attaccgttg agttttacga tagtggctct 1500
aacctaaact ttttaaacat taccgaaagc tcgggtaccg aaccacctgt agaaccaccc 1560
gttgagccgc cagtagaacc acccgtagac aacggtaact tcccatgtaa cgacggtaac 1620
tctacgcttg ccaacaacgg cgcctccatt aaccttaacc aaggagcgtg tgttaaatac 1680
aatcacggct ggggcgatat tcgtttaggc acctggagcg gcaacggtac cattcgatac 1740
gacgtactag actgcaataa caacgtaatg agtgatattg cacaaaaact taatgacttt 1800
actgctgtag acaccgcaac aatgaactgc gcacactaca tttatgtaaa acaagcccct 1860
agcagctaca ccctgcaatt tggtagctgg tag 1893
<210> 19
<211> 725
<212> PRT
<213> Microbulbifer degradans
<400> 19
Met Lys Ile Asn Thr Leu Phe Thr Pro Leu Arg Thr Val Gly Ala Ala
1 5 10 15
Val Ala Ile Ala Leu Ser Pro Val Ala Phe Ala Asp Val Thr Cys Glu
20 25 30
Val Thr Asn Phe Asn Gln Trp Asn Ser Gly Tyr Gln Ala Asp Val Arg
35 40 45
Val Thr Asn Ser Gly Ser Ala Val Ser Gly Trp Thr Val Asn Leu Asn
50 55 60
Phe Ala Ser Ala Pro Gln Met Thr Asn Gly Trp Asn Ala Ala Leu Ser
65 70 75 80
Thr Ser Gly Asn Thr Ile Ser Ala Ser Asn Ile Ser Trp Asn Gly Asn
85 90 95
Leu Gly Asn Gly Gln Ser Thr Ser Phe Gly Phe Gln Gly Asn Ser Asn
100 105 110
Gly Asn Leu Ala Thr Pro Thr Cys Val Gly Ser Gly Thr Gly Ser Ser
115 120 125
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr
130 135 140
Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Gly
145 150 155 160
Gly Glu Cys Val Glu Met Cys Lys Trp Tyr Gln Asp Ala Pro Arg Pro
165 170 175
Leu Cys Asn Asn Gln Asp Ser Gly Trp Gly Trp Glu Asn Asn Gln Ser
180 185 190
Cys Ile Gly Arg Thr Thr Cys Asn Ser Gln Ser Gly Asn Gly Gly Val
195 200 205
Ile Asn Ser Cys Pro Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr
210 215 220
Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr
225 230 235 240
Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser
245 250 255
Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser Gly Gly Gly Val
260 265 270
Phe Arg Val Asp Ala Thr Gly Asn Ile Thr Lys Asn Gly Glu Val Leu
275 280 285
Pro Val Arg Cys Gly Asn Trp Phe Gly Leu Glu Gly Gln His Glu Pro
290 295 300
Ser Asp Ala Gln Asn Asn Pro Gly Gly Ala Pro Leu Glu Leu Tyr Val
305 310 315 320
Gly Asn Met Trp Trp Val Asp Ser Gly Arg Thr Ile Gln Gln Thr Met
325 330 335
Ser Glu Ile Thr Ala Gln Gly Ile Asn Met Val Arg Leu Pro Ile Ala
340 345 350
Pro Gln Thr Leu Asn Pro Asn Asp Pro Gln Gly Val Gly Asp Val Arg
355 360 365
Asn Gly Gly Val Leu Lys Asn His Glu Ser Val Gln Gln Thr Asn Ala
370 375 380
Arg Gln Ala Leu Glu Asp Phe Ile Val Gln Ala Asn Glu Asn Asp Ile
385 390 395 400
Gln Val Leu Ile Asp Ile His Ser Cys Ser Asn Tyr Val Gly Trp Arg
405 410 415
Ala Gly Arg Leu Asp Ala Glu Pro Pro Tyr Val Asp Ala Thr Arg Val
420 425 430
Gly Tyr Asp Phe Thr Arg Glu Asp Tyr Ser Cys Gly Thr Asn Val Gly
435 440 445
Pro Gly Val Thr Val His Glu Tyr Asn Glu Glu Ile Trp Leu Asn Asn
450 455 460
Leu Arg Glu Ile Ala Gly Leu Ser Glu Ser Leu Gly Val Asp Asn Ile
465 470 475 480
Ile Gly Ile Asp Ile Phe Asn Glu Pro Trp Asp Tyr Thr Trp Glu Glu
485 490 495
Trp Lys Ala Leu Ser Glu Ser Ala Tyr Gln Ala Ile Ser Glu Val Asn
500 505 510
Pro Asp Ile Leu Ile Phe Val Glu Gly Val Ala Gly Gly Thr Gly Ala
515 520 525
Gly Val Asp Val Pro His Gly Asp Glu Ser Ser Asn Pro Asn Trp Gly
530 535 540
Glu Asn Phe Tyr Pro Ala Gln Thr Ala Pro Leu Asn Ile Pro Lys Asp
545 550 555 560
Arg Leu Val Ile Ser Pro His Thr Tyr Gly Pro Ser Val Phe Val Gln
565 570 575
Arg Gln Phe Met Asp Pro Asn Asp Pro Glu Cys Val Gly Leu Glu Gly
580 585 590
Asp Glu Ala Ala Glu Ala Gly Cys Gln Ile Val Ile Asp Tyr Ala Thr
595 600 605
Leu Ala Ala Gly Trp Asp Glu His Phe Gly Phe Leu Arg Glu Gln Gly
610 615 620
Phe Ala Met Val Val Gly Glu Phe Gly Gly Asn Met Asp Trp Pro Asn
625 630 635 640
Gly Thr Arg Gln Ala Glu Lys Asp Met Trp Ser His Ile Thr Pro Gly
645 650 655
Ile Asp Arg Gln Trp Gln Glu Ala Phe Val Asp Tyr Met Val Glu Lys
660 665 670
Asn Ile Gln Ala Cys Tyr Trp Ser Ile Asn Pro Glu Ser Gly Asp Thr
675 680 685
Gly Gly Trp Tyr Gly His Glu Tyr Asp Pro Val Ser Asn Asp Ala Gly
690 695 700
Trp Gly Arg Trp Leu Asp Phe Asp Ser Arg Lys Thr Asn Leu Leu Lys
705 710 715 720
Glu Leu Trp Gly Ile
725
<210> 20
<211> 71
<212> PRT
<213> Microbulbifer degradans
<400> 20
Met Leu Glu Val Glu Leu Leu Leu Val Glu Leu Val Glu Leu Asp Glu
1 5 10 15
Val Leu Glu Val Leu Leu Val Glu Leu Asp Glu Val Leu Glu Val Leu
20 25 30
Asp Glu Leu Val Asp Glu Val Leu Glu Glu Leu Glu Glu Leu Glu Glu
35 40 45
Leu Gly Gln Leu Leu Ile Thr Pro Pro Leu Pro Asp Trp Leu Leu Gln
50 55 60
Val Val Arg Pro Ile Gln Leu
65 70
<210> 21
<211> 2178
<212> DNA
<213> Microbulbifer degradans
<400> 21
atgaaaatca acactctctt tacgcctttg cgtactgtgg gtgctgcagt tgcgatagct 60
ttatcgcctg tagcctttgc agacgtaacg tgcgaagtaa cgaactttaa ccagtggaat 120
agtggctacc aagccgatgt tcgtgttaca aacagcggta gcgctgttag tggctggacc 180
gtaaatttaa attttgcctc agccccgcaa atgacaaatg gctggaacgc agctttgagt 240
actagcggca atacaattag tgcatctaat attagttgga atggcaattt gggtaatggt 300
cagtccacca gctttggttt tcagggcaat tcaaatggta acttggcaac gccaacgtgt 360
gtaggtagcg gtacggggtc ttctagcagc tcttcatcca gctctacttc tagcacaagc 420
tcatcatcta caagttcttc tagcacgtct tctactagct ctagcagttc atcctctggt 480
ggtgaatgtg tagaaatgtg taagtggtat caagatgcac cgcgcccatt atgtaataat 540
caagacagtg gttggggttg ggaaaacaat caaagctgta ttggtcgcac tacttgtaac 600
agccaatctg gcaatggtgg tgtaattaat agttgcccaa gttcttcaag ttcttcaagt 660
tcttctagca cttcgtctac cagctcatct agtacttcaa gtacttcatc gagctcaaca 720
agtagtactt caagcacttc atcaagttcc acaagctcta ctagcagcag ctcaacctct 780
agcactagct cgtcgtcttc aagtggtggt ggagtattcc gcgtagatgc taccggtaat 840
attactaaaa atggtgaagt actgcctgtt cgttgtggta actggtttgg tctagagggc 900
cagcacgagc cttcagatgc gcaaaataac ccaggcggtg cgccgcttga attatatgtt 960
ggcaacatgt ggtgggtaga tagtggccgc actattcagc aaaccatgag cgaaattacc 1020
gcccaaggta tcaacatggt tcgcttgcct attgcaccgc aaacattaaa ccctaacgac 1080
cctcaaggtg tgggtgatgt gcgcaacggc ggcgtgctta aaaatcacga atctgtgcag 1140
caaaccaatg cacgtcaagc gttagaagac ttcattgttc aagctaacga aaatgacatt 1200
caagtgctaa ttgatattca ctcttgtagt aactacgtgg gttggcgtgc aggccgttta 1260
gatgcagagc ctccttatgt ggatgcaacg cgagtgggtt atgactttac ccgtgaagat 1320
tattcttgtg gcaccaatgt gggcccaggt gtaactgtgc acgagtacaa cgaggaaatt 1380
tggttaaaca acttgcgtga gattgctggt ttatctgaat ccttgggcgt tgataatatt 1440
atcggtatcg atatttttaa cgaaccatgg gattacactt gggaagagtg gaaagcactt 1500
tctgaaagcg cttatcaagc cattagcgaa gttaacccag atattctaat ctttgttgag 1560
ggtgttgcag gcggcacggg tgctggtgtt gatgtgccac atggagacga gtcttctaac 1620
cctaactggg gcgaaaactt ttatcctgcg caaactgctc cgcttaatat tccaaaagat 1680
cgtctagtta tttcaccgca tacctatggc ccatctgtat ttgttcagcg tcaatttatg 1740
gacccgaatg atccagagtg tgttggttta gaaggtgatg aggcggctga agctggctgt 1800
caaattgtta tcgattatgc aaccttagca gctggttggg atgagcattt cggcttctta 1860
cgtgagcaag gctttgccat ggtagtgggt gagtttggtg gcaacatgga ttggccaaat 1920
ggcacgcgcc aagcagaaaa agatatgtgg agccacatca cccctggaat cgacagacag 1980
tggcaagaag cgtttgttga ctacatggtt gagaaaaaca tccaagcttg ttactggtca 2040
attaacccag agtctggcga cactggcggt tggtatggtc acgagtacga ccctgtttct 2100
aacgatgcag gttgggggcg ttggttagac ttcgattctc gcaaaactaa cttacttaaa 2160
gagctttggg gtatttaa 2178
<210> 22
<211> 610
<212> PRT
<213> Microbulbifer degradans
<400> 22
Met Met Tyr Thr Asn Leu Phe Asn Leu Lys Lys His Leu Phe Gln Thr
1 5 10 15
Ser Leu Lys Leu Leu Ala Cys Ala Thr Leu Ile Gly Gly Thr Leu Asn
20 25 30
Ala Ala Ala Asp Val Pro Ala Met Ser Val Gln Gly Asn Lys Val Leu
35 40 45
Val Gly Gly Glu Val Lys Ser Leu Gly Gly Met Ser Tyr Phe Trp Ser
50 55 60
Asn Asn Gly Trp Gly Gly Glu Lys Tyr Tyr Asn Ala Ser Thr Val Ser
65 70 75 80
Tyr Phe Lys Gln Asp Trp Lys Ala Ser Ile Val Arg Ala Ala Met Gly
85 90 95
Val Glu Asp Ala Gly Gly Tyr Phe Asp Asp Pro Gln Gly Ser Lys Gln
100 105 110
Lys Val Arg Thr Ile Val Asp Ala Ala Ile Ala Asn Asp Met Tyr Val
115 120 125
Ile Ile Asp Trp His Ser His Tyr Ala Asn Thr His Asp Trp Ala Ala
130 135 140
Ala Val Gln Phe Phe Gln Glu Met Ala Arg Asp Tyr Gly Gln Tyr Asn
145 150 155 160
Asn Val Ile Tyr Glu Val Tyr Asn Glu Pro Leu Asp Ile Pro Trp Gly
165 170 175
His Ile Lys Ser Tyr Ala Glu Thr Val Ile Asp Ala Ile Arg Ala Ile
180 185 190
Asp Pro Asp Asn Val Ile Val Val Gly Thr Pro Arg Trp Ser Gln Gly
195 200 205
Val Lys Glu Ala Ser Trp Asp Pro Ile Asn Arg Asn Asn Ile Ala Tyr
210 215 220
Thr Leu His Phe Tyr Ser Gly Ser His Gly Gln Trp Leu Arg Asn Asp
225 230 235 240
Ala Ala Glu Ala Met Ser Asn Gly Ile Ala Leu Phe Val Thr Glu Trp
245 250 255
Gly Ser Val Asn Ala Asn Gly Asp Gly Ala Val Asn Glu Gly Glu Thr
260 265 270
Ala Ala Trp Met Asn Phe Met Arg Asp Asn Gly Ile His His Ala Asn
275 280 285
Trp Ser Val Asn Asp Lys Ala Glu Gly Ala Ser Ala Leu Asn Pro Gly
290 295 300
Ala Ser Ala Thr Gly Gly Trp Gly Asp Gly Asp Leu Thr Trp Ser Gly
305 310 315 320
His Val Val Arg Gly Tyr Leu Arg Asp Trp Asn Gln Ile Gly Ser Gly
325 330 335
Asn Gly Asn Gly Asn Gly Thr Gly Cys Thr Glu Val Ser Leu Pro Gly
340 345 350
Thr Ile Glu Ala Glu Ala Tyr Cys Ala Met Asp Gly Ile Gln Thr Glu
355 360 365
Asn Thr Asn Asp Thr Asn Gly Gly Ser Asn Val Gly Tyr Ile Asp Ala
370 375 380
Gly Asp Trp Met Ser Tyr Ser Val Asn Val Ala Asn Ala Gly Thr Tyr
385 390 395 400
Thr Val Ser Tyr Arg Val Ala Ser Leu Gly Gly Gly Gly Val Leu Ser
405 410 415
Ile Glu Asn Ala Gly Gly Ser Pro Val Tyr Gly Thr Leu Asn Val Pro
420 425 430
Gln Thr Gly Gly Trp Gln Glu Trp Thr Thr Val Ser His Asp Ile Ser
435 440 445
Leu Gln Ala Gly Gln Gln Asn Ile Gly Ile Ala Ala Ile Glu Gly Gly
450 455 460
Phe Asn Ile Asn Trp Ile Ala Leu Thr Pro Ala Gly Thr Asn Pro Asn
465 470 475 480
Pro Val Gln Ser Ile Thr Leu Gln Ala Glu Asp Tyr Ser Phe Met Ser
485 490 495
Gly Val Gln Val Glu Asn Thr Ser Asp Asn Gly Gly Gly Met Asn Val
500 505 510
Gly Trp Leu Asp Ala Gly Asp Trp Leu Ala Tyr His Gly Val Asn Ile
515 520 525
Pro Thr Ser Gly Gln Tyr Thr Ile Thr Tyr Arg Val Ala Ser Gln Ser
530 535 540
Gly Gly Gly Ser Leu Gln Leu Glu Gln Ala Gly Gly Gly Val Val Tyr
545 550 555 560
Gly Asn Leu Asn Val Pro Ser Thr Gly Gly Trp Gln Asn Trp Val Asp
565 570 575
Val Ser His Thr Val Thr Leu Asn Ala Gly Val Gln Asp Phe Gly Leu
580 585 590
Gly Ile Thr Ser Gly Gly Phe Asn Ile Asn Trp Ile Lys Val Glu Ala
595 600 605
Ile His
610
<210> 23
<211> 1833
<212> DNA
<213> Microbulbifer degradans
<400> 23
atgatgtaca caaacctctt taatttaaaa aagcacctct ttcaaacctc acttaaacta 60
ctggcctgcg ccacattaat tggcggcacc ctaaacgcag ccgctgacgt gccagcaatg 120
tccgtacaag gcaataaagt actggtgggc ggtgaagtta aaagccttgg aggtatgagc 180
tatttttggt ctaacaacgg ctggggcggc gagaaatact acaacgcttc taccgttagt 240
tacttcaagc aagactggaa ggcatccatt gttcgagctg caatgggggt agaagatgcc 300
ggcggctact tcgatgaccc gcagggctct aagcaaaaag ttcgtacaat agtagatgcc 360
gccattgcga atgatatgta cgtcattatc gattggcact cacattacgc caacacccac 420
gactgggcag ccgctgtgca atttttccaa gaaatggcac gtgactatgg ccaatacaat 480
aatgtgattt atgaggtata caacgaacca ctggatatcc cttggggcca cataaaaagc 540
tacgccgaaa cggtaattga tgccattcgc gcaattgacc cagataacgt gatcgtagta 600
ggcactcctc gctggtcgca gggggtaaaa gaagcgtcat gggacccaat caaccgcaat 660
aatattgcct acacgctgca cttctattca ggtagtcatg gccaatggct gcgcaacgac 720
gcagcagaag ctatgagtaa tggtattgcc ttgtttgtta ctgaatgggg cagcgtaaat 780
gccaatggcg atggcgcagt caacgaaggc gaaaccgcag cgtggatgaa cttcatgcgc 840
gataacggta tccatcacgc aaactggtct gtaaacgaca aagcagaggg tgcatctgca 900
cttaaccctg gcgccagtgc cacaggtggt tggggcgacg gcgatttgac ttggtctggc 960
catgttgtgc gcggctacct gcgcgactgg aaccaaattg gttctggcaa tggtaacggc 1020
aacggcacag gctgcaccga ggttagccta ccaggcacga tagaagcgga agcctactgc 1080
gcaatggatg gtatccaaac cgaaaacacc aacgacacca acggcggcag taacgtgggc 1140
tacatagatg ctggcgactg gatgagctac agcgtaaacg ttgctaacgc aggcacttat 1200
accgtgtctt accgcgtggc tagccttggc ggcggcggtg ttctaagcat tgaaaatgcc 1260
ggcggctcgc ccgtttatgg cacgctgaat gtaccgcaaa ctggcggctg gcaagaatgg 1320
accactgtat ctcacgatat tagcttgcaa gccggccaac aaaacattgg catagcggca 1380
atagaaggtg gttttaacat caactggata gccctaaccc ctgctggcac caaccccaac 1440
ccagtgcaaa gtattacctt acaagcagaa gactactcct ttatgagtgg cgtgcaggta 1500
gaaaatacta gcgacaatgg cggcggtatg aacgtaggct ggttagatgc tggcgactgg 1560
cttgcctacc acggcgtaaa cattccaacc tctggccaat acaccataac ttaccgagta 1620
gccagccaaa gcggtggtgg aagcctgcag ctagaacaag caggtggcgg cgttgtttac 1680
ggtaacctga acgtaccaag cactggcggc tggcagaact gggtagacgt aagccatacc 1740
gttaccctta acgctggtgt acaagatttt gggttaggta ttactagtgg tggcttcaat 1800
attaactgga taaaagtcga ggcaattcac taa 1833
<210> 24
<211> 791
<212> PRT
<213> Microbulbifer degradans
<400> 24
Met Leu Ala Ser Asn Lys Asn Ser Lys Leu Ala Asn Ser Glu Gln His
1 5 10 15
Arg Pro Tyr Lys Thr Arg Thr Ala Arg Trp Leu Thr Gly Ser Gly Val
20 25 30
Ile Ala Ser Ser Leu Leu Phe Ser Ala Gln Ser Phe Ala Ala Gln Cys
35 40 45
Glu Tyr Ile Ile Ser Asn Glu Trp Asn Ser Gly Phe Thr Gly Ala Val
50 55 60
Arg Ile Thr Asn Asn Gly Thr Thr Pro Ile Asn Gly Trp Asp Val Ser
65 70 75 80
Trp Gln Tyr Ala Gly Asp Ala Val Thr Ser Ser Trp Asn Ala Asn Val
85 90 95
Ser Gly Ser Asn Pro Val Ser Ala Thr Pro Leu Ser Trp Asn Ala Asn
100 105 110
Ile Gln Pro Gly Gln Ser Val Glu Phe Gly Phe Gln Gly Ser Lys Ala
115 120 125
Gly Ser Asn Ala Glu Ile Pro Thr Val Thr Gly Ala Val Cys Asp Ser
130 135 140
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
145 150 155 160
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Thr Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Gly Thr Gly Gly Ile
180 185 190
Ala Cys Thr Val Gly Asn Ala Asn Ile Trp Gly Ser Gly Tyr Gln Leu
195 200 205
Asp Met Gln Val Val Asn Asn Gly Thr Ala Ala Val Ser Ser Trp Asp
210 215 220
Val Thr Met Ala Phe Gly Glu Ala Pro Gln Arg Thr Gly Gly Trp Asn
225 230 235 240
Ala Asn Phe Val Glu Ser Gly Asn Thr Ile Val Ala Ser Asn Ile Ser
245 250 255
Trp Asn Gly Asn Leu Ala Pro Gly Gln Ser Ala Ser Phe Gly Ile Gln
260 265 270
Gly Asn His Asp Gly Ser Phe Gly Gly Val Thr Cys Asn Gly Ala Ser
275 280 285
Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Thr Ser Ser Ser Ser Ser
290 295 300
Ser Ser Ser Ser Ser Gly Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser
305 310 315 320
Ser Thr Gly Ser Thr Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr
325 330 335
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
355 360 365
Ser Ser Ser Thr Ser Gly Ser Gly Ala Gly Phe Asp Asn Pro Phe Ile
370 375 380
Gly Gly Lys Trp Tyr Val Asp Pro Val Trp Ser Ala Lys Ala Ala Ala
385 390 395 400
Glu Pro Asn Gly Ser Leu Ile Ala Asn Tyr Asn Thr Ala Val Trp Met
405 410 415
Asp Arg Ile Gly Ala Ile Glu Gly Pro Glu Asp Gly Asp Gly Met Gly
420 425 430
Leu Glu Glu His Leu Asp Glu Ala Leu Ala Gln Gly Ala Asp Ile Phe
435 440 445
Met Phe Val Val Tyr Asp Leu Pro Asn Arg Asp Cys Ala Ala Leu Ala
450 455 460
Ser Ser Gly Glu Leu Leu Ile Ala Glu Asn Gly Phe Glu Arg Tyr Gln
465 470 475 480
Asn Glu Tyr Ile Gly Pro Ile Val Asp Ile Leu Ser Lys Pro Ala Tyr
485 490 495
Ser Ser Leu Arg Ile Ile Ala Ile Ile Glu Val Asp Ser Leu Pro Asn
500 505 510
Leu Val Thr Asn Leu Asn Ile Gln Lys Cys Val Glu Ala Asn Gly Pro
515 520 525
Gly Gly Tyr Val Asp Gly Ile Gln His Ala Leu Asn Glu Leu Asn Thr
530 535 540
Leu Asp Asn Val Tyr Pro Tyr Val Asp Ile Ala His Ser Gly Trp Leu
545 550 555 560
Gly Trp Ser Asp Asn Phe Ala Gly Ala Thr Lys Leu Ile Gly Asp Ala
565 570 575
Ile Lys Gly Thr Asn Lys Gly Val Asn Ser Ile Ala Gly Phe Val Ser
580 585 590
Asn Ser Ser Asn Tyr Thr Pro Val Thr Glu Pro Tyr Leu Pro Asn Pro
595 600 605
Thr Leu Gln Ile Gly Ser Asn Gln Val Arg Ser Ala Asp Phe Tyr Glu
610 615 620
Trp Thr Met Tyr Phe Glu Glu Leu Ser Phe Val Gln Asp Trp Arg Gln
625 630 635 640
Ala Met Ile Gln Gln Gly Phe Pro Glu Ser Ile Gly Met Leu Ile Asp
645 650 655
Thr Ala Arg Asn Gly Trp Gly Gly Pro Asp Arg Pro Thr Gly Glu Ser
660 665 670
Thr Ser Thr Asp Leu Asn Thr Tyr Val Asn Glu Ser Arg Ile Asp Arg
675 680 685
Arg Gln His Arg Gly Asn Trp Cys Asn Gln Pro Gly Gly Val Gly Phe
690 695 700
Arg Pro Gln Ala Ala Pro Glu Pro Gly Val Asp Ala Tyr Val Trp Val
705 710 715 720
Lys Pro Gln Gly Glu Ser Asp Gly Ile Ser Asp Pro Asn Phe Pro Ile
725 730 735
Asp Pro Asn Asp Pro Ala Lys Gln His Asp Pro Met Cys Asp Pro Asn
740 745 750
Ala Pro Asn Arg Asp Asn Asn Ala Val Gly Thr Gly Ala Leu Asp Asn
755 760 765
Ala Pro His Ala Gly Arg Trp Phe Pro Glu Ala Phe Gln Ile Leu Ile
770 775 780
Glu Asn Ala Tyr Pro Pro Leu
785 790
<210> 25
<211> 65
<212> PRT
<213> Microbulbifer degradans
<400> 25
Met Pro Leu Glu Pro Asp Asp Glu Leu Asp Glu Glu Val Leu Glu Glu
1 5 10 15
Leu Asp Glu Glu Leu Leu Val Leu Leu Glu Leu Leu Glu Glu Leu Asp
20 25 30
Glu Leu Asp Asp Glu Leu Glu Leu Glu Glu Leu Glu Pro Leu Ser His
35 40 45
Thr Ala Pro Val Thr Val Gly Ile Ser Ala Leu Glu Pro Ala Leu Leu
50 55 60
Pro
65
<210> 26
<211> 112
<212> PRT
<213> Microbulbifer degradans
<400> 26
Met Asn Gly Leu Ser Lys Pro Ala Pro Leu Pro Glu Val Glu Glu Leu
1 5 10 15
Glu Leu Glu Glu Asp Glu Leu Glu Leu Asp Asp Val Leu Glu Leu Glu
20 25 30
Leu Glu Leu Glu Leu Val Glu Leu Val Leu Glu Leu Glu Leu Glu Glu
35 40 45
Val Leu Glu Val Glu Glu Leu Leu Glu Leu Glu Val Glu Pro Val Glu
50 55 60
Glu Leu Glu Leu Glu Leu Glu Val Leu Glu Pro Asp Glu Leu Asp Glu
65 70 75 80
Leu Leu Asp Glu Leu Val Leu Glu Pro Glu Leu Asp Glu Pro Glu Leu
85 90 95
Glu Ala Pro Leu Gln Val Thr Pro Pro Lys Glu Pro Ser Trp Phe Pro
100 105 110
<210> 27
<211> 2376
<212> DNA
<213> Microbulbifer degradans
<400> 27
atgttggctt ctaataaaaa tagtaagctg gcaaactctg agcaacaccg cccttataaa 60
acccgcacag cgcgctggtt aaccgggtct ggggttattg cttcaagttt gcttttttct 120
gcgcagagtt ttgcggcgca atgtgaatac atcattagca atgaatggaa cagcggcttt 180
actggcgcag ttcgcattac taataatggc actactccca tcaatggctg ggatgttagc 240
tggcagtatg ccggcgatgc agtcaccagc agctggaacg cgaatgtttc tggctcgaac 300
cccgtttctg ctacaccatt aagctggaat gccaacattc aacccggtca aagcgttgag 360
tttggttttc agggcagcaa agccggctcc aatgcagaaa ttccaaccgt taccggcgcg 420
gtatgtgata gcggctctag ctcttccagc tccagctcat catctagttc atcaagctct 480
tctagtagct caagcagcac tagcagctcc tcgtccagct cttcaagcac ctcttcgtct 540
agctcatcat ctggctccag tggcacaggt ggtattgcgt gtactgtagg caatgcgaat 600
atttggggct cgggctacca gctggacatg caagttgtta acaacggcac cgctgcagta 660
agcagttggg acgtaaccat ggcattcggc gaggcaccac agcgcaccgg tggctggaac 720
gcaaactttg tagagtcagg caataccatt gttgcgagca acattagctg gaacggcaac 780
ctcgcaccgg ggcaatcagc ttcgtttggt attcaaggga accacgacgg ctcttttggc 840
ggcgtaacct gtaacggcgc ttcaagctct ggctcgtcta gttctggctc tagcaccagc 900
tcatcaagta gctcatccag ttcgtctggc tctagcactt ctagctctag ctcaagctcc 960
tctactggtt ctacctctag ctctagtagc tcttcaactt ctagcacttc ttcaagttct 1020
agctctagca ccagctccac gagttctagc tccagttcta gctcgagtac atcgtctagt 1080
tccagctcat cttcctcaag ctctagctct tctacttcag gcagtggcgc aggttttgac 1140
aacccgttca ttggcggcaa gtggtatgta gacccagtat ggtcagcaaa agctgcagca 1200
gagccaaacg gttcacttat tgccaactac aacacggcag tttggatgga tcgcattggt 1260
gcgattgaag gcccagaaga tggcgatggt atgggcttag aagaacactt agatgaagct 1320
ttagcacaag gtgcagacat ctttatgttc gtggtatacg acctaccaaa ccgcgactgt 1380
gcagctttgg cctcaagtgg cgaactactc attgccgaga acggttttga gcgctatcaa 1440
aatgagtaca ttggcccaat cgtagatata ctcagcaagc ccgcgtattc tagcttgcgt 1500
attatcgcga ttattgaagt ggattctcta cccaacctcg ttaccaacct caacattcaa 1560
aaatgtgttg aagcgaatgg cccgggtggg tacgtagacg gtatccaaca tgcacttaac 1620
gagctaaaca cgcttgataa tgtgtaccca tacgtcgata ttgctcactc aggctggcta 1680
ggctggagcg acaacttcgc cggcgccacc aagcttattg gtgatgcaat taaaggcaca 1740
aacaaaggtg taaacagtat tgcaggcttt gtaagtaact cttctaacta cacacctgtg 1800
actgaaccat acctacctaa ccctaccttg caaattggta gcaaccaagt tcgatctgcg 1860
gatttctacg agtggaccat gtacttcgaa gaacttagct ttgtacaaga ttggcgccaa 1920
gccatgattc agcaaggctt cccagaatca attggtatgc ttattgatac cgcacgtaat 1980
ggctggggtg gacctgaccg tccaactggt gagtctacat ctaccgacct caacacctat 2040
gtgaatgaat cgcgtataga ccgccgtcag catcgcggaa actggtgtaa ccagcccggt 2100
ggtgttggct tccgtccgca agcggcacca gaaccaggtg tagacgctta cgtttgggtt 2160
aagccacaag gtgagtcgga tggtattagt gatcctaact tccctatcga ccctaacgac 2220
ccagctaaac agcacgaccc aatgtgtgat ccaaacgcac ctaaccgcga taacaatgcg 2280
gttggcacag gcgcgctaga taacgctcca catgctggtc gctggttccc agaagcattc 2340
caaatactta tagaaaacgc ctacccaccg ctatag 2376
<210> 28
<211> 578
<212> PRT
<213> Microbulbifer degradans
<400> 28
Met Asn Lys Val Lys Val Leu Ala Leu Cys Ala Ser Val Ala Val Met
1 5 10 15
Ile Gly Cys Ser Asp Ala Asp Thr Lys Leu Ala Asn Ser Ala Lys Ala
20 25 30
Glu Val Gly Phe Thr Lys Val Asn Gln Leu Gly Tyr Leu Pro Ala Ala
35 40 45
Lys Lys Leu Ala Val Val Pro Ala Val Ala Ala Ala Lys Phe Asp Ile
50 55 60
Ile Asp Val Thr Ser Gly Lys Val Ala Phe Thr Gly Ser Leu Ser Asp
65 70 75 80
Val Lys Ser Trp Ser Ala Met Gly Asp Glu Ser Phe Lys Leu Ala Asp
85 90 95
Phe Ser Ala Leu Gln Ala Glu Gly Ser Tyr Arg Leu Val Val Gln Gly
100 105 110
Val Ser Asp Ser Tyr Thr Phe Asp Ile Ser Pro Ser Val Tyr Ser Gln
115 120 125
Ala His Asp Gly Ala Leu Lys Ala Tyr Tyr Tyr Asn Arg Ala Ser Thr
130 135 140
Glu Leu Thr Glu Gln Tyr Ala Gly Val Tyr Ala Arg Pro Ala Gly His
145 150 155 160
Pro Asp Thr Asp Val Arg Ile Phe Asp Asn Ala Ala Ser Ala Ala Arg
165 170 175
Pro Ala Asp Thr Ser Phe Ala Ala Pro Lys Gly Trp Tyr Asp Ala Gly
180 185 190
Asp Tyr Gly Lys Tyr Ile Val Asn Ser Gly Ile Ser Thr Tyr Thr Leu
195 200 205
Met Ala Ala Tyr Glu His Phe Pro Ser Phe Tyr Lys Gln Arg Asp Ile
210 215 220
Asp Ile Pro Glu Ser Gly Asp Ala Val Pro Asp Ile Leu Asp Glu Val
225 230 235 240
Met Trp Asn Leu Glu Trp Met Gln Val Met Gln Asp Pro Asn Asp Gly
245 250 255
Gly Val Tyr His Lys Leu Thr Thr Leu Asn Phe Ser Gly Ala Val Met
260 265 270
Pro His Glu Ala Thr Ala Gln Arg Tyr Phe Ile Lys Lys Ser Thr Ala
275 280 285
Ala Thr Leu Asp Phe Ala Ala Val Met Ala Thr Ala Ser Arg Val Tyr
290 295 300
Ala Pro Phe Glu Gly Ala Phe Pro Gly Lys Ser Ala Ala Tyr Arg Gln
305 310 315 320
Ala Ala Ile Ala Ala Trp Glu Trp Ala Gln Ala Asn Pro Ser Glu Thr
325 330 335
Tyr Ser Gln Thr Pro Leu Ser Lys Val Gln Thr Gly Ala Tyr Gly Asp
340 345 350
Lys Lys Leu Asn Asp Glu Phe Ala Trp Ala Ala Ala Glu Leu Phe Ile
355 360 365
Leu Thr Gly Glu Gln Lys Tyr Trp Gln Ala Phe Asn Lys Gln Lys Val
370 375 380
Gln Ala Gly Glu Ser Ser Trp Ala Asn Val Ala Gly Leu Gly Phe Ile
385 390 395 400
Ser Leu Ala Asn Asn Ala Arg Ser Leu Leu Asn Glu Ala Gln Tyr Lys
405 410 415
Thr Val Thr Asp Ser Ile Val Arg Ala Ala Asp Ser Leu Leu Val Thr
420 425 430
Tyr Lys Glu Asn Ala Tyr Gln Val Pro Ile Gly Asn Lys Asp Phe Phe
435 440 445
Trp Gly Gly Asn Ser Gly Thr Leu Asn Arg Ala Trp Val Leu Leu Glu
450 455 460
Ala Asn Lys Ile Lys Pro Gln Gln Glu Tyr Ile Asp Ala Ala Leu Ala
465 470 475 480
Ala Val Asp Tyr Ile Tyr Gly Arg Asn Pro Thr Asn Tyr Ser Phe Val
485 490 495
Thr Gly Phe Gly Asp Asn Pro Ala Val Gly Ile His His Arg Pro Ser
500 505 510
Tyr Ala Asp Gly Ile Lys Ala Pro Val Pro Gly Trp Leu Ala Gly Gly
515 520 525
Ala His Asn Gly Lys Gln Asp Gly Cys Glu Tyr Pro Ser Asp Ala Pro
530 535 540
Ala Lys Ser Tyr Leu Asp Asp Trp Cys Ser Tyr Ser Thr Asn Glu Ile
545 550 555 560
Ala Ile Asn Trp Asn Ala Pro Leu Val Tyr Ile Leu Ala Ala Val Asn
565 570 575
Asn Leu
<210> 29
<211> 1737
<212> DNA
<213> Microbulbifer degradans
<400> 29
atgaacaaag ttaaagtttt agcgctgtgt gccagtgtgg ctgtaatgat aggttgcagt 60
gatgccgaca ctaaattagc taactcggcc aaggccgagg tgggctttac caaagtgaat 120
cagctgggtt atttgcccgc ggccaaaaag ctggcggtgg tacccgccgt tgcagctgca 180
aaattcgaca taatcgatgt aactagcggt aaagtagcgt ttacggggag tttaagcgac 240
gtaaaaagct ggagcgcgat gggggacgaa tctttcaagt tggcagactt tagcgccctg 300
caagccgaag ggagttaccg cttagttgtt cagggtgtga gtgattctta caccttcgat 360
attagcccaa gtgtatatag ccaagcgcac gatggagccc ttaaagccta ttactataat 420
cgagcgagca cagagttaac agaacagtac gccggggtgt atgcgcgacc tgcggggcac 480
ccagataccg acgtacgcat attcgataac gccgcctcag ccgcgcgccc agcagataca 540
agctttgctg caccaaaggg ttggtacgat gctggcgatt acggcaagta cattgttaac 600
agtggtattt ccacttacac cctaatggct gcgtacgagc atttcccgtc gttttacaag 660
caacgcgata tagatattcc cgaatctggc gatgccgtac cggatattct cgacgaggta 720
atgtggaacc ttgaatggat gcaggtcatg caagacccga acgacggcgg tgtgtaccac 780
aagcttacca ccctgaattt ttctggcgca gtcatgccgc acgaagcgac tgcgcagcgc 840
tattttatta aaaaatctac cgctgcaacg ctagattttg ccgcggttat ggccactgca 900
agccgagtat acgcaccgtt cgaaggtgct tttcctggta aatcagctgc ttatcgacag 960
gcggccattg ctgcgtggga gtgggcacaa gcaaacccta gtgagacata ttcgcagaca 1020
ccgctgagca aagttcaaac cggcgcctat ggtgataaaa agttaaacga tgaatttgcg 1080
tgggcggccg cagagttgtt tatattgacc ggcgagcaaa aatactggca ggcgtttaac 1140
aagcaaaaag tgcaggcggg tgagtctagc tgggcgaatg ttgcggggtt ggggtttatt 1200
tccttggcca ataatgcgcg cagcctgtta aacgaagctc aatacaaaac cgttaccgat 1260
tcaattgttc gcgctgcaga tagcttgctt gttacttaca aagagaatgc ctaccaagta 1320
cccattggca acaaagattt tttctggggt ggcaattccg gcacgttaaa tcgcgcttgg 1380
gttttgcttg aggccaataa aattaaaccg cagcaagaat acatcgatgc tgcacttgcc 1440
gcggtggatt atatttatgg tcgcaaccct accaactact cttttgtcac tgggtttggc 1500
gataaccctg cggtgggtat ccatcatcgt ccatcctatg ccgatggcat taaagcccct 1560
gtgcctggtt ggcttgcggg cggtgcgcac aatggcaagc aagatggttg tgagtaccct 1620
tccgatgcac cggcaaaatc ctatctagac gactggtgca gttactccac caacgaaatt 1680
gctattaatt ggaatgcgcc gttagtttac atactggctg cggtaaataa tttgtag 1737
<210> 30
<211> 867
<212> PRT
<213> Microbulbifer degradans
<400> 30
Met Asn Leu Thr Ser Ile Met Phe Glu Gln Ser Val Lys Lys Val Ala
1 5 10 15
Lys Ser Ala Ile Ala Val Ala Val Ala Ser Ala Val Thr Leu Ser Ala
20 25 30
Ala Gln Ala Glu Val Gly Asn Pro Arg Val Asn Gln Val Gly Tyr Ile
35 40 45
Pro Asn Gly Ala Lys Val Ala Ser Tyr Val Ala Pro Ser Asn Thr Ala
50 55 60
Gln Thr Trp Gln Leu Leu Arg Asn Gly Ser Val Val Ala Ser Gly Thr
65 70 75 80
Thr Thr Pro Lys Gly Thr Asp Ala Ala Ser Gly Asp Asn Ile His His
85 90 95
Ile Asp Phe Ser Ala Val Ser Ala Thr Gly Glu Gly Phe Ser Leu Leu
100 105 110
Val Gly Gly Asp Glu Ser Tyr Pro Phe Glu Ile Ser Ala Asp Ala Phe
115 120 125
Thr Pro Val Leu Tyr Asp Ser Ile Arg Tyr Phe Tyr His Asn Arg Ser
130 135 140
Gly Ile Ala Ile Glu Thr Gln Tyr Thr Gly Gly Gly Asn Gly Ser Tyr
145 150 155 160
Ala Ala Asn Ala Gln Trp Ala Arg Pro Ala Gly His Ile Asn Gln Asn
165 170 175
Ala Asn Gln Gly Asp Asn Ala Val Pro Cys Trp Ser Gly Ser Gly Cys
180 185 190
Asn Tyr Ala Leu Asp Val Thr Lys Gly Trp Tyr Asp Ala Gly Asp His
195 200 205
Gly Lys Tyr Val Val Asn Gly Gly Ile Ser Val Trp Lys Leu Leu Asn
210 215 220
Met Tyr Glu Arg Ala Leu His Ile Ser Gly Ser Gln Asn Lys Tyr Ala
225 230 235 240
Asp Gly Thr Leu Asn Ile Pro Glu Ser Gly Asn Gly Val Ala Asp Ile
245 250 255
Leu Asp Glu Ala Arg Trp Gln Met Glu Phe Leu Leu Ala Met Gln Val
260 265 270
Pro Glu Gly Glu Ala Lys Ala Gly Met Val His His Lys Met His Asp
275 280 285
Val Gly Trp Thr Gly Leu Pro Leu Ala Pro His Glu Asp Asn Arg Glu
290 295 300
Arg Ala Leu Val Pro Pro Ser Val Thr Ala Thr Leu Asn Val Ala Ala
305 310 315 320
Thr Gly Ala Gln Cys Ala Arg Leu Phe Asp Glu Ile Asp Ala Ser Phe
325 330 335
Ala Ala Ser Cys Leu Thr Ala Ala Glu Arg Ala Trp Asp Ala Ala Leu
340 345 350
Gln Asn Pro Asn Asp Val Tyr Thr Gly Gly Tyr Asp Asn Gly Gly Gly
355 360 365
Gly Tyr Gly Asp Glu Val Ala Asp Asp Glu Phe Phe Trp Ala Ala Ala
370 375 380
Glu Leu Tyr Ile Thr Thr Gly Asp Ser Lys Tyr Leu Ser Thr Ile Asn
385 390 395 400
Asn Tyr Asn Val Thr Arg Ile Asp Trp Gly Trp Pro Asp Thr Glu Leu
405 410 415
Pro Ala Leu Met Ser Leu Ala Val Val Pro Ala Asn His Thr Ala Asn
420 425 430
Leu Arg Ala Thr Ala Arg Ala Lys Ile Val Glu Ile Ala Asp Thr His
435 440 445
Val Ala Thr Ser Asn Ala Ala Gly Tyr Leu Thr Pro Ser Ser Ala Leu
450 455 460
Asp Tyr Tyr Trp Gly Ser Asn Asn Gly Val Ala Asn Lys Ile Ala Leu
465 470 475 480
Leu Gly Leu Ala Tyr Asp Phe Thr Gly Asp Asp Val Tyr Ala Lys Thr
485 490 495
Val Ser Lys Ala Val Asn Tyr Leu Phe Gly Asn Asn Thr Leu Ser Phe
500 505 510
Ser Tyr Ile Ser Gly His Gly Glu Asn Ala Leu Gln Gln Pro His His
515 520 525
Arg Phe Trp Ala Gly Ala Leu Asn Gly Ser Tyr Pro Trp Leu Pro Pro
530 535 540
Gly Ala Leu Ser Gly Gly Pro Asn Ala Gly Leu Glu Asp Gly Val Ala
545 550 555 560
Ala Ala Ala Leu Ser Ala Cys Val Ser Thr Pro Ala Lys Cys Tyr Met
565 570 575
Asp Asp Ile Glu Ser Trp Ser Thr Asn Glu Ile Thr Ile Asn Trp Asn
580 585 590
Gly Ala Leu Val Trp Ala Met Ala Phe Tyr Asp Asp Tyr Ala Asp Ser
595 600 605
Gly Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
610 615 620
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
625 630 635 640
Ser Ser Ser Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
645 650 655
Ser Ser Ser Ser Ser Ser Ser Gly Gly Glu Cys Val Glu Met Cys Lys
660 665 670
Trp Tyr Gln Asp Ala Pro Arg Pro Leu Cys Asn Asn Gln Asn Ser Gly
675 680 685
Trp Gly Trp Glu Asn Gln Gln Ser Cys Ile Gly Arg Thr Thr Cys Glu
690 695 700
Ser Gln Ser Gly Asn Gly Gly Val Ile Asn Ser Cys Gly Thr Ser Ser
705 710 715 720
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
725 730 735
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
740 745 750
Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly
755 760 765
Val Ala Gly Val Ala Cys Ala Val Thr Lys Met Asn His Trp Gly Ser
770 775 780
Gly Tyr Gln Leu Asp Val Thr Val Ser Asn Asn Gly Ala Ala Ala Val
785 790 795 800
Ser Gly Trp Ser Ile Glu Leu Asp Phe Gly Glu Ser Pro Gln Leu Thr
805 810 815
Gly Ser Trp Asn Ala Ala Val Ser Ala Ser Gly Asn Thr Val Ser Ala
820 825 830
Thr Asn Ile Ser Trp Asn Gly Asn Leu Ser Ala Gly Gln Ser Thr Ser
835 840 845
Phe Gly Met Gln Gly Asn Ser Asp Gly Ser Leu Ser Thr Pro Ser Cys
850 855 860
Leu Val Lys
865
<210> 31
<211> 49
<212> PRT
<213> Microbulbifer degradans
<400> 31
Met Glu Glu Leu Leu Glu Glu Leu Glu Pro Leu Asp Glu Glu Leu Leu
1 5 10 15
Leu Glu Asp Asp Glu Leu Glu Val Glu Leu Glu Glu Leu Asp Glu Glu
20 25 30
Leu Asp Glu Leu Glu Glu Leu Asp Glu Leu Glu Pro Leu Pro Glu Ser
35 40 45
Ala
<210> 32
<211> 2604
<212> DNA
<213> Microbulbifer degradans
<400> 32
atgaatctta cttcaatcat gtttgaacaa tcagtaaaaa aagtcgctaa gtcagccatt 60
gccgtggcag ttgcttcggc ggttacctta agtgcggcgc aggccgaggt gggtaaccca 120
cgtgttaacc aagtaggcta tatacccaat ggtgccaaag ttgccagtta tgttgcgcca 180
tcaaatacgg cacaaacgtg gcagttactg cgtaatggca gtgtggttgc aagtggcact 240
acaaccccaa agggtacaga tgcagcctcg ggtgacaata ttcaccatat cgatttttct 300
gcggtgagtg caaccggcga aggttttagt ttgcttgtgg gcggcgatga aagttacccc 360
tttgaaattt ctgccgacgc atttacaccg gttttatacg attccatccg ttacttttat 420
cacaaccgtt cgggtatcgc gattgaaacg cagtacaccg gtggcggtaa cggtagctac 480
gcggcgaatg ctcagtgggc taggcccgca ggtcacatta atcaaaatgc taaccaaggc 540
gataatgcgg tgccgtgttg gtcgggcagt ggttgcaact acgccttaga cgtaactaaa 600
ggttggtacg atgccggtga ccacggtaaa tatgttgtaa acggtggcat ttccgtatgg 660
aagctattaa acatgtacga gcgtgccttg cacattagtg gcagccaaaa taaatacgcc 720
gacggtacat taaatattcc tgaaagcggc aatggcgtgg cggatatttt ggatgaagct 780
cgctggcaaa tggagttttt attagccatg caagtgccag agggcgaagc gaaagctggc 840
atggtgcacc acaaaatgca cgatgtgggt tggacaggct tgccactagc accccatgaa 900
gataatcgcg agcgcgcgct tgtgccgcct tcggttactg caacccttaa cgttgcggcc 960
acaggcgcgc agtgtgcgcg tttatttgac gaaatagatg cgagttttgc agcaagttgt 1020
ttaactgccg cagagcgcgc atgggatgca gccctgcaaa accctaacga tgtttacact 1080
ggcggctacg ataatggcgg cggtggttac ggcgatgaag tggcggacga cgagtttttc 1140
tgggctgctg ctgagttata cattaccact ggcgatagca aatatctttc aaccattaac 1200
aactacaatg taacgcgcat tgattggggc tggccagata ccgagttgcc tgcgttgatg 1260
tcgttagcgg ttgtgcctgc taatcacacc gcaaatttgc gtgcgactgc tcgtgcaaaa 1320
attgtagaaa ttgcagatac ccatgtcgct accagtaatg ctgccggcta tttaacacca 1380
tcgtccgcgc tggattacta ctggggttct aacaatggcg tagccaataa aattgcgtta 1440
cttggtttgg catacgattt tactggcgat gacgtttacg cgaaaacggt gtcgaaagca 1500
gttaactatt tatttggtaa taatacctta tcgttttctt atatttctgg gcatggcgaa 1560
aatgctttgc aacagccgca tcaccgcttt tgggctgggg cattaaatgg aagttaccca 1620
tggttgccgc ctggtgcgct ttctggtggc cctaacgcag ggttagaaga tggcgttgcc 1680
gccgccgcgc taagtgcttg tgtttcaacg cctgccaaat gctatatgga tgatattgaa 1740
tcttggtcga ccaacgaaat tactattaac tggaatggtg cattggtttg ggcaatggcg 1800
ttttatgatg actacgccga ttcgggtagc ggttctagct cgtcaagttc ttctagctca 1860
tctagctctt cgtcaagttc ttccagttcg acttctagct cgtcgtcttc tagtagtagc 1920
tcttcgtcga gcggctcgag ttcttctagc agctcttcca catccagttc cagctcttcg 1980
agttcatcgg gtggggagtg tgtagaaatg tgtaagtggt atcaagatgc accgcgccct 2040
ctatgcaata accaaaacag cggttgggga tgggagaacc agcagagttg tattggtaga 2100
acaacttgcg aaagtcaaag tggcaatggt ggagtgatta attcgtgcgg cacgtctagc 2160
tcgagctctt catctagctc tagcagtagc tcttcgagtt catccagctc ttctagcagt 2220
tcttccacat caagctcgtc gagtagttcg tcttctagct cttctagttc gacttcaagt 2280
tcttcgtcga gcagttcagg gggcgttgca ggtgtggctt gtgcggtaac caaaatgaac 2340
cattggggca gcggatatca attagatgta acagtttcta ataatggtgc tgcagcggta 2400
agtggttgga gtattgaact cgattttggt gaatcgccac agcttactgg tagttggaat 2460
gctgctgtat cggcatctgg taatactgta tcggctacta acattagttg gaacggtaat 2520
ttaagcgctg ggcaatctac ctcttttggt atgcagggta attcagatgg ttcgctgagc 2580
acgccaagct gtttagttaa gtaa 2604
<210> 33
<211> 1072
<212> PRT
<213> Microbulbifer degradans
<400> 33
Met Lys Asn Thr Leu Ser Phe Lys Thr Ser Leu Leu Ala Gly Leu Val
1 5 10 15
Ala Ser Ser Leu Leu Val Ala Ala Cys Gln Gly Val Lys Gln Gln Thr
20 25 30
Glu Ala Thr Gln Thr Lys His Asn Ile Thr Leu Trp Pro Gln Ala Ser
35 40 45
Ser Pro Val Ile Lys Ser Pro Asp Tyr Glu Ala Glu Val Glu Ala Lys
50 55 60
Val Glu Ala Leu Leu Gly Gln Met Thr Leu Glu Gln Lys Val Gly Gln
65 70 75 80
Ile Leu Gln Pro Glu Ile Gln Ser Ile Lys Pro His Glu Val Lys Glu
85 90 95
Tyr His Ile Gly Ser Val Leu Asn Gly Gly Gly Ser Met Pro Asn Arg
100 105 110
Ile Glu Asn Ala Pro Pro Ile Glu Trp Val Lys Leu Ala Asp Ala Phe
115 120 125
Tyr Asp Ala Ser Met Asp Asp Ser Asp Gly Gly Ile Ala Ile Pro Ile
130 135 140
Ile Trp Gly Thr Asp Ala Val His Gly His Gly Asn Val Thr Gly Ala
145 150 155 160
Thr Ile Phe Pro His Asn Ile Gly Leu Gly Ala Ala Arg Asn Pro Ala
165 170 175
Leu Ile Glu Lys Ile Gly Glu Ile Thr Ala Lys Glu Val Arg Ala Thr
180 185 190
Gly Ile Glu Trp Ile Phe Gly Pro Thr Leu Ala Val Ala Gln Asn Asp
195 200 205
Leu Trp Gly Arg Thr Tyr Glu Ser Tyr Ser Glu Asp Pro Ala Ile Val
210 215 220
Ala Asp Tyr Ala Ser Ala Met Val Val Gly Met Gln Gly Lys Val Asp
225 230 235 240
Asp Ser Asp Phe Leu Ser Thr Asn Arg Val Val Ala Thr Ala Lys His
245 250 255
Phe Leu Ala Asp Gly Gly Thr Leu Gly Gly Asn Asp Gln Gly Asp Ala
260 265 270
Arg Ile Ser Glu Glu Glu Leu Val Gln Ile His Asn Ala Gly Tyr Val
275 280 285
Pro Ala Ile Glu Ser Gly Val Gln Thr Val Met Ala Ser Phe Ser Leu
290 295 300
Trp Asn Gly Val Lys Met His Gly Asn Asn Tyr Leu Leu Thr Gln Ala
305 310 315 320
Leu Lys Glu Arg Met Gly Phe Asp Gly Phe Ile Val Gly Asp Trp Asn
325 330 335
Gly His Gly Gln Val Pro Gly Cys Thr Asn Glu Ser Cys Pro Gln Ser
340 345 350
Leu Asn Ala Gly Leu Asp Met Tyr Met Val Pro Tyr Asp Trp Lys Lys
355 360 365
Leu Tyr Arg Asn Leu Ile Ser Gln Val Gln Ser Gly Glu Ile Ala Pro
370 375 380
Ser Arg Leu Asp Asp Ala Val Arg Arg Ile Leu Arg Val Lys Ile Arg
385 390 395 400
Ala Asn Leu Trp Ala Ala Lys Pro Ser Glu Arg Ile Asn Leu Ala Thr
405 410 415
Ile Asp Glu Val Val Gly His Ala Asn His Arg Glu Val Ala Arg Gln
420 425 430
Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn Lys Asn Ser Val Leu
435 440 445
Pro Ile Ala Ala Asn Lys Thr Val Leu Val Ala Gly Asp Gly Ala Asp
450 455 460
Asn Ile Gly Lys Gln Ser Gly Gly Trp Ser Val Ser Trp Gln Gly Thr
465 470 475 480
Gly Asn Thr Asn Ala Ser Phe Pro Gly Gly Thr Ser Ile Tyr Lys Gly
485 490 495
Ile Ala Asp Ala Val Thr Gln Gly Gly Gly Lys Ala Thr Leu Ser Val
500 505 510
Asp Gly Ser Tyr Lys Thr Lys Pro Asp Val Ala Ile Val Val Ile Gly
515 520 525
Glu Asp Pro Tyr Ala Glu Gly Gln Gly Asp Arg Asn Ser Leu Glu Phe
530 535 540
Glu Pro Val Asn Lys Lys Ser Leu Glu Leu Leu Lys Lys Leu Lys Ala
545 550 555 560
Asp Gly Ile Pro Val Val Thr Val Phe Ile Ser Gly Arg Pro Met Trp
565 570 575
Ala Asn Pro Glu Ile Asn Ala Ser Asp Ala Phe Val Ala Ala Trp Leu
580 585 590
Pro Gly Ser Glu Gly Gln Gly Val Ala Asp Val Leu Ile Gly Asn Ala
595 600 605
Asn Gly Lys Pro Arg Phe Asp Phe Lys Gly Thr Leu Ser Phe Ser Trp
610 615 620
Pro Lys Leu Pro Thr Gln Gly Leu Leu Asn Pro Thr His Pro Asn Tyr
625 630 635 640
Asp Pro Leu Phe Lys Leu Gly Tyr Gly Leu Thr Tyr Ala Ser Ser Glu
645 650 655
Thr Gly Pro Glu Gln Leu Ala Glu Asp Val Glu Gly Val Asp Lys Gly
660 665 670
Ser Thr Gly Asp Ile Asn Phe Tyr Val Gly Arg Thr Leu Glu Pro Trp
675 680 685
Glu Val Phe Val Arg Thr Pro Glu Ser Ser Gln Arg Leu Ser Gly Pro
690 695 700
Phe Ala Asp Leu Gly Asn Ala Ser Val Arg Thr Ser Asp Met Gln Val
705 710 715 720
Gln Glu Asp Ala Leu Thr Phe Thr Trp Gly Gly Ser Trp Met Ser Ile
725 730 735
Leu Gly Ile Glu Gly Gly Arg Gly Tyr Asp Leu Ser Ser Gln Tyr Lys
740 745 750
Glu Gly Gly Val Ile Ser Phe Asn Phe Asn Ser Ile Asp Met Ala Lys
755 760 765
Gly Asp Leu Lys Val Gln Met Ala Cys Gly Glu Gly Cys Thr Arg Glu
770 775 780
Val Asp Ile Thr Thr Ile Ala Arg Asp Leu Glu Gly Lys Gly Trp Gln
785 790 795 800
Ser Leu Thr Val Pro Leu Ala Cys Phe Ala His Glu Gly Asp Asp Phe
805 810 815
Thr His Ile Thr Ala Pro Phe Asn Leu Phe Ala Gly Gly Lys Gly Gln
820 825 830
Val Ala Val Ala Asn Ile Arg Ile Leu Arg Ala Gly Thr Gln Thr Val
835 840 845
Pro Cys Val Leu Pro Lys Asp Val Ser Val Thr Pro Glu Pro Leu Asn
850 855 860
Ala Ser Trp Ala Ile Asp Trp Trp Met Pro Arg His Lys Glu Lys Leu
865 870 875 880
Ala Arg Ile Gln Gln Gly Asn Val Asp Leu Leu Met Ile Gly Asp Ser
885 890 895
Ile Thr His Gly Trp Glu Asp Ala Gly Lys Asp Val Trp Ala Gln Tyr
900 905 910
Tyr Ala His Arg Asn Ala Val Asp Leu Gly Phe Ser Gly Asp Arg Thr
915 920 925
Glu Asn Val Leu Trp Arg Leu Gln His Gly Glu Ala Asp Gly Ile Lys
930 935 940
Pro Lys Val Ala Val Val Met Ile Gly Thr Asn Asn Ala Gly His Arg
945 950 955 960
His Glu Pro Ser His Tyr Thr Ala Lys Gly Val Ala Ala Val Val Ala
965 970 975
Glu Leu Gln Lys Arg Leu Pro Glu Thr Lys Ile Leu Leu Leu Gly Ile
980 985 990
Phe Pro Arg Gly Glu Thr Ser Glu Asp Pro Leu Arg Val Leu Asn Ala
995 1000 1005
Lys Thr Asn Thr Leu Leu Ala Lys Met Ala Asp Gly Glu Lys Val Val
1010 1015 1020
Tyr Leu Asn Ile Asn Lys Thr Phe Leu Asp Glu Asn Gly Val Leu Pro
1025 1030 1035 1040
Lys Asp Ile Met Pro Asp Leu Leu His Pro Asn Glu Lys Gly Tyr Ala
1045 1050 1055
Leu Trp Ala Lys Ala Met Glu Pro Thr Leu Lys Lys Met Leu Gly Glu
1060 1065 1070
<210> 34
<211> 3219
<212> DNA
<213> Microbulbifer degradans
<400> 34
atgaaaaata ctttatcctt taaaacatcc ttgcttgcgg gcttggtggc atccagttta 60
ctggttgcgg cctgtcaggg tgttaaacag caaacggaag ctactcagac aaagcacaat 120
attaccttat ggccgcaggc gtctagccct gtaataaagt cgccagatta cgaagcggaa 180
gtggaagcca aggtagaagc gttgttagga caaatgacgc tagagcaaaa agtagggcaa 240
atcctacagc cagaaattca atctattaag ccgcatgaag taaaagaata ccacattggc 300
tctgtactaa atggtggtgg ctctatgcct aaccgcatag aaaatgcgcc gcccattgaa 360
tgggtaaaat tggccgatgc cttttacgat gcctctatgg acgattctga cggtggaatc 420
gcaattccca ttatttgggg taccgatgcc gtacacggtc acggcaatgt aactggcgca 480
accatattcc cgcataacat aggccttggt gctgcacgca acccagcgct tatcgaaaaa 540
attggcgaaa taacggcaaa agaagtacgc gcaaccggca ttgaatggat atttggccca 600
actttggccg tagcgcaaaa cgatttatgg ggccgcactt acgaaagcta ctcggaagac 660
ccagccatag tggccgacta cgccagtgcc atggtggtag gtatgcaggg caaagtggac 720
gacagcgatt ttctgtccac taatcgcgta gttgccacag caaagcactt tttagctgac 780
ggcggtacct taggaggcaa cgatcaaggt gatgcgcgca taagcgaaga agagttggtg 840
caaattcata atgcgggcta tgtgcctgcc attgaatcgg gcgtgcaaac ggttatggcc 900
agtttctctt tgtggaatgg cgtaaaaatg catggtaaca actacctact tacccaagca 960
cttaaagagc gtatggggtt tgatggtttt atagtagggg attggaatgg ccacgggcag 1020
gtacctgggt gcaccaacga atcttgccct caatcgctaa acgccggttt agatatgtac 1080
atggtgcctt acgattggaa aaaactgtac agaaacttaa ttagccaagt gcaatcgggt 1140
gaaattgccc caagccgttt agatgacgct gtacgccgta ttcttcgggt aaaaattcgc 1200
gctaatttgt gggctgcgaa accttcagag cgaattaatc tagccactat tgacgaggtg 1260
gttggccacg caaaccaccg tgaggtagcg cggcaggcgg tgcgagaaag tttagtattg 1320
ttaaaaaata aaaatagcgt actgcctatt gctgccaata aaaccgtgct ggttgcaggt 1380
gacggcgccg ataatattgg caaacaatct ggcggttgga gtgtaagctg gcagggcact 1440
ggtaacacca atgcatcctt ccccggtggt acatctattt ataaaggtat tgccgatgca 1500
gtcactcagg gcggcggtaa agctacgctt tctgtggatg gcagctacaa aactaaaccc 1560
gatgttgcca ttgtggtaat aggcgaagac ccttacgccg aaggccaagg cgaccgcaat 1620
agtttagagt tcgagccggt gaataaaaaa tcgcttgagc tattaaaaaa attaaaagca 1680
gatggcatac ccgttgtaac agtatttatt tctggccgac ctatgtgggc taacccagaa 1740
attaacgcgt ctgatgcatt tgttgccgcg tggttacctg gctctgaagg gcagggcgta 1800
gcagatgtac ttataggcaa cgccaacggc aagcctcgtt ttgatttcaa gggcaccttg 1860
tcgttctctt ggcctaagct gccgacccaa ggcttgctca acccaacgca ccccaactac 1920
gacccgttat ttaaattggg atacgggcta acttatgcct cgagtgaaac tggcccagag 1980
caattggcgg aagatgttga aggtgtagat aaaggctcaa ccggcgacat taatttttat 2040
gttggccgca cattagagcc gtgggaagtg tttgttcgaa ctcctgaaag ttcgcagcgt 2100
ttaagtggcc catttgcaga cttaggcaat gccagtgtgc gtaccagtga tatgcaggta 2160
caagaagatg cccttacttt tacttggggc ggtagctgga tgtctattct gggaatagaa 2220
ggagggcgcg gttacgacct ttcttcgcaa tataaagaag gcggagtaat aagctttaac 2280
ttcaattcaa tagatatggc taaaggcgat ttaaaagtac aaatggcctg tggtgaaggt 2340
tgcacgcgtg aagtagatat cacaactatc gcacgcgact tggaaggcaa aggctggcag 2400
tcgttaacag tgcccttagc gtgctttgca cacgaaggcg acgatttcac ccatattact 2460
gcgccgttta acttatttgc cggtggaaaa ggtcaagttg ctgtagccaa cattcgcata 2520
ctgcgcgccg gtacacaaac cgtgccgtgt gtattgccta aagatgtttc cgtaacgcca 2580
gagccgctga atgctagctg ggcgatagat tggtggatgc cgcgccacaa agaaaaactg 2640
gcgcgtatcc agcaaggtaa tgtggattta ctaatgattg gcgattccat tacccacggc 2700
tgggaagatg caggtaaaga cgtgtgggcg caatattacg cgcaccgcaa tgcagtggac 2760
ttaggcttta gtggcgaccg aaccgaaaac gtattgtggc gcttacagca cggcgaagca 2820
gacggtatta agcctaaagt ggcagtggtt atgattggta ccaacaatgc cggccatcgt 2880
cacgagcctt cgcactacac agccaagggt gttgcggctg tcgttgctga attgcaaaaa 2940
cgattgcctg aaacaaagat attattactg ggtatattcc ctcgcggcga aaccagtgaa 3000
gaccctttgc gggtattaaa tgccaaaacc aatactcttt tggcgaaaat ggccgacgga 3060
gagaaggtgg tgtatttgaa tatcaataaa acgtttttag atgaaaacgg cgtattgcct 3120
aaagatataa tgcccgacct attgcacccc aatgaaaagg ggtacgcatt gtgggcgaaa 3180
gcgatggaac ccacccttaa aaaaatgctg ggcgaatag 3219
<210> 35
<211> 862
<212> PRT
<213> Microbulbifer degradans
<400> 35
Met Leu Lys Lys Ile Asn Lys Lys Gly Leu Ala Leu Ser Leu Ala Ile
1 5 10 15
Ala Ala Met Leu Ser Gly Cys Asn Glu Gly Asp Ser Asn Lys Thr Lys
20 25 30
Pro Ser Ala Glu Thr Leu Ser Ala Thr Gln Ala Ser Asn Thr Val Ala
35 40 45
Asn Pro Ser Ile Trp Pro Lys Val Thr Ser Lys Val Ala Lys Asp Ala
50 55 60
Lys Met Glu Ala Asp Ile Ser Ala Ile Leu Ser Gly Met Thr Leu Glu
65 70 75 80
Gln Lys Val Ala Gln Met Ile Gln Pro Glu Ile Arg Ala Phe Ser Lys
85 90 95
Glu Asp Met Lys Lys Tyr Gly Phe Gly Ser Tyr Leu Asn Gly Gly Gly
100 105 110
Ala Phe Pro Asn Asp Asn Lys His Ser Thr Met Ala Asp Trp Val Ala
115 120 125
Leu Ala Asp Asp Met Tyr Glu Ala Ser Ile Asp Asp Ser Ile Asp Gly
130 135 140
Ser Thr Ile Pro Thr Met Trp Gly Thr Asp Ala Val His Gly His Asn
145 150 155 160
Asn Val Val Lys Ala Thr Ile Phe Pro His Asn Ile Gly Leu Gly Ala
165 170 175
Met His Asn Pro Lys Leu Met Gln Gln Ile Gly Ala Ala Thr Ala Lys
180 185 190
Val Val Gln Val Thr Gly Ile Asp Trp Val Phe Ala Pro Thr Val Ala
195 200 205
Val Val Arg Asp Asp Arg Trp Gly Arg Thr Tyr Glu Gly Tyr Ser Glu
210 215 220
Asp Pro Ala Ile Val Lys Glu Tyr Ala Arg Ala Met Val Ile Gly Met
225 230 235 240
Gln Gly Glu Ala Asn Ser Glu Ala Phe Met Gly Asp Gly Thr Val Ile
245 250 255
Ala Thr Ala Lys His Phe Leu Gly Asp Gly Gly Thr Asp Lys Gly Asp
260 265 270
Asp Gln Gly Asn Asn Leu Ser Thr Glu Gln Glu Leu Ile Asp Ile His
275 280 285
Ala Gln Gly Tyr Ile Ser Ala Ile Glu Glu Gly Val Gln Thr Ile Met
290 295 300
Ala Ser Phe Asn Ser Trp Asn Gly Glu Lys Met His Gly Asn Lys Ser
305 310 315 320
Leu Leu Thr Asp Val Leu Lys Lys Gln Met Gly Phe Asp Gly Leu Val
325 330 335
Val Gly Asp Trp Asp Gly His Gly Gln Val Lys Gly Cys Ser Asn Ala
340 345 350
Ser Cys Ala Gln Ala Ile Asn Ala Gly Val Asp Ile Ile Met Val Pro
355 360 365
Asn Glu Trp Lys Pro Met Phe Glu Asn Thr Val Ala Gln Val Lys Ser
370 375 380
Gly Glu Ile Ser Glu Ala Arg Ile Asn Asp Ala Val Thr Arg Ile Leu
385 390 395 400
Arg Val Lys Met Arg Ala Gly Ile Phe Asp Gly Val Lys Pro Ser Asp
405 410 415
Arg Ala Phe Ala Ala Glu Glu Lys Tyr Leu Gly Ser Ala Glu Asn Arg
420 425 430
Ala Ile Ala Arg Gln Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn
435 440 445
Gln Asn Lys Leu Leu Pro Leu Asp Arg Lys Met Asn Val Leu Met Ala
450 455 460
Gly Ser Gly Ala Asp Asn Ile Gly Lys Gln Ser Gly Gly Trp Thr Leu
465 470 475 480
Ser Trp Gln Gly Thr Gly Asn Val Asn Ser Asp Phe Pro Gly Ala Thr
485 490 495
Ser Ile Tyr Asp Gly Val Asn Gln Val Val Ser Ser Ala Gly Gly Lys
500 505 510
Val Glu Leu Ser Glu Asn Gly Asn Tyr Gln Ala Lys Pro Asp Val Ala
515 520 525
Ile Val Val Phe Gly Glu Asn Pro Tyr Ala Glu Gly Val Gly Asp Ile
530 535 540
Glu Gly Ile Glu Tyr Gln Leu Asn Asn Lys Arg Asp Ile Asn Leu Leu
545 550 555 560
Gln Lys Leu Lys Ala Asp Gly Ile Pro Val Val Ser Val Phe Leu Thr
565 570 575
Gly Arg Pro Leu Trp Val Asn Lys Glu Leu Asn Ala Ser Asp Ala Phe
580 585 590
Val Ala Ala Trp Leu Pro Gly Ser Glu Gly Val Gly Val Ser Asp Val
595 600 605
Leu Phe Lys Lys Ala Asp Gly Ser Ile Asn Tyr Asp Phe Lys Gly Lys
610 615 620
Leu Thr Tyr Ser Trp Pro Lys Tyr Asp Asp Gln Val Val Ile Asn Lys
625 630 635 640
Gly Asp Lys Asp Tyr Ala Pro Leu Tyr Pro Tyr Gly Tyr Gly Leu Thr
645 650 655
Tyr Ser Asp Val Asp Thr Gln Gly Asp Asp Leu Pro Glu Glu Thr Lys
660 665 670
Val Lys Ile Gly Arg Ala Asp Asp Glu Pro Met Ala Ile Phe Asp Ser
675 680 685
Leu Pro Gln Ser Asp Leu Gly Phe Phe Leu Gly Asp Lys Ala Asn Trp
690 695 700
Val Val Pro Ile Ala Thr Ser Val Val Thr Thr His Asn Ser Asp Asn
705 710 715 720
Leu Thr Met Arg Thr Tyr Asn Trp Lys Val Gln Glu Asp Ala Arg Gln
725 730 735
Leu Ile Trp Lys Gly Asp Ser Lys Ala Asn Ala Phe Phe Ala Trp Pro
740 745 750
Asp Pro His Asn Met Gln Gly Met Leu Glu His Lys Ala Ala Tyr Ser
755 760 765
Phe Ser Ile Lys Val Asp Lys Ala Pro Ala Gly Asp Leu Thr Leu Gly
770 775 780
Ile His Cys Met Glu Glu Cys Gly Lys Lys Leu Val Leu Asn Glu Ala
785 790 795 800
Leu Ser Lys Ile Pro Ala Gly Glu Trp Gly Glu Leu Thr Ile Asp Leu
805 810 815
Ala Cys Ile Ala Asp Ala Glu Ala Leu Ala Glu Val Arg Ser Pro Phe
820 825 830
Met Leu Ser Thr Asp Ala Pro Ala Ser Ile Val Phe Gly Asp Val Lys
835 840 845
Leu Val Pro Gly Gly Ala Asp Ser Ala Ala Ile Lys Cys Asp
850 855 860
<210> 36
<211> 2540
<212> DNA
<213> Microbulbifer degradans
<400> 36
atgctcaaaa agataaacaa gaaaggtctt gctttaagct tagcaattgc agcaatgcta 60
agcggctgca acgaaggcga cagcaacaaa accaaaccaa gtgcggaaac cctctccgct 120
actcaagcca gtaacactgt agccaacccc agcatttggc ccaaggtaac tagcaaggtt 180
gccaaagacg ccaaaatgga agcagatata agcgcaatac tcagcggtat gacccttgag 240
caaaaagtag cccaaatgat ccaacccgaa attcgtgcct tcagcaaaga agacatgaaa 300
aagtatggtt ttggctccta ccttaacggt ggcggcgcat tccctaacga caacaaacat 360
tccaccatgg ccgactgggt tgccctagcc gacgacatgt atgaagcctc tatagacgac 420
agcatagacg gcagcactat tccaaccatg tggggtaccg atgcagtaca cggccacaac 480
aacgtggtta aagcgactat tttcccacac aacattggcc ttggcgccat gcataacccc 540
aagctcatgc agcaaatagg cgctgccacg gctaaagtgg tacaagttac tggtatcgac 600
tgggtatttg cgcccactgt tgcggtagtg cgcgacgacc gctggggccg tacttacgag 660
ggctactctg aagaccccgc catagtaaaa gaatacgctc gcgccatggt tattggcatg 720
cagggcgaag ccaatagcga agcgtttatg ggtgacggca ctgttatagc caccgccaaa 780
cactttttgg gcgatggcgg caccgacaaa ggcgacgacc aaggcaacaa cttatccacc 840
gaacaagaat taattgatat tcacgcccaa ggctatataa gcgccattga agaaggtgtg 900
caaactatca tggcatcttt caatagctgg aatggcgaaa agatgcacgg caataaatct 960
ctgcttaccg atgtccttaa aaagcaaatg ggctttgacg gtttggtggt tggcgattgg 1020
gatggccacg gccaagtaaa aggttgctct aatgcaagct gtgcccaagc catcaacgcc 1080
ggtgtcgata tcatcatggt acccaatgag tggaaaccca tgttcgaaaa caccgttgca 1140
caagttaaaa gcggcgaaat ctctgaagcg cgaattaacg atgcagttac ccgtatttta 1200
cgtgtaaaaa tgcgcgctgg tattttcgac ggtgttaaac catcggatcg cgccttcgca 1260
gcagaagaaa aatacctagg ctctgccgaa aaccgcgcta tcgctcgtca agctgtacgc 1320
gaatcgttag tgttgcttaa aaaccaaaac aaactgctgc cattagaccg caaaatgaac 1380
gttttaatgg cgggttctgg cgcagacaac atcggcaagc aaagtggtgg ttggacatta 1440
agctggcagg gtactggcaa cgtgaacagc gacttccctg gcgcaacatc tatttacgac 1500
ggcgttaacc aagtagtgag cagcgctggc ggtaaagtag agctaagcga aaacggcaac 1560
taccaagcca aaccagatgt agcgattgta gtatttggtg aaaaccctta cgcagaaggc 1620
gtaggcgata ttgaaggtat tgaataccaa ctaaacaata agcgcgatat caatttgtta 1680
caaaaactca aagccgatgg cattcctgtt gtatcggtat tcttaaccgg tcgtccactt 1740
tgggtaaaca aagagcttaa tgcctccgat gcttttgttg cagcttggct gccaggctct 1800
gaaggtgtag gcgtttctga tgtgctattc aaaaaagccg acggtagtat taactacgac 1860
tttaaaggca agctaactta ctcttggcca aagtatgatg accaagtagt aataaacaaa 1920
ggcgacaaag attacgcccc gctttaccct tatggttacg gcttaaccta cagcgatgtt 1980
gacacccaag gtgacgactt acctgaagaa accaaagtta aaattggccg cgctgacgac 2040
gagccaatgg ccatcttcga cagcctaccc caaagcgacc tcggcttctt ccttggcgac 2100
aaagccaact gggtagtacc tattgcaaca agtgtagtta caacgcacaa cagcgataac 2160
ctaaccatgc gcacctacaa ctggaaagta caagaagatg ctcgccagtt aatttggaaa 2220
ggcgacagca aagccaatgc cttctttgca tggccagacc cacacaatat gcaaggcatg 2280
ttagaacaca aagcggctta cagctttagc attaaagtag ataaagcacc cgctggcgac 2340
ctaacactag gcatacactg catggaagaa tgcggtaaaa aacttgtgct taacgaagcg 2400
cttagcaaaa ttcctgctgg tgagtgggga gagctaacaa tagatctagc ttgcatagca 2460
gatgccgaag ccttggccga agttcgctca cccttcatgc taagcaccga tgcacccgca 2520
tctatcgtgt ttggcgatgt 2540
<210> 37
<211> 862
<212> PRT
<213> Microbulbifer degradans
<400> 37
Met Leu Lys Lys Ile Asn Lys Lys Gly Leu Ala Leu Ser Leu Ala Ile
1 5 10 15
Ala Ala Met Leu Ser Gly Cys Asn Glu Gly Asp Ser Asn Lys Thr Lys
20 25 30
Pro Ser Ala Glu Thr Leu Ser Ala Thr Gln Ala Ser Asn Thr Val Ala
35 40 45
Asn Pro Ser Ile Trp Pro Lys Val Thr Ser Lys Val Ala Lys Asp Ala
50 55 60
Lys Met Glu Ala Asp Ile Ser Ala Ile Leu Ser Gly Met Thr Leu Glu
65 70 75 80
Gln Lys Val Ala Gln Met Ile Gln Pro Glu Ile Arg Ala Phe Ser Lys
85 90 95
Glu Asp Met Lys Lys Tyr Gly Phe Gly Ser Tyr Leu Asn Gly Gly Gly
100 105 110
Ala Phe Pro Asn Asp Asn Lys His Ser Thr Met Ala Asp Trp Val Ala
115 120 125
Leu Ala Asp Asp Met Tyr Glu Ala Ser Ile Asp Asp Ser Ile Asp Gly
130 135 140
Ser Thr Ile Pro Thr Met Trp Gly Thr Asp Ala Val His Gly His Asn
145 150 155 160
Asn Val Val Lys Ala Thr Ile Phe Pro His Asn Ile Gly Leu Gly Ala
165 170 175
Met His Asn Pro Lys Leu Met Gln Gln Ile Gly Ala Ala Thr Ala Lys
180 185 190
Val Val Gln Val Thr Gly Ile Asp Trp Val Phe Ala Pro Thr Val Ala
195 200 205
Val Val Arg Asp Asp Arg Trp Gly Arg Thr Tyr Glu Gly Tyr Ser Glu
210 215 220
Asp Pro Ala Ile Val Lys Glu Tyr Ala Arg Ala Met Val Ile Gly Met
225 230 235 240
Gln Gly Glu Ala Asn Ser Glu Ala Phe Met Gly Asp Gly Thr Val Ile
245 250 255
Ala Thr Ala Lys His Phe Leu Gly Asp Gly Gly Thr Asp Lys Gly Asp
260 265 270
Asp Gln Gly Asn Asn Leu Ser Thr Glu Gln Glu Leu Ile Asp Ile His
275 280 285
Ala Gln Gly Tyr Ile Ser Ala Ile Glu Glu Gly Val Gln Thr Ile Met
290 295 300
Ala Ser Phe Asn Ser Trp Asn Gly Glu Lys Met His Gly Asn Lys Ser
305 310 315 320
Leu Leu Thr Asp Val Leu Lys Lys Gln Met Gly Phe Asp Gly Leu Val
325 330 335
Val Gly Asp Trp Asp Gly His Gly Gln Val Lys Gly Cys Ser Asn Ala
340 345 350
Ser Cys Ala Gln Ala Ile Asn Ala Gly Val Asp Ile Ile Met Val Pro
355 360 365
Asn Glu Trp Lys Pro Met Phe Glu Asn Thr Val Ala Gln Val Lys Ser
370 375 380
Gly Glu Ile Ser Glu Ala Arg Ile Asn Asp Ala Val Thr Arg Ile Leu
385 390 395 400
Arg Val Lys Met Arg Ala Gly Ile Phe Asp Gly Val Lys Pro Ser Asp
405 410 415
Arg Ala Phe Ala Ala Glu Glu Lys Tyr Leu Gly Ser Ala Glu Asn Arg
420 425 430
Ala Ile Ala Arg Gln Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn
435 440 445
Gln Asn Lys Leu Leu Pro Leu Asp Arg Lys Met Asn Val Leu Met Ala
450 455 460
Gly Ser Gly Ala Asp Asn Ile Gly Lys Gln Ser Gly Gly Trp Thr Leu
465 470 475 480
Ser Trp Gln Gly Thr Gly Asn Val Asn Ser Asp Phe Pro Gly Ala Thr
485 490 495
Ser Ile Tyr Asp Gly Val Asn Gln Val Val Ser Ser Ala Gly Gly Lys
500 505 510
Val Glu Leu Ser Glu Asn Gly Asn Tyr Gln Ala Lys Pro Asp Val Ala
515 520 525
Ile Val Val Phe Gly Glu Asn Pro Tyr Ala Glu Gly Val Gly Asp Ile
530 535 540
Glu Gly Ile Glu Tyr Gln Leu Asn Asn Lys Arg Asp Ile Asn Leu Leu
545 550 555 560
Gln Lys Leu Lys Ala Asp Gly Ile Pro Val Val Ser Val Phe Leu Thr
565 570 575
Gly Arg Pro Leu Trp Val Asn Lys Glu Leu Asn Ala Ser Asp Ala Phe
580 585 590
Val Ala Ala Trp Leu Pro Gly Ser Glu Gly Val Gly Val Ser Asp Val
595 600 605
Leu Phe Lys Lys Ala Asp Gly Ser Ile Asn Tyr Asp Phe Lys Gly Lys
610 615 620
Leu Thr Tyr Ser Trp Pro Lys Tyr Asp Asp Gln Val Val Ile Asn Lys
625 630 635 640
Gly Asp Lys Asp Tyr Ala Pro Leu Tyr Pro Tyr Gly Tyr Gly Leu Thr
645 650 655
Tyr Ser Asp Val Asp Thr Gln Gly Asp Asp Leu Pro Glu Glu Thr Lys
660 665 670
Val Lys Ile Gly Arg Ala Asp Asp Glu Pro Met Ala Ile Phe Asp Ser
675 680 685
Leu Pro Gln Ser Asp Leu Gly Phe Phe Leu Gly Asp Lys Ala Asn Trp
690 695 700
Val Val Pro Ile Ala Thr Ser Val Val Thr Thr His Asn Ser Asp Asn
705 710 715 720
Leu Thr Met Arg Thr Tyr Asn Trp Lys Val Gln Glu Asp Ala Arg Gln
725 730 735
Leu Ile Trp Lys Gly Asp Ser Lys Ala Asn Ala Phe Phe Ala Trp Pro
740 745 750
Asp Pro His Asn Met Gln Gly Met Leu Glu His Lys Ala Ala Tyr Ser
755 760 765
Phe Ser Ile Lys Val Asp Lys Ala Pro Ala Gly Asp Leu Thr Leu Gly
770 775 780
Ile His Cys Met Glu Glu Cys Gly Lys Lys Leu Val Leu Asn Glu Ala
785 790 795 800
Leu Ser Lys Ile Pro Ala Gly Glu Trp Gly Glu Leu Thr Ile Asp Leu
805 810 815
Ala Cys Ile Ala Asp Ala Glu Ala Leu Ala Glu Val Arg Ser Pro Phe
820 825 830
Met Leu Ser Thr Asp Ala Pro Ala Ser Ile Val Phe Gly Asp Val Lys
835 840 845
Leu Val Pro Gly Gly Ala Asp Ser Ala Ala Ile Lys Cys Asp
850 855 860
<210> 38
<211> 2589
<212> DNA
<213> Microbulbifer degradans
<400> 38
atgctcaaaa agataaacaa gaaaggtctt gctttaagct tagcaattgc agcaatgcta 60
agcggctgca acgaaggcga cagcaacaaa accaaaccaa gtgcggaaac cctctccgct 120
actcaagcca gtaacactgt agccaacccc agcatttggc ccaaggtaac tagcaaggtt 180
gccaaagacg ccaaaatgga agcagatata agcgcaatac tcagcggtat gacccttgag 240
caaaaagtag cccaaatgat ccaacccgaa attcgtgcct tcagcaaaga agacatgaaa 300
aagtatggtt ttggctccta ccttaacggt ggcggcgcat tccctaacga caacaaacat 360
tccaccatgg ccgactgggt tgccctagcc gacgacatgt atgaagcctc tatagacgac 420
agcatagacg gcagcactat tccaaccatg tggggtaccg atgcagtaca cggccacaac 480
aacgtggtta aagcgactat tttcccacac aacattggcc ttggcgccat gcataacccc 540
aagctcatgc agcaaatagg cgctgccacg gctaaagtgg tacaagttac tggtatcgac 600
tgggtatttg cgcccactgt tgcggtagtg cgcgacgacc gctggggccg tacttacgag 660
ggctactctg aagaccccgc catagtaaaa gaatacgctc gcgccatggt tattggcatg 720
cagggcgaag ccaatagcga agcgtttatg ggtgacggca ctgttatagc caccgccaaa 780
cactttttgg gcgatggcgg caccgacaaa ggcgacgacc aaggcaacaa cttatccacc 840
gaacaagaat taattgatat tcacgcccaa ggctatataa gcgccattga agaaggtgtg 900
caaactatca tggcatcttt caatagctgg aatggcgaaa agatgcacgg caataaatct 960
ctgcttaccg atgtccttaa aaagcaaatg ggctttgacg gtttggtggt tggcgattgg 1020
gatggccacg gccaagtaaa aggttgctct aatgcaagct gtgcccaagc catcaacgcc 1080
ggtgtcgata tcatcatggt acccaatgag tggaaaccca tgttcgaaaa caccgttgca 1140
caagttaaaa gcggcgaaat ctctgaagcg cgaattaacg atgcagttac ccgtatttta 1200
cgtgtaaaaa tgcgcgctgg tattttcgac ggtgttaaac catcggatcg cgccttcgca 1260
gcagaagaaa aatacctagg ctctgccgaa aaccgcgcta tcgctcgtca agctgtacgc 1320
gaatcgttag tgttgcttaa aaaccaaaac aaactgctgc cattagaccg caaaatgaac 1380
gttttaatgg cgggttctgg cgcagacaac atcggcaagc aaagtggtgg ttggacatta 1440
agctggcagg gtactggcaa cgtgaacagc gacttccctg gcgcaacatc tatttacgac 1500
ggcgttaacc aagtagtgag cagcgctggc ggtaaagtag agctaagcga aaacggcaac 1560
taccaagcca aaccagatgt agcgattgta gtatttggtg aaaaccctta cgcagaaggc 1620
gtaggcgata ttgaaggtat tgaataccaa ctaaacaata agcgcgatat caatttgtta 1680
caaaaactca aagccgatgg cattcctgtt gtatcggtat tcttaaccgg tcgtccactt 1740
tgggtaaaca aagagcttaa tgcctccgat gcttttgttg cagcttggct gccaggctct 1800
gaaggtgtag gcgtttctga tgtgctattc aaaaaagccg acggtagtat taactacgac 1860
tttaaaggca agctaactta ctcttggcca aagtatgatg accaagtagt aataaacaaa 1920
ggcgacaaag attacgcccc gctttaccct tatggttacg gcttaaccta cagcgatgtt 1980
gacacccaag gtgacgactt acctgaagaa accaaagtta aaattggccg cgctgacgac 2040
gagccaatgg ccatcttcga cagcctaccc caaagcgacc tcggcttctt ccttggcgac 2100
aaagccaact gggtagtacc tattgcaaca agtgtagtta caacgcacaa cagcgataac 2160
ctaaccatgc gcacctacaa ctggaaagta caagaagatg ctcgccagtt aatttggaaa 2220
ggcgacagca aagccaatgc cttctttgca tggccagacc cacacaatat gcaaggcatg 2280
ttagaacaca aagcggctta cagctttagc attaaagtag ataaagcacc cgctggcgac 2340
ctaacactag gcatacactg catggaagaa tgcggtaaaa aacttgtgct taacgaagcg 2400
cttagcaaaa ttcctgctgg tgagtgggga gagctaacaa tagatctagc ttgcatagca 2460
gatgccgaag ccttggccga agttcgctca cccttcatgc taagcaccga tgcacccgca 2520
tctatcgtgt ttggcgatgt gaagttagta cctggcggtg cagatagcgc agctattaag 2580
tgtgactaa 2589
<210> 39
<211> 461
<212> PRT
<213> Microbulbifer degradans
<400> 39
Met Lys Thr Phe Asn Pro Asp Phe Val Trp Gly Ala Ala Ser Ser Ala
1 5 10 15
Tyr Gln Val Glu Gly Ala Thr Thr Thr Asp Gly Arg Gly Pro Ser Ile
20 25 30
Trp Asp Ala Phe Ser Ser Ile Pro Gly Lys Thr Tyr His Asn Gln Asn
35 40 45
Ala Asp Ile Ala Cys Asp His Tyr Asn Arg Trp Gln Glu Asp Val Ala
50 55 60
Ile Met Lys Glu Met Gly Leu Lys Ala Tyr Arg Phe Ser Ile Ser Trp
65 70 75 80
Ser Arg Ile Phe Pro Thr Gly Arg Gly Glu Val Asn Glu Lys Gly Val
85 90 95
Ala Phe Tyr Asn Asn Leu Ile Asp Glu Leu Ile Lys Asn Asp Ile Thr
100 105 110
Pro Trp Val Thr Leu Phe His Trp Asp Phe Pro Leu Ala Leu Gln Met
115 120 125
Glu Met Asp Gly Leu Leu Asn Pro Ala Ile Ala Asp Glu Phe Ala Asn
130 135 140
Tyr Ala Lys Leu Cys Phe Ala Arg Phe Gly Asp Arg Val Thr His Trp
145 150 155 160
Ile Thr Leu Asn Glu Pro Trp Cys Ser Ala Met Leu Gly His Gly Met
165 170 175
Gly Ser Lys Ala Pro Gly Arg Val Ser Lys Asp Glu Pro Tyr Ile Ala
180 185 190
Ala His Asn Leu Leu Arg Ala His Gly Lys Met Val Asp Ile Tyr Arg
195 200 205
Arg Glu Phe Gln Pro Thr Gln Lys Gly Met Ile Gly Ile Ala Asn Asn
210 215 220
Cys Asp Trp Arg Glu Pro Lys Thr Asp Ser Glu Leu Asp Lys Lys Ala
225 230 235 240
Ala Glu Arg Ala Leu Glu Phe Phe Val Ser Trp Phe Ala Asp Pro Ile
245 250 255
Tyr Leu Gly Asp Tyr Pro Ala Ser Met Arg Glu Arg Leu Gly Glu Arg
260 265 270
Leu Pro Thr Phe Ser Asp Glu Asp Ile Ala Leu Ile Lys Asn Ser Ser
275 280 285
Asp Phe Phe Gly Leu Asn His Tyr Thr Thr Met Leu Ala Glu Gln Thr
290 295 300
His Glu Gly Asp Val Val Glu Asp Thr Ile Arg Gly Asn Gly Gly Ile
305 310 315 320
Ser Glu Asp Gln Met Val Thr Leu Ser Lys Asp Pro Ser Trp Glu Gln
325 330 335
Thr Asp Met Glu Trp Ser Ile Val Pro Trp Gly Cys Lys Lys Leu Leu
340 345 350
Ile Trp Leu Ser Glu Arg Tyr Asn Tyr Pro Asp Ile Tyr Ile Thr Glu
355 360 365
Asn Gly Cys Ala Leu Pro Asp Glu Asp Asp Val Asn Ile Ala Ile Asn
370 375 380
Asp Thr Arg Arg Val Asp Phe Tyr Arg Gly Tyr Ile Asp Ala Cys His
385 390 395 400
Gln Ala Ile Glu Ala Gly Val Lys Leu Lys Gly Tyr Phe Ala Trp Thr
405 410 415
Leu Met Asp Asn Tyr Glu Trp Glu Glu Gly Tyr Thr Lys Arg Phe Gly
420 425 430
Leu Asn His Val Asp Phe Thr Thr Gly Lys Arg Thr Pro Lys Gln Ser
435 440 445
Ala Ile Trp Tyr Ser Thr Leu Ile Lys Asp Gly Gly Phe
450 455 460
<210> 40
<211> 1386
<212> DNA
<213> Microbulbifer degradans
<400> 40
atgaaaacct ttaacccaga tttcgtatgg ggagcagcca gttccgccta tcaggtagaa 60
ggcgccacca ccaccgatgg cagaggcccc agtatttggg atgcgttcag ttccattccc 120
ggtaaaacct accacaacca aaacgccgac atagcctgcg accactacaa ccgctggcaa 180
gaagacgtgg ccataatgaa agagatgggg ctaaaggctt accgcttttc tatttcttgg 240
tcgcgcatat tccctactgg gcgcggcgaa gttaacgaaa aaggcgtagc cttttacaac 300
aaccttatcg acgaattaat aaaaaacgac attacccctt gggtaaccct atttcactgg 360
gactttcctc tggcactgca aatggaaatg gacggcctac ttaaccccgc catcgccgac 420
gaattcgcca actacgccaa gctgtgtttc gcgcgctttg gcgaccgcgt tacccactgg 480
attaccctaa acgaaccttg gtgcagtgcc atgcttggcc acggcatggg cagcaaagcc 540
cctggccgcg tatctaagga tgaaccctat atagccgccc acaacttgct gcgtgcacac 600
ggcaaaatgg tagatattta ccggcgcgaa tttcagccca cacaaaaagg catgataggc 660
atagccaaca attgcgactg gcgcgaaccc aaaaccgatt ctgaattaga taaaaaagca 720
gccgagcgcg ccctagaatt ttttgtaagc tggtttgccg accccattta tttgggcgac 780
tacccagcca gcatgcgcga gcgcttgggt gagcgtttac ccacctttag cgacgaagac 840
attgcgctaa taaaaaactc tagcgacttt tttggtttga atcactacac caccatgctt 900
gccgaacaaa cccacgaagg tgacgttgtt gaagatacta ttcgcggcaa cggcggcata 960
tcggaagacc aaatggtcac cctctccaaa gacccaagct gggaacaaac cgacatggag 1020
tggagcattg tgccctgggg ctgtaaaaaa ttattaatct ggttaagcga gcgctacaac 1080
taccccgaca tttacattac cgaaaacggc tgcgccctac ccgacgaaga cgacgtaaac 1140
atagccatta acgatacacg ccgcgtagat ttttaccgcg gttatatcga tgcgtgtcac 1200
caagcaatag aggccggcgt aaaactaaaa ggctattttg catggacact tatggataac 1260
tacgaatggg aagaaggcta caccaaacgc tttggcttaa accatgtaga tttcaccaca 1320
ggcaaacgca cacctaaaca gtctgcaatt tggtatagca cgttaattaa agatggtggg 1380
ttctag 1386
<210> 41
<211> 444
<212> PRT
<213> Microbulbifer degradans
<400> 41
Met Asn Arg Leu Thr Leu Pro Pro Ser Ser Arg Leu Arg Ser Lys Glu
1 5 10 15
Phe Thr Phe Gly Val Ala Thr Ser Ser Tyr Gln Ile Glu Gly Gly Ile
20 25 30
Asp Ser Arg Leu Pro Cys Asn Trp Asp Thr Phe Cys Glu Gln Pro Asn
35 40 45
Thr Ile Ile Asp Asn Thr Asn Gly Ala Ile Ala Cys Asp His Ile Asn
50 55 60
Arg Trp Gln Asp Asp Ile Glu Leu Ile Ala Asn Leu Gly Val Asp Ala
65 70 75 80
Tyr Arg Phe Ser Ile Ala Trp Gly Arg Val Ile Asn Leu Asp Gly Ser
85 90 95
Leu Asn Asn Glu Gly Val Thr Phe Tyr Lys Asn Ile Leu Thr Lys Leu
100 105 110
Arg Glu Lys Asn Leu Lys Ala Tyr Ile Thr Leu Tyr His Trp Asp Leu
115 120 125
Pro Gln His Leu Glu Asp Ala Gly Gly Trp Leu Asn Arg Asp Thr Ala
130 135 140
Tyr Lys Phe Arg Asp Tyr Val Asn Leu Ile Thr Gln Ala Leu Asp Asp
145 150 155 160
Asp Val Phe Cys Tyr Thr Thr Leu Asn Glu Pro Phe Cys Ser Ala Tyr
165 170 175
Leu Gly Tyr Glu Ile Gly Val His Ala Pro Gly Ile Lys Asp Leu Ala
180 185 190
Ser Gly Arg Lys Ala Ala His His Leu Leu Leu Ala His Gly Leu Ala
195 200 205
Met Gln Val Leu Arg Lys Asn Cys Pro Asn Ser Leu Ser Gly Ile Val
210 215 220
Leu Asn Met Ser Pro Cys Tyr Ala Gly Ser Asn Ala Gln Ala Asp Ile
225 230 235 240
Asp Ala Ala Lys Arg Ala Asp Asp Leu Leu Phe Gln Trp Tyr Ala Gln
245 250 255
Pro Leu Leu Thr Gly Cys Tyr Pro Asp Ala Ile Asn Ser Leu Pro Asp
260 265 270
Asn Ala Lys Pro Pro Ile Cys Glu Gly Asp Met Ala Leu Ile Ser Gln
275 280 285
Pro Leu Asp Tyr Leu Gly Leu Asn Tyr Tyr Thr Arg Ala Val Phe Phe
290 295 300
Ala Asp Gly Asn Gly Gly Phe Thr Glu Gln Val Pro Glu Gly Val Glu
305 310 315 320
Leu Thr Asp Met Gly Trp Glu Val Tyr Pro Gln Gly Leu Thr Asp Leu
325 330 335
Leu Ile Asp Leu Asn Gln Arg Tyr Thr Leu Pro Pro Leu Leu Ile Thr
340 345 350
Glu Asn Gly Ala Ala Met Val Asp Glu Leu Val Asn Gly Glu Val Asn
355 360 365
Asp Ile Ala Arg Ile Asn Tyr Phe Gln Thr His Leu Gln Ala Val His
370 375 380
Asn Ala Ile Glu Gln Gly Val Asp Val Arg Gly Tyr Phe Ala Trp Ser
385 390 395 400
Leu Met Asp Asn Phe Glu Trp Ala Leu Gly Tyr Ser Lys Arg Phe Gly
405 410 415
Ile Thr Tyr Val Asp Tyr Gln Thr Gln Lys Arg Thr Leu Lys Ala Ser
420 425 430
Gly His Ala Phe Ala Glu Phe Val Ser Ser Arg Ser
435 440
<210> 42
<211> 1335
<212> DNA
<213> Microbulbifer degradans
<400> 42
atgaatagac ttacactacc gccttcttct cgtttgcgca gcaaagagtt tacctttggt 60
gttgcaacgt cgtcttacca aattgaaggc ggcatagatt ctcgcctgcc ctgtaattgg 120
gatacgttct gtgagcagcc caataccatt attgataaca ccaacggcgc cattgcttgc 180
gaccacataa atagatggca agacgatata gaacttattg ccaacctagg ggtagatgcc 240
taccgctttt ctattgcgtg gggccgtgtt attaatttag acggcagcct caataatgaa 300
ggcgttacat tttacaaaaa tattttaact aagcttcgcg aaaagaattt aaaagcttat 360
ataacgctat accactggga cttgccacaa catttagaag atgctggcgg ctggcttaac 420
cgcgataccg cctacaagtt tcgcgactat gtaaacctta taacccaagc gcttgatgac 480
gatgtatttt gctacacaac gttaaacgag cccttttgca gtgcctacct tggctatgaa 540
attggtgtac acgcaccggg tataaaagac ttagccagtg ggcgcaaagc cgcacaccat 600
ttattacttg cccatggctt agctatgcaa gtgctgcgaa aaaactgccc caatagttta 660
agcggcatag tgttaaacat gagcccttgt tacgccggca gcaacgcaca agcagatata 720
gatgcagcaa aacgcgcgga cgatttatta tttcagtggt atgcacaacc gctacttact 780
ggctgctacc ctgatgcaat aaacagcctg ccagacaatg ccaaaccacc tatttgtgaa 840
ggcgacatgg cgttaataag ccaaccttta gattatttag gccttaacta ctatacccgc 900
gcagtatttt ttgccgacgg taatggcggt tttaccgaac aagtacctga gggtgtagag 960
ctaaccgata tgggctggga agtttacccg caaggcttaa ccgatttact aatagaccta 1020
aaccaacgct ataccctacc cccgttactt attaccgaaa acggcgcagc aatggtggac 1080
gaacttgtta acggcgaagt taacgatatt gcccgaataa attattttca aacccattta 1140
caagcggtac acaacgccat tgaacaaggt gttgatgtac gcggttattt tgcttggagc 1200
ctaatggata attttgagtg ggcactgggt tacagcaaac gattcggtat tacctatgta 1260
gattaccaaa cacaaaagcg aacgctaaaa gccagcggcc acgcatttgc tgagtttgtc 1320
tcgagtagga gctaa 1335
<210> 43
<211> 866
<212> PRT
<213> Microbulbifer degradans
<400> 43
Met Leu Leu Ser Leu Lys Asn Thr Gln Leu Lys Arg Ser Met Asn Met
1 5 10 15
Asn Leu Lys His Leu Phe Leu Val Ala Leu Ala Leu Asn Ile Ala Ala
20 25 30
Cys Asn Val Lys Glu Pro Ala Ala Thr Asn Asp Asn His Ile Ser Tyr
35 40 45
Gln Ala Ala Arg Glu Ala Arg Leu Ala Lys Val Glu Ala Glu Val Glu
50 55 60
Arg Leu Leu Pro Leu Leu Thr Leu Glu Glu Lys Ala Ser Leu Val His
65 70 75 80
Ala Asn Ser Lys Phe Ser Ile Ala Ser Ile Glu Arg Leu Gly Ile His
85 90 95
Glu Met Trp Met Ser Asp Gly Pro His Gly Val Arg Tyr Gln Ile Glu
100 105 110
Arg His Gly Trp Ala Pro Ala Gly Trp Thr Asp Asp Asn Ser Thr Tyr
115 120 125
Leu Pro Pro Leu Thr Thr Val Ala Ala Ser Trp Asn Pro Glu Ile Ala
130 135 140
Ala Leu His Gly Asp Val Leu Gly Ala Glu Ala Arg His Arg Arg Lys
145 150 155 160
Asp Val Ile Leu Gly Pro Gly Val Asn Leu Ala Arg Leu Pro Leu Tyr
165 170 175
Gly Arg Asn Phe Glu Tyr Met Gly Glu Asp Pro Phe Leu Ala Ser Arg
180 185 190
Leu Ala Val Ala Glu Ile Lys Ala Ile Gln Glu Asn Asp Val Ala Ala
195 200 205
Cys Ile Lys His Phe Ala Leu Asn Asn Gln Glu Leu Asn Arg Thr Gly
210 215 220
Val Asn Ala Lys Pro Asp Glu Arg Thr Leu Arg Glu Val Tyr Leu Pro
225 230 235 240
Ala Phe Glu Ala Ala Val Lys Glu Ala Gly Val His Thr Ile Met Gly
245 250 255
Ala Tyr Asn Glu Phe Arg Gly Thr Asn Ala Asn Gln Ser Lys His Leu
260 265 270
Val Met Asp Ile Leu Lys Gly Glu Trp Gly Tyr Lys Gly Val Leu Leu
275 280 285
Thr Asp Trp Asn Val Asp Ile Asn Thr Tyr Asp Ala Ala Val Asn Gly
290 295 300
Leu Asp Ile Glu Met Gly Thr Asn Val Asp Ser Tyr Asp Asp Tyr Met
305 310 315 320
Leu Ala Gln Pro Met Ile Asp Met Ile Lys Ala Gly Ser Ile Pro Glu
325 330 335
Ser Val Leu Asp Asp Lys Val Arg Arg Ile Leu Arg Val Gln Leu Ser
340 345 350
Ile Gly Met Met Asp Lys Tyr Arg Leu Ser Gly Glu Arg Asn Thr Ala
355 360 365
Lys His His Glu Ala Ala Arg Lys Ile Ala Ser Glu Gly Ile Val Leu
370 375 380
Leu Lys Asn Glu Asn Ile Leu Pro Leu Asn Lys Asn Lys Ile Lys Asn
385 390 395 400
Val Leu Val Leu Gly Pro Asn Ala Asp Lys Val His Gly Leu Gly Gly
405 410 415
Gly Ser Ser Glu Val Pro Ala Leu Tyr Glu Ile Thr Pro Leu Gln Gly
420 425 430
Leu Lys Gln Lys Leu Gly Asp Asn Val Asn Ile Thr Val Met Arg Ala
435 440 445
Arg Tyr Asp Gly Val Leu Met Pro Ile Ala Ser Asp Tyr Val Thr Ser
450 455 460
Arg His Trp Thr Gly Thr Pro Ala Trp Asn Met Val Arg Tyr Ser Asp
465 470 475 480
Ala Ala Arg Thr Gln Ala Ile Gly Asp Ser Ala Ile Val Asp Ser Ala
485 490 495
Tyr Ser Ser Pro Ala Gly Thr Thr Lys Glu Tyr Val Thr Met Thr Ala
500 505 510
Thr Ile Lys Pro Leu Lys Ser Gly Glu His Thr Leu Lys Thr Ser Val
515 520 525
Met Gly Asp Phe Glu Leu Lys Ile Asn Gly Lys Thr Thr Val Lys His
530 535 540
Ser Ser Thr Ser Gly Asp Val Val Thr Gln Lys Ile Ala Leu Asn Gly
545 550 555 560
Gly Glu Thr Tyr Ser Phe Glu Ile Leu Tyr Ser Gly Asn Lys Asn Phe
565 570 575
Thr Leu Gly Trp Asp Ala Pro Gly Asp Leu Phe Thr Ala Glu Lys Glu
580 585 590
Tyr Ile Ala Ala Ala Lys Lys Ala Asp Val Val Phe Tyr Phe Gly Gly
595 600 605
Leu Thr His Gly Asp Asp Arg Glu Ala Ile Asp Arg Pro His Met Lys
610 615 620
Leu Pro Asn His Gln Asp Pro Val Ile Ser Lys Val Leu Ala Ala Asn
625 630 635 640
Pro Asn Thr Val Val Phe Leu Ile Ala Gly Ser Ala Val Glu Met Pro
645 650 655
Trp Ala Asp Lys Ala Lys Ala Ile Val Trp Gly Trp Tyr Gly Gly Met
660 665 670
Glu Ala Gly Asn Ala Tyr Ala Asp Met Leu Phe Gly Asp Thr Asn Pro
675 680 685
Ser Gly Lys Met Pro Ile Thr Leu Pro Lys Ala Leu Glu Asp Thr Ala
690 695 700
Pro Ile Ala Leu Asn Asp Tyr Asn Pro Val Glu Ser Leu Tyr Thr Glu
705 710 715 720
Gly Val Phe Ile Gly Tyr Arg Trp Phe Glu Lys Gln Asn Ile Glu Pro
725 730 735
Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Thr Gln Phe Lys Tyr Asn
740 745 750
Asn Ile Lys Leu Ser Ser Ala Asn Ile Lys Gly Asp Gln Thr Val Thr
755 760 765
Val Ser Ala Thr Ile Thr Asn Thr Gly Lys Val Ala Gly Ala Glu Val
770 775 780
Val Gln Leu Tyr Leu His Asp Glu Gln Ala Ser Val Glu Arg Pro Ala
785 790 795 800
Lys Glu Leu Lys Gly Phe Gln Lys Val Phe Leu Lys Pro Gly Glu Ser
805 810 815
Lys Ala Val Asn Ile Thr Leu Asn Lys Arg Ala Leu Ser Phe Trp Asp
820 825 830
Glu Asn Ser Asn Asp Trp Leu Ala Glu Thr Gly Lys Phe Asn Val Leu
835 840 845
Leu Gly Ala Ser Val Ser Asp Ile Arg Leu Gln Thr Ser Phe Gln Tyr
850 855 860
Gln Gln
865
<210> 44
<211> 2601
<212> DNA
<213> Microbulbifer degradans
<400> 44
atgctgctaa gcttaaaaaa cactcaactc aaaagaagta tgaacatgaa ccttaaacac 60
ctctttctgg ttgctttggc gctaaatatt gctgcgtgca atgtaaaaga gcccgcggcg 120
acaaatgata accacattag ctaccaagcc gctcgcgaag cgcgcttggc aaaagttgaa 180
gccgaagttg aacgcctgct gccactatta acactagaag aaaaagcctc tttggttcat 240
gcgaacagca aattctctat cgcctctatc gagcggctag gcattcacga aatgtggatg 300
tctgatggcc cccacggcgt gcgctatcaa atcgaacgcc acggctgggc accagcaggc 360
tggacagatg acaactccac ttacttacca ccgcttacta ccgtagccgc cagctggaac 420
cccgaaatag ctgcccttca cggcgatgta ctcggcgcag aagctcgcca ccgccgtaaa 480
gatgtaatat taggcccagg cgtaaactta gctcgcctgc cactttatgg tcgtaacttt 540
gaatatatgg gtgaagaccc cttcttggca tcacgtcttg ctgtggcaga aattaaagcc 600
attcaagaaa atgacgtggc cgcctgtatc aaacatttcg cgcttaacaa tcaagagctg 660
aatcgcaccg gcgtaaacgc caaacccgat gaacgcacat tacgcgaagt gtatttaccc 720
gccttcgaag ccgccgttaa agaagcgggc gtgcacacca taatgggggc ctacaatgaa 780
tttcgcggta ccaacgccaa ccaaagcaaa catttagtaa tggatattct aaaaggcgaa 840
tggggctaca aaggcgtgtt actcacagac tggaacgtag atatcaacac ttacgatgcc 900
gctgttaacg gcctcgatat cgaaatgggt acaaatgtag atagctacga cgactacatg 960
cttgcccaac caatgatcga catgattaaa gcgggcagca ttccagagtc agtacttgat 1020
gataaagttc gtcgcatact gcgcgtgcaa ctcagcatag gcatgatgga caaataccgc 1080
ttatctggtg agcgcaatac tgccaagcat cacgaagctg cacgcaaaat tgcatctgaa 1140
ggtattgtgc tactaaaaaa tgaaaacatt ctgccgctaa ataaaaacaa aattaaaaac 1200
gtattggtgc ttggccccaa cgcagacaaa gtgcacggtt taggcggtgg ctcgtcagaa 1260
gtgccagcac tttatgaaat aaccccgtta caagggttaa aacagaagct gggagataat 1320
gtaaacatta ccgttatgcg cgcacgctat gacggtgtgt taatgcctat cgccagtgat 1380
tatgttactt ctcgtcactg gaccggcaca cctgcatgga acatggtgcg ttactcggat 1440
gctgcgcgca cccaagctat tggcgactcc gccattgttg attcggctta ttcttcgcct 1500
gcaggcacga ctaaagaata cgtcaccatg accgccacaa ttaaaccgtt aaaatcgggc 1560
gagcacacac tcaaaacatc ggtgatgggc gatttcgaat taaaaattaa cggtaaaacc 1620
acagtaaaac atagcagcac tagcggcgat gtagtaaccc aaaaaatcgc cctcaacggc 1680
ggtgaaacat acagcttcga aattttatac agcggcaata aaaactttac cttgggctgg 1740
gatgcaccgg gagatttatt taccgcagaa aaagaataca tagccgccgc gaaaaaagcg 1800
gatgtagtgt tttactttgg cggcctaacc cacggcgacg accgcgaagc aattgaccgc 1860
cctcacatga agctgcctaa ccatcaagac ccagttatta gcaaagtatt agctgcaaac 1920
ccgaacacgg ttgtattttt aattgcaggc tctgctgtag aaatgccgtg ggccgataaa 1980
gctaaagcta ttgtgtgggg ctggtatggc ggtatggagg ccggtaacgc ctacgccgat 2040
atgctatttg gcgataccaa ccccagcggc aaaatgccaa taactttacc aaaggcactg 2100
gaagatactg ctccaatcgc actgaatgat tacaaccctg ttgaatcact ctacaccgag 2160
ggcgtgttta ttggttaccg ctggttcgaa aaacaaaaca tcgagccgct attcccgttc 2220
ggtcatggtt tgtcttatac ccagtttaag tacaacaata taaagctctc tagcgcgaac 2280
attaaaggcg accaaaccgt caccgtaagc gcaaccatta ccaatactgg caaagtggcc 2340
ggcgctgaag ttgtacaact gtatttgcat gacgagcaag caagcgtaga acgcccagca 2400
aaagaactta aaggtttcca aaaagtgttt ttaaagccgg gtgaaagcaa agcggtaaat 2460
attacgctta ataaacgcgc cctttcattt tgggatgaaa acagcaacga ctggcttgca 2520
gaaacaggta aatttaatgt gctattgggc gcatcagtaa gcgatatacg cttacaaact 2580
agcttccaat accagcagta a 2601
<210> 45
<211> 811
<212> PRT
<213> Microbulbifer degradans
<400> 45
Met Lys Phe Gly His Phe Asp Asp Lys Ala Arg Glu Tyr Val Ile Thr
1 5 10 15
Asp Pro Lys Thr Pro Tyr Pro Trp Ile Asn Tyr Leu Gly Asn Glu Asp
20 25 30
Phe Phe Ser Leu Val Ser Asn Thr Gly Gly Gly Tyr Ser Phe Tyr Lys
35 40 45
Asp Ala Lys Phe Arg Arg Leu Thr Arg Tyr Arg Tyr Asn Asn Val Pro
50 55 60
Val Asp Asn Gly Gly Lys Tyr Phe Tyr Ile Asn Asp Ser Gly Asp Val
65 70 75 80
Trp Ser Pro Gly Trp Lys Pro Val Lys Ala Glu Leu Asp Ala Tyr Ser
85 90 95
Cys Ala His Gly Leu Ser Tyr Thr Arg Ile Thr Gly Glu Arg Asn Gly
100 105 110
Ile Gln Ala Glu Val Leu Ser Phe Ile Pro Leu Gly Thr Trp Ala Glu
115 120 125
Ile Gln Lys Val Ser Leu Lys Asn Thr Ser Gly Ala Thr Lys Lys Phe
130 135 140
Lys Leu Phe Ser Phe Ala Glu Trp Cys Leu Trp Asn Ala Glu Asp Asp
145 150 155 160
Met Thr Asn Phe Gln Arg Asn Phe Ser Thr Gly Glu Val Glu Val Glu
165 170 175
Asp Ser Val Ile Tyr His Lys Thr Glu Phe Lys Glu Arg Arg Asn His
180 185 190
Tyr Ala Phe Tyr Ser Val Asn Ala Pro Ile Gln Gly Phe Asp Thr Asp
195 200 205
Arg Asp Lys Trp Lys Gly Leu Tyr Asn Asp Phe Asp Lys Pro Asp Ala
210 215 220
Val Phe Glu Gly Glu Pro Arg Asn Ser Glu Ala His Gly Trp Ser Pro
225 230 235 240
Ile Ala Ser His Tyr Leu Glu Val Glu Leu Ala Pro Gly Glu Ser Lys
245 250 255
Asp Leu Ile Phe Val Leu Gly Tyr Ile Glu Val Ala Pro Glu Asn Lys
260 265 270
Trp Glu Ser Lys Gly Val Ile Asn Lys Ser Pro Ala Lys Glu Leu Ile
275 280 285
Ala Arg Phe Asp Ser Val Glu Lys Val Asp Ala Glu Leu Thr Lys Leu
290 295 300
Ala Asp Tyr Trp Ala Asn Leu Leu Ser Thr Tyr Ser Val Glu Ser Gly
305 310 315 320
Asp Glu Lys Leu Asp Arg Met Val Asn Ile Trp Asn Gln Tyr Gln Cys
325 330 335
Met Val Thr Phe Asn Met Ser Arg Ser Ala Ser Phe Phe Glu Ser Gly
340 345 350
Ile Gly Arg Gly Met Gly Phe Arg Asp Ser Asn Gln Asp Leu Ile Gly
355 360 365
Phe Val His Gln Val Pro Glu Arg Ala Arg Glu Arg Ile Ile Asp Ile
370 375 380
Ala Ser Thr Gln Phe Glu Asp Gly Ser Ala Tyr His Gln Tyr Gln Pro
385 390 395 400
Leu Thr Lys Arg Gly Asn Asn Ala Ile Gly Gly Asn Phe Asn Asp Asp
405 410 415
Pro Leu Trp Leu Ile Leu Ser Thr Thr Asp Tyr Ile Lys Glu Thr Gly
420 425 430
Asp Phe Ser Ile Leu Glu Glu Gln Val Pro Tyr Asp Asn Asp Ala Ser
435 440 445
Lys Ala Thr Ser His Phe Glu His Leu Lys Arg Ser Phe Tyr His Thr
450 455 460
Val Asn Asn Leu Gly Pro His Gly Leu Pro Leu Ile Gly Arg Ala Asp
465 470 475 480
Trp Asn Asp Cys Leu Asn Leu Asn Cys Phe Ser Glu Asp Pro Asn Glu
485 490 495
Ser Phe Gln Thr Thr Gly Asn Lys Thr Gly Arg Thr Ala Glu Ser Leu
500 505 510
Met Ile Ala Gly Leu Phe Val Leu Tyr Gly Asn Glu Phe Val Lys Leu
515 520 525
Cys Arg Glu Ile Gly Gln Asp Gly Glu Ala Ala Glu Ala Gln Ala His
530 535 540
Ile Asp Gln Met Val Glu Ala Val Lys Lys His Gly Trp Asp Gly Glu
545 550 555 560
Trp Phe Leu Arg Ala Tyr Asp Tyr Tyr Gly Lys Lys Val Gly Ser Lys
565 570 575
Glu Asn Glu Glu Gly Lys Ile Phe Ile Glu Ser Gln Gly Phe Cys Gly
580 585 590
Met Ala Gly Ile Gly Leu Glu Asp Gly Leu Val Glu Lys Ser Met Asp
595 600 605
Ser Val Lys Glu Trp Leu Asp Cys Asp Tyr Gly Ile Val Leu Gln Gln
610 615 620
Pro Ala Phe Thr Lys Tyr Tyr Ile Glu Tyr Gly Glu Ile Ser Thr Tyr
625 630 635 640
Pro Ala Gly Tyr Lys Glu Asn Ala Gly Ile Phe Cys His Asn Asn Pro
645 650 655
Trp Ile Met Ile Thr Glu Thr Leu Leu Gly Arg Gly Asp Lys Ala Phe
660 665 670
Glu Tyr Tyr Arg Lys Ile Ala Pro Ala Tyr Leu Glu Glu Ile Ser Asp
675 680 685
Leu His Lys Val Glu Pro Tyr Ala Tyr Cys Gln Met Ile Ala Gly Lys
690 695 700
Asp Ala Tyr Leu Pro Gly Glu Gly Lys Asn Ser Trp Leu Thr Gly Thr
705 710 715 720
Ala Ser Trp Asn Phe Ala Ala Ile Thr Gln Tyr Ile Leu Gly Val Lys
725 730 735
Pro Asp Tyr Ser Gly Leu Ala Ile Asn Pro Cys Ile Pro Ser Ser Trp
740 745 750
Asp Gly Phe Lys Val Thr Arg Lys Tyr Arg Gly Ala Thr Tyr Asn Ile
755 760 765
Ile Val Thr Asn Pro Thr His Val Ser Lys Gly Val Lys Ser Leu Thr
770 775 780
Leu Asn Gly Asn Ala Ile Asp Gly Tyr Ile Val Pro Pro Gln Gln Ala
785 790 795 800
Gly Thr Val Cys Asn Val Glu Val Thr Leu Gly
805 810
<210> 46
<211> 2436
<212> DNA
<213> Microbulbifer degradans
<400> 46
atgaaatttg ggcactttga cgacaaagca cgcgagtatg taattaccga cccgaaaact 60
ccctacccgt ggataaacta cttaggcaac gaagacttct tcagcctagt atctaacact 120
gggggtggct acagttttta caaagatgca aagttccgtc gtttaacacg ctatagatac 180
aacaacgtac ccgtagacaa cggcggtaaa tatttttaca tcaatgatag tggcgatgta 240
tggagccccg gttggaagcc ggtaaaagca gagctagacg catacagctg cgctcacggc 300
cttagctaca cccgcattac cggcgaaaga aacggcattc aagcggaagt acttagcttt 360
atccctctcg gcacttgggc cgaaattcaa aaagttagcc ttaagaatac ctctggcgct 420
accaaaaaat ttaaactgtt ttctttcgcc gaatggtgcc tatggaacgc agaagatgac 480
atgaccaact tccaacgcaa cttctccacc ggtgaagtag aggtggaaga ctctgttatt 540
tatcacaaga cagaatttaa agagcgccgc aatcattacg cattctactc tgtaaacgca 600
ccaattcagg gcttcgacac cgacagagac aaatggaaag gcttgtacaa cgattttgat 660
aaacccgatg ccgtttttga aggcgagcct cgcaactccg aagcgcacgg ctggtcgcca 720
attgcatctc actatctaga agtggagctc gcaccaggcg aaagcaaaga cttaattttt 780
gtgcttggct atatagaagt tgccccagaa aacaaatggg aatcaaaggg cgttatcaac 840
aagtctccag ccaaagaact tattgcgcgt ttcgatagcg tagaaaaagt agatgccgag 900
ttaaccaagc tagccgatta ttgggcaaat ttgctttcta cttacagcgt agaaagtggc 960
gacgaaaagc tagaccgcat ggtaaatatt tggaaccaat accagtgtat ggtgacattt 1020
aatatgagtc gctctgcgtc tttcttcgaa tctggcattg gccgtggtat gggcttccgc 1080
gattccaatc aggatttgat aggctttgta caccaagtac ccgagcgcgc ccgcgaacgc 1140
ataattgata ttgcttctac tcagtttgaa gacggttcgg cctaccacca gtatcagcct 1200
ttaaccaaac gcggcaacaa cgcaattggc ggcaacttta acgatgaccc tctttggcta 1260
atcctttcta ccaccgatta cataaaagag actggcgatt tctctatttt agaagagcaa 1320
gtgccttacg ataatgatgc gagcaaagcc acaagtcatt ttgaacattt aaagcgctcg 1380
ttttatcaca cggttaataa tttaggccca catggcttgc cacttattgg tcgcgccgac 1440
tggaacgact gcctaaacct aaactgcttt agtgaagacc ctaacgaatc attccaaacc 1500
acgggcaaca aaaccggcag aacggctgag tcgttaatga ttgcaggttt atttgtttta 1560
tacggcaacg agtttgtaaa actgtgccgt gaaataggcc aagacggaga agcggcagaa 1620
gcccaagccc atattgacca aatggtagaa gctgtgaaaa agcacggctg ggatggcgag 1680
tggtttttgc gtgcttacga ctactacggt aaaaaagtag gcagtaaaga aaacgaagaa 1740
ggcaaaatat ttatcgaatc gcaaggtttc tgcggcatgg caggaatcgg cctagaagac 1800
ggccttgtcg aaaaatcgat ggattctgtt aaagaatggt tagattgcga ttacggtatt 1860
gtgttgcagc aaccggcgtt taccaagtac tacatagagt atggtgaaat ctccacctac 1920
cctgctggct acaaagagaa cgcaggtatc ttctgccaca acaacccgtg gattatgatc 1980
accgaaactt tgcttggccg cggtgacaaa gcctttgaat actaccgcaa aattgcacct 2040
gcatacctag aggaaattag cgatcttcac aaagtagagc cttacgccta ctgccagatg 2100
attgcaggta aagatgccta cttacctggc gagggtaaaa actcatggct aacagggacc 2160
gcttcgtgga acttcgctgc aattactcag tacattttag gcgtaaaacc agactatagc 2220
ggtttagcaa ttaacccttg cataccgtct agctgggatg gctttaaagt tacccgtaag 2280
tatcgcggcg caacctataa catcatcgta accaacccaa cccatgtaag caaaggcgta 2340
aaatcgctca ccctaaatgg caacgctatt gatggctaca tagtgccacc gcaacaagct 2400
ggcaccgtat gtaacgtaga agttacattg ggctaa 2436
<210> 47
<211> 788
<212> PRT
<213> Microbulbifer degradans
<400> 47
Met Leu Lys Ala Ile Asn Asn Gly Glu Arg Tyr Gln Leu Thr Ser Pro
1 5 10 15
Thr Ala Met Pro Gln Ser Ala Ser Phe Leu Trp Asn Lys Lys Met Met
20 25 30
Ile Gln Val Asn Cys Arg Gly Tyr Ala Val Ala Gln Phe Met Gln Pro
35 40 45
Glu Pro Ala Lys Tyr Ala Tyr Ala Pro Asn Leu Glu Ala Lys Thr Phe
50 55 60
Met Gln Pro Glu Gln Pro Tyr Tyr Ala His His Pro Gly Arg Phe Phe
65 70 75 80
Tyr Ile Lys Asp Glu Glu Thr Gly Glu Ile Phe Ser Ala Pro Tyr Glu
85 90 95
Pro Val Arg Ser Gln Leu Asn Asn Phe Ser Phe Asn Ala Gly Lys Ser
100 105 110
Asp Ile Ser Trp His Ile Ala Ala Leu Gly Ile Glu Val Glu Leu Cys
115 120 125
Leu Ser Leu Pro Val Asp Asp Val Val Glu Leu Trp Glu Leu Lys Ile
130 135 140
Lys Asn Gly Gly Ala Gln Pro Arg Lys Leu Ser Ile Tyr Pro Tyr Phe
145 150 155 160
Pro Val Gly Tyr Met Ser Trp Met Asn Gln Ser Gly Asp Tyr Ser Gln
165 170 175
Thr Ala Gly Gly Ile Ile Ala Ser Cys Val Thr Pro Tyr Gln Lys Val
180 185 190
Ala Asp Tyr Phe Lys Asn Lys Asp Phe Lys Asp Lys Thr Phe Phe Leu
195 200 205
His Glu Thr Ala Pro Ala Ala Trp Glu Val Asn Gln Lys Asn Phe Glu
210 215 220
Gly Glu Gly Gly Leu His Asn Pro Asn Ala Ile Gln Gln Glu Thr Leu
225 230 235 240
Gly Cys Gly Asn Ala Leu Tyr Glu Thr Pro Thr Ala Val Leu Gln Tyr
245 250 255
Arg Arg Glu Leu Ala Ala Gln Glu Gln Gln Thr Phe Arg Phe Ile Phe
260 265 270
Gly Pro Ala Phe Asp Glu Ser Glu Ala Ile Ala Leu Arg Asn Lys Tyr
275 280 285
Leu Ser Ala Glu Gly Phe Ala Lys Ala Lys Ser Glu Tyr Gln Thr Tyr
290 295 300
Ile Thr Ser Gly Lys Gly Cys Leu Gln Ile Asn Thr Pro Asp Pro Glu
305 310 315 320
Leu Asn Asn Phe Val Asn His Trp Leu Pro Arg Gln Val Phe Tyr His
325 330 335
Gly Asp Val Asn Arg Leu Thr Thr Asp Pro Gln Thr Arg Asn Tyr Ile
340 345 350
Gln Asp Asn Met Gly Met Ser Tyr Ile Lys Pro Asn Ile Thr Arg Gln
355 360 365
Ala Phe Leu His Ala Leu Ser Gln Gln Glu Glu Ser Gly Ala Met Pro
370 375 380
Asp Gly Ile Leu Leu Leu Glu Gly Ala Glu Leu Lys Tyr Ile Asn Gln
385 390 395 400
Ile Pro His Thr Asp His Cys Val Trp Leu Pro Val Cys Met Gln Ala
405 410 415
Tyr Leu Asp Glu Thr Asn Asp Tyr Ala Leu Leu Asp Glu Ile Val Pro
420 425 430
Tyr Ala Ser Gly Glu Lys Arg Glu Thr Val Glu Gln His Met His His
435 440 445
Ala Met Arg Trp Leu Leu Gln Ala Arg Asp Glu Arg Gly Leu Ser Phe
450 455 460
Ile Ala Gln Gly Asp Trp Cys Asp Pro Met Asn Met Val Gly Tyr Lys
465 470 475 480
Gly Lys Gly Val Ser Gly Trp Leu Ser Val Ala Thr Ala Tyr Ala Leu
485 490 495
Asn Leu Trp Ala Asp Val Cys Glu Gln Arg Gln Gln Asn Ser Cys Ala
500 505 510
Asn Glu Phe Arg Gln Gly Ala Lys Asp Ile Asn Ala Ala Val Asn Lys
515 520 525
His Ile Trp Asp Gly Glu Trp Phe Gly Arg Gly Ile Thr Asp Asp Gly
530 535 540
Val Leu Phe Gly Thr Ser Lys Asp Lys Glu Gly Arg Ile Phe Leu Asn
545 550 555 560
Pro Gln Ser Trp Ala Ile Leu Gly Gly Ala Ala Asp Glu Gln Lys Ile
565 570 575
Pro Cys Leu Leu Asp Ala Val Glu Gln Gln Leu Glu Thr Pro Tyr Gly
580 585 590
Val Met Met Leu Ala Pro Ala Phe Thr Ala Met Arg Asp Asp Val Gly
595 600 605
Arg Val Thr Gln Lys Phe Pro Gly Ser Ala Glu Asn Gly Ser Val Tyr
610 615 620
Asn His Ala Ala Val Phe Tyr Ile Phe Ser Leu Leu Ser Ile Gly Glu
625 630 635 640
Ser Glu Arg Ala Tyr Lys Leu Leu Arg Gln Met Leu Pro Gly Pro Asp
645 650 655
Glu Ala Asp Leu Leu Gln Arg Gly Gln Leu Pro Val Phe Ile Pro Asn
660 665 670
Tyr Tyr Arg Gly Ala Tyr Tyr Gln His Pro Arg Thr Ala Gly Arg Ser
675 680 685
Ser Gln Leu Phe Asn Thr Gly Thr Val Ser Trp Val Tyr Arg Cys Leu
690 695 700
Ile Glu Gly Val Phe Gly Leu Lys Gly Ser Pro Gln Gly Leu Val Val
705 710 715 720
Gln Pro Gln Leu Pro Val Ala Trp Gln Thr Ala Glu Ala Val Arg Glu
725 730 735
Phe Arg Gly Ala Thr Phe Asn Val Ser Tyr Arg Lys Ser Ser Asp Ile
740 745 750
Lys Glu Met Glu Ile Gln Leu Asn Glu Ser Val Ile Ser Gly Asn Thr
755 760 765
Ile Ser Asp Ile Thr Ala Gly Ala Thr Tyr Gln Leu Thr Val Leu Leu
770 775 780
Pro Ala Thr His
785
<210> 48
<211> 2367
<212> DNA
<213> Microbulbifer degradans
<400> 48
atgttaaaag ccattaacaa cggcgaacgc tatcaactca ctagccctac cgctatgccg 60
caaagcgcat cgtttttatg gaataaaaaa atgatgatac aagtaaattg ccgcggctac 120
gccgttgcgc aatttatgca gccagaacca gccaaatacg cttacgcacc caatctggaa 180
gcaaaaacat ttatgcaacc agagcaaccc tattacgcgc atcaccccgg gcgctttttc 240
tatataaaag atgaagagac aggcgagatt ttttcggcac cctacgagcc tgtgcgcagc 300
cagctgaaca actttagctt taacgcaggc aagagcgata taagctggca tattgccgct 360
ttaggcattg aagtagagct atgtcttagc ctgccggtgg acgatgtagt agaattgtgg 420
gaactaaaaa taaaaaacgg cggcgcgcaa cctcgtaaac tcagtattta cccgtacttt 480
cctgtgggtt acatgtcgtg gatgaatcaa tctggtgact acagccaaac cgccggcggc 540
attattgcca gctgcgtaac gccttatcaa aaagtcgccg actactttaa gaataaagac 600
tttaaagata aaacgttctt tcttcacgaa accgccccag cagcatggga agtaaaccag 660
aaaaacttcg aaggcgaagg cgggttgcac aaccccaacg ccatacaaca agaaacgctg 720
ggctgcggca acgcattgta cgaaacgccc acagcggtat tgcaataccg ccgcgaactt 780
gcagcgcaag agcagcaaac ctttcgcttt atttttggcc cagcatttga cgagagcgaa 840
gccattgcac tgcgcaataa gtatttatct gccgaaggtt ttgccaaagc aaaaagcgaa 900
taccaaacct atataacgag cggcaaaggc tgcttgcaaa ttaacacccc agacccagaa 960
ctaaacaact ttgtaaacca ctggctaccg cgccaagtgt tttatcacgg cgatgtaaac 1020
cggttaacca ccgacccgca aacgcgcaat tatattcaag acaatatggg catgagctac 1080
attaagccca acattacgcg gcaggcgttt ttacatgcct taagccagca ggaagaaagc 1140
ggtgcaatgc ccgacggcat tttattgctt gaaggcgccg agcttaaata cataaaccaa 1200
ataccccata ccgatcactg cgtttggctg ccggtgtgta tgcaagccta tttggatgaa 1260
accaatgact acgccctatt agacgaaata gtaccctatg cgagtggcga gaagcgcgaa 1320
actgttgagc aacatatgca tcacgctatg cgctggcttt tgcaagcacg cgacgaacgc 1380
ggcctaagct ttatcgcaca gggcgactgg tgcgacccca tgaacatggt gggctacaag 1440
ggcaaagggg tatccggctg gctttcagtc gctaccgctt atgcattaaa cctgtgggca 1500
gatgtttgcg aacaacggca gcaaaacagt tgcgccaacg aatttagaca gggcgctaaa 1560
gatataaacg cggcggtaaa caagcatatt tgggatggcg aatggtttgg ccgcggcatt 1620
acagatgacg gcgtactgtt tggcaccagc aaagataaag aaggcagaat ttttctaaac 1680
ccacaaagct gggcaatact tggcggcgcc gccgacgaac aaaaaatccc atgcctgcta 1740
gacgcagtag agcaacaact ggaaacccct tacggcgtaa tgatgctggc ccccgcgttt 1800
accgccatgc gcgatgacgt aggccgagtt acccaaaaat tcccaggctc tgcagaaaac 1860
ggctctgttt ataatcacgc ggcggtgttt tatatattta gcttgttatc cattggcgag 1920
agcgaacgcg catataaact gctacgccaa atgctgcctg ggccagatga agccgatctt 1980
ttacagcgcg gccaactgcc agtattcata cctaactatt atcgcggcgc atactaccag 2040
cacccccgca ccgccggtcg ctctagccag ctctttaata cgggtacagt ctcgtgggtt 2100
taccgctgct taattgaagg ggtattcggc ttgaaaggct cgccacaagg cttagttgta 2160
caaccgcaac tgcctgtcgc ctggcaaaca gcagaagccg ttagggaatt tagaggcgca 2220
acgtttaacg tgagctaccg caaaagcagc gatataaaag aaatggaaat acagctaaat 2280
gaatcggtaa taagtggcaa caccatctcc gacatcaccg ccggcgcgac ctatcaatta 2340
accgttctat tacctgccac acactaa 2367
<210> 49
<211> 574
<212> PRT
<213> Microbulbifer degradans
<400> 49
Met Lys Lys Leu Ile Lys Pro Thr Leu Ser Trp Val Ala Gly Val Ala
1 5 10 15
Leu Ser Leu Gly Ile Ala Gln Gly Ala Gly Ala Gln Asn Val Gln Phe
20 25 30
Val Gly Asn Ile Thr Thr Asn Gly Ser Val Arg Asn Asp Phe Met Asp
35 40 45
Tyr Trp Asp Gln Ile Thr Pro Glu Asn Glu Gly Lys Trp Gly Ser Val
50 55 60
Glu Arg Ser Arg Asp Asn Tyr Ser Trp Ser Gly Gln Asp Ala Ala Tyr
65 70 75 80
Asn Phe Ala Arg Ala Asn Gly Ile Pro Phe Lys Ala His Thr Leu Val
85 90 95
Trp Gly Ser Gln Tyr Pro Ser Trp Ile Asn Asn Leu Ser Asn Ala Glu
100 105 110
Lys Ala Ala Glu Ile Glu Glu Trp Ile Arg Asp Tyr Cys Asn Arg Tyr
115 120 125
Pro Ala Thr Asp Ile Ile Asp Val Val Asn Glu Ala Thr Pro Gly His
130 135 140
Ala Pro Ala Asn Tyr Ala Arg Asp Ala Phe Gly Asp Asn Trp Ile Ile
145 150 155 160
Lys Ser Phe Gln Leu Ala Arg Gln Tyr Cys Pro Asn Ala Thr Leu Val
165 170 175
Leu Asn Asp Tyr Asn Val Leu Ile Trp Asn Thr Asn Asp Phe Ile Ala
180 185 190
Met Ala Gln Pro Val Ile Asn Ala Gly Val Val Asp Ala Leu Gly Leu
195 200 205
Gln Ala His Gly Leu Glu Ser Leu Ser Ala Ser Gln Leu Lys Ser Thr
210 215 220
Leu Asp Arg Ile Ala Asn Leu Gly Leu Pro Ile Tyr Ile Ser Glu Tyr
225 230 235 240
Asp Val Arg Ser Thr Asn Asp Gln Glu Gln Leu Arg Ile Met Arg Asp
245 250 255
Gln Phe Pro Val Phe Tyr Asn His Pro Ser Val Arg Gly Ile Thr Leu
260 265 270
Trp Gly Tyr Met Val Gly Ala Thr Trp Arg Glu Gly Thr Gly Leu Ile
275 280 285
Arg Ala Asp Gly Ser His Arg Pro Ala Met Thr Trp Leu Met Asn Tyr
290 295 300
Leu Glu Asn Asn Arg Gly Gly Ser Thr Ser Ser Ser Ser Ser Ser Ser
305 310 315 320
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Ser Gly
325 330 335
Gly Pro Ser Ser Leu Thr Val Glu Leu Glu Ser Leu Ser Asp Ser Ser
340 345 350
Asn Phe Ser Pro Phe Ser Val Gln Ser Asp Ser Ser Ala Ala Gly Gly
355 360 365
Gln Tyr Val Val Trp Pro Asn Asn Gly Asn Gln Ile Val Ser Ser Pro
370 375 380
Ser Asp Ser Ala Ser Gly Gln Ile Gln Val His Phe Thr Leu Ser Gln
385 390 395 400
Ser Ala Asp Val Gln Phe Gln Ile Arg Ala Asp Leu Ala Asn Gly Asn
405 410 415
Asp Asp Ser Phe Tyr Tyr Lys Leu Asp Ser Gly Ser Trp Asn Thr Gln
420 425 430
Asn Asn Ala Ser Thr Ser Gly Trp Gly Thr Leu Thr Pro Ala Thr Phe
435 440 445
Ser Asn Val Ser Thr Gly Ser His Thr Leu His Ile Leu Arg Arg Glu
450 455 460
Asp Gly Ala Lys Leu Asp Lys Val Thr Leu Asn Ala Ser Val Gly Gln
465 470 475 480
Val Ser Ala Ser Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
485 490 495
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Ala Ala Val Ala
500 505 510
Ser Cys Asp Gly Val Asn Glu Tyr Pro Ser Trp Thr Ala Lys Asp Trp
515 520 525
Ser Gly Gly Asp Tyr Asn His Ala Asn Ser Gly Asp Tyr Met Ser Tyr
530 535 540
Gln Gly Val Leu Tyr Arg Ala Asn Trp Tyr Thr Ala Thr Val Pro Gly
545 550 555 560
Ser Asp Ser Ser Trp Thr Arg Val Gly Asp Cys Asn Phe Val
565 570
<210> 50
<211> 1725
<212> DNA
<213> Microbulbifer degradans
<400> 50
atgaagaagt taattaagcc tacgctatcg tgggttgcag gagttgcttt gtcgctgggt 60
attgcccagg gagctggtgc tcaaaatgtg cagtttgttg gtaatatcac taccaatggt 120
agtgtgcgca acgacttcat ggactactgg gatcagatta ccccagagaa tgaaggcaag 180
tggggctcgg tggagcgcag tcgcgacaac tattcatgga gcggccaaga tgccgcctac 240
aattttgccc gtgccaacgg catcccattt aaagcacata ctttagtatg gggcagtcaa 300
tatcccagct ggataaacaa tttaagtaac gcggaaaaag ccgctgagat tgaagagtgg 360
attcgcgatt actgtaaccg ttacccagcc accgatatta tcgatgttgt caatgaagca 420
acgccgggcc acgcgccagc aaattatgct cgcgatgcat ttggcgacaa ctggataatc 480
aagtccttcc agctggcacg tcagtactgc cccaatgcca cgttagtgtt gaacgactac 540
aacgtactta tttggaacac caatgatttt atagcgatgg cccagccggt aattaacgcc 600
ggagtagtag atgctttggg tttgcaggcc cacggtctgg agagcctttc tgcgtcgcaa 660
ttaaaatcga ctctggatcg tatcgccaat ttgggtttgc caatttatat ctctgaatac 720
gatgttcgca gcaccaatga tcaggagcag ctgcgtatta tgcgtgatca attccctgta 780
ttttacaacc acccaagtgt acgtggcata actttgtggg gttatatggt gggggccacc 840
tggcgagaag gcacaggttt gattcgtgct gatggctccc atcgtccagc gatgacctgg 900
ttgatgaact atctggagaa caatcgtggc ggctcaacct cttcaagtag ttcatcctcc 960
tctagcagtt cgtcttccag tagttcttct tcgggaagtt cctctggtgg cccaagtagt 1020
ttgacggtag agctagaatc tttgtcggat agcagtaact tttcgccatt ctcggtacag 1080
agtgacagca gcgcagcggg cggccagtac gtggtatggc ctaacaacgg caatcagatt 1140
gtaagctcac cctccgatag cgccagcggg caaattcagg tgcactttac cctgtcgcaa 1200
tcggcggatg tgcaatttca gattcgtgca gacctagcta acggcaatga cgactctttt 1260
tattacaagc tggactcagg ctcttggaat actcagaaca acgcttccac gtctggttgg 1320
ggcaccttaa ccccagcaac tttctctaat gtatccacag gatcccatac cttacacatt 1380
ctccgcagag aagatggggc gaaactcgat aaggtaactc tgaatgcttc agttggtcag 1440
gtttccgcta gtacaggcag tagctccagc tcttccagca gctccagttc atccagcagt 1500
tctagttctt caagcagcag cggcgcggca gtcgcaagtt gtgacggtgt taatgaatac 1560
cccagctgga cagcaaaaga ttggtctggg ggtgactata accacgccaa tagcggtgac 1620
tacatgagct atcagggtgt tctatatcga gcaaactggt acaccgcaac tgttcctgga 1680
agtgattctt cctggactcg agttggcgat tgcaattttg tgtaa 1725
<210> 51
<211> 619
<212> PRT
<213> Microbulbifer degradans
<400> 51
Met Ile Lys Leu Arg Gln Ser Ile His Gly Ala Leu Ala Arg Thr Val
1 5 10 15
Gly Ile Ile Ser Ile Ser Thr Gly Leu Val Leu Ala Ala Gln Thr Ala
20 25 30
Ser Ala Ala Cys Glu Tyr Thr Val Thr Asn Ser Trp Gly Ser Gly Phe
35 40 45
Thr Ala Ser Ile Arg Ile Thr Asn Asp Thr Gly Ser Ala Val Asn Gly
50 55 60
Trp Ala Val Asn Trp Gln Tyr Ala Asn Gly Asn Arg Val Thr Asn Ser
65 70 75 80
Trp Asn Ala Thr Leu Ser Gly Asn Asn Pro Tyr Ser Ala Ser Asn Ile
85 90 95
Gly Trp Asn Gly Gly Ile Gln Pro Gly Gln Ser Val Glu Phe Gly Phe
100 105 110
Gln Gly Thr Ala Asn Gly Ala Ala Glu Thr Pro Ala Val Thr Gly Ala
115 120 125
Val Cys Ala Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
130 135 140
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
145 150 155 160
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly
165 170 175
Ala Asn Cys Val Glu Met Cys Lys Trp Tyr Gln Asp Ala Pro Arg Pro
180 185 190
Leu Cys Asn Asn Gln Asn Ser Gly Trp Gly Trp Glu Asn Asn Gln Ser
195 200 205
Cys Ile Gly Arg Ala Thr Cys Glu Ser Gln Pro Ser Asn Ala Gly Gly
210 215 220
Val Val Asn Ser Cys Pro Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
225 230 235 240
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
245 250 255
Ser Thr Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser
260 265 270
Ser Ser Ser Ser Gly Ser Ala Ala Asn Leu Tyr Thr Leu Ala Asp Phe
275 280 285
Pro Ile Gly Val Ala Val Thr Ala Gly Asn Glu Ser Arg Ser Phe Leu
290 295 300
Ser Ile Ala Ala Lys Glu Ala Thr Val Lys Lys His Phe Asp Gln Ile
305 310 315 320
Thr Ala Gly Asn Ile Met Lys Met Ser Tyr Leu His Pro Ser Glu Asn
325 330 335
Ser Tyr Thr Phe Ser Gln Ala Asp Ala Met Val Asn Trp Ala Asn Ser
340 345 350
Asn Gly Val Ser Val His Gly His Thr Phe Ile Trp His Ser Asp Tyr
355 360 365
Gln Val Pro Asn Trp Met Asn Asn Tyr Ser Gly Asn Phe Ala Ser Met
370 375 380
Met Asp Thr His Val Thr Thr Ile Ala Asp His Phe Glu Gly Arg Val
385 390 395 400
Val Ser Trp Asp Val Val Asn Glu Ala Ile Asp Glu Ser Gln Ser Ser
405 410 415
Cys Tyr Arg Asn Ser Leu Phe Tyr Gln Arg Leu Gly Lys Ala Tyr Ile
420 425 430
Ala Asn Ala Phe Arg Ala Ala Arg Ala Ala Asp Pro Ser Val Glu Leu
435 440 445
Tyr Tyr Asn Asp Tyr Asp Thr Glu Gly Gly Asn Ala Asn Lys Leu Asn
450 455 460
Cys Leu Leu Gln Leu Val Asp Asp Leu Gln Ala Asn Asn Val Pro Ile
465 470 475 480
Asp Gly Val Gly Phe Gln Met His Val Gln Ile Asp Trp Pro Ser Thr
485 490 495
Ser Asn Ile Ala Ala Ala Phe Gln Ala Ile Val Asp Arg Gly Leu Lys
500 505 510
Val Lys Ile Thr Glu Leu Asp Val Pro Ile Asn Asn Pro Tyr Gly Ser
515 520 525
Gly Ser Phe Pro Gln Tyr Ser Thr Tyr Thr Ser Gln Ala Ala Ala Leu
530 535 540
Gln Lys Ala Arg Tyr Lys Ser Ile Val Lys Thr Tyr Leu Thr Val Val
545 550 555 560
Pro Ala His Leu Arg Gly Gly Leu Thr Val Trp Gly Ile Trp Asp Gly
565 570 575
Asp Ser Trp Leu Leu Asp Phe Asp Asn Arg Gln Gly Ala Asp Asp Trp
580 585 590
Pro Leu Leu Phe Ser Gly Pro Ala Asn Gly Pro Tyr Val Glu Lys Glu
595 600 605
Ala Phe Tyr Gly Val Ala Glu Ala Leu Thr Glu
610 615
<210> 52
<211> 50
<212> PRT
<213> Microbulbifer degradans
<400> 52
Met Leu Asp Glu Leu Leu Asp Glu Leu Glu Leu Leu Leu Glu Leu Glu
1 5 10 15
Leu Leu Asp Glu Glu Leu Glu Leu Glu Leu Glu Glu Leu Asp Pro Val
20 25 30
Ala His Thr Ala Pro Val Thr Ala Gly Val Ser Ala Ala Pro Leu Ala
35 40 45
Val Pro
50
<210> 53
<211> 103
<212> PRT
<213> Microbulbifer degradans
<400> 53
Met Gly Lys Ser Ala Lys Val Tyr Lys Phe Ala Ala Glu Pro Leu Glu
1 5 10 15
Leu Glu Leu Leu Glu Glu Val Glu Leu Val Leu Asp Glu Leu Glu Leu
20 25 30
Val Asp Glu Leu Glu Leu Leu Asp Glu Leu Glu Leu Leu Asp Glu Leu
35 40 45
Glu Leu Glu Glu Leu Glu Glu Leu Leu Glu Leu Asp Gly Gln Leu Phe
50 55 60
Thr Thr Pro Pro Ala Leu Asp Gly Cys Asp Ser His Val Ala Arg Pro
65 70 75 80
Ile Gln Leu Trp Leu Phe Ser Gln Pro Gln Pro Leu Phe Trp Leu Leu
85 90 95
His Lys Gly Arg Gly Ala Ser
100
<210> 54
<211> 1860
<212> DNA
<213> Microbulbifer degradans
<400> 54
atgatcaagc tacgtcaatc tatccacggc gccttggcgc gtaccgtggg cataataagt 60
ataagcaccg gacttgtact cgcagcgcaa actgcaagtg cagcctgtga atacaccgta 120
accaattcgt ggggttcggg ttttaccgcg agtattcgca taacaaacga taccggtagc 180
gcagtaaacg gttgggcggt taactggcaa tacgctaatg gcaaccgtgt aacaaattca 240
tggaacgcta cgctgtctgg caataaccct tatagcgcca gcaatattgg ttggaacggc 300
ggtattcaac ctgggcagtc ggtggaattt ggttttcaag gcacggctaa tggcgcggca 360
gaaacaccag cggtaacggg ggctgtatgt gctacagggt ctagctcttc cagctcaagc 420
tctagttctt cgtctagcag ctctagttct agcagcagtt cgagttcatc gagtagctcg 480
tctagcactt caagctcgtc atctagcagc tcttccagtt catcgggcgc aaactgtgta 540
gaaatgtgta agtggtatca agatgcgcct cgccctttat gcaataacca aaatagtggt 600
tggggttggg aaaacaacca aagttgtatc ggccgagcaa cgtgcgaatc gcaaccgtct 660
aatgcgggtg gggtggtaaa tagttgtccg tctagttcta gcagctcttc aagttcctct 720
agttcgagct cgtctagcag ttcaagctcg tctagcagtt caagctcgtc tactagttct 780
agttcatcaa gcacaagttc tacttcttca agtagttcta gctctagcgg ttctgctgca 840
aacttatata ccttggcaga tttccccatt ggcgttgctg taactgcggg taatgagagc 900
cgtagctttt tatctattgc tgcgaaagag gcaactgtta aaaaacactt cgaccaaatt 960
acagccggta acattatgaa gatgagttac ttgcacccat ccgaaaatag ctacaccttt 1020
agtcaagcgg atgccatggt taactgggca aatagcaacg gcgtaagtgt gcacggccat 1080
acttttattt ggcattccga ttaccaagta ccaaattgga tgaataatta cagcggtaat 1140
tttgcgtcta tgatggatac ccacgtaacc actattgccg atcattttga aggccgagta 1200
gtaagctggg atgtggtaaa cgaagctatc gatgagagcc aatctagttg ttatcgcaac 1260
tctttgtttt accagcgttt aggtaaagct tatattgcca atgcgttccg cgcggcccga 1320
gcagcagacc ctagcgtaga gttgtattac aacgattacg ataccgaagg tggcaatgcc 1380
aataagttaa attgcttgtt gcaattagtc gatgacttgc aagcgaacaa tgtgcctatc 1440
gatggtgtgg gctttcaaat gcacgtgcaa attgattggc ccagcaccag caatattgct 1500
gcggctttcc aagctattgt ggatcgcggc ttaaaggtaa aaattactga gctggatgtg 1560
cctattaata acccttatgg cagtggttca ttcccgcaat attcaactta cacgtcacaa 1620
gccgctgcgt tgcaaaaggc gcgttataaa tccattgtaa aaacctactt gactgttgtg 1680
ccagcgcatt tgcgcggggg cttaaccgta tggggtatat gggatggtga tagctggttg 1740
ttagattttg ataatcgtca aggcgctgat gattggccgc tattatttag tggcccagct 1800
aatggcccct atgtagaaaa agaagcattc tatggcgtgg cagaggcgct tacagaatag 1860
<210> 55
<211> 470
<212> PRT
<213> Microbulbifer degradans
<220>
<221> MOD_RES
<222> (301)..(457)
<223> Variable amino acid
<400> 55
Met Asn Cys Thr Arg Arg Asn Ile Val Lys Ala Gly Leu Leu Gly Ser
1 5 10 15
Ala Phe Val Ala Leu Pro Ala Val Ala Arg Ala Leu Pro Gly Leu Ala
20 25 30
Thr Lys Phe Arg Asp Gln Phe Tyr Val Gly Thr Ala Val Ser Ala Arg
35 40 45
Ser Leu Asn Thr Pro Ser Gly Ala Phe Ala Ala Thr Val Ala His Gln
50 55 60
Phe Asn Ala Leu Thr Ala Glu Asn Ala Met Lys Pro Ala Leu Leu Gln
65 70 75 80
Pro Gln Met Gly Glu Trp Arg Trp Gln Asp Ala Asp Ala Ile Val Arg
85 90 95
Phe Ala Glu Gln His Gln Met Leu Met His Gly His Thr Leu Val Trp
100 105 110
His Ser Gln Thr Pro Asp Trp Phe Phe Gln Asn Lys Gln Gly Glu Pro
115 120 125
Ala Asp Lys Ala Thr Leu Tyr Arg Arg Gln Glu Glu Tyr Ile Asn Ala
130 135 140
Val Val Gly Arg Tyr Lys Gly Arg Val His Ser Trp Asp Val Val Asn
145 150 155 160
Glu Ala Glu Asp Glu Gly Lys Gly Trp Arg Lys Ser His Trp Tyr Asn
165 170 175
Ile Cys Gly Pro Glu Phe Met Glu Arg Ala Phe Arg Leu Ala His Ala
180 185 190
Ala Asp Pro Lys Ala His Leu Cys Tyr Asn Asp Tyr Asn Met His Leu
195 200 205
Pro Gln Lys Arg Glu Phe Leu Val Lys Leu Phe Lys Asp Tyr Ile Lys
210 215 220
Arg Gly Val Pro Ile His Gly Val Gly Leu Gln Gly His Val Gly Leu
225 230 235 240
Asp Tyr Pro Ser Leu Asp Glu Leu Glu Lys Thr Ile Val Ala Met Ala
245 250 255
Asp Leu Gly Leu Lys Val His Ile Thr Glu Leu Asp Val Asp Val Leu
260 265 270
Pro Ala Pro Trp Gln Leu Ala Ser Ala Asp Ile Ser Thr Lys Phe Glu
275 280 285
Tyr Asp Lys Ser Leu Asn Pro Tyr Val Asp Gly Leu Xaa Xaa Xaa Xaa
290 295 300
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
305 310 315 320
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
325 330 335
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
340 345 350
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
355 360 365
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
370 375 380
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
385 390 395 400
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
405 410 415
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
420 425 430
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
435 440 445
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Met Tyr Lys Met Arg Leu
450 455 460
Pro Ala Lys Ile Leu Ala
465 470
<210> 56
<211> 1413
<212> DNA
<213> Microbulbifer degradans
<220>
<221> modified_base
<222> (902)..(1369)
<223> a, c, g, or t
<400> 56
gtgaattgta cgcgtaggaa tatagtaaaa gcaggccttc ttggctcggc attcgtcgcc 60
ctgcctgccg tggcgcgcgc gctgcctgga ttggccacga aatttcgcga tcagttttac 120
gtgggcactg cggttagtgc gcgctcactt aatacgccca gcggcgcgtt tgcagccact 180
gtcgcgcatc aattcaatgc actaaccgct gaaaacgcca tgaagcccgc cttacttcaa 240
ccacaaatgg gggagtggcg ctggcaggat gccgatgcca ttgtgagatt tgccgagcag 300
catcagatgc taatgcatgg tcacaccctt gtgtggcatt cgcaaacgcc agattggttc 360
ttccaaaaca agcagggcga accggcagac aaagcaaccc tataccgcag gcaagaggag 420
tatatcaatg ccgtagttgg gcgctataaa gggcgggtac actcgtggga tgtggtgaat 480
gaagcagaag atgagggtaa aggctggcgc aagagccact ggtataacat ttgtgggcca 540
gagtttatgg aacgagcctt tcgcttagct cacgcagcgg acccaaaagc acacttatgt 600
tacaacgatt acaatatgca cttgccgcaa aagcgcgaat ttttggttaa gttattcaaa 660
gactacatta agcgcggcgt gcctattcac ggcgtagggt tgcaggggca tgtgggctta 720
gactacccct cgctggacga gttggaaaaa accatcgtgg ccatggccga tttaggtcta 780
aaagtacaca ttacagaatt ggatgtagat gtattacccg cgccatggca actagctagc 840
gcagatataa gtactaaatt cgagtacgac aaaagcttaa acccgtacgt tgatggtttg 900
cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt gactatgtac 1380
aaaatgcgct tgcccgcgaa aatattagct taa 1413
<210> 57
<211> 1186
<212> PRT
<213> Microbulbifer degradans
<400> 57
Met Leu Arg Ser Thr Gln Ser Thr Pro Ile Val Lys Arg Lys Ile Ser
1 5 10 15
Ala Tyr Val Gly Trp Gly Leu Cys Val Leu Leu Ser Val Cys Thr Ala
20 25 30
Ser Ile Ser Trp Ala Gly Asn Pro Ile Val Ser His Val Tyr Thr Ala
35 40 45
Asp Pro Ala Ala Arg Val Ile Asn Gly Arg Ala Tyr Val Met Val Thr
50 55 60
His Asp Gln Asp Asn Gln Asn Asp Tyr Gly Gly Leu Ile Asp Tyr Tyr
65 70 75 80
Leu Phe Ser Ser Asp Asp Met Val Asn Trp Gln Asp His Gly Ile Val
85 90 95
Trp Asn Ser Arg Thr Asp Ser Ser Trp Ala Ser Leu Ala Tyr Ala Pro
100 105 110
Asp Phe Ile Glu Arg Asn Gly Lys Tyr Tyr Leu Tyr Phe Pro Asn Gly
115 120 125
Ala Asn Ser Ile Gly Val Ala Val Ala Asp Ser Pro Glu Gly Pro Tyr
130 135 140
Thr Asp Pro Leu Gly Arg Pro Leu Val Asp Arg Asn Thr Pro Asn Ala
145 150 155 160
Asn Val Asp Trp Leu Phe Asp Pro Gly Val Phe Ile Asp Asp Asp Gly
165 170 175
Gln Ala Phe Leu Tyr Phe Gly Gly Gly Ala Asp Gly Thr Ala Arg Val
180 185 190
Ile Arg Leu Asn Asn Asp Met Ile Ser Thr Ser Gly Ala Ala Ile Ser
195 200 205
Ile Asp Val Pro Asn Phe Phe Glu Ala Leu Tyr Met His Lys Arg Asn
210 215 220
Gly Ile Tyr Tyr Leu Ser Tyr Ser Thr Asn Pro Ser Ala Gly Met Ser
225 230 235 240
Ile Asp Tyr Met Thr Ser Asn Asn Pro Thr Ser Gly Phe Thr His Arg
245 250 255
Gly Thr Ile Leu Pro Asn Pro Trp Glu Asn Asn Ser Asn Asn Asn His
260 265 270
Gln Ser Ile Ile Glu Phe Asn Asn Glu Trp Tyr Ile Phe Tyr His Asn
275 280 285
Arg Ala Val Ala Asn Thr Arg Gly Asp Ser Thr Phe Ser Arg Ser Ile
290 295 300
Asn Val Asp Arg Leu Tyr Tyr Asn Ser Asp Gly Ser Ile Arg Glu Val
305 310 315 320
Asn Ala Ser Ser Ile Gly Val Pro Ala Val Arg Asn Val Asn Ala Phe
325 330 335
Ser Ile Asn Gln Ala Glu Thr Phe Asp Gln Glu Gly Gly Ile Glu Thr
340 345 350
Glu Pro Ser Ser Glu Gly Thr Leu Asn Ile Gln Met Gly Pro Gly Asp
355 360 365
Trp Val Lys Val Ala Asn Val Asp Phe Gly Asn Gly Ala Thr Gln Phe
370 375 380
Asn Ala Arg Val Ala Ser Ala Ile Asp Asn Ser Lys Leu Glu Ile Ile
385 390 395 400
Leu Gly Ser Leu Ser Asn Thr Pro His Ala Ser Leu Glu Ile Thr Asn
405 410 415
Thr Gly Gly Trp Gln Asn Trp Gln Thr Gln Ser Thr Ser Phe Asn Ala
420 425 430
Ile Thr Gly Val His Asp Val Tyr Leu Arg Gly Thr Ser Gly His Asn
435 440 445
Leu Asn Trp Phe Glu Phe Glu Gly Glu Asn Asn Gly Gly Ser Ser Gln
450 455 460
Leu Thr Val Glu Leu Glu Asp Leu Ala Ser Gln Ser Leu Phe Ala Pro
465 470 475 480
Leu Ser Val Arg Ser Asp Asn Met Ala Asn Asn Gly Ala Tyr Ile Glu
485 490 495
Trp Ser Asn Asp Gly Ser Asn Gln Ile Leu Ser Val Ala Ser Glu Gln
500 505 510
Ser Gln Gly Gln Ile Ser Val Pro Phe Thr Leu Ser Gln Ala Ser Asp
515 520 525
Val Glu Phe Asn Val Arg Val Asn Leu Ala Asn Gly Asn Asp Asp Ser
530 535 540
Phe Tyr Tyr Lys Leu Asn Ser Asn Ser Trp Gln Thr Phe Asn Asn Gln
545 550 555 560
Ala Thr Thr Gly Trp Gln Val Leu Thr Pro Asn Thr Phe Thr Gly Leu
565 570 575
Ser Pro Gly Asn His Ile Leu Thr Leu Leu Arg Arg Glu Asp Gly Ala
580 585 590
Lys Leu Asp Thr Leu Thr Leu Val Ala Ser Ala Gly Ser Ile Gln Thr
595 600 605
Asn Asn Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Thr Thr Ser
610 615 620
Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Gly Ala Ala Pro Thr Gly
625 630 635 640
Asn Val Thr Tyr Ser Ile Asn Val Thr Asn Asp Trp Gln Ser Gly Tyr
645 650 655
Cys Ala Glu Leu Thr Val Thr Asn Asn Thr Asn Asn Ala Leu Gln Trp
660 665 670
Gln Ala Ser Val Ser Met Ser Asp Ser Val Asp Ser Met Trp Asn Ala
675 680 685
Ser Trp Ser Gln Ser Gly Asn Ile Leu Asn Val Ser Gly Val Glu Trp
690 695 700
Asn Asn Thr Leu Gln Ala Gly Gln Ser Gln Ser Gly Ile Gly Phe Cys
705 710 715 720
Ala Thr Arg Ala Ser Ser Ser Ser Ser Ser Ser Thr Thr Ser Ser Thr
725 730 735
Ser Gly Ser Thr Ser Ser Ser Ser Ser Ser Gly Gly Tyr Thr Val Pro
740 745 750
Ser Asn Asn Phe Ala Val Asn Gly Gly Val Glu Asn Asn Leu Gln Ser
755 760 765
Trp Gly Ala Thr Ala Gly Ser Val Thr Arg Ser Thr Glu Gln Arg Tyr
770 775 780
Ser Gly Asn Ala Ser Ala Arg Ile Thr Asn Arg Ala Glu Asn Trp His
785 790 795 800
Gly Leu Thr Phe Ser Val Gly Glu Leu Thr Gln Gly Asn Leu Tyr Glu
805 810 815
Val Ala Val Trp Val Lys Leu Ala Ala Gly Ser Ala Asp Thr Pro Ile
820 825 830
Thr Leu Thr Ala Lys Arg Gln Asn Asp Ser Asp Asp Ser Thr Tyr Asn
835 840 845
Glu Tyr Thr Gly Ile Val Thr Thr Ile Ala Asn Asp Ser Glu Trp Val
850 855 860
Leu Leu His Gly Gln Tyr Thr Gln Thr Gly Thr Ala Phe Glu His Phe
865 870 875 880
Ile Ile Glu Ser Glu Ser Asp Ser Val Ser Phe Tyr Ala Asp Glu Phe
885 890 895
Ser Ile Gly Gly Glu Val Thr Pro Lys Asn Glu Val Gly Phe Phe Val
900 905 910
Gly Asn Ile Thr Thr Asn Gly Asn Val Arg Asn Asp Phe Thr Gln Tyr
915 920 925
Trp Asp Gln Leu Thr Pro Glu Asn Glu Gly Lys Trp Gly Ser Val Glu
930 935 940
Arg Thr Arg Asp Val Tyr Asp Trp Ser Gly Leu Asp Arg Ala Tyr Asn
945 950 955 960
Tyr Ala Lys Gln Asn Asn Ile Pro Phe Lys Gln His Thr Met Val Trp
965 970 975
Gly Ser Gln Gln Pro Asn Trp Ile Asp Ser Leu Ser Pro Ala Glu Gln
980 985 990
Ala Ala Glu Ile Glu Glu Trp Ile Arg Asp Tyr Cys Ala Arg Tyr Pro
995 1000 1005
Asp Thr Glu Met Ile Asp Val Val Asn Glu Ala Thr Leu Gly His Ala
1010 1015 1020
Pro Ala Asn Tyr Ala Ala Ser Ala Phe Gly Asn Asn Trp Ile Ile Arg
1025 1030 1035 1040
Ser Phe Glu Leu Thr Arg Gln Tyr Cys Pro Asn Ser Ile Leu Ile Leu
1045 1050 1055
Asn Asp Tyr Asn Val Leu Ser Trp Asn Thr Gln Glu Phe Ile Gln Met
1060 1065 1070
Ala Thr Pro Ala Val Asn Ala Gly Val Val Asp Ala Ile Gly Leu Gln
1075 1080 1085
Ala His Gly Leu Ala Asp Trp Ser Leu Ser Asp Leu Glu Thr Lys Leu
1090 1095 1100
Asn Gln Val Ala Ala Leu Gly Leu Pro Ile Tyr Ile Ser Glu Tyr Asp
1105 1110 1115 1120
Ile Glu Lys Thr Asn Asp Gln Glu Gln Leu Arg Val Met Gln Thr Gln
1125 1130 1135
Phe Pro Leu Phe Tyr Asn His Pro Ser Val Lys Gly Ile Thr Ile Trp
1140 1145 1150
Gly Tyr Val Val Gly Ala Thr Trp Arg Asp Gly Thr Gly Leu Leu His
1155 1160 1165
Ser Asn Gly Thr Pro Arg Pro Ala Leu Thr Trp Leu Met Asp Tyr Leu
1170 1175 1180
Asn Arg
1185
<210> 58
<211> 3561
<212> DNA
<213> Microbulbifer degradans
<400> 58
atgttgcgaa gcacccaatc aacacccatt gttaagcgaa agatttctgc ctatgtaggt 60
tggggtctgt gcgtgttact tagcgtctgc acggcctcga tctcttgggc aggtaaccct 120
attgtgtctc atgtatatac cgcagaccct gctgcacggg taataaacgg aagagcctat 180
gtaatggtta cccacgatca ggataaccaa aatgattacg gtggtttgat tgattactac 240
ctgttctcat cggacgatat ggttaattgg caagatcacg gtattgtgtg gaattctcga 300
acagacagta gttgggccag tcttgcttac gccccagatt ttatcgagcg caatggaaag 360
tactacctgt actttcccaa cggcgcaaac tctattggtg tcgctgtggc cgatagccct 420
gagggcccct atactgatcc actcggtagg ccgctggttg accgcaatac ccccaatgcc 480
aatgttgact ggctgttcga tcccggtgta tttattgatg acgacggaca agcctttttg 540
tactttggtg gaggcgctga tggaaccgcg cgcgttattc gtttaaataa cgacatgata 600
agtaccagtg gtgcagccat aagtattgac gtacctaact tctttgaagc gctatacatg 660
cataagcgca acggcattta ctacttatcc tactcgacca accccagcgc ggggatgagc 720
atagattaca tgacgagtaa taaccctacc tcagggttca cccatcgcgg caccattttg 780
cccaaccctt gggaaaataa ttccaataac aaccaccagt caattattga atttaataac 840
gaatggtaca ttttttacca caatagagct gtcgcaaata cgcggggcga tagtaccttt 900
tcccgctcta ttaacgtgga tcgtctttac tacaattccg acggcagtat tcgagaagta 960
aatgccagtt caataggtgt acccgcggta cgtaatgtta atgctttttc cataaaccaa 1020
gcagaaacat tcgatcaaga aggtggcata gaaactgagc cgtcttctga aggtaccttg 1080
aatattcaga tgggcccagg agattgggta aaagttgcta acgtcgattt tggtaacggc 1140
gccacacaat ttaacgctcg agttgctagc gcaatcgata attcaaagct ggaaattatt 1200
ttaggcagtc tcagtaatac cccgcatgcc tcgctcgaaa ttaccaacac aggcgggtgg 1260
caaaattggc aaacacaaag cacaagtttt aatgcaataa ctggtgttca cgatgtatac 1320
ctgcgcggta cttctgggca caacctaaat tggtttgaat ttgaaggcga aaataatgga 1380
ggaagcagtc agctaacggt tgagttggaa gacttggctt cgcaatctct ttttgctccc 1440
cttagcgtac gctccgataa catggctaat aacggcgctt acattgaatg gagtaatgat 1500
gggagcaatc agattctcag tgtggccagc gagcaatcgc aaggccaaat cagtgtccca 1560
tttactctat cgcaagcttc cgatgtcgaa tttaacgtac gcgtgaatct tgctaatggc 1620
aatgatgatt cgttttatta caagctaaac agtaatagct ggcagacttt taataatcaa 1680
gctaccactg gttggcaggt gctcacgccc aacaccttca ctggtcttag ccctgggaat 1740
cacattctta ctctacttcg gcgtgaagat ggcgccaaat tagataccct cacgttggta 1800
gcctccgcgg gcagtattca aaccaataac agctcatcaa gttctacctc cagcagtagt 1860
tcaacgacta gctcaagttc aaccagctcg agtagttcct ctggcgccgc gccaactggt 1920
aacgttactt actctataaa tgttactaac gactggcaaa gtggttattg tgcggagctt 1980
accgttacca ataacacgaa caacgctctg cagtggcaag ctagtgtttc tatgagcgat 2040
agtgtcgaca gtatgtggaa tgctagctgg tcgcagagcg ggaacatact taacgtaagc 2100
ggggtagagt ggaataatac gttgcaagca gggcaaagcc agagtggcat aggattttgt 2160
gctacacgtg ccagctcgtc ttcctccagc tctacaacaa gttctacttc cggttccaca 2220
tcaagctcta gttcatcggg aggctatacc gttccgagta ataatttcgc agtcaatggt 2280
ggtgtagaaa acaacctgca gagctggggc gcaacggcgg gttcagtaac acgttctact 2340
gaacaacgtt atagcggaaa cgcaagcgcg cgtataacaa atcgagcaga aaactggcac 2400
ggtttgacgt tcagtgttgg tgagcttacg caaggcaacc tgtacgaagt tgcggtgtgg 2460
gtaaaacttg cggcaggcag tgcggacaca cctattacgc ttaccgccaa acgacaaaat 2520
gatagcgacg attccactta taacgaatat accggcatag tcacgaccat tgctaacgat 2580
tctgaatggg tgctgctgca cgggcaatac actcaaactg gcacagcgtt tgagcatttt 2640
attatcgagt cagaaagcga tagcgtaagt ttttatgccg atgagttttc tattggtgga 2700
gaggtcacgc ccaaaaacga agtgggattt tttgtgggta acattaccac taatggcaat 2760
gtgcgcaatg attttactca gtactgggat caactaacac cagaaaatga aggaaagtgg 2820
ggttcggtag aacgcactcg tgatgtgtat gattggagtg gactagacag agcctataac 2880
tacgccaaac aaaataatat tccgtttaaa cagcatacta tggtgtgggg tagccaacag 2940
cccaactgga ttgattcgct cagcccagca gaacaggctg cagagataga ggagtggata 3000
agagattatt gtgcgcgcta tcctgatact gaaatgattg atgtggtaaa cgaagcaacg 3060
ctgggccatg ctcctgctaa ctacgcggcg agtgcgtttg gcaataattg gatcattcgt 3120
tcgttcgagc ttactcgtca atattgtcct aacagcattt taatattgaa cgattacaat 3180
gttttaagtt ggaacactca agagtttatc cagatggcta ctccggctgt caatgcaggc 3240
gttgtagatg caattggatt acaagcacac ggcttagcgg attggtcttt aagtgattta 3300
gaaaccaaac taaaccaggt tgcggcattg ggtttaccca tttatatatc cgaatacgat 3360
atagaaaaaa ctaacgacca agaacagctg cgcgtaatgc aaactcagtt cccgctgttt 3420
tataaccatc catcggtgaa aggcattact atttgggggt atgttgttgg ggctacttgg 3480
cgcgatggga cgggattgtt gcacagtaac ggaacaccca gaccggcact tacttggtta 3540
atggattact tgaatagata g 3561
<210> 59
<211> 670
<212> PRT
<213> Microbulbifer degradans
<400> 59
Met Val Ile Ile Thr Met Lys Ala Gly Leu Leu Leu Arg Ile Leu Leu
1 5 10 15
Thr Val Leu Ala Leu Asn Met Leu Ala Ala Cys Gly Gly Ser Ser Ser
20 25 30
Asn Thr Lys Glu Pro Val Thr Gln Pro Glu Pro Glu Pro Glu Gln Gln
35 40 45
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
50 55 60
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Met Glu Pro Gln
65 70 75 80
Pro Glu Pro Gln Ala Pro Pro Ala Gly Gly Val Ser Ile Ile Asp Thr
85 90 95
Asn Pro Asn Asn Ala Ser Phe Trp Ala Gly Ser Asn Asn Gly Asp Val
100 105 110
Gly Ser Arg Ala Val Ile Asp Val Asp His Pro Glu Phe Ser Gln Ala
115 120 125
Thr Arg Ile Thr Val Ser Asn Pro Ala Ser Asp Tyr Trp Asn Gly Gln
130 135 140
Leu Ser Phe Pro Leu Asn Ala Ser Val Ala Ala Gly Asp Val Val Leu
145 150 155 160
Val Arg Leu Tyr Met Arg Ser Val Glu Asn Thr Tyr Glu Ser Gly Ala
165 170 175
Ser Phe Thr Thr Val Phe Ile Glu Asp Asn Ile Asp Phe Thr Lys Phe
180 185 190
Leu Asn Arg Glu Ile Thr Ala Ala Gln Asp Trp Val Glu Tyr Tyr Leu
195 200 205
Pro Ala Glu Ile Thr Asp Asn His Ala Thr Gly Glu Val Gly Leu Arg
210 215 220
Ile Gly Phe Gly Ala Gly Pro Arg Ala Gln Val Phe Asp Ile Gly Gly
225 230 235 240
Val Glu Leu Leu His Tyr Thr Asn Thr Asp Ile Ser Ala Met Pro Ser
245 250 255
Thr Arg Pro Ser Tyr Glu Gly Arg Glu Pro Asp Ala Ala Trp Arg Thr
260 265 270
Ala Ala Ala Glu Arg Ile Glu Gln His Arg Lys Gly Asp Phe Glu Leu
275 280 285
Thr Val Val Asp Asp Gly Asn Pro Ile Ala Asn Ala Thr Ile Asp Val
290 295 300
Asp Phe Gln Lys His Ala Tyr His Phe Gly Ser Val Thr Val Gly His
305 310 315 320
Leu Leu Met Gly Thr Ser Glu Asp Ser Ala Ile Tyr Arg Glu Lys Val
325 330 335
Leu Glu Leu Phe Asn Gln Ser Gly Pro Glu Asn Asp Leu Lys Trp Gly
340 345 350
Pro Trp Glu Gly Glu Trp Gly Asn Asn Phe Asn Gln Thr Gln Thr Leu
355 360 365
Asn Gly Leu Gln Trp Leu Arg Asp Asn Gly Leu Tyr Thr Arg Gly His
370 375 380
Val Met Val Trp Pro Ser Lys Arg Asn Leu Pro Asn Leu Met Gln Gln
385 390 395 400
Tyr Leu Pro Glu Gly Asp Pro Ala Ser Ala Asn Pro Glu Ala Lys Gln
405 410 415
Val Val Leu Asp His Ile Asp Asp Ile Ala Thr Ala Thr Ala Asn Tyr
420 425 430
Leu Asp Glu Trp Asp Val Leu Asn Glu Pro Tyr Asp Asn His Tyr Leu
435 440 445
Met Asp Ala Phe Gly Asp Ser Val Met Val Asp Trp Phe Asn Arg Ala
450 455 460
Arg Thr Asn Leu Pro Ala His Gly Leu Tyr Ile Asn Asp Tyr Ser Ile
465 470 475 480
Leu Ser Ala Gly Gly Arg Asn Phe Ala His Gln Glu His Tyr Thr Asn
485 490 495
Thr Ile Gln Tyr Leu Val Asp Asn Asn Ala Pro Ile Thr Gly Ile Gly
500 505 510
Leu Gln Ser His Phe Gly Asp Ser Pro Thr Ala Ile Thr Arg Ile Tyr
515 520 525
Glu Ile Ile Asp Gln Tyr Ser Thr Ala Phe Pro Gln Leu Asp Ile Arg
530 535 540
Ala Thr Glu Phe Asp Val Ser Thr Thr Asp Glu Asp Leu Gln Ala Asp
545 550 555 560
Phe Thr Arg Asp Phe Leu Thr Ile Phe Phe Ser His Pro Lys Thr Val
565 570 575
Gly Val Gln Leu Trp Gly Phe Trp Ala Asn Ala His Trp Tyr Pro Asn
580 585 590
Ala Ala Leu Tyr Asp Ala Asp Trp Arg Glu Lys Pro Asn Ala Leu Ala
595 600 605
Trp Lys Glu Gln Ile Phe Asn Glu Trp Trp Asn Asp Phe Asp Gly Thr
610 615 620
Thr Asn Ala Gln Gly Lys Phe Asp Glu Arg Gly Phe Tyr Gly Asp Tyr
625 630 635 640
Gln Val Thr Val Thr Val Gly Glu Glu Gln Gln Ile Phe Thr Phe Ser
645 650 655
Leu Val Lys Gly Gly Glu Gln Asn Phe Ser Phe Glu Trp Gln
660 665 670
<210> 60
<211> 2013
<212> DNA
<213> Microbulbifer degradans
<400> 60
atggtcataa taactatgaa agccggttta cttctacgca tcctattaac tgtactcgcg 60
ctcaatatgc ttgccgcatg tggcggtagt tctagcaata ccaaagaacc cgttacccag 120
ccggaaccag agccagagca gcagccagaa ccagaaccag agccagagcc agagccagag 180
ccagaaccag agccagaacc agagccagaa ccagagccag agcctgaaat ggaaccgcag 240
ccagagccac aagcgccgcc tgcaggtggt gtatctatca ttgataccaa ccccaacaat 300
gcatcgtttt gggcaggctc aaacaatggt gatgtgggca gtagggctgt tatagatgtc 360
gatcaccccg aatttagcca agcgacgcgc ataaccgtaa gcaaccccgc tagcgactat 420
tggaatggtc agctctcctt cccgcttaat gcgagtgtgg cggcggggga tgtagtatta 480
gtgcgtttgt acatgcgctc ggtggagaat acttacgaat cgggtgctag ttttactacc 540
gtatttattg aagacaacat cgactttact aaatttttaa accgcgaaat aaccgccgcg 600
caagattggg tagagtatta cctacccgca gaaattaccg ataaccatgc aaccggtgaa 660
gtgggcttgc gcattggctt tggcgctggc cctagggcgc aggtgtttga tattggcggt 720
gtagagctat tgcattacac caatactgat ataagcgcta tgcctagtac acgcccaagt 780
tacgaaggcc gcgagccaga tgccgcatgg cgtacagcgg cggcagagcg aattgagcag 840
caccgcaaag gcgactttga gctaacagta gtggacgatg gcaaccctat cgccaatgcc 900
accatagatg tagattttca aaaacacgcc tatcattttg gctcggtaac tgttggccat 960
ctattgatgg gcaccagtga agatagcgcc atttaccgcg aaaaagtgct cgagctattt 1020
aaccaaagtg gcccagaaaa cgatttaaag tggggcccat gggaaggcga gtggggcaac 1080
aattttaacc aaactcaaac cctaaacggc ttgcagtggc tgcgcgataa cggcctgtac 1140
acacgtggcc atgtaatggt ttggccttct aagcgcaact tgccaaactt aatgcagcaa 1200
tatttaccag aaggcgaccc cgccagcgcc aacccagaag caaaacaagt ggtgctggat 1260
cacatcgatg atatagcaac cgcaacagct aattatttag atgagtggga tgtactaaac 1320
gagccttacg acaaccacta tttaatggat gcctttggcg atagtgtaat ggtggattgg 1380
tttaatcgcg cgcgtactaa cctgcctgcg cacggtttgt acataaacga ttacagtatt 1440
ttatctgcgg gcgggcgcaa ttttgctcac caagaacact acaccaacac gattcaatat 1500
ttggtcgata acaacgcacc catcaccggt ataggtttgc aaagtcactt tggcgactcg 1560
cctacagcca ttacgcgtat ttacgaaatt attgatcaat acagtaccgc gtttccgcag 1620
ttagatattc gcgcaacgga atttgacgta agtacaacag atgaagacct gcaggcagat 1680
tttacccgcg acttcttaac gatattcttt agccacccta aaacagtggg tgtgcagttg 1740
tggggttttt gggcaaatgc acattggtac cctaatgcag cgctttatga tgccgattgg 1800
cgagaaaagc ccaatgcact agcttggaaa gagcaaattt ttaacgagtg gtggaacgac 1860
tttgacggca cgaccaacgc acagggtaaa tttgatgaac gcggttttta cggcgattac 1920
caagtaactg taaccgtagg tgaagagcag caaattttta cctttagcct agttaaaggc 1980
ggcgaacaaa actttagttt tgagtggcaa tag 2013
<210> 61
<211> 275
<212> PRT
<213> Microbulbifer degradans
<400> 61
Met Asn Ile Lys Thr Phe Phe Pro Ala Leu Ile Ala Ser Val Phe Leu
1 5 10 15
Leu Ile Asn Ala Ser Thr Gly Tyr Ala Ala Ser Ile Thr Lys Thr Leu
20 25 30
Cys Asn Pro Ala Asp Ser Asp Asn Gly Tyr Gly Ala Gly Thr Phe Asn
35 40 45
Gly Lys Phe Tyr Ser Trp Phe Glu Leu Ser Gln Glu Asp Ile Thr Asp
50 55 60
Cys Asp Thr Lys Ile Gly Phe Tyr Asn Glu Thr Asn Arg His Phe Arg
65 70 75 80
Val Glu Trp Asn Val Ala Gln Ser Trp Gly Glu Asp Ala Ile Gly Gly
85 90 95
Met Gly Trp Ser Ser Gly Ser Arg Asp Arg Lys Ile Gly Tyr Asn Val
100 105 110
Gly Gln Leu Thr Thr Asn Ser Ser Ile Gln Lys Ala Leu Val Ala Met
115 120 125
Tyr Gly Trp Ser Cys Ser Thr Ser Gly Gly Asn Gln Ile Ser Gln Glu
130 135 140
Tyr Tyr Val Val Asp Thr Trp Asp Gly Gly Lys Phe Val Pro Trp Asp
145 150 155 160
Glu Asn Ala Asn Asn Gly Asn Gly Ala Pro Ala Gln Ser Val Gly Thr
165 170 175
Val Ser Ala Asn Gly Ala Thr Tyr Asp Val Tyr Lys Val Arg Arg Asn
180 185 190
Gly Ala Gln Tyr Cys Phe Asn Gly Ser Ser Arg Ser Phe Asp Gln Phe
195 200 205
Trp Ser Val Arg Arg Thr Pro Arg Ala Ile Asn Gly Asn Arg Asn Met
210 215 220
Asp Phe Arg Pro His Ala Asn Arg Trp Asp Asn Ser Asp Leu Gly Phe
225 230 235 240
Lys Val Asp Gly Leu Ser Ser Gly Tyr Gln Ile Leu Ala Val Glu Ile
245 250 255
Phe Gly Asp Ala Asn Leu Arg His Lys Gly Ala Ala Asp Ile Thr Leu
260 265 270
Trp Pro Arg
275
<210> 62
<211> 828
<212> DNA
<213> Microbulbifer degradans
<400> 62
atgaacataa aaacattctt ccccgcactt attgcaagtg tatttttatt aattaacgcc 60
agcactggct atgcagcaag cattaccaaa acgctttgca acccagccga ttccgataac 120
ggctacggtg caggaacctt caatggcaaa ttttattctt ggtttgagtt aagccaagaa 180
gacattaccg attgcgatac aaaaattggt ttttacaacg aaaccaatcg acactttagg 240
gtggagtgga atgttgctca atcttgggga gaagatgcaa ttggtggaat gggttggagc 300
tctggctcga gagatagaaa aataggttac aacgttggcc aacttacaac taattcttct 360
attcaaaaag cattggttgc tatgtatggc tggtcttgct ctaccagtgg tggcaaccaa 420
atatcacaag aatattatgt agtggataca tgggacggcg gcaagtttgt gccttgggat 480
gaaaacgcaa ataatggcaa cggtgctcca gcacagagtg taggaacagt tagcgctaat 540
ggtgcaacat acgatgttta taaggttcgc cgcaacggtg cgcaatattg ttttaatggc 600
agcagccgct cgtttgatca gttttggagt gtgcgtagaa cgcctagagc gattaacggc 660
aaccgtaata tggattttcg cccgcacgcc aaccgctggg acaacagtga cctaggtttt 720
aaagttgacg ggttaagcag cggttaccaa attttagcgg ttgaaatatt tggtgatgcg 780
aacctaagac ataaaggtgc agcagatatt actttatggc cacgctaa 828
<210> 63
<211> 767
<212> PRT
<213> Microbulbifer degradans
<400> 63
Met Lys Ser Ile Asn Val Cys Gly Arg Arg Leu Lys Gln Ala Leu Ala
1 5 10 15
Ala Ile Ala Thr Ala Ala Ala Thr Leu Trp Phe Thr Pro Val Asp Ala
20 25 30
Gln Thr Leu Thr Ser Asn Gln Thr Gly Thr His Gly Gly Tyr Tyr Tyr
35 40 45
Ser Phe Trp Thr Asp Ser Ala Gly Thr Val Ser Met Thr Leu Gly Asn
50 55 60
Gly Gly Asn Tyr Ser Ser Ser Trp Ser Asn Thr Gly Asn Trp Val Gly
65 70 75 80
Gly Lys Gly Trp Gln Thr Gly Gly Arg Lys Thr Val Asn Tyr Ser Gly
85 90 95
Thr Phe Asn Pro Ser Gly Asn Gly Tyr Leu Thr Leu Tyr Gly Trp Thr
100 105 110
Gln Asn Pro Leu Ile Glu Tyr Tyr Ile Ile Glu Ser Trp Gly Thr Tyr
115 120 125
Arg Pro Gly Glu Ser Gly Thr Tyr Tyr Gly Thr Val Asn Thr Asp Gly
130 135 140
Gly Thr Tyr Asp Ile Tyr Arg Thr Gln Arg Val Asn Gln Pro Ser Ile
145 150 155 160
Glu Gly Thr Ala Thr Phe Tyr Gln Tyr Trp Ser Val Arg Gln Gln Lys
165 170 175
Arg Val Gly Gly Thr Ile Thr Thr Gly Asn His Phe Asp Ala Trp Ala
180 185 190
Ser His Gly Leu Asn Leu Gly Thr His Asn Tyr Met Val Met Ala Thr
195 200 205
Glu Gly Tyr Gln Ser Ser Gly Asn Ser Asn Ile Thr Val Ser Glu Gly
210 215 220
Ser Gly Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Thr Gly Gly Pro
225 230 235 240
Ser Gly Thr Asn Ile Val Val Arg Ala Gln Gly Val Ser Gly Gln Glu
245 250 255
His Ile Asn Leu Ile Ile Gly Gly Asn Val Val Ala Asp Trp Thr Leu
260 265 270
Ser Thr Ser Met Gln Asp Tyr Thr Tyr Thr Gly Asn Ala Ala Gly Asp
275 280 285
Leu Gln Val Glu Tyr Asp Asn Asp Ala Ser Gly Arg Asp Val Glu Leu
290 295 300
Asp Tyr Val Tyr Val Asn Gly Glu Ile Arg Gln Ala Glu Asp Met Glu
305 310 315 320
Tyr Asn Thr Ala Thr Tyr Ser Gly Glu Cys Gly Gly Gly Ser Tyr Ser
325 330 335
Gln Thr Met His Cys Ser Gly Val Ile Gly Phe Gly Asp Thr Ser Asp
340 345 350
Cys Phe Ser Gly Asn Cys Asn Gly Ala Ser Ser Thr Ser Ser Ser Ser
355 360 365
Ser Ser Ser Ser Thr Ser Ser Ser Thr Ser Ser Gly Gly Asn Asn Asn
370 375 380
Ser Gly Ile Thr Val Arg Ala Arg Gly Thr Asn Gly Asp Glu His Ile
385 390 395 400
Asn Leu Ile Val Gly Gly Asn Ile Val Gly Asn Trp Thr Leu Thr Thr
405 410 415
Ser Asn Gln Asn Tyr Val Tyr Asn Gly Asn Ala Ser Gly Asp Val Glu
420 425 430
Val Gln Phe Asp Asn Asp Ala Asn Gly Arg Asp Val Ile Leu Asp Tyr
435 440 445
Val Ile Val Asn Gly Glu Thr Arg Gln Ala Glu Asp Met Glu Tyr Asn
450 455 460
Thr Ala Thr Tyr Ser Gly Ser Cys Gly Gly Gly Ser Tyr Ser Glu Thr
465 470 475 480
Met His Cys Ser Gly Glu Ile Gly Phe Gly His Thr Asp Asp Cys Phe
485 490 495
Ser Gly Asn Cys Thr Ser Ser Ser Gly Thr Thr Gly Ser Ser Gly Gly
500 505 510
Thr Ser Ser Asn Asn Gly Thr Ser Ser Cys Asn Gly Tyr Val Gly Ile
515 520 525
Thr Phe Asp Asp Gly Pro Gly Asn Asn Thr Ala Thr Leu Ile Asn Leu
530 535 540
Leu Gln Gln Asn Asn Leu Thr Pro Val Thr Trp Phe Asn Thr Gly Gln
545 550 555 560
Asn Ile Ala Ala Asn Thr Gly Gln Phe Ala Gln Gln Lys Ser Val Gly
565 570 575
Glu Ile Gln Asn His Ser Tyr Thr His Ser His Met Leu Asn Trp Ser
580 585 590
Tyr Gln Gln Val Arg Asp Glu Leu Ala Ser Thr Asn Gln Ala Ile Val
595 600 605
Asn Ala Gly Gly Ala Thr Pro Thr Leu Phe Arg Pro Pro Tyr Gly Glu
610 615 620
Thr Asn Ser Thr Ile Asn Gln Ala Ala Gln Asp Leu Gly Leu Arg Val
625 630 635 640
Ile Thr Trp Asp Val Asp Ser Arg Asp Trp Asp Gly Ala Ser Ala Ser
645 650 655
Ala Ile Ala Asn Ser Ala Asn Gln Leu Gln Asn Gly Gln Val Ile Leu
660 665 670
Met His Asp Ala Ser Tyr Asn Asn Thr Asn Gly Ala Ile Ser Gln Phe
675 680 685
Ala Ala Asn Leu Arg Ala Arg Gly Leu Cys Ala Gly Lys Ile Asp Pro
690 695 700
Ser Thr Gly Arg Ala Val Ala Pro Ser Thr Asn Thr Gly Gly Asn Thr
705 710 715 720
Gly Ser Asn Thr Gly Asn Gly Gly Asn Gly Gly Met Cys Asn Trp Tyr
725 730 735
Gly Thr Ser Ile Pro Leu Cys Gln Thr Thr Asn Asp Gly Trp Gly Trp
740 745 750
Glu Asn Ser Gln Ser Cys Val Ser Gln Asn Thr Cys Asn Ser Gln
755 760 765
<210> 64
<211> 2304
<212> DNA
<213> Microbulbifer degradans
<400> 64
atgaagtcaa tcaatgtatg cggcagacgc ctcaagcaag ccctcgcagc aatagcaacc 60
gctgcagcaa ctctctggtt tacgccagtg gatgcacaaa ccttaacctc aaaccaaact 120
ggtactcatg gtggttacta ctattccttc tggaccgaca gtgctggcac tgtttctatg 180
acactcggca atggcggcaa ttacagttca tcgtggagca ataccggtaa ctgggtggga 240
ggtaaaggct ggcaaacggg gggacgcaaa accgtaaact attccggtac gtttaacccc 300
tcgggcaatg gttatttaac cctctacggt tggacccaaa acccactcat tgaatactac 360
atcattgaaa gctggggcac ctatcgccca ggtgaaagcg gaacctacta cggcaccgtc 420
aacaccgatg gcggcactta cgatatttat cgcacccaac gcgttaacca accgtcaatt 480
gaaggcactg caacgtttta tcagtactgg agtgttaggc aacaaaaacg cgtaggcggc 540
accataacaa ccggcaacca ttttgatgcg tgggcgagtc atggccttaa cctaggcaca 600
cacaattaca tggtaatggc caccgaaggt tatcaaagta gcggcaactc caatattacc 660
gttagcgaag gcagcggttc gagcagtact agttcgagta gctctagcac cggtggccca 720
agtggtacca atattgttgt gcgcgcacaa ggtgtaagcg gccaagaaca tatcaattta 780
attattggcg gtaacgtagt ggcagactgg acgctttcaa ccagcatgca agattacacc 840
tacaccggta atgccgcagg cgacctgcaa gtagaatacg acaacgatgc tagtggtcgc 900
gatgtagagc tagactatgt gtatgtgaat ggcgaaattc gtcaagcaga agacatggaa 960
tacaacaccg caacttacag tggtgaatgt ggtggcggtt cctattcgca aaccatgcac 1020
tgcagcggtg taattggctt tggcgatacc agtgattgtt ttagcggcaa ctgtaatggt 1080
gcatcttcta caagttctag ttcgtctagt agctcaacca gctctagcac aagctctggc 1140
ggtaacaata acagcggcat tactgttcgc gcacgcggta ccaatggcga tgaacatatc 1200
aaccttattg ttggcggcaa tatagtaggc aattggacgc tcaccaccag caaccaaaat 1260
tatgtttaca acggcaatgc atctggtgat gtagaagtac aattcgacaa cgatgccaac 1320
ggtcgcgatg ttattctcga ttacgtaata gtaaatggcg aaactcgcca agcggaagat 1380
atggaataca acacggcgac ctacagcggt tcctgtggtg gtggctccta ttcggaaaca 1440
atgcactgca gcggcgaaat tggttttggt cacaccgacg attgctttag tggaaattgc 1500
actagcagca gcggcacaac cggtagctct ggaggaacat caagcaataa cggtacaagt 1560
agctgtaacg gttatgtagg tattaccttc gatgatggcc caggcaataa caccgctaca 1620
ttaataaact tactacaaca aaataactta accccagtaa cttggtttaa cacaggccaa 1680
aatattgctg ccaatacagg tcagtttgcc cagcaaaaaa gtgttggtga aattcaaaac 1740
cacagctaca cccattccca tatgcttaat tggagctatc aacaagttcg cgacgaactc 1800
gccagcacca atcaagctat tgtgaatgct gggggcgcaa cgccaactct attccgtccg 1860
ccttatggcg aaacaaactc caccattaat caagcggcac aagatttagg cctgcgcgta 1920
ataacctggg atgtagattc gcgcgattgg gatggcgcaa gcgcttcagc tattgccaac 1980
tcggctaatc agttgcaaaa cggccaagta attttgatgc acgatgccag ctacaacaat 2040
accaacggag ccatatcaca atttgcagcc aatctaagag caagagggct atgtgcaggt 2100
aaaatagacc caagcactgg ccgcgcagtt gcaccaagca caaataccgg cggcaacact 2160
ggcagcaata caggaaatgg cggtaatggc ggcatgtgta actggtacgg caccagcatt 2220
ccattatgcc aaactaccaa cgacggttgg ggctgggaaa actcacaaag ctgcgtttcg 2280
caaaatacct gtaactcaca ataa 2304
<210> 65
<211> 360
<212> PRT
<213> Microbulbifer degradans
<400> 65
Met Cys Leu Lys Ile Asn Arg Cys Trp Val Phe Val Trp Leu Cys Ile
1 5 10 15
Cys Ala Thr Thr Ala His Ser Glu Thr Tyr Val Pro Ala Asp Asn Asp
20 25 30
Gln Tyr Leu Tyr Thr Gly Arg Ile Asp Phe Ser Asp Ile Lys Ala Pro
35 40 45
Ser Leu Ser Trp Pro Gly Thr Ser Ile Lys Ala Asn Phe Thr Gly Glu
50 55 60
His Leu Glu Val Val Leu Asp Asp Gln Asn Gly Lys Asn Phe Phe Asn
65 70 75 80
Val Ile Ile Asp Gly Asn Asp Arg Phe Pro Tyr Val Leu Glu Ala Lys
85 90 95
Gln Gly Glu His Arg Tyr Leu Ile Ser Ser Ala Leu Ser Lys Gly Lys
100 105 110
His Ser Val Glu Ile Tyr Lys Arg Thr Glu Gly Glu Glu Gly Ala Thr
115 120 125
Leu Phe Lys Gly Leu Trp Leu Ala Asp Asp Ser Tyr Leu Leu Lys Pro
130 135 140
Pro Lys Arg Pro Lys Arg Arg Ile Glu Ile Tyr Gly Asp Ser Ile Thr
145 150 155 160
Ser Gly Met Gly Asn Glu Gly Ala Asp Asn Gly Ala Asp His Leu Gly
165 170 175
Ser Glu Lys Asn Asn Tyr Leu Ala Tyr Gly Ala Ile Thr Ala Arg Asn
180 185 190
Leu Asn Ala Glu Leu His Thr Ile Ser Gln Ser Gly Ile Gly Val Met
195 200 205
Val Ser Trp Phe Pro Phe Ile Met Pro Gln Phe Tyr Asn Gln Leu Ser
210 215 220
Ala Val Gly Asn Asn Asp Ser Ile Trp Asp Phe Lys Gln Trp Thr Pro
225 230 235 240
His Val Val Val Ile Asn Leu Met Gln Asn Asp Ser Trp Leu Ile Asp
245 250 255
Arg Glu Lys Arg Leu Thr Pro Ile Pro Ala Asp Ala Gln Arg Ile Ala
260 265 270
His Tyr Gln Ala Phe Val Gln Ser Ile Arg Ala Glu Tyr Pro Lys Ala
275 280 285
Gln Ile Ile Cys Ala Leu Gly Ser Met Asp Ala Thr Ala Asn Glu Lys
290 295 300
Trp Pro Asn Tyr Val Arg Glu Ala Val Lys Asn Met Gln Asp Asn Gly
305 310 315 320
Asp Asn Lys Ile Asp Thr Ile Phe Phe Glu Tyr Ile Gly Tyr Gly Gln
325 330 335
His Pro Arg Val Ala Gln His Asn Ala Asn Ala Asp Lys Leu Thr Lys
340 345 350
Phe Ile Lys Lys Lys Met Lys Trp
355 360
<210> 66
<211> 1083
<212> DNA
<213> Microbulbifer degradans
<400> 66
atgtgcctaa aaataaaccg gtgctgggtg tttgtttggt tgtgtatttg cgcaactact 60
gcccatagtg aaacctacgt acccgcagat aacgaccaat acctttatac cggccgtata 120
gattttagcg atataaaagc accctcgcta agctggcccg gcacaagtat aaaagccaac 180
tttaccggcg aacatttaga ggtagtgtta gacgatcaaa acggtaagaa tttttttaat 240
gtgattatcg acggtaacga tcgatttcct tatgtgctag aagctaaaca aggtgagcat 300
cgatatttaa tttcttctgc gctaagcaag ggcaagcaca gcgtagaaat ttataaacgt 360
acagaaggcg aagagggcgc aacgctattt aaagggcttt ggttagccga tgatagttat 420
ttattaaaac cccctaaacg cccaaaacgc agaatagaaa tttatggtga ctcaattaca 480
agcggtatgg gtaacgaagg cgcagataac ggcgccgacc atttgggctc cgaaaaaaat 540
aattaccttg cctatggggc tattaccgca cgcaatttaa acgccgagct acataccatt 600
tcgcaaagcg gtattggggt aatggtaagt tggtttccgt ttattatgcc gcagttttac 660
aaccagctaa gtgctgttgg taataatgat tccatatggg actttaaaca atggacgccc 720
catgtagttg taataaacct aatgcaaaac gatagctggc taatagatag agaaaagcgc 780
cttacgccaa ttcctgcaga tgcacaacgc atagcccatt atcaagcgtt tgtgcaaagc 840
attcgtgccg aataccccaa ggcgcaaata atatgcgcac tgggcagtat ggatgcaacc 900
gcaaacgaaa aatggccaaa ctacgtgcgc gaagctgtaa aaaatatgca agataatggc 960
gataataaaa tcgatactat tttctttgaa tacatcggct acggccaaca cccgcgcgta 1020
gcgcaacaca atgcgaatgc agataagtta actaaattta ttaagaaaaa aatgaaatgg 1080
tag 1083
<210> 67
<211> 973
<212> PRT
<213> Microbulbifer degradans
<400> 67
Met Asn Tyr Tyr Leu Asn Lys Lys Arg Leu Gly Gln Leu Leu Thr Gly
1 5 10 15
Ala Ala Ile Ile Pro Val Leu Tyr Ala Cys Gly Ser Gln Glu Lys Asn
20 25 30
Val Glu Pro Ala Thr Val Asn Trp His Lys Thr Ser Asp Gly Val Val
35 40 45
Val Ser Leu Gln Asp Ser Glu Ala Lys Lys Val Arg Leu Gln Val Ile
50 55 60
Asn Asp Arg Ile Val Arg Val Thr Ala Thr Pro Gln Gln Asp Phe Asn
65 70 75 80
Asn Leu Pro Asn Thr Leu Met Val Val Ala Lys Pro Glu Gln Thr Ala
85 90 95
Phe Glu Val Lys Gln Asn Asp Ala Ser Val Val Leu Ser Thr Ala Asp
100 105 110
Leu Ser Ala Glu Val Ser Leu Val Thr Gly Val Val Ser Phe Lys Asp
115 120 125
Glu His Gly Lys Val Leu Thr Thr Glu Val Asp Arg Gly Asn Phe Gly
130 135 140
Ala Val Thr Arg Asp Pro Gly Val Val Asp Ala Asp Ser Phe Ala Ile
145 150 155 160
Arg Gln Gln Phe Thr Ser Asp Glu Asn Glu Gly Tyr Tyr Gly Leu Gly
165 170 175
Gln Gln Gln Asp Gly Glu Val Asn Tyr Ala Gly Asp Asn Val Glu Leu
180 185 190
Thr Thr Tyr Asn Leu Glu Ile Ser Ile Pro Tyr Val Val Ser Ser Lys
195 200 205
Asp Tyr Ala Leu Leu Trp Asn Asn Thr Ser Ile Ser Arg Leu Gly Asp
210 215 220
Pro Asn Pro Pro Glu Pro Leu Lys Glu Gly Phe Lys Leu Phe Asp Ala
225 230 235 240
Asn Gly Asn Pro Gly Gly Leu Thr Ala Arg Tyr Phe Asp Gly Asp Lys
245 250 255
Leu Leu Leu Glu Arg Val Glu Ala Asp Leu Asp Tyr Gln Phe Leu Ala
260 265 270
Gln Gly Ser Asn Arg Thr Thr Pro Met Pro Asp Glu Thr Ala Asp Ala
275 280 285
Lys Asn Leu Arg Ile Glu Trp Glu Gly Ser Ile Glu Ser Asp Thr Asn
290 295 300
Gly Val His Glu Leu Lys Met Tyr Ser Ser Gly Tyr Ala Lys Leu Tyr
305 310 315 320
Leu Asn Gly Glu Leu Val Leu Asp Arg Trp Arg Met Asn Trp Asn Pro
325 330 335
Trp Tyr His Asn Thr Lys Leu Glu Met Gln Ala Gly Lys Lys Val Ala
340 345 350
Leu Lys Leu Asp Trp Gln Val Asp Gly Gly Tyr Met Arg Ile Lys Gln
355 360 365
His Lys Pro Leu Pro Val Ala Glu Gln Gly Arg Leu Ser Ile Ala Ser
370 375 380
Asp Thr Ala Lys Ala Ile Asp Tyr Tyr Phe Val Val Gly Asp Asn Lys
385 390 395 400
Asp Glu Leu Val Ser Gly Tyr Arg Thr Leu Thr Gly Lys Ala Val Met
405 410 415
Leu Pro Lys Trp Val Phe Gly Phe Trp Gln Ser Arg Glu Arg Tyr Lys
420 425 430
Thr Gln Asp Glu Ile Ile Asp Ala Leu Gln Glu Tyr Arg Asp Arg Lys
435 440 445
Ile Pro Ile Asp Asn Ile Val Leu Asp Trp Ser Tyr Trp Pro Gln Asp
450 455 460
Ala Trp Gly Ser His Asp Phe Asp Glu Gln Phe Phe Pro Asp Pro Ser
465 470 475 480
Ala Leu Val Asp Lys Val His Glu Leu Asn Gly Asn Ile Met Ile Ser
485 490 495
Val Trp Pro Lys Phe Tyr Pro Thr Thr Asp Asn Tyr Lys Ala Leu Asn
500 505 510
Ala Lys Gly Cys Met Phe Asn Lys Asn Ile Glu Gln Lys Asn Leu Asp
515 520 525
Trp Ile Gly Glu Gly Tyr Leu Asn Gly Phe Tyr Asp Ala Tyr Asn Pro
530 535 540
Glu Cys Arg Glu Met Phe Trp Ala Gln Ile Arg Asp Lys Ile Asn Val
545 550 555 560
His Gly Phe Asp Ala Trp Trp Leu Asp Ala Val Glu Pro Asp Ile His
565 570 575
Ser Asn Leu Ser Phe Glu His Arg Lys Asp Leu Met Thr Pro Asn Ala
580 585 590
Leu Gly Thr Gly Ala Glu Val Phe Asn Ala Tyr Ala Leu Pro His Ala
595 600 605
Glu Thr Val Tyr Gln Gly Glu Arg Arg Asp Asp Gly Asp Lys Arg Ala
610 615 620
Phe Ile Leu Thr Arg Ser Gly Phe Ala Gly Ile Gln Arg Thr Gly Ser
625 630 635 640
Ala Ile Trp Ser Gly Asp Val Val Ser Arg Trp Ser Asp Leu Lys Glu
645 650 655
Gln Ile Ala Ala Gly Val Gly Val Gly Ile Ser Gly Met Pro Tyr Trp
660 665 670
Thr Phe Asp Ile Gly Gly Phe Thr Pro Glu Asp Arg Tyr Arg Tyr Ser
675 680 685
Ala Lys Gly Ser Val Gly His Phe Ser Met Met Asn Glu Ser Glu Val
690 695 700
Pro Glu Trp Gln Glu Ile Asn Leu Arg Trp Phe Gln Phe Gly Thr Phe
705 710 715 720
Val Pro Leu Phe Arg Ser His Gly Gln Asn Pro Tyr Arg Glu Ile Tyr
725 730 735
Asn Ile Ala Asp Lys Gly Thr Glu Val Tyr Asp Ser Met Val Trp Tyr
740 745 750
Thr Lys Thr Arg Tyr Arg Leu Met Pro Tyr Ile Tyr Ser Leu Val Gly
755 760 765
Asp Ala His His Lys Asp Gly Thr Phe Met Arg Ala Leu Val Met Asp
770 775 780
Phe Pro Ser Asp Leu Asn Val Arg Asp Ile Asn Asp Gln Tyr Met Phe
785 790 795 800
Gly Pro Ala Leu Leu Val Asn Pro Val Ser Glu Phe Lys Ala Arg Ser
805 810 815
Arg Asp Val Tyr Leu Pro Ala Gly Ala Asp Trp Tyr Asp Phe Tyr Thr
820 825 830
Gly Val Lys His Thr Gly Gly Lys Thr Ile Lys Ala Asp Ala Pro Leu
835 840 845
Ala Lys Met Pro Ile Phe Val Lys Ala Gly Ser Ile Ile Pro Thr Gly
850 855 860
Val Glu Ile Gln His Val Tyr Asp Lys Pro Asp Ala Pro Tyr Thr Leu
865 870 875 880
Asn Val Tyr Thr Gly Ala Asn Gly Ser Phe Glu Ile Tyr Glu Asp Asp
885 890 895
Gly Lys Thr Tyr Ala Tyr Glu Gln Gly Ala Trp Ala Arg Ile Pro Val
900 905 910
Ser Tyr Asn Asp Lys Thr Gly Glu Leu Thr Ile Gly Asp Arg Val Gly
915 920 925
Ser Phe Glu Gly Met Thr Lys Glu Arg Glu Phe Arg Val Arg Trp Ile
930 935 940
Ser Ala Lys Arg Asp Asp Ala Ala Asn Phe Asp Thr Gly Val Ala Lys
945 950 955 960
Ala Val Thr Tyr Thr Gly Lys Ala Ile Thr Ile Lys Arg
965 970
<210> 68
<211> 2922
<212> DNA
<213> Microbulbifer degradans
<400> 68
gtgaattatt atttaaacaa aaagcgactg gggcaattgc tcaccggcgc ggccattatt 60
cccgtgctat atgcatgtgg ctcacaggaa aaaaacgtag agcctgcaac ggttaattgg 120
cataaaacaa gcgacggcgt cgttgtaagc ttgcaagata gcgaagcaaa aaaagtgcgc 180
ttgcaagtca ttaacgatcg gatagtacgt gttaccgcta cgccacagca ggatttcaac 240
aacctgccaa atacgcttat ggtggtggcc aagcccgagc aaacggcgtt tgaagttaaa 300
caaaacgatg catctgttgt gttatcaacg gcagatctat ctgccgaagt gtcattagta 360
actggtgttg taagttttaa agatgagcac ggcaaggtgc ttacaacaga agttgatcgc 420
ggcaattttg gggcggtaac ccgcgaccca ggtgtggtgg acgccgattc atttgctatt 480
cgccaacagt ttacaagcga cgaaaatgaa ggctactacg gtttaggtca gcagcaggat 540
ggcgaagtaa actacgctgg cgataacgta gagttaacaa cttacaactt agaaatttct 600
ataccttatg ttgtatcaag caaagattac gcgctgctat ggaacaatac ctcaatttct 660
cgtttgggcg accccaatcc acccgagcca ctaaaagagg gctttaaact ctttgacgct 720
aatggtaacc ccggcgggct aaccgcacgt tattttgatg gcgataaatt actgctcgag 780
cgtgtagagg ccgatttaga ttatcaattt ttagcgcaag gtagtaatcg cactacgccc 840
atgcctgatg aaaccgctga tgcaaaaaat ctgcgtattg aatgggaagg tagtatcgaa 900
tccgatacca acggtgtgca cgagttaaaa atgtattcca gtggctacgc taaattgtat 960
ttgaatggcg agttagtgtt agatcgctgg cgtatgaact ggaacccttg gtatcacaac 1020
accaagttag aaatgcaggc cggtaaaaaa gttgcattaa agttagattg gcaagtagat 1080
ggtggttata tgcgcataaa acagcataaa ccactgccgg tagcagagca gggacgtttg 1140
tctattgctt ccgataccgc gaaagccatt gattactact ttgtagttgg cgataacaag 1200
gatgagttgg tgtctggcta ccgtacgctc acaggtaaag cagtgatgct acctaagtgg 1260
gtgtttggtt tttggcaaag ccgcgagcgc tataaaacac aagatgaaat tatcgacgcc 1320
ttgcaagaat accgcgatcg taaaattcct atcgataaca ttgtattaga ttggagttat 1380
tggcctcagg atgcatgggg tagtcatgat ttcgacgagc aatttttccc cgacccatct 1440
gcactagtag ataaagtaca cgagctaaac ggcaatatta tgatttccgt atggcctaag 1500
ttttacccta caaccgacaa ctacaaagcg ctaaacgcta aaggttgtat gtttaataaa 1560
aacatcgagc agaaaaacct cgattggatt ggcgagggtt acctaaatgg cttttacgat 1620
gcctataacc cagagtgccg tgaaatgttt tgggcgcaaa ttcgcgataa gatcaatgtg 1680
cacggtttcg atgcttggtg gttagatgcg gtagagccag atatccattc caacctttct 1740
tttgagcacc gcaaagattt aatgacaccc aatgcactcg gcaccggtgc cgaagtgttt 1800
aacgcttacg ctttgccgca cgcagaaact gtttaccaag gcgagcgtag agatgacggt 1860
gacaagcgcg catttattct aacgcgttct gggtttgccg gtattcagcg caccggttcg 1920
gctatttgga gtggcgatgt ggtatcgcgc tggtccgact taaaagaaca aattgcagca 1980
ggtgtgggcg tgggcatttc tggtatgccg tattggacgt tcgatatcgg tggctttact 2040
ccagaagatc gctaccgtta tagcgccaaa ggttctgttg gtcatttctc tatgatgaac 2100
gaatcggaag tgcctgaatg gcaagaaatc aatctgcgtt ggttccaatt tggtaccttt 2160
gtgccgctgt ttaggtccca cggccaaaac ccatatcgcg aaatatataa catcgccgat 2220
aaaggcaccg aggtatacga cagcatggtg tggtacacca aaactcgcta tcgcttaatg 2280
ccttatattt attcgttagt tggcgatgct caccacaaag acggcacctt tatgcgcgct 2340
ctggtgatgg atttccctag cgaccttaat gtgcgcgata ttaacgacca gtatatgttt 2400
ggccccgcgc tactcgtaaa ccctgtgtcg gaatttaaag cgcgttcacg ggatgtgtat 2460
ctacctgcgg gcgcagattg gtacgatttc tatacaggtg tgaagcacac aggtggtaaa 2520
accattaagg ccgatgcacc gcttgccaaa atgcctattt ttgttaaggc cggctctatt 2580
attccaacag gtgtagaaat ccagcatgtg tacgataagc ccgatgctcc ttacaccctt 2640
aacgtgtata ccggtgcgaa tggcagcttc gaaatttatg aagatgacgg caaaacctac 2700
gcttacgagc aaggggcttg ggcgcgcatt cccgtttcgt acaacgataa aaccggtgag 2760
ctaaccattg gcgatcgcgt aggtagcttt gagggaatga ccaaagagcg cgaattccgc 2820
gtgcgctgga tatctgccaa gcgagacgat gccgccaatt tcgatacagg tgtggccaaa 2880
gccgttacct atacgggtaa ggcaataacc attaagcgct aa 2922
<210> 69
<211> 893
<212> PRT
<213> Microbulbifer degradans
<400> 69
Met Asn Lys His Phe Leu Val Gly Val Ile Thr Leu Gly Val Ile Leu
1 5 10 15
Gln Gly Leu Thr Ala Cys Ser Lys Ser Ala Ala Pro Asn Ala Asn Gln
20 25 30
Pro Gln Asp Thr Ala Ala Ser Thr Ala Thr Tyr Pro Phe Arg Asp Ala
35 40 45
Ser Leu Ser Val Asp Ala Arg Val Asp Asp Leu Val Ser Arg Leu Thr
50 55 60
Thr Thr Glu Lys Ile Ala Gln Met Phe Asn Asp Thr Pro Ala Ile Glu
65 70 75 80
Arg Leu Gly Ile Pro Ala Tyr Asn Trp Trp Asn Glu Ser Leu His Gly
85 90 95
Val Ala Arg Ala Gly Lys Ala Thr Val Tyr Pro Gln Ala Ile Gly Leu
100 105 110
Ala Ser Thr Phe Asp Glu Asp Leu Met Leu Arg Val Ala Thr Ser Ile
115 120 125
Ser Asp Glu Gly Arg Ala Lys Tyr His Asp Phe Leu Ser Lys Asp Val
130 135 140
Arg Thr Ile Tyr Gly Gly Leu Thr Phe Trp Ser Pro Asn Ile Asn Ile
145 150 155 160
Phe Arg Asp Pro Arg Trp Gly Arg Gly Gln Glu Thr Tyr Gly Glu Asp
165 170 175
Pro Phe Leu Thr Gly Arg Met Ala Ile Asn Phe Val Lys Gly Ile Gln
180 185 190
Gly Glu Asn Asp Asn Ser Asp Tyr Leu Lys Ala Val Ala Thr Ile Lys
195 200 205
His Tyr Ala Val His Ser Gly Pro Glu Lys Thr Arg His Ser Asp Asp
210 215 220
Tyr His Pro Thr Arg Lys Asp Leu Phe Glu Thr Tyr Leu Pro Ala Phe
225 230 235 240
Arg Met Ala Ile Ala Glu Thr Asn Val Gln Ser Leu Met Cys Ala Tyr
245 250 255
Asn Arg Val Asp Gly Ala Pro Ala Cys Gly Asn Asn Glu Leu Met Gln
260 265 270
Glu Ile Leu Arg Gly Asp Met Gly Phe Asn Gly Tyr Val Val Ser Asp
275 280 285
Cys Gly Ala Ile Ala Asp Phe Tyr Glu Ser Arg Ser His His Val Val
290 295 300
Asp Ser Pro Ala Glu Ala Ala Ala Trp Ala Val Lys Ser Gly Thr Asp
305 310 315 320
Leu Asn Cys Gly Asp Ser His Gly Asn Thr Tyr Thr Asn Leu His Tyr
325 330 335
Ala Leu Gln Gln Gly Leu Ile Thr Glu Asp Tyr Ile Asp Ile Ala Val
340 345 350
Lys Arg Leu Phe Lys Ala Arg Ile Lys Leu Gly Met Phe Asp Glu Gln
355 360 365
Asp Arg Val Pro Tyr Ser Glu Ile Gly Met Asp Val Val Gly Ser Pro
370 375 380
Lys His Leu Ala Leu Thr Gln Glu Ala Ala Glu Lys Ser Ile Val Leu
385 390 395 400
Leu Lys Asn Asn Gly Val Leu Pro Leu Lys Ala Gly Val Lys Val Ala
405 410 415
Val Ile Gly Pro Asn Ala Val Asp Glu Asp Val Leu Val Gly Asn Tyr
420 425 430
His Gly Val Pro Val Lys Pro Val Leu Pro Leu Glu Gly Ile Val Asn
435 440 445
Arg Val Gly Glu Ala Asn Val Phe Tyr Ala Pro Gly Ser Ala Gln Ile
450 455 460
Ala Asp Ile Tyr Ser His Tyr Glu Pro Ile Ser Ala Glu Asn Phe Tyr
465 470 475 480
His Lys Asp Ala Asn Gly Asn Leu Ala Ala Gly Leu Lys Ala Glu Tyr
485 490 495
Tyr Ala Asp Tyr Tyr Asn Ala Ala Glu Ile Asn Asp Asp Thr Phe Ser
500 505 510
Ala Thr Pro Ala Leu Asn Arg Ile Asp Ala Asp Ile Asn Phe Ser Trp
515 520 525
Pro Val Ser Pro Ile Asp Asn Ser Leu Asp Asp Glu Phe Ser Ala Val
530 535 540
Trp Thr Gly Ile Leu Lys Pro Lys Lys Ser Gly Ser Tyr Arg Phe Ser
545 550 555 560
Gly Thr Val Ala Leu Ala Ile Asn Gly Lys Pro Val Asn Gly Ala Val
565 570 575
Asn Leu Lys Ala Gly Glu Ser Tyr Asn Ile Lys Ala Ile Phe Gly Val
580 585 590
Gln Lys Trp Trp Pro Val Asn Ala Ile His Pro Tyr Gly Lys Leu Thr
595 600 605
Trp Leu Asp Glu Ser Arg Asp Leu Glu Glu Glu Ala Leu Ala Ala Ala
610 615 620
Arg Lys Ala Asp Val Ile Ile Phe Met Gly Gly Ile Asp Ala His Leu
625 630 635 640
Glu Gly Glu Glu Met Pro Leu Glu Leu Asp Gly Phe Thr His Gly Asp
645 650 655
Arg Thr His Ile Asn Leu Pro Lys Val Gln Thr Asn Leu Leu Lys Gln
660 665 670
Leu Lys Ala Thr Gly Lys Pro Val Val Met Val Asn Phe Ser Gly Ser
675 680 685
Ala Met Ala Leu Asn Trp Glu Ser Glu Lys Leu Asp Ala Ile Leu Gln
690 695 700
Ala Phe Tyr Pro Gly Glu Ala Thr Gly Thr Ala Leu Ala Asn Ile Leu
705 710 715 720
Trp Gly Asp Val Ser Pro Ser Gly Arg Leu Pro Val Thr Phe Tyr Lys
725 730 735
Gly Val Asp Asp Leu Pro Ala Phe Asn Asp Tyr His Met Glu Asn Arg
740 745 750
Thr Tyr Lys Phe Tyr Arg Gly Glu Pro Leu Tyr Ala Phe Gly His Gly
755 760 765
Leu Gly Tyr Val Asp Phe Ala Tyr Asn Asn Leu Val Val Ala Asn Thr
770 775 780
Ala Glu Ala Gly Lys Ala Leu Pro Ile Ala Val Ser Val Thr Asn Thr
785 790 795 800
Gly Lys Met Gln Ala Glu Asp Val Ala Gln Val Tyr Ile Ser Leu Leu
805 810 815
Asp Ala Pro Ala Asn Thr Pro Ile Arg Asp Leu Lys Ala Phe Lys Arg
820 825 830
Thr Lys Leu Ala Ala Gly Glu Ser Thr Glu Leu Glu Phe Asn Leu Pro
835 840 845
Ala Arg Val Leu Thr Tyr Ile Asp Asp Asn Gly Lys Thr Gln Thr Tyr
850 855 860
Thr Gly Arg Val Glu Val Thr Val Gly Ser Gly Gln Lys Gly Tyr Val
865 870 875 880
Lys Glu Asn Ala Ile Ala Val Ala Thr Ile Asn Val Gln
885 890
<210> 70
<211> 2682
<212> DNA
<213> Microbulbifer degradans
<400> 70
atgaataaac actttttagt aggtgtaatt acgttagggg taattctgca ggggctaact 60
gcatgtagca aaagcgctgc acctaatgcc aatcaaccgc aagataccgc agctagtacg 120
gctacctacc cgtttaggga tgcaagctta agtgtagatg cccgcgtaga cgacttggta 180
tcgcgtttaa ccacaaccga aaaaattgcc caaatgttta acgatacgcc cgcaatcgag 240
cgattgggta ttcccgccta caattggtgg aacgaatcgt tgcacggtgt ggcccgtgcg 300
ggtaaagcaa cggtataccc gcaggcaata ggcttagcgt ctacatttga tgaagactta 360
atgttgcgcg tggctacttc tatttctgat gaggggcgcg ctaagtatca cgacttccta 420
tcgaaagacg tgcgcaccat atacggcggg cttacctttt ggtcgccaaa tattaatatc 480
ttccgcgacc cgcgttgggg cagggggcaa gaaacctacg gtgaagaccc gttcttaacg 540
gggcgtatgg ccattaattt tgttaagggt attcaaggcg aaaacgacaa cagcgattac 600
ctaaaagccg tagcgacaat taagcactat gccgtacaca gcggccccga aaaaacgcgt 660
cattcggatg actaccatcc aacccgtaaa gatttattcg aaacctattt gcctgcattt 720
cgcatggcaa tagcagagac taacgtgcaa tcgttaatgt gtgcctacaa ccgtgtagat 780
ggggcacctg cctgtggcaa taatgaatta atgcaagaaa ttttgcgtgg cgatatgggc 840
tttaacggtt atgtcgtgtc tgactgtggc gccattgccg atttttacga gagtagatcg 900
caccacgtgg ttgactcacc tgcagaggct gcagcgtggg ccgttaaatc gggtaccgat 960
ttaaactgtg gcgattcaca tggcaatacc tacaccaacc tgcattacgc gttacagcaa 1020
ggtttaatta cagaagatta tattgatata gcggtaaagc gtttgtttaa agcgcgtatt 1080
aagcttggca tgtttgacga gcaagaccgc gtgccttaca gcgaaattgg tatggatgtt 1140
gtaggttcac ctaagcacct agcgctaacc caagaagcgg cagaaaaatc tattgtgctg 1200
ctaaaaaaca atggtgtatt gccattaaaa gcaggggtaa aggtagccgt aatagggcca 1260
aatgcagttg atgaagatgt attggtaggc aactaccacg gcgtaccagt gaaacctgtg 1320
ttgccgctag aggggattgt taatcgtgtt ggcgaggcca acgtatttta tgccccaggc 1380
agtgcacaaa tagccgatat atacagccac tacgaaccga taagtgcaga aaatttttat 1440
cataaagatg caaatggtaa tttagctgca ggcttaaaag cagagtatta cgccgattat 1500
tacaacgcag ctgaaattaa cgacgatacc tttagcgcaa ccccagcgtt aaatagaatt 1560
gatgcagata ttaatttctc ttggcctgta tcgcctattg ataattcgtt agatgatgaa 1620
tttagtgcag tatggacagg catacttaaa ccgaaaaagt cgggtagcta ccgtttctcg 1680
ggcacggttg cattagccat taacggcaaa cctgttaatg gggctgttaa cctaaaggca 1740
ggtgaaagct ataacataaa agctattttt ggcgtgcaaa aatggtggcc cgttaatgca 1800
atacacccgt acggaaaact tacttggcta gatgagtcgc gcgatttaga agaagaggca 1860
ttagctgctg cccgaaaagc cgatgtgatt atttttatgg gcggtataga tgcgcacctt 1920
gaaggcgaag aaatgccgct agagctagat ggctttactc acggtgatcg tacgcacatt 1980
aatttaccta aagtacaaac caatttgctt aaacaattaa aagcaacggg taaacctgtt 2040
gtaatggtta actttagtgg tagtgccatg gctttaaatt gggaaagcga aaagctagac 2100
gcaatactgc aagcgtttta cccaggtgaa gcaaccggta cagcgttagc taatattttg 2160
tggggcgatg taagcccgag tggccgctta cctgtaacct tttacaaagg cgtagacgat 2220
ctaccagcat ttaatgatta ccacatggaa aaccgcacct ataaatttta ccgcggtgag 2280
cctttgtatg catttggcca cggtttaggt tacgttgatt ttgcttataa caatttagtc 2340
gtagcaaata ctgcagaagc gggcaaagcg ctacctatag ctgtaagcgt aaccaatacc 2400
ggtaaaatgc aagcagaaga cgttgcccaa gtttatataa gtttgctaga tgcccccgca 2460
aacacgccca tccgcgattt aaaagcgttt aaacgtacca agcttgcggc aggcgaaagc 2520
accgagcttg aatttaactt gccggcgaga gtgcttacct atatagacga taatggtaaa 2580
acccaaacct atactggcag ggtagaagtt actgttggct ctgggcaaaa gggatacgta 2640
aaagaaaatg cgatagctgt agcgactatt aacgttcagt ag 2682
<210> 71
<211> 317
<212> PRT
<213> Microbulbifer degradans
<400> 71
Met Tyr Thr Tyr Val Ser Ala Ile Ala Leu Phe Ile Phe Ser Ile Ala
1 5 10 15
Ser Ser Cys Cys Val Ala Gln Asn Pro Leu Asp Phe Gly Ser Asn Ile
20 25 30
Lys Thr Ala Asp Pro Ser Gly His Ile Trp Ala Asp Gly Arg Met Tyr
35 40 45
Leu Tyr Thr Ser His Asp Gln Glu Cys Gln Glu Asp Phe Tyr Met Lys
50 55 60
Asp Trp His Thr Phe Ser Ser Ser Asp Leu Ile Asn Trp Thr Ala His
65 70 75 80
Gly Pro Ser Leu Ser Val Ala Asp Ile Thr Trp Ala Asp Asn Tyr Ala
85 90 95
Trp Ala Pro Asp Ala Ala Tyr Lys Asn Gly Lys Tyr Tyr Leu Phe Phe
100 105 110
Pro Ala Gly Thr Gly Val Lys Asp Arg Val Asn Pro Glu Lys Ser Thr
115 120 125
Lys Trp Met Gly Ile Gly Val Ala Val Ser Asp Ser Pro Thr Gly Pro
130 135 140
Phe Lys Asp Ala Ile Gly Ala Pro Leu Trp Thr Asp Pro Tyr Ala Asn
145 150 155 160
Asp Pro Ser Ile Phe Ile Asp Asp Asp Gly Lys Gly Tyr Leu Tyr Phe
165 170 175
His Gly Lys Gly Ala Asp Tyr Leu Val Ala Glu Met Ala Asp Asp Leu
180 185 190
Leu Ser Val Lys Gly Glu Phe His Lys Met Asp Met Gly Gly Tyr Glu
195 200 205
Pro Lys Met Glu Gly Pro Trp Val Phe Lys Arg Glu Gly Met Tyr Tyr
210 215 220
Phe Thr Met Pro Glu Asn Asn Arg Ser Leu Ala Tyr Tyr Met Ala Lys
225 230 235 240
Ser Pro Phe Gly Pro Trp Glu Tyr Lys Gly Ile Phe Met Gln Glu Glu
245 250 255
Gly Gly Asn Asn His His Ser Ile Val Gln Phe Lys Gly Lys Trp Ile
260 265 270
Leu Phe Tyr His Arg Trp Leu Met Gly Glu Gly Glu Cys Lys Lys Lys
275 280 285
Gln Arg His Thr Ala Ala Glu Tyr Leu His Phe Asn Ala Asp Gly Thr
290 295 300
Ile Lys Glu Val Lys Arg Thr Arg Glu Gly Leu Thr Lys
305 310 315
<210> 72
<211> 954
<212> DNA
<213> Microbulbifer degradans
<400> 72
atgtacacat atgtatccgc catagcacta tttatatttt caattgcctc gtcgtgttgt 60
gttgcccaaa acccgctcga ctttggcagt aatattaaaa ccgcagatcc gtctggccat 120
atatgggctg atggcagaat gtacctttac acctcgcacg accaagaatg ccaagaagat 180
ttttatatga aggattggca taccttttcg tccagcgact taataaattg gactgcccac 240
ggcccaagtt tatctgtagc ggatattacg tgggcagata actacgcatg ggcgcccgac 300
gcggcctata aaaatgggaa gtactatttg ttctttccgg cgggaaccgg tgttaaagat 360
agagtaaacc ccgaaaaaag cactaagtgg atgggcattg gtgttgcagt aagcgatagc 420
cctacaggcc cctttaaaga tgcgattggc gcccccttgt ggaccgaccc ctatgccaac 480
gacccaagta tttttataga tgatgacggc aagggctact tatattttca cggtaaaggt 540
gcagactacc tagtagccga aatggcagac gatttactga gtgtaaaagg tgagtttcac 600
aaaatggata tgggcggtta cgagccaaaa atggagggcc cttgggtttt taagcgcgag 660
ggaatgtatt actttaccat gccagaaaac aatcgttcac ttgcttacta tatggcgaaa 720
tctccctttg ggccgtggga atacaagggc atttttatgc aagaagaagg cggtaacaac 780
caccattcta ttgtgcaatt taaaggcaag tggatattgt tttatcaccg ctggttaatg 840
ggcgaaggcg agtgtaaaaa gaagcaacgc cacaccgcag cggaatacct tcactttaat 900
gccgacggca caattaaaga agtaaaaaga acgcgcgagg ggttaactaa gtag 954
<210> 73
<211> 577
<212> PRT
<213> Microbulbifer degradans
<400> 73
Met Lys Ile Lys Cys Leu Leu Leu Ala Val Tyr Ala Gly Leu Leu Ala
1 5 10 15
Ala Cys Ala Leu Asp Ala Pro Leu Lys Thr Ser Ser Lys Pro Leu Ala
20 25 30
His Phe Ser Trp Phe Glu Tyr Gln Gly Asn Asp Glu Ile Phe Lys Ala
35 40 45
Pro Leu Ala Ser Asn Gln Tyr Gln Asn Pro Ile Leu Ala Gly Tyr His
50 55 60
Pro Asp Pro Ser Ile Val Arg Val Gly Glu Asp Tyr Tyr Leu Val Asn
65 70 75 80
Ser Thr Phe Gly Phe Tyr Pro Gly Ile Pro Val Phe His Ser Arg Asp
85 90 95
Leu Val Asn Trp Thr Gln Leu Gly Asn Ala Ile His Arg Pro Glu Gln
100 105 110
Leu Ser Phe Asp Gly Ile His Leu Gly Tyr Asn Gly Val Tyr Ala Pro
115 120 125
Ala Ile Glu Tyr Arg Asp Gly Thr Phe Tyr Val Ile Asn Thr Cys Val
130 135 140
Ala Cys Gly Gly Asn Phe Ile Val Thr Ala Thr Asn Pro Ala Gly Pro
145 150 155 160
Trp Ser Asp Pro Ile Trp Leu Pro Glu Val Ile Gly Ile Asp Pro Ser
165 170 175
Leu Phe Phe Asp Glu Asp Gly Lys Thr Tyr Ile Val His His Arg Asn
180 185 190
Pro Pro Val Gln Lys Tyr Pro Ala His Thr Ala Leu Trp Val Met Glu
195 200 205
Val Asp Ser Lys Thr Phe Ala Pro Val Ser Asp Asp Val Met Leu Val
210 215 220
Asp Gly Gly Asp Glu Ala Pro Trp His Thr Glu Tyr Ile Glu Gly Pro
225 230 235 240
His Ile Tyr Lys Ile Asp Gly Thr Tyr Tyr Leu Tyr Ala Pro Gly Gly
245 250 255
Gly Thr Gly Tyr Phe His Gly Gln Leu Val Tyr Arg Ser Asp Asn Val
260 265 270
Phe Gly Pro Tyr Glu Ala Asn Pro Asn Asn Pro Val Leu Thr Gln Val
275 280 285
Gly Leu Pro Asp Asp Arg Glu His Pro Val Thr Ala Thr Gly His Ala
290 295 300
Asp Leu Phe Gln Asp Thr Asn Gly Asp Trp Trp Thr Val Phe Leu Gly
305 310 315 320
Thr Arg Val Tyr Asp Leu Ala Lys Pro Pro Gln Asp Pro Gly Asn Phe
325 330 335
Ala Thr Gly Arg Glu Thr Phe Met Leu Pro Val Thr Trp Gln Asn Gly
340 345 350
Trp Pro His Val Leu Glu Lys Gly Glu Ala Val Pro Tyr Arg Val Thr
355 360 365
Lys Pro Lys Leu Pro Ala Gly Lys Pro Ala Pro Arg Ala Met Thr Gly
370 375 380
Asn Phe Thr Val Arg Glu Glu Phe Thr Asn Ala Ser Leu Ala Pro His
385 390 395 400
Trp Leu Phe Val Arg Thr Pro Arg Ser Lys Trp Trp Gln Thr Gly Asn
405 410 415
Gly Glu Leu Ile Leu Glu Ala Arg Ala Asp Thr Ile Gly Ala Val Asn
420 425 430
Gln Pro Ser Phe Ile Gly Arg Arg Leu Ala His Met Thr Ala Ser Phe
435 440 445
Ala Thr Gln Leu Thr Phe Asn Pro His Thr Val Gly Asp Glu Ala Gly
450 455 460
Leu Leu Ala Val Gln Asn Asp Glu His Phe Tyr Ala Phe Gly Leu Gly
465 470 475 480
Leu Asn Ser Lys Gly Gln Thr Val Leu Arg Val Arg Lys Lys Ala Gly
485 490 495
Lys Asn Glu Ser Ile Arg Gly Asp Thr Val Ala Glu Gln Val Val Lys
500 505 510
Leu Lys His Gly His Pro Ile Tyr Leu Arg Val Asn Ile Gly Lys Ala
515 520 525
Glu Leu Asn Phe Ala Tyr Ser Thr Asn Gly Lys Arg Tyr Thr Thr Leu
530 535 540
Leu Asn Gln Ala Asp Ala Asn Leu Leu Thr Thr Ala Lys Ala Gly Gly
545 550 555 560
Phe Thr Gly Ala Val Val Gly Met Tyr Ala Glu Ser Thr Ala Gln Gln
565 570 575
Asn
<210> 74
<211> 1734
<212> DNA
<213> Microbulbifer degradans
<400> 74
atgaaaatta agtgcttact ccttgctgtt tacgcgggtc tacttgcggc ttgcgcgctg 60
gacgcgccgc tcaaaacctc aagtaaaccg ctagcgcatt tttcgtggtt tgaatatcaa 120
ggtaacgacg agatatttaa ggctccactc gcctcaaatc aataccaaaa ccccatactc 180
gccggctacc acccagaccc aagtattgtg cgagtaggcg aagattatta tttggtgaac 240
tccacctttg gcttctaccc tggcattcca gtatttcaca gccgtgactt agtgaattgg 300
acccaactgg gtaacgctat tcaccgccca gagcaacttt catttgatgg tattcactta 360
ggctacaacg gcgtttatgc accggcaatc gaataccgcg acgggacctt ttacgtaata 420
aatacctgcg tagcctgcgg aggaaatttt atcgttaccg ccaccaatcc cgcgggcccc 480
tggtcagacc caatatggct accagaggta attggcatag acccctcgct atttttcgac 540
gaggacggca aaacctatat cgtgcatcat cgtaatccac ctgtgcagaa ataccctgcc 600
cacacagccc tgtgggtaat ggaagttgac tccaaaacat ttgcgccggt atctgacgat 660
gtaatgcttg tggacggtgg cgacgaagcg ccatggcaca cagaatatat tgaagggccg 720
catatatata aaattgatgg cacctactac ctctatgccc ctggtggcgg cacgggatac 780
ttccacggcc aattggtgta tagatctgac aatgtatttg gaccctacga agccaacccc 840
aataaccctg tgttgactca agttggttta cccgacgaca gagaacaccc tgtaacggca 900
acgggtcatg cagatttatt tcaagatacc aacggcgact ggtggacggt atttctgggt 960
actcgcgttt acgatttagc taagccacca caagaccccg gcaattttgc caccggacgc 1020
gaaacattta tgttgccagt aacatggcaa aacggctggc cacacgtgct cgaaaaaggc 1080
gaggctgtgc cctaccgagt aaccaaaccc aaattacctg caggcaaacc cgccccgcgc 1140
gcaatgactg gaaactttac tgtgcgcgag gaatttacca acgcttcgct tgccccccac 1200
tggctatttg ttcgcacacc gcgttccaaa tggtggcaaa caggtaatgg cgaacttatt 1260
ttagaagcgc gcgccgatac cattggggca gttaaccagc cgtcgtttat tggccgacgg 1320
ctcgctcata tgacggcctc cttcgccacc caactaacct ttaacccaca caccgttggc 1380
gacgaagcag ggttactcgc cgtacaaaac gacgaacact tttacgcctt tggcctaggg 1440
ttaaacagta aagggcaaac cgttttgcgc gtgcgtaaaa aagcgggtaa aaatgaatcg 1500
ataaggggag atacggttgc cgagcaggtt gttaagctta agcacggcca ccctatttac 1560
ctgcgtgtaa atataggtaa agccgaatta aatttcgcgt atagcaccaa cggcaaacgc 1620
tacaccacct tgttgaacca agccgatgcc aacctactta ccacagctaa agcgggcggg 1680
tttactggcg cagtagtggg tatgtacgcc gaatccaccg cacaacaaaa ctaa 1734
<210> 75
<211> 566
<212> PRT
<213> Microbulbifer degradans
<400> 75
Met Arg Leu Leu Pro Ile Leu Leu Val Ser Leu Leu Pro Leu Leu Ser
1 5 10 15
Ser Cys Thr Ser Ala Ile Asn Gly Gln Gln Asn Ser Gln Thr Ser Pro
20 25 30
Val Phe Asp Trp Phe Glu Tyr Ala Gly Ser Asp Ala Leu Tyr Asn Thr
35 40 45
Val Ala Pro Ser Lys Asn Ala Tyr Thr Asn Pro Val Ile Lys Gly Phe
50 55 60
Tyr Pro Asp Pro Ser Ile Val Arg Val Gly Ala Asp Tyr Tyr Leu Val
65 70 75 80
Asn Ser Ser Phe Gly Tyr Phe Pro Gly Val Pro Ile Phe His Ser Thr
85 90 95
Asp Leu Val Asn Trp Val Gln Ile Gly Asn Ile Leu Glu Arg Pro Ser
100 105 110
Gln Leu Gln Ile Pro Ser Gly Met Gly Val Ser Arg Gly Ile Phe Ala
115 120 125
Pro Thr Leu Arg His His Asn Gly Ile Phe Tyr Met Ile Thr Thr Met
130 135 140
Val Asp Gly Gly Gly Asn Phe Ile Val Thr Ala Lys Asn Pro Ala Gly
145 150 155 160
Pro Trp Ser Asp Pro Val Trp Leu Pro Glu Val Gly Gly Ile Asp Pro
165 170 175
Asp Leu Phe Phe Asp Asp Asn Gly Lys Ala Tyr Ile Leu Asn Asn Asp
180 185 190
Ala Pro Ile Gly Glu Pro Leu Tyr Asp Gly His Arg Ala Ile Trp Ile
195 200 205
Arg Glu Phe Asp Leu Ala Thr Leu Lys Thr Val Gly Asp Ala Lys Leu
210 215 220
Ile Val Asn Gly Gly Val Asp Ile Thr Thr Lys Pro Val Trp Ile Glu
225 230 235 240
Gly Pro His Leu Phe Lys Asn Lys Gly Ala Tyr Tyr Leu Ile Asn Ala
245 250 255
Glu Gly Gly Thr Ser Val Asn His Ser Gln Val Val Phe Lys Ala Gln
260 265 270
Ser Pro Trp Gly Pro Tyr Ile Pro Trp Glu Asn Asn Pro Ile Leu Thr
275 280 285
Gln Arg His Leu Pro Ala Asp Arg Ala Asn Pro Val Thr Ser Val Gly
290 295 300
His Val Asp Leu Val Gln Thr Gln His Gly Asp Trp Trp Ala Val Phe
305 310 315 320
Leu Gly Cys Arg Pro Tyr Lys Asp Asn Tyr Tyr Asn Thr Gly Arg Glu
325 330 335
Thr Phe Leu Leu Pro Val Asp Trp Ser Gly Glu Tyr Pro Val Ile Leu
340 345 350
Arg Gly Asp Ala Glu Val Pro Tyr His His Gln Arg Pro Gln Leu Gly
355 360 365
Ala Ser Gln Gln Pro Ala Ile Ala Leu Ser Gly Asn Phe Ile Glu Arg
370 375 380
Asp Glu Phe Asp Ser Ala Leu Lys Leu Tyr Trp Arg Lys Val Arg Thr
385 390 395 400
Pro Thr Asn Asn Phe Thr Asp Leu Thr Ser Gln Lys Gly Lys Leu Val
405 410 415
Leu Thr Ala Asn Asn Thr Asp Leu Ser Asp Phe Gly Ser Pro Ala Phe
420 425 430
Ile Ala Arg Ala Gln Gln His Leu Thr Gly Ser Ala Thr Thr Lys Leu
435 440 445
Val Tyr Thr Pro Pro His Val Gly Asp Lys Ala Gly Ile Ala Ala Phe
450 455 460
Gln Asn Asp Glu Tyr Phe Tyr Ala Leu Thr Val Thr Lys Asn Asn Ser
465 470 475 480
Gly Leu Ala Ile Gln Leu Glu Lys Gln Leu Gly Lys Asn Lys Glu Ile
485 490 495
Val Ala Gln Tyr Pro Leu Gln Glu Lys Thr Leu Arg Asn Gly Leu Tyr
500 505 510
Leu Lys Ile Glu Phe Asn Asn Asp Lys Tyr Asp Phe Ser Tyr Ser Thr
515 520 525
Asn Asn Thr Lys Trp Gln Ser Val Gly Glu Thr Gln Asp Gly Thr Ile
530 535 540
Leu Ser Thr Gln Ser Ala Gly Gly Phe Val Gly Ala Thr Leu Gly Ile
545 550 555 560
Phe Ala Tyr Thr Ala His
565
<210> 76
<211> 1701
<212> DNA
<213> Microbulbifer degradans
<400> 76
atgcgacttt tacctatctt actcgttagc ttacttccac tgctctcaag ctgcacaagc 60
gccataaacg ggcaacaaaa tagccaaacc tcgcctgtat ttgattggtt tgaatacgcg 120
ggaagcgatg ctttatacaa cacggttgcg ccaagtaaaa atgcctatac caacccagta 180
ataaaagggt tttatccaga tccaagcatt gtaagagtgg gagcagatta ctacctcgtg 240
aactcttcat ttggctactt ccctggcgtg ccgatatttc atagcacaga tttagtgaat 300
tgggttcaaa taggtaatat tctcgagcgc ccatcacaat tacaaatacc cagcggcatg 360
ggtgtgtcgc gaggtatatt cgccccaaca ctgcgccacc acaacggtat tttttacatg 420
attactacaa tggtagacgg tggcggcaat tttattgtta ctgcaaaaaa ccccgcaggc 480
ccttggtcgg acccagtatg gttacctgaa gtgggcggta tagacccaga tttatttttt 540
gatgacaacg gcaaagccta catacttaac aacgacgccc ccattggcga gccgctttac 600
gatggccacc gagccatttg gattcgcgaa ttcgacttag ccacattaaa aaccgttggc 660
gacgccaagt taatagtaaa cggcggtgta gatataacta ccaaacccgt ttggatagaa 720
ggcccacacc ttttcaaaaa taaaggcgct tactatttaa ttaatgcaga aggtggcacc 780
agcgtgaatc acagccaagt tgtatttaaa gcgcaaagcc cttgggggcc gtatattcct 840
tgggaaaaca atccaatttt aacacagcgc catttaccgg ctgatcgcgc caaccccgtc 900
acatccgttg gccatgtcga tttagtacaa actcaacatg gcgactggtg ggcggtattt 960
ttaggctgca ggccctataa agataactac tacaataccg gccgcgaaac atttttatta 1020
ccggtagatt ggtctggcga ataccccgtc attcttcgcg gcgatgccga ggtgccctat 1080
catcaccaac gcccccaatt gggagcatcc caacaaccag ccattgccct tagcggtaac 1140
tttattgagc gcgatgaatt tgactcagca cttaaacttt attggcgcaa ggttcgcacc 1200
cccacaaaca actttacaga tttaacctct caaaaaggca agcttgtttt aactgcaaac 1260
aatacagatt taagcgactt tggatcacca gcatttattg cgcgcgcaca gcagcaccta 1320
acaggcagcg caacaaccaa actggtttac acacccccac acgtgggcga caaagcgggt 1380
attgctgcct ttcaaaacga tgagtatttt tatgcgctta ccgttacaaa aaataatagc 1440
ggccttgcca tacaactaga aaaacaactt ggcaagaaca aagaaattgt tgcgcaatat 1500
ccactacaag aaaaaacgct tcgcaatggc ttatatttga aaatagaatt taataacgac 1560
aaatatgatt tcagctattc cacaaataac accaagtggc aatcggtagg cgaaacacaa 1620
gatggaacta tattaagcac gcaaagtgca ggcgggtttg taggtgccac gctaggtata 1680
tttgcatata ccgcgcacta a 1701
<210> 77
<211> 319
<212> PRT
<213> Microbulbifer degradans
<400> 77
Met Ser Met Phe Asn Lys Lys Thr Leu Ala Ala Gly Ile Val Ala Ala
1 5 10 15
Cys Leu Thr Asn Val Ser Ala Ser Tyr Ala Ala Asn Pro Ala Ile Thr
20 25 30
Asp Thr His Thr Ala Asp Pro Ala Ala Leu Val His Gly Asp Thr Val
35 40 45
Tyr Leu Tyr Val Gly Asn Asp Glu Ala Lys Asp Asn Arg Val Phe Tyr
50 55 60
Asp Leu Lys Lys Trp Leu Val Tyr Ser Ser Lys Asp Met Val Asn Trp
65 70 75 80
Thr Asn His Gly Ser Pro Leu Ala Ala Thr Asp Phe Lys Trp Ala Ser
85 90 95
Gly Asp Ala Trp Ala Ala His Thr Val Glu Lys Asp Gly Lys Phe Tyr
100 105 110
Trp Tyr Thr Thr Val Arg His Ala Thr Ile Asn Gly Phe Ala Ile Gly
115 120 125
Val Ala Val Ser Asp Ser Pro Thr Gly Pro Phe Lys Asp Ala Leu Gly
130 135 140
Lys Ala Leu Ile Ser Asn Asp Met Thr Thr Asp Thr Asp Ile Asp Trp
145 150 155 160
Asp Asp Ile Asp Pro Ala Val Phe Ile Asp Asp Asp Gly Gln Ala Tyr
165 170 175
Ile Phe Trp Gly Asn Thr Lys Pro Arg Trp Ala Lys Leu Lys Pro Asn
180 185 190
Met Ile Glu Leu Asp Gly Pro Ile His Ala Ile Asp Ile Pro His Phe
195 200 205
Thr Glu Ala Leu Tyr Val His Lys His Gly Glu Tyr Tyr Tyr Leu Ser
210 215 220
Tyr Ala Thr Gly Phe Pro Glu Lys Thr Ala Tyr Ala Met Ser Lys Ser
225 230 235 240
Ile Glu Gly Pro Trp Glu Tyr Lys Gly Ile Leu Asn Glu Leu Ala Gly
245 250 255
Asn Ser Asn Thr Asn His Gln Ser Val Ile Asp Phe Lys Gly Lys Ser
260 265 270
Tyr Phe Ile Tyr His Asn Gly Gly Leu Gly Gln Asp Gly Gly Ser Phe
275 280 285
Arg Arg Ser Val Cys Ile Asp Tyr Leu Asn Tyr Asn Ala Asp Gly Thr
290 295 300
Ile Lys Arg Ile Val Met Thr Ser Glu Gly Val Asp Pro Val Lys
305 310 315
<210> 78
<211> 960
<212> DNA
<213> Microbulbifer degradans
<400> 78
gtgagtatgt ttaataaaaa aacactagca gccggtattg tagctgcatg tttaactaac 60
gtaagtgcaa gctatgctgc caaccccgca attaccgata ctcacacggc cgatcccgct 120
gcgttagtgc acggcgatac cgtttatttg tacgtgggta acgatgaagc gaaggataac 180
cgcgtatttt acgatcttaa aaaatggttg gtgtattcat caaaagatat ggtgaactgg 240
accaatcacg gttcgccgtt agctgcaacg gattttaagt gggccagcgg cgatgcgtgg 300
gcggcgcaca cggtagaaaa agatggcaag ttttattggt ataccacggt gcgtcacgca 360
accattaatg gttttgccat tggcgttgca gtaagtgata gccctacagg gccattcaaa 420
gatgctttgg gtaaagcact aataagtaat gacatgacca ccgataccga tattgattgg 480
gacgatatag acccagcagt atttattgac gacgatggcc aagcgtatat tttttggggc 540
aacaccaaac cgcgctgggc caagttaaaa cccaatatga ttgaactaga tggacctatt 600
cacgcaatcg atattccaca ctttaccgaa gcgctatacg tgcacaaaca cggtgaatat 660
tactacttaa gctatgcgac aggctttcca gaaaaaacag cttacgctat gagcaaatct 720
atagaagggc cgtgggaata caaaggcatt cttaatgaat tggctggtaa ctcaaatact 780
aatcaccaat ctgtcatcga ttttaagggc aagtcatact ttatttatca caatggtggc 840
ttgggtcaag atggcggtag cttccgtcgc agtgtatgta tcgattattt gaactacaac 900
gcggatggta ctatcaagcg aattgtaatg acatcagaag gtgtagaccc agttaaataa 960
<210> 79
<211> 385
<212> PRT
<213> Microbulbifer degradans
<400> 79
Met Pro Glu His Thr Arg Lys Arg Leu Leu Ser Thr Leu Gly Leu Ala
1 5 10 15
Leu Ser Gly Thr Ala Ile Thr Leu Thr Leu Val Gly Cys Gly Lys Asp
20 25 30
Asn Pro Ala Thr Gln Thr Glu Gly Ser His Ser Ala Gly His Thr Glu
35 40 45
Val Ala Ala Glu Gln Thr His Asp Ile Gly Gly Pro Gly Pro Glu Gly
50 55 60
Lys Pro Ile Asn Asp Pro Leu Val Thr His Ile Tyr Thr Ala Asp Pro
65 70 75 80
Ser Ala His Val Phe Asp Gly Lys Leu Tyr Ile Tyr Pro Ser His Asp
85 90 95
Val Glu Ala Gly Ile Pro Gln Asn Asp Asn Gly Asp His Phe Asp Met
100 105 110
Arg Asp Tyr His Val Leu Ser Met Glu Glu Pro Gly Gly Lys Val Thr
115 120 125
Asp His Gly Val Ala Leu Ala Arg Glu Asp Val Ala Trp Ala Gly Arg
130 135 140
Gln Leu Trp Ala Pro Asp Ala Ala Glu Lys Asp Gly Thr Tyr Tyr Leu
145 150 155 160
Tyr Phe Pro Met Lys Asp Lys Asp Asp Ile Phe Arg Ile Gly Val Ala
165 170 175
Ser Gly Ser Thr Pro Tyr Gly Pro Phe Lys Ala Glu Pro Glu Pro Met
180 185 190
Pro Gly Ser Tyr Ser Ile Asp Pro Ser Val Phe Gln Asp Gly Asp Asp
195 200 205
Tyr Tyr Met Tyr Ile Gly Gly Ile Trp Gly Gly Gln Leu Gln Arg Trp
210 215 220
Thr Thr Gly Glu Tyr Asn Pro Glu Asp Val Tyr Pro Ala Asp Asp Glu
225 230 235 240
Pro Ala Leu Leu Pro Lys Met Ala Lys Leu Ser Ala Asp Met Lys Ser
245 250 255
Phe Ala Glu Pro Leu Arg Asp Ile Gln Ile Leu Asp Glu Asn Gly Glu
260 265 270
Leu Ile Lys Ala Gly Asp Asn Asp Arg Arg Phe Phe Glu Ala Ala Trp
275 280 285
Val His Lys Tyr Asn Gly Lys Tyr Tyr Leu Ser Tyr Ser Thr Gly Asp
290 295 300
Thr His Tyr Ile Val Tyr Ala Ile Gly Asp Asn Pro Tyr Gly Pro Phe
305 310 315 320
Thr Tyr Gln Gly Val Val Leu Asn Pro Val Ile Gly Trp Thr Asn His
325 330 335
His Ser Ile Ala Glu Phe Lys Gly Lys Trp Tyr Leu Phe Tyr His Asp
340 345 350
Ser Ser Leu Ser Gly Gly Val Thr His Leu Arg Ser Val Lys Met Thr
355 360 365
Glu Leu Thr His Asn Pro Asp Gly Thr Ile Gln Thr Ile Asn Ala Tyr
370 375 380
Lys
385
<210> 80
<211> 1158
<212> DNA
<213> Microbulbifer degradans
<400> 80
atgccagaac atacgcgtaa gcgcttacta tcaaccttag gcctagcttt atcgggcaca 60
gctataaccc taacgcttgt ggggtgcggt aaagacaacc ccgcaactca aacagaaggc 120
agccacagcg ctggccatac agaagttgcc gcagaacaaa cacacgacat aggcggccca 180
ggccctgagg gcaagccaat taacgacccg cttgttaccc acatatacac cgcagaccct 240
tctgcccatg tgtttgacgg caaactttat atttacccat cgcacgatgt ggaagcgggt 300
attccgcaaa acgataacgg cgatcacttc gatatgcgcg attatcacgt gctttccatg 360
gaagagcctg gtggcaaagt caccgatcac ggcgtagccc ttgcgcgcga agatgtagct 420
tgggctggtc gccaactgtg ggcgcccgat gcggctgaaa aagacggcac ttactacctg 480
tatttcccca tgaaagataa ggatgacatc ttccgcattg gtgtcgccag tggcagtacc 540
ccttatggcc catttaaagc cgagccagag ccaatgcccg gcagctatag catagaccca 600
agcgtatttc aggatggcga cgactactac atgtacatag gtggtatttg gggcggccag 660
ttgcagcgtt ggacaaccgg tgagtacaac ccagaagatg tatacccagc ggatgacgag 720
cctgcgctat tacctaaaat ggccaagcta agtgcagata tgaaaagctt tgccgagcca 780
ttaagagaca ttcaaatttt ggatgaaaat ggcgagctaa ttaaagctgg cgataacgac 840
cgacgtttct tcgaagccgc gtgggtacac aaatataacg gcaagtatta cttgagctat 900
tcaaccggtg acacccacta tattgtgtat gccattggcg ataacccata cggcccgttt 960
acttaccagg gtgtagtgct caaccccgtt attggttgga ctaaccatca ctcaattgct 1020
gaatttaaag gtaagtggta tttgttctac cacgatagtt cgctttccgg tggtgtaaca 1080
catttgcgca gcgtgaaaat gacagagcta actcacaacc cagatggcac tatccaaacc 1140
attaatgcct ataagtaa 1158
<210> 81
<211> 738
<212> PRT
<213> Microbulbifer degradans
<400> 81
Met Ala Thr Leu Gly Val Asn Ala Ala Lys Phe Ala Met Phe Ala Ala
1 5 10 15
Ile Cys Leu Gln Phe Ser Val Ala Glu Ala Ala Lys Ser Arg Asp Gly
20 25 30
Tyr Gly Leu Trp Leu Asp Tyr Gln Pro Ile Thr Asn Thr Arg Glu Arg
35 40 45
Glu Gly Tyr Ile Lys Ala Leu Ser Pro Trp Gln Val Glu Gly Glu Ala
50 55 60
Ala Thr Ala Asp Phe Ile Arg Gln Glu Leu Thr Ala Ala Leu Gly Ala
65 70 75 80
Met Leu Gly Val Glu Ala Gly Pro Val Gly Asp Tyr Thr His Asn Ser
85 90 95
Leu Ala His Pro Val Ala Arg Leu Leu Val Ala Thr Pro Glu Glu Ser
100 105 110
Ala Val Ile Arg Ser Leu Ala Leu Gly Asp Ala Leu Thr Arg Val Gly
115 120 125
Gln Glu Gly Tyr Leu Ile Lys Thr Thr Arg Tyr Arg Asp Lys Pro Ile
130 135 140
Thr Ile Val Thr Ala Asn Thr His Ala Gly Leu Leu Tyr Gly Thr Phe
145 150 155 160
Lys Leu Leu Gln Leu Leu Gln Thr Gly Gln Ala Val Ser Asn Leu Ala
165 170 175
Ile Glu Ser Ala Pro Ala Thr Lys Leu Arg Val Leu Asn His Trp Asp
180 185 190
Asn Leu Asp Arg Tyr Val Glu Arg Gly Tyr Ala Gly Glu Ser Ile Trp
195 200 205
Asn Trp His Lys Leu Pro His Tyr Lys Ser Gln Arg Tyr Tyr Asp Tyr
210 215 220
Ala Arg Ala Asn Ala Ser Ile Gly Ile Asn Gly Val Val Leu Asn Asn
225 230 235 240
Val Asn Ala Asp Pro Leu Ile Leu Thr Pro Gln Tyr Leu Val Lys Val
245 250 255
Lys Ala Leu Ala Asp Ile Phe Arg Pro Tyr Gly Ile Lys Val Tyr Leu
260 265 270
Ser Val Lys Phe Ser Ser Pro Asn Leu Ile Gly Gly Leu Pro Thr Ser
275 280 285
Asp Pro Leu Asp Lys Asn Val Gln Ala Trp Trp Gln Ala Lys Ala Asn
290 295 300
Glu Ile Tyr Ser Leu Ile Pro Asp Phe Gly Gly Phe Leu Val Lys Ala
305 310 315 320
Asn Ser Glu Gly Gln Pro Gly Pro Gly Asp Phe Gly Arg Ser His Ala
325 330 335
Gln Gly Ala Asn Met Leu Ala Asp Ala Leu Ala Pro His Gly Gly Asn
340 345 350
Val Met Trp Arg Ala Phe Val Tyr Asn Val Glu Ala Asn Val Glu Arg
355 360 365
Ser Lys Gln Ala Tyr Asn Glu Phe Lys Pro Leu Asp Gly Thr Phe Arg
370 375 380
Gln Asn Val Leu Val Gln Val Lys Asn Gly Pro Ile Asp Phe Gln Pro
385 390 395 400
Arg Glu Pro Phe Ser Pro Leu Phe Gly Ala Met Pro Lys Thr Pro Leu
405 410 415
Met Met Glu Phe Gln Ile Thr Gln Glu Tyr Leu Gly Phe Ser Thr His
420 425 430
Leu Val Tyr Leu Gly Pro Leu Tyr Glu Glu Val Leu Lys Ala Asp Thr
435 440 445
Tyr Ala Lys Gly Ala Gly Ser Thr Val Ala Lys Val Val Asp Gly Ser
450 455 460
Leu Tyr Gly His Gly Ile Thr Gly Met Ala Gly Val Ala Asn Ile Gly
465 470 475 480
Ser Asp Arg Asn Trp Thr Gly His Ile Phe Gly Gln Ala Asn Trp Tyr
485 490 495
Val Phe Gly Gln Leu Ala Trp Asn Pro Glu Val Ser Thr Lys Gln Ile
500 505 510
Ala Asp Asp Trp Ile Arg Met Thr Leu Thr Arg Asp Asp Lys Ala Val
515 520 525
Asn Thr Ile Arg Ala Met Met Met Ala Ser Arg Glu Thr Ala Val Asn
530 535 540
Tyr Met Thr Pro Leu Gly Leu His His Ile Met Gly Trp Gly His His
545 550 555 560
Tyr Gly Pro Ala Pro Trp Ile Gly Glu Gln Lys Pro Asp Trp Met Arg
565 570 575
Glu Asp Trp Thr Ser Val Tyr Tyr His Ser Ala Asn Ala Thr Gly Leu
580 585 590
Gly Lys Asp Arg Thr Ala Ser Gly Ser Asn Val Ile Ala Gln Tyr His
595 600 605
Ala Pro Leu Arg Gln Ala Tyr Ser Asp Pro Lys Thr Thr Pro Thr Glu
610 615 620
Leu Leu Leu Trp Phe His His Leu Pro Trp His Tyr Glu Leu Ala Asn
625 630 635 640
Gly Asn Ser Leu Trp His Glu Leu Val Ala Arg Tyr Tyr Leu Gly Ala
645 650 655
Gln Ala Val Ala Glu Met Ala Lys Thr Trp Asp Gly Leu Glu Ala Asn
660 665 670
Ile Pro Pro Gln Leu Phe Lys Gln Val Gln Met Ala Leu Ala Ile Gln
675 680 685
Thr Gln Glu Ala Ala Trp Trp Arg Asp Ala Cys Val Leu Tyr Phe Gln
690 695 700
Ser Tyr Ser Lys Gln Ser Leu Pro Glu Gly Phe Ala Lys Pro Lys His
705 710 715 720
Ser Leu Glu Tyr Tyr Lys Gly Leu Ser Phe Pro His Ala Pro Gly Asp
725 730 735
Gly Arg
<210> 82
<211> 2217
<212> DNA
<213> Microbulbifer degradans
<400> 82
atggctactt tgggggtaaa tgccgctaag tttgccatgt ttgcagctat ttgcttgcag 60
tttagtgtcg ccgaagcggc taaaagccgc gatggatatg ggctgtggtt agattaccag 120
ccaattacca atacccgcga acgcgagggc tatataaaag cattaagccc atggcaggta 180
gaaggcgaag ctgcaactgc cgattttatt cggcaagagc ttactgcagc gttgggcgct 240
atgcttggcg ttgaggctgg tccagtgggt gattacaccc ataactccct cgctcaccct 300
gtggcgcggc tattggttgc aactccagaa gaaagcgctg ttattcgctc tttggcttta 360
ggcgatgctt taactcgagt agggcaagag gggtacctta ttaaaaccac gcgttaccgt 420
gacaagccta tcaccattgt taccgcgaac acgcatgcag gcctgctgta tggcacattc 480
aaactactgc agctgctgca aacagggcag gccgtttcta atttagctat tgagtccgcc 540
ccagcaacca aactgcgtgt gcttaaccac tgggataacc tcgatcgcta tgtggagcgc 600
ggctatgccg gtgagtctat ttggaactgg cacaagctgc cgcactacaa atcgcagcgc 660
tactacgatt acgctcgcgc taacgcgtcc attggtatta acggtgtggt actaaacaat 720
gttaacgccg accccttaat tcttaccccg cagtaccttg taaaagtaaa agcactggca 780
gatattttta ggccctacgg cattaaagtt tatctttcgg tgaagtttag ctcgccgaat 840
cttattggcg ggctgccaac atccgacccg ttagataaaa atgtgcaagc ttggtggcaa 900
gcgaaagcga atgaaattta ctcgctcatt cccgactttg gtggcttttt agtaaaagcg 960
aattcggaag ggcagcccgg cccaggggac tttggccgca gccatgcaca aggggcaaat 1020
atgttggccg atgcactggc accccatggc ggcaatgtaa tgtggcgcgc gtttgtatat 1080
aacgtagaag ccaatgtgga gcgatccaag caggcataca acgaatttaa gccattagac 1140
ggtaccttta ggcaaaacgt attggtgcaa gtaaaaaatg ggccaattga ttttcagcca 1200
cgtgaaccgt ttagcccgct gtttggtgct atgcccaaaa cgccgttaat gatggagttt 1260
caaattactc aggagtactt ggggtttagt actcaccttg tttacttggg gccgctgtac 1320
gaagaagtac ttaaggccga tacctatgcg aagggggcag ggtctactgt tgcgaaggtg 1380
gtcgatggct cgctctacgg gcacggtata acgggtatgg ctggggtagc taatattggc 1440
agcgatcgca attggaccgg ccatattttc ggccaagcca actggtatgt atttggccaa 1500
ttggcgtgga accccgaggt aagcactaag caaatagccg atgattggat tcgcatgaca 1560
ctcacccgcg acgataaagc ggtaaacacc attcgcgcaa tgatgatggc cagccgcgaa 1620
acggcggtta actacatgac gcccctgggg ctgcatcaca ttatggggtg ggggcaccac 1680
tacggcccag cgccgtggat aggcgagcaa aaacccgatt ggatgcgtga agattggaca 1740
tctgtttact atcatagcgc aaacgccaca gggctaggca aagatagaac agcttctggc 1800
agcaatgtca tagcgcaata ccacgcccct ttacggcagg cctatagcga cccgaaaacc 1860
acgcccaccg agttgctatt gtggtttcat catttgcctt ggcattatga attagcgaat 1920
ggcaatagcc tgtggcatga actggtagcg cgttactatt taggcgcgca ggctgtggca 1980
gaaatggcca aaacgtggga tggcctagaa gctaatatcc ccccgcagct attcaaacaa 2040
gtacaaatgg cgctggctat tcaaacccaa gaagccgcgt ggtggcgcga tgcctgcgtg 2100
ctgtattttc aaagctattc taagcagtcg ctacccgagg gctttgcaaa acctaagcac 2160
tcgctcgaat actataaagg gttaagcttc ccgcatgcgc cgggtgacgg gcgttaa 2217
<210> 83
<211> 1316
<212> PRT
<213> Microbulbifer degradans
<400> 83
Met Arg Asn Lys Leu Gly Ser Met Leu Lys Met Ser Ala Ala Ile Gly
1 5 10 15
Gly Leu Val Ala Ala Gly Ser Ala Val Ala Gly Pro Val Gly Phe Ala
20 25 30
Ser Leu Asn Gly Gly Thr Thr Gly Gly Ala Gly Gly Gln Val Val Tyr
35 40 45
Ala Ser Thr Gly Ala Glu Ile Asn Gln Ala Met Cys Asn Arg Ala Ser
50 55 60
Asp Asp Thr Pro Leu Ile Ile Tyr Val Thr Gly Thr Ile Asn His Gly
65 70 75 80
Asn Thr Ala Lys Tyr Ser Gly Ser Cys Asp Thr Thr Ala Asp Glu Ile
85 90 95
Gln Phe Lys Gly Val Lys Asn Ile Ser Leu Ile Gly Thr Gly Ser Gly
100 105 110
Ala Val Phe Asp Gln Ile Gly Ile His Leu Arg Asp Thr Ser Asn Ile
115 120 125
Ile Leu Gln Asn Leu His Ile Lys Asn Val Lys Lys Ser Gly Ser Pro
130 135 140
Thr Ser Asn Gly Gly Asp Ala Ile Gly Met Glu Ser Gly Val Tyr Asn
145 150 155 160
Val Trp Val Asp His Cys Glu Leu Glu Ala Ser Gly Gly Glu Ser Asp
165 170 175
Gly Tyr Asp Ser Leu Leu Asp Met Lys Ala Thr Thr Gln Tyr Val Thr
180 185 190
Val Ser Tyr Thr Tyr Tyr His Asp Ser Gly Arg Gly Gly Leu Met Gly
195 200 205
Ser Ser Asp Ser Asp Asp Thr Asn Thr Phe Val Thr Phe His His Asn
210 215 220
Tyr Tyr Glu Asn Met Asp Ser Arg Leu Pro Leu Leu Arg His Gly Thr
225 230 235 240
Ala His Ala Phe Asn Asn Tyr Tyr Asn Gly Ile Ala Lys Ser Gly Met
245 250 255
Asn Pro Arg Ile Gly Gly Gln Ile Lys Ala Glu Asn Asn Tyr Phe Glu
260 265 270
Asn Ala His Asn Pro Ile Gly Thr Phe Tyr Thr Asp Asp Met Gly Tyr
275 280 285
Trp Asp Leu Arg Gly Asn Ile Phe Gly Ser Asn Val Thr Trp Ala Ser
290 295 300
Ala Asp Asp Glu Thr Pro Ala Gly Pro Asn Pro Thr Ser Thr Thr Ser
305 310 315 320
Ile His Ile Ser Tyr Pro Tyr Asp Leu Asp Asp Ala Ala Cys Val Pro
325 330 335
Asp Ile Val Lys Ser Thr Ala Gly Val Gly Thr Gly Leu Ala Val Ser
340 345 350
Asp Gly Ser Cys Thr Ile Thr Thr Pro Pro Ser Thr Ser Ser Ser Ser
355 360 365
Ser Ser Ser Ser Ser Thr Ser Ser Thr Gly Ser Ser Ser Ser Ser Ser
370 375 380
Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Gly Gly Ser Leu Val Leu
385 390 395 400
Gly Asn Asn Leu Ser Ile Gly Ala Gly Ser Asp Gly Ser Ser Lys Gly
405 410 415
Ala Gly Ser Tyr Gly Asn Val Arg Asp Gly Asp Val Ser Ser Tyr Trp
420 425 430
Ala Pro Ser Gly Ser Thr Gly Arg Val Ser Ile Lys Trp Ser Gly Ser
435 440 445
Gln Thr Val Asn Ala Ile Val Ile Lys Glu Ala Ala Gly Tyr Glu Gly
450 455 460
Asn Ile Ser Gly Trp Gln Val Thr Asp Asn Asp Thr Gly Ala Val Leu
465 470 475 480
Ala Ala Gly Ser Ser Val Gly Thr Ile Thr Phe Asp Ala Val Thr Thr
485 490 495
Ser Lys Ile Asn Phe Glu Ile Thr Ser Ser Asn Gly Thr Pro Thr Val
500 505 510
Ala Glu Phe Glu Thr Tyr Asn Ala Thr Gly Ser Ser Ser Ser Ser Ser
515 520 525
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
530 535 540
Ser Ser Ser Ser Ser Ser Ser Thr Gly Gly Thr Ala Thr Leu Ser Thr
545 550 555 560
Thr Val Ser Gly Asp Gln Val Thr Leu Asn Trp Ser Val Asn Asn Ala
565 570 575
Thr Val Thr Gly Gln Gln Ile Tyr Arg Asp Val Asp Ser Asp Pro Ala
580 585 590
Gly Arg Val Arg Ile Ala Ser Gly Val Thr Gly Asn Thr Tyr Thr Asp
595 600 605
Thr Gly Leu Ala Asn Gly Thr Tyr Tyr Tyr Trp Val Lys Val Thr Asp
610 615 620
Ser Asn Ser Ala Thr Ile Asn Ser Asn Tyr Ser Glu Ala Gln Val Asn
625 630 635 640
Val Tyr Thr Thr Ser Thr Thr Thr Phe Glu Glu Asp Ala Gly Tyr Cys
645 650 655
Ser Val Asp Gly Ser Val Asp Ser Asn Asn Ser Gly Phe Ala Gly Ser
660 665 670
Gly Phe Ala Asn Thr Asp Asn Ala Ser Gly Asn Gly Val Asn Tyr Ala
675 680 685
Val Ser Val Pro Val Ala Gly Val Tyr Thr Leu Gln Val Arg Phe Ala
690 695 700
Asn Gly Ser Ser Ala Arg Pro Ala Asp Val Leu Val Asn Tyr Gly Asn
705 710 715 720
Ala Gly Val Phe Asp Leu Pro Ser Thr Gly Ser Trp Thr Ser Trp Ser
725 730 735
Asn Ser Asn Glu Ile Ser Val Asn Leu Val Ala Gly Asn Asn Ile Ile
740 745 750
Arg Leu Glu Ala Thr Thr Ser Gly Gly Leu Ala Asn Ile Asp Ser Leu
755 760 765
Ser Val Thr Gly Val Glu Pro Ser Ala Gly Asp Cys Asn Gly Ser Val
770 775 780
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser
785 790 795 800
Ser Ser Ser Thr Ser Ser Gly Gly Ser Ser Thr Ser Ser Ser Ser Thr
805 810 815
Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser Ser
820 825 830
Ser Thr Ser Ser Thr Ser Ser Ser Ser Gly Gly Gly Thr Ala Ser Cys
835 840 845
Glu Gln Leu Ile Asn Asp Pro Ser Val Asn Trp Asp Glu Ser Ala Leu
850 855 860
Ala Ser Glu Gln Glu Ile Val Ala Cys Leu Ala Gln Ser Leu Gly Ser
865 870 875 880
Pro Val Gly Phe Gly Glu Gly Thr Thr Gly Gly Tyr Asp Pro Ser Gly
885 890 895
Gly Ser Asn Leu Val Val Ile Lys Lys Asn Ile Gly Ile Ser Val Glu
900 905 910
Gln Gln Ile Leu Asp Ala Ile Ser Thr Glu Asn His Asn Trp Ile Val
915 920 925
Phe Asp Lys Asp Asp Phe Ala Ala Arg Thr Ala Val Ala Met Tyr Arg
930 935 940
Leu Asp Cys Asp Asn Ala Asp Val Arg Ser Ala Leu Gly Gly Ala Ser
945 950 955 960
Ala Ala Gln Cys Arg Asp His Ile Ala Trp Cys Ser Ala Asn Gly Ile
965 970 975
Ser Asp Glu His Asp Cys Glu Asn Glu Phe Phe Asn Asn Arg Leu Asn
980 985 990
Asp Ser Asp Leu Pro Ile Arg Asn Gln Met Ile Gln Ser Asn Thr Thr
995 1000 1005
Ile Asp Gly Arg Gly Ala Asn Ala Tyr Phe Phe Phe Asn Gly Phe Ser
1010 1015 1020
Ile Gly Lys Asp Ser Ser Gly Ala Ser Leu Tyr Ala Ala Gln Asn Val
1025 1030 1035 1040
Ile Val Thr Asn Asn Glu Phe Ile Gly Ala Gly His Thr Glu Asp His
1045 1050 1055
Asp Leu Asp Pro Asp Met Ile Arg Ser Thr Gly Glu Ser Asn Lys Ile
1060 1065 1070
Trp Ile His Gln Asn Thr Phe Asp His Thr Gly Asp Ser Ala Phe Asp
1075 1080 1085
Val Lys Val Gly Ala Tyr Asp Ile Thr Ile Ser Phe Asn Lys Leu Val
1090 1095 1100
Asn Val Lys Arg Ala Ala Leu His Gly Ser Ser Asp Ser Arg Ala Ile
1105 1110 1115 1120
Asn Ser Gln Ile Thr Thr Thr Met His Asn Asn Leu Phe Tyr Thr Ser
1125 1130 1135
Asp Asp Gln Tyr Ala Leu Ser Thr Tyr Asp Thr Leu Arg Arg Val Pro
1140 1145 1150
Leu Met Arg Arg Gly Gln Ser His Met Phe Asn Asn Val Phe Tyr Gly
1155 1160 1165
Tyr Arg Lys Asp Ile Leu Ser Val Arg Val Gly Gly Arg Ile Ala Phe
1170 1175 1180
Glu Asp Asn Ile Ile Leu Asn Lys Glu Ser Ser Ser Thr Pro Gly Asp
1185 1190 1195 1200
Gly Leu Lys Lys Gly Asp Asp Met Glu Tyr Tyr Val Glu Thr Leu Leu
1205 1210 1215
Arg Asp Phe Arg Glu Gly Gly Leu Glu Ile Ser Gly Ser Tyr Val Ser
1220 1225 1230
Phe Ala Asp Ser Ala Cys Asn Ser Tyr Gly Ala Ser Gly Asp Leu Thr
1235 1240 1245
Ala Ser His Gly Ala Thr Pro Asp Met Phe Asp Asp Tyr Ser Ser Ala
1250 1255 1260
Ser Lys Asn Thr Ile Ser Ala Asn Arg Phe Val Ala Gly Asp Asp Leu
1265 1270 1275 1280
Thr Asp Tyr Val Phe Ala Thr Ala Gly Lys Gly Gly Lys Ala Pro Tyr
1285 1290 1295
Val Ser Thr Phe Thr Ala Gly Gln Asn Ser Leu Ile Ser Gln Ala Asn
1300 1305 1310
Pro Val Cys Gln
1315
<210> 84
<211> 93
<212> PRT
<213> Microbulbifer degradans
<400> 84
Met Pro Pro Val Glu Leu Leu Asp Glu Leu Leu Asp Glu Glu Leu Asp
1 5 10 15
Glu Leu Leu Asp Glu Glu Leu Glu Glu Leu Glu Leu Leu Glu Glu Leu
20 25 30
Glu Pro Val Ala Leu Tyr Val Ser Asn Ser Ala Thr Val Gly Val Pro
35 40 45
Leu Glu Glu Val Ile Ser Lys Leu Ile Leu Leu Val Val Thr Ala Ser
50 55 60
Asn Val Ile Val Pro Thr Leu Glu Pro Ala Ala Asn Thr Ala Pro Val
65 70 75 80
Ser Leu Ser Val Thr Cys Gln Pro Leu Ile Leu Pro Ser
85 90
<210> 85
<211> 3951
<212> DNA
<213> Microbulbifer degradans
<400> 85
atgagaaata aattaggctc aatgttaaaa atgagcgcag ccattggcgg tttagttgca 60
gcgggttccg ctgttgcagg cccagttggt ttcgcaagtt taaacggcgg cactaccggc 120
ggcgcgggcg ggcaagttgt atatgctagc accggtgctg aaattaacca ggctatgtgt 180
aatcgcgcaa gcgacgatac accgctaatt atttatgtga cgggtaccat taaccacggt 240
aacaccgcca agtattctgg tagctgcgat accactgcag atgaaattca gtttaaaggt 300
gtaaaaaata tatcgttgat aggaacgggc agcggtgctg tgttcgatca aatcggtatt 360
cacctacgcg atacctcgaa tattattttg caaaatttgc atattaaaaa cgttaagaag 420
tctggttcgc ctacttcgaa tggcggtgac gctattggta tggaatctgg cgtatacaat 480
gtgtgggtag accactgtga gctagaagct tcaggcggtg aaagtgatgg atatgattca 540
ttgctagata tgaaagccac cacgcagtat gtaacggttt cttacactta ctatcacgat 600
tctggtcgcg gtggtttaat ggggtctagt gatagcgacg ataccaatac cttcgtcacc 660
ttccaccaca actactacga aaatatggat tcgcgcttgc cgttactgcg tcacggtaca 720
gctcatgcat ttaacaacta ctataatggt attgctaaat ctggcatgaa cccacgtata 780
ggtgggcaaa taaaagcgga aaacaattac ttcgaaaatg cgcacaaccc aattggtact 840
ttttatacag acgatatggg ttactgggac ttacgcggca atatatttgg cagtaacgta 900
acgtgggcgt ctgcggatga tgaaacccct gcaggcccga acccaacatc cactacgtct 960
attcatattt cttaccccta tgatctagat gacgctgctt gtgtgcctga tattgtaaaa 1020
tccacagcag gtgtgggtac tggcctagcg gtttcagacg gaagctgcac cataacaacg 1080
ccaccttcaa cgagttcgtc tagctccagt tctagctcaa cctcgtcgac tggttcgagt 1140
tcgtcttcaa gctcttcctc ttcaagcagc tctagctcca atggcggcag cttagtatta 1200
ggtaacaacc tttcaattgg tgctggctct gatggtagta gcaagggagc aggttcgtac 1260
ggcaatgtgc gcgatggcga tgtaagtagc tattgggcgc cgagtggcag tactggtcgt 1320
gtttcaatta aatggagcgg cagccaaact gttaacgcta ttgttattaa agaagcggca 1380
ggctatgaag gtaatattag tggttggcaa gtaactgata acgataccgg tgcagtattg 1440
gctgctggct caagcgtagg cacaattacg tttgatgcgg taacgactag caagatcaat 1500
ttcgaaatta cttcttctaa cggtacacca acggtagcgg aattcgaaac atataatgct 1560
acaggttcta gctcttcaag cagctcgagt tcttctagct cttcatcaag tagctcgtct 1620
agttcttcat cgagcagttc gtctagtagt tctacaggcg gcaccgctac tttaagtaca 1680
acggtttctg gcgatcaagt aacgttgaat tggagcgtaa ataatgcaac cgtaactggt 1740
cagcaaattt atcgcgatgt ggattcagac ccagctggcc gtgtgcgcat tgcatccggt 1800
gtaactggaa atacttacac agataccggt ttggctaacg gaacttatta ctactgggta 1860
aaagtaaccg attcaaactc agctacaatt aattccaact actcagaagc gcaagtgaat 1920
gtttatacaa catctactac aacgtttgaa gaggatgcgg gttattgctc ggtagacggt 1980
tcagtagata gcaataacag tggctttgct ggcagtggtt ttgctaatac cgataatgct 2040
tcgggtaatg gcgtaaacta cgcggtaagc gtacccgttg ctggtgtgta cacgctgcaa 2100
gtgcgttttg ctaatggctc aagtgcacgt cctgctgatg tgttagtgaa ctatggtaac 2160
gccggtgtat ttgatctgcc tagcacaggt tcttggacca gctggagcaa ctcaaacgaa 2220
attagcgtta acttagttgc tggcaataat attattcgtt tagaggctac cacgtctggc 2280
ggcttggcga atattgatag cctatctgta acgggtgtag agccttctgc aggtgactgt 2340
aacggtagtg ttggttctag cagttctagt tcttccagct cttctacttc tagcaccagt 2400
tcatctagca ctagctctgg tggcagctct actagctcaa gctcaacgtc tagctcatct 2460
acaagctcta catctagtag ttcaaccagc tctagcagca cgtcttctac ctcaagctct 2520
tcgggcggtg gtacggcaag ttgtgagcag ttgattaacg atccaagtgt taactgggat 2580
gagtctgcac tggcttcaga gcaagaaatt gtagcctgtt tggctcagtc tctaggtagc 2640
cctgttggct ttggggaagg tactaccggt ggttacgatc caagtggcgg cagcaacctt 2700
gttgttatta aaaagaacat aggtatttct gttgagcaac aaattttgga tgctataagc 2760
accgaaaacc acaactggat tgtgttcgac aaagatgatt ttgctgcgcg cactgcggta 2820
gcgatgtatc gcttagattg tgacaatgcc gatgtgcgtt cagcattggg tggcgcaagt 2880
gctgcacaat gtcgcgatca tatagcttgg tgttctgcta atggtatttc tgacgagcat 2940
gactgtgaaa atgaattctt taacaaccgt ttaaatgatt cagatttgcc aatccgcaat 3000
caaatgattc agtcaaacac taccattgat ggtcgtggtg caaacgcata cttcttcttt 3060
aatggtttct ccattggtaa agatagcagt ggtgcaagct tgtacgcagc gcaaaatgtg 3120
attgtaacga ataacgagtt tattggtgcc ggtcacactg aagatcacga tctagaccca 3180
gatatgattc gatctactgg cgaatcgaac aaaatttgga ttcaccaaaa cacgttcgac 3240
catactggtg attctgcgtt tgacgtaaag gtgggtgctt acgatataac aatatcattc 3300
aataagttgg tgaacgtgaa gcgtgctgcg ctacatggtt caagtgatag ccgagcaatt 3360
aactcgcaaa tcacaaccac tatgcacaac aacctgttct atacttcaga tgatcaatac 3420
gcgctaagta cctacgacac tttgcgtcgt gtaccgctaa tgcgtcgcgg tcaatcacac 3480
atgtttaaca acgttttcta cggttaccgt aaagatattc taagcgtgcg tgttggcggt 3540
cgtatcgcct ttgaagataa cattattttg aataaagaaa gcagctctac cccaggtgat 3600
ggcctgaaga aaggcgacga catggaatac tatgttgaaa ccttgttgcg cgacttccgt 3660
gagggtgggt tagaaattag cggtagctat gtatcgtttg cagatagcgc ttgtaattcc 3720
tatggcgcat cgggtgactt aaccgcatcg catggtgcta cgccagatat gtttgatgat 3780
tacagctctg catctaaaaa tactatatca gccaatcgct ttgttgctgg cgatgactta 3840
actgactatg tatttgctac tgcaggtaag ggcggtaaag cgccttatgt ttccaccttt 3900
actgctgggc aaaatagcct tatttcacag gctaacccag tttgtcagta g 3951
<210> 86
<211> 427
<212> PRT
<213> Microbulbifer degradans
<400> 86
Met Asn Lys Asn Asn Val Ile Ala Tyr Leu Leu Ile Ser Thr Phe Leu
1 5 10 15
Leu Phe Ser Ala Thr Val Phe Ala Val Lys Pro Ser Asn Ala Glu Thr
20 25 30
Arg Tyr Ser Ala Met Gly Ala Asp Thr Pro Ala Gly Leu Gly Gly Thr
35 40 45
Leu Pro Asp Gly Gln Ser Arg Ile Val Arg Val Thr Asn Leu Asn Ala
50 55 60
Ser Gly Glu Gly Ser Leu Ala Trp Ala Leu Gly Leu Ala Arg Pro Arg
65 70 75 80
Val Val Val Phe Glu Val Gly Gly Val Ile Asp Leu Ala Gly Gln Ser
85 90 95
Ile Thr Val Thr Gln Pro Phe Leu Thr Val Ala Gly Gln Ser Ala Pro
100 105 110
Ala Pro Gly Ile Thr Leu Ile Arg Gly Gly Leu Asn Ile Arg Thr His
115 120 125
Asp Val Arg Val Gln His Ile Arg Val Arg Pro Gly Asp Asn Leu Gln
130 135 140
Pro Lys Arg Ser Gly Trp Glu Ser Asp Gly Ile Ser Val Ala Gly Glu
145 150 155 160
Asn Ala Lys Asp Val His Ile Asp His Val Ser Val Ser Trp Ala Val
165 170 175
Asp Glu Asn Leu Ser Ala Ser Gly Asn Arg Tyr Lys Gly Tyr Gly Gln
180 185 190
Thr Ala Glu Arg Val Thr Phe Ser Asn Asn Leu Ile Ala Glu Ala Leu
195 200 205
Asp Tyr Ala Ser His Lys Lys Gly Lys His Ser Lys Gly Leu Leu Val
210 215 220
His Asp Tyr Val Arg Asp Val Ala Val Val Arg Asn Leu Phe Val Ser
225 230 235 240
Asn Asp Arg Arg Asn Pro Tyr Phe Lys Ala His Thr Ile Gly Phe Val
245 250 255
Ala Asn Asn Ile Ile Tyr Asn Ala Gly Asn Ala Ala Ile Gln Val Asn
260 265 270
Tyr Ile Glu Arg Glu Trp Glu Gly Gln Ser Thr Gly Pro Ala Asn Ala
275 280 285
Arg Val Ala Val Val Asn Asn Gln Leu Val Tyr Gly Arg Asp Thr Tyr
290 295 300
Ser Asp Leu Ala Leu Val Ser Val Arg Gly Asp Ala Tyr Leu Thr Gly
305 310 315 320
Asn Ser Val Thr Asn Leu Met Gly Glu Pro Met Pro Ile Thr Glu Gly
325 330 335
Ala Val Asn Ser Leu Ala Ser Ala Pro Ser Trp Leu Thr Gly Tyr Glu
340 345 350
Leu Trp Asp Ala Asp Glu Met Arg Glu Leu Leu Val Ala Ser Val Gly
355 360 365
Ala Thr Pro Trp Ala Arg Asp Ala Ile Asp Thr Arg Ile Ile Asn Gly
370 375 380
Val Ala Thr Gly Lys Ala Arg Ile Ile Asp Ser Gln Gln Asp Val Gly
385 390 395 400
Gly Tyr Pro Ser Tyr Lys Gln Thr Asn Lys Lys Phe Asp Ile Pro Asp
405 410 415
Asp Lys Ile Ala Glu Trp Leu Leu Gly Tyr Leu
420 425
<210> 87
<211> 1284
<212> DNA
<213> Microbulbifer degradans
<400> 87
atgaataaaa ataatgtaat tgcttatctg ctaatttcaa cttttctatt attttctgcg 60
actgtgttcg cggttaaacc cagcaacgct gaaacccgat attccgcaat gggtgcagat 120
accccagcag gtttgggagg cactttgcca gatggtcagt ctcgtatcgt tagggtgact 180
aatttaaatg caagtgggga gggctcgctc gcatgggcgc tgggtttagc tcgaccacgc 240
gtagtggtgt tcgaagttgg cggtgttata gatcttgctg ggcaaagtat taccgtcacg 300
cagccattcc ttactgttgc cggtcagtcg gcgccagcac cgggtattac attaattcgc 360
ggcggtttaa atatacgaac ccacgatgta agagtgcagc atattcgcgt gcggccggga 420
gataacttac aaccaaagcg ctccggctgg gaaagtgacg gtatatctgt ggccggtgaa 480
aatgccaaag atgtacatat agatcatgta tcggtaagtt gggcggtaga tgaaaacctc 540
tccgcttcgg ggaatcgtta caaaggttac ggtcaaaccg ctgagcgggt aacgtttagt 600
aataatctca ttgccgaagc gttagattat gccagccata aaaaaggcaa acactctaag 660
ggattattgg tacacgatta tgtgcgagat gttgccgtag ttagaaattt gtttgtgtct 720
aatgatcgtc gcaacccgta ctttaaagcg cacaccatag gttttgtagc aaataatatt 780
atttacaatg cgggtaatgc cgctatacag gttaactata ttgagcgtga gtgggagggc 840
cagagtacag gcccagctaa tgctagagta gcggtggtaa ataaccagtt agtttacggc 900
cgcgatacat actcagactt ggcgctagtg tctgtgcgtg gggatgctta tttgacgggt 960
aatagcgtta caaatttaat gggcgagccc atgcctatta cggaaggggc ggttaattct 1020
ttagcctctg caccttcatg gttaacaggt tatgagttgt gggatgctga cgagatgcgt 1080
gagctgctag tagccagtgt tggtgcaaca ccctgggcca gggatgcgat agatacccga 1140
ataattaatg gggtggcaac ggggaaggcg cgaataatag atagccagca agatgtgggt 1200
ggctacccga gctataagca aacaaataaa aaatttgata taccagacga caaaattgcc 1260
gaatggttac tgggttacct gtaa 1284
<210> 88
<211> 769
<212> PRT
<213> Microbulbifer degradans
<400> 88
Met Arg Asn Thr Lys His Leu Leu Asn Ser Gly Ala Val Leu Leu Ala
1 5 10 15
Ser Ser Ile Ala Thr Ala Ala Met Ala Gly Pro Val Gly Phe Ala Ser
20 25 30
Leu Asn Gly Gly Thr Thr Gly Gly Gln Gly Gly Gln Val Val Tyr Ala
35 40 45
Asn Thr Gly Thr Gln Ile Asn Glu Ala Met Cys Asn Arg Pro Ser His
50 55 60
Asp Thr Pro Leu Ile Ile Tyr Val Ser Gly Thr Ile Asn His Gly Asn
65 70 75 80
Thr Glu Lys Val Ser Gly Asn Cys Asp Thr Thr Gly Asp Glu Ile Gln
85 90 95
Phe Lys Lys Val Lys Asn Leu Ser Leu Ile Gly Thr Gly Asn Gly Ala
100 105 110
Val Phe Asp Gln Ile Gly Ile His Leu Arg Glu Thr Ser Asn Ile Ile
115 120 125
Leu Gln Asn Leu His Ile Lys Asn Val Lys Lys Ser Gly Ser Pro Thr
130 135 140
Ser Asn Gly Gly Asp Ala Ile Gly Met Glu Ser Gly Val Tyr Asn Val
145 150 155 160
Trp Val Asp His Cys Glu Leu Glu Ala Ser Gly Gly Glu Lys Asp Gly
165 170 175
Tyr Asp Ser Leu Leu Asp Met Lys Ala Thr Thr Gln Tyr Val Thr Val
180 185 190
Ser Tyr Thr Tyr Tyr His Asp Ser Gly Arg Gly Gly Leu Met Gly Ser
195 200 205
Ser Asp Ser Asp Asp Thr Asn Thr Tyr Val Thr Phe His His Asn Tyr
210 215 220
Tyr Lys Asn Met Asp Ser Arg Leu Pro Leu Leu Arg His Gly Thr Ala
225 230 235 240
His Ala Phe Asn Asn Tyr Tyr Asp Gly Ile Thr Lys Ser Gly Met Asn
245 250 255
Pro Arg Ile Gly Gly Gln Ile Lys Ala Glu Asn Asn Tyr Phe Glu Asn
260 265 270
Ala His Asn Pro Ile Gly Thr Phe Tyr Thr Asn Asp Met Gly Tyr Trp
275 280 285
Asp Leu Ser Gly Asn Ile Phe Gly Asn Asn Val Thr Trp Ala Ser Ala
290 295 300
Asp Asp Glu Thr Pro Ala Gly Pro Asn Pro Gln Ser Thr Thr Ser Ile
305 310 315 320
His Ile Ser Tyr Pro Tyr Ser Leu Asp Asp Ala Thr Cys Val Pro Lys
325 330 335
Ile Val Lys Ala Thr Ala Gly Val Gly Asn Gly Leu Ala Val Ser Thr
340 345 350
Gly Gly Ser Asn Cys Gly Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
355 360 365
Ser Ser Ser Thr Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser
370 375 380
Ser Asn Ser Ser Ser Gly Gly Ser Gly Val Asn Leu Ser Ile Gly Ala
385 390 395 400
Gly Ser Asp Gly Ser Ser Lys Gly Ala Gly Ser Tyr Gly Asp Val Arg
405 410 415
Asp Gly Asn Met Ser Thr Tyr Trp Ala Pro Ser Gly Ser Thr Gly Arg
420 425 430
Val Ser Ile Lys Trp Ser Ser Ala Thr Thr Val Ser Ser Ile Val Ile
435 440 445
Lys Glu Ala Ala Gly Phe Glu Gly Asn Ile Thr Gly Trp Gln Val Val
450 455 460
Asn Asn Glu Asn Gly Ala Val Leu Lys Ser Gly Ser Asn Ala Gly Val
465 470 475 480
Ile Ser Phe Ser Pro Val Ser Thr Thr Lys Leu Asn Phe Glu Ile Thr
485 490 495
Ser Ser Asn Gly Met Pro Thr Val Ala Glu Phe Glu Thr Tyr Ser Gly
500 505 510
Thr Val Gly Gly Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
515 520 525
Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Gly Gly Ser Ala
530 535 540
Asn Leu Gly Thr Ser Val Ser Gly Asp Gln Val Ser Leu Asn Trp Ser
545 550 555 560
Thr Ser Asn Ile Asp Val Gly Ser Gln Gln Val Tyr Arg Asp Thr Asp
565 570 575
Ser Asn Pro Ser Gly Arg Val Arg Ile Ser Ala Gly Val Ser Gly Asn
580 585 590
Ser Tyr Thr Asp Tyr Gly Leu Ala Ser Gly Thr Tyr Tyr Tyr Trp Ile
595 600 605
Lys Ile Thr Asp Gln Asn Gly Val Val Tyr Asn Thr Asn Ala Ala Glu
610 615 620
Ala Val Val Gly Ser Gln Ala Pro Thr Thr Phe Thr Ala Gln Glu Ser
625 630 635 640
Ala Gly Phe Cys Ser Val Asn Gly Ser Val Asp Ser Asn Asn Ala Gly
645 650 655
Tyr Thr Gly Asp Gly Phe Val Asn Thr Asp Asn Ala Ser Gly Asn Ala
660 665 670
Ala Val Tyr Ala Phe Asn Ala Pro Ser Ala Gly Met Tyr Ser Leu Gln
675 680 685
Val Arg Tyr Ala Asn Gly Ser Ser Ala Arg Pro Gly Asp Val Leu Val
690 695 700
Asn Ala Gly Asn Ile Gly Thr Phe Asp Phe Ser Ser Thr Gly Ser Trp
705 710 715 720
Thr Ser Trp Ala Asn Ser Asn Glu Leu Ser Ala Tyr Phe Ser Ala Gly
725 730 735
Asn Asn Thr Val Arg Ile Gln Ala Thr Asn Ser Gly Gly Leu Pro Asn
740 745 750
Leu Asp Ser Val Ser Val Thr Gly Asn Ala Pro Ala Ala Gly Asn Cys
755 760 765
Asn
<210> 89
<211> 2310
<212> DNA
<213> Microbulbifer degradans
<400> 89
atgagaaaca ctaaacacct actcaatagc ggtgccgtat tgctagctag cagtattgca 60
acagctgcga tggcggggcc tgtgggcttt gcatcgctta atggtggcac aacgggcggt 120
caaggcggtc aagttgtata cgccaacacc ggtacacaaa ttaacgaagc catgtgtaac 180
cgcccatctc acgatacgcc gttaattatt tatgtatccg gtaccattaa ccacggcaac 240
accgaaaagg tgtcgggtaa ttgcgataca accggcgacg agattcagtt taaaaaagtt 300
aaaaacctat cgttaattgg tactggtaac ggtgcggtgt ttgatcaaat aggtattcat 360
ttacgcgaaa cctccaatat tattctgcaa aaccttcata ttaaaaatgt taaaaaatcg 420
ggttctccaa cctccaatgg cggcgatgca attggaatgg agtctggcgt atacaatgtg 480
tgggtggatc actgtgagct agaagcatcc ggtggtgaaa aagatggtta cgattcattg 540
ctagatatga aagcaaccac gcagtatgta accgtttctt acacctatta ccacgattca 600
ggccgcggtg gtttaatggg gtcgagcgat agtgacgata ccaacaccta cgtgactttc 660
caccacaatt actacaaaaa tatggattca cgcttaccgc ttttacgcca cggtactgcg 720
catgccttta acaactatta cgatggcatt accaaatctg gtatgaaccc ccgtataggc 780
ggtcaaataa aagcagaaaa taactatttc gaaaacgcac acaacccaat aggtacgttt 840
tacacaaacg atatgggtta ctgggactta agcggcaata tatttggcaa caacgtaacg 900
tgggcgtctg cggatgatga aacccctgca gggccgaatc cacaatccac aacgtccatt 960
catatttctt acccctacag cttggatgac gcaacgtgcg tgccgaagat tgtaaaagct 1020
actgcgggtg tgggtaacgg tttggctgtg tctaccggtg gtagcaattg cggtacttct 1080
agctcttcgt ctagcagctc ttcgtccagc tctacctcgt ctaccagttc tacatctagc 1140
agttcttcat catccaatag ttcttctggt ggctcaggtg taaacctttc aattggcgca 1200
ggttctgatg gtagcagtaa aggtgcgggc tcttatggcg atgtgcgcga tggcaatatg 1260
agcacctact gggcaccgag tggcagcact ggtcgcgtat ctattaagtg gagttcggca 1320
acgacggtaa gtagcattgt tattaaagaa gcggctggct ttgaaggtaa cattactggc 1380
tggcaagttg taaacaacga gaatggcgca gtattaaaaa gcggctctaa cgctggcgta 1440
atttcttttt ctccggtttc tactacgaag ttaaatttcg aaattacctc ttcgaacggc 1500
atgcccacgg ttgcagaatt tgaaacctat agcggcacgg ttggtggtac ttcgtcatcc 1560
agttcttcaa gtagttcgtc aagcagttct tcaagcagct caagttcaac gagcagctct 1620
ggtggttccg cgaatctagg tacctcggta agtggcgatc aggtttcact taattggtct 1680
acctcaaaca ttgatgtggg ttcgcagcaa gtttatcgcg ataccgattc aaacccatct 1740
ggtcgtgtgc gtatctctgc aggcgtttct ggtaattcgt ataccgatta cggcttagct 1800
agcggtactt attactactg gataaaaatt accgatcaaa acggtgttgt ttacaacaca 1860
aatgccgcag aagcggttgt aggcagccaa gcgccaacga cgtttactgc gcaagagtct 1920
gcgggtttct gctctgttaa cggttctgtt gattccaata acgctggcta cactggcgat 1980
ggttttgtaa ataccgataa cgccagtggc aatgcagcag tttatgcctt caatgcacct 2040
agtgcgggta tgtatagctt gcaggttcgc tacgcaaacg gttctagtgc acgcccaggt 2100
gatgtgctgg tcaacgctgg caatattgga acatttgatt tttccagtac cggttcttgg 2160
acatcttggg caaacagtaa tgagttaagt gcgtacttct ctgcgggtaa caatactgtt 2220
cgtattcaag ctactaactc tggcggctta cctaacttag atagtgtttc tgtaacgggt 2280
aatgcaccag cggcaggcaa ctgtaattaa 2310
<210> 90
<211> 594
<212> PRT
<213> Microbulbifer degradans
<400> 90
Met Lys Ile Phe Lys Leu Leu Leu Met Phe Val Leu Ala His Asn Leu
1 5 10 15
Val Ala Cys Gly Gly Ser Asn Asp Gly Gly Glu Ile Glu Leu Asn Phe
20 25 30
Gly Glu Glu Asn Thr Pro Glu Pro Glu Thr Glu Pro Glu Ala Glu Pro
35 40 45
Glu Gly Glu Pro Glu Gly Glu Pro Glu Gly Glu Pro Glu Gly Glu Pro
50 55 60
Glu Gly Glu Pro Glu Gly Glu Thr Ala Asp Ala Thr Ala Asp Ala Gly
65 70 75 80
Phe Ala Gly His Asn Phe Asn Leu Thr Gly Gly Glu Gly Gly Thr Ala
85 90 95
Tyr Thr Val Asn Asn Gly Lys Asp Leu Gln Thr Val Leu Asp Asn Ala
100 105 110
Lys Ser Ser Asn Ser Pro Val Ile Ile Tyr Val Asp Gly Thr Ile Asn
115 120 125
Ser Phe Asn Ser Ala Asn Gly Asn Gln Pro Ile Gln Ile Lys Asp Met
130 135 140
Asp Asn Val Ser Ile Ile Gly Tyr Gly Ala Glu Ala Thr Phe Asp Gly
145 150 155 160
Val Gly Ile Ala Ile Arg Arg Ala Asn Asn Ile Ile Ile Arg Asn Leu
165 170 175
Thr Phe Lys Ser Val Leu Thr Glu Gly Lys Asp Ala Ile Ser Ile Glu
180 185 190
Gly Asp Asp Asp Gly Ser Thr Thr Ser Asn Ile Trp Val Asp His Asn
195 200 205
Glu Phe Tyr Ser Ala Pro Thr Ala Asp Lys Asp Phe Tyr Asp Gly Leu
210 215 220
Ile Asp Ser Lys Ser Gly Ala Ser Asn Ile Thr Ile Ser Tyr Asn Tyr
225 230 235 240
Leu His Asp His Trp Lys Ala Ser Leu His Gly His Thr Glu Asn Asp
245 250 255
Glu Gly Ala His Asn Thr Asp Arg Lys Ile Thr Phe His His Asn Arg
260 265 270
Phe Glu Asn Ile Glu Ser Arg Leu Pro Leu Phe Arg Arg Gly Val Gly
275 280 285
His Leu Tyr Asn Asn Tyr Tyr Lys Asp Val Gly Ser Thr Ala Ile Asn
290 295 300
Ser Arg Ile Gly Ala Glu Leu Leu Ile Glu Asn Asn Val Phe Glu Asp
305 310 315 320
Ser Gln Asn Pro Ile Val Ser Phe Tyr Ser Asp Val Ile Gly Tyr Trp
325 330 335
Asn Thr Ser Gly Asn Leu Phe Thr Asn Val Thr Trp Thr Thr Pro Gly
340 345 350
Thr Gly Glu Val Ser Ala Gly Ala Thr Gln Thr Pro Thr Ser Asp Tyr
355 360 365
Val Val Pro Tyr Ser Tyr Thr Leu Met Pro Ala Ala Asp Val Lys Ala
370 375 380
His Val Ile Ala Ser Ala Gly Val Gly Lys Ile Asp Gln Thr Gly Leu
385 390 395 400
Thr Ile Pro Asp Pro Val Thr Pro Glu Gly Asp Leu Gly Glu Pro Glu
405 410 415
Ala Pro Val Gln Gly Asp Val Ser Leu Pro Tyr Thr Glu Asn Phe Ala
420 425 430
Ala Thr Asp Ala Ala Asn Phe Phe Ser Ala Ala Tyr Arg Asp Ile Thr
435 440 445
Gly Ser Ala Gly Thr Ser Thr Pro Met Tyr His Arg Val Thr Gly Thr
450 455 460
Val Glu Ile Asn Ala Gln Gln Leu Asp Met Thr Gly Ala Arg Val Ser
465 470 475 480
Ile Gly Asn Thr Thr Pro Ser Val Ser Thr Thr Gly Ala Asp Thr Thr
485 490 495
Thr Thr Gly Val Leu Asp Leu Ser Ala Pro Tyr Thr Val Ser Phe Lys
500 505 510
Val Val Ser Val Gly Gly Thr Leu Thr Lys Lys Phe Gln Ile Tyr Val
515 520 525
Asp Asn Asn Thr Ser Ala Ser Gly Asp Ser Ile His Gly Gly Ser Ser
530 535 540
Arg Phe Tyr Ser Glu Thr Leu Asp Ser Leu Val Ala Gly Gln Thr Tyr
545 550 555 560
Thr Val Thr Gly Phe Thr Ala Thr Asn Ser Ser Phe Ile Thr Leu Arg
565 570 575
Thr Glu Ser Ser Gly Gln Ile Val Leu Asp Asp Leu Ser Ile Gln Ala
580 585 590
Ala Glu
<210> 91
<211> 1785
<212> DNA
<213> Microbulbifer degradans
<400> 91
atgaaaattt ttaaattgtt attaatgttt gtactcgccc acaacttagt tgcttgtggt 60
ggcagtaacg acggtggtga aattgaatta aactttggcg aagaaaacac accagagcca 120
gaaaccgaac cagaagctga gcctgaagga gaaccagagg gcgagccgga gggagaacct 180
gaaggagagc ctgaagggga accagagggc gaaacagccg acgcaaccgc agatgctggc 240
ttcgccggcc acaattttaa tcttaccggt ggcgaaggcg gcacagccta taccgttaat 300
aacggcaaag atttgcaaac agttttagac aacgccaaat cgagtaattc accggtcatt 360
atttacgtag acggcaccat aaattcgttt aactctgcca acggcaacca gcctattcaa 420
attaaagata tggataacgt atctataatt ggttacggcg ccgaagcaac atttgacggt 480
gttggtatag caatacgccg cgccaacaac attattattc gcaaccttac ttttaaaagc 540
gtccttaccg aaggtaaaga tgcaattagt atagaaggtg atgacgacgg cagcaccacg 600
tcaaacattt gggttgatca caacgaattc tacagcgccc caacggcaga caaagatttt 660
tacgacggtt taatcgatag taaaagcggc gcgagcaaca ttactatttc ttacaactac 720
ctgcacgacc attggaaagc atcgttacac ggccataccg aaaatgacga aggtgcacac 780
aacaccgacc gcaaaattac tttccaccac aaccgttttg agaatattga atcgcgttta 840
ccgctgttcc gtcgcggtgt aggccatttg tacaataact actacaaaga cgtaggctca 900
acggctatca actcacgtat tggtgccgag ttattaattg agaataacgt ttttgaagat 960
tcacaaaacc cgattgtctc tttttactct gacgtaattg ggtactggaa cacctcaggc 1020
aacctcttca ccaatgtaac ttggacaacc ccaggtactg gcgaagtatc tgcaggcgca 1080
acacaaacgc caacctcaga ttacgtagtg ccatacagct acacgcttat gccggcagcc 1140
gatgtaaaag cccacgtcat tgcgagtgca ggcgttggca aaatagacca gacagggctt 1200
accattccag accccgttac ccctgaaggc gacctaggtg aaccagaagc cccagtgcaa 1260
ggtgatgtaa gcctacctta cactgaaaat tttgccgcca ctgacgccgc caatttcttt 1320
agcgccgcgt accgcgatat tactggctct gctggcacca gcacacccat gtaccaccgc 1380
gtaaccggca cggtggaaat taacgcacag caattggata tgactggcgc acgcgtatca 1440
attggcaaca caacgccaag tgtaagcaca accggtgcag acaccactac aacgggcgta 1500
ttagatttaa gcgcgcccta caccgtaagc tttaaagtgg taagcgtagg cggcacccta 1560
actaagaaat ttcaaatata tgtagacaac aatacctctg ccagcggcga ctctattcac 1620
ggcggctcat cgcgctttta cagtgaaact ttagactcgc tagttgcagg ccaaacctac 1680
acagtaaccg gctttaccgc caccaacagc tctttcataa cattacgtac cgaaagtagc 1740
ggccaaattg tattagatga cctaagtatt caagccgcag aataa 1785
<210> 92
<211> 425
<212> PRT
<213> Microbulbifer degradans
<400> 92
Met Phe Lys Tyr Ala Leu Tyr Val Val Ala Leu Val Ala Gly Val Val
1 5 10 15
Val Ser Leu Ala Ala Cys Ser Lys Arg Ala Thr Gln Gln Val Glu Thr
20 25 30
Glu Phe Tyr Glu Ile Asn Glu Arg Gly Gly Asp Asp Gly Arg Leu Leu
35 40 45
Arg Val Val Asn Leu Asn Asn Gln Gly Val Gly Ser Leu Arg Trp Ala
50 55 60
Leu Ala Gln Thr Gly Ala Arg Lys Ile Ile Phe Asp Val Gly Gly Val
65 70 75 80
Ile Asp Leu Glu Glu Lys Ser Leu Lys Ile Arg Glu Ala His Val Thr
85 90 95
Ile Ala Gly Glu Thr Ala Pro Ser Pro Gly Ile Thr Leu Ile Lys Gly
100 105 110
Gly Leu Arg Ile Glu Thr His Asn Val Lys Val Ser His Leu Met Ile
115 120 125
Arg Pro Gly Asp Ala Gly Tyr Ser Lys Gly Gln Gly Trp Lys Pro Asp
130 135 140
Gly Ile Thr Ile Tyr Gly Ser Lys Ala Arg His Val Val Ile Asp His
145 150 155 160
Cys Ser Val Thr Trp Ala Val Asp Glu Asn Ile Ala Val Ser Gly Pro
165 170 175
Ala Asp Lys Gly Ala Glu Ala Thr Ala Gly Lys Val Leu Ile Arg Asn
180 185 190
Ser Ile Ile Ala Glu Ala Leu Ser Asn Ala Ser His Pro Glu Gly Glu
195 200 205
His Ser Lys Gly Ile Leu Ile His Asn Asn Val Gln His Val Ser Leu
210 215 220
Val Asn Asn Leu Leu Ala His Asn Arg Arg Arg Asn Pro Tyr Phe Lys
225 230 235 240
Ala Gly Thr Thr Gly Ile Val Ile Gly Asn Ile Ile Tyr Asn Pro Gly
245 250 255
Lys Arg Ala Ile His Met Ser Ser Gly Arg Ala Asp Ala Ser Leu Pro
260 265 270
Thr Leu Ser Ile Thr Gly Asn Leu Phe Ile Pro Ala Ala Asn Thr Ser
275 280 285
Pro Asn Leu Ser Leu Ile Ser Asn Tyr Gly Lys Ile Tyr Ser Ser Gly
290 295 300
Asn Leu Val Gln Gly Glu Ser Arg Pro Ile Thr Asp Gly Lys Ser Ile
305 310 315 320
Ser Leu Thr Ala Pro Pro Leu Gln Gln Val Gly Ile Asn Leu Thr Asp
325 330 335
Thr Gly Thr Gln Asn Asp Phe Cys Gln Thr Leu Ser Asn Ala Gly Ala
340 345 350
Arg Pro Trp Asp Pro Asp Pro Ile Asp Ile Arg Ile Lys Thr Gln Leu
355 360 365
Leu Ala Gly Glu Gly Arg Ile Ile Asp Ser Gln Ser Glu Val Gly Gly
370 375 380
Tyr Pro Ile His Asn Val Asn Asn Lys Glu Thr Ala Glu Ala Gly Ser
385 390 395 400
Thr Gln Gly Asp Gly Met Leu Gln Phe Asp Thr Glu Leu Leu Arg Lys
405 410 415
Ile Pro Asn Leu Cys Ser Gly Met Met
420 425
<210> 93
<211> 1278
<212> DNA
<213> Microbulbifer degradans
<400> 93
atgtttaagt atgcgctata tgttgtggcc ttggtggctg gcgtagtggt tagcttagca 60
gcctgcagca agagagctac gcagcaagta gagactgaat tctacgagat taatgagcgg 120
ggcggagatg atggtcgcct gctacgcgtt gtgaatttaa ataatcaggg ggttggctct 180
ttgcgctggg cgttggcgca gacaggtgct agaaaaataa ttttcgatgt agggggggtg 240
atagatctag aagaaaaatc gctcaagatt cgtgaagccc atgtgacaat cgctggagag 300
acggccccat caccgggtat cacccttatc aagggcggac taagaataga aacccacaat 360
gtcaaagttt cgcaccttat gattaggcct ggtgacgcag ggtactccaa aggtcaaggt 420
tggaaacccg acggcataac tatatatggc agcaaagcga ggcatgttgt tattgatcat 480
tgctcggtta catgggctgt cgacgaaaat atcgcagtat ctggcccagc agataagggg 540
gcagaggcta ccgcgggtaa ggttcttatt cgcaattcaa ttattgccga agcgctaagc 600
aatgcatccc acccagaggg ggagcattct aagggcatac tcatacacaa caatgtgcag 660
catgtaagct tggttaacaa tttgttggct cacaataggc gaagaaaccc ttattttaag 720
gcgggtacaa cgggaattgt aattggcaat ataatttata acccagggaa acgtgctatt 780
catatgtctt cgggccgtgc tgatgctagc ctgccaacac tgtcaattac agggaatttg 840
tttattccag cagctaatac ttcccccaac cttagtttga taagcaacta tggaaaaatt 900
tattcgagtg gaaacttggt gcagggggag agcaggccga taactgatgg taaaagtatt 960
tcattgactg cgccgccgct acagcaagta ggcataaatt taaccgatac aggtacacaa 1020
aacgactttt gccaaaccct aagcaacgca ggtgcccgcc cgtgggaccc cgacccgatt 1080
gatattagaa taaaaacaca acttttggct ggcgaaggac gaattattga tagccagagc 1140
gaggttggag gctacccaat acataacgtc aataacaaag aaaccgctga agcaggctct 1200
acgcaggggg atggtatgtt gcagtttgat actgagttat tgagaaaaat accaaacctg 1260
tgtagtggca tgatgtaa 1278
<210> 94
<211> 772
<212> PRT
<213> Microbulbifer degradans
<400> 94
Met Arg Asp Ile Thr Met Lys Asn Asn Lys Phe Arg Ser Ser Phe Thr
1 5 10 15
Leu Lys Lys Leu Thr Pro Phe Phe Val Ala Gly Thr Met Leu Gly Gly
20 25 30
Ser Asn Ala Trp Ala Gly Cys Asp Tyr Thr Val Thr Asn Gln Trp Gly
35 40 45
Ser Gly Phe Thr Gly Asn Val Arg Ile Thr Asn Ser Gly Asn Thr Pro
50 55 60
Thr Asn Gly Trp Ala Val Asn Trp Gln Tyr Ala Gly Asp Asn Arg Ile
65 70 75 80
Ser Asn Ser Trp Gly Ala Gln Leu Ser Gly Ser Asn Pro Tyr Ser Ala
85 90 95
Thr Ala Glu Ser Trp Asn Ala Val Ile Gln Pro Ser Gln Ser Ile Glu
100 105 110
Ile Gly Phe Gln Gly Thr Gly Asp Gly Asn Glu Ile Pro Thr Ile Asn
115 120 125
Gly Asp Val Cys Gln Thr Ser Ser Gly Ser Thr Ser Ser Ser Ser Ser
130 135 140
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser
145 150 155 160
Ser Asn Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Ser Gly Ser Thr Thr Gly Tyr Ile His Ile Glu Glu Asn Glu Leu Gly
180 185 190
Phe Cys Tyr Val Gln Gly Ser Ile Asp Ser Asn Asn Gly Gly Phe Thr
195 200 205
Gly Thr Gly Phe Ala Asn Thr Asp Asn Val Asn Gly Ser Gln Ile Asn
210 215 220
Trp Lys Val Asn Val Asp Phe Asp Gly Tyr Tyr Ala Leu Glu Trp Arg
225 230 235 240
Tyr Ala Asn Gly Ser Gly Thr Ala Arg Thr Ala Ser Val Ser Ala Asn
245 250 255
Gly Ala Gln Ser Glu Ile Ser Phe Pro Thr Thr Gly Ser Trp Asp Ser
260 265 270
Trp Leu Leu Asp Ser Thr Thr Leu Phe Leu Lys Ala Gly Val Asn Asp
275 280 285
Val Ile Leu Ser Ala Asn Thr Ser Ser Gly Leu Ala Asn Ile Asp Ser
290 295 300
Leu Thr Val His Gly Asp Gly Val Ala Ala Ala Asp Cys Asn Thr Asp
305 310 315 320
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr
325 330 335
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser
340 345 350
Gly Gly Pro Gln Ile Leu Lys Ala Phe Pro Thr Ala Glu Gly Tyr Gly
355 360 365
Lys Ile Thr Ala Gly Gly Arg Gly Gly Asp Val Tyr Ile Val Thr Asn
370 375 380
Leu Asn Asp Ser Gly Ala Gly Ser Leu Arg Gln Ala Val Glu Ala Ser
385 390 395 400
Gly Pro Arg Thr Val Val Phe Glu Val Ser Gly Thr Ile Thr Leu Asn
405 410 415
Lys Pro Leu Thr Ile Lys Asn Asn Asn Ile Thr Ile Ala Gly Gln Thr
420 425 430
Ala Pro Gly Asp Gly Ile Thr Leu Arg Lys His Asn Phe Ser Ile Gln
435 440 445
Ala Asp Asp Val Ile Val Arg Tyr Ile Arg Val Arg Phe Gly Asp Glu
450 455 460
Thr Leu Thr Asp Ser Asp Ala Ile Ser Met Arg Tyr Gln Lys Asn Ile
465 470 475 480
Ile Leu Asp His Val Ser Ala Ser Trp Gly Asp Asp Glu Thr Leu Ser
485 490 495
Leu Tyr His Gly Glu Asn Ile Thr Val Gln Trp Ser Met Ile Thr Glu
500 505 510
Thr Leu Asn Arg Gly Gly Glu His Ala Phe Ala Ala Ile Trp Gly Ser
515 520 525
Pro Phe Ser Thr Phe His His Asn Leu Ile Ala His Asn Val Ala Arg
530 535 540
Asn Val Arg Phe Ala Ser Gly Ser Gly Tyr Thr Asp Tyr Arg Asn Asn
545 550 555 560
Val Val Tyr Asn Trp Gly Tyr Ser Ser Thr His Gly Gly Glu Ala Gln
565 570 575
Gln Val Gly Asn Ala Asn Phe Asn Phe Thr Thr Val Asn Met Val Gly
580 585 590
Asn Tyr Tyr Lys Pro Gly Pro Arg Thr Glu Ser Gly Val Arg Ser Arg
595 600 605
Leu Leu Thr Pro Asn Thr Arg Asn Gly Asp Ala Asp Leu Gly Ser Phe
610 615 620
Tyr Val Ser Gly Asn His Met Val Gly Ser Pro Asn Val Thr Ala Asp
625 630 635 640
Asn Ser Ile Gly Val Ser Asn Lys Asn Ala Leu Ile Ser Ser Pro Trp
645 650 655
Asn Ser Met Lys Ile Glu Gly Glu Gln Thr Ala Glu Gln Ala Tyr Glu
660 665 670
Ser Val Leu Ala Tyr Ala Gly Ala Ser Lys Val Arg Asp Ser Val Asp
675 680 685
Thr Arg Ile Ile Glu Glu Val Arg Thr Gly Thr Ala Thr Tyr Gly Gly
690 695 700
Asn Gly Ile Ile Glu Ser Gln Asn Glu Val Gly Gly Trp Pro Gln Leu
705 710 715 720
Arg Ser Glu Thr Pro Pro Gln Asp Ser Asp Arg Asp Gly Met Pro Asp
725 730 735
Asp Trp Glu Arg Ala Asn Asn Leu Asn Pro Phe Asn Ala Ala Asp Arg
740 745 750
Asn Thr Lys Asp Ser Ile Gly Tyr Thr Met Leu Glu Arg Tyr Ile Asn
755 760 765
Gly Leu Val Asp
770
<210> 95
<211> 2319
<212> DNA
<213> Microbulbifer degradans
<400> 95
atgagagata tcacgatgaa gaataataaa ttcaggtcgt cttttacatt aaaaaaactc 60
acaccgtttt ttgttgcggg caccatgctt ggcggttcca acgcctgggc tggctgcgac 120
tatacggtca ctaatcagtg gggctcaggc tttactggca acgttcgtat aactaatagc 180
ggtaacacgc caacaaatgg ttgggctgtt aactggcagt acgctggcga taatcgtatt 240
agtaatagct ggggagcaca gctttcaggg tcgaacccat actctgccac ggcagaaagc 300
tggaatgctg ttattcagcc tagtcagtcc atagaaattg ggtttcaagg taccggcgac 360
ggaaatgaaa taccaactat aaatggcgat gtttgccaga ctagcagcgg aagtacttca 420
tccagctcat cttcaagtac gtcttctagc agctcttcaa gctcgtccac tagcagctct 480
tcaaacagct cttctagctc tagctcgtcc agctcgtcta gctcctcctc tggctctaca 540
actggatata ttcatataga agagaatgaa cttggttttt gttatgtaca aggttccatt 600
gactccaaca acggtggctt taccggcaca ggctttgcca ataccgataa cgttaatggc 660
tcacagatta actggaaagt aaatgtcgac tttgatggat attatgcgct cgaatggcgc 720
tatgcgaatg gctccggcac cgcgcgcact gcaagcgtta gcgctaatgg agcacaaagc 780
gaaatttcct tccctacaac aggttcgtgg gatagctggt tattagacag cactacccta 840
tttttaaaag ctggcgtaaa cgacgtaata ttgagcgcaa atacaagcag tggcctagcg 900
aacatagatt cacttacagt gcacggtgat ggcgtagctg cggcagactg taatactgat 960
ggaagctcaa gcagcagctc tagttcaagc tctagttcca gctcaacttc tagtagctcc 1020
tcaagttcca gctcgtctag cacgtccagc tcttctggtg gcccgcaaat attaaaagca 1080
ttccccaccg cagaaggcta cggaaaaata accgcaggtg gtcgtggtgg cgatgtctat 1140
atagttacaa acctgaatga ctcaggcgcg ggtagtttgc gtcaggccgt agaggcatct 1200
ggccctagaa ccgttgtgtt cgaagtgtct ggaaccatca ctctaaataa accactcaca 1260
atcaaaaata ataacatcac aatagcagga caaactgcac caggcgatgg cattacactt 1320
agaaagcaca acttttctat ccaagctgat gatgtcatcg tacgttacat acgtgttcgc 1380
tttggtgatg aaaccctaac cgattctgat gcgatttcca tgaggtacca aaaaaatatt 1440
attttggatc atgtgagtgc tagctgggga gatgatgaaa ccttatctct ttatcacggc 1500
gaaaatatca ctgtgcaatg gagcatgatt acagagaccc tcaatcgtgg cggcgaacat 1560
gcattcgcag ctatatgggg ttcgcctttt agtaccttcc accacaattt aattgctcac 1620
aatgttgcga gaaacgttcg ctttgcgtcg ggttccggtt atacggatta tcgtaacaat 1680
gtcgtatata actggggcta tagcagcaca cacggaggcg aagctcaaca agttggcaac 1740
gctaatttta atttcaccac cgtcaatatg gtcggcaact attacaaacc tgggccgaga 1800
actgaatctg gcgttcgtag tcgactactt acacctaaca cgcgtaacgg cgatgcggac 1860
ttaggtagtt tttacgtttc tggtaaccac atggttggca gcccaaatgt aactgcagac 1920
aactcgattg gcgtatcgaa taaaaatgcc ttaataagta gcccttggaa ttcaatgaaa 1980
atagaaggcg aacaaacagc tgagcaagca tatgagtcag ttcttgctta cgcaggtgca 2040
tctaaagtac gcgactcggt agatactcgt attattgaag aagtacgtac aggcacagct 2100
acttatggtg gaaacggcat aattgaatcg cagaatgaag tgggtggttg gccacaactt 2160
agaagtgaaa cgcccccgca agacagtgat cgcgacggaa tgccagatga ctgggaacgc 2220
gcgaacaacc taaatccatt caacgcagcc gatagaaaca ctaaagacag tattggctac 2280
acaatgttag agcgatatat taacgggctt gttgattaa 2319
<210> 96
<211> 511
<212> PRT
<213> Microbulbifer degradans
<400> 96
Met Leu Arg Ile Pro Lys Ala Trp Leu Ala Leu Pro Leu Val Leu Gly
1 5 10 15
Ser Thr Asn Leu Tyr Ala Gln Val Thr Cys Ser Ile Ser Asn Thr Asn
20 25 30
Val Trp Asn Asn Gly Tyr Thr Val Asn Val Asn Val Thr Asn Thr Gly
35 40 45
Ser Ser Gln Val Gly Ser Trp Gln Val Pro Ile Asn Phe Ser Glu Pro
50 55 60
Pro Gln Val Ser Ser Gly Trp Asn Ala Ile Leu Ser Thr Asn Gly Asn
65 70 75 80
Thr Val Thr Ala Gly Asn Ile Gly Trp Asn Gly Asn Leu Asn Pro Gly
85 90 95
Gln Ser Ala Ser Phe Gly Phe Gln Gly Gly His Asp Gly Ser Phe Val
100 105 110
Glu Pro Thr Cys Ser Gly Gly Gly Ser Ser Thr Ser Ser Ser Ser Ser
115 120 125
Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser
130 135 140
Ser Ser Ser Ser Ser Ser Gly Gly Ser Glu Leu Leu Ile Gln Glu Asn
145 150 155 160
Ala Ser Gly Phe Cys Arg Val Asp Gly Ser Ile Asp Asn Asn Asn Ser
165 170 175
Gly Tyr Thr Gly Ser Gly Phe Ala Asn Thr Glu Asn Gln Asn Gly Ser
180 185 190
Ala Val Glu Tyr Ala Leu Asn Val Pro Ser Asn Gly Asn Tyr Leu Leu
195 200 205
Asp Ala Arg Tyr Ala Ser Ala Thr Thr Arg Ser Ala Ser Val Val Val
210 215 220
Asn Gly Ser Ser Val Gly Ser Phe Ser Phe Pro Ser Thr Gly Ser Trp
225 230 235 240
Thr Ser Trp Thr Val Asp Ser Ala Asn Val Pro Leu Lys Gly Gly Asn
245 250 255
Asn Ile Val Arg Ile Val Ala Thr Asn Ser Ser Gly Leu Pro Asn Ile
260 265 270
Asp Ser Leu Lys Val Ile Gly Thr Asn Pro Ser Ala Gly Ser Cys Ser
275 280 285
Ser Asn Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
290 295 300
Ser Asn Ser Gly Gly Lys Gly Ser Ser Cys Arg Ser Thr Gly Ser Gln
305 310 315 320
Ser Val Ser Ser Thr Ile Lys Val Thr Ser Gly Thr Phe Asp Gly Asn
325 330 335
Cys Lys Thr Tyr Asn Pro Thr Ser Ala Leu Gly Asp Gly Ser Gln Ser
340 345 350
Glu Ser Gln Lys Pro Ala Phe Arg Val Glu Asn Gly Ala Thr Leu Lys
355 360 365
Asn Val Ile Leu Gly Asn Asn Gly Val Asp Gly Ile His Val Tyr Asn
370 375 380
Gly Gly Thr Leu Asp Asn Ile Arg Trp Thr Asn Val Gly Glu Asp Ala
385 390 395 400
Met Thr Val Lys Ser Glu Gly Asn Val Thr Val Ser Asn Ile Glu Gly
405 410 415
Tyr Asp Gly Ser Asp Lys Phe Ile Gln Val Asn Ala Val Thr Asn Leu
420 425 430
Lys Val Ser Asn Cys Ile Val Asp Lys Met Gly Lys Phe Leu Arg Gln
435 440 445
Asn Gly Gly Lys Thr Phe Ala Met Ser Val Thr Val Asp Asn Cys Asp
450 455 460
Ile Ser Asn Met Gly Glu Gly Val Phe Arg Ser Asp Ser Pro Asn Ala
465 470 475 480
Thr Ala Arg Ile Thr Asn Ser Arg Leu Lys Asn Ala Gly Asp Ile Cys
485 490 495
Ile Gly Lys Trp Lys Ser Cys Thr Ser Ser Asn Ile Thr Ser Phe
500 505 510
<210> 97
<211> 1536
<212> DNA
<213> Microbulbifer degradans
<400> 97
atgttgcgaa tccccaaggc ttggctggca cttccacttg tactgggaag taccaatcta 60
tacgctcaag taacttgcag tatctctaac accaatgttt ggaataacgg atacaccgtt 120
aatgttaatg taaccaacac aggctcttca caggttggtt cttggcaggt tcctattaat 180
ttttctgagc cacctcaagt aagcagcggc tggaatgcaa tattaagcac aaacggaaac 240
accgtaactg ccggcaatat tggttggaat ggtaatttaa atcccggcca aagcgcctcc 300
tttggttttc aaggtggcca cgatggcagc tttgtggagc ccacctgctc gggcggaggc 360
tctagcacta gctcaagcag ctctagtagt tctagctcaa caagttctac cagttcttca 420
tccacaagtt caagtagctc ttctagctcc ggcggctctg aacttttaat ccaagaaaat 480
gcatccggct tctgccgtgt ggacggatcg atagataaca ataactcagg ctataccggt 540
agtggctttg ccaacaccga gaaccaaaac ggttccgcag ttgaatacgc acttaacgtt 600
ccctctaatg ggaattatct cctcgacgct cgatatgcaa gcgctactac acgatcggct 660
agcgtggtag ttaatggatc ttcagtaggc agctttagtt ttccatctac gggttcgtgg 720
acaagctgga cagttgactc cgccaacgtt ccgttaaaag gcgggaataa tattgttcga 780
attgttgcaa ctaacagcag cggattacct aatattgatt cattaaaggt aataggcacc 840
aacccgtcag ccggcagttg ttcaagcaac tcgtcatcca ctagttcatc gtctagctca 900
agttcatcaa gcagtaactc cggtggcaaa ggctctagct gccgttctac aggcagtcaa 960
tctgtttcct ctactattaa agttactagc gggactttcg atgggaactg taaaacgtat 1020
aaccctacaa gtgcccttgg cgatggcagt caatcagaaa gccagaaacc ggcattccga 1080
gtggagaacg gcgcaacact caaaaacgtg attctaggca acaatggcgt agacggtatt 1140
catgtttata acggcggcac cttggataac atccgctgga ccaatgtggg tgaagatgca 1200
atgaccgtta aatctgaagg aaacgttacc gtttcaaata ttgagggtta tgacggttca 1260
gataaattta tacaagtaaa cgcagttacc aacctaaagg tttctaattg cattgtagat 1320
aaaatgggta aatttttacg tcagaatggc ggtaaaactt tcgctatgtc tgtaaccgta 1380
gataattgtg atatctcaaa tatgggtgaa ggtgttttcc gctcagacag cccaaatgca 1440
acagcgagaa tcacaaatag ccgattaaaa aatgcaggcg acatttgtat tggtaagtgg 1500
aaaagctgca catcttccaa cattaccagc ttctaa 1536
<210> 98
<211> 455
<212> PRT
<213> Microbulbifer degradans
<400> 98
Met Ile Met Met Arg Asn Lys Ile Leu Leu Ala Leu Val Leu Cys Gly
1 5 10 15
Ala Ser Ala Ser Ala Phe Ala Ala Ser Asn Arg Pro Ser Gly Tyr Thr
20 25 30
Thr Ile Cys Lys Thr Asp Gln Thr Cys Ser Val Ser Ser Ser Thr Asn
35 40 45
Val Ala Phe Gly Ala Ala Gly Lys Phe Val Tyr Lys Val Ile Asn Gly
50 55 60
Thr Phe Thr Cys Asn Thr Ser Thr Phe Gly Ser Asp Pro Asn Pro Ala
65 70 75 80
Lys Ser Val Lys Glu Cys Ser Val Pro Thr Asn Gly Ser Ser Ser Thr
85 90 95
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly
100 105 110
Ser Ser Ser Ser Cys Gly Thr Gly Gly Gly Ala Thr Val Cys Leu Ser
115 120 125
Ala Gly Gly Gly Ser Asn Asp Ile Asp Leu Thr Trp Thr Val Ser Gly
130 135 140
Ser Ile Ser Ser Ala Gln Val Tyr Arg Asp Thr Asp Ser Asn Pro Ser
145 150 155 160
Gly Arg Thr Arg Ile Ala Gln Leu Gly Gly Asp Ala Arg Ser Tyr Ser
165 170 175
Asp Thr Asn Val Ser Ala Gly Lys Gln Tyr Tyr Tyr Trp Ile Lys Phe
180 185 190
Gly Ala Asn Gly Ser Asn Tyr Asn Ser Asn Ala Ala Ser Ala Thr Tyr
195 200 205
Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
210 215 220
Ser Gly Gly Ser Ala Glu Cys Lys Ala Gly Ala Thr Ile Ser Gly Lys
225 230 235 240
Thr Val Asp Cys Gly Gly Lys Glu Ile Gly Leu Ser Cys Ser Gly Asp
245 250 255
Ser Glu Thr Gln Pro Pro Val Leu Thr Leu Lys Asn Ala Thr Ile Lys
260 265 270
Asn Leu Val Ile Ser Ala Lys Gly Gly Ser Asp Gly Ile His Cys Thr
275 280 285
Gly Asn Cys Thr Met Glu Asn Val Val Trp Lys Asp Ile Cys Glu Asp
290 295 300
Ala Ala Thr Asn Lys Thr Asp Gly Ile Thr Met Thr Ile Ile Gly Gly
305 310 315 320
Ser Ala Tyr Asn Ser Thr Ser Gly Tyr Gly Gly Lys Pro Asp Lys Val
325 330 335
Phe Gln His Asn Ser Lys Asn Ser Thr Thr Val Ile Lys Gly Gly Phe
340 345 350
Thr Leu Thr Gly Glu His Gly Lys Leu Trp Arg Ser Cys Gly Asn Cys
355 360 365
Thr Asn Asn Gly Gly Pro Arg Asn Val Thr Ile Asp Asn Val Lys Val
370 375 380
Asp Ala Lys Ile Gly Ser Ile Val Gly Val Asn Arg Asn Tyr Gly Asp
385 390 395 400
Lys Ala Thr Ile Lys Asn Leu Lys Ile Lys Asp Tyr Lys Ser Gly Ser
405 410 415
Pro Lys Val Cys Glu Glu Tyr Lys Gly Val Gln Lys Gly Ser Gly Glu
420 425 430
Ser Ser Lys Tyr Gly Glu Tyr Trp Asp Thr Ala Asn Cys Asp Val Ser
435 440 445
Lys Ser Asp Val Ser Ala Leu
450 455
<210> 99
<211> 60
<212> PRT
<213> Microbulbifer degradans
<400> 99
Met Pro His Glu Leu Leu Glu Pro Asp Glu Leu Leu Glu Leu Glu Leu
1 5 10 15
Glu Glu Leu Leu Val Leu Leu Val Leu Glu Glu Pro Leu Val Gly Thr
20 25 30
Glu His Ser Phe Thr Asp Leu Ala Gly Leu Gly Ser Leu Pro Lys Val
35 40 45
Asp Val Leu Gln Val Lys Val Pro Leu Ile Thr Leu
50 55 60
<210> 100
<211> 1368
<212> DNA
<213> Microbulbifer degradans
<400> 100
gtgataatga tgcgtaataa aatcctattg gcgcttgtat tgtgtggagc ttctgcctct 60
gcctttgcgg ctagtaatcg tcctagtggt tacacaacta tctgtaaaac cgatcaaact 120
tgttctgtaa gctcgtctac caacgttgcg ttcggcgctg ctggtaagtt tgtttacaaa 180
gtaattaacg gtacctttac ttgtaataca tctacttttg gcagcgatcc taaccctgct 240
aaatctgtaa aagaatgttc tgtacctact aatggttctt ctagcactag cagcaccagt 300
agttcttcta gctctagttc aagtagctca tcgggttcaa gcagctcatg tggcactggt 360
ggcggcgcaa ctgtatgttt aagtgcaggt ggcggcagta acgatatcga tttaacttgg 420
acagtatctg gttctatttc tagcgctcag gtttaccgcg acacagattc taaccctagt 480
ggtcgcacac gtattgctca attaggtggc gatgcaagaa gctatagcga tacgaatgtt 540
agtgctggta agcagtacta ctactggatt aagtttggcg ccaacggctc taactacaat 600
tcgaatgcgg cttctgctac ctatagtggt tcaagtagct catcgagttc ttcaagctct 660
tctagttcct catctggcgg ctcggctgaa tgtaaagctg gcgctactat ttctggtaaa 720
accgtagatt gcggtggtaa agaaattggc ttgtcgtgct cgggtgatag tgaaactcaa 780
ccaccagtat taacgcttaa aaatgccacc attaaaaact tggtaatttc tgctaaaggt 840
gggtccgacg gtattcactg tactggcaac tgcaccatgg aaaatgttgt ttggaaagat 900
atctgtgaag atgctgctac caacaaaacc gacggtatta ccatgaccat tattggtggt 960
agtgcgtata actctacaag cggttacggc ggcaagccag ataaggtttt ccaacataac 1020
tctaaaaaca gtactactgt aattaaaggc ggctttacat taacaggtga gcacggcaaa 1080
ttgtggcgtt catgtggtaa ctgtactaat aacggcggcc cacgtaatgt gactatcgac 1140
aacgttaaag tagacgcgaa aataggcagt attgttggcg ttaaccgcaa ctatggcgat 1200
aaggcaacaa tcaaaaactt aaagattaaa gactacaaat ctggtagccc caaagtgtgt 1260
gaagaataca agggtgtaca aaagggtagt ggcgagtctt ctaagtatgg cgaatactgg 1320
gatactgcaa actgcgatgt aagtaaatca gatgtgtctg ctctttaa 1368
<210> 101
<211> 424
<212> PRT
<213> Microbulbifer degradans
<400> 101
Met Phe Asn Lys Ile Leu Val Ala Val Gly Leu Leu Ala Ala Ser Leu
1 5 10 15
Ser Val His Ala Ala Thr Asn Arg Pro Ser Gly Tyr Thr Thr Ile Cys
20 25 30
Lys Val Gly Glu Thr Cys Ser Val Ser Gln Ser Thr Asn Val Ala Phe
35 40 45
Gly Ala Ser Gly Gln Phe Val Tyr Lys Val Leu Asn Gly Ser Phe Ser
50 55 60
Cys Ser Val Ser Thr Phe Gly Ser Asp Pro Ile Pro Ser Lys Ser Val
65 70 75 80
Lys Glu Cys Ser Ile Pro Ser Asn Gly Ser Ser Ser Ser Gly Ser Ser
85 90 95
Ser Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Ser Gly Gly Gly Cys
100 105 110
Gly Ser Gly Gly Gly Ser Thr Val Cys Leu Ser Ala Ser Gly Ser Ser
115 120 125
Asn Gly Ile Asn Leu Ser Trp Ser Val Ser Gly Ser Ile Ser Ser Val
130 135 140
Gln Leu Tyr Arg Asp Thr Asp Ser Asn Pro Ser Gly Arg Thr Arg Ile
145 150 155 160
Ala Ser Val Ser Ser Ser Thr Thr Ser Phe Ser Asp Thr Gly Ala Ala
165 170 175
Ser Gly Thr Thr Tyr Tyr Tyr Trp Val Lys Tyr Tyr Val Asn Gly Thr
180 185 190
Ala Tyr Asn Ser Gly Val Ala Ser Ala Val Arg Gly Ser Ser Ser Ser
195 200 205
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Gly Gly Lys Gly
210 215 220
Ser Ser Cys Ser Ser Thr Gly Ser Gln Ser Val Ser Ser Thr Ile Lys
225 230 235 240
Val Thr Ser Gly Thr Tyr Asp Gly Gly Cys Lys Thr Phe Asn Pro Thr
245 250 255
Ser Ala Leu Gly Asp Gly Ser Gln Ser Glu Ser Gln Lys Pro Ala Phe
260 265 270
Arg Val Glu Asn Gly Ala Thr Leu Lys Asn Val Ile Ile Gly Asn Asn
275 280 285
Gly Val Asp Gly Ile His Val Tyr Asn Gly Gly Thr Leu Asn Asn Ile
290 295 300
Leu Trp Thr Asn Val Gly Glu Asp Ala Met Thr Val Lys Ser Glu Gly
305 310 315 320
Asn Val Thr Val Thr Asn Val Glu Gly Tyr Asp Gly Glu Asp Lys Phe
325 330 335
Ile Gln Val Asn Ala Val Thr Asn Leu Lys Val Ser Asn Cys Ile Val
340 345 350
Asn Lys Met Gly Lys Phe Leu Arg Gln Asn Gly Gly Lys Thr Phe Ala
355 360 365
Met Ser Val Ser Val Asp Asn Cys Asp Ile Ser Asn Met Gly Glu Gly
370 375 380
Ile Phe Arg Ser Asp Ser Pro Asn Ala Thr Ala Val Ile Thr Asn Ser
385 390 395 400
Arg Leu Arg Asn Ala Gly Asp Ile Cys Ile Gly Ala Trp Lys Ser Cys
405 410 415
Lys Ser Ser Asn Ile Ser Ser Phe
420
<210> 102
<211> 1275
<212> DNA
<213> Microbulbifer degradans
<400> 102
atgtttaaca agatactcgt tgcagtagga ttacttgcgg ctagcctttc tgtgcacgcc 60
gcaacaaacc gcccaagtgg ttatacaaca atttgtaagg ttggtgaaac atgctcggta 120
agtcagtcta cgaatgtagc ctttggcgcg tctgggcagt ttgtgtataa agtattaaac 180
ggtagctttt cttgtagtgt ttctacgttt ggtagtgacc ctattccttc taaatctgta 240
aaagaatgtt caatcccatc aaacggctct agctcttctg gctcgtcttc atcttcgtct 300
agcagctctt ccggtagctc ttctggtggt ggctgtggca gcggtggtgg ttctacggtg 360
tgcttatcgg cctcgggttc tagcaatggt atcaatttaa gttggtctgt atctggttct 420
atatcttccg tgcagcttta tcgcgatacc gattcaaacc caagcggtcg cacgcgtatt 480
gctagtgtat ctagctctac tactagcttt agtgataccg gcgcggcatc gggcaccact 540
tattactact gggttaaata ttatgtaaat ggtactgctt acaactcggg tgttgcttct 600
gcggtgcgcg gttcttctag ctctagtagt tcaagttctt ccagcacttc tagcagttct 660
ggtggaaaag gttctagttg tagctctact ggtagccaat ctgtgtcttc tactattaag 720
gtaaccagcg gtacttacga tggtggttgt aaaacattta accctaccag tgctttgggt 780
gatggtagcc aatctgaaag ccaaaaacct gctttccgtg tagaaaacgg tgcaacgtta 840
aagaatgtaa ttattggcaa taacggtgtg gatggtattc acgtttacaa cggcggtacg 900
ttaaataata ttctttggac taacgtaggt gaagatgcca tgaccgttaa gtctgaaggt 960
aacgtgacgg taaccaatgt tgaaggctat gacggcgaag ataagtttat tcaggtaaac 1020
gcagtgacta acttaaaagt ttctaactgt attgtgaata aaatgggtaa gtttcttcgt 1080
cagaatggtg gtaaaacatt tgccatgtcg gtaagtgtag ataactgcga tatatctaat 1140
atgggtgaag gtatcttccg ttcagacagc ccgaacgcta cagcggttat tactaacagc 1200
cgtttacgca acgctgggga tatttgtatt ggggcttgga aaagttgtaa atcttccaat 1260
atcagcagct tttaa 1275
<210> 103
<211> 392
<212> PRT
<213> Microbulbifer degradans
<400> 103
Met Lys Lys Leu Ile Leu Met Val Ala Leu Leu Ala Phe Ser Val Ser
1 5 10 15
Ser Phe Ala Ala Leu Ser Ser Gly Arg Tyr Ile Ile Val Ser Lys Leu
20 25 30
Asn Gly Asn Ala Leu Asp Val Asp Ser Phe Ser Thr Ala Asp Gly Ala
35 40 45
Asn Val Met Gln Trp Phe Ala Leu Gly Gly Val Asn Gln Gln Phe Asp
50 55 60
Val Ala Val Leu Ser Asp Gly Ser Tyr Ser Ile Arg Pro Val His Ser
65 70 75 80
Gly Lys Ser Leu Asp Val Tyr Ala Trp Asn Ala Asp Asp Gly Ala Glu
85 90 95
Leu Arg Gln Trp Ala Tyr Thr Gly Ala Asp Asn Gln Arg Trp Tyr Ile
100 105 110
Asp Asn Gln Ser Gly Asp Tyr Tyr Ser Ile Thr Ser Lys Phe Ser Gly
115 120 125
Arg Ala Leu Asp Val Trp Gly Met Ser Met Tyr Thr Gly Ala Asp Val
130 135 140
Arg Leu Tyr Ser Tyr Trp Gly Gly Ala Gly Gln Leu Trp Thr Phe Gln
145 150 155 160
Lys Val Gly Ser Ser Ser Glu Cys Tyr Ala Gly Ala Thr Leu Thr Asn
165 170 175
Arg Phe Val Asp Cys Gly Gly Lys Thr Ile Gly Leu Ser Cys Val Gly
180 185 190
Asp Ser Glu Thr Gln Gly Ala Val Leu Thr Leu Lys Asn Ser Ser Ile
195 200 205
Arg Asn Val Lys Leu Ala Ala Asn Gly Gly Ala Asp Gly Ile His Cys
210 215 220
Thr Ser Gly Asn Cys Thr Leu Ala Asp Val Val Trp Asn Asp Ile Cys
225 230 235 240
Glu Asp Ala Ala Thr Asn Lys Ser Glu Gly Gly Thr Leu Thr Ile Val
245 250 255
Gly Gly Ser Ala Tyr Asn Ser Thr Gly Gly Tyr Gly Gly Thr Pro Asp
260 265 270
Lys Ile Phe Gln His Asn Ser Lys Asn Ser Thr Thr Ile Val Ala Gly
275 280 285
Gly Phe Thr Ala Tyr Gly Thr His Gly Lys Leu Trp Arg Ser Cys Gly
290 295 300
Asn Cys Thr Asn Asn Gly Gly Pro Arg Asn Leu Leu Val Tyr Ser Val
305 310 315 320
Asn Ile Asp Ala Ser Ile Gly Ala Ile Ala Gly Val Asn Arg Asn Tyr
325 330 335
Gly Asp Arg Ala Thr Ile Arg Asp Leu Lys Ile Lys Asn Tyr Ser Ser
340 345 350
Gly Ser Pro His Val Cys Asp Glu Tyr Gln Gly Val Gln Lys Gly Asn
355 360 365
Ser Ser Thr Lys Tyr Gly Glu Tyr Trp Asn Thr Ala Ser Cys Asp Val
370 375 380
Ser Arg Ser Asp Val Ser Gly Leu
385 390
<210> 104
<211> 1179
<212> DNA
<213> Microbulbifer degradans
<400> 104
atgaaaaaac ttatccttat ggtggcgctg ttggctttta gtgttagttc ttttgctgca 60
ctgtcttcag gccgctacat tattgtttct aaacttaatg gcaacgcgtt agatgtagat 120
agctttagca ccgcagatgg cgccaatgtt atgcagtggt ttgctttggg tggtgtgaac 180
cagcagtttg acgtggcagt gcttagcgat ggcagttact ccatacgacc agtgcacagc 240
ggtaagtcat tagatgtata tgcgtggaac gcagacgatg gtgcggaact tcgtcagtgg 300
gcatacacag gcgcagataa ccaacgttgg tatatcgata atcaaagtgg cgattactat 360
tcaattacgt ctaaatttag cgggcgcgca ttggatgtat ggggtatgag tatgtacacc 420
ggcgcagatg tccgccttta ttcatattgg ggcggcgcgg ggcagctgtg gaccttccaa 480
aaggtaggta gctcaagtga gtgttacgca ggtgctacgt taacaaaccg ctttgtggat 540
tgtggcggca aaacaatagg ccttagttgt gtaggcgata gtgaaactca aggcgcggtg 600
ctaaccctta aaaactcgtc cattcgcaat gttaagttgg ctgcaaacgg tggtgcggat 660
ggcattcact gcactagtgg caactgcaca ttagccgacg ttgtttggaa cgatatttgt 720
gaagatgctg ccacgaataa gtctgaaggt ggcaccctga ctattgtggg tggttcggcg 780
tataactcta ctggcgggta tggtggtaca ccggataaaa tttttcagca caactcgaaa 840
aacagcacaa caattgttgc cggcggcttc actgcatatg gtacccacgg taagttgtgg 900
cgctcgtgtg gtaactgtac aaacaacggc ggtccgcgta atttactggt ttatagcgtg 960
aatattgacg caagtattgg cgcaattgct ggtgttaacc gcaattacgg cgatagagcg 1020
accattcgcg acctaaaaat aaagaattat tcttctggca gcccgcatgt gtgtgacgaa 1080
tatcaaggcg tacagaaggg caattcttct acaaaatatg gcgagtactg gaataccgca 1140
agttgtgatg tttcgcggtc agatgtaagt gggctttaa 1179
<210> 105
<211> 733
<212> PRT
<213> Microbulbifer degradans
<400> 105
Met Phe Arg Tyr Ile Leu Thr Ala Phe Ala Leu Val Ala Ala Ala Ser
1 5 10 15
Cys Ala Gln Ala Ala Thr Asn Arg Pro Ser Gly Tyr Thr Thr Ile Cys
20 25 30
Lys Thr Asn Gln Thr Cys Ser Val Ser Ser Pro Thr Asn Val Ala Phe
35 40 45
Gly Ala Ser Gly Lys Phe Thr Phe Lys Val Leu Asn Gly Ser Phe Val
50 55 60
Cys Ser Val Ala Thr Phe Gly Ser Asp Pro Asn Pro Ala Lys Ser Ala
65 70 75 80
Lys Glu Cys Ser Ile Pro Ser Asp Gly Ser Ser Ser Thr Ser Ser Thr
85 90 95
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
100 105 110
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
115 120 125
Ser Ser Ser Ser Ser Gly Ser Ser Gln Ala Gly Cys Gly Ser Gly Gly
130 135 140
Gly Ala Thr Val Cys Leu Ser Ala Thr Asp Thr Ala Ser Ala Ile Asn
145 150 155 160
Leu Asn Trp Thr Val Ser Gly Ser Leu Ser Ser Val Gln Val Tyr Arg
165 170 175
Asp Thr Asp Pro Asn Pro Ser Gly Arg Thr Arg Leu Thr Ser Leu Ser
180 185 190
Pro Ser Val Thr Ser Tyr Thr Asp Asn Asn Ala Gln Ala Gly Thr Thr
195 200 205
Tyr Tyr Tyr Trp Ile Lys Phe Gly Ala Asn Gly Ser Asn Tyr Asn Ser
210 215 220
Gly Ala Ala Ser Ala Val Ile Ala Asn Thr Gly Asn Asp Asp Glu Gly
225 230 235 240
Cys Gly Ser Asp Val Cys Leu Thr Ala Thr Ala Asn Ile Gly Ser Ile
245 250 255
Gly Leu Ser Trp Gly Ser Ser Ala Ala Leu Thr Ser Val Gln Ile Tyr
260 265 270
Arg Asp Thr Asp Ser Asn Pro Ser Gly Arg Thr Arg Ile Ala Ser Leu
275 280 285
Ser Thr Ser Ala Thr Ser Phe Thr Asp Ser Thr Thr Ala Val Gly Thr
290 295 300
Thr Tyr Tyr Tyr Trp Val Lys Tyr Gly Leu Asn Gly Ser Gln Leu Asn
305 310 315 320
Ser Asn Val Ala Ser Ala Thr Ala Leu Gln Asn Asn Thr Gly Asn Ala
325 330 335
Ser Cys Pro Gly Glu Thr Ser Gly Glu Thr Ala Ala Thr Val Tyr Tyr
340 345 350
Val Thr Pro Asn Gly Ser Ala Ser Ala Ser Gly Asn Ser Phe Ala Ser
355 360 365
Ala Met Asp Ile Asp Thr Ala Leu Ser Ile Val Gly Ala Gly Gln Met
370 375 380
Ile Leu Met Gln Pro Gly Thr Tyr Thr Val Ala Tyr Ser Ala Gly Asn
385 390 395 400
Lys Asn Thr Lys Val Leu Ser Arg Ser Gly Ala Ala Gly Ala Pro Ile
405 410 415
Lys Met Val Ala Ala Asn Cys Gly Arg Ala Val Phe Asp Phe Ser Phe
420 425 430
Pro Glu Arg Glu Trp Val Gln Asp Ser Tyr Gly Phe Phe Leu Thr Gly
435 440 445
Asp Tyr Trp Tyr Phe Lys Gly Ile Glu Ile Thr Arg Ala Gly Tyr Gln
450 455 460
Gly Val Tyr Val Thr Gly Ala His Asn Thr Phe Glu Asn Cys Ala Phe
465 470 475 480
Tyr Tyr Asn Arg Asn Thr Gly Leu Glu Ile Asn Lys Gly Gly Ser Tyr
485 490 495
Thr Thr Val Ile Asn Ser Asp Ala Tyr Arg Asn Tyr Asp Pro Lys Lys
500 505 510
Asn Gly Ser Met Ala Asp Gly Phe Gly Pro Lys Gln Thr Gln Gly Pro
515 520 525
Gly Asn Lys Phe Ile Gly Cys Arg Ala Trp Glu Asn Ser Asp Asp Gly
530 535 540
Phe Asp Leu Tyr Asp Ser Pro Glu Glu Val Thr Ile Glu Asn Ser Trp
545 550 555 560
Ala Phe Arg Asn Gly Val Asp Val Trp Gly Tyr Gly Gly Phe Ala Gly
565 570 575
Asn Gly Asn Gly Phe Lys Leu Gly Gly Asn His Val Ala Ala Asn Asn
580 585 590
Arg Ile Thr Asn Ser Val Ala Phe Gly Asn Pro Val Lys Gly Phe Asp
595 600 605
Gln Asn Asn Asn Ala Gly Gly Ile Thr Val Leu Asn Cys Thr Ala Tyr
610 615 620
Ala Asn Gly Thr Asn Tyr Gly Phe Gly Asn Asn Leu Asn Ser Gly Glu
625 630 635 640
Gln His Tyr Phe Arg Asn Asn Val Ser Val Ser Gly Ala Val Asn Ile
645 650 655
Ser Asn Ala Asp Asn Lys Tyr Asn Ser Trp Asn Gly Gly Val Thr Ala
660 665 670
Ser Thr Ala Asp Phe Glu Asn Val Asp Leu Ser Lys Ala Thr Ala Ala
675 680 685
Arg Asn Ile Asp Gly Ser Leu Pro Asn Asn Gly Leu Phe Arg Leu Lys
690 695 700
Ser Gly Ser Asp Leu Ile Asp Ala Gly Val Glu Val Gly Leu Pro Ser
705 710 715 720
Asn Gly Ser Ala Pro Asp Met Gly Ala Phe Glu Ala Asn
725 730
<210> 106
<211> 2202
<212> DNA
<213> Microbulbifer degradans
<400> 106
atgtttcgat acatccttac tgctttcgca ttggtggcag cggcttcctg cgcgcaagca 60
gctaccaatc gccctagcgg ttacaccacc atatgtaaaa ccaatcaaac ctgctctgtt 120
tcaagcccta ccaatgtggc gtttggcgca tcgggtaaat ttacctttaa agtgcttaat 180
ggctcttttg tttgtagcgt agccactttt ggctccgacc ctaacccggc taaaagtgct 240
aaagagtgct ctattccgtc ggatggctct tccagcacct ctagtacttc gagcacatcg 300
tctagttcta gtagttcatc tagcagcaca agttcaagca gcagctctag cagttcttct 360
agctcttcgt cttccagtag ctcaagttct agttcaagtg gctcttcgca agctggctgc 420
ggtagtggcg ggggagcaac ggtttgttta tcggcaaccg atacagctag tgctatcaat 480
ttaaattgga cagtgagtgg ctcgctgtcg agcgtgcagg tgtatcgcga taccgatccc 540
aacccaagtg gacgtacgcg gttaacgtcg ttaagccctt cggtaacaag ctacaccgac 600
aacaacgcac aggccggtac aacctactat tactggatta aattcggcgc aaacggcagc 660
aactataatt ccggtgctgc gtcggctgta atagccaata ccggtaacga tgatgaaggt 720
tgtggtagtg atgtgtgctt gacggcaacc gctaatattg gctctatagg cttaagctgg 780
ggctcttctg ccgctttaac cagcgtacaa atttatcgcg atacagattc aaacccaagt 840
gggcgtacac gtattgcatc gcttagcact tctgcaacca gctttaccga ttcaaccact 900
gcagtaggca caacgtatta ttactgggtt aaatacggct taaacggcag ccagctaaac 960
tctaatgttg catctgccac tgctttgcaa aataacactg gcaacgcaag ttgccccggt 1020
gaaacaagtg gcgaaactgc agcaaccgtg tattacgtaa caccaaacgg ttcggctagt 1080
gcaagcggca atagctttgc atctgcaatg gatatagaca cagcactttc aattgtaggc 1140
gcggggcaaa tgatattaat gcagcctggt acctacaccg ttgcctatag tgcgggtaat 1200
aaaaatacca aagtgctttc gcgctcgggt gccgctggtg cacctatcaa aatggtggca 1260
gccaattgcg gtcgtgcagt gtttgatttt tcgttcccag aacgtgagtg ggtgcaagat 1320
tcttacggct ttttcttaac tggcgattac tggtatttta aaggaataga aattacccgt 1380
gcaggctacc aaggtgtgta tgtaacgggt gcgcacaaca catttgaaaa ttgcgccttt 1440
tattacaacc gcaatacagg tttagaaatt aacaaaggcg ggtcttacac caccgtcatt 1500
aattcagatg cctatcgcaa ttacgatccc aagaaaaacg gcagcatggc cgatggcttt 1560
ggccctaaac aaacccaagg cccaggcaat aaatttattg gttgccgcgc gtgggaaaac 1620
tccgacgatg gatttgacct gtacgatagc ccagaagaag taactattga aaatagctgg 1680
gcatttcgca acggtgtaga tgtatggggt tacggtggtt ttgcgggtaa tggcaacggc 1740
tttaaattgg gcggtaacca cgtggctgca aacaatcgta ttaccaactc ggttgcgttc 1800
ggcaaccccg taaaaggttt tgatcaaaac aataatgccg gcggtattac agtgcttaat 1860
tgcacagcct acgccaacgg cactaactac ggctttggca acaacttaaa ctcgggtgag 1920
caacactact tccgcaataa tgtttctgta tctggcgctg tgaatattag caatgccgac 1980
aacaaataca attcgtggaa cggcggagta acagcatcca cggcagattt tgaaaacgta 2040
gatttatcca aagccaccgc tgcacgtaac atagatggca gcctgccaaa caacggccta 2100
ttccgcttaa aaagcggcag cgatttaata gacgccggtg tagaggtagg tttaccaagt 2160
aacggcagcg cgcccgatat gggagcgttc gaagctaact ag 2202
<210> 107
<211> 700
<212> PRT
<213> Microbulbifer degradans
<400> 107
Met Lys Asn Val Phe Asn Thr Gln Lys Thr Lys Arg His Cys Asn Tyr
1 5 10 15
Ala Tyr Asn Ala Lys Ala Ala Lys Pro Phe Ser Gln Lys Ala Leu Val
20 25 30
Gln Lys Cys Ala Ala Ala Ala Leu Ser Val Gly Leu Leu Gly Ala Val
35 40 45
Gly Asn Ala Tyr Ala Ile Ser Cys Ser Ala Thr Ala Asp Thr Trp Gly
50 55 60
Gly Gly Tyr Val Leu Asn Val Thr Val Thr Asn Asp Thr Asn Asn Ala
65 70 75 80
Ile Ser Asn Trp Ala Leu Ala Leu Asn Tyr Asp Gln Ala Ala Ala Ile
85 90 95
Thr Asn Ser Trp Asn Ala Ser Val Ser Ala Asn Gly Asn Val Val Asn
100 105 110
Ala Thr Asn Ile Gly Trp Asn Gly Asn Leu Ala Ala Gly Gln Ser Thr
115 120 125
Ser Phe Gly Leu Gln Gly Thr Tyr Thr Gly Asn Phe Ser Leu Pro Val
130 135 140
Cys Val Gly Gln Gly Gln Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
145 150 155 160
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
180 185 190
Ser Ser Ser Ser Thr Ser Ser Thr Gly Gly Ser Ser Glu Leu Thr Ile
195 200 205
Gln Glu Asp Asn Ser Gly Phe Cys Gly Val Asp Gly Ser Ile Asp Ser
210 215 220
Asn Asn Ser Gly Phe Thr Gly Ser Gly Phe Ala Asn Thr Asp Asn Ala
225 230 235 240
Thr Gly Lys Ser Val Asp Trp Ser Val Ser Val Pro Tyr Ser Gly Asn
245 250 255
Tyr Leu Leu Glu Trp Arg Tyr Ala Asn Gly Ser Gly Asn Asn Arg Ala
260 265 270
Gly Ala Ile Glu Val Asn Gly Asn Ala Arg Gly Asn Gln Ser Phe Pro
275 280 285
Thr Thr Gly Ala Trp Thr Ser Trp Thr Thr Ala Ser Ala Asn Val Ser
290 295 300
Leu Asp Ala Gly Thr Asn Leu Ile Ser Leu Val Ala Ser Thr Gly Glu
305 310 315 320
Gly Leu Gly Asn Ile Asp Ser Leu Thr Val Ile Gly Asn Asp Ile Gln
325 330 335
Thr Gly Ala Cys Asp Ser Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser
355 360 365
Ser Ser Ser Ser Gly Ala Pro Met Leu Pro Gln Ala Gly Asn Pro Ile
370 375 380
Asn Gly Lys Phe Gly Lys Tyr Lys Ser Trp Gln Lys Gly Ser Leu Ser
385 390 395 400
Ala Asp Lys Gln Phe Ala Asp Ile Leu Leu Ser His Gln Tyr Thr Asn
405 410 415
Gly Gly Phe Pro Lys Asn Gln Ala Tyr Asp Ser Met Gly Ser Gly Gly
420 425 430
Asn Ser Ala Gly Thr Ile Asp Asn Asp Ala Thr Thr Thr Glu Leu Leu
435 440 445
Phe Leu Ala Asp Val Tyr Gln Arg Thr Gly Glu Thr Lys Tyr Arg Asp
450 455 460
Gly Ala Arg Lys Ala Leu Asp Phe Leu Leu Asp Met Gln Tyr Ser Ser
465 470 475 480
Gly Gly Trp Pro Gln Tyr Tyr Pro Val Arg Ser Gly Tyr Tyr Glu His
485 490 495
Val Thr Phe Asn Asp Asp Ala Met Ala Arg Val Leu Ile Val Leu Asp
500 505 510
Lys Ala Lys Gln Gly Val Ala Pro Leu Asn Gly Asp Leu Leu Thr Ser
515 520 525
Asn Gln Arg Ala Arg Leu Ser Ser Ala Val Asn Lys Gly Val Asp Tyr
530 535 540
Ile Leu Lys Ser Gln Trp Arg Gln Asn Gly Thr Leu Thr Val Trp Cys
545 550 555 560
Ala Gln His Gly Lys Asp Asp Tyr Leu Pro Lys Lys Ala Arg Ala Tyr
565 570 575
Glu Leu Glu Ser Leu Ser Gly Ser Glu Ser Val Leu Val Val Ala Phe
580 585 590
Leu Met Ser Gln Pro Gln Thr Pro Glu Ile Lys Thr Ala Val Lys Ala
595 600 605
Ala Ile Asn Trp Phe Arg Ser Pro Asn Thr Tyr Leu Ala Gly Tyr Thr
610 615 620
Tyr Asp Ser Ser Arg Lys Gly Asp Gly Asn Ser Pro Ile Val Ala Lys
625 630 635 640
Ser Gly Ser Lys Met Trp Tyr Arg Phe Tyr Asp Leu Asn Thr Asn Arg
645 650 655
Gly Phe Phe Ser Asp Arg Asp Ser Arg Lys Val Tyr Asp Ile Leu Asp
660 665 670
Ile Ser Thr Glu Arg Lys Asp Gly Tyr Arg Trp Gly Gly Asp Tyr Gly
675 680 685
Ser Gly Ile Ile Ser Tyr Ala Glu Ser Val Gly Tyr
690 695 700
<210> 108
<211> 72
<212> PRT
<213> Microbulbifer degradans
<400> 108
Met Val Asn Ser Leu Leu Pro Pro Val Glu Leu Val Asp Glu Asp Asp
1 5 10 15
Glu Leu Glu Asp Glu Glu Leu Leu Glu Leu Leu Glu Glu Leu Leu Glu
20 25 30
Glu Leu Glu Asp Glu Leu Leu Asp Glu Leu Glu Glu Leu Asp Glu Leu
35 40 45
Leu Leu Val Leu Leu Leu Glu Glu Glu Leu Trp Pro Trp Pro Thr Gln
50 55 60
Thr Gly Lys Leu Lys Leu Pro Val
65 70
<210> 109
<211> 2103
<212> DNA
<213> Microbulbifer degradans
<400> 109
atgaaaaacg tatttaatac acaaaaaacc aagcggcatt gcaactatgc ctataacgcg 60
aaagctgcca aacccttcag ccaaaaagca ctggtacaaa agtgtgcggc tgcggcattg 120
tctgttggct tactgggggc cgtgggtaat gcgtatgcaa tatcctgttc ggcaactgct 180
gatacctggg gtggtggcta tgtgctaaat gtgaccgtta ctaacgacac aaataatgca 240
attagcaatt gggcgctagc tttaaattac gatcaagctg cagccataac taattcgtgg 300
aacgcgagtg tgtctgcaaa tggcaatgtg gttaatgcta ccaacattgg ttggaatggc 360
aatttagcgg ctggccaaag tacaagtttt ggtttgcaag gcacctatac cggcaacttt 420
agtttacctg tctgtgttgg ccagggccaa agctcttcct ctagtagcag tactagtagt 480
agttcgtcca gctcttcgag ttcatccagc agttcatctt ccagctcttc aagtagttct 540
tcaagcagct ctagtagttc ttcatcttct agctcatcgt cttcgtccac cagttcaact 600
ggtggaagta gtgagttaac cattcaagaa gataatagcg gcttttgcgg ggttgatggt 660
tcaatcgatt ccaataattc aggctttacc ggaagcggct tcgccaatac cgataacgcg 720
acaggcaaaa gcgtagattg gagtgtgagc gttccttata gtggcaacta tttgcttgaa 780
tggcgttacg caaatggttc cggtaataac cgtgctggcg cgattgaagt aaatggtaac 840
gctcgtggta atcaaagctt tcctactaca ggggcctgga ctagctggac aacggcaagt 900
gctaacgtaa gtttggatgc aggtacaaac ttaattagct tggttgcatc tacgggcgaa 960
ggcttaggga atattgattc acttactgtc attggtaacg atattcagac cggcgcttgt 1020
gattctacag ggtctagctc ttccagtagc tctagttcct ccagtacttc tagctcaagt 1080
tccagctcca gttcaagtac cagtagttct agcagcggtg cgcccatgtt acctcaagct 1140
gggaacccca ttaatggcaa gtttggcaaa tataaatctt ggcaaaaagg aagcttgtct 1200
gccgataagc aatttgcaga catccttcta tcgcaccaat ataccaatgg cggatttccc 1260
aaaaaccaag cctacgacag tatgggtagc ggtggtaaca gtgcgggcac aatcgacaac 1320
gatgccacaa caacagagtt gttattctta gctgatgtgt accagcgtac tggtgaaacc 1380
aaataccgag acggtgcgcg taaagcgtta gatttccttt tggatatgca gtattcatcg 1440
ggcggctggc cgcaatacta ccctgtgcgc agtggctact acgagcatgt aacatttaac 1500
gacgatgcaa tggcgcgagt gctaattgtt ttagataaag cgaaacaagg tgtggcgccg 1560
ctaaatggcg atctattaac atctaaccag cgtgcgcgtt taagcagtgc ggttaataag 1620
ggcgtggatt acattcttaa atcgcagtgg cgtcaaaacg gaaccttaac tgtttggtgt 1680
gcgcaacatg gtaaagatga ttatctacct aaaaaggcgc gtgcttacga gctggaatcg 1740
ttgagtggta gcgaatcggt attggtagtt gcattcctta tgtctcaacc tcaaacccct 1800
gaaattaaaa cggcggttaa ggctgctatc aattggttta gaagccccaa tacttactta 1860
gctggttaca cttacgattc atctagaaaa ggcgacggca acagccccat cgtcgcgaaa 1920
agcggtagta aaatgtggta tcgcttctac gacctaaata ctaaccgtgg cttcttcagt 1980
gatagagata gcagaaaagt ttacgatatt ttagatattt ctacagagcg taaagatggc 2040
tatcgttggg gcggtgatta tggctccggc atcattagtt acgcggaaag cgttggttac 2100
taa 2103
<210> 110
<211> 96
<212> PRT
<213> Microbulbifer degradans
<400> 110
Met Ser Val Pro Ala Leu Pro Pro Glu Glu Leu Leu Glu Leu Glu Asp
1 5 10 15
Glu Leu Leu Glu Leu Glu Asp Glu Leu Leu Glu Leu Asp Glu Leu Leu
20 25 30
Glu Leu Asp Glu Leu Leu Glu Leu Val Glu Leu Leu Glu Leu Glu Glu
35 40 45
Leu Val Glu Ala Phe Ile Ser Leu Ala Gly Pro Glu Leu Pro Pro Pro
50 55 60
Gln Ala Leu Asn Ser Val Ala Ala Ser Ala Ser Val Ile Arg Arg Arg
65 70 75 80
Lys Phe Ile Ile Val Ala Leu Tyr Val Phe Ile Leu Arg Ala His Lys
85 90 95
<210> 111
<211> 574
<212> PRT
<213> Microbulbifer degradans
<400> 111
Met Asn Phe Leu Arg Leu Ile Thr Leu Ala Leu Ala Ala Thr Leu Leu
1 5 10 15
Ser Ala Cys Gly Gly Gly Ser Ser Gly Pro Ala Lys Glu Ile Asn Ala
20 25 30
Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
50 55 60
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Ser Ala Gly
65 70 75 80
Thr Leu Thr Ile Gln Glu Ser Glu Lys Gly Phe Cys Thr Val Asn Gly
85 90 95
Glu Ile Val Asn Asn His Glu Gly Tyr Ser Gly Thr Gly Phe Val Asp
100 105 110
Thr Ala Asn Ala Glu Gly Ala Ser Ile Thr Trp Lys Val Asp Val Asp
115 120 125
Gly Gly Asn Tyr Asp Val Ser Val Arg Phe Ala Asn Gly Ser Thr Ala
130 135 140
Arg Gly Ala Thr Leu Ser Ser Asn Glu Ile Asn Thr Thr Tyr Gly Phe
145 150 155 160
Ala Thr Thr Gly Asp Trp Ala Thr Trp Ala Asp Glu Thr His Thr Val
165 170 175
Ser Leu Ala Ala Gly Glu Asn Thr Ile Gln Leu Ser Ala Leu Thr Ala
180 185 190
Gly Gly Leu Pro Asn Val Asp Ala Ile Thr Ile Ala Gly Ala Gly Val
195 200 205
Leu Ala Ala Asp Cys Ala Thr Glu His Thr Gly Pro Met Leu Ser Gln
210 215 220
Thr Gly Asn Pro Ile Tyr Thr Glu Leu Asn Asn Tyr Lys Ser Trp Leu
225 230 235 240
Thr Gly Ser Gly Thr Thr Ala Ala Lys Leu Ala Ala Asp Lys Thr Ile
245 250 255
Ala Asp Asn Met Ile Thr Trp Gln Met Pro His Gly Gly Phe Tyr Lys
260 265 270
Tyr Gly Val Ser Lys Tyr Ser Ser Ala Trp Asn Gly Ser Asp Ala Arg
275 280 285
Ser Gly Trp Thr Gly Ala Asn Gly Val Glu Leu Gly Thr Ile Asp Asn
290 295 300
Asp Ala Thr Val Ser Glu Leu Leu Phe Leu Ala Asp Val Tyr Lys Arg
305 310 315 320
Ser Gly Glu Thr Lys Tyr Arg Asp Ala Ala Arg Ser Ala Leu Glu Phe
325 330 335
Leu Leu Thr Met Gln Tyr Ser Thr Gly Gly Trp Pro Gln Val Tyr Pro
340 345 350
Ala Arg Thr Gly Thr Ser Tyr Ser Asn His Val Thr Phe Asn Asp Asn
355 360 365
Ala Met Ala Arg Val Leu Ile Leu Leu Asp Lys Ala Ala Arg Leu Glu
370 375 380
Ala Pro Leu Asp Gly Asp Ile Phe Thr Thr Asp Gln His Thr Arg Ile
385 390 395 400
Thr Thr Ala Ile Asn Gly Gly Ile Asp Phe Ile Leu Asn Ala Gln Ile
405 410 415
Val Gln Gly Asp Val Lys Thr Val Trp Cys Ala Gln His Asp Pro Tyr
420 425 430
Thr Tyr Glu Ala Lys Ala Ala Arg Ser Tyr Glu Leu Ala Ser Lys Ser
435 440 445
Gly Lys Glu Ser Val Leu Val Val Ala Phe Leu Met Thr Arg Pro Gln
450 455 460
Ser Glu Ala Ile Glu Asn Ala Val Lys Ala Ala Leu Ala Trp Tyr Arg
465 470 475 480
Asn Pro Asn Val Gln Val Ala Asn Thr Glu Tyr Val Lys Arg Thr Asn
485 490 495
Asn Asp Asp Asn Tyr Asn Pro Ile Gln Thr Lys Ala Gly Ser Thr Met
500 505 510
Trp Tyr Arg Phe Tyr Asp Leu Asp Gln Asp Val Gly Phe Phe Ser Gly
515 520 525
Arg Ser Ala Ser Asp Asn Pro Ala Gly Asn Gly Lys Gln Tyr Asp Ile
530 535 540
Met Leu Ile Glu Pro Glu Arg Arg Tyr Gly Tyr Glu Trp Gly Gly Asn
545 550 555 560
Tyr Gly Lys Lys Ile Ile Asp Tyr Ala Asn Ser Val Gly Tyr
565 570
<210> 112
<211> 1725
<212> DNA
<213> Microbulbifer degradans
<400> 112
atgaattttc tacgccttat tacactcgca ctcgccgcaa cactattaag tgcctgcggt 60
ggaggtagtt ctggcccagc caaagagata aacgcttcca ccagttcctc tagctctagc 120
agctccacga gttctagcag ctcatccagc tctagtagtt cgtctagctc aagcagctcg 180
tcttccagtt ccagcagctc atcctctagc tctagcagct cttctggtgg tagcgccggc 240
acactcacta ttcaggaaag cgaaaaaggc ttttgcactg ttaacggtga gattgtgaat 300
aatcacgagg ggtatagcgg tacaggtttt gtagacaccg ccaatgccga aggcgccagc 360
attacttgga aggtagatgt cgatggcggc aattatgatg taagtgtgcg attcgcaaac 420
ggttccactg cacgcggcgc aacgcttagc agtaacgaaa taaataccac ctatggtttt 480
gctaccactg gcgactgggc cacatgggca gacgaaaccc acaccgtttc gttagccgca 540
ggcgaaaaca ccatccagct aagtgcactt accgcgggtg gcttacccaa tgtagacgct 600
attactattg caggtgcagg tgtactcgca gcagactgcg ccacagagca tactgggcca 660
atgctttcgc aaacaggcaa ccctatttat accgagttaa ataattacaa gtcctggcta 720
acgggtagcg gcacaacagc agccaaatta gccgcagata aaacaattgc cgacaatatg 780
attacctggc aaatgcctca cggtggtttt tataaatacg gcgtatctaa atacagctcg 840
gcttggaacg gtagcgatgc ccgctctggc tggactgggg ccaacggcgt tgagcttggc 900
acaattgata atgatgcaac cgttagcgaa ttattatttt tagcggacgt atataaacgc 960
agcggtgaaa ctaaatatag agatgccgca agaagcgcgt tggaattttt acttaccatg 1020
caatattcca ctggcggttg gccacaggtt taccctgcgc gcactggcac cagttactcc 1080
aatcacgtta cgtttaacga taacgccatg gctcgtgtac ttattttatt ggataaagcc 1140
gcgcgattag aagcaccact cgatggcgac atttttacca cagaccagca cacgcgtatt 1200
actaccgcaa taaatggcgg catcgatttt attttgaatg cgcaaatagt acagggcgac 1260
gtgaaaaccg tttggtgtgc gcaacacgac ccttatacct acgaggcaaa agcagctcgc 1320
tcttatgagt tggcctctaa aagcggtaaa gaatctgtat tggttgtagc gtttttaatg 1380
acacgcccgc aaagcgaagc catagaaaat gccgtgaaag cagcccttgc ttggtaccgc 1440
aacccaaatg ttcaagtcgc caacaccgag tatgtaaaac gcacaaataa cgatgacaac 1500
tacaacccga tacaaacgaa agcaggtagc actatgtggt accgctttta cgatttagac 1560
caagacgttg gattctttag cggccgctct gcaagtgaca acccagcagg taacggtaag 1620
caatacgaca ttatgcttat tgaacccgag cgcaggtatg gctatgaatg gggtggcaat 1680
tacggcaaaa aaataatcga ttacgctaat tcggtagggt attaa 1725
<210> 113
<211> 463
<212> PRT
<213> Microbulbifer degradans
<400> 113
Met Tyr Lys Ile Ser Arg Arg Thr Thr Leu Lys Gly Leu Gly Leu Thr
1 5 10 15
Cys Leu Ala Gly Cys Thr Thr Ser Leu Pro Thr Leu Glu Gln Asp Pro
20 25 30
Trp Ala Phe Ala Gln Asn Ile Ala Asp Asn Thr Thr Ile Pro Thr Phe
35 40 45
Pro Asn Lys Glu Phe Asn Leu Leu Glu Phe Gly Gly Lys Glu Gly Ser
50 55 60
Asp Asn Thr Leu Ala Phe Lys Lys Ala Ile Ala Ala Cys Ser Lys Ala
65 70 75 80
Gly Gly Gly Lys Val Val Val Pro Ala Gly Arg Phe Glu Thr Gly Ala
85 90 95
Ile His Leu Glu Ser Asn Val Asn Leu His Ile Ser Glu Gly Ala Thr
100 105 110
Ile Ala Phe Phe Thr Asp Pro Lys Tyr Tyr Leu Pro Ala Val Phe Thr
115 120 125
Arg Trp Glu Gly Met Glu Cys Met Gly Tyr Ser Pro Leu Ile Tyr Ala
130 135 140
Tyr Gly Lys Thr Asn Ile Ala Ile Thr Gly Lys Gly Thr Leu Asp Gly
145 150 155 160
Gln Ala Asp Pro Thr His Trp Trp Ala Trp Lys Gly Asn Lys Glu Trp
165 170 175
Gly Val Glu Gly Tyr Pro Ser Gln Lys Glu Ser Arg Asn Gln Leu Phe
180 185 190
Ala Gln Ala Glu Ala Gly Asp Pro Val Arg Glu Arg Val Tyr Ala Asp
195 200 205
Gly His Tyr Leu Arg Pro Ser Phe Val Gln Pro Tyr Lys Cys Glu Asn
210 215 220
Val Leu Ile Glu Asp Ile Thr Ile Ile Asn Ala Pro Phe Trp Leu Leu
225 230 235 240
His Pro Thr Leu Ser Gln Asn Val Thr Val Arg Gly Val His Leu Glu
245 250 255
Ser Leu Gly Pro Asn Ser Asp Gly Cys Asp Pro Glu Ser Cys Lys Asn
260 265 270
Val Val Ile Glu Asn Cys Phe Phe Asn Thr Gly Asp Asp Cys Ile Ala
275 280 285
Ile Lys Ser Gly Arg Asn Asn Asp Gly Arg Arg Leu Ala Thr Pro Thr
290 295 300
Glu Asn Val Ile Ile Arg Asn Cys Lys Met Glu Ala Gly His Gly Gly
305 310 315 320
Val Val Ile Gly Ser Glu Ile Ser Gly Gly Val Arg Asn Val Phe Ala
325 330 335
Glu Asn Asn Val Met Ser Ser Pro Asp Leu Glu Lys Gly Ile Arg Ile
340 345 350
Lys Thr Asn Ser Val Arg Gly Gly Leu Leu Glu Asn Ile Tyr Val Arg
355 360 365
Asn Cys Thr Ile Gly Glu Val Gln Gln Ala Ile Val Ile Asn Phe Gln
370 375 380
Tyr Glu Glu Gly Asp Ala Gly Lys Phe Asp Pro Thr Val Arg Asn Val
385 390 395 400
Glu Ile Arg Asn Leu Val Cys Gln His Ala Leu Gln Val Phe Asn Ile
405 410 415
Arg Gly Phe Glu Arg Ala Pro Ile Gln Asn Phe Arg Ile Ile Asp Ser
420 425 430
Thr Phe Val Arg Gly Asp Asn Pro Gly Val Ile Glu His Thr Thr Gly
435 440 445
Leu Val Ile Asp Asn Val Gln Val Asn Gly Lys Ala Phe Asn Ile
450 455 460
<210> 114
<211> 1392
<212> DNA
<213> Microbulbifer degradans
<400> 114
atgtataaaa tttcacgccg cacaacactc aaaggcttag gcctaacttg cctagccggc 60
tgcaccacca gcctacccac actagagcaa gacccatggg cttttgcaca aaacatagcg 120
gacaacacca ccatccccac attcccaaac aaagaattta atttactcga attcggcggc 180
aaagaaggga gcgacaacac cctcgccttc aaaaaagcga ttgcagcatg cagcaaagca 240
ggtggcggca aggtggtagt acccgcagga cgatttgaga caggcgccat ccacttagag 300
tcgaacgtta accttcatat tagcgaaggc gctaccatcg ccttttttac cgaccccaaa 360
tattacctgc ctgcggtttt cactcgctgg gaaggcatgg agtgcatggg ctactcaccc 420
cttatatacg cctacggcaa aaccaacata gccattaccg gtaaaggcac cctcgacggt 480
caagccgacc caacgcactg gtgggcatgg aaaggcaaca aagaatgggg cgtagagggc 540
tacccaagcc aaaaggaaag ccgcaaccaa ctatttgccc aagcagaagc tggcgacccc 600
gttagagagc gcgtgtatgc agacggccac tacctgcgcc cctcgtttgt gcaaccctac 660
aagtgcgaaa acgtgctgat agaagacata actattatca acgctccctt ctggttgcta 720
caccccaccc tttcacaaaa cgtcactgta cgcggtgttc acctagaaag cctaggcccc 780
aactcggatg gctgcgatcc tgaaagctgt aagaatgtag ttatcgaaaa ctgctttttt 840
aataccggtg acgactgtat cgctattaaa tctggccgca acaacgatgg ccgcaggctt 900
gccacaccta ccgagaacgt gattattcgc aactgtaaaa tggaagcggg tcacggtggc 960
gtagttatag gctcagaaat ttctggcggc gtgcgcaatg tgtttgccga aaataacgta 1020
atgagcagcc ccgatttaga gaaaggcatt cgcattaaaa ccaactctgt gcgcggcgga 1080
ctgctagaga acatctatgt gcgcaactgc accataggcg aagtacaaca agccattgtt 1140
attaacttcc aatacgaaga aggcgatgcg ggtaaatttg accccaccgt gcgcaatgta 1200
gaaatacgca atttggtctg ccagcacgcc ttacaagtgt ttaacatccg cggttttgag 1260
cgcgccccca ttcaaaactt taggataatc gacagcacct ttgtgcgtgg tgacaaccca 1320
ggcgtaattg aacataccac agggttagtt atcgacaacg tccaagtcaa cggcaaagcg 1380
tttaacatct ag 1392
<210> 115
<211> 1084
<212> PRT
<213> Microbulbifer degradans
<400> 115
Met Leu Asp Met Thr Lys Arg Thr Leu Ser Ala Leu Leu Ala Leu Cys
1 5 10 15
Ala Thr Leu Thr Ala Cys Gly Gly Gly Asp Ile Thr Ser Gly Gly Asp
20 25 30
Ala Ile Pro Ala Val Asn Gln Pro Ala Pro Val Gln Glu Pro Glu Pro
35 40 45
Glu Pro Glu Pro Gln Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro
50 55 60
Glu Pro Glu Gly Ala Trp Thr Cys Pro Glu Thr Gly Phe Tyr Phe Cys
65 70 75 80
Asp Asp Phe Glu Asp Gly Thr Phe Asp Asp Lys Trp Asp Asp Leu Ile
85 90 95
Ala Thr Tyr Asp Leu Pro Ser Pro Gly Val Phe Asp Ile Leu Asp Glu
100 105 110
Ala Ser Gly Lys Ser Leu Arg Phe Thr Ala Gly Thr Arg Gly Gly Asp
115 120 125
Leu Ala Asp Gly Glu Leu Ile Val Val Lys Asp Thr Ala Phe Glu Asn
130 135 140
Val Thr Asn Ala Asp Tyr Ser Leu Glu Tyr Arg Ile Arg Pro Arg Asn
145 150 155 160
Asn Gly Asn Thr Gly Asn Lys Tyr Leu His Ala Met Ser Arg Tyr Glu
165 170 175
Gly Pro Lys Glu Tyr Tyr Phe Gly Gly Leu Ser Met Gln Gly Ser Thr
180 185 190
Ala Ser Thr Gln Val Glu Ala Gly Phe Val Leu Pro Glu Asn Thr Thr
195 200 205
Ser Ile Ser Asn Arg Leu Val Gln Ala Lys Tyr Pro Leu Glu Leu Gly
210 215 220
Thr Thr Gly Met Ser Asp Gly Tyr Trp Tyr Glu Val Arg Phe Asp Met
225 230 235 240
Ile Gly Asn Thr Gly Thr Ile Tyr Leu Asp Gly Glu Pro Gln Gly Ser
245 250 255
Phe Thr Asp Ala Asp Gly Leu Tyr Pro Leu Thr Gly Lys Ile Gly Phe
260 265 270
Met Thr Tyr Asn Arg Ser Phe Glu Ile Asp Trp Val Arg Val Gly Asp
275 280 285
Pro Ala Ile Lys Pro Val Gln Leu Ser Leu Asp Tyr Ala Ser Pro Leu
290 295 300
Trp Glu Ala Ala Ala Asp Gln Asp Pro Leu Asn Val Thr Val Thr Ala
305 310 315 320
Ile Gln Ser Asp Gly Val Thr Ala Asp Thr Phe Thr Ala Val Ser Ser
325 330 335
Asp Thr Asn Val Val Thr Thr Ser Ile Ala Asn Asn Val Val Thr Ile
340 345 350
Thr Pro Val Ala Gln Gly Ser Ala Thr Val Thr Phe Thr Ala Gly Ser
355 360 365
Asp Ala Asn Arg Val Lys Thr Ile Asp Val Glu Ile Ala Arg Ala Phe
370 375 380
Val Met Ser Thr Thr Asp Tyr Gly Asp Ile Ala Ser Lys Val Thr Pro
385 390 395 400
Thr Val Gly Met Thr Asp Ala Asn Pro Asp Ala His Leu Ser Ile Thr
405 410 415
Phe Asp Ser Ala Pro Thr Leu Ser Gly Val Gly Ser Ile Arg Ile Tyr
420 425 430
Asn Ala Ala Asp Asp Ser Glu Val Asp Val Ile Arg Leu Thr Asp Glu
435 440 445
Ser Asp Ala Leu Gly Tyr Ala Gly Gln Ala Asn Lys Arg Glu Leu Asn
450 455 460
Thr Thr Pro Val Tyr Leu Asp Gly Asn Thr Leu His Val Ser Pro His
465 470 475 480
Ser Asn Ala Leu Ala Tyr Gly Gln Asp Tyr Tyr Val Ala Ile Gly Asp
485 490 495
Asn Val Leu Thr Gly Ala Thr Leu Asn Thr Ile Ala Phe Asp Gly Leu
500 505 510
Gly Lys Asn Ala Gly Trp Thr Phe Ser Thr Lys Ala Ser Ala Pro Thr
515 520 525
Gly Asn Thr Val Thr Val Asp Asp Asp Ala Ser Ala Asp Phe Ser Thr
530 535 540
Val Gln Gly Ala Leu Asn Tyr Ala Met Ala Asn Thr Thr Asp Asp Ser
545 550 555 560
Ile Thr Ile Asn Ile Ala Asn Gly Asn Tyr Tyr Glu Pro Leu Tyr Leu
565 570 575
Ala Glu Arg Asn Asn Val Thr Leu Lys Gly Glu Ser Arg Asp Gly Val
580 585 590
Val Ile His Tyr Asn Asn His Glu Ala Met Asn Gly Gly Ser Thr Gly
595 600 605
Arg Ala Asn Phe Tyr Val Ala Asn Ser Asp Met Leu Thr Leu Glu Thr
610 615 620
Leu Thr Leu Lys Asn Gly His Gln Arg Thr Gly Gly Gly Asp Gln Ala
625 630 635 640
Glu Thr Ile Tyr Phe Asn Ser Ser Ser Asn Thr Asp Arg Leu Ile Ala
645 650 655
Lys Gly Ala Ala Phe Ile Ser Glu Gln Asp Thr Leu Leu Leu Lys Gly
660 665 670
Tyr Asn Trp Phe Tyr Asn Ser Leu Val Val Gly Asn Val Asp Phe Ile
675 680 685
Trp Gly Tyr Ser Ala Val Thr Leu Phe Glu Glu Thr Glu Ile Arg Ser
690 695 700
Ile Ala Asp Ser Lys Pro Gly Ala Gly Asp Ser Gly Gly Tyr Ile Leu
705 710 715 720
Gln Ala Arg Thr Pro Leu Glu Thr Asp Leu Gly Phe Val Phe Leu Asn
725 730 735
Ser Glu Leu Thr Lys Ala Thr Gly Val Asn Gly Asn Glu Ile Gly Asp
740 745 750
Gly Lys Thr Tyr Leu Ala Arg Ser Gly Gly Ser Thr Gly Tyr Phe Asp
755 760 765
Asn Ile Ser Phe Ile Asn Thr Lys Met Gly Ser His Ile Ala Asp Ile
770 775 780
Gly Phe Ala Tyr Ala Asp Ile Asn Gly Gln Pro Ala Pro Asn Pro Ala
785 790 795 800
Val Ala Thr Ala Asp Ala Gly Trp Arg Glu Phe Gly Ser Met Asp Ser
805 810 815
Ala Gly Thr Ala Leu Asp Val Ser Ala Arg Cys Gly Asp Ser Gly Ser
820 825 830
Cys Ile Gln Leu Thr Gln Ala Gln Val Asp Ala Gln Tyr Cys Asn Arg
835 840 845
Ala Gln Ile Phe Ala Ser Trp Asn Asp Trp Thr Gly Trp Asp Pro Leu
850 855 860
Pro Glu Asp Thr Ser Asp Asp Ala Cys Ala Asp Pro Val Ile Pro Gly
865 870 875 880
Ala Val Thr Trp Thr Gly Ile Ala Met Ser Leu Gly Gly Ser Thr Thr
885 890 895
Ser Val Ser Gly Asn Ile Thr Glu Gln Thr Asp Ser Asn Ile Thr Phe
900 905 910
Thr Ala Asp Gly Gly Lys Phe Glu Ser Ser Lys Leu Ser Thr Tyr Phe
915 920 925
Ala Tyr Gln Glu Leu Thr Gly Asp Phe Val Ile Ser Ala Lys Ala Lys
930 935 940
Thr Ile Gly Leu Leu Arg Glu Asn Gly Ser Tyr Gln Phe Pro Thr Gly
945 950 955 960
Ile Leu Met Cys Val Cys Asp Ala Ala Ala Ala Thr Thr Gly Leu Met
965 970 975
Gly His Ala Ser Leu Asn Asp Ile Thr Val Asp Thr Thr Val Asn Leu
980 985 990
Val Ala Thr Tyr Gly His Ile Gln Thr Thr Ala Gly Ser Trp Asn Lys
995 1000 1005
Thr Gly Thr Thr Asp Val Thr Ala Gly Asp Asn Leu Tyr Ile Gln Leu
1010 1015 1020
Glu Arg Ala Gly Asn Ser Tyr Thr Ala Arg Tyr Ser Thr Asp Gly Gly
1025 1030 1035 1040
Ala Thr Tyr Ser Asn Ile Gly Gly Ser Ser Phe Thr Asp Thr Leu Pro
1045 1050 1055
Asp Thr Leu Lys Val Gly Phe Phe Ala Thr Pro Asn Asn Thr Gly Glu
1060 1065 1070
Gln Thr Phe Val Tyr Glu Asp Ile Gln Ile Thr Gln
1075 1080
<210> 116
<211> 3255
<212> DNA
<213> Microbulbifer degradans
<400> 116
atgctcgata tgacaaaacg aactctatct gcgttgttag ccctgtgcgc aacattaacg 60
gcttgcggtg gtggcgatat aaccagcggc ggcgatgcta taccggcagt aaaccaaccc 120
gccccagtac aagagcctga acctgaacct gaaccacaac cggaacccga acccgaaccc 180
gagcccgagc cagaaccaga gggcgcgtgg acctgcccag aaacaggctt ctacttctgt 240
gacgactttg aagacggcac gtttgatgac aagtgggacg atctcattgc cacatacgac 300
ctaccaagcc ctggtgtatt cgacatatta gacgaagcaa gcggcaaatc tttgcgcttt 360
acagcaggca cccgtggcgg tgacttagca gatggcgaac ttattgttgt aaaagataca 420
gcattcgaaa atgtaaccaa cgcagattac tccttagagt accgtattcg cccgcgcaac 480
aacggcaaca caggcaacaa gtacctgcac gctatgtcgc gctacgaagg ccctaaagaa 540
tattactttg gcggtttaag catgcaaggc tctactgcaa gtacgcaagt agaagcaggt 600
ttcgtattgc cagaaaacac cactagcatt agcaaccgct tggtgcaggc caagtacccg 660
ttagagctag gtacaacagg catgagcgac ggctactggt acgaagtacg cttcgatatg 720
ataggcaata caggcaccat ttacctagat ggcgaaccac aaggcagctt taccgatgcc 780
gatggccttt acccattaac aggtaaaatt ggctttatga cttacaaccg ctcattcgaa 840
attgattggg tgcgagtagg cgacccagct attaagcctg tacaactttc actggattac 900
gccagcccgc tatgggaagc agcggcagac caagacccgc taaacgtaac agttactgcc 960
atacaaagcg atggcgtaac agcagatacc tttaccgcag ttagcagcga taccaatgta 1020
gtaaccacaa gtattgcaaa taacgtagta accattaccc ctgtagctca aggtagtgcc 1080
accgtgacct ttaccgctgg ttcagatgct aatcgcgtta aaacaattga tgtagaaatt 1140
gcacgcgcgt ttgttatgtc tactaccgac tacggcgata tagcttctaa ggtaacacca 1200
actgttggta tgactgacgc caacccagac gcacatttaa gcattacatt cgatagcgca 1260
cctaccctaa gcggtgttgg ctctatacgt atatacaatg cagcagacga tagcgaagta 1320
gatgttattc gccttaccga cgaaagtgat gcattgggtt acgccggcca agccaacaag 1380
cgtgaattaa ataccacacc ggtttacttg gatggcaaca ccctacacgt tagcccacac 1440
agtaacgcac ttgcctacgg ccaagactac tacgttgcca ttggcgataa cgtacttacc 1500
ggcgcaacac taaacaccat tgcgtttgat ggtttaggta aaaacgcggg ttggactttc 1560
tctaccaaag cctctgcccc taccggcaac accgtaactg tagacgacga tgcaagtgca 1620
gatttcagca cagtacaagg tgcgttgaac tatgctatgg caaataccac ggacgattca 1680
atcaccatta acattgctaa cggcaactac tacgagccgc tatatctagc agagcgcaac 1740
aacgtaacgc taaaaggtga aagccgcgac ggcgttgtta ttcattacaa caaccacgaa 1800
gccatgaacg gtggcagcac tggccgcgca aacttctatg ttgccaactc agacatgcta 1860
accctagaaa cgctaaccct taaaaacggt catcagcgca ctggtggtgg cgaccaagca 1920
gaaactatct acttcaatag cagcagcaat accgatcgct taattgccaa aggcgctgct 1980
tttattagtg aacaagatac gctgttactt aaaggctaca actggttcta caactcgctt 2040
gtggtaggta acgtagactt tatttggggc tacagcgcag taaccttgtt tgaagaaaca 2100
gaaattcgat ctattgccga ctctaaacca ggtgcgggcg actcgggtgg ctatattctg 2160
caagcgcgta cgccactaga aacagacctt ggctttgttt tcttaaatag cgaattaaca 2220
aaagctaccg gcgtaaacgg taacgaaatt ggcgatggca aaacctacct tgcgcgcagc 2280
ggcggcagca cgggttactt cgataatatt tcgtttatta acaccaaaat gggtagccat 2340
attgccgaca taggcttcgc ctacgccgac attaacggtc aacctgcccc taacccagcg 2400
gtagctactg ctgacgcagg ctggcgtgaa tttggcagca tggattctgc aggcacggct 2460
ctagatgtat ctgcacgctg cggtgatagc ggcagctgta tccaacttac gcaagcacaa 2520
gtagatgcgc agtactgtaa ccgcgcgcaa atttttgcta gctggaacga ttggacaggc 2580
tgggacccgt tgccagaaga tacctctgac gatgcctgtg ctgaccccgt tatacctggt 2640
gcagtaacgt ggactggcat tgcaatgagc cttgggggtt ctacaacatc tgtttccggc 2700
aacattaccg agcaaacaga cagcaatatt acattcactg cagacggcgg taagtttgaa 2760
tcgagcaaac tttcaactta cttcgcttat caagaattga ctggcgactt tgtaattagc 2820
gccaaagcta aaaccattgg cttactgcgc gaaaacggca gctaccagtt ccctacaggc 2880
atattgatgt gtgtttgcga tgcggcagcg gcaacaactg gcttaatggg ccacgccagc 2940
ctcaatgaca ttacagttga tactactgtg aatttagttg ccacctacgg ccacattcaa 3000
accacagctg gtagctggaa taaaactgga acgactgacg taaccgctgg cgacaacctg 3060
tatatacagc tagagcgcgc aggtaatagt tataccgcac gctactcgac tgatggcggt 3120
gccacctata gcaacattgg tggcagctca tttacagaca cccttccaga cacacttaaa 3180
gtgggtttct tcgctacgcc taacaacacc ggtgagcaaa ctttcgttta cgaagatata 3240
caaatcactc agtaa 3255
<210> 117
<211> 391
<212> PRT
<213> Microbulbifer degradans
<400> 117
Met Gly Met Gly Thr Lys Ile Asn Phe Leu Leu Leu Gly Phe Ile Leu
1 5 10 15
Ser Ala Cys Ser Leu Ser Gly Cys Ala Asp Lys Ile Lys Arg Asn Thr
20 25 30
Pro Leu Thr Glu Thr Ala Leu Pro Ser Lys Lys Ile Leu Tyr Val Gln
35 40 45
Thr Glu Val Cys Asp Pro Pro Ser Gln Leu Thr Gly Ser Cys Tyr Asn
50 55 60
Ser Leu Gln Arg Ala Ile Asp Val Ala His Thr Val Pro Ser Ala Thr
65 70 75 80
His Val Thr Ile Glu Met Ala Ala Gly His Tyr His Glu Arg Ile Val
85 90 95
Leu Ser Arg Gly Asn Ile Asp Ile Val Gly Ala Gly Lys Asn Lys Thr
100 105 110
Tyr Val Gln Tyr Asn Leu Asn Ala Glu Gln Gly Lys Ala Tyr His Arg
115 120 125
Asp Gly Trp Gly Thr Pro Gly Ser Ala Thr Phe Thr Ile Asn Ala Ser
130 135 140
Glu Val Asn Val Ser Asp Leu Thr Ile Glu Asn Thr Phe Asp Phe Leu
145 150 155 160
Arg Asn Asp Ser Lys Asp Lys Thr Asp Pro Ser Lys Val Arg Ala Ser
165 170 175
Gln Gly Val Ala Leu Leu Leu Asp Glu His Ser Asp Lys Val Ala Leu
180 185 190
Tyr Arg Val Gly Leu Tyr Gly Tyr Gln Asp Thr Leu Phe Ala Asn Gly
195 200 205
Lys Arg Ala Phe Ile Tyr Gln Ser Asp Ile Ala Gly Asn Val Asp Phe
210 215 220
Ile Phe Gly Ala Gly Gln Val Val Ile Glu Asn Ser Arg Val Ile Ser
225 230 235 240
Arg Pro Arg Gly Lys Ala Ile Ala Ser Asn Glu Ile Ala Gly Tyr Ile
245 250 255
Thr Ala Pro Ser Thr Asn Ile Thr Asp Ala Phe Gly Leu Val Phe Ile
260 265 270
Asn Ser Arg Leu Glu Arg Glu Gln Gly Val Ala Asp Ala Ser Val Thr
275 280 285
Leu Gly Arg Pro Trp His Pro Thr Thr Asn Phe Ser Asp Gly Arg Tyr
290 295 300
Ala Asp Pro Asn Ala Ile Gly His Ala Leu Phe Phe Asn Cys Phe Met
305 310 315 320
Asp Ala His Ile His Pro Ala Arg Trp Ser Ser Met Lys Gly Thr Ala
325 330 335
Lys Asp Gly Ser Lys Thr Leu Val Phe Thr Pro Glu Gln Ser Arg Phe
340 345 350
Phe Glu Val Gln Ser Phe Gly Pro Ser Gly Asn Asp Glu Val Thr Thr
355 360 365
Ser Tyr His Ser Leu Ser Ala Asp Ser Leu Arg Glu Gln Ala Leu Gly
370 375 380
Asp Trp Asn Val Ser Ile Asn
385 390
<210> 118
<211> 1176
<212> DNA
<213> Microbulbifer degradans
<400> 118
gtgggtatgg gtacaaaaat taattttttg ctattaggtt ttattttgtc tgcatgttca 60
ttaagtggct gcgccgacaa aataaagcgc aatacgcctt taaccgaaac ggcattgccg 120
agtaaaaaaa tactctatgt acaaaccgaa gtatgtgacc cgccttcgca attaacgggt 180
agttgctaca acagcttgca gcgagctatt gacgtggcgc acacagtgcc gtctgctacc 240
catgtaacca ttgaaatggc tgcggggcat taccacgaac gcattgtgct tagccgtggc 300
aatatcgata ttgttggtgc aggtaaaaat aaaacctatg ttcaatacaa cctgaatgcc 360
gagcagggta aagcttatca ccgcgacggt tggggtactc ctggctcagc tacatttacc 420
attaatgcca gtgaagtaaa tgttagcgat ttaactatcg aaaatacttt cgacttttta 480
agaaatgatt caaaagataa aaccgaccct tcaaaagtga gggcatcgca aggcgttgca 540
ttattattgg atgaacacag cgataaggtt gcgctgtatc gagtaggcct atacggatac 600
caagacaccc tatttgcaaa tggaaagcgt gcatttatct accaatcaga tattgcaggc 660
aatgttgatt ttatttttgg cgctggccaa gtggttatag aaaatagtcg tgttatttct 720
aggccgcgcg gcaaagccat tgcttccaat gaaattgccg gctatatcac agcgccatcc 780
accaatatta cggacgcctt tggtctggtt tttattaata gtcgattaga acgtgagcaa 840
ggcgtggcag atgcgtcggt caccttgggt cgcccttggc accctacaac caatttcagc 900
gatggccgat atgccgaccc aaacgcgatt ggccatgcgc tattttttaa ctgctttatg 960
gatgcgcata ttcaccccgc gagatggtct agcatgaaag gcaccgctaa agacggcagt 1020
aaaacgctag tgtttacgcc cgagcaatcg cgtttttttg aagttcagtc ctttggcccc 1080
agcggcaacg atgaagtaac cacctcgtat cattcgttaa gcgccgactc attacgcgaa 1140
caagcgctcg gcgattggaa tgtatcaatt aactaa 1176
<210> 119
<211> 914
<212> PRT
<213> Microbulbifer degradans
<400> 119
Met Ser Ala Leu Thr Arg Pro Lys Phe Gly Ala His Thr Lys Leu Phe
1 5 10 15
His Ala Ile Lys His Ala Leu Thr Pro Val Ile Phe Leu Gly Ala Ala
20 25 30
Ala Phe Pro Leu Ala Ala His Ser Gln Tyr Asn Met Glu Asn Leu Asp
35 40 45
Arg Gly Leu Val Ala Ile Asp Arg Lys Asp Gly Ser Val Leu Val Ser
50 55 60
Trp Arg Trp Leu Gly Gln Glu Pro Asp Asn Thr Ser Phe Asn Val Tyr
65 70 75 80
Arg Asn Gly Thr Leu Leu Thr Ser Ser Pro Leu Thr Asn Lys Thr Asn
85 90 95
Phe Val Asp Thr Ser Gly Asn Pro Asn Ala Asn Tyr Ala Val Glu Ala
100 105 110
Ile Val Asn Gly Ala Ser Gln Ser Leu Ala Thr Thr His Val Trp Ser
115 120 125
Asp Ile Tyr Arg Thr Ile Pro Leu Gln Arg Pro Pro Gly Gly Thr Thr
130 135 140
Pro Asp Gly Val Ala Tyr Thr Tyr Ser Pro Asn Asp Ile Ser Ala Ala
145 150 155 160
Asp Leu Asp Gly Asp Gly Gly Tyr Glu Leu Ile Val Lys Trp Asp Pro
165 170 175
Ser Asn Ala Lys Asp Asn Ser Gln Ser Gly Tyr Thr Gly Asn Val Tyr
180 185 190
Leu Asp Ala Tyr Glu Ile Ser Gly Glu Phe Met Trp Arg Ile Asp Leu
195 200 205
Gly Arg Asn Ile Arg Ala Gly Ala His Tyr Thr Gln Phe Leu Ala Phe
210 215 220
Asp Phe Asp Ser Asp Gly Lys Ala Glu Val Ala Val Lys Thr Ala Asp
225 230 235 240
Ala Thr Lys Asp Ser Gln Gly Val Val Ile Gly Asp Ser Asn Ala Asp
245 250 255
Tyr Arg Asn Ser Ala Gly Tyr Val Leu Ser Gly Pro Glu Tyr Leu Thr
260 265 270
Met Phe Glu Gly Gln Thr Gly Arg Ala Leu Asn Thr Val Asn Tyr Val
275 280 285
Pro Ala Arg Gly Ser Val Ser Ser Trp Gly Asp Asn Tyr Gly Asn Arg
290 295 300
Val Asp Arg Phe Leu Gly Gly Val Ala Tyr Leu Asp Gly Gln Asn Pro
305 310 315 320
Ser Leu Ile Met Ser Arg Gly Tyr Tyr Thr Arg Thr Val Val Ala Ala
325 330 335
Trp Asp Trp Arg Asn Gly Gln Leu Ser Gln Arg Trp Val Phe Asp Ser
340 345 350
Asn Thr Ser Gly Asn Ser Ser Tyr Ala Gly Gln Gly Ala His Ser Leu
355 360 365
Thr Ile Gly Asp Val Asp Ala Asp Gly Lys Gln Glu Ile Val Phe Gly
370 375 380
Ala Met Thr Ile Asp Asp Asn Gly Thr Gly Leu Asn Asn Thr Arg Leu
385 390 395 400
Gly His Gly Asp Ala Leu His Leu Ser Asp Met Asp Pro Ser Asn Pro
405 410 415
Gly Leu Glu Val Phe Met Val His Glu Cys Pro Ser Cys Tyr Gly Glu
420 425 430
His Gly Ile Glu Met His Asp Ala Ala Thr Gly Gln Ile Leu Trp Ser
435 440 445
His Pro Gly Asp Tyr Ile Asp Ile Gly Arg Gly Val Ala Met Asp Ile
450 455 460
Asp Pro Arg Tyr Ala Gly Tyr Glu Ala Trp Ala Ser Arg Gly Gly Leu
465 470 475 480
Tyr Ser Ala Lys Gly Glu Thr Ile Ser Ser Thr Arg Pro Ser Gln Ile
485 490 495
Asn Phe Ala Ala Trp Trp Asp Gly Asp Leu Leu Arg Glu Ile Leu Asp
500 505 510
Asn Asn Tyr Ile Asn Lys Trp Asn Tyr Thr Ala Ser Ser Thr Thr Arg
515 520 525
Leu Leu Ser Ala Gly Asn Tyr Gly Ala Ala Ser Asn Asn Gly Thr Lys
530 535 540
Ala Thr Pro Gly Leu Ser Ala Asp Ile Leu Gly Asp Trp Arg Glu Glu
545 550 555 560
Val Val Trp Arg Asn Ser Asn Asn Gln Glu Leu Met Val Phe Thr Thr
565 570 575
Pro His Glu Ser Glu Tyr Arg Leu Arg Thr Leu Met His Asp Pro Gln
580 585 590
Tyr Arg Thr Ala Ile Ala Trp Gln Asn Val Gly Tyr Asn Gln Pro Pro
595 600 605
His Pro Ser Tyr Phe Leu Gly Ala Gly Met Thr Thr Pro Asn Gln Pro
610 615 620
His Ile Thr Ile Val Gly Glu Gly Thr Val Gln Pro Pro Ala Pro Thr
625 630 635 640
Gly Asp Ala Ile Gln Glu Asn Ala Thr Gly Phe Cys Gly Tyr Glu Gly
645 650 655
Thr Ile Asp Ser Asn His Ser Gly Tyr Thr Gly Ala Gly Phe Thr Asn
660 665 670
Thr Thr Asn Ala Thr Gly Ala Gly Ile Asn Trp Asn Leu His Ala Ser
675 680 685
Thr Ala Gly Thr Tyr Arg Leu Ser Met Arg Tyr Ala Asn Gly Ser Thr
690 695 700
Ala Arg Gly Ala Val Leu Asn Val Glu Thr Thr Gly Asn Ser Tyr Pro
705 710 715 720
Met Ala Phe Ala Pro Thr Ser Thr Trp Thr Asn Trp Gln Glu Glu Tyr
725 730 735
Val Asp Ala His Leu Asn Ala Gly Tyr Asn Ser Ile Arg Leu Glu Ala
740 745 750
Asn Gln Ala Ala Gly Leu Pro Asn Leu Asp Ala Ile Tyr Leu Ala Asp
755 760 765
Gly Leu Thr Ala Ala Ala Cys Gly Gln Thr Ser Ser Ser Ser Ser Ser
770 775 780
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
785 790 795 800
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Thr Thr Ala
805 810 815
Gly Leu Ala Cys Ala Asn Gly Ser Thr Asp Thr Trp Gly Thr Gly Phe
820 825 830
Val Leu Asn Gly Phe Gln Val Glu Asn Glu Gly Gln Gln Ala Thr Asn
835 840 845
Asn Trp Gln Val Thr Leu Gln Phe Asp Gln Pro Val Asn Ile Thr Asn
850 855 860
Ala Trp Gly Val Asn Val Glu Thr Thr Gly Thr Thr Val Val Ala Thr
865 870 875 880
Ser Val Gly Tyr Asn Ser His Leu Asn Pro Gly Gln Ser Ala Ser Phe
885 890 895
Gly Met Gln Gly Thr Ser Ala Thr Ala Val Ser Asn Pro Leu Cys Ser
900 905 910
Ala Gln
<210> 120
<211> 82
<212> PRT
<213> Microbulbifer degradans
<400> 120
Met Ala Cys Trp Pro Ser Phe Ser Thr Trp Lys Pro Phe Asn Thr Asn
1 5 10 15
Pro Val Pro His Val Ser Val Leu Pro Leu Ala Gln Ala Arg Pro Ala
20 25 30
Val Val Pro Pro Asp Glu Leu Leu Leu Glu Leu Leu Leu Glu Leu Leu
35 40 45
Leu Glu Leu Glu Glu Leu Leu Glu Glu Leu Glu Glu Leu Leu Glu Asp
50 55 60
Glu Leu Leu Leu Asp Val Trp Pro His Ala Ala Ala Val Arg Pro Ser
65 70 75 80
Ala Arg
<210> 121
<211> 2745
<212> DNA
<213> Microbulbifer degradans
<400> 121
atgtcggcac tcactcgtcc caaatttggt gcacacacca aactgttcca cgctataaag 60
catgcgttaa cccccgttat atttttaggt gctgctgctt tcccacttgc cgctcacagc 120
caatacaaca tggaaaacct cgaccgcggg ttggtggcca tagatcgcaa agacggcagc 180
gtattagtaa gctggcgctg gttggggcaa gagccagata acaccagctt caacgtatat 240
agaaacggca cactattaac cagctccccc cttaccaata aaacaaactt tgtagatacc 300
agcggcaacc caaacgctaa ttacgcggta gaagccatag taaacggcgc gagccaatca 360
ctagccacca cacatgtatg gagcgatata taccgcacta ttccgctgca aagaccacca 420
ggtggcacca cccccgacgg agtggcatac acctatagcc ccaacgatat aagcgcagcc 480
gatttagacg gcgatggcgg ttacgagcta atcgtaaaat gggacccctc caacgcaaaa 540
gacaactcgc aaagtggcta caccggcaat gtgtatctcg acgcttacga aatatctggc 600
gagtttatgt ggcgcataga cctaggccgc aatattcgag caggcgccca ctacacgcaa 660
tttttagcat tcgattttga tagcgatggc aaagcagaag ttgcagttaa aaccgcagac 720
gccaccaaag acagccaagg agtagtaata ggagacagca atgccgatta ccgcaacagt 780
gcaggctatg ttttatctgg ccccgaatac ctcaccatgt ttgaaggcca aaccggtaga 840
gcgctaaata ccgttaacta cgtacccgcg cgcggcagtg tttctagctg gggcgacaac 900
tacggcaacc gtgttgatag atttttaggt ggcgtagcct acttagatgg ccaaaaccca 960
agtttgatta tgtcgcgcgg gtactacacc cgcaccgtgg tagcagcatg ggactggcgc 1020
aatgggcagc ttagccaacg ctgggtattt gattccaaca ccagtggcaa cagcagctat 1080
gcaggccaag gcgcgcacag ccttaccatt ggcgatgtag atgcagacgg aaaacaagaa 1140
attgtatttg gcgccatgac catagatgac aacggcaccg gcttgaacaa tacccgccta 1200
ggtcacggcg atgcactgca cctatccgac atggacccaa gcaacccagg cttagaagtg 1260
tttatggtgc acgaatgccc ctcttgctat ggcgagcacg gaatagaaat gcacgatgcc 1320
gccaccggcc aaatactatg gagccaccca ggcgactaca tagacatagg ccgaggtgta 1380
gccatggata tagacccacg ttatgccggc tacgaagctt gggcttcgcg cggtggttta 1440
tacagtgcaa aaggcgaaac aatatcgagc acgcgcccgt cgcaaataaa ctttgccgct 1500
tggtgggatg gcgacttact gcgcgaaata ctcgataaca attacataaa caaatggaac 1560
tacaccgcca gctctaccac gcgcttgcta agcgcaggta actacggtgc agcatcgaac 1620
aacggcacca aggctacacc agggctttcg gccgatattc ttggtgactg gcgcgaagaa 1680
gtggtatggc gcaacagcaa caaccaagaa cttatggtgt ttaccacccc acacgaaagt 1740
gaataccgct taagaaccct aatgcacgac ccgcaatacc gcacagccat agcttggcaa 1800
aacgtgggct acaaccaacc acctcacccc tcttactttt tgggtgcggg tatgactacc 1860
cccaaccaac cacacataac catagttggc gaaggcacag tgcagcctcc agcccctaca 1920
ggcgacgcaa tacaagaaaa cgccaccggc ttttgcggct acgaaggcac tatagatagc 1980
aaccactctg gctatacagg cgcgggcttt actaacacta ccaacgcaac aggtgcaggc 2040
attaactgga acctacatgc gtccaccgcc ggtacatacc gcctaagcat gcgttacgcg 2100
aatggcagta cggcacgagg tgctgtgcta aacgtagaaa caaccggtaa tagctacccc 2160
atggcatttg caccaacaag cacatggaca aactggcaag aagaatatgt agacgcccac 2220
ctaaacgccg gctacaacag cattcggctt gaagcaaacc aagcggcagg tttgcccaac 2280
ctcgatgcca tctacctagc cgatggtctt acggcggcag cgtgcggcca aacatctagc 2340
agcagctcgt cctccagtag ctcctctagt tcttccagca gctcttcaag ctctagtagc 2400
agttccagta gtagctcgag cagcagctca tccggcggta caacagcagg cctagcttgc 2460
gccaacggca gtacagatac atggggaacg gggtttgtgt taaacggctt ccaagtagaa 2520
aacgaaggcc agcaagccac caataactgg caagtgacac ttcaattcga ccagcccgta 2580
aacataacca acgcatgggg tgtaaacgta gaaacaacag gcacaaccgt tgtagcaaca 2640
agcgtaggct acaacagcca cctaaacccg gggcaaagtg ctagctttgg aatgcaagga 2700
acatcggcaa cggcggtaag caacccgcta tgcagtgccc agtaa 2745
<210> 122
<211> 789
<212> PRT
<213> Microbulbifer degradans
<400> 122
Met Arg Ser Leu Ala Pro Ile Lys Ile Arg Glu Lys Ile Arg Glu Thr
1 5 10 15
Leu Met Phe Asn Ile Arg Ala Trp Gln Leu Asp Leu Pro Leu Ala Leu
20 25 30
Leu Ala Phe Ser Ser Thr Ser Tyr Ala Ile Asp Asn Gly Thr Tyr Thr
35 40 45
Ile Gln Ser Lys His Ser Gly Lys Val Val Glu Val Ala Ala Gly Ser
50 55 60
Val Asp Asp Ala Ala Asn Val Ala Gln Trp Pro Ser Asn Gly His Pro
65 70 75 80
Thr Gln Gln Trp Ile Ile Thr Gln Ile Ser Gly Asp Asp Tyr Ser Val
85 90 95
Ile Asn Val Asn Ser Gly Lys Ala Met Glu Val Tyr Asp Phe Gly Thr
100 105 110
Thr Asp Gly Gly Asn Ile Val Gln Tyr Pro Tyr Trp Gly Gly Ala Pro
115 120 125
Gln Leu Trp Thr Ile Thr Asp Gln Gly Gly Tyr Tyr Ser Leu Ile Asn
130 135 140
Lys His Ser Gly Lys Ala Leu Asp Leu Leu Asn Trp Asp Thr Thr Asp
145 150 155 160
Gly Ala Asn Ile Gly Gln Trp Ala Trp Trp Gly Gly Asp Ala Gln Leu
165 170 175
Trp Ala Leu Asn Thr Val Gln Pro Ser Thr Val Thr Phe Thr Leu Glu
180 185 190
Glu Asn Gln Ala Gly Phe Cys Ser Val Asp Gly Ser Ile Asp Ser Asn
195 200 205
His Thr Gly Tyr Thr Gly Ser Gly Phe Ala Asn Thr Thr Asn Ala Asn
210 215 220
Gly Gln Gly Val Asn Trp Ser Val Asn Val Ala Thr Ala Gly Thr Tyr
225 230 235 240
Thr Phe Thr Trp Arg Tyr Ala Gly Thr Ser Asn Arg Pro Ala Asn Leu
245 250 255
Leu Ile Asp Gly Ser Thr Gln Val Ser Gly Ile Ala Leu Asn Ser Thr
260 265 270
Gly Ala Trp Ala Thr Trp Ala Asn Ser Ala Glu Ile Ser Val Trp Leu
275 280 285
Asp Thr Gly Val His Ser Leu Arg Leu Gln Ala Thr Thr Ser Ala Gly
290 295 300
Leu Pro Asn Ile Asp Ser Leu Ser Ile Thr Gly Gln Ser Ala Ala Ala
305 310 315 320
Gly Asn Cys Ser Gly Ala Ile Glu Pro Ile Thr Phe Ala Thr Pro Ser
325 330 335
Phe Thr Asn Ile Ala Val His Asp Pro Ser Val Ile Glu Ala Asn His
340 345 350
Gln Tyr Tyr Val Phe Gly Ser His Leu Ser Val Ala Lys Thr Pro Asp
355 360 365
Leu Lys Asn Trp Ser Arg Val Ala Asp Gly Val Thr Thr Asn Asn Pro
370 375 380
Leu Phe Asn Asp Val Thr Ser Glu Leu Ala Glu Ala Leu Ala Trp Ala
385 390 395 400
Glu Thr Thr Thr Leu Trp Ala Pro Asp Val Thr Tyr Val Asn Gly Arg
405 410 415
Tyr Leu Met Tyr Tyr Asn Ala Cys Arg Gly Asp Ser Pro Leu Ser Ala
420 425 430
Met Gly Ile Ala Ser Ser Asn Asn Ile Glu Gly Pro Tyr Thr Asn Asp
435 440 445
Gly Ile Phe Leu Lys Ser Gly Met Trp Gly Gln Thr Ser Glu Asp Gly
450 455 460
Thr Val Tyr Asp Ala Thr Val His Pro Asn Ala Val Asp Pro Val Ile
465 470 475 480
Phe Ser Asp Ala Asn Asn Arg Met Trp Met Thr Tyr Gly Ser Tyr Ser
485 490 495
Gly Gly Ile Phe Ile Met Glu Leu Asn Pro Ser Thr Gly Phe Pro Tyr
500 505 510
Ala Gly Gln Gly Tyr Gly Lys His Leu Met Gly Gly Asn His Ala Arg
515 520 525
Ile Glu Gly Ala Tyr Thr Ile Tyr Ser Pro Glu Thr Gly Tyr Tyr Tyr
530 535 540
Met Tyr Val Ser Tyr Gly Gly Leu Gly Ala Asp Gly Gly Tyr Asn Val
545 550 555 560
Arg Val Ala Arg Ala Thr Ser Pro Asp Gly Pro Tyr Tyr Asp Ala Asn
565 570 575
Gly Thr Asn Met Ala Asn Val Lys Ser Asn Pro Ser Leu Pro Leu Phe
580 585 590
Asp Asp Ala Ser Ile Ala Pro His Gly Val Lys Leu Met Gly Asn His
595 600 605
Val Phe Ser Gly Thr Asn Asn Val Leu Gly Tyr Val Ser Pro Gly His
610 615 620
Asn Ser Ala Tyr Arg Asp Ala Thr Thr Gly Gln Thr Phe Leu Leu Phe
625 630 635 640
His Thr Arg Phe Pro Gly Arg Gly Glu Glu His Glu Val Arg Val His
645 650 655
Glu Val Phe Tyr Asn Asp Ala Gly Trp Pro Val Ile Ala Pro Leu Arg
660 665 670
Tyr Ala Gln Lys Val Asp Ala Asn Asn Pro Asn Arg Ser Ala Ser Glu
675 680 685
Leu Asn Ala Val Tyr Ala Ser Glu Leu Pro Gly Ser Tyr Gln Leu Ile
690 695 700
Asn His Gly Lys Asp Ile Ser Ala Thr Ile Lys Asn Ser Val Asn Ile
705 710 715 720
Thr Leu Asn Ser Asn Gly Ser Ile Ser Gly Glu Leu Ser Gly Ser Trp
725 730 735
Thr Tyr Asn Ala Asn Thr Arg Asn Thr Val Ile Thr Val Ala Gly Val
740 745 750
Ala Tyr Arg Gly Val Val Ser Arg Gln Trp Asn Gln Ala Arg Asn Arg
755 760 765
Phe Glu Val Thr Phe Ser Ala Leu Ser Ala Asp Gly Thr Ala Ile Trp
770 775 780
Gly Val Asn Ser Asp
785
<210> 123
<211> 2370
<212> DNA
<213> Microbulbifer degradans
<400> 123
gtgcgcagtc tagcccccat aaaaataaga gagaaaataa gagagacact catgtttaat 60
atacgcgctt ggcaacttga cttgcccttg gcgttattgg cgttttcgtc tacaagttac 120
gctatcgata acggcactta cacaattcaa tctaaacaca gtggcaaggt tgtagaagtg 180
gccgcaggca gtgtagatga tgctgcaaat gtggcccaat ggcccagtaa tggccatcct 240
acccagcagt ggataattac ccaaattagc ggcgatgatt actcggtaat aaatgtaaat 300
agcggcaaag ctatggaggt atacgacttc ggcaccacgg acggcggcaa catagtgcaa 360
tacccctact ggggaggcgc cccccagttg tggacaatta cagatcaagg cggttattac 420
agcttaataa acaaacacag tggtaaagct ttagatttgt taaattggga taccacagac 480
ggcgccaata taggccagtg ggcttggtgg ggcggcgatg cgcaactgtg ggcactgaac 540
acagtgcaac ccagcacagt caccttcaca cttgaagaaa accaagcggg tttttgcagc 600
gtagatggca gcatagatag caatcatacg ggctataccg gcagcgggtt tgccaataca 660
accaatgcaa atggccaagg agttaattgg tcggtaaatg tagccacagc tggtacctat 720
acgtttacat ggcgctatgc gggcactagc aaccgcccag ccaacttgct aatagatggc 780
agcacacagg tttcgggcat tgctttaaat tcaaccggcg catgggcaac ttgggctaat 840
agtgcagaaa taagtgtttg gcttgacaca ggtgtgcact cacttcggct gcaagccaca 900
accagcgcgg gcttacctaa tatagattcg cttagcataa ccggccaaag cgcagcagcg 960
ggcaactgca gcggcgcaat tgaacctatc acttttgcca caccaagctt taccaatata 1020
gcggtgcacg acccctcggt aatagaagca aaccatcaat actacgtatt tggctcccac 1080
ctttctgtgg ctaaaacgcc cgacctaaaa aactggtcgc gcgtggcgga tggtgtaacc 1140
accaataacc cattgtttaa cgatgtaacc agcgaacttg cagaagcatt agcttgggca 1200
gaaaccacta ccctgtgggc gccagatgtt acctatgtaa atggtcggta tttgatgtat 1260
tacaacgcct gccgtggcga ctcaccactg tcggctatgg gtattgcttc ttcgaacaac 1320
atagaaggcc cttacactaa cgatggtata ttccttaaat ctggaatgtg gggccaaacc 1380
agcgaagatg gcactgtgta cgacgcaacc gtgcacccca atgctgtgga ccccgttatt 1440
tttagcgacg caaataatcg catgtggatg acctacggtt cgtattcggg tggtattttt 1500
attatggagt taaacccatc cacggggttc ccttacgcgg ggcaaggtta tggtaaacat 1560
ttaatgggtg gcaaccacgc gcgcattgaa ggcgcttaca ccatctacag cccagaaacg 1620
ggctactact atatgtatgt aagctacggt ggcctaggcg cagatggcgg ctataacgtt 1680
cgtgtggccc gagcaactag cccagacggc ccctactatg atgccaacgg caccaatatg 1740
gccaacgtaa aaagcaaccc aagcttgcca ctgttcgacg acgccagcat agcaccccac 1800
ggtgtaaaac ttatgggtaa ccacgtgttt agcggcacta acaatgtact tggttacgta 1860
tcaccagggc acaactctgc ataccgtgac gccactaccg gccaaacatt tttactattc 1920
cacacacgct tccctgggcg cggcgaagag catgaagtgc gagtgcatga agtgttctac 1980
aacgatgcag gctggccggt aatagcacca ttgcgctatg cccaaaaagt agatgccaac 2040
aacccaaata gaagtgcgag cgagctaaat gcagtgtacg caagcgaact gccaggtagc 2100
tatcagctaa ttaaccacgg caaagacata agcgcgacaa ttaaaaattc cgttaacatc 2160
acgctaaaca gcaacggcag tatctctggc gagctatctg gcagctggac atacaacgcc 2220
aacacccgca ataccgtaat caccgttgcc ggtgtggcct accgcggcgt ggtatctcgc 2280
caatggaacc aagcacgcaa ccgcttcgaa gtcaccttca gcgccctatc tgcagacggc 2340
acagcaatat ggggggtgaa cagcgactaa 2370
<210> 124
<211> 362
<212> PRT
<213> Microbulbifer degradans
<400> 124
Met Ser Ser Phe Ile Met Asp Lys Ser Gln Leu Gln Ser Gly Phe Ala
1 5 10 15
Phe Lys Thr Ser Gly Phe Asn Val Leu Ile Val Val Thr Phe Leu Ala
20 25 30
Leu Leu Ala Ala Leu Val Gly Cys Ser Ser Ala Lys Leu Ala Pro Val
35 40 45
Ala Ser Pro Ser Leu Pro Gln Pro Leu Val Ala Gln Arg Ala Asp Pro
50 55 60
Trp Val His Lys His Ser Asp Gly Tyr Tyr Tyr Phe Ile Ala Thr Val
65 70 75 80
Pro Ala Tyr Asp Arg Leu Glu Met Arg Arg Ala Thr Thr Ile Ala Gly
85 90 95
Leu Arg Ser Ala Pro Ala Val Val Val Trp Gln Arg Asn Thr Ile Gly
100 105 110
Gly Met Ser Ala Asn Ile Trp Ala Pro Glu Leu His Phe Ile Asp Gly
115 120 125
Lys Trp Tyr Ile Tyr Val Ala Ala Ala Thr Asp His Asn Lys Pro Trp
130 135 140
Thr Ile Arg Met His Thr Leu Ser Asn Ala Ser Ala Asn Pro Met Gln
145 150 155 160
Gly Glu Trp Gln Glu Glu Gly Arg Phe His Thr Pro Leu Asp Thr Phe
165 170 175
Ser Leu Asp Ala Thr Thr Phe Glu His Arg Gly Lys Arg Tyr Leu Val
180 185 190
Trp Ala Gln Gln Asn Glu Ala Arg Thr Tyr Asn Ser Ala Leu Leu Ile
195 200 205
Ala Gln Met Asp Ser Pro Thr Ser Ile Thr Gly Pro Ile Val Thr Leu
210 215 220
Ser Glu Pro Thr Leu Pro Trp Glu Ile Ile Gly His Lys Val Asn Glu
225 230 235 240
Gly Ala Ala Val Ile Lys His Gly Lys Arg Ile Phe Ile Ser Tyr Ser
245 250 255
Ala Ser Ala Thr Asp His Asn Tyr Ala Met Gly Leu Leu Trp Ala Asp
260 265 270
Glu Asn Ala Asp Leu Leu Asp Ala Ala Ser Trp Thr Lys Ser Pro Glu
275 280 285
Pro Val Phe Tyr Ser Asn Glu Gln Leu Lys Arg Phe Gly Pro Gly His
290 295 300
Asn Cys Phe Val Lys Ala Glu Asp Gly Val Thr Asp Leu Met Val Tyr
305 310 315 320
His Ala Arg Asp Tyr Lys Glu Ile Asp Gly Glu Pro Leu Arg Asp Pro
325 330 335
Asn Arg His Thr Arg Val Arg Lys Val Tyr Trp Asp Glu Gln Gly Met
340 345 350
Pro Asp Phe Arg Gln His Glu Ala Asp Leu
355 360
<210> 125
<211> 1089
<212> DNA
<213> Microbulbifer degradans
<400> 125
atgagtagct tcataatgga taaaagccag ctacaaagtg gttttgcgtt taaaacaagt 60
ggttttaacg tgctgattgt tgtcacattt ttggcattgc ttgcagcgct tgttggctgc 120
agcagcgcca agctcgcacc cgtcgcctct cctagtttac cgcagccatt agtggcccaa 180
cgggcagacc cttgggtgca caagcacagc gatggttatt actactttat agcaacggta 240
ccagcatacg accgcttaga aatgcgtagg gccacaacca tagcaggctt acgtagcgcg 300
cccgctgtag tggtatggca gcgcaatact attggaggta tgagcgcgaa tatttgggcg 360
cccgagctgc attttattga tggtaaatgg tacatctatg tagcggctgc caccgatcac 420
aacaagccgt ggacaattcg tatgcacacg ctttccaatg catcggccaa ccctatgcaa 480
ggtgagtggc aagaagaggg gcgctttcat acaccgctag atactttctc gctagatgcc 540
acaacctttg agcacagggg taaacgctat ttagtatggg cgcaacagaa tgaagcccgt 600
acttataact cggcgttact tatagcgcaa atggatagcc ctacaagtat tactggcccc 660
attgttacct taagtgaacc gacattaccg tgggaaatta ttggccataa ggttaatgag 720
ggtgcggcag taattaaaca cggtaagcgt atttttataa gttattccgc cagtgcgacc 780
gatcataact atgcgatggg tttgttatgg gcagacgaaa acgctgattt gctcgacgca 840
gcaagctgga ccaagtcacc cgagcctgta ttttactcaa acgaacaatt aaagcgcttt 900
ggccctggcc ataattgttt tgttaaagct gaagatggtg ttaccgattt aatggtgtac 960
cacgcgcgtg attataaaga gatagatggt gagccattgc gagacccaaa ccgccacacg 1020
cgggtgcgca aagtgtattg ggatgaacag ggcatgccgg attttcgtca acatgaagca 1080
gacctatag 1089
<210> 126
<211> 314
<212> PRT
<213> Microbulbifer degradans
<400> 126
Met Lys Lys Leu Ser Pro Leu Ile Glu Gln Arg Ala Asp Pro Tyr Ile
1 5 10 15
Tyr Lys His Thr Asp Gly Tyr Tyr Tyr Phe Thr Ala Ser Val Pro Ala
20 25 30
Tyr Asp Gly Ile Glu Leu Arg Arg Ala Lys Thr Ile Gln Ala Leu Ala
35 40 45
Thr Ala Glu Thr Val Met Val Trp Arg Lys Pro Ser Glu Gly Asp Tyr
50 55 60
Ser Glu Leu Ile Trp Ala Pro Glu Ile His Phe Asn Met Gly Ala Trp
65 70 75 80
Tyr Val Tyr Phe Ala Ala Ala Pro Ser Arg Glu Ile Lys Phe Asp Leu
85 90 95
Phe Gln His Arg Met Tyr Ala Ile Ser Cys Ser Asp Ala Asn Pro Leu
100 105 110
Thr Gly Glu Trp Ile Phe Glu Gly Lys Ile Asp Ser Gly Ile Asp Ala
115 120 125
Phe Cys Leu Asp Ala Thr Thr Phe Thr His Ser Asn Glu Leu Tyr Tyr
130 135 140
Val Trp Ala Gln Lys Glu Leu Asp Val Arg Gly Asn Ser Asn Leu Met
145 150 155 160
Ile Ala Lys Met Glu Thr Pro Thr Lys Leu Ala Thr Lys Pro Val Arg
165 170 175
Leu Ser Lys Pro Glu Tyr Asp Trp Glu Ile Gln Gly Phe Trp Val Asn
180 185 190
Glu Gly Pro Ser Ile Val Lys His Gly Ser Arg Ile Phe Ile Ser Tyr
195 200 205
Ser Gly Ser Ala Thr Asp Glu Arg Tyr Ala Met Gly Ile Leu Trp Ala
210 215 220
Glu Gln Ser Ala Asp Leu Leu Asp Pro Ala Ser Trp Thr Lys Ser Val
225 230 235 240
Glu Pro Val Leu Val Ser Glu Pro Ser Glu Lys Val Phe Gly Pro Gly
245 250 255
His Asn Ser Phe Thr Val Asp Glu Glu Gly Asn Asp Met Leu Val Tyr
260 265 270
His Ala Arg Asn Tyr Thr Glu Ile Glu Gly Asp Pro Leu Trp Asp Pro
275 280 285
Asn Arg His Thr Tyr Val Lys Lys Leu Arg Trp Asp Glu Thr Gly Met
290 295 300
Pro Ile Phe Gly Ser Pro Ala Phe Glu Glu
305 310
<210> 127
<211> 945
<212> DNA
<213> Microbulbifer degradans
<400> 127
gtgaaaaaat tatcgcctct tatagagcaa cgtgcagacc cttatattta taaacacacc 60
gacggctact attattttac ggcttcggtg cccgcctatg atggcataga actgcgtcgc 120
gcaaaaacta tacaagcgtt agccaccgca gaaaccgtta tggtgtggcg caagccaagc 180
gaaggtgatt atagtgagct tatttgggcg ccagaaatac actttaatat gggggcttgg 240
tatgtatatt ttgctgcggc tccatcacgt gaaattaagt tcgatttatt tcaacaccgc 300
atgtatgcca ttagctgtag cgatgccaac ccgctaacag gtgaatggat atttgaaggt 360
aaaatagata gcggcataga tgcattctgt ttagatgcca ccacctttac tcacagcaat 420
gagctctact atgtttgggc gcaaaaagaa ttagatgttc gcggcaactc taatttgatg 480
atcgcaaaaa tggaaacgcc caccaagctt gccaccaagc ccgtgcgttt atctaaaccc 540
gaatacgact gggagattca gggtttttgg gttaacgaag gcccatccat tgttaagcac 600
ggctcacgta tttttatttc ttattctggc tctgctaccg atgagcgcta cgcaatgggt 660
attttgtggg cagaacaaag cgcagactta ctagacccag caagttggac caagtcggta 720
gagcctgtat tagtttctga accctctgaa aaagtatttg gcccaggcca caatagtttt 780
actgtggatg aagagggtaa cgatatgttg gtgtatcatg ctcgcaatta taccgaaatt 840
gaaggcgacc cgctgtggga cccaaatcgt catacttacg ttaaaaaatt gcgctgggat 900
gaaacaggca tgcctatttt tggcagccct gcgtttgaag agtag 945
<210> 128
<211> 350
<212> PRT
<213> Microbulbifer degradans
<400> 128
Asn Gly Tyr Tyr Ala Val Leu Asn Lys His Ser Gly Lys Ala Leu Asp
1 5 10 15
Leu Tyr Gly Phe Asp Thr Ser Asn Gly Ala Asn Ile Ala Gln Trp Ala
20 25 30
Phe Trp Gly Gly Asp Pro Gln Gln Trp Gln Phe Thr Lys Ile Ala Asn
35 40 45
Val Gly Ala Pro Pro Val Asp Thr Ser Thr Thr Asn Gly Ala Thr Asn
50 55 60
His Trp Ser Leu Thr Gly Asn Leu Val Thr His Asp Pro Thr Met Ala
65 70 75 80
Tyr Glu Asn Gly Ser Trp Trp Leu Tyr Gln Thr Gly Glu Gly Ile Tyr
85 90 95
Gly Lys Tyr Ser Ala Asn Gly Leu Ala Trp Asp Gly Leu Pro Ser Val
100 105 110
Phe Pro Asn Gly Leu Ser Trp Trp Lys Thr Tyr Val Pro Gly Gln Ser
115 120 125
Asn Asn Asp Val Trp Ala Pro Asp Val Arg Thr Tyr Asn Gly Arg Val
130 135 140
Tyr Leu Tyr Tyr Ser Ile Ser Thr Phe Gly Ser Arg Val Ser Ala Ile
145 150 155 160
Gly Leu Ala Ser Ala Ser Ser Leu Ala Ala Ser Asp Trp Gln Asp His
165 170 175
Gly Leu Val Ile Asn Thr Thr Ser Ser Ser Asp Trp Asn Ala Ile Asp
180 185 190
Pro Asp Leu Val Val Asp Glu His Gly Asn Pro Trp Leu Thr Met Gly
195 200 205
Ser Trp Asn Ser Gly Ile Lys Val Met Arg Leu Asn Pro Ile Thr Met
210 215 220
Lys Pro Ile Gly Thr Leu Tyr Ser Ile Ala Gln Lys Gly Gly Gly Ile
225 230 235 240
Glu Ala Pro Ser Ile Val Tyr Arg Arg Gly Tyr Tyr Tyr Leu Phe Val
245 250 255
Ser Ile Gly Lys Cys Cys Ala Gly Val Asp Ser Thr Tyr Gln Ile Ala
260 265 270
Tyr Gly Arg Ser Thr Ser Ile Thr Gly Pro Tyr Leu Asp Lys Asn Gly
275 280 285
Asn Asp Met Met Ser Gly Gly Gly Ser Ile Leu Asp Ala Gly Asn Asn
290 295 300
Val Trp Val Gly Pro Gly Gly Gln Asp Ile Ile Asn Thr Asp Val Ile
305 310 315 320
Val Arg His Ala Tyr Asp Ala Thr Asp Ala Gly Thr Pro Lys Met Ile
325 330 335
Ile Ser Thr Leu Asn Trp Asp Ala Asn Gly Trp Pro Lys Tyr
340 345 350
<210> 129
<211> 1053
<212> DNA
<213> Microbulbifer degradans
<400> 129
aatggctatt atgccgtgct aaataaacac agcggcaaag cgttagattt gtatggtttt 60
gatacgtcta acggcgcgaa tattgcgcaa tgggcctttt ggggcgggga cccgcagcag 120
tggcaattta ccaaaatcgc caatgtaggt gcgccgccag tagatacatc taccaccaac 180
ggtgcaacca accactggtc cttaaccggt aatctagtga ctcacgaccc cacaatggcc 240
tacgaaaacg gctcatggtg gttgtatcaa accggcgagg gaatttacgg taagtattca 300
gccaatggtt tggcgtggga tggcttacct tctgtgtttc ccaatggttt aagttggtgg 360
aagacctatg tacccggcca gtcgaacaac gatgtatggg cgcctgatgt acgcacttat 420
aatgggcggg tttatttgta ctattccatc tctacttttg gctcgcgtgt atctgccatt 480
ggtttggcgt cggcatcgag tttggctgcg agtgattggc aggaccacgg cttagtaatt 540
aataccacct catctagcga ttggaatgcg atcgacccag atttagtggt cgatgagcat 600
ggcaaccctt ggttaacaat gggaagttgg aacagcggta ttaaagtgat gcgcttgaac 660
cccattacca tgaagccaat tggcacactt tattctattg cgcaaaaggg cggcggtatt 720
gaagcgcctt ctattgtgta tcgccgtggg tattactatt tatttgtttc tatcggcaaa 780
tgctgtgcgg gcgtagatag cacctatcaa attgcttacg ggcgctctac aagtattacc 840
ggcccttatt tggataagaa cggcaacgat atgatgagtg gtggtggcag tattttagat 900
gcgggcaaca acgtgtgggt tggccctggt gggcaagata ttattaacac cgatgtcatt 960
gtgcgccacg cgtacgatgc cacagatgca ggcacaccta agatgattat tagtaccttg 1020
aattgggatg ctaatggatg gccgaaatac tag 1053
<210> 130
<211> 346
<212> PRT
<213> Microbulbifer degradans
<400> 130
Met Leu Asn Lys Asn Lys Arg Pro Ile Thr Phe Ala Leu Val Val Ser
1 5 10 15
Leu Leu Ala Leu Leu Ala Leu Ala Gly Cys Ser Glu Ala Lys Gln Val
20 25 30
Ser Ile His Asp Pro Val Met Ile Lys Glu Gly Asp Thr Tyr Tyr Leu
35 40 45
Phe Ser Thr Gly Pro Gly Ile Thr Met Tyr Ser Ser Ser Asp Met Lys
50 55 60
Asn Trp Arg Arg Glu Gly Glu Val Phe Asn Gln Ala Pro Ser Trp Ala
65 70 75 80
Ser Asn Ala Val Pro Tyr Phe Lys Gly His Leu Trp Ala Pro Asp Ile
85 90 95
Ile Glu Lys Asp Gly Leu Phe Tyr Leu Tyr Tyr Ser Val Ser Ala Phe
100 105 110
Gly Lys Asn Thr Ser Gly Ile Gly Val Thr Val Ser Pro Thr Leu Asn
115 120 125
Pro Arg Ala Pro Asn Tyr Gly Trp Gln Asp Lys Gly Met Val Leu Arg
130 135 140
Ser Val Pro Glu Arg Asp Glu Trp Asn Ala Ile Asp Pro Asn Ile Val
145 150 155 160
Val Asp Asn Asn Gly Thr Ala Trp Met Ala Phe Gly Ser Phe Trp Gln
165 170 175
Ser Leu Lys Met Val Ala Leu Asp Ser Ser Trp Thr Lys Ile Ala Glu
180 185 190
Pro Gln Gln Trp His Thr Ile Ala Ala Leu Pro Lys Gly Ser Met Pro
195 200 205
Thr Gly Asp Ala Val Lys Asp Gly Glu Ile Glu Ala Pro Phe Ile Phe
210 215 220
Lys Lys Asn Asp Asp Tyr Phe Leu Phe Val Ser Trp Gly Lys Cys Cys
225 230 235 240
Arg Lys Asp Glu Ser Thr Tyr Arg Leu Ala Met Gly Arg Ser Lys Asn
245 250 255
Thr Thr Gly Pro Phe Leu Asp Lys Asn Gly Lys Asp Leu Ala Gln Gly
260 265 270
Gly Gly Thr Leu Leu Ile Ser Gly Asn Lys Asn Trp Pro Gly Leu Gly
275 280 285
His Asn Ser Ala Tyr Thr Phe Asp Gly Lys Asp Trp Leu Val Leu His
290 295 300
Ala Tyr Glu Ser Ala Asp Asn Gly Leu Gln Lys Leu Lys Ile Leu Glu
305 310 315 320
Ile Asn Trp Asp Lys Asp Gly Trp Pro Thr Val Asp Thr Lys Glu Leu
325 330 335
Asp Glu Phe Val Ser Ile Glu Leu Thr Gln
340 345
<210> 131
<211> 1041
<212> DNA
<213> Microbulbifer degradans
<400> 131
atgcttaaca aaaacaaacg cccaattaca ttcgctttag tcgttagcct cttagccctg 60
cttgcccttg caggctgcag cgaggcaaaa caagtaagca tccacgaccc agtaatgatt 120
aaagaaggtg acacctacta cttgtttagc actggccccg gcataacaat gtatagctct 180
agcgatatga aaaactggcg ccgcgaaggc gaagtattta atcaagcccc tagttgggcc 240
tccaacgccg taccctattt taaaggccac ctgtgggcac ccgacatcat tgaaaaagat 300
ggtctgtttt acctctacta ttctgtgtct gcttttggaa agaacacatc cggcattggc 360
gttaccgtat cgcccacgct taacccacgc gcgcccaatt acggttggca agataaaggc 420
atggtattgc gcagcgtgcc tgagcgcgac gagtggaacg ctatcgaccc caatattgtg 480
gtagataaca acggcaccgc atggatggct tttggctcct tttggcaaag cttaaaaatg 540
gtggcactag acagcagctg gacaaaaata gctgagcctc aacagtggca taccatagca 600
gccttaccca aaggcagtat gcccacaggc gacgcagtaa aggacggcga aatagaagct 660
ccttttattt ttaaaaagaa cgacgattac tttttgtttg taagttgggg taaatgctgc 720
cgcaaagatg aaagcaccta ccgcctagca atgggccgca gcaaaaatac taccggtcca 780
ttcttagata aaaacggcaa agacctcgcc caaggtggtg gcaccctatt aataagtggc 840
aacaaaaact ggcccggctt aggccacaac agcgcctaca ccttcgacgg caaagattgg 900
cttgtgctac acgcctatga atctgcagat aacggtttac aaaaactaaa aatattagaa 960
ataaactggg ataaagacgg ctggccaact gtagatacca aagaactgga tgagtttgtt 1020
agtattgaat taactcaata a 1041
<210> 132
<211> 665
<212> PRT
<213> Microbulbifer degradans
<400> 132
Met Asp Asn Ile Met Lys Met Ile Ile Lys Leu Ala Leu Ala Val Thr
1 5 10 15
Leu Ala Val Trp Val Ala Gly Cys Thr Asn Gln Ala Gly Leu Asn Ala
20 25 30
Glu Asn Lys Asn Ile Glu Arg Gln Thr Ile Asn Ser Pro Asp Lys Ser
35 40 45
Leu Lys Val Arg Leu Ser Leu Asp Glu Ser Gly Lys Val Phe Tyr Ser
50 55 60
Ile Ser Arg Asn Gly Glu Gln Val Met Leu Pro Ser Gln Leu Gly Val
65 70 75 80
Glu Leu Asn Ser Gln Ala Phe Thr Asp Gly Leu Thr Ile Thr Asp Val
85 90 95
Asp Ala Gly Lys Val Asn Asp Ser Tyr Thr Leu Leu His Gly Lys Gln
100 105 110
Arg Asp Ile Thr Tyr Asn Ala Asn Glu Lys Ile Tyr Ser Leu Lys Asn
115 120 125
Lys Gln Gly Asp Lys Leu Ile Ile Ala Phe Arg Val Ser Asn Asp Gly
130 135 140
Val Ala Phe Gln Tyr Arg Phe Pro Asn Thr Ala Lys Gln Leu Leu Ala
145 150 155 160
Val Lys Lys Glu Ile Thr Ser Phe Ala Phe Glu His Thr Thr Lys Ala
165 170 175
Trp Leu Gln Pro Ile Ala Val Ala Gln Thr Gly Trp Ala Asn Thr Asn
180 185 190
Pro Ser Tyr Glu Glu His Tyr Gln Met Asn Ile Pro Val Asp Thr Val
195 200 205
Ser Pro Ser Pro Ala Gly Trp Val Phe Pro Ala Leu Phe Lys Ala Asn
210 215 220
Lys His Trp Leu Leu Ile Thr Glu Ala Gly Met Asn Gly Asp Tyr His
225 230 235 240
Ala Ser Arg Leu His Ala Glu Ser Pro Asn Gly Glu Tyr Ser Leu Gly
245 250 255
Ile Pro Met Ala Ala Glu Val Phe Glu Gln Asp Gly Asn Lys Gly Ala
260 265 270
Leu Leu Ala Gln Ser Asn Thr Ala Phe His Ser Pro Trp Arg Val Ile
275 280 285
Leu Val Gly Gly Leu Asp Thr Ile Ile Ala Ser Thr Leu Gly Thr Asp
290 295 300
Leu Ala Asp Pro Ala Ile Ala Lys Met Asp Phe Val Lys Pro Gly Thr
305 310 315 320
Ala Ser Trp Ser Trp Ala Leu Leu Lys Asp Glu Ser Val Asn Tyr Glu
325 330 335
Thr Ser Leu Glu Phe Ile Asp Tyr Ala Ala Glu Met Gly Trp Asp Tyr
340 345 350
Thr Leu Val Asp Ala Asp Trp Asp Arg Arg Ile Gly Tyr Glu Arg Thr
355 360 365
Ala Gln Leu Ala Ala Tyr Ala Gln Ser Lys Asn Val Gly Leu Leu Val
370 375 380
Trp Tyr Asn Ser Ser Gly Asp Trp Asn Thr Thr Glu Tyr Ser Pro Lys
385 390 395 400
Ser Ala Leu Leu Asp Arg Asp Lys Arg Arg Ala Glu Phe Ala Arg Leu
405 410 415
Gln Asn Met Gly Val Lys Gly Val Lys Ile Asp Phe Phe Pro Gly Asp
420 425 430
Gly Lys Ser Val Met Ala Tyr Tyr Asn Asp Leu Ala Lys Asp Ala Ala
435 440 445
Asp Tyr Asn Leu Leu Val Asn Tyr His Gly Ser Ser Leu Pro Arg Gly
450 455 460
Leu His Arg Thr Tyr Pro Asn Ile Met Thr Met Glu Ser Val His Gly
465 470 475 480
Phe Glu Met Ile Thr Phe Met Gln Pro Ser Ala Asp Lys Ala Ala Thr
485 490 495
His Met Ala Ile Leu Pro Phe Thr Arg Asn Ala Phe Asp Pro Met Asp
500 505 510
Phe Thr Pro Thr Thr Phe Ser Asp Ile Pro Asn Ile Glu Arg Arg Thr
515 520 525
Ser Asn Gly Phe Glu Leu Ala Leu Pro Val Leu Phe Leu Ser Gly Leu
530 535 540
Gln His Ile Ala Glu Thr Ala Gln Gly Met Ala Thr Asn Ala Pro Asp
545 550 555 560
Tyr Val Lys Ala Tyr Met Arg Asp Ile Pro Val Leu Trp Asp Glu Ser
565 570 575
Lys Leu Ile Asp Gly Met Pro Gly Glu His Val Val Ile Ala Arg Lys
580 585 590
His Gly Glu Arg Trp Phe Val Ala Gly Ile Asn Ala Thr Asn Glu Ala
595 600 605
Ile Asn Leu Glu Met Asn Phe Asp Phe Ala Leu Gly Lys Gln Gly Thr
610 615 620
Leu Ile Thr Asp Ser Asn Ile Asn Thr Lys Gly Val Glu Ser Phe Thr
625 630 635 640
Ser His Thr Ile Thr Ala Thr Lys Asn Asn Ala Leu Thr Val Lys Ala
645 650 655
Asn Gly Gly Phe Val Ile Val Phe Asn
660 665
<210> 133
<211> 619
<212> PRT
<213> Microbulbifer degradans
<400> 133
Met Ala Ala Gly Gln Ile Ile Ser Leu Glu Val Lys Val Lys Lys Ile
1 5 10 15
Glu Glu Ile Met Lys His Thr Ala Arg Thr Ile Ala Leu Gly Ala Thr
20 25 30
Gly Ala Ala Leu Leu Thr Gly Leu Ile Ala Cys Asn Gly Thr Asn Val
35 40 45
Asn Thr Asn Gly Asp Thr Gln Gln Ala Ser Ile Lys Lys Ala Pro Glu
50 55 60
Gly Met Phe Ala Asn Pro Leu Phe Ala Asn Gly Ala Asp Pro Trp Leu
65 70 75 80
Glu Tyr Tyr Asp Gly Asn Tyr Tyr Leu Thr Thr Thr Thr Trp Thr Ser
85 90 95
Gln Leu Val Met Arg Lys Ser Pro Thr Leu Asp Gly Leu Ser Thr Ala
100 105 110
Leu Pro Val Asn Val Trp Ser Asp Ser Asp Leu Thr Arg Cys Cys Asn
115 120 125
Phe Trp Ala Phe Glu Phe His Arg Leu Asn Gly Pro Asn Gly Trp Arg
130 135 140
Trp Tyr Leu Met Tyr Thr Ser Gly Gln His Gly Thr Leu Asp His Gln
145 150 155 160
His Leu Ser Val Leu Glu Ser Val Gly Asp Asp Pro Met Gly Pro Tyr
165 170 175
Thr Tyr Lys Gly Glu Met Met Pro Asn Thr Trp Asn Ile Asp Gly Ser
180 185 190
Tyr Leu Glu His Asn Gly Gln Leu Tyr Leu Leu Trp Ser Glu Trp Val
195 200 205
Gly Asp Glu Gln Gln Asn Phe Ile Ser Lys Met Thr Thr Pro Trp Ser
210 215 220
Ile Glu Gly Pro Arg Ala Leu Leu Thr Arg Pro Glu Ala Glu Trp Glu
225 230 235 240
Lys Ser Gly Arg Lys Val Asn Glu Gly Pro Glu Ile Leu Lys Lys Asp
245 250 255
Gly Arg Thr Phe Leu Ile Tyr Ser Ala Ser Tyr Cys Asp Thr Pro Asp
260 265 270
Tyr Lys Leu Ala Met Lys Glu Leu Thr Gly Asp Asp Pro Met Asn Ser
275 280 285
Glu His Trp Thr Lys Tyr Asp Lys Pro Val Phe Glu Arg Gly Asn Gly
290 295 300
Val Phe Ala Pro Gly His Asn Gly Phe Phe Lys Ser Pro Asp Gly Thr
305 310 315 320
Glu Asp Trp Ile Val Tyr His Gly Asn Ser Lys Glu Glu His Gly Cys
325 330 335
Gly Ala Thr Arg Ser Val Arg Ala Gln Lys Phe Thr Trp Asn Thr Asp
340 345 350
Gly Thr Pro Asn Phe Gly Glu Pro Ile Pro Glu Gly Gln Phe Leu Pro
355 360 365
Leu Pro Ser Gly Glu Asn Gly Pro Leu Val Thr Ala Leu Gln Gly Ala
370 375 380
Arg Ile Gln Leu Arg Asn Gly Glu Ser Cys Leu Leu Ala Glu Gly Lys
385 390 395 400
Glu Leu Lys Gln Gly Ser Cys Gln Ala Glu Ala Ser Leu Trp Val Met
405 410 415
Asp Asn Thr Ala Asp Asn His Tyr Arg Phe Gly Asn Val Ala Ser Asn
420 425 430
Leu Phe Leu Thr Ala Asp Glu Gly Leu Ser Gln Ser Ala Trp Val Asn
435 440 445
Thr Ala Ser Gln Arg Trp Ala Leu Asn Ala Gly Glu Gly Asn Phe Val
450 455 460
Ala Phe Thr Asn Lys Tyr Thr Gly Asp Ala Leu Met Gln Asn Asn Trp
465 470 475 480
Gln Ile Leu Pro Val Gly Lys Val Ala Ile Ser Ser Ile Gln Ser Gly
485 490 495
Arg Val Leu Gln Ala Cys Asp Lys Asn Ser Ala Asn Val Asn Gln Gly
500 505 510
Gly Trp Gln Gly Arg Ala Cys Gln Ala Trp Gln Phe Asn Pro Ala Ser
515 520 525
Glu Gly His Val Gln Ile Lys Thr Gly Asn Gln Cys Leu Thr Val Glu
530 535 540
Asn Lys Ser Ile Val Pro Gly Thr Asn Val Ile Ala Gly Glu Cys Glu
545 550 555 560
Ser Thr Ser Ser Gln Trp Leu Tyr Gln Leu Asp Lys Glu Gly Arg Ala
565 570 575
Thr Phe Thr Asn Arg Glu Ser Lys Gln Arg Leu Asp Leu Ala Asn Cys
580 585 590
Gly Leu Ala Asp Gly Thr Asn Phe Ala Gln Ala Pro Ala Leu Asp Thr
595 600 605
Ile Cys Gln Ala Phe Gln Val Arg Tyr Leu Pro
610 615
<210> 134
<211> 1860
<212> DNA
<213> Microbulbifer degradans
<400> 134
gtggccgcag gccaaataat ttcactggag gttaaagtga aaaagataga agaaataatg 60
aaacacacag cgcgcactat agcgctaggt gcaacaggtg ccgccttgct aacggggtta 120
attgcctgta acggtaccaa tgtgaataca aacggggata cccaacaagc aagcattaaa 180
aaagcgccag aaggcatgtt tgccaacccg ttgttcgcca atggcgcaga cccttggtta 240
gagtattacg atggcaatta ctacctcact accaccacat ggacatcgca acttgttatg 300
cgtaaatcac ccacgttgga tggtttgtct actgcgctgc cggtgaatgt atggtccgat 360
tcagatttaa cccgctgctg taacttttgg gcgtttgaat tccatcgctt aaacggccct 420
aacggctggc gttggtattt aatgtacacc tcgggccagc acggcacttt agatcaccaa 480
cacttaagcg tattagaaag tgtgggtgac gaccctatgg ggccatacac ctacaaaggt 540
gaaatgatgc ccaatacatg gaatatagac ggcagttatt tagagcataa tggccaatta 600
tatttgttgt ggtctgaatg ggtaggtgac gagcagcaaa actttatatc taaaatgacc 660
accccatgga gcattgaagg cccgcgagca ctgttaactc ggccggaagc agagtgggaa 720
aaaagcggtc gcaaagttaa cgaaggccca gagattctaa aaaaagatgg tcgtaccttt 780
ttgatttact cggcgagcta ttgcgatacg ccagattata aactggcaat gaaagagcta 840
acgggcgacg acccaatgaa ctccgagcac tggactaaat acgataagcc cgtgtttgaa 900
agagggaacg gtgtgtttgc cccaggccac aatggtttct tcaaatcgcc cgatggcaca 960
gaagattgga ttgtgtacca cggaaattcg aaagaagagc acggctgcgg tgcaacgcgc 1020
tctgtgcgcg cacagaaatt tacttggaac acagacggca ccccaaattt tggtgagcca 1080
ataccagaag gccaattttt gcccttacct tctggcgaaa acggtccttt agtgactgca 1140
ctgcaaggtg cacgtattca gttgcgcaat ggcgaaagct gtttattagc cgaaggtaaa 1200
gagctgaagc agggcagttg ccaagcagaa gccagtttat gggtaatgga taacacggca 1260
gataatcact accgctttgg caatgtagcc agcaatttat ttttaacggc agacgaaggc 1320
cttagtcaga gtgcctgggt taatactgca agccagcgct gggcacttaa tgcaggcgaa 1380
ggtaactttg ttgcgtttac aaacaagtac accggcgatg cgcttatgca aaataattgg 1440
caaatactac cggttggcaa agtggcaatt agcagcattc aaagtggccg cgtattacag 1500
gcgtgcgata aaaacagcgc gaatgtaaac caaggcggct ggcagggtag ggcttgtcaa 1560
gcatggcaat ttaacccagc aagtgaaggc catgtgcaaa ttaaaacggg caaccaatgt 1620
ttaaccgttg agaataaatc catagtgcct ggcaccaatg ttattgccgg cgagtgtgaa 1680
tcaactagca gccaatggct ttatcaatta gataaagaag gccgcgcaac cttcacaaac 1740
cgcgaaagca aacagcgttt agatttagca aactgcggcc tagccgacgg caccaacttc 1800
gcacaagccc ctgcgttaga tactatttgc caagcattcc aagtgcgtta tttaccgtaa 1860
<210> 135
<211> 534
<212> PRT
<213> Microbulbifer degradans
<400> 135
Met Asn Pro Ala Ser Thr Leu Ser Val Lys Thr Thr Asn Lys Thr Thr
1 5 10 15
Asn His Leu Lys Lys Val Ala Leu Thr Val Ala Ala Ile Val Ala Pro
20 25 30
Leu Thr Ser Trp Ala Asp Val Lys Val Ser Leu Asn Pro Gln Asn Thr
35 40 45
Gly Glu Thr Ile Ser Lys Tyr Ile Tyr Gly Gln Phe Ala Glu His Leu
50 55 60
Gly Ser Gly Ile Tyr Gly Gly Ile Trp Val Gly Glu Asp Ser Pro Ile
65 70 75 80
Pro Asn Lys Asn Gly Phe Arg Asn Asp Val Ile Lys Ala Leu Gln Glu
85 90 95
Leu Gln Val Pro Val Ile Arg Trp Pro Gly Gly Cys Phe Ala Asp Glu
100 105 110
Tyr Arg Trp Arg Asp Gly Ile Gly Pro Arg Glu Gln Arg Pro Ile Arg
115 120 125
Val Asn Thr His Trp Gly Gly Val Glu Glu Pro Asn Thr Phe Gly Thr
130 135 140
His Glu Phe Phe Glu Leu Val Glu Leu Leu Asn Thr Glu Ala Tyr Val
145 150 155 160
Ala Gly Asn Leu Gly Thr Gly Ser Pro Gln Glu Met Ala Glu Trp Leu
165 170 175
Glu Tyr Ile Val Ser Asn Ser Asn Ser Thr Val Val Ala Glu Arg Lys
180 185 190
Lys Asn Gly Arg Glu Glu Pro Trp Glu Val Ala Phe Trp Gly Val Gly
195 200 205
Asn Glu Ser Trp Gly Cys Gly Gly Asn Leu Thr Pro Glu Tyr Tyr Thr
210 215 220
Asn Leu Tyr Arg His Phe Ser Thr Phe Val Lys Ala Thr Gly Ala Lys
225 230 235 240
Arg Pro Lys Leu Val Ala Ser Gly Ser Tyr Asp Asp Asp Glu Thr Trp
245 250 255
Thr Thr Pro Leu Ser Lys Leu Lys Asn Asn Ile Asp Gly Ile Ser His
260 265 270
His Tyr Tyr Thr Leu Pro Thr Ser Asp Trp Ser Ile Lys Gly Ala Ala
275 280 285
Thr Gly Phe Asp Glu Lys Glu Trp Ile Leu Thr Leu Glu Arg Thr Leu
290 295 300
Lys Ile Asp Ser Tyr Leu Ala Thr Gln Thr Gly Ile Leu Lys Lys Asn
305 310 315 320
Asn Pro Glu Gly Asn Ile Gly Leu Tyr Leu Asp Glu Trp Gly Thr Trp
325 330 335
Tyr Asp Ala Glu Pro Gly Thr Asn Pro Gly Phe Leu Tyr Gln Gln Asn
340 345 350
Thr Val Arg Asp Ala Ile Val Ala Ala Val Asn Leu Asn Ile Phe His
355 360 365
Asn Tyr Ala Asp Arg Leu His Met Ala Asn Ile Ala Gln Met Val Asn
370 375 380
Val Leu Gln Ala Met Ile Leu Thr Asp Asn Glu Lys Met Leu Leu Thr
385 390 395 400
Pro Thr Tyr His Val Phe Lys Met Tyr Ile Pro Phe Gln Asp Ala Thr
405 410 415
His Ile Pro Leu Asp Ile Lys Gly Gln Arg Asp Tyr Ser Ala His Lys
420 425 430
Thr Thr Val Pro Gly Phe Ser Ala Ser Ala Ala Lys Thr Pro Asn Gly
435 440 445
Asn Ile Val Val Ser Leu Val Asn Leu Asn Pro Asn Glu Ala Glu Glu
450 455 460
Val Ser Ile Ala Leu Gln Gly Ile Lys Val Lys Thr Ile Thr Gly Glu
465 470 475 480
Leu Leu Thr Ser Gln Lys Met Asp Ala His Asn Thr Phe Asp Lys Pro
485 490 495
Asn Asn Val Gln Pro Arg Ala Leu Asn Gln Ser Asp Tyr Ser Ile Ser
500 505 510
Lys Asn Gly Lys Thr Leu Thr Val Lys Leu Pro Ala Lys Ala Val Val
515 520 525
Val Leu Gln Leu Asn Lys
530
<210> 136
<211> 1605
<212> DNA
<213> Microbulbifer degradans
<400> 136
atgaacccag cttcaaccct aagtgtgaaa acaactaata aaacaacaaa ccacctcaaa 60
aaagttgccc ttaccgtagc cgctatagtt gccccgctaa caagctgggc cgatgtgaaa 120
gttagcctaa acccacaaaa cacaggcgaa accataagta aatatatcta cggccaattc 180
gcagagcacc ttggcagcgg catatacggc ggcatatggg tgggcgaaga ctccccaata 240
cccaacaaaa acggctttcg taacgatgta atcaaagccc tgcaagagct acaagtacct 300
gttataagat ggcccggtgg ctgctttgcc gacgaatatc gctggcgtga tggcattggc 360
ccacgtgagc aacgccctat ccgcgtaaat acccactggg gcggtgtgga agaacccaat 420
acctttggta ctcacgaatt cttcgaatta gttgagctac ttaataccga agcctatgtg 480
gcaggtaacc taggcactgg ctcaccacaa gaaatggccg aatggctgga atatattgtt 540
tccaactcca actctactgt agtggcagag cgcaaaaaaa atggccgcga agagccttgg 600
gaagtggcct tttggggtgt gggtaatgaa tcttggggtt gcggtggcaa cttaacgccc 660
gagtactaca ccaatttata tcgtcatttt tccacttttg ttaaagccac tggcgctaag 720
cgcccgaagt tagttgcaag cggctcgtat gatgatgacg aaacttggac aacgccacta 780
agtaagctca aaaataatat agatggtata agccaccact actacacctt acccaccagc 840
gactggagca taaaaggtgc ggctacaggc tttgatgaaa aagaatggat tcttaccctt 900
gagcgcacat tgaaaataga cagctacctt gcaactcaaa cgggtattct taaaaagaat 960
aaccccgaag gcaatatagg gttgtattta gatgaatggg gtacgtggta cgatgcagag 1020
cccggtacaa accccggctt tttataccaa caaaataccg tgcgcgacgc catagtagca 1080
gcagtaaact taaatatatt ccacaactat gccgaccgct tacacatggc gaacatcgcc 1140
caaatggtaa acgtactaca ggcaatgata ctgaccgaca acgaaaaaat gttgctcaca 1200
cccacgtatc acgtttttaa gatgtacata ccatttcaag atgctaccca tataccgtta 1260
gatataaaag gccagcgcga ctacagcgca cataaaacta ccgtaccagg gttctcggcc 1320
tcggcagcta aaaccccaaa tggcaatatt gtagtttcac ttgttaacct taacccaaac 1380
gaagcggaag aagtgagcat tgcactacaa ggcatcaagg taaaaaccat tactggtgaa 1440
ttacttacca gccaaaaaat ggatgcgcat aacaccttcg acaagccaaa caatgtgcag 1500
ccacgcgcac ttaaccaaag cgactacagc attagcaaaa acggcaaaac ccttaccgtt 1560
aagctacccg ctaaagccgt agttgtttta cagcttaata aataa 1605
<210> 137
<211> 397
<212> PRT
<213> Microbulbifer degradans
<400> 137
Met Leu Leu Ala Gln Leu Pro Val Lys Lys Tyr Phe Val Leu Leu Ala
1 5 10 15
Ile Phe Ser Phe Met Leu Gly Cys Asn Ser Ala Gly Val Gln Gln Ser
20 25 30
Ala Lys Ser Ile Gln Val Ala Gly Thr His Ser Lys Pro Ala Arg Phe
35 40 45
Phe Ala Gly Ala Asp Leu Ser Tyr Val Asn Glu Met Glu Asp Cys Gly
50 55 60
Ala Thr Tyr Arg Val Asn Gly Val Thr Thr Asp Pro Tyr Gln Ala Phe
65 70 75 80
Ala Asp Ala Gly Ala Asn Leu Val Arg Val Arg Leu Trp His Asn Pro
85 90 95
Thr Trp Thr Glu Tyr Ser Asp Phe Ala Asp Val Lys Lys Thr Ile Arg
100 105 110
Lys Ala Lys Gln Asn Asn Gln Thr Val Leu Leu Asp Phe His Tyr Ser
115 120 125
Asp Thr Trp Ala Asp Pro Glu Lys Gln Phe Val Pro Ala Ala Trp Glu
130 135 140
His Met Val Asp Asp Thr Pro Ala Leu Ala Gln Ala Leu Ala Gln Tyr
145 150 155 160
Thr Thr Asp Val Leu Glu Lys Leu Gln Ala Glu Asn Leu Leu Pro Asp
165 170 175
Met Val Gln Val Gly Asn Glu Thr Asn Ala Glu Val Leu Gln Leu Glu
180 185 190
Ala His Met Lys His Gly Glu Ile Asp Trp Gln Arg Asn Ala Ala Leu
195 200 205
Leu Asn Ser Gly Leu Ala Ala Val Ala Glu Phe Asn Gln Asn Asn Asn
210 215 220
Thr Tyr Ile Glu Arg Val Leu His Ile Ala Gln Pro Glu Asn Ala Leu
225 230 235 240
Trp Trp Phe Asp Asp Ala Ala Gln Ala Gly Ile Thr Asp Phe Glu Ile
245 250 255
Ile Gly Leu Ser Tyr Tyr Ala Lys Trp Ser Thr Tyr Lys Leu Asp Ser
260 265 270
Ile Gly Glu Ala Ile Arg Ala Leu Arg Thr Ala Phe Asn Lys Asp Val
275 280 285
Leu Val Val Glu Thr Ser Tyr Pro Trp Thr Met Gln Asn Phe Asp Gln
290 295 300
Ala Asn Asn Val Leu Asp Ala Thr Ser Leu Gln Gln Gly Tyr Pro Ala
305 310 315 320
Thr Ala Glu Gly Gln Lys Lys Tyr Met Met Asp Leu Ala Lys Gln Ile
325 330 335
Met Tyr Ala Gly Gly Ile Gly Ile Ala Tyr Trp Glu Pro Ala Trp Val
340 345 350
Ser Thr Pro Cys Lys Thr Leu Trp Gly Thr Gly Ser His Trp Glu Asn
355 360 365
Ala Val Phe Phe Asp Ser Gly Asn Asn Asn Glu Ala Leu Pro Ala Leu
370 375 380
Ser Phe Tyr Thr Asp Ile Met Ala Leu Phe Lys Gln Asp
385 390 395
<210> 138
<211> 1194
<212> DNA
<213> Microbulbifer degradans
<400> 138
atgttattag ctcaattgcc agttaaaaaa tattttgtct tattagctat tttctcgttt 60
atgctggggt gtaatagtgc tggcgtacaa caaagtgcta aatcaattca ggttgctggc 120
acgcacagta aacccgctcg tttttttgct ggtgccgacc tttcttacgt aaacgaaatg 180
gaagattgcg gagcaacata ccgcgtaaac ggtgtaacta ccgaccctta ccaagccttt 240
gccgatgccg gcgcaaattt agtgcgcgtg cgcttatggc acaaccctac ttggacagaa 300
tattccgact ttgccgacgt taaaaaaact atccgcaaag ccaaacaaaa taatcaaacg 360
gtattgttag attttcatta ttcagatacc tgggccgacc cagaaaaaca atttgttcca 420
gccgcttggg aacatatggt ggatgacacc ccagcactag cgcaagcctt agcgcaatac 480
acaaccgatg tattagaaaa gctgcaagca gaaaacctat tgccagatat ggtgcaagta 540
ggtaacgaaa caaacgcaga agtcttacag ctagaagcgc acatgaaaca cggcgaaata 600
gattggcagc gcaatgcagc gctactaaac agtgggttag cagccgttgc tgaatttaac 660
caaaacaaca acacctatat tgaacgcgta ttacatatcg cccagccaga aaatgctttg 720
tggtggtttg acgatgccgc gcaggctggc ataaccgatt ttgaaattat aggtcttagc 780
tactatgcca aatggtcaac gtataaatta gattccatcg gcgaagctat acgcgccttg 840
cgaaccgcat tcaataaaga tgtgttggtg gtagaaacct catacccctg gactatgcaa 900
aatttcgatc aagccaataa cgtgctcgat gctaccagct tgcagcaggg ctaccctgca 960
acggccgaag gccaaaaaaa atacatgatg gatttagcta aacaaattat gtacgccggt 1020
ggaattggta ttgcctactg ggaaccagct tgggtaagca ccccttgcaa aactctatgg 1080
ggtacaggtt ctcactggga aaatgccgtg ttttttgact ctggcaacaa caacgaagcg 1140
ctacccgcgc ttagtttcta cacagacata atggctcttt ttaagcaaga ttaa 1194
<210> 139
<211> 420
<212> PRT
<213> Microbulbifer degradans
<400> 139
Met Leu Gln Ile Leu Lys Asp His Gly Met Asp Ser Ile Arg Leu Arg
1 5 10 15
Val Trp Val Asn Pro Ala Gly Gly Trp Tyr Ser Ser Ile Asn Asp Val
20 25 30
Ile Glu Lys Ala Gln Arg Ala Lys Ala Ala Gly Met Arg Ile Met Ile
35 40 45
Asp Phe His Tyr Ser Asp Ser Trp Ala Asp Pro Gly Lys Gln Tyr Lys
50 55 60
Pro Ala Ala Trp Thr Asn Tyr Thr Leu Asp Gly Leu Met Ser Ala Val
65 70 75 80
Trp Trp His Thr Tyr Asp Ser Leu Val Ala Leu Lys Asn Ala Gly Ile
85 90 95
Thr Pro Glu Trp Val Gln Val Gly Asn Glu Thr Asn Asn Gly Met Leu
100 105 110
Trp Glu Glu Gly Arg Ala Ser Ala Asn Met Gln Asn Tyr Ala Trp Leu
115 120 125
Val Asn Ser Gly Tyr Asp Ala Val Lys Glu Val Phe Pro Asn Thr Lys
130 135 140
Ala Val Val His Leu Ala Asn Cys His Asp Asn Ala Asn Phe Arg Trp
145 150 155 160
Ile Phe Asp Gly Leu Gln Ala Asn Gly Gly Lys Trp Asp Val Ile Gly
165 170 175
Ala Ser Ile Tyr Pro Thr Asn Ala Ser Gly Tyr Ser Trp Ser Gln Ala
180 185 190
Asn Ser Leu Cys Glu Ala Asn Leu Asn Asp Met Gln Ser Arg Tyr Gly
195 200 205
Ser Glu Val Leu Ile Ala Glu Val Gly Ala Pro Trp Asp His Pro Glu
210 215 220
Ala Lys Ala Ile Val Ser Asp Val Ile Ala Lys Ala Gln Asn Ala Gly
225 230 235 240
Ala Thr Gly Val Phe Tyr Trp Glu Pro Gln Ala Ser Asn Trp Gln Gly
245 250 255
Tyr Thr Leu Gly Ala Trp Asn Pro Asn Thr Met Arg Pro Thr Glu Ala
260 265 270
Leu Asp Ala Phe Ile Asp Gly Ser Ser Asn Val Thr Thr Ala Arg Leu
275 280 285
Gln Ser Arg Asn Ser Asn Arg Cys Ile Asp Val Asn Gly Arg Ser Thr
290 295 300
Ala Asp Gly Ala Asp Ile Ile Gln Trp Ser Cys His Ser Asn Ala Asn
305 310 315 320
Gln Gln Trp Thr Phe Glu Asp Met Gly Asn Asn Tyr Val Arg Leu Arg
325 330 335
Val Gly His Ser Asn Lys Cys Leu Asp Val Leu Gly Ala Gly Thr Ala
340 345 350
Asp Gly Asp Asn Val Val Gln Trp Ala Cys His Asn Asn Ala Asn Gln
355 360 365
Gln Trp Leu Lys Glu Asp Met Gly Asp Gly Tyr Phe Arg Leu Lys Ser
370 375 380
Arg Ala Ser Gly Lys Cys Val Asp Val Asn Ala Gly Gly Ala Asn Asn
385 390 395 400
Gly Asp Ser Ile Ile Gln Trp Ser Cys His Thr Gly Trp Asn Gln Gln
405 410 415
Trp Met Val Tyr
420
<210> 140
<211> 1263
<212> DNA
<213> Microbulbifer degradans
<400> 140
gtgctgcaaa ttctaaaaga tcacggtatg gattccatcc gtctgcgcgt gtgggtaaac 60
cccgccggtg gatggtatag cagcattaat gacgtaatag aaaaagctca gcgcgccaaa 120
gctgcgggca tgcgtattat gatcgatttt cactacagcg actcttgggc tgacccaggc 180
aagcaataca agcccgccgc gtggaccaac tataccttag acggtttaat gtctgcggtg 240
tggtggcaca cctacgattc cctcgtggcc ctaaagaatg cgggtattac ccctgaatgg 300
gtgcaagtgg gcaacgaaac aaacaacggt atgttatggg aagaggggcg cgcatccgcc 360
aatatgcaaa actatgcgtg gttggtgaat agtggctacg atgccgttaa agaagtgttc 420
cctaatacca aggcagtggt gcacttggca aactgccacg acaacgcaaa cttccgctgg 480
atatttgacg gcttacaagc caatggtggt aagtgggatg taataggtgc ctctatttac 540
cctaccaacg caagcggtta tagctggagc caagccaaca gtttgtgcga ggcaaactta 600
aacgatatgc aatcgcgcta tgggtccgag gtgctaattg ccgaggttgg tgcgccgtgg 660
gatcacccag aagcgaaagc aatcgtgagc gatgtaattg ctaaggcgca aaacgccggt 720
gcaacagggg tattttattg ggagccgcag gcatcaaact ggcagggcta cacgctaggt 780
gcatggaacc caaacaccat gcgccccacc gaagcattag acgcgtttat tgacggcagc 840
tcgaatgtga caaccgcgcg tttgcaatcg cgcaatagca accgctgtat agatgttaat 900
ggccgcagta cagcagatgg tgccgatatc attcagtgga gttgccacag caacgccaac 960
cagcaatgga cttttgaaga tatgggcaat aactacgtgc gattgcgcgt gggccacagt 1020
aataagtgct tagatgtact gggtgcaggc actgccgatg gcgataacgt agtgcagtgg 1080
gcatgccaca ataacgccaa tcagcaatgg ctaaaagaag acatgggcga tggctacttc 1140
cgcttaaaat ctcgcgccag cggtaaatgc gtagatgtaa acgcaggcgg tgctaacaac 1200
ggtgattcta ttattcaatg gagttgccac actggttgga accagcaatg gatggtttat 1260
tag 1263
<210> 141
<211> 589
<212> PRT
<213> Microbulbifer degradans
<400> 141
Asn Asp Leu Ser Tyr Val Asn Glu Met Glu Asp Cys Gly Ala Val Tyr
1 5 10 15
Lys Asp Ala Gly Ser Val Val Asp Pro Tyr Glu Val Ile Ala Asn His
20 25 30
Gly Gly Asn Leu Val Arg Val Arg His Trp Asn Asp Pro Tyr Trp Gln
35 40 45
Ala Leu Ile Thr Gln Pro Glu Ser Val Ala Ala Asn Trp Lys Ala Asn
50 55 60
Tyr Ser Gly Leu Glu Asp Val Thr Glu Thr Ile Arg Arg Ser Lys Ala
65 70 75 80
Ala Gly Met Glu Val Leu Leu Asp Phe His Phe Ser Asp Ile Trp Ala
85 90 95
Asp Pro Gly Arg Gln Thr Thr Pro Arg Ala Trp Glu Asn Asp Phe Gly
100 105 110
Asp Glu Asp Ala Met Ala Ala His Ile Tyr Asp Tyr Val Thr Ser Val
115 120 125
Leu Thr Gly Leu Asn Asp Glu Gly Leu Met Pro Glu Leu Ile Gln Ile
130 135 140
Gly Asn Glu Ser Asn Ser Gly Met Met Thr Thr Gln Asn Leu Ile Ile
145 150 155 160
Glu Met Asn Asp Ala Gly Thr Gly Leu Asn Val Ser Lys Gly Gly Gln
165 170 175
Thr Asn Tyr Ser Asp Gln Tyr Val Ala Arg Met Tyr Asn Ser Ala Ile
180 185 190
Ser Ala Val Arg Asp Ile Ser Glu Gly Met Thr Asn Ala Pro Arg Ile
195 200 205
Ala Ile His Val Ala Gly Ala Asp Lys Ala Val Ala Phe Phe Asp Lys
210 215 220
Leu Lys Ser Ile Gly Val Thr Asp Ile Asp Ile Ala Gly Phe Ser Phe
225 230 235 240
Tyr Tyr Gly Trp Glu Gln Ala Pro Ile Glu Asp Val Ala Ser Met Ile
245 250 255
Ala Thr Leu Lys Glu Arg His Pro Asn Leu Asp Pro Leu Met Leu Glu
260 265 270
Thr Gly Tyr Leu Trp Asp Glu Glu Asn Ile Asp Ser Leu Gly Asn Ile
275 280 285
Ile Gly Ile Ala Asp Pro Ala Tyr Leu Pro Val Ser Lys Gln Asn Gln
290 295 300
Leu Lys Tyr Leu Thr Asp Leu Ser Gln Ala Val Ala Asp Ala Gly Gly
305 310 315 320
Ile Gly Val Val Phe Trp Glu Pro Ser Trp Val Ser Thr Glu Cys Arg
325 330 335
Thr Pro Trp Gly Gln Gly Ser Ser His Glu His Val Ala Tyr Phe Asp
340 345 350
His Arg Asp Gly Leu Asn Phe His Ile Gly Gly Gln Trp Met Glu Val
355 360 365
Thr Lys Leu Ser Glu Thr Pro Glu Ala Gly Leu Ala Thr Thr Phe Arg
370 375 380
Val Asp Met Thr Gly Gln Asp Thr Ser Ala Gly Val Phe Ile Arg Gly
385 390 395 400
Ala Phe Thr Glu Asp Thr Leu Gln Pro Met Leu Tyr Glu Gly Glu Asn
405 410 415
Ile Tyr Ser Tyr Thr Thr His Ile Gln Ala Ala Gln Ser Gly Ser Tyr
420 425 430
His Tyr Ala Ile Gly Leu Lys Asn Gly Thr Arg Glu Thr Val Pro Ser
435 440 445
Glu Cys Ala Asn Pro Glu Asp Thr Leu Asn Arg Leu Tyr Thr Val Gly
450 455 460
Glu Asn Gly Glu Gln Leu Val Thr Ala Val Trp Ala Ser Cys Asp Val
465 470 475 480
Phe Asp Pro Gln Ala Ala Gly Pro Thr Thr Leu Thr Leu Asn Val Asp
485 490 495
Met Thr Gly Val Asp Val Ser Gly Gly Val Tyr Val Ala Gly Asp Leu
500 505 510
Asn Ala Trp Thr Ile Thr Glu Leu Thr Gln Val Gly Ala Ser Ala Ile
515 520 525
Tyr Thr Ile Ser Tyr Asp Leu Ala Val Gly Ala Glu Gly Gly Tyr Tyr
530 535 540
Phe Leu Asn Gly Ser Asp Trp Gly Asp Arg Glu Thr Ile Pro Glu Glu
545 550 555 560
Cys Val Gly Tyr Tyr Asp Ala Asp Arg Gly Phe Leu Val Glu Glu Gln
565 570 575
Ser Pro Gln Val Leu Asp Leu Val Trp Ser Ser Cys Gln
580 585
<210> 142
<211> 1770
<212> DNA
<213> Microbulbifer degradans
<400> 142
aacgaccttt cctacgtaaa cgaaatggaa gactgcggcg cagtgtataa agatgcaggc 60
agtgtagtag acccctacga agtgatcgcc aatcacggcg gcaaccttgt gcgcgtgcgt 120
cactggaacg acccctactg gcaggcgctt attacccagc cagaatctgt agcggcaaat 180
tggaaagcta attacagtgg gcttgaagat gtaacagaaa caattcgacg atcaaaagcc 240
gccggtatgg aagtgctgct cgactttcat ttctcagata tttgggcaga ccccggcagg 300
caaacaacgc cgcgcgcatg ggaaaatgac tttggcgatg aagacgccat ggctgcgcat 360
atatacgatt acgtaacatc agtactcaca gggttaaacg acgaaggttt aatgcccgag 420
cttattcaaa ttggtaacga atctaactcc ggcatgatga cgactcaaaa cctcattata 480
gaaatgaatg atgcgggtac ggggttaaat gtgagtaaag gcgggcaaac aaattactca 540
gatcaatatg ttgcgcgtat gtataactct gcgatttctg ctgtgcgcga tattagtgaa 600
gggatgacca acgctccgcg tattgctatc cacgtggcag gtgcagataa agccgtcgca 660
ttttttgata agttaaaaag catcggagta acggatattg atatcgcggg cttctcgttt 720
tattacggtt gggagcaagc accaatagaa gacgttgcaa gcatgattgc aaccttaaag 780
gaacgtcacc ctaatttaga cccgttaatg cttgaaacag gctacctgtg ggatgaagaa 840
aacatcgata gcttaggcaa tattattggc attgccgacc ccgcatattt acctgtgagc 900
aaacagaacc aacttaaata tcttaccgat ttatcgcaag cggttgctga tgctggtggt 960
attggagtgg tgttttggga gccatcttgg gtatcaaccg aatgtcgcac accttggggg 1020
cagggctcat ctcacgagca tgttgcttac ttcgatcacc gcgacggctt aaactttcat 1080
attggtggcc aatggatgga ggttaccaag ttaagcgaaa ccccagaagc ggggctagct 1140
actaccttta gagtggatat gactggccaa gacaccagcg caggggtatt tattcgcggg 1200
gcgtttaccg aagacacatt gcagcccatg ctatatgaag gcgaaaacat ttatagttac 1260
accacgcata tccaagcagc gcaaagcgga agctaccact atgcgattgg cttaaaaaat 1320
ggtacgcgcg aaacggttcc tagcgaatgt gcaaacccag aagatacgtt aaatcgttta 1380
tatacggtgg gcgaaaatgg ggagcagtta gttaccgcag tttgggcaag ctgtgatgtt 1440
ttcgatccgc aagcggctgg gccaacaacc ttaacgctta atgtagatat gactggtgta 1500
gatgtaagtg gtggtgtgta tgttgcaggc gacttaaatg cttggacaat caccgagctt 1560
acacaagttg gcgctagcgc aatttatacc attagttacg atttagccgt aggtgcagaa 1620
ggtggctact acttcttgaa tggcagcgat tggggcgata gggaaacaat accagaagaa 1680
tgtgtgggct actatgatgc agaccgcggc tttttggtgg aagagcaaag cccacaggta 1740
ttggatttag tgtggagtag ttgtcaataa 1770
<210> 143
<211> 339
<212> PRT
<213> Microbulbifer degradans
<400> 143
Met Glu Arg Thr Gln Thr Ala Met Ser Asp Thr Pro Val Asn Pro Asn
1 5 10 15
Ala Asn Thr Thr Thr Lys Ala Val Tyr Thr Tyr Leu Lys Gln Gln Trp
20 25 30
Gly Ser Lys Met Leu Thr Gly Gln Met Asp Leu Thr Trp Lys Asp Ser
35 40 45
Ile Asp Glu Tyr Gln Arg Val Ile Asn Asp Thr Gly Lys Ala Pro Ala
50 55 60
Ile Met Gly Tyr Asp Tyr Met Asn Tyr Gly Ile Glu Ser Ser Phe Ile
65 70 75 80
Ser Gly Leu Glu Gln Thr Glu Glu Ala Ile Val His Trp Gln Arg Gly
85 90 95
Gly Leu Val Thr Phe Ala Trp His Trp Arg Asp Pro Asn Val Ser Gly
100 105 110
Ser Asn Ile Gly Glu Phe Tyr Thr Ala Asp Thr Ser Phe Gln Ile Pro
115 120 125
Ile Ala Asn Gly Gln Leu Asp Glu Ser Ser Gln Ser Phe Ile Asn Met
130 135 140
Gln Ala Asp Ile Asp Met Ile Ala Ala Glu Leu Gln Lys Leu Glu Asp
145 150 155 160
Ala Gly Ala Val Val Leu Trp Arg Pro Leu His Glu Ala Ser Gly Gly
165 170 175
Trp Phe Trp Trp Gly Arg Thr Arg Thr Asp Ser Val Ser Ala Ala Tyr
180 185 190
Ala Gln Val Leu Leu Trp Arg His Met Tyr Thr Arg Leu Thr Asp His
195 200 205
His Gly Leu Asp Asn Leu Leu Trp Val Trp Asn Gly Gln Asn Ser Ala
210 215 220
Trp Tyr Pro Gly Asp Glu Tyr Ala Asp Ile Val Ser His Asp Ile Tyr
225 230 235 240
Asp Gly Ala Lys Asn Tyr Glu Ser Gln Leu Ala Val Tyr Asn Asp Thr
245 250 255
Lys Asn Thr Pro Met Gln Thr Lys Met Val Ala Leu Ser Glu Asn Ser
260 265 270
Asn Ile Pro Asp Pro Asp Ala Met Gln Ala Asp Gly Ala Trp Trp Leu
275 280 285
Trp Phe Met Val Trp Asn Asp Ser Asp Thr Ala Glu Gly Val Thr His
290 295 300
Glu Asn Asn Phe Trp Thr Gly Glu Tyr Tyr Asn Ser Asn Ala His Lys
305 310 315 320
Gln His Val Tyr Asn His Glu Leu Val Ile Thr Leu Asp Glu Leu Pro
325 330 335
Ser Phe Asp
<210> 144
<211> 1020
<212> DNA
<213> Microbulbifer degradans
<400> 144
atggagcgca ctcaaacggc catgagcgat acgccggtaa accctaacgc caataccacc 60
accaaggctg tttacactta ccttaagcag caatggggca gcaaaatgct aaccgggcag 120
atggatttga cctggaagga cagcattgat gagtaccagc gcgtaattaa cgataccggc 180
aaagcccccg caattatggg ctacgactat atgaattacg gtattgagag cagttttatt 240
agtggccttg agcaaacaga agaagctatt gtccactggc agcgcggcgg cttagtgact 300
tttgcttggc actggcgaga cccgaacgta agcggtagta acattggcga gttttatacg 360
gctgatacga gctttcaaat cccaattgcc aatggtcaac tggatgaaag cagtcagagc 420
tttataaata tgcaggccga tatcgatatg atagcggcag agctgcaaaa gctagaagat 480
gccggcgctg tggtgctgtg gcgccccttg cacgaagctt ctggcggttg gttctggtgg 540
ggccgtacgc gtaccgattc ggtatctgcc gcttatgcac aagtgctgtt gtggcgccac 600
atgtacaccc gccttactga tcatcacggt ttagataatt tactttgggt gtggaacggg 660
caaaacagcg cttggtaccc aggcgatgaa tatgccgata ttgtaagtca cgacatttac 720
gacggcgcga aaaattacga gtcgcaatta gccgtatata acgatacgaa aaacaccccc 780
atgcaaacca aaatggtggc gctaagcgag aacagtaata ttcccgaccc agatgccatg 840
caggccgatg gcgcttggtg gttgtggttt atggtgtgga acgactcgga taccgcggaa 900
ggcgtaaccc acgaaaacaa cttttggaca ggtgaatatt acaactctaa cgcccataag 960
cagcatgttt acaatcatga gctggttatt acgctggatg agttacctag ctttgactag 1020
<210> 145
<211> 561
<212> PRT
<213> Microbulbifer degradans
<400> 145
Met Phe Asn Lys Ile Ser Thr Pro Ala Leu Ser Gly Trp Ser Lys Ala
1 5 10 15
Ala Arg Tyr Leu Cys His Thr Ala Thr Gly Ala Leu Met Leu Ala Ala
20 25 30
Ser Thr Val Asn Ala Gly Phe Ser Val Ser Gly Thr Gln Leu Leu Asp
35 40 45
Asp Asn Gly Gln Ala Phe Ile Met Arg Gly Val Asn His Pro His Ala
50 55 60
Trp Tyr Ala Asn Gln Thr Ser Ser Phe Ala Asp Ile Ala Ser Val Gly
65 70 75 80
Ala Asn Thr Val Arg Val Val Leu Ser Asp Gly Gln Gln Trp Thr Arg
85 90 95
Asn Ser Ala Ser Asp Val Ala Asn Val Ile Ser Leu Cys Lys Ala Asn
100 105 110
Lys Leu Val Cys Val Leu Glu Val His Asp Val Thr Gly Ser Gly Glu
115 120 125
Ala Ser Ala Ala Gly Thr Leu Ala Asn Ala Ala Gln Tyr Trp Val Asp
130 135 140
Ile Ala Asn Val Leu Lys Gly Gln Glu Asp Tyr Val Ile Ile Asn Ile
145 150 155 160
Ala Asn Glu Pro Phe Gly Asn Asn Val Pro Ala Ser Asn Trp Ile Asn
165 170 175
Gln His Lys Ala Ala Ile Gln Thr Leu Arg Ala Ala Gly Leu Thr His
180 185 190
Thr Leu Met Ile Asp Ala Ala Asn Trp Gly Gln Asp Trp Gln Gln Val
195 200 205
Met Leu Asn Asn Ala Ser Glu Val Ala Gln Ala Asp Ser Leu Ser Asn
210 215 220
Thr Met Phe Ser Val His Met Tyr Gln Val Tyr Asn Asn Leu Ser Thr
225 230 235 240
Val Glu Asn Tyr Val Ser Thr Phe Leu Ser Ser His Asn Leu Pro Leu
245 250 255
Ile Val Gly Glu Phe Gly Ala Asp His Gln Gly Glu Glu Val Asp Glu
260 265 270
Asp Ala Ile Leu Ser Val Ala Glu Gln Tyr Gly Ile Gly Tyr Leu Gly
275 280 285
Trp Ser Trp Ser Gly Asn Gly Ser Cys Cys Gly Thr Leu Asp Ile Thr
290 295 300
Asn Asn Phe Asn Val Asn Ser Leu Thr Ser Trp Gly Asn Arg Leu Ile
305 310 315 320
Asn Gly Thr Asn Gly Ile Lys Ala Thr Ser Val Ile Ala Ser Val Tyr
325 330 335
Gly Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser
340 345 350
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Gly
355 360 365
Ser Ser Gly Gly Ala Gln Gln Cys Asn Trp Tyr Gly Ser Val Tyr Pro
370 375 380
Leu Cys Asn Asn Gln Ala Ser Gly Trp Gly Trp Glu Asn Gln Gln Ser
385 390 395 400
Cys Ile Gly Arg Thr Thr Cys Glu Ser Gln Ser Gly Asn Gly Gly Val
405 410 415
Ile Gly Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
420 425 430
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
435 440 445
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Ser
450 455 460
Gly Ala Thr Cys Glu His Ile Ile Thr Asn Ser Trp Asn Ser Gly Phe
465 470 475 480
Gln Gly Ala Val Arg Ile Thr Asn Asn Gly Ser Ser Ala Ile Asn Gly
485 490 495
Trp Gln Val Ser Trp Ser Tyr Ser Asp Gly Thr Thr Ile Gly Ser Val
500 505 510
Trp Asn Ala Asn Gln Ser Gly Ser Asn Pro Tyr Thr Ala Ser Asn Leu
515 520 525
Gly Trp Asn Ala Thr Val Asn Pro Gly Gln Ser Val Glu Phe Gly Phe
530 535 540
Thr Ala Asn Gly Gly Gly Ala Ala Ser Ala Val Thr Gly Ser Val Cys
545 550 555 560
Asn
<210> 146
<211> 61
<212> PRT
<213> Microbulbifer degradans
<400> 146
Met Cys Ser Gln Val Ala Pro Leu Pro Pro Asp Glu Leu Leu Glu Glu
1 5 10 15
Glu Glu Leu Glu Asp Asp Glu Leu Leu Leu Asp Glu Leu Glu Leu Leu
20 25 30
Leu Asp Glu Leu Leu Glu Glu Leu Leu Asp Glu Leu Leu Glu Glu Leu
35 40 45
Leu Asp Glu Asp Pro Pro Ile Thr Pro Pro Leu Pro Asp
50 55 60
<210> 147
<211> 1686
<212> DNA
<213> Microbulbifer degradans
<400> 147
atgtttaaca agatatctac acccgcgtta agcggttgga gcaaggctgc gcgctattta 60
tgccatacgg ctactggcgc attaatgttg gctgcaagta cggtaaatgc ggggtttagc 120
gtttcgggta cgcagctgtt agatgataac ggccaagcgt ttattatgcg cggtgttaat 180
catccgcatg cgtggtacgc caatcaaacc agctcgtttg ctgatatagc ttcagtaggt 240
gctaatacgg ttcgagtggt actaagcgat ggccagcaat ggacgcgcaa cagcgcatct 300
gatgtggcga atgttatctc gctatgtaaa gcaaacaagt tagtgtgtgt tcttgaggta 360
cacgatgtaa ccggttcagg cgaggccagt gcggcgggta ccttggcaaa tgcggcccag 420
tattgggtag atattgccaa cgtgcttaag gggcaagaag attacgtaat aatcaatatt 480
gccaacgaac cgttcggcaa caatgtgcct gccagtaatt ggattaacca gcacaaagcg 540
gctattcaaa cgctgcgagc ggccggttta acccacacgc taatgataga tgcggcaaac 600
tgggggcaag attggcaaca agtaatgcta aacaatgctt cagaggtggc acaggccgat 660
agcctaagta acaccatgtt tagcgtgcat atgtatcagg tttacaacaa cttaagcacc 720
gtagaaaact atgtgtctac gtttttaagt agccataact tgccgttaat agtgggggag 780
tttggtgcag atcaccaagg tgaagaggtt gatgaagatg caattttatc tgtggcagag 840
cagtatggca ttggctactt aggctggagt tggtcgggca atggcagttg ttgcggcacc 900
ttagatataa ccaacaactt taacgttaac agtttaacca gttggggtaa ccgcctaata 960
aacggtacca atggtattaa agcaacctcg gtaattgcat ctgtatacgg tggctcgtcg 1020
agcagttcta gttcatctag tagctcttca actagctcta gcagttccag ttcttctacc 1080
agttcatcca gcagttcgtc cggttctagt ggtggtgcgc aacagtgtaa ttggtacggc 1140
tcggtttacc cattgtgtaa taatcaagcc agtggttggg gctgggaaaa tcagcaaagc 1200
tgtattggcc gtactacctg cgaaagtcag tctggtaatg gcggagttat tggcgggtct 1260
tcgtcgagca gttcttctag tagttcatca agcagttctt ctagcagctc atccagtagc 1320
agttctagct catctagcag cagctcgtcg tcctcaagtt cttcttcttc tagtagttca 1380
tcgggcggca gtggtgcaac ttgcgaacac ataattacaa atagttggaa cagcggtttt 1440
caaggggcgg tgcgcattac caataatggc agcagtgcca ttaacggctg gcaagtaagt 1500
tggagctaca gtgatggcac aaccattggc agtgtatgga atgccaatca aagcggtagc 1560
aacccttaca cagccagcaa tttagggtgg aacgccacgg ttaaccctgg gcagtcggta 1620
gagtttggtt ttactgccaa cggcggcggt gctgcatcag cagtgacggg gtcggtttgt 1680
aactaa 1686
<210> 148
<211> 514
<212> PRT
<213> Microbulbifer degradans
<400> 148
Met Asn Asn Glu Glu Val Asn Met Arg Ile Phe Ser Ile Ala Leu Ala
1 5 10 15
Leu Cys Ala Val Leu Cys Ala Gly Gln Ser Leu Ala Gly Leu Ser Ile
20 25 30
Gln Gly Thr Arg Leu Val Asp Gly Asn Gly Ser Thr Val Val Leu Arg
35 40 45
Gly Val Asn His Pro His Ala Trp Tyr Ala Gly Glu Thr Ala Ala Ala
50 55 60
Ile Pro Lys Ile Ala Ala Thr Gly Ala Asn Ser Val Arg Val Val Met
65 70 75 80
Ala Met Gly Thr Lys Trp Ser Arg Thr Ser Ala Ala Glu Ile Gln Thr
85 90 95
Ile Ile Asp Leu Cys Lys Gln Asn Asn Met Ile Ala Val Leu Glu Phe
100 105 110
His Asp Gly Thr Gly Trp Gly Glu Glu Ser Gly Thr Ala His Ile Ser
115 120 125
Asp Ile Ala Asp Tyr Trp Val Ser Ser Asp Val Met Ala Val Val Lys
130 135 140
Gly Glu Glu Asp Tyr Val Ile Ile Asn Ile Ala Asn Glu Pro Phe Gly
145 150 155 160
Asn Gly Val Ser Ala Ser Thr Tyr Thr Asn Asp Thr Ile Ala Ala Ile
165 170 175
Gln Lys Leu Arg Asn Ala Gly Tyr Thr His Thr Leu Met Val Asp Ala
180 185 190
Ala Asn Trp Gly Gln Asp Trp Gln Asn Leu Met Arg Asp Asn Ala Gln
195 200 205
Thr Ile Phe Asn Gly Asp Ser Leu Asn Asn Thr Met Phe Ser Val His
210 215 220
Met Tyr Gln Val Tyr Asn Thr Ser Ala Lys Val Gln Ser Tyr Met Gln
225 230 235 240
Ala Phe Ser Asp Lys Gly Leu Ala Leu Val Val Gly Glu Phe Ala Ala
245 250 255
Asp His Phe Ser Glu Asp Val Ala Glu Ala Ala Ile Met Gln Tyr Ala
260 265 270
Glu Gln Phe Gly Phe Gly Tyr Met Gly Trp Ser Trp Thr Gly Asn Ser
275 280 285
Ser Asp Leu Ala Ser Leu Asp Ile Val Lys Ser Phe Ser Asp Asn Thr
290 295 300
Tyr Thr Thr Trp Gly Asn Arg Leu Ile Asn Gly Ser Asn Gly Ile Ala
305 310 315 320
Thr Thr Ser Arg Ile Ala Ser Val Tyr Thr Gly Thr Ser Ser Gly Gly
325 330 335
Ser Ser Ser Ser Ser Ser Ser Ser Asn Ser Ser Ser Ser Gly Gly Thr
340 345 350
Ser Ala Cys Ser Thr Gly Gly Ser Cys Asp Trp Ser Gly Thr Ser Phe
355 360 365
Pro Leu Cys Glu Asn Gly Asn Thr Asn Asp Trp Gly Tyr Glu Asn Gly
370 375 380
Gln Ser Cys Val Gly Val Gly Leu Cys Asp Ser Asn Pro Asp Ala Thr
385 390 395 400
Thr His Cys Gly Thr Ser Ser Gly Gly Gly Ser Thr Ser Cys Gly Thr
405 410 415
Thr Ser Asp Gly His Pro Val Cys Cys Asp Ala Ala Ser Asp Pro Asp
420 425 430
Gly Asp Gly Trp Gly Trp Glu Asn Glu Ala Ser Cys Val Val Glu Ser
435 440 445
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser
450 455 460
Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Ser Ser Ala Thr Cys
465 470 475 480
Asn Trp Tyr Gly Thr Gln Tyr Pro Met Cys Thr Ser Thr Ser Ser Gly
485 490 495
Trp Gly Trp Glu Asn Asn Gln Ser Cys Ile Ser Pro Ser Thr Cys Ser
500 505 510
Gly Gln
<210> 149
<211> 52
<212> PRT
<213> Microbulbifer degradans
<400> 149
Met Glu His Ala Asp Val Pro Pro Leu Leu Leu Glu Leu Asp Asp Asp
1 5 10 15
Glu Glu Leu Glu Leu Pro Pro Leu Asp Val Pro Val Tyr Thr Leu Ala
20 25 30
Ile Arg Glu Val Val Ala Ile Pro Leu Leu Pro Phe Ile Lys Arg Leu
35 40 45
Pro Gln Val Val
50
<210> 150
<211> 1545
<212> DNA
<213> Microbulbifer degradans
<400> 150
atgaataacg aggaagtcaa catgagaata ttttctatag cgcttgcgct atgcgccgtg 60
ctttgcgcgg gtcaatcttt agctgggtta tctattcagg gtacgcgact tgtggatggc 120
aatggcagca cggttgtgct gcgcggcgtg aaccatcccc atgcgtggta tgccggtgaa 180
acggcggcgg ctatccctaa aattgctgca acaggtgcta actcggtacg ggtagtgatg 240
gctatgggca ctaagtggag ccgcaccagt gccgcagaga ttcaaaccat tattgacctg 300
tgcaaacaaa acaacatgat tgctgtactg gaatttcacg atggcactgg ctggggggaa 360
gagagcggta ctgcgcatat atctgatatt gccgattact gggtgagcag tgatgtaatg 420
gcggtagtta agggtgagga agactacgta attataaata ttgccaacga accctttggc 480
aacggtgtat cggcaagcac ctacaccaac gacacgattg cggctattca aaaactgcgc 540
aacgccggtt atacccacac ccttatggtt gatgcggcaa attgggggca agattggcag 600
aacctaatgc gtgataacgc acaaaccatt tttaatggcg actcacttaa caacactatg 660
tttagcgtgc atatgtatca ggtgtataac acctcggcaa aagtgcaaag ctacatgcaa 720
gcctttagcg acaaaggttt agcgttagtt gtaggcgaat tcgcagcaga tcattttagt 780
gaagatgttg cagaagcagc cattatgcaa tacgccgagc aattcggttt tggttacatg 840
ggttggagtt ggacgggtaa tagctcggat ttagcatcgc tagacatagt aaaaagcttt 900
agcgataaca cctatacaac ctggggcaac cgcttaataa acggcagcaa cggtattgcg 960
accacttcgc gtatagccag tgtgtacaca ggaacgtcta gcggcggcag ctctagttct 1020
tcgtcatcgt ctaattcaag tagcagtggc ggtacatcag cgtgttccac tggcggaagt 1080
tgtgattgga gtggtacaag ctttccattg tgtgaaaacg gtaatactaa cgactggggt 1140
tacgagaatg gccaaagttg tgtaggtgta ggtttatgcg attctaaccc cgatgctacc 1200
actcactgcg gtacttcttc tggcggcggt tctacaagtt gcggcactac cagcgacggc 1260
caccctgttt gctgtgatgc agcctcagac ccagatggcg atggctgggg ttgggagaac 1320
gaagccagtt gtgtcgttga aagcggtagc tcttcgagtt catcaagttc tagctctacg 1380
tctagttcaa gttcgtcgag ctcttcaagc agctcaagtg gcggtagcag cgccacgtgt 1440
aattggtatg gcactcaata cccaatgtgt acttctacca gtagcggctg ggggtgggag 1500
aataaccaaa gttgtatttc gccttctacg tgtagtgggc aatga 1545
<210> 151
<211> 457
<212> PRT
<213> Microbulbifer degradans
<400> 151
Met Val Asn Tyr Leu Lys Arg Ala Ala Leu Cys Met Leu Ala Leu Val
1 5 10 15
Ala Val Ala Cys Ala Lys Ser Gln Pro Glu Thr Lys Val Glu Glu Leu
20 25 30
Pro Ala Thr Lys Thr Ala Ala Asn Ala Gly Val Ala Ala Thr Glu Phe
35 40 45
Val Gln Val Asn Gly Gly Arg Phe Thr Leu Arg Gly Gln Asp Tyr Ala
50 55 60
Tyr Ile Gly Thr Asn Met Trp Phe Ala Ala Tyr Ile Gly Ser Thr Asn
65 70 75 80
Pro Glu Tyr Gly Asp Arg Glu Arg Leu Ile Lys Glu Leu Asp Leu Leu
85 90 95
Lys Ser Leu Gly Val Thr Asn Leu Arg Ile Leu Gly Ala Ser Glu Lys
100 105 110
Ser Pro Leu Arg Asp Ser Met Lys Pro Ala Ile Ser Glu Arg Gly Glu
115 120 125
Ile Asn Gln His Asp Ile Leu Glu Gly Leu Asp Phe Ala Leu Ala Glu
130 135 140
Met Ala Lys Arg Asp Met Lys Ala Val Ile Phe Leu Asn Asn Phe Trp
145 150 155 160
Glu Trp Ser Gly Gly Met Ala Thr Tyr Leu Ser Trp Val Asn Gly Gly
165 170 175
Glu Ile Val Asp Met Ala Asp Pro Thr Lys Pro Trp Pro Ala Phe Ala
180 185 190
Leu Phe Ser Ala Gly Phe Tyr Ser Asn Glu Glu Ala Lys Gln Leu Phe
195 200 205
Asn Asn Tyr Leu Thr Lys Val Val Ser Arg Arg Asn Thr Ile Thr Gly
210 215 220
Glu Leu Tyr Ala Asn Asp Pro Thr Ile Met Ser Trp Gln Leu Ala Asn
225 230 235 240
Glu Pro Arg Pro Gly Asn Gly Asp Val Ser Lys Ser Asn Leu Pro Ala
245 250 255
Tyr Tyr Asp Trp Ile Ser Lys Thr Thr Gln Leu Ile Lys Ser Ile Ala
260 265 270
Pro Lys Gln Leu Val Ser Ile Gly Ser Glu Gly Thr Met Gly Cys Leu
275 280 285
Glu Leu Asp Glu Cys Val Ile Thr Ala His Lys Glu Thr Gly Ile Asp
290 295 300
Tyr Val Thr Phe His Met Trp Leu Lys Asn Trp Gly Trp Phe Asp Val
305 310 315 320
Gln Asn Ala Glu Gln Thr Tyr Asp Ser Ala Val Ala Thr Ala Asp Lys
325 330 335
Tyr Ile Asp His His Ile Lys Leu Ala Asn Glu Leu Asn Met Pro Val
340 345 350
Val Leu Glu Glu Phe Gly Met Glu Arg Asp Gly Gly Glu Phe Ser Pro
355 360 365
Glu Ser Ala Val Thr Tyr Arg Asp Lys Phe Tyr Ala Tyr Val Phe Asp
370 375 380
Arg Gln Ile Lys Ser Ile Arg Ser Gly Gly Pro Phe Val Gly Ser Asn
385 390 395 400
Phe Trp Ala Trp Gly Gly Tyr Gly Lys Ala Met His Asp Asp Ala Val
405 410 415
Trp Arg Lys Gly Asp Lys Thr Phe Val Gly Asp Pro Pro Gln Glu Pro
420 425 430
Gln Gly Leu Asn Ala Val Phe Ala Ser Asp Thr Ser Thr Leu Glu Val
435 440 445
Leu Lys Gln Ala Ala Asp Ala Ile Arg
450 455
<210> 152
<211> 1374
<212> DNA
<213> Microbulbifer degradans
<400> 152
atggtgaact acttaaagcg cgctgccctg tgcatgcttg cgctcgttgc tgtggcttgc 60
gccaaatcgc agccagaaac aaaagttgaa gagctgcctg ccaccaaaac tgcggcaaat 120
gccggtgtgg cagcgacaga gtttgtgcaa gtaaatggtg gccgctttac attgcgcggg 180
caggactacg cctacatagg cactaatatg tggtttgccg cgtatattgg ttctaccaac 240
cctgaatacg gcgatcgcga gcgcctaatt aaagaactcg atttattaaa atccttgggt 300
gtgaccaacc tgcgtatttt aggagcatcc gaaaaatctc cattgcgcga ttccatgaaa 360
ccggcaatta gtgagcgcgg tgaaattaac cagcacgata ttttagaggg cttagatttt 420
gcgttagctg aaatggctaa gcgcgatatg aaagccgtga tatttctgaa taacttttgg 480
gaatggtccg gcggtatggc gacttaccta agctgggtta acggtggcga aattgttgat 540
atggcagacc ccaccaaacc atggccagct tttgcacttt tttcagcggg gttttactct 600
aatgaagagg cgaaacaact attcaataat taccttacaa aagttgtaag ccgccgcaat 660
accattaccg gcgagttgta cgctaacgac cccactatta tgtcttggca gctcgctaac 720
gagccgcgcc caggtaacgg cgatgtgagc aagtctaact tacccgccta ttacgattgg 780
ataagtaaaa ccactcagct aattaaaagc attgccccta agcagttggt gtcgattggt 840
agcgaaggta ccatgggctg cttagagttg gatgagtgcg ttattaccgc ccacaaagaa 900
acaggtatcg actacgttac attccatatg tggcttaaaa actggggctg gttcgatgtg 960
caaaacgcag agcaaaccta cgacagcgca gttgctactg cagataaata tatcgatcac 1020
cacattaaac tggctaacga gttaaatatg ccggtggtgc tggaagagtt tggtatggag 1080
cgcgacggag gtgaattctc cccagaaagc gcggtaacct atcgcgataa attctatgcc 1140
tatgtgttcg atcgccaaat taaaagtatt cgctctggcg ggccgtttgt agggtccaac 1200
ttttgggcgt ggggcggtta tggcaaagcc atgcacgatg atgctgtatg gcgcaaaggc 1260
gataaaacct ttgttggtga cccgccgcag gagccgcaag gtttaaatgc ggtatttgct 1320
tccgatactt caacgctgga agtgttgaag caagcggcgg acgctattcg gtaa 1374
<210> 153
<211> 850
<212> PRT
<213> Microbulbifer degradans
<400> 153
Met Asn Thr Lys Ile Lys Leu Leu Ser Phe Leu Ala Ser Ala Met Leu
1 5 10 15
Leu Gln Ala Cys Gly Gly Asp Met Leu Gly Thr Ser Asp Ser Glu Asp
20 25 30
Tyr Lys Leu Ile Pro Glu Glu Val Thr Glu Asp Pro Thr Lys Ala Arg
35 40 45
Pro Ser Glu Asn Ala Pro Val Leu Lys Thr Ser Gly Thr Thr Ile Gln
50 55 60
Leu Pro Asp Gly Thr Pro Val Leu Leu Arg Gly Ile Asn Leu Gln Phe
65 70 75 80
Gly Asp Asn Pro Ile Glu Gln Ile Asp Gly Ile Gln Ala Ile Arg Glu
85 90 95
Thr Gly Ser Asn Val Val Arg Ile Gln Leu Leu Ala Asp Thr Ser Thr
100 105 110
Ala Asn Leu Glu Ala Val Leu Asn Lys Val Val Glu His Asn Leu Ile
115 120 125
Ala Val Leu Ser Leu Tyr Asp Glu Ala Leu His Cys Lys Glu Asp Asp
130 135 140
Glu Ala Phe Thr Asp Ala Val Lys Gln Val Trp Leu Thr Asp Phe Leu
145 150 155 160
Pro Ile Ile Ala Gln Asp Arg Tyr Gln Ala Asn Ile Met Ile Asn Ile
165 170 175
Ala Ser Gly Trp Gly Pro Glu Ala Ile Phe Asp Gly Tyr Ser Val Gly
180 185 190
Tyr Lys Thr Tyr Thr Asp Asn Tyr Lys Ala Ala Ile Arg Gln Phe Arg
195 200 205
Lys Ala Gly Phe Asn Val Pro Leu Val Ile Asp Ala Pro Gly Cys Gly
210 215 220
Ala Asp Phe Asn Ala Phe Leu Ser Asn Arg Gly Lys Glu Leu Met Ala
225 230 235 240
Ala Asp Asp Lys Asp Asn Leu Val Leu Ser Val His Gly Tyr Gly Ser
245 250 255
Gln Trp Asn Thr Ala Thr Lys Val Thr Asp Ala Ile Ser Gln Leu Ala
260 265 270
Ala Gln Asn Ile Pro Val Leu Met Ser Glu Phe Gly Gly Ser Gly Val
275 280 285
Gly Glu Lys Pro Val Lys His Met Ala Ile Leu Asp Lys Gly Ala Gly
290 295 300
Asp Tyr Ala Ala Glu Ile Ile Leu Pro Trp Ala Ser Ala Thr Asp Lys
305 310 315 320
Val Ala Met Asn Val Pro Phe Ser Ala Pro Ile Asn Leu Thr Asn Thr
325 330 335
Asp Val Ser Phe Asp Val Lys Leu Asp Glu Ala Tyr Val Thr Asp Gly
340 345 350
Gln Met Gly Val Val Met Tyr Leu Arg Asp Val Asn Gly Glu Tyr Ala
355 360 365
Asn Leu Ala Trp His Ser Ala Ser Glu Phe Pro Ala Gly Glu Trp Ala
370 375 380
Thr Lys Ser Tyr Ala Ile Gln Asn Asn Ala Ser Phe Gly Trp Ala Ser
385 390 395 400
Glu Ala Phe Asp Ile Thr Ala Val Ala Lys Val Gly Val Glu Leu Val
405 410 415
Ala Asn Gly Lys Leu Ala Glu Val Ala Gly Ser Val Val Ile Asp Asn
420 425 430
Leu Arg Val Ala Glu Gly Ser Gly Ala Val Glu Leu Tyr Ser Gln Ser
435 440 445
Phe Asp Asp Asp Ile Ala Gly Trp Gly Val Pro Trp Thr Gly Thr Val
450 455 460
Ala Ala His Ala Asp Gly Ala Leu Ser Leu Thr His Asp Asn Gly Glu
465 470 475 480
Ile Ile Ala Gln Leu Asp Gly Leu Gly Gly Val Val Asp Phe Ala Gln
485 490 495
Pro Val Val Ile Ser Gly Arg Phe Phe Val Pro Ala Asp Tyr Ala Gly
500 505 510
Ser Trp Met Tyr Ala Lys Phe Phe Asn Asn Gly Glu Ala Trp Thr Glu
515 520 525
Val Gly Ile Thr Gly Leu Thr Pro Gly Glu Trp Thr Glu Ile Ser Val
530 535 540
Glu Thr Glu Phe Pro Ala Ala Ala Thr Ser Val Gly Ile Gln Ile Gly
545 550 555 560
Asn Ile Gly Ile Ala Asp Gly Ala Thr Ile Thr Asp Ser Thr Glu Pro
565 570 575
Phe Leu Leu Asp Asp Phe Ala Ile Ser Gly Val Ala Ala Asn Asp Ser
580 585 590
Phe Glu Leu Gly Thr Gln Tyr Met Ala Ser Phe Asp Glu Ser Glu Asp
595 600 605
Gly Trp Ala Tyr Leu Ser Trp Gly Ala Ser Ala Thr Val Glu Ala Ile
610 615 620
Asp Gly Ala Leu Asn Phe Tyr Pro Asn Ala Asn Asp Ala Val Arg Ile
625 630 635 640
Val Leu Tyr Lys Thr Asp Leu Ser Ala Ile Asp Asp Leu Asp Leu Gln
645 650 655
Asp Pro Phe Thr Ile Lys Thr Arg Val Leu Ile Pro Asp Ser Tyr Thr
660 665 670
Gly Gln Ala Phe Glu Tyr Gln Leu Phe Leu Gln Asp Ala Asn Trp Gln
675 680 685
Asn His Phe Ala Ala Lys Ile Trp Asn Gln Asp Glu Leu Ile Pro Gly
690 695 700
Glu Trp Met Asp Leu Val Val Glu Val Glu Phe Pro Ala Glu Phe Asp
705 710 715 720
Arg Ala Gly Ile Pro Gln Tyr Leu Gly Phe Asp Leu Ser Ser Glu Val
725 730 735
Ala Leu Pro Gln Asp Pro Ile Leu Ile Asp Glu Ile Val Phe Glu Gly
740 745 750
Met Val Pro Val Glu Lys Glu Val Val Ile Ile Asp Gln Val Asp Phe
755 760 765
Phe Tyr Thr Asn His Phe Thr Asp Phe Ala Ile Asp Tyr Ile Glu Gly
770 775 780
Glu Ile Leu Glu Asp Asp Ile Leu Glu Leu Ala Tyr Ile His Gln Arg
785 790 795 800
Ser Glu Pro Phe Ser Trp Ile Ala Trp Ser Trp Tyr Gly Asn Asp Ile
805 810 815
Glu Asn Ser Asp Trp Asp Met Thr Thr Ile Val Gly Asp Ala Thr Ala
820 825 830
Leu Thr Glu Arg Gly Glu Asp Ile Val Asn Gly Lys Gly Gly Ile Ala
835 840 845
Gly Asn
850
<210> 154
<211> 2553
<212> DNA
<213> Microbulbifer degradans
<400> 154
atgaacacaa aaataaaatt actttcattt cttgcgagtg caatgctgtt gcaggcatgt 60
ggtggtgaca tgctgggcac ctctgacagc gaagactata aattgattcc agaagaagta 120
acggaagatc caactaaagc taggccgtct gaaaatgccc ccgtgttaaa aacgagtggt 180
actactatac agctaccaga tggtacacca gtgctgctgc gaggtattaa ccttcaattt 240
ggcgacaatc ctattgagca aatagatggc atacaggcta ttcgcgaaac aggttcaaac 300
gttgtgcgaa ttcaattgtt agcagataca tctacagcaa atttagaggc ggtgctaaat 360
aaagttgttg agcataattt aatagctgta cttagtctct acgacgaagc tttacattgt 420
aaagaagatg atgaggcatt tacggatgca gttaagcaag tatggcttac cgattttctt 480
ccaattattg ctcaagatcg ttatcaagca aatattatga ttaacatcgc aagtggctgg 540
gggcctgaag ccatttttga tgggtatagt gtaggttaca aaacgtatac cgataactat 600
aaagcagcta ttcgtcagtt ccgaaaggct gggtttaatg tgcctttggt aattgatgcc 660
cctggatgtg gtgctgactt taacgctttt ttaagtaatc gcggtaaaga gcttatggct 720
gcggatgata aagacaatct tgtgttgtcc gtgcacggtt atggttcgca gtggaacacg 780
gctactaaag ttacggatgc aattagccag cttgctgcgc aaaatatccc tgtgctaatg 840
agcgagtttg gtggctctgg tgtcggtgaa aagcctgtta aacatatggc gatccttgat 900
aaaggtgcag gagattacgc cgcagaaatt attctgccat gggcgagcgc tacagataaa 960
gtggctatga atgtgccatt ctcagcacct atcaatctga caaacactga tgtgagcttt 1020
gacgttaagc tggatgaagc atatgtcacg gatggtcaaa tgggggttgt aatgtacctc 1080
cgagatgtaa atggagaata tgcaaactta gcttggcact ctgcctctga atttccggct 1140
ggtgaatggg ctactaaatc ttatgcgatt cagaataatg catctttcgg ctgggcaagt 1200
gaggcgtttg atattactgc cgtcgctaaa gttggtgttg agttggttgc aaatggcaag 1260
ctcgccgagg ttgctggcag cgttgttatc gacaaccttc gagtggctga aggtagtggt 1320
gcagtcgagt tatacagtca aagctttgat gatgatattg caggttgggg cgtgccttgg 1380
actggcacag ttgctgcgca cgctgacggt gctttatcgc taacccatga caatggagaa 1440
attattgccc agctggacgg gcttggtggt gtagttgatt ttgcccagcc ggttgttatt 1500
agtggcaggt tttttgtgcc tgcggattat gctggttcat ggatgtacgc taaatttttc 1560
aataacggcg aggcatggac agaggttgga attactgggc ttactcctgg agagtggaca 1620
gaaatttccg tcgaaaccga atttcctgca gctgctacca gtgtaggcat tcagattggc 1680
aacataggta ttgctgatgg ggcaaccatt acagattcca cagagccttt tttgctggac 1740
gactttgcta taagtggcgt agcagctaat gactcgttcg aacttggtac gcagtatatg 1800
gcgtcgttcg atgaatctga agatggctgg gcttacttga gttggggggc gtctgctact 1860
gtagaggcca ttgatggcgc tttgaatttc tatccaaatg ccaatgatgc ggtgagaata 1920
gtactttata aaaccgactt aagtgctata gacgacttgg atttgcagga cccgttcacc 1980
attaaaacgc gagtattgat tccagatagc tacacaggac aggcgttcga gtatcagctg 2040
tttctacaag atgctaactg gcaaaatcat tttgcagcga aaatttggaa ccaagatgag 2100
cttatccctg gtgagtggat ggacttggtg gttgaggttg agtttcctgc tgaattcgat 2160
cgggctggaa ttcctcaata cctcggtttt gacctgtcct ccgaagttgc tttaccacaa 2220
gacccaatac taattgatga aatagttttt gaaggcatgg ttccagtaga aaaagaagtt 2280
gtgatcattg atcaggtaga tttcttctac actaaccact ttacagattt tgctatcgac 2340
tatattgagg gtgaaatatt agaggacgac attctggagc ttgcttatat tcatcagcgc 2400
agcgagccat tctcttggat agcttggtct tggtacggaa atgatatcga aaattctgac 2460
tgggatatga ccaccatagt tggcgacgca acagccttga ctgaacgtgg tgaagatatc 2520
gtaaatggta aaggtgggat tgcaggtaac tag 2553
<210> 155
<211> 408
<212> PRT
<213> Microbulbifer degradans
<400> 155
Met Gln Lys Ile Thr Arg Leu Ala Ala Leu Val Val Ala Gly Leu Cys
1 5 10 15
Leu Leu Gly Thr Gln Ala Gln Ala Lys Lys Phe Glu His Leu Ala Gln
20 25 30
Thr Pro Pro Met Gly Trp Asn Ser Trp Asn Asn Phe Gly Cys Asp Val
35 40 45
Asp Glu Lys Leu Ile Lys Glu Thr Ala Asp Tyr Met Val Ser Ser Gly
50 55 60
Met Lys Asp Ala Gly Tyr Glu Tyr Val Asn Ile Asp Asp Cys Trp His
65 70 75 80
Gly Glu Arg Asp Ala Asn Gly Phe Ile Gln Ala Asp Pro Glu Arg Phe
85 90 95
Pro Ser Gly Ile Lys Ala Leu Ala Asp Tyr Val His Ser Lys Gly Leu
100 105 110
Lys Phe Gly Ile Tyr Ser Asp Ala Gly Trp Thr Thr Cys Gly Gly Lys
115 120 125
Pro Gly Ser Arg Gly Tyr Glu Phe Gln Asp Ala Gln Met Tyr Ala Lys
130 135 140
Trp Gly Val Asp Tyr Leu Lys Tyr Asp Trp Cys Ala Thr Asp Gly Leu
145 150 155 160
Lys Ala Glu Gly Ala Tyr Gln Thr Met Arg Glu Ala Ile His Lys Ala
165 170 175
Gly Arg Pro Met Val Phe Ser Ile Cys Glu Trp Gly Asp Asn Gln Pro
180 185 190
Trp Glu Trp Ala Lys Pro Ile Gly His Leu Trp Arg Thr Thr Gly Asp
195 200 205
Ile Tyr Asn Cys Phe Asp Cys Glu Tyr Asp His Gly Thr Trp Ser Ser
210 215 220
Trp Gly Val Leu Gln Ile Leu Asp Met Gln Asp Asp Leu Arg Gln Tyr
225 230 235 240
Ala Gly Pro Gly His Trp Asn Asp Pro Asp Met Met Glu Val Gly Asn
245 250 255
Gly Met Thr Glu Ala Glu Asp Arg Ser His Phe Ser Met Trp Ala Met
260 265 270
Leu Ala Ala Pro Leu Ile Ala Gly Asn Asp Ile Arg Asn Met Ser Glu
275 280 285
Ala Thr Arg Lys Ile Leu Thr Asn Lys Ala Val Ile Ala Val Asp Gln
290 295 300
Asp Glu Leu Gly Val Gln Gly Phe Lys Tyr Ser Ser Lys Asn Gly Val
305 310 315 320
Glu Val Trp Phe Lys Pro Leu Ala Asn Asp Glu Trp Ala Met Ala Val
325 330 335
Leu Asn Arg Asn Lys Gly Glu Val Lys Phe Glu Phe Lys Trp Arg Asn
340 345 350
Glu Val Val Lys Asp Glu Leu Thr His Arg Thr Ile Thr Phe Asn Glu
355 360 365
Gln Lys Phe Asp Trp Gln Asp Leu Trp Asn Lys Ser Asn Lys Gly His
370 375 380
Thr Lys Lys Phe Leu Lys Thr Lys Ile Ala Gly His Asp Thr Leu Met
385 390 395 400
Phe Arg Leu Thr Pro Ala Lys Asn
405
<210> 156
<211> 1227
<212> DNA
<213> Microbulbifer degradans
<400> 156
atgcaaaaaa ttacacgttt agcggcgtta gtcgtcgctg gcttgtgcct gctaggtacg 60
caggcgcagg cgaaaaagtt tgaacactta gcccaaaccc cgcctatggg gtggaatagc 120
tggaacaatt ttggctgtga tgtagatgaa aagctcatca aggaaacggc tgattatatg 180
gtttcttccg gcatgaaaga cgcgggatat gagtatgtaa acatagacga ttgctggcat 240
ggtgagcgcg atgctaatgg ttttattcag gcagaccccg agcggtttcc ttccggtatt 300
aaagcgcttg ccgactatgt tcattcgaag gggctgaagt ttggtattta ttctgacgcg 360
ggttggacca cctgcggcgg taaaccgggc agtcgtgggt acgagtttca agatgcacaa 420
atgtatgcga aatggggcgt ggattatctt aaatacgatt ggtgcgctac agatggctta 480
aaggccgaag gcgcatatca aacaatgcgt gaagccattc ataaagccgg tagaccaatg 540
gtatttagca tttgcgagtg gggcgacaac cagccttggg agtgggccaa acctattggg 600
catctatggc gtaccacggg tgatatctac aactgtttcg attgcgaata tgaccacggt 660
acttggtctt cctggggtgt attgcaaatt ttggatatgc aagacgacct aaggcagtac 720
gctgggccag gccattggaa cgaccccgat atgatggagg tgggcaatgg catgactgaa 780
gctgaagacc gttcgcattt ttcaatgtgg gctatgttgg cagcgcccct tattgctggt 840
aacgatatac gcaatatgtc ggaagctacc agaaaaattc tcaccaataa ggccgtgata 900
gcagtcgatc aggacgagct aggcgtacag ggctttaaat attccagtaa aaatggcgta 960
gaggtttggt ttaaaccgtt ggcgaatgat gagtgggcaa tggctgtact caaccgaaat 1020
aaaggtgaag ttaaattcga gtttaagtgg cgtaatgaag tggttaaaga tgagctgacg 1080
catcgcacta ttacgttcaa cgagcaaaaa tttgactggc aagatttgtg gaataaatcc 1140
aacaagggcc ataccaaaaa attcctaaaa acaaaaatag ctgggcacga tacgctgatg 1200
tttaggctaa cgccggctaa aaactag 1227
<210> 157
<211> 1733
<212> PRT
<213> Microbulbifer degradans
<400> 157
Met Ser Cys Thr Pro Cys Ile Arg Ser Lys Ala Ser Ser Leu Asn Phe
1 5 10 15
Asp Asn Lys Leu Ile Lys Ser Glu Phe Ile Met Ser Asn Gln Lys Arg
20 25 30
Leu Ser Ala Gly Leu Ile Ala Lys Leu Ala Leu Ala Ser Gly Leu Thr
35 40 45
Leu Val Ala Thr Ala Ala Thr His Ala Asp Val Thr Asn Pro Val Ile
50 55 60
Gly Asn Leu Ile Phe Glu Glu Asn Phe Asn Thr Leu Asn Ser Asp Arg
65 70 75 80
Trp Asn Val Ile Glu Gly Asp Gly Cys Gln Tyr Gly Pro Asn Leu Cys
85 90 95
Gly Trp Gly Asn Gln Glu Leu Gln Tyr Tyr Gln Asp Ser Asn Val Thr
100 105 110
Ile Glu Asp Val Pro Gly Glu Pro Gly Asn Lys Ala Val Val Phe Glu
115 120 125
Ala Arg Asn Glu Thr Val Asn Gly Ser Ala Phe Thr Ser Gly Lys Ile
130 135 140
Asp Ser Glu Asp Gly Ile Ala Ile Lys Tyr Gly Met Ile Glu Phe Arg
145 150 155 160
Leu Arg Val Pro Asn Met Gly Val Gly Tyr Trp Pro Ala Val Trp Met
165 170 175
Leu Gly Thr Ser Thr Glu Ser Trp Pro Ser Lys Gly Glu Ile Asp Met
180 185 190
Met Glu Met Gly His Arg Ala Gln Gly Met Ala Asp Ala Gly His Pro
195 200 205
Gly Thr Asn Leu Asn Asn Tyr Thr Ala Ala Asn Ala Ile Phe Tyr Ala
210 215 220
Glu Ala Ala Cys Val Pro Glu Asn Pro Thr Cys Ala Ala Met Thr Ala
225 230 235 240
Trp Gln Thr Asp Asn Ala Tyr Val Ser Gln Thr Ser Met Gly Glu Arg
245 250 255
Phe Val Ile Tyr Arg Thr Tyr Trp Thr Asp Thr Gln Leu Arg Phe Thr
260 265 270
Val Ile Asp Asn Gly Val Glu Tyr Asp Met Tyr Asp Asp Pro Ile Thr
275 280 285
Ile Gly Glu Glu Ala Thr Glu Leu Gln Gln Pro Phe Tyr Leu Leu Ala
290 295 300
Asn Leu Ala Val Gly Gly Asn Phe Thr Asp Ala Ser Thr Pro Ala Glu
305 310 315 320
Val Thr Ala Gln Leu Pro Gly Lys Met Tyr Leu Asp Tyr Ile Arg Val
325 330 335
Tyr Gln Leu Asp Gly Met Gly Glu Ile Phe Glu Gly Ser Ile Ala Gln
340 345 350
Lys Glu Tyr Gly Thr Phe Gly Val Phe Thr Asp Asp Thr Pro Thr Ser
355 360 365
Asn Lys Leu Val Ala Gly Asp Thr Ser Gln Ile Tyr Ile Trp Asn Gln
370 375 380
Asn Ser Leu Ser Glu Gly Thr Leu Ala Pro Ala Glu Gly Asp Asn Val
385 390 395 400
Ile Ala Trp Asn Tyr Thr Ala Pro Glu Trp Phe Gly Ala Gly Ile Gln
405 410 415
Ala Val His Ala Arg Asp Met Ser Asn Phe Glu Asn Gly Glu Phe Lys
420 425 430
Phe Lys Ile Lys Ile Pro Ala Asn Val Ser Phe Lys Val Gly Phe Ala
435 440 445
Asp Thr Tyr Thr Asn Glu Asn Trp Leu Thr Phe Pro Ala Asn Gln Thr
450 455 460
Thr Tyr Gly Leu Val Arg Asn Gly Glu Trp Ala Glu Ala Thr Ile Pro
465 470 475 480
Val Ala Asp Leu Arg Gly Ser Leu Ile Ala Leu Gln Ser Met Ala Gly
485 490 495
Met Phe Tyr Ile Ala Ser Val Asp Gly Gln Ile Pro Thr Ser Asn Phe
500 505 510
Glu Phe Ala Ile Asp Asp Val Arg Trp Glu Gly Gly Gly Ala Gly Pro
515 520 525
Val Asp Ser Asp Gly Asp Gly Val Ala Asp Glu Leu Asp Gln Cys Pro
530 535 540
Asn Thr Pro Ala Gly Thr Ala Val Asp Ser Val Gly Cys Gln Ile Gly
545 550 555 560
Leu Pro Gln Pro Val Ala Val Thr Val Glu Ala Glu Asp Tyr Glu Ala
565 570 575
Tyr Tyr Asp Thr Thr Ser Gly Asn Ser Gly Asn Ala Tyr Arg Ser Asp
580 585 590
Asp Val Asp Ile Glu Ala Ala Ser Glu Gly Gly Phe Asn Val Gly Trp
595 600 605
Thr Asp Ala Gly Glu Trp Met Asp Tyr Thr Leu Asn Leu Ala Ala Gly
610 615 620
Thr Tyr Asp Val Thr Ala Arg Val Ala Ser Asn Thr Asp Thr Gly Val
625 630 635 640
Tyr Ser Val Ser Leu Asp Gly Thr Thr Ile Gly Ser Asn Gly Val Ala
645 650 655
Thr Gly Gly Trp Gln Asn Trp Val Thr Gln Val Val Gly Gln Ile Thr
660 665 670
Val Asn Gly Gly Gln Gln Thr Leu Arg Ile Ser Thr Asp Val Ala Gly
675 680 685
Tyr Asn Ile Asn Trp Val His Phe Glu Pro Val Pro Asp Ala Asp Asn
690 695 700
Asp Gly Val Pro Asp Ser Gln Asp Asn Cys Pro Asn Thr Pro Ala Gly
705 710 715 720
Thr Glu Val Asp Ala Asn Gly Cys Ala Ile Val Val Asp Pro Val Pro
725 730 735
Val His Ile Glu Ala Glu Asp Tyr Ala Ala Tyr His Asp Leu Ser Ala
740 745 750
Gly Asn Asn Gly Gly Gln Tyr Arg Ser Asp Asp Val Asp Ile Glu Ala
755 760 765
Ala Ser Glu Gly Gly Phe Asn Val Gly Trp Thr Asp Thr Gly Glu Trp
770 775 780
Leu Glu Tyr Ser Val Glu Leu Ile Glu Gly Val Tyr Asp Leu Thr Ala
785 790 795 800
Arg Val Ala Ser Leu Ser Gly Asn Gly Ala Tyr Ser Val Ser Ile Ser
805 810 815
Gly Gln Ala Val Gly Gly Ser Asn Ala Val Ala Thr Gly Gly Trp Gln
820 825 830
Asn Trp Glu Thr Gln His Val Ala Arg Phe Val Ala Gly Thr Gly Thr
835 840 845
Tyr Thr Ile Arg Val Asn Ala Asp Ala Gly Gly Phe Asn Leu Asn Trp
850 855 860
Leu His Leu Glu Pro Val Asn Asp Pro Asp Ser Asp Asn Asp Gly Val
865 870 875 880
Pro Asp Ser Gln Asp Asn Cys Ala Asn Thr Pro Ala Gly Thr Glu Val
885 890 895
Asp Ala Ser Gly Cys Pro Val Val Val Gln Pro Phe Gly Val Thr Gln
900 905 910
Ser Asp Ser Asn Ser Ala Gln Phe Tyr Val Asn Gly Ala Asp Trp Ala
915 920 925
Val Leu His Tyr Ser Val Asn Gly Gly Gly Gln Val Asn Val Ser Met
930 935 940
Ser Leu Glu Asn Gly Lys His Val Tyr Thr Val Pro Asp Leu Ala Pro
945 950 955 960
Gly Asp Ser Ile Ser Tyr Phe Val Thr Tyr Trp Asp Pro Glu Leu Gly
965 970 975
Gly Ala Arg Asp Ser Glu Thr Val Ser Tyr Ser Val Val Ala Ala Gly
980 985 990
Ser Asp Ser Asp Gly Asp Gly Val Gly Asp Ser Ala Asp Gln Cys Pro
995 1000 1005
Asn Thr Pro Ala Gly Thr Ala Val Asp Ser Val Gly Cys Pro Val Thr
1010 1015 1020
Gln Pro Pro Ser Asp Ser Asp Asn Asp Gly Val Thr Asp Ala Asn Asp
1025 1030 1035 1040
Gln Cys Pro Asn Thr Pro Ala Gly Thr Ser Val Asp Ser Val Gly Cys
1045 1050 1055
Pro Val Val Gln Pro Pro Ser Asp Ser Asp Asn Asp Gly Val Asp Asp
1060 1065 1070
Ser Ser Asp Gln Cys Pro Asn Thr Pro Ala Gly Thr Ser Val Asn Ala
1075 1080 1085
Val Gly Cys Pro Val Thr Gln Thr Asn Ile Val Pro Leu Tyr Asn Ala
1090 1095 1100
Ser Thr Asn Leu Glu Gly Ala Ile Ser Phe Asp Arg Gly Asp Ala Leu
1105 1110 1115 1120
Val Thr Arg Ile Ser Asp Arg Gly Arg Asp Arg His Ala Lys Glu Asn
1125 1130 1135
His Phe Gln Ala Tyr Asp His Tyr Leu Thr Phe Tyr Trp Glu Asp Arg
1140 1145 1150
Thr Ile Ala Ile Glu Ile Val Asp Tyr Val Ala Lys Gly Gly Ser Ser
1155 1160 1165
Ile Arg Met Asn Ile Val Ser Gln Met Arg Leu Asp Asp Thr Glu Ala
1170 1175 1180
Glu Asn Arg Trp Phe Tyr Ile Gly Asn Asn Thr Leu Ala Glu Tyr Cys
1185 1190 1195 1200
Gly Asn Gly Val Met Asn Glu Val Asp His Thr His Tyr Trp Lys Glu
1205 1210 1215
Ser Ser Phe Asn Cys Arg Glu Gly Arg Pro Ile Gln Ile Gly Asp Lys
1220 1225 1230
Met Glu Phe Glu Ile Ser Gln Phe Leu Asp Pro Ala Leu Leu Pro Arg
1235 1240 1245
Gly Arg Ser Asn Tyr Tyr Gly Thr Thr Tyr Leu Tyr Ile Val Gly Glu
1250 1255 1260
Gly Leu Val Pro Trp Asp Val Thr Asp Lys Val Ala Phe Gln Gly Gly
1265 1270 1275 1280
Asn Arg Leu Gln Arg Asp Ser Ile Pro Val Pro Glu His Ala Arg Leu
1285 1290 1295
Gly Gly Asp Thr Thr Leu His Val Gln Met Thr Ala Glu Pro Asp Gly
1300 1305 1310
His Phe Gln Gln Met Ala Thr Asn Leu Gly Phe Asp Asn Gly Gln Pro
1315 1320 1325
Phe Val Leu Gly Arg Arg Val His His Thr Ser Tyr Val Asp Gly Thr
1330 1335 1340
His Asp Glu Ser Ala Glu Asn Gly Val Phe Asp Gly Met Pro Gly Lys
1345 1350 1355 1360
Ala Gly Pro His Tyr Val Asn Asp Arg Cys Ser Asp Cys His Glu Arg
1365 1370 1375
Asn Gly Arg Ala Pro Val Val Gly Ile Gly Glu Pro Leu Asp Arg Trp
1380 1385 1390
Val Phe Lys Val Gly Asp Ala Asn Gly Asn Pro His Pro Asp Met Gly
1395 1400 1405
Arg Val Leu Gln Pro Glu Ala Asn Asn Gly Ala Ala Ser Glu Gly Thr
1410 1415 1420
Pro Thr Ile Ala Phe Phe Ser Glu Glu Asn Gly Leu Arg Lys Pro Asn
1425 1430 1435 1440
Tyr Ala Phe Ser Gly Ile Thr Pro Asp Thr Phe Ser Ala Arg Ile Ala
1445 1450 1455
Pro Gln Leu Asn Gly Ile Gly Leu Leu Glu Ala Ile Pro Glu Ser Ala
1460 1465 1470
Ile Leu Ala Gln Glu Asp Val Asn Asp Ala Asn Gly Asp Gly Ile Ser
1475 1480 1485
Gly Lys Ala Gln Arg Ser Ile Asp Pro Val Thr Gly Glu Thr Arg Leu
1490 1495 1500
Gly Arg Phe Gly Tyr Lys Ala Ala Thr Ser Ser Val Lys His Gln Val
1505 1510 1515 1520
Ala Ala Ala Phe Asn Thr Asp Ile Gly Val Arg Thr Ser Val Met Pro
1525 1530 1535
Asn Pro Asp Cys Gly Ser Asn Gln Asn Asp Cys Gly Pro Ser Gly Ala
1540 1545 1550
Glu Leu Ala Asp Glu His Leu Asp Asn Leu Val Lys Tyr Val Ser Leu
1555 1560 1565
Leu Gly Val Arg Pro Gln Arg Asp Tyr Asn Asp Ala Gln Val Leu Gln
1570 1575 1580
Gly Lys Gln Val Phe Asn Asp Ala Gly Cys Val Ser Cys His Thr Asp
1585 1590 1595 1600
Thr Tyr Gln Thr Ser Gln Tyr His Pro Leu Ala Glu Leu Arg Ser Gln
1605 1610 1615
Thr Ile His Pro Tyr Thr Asp Leu Leu Leu His Asp Met Gly Pro Gly
1620 1625 1630
Leu Ala Asp Asn Leu Gly Glu Gly Asp Ala Thr Gly Ala Glu Trp Arg
1635 1640 1645
Thr Ala Pro Leu Trp Gly Leu Gly Leu Ser Ala Cys Val Thr Gly Gly
1650 1655 1660
Val Glu Gly Gly Arg Gly Trp Asp Asp Phe Gly Leu Asp Gly Tyr Glu
1665 1670 1675 1680
Thr Cys Thr Pro His His Ser Tyr Leu His Asp Gly Arg Ala Arg Thr
1685 1690 1695
Ile Glu Glu Ala Ile Leu Trp His Gly Gly Glu Gly Glu Asn Ser Lys
1700 1705 1710
Gln Ala Tyr Gln Asn Leu Ser Asn Ser Glu Arg Asp Ala Leu Leu Ala
1715 1720 1725
Phe Leu Asn Ser Leu
1730
<210> 158
<211> 5202
<212> DNA
<213> Microbulbifer degradans
<400> 158
gtgagttgta caccctgtat tagatcaaaa gcttcatcac ttaacttcga taataaactt 60
ataaaaagtg agttcatcat gagtaaccaa aagcgattgt ccgctgggtt aattgcaaag 120
ttagcgttgg cttcaggcct tacgctagtc gccactgcgg ccacccacgc cgacgttacc 180
aatcctgtta ttggtaattt aatctttgaa gagaatttca ataccctcaa cagcgatcgt 240
tggaatgtga ttgagggtga tggctgccaa tacggcccaa acctttgcgg ttggggcaac 300
caagaattgc agtattacca agatagcaat gtaaccatcg aagatgtgcc cggagaaccg 360
ggcaacaaag cggttgtttt tgaagcgcgc aacgaaactg taaatggcag tgcgtttaca 420
tccggcaaaa ttgattcgga agacggaata gccattaagt acggcatgat tgagttccgt 480
ttgcgagtac caaatatggg cgttggctat tggcctgcgg tttggatgct tggcacaagc 540
acagaaagct ggccgagcaa aggcgaaata gacatgatgg aaatgggcca tcgtgcccaa 600
ggcatggccg atgcgggcca cccaggcaca aacctaaaca actacaccgc ggccaatgca 660
attttttatg cagaagcagc ctgtgtaccc gaaaacccca cctgtgcagc catgaccgca 720
tggcaaacag ataacgccta tgtttctcaa acgtctatgg gcgagcgatt tgttatttac 780
cgcacttatt ggaccgacac ccagttgcgt tttaccgtaa ttgataatgg cgttgagtac 840
gatatgtacg atgatccaat tactatcggc gaagaggcta ctgagctgca acagccgttt 900
tatttgttgg caaacttagc agttggcggc aattttaccg atgcgtctac acctgccgaa 960
gttaccgcac agctgcccgg taaaatgtat ttagactata ttcgtgtgta ccaattagac 1020
ggtatgggcg aaatattcga aggctcaatt gcccaaaaag aatacggcac atttggtgtg 1080
tttaccgatg acacgcccac cagcaataag ctggtagccg gtgatacgtc acaaatttat 1140
atttggaatc aaaactccct aagtgaagga acgcttgcgc ccgccgaagg cgacaatgta 1200
atagcctgga actacaccgc gccagagtgg ttcggtgctg gcattcaagc tgtgcacgcg 1260
cgcgatatga gtaattttga gaatggcgag tttaaattca aaataaaaat acccgctaac 1320
gtctcattta aagttggttt tgcagatact tacaccaacg aaaactggct aactttccct 1380
gctaatcaaa caacctacgg tttggtgcgc aatggcgaat gggctgaggc gacaattcca 1440
gtagcggact tgcgcggcag tttaattgca ttgcagtcta tggctggtat gttttatatc 1500
gccagtgttg acggccaaat tccaacgtct aattttgaat ttgccatcga cgatgtgcgc 1560
tgggaaggcg gtggtgcagg gcctgtagac agcgacggcg acggcgtagc tgatgaactt 1620
gatcagtgcc ccaacacacc agccggtact gcagtagata gcgttggctg ccaaataggt 1680
ttgccacagc ctgtggctgt aacggtagaa gcggaagact acgaagctta ctacgataca 1740
accagtggca actctggcaa tgcgtatcgc agcgatgatg tagatataga agctgcaagc 1800
gaaggtggtt ttaacgtcgg ttggactgat gctggcgagt ggatggatta cactttaaac 1860
ctcgctgcgg gcacctacga cgttacagct cgtgttgctt caaataccga tactggtgtg 1920
tacagtgtta gtttagatgg caccacaatc ggctctaacg gagttgcaac tggcggttgg 1980
caaaactggg ttactcaagt tgtcggccaa attacggtaa atggcggtca acaaactctg 2040
cgcattagta ctgatgtggc tggatataac attaactggg tgcattttga gcccgtgcca 2100
gacgcagata acgatggcgt acctgacagc caagataact gccccaatac accagcgggt 2160
acagaggtgg atgctaatgg ctgtgcgatt gtcgtagacc cagtacccgt gcacatagaa 2220
gccgaagatt acgcggccta tcacgaccta tctgcaggta acaatggcgg tcaataccgc 2280
tctgatgatg tggatattga agccgctagc gaaggtggat tcaacgtagg ttggacagat 2340
actggcgagt ggttggaata ctcggttgaa ttaattgaag gtgtttatga cctaactgca 2400
cgcgtggcat cgttaagcgg caatggcgct tacagcgttt ctatttctgg tcaagctgtg 2460
ggtgggtcta atgctgtagc gactggtggt tggcaaaatt gggaaaccca gcacgtagcg 2520
cgttttgttg caggcacggg tacctacact attcgtgtaa atgcagatgc cggcggcttt 2580
aaccttaatt ggttgcactt agagccagta aacgaccccg atagcgataa cgacggtgtg 2640
ccagacagcc aagacaactg cgccaatacc cctgccggca ccgaagtgga cgccagcggt 2700
tgcccggtag ttgtacagcc atttggtgtg actcagtcag attccaacag cgcgcaattt 2760
tatgtaaatg gcgccgactg ggcagtgttg cattacagtg ttaatggcgg tgggcaagtg 2820
aatgtgtcca tgagtttgga aaacggtaag catgtatata ccgtgccaga cttggcccca 2880
ggtgactcca ttagttactt tgttacctat tgggacccag aactaggcgg tgcacgcgac 2940
agtgaaacgg taagctacag cgtagttgct gcgggtagcg atagtgatgg cgacggcgta 3000
ggtgatagtg ctgaccaatg cccaaataca cccgccggca cagctgtcga ttctgttggt 3060
tgcccagtaa cacagccacc atccgatagt gacaacgatg gcgttacgga tgctaatgat 3120
cagtgcccaa atacacctgc aggtacatcg gttgattccg ttggctgccc agtggttcag 3180
ccaccatcag atagcgataa cgacggtgtg gacgattcaa gtgatcaatg tccaaatacc 3240
cctgcgggta ccagtgtaaa tgcagtgggt tgccctgtaa cgcaaactaa tattgtgcct 3300
ttatataatg ccagcacaaa cttagaaggc gctatttctt ttgatcgcgg tgatgcgctt 3360
gttacgcgaa tttcagatcg cggtcgcgac cgtcacgcaa aagaaaacca cttccaagct 3420
tacgatcact accttacttt ctattgggaa gatcgcacta ttgctattga aatagtggat 3480
tacgttgcga aaggtggcag cagcattcgc atgaatatcg taagtcaaat gcgtttggat 3540
gacaccgaag cagaaaaccg ctggttttat attggtaaca acacgctcgc tgaatactgt 3600
ggcaatggcg tgatgaacga agtggatcac acgcactact ggaaggaatc tagctttaac 3660
tgtcgtgagg gtcgtcccat tcaaattggc gataaaatgg agtttgaaat cagccaattt 3720
ttagaccccg cgctattacc tcgcggtcgc tctaattatt acggcaccac ttacctttac 3780
attgtgggtg agggcttagt gccttgggat gttaccgata aagttgcttt ccaaggcggc 3840
aaccgtttgc agcgagattc cattcctgtg cccgagcacg cgcgtttagg tggcgataca 3900
actttgcatg tgcaaatgac tgccgagcca gatggccact tccagcaaat ggcaaccaac 3960
ctgggttttg ataatggcca gccgtttgta ctcggtcgcc gtgtgcacca cacatcttac 4020
gtagatggta cgcacgatga gagcgcagaa aacggcgtat ttgatggtat gccaggtaaa 4080
gcaggcccgc actatgtgaa tgaccgctgc tcggattgtc acgagcgcaa tggtcgtgca 4140
ccggttgtgg gtattggtga accgttagac cgttgggtgt ttaaagtggg cgacgcaaat 4200
ggtaacccgc atccagatat ggggcgcgta ttgcagccag aagcgaacaa cggtgccgct 4260
agcgagggca cgccaaccat cgcattcttt agcgaagaaa acggtttgcg caaaccaaac 4320
tatgcattta gcggcataac gccagacact ttctctgcgc gtattgcccc acagctaaat 4380
ggcattggtt tgctcgaagc gattccagaa agtgcaattt tagcgcaaga agatgtgaac 4440
gatgcgaacg gtgatggtat ctccggtaaa gcgcagcgct ctatcgaccc tgtaaccggt 4500
gaaacccgct taggtcgctt tggctacaaa gcagcaacca gcagtgtaaa gcatcaagtg 4560
gctgcggcgt ttaatacgga tataggtgta cgtacctctg taatgcccaa cccagattgc 4620
ggctcgaatc aaaacgattg tggcccaagt ggtgcggaac tagcagatga gcatttagat 4680
aaccttgtta agtatgtttc tttactgggt gttcgcccac agcgagacta caacgatgcg 4740
caagtactac agggtaagca agtgtttaac gatgctggct gtgtgagttg tcataccgac 4800
acttaccaaa catcgcaata tcacccgctt gcagagttac gcagccaaac tattcacccc 4860
tacacagatc tcttgctgca cgatatgggc ccaggtttgg ccgataactt aggtgaaggc 4920
gatgcaacgg gtgccgaatg gcgtaccgcg ccgctttggg gcttaggttt gtctgcttgt 4980
gttacgggtg gtgtagaagg tggtcgcggc tgggatgatt ttggcttaga tggctacgaa 5040
acctgtactc cacatcacag ctacttacac gatggtcgcg cacgcaccat tgaagaagct 5100
attttgtggc acggtggcga aggtgaaaac tcgaagcaag cgtaccaaaa cttaagcaat 5160
agcgagcgcg atgctctact cgcattctta aattccctct aa 5202
<210> 159
<211> 1441
<212> PRT
<213> Microbulbifer degradans
<400> 159
Met Thr Thr Arg Asn Thr Phe Lys Leu Ser Lys Leu Leu Leu Ser Leu
1 5 10 15
Gly Val Leu Thr Ala Ser Ser Gln Leu Val Ala Gln Asp Leu Ile Trp
20 25 30
Ser Asp Glu Phe Glu Gly Asp Lys Ile Asp Arg Ser Ile Trp Ser Tyr
35 40 45
Asn Val Gly Gly Ser Gly Asn Gly Asn Gly Glu Leu Gln Tyr Tyr Thr
50 55 60
Ala Asn Ala Thr Asn Ser Arg Ile Glu Asp Gly Asn Leu Val Ile Glu
65 70 75 80
Ala Arg Arg Glu Glu Met Glu Gly Lys Gln Phe Thr Ser Ala Arg Leu
85 90 95
His Thr Asn Gly Arg Phe Ser Phe Lys Tyr Gly Thr Leu Glu Ala Arg
100 105 110
Ile Lys Leu Pro Lys Leu Asp Asp Gly Leu Trp Pro Ala Phe Trp Thr
115 120 125
Leu Gly Asp Asn Phe Gly Val Asp Gly Trp Pro Lys Ser Gly Glu Ile
130 135 140
Asp Ile Leu Glu Ala Gly Tyr Lys Ala Ala Arg Glu Ala Gly Thr Thr
145 150 155 160
Asn Thr Ala Val Ser Gly Ala Val His Trp Trp His Glu Ser Gly Asp
165 170 175
Trp Ser Asp Trp Leu Gln Ala Asp Ala His Ala Glu Thr His Val Thr
180 185 190
Thr Pro Met Asn Glu Ala Tyr His Thr Tyr Arg Leu Glu Trp Thr Pro
195 200 205
Ser Glu Leu Ser Ile Ser Val Asn Asp Asn Thr Tyr Phe Thr Met Asp
210 215 220
Ile Thr Asp Pro Asn Met Ser Glu Phe His Gly Ala Gln His Leu Leu
225 230 235 240
Leu Asn Leu Ala Val Gly Gly Trp Asn Phe Val Glu Ile Glu Asp Pro
245 250 255
Ala Leu Ile Thr Ala Asp Phe Pro Ala Gln Met Leu Val Asp Tyr Val
260 265 270
Arg Leu Tyr Ser Asn Glu Phe Thr Glu Val Phe Asp Ala Asn Glu Asn
275 280 285
Leu Pro Thr Gly Asp Phe Gly Val Met Thr Asp Leu Thr Pro Val Phe
290 295 300
Asn Glu Leu Asn Trp Gly Asp Arg Met His Leu Tyr Ile Trp Asn Asn
305 310 315 320
Met Glu Val Ala Gly Ile Asp Pro Tyr Glu Gly Thr Ser Val Leu Ala
325 330 335
Tyr Asp Val Ala Pro Asp Ala Trp Trp Gly Met Gly Leu Leu His Lys
340 345 350
Asp Tyr Asn Met Arg Asn Tyr Lys His Gly Tyr Leu His Phe Met Met
355 360 365
Lys Thr Thr Ala Thr Ser Asp Ile Ser Ile Asn Met Ala Ser Thr Ser
370 375 380
Gly Gly Glu Gly Ala Val Val Leu Ala Asn Gly Gly Glu Glu Tyr Gly
385 390 395 400
Leu Glu Arg Asp Gly Glu Trp His Glu Val Asn Ile Pro Leu Ala Lys
405 410 415
Phe Gly Ala Met Asp Phe Glu Thr Ile Lys Thr Phe Phe Ser Met Ser
420 425 430
Gly Pro Gly Gln Ala Glu Ala Phe Gln Ile Ala Val Asp Asp Ile Tyr
435 440 445
Leu Lys Ser Ser Ile Ala Leu Pro Arg Pro Glu Phe Gly Ser Phe Gly
450 455 460
Ile Tyr Thr Glu Ser Pro Glu Asn Met Ser Ala Gly Asn Phe Gly Phe
465 470 475 480
Gly Val Glu Gly Asp Leu Phe Leu Trp Ala Asp Thr Leu Glu Leu Leu
485 490 495
Pro Gly Glu Val Val Glu Gly Asn Ala Ser Leu His Leu Lys Ser Thr
500 505 510
Gly Gln Gly Trp Phe Gly Met Gly Leu Thr Ala Arg Glu Gly Phe Asn
515 520 525
Leu Ser Ala Phe Asp Asn Ala Asp Ala Lys Leu His Leu Ser Met Lys
530 535 540
Thr Thr Asp Gln Thr Asp Phe Gln Val Gly Ile Lys Ser Gly Ser Val
545 550 555 560
Asn Asp Ile Gly Gln Val Trp Ile Lys Phe Thr Pro Gly Asn Asp Pro
565 570 575
Tyr Gly Phe Ala Arg Asp Gly Gln Trp His Asp Leu Val Ile Pro Met
580 585 590
Ser Asp Ile Ala Gln Asp Leu Asp Met Phe Asp Val Arg Gln Val Phe
595 600 605
Gln Leu Leu Gly Phe Gly Glu Ile Glu Ser Leu Ala Ile Asp Asn Ile
610 615 620
Tyr Ile Ser Gly Gly Gly Ala Ser Thr Pro Glu Ile Val Asn Pro Ser
625 630 635 640
Glu Pro Val Asn Arg Ala Pro Met Ala Ala Ile Lys Pro Ser Val Asn
645 650 655
Gly Gly Pro Ala Thr Leu Ser Val Asp Phe Asp Ala Ser Gln Ser Gly
660 665 670
Asp Val Asn Gly Asp Ala Leu Thr Tyr Thr Trp Asp Phe Gly Asp Gly
675 680 685
Thr Thr Ala Thr Gly Val Gln Val Ser His Asp Phe Glu Lys Glu Gly
690 695 700
Asn Tyr Arg Val Ser Leu Ile Val Ser Asp Gly Gln Ala Thr Asp Glu
705 710 715 720
Thr Ala Ala Ile Ile Thr Val Asp Asp Asn Tyr Gly Leu Ser Arg Ser
725 730 735
Asp Lys Arg Gly Leu Gly Phe Gly His His Ser Val Ala Asp Phe Glu
740 745 750
Ala Ile Ser Gln Gly Ile Ser Trp Trp Tyr Asn Trp Ser Ile Lys Pro
755 760 765
Asp Ser Leu Ile Gln Asp Val Tyr Gln Asn Tyr Gly Val Glu Phe Val
770 775 780
Pro Met Ala Trp Asn Gly Gly Phe Asp Asp Gln Ala Met Arg Asp Tyr
785 790 795 800
Ile Asn Ala His Arg Asp Asp Val Lys Tyr Ile Leu Ala Phe Asn Glu
805 810 815
Pro Asn Phe Leu Glu Gln Ala Asn Met Thr Pro Ser Gln Ala Ala Ala
820 825 830
Glu Trp Pro Arg Leu Glu Ala Ile Ala Asp Glu Phe Gly Leu Lys Ile
835 840 845
Val Ser Val Ala Met Asn Phe Cys Gly Asn Cys Val Thr Glu Asn Gly
850 855 860
Thr Thr Tyr Tyr Asp Pro Ile Asp Tyr Phe Asp Asp Phe Phe Glu Val
865 870 875 880
Cys Pro Asp Cys Arg Val Asp Ala Leu Ser Ile His Ala Tyr Met Gly
885 890 895
Gly Val Gly Gly Ile Glu Trp Tyr Ile Asp Leu Phe Ala Lys Tyr Asn
900 905 910
Leu Pro Ile Trp Met Thr Glu Phe Ser Ala Trp Asp Asp Thr Thr Thr
915 920 925
Glu Glu Glu Gln Ile Phe Phe Met Thr Gln Ala Val Asp Tyr Met Glu
930 935 940
Gln His Glu Asn Val Glu Arg Tyr Ala Trp Phe Thr Gly Arg Arg Asn
945 950 955 960
Gly His Pro Tyr Asn Gly Leu Phe Asp Tyr Arg Gln Ser Gly Val Leu
965 970 975
Thr Glu Leu Gly Ser Ala Tyr Ile Asn Met Pro Val His Gly Glu Asp
980 985 990
Ser Ile His Thr Leu Pro Lys His Ile Glu Ala Glu Ala Tyr Ala Tyr
995 1000 1005
Gln Ser Gly Gly Arg Val Ala Pro Thr Thr Asp Ala Asn Gly Phe Leu
1010 1015 1020
Gln Met Gly Glu Asn Ala Ala Gly Ser Trp Val Glu Tyr Asn Val Ile
1025 1030 1035 1040
Asn Pro Ala Thr Arg Ser Tyr Asn Val Ala Val Arg Val Asn Ser Glu
1045 1050 1055
Thr Gly Gly Thr Ile Thr Val Leu Val Asp Gly Val Glu Lys Gly Gln
1060 1065 1070
Ile Pro Val Pro Ala Ala Ala Asp Ala Gln Ala Trp Ala Thr Val Asp
1075 1080 1085
Thr Asp Leu Glu Ile Thr Ala Gly Glu His Asp Val Arg Leu Val Phe
1090 1095 1100
Gly Ala Ser Val Asn Leu Asn Trp Leu Asn Ile Gly Asp Ala Leu Ser
1105 1110 1115 1120
Pro Glu Leu Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
1125 1130 1135
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
1140 1145 1150
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
1155 1160 1165
Pro Glu Pro Glu Pro Glu Pro Glu Val Asp Pro Thr Ala Pro Leu Ala
1170 1175 1180
Cys Thr Ala Thr Ala Ser Ser Gln Glu Gly Ala Leu Ala Pro Ser Phe
1185 1190 1195 1200
Val Cys Asp Gly Asp Ala Gly Thr Arg Trp Ser Ser Glu Trp Asn Asp
1205 1210 1215
Asn Glu Thr Ile Thr Leu Asp Leu Gly Asp Val Tyr Glu Ile Ser Ser
1220 1225 1230
Ile Asn Leu Thr Trp Glu Asn Ala Tyr Gly Ser Lys Tyr Asp Ile Leu
1235 1240 1245
Val Ser Asp Asp Asn Val His Trp Thr Leu Ala Tyr Ser Glu Gln Ala
1250 1255 1260
Gly Asp Gly Gly Ile Asp Asn Leu Ala Val Thr Ala Thr Gly Gln Tyr
1265 1270 1275 1280
Val Arg Leu Leu Gly Leu Glu Arg Gly Ile Gly Tyr Gly Phe Ser Leu
1285 1290 1295
Phe Ser Phe Asp Val Tyr Gly Ala Leu Lys Pro Val Val Val Val Pro
1300 1305 1310
Glu Gly Asp Leu Ala Leu Gly Lys Pro Thr Ser Ala Ser Ser Ser Glu
1315 1320 1325
Trp Ala Glu Pro Thr Ala Ala Asn Ala Thr Asp Gly Asn Thr Gly Thr
1330 1335 1340
Arg Trp Ala Ser Ala Trp Thr Asp Ser Glu Trp Ile Ser Val Asp Met
1345 1350 1355 1360
Gly Glu Val Tyr Pro Val Gly Arg Val Val Leu Asn Trp Glu Gly Ala
1365 1370 1375
Tyr Gly Lys Gly Tyr Asn Ile Gln Thr Ser Asn Asn Gly Ala Asp Trp
1380 1385 1390
Thr Thr Val Tyr Thr Glu Thr Ala Gly Asp Gly Gly Thr Asp Glu Ile
1395 1400 1405
Ile Leu Pro Phe Ala Ser Gly Arg Tyr Ile Arg Val Leu Gly Thr Glu
1410 1415 1420
Arg Gly Thr Gly Tyr Gly Tyr Ser Leu Trp Asp Phe Glu Ile Tyr Ala
1425 1430 1435 1440
Asp
<210> 160
<211> 4326
<212> DNA
<213> Microbulbifer degradans
<400> 160
atgacaacgc gaaatacttt caagctgagt aagctcctgc tttcgctagg tgtacttacc 60
gcttcttcgc aactcgtcgc ccaagaccta atttggtccg acgaattcga aggcgataaa 120
attgaccgca gcatatggtc ttacaatgtg ggtggtagtg gtaacggcaa tggtgagttg 180
cagtactaca ccgccaatgc caccaattca cgcattgaag acggcaacct cgttattgag 240
gcgcgccgcg aagagatgga aggtaagcag tttacttctg cacgtctcca tacaaatggt 300
cgcttttctt ttaaatacgg taccttagag gcaagaatta aattgcctaa gttagacgat 360
ggcctgtggc cagcgttttg gactttgggt gataactttg gtgtagacgg ctggccaaaa 420
tctggtgaga tcgatatttt agaagccggc tacaaggctg cgcgcgaagc gggcaccacc 480
aatactgcgg tgtctggtgc tgtgcactgg tggcacgaaa gtggcgattg gagtgattgg 540
ttgcaggccg atgctcatgc cgaaacgcat gttacaacgc ctatgaacga ggcgtatcac 600
acctatcgtt tagagtggac tcctagcgag ctatctattt ctgtaaacga caatacctat 660
ttcaccatgg atattaccga cccaaacatg tcggaatttc acggtgctca gcatttgctg 720
cttaacttgg ctgtaggtgg ttggaacttt gtagaaattg aagacccagc gcttattact 780
gcagatttcc ctgcacaaat gcttgtggac tacgtgcgct tatacagcaa cgaattcacc 840
gaagtgtttg acgcaaacga aaacttacca accggtgact tcggcgtaat gaccgatttg 900
acccccgtgt tcaatgaatt gaactggggt gatcgtatgc acctgtacat ttggaacaat 960
atggaagtag cgggcataga cccgtacgaa ggcacttctg tattggctta tgatgtggcg 1020
ccagatgcgt ggtggggcat gggtcttttg cataaagact acaacatgcg caactacaaa 1080
cacggctacc tgcacttcat gatgaagaca accgctacct ccgatatcag cattaacatg 1140
gccagtacct ctggcggtga aggtgctgtt gtattggcaa acggcggtga agagtacggt 1200
ttagagcgcg acggcgaatg gcatgaggtg aatattcctc tggctaaatt cggcgcgatg 1260
gatttcgaaa ccatcaaaac tttcttcagc atgtctggcc caggccaagc agaagcattc 1320
caaattgctg tggacgacat ttatctaaaa agcagtattg ctttacctcg cccagagttt 1380
ggcagcttcg gtatttacac cgaaagccca gaaaatatga gcgcaggtaa ctttggtttt 1440
ggtgtagaag gtgatttatt tctttgggct gataccttag agctattgcc aggcgaagtg 1500
gttgaaggca atgcatcact gcatttgaaa tcaaccggtc aaggttggtt cggtatgggc 1560
ttaacggctc gcgaaggctt taacctttct gcgtttgata acgccgatgc gaaattacac 1620
ctctcaatga aaaccacaga tcaaactgat ttccaagtgg gcattaagag cggcagcgta 1680
aacgacatcg gtcaagtgtg gattaagttt acccctggta acgacccata cggtttcgca 1740
cgcgacggtc agtggcacga cttagttata ccgatgtctg atatcgcaca agatttagat 1800
atgttcgatg tgcgtcaagt atttcaattg ttaggttttg gcgaaataga aagcctcgca 1860
attgacaata tttatattag cggtggtggt gcatctacac cggaaattgt taacccaagt 1920
gaacctgtaa accgcgcacc gatggctgca attaagcctt ctgtaaatgg cggccctgca 1980
acattgagtg ttgatttcga cgcgagccaa tctggcgacg taaacggcga tgcgcttacc 2040
tatacctggg actttggtga cggtactact gcaaccggcg tacaagtaag tcacgacttt 2100
gaaaaagaag gtaactaccg cgttagctta attgtaagcg acggccaagc gacagacgaa 2160
actgcagcca ttattactgt tgacgataac tacggtttat cgcgcagcga taagcgcgga 2220
cttggttttg gtcaccactc agttgccgac ttcgaagcta tttcgcaggg tatttcttgg 2280
tggtacaact ggtctattaa gccagattca ctgattcaag atgtttacca aaactacggc 2340
gtagagtttg tacctatggc gtggaacggt ggttttgacg atcaagcaat gcgcgattac 2400
atcaatgcgc accgcgacga cgttaaatac attcttgcgt tcaacgagcc taacttctta 2460
gagcaagcca acatgacgcc ttcgcaggcg gcagccgagt ggccacgttt agaagccata 2520
gccgatgagt ttggtttgaa aattgtttcc gttgctatga acttctgtgg caactgtgta 2580
acagaaaatg gcactaccta ttacgacccc attgattact ttgacgattt cttcgaagtg 2640
tgccctgact gccgtgtaga tgcgctttct atccatgctt acatgggtgg tgttggcggc 2700
atcgagtggt atatcgatct gtttgcaaaa tacaaccttc caatttggat gaccgagttt 2760
tctgcatggg acgataccac cacagaagaa gagcaaatct tctttatgac ccaagcagta 2820
gactacatgg agcaacacga aaatgtagaa cgctatgcgt ggttcactgg tcgtcgcaat 2880
ggtcacccgt acaacggttt gtttgattac cgccagtcgg gtgtacttac cgagctaggc 2940
agcgcgtaca tcaacatgcc tgtgcacggt gaagacagca ttcacacttt accaaagcat 3000
attgaagcag aagcttacgc ttaccaaagc ggtggacgtg tagcacctac tactgatgca 3060
aatggcttct tgcaaatggg cgagaacgct gctggctctt gggttgaata caacgtaata 3120
aacccagcga ctcgcagtta caacgtagcg gtacgtgtaa acagcgaaac cggcggcacc 3180
attaccgtat tggtagacgg tgttgagaaa ggccaaattc ctgtacctgc agcagctgat 3240
gcgcaagctt gggcaactgt ggataccgat ttggaaatta ctgcaggtga gcacgatgtg 3300
cgcttggtat ttggtgcaag tgttaacctt aactggttaa acattggcga tgcgttaagc 3360
ccagaactag aaccagaacc agaaccagaa ccagaacctg aaccagagcc tgaaccagaa 3420
cctgaaccag agccagaacc agaaccagaa ccagaaccag aaccagagcc tgagccagag 3480
ccagagccag agccagagcc agagcctgag cctgagccag agccagaagt agacccaact 3540
gcaccattag cgtgtaccgc tacagcctca tcgcaagaag gcgcgttagc accaagcttt 3600
gtgtgtgatg gtgatgccgg cacccgttgg tcatccgagt ggaacgataa cgaaaccatt 3660
acgttagacc taggtgatgt ttatgaaata tctagcatta acctaacgtg ggaaaatgcc 3720
tacggaagca agtacgacat attggtttcg gacgataacg tacactggac gcttgcctac 3780
agcgaacaag ctggcgatgg cggtatagat aacttagcag taacggctac tggccaatac 3840
gtgcgcctgc ttggtttaga gcgcggcatt ggttacggct tctctctgtt tagtttcgat 3900
gtttacggtg cactaaaacc tgtagtggtt gtaccagaag gtgatttggc gttaggtaag 3960
cctacgagtg cttcttcatc ggaatgggca gagccaaccg ccgctaatgc aaccgatggc 4020
aacaccggca cgcgttgggc cagtgcttgg acagacagtg agtggatttc tgtagatatg 4080
ggtgaagtgt atcctgttgg tcgagtggta cttaactggg aaggtgcata cggtaaaggc 4140
tataacattc aaacctcaaa caatggggct gattggacta ccgtttacac cgaaaccgca 4200
ggcgatggcg gtacagatga aattattctg ccattcgcgt ctggccgcta tattcgtgta 4260
ctaggtaccg agcgtggcac aggttacggt tactcactgt gggattttga aatctacgca 4320
gattaa 4326
<210> 161
<211> 1207
<212> PRT
<213> Microbulbifer degradans
<400> 161
Met Gln Pro Ser Ala Tyr Gly Leu Tyr Cys Ala Ala Cys Tyr Ser Gln
1 5 10 15
Lys Ser Lys Cys Glu Phe Ile Met Ile Lys Asn Asn Leu Ile Lys Tyr
20 25 30
Ala Val Arg Cys Ala Ala Leu Gly Ala Leu Ser Thr Ser Ala Val His
35 40 45
Val Ser Ala Thr Asp Tyr Asn Asp Pro Tyr Trp Thr Asn Pro Glu Val
50 55 60
Asn Ser Asp Phe Val Asp Asn Phe Asn Gln Thr Gln Ile Asp Arg Ser
65 70 75 80
Lys Trp Leu Val Glu Thr Asn Ile Phe Val Asn Gly Glu Asp Ile Asp
85 90 95
Tyr Gln Asp Val Glu Tyr Pro Gln Ala Asp Trp Thr Ile Gly Val Gly
100 105 110
Gln Asp Asp Pro Ala Ala Ile Asp Gly Lys Ala Leu Ile Leu Lys Ala
115 120 125
Arg Tyr Met Asp Gly Glu Val Gln Asp Tyr Tyr Gly Gly Asp Pro Ala
130 135 140
Asn Pro Gly Lys Pro Leu Phe Ile Arg Ser Gly Arg Ile Glu Ser Gln
145 150 155 160
Ile Thr Asp Asp Thr Thr Phe Thr Tyr Gly Lys Phe Glu Ala Arg Leu
165 170 175
Lys Met Pro Pro Ala Arg Asn Gly Glu Phe Pro Ala Trp Trp Leu Leu
180 185 190
Gly Asn Phe Pro Asp Val Gly Trp Thr Ala Cys Gln Glu Leu Asp Ile
195 200 205
Met Glu Phe Thr Gly Asn Asn Gly Met Asn Ile Pro Gln Thr Tyr Trp
210 215 220
Thr Ala Pro Tyr Ala Val His Gly Gly Thr Thr Val Gly Leu Ala Gly
225 230 235 240
Leu Gly Ile Thr Asn Pro Gln Glu Gln Tyr Val Thr Tyr Gly Ile Ile
245 250 255
Lys Thr Pro Ser Lys Val Glu Trp Tyr Ile Asn Gly Val Leu Thr Asn
260 265 270
Thr Phe Ser Arg Asp Asn Gln Gly Asp Asp Gln Pro Trp Pro Tyr Val
275 280 285
Thr Pro Met Arg Met Ile Leu Asn His Ala Ile Thr His Val Glu Trp
290 295 300
Pro Asp Val Gly Asn Tyr Asn Ala Tyr Ser Pro Asp Ala Asn Arg Pro
305 310 315 320
Ser Ala Thr Gly Trp Ser Tyr Val Asp Asn Asn Gly Val Thr Gln Tyr
325 330 335
Glu Tyr Ile Asp Val Asp Ala Met Asn Ala Asn Ile Gly Arg Ala Gly
340 345 350
Thr Asp Phe Val Val Asp Tyr Val Ala His Trp Pro Leu Pro Thr Ser
355 360 365
Asp Ala Glu His Lys Tyr Val Asp Asp Ser Lys Ser Ser Phe Phe Arg
370 375 380
Glu Ser Gly Asn Thr Lys Gly Phe Tyr Asn Leu Lys Gly Trp Leu Ala
385 390 395 400
Pro Val Ser Val Thr Ala Asp Gly Phe Asp Gln Pro Gly Trp Asp Thr
405 410 415
Asp Val Arg Asp Asn Gly Ala Asp Asn Ala Ala Asp Gly Tyr Val Gly
420 425 430
Ser Lys Trp Ala Thr Pro Asn Asp Asp Gly Ile His Trp Val Glu Val
435 440 445
Asp Tyr Gly Gln Asp Lys Gln Ile Asn Tyr Leu Trp Leu Glu Trp Ala
450 455 460
Trp Asn Leu Pro Ala Asp Tyr Asp Ile Tyr Gly Lys Ser Ser Thr Gly
465 470 475 480
Ala Trp Glu Phe Ile Thr Ser Ser Gln Gln Glu Val Ala Thr Trp Ala
485 490 495
Thr His Val Phe Asp Val Asn Arg Thr Tyr Arg Tyr Leu Lys Leu Val
500 505 510
Thr Lys Gly Arg Ile Asp Lys Ser Ser Pro Ile Trp Leu Leu Glu Leu
515 520 525
Met Ala Phe Glu Asp Val Pro Asn Met Tyr Pro Lys Pro Ala Gly Thr
530 535 540
Thr Met Pro Asn Arg Ser Val Asn Val Leu Asn Asn Gly Asn Phe Ser
545 550 555 560
Gln Gly Leu Asp Ser Trp Gly Thr Glu Ala Phe Asp Gly Ala Asn Pro
565 570 575
Thr Tyr Asn Val Gln Asn Gly Ala Ala Val Ile Ala Leu Thr Asn Asp
580 585 590
Gly Gly Leu Ser Gly Ser Val Gln Leu His Thr Ser Gly Phe Gly Leu
595 600 605
Lys Arg Asp Tyr Arg Tyr His Ile Ser Phe Asp Ala Arg Ala Asp Val
610 615 620
Ala Arg Asn Leu Met Val Arg Leu Ala Glu Asn Asn Leu Asn Pro Ser
625 630 635 640
Ala Ala Gly Thr Tyr His Val Glu Thr Val Ala Val Gly Thr Ser Phe
645 650 655
Asn Thr Tyr Ser Phe Thr Tyr Asp Tyr Thr Gly Gln Gly Glu Pro Ala
660 665 670
Arg Leu Ala Phe Leu Leu Gly Gly Met Gly Thr Ala Thr Thr Tyr Ile
675 680 685
Asp Asn Val Val Ile Arg Glu Gly Asp Phe Ile Gly Ser Gly Glu Pro
690 695 700
Leu Val Ala Ala Ile Ser Ser Thr Asn Phe Val Ser Ala Ser Asn Gly
705 710 715 720
Trp Glu Thr Glu Trp Trp Gly Ala Pro Ala Arg Ala Val Asp Gly Asn
725 730 735
Leu Gly Asn Lys Ala Ser Gly Asn Asp Gly Glu Ala Glu Gly Met Asp
740 745 750
Leu Asp Ile Thr Val Arg Ile Asp Glu His Tyr Asp Val Arg Ala Val
755 760 765
Leu Val Ala Gly Asp Asn Ser Pro Glu Arg Ser Leu Asp Gln Phe Arg
770 775 780
Val Glu Phe Gly Asn Gly Thr Pro Leu Met Gly Trp Thr Asp Ser Thr
785 790 795 800
Thr Glu Gly Val Tyr Glu Glu Phe Ala Asn Phe Asn Asn Thr Pro Ala
805 810 815
Ala Gly Glu Arg Asp Phe Lys Phe Phe Phe Arg Pro Pro Ala Gly Gln
820 825 830
Leu Val Glu Val Ala Asp Val Gln Leu Leu Ala Val Asp Leu Met Pro
835 840 845
His Arg Ile Ser Ala Ile Ala Leu Asp Glu Gly Gly Thr Ile Ser Pro
850 855 860
Ala Gly Thr Thr Arg Tyr Ser Arg Asn Asn Thr Asp Asp Ala Thr Tyr
865 870 875 880
Thr Phe Thr Ala Pro Gln Gly Gln Ser Val Ser Asn Val Ile Val Asp
885 890 895
Gly Val Ser Met Gly Pro Leu Asn Ser Tyr Thr Phe Thr Asp Ile Asn
900 905 910
Ala Asp His Thr Leu Ala Val Ser Phe Gly Gly Glu Val Glu Gln Pro
915 920 925
Glu Thr Gly Asp Ala Ile Asn Tyr Ala Pro Glu Ala Tyr Ala Thr Ala
930 935 940
Ser Ser Glu Leu Gln Thr Ala Ala Leu Ala Asn Asp Gly Asp Ala Gly
945 950 955 960
Ser Arg Trp Glu Ser Glu His Gly Ala Gly Pro Ser Trp Leu Ala Leu
965 970 975
Glu Leu Asn Asp Thr Val Arg Val Ser Lys Val Val Ile Asp Trp Glu
980 985 990
Ala Ala Asn Ala Gly Thr Tyr Glu Ile Gln Gly Ser Leu Asn Gly Ile
995 1000 1005
Asp Trp Asn Thr Leu Glu Val Val Ser Gly Gly Ala Phe Gly Asn Arg
1010 1015 1020
Thr Asp Thr Val Leu Leu Asp Gly Ser Ala Ser Asn Ser Val Asn His
1025 1030 1035 1040
Ile Arg Ile Tyr Gly Val Glu Arg Ser Ala Gly Asn Asn Trp Gly Tyr
1045 1050 1055
Ser Ile Phe Asn Val Glu Val Trp Gly Glu Ala Gly Asp Thr Ser Met
1060 1065 1070
Glu Glu Glu Val Glu Ile Glu Pro Val Ser Ala Ala Ala Ser Ser Asp
1075 1080 1085
Phe Gln Ala Ala Ala Asn Ala Ile Asp Ala Asp Ala Gly Ser Arg Trp
1090 1095 1100
Glu Ser Ala His Ala Asp Ala Thr Ala His Leu Thr Leu Asp Leu Ala
1105 1110 1115 1120
Ser Thr Tyr Lys Leu Val Ser Val Ala Ile Asp Trp Glu Ala Ala Asn
1125 1130 1135
Ala Gly Ala Tyr Thr Leu Glu Gly Ser Ser Asp Gly Val Asn Trp Thr
1140 1145 1150
Thr Ile Ala Ser Phe Thr Gly Gly Ala Phe Gly Asn Arg Thr Asp Thr
1155 1160 1165
Leu Thr Val Ser Gly Ser His Arg Phe Val Arg Ile Asn Cys Thr Glu
1170 1175 1180
Lys Ser Ala Gly Asn Asn Trp Gly Tyr Ser Ile Tyr Asp Val Arg Leu
1185 1190 1195 1200
Thr Ala Leu Leu Pro Thr Pro
1205
<210> 162
<211> 3624
<212> DNA
<213> Microbulbifer degradans
<400> 162
atgcaaccgt ctgcgtacgg tttgtattgt gctgcgtgtt attcacaaaa atcaaagtgt 60
gagtttatca tgattaaaaa taacctaatt aaatatgccg ttagatgtgc agccttaggc 120
gcactctcta caagtgcagt acatgttagt gctacagatt acaacgaccc atattggaca 180
aaccccgaag tgaattctga ctttgtggat aatttcaacc aaacccaaat tgaccgaagt 240
aagtggttgg ttgaaacaaa tatttttgta aatggcgaag atatcgatta tcaagatgtg 300
gagtacccac aagccgattg gacaattggc gtagggcagg acgaccccgc cgcaatagac 360
ggcaaggcgc ttatattaaa agcgcgctac atggatggag aagtgcaaga ttactacggc 420
ggtgacccgg caaacccagg caagccattg tttattcgtt cggggcgtat tgagtcgcaa 480
ataaccgacg atacaacttt tacctatggg aaattcgaag cccgattaaa aatgccgcct 540
gcacgcaacg gcgaattccc cgcgtggtgg ctattaggta actttcctga tgtgggttgg 600
accgcttgcc aagagttaga cattatggaa tttaccggca acaacggtat gaatattcca 660
caaacctatt ggaccgcccc ctatgctgta cacggtggta caacagtcgg tttagcaggc 720
cttggtatta ctaatccgca agagcagtat gttacttacg gtattattaa aacgccgtca 780
aaagtagagt ggtatatcaa cggtgtgctc accaatactt tttctcgcga caaccaaggt 840
gacgatcaac cttggccgta cgtcaccccc atgcgcatga ttttaaacca cgcaattact 900
catgtggaat ggccggatgt aggtaactac aacgcttatt ccccagatgc caacaggccc 960
agcgccacag gttggagcta tgtggataac aatggcgtaa cccaatacga atatattgat 1020
gtagatgcaa tgaacgcaaa tatcggtcgg gcgggtaccg attttgttgt ggattatgtt 1080
gcccattggc cgctacctac tagcgatgcc gaacacaaat atgtagacga tagtaaatcc 1140
agctttttcc gcgaatcggg taacactaaa ggcttttaca atttaaaagg ctggttagcg 1200
ccagtgtcag tgactgctga tggatttgat caaccaggtt gggataccga tgtgcgcgac 1260
aatggcgccg ataatgcagc tgatggctat gtaggctcta agtgggcaac acccaacgac 1320
gacggtattc attgggttga agtggattac ggccaagaca agcaaattaa ttacttgtgg 1380
ttagagtggg cgtggaacct acctgccgat tacgatatat acggtaaaag cagtactggt 1440
gcttgggaat ttatcacctc gtcacaacaa gaagtggcca cgtgggctac gcacgttttt 1500
gatgtaaatc gcacttaccg ttacctaaag ctcgtaacaa aaggccgcat agataaatct 1560
agccctattt ggttgttaga gcttatggca tttgaagatg tgccaaatat gtaccctaaa 1620
cctgcgggta ccacaatgcc taaccgatcc gttaacgtgc taaataatgg caatttttcg 1680
caagggttag atagctgggg aaccgaggcc tttgatggcg caaacccaac ttacaacgtg 1740
caaaatggtg cggcagtcat tgcgcttaca aacgatggtg gcttatcggg ctctgtgcaa 1800
ttgcacactt caggttttgg tttaaagcgt gattatcgct atcacataag tttcgatgct 1860
cgcgccgatg ttgcacgaaa tttaatggtg cgcttagcag aaaacaacct taacccaagc 1920
gcagctggca cctaccatgt agaaaccgtt gctgtgggga caagctttaa cacctatagt 1980
tttacttacg attacacggg gcaaggtgaa cctgctcgac tcgcattttt actgggcgga 2040
atgggtactg cgaccaccta tatcgataat gtggttattc gtgaaggtga ctttattggc 2100
agtggtgaac cgcttgtagc ggctattagc tctaccaact ttgttagcgc cagtaatggc 2160
tgggaaaccg aatggtgggg tgcgcctgct cgtgcggtag atggaaactt aggtaataaa 2220
gcgagcggta acgatggtga agccgaaggc atggatcttg atattaccgt gcgtattgac 2280
gagcattacg acgttagagc ggtattggtt gcgggcgaca attcacctga gcgaagtttg 2340
gatcagttcc gcgttgagtt tggcaatggc acacctttaa tgggctggac cgacagcaca 2400
acagagggtg tatacgaaga gttcgcaaac tttaataaca cgcctgctgc tggcgagcgc 2460
gattttaaat tcttcttccg cccacctgca gggcaactgg tagaagtggc cgatgtgcaa 2520
ttgctagcgg tagatttaat gccccataga atttcagcca tagcgttaga cgagggcggc 2580
actatttcgc cagcaggcac tacgcgctac agcagaaaca acaccgatga tgcaacctac 2640
acgtttaccg ccccacaagg gcaatcggta agtaatgtga ttgtggatgg cgtgtcgatg 2700
ggaccgttaa atagttatac ttttaccgat atcaatgctg atcacacctt agcggtaagc 2760
tttggtggtg aagttgagca accagaaact ggcgatgcta tcaactatgc gcccgaggct 2820
tacgcaactg cgagttccga gttacaaacg gcggcgttag ccaacgacgg cgatgcaggt 2880
tcgcgctggg agagtgagca cggtgcaggc ccttcgtggt tagcgctaga gttaaatgac 2940
acagtgcgcg tatccaaagt ggttattgat tgggaagcgg caaacgctgg cacctatgaa 3000
attcaaggct cgctaaacgg tatagattgg aacaccttag aagtcgttag tggtggagcc 3060
tttggtaatc gtacggatac ggttttacta gatggctctg ctagtaattc tgtcaaccat 3120
attcgtatat acggtgtaga acgcagtgct ggcaacaact gggggtattc aatctttaac 3180
gttgaagtat ggggcgaagc aggtgatacc tcgatggagg aggaagtaga aattgagcct 3240
gtatccgcag cagctagcag tgattttcaa gcggcagcga atgcaataga tgccgatgct 3300
ggtagtcgct gggaaagcgc gcacgcagat gctaccgcac acttaacatt ggatttagca 3360
tcaacctata aactcgtgtc agtcgctata gattgggaag ctgccaatgc tggagcttac 3420
accctagaag gttctagcga tggtgtaaat tggacaacca ttgcaagctt caccggcggt 3480
gctttcggca atagaaccga tactttaacg gtgagtggca gccatagatt tgtacgcatt 3540
aactgtacag aaaaaagtgc aggcaacaat tggggctatt ccatttatga cgtacgtcta 3600
actgcattat tacctacgcc ttaa 3624
<210> 163
<211> 331
<212> PRT
<213> Microbulbifer degradans
<220>
<221> MOD_RES
<222> (313)..(329)
<223> Variable amino acid
<400> 163
Met Lys Lys Val Phe Ser Leu Ile Ile Gly Leu Ser Leu Ser Leu Pro
1 5 10 15
Val Trp Ala Gly Trp Glu Arg Gln Trp Ile Asp Thr Phe Asp Gly Asp
20 25 30
Ser Val Asp Trp Thr Asn Trp Thr Ala Gln Val Glu Ala Asn Tyr Asn
35 40 45
Asn Glu Val Gln Cys Tyr Thr Ala Asp Glu Thr Ser Ala Asn Lys Asn
50 55 60
Tyr Asp Val Ser Asp Gly Thr Leu Lys Ile Ile Ala Arg Lys Gln Ser
65 70 75 80
Val Glu Cys Ala Gly Leu Gly Gly Gln Asn Lys Thr Trp Thr Ser Gly
85 90 95
Arg Leu Asn Ser Lys Asp Lys Gln Glu Phe Leu Tyr Gly Arg Ile Glu
100 105 110
Ser Arg Ile Arg Phe His Asn Leu Glu Gly Gly Thr Trp Pro Ala Phe
115 120 125
Trp Met Leu Glu Asn Arg Ile Ser Glu Thr Pro Arg Lys Tyr Asp Asp
130 135 140
Asp Tyr Glu Gln Trp Pro Asn Pro Gly Ala Gly Glu Ile Asp Val Trp
145 150 155 160
Glu Trp Phe Ser Asn Gly Pro Asn Ile Tyr Ile Ile Asn Phe Phe Asn
165 170 175
Ala Asn Asn Cys Gly Asp Arg His Asp Tyr Val Tyr Pro Asn Gly Gly
180 185 190
Ser Asp Val Leu Asn Trp His Asn Tyr Ala Met Glu Trp Asp Ala Asn
195 200 205
Asn Ile Ser Phe Phe Ile Asp Gly Ser Leu Ile Thr Ser Phe Asp Val
210 215 220
Ser Ser Cys Pro Gln Tyr Lys Glu Lys Met Phe Val Leu Leu Asn Leu
225 230 235 240
Ala Val Gly Gly Asn Leu Gly Gly Thr Ile Asp Pro Asn Leu Ser Leu
245 250 255
Ala Thr Leu Glu Val Asp Tyr Val Gly Tyr Cys Thr Ala Thr Asn Ala
260 265 270
Asn Asp Tyr Ala Ser Cys Asp Glu Thr Thr Pro Ala Ala Leu Gln Gly
275 280 285
Glu Pro Ser Thr Ser Trp Val Leu Ser Thr Val Asn Ala Thr Ala Thr
290 295 300
Leu Ala Asn Ala Ala Gln Asn Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
305 310 315 320
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Leu
325 330
<210> 164
<211> 707
<212> PRT
<213> Microbulbifer degradans
<220>
<221> MOD_RES
<222> (14)..(30)
<223> Variable amino acid
<400> 164
Met Leu Pro Pro Arg Leu Pro Met Gln Arg Lys Ile Ala Xaa Xaa Xaa
1 5 10 15
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Tyr
20 25 30
Asn Asn Glu Val Gln Cys Tyr Thr Ala Asp Glu Thr Ser Ala Asn Lys
35 40 45
Asn Tyr Asp Val Ser Asp Gly Thr Leu Lys Ile Ile Ala Arg Lys Gln
50 55 60
Ser Val Glu Cys Ala Gly Leu Gly Gly Gln Asn Lys Thr Trp Thr Ser
65 70 75 80
Gly Arg Leu Asn Ser Lys Asp Lys Gln Glu Phe Leu Tyr Gly Arg Ile
85 90 95
Glu Ser Arg Ile Arg Phe His Asn Leu Glu Gly Gly Thr Trp Pro Ala
100 105 110
Phe Trp Met Leu Glu Asn Arg Ile Ser Glu Thr Pro Arg Lys Tyr Asp
115 120 125
Asp Asp Tyr Glu Gln Trp Pro Asn Pro Gly Ala Gly Glu Ile Asp Val
130 135 140
Trp Glu Trp Phe Ser Asn Gly Pro Asn Ile Tyr Ile Ile Asn Phe Phe
145 150 155 160
Asn Ala Asn Asn Cys Gly Asp Arg His Asp Tyr Val Tyr Pro Asn Gly
165 170 175
Gly Ser Asp Val Leu Asn Trp His Asn Tyr Ala Met Glu Trp Asp Ala
180 185 190
Asn Asn Ile Ser Phe Phe Ile Asp Gly Ser Leu Ile Thr Ser Phe Asp
195 200 205
Val Ser Ser Cys Pro Gln Tyr Lys Glu Lys Met Phe Val Leu Leu Asn
210 215 220
Leu Ala Val Gly Gly Asn Leu Gly Gly Thr Ile Asp Pro Asn Leu Ser
225 230 235 240
Leu Ala Thr Leu Glu Val Asp Tyr Val Gly Tyr Cys Thr Ala Thr Asn
245 250 255
Ala Asn Asp Tyr Ala Ser Cys Asp Glu Thr Thr Pro Ala Ala Leu Gln
260 265 270
Gly Glu Pro Ser Thr Ser Trp Val Leu Ser Thr Val Asn Ala Thr Ala
275 280 285
Thr Leu Ala Asn Ala Ala Gln Asn Ser Gly Ala Ile Ala Phe Glu Thr
290 295 300
Ser Ala Phe Gly Glu Asn Ala Ser Asp Pro Met Ile Tyr Gln Ala Gly
305 310 315 320
Gln Asn Ile Thr Ala Gly Lys Glu Tyr Thr Leu Ala Phe Asp Val Arg
325 330 335
Ser Asn Thr Ala Gly Arg Ala Phe Arg Ala Phe Val Ala Ala Ser Ala
340 345 350
Glu Ala Ser Gln Gly Ile Leu Asp Gln Glu Val Val Val Ala Asn Ala
355 360 365
Asp Ala Trp Gln Phe Val Ser Leu Asn Phe Thr Ser Glu Gln Thr Phe
370 375 380
Asn Asp Ala Val Ile Ala Phe Gln Ser Gly Thr Ala Ala Leu Gly Asn
385 390 395 400
Gly Glu Leu Leu Phe Lys Asn Ile Val Leu Thr Glu His Ser Thr Leu
405 410 415
Tyr Ser Thr Thr Val Val Thr Asn Ala Gly Gly Ser Ile Asn Pro Pro
420 425 430
Gly Pro Thr Val Lys Thr Val Ser Gly Tyr Ser Ser His Phe Thr Ile
435 440 445
Thr Ala Asn Val Thr His Ala Ile Gly Asp Val Phe Ile Asp Gly Val
450 455 460
Ser Val Gly Pro Val Ala Glu Tyr Thr Phe Asp Asn Ile Gln Gly Asp
465 470 475 480
His Glu Ile Glu Val Gln Phe Val Ala Leu Pro Pro Pro Glu Gly Thr
485 490 495
Glu Ile Leu Thr Pro Val Ala Ala Thr Ala Ser Ser Ser Leu Ser Val
500 505 510
Ala Ser Arg Ala Ile Asp Gly Asp Tyr Ala Thr Arg Trp Glu Ser Arg
515 520 525
His Gly Ile Glu Ala Thr Trp Leu Glu Leu Asp Leu Gly Thr Ala Val
530 535 540
Asp Leu His Ser Ile Leu Ile Asn Trp Glu Ala Ala Asn Ala Lys Ala
545 550 555 560
Tyr Thr Leu Glu Gly Ser Asn Asp Gly Glu Asn Trp Thr Val Leu Ala
565 570 575
Ser Val Thr Asn Gly Thr Phe Gly Asp Arg Ala Asp Glu Leu Ser Leu
580 585 590
Thr Gly Ser Tyr Ser His Leu Arg Ile Asn Ala Thr Glu Arg Ser Ala
595 600 605
Gly Asn Asp Trp Gly Tyr Ser Ile Trp Asp Ile Val Leu Tyr Ala Tyr
610 615 620
Thr Gln Asp Gln Asn Ala Ser Ser Ser Ser Ser Ser Ser Ser Asn Ser
625 630 635 640
Thr Thr Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser
645 650 655
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Ser Glu
660 665 670
Pro Ala Ser Ser Gly Ser Gly Arg Met Asp Gly Val Val Phe Leu Phe
675 680 685
Leu Met Val Leu Phe Gly Leu Val Thr Gln Arg Val Val Leu Ala Lys
690 695 700
Leu Val Ser
705
<210> 165
<211> 2124
<212> DNA
<213> Microbulbifer degradans
<220>
<221> modified_base
<222> (40)..(89)
<223> a, c, g, or t
<400> 165
atgctaccgc cacgcttgcc aatgcagcgc aaaatagcgn nnnnnnnnnn nnnnnnnnnn 60
nnnnnnnnnn nnnnnnnnnn nnnnnnnnna aattataaca acgaagtgca gtgttatacc 120
gcagatgaaa catccgctaa taaaaactac gacgtatctg acggtacgct aaaaataata 180
gcaagaaagc aatccgtcga gtgtgcaggc ctaggtggcc aaaacaaaac atggacatca 240
gggcgattaa acagcaaaga taaacaagag ttcttgtatg gccgtataga atcccgcatt 300
cgcttccaca acctagaggg cggcacgtgg ccggcatttt ggatgctcga aaaccgcata 360
tccgaaaccc cgcgtaaata cgatgatgat tacgagcaat ggcccaaccc cggtgccggt 420
gaaatagatg tgtgggagtg gttttcgaac ggccctaata tttacatcat aaacttcttt 480
aatgctaata actgcggcga taggcacgat tacgtttacc caaatggcgg cagtgatgta 540
ctaaattggc ataactatgc catggagtgg gatgcaaata acattagctt ttttatagat 600
ggcagcctga ttacctcgtt tgatgtgtct tcgtgcccgc agtacaaaga gaaaatgttt 660
gtactactca acttggccgt tggcgggaat ctaggtggta ctatcgaccc caatttatcg 720
ctcgcaacat tagaagtgga ttacgtaggc tactgcacag ctaccaatgc taacgactac 780
gcaagctgtg atgaaacaac acccgctgca ttgcaaggtg agcccagcac atcgtgggtg 840
ttaagcacag taaatgctac cgccacgctt gccaatgcag cgcaaaatag cggagccatt 900
gcgtttgaaa caagcgcttt cggcgaaaat gcatcagacc ctatgattta ccaagcaggg 960
caaaacatca cagccggtaa agaatatacc ttagcgtttg atgtgcgatc caatactgcc 1020
ggccgcgcct ttcgtgcatt tgtggcggct agtgccgaag cctcgcaggg tatattagat 1080
caagaagttg tggttgcaaa tgcagacgca tggcaattcg taagtttaaa cttcacgtca 1140
gaacaaacct ttaacgatgc ggttatagcg tttcaatccg gcactgcagc actgggtaac 1200
ggtgagttgt tgtttaaaaa tatagtgctt accgagcatt ccaccttgta ctcaacaacg 1260
gttgttacca atgctggtgg ttcaataaac cctcctgggc ctacagtaaa aaccgtaagc 1320
ggttacagta gccactttac aattactgca aatgtaaccc atgctattgg cgatgtgttt 1380
atcgatggcg taagcgttgg ccctgttgcc gaatatacgt tcgacaatat acaaggcgac 1440
cacgaaatag aagtgcagtt tgttgcactg ccgccccccg aaggtaccga gatacttacc 1500
cctgtagccg ccactgcgag cagttcactt agtgttgcga gcagagcaat cgatggcgac 1560
tacgcaaccc gctgggaatc gcgccacggt atagaggcta cgtggctaga gttagattta 1620
ggcacagccg tagatttgca ctccatactc ataaactggg aagccgcaaa cgcaaaagcg 1680
tacaccctag aaggctctaa cgatggcgaa aattggacag tactcgcaag cgtaacgaat 1740
ggcacctttg gcgaccgcgc cgatgagcta tcactcactg gcagctactc acacctgcga 1800
ataaacgcaa ccgaaagaag cgcgggtaat gattggggct attccatttg ggatatagtg 1860
ctatacgcat acacgcaaga tcaaaacgca agtagcagct cgtctagttc ttcaaacagt 1920
acaacctcta gttcttctag ctcatccact tcttcgagta gctcttcctc aagctcttct 1980
agttcatctt caagttctag ctctggcggc agtgaacctg cttcgagcgg tagcggcaga 2040
atggatggtg ttgtttttct tttcttgatg gtgttgtttg gtttggttac acaacgcgtc 2100
gtgttagcta aattagttag ttaa 2124
<210> 166
<211> 573
<212> PRT
<213> Microbulbifer degradans
<400> 166
Met Glu Ala Thr Met Ile Lys Thr Cys Lys Thr Thr Pro Ala Lys Trp
1 5 10 15
Ala Ala Ala Leu Ser Leu Gly Cys Ala Leu Leu Leu Pro Ser Gly Val
20 25 30
Asn Ala Ala Thr Phe Gln Ala Glu Asp Tyr Ser Ala Phe Tyr Asp Thr
35 40 45
Thr Ala Gly Asn Thr Gly Gly Ala Tyr Arg Asn Asp Asn Val Asp Ile
50 55 60
Glu Ala Thr Asn Asp Asn Gly Gly Gly Tyr Asn Val Gly Trp Ile Asp
65 70 75 80
Ala Asn Glu Trp Leu Val Tyr Pro Gly Val Asn Ile Thr Thr Thr Gly
85 90 95
Asp Tyr Val Ile Asn Val Arg Val Ala Ser Ala Ser Gly Gly Ala Leu
100 105 110
Ser Val Asp Phe Asn Ala Gly Ser Ile Pro Leu Gly Gln Phe Asp Ile
115 120 125
Pro Asn Thr Gly Gly Trp Gln Asn Trp Val Thr Val Ser Lys Thr Val
130 135 140
Thr Leu Thr Ala Gly Thr Tyr Asp Met Gly Val Phe Ala Ser Thr Gly
145 150 155 160
Gly Trp Asn Phe Asn Trp Ile Glu Val Thr Pro Val Asn Asn Gly Gly
165 170 175
Gly Asn Gly Gly Gly Ser Ala Thr Phe Phe Gln Ala Glu Asp Tyr Ser
180 185 190
Asn Tyr Ser Asp Thr Thr Pro Glu Asn Ile Gly Gly Ala Tyr Arg Asn
195 200 205
Asp Gly Val Asp Ile Glu Thr Thr Ser Asp Ala Asn Gly Gly His Asn
210 215 220
Ile Gly Trp Met Glu Asn Gly Glu Trp Leu Ala Tyr Glu Gly Leu Ser
225 230 235 240
Ile Pro Ser Asn Gly Asn Tyr Ile Ile Lys Ala Arg Val Ala Ser Pro
245 250 255
Asn Gly Gly Ala Leu Ser Phe Asp Leu Asn Gly Gly Gly Gln Val Leu
260 265 270
Gly Thr Met Asn Ile Pro Ala Thr Gly Gly Trp Gln Asn Trp Gln Thr
275 280 285
Val Glu Leu Ser Thr Asn Ile Asn Ala Gly Thr Tyr Thr Leu Gly Ala
290 295 300
Phe Val Ser Thr Ser Gly Phe Asn Leu Asn Trp Ile Glu Val Val Ala
305 310 315 320
Gly Gly Asp Gly Gly Asn Asn Gly Gly Gly Gly Gly Asn Ile Thr Trp
325 330 335
Arg Asp Glu Phe Asp Thr Ile Asn Arg Asp Val Trp Asn Phe Glu Thr
340 345 350
Gly Gly Gly Gly Trp Gly Asn Asn Glu Leu Gln Tyr Tyr Thr Asp Gly
355 360 365
Gln Asn Ala Ser Ile Gln Phe Asp Pro Gln Ala Gly Ser Asn Val Leu
370 375 380
Val Ile Glu Ala Arg Lys Glu Thr Gly Gly Gln Cys Trp Trp Gly Gly
385 390 395 400
Asn Cys Gly Tyr Thr Ser Ser Arg Met Asn Thr Arg Phe Lys Lys Ser
405 410 415
Phe Gln Tyr Gly Arg Ile Glu Ala Arg Met Lys Leu Pro Arg Thr Gln
420 425 430
Gly Ile Trp Pro Ala Phe Trp Met Leu Gly Asp Asn Phe Asn Asn Val
435 440 445
Gly Trp Pro Gln Gly Gly Glu Leu Asp Ile Met Glu His Val Gly Thr
450 455 460
Asn Asn Ile Thr Ser Gly Ala Leu His Gly Pro Gly Tyr Ser Gly Asn
465 470 475 480
Thr Pro Ile Thr Gly His Leu Glu His Gly Ala Ser Ile Asp Ser Gly
485 490 495
Tyr Arg Val Tyr Ala Val Glu Trp Asp Thr Asn Gly Ile Arg Trp Phe
500 505 510
Val Asp Gly Thr Asn Phe Tyr Ser Val Glu Lys Trp Gln Val Gln Gln
515 520 525
Tyr Gly Glu Trp Val Tyr Asp Gln Pro Phe Trp Ile Leu Leu Asn Leu
530 535 540
Ala Val Gly Gly Asn Trp Pro Gly Asp Pro Asp His Ala Asn Phe Thr
545 550 555 560
Thr Gln Arg Phe Tyr Ile Asp Tyr Val Arg Val Ile Gln
565 570
<210> 167
<211> 1722
<212> DNA
<213> Microbulbifer degradans
<400> 167
atggaggcta caatgataaa aacatgcaaa accacaccag ccaagtgggc agcagccctt 60
agcttaggct gcgccctact tttaccctca ggcgtcaatg ctgccacctt ccaagcagaa 120
gattacagcg ccttctacga caccaccgcg ggtaacactg gcggcgccta ccgcaatgat 180
aacgtcgata tagaagccac caatgataac ggcggcggct ataacgtggg ctggatagat 240
gccaacgaat ggctggtata cccaggcgta aatatcacca ccaccggcga ctatgtaatt 300
aatgtacgcg tggcgagcgc aagcggcggt gcactttctg tcgactttaa tgcaggctct 360
attccactgg gtcagttcga tattcccaac actggcggtt ggcaaaactg ggtaaccgtt 420
tcaaaaacag ttactttaac tgccggcacc tacgatatgg gcgtattcgc ctctaccggc 480
ggctggaact ttaactggat agaagtcaca cccgttaata acggtggtgg caatggcggc 540
ggttcagcca cgttctttca agcggaagac tacagcaact actccgacac cacacccgaa 600
aacattggcg gcgcctatcg caatgacggt gtagatatag aaaccacaag cgatgccaac 660
ggcggccaca atataggctg gatggaaaac ggcgaatggc ttgcctacga aggtttatct 720
attcccagta acggcaacta catcattaaa gcgcgtgtag ctagccccaa cggcggcgca 780
ctatcgtttg atttaaacgg cggcggccaa gtgcttggca caatgaacat acccgctact 840
ggcggttggc aaaattggca aaccgtagaa ttgagcacca acattaatgc cggaacctat 900
acgctcggcg cctttgttag tactagtggc tttaacctaa attggataga agtcgttgcc 960
ggtggcgacg gcggcaataa tggtggcggt ggtggcaaca tcacttggcg cgatgaattc 1020
gacacaatca accgcgatgt ttggaatttt gaaaccggtg gcggcggctg gggtaacaac 1080
gaattgcaat actacaccga cggccaaaac gcttccattc agttcgaccc gcaagccggt 1140
agtaatgtgt tagttatcga agcccgcaaa gaaaccggtg gccaatgttg gtggggcggc 1200
aattgcggtt acacctctag ccgcatgaac acgcggttta aaaaatcgtt ccaatacggc 1260
cgtatagaag cacgtatgaa gctgccgcgt acccaaggta tttggccagc gttttggatg 1320
ctaggtgaca actttaataa tgtaggctgg cctcaaggtg gcgaactaga cattatggag 1380
cacgtaggca ccaacaatat cacatcgggt gcattgcacg gcccaggcta cagcggtaat 1440
acgcccatta ccggccattt ggagcacggt gcaagcattg attctggcta ccgtgtttac 1500
gcagtagaat gggataccaa tggtattcgc tggtttgttg acggcaccaa cttctacagc 1560
gtagaaaaat ggcaagtgca acagtacggc gaatgggttt acgaccaacc tttctggata 1620
ctgctaaacc tagcggtggg aggcaactgg cccggcgacc cagaccacgc caactttact 1680
acccaacgtt tttacattga ctatgtgcgg gtgattcagt aa 1722
<210> 168
<211> 742
<212> PRT
<213> Microbulbifer degradans
<400> 168
Met Lys Lys Leu Lys Leu Leu Glu Leu Ser Leu Val Val Leu Val Ala
1 5 10 15
Leu Gly Leu Ala Ser Cys Gly Gly Ser Asp Asp Lys Lys Thr Pro Asp
20 25 30
Pro Val Glu Glu Pro Val Gly Glu Pro Glu Ser Glu Pro Glu Ser Glu
35 40 45
Pro Glu Ser Glu Pro Glu Ser Glu Pro Glu Pro Glu Asn Ala Trp Gln
50 55 60
Leu Leu Trp Glu Asp Asn Phe Asp Ser Glu Ile Ser Ala Ser Asn Trp
65 70 75 80
Gly Phe Glu Val Asn Cys Thr Gly Gly Gly Asn Asn Glu Lys Gln Cys
85 90 95
Tyr Thr Asp Arg Ala Asp Asn Ala Tyr Val Asp Glu Ala Gly Ile Leu
100 105 110
His Ile Val Ala Lys Glu Glu Ala Phe Ser Gly Pro Ala Ile Gln Asp
115 120 125
Asp Asp Pro Asn Tyr Asn Pro Asp Asp Thr Ser Ala Ala Arg Asn Tyr
130 135 140
Thr Ser Ala Arg Leu Arg Thr Leu Asp Lys Phe Asp Phe Lys Tyr Gly
145 150 155 160
Arg Val Glu Ile Arg Ala Gln Ile Pro Gly Gly Gln Gly Ser Trp Pro
165 170 175
Ala Leu Trp Met Leu Pro Ser Asp Lys Val Tyr Gly Gly Trp Pro Ala
180 185 190
Ser Gly Glu Ile Asp Ile Met Glu Ala Val Asn Leu Asp Thr Asp Ala
195 200 205
Ala Asn Ala Val His Gly Thr Leu His Tyr Gly Leu Gln Trp Pro Gln
210 215 220
Trp Ser Thr Ile Gly Ala Ser Tyr Glu Thr Asn Asp Asp Phe Thr Gly
225 230 235 240
Glu Phe His Thr Tyr Ala Ile Glu Trp Glu Ala Asp Gln Ile Arg Trp
245 250 255
Phe Val Asp Gly Val His Thr Gln Thr Gln Val Ser Asp Asn Trp Tyr
260 265 270
Asn Phe Val Trp Gly Gly Gln Glu Ser Gly Phe Ala Val Ala Asn Pro
275 280 285
Arg Ala Pro Tyr Asp Gln Glu Phe His Leu Ile Met Asn Val Ala Ile
290 295 300
Gly Gly Asn Trp Pro Gly Asp Pro Asp Thr Gly Trp Ala Ser Asp Arg
305 310 315 320
Glu Met Leu Val Asp Tyr Val Arg Val Tyr Gln Cys Asp Ser Glu Ala
325 330 335
Asp Asp Gly Thr Gly Cys Ala Asn Ala Thr Asp Ala Val Asp Ile Ala
340 345 350
Ile Ser Pro Thr Ala Asp Val Gly Ala Pro Ser Gln Val Glu Tyr Asp
355 360 365
Leu Tyr Ser Asp Gly Leu Gln Thr Pro Ala Phe Thr Asn Ala Gly Val
370 375 380
Thr Leu Ala Ala Asn Val Trp Gln Glu Thr Glu Gly Asn Val Val Thr
385 390 395 400
Thr Thr Gly Asn Leu Gly Asp Asp His Gly Asp Ala Trp Glu Ile Thr
405 410 415
Phe Asn Gly Thr Gly Asn Val Ser Ile Ala Ile Ala Glu Gln Asp Gly
420 425 430
Val Glu Ser Tyr Val Arg Leu Asn Gly Gly Thr Ala Trp Ser Asn Tyr
435 440 445
Gly Val Leu Ala Phe Asp Met Tyr Val Asp Ser Val Asp Ala Glu Thr
450 455 460
Gly Phe Val Val Lys Met Asp Ser Val Tyr Pro Asn Val Gly Ala Val
465 470 475 480
Asp Ile Ala Thr Pro Ala Ala Gly Glu Trp Thr Arg Val His Val Lys
485 490 495
Val Ala Asp Ile Leu Ala Asn Pro Ile Ala Gly Gly Gly Gly Leu Asn
500 505 510
Val Ala Gln Ala Val Asn Leu Phe Val Leu Glu Pro Thr Gly Thr Lys
515 520 525
Thr Ala His Val Tyr Val Asp Asn Ile Ser Ile Ser Cys Ala Tyr Asn
530 535 540
Ser Thr Ala Lys Ser Trp Gln Gly Asp Lys Thr Cys Asp Val Ala Ala
545 550 555 560
Val Pro Thr Ala Asp Thr Ser Gly Ile Asp Leu Ser Gly Asn Glu Leu
565 570 575
Val Ile Phe Asp Glu Gln Glu Pro Asp Phe Trp Gly Phe Gly Gln Phe
580 585 590
Ser Ala Gly Gly Ala Glu Val Ala Met Ser Phe Leu Ala Asp Pro Glu
595 600 605
Asp Ala Ser His Gly Asn Val Leu Gly Phe Ser Tyr Pro Asp Ser Gln
610 615 620
Asn Val Ala Tyr Leu Gln Ser Ala Thr Pro Leu Asn Leu Ser Asp Trp
625 630 635 640
Ala Gly Gly Thr Ile Gln Phe Asp Met Tyr Val Ile Ser Glu Pro Ala
645 650 655
Asn Val Asn Trp Met Met Lys Val Asp Cys Val His Pro Cys Ser Ser
660 665 670
Gly Asp Ile Pro Leu Thr Thr Asn Ile Asp Gly Val Val Pro Ala Val
675 680 685
Gly Val Trp Gln Thr Tyr Arg Phe Asn Leu Asp Ala Leu Val Ala Gly
690 695 700
Asn Pro Gly Leu Asp Leu Thr Lys Val Asp Thr Pro Leu Val Ile Phe
705 710 715 720
Pro Ala Trp Asp Asn Gln Thr Gly Ala Asn Phe Arg Ile Asp Asn Ile
725 730 735
Lys Phe Val Arg Ala Asn
740
<210> 169
<211> 2229
<212> DNA
<213> Microbulbifer degradans
<400> 169
atgaaaaaac taaagcttct agaactttcg ctggtggtgt tggtagccct agggctagcg 60
agctgcggcg gttctgacga caaaaaaacg ccagacccag tagaagaacc cgtgggtgaa 120
ccagaatcag agcctgaatc agagccagaa tctgaacccg aatctgagcc agaaccagaa 180
aacgcatggc aattgctgtg ggaagataat ttcgacagcg aaataagcgc atctaactgg 240
ggcttcgaag ttaactgcac cggcgggggt aacaacgaga aacagtgtta taccgataga 300
gccgacaacg cttatgtaga tgaggcaggc atactgcaca tagttgccaa agaggaagca 360
tttagcggac ccgctattca ggatgatgac cccaactaca accccgatga cacatcggct 420
gcgcgcaatt atacctctgc gcgtttgcgt acattagata agtttgactt caaatatggt 480
cgagtcgaaa ttcgtgcgca aataccaggt gggcagggct catggcctgc attgtggatg 540
ttgccaagtg acaaagtata tggtggttgg ccagccagtg gcgaaatcga cattatggaa 600
gcggttaatt tagatactga tgcggccaat gccgtgcatg gtactttgca ctacggttta 660
cagtggccac aatggtcgac gataggcgct tcttacgaaa caaatgatga tttcaccggt 720
gagttccaca cttatgccat tgagtgggaa gcagaccaaa ttcgctggtt tgtggatggc 780
gtacacactc aaacacaggt gtccgataac tggtacaact ttgtatgggg cgggcaagag 840
tccggctttg ctgtggctaa cccgcgtgca ccttacgatc aagagtttca tttaatcatg 900
aacgtcgcta taggcggcaa ctggccgggt gacccggaca cgggttgggc atctgatcgc 960
gaaatgcttg tagattatgt gcgtgtgtac cagtgtgatt ctgaagccga tgatggaaca 1020
ggctgtgcaa acgcaaccga cgccgttgat attgctattt caccgaccgc agatgtaggt 1080
gcgccatcac aagtagaata cgacctatat agtgacggtt tacaaacgcc tgcatttacc 1140
aacgccggcg taactcttgc tgcaaatgtt tggcaagaaa cagaaggcaa cgtagtaact 1200
accactggca acttgggcga cgaccatggc gatgcttggg aaattacctt taatggtact 1260
ggtaatgtat ctattgctat tgccgagcaa gacggcgtgg aaagctatgt gcgattaaat 1320
ggcggaacag cttggagtaa ctacggcgta ctagcgtttg atatgtatgt agatagcgtg 1380
gatgccgaaa ctggctttgt tgttaaaatg gacagtgttt accccaatgt aggtgctgta 1440
gatattgcta cacctgccgc aggtgagtgg acgcgagtgc acgtaaaagt ggctgatatt 1500
ttagctaacc caattgcagg tggcggtggc cttaacgttg cgcaagcggt taacttgttt 1560
gtacttgagc caacaggtac taaaacagca catgtatatg tagataatat ttctattagc 1620
tgcgcatata actctactgc aaaatcttgg cagggtgata aaacctgtga tgttgcagcg 1680
gtgcctactg ccgatacctc aggtatagat ttaagtggta acgagcttgt gatttttgat 1740
gagcaagagc cggatttttg gggttttggg cagttttctg ctggtggtgc tgaagtagca 1800
atgagcttct tagctgaccc agaagatgcg agtcacggta atgtgttggg ctttagctac 1860
ccagatagcc aaaacgttgc ctacttacaa tctgctacac cgcttaattt aagcgattgg 1920
gctgggggta ctatccaatt cgacatgtac gttattagcg agcctgctaa tgtaaattgg 1980
atgatgaaag tggattgtgt acacccttgt tcatctggcg atataccgct caccaccaat 2040
attgatggtg tagtgcctgc tgtgggtgta tggcaaacct atcgctttaa cttagatgca 2100
ttagttgctg gcaacccagg gttagatttg accaaagtgg atacgccgct agtaatattc 2160
cctgcatggg ataaccaaac cggtgctaac ttccgtatag ataacattaa atttgttcgg 2220
gctaactag 2229
<210> 170
<211> 881
<212> PRT
<213> Microbulbifer degradans
<400> 170
Met Gly Leu Thr Met Val Lys Asn Lys Leu Tyr Leu Ala Leu Ser Ile
1 5 10 15
Ala Ala Ala Thr Ser Leu Thr Ala Cys Gly Gly Gly Gly Asp Ala Ala
20 25 30
Asp Asp Thr Val Lys Arg Asn Val Phe Ala Val Gly Asp Val Phe Lys
35 40 45
Thr Gln Glu Asp Gly Asp Ala Val Glu Ala Asp Val Ser Glu Asn Asp
50 55 60
Phe Gly Arg Gly Leu Thr Phe Ala Leu Glu Ser Gly Ser Thr Thr Ala
65 70 75 80
Asn Gly Glu Leu Val Phe Asn Ala Asp Gly Ser Phe Thr Tyr Thr Pro
85 90 95
Asn Ala Asp Phe Ser Gly Lys Asp Ser Phe Thr Tyr Val Ala Thr His
100 105 110
Thr Ala Ser Gly Asp Thr Ala Ser Ala Leu Val Thr Ile Asn Val Ile
115 120 125
Ser Asp Phe Glu Thr Ile Glu Glu Ser Gly Trp Thr Leu Thr Trp Ser
130 135 140
Asp Glu Phe Asp Ser Leu Asp Ser Met Ala Trp Asp Ala Met Asn Ala
145 150 155 160
Ser Ala Ser Glu Gly Val Leu Ser Val Ser Ala Val Glu Gly Gln Thr
165 170 175
Ser Tyr Val Lys Ser Thr Ala Ala Leu Gly Gln Ala Gly Arg Ile Glu
180 185 190
Ala Ser Ile Gln Leu Pro Asp Gly Lys Ser Leu Tyr Ser Gly Phe Gly
195 200 205
Leu Met Pro Met Ala Asp Met Phe Asp Gly Lys Asn Ala Leu Met Ala
210 215 220
Ile Glu Ser Ala Asn Asn Lys Ala Thr Ala Gly Gly His Trp Gly Ile
225 230 235 240
Gly Leu Val Asn Gly Val Glu Ile Asn Glu Pro Thr Asn Ala Val Val
245 250 255
Arg Ala Glu Phe His Thr Tyr Ala Ile Glu Trp Asn Glu Ser Leu Ile
260 265 270
Arg Trp Tyr Ile Asp Asp Ile His Ile His Thr Val Asp Thr Leu Asn
275 280 285
Thr Trp Ser Tyr Asn Leu Ser Gly Asp Thr Val Val Ala Asp Thr Thr
290 295 300
Thr Lys Pro Phe Ser Gln Asp Leu Gln Ile Met Met Glu Leu Thr Ala
305 310 315 320
Ala Ser Ser Gly Leu Pro Asn Ala Met Leu Val Asp Phe Val Lys Val
325 330 335
Tyr Glu Cys Asp Thr Ser Val Thr Asp Gln Ile Glu Ser Cys Ala Phe
340 345 350
Ala Ala Asp Glu Asn Val Asp Lys Leu Ala Ser Asn Arg Ile Glu Ser
355 360 365
Val Gly Glu Ile Val Thr Pro Leu Phe Thr Asp Glu Leu Thr Ser Leu
370 375 380
Ser Trp His Tyr Ser Asp Ala Glu Glu Ala Val Thr Phe Val Thr Glu
385 390 395 400
Gln Glu Ser Ala Val Pro Thr His Arg Gly Ile Val Ser Gln Pro Asp
405 410 415
Val Met Glu Pro Val Ala Glu Val Ala Pro Val Asp Pro Val Glu Glu
420 425 430
Gly Glu Glu Gly Tyr Glu Glu Tyr Gln Ala Tyr Leu Ala Tyr Val Asp
435 440 445
Tyr Leu Asn Tyr Leu Glu Thr Leu Ala Phe Leu Asp Thr Val Asp Pro
450 455 460
Ser Arg Glu Arg Gly Ala Val Ile Arg Tyr Gln Ser Asp Ala Ser Thr
465 470 475 480
Trp Ser Asn Phe Ser Leu Asn Thr Pro Ser Leu Gly Leu Val Gly Lys
485 490 495
Asp Ser Ala Leu Gln Phe Asp Met Tyr Ile Asp Ser Ala Ser Thr Thr
500 505 510
Thr Glu Thr Ile Glu Ile Arg Met Glu Thr Gly Trp Pro Phe Leu Gly
515 520 525
Thr Val Leu Leu Asn Val Ala Asp Leu Gln Leu Asp Thr Trp Val Thr
530 535 540
Tyr Asn Ile Pro Val Ser Asp Phe Leu Ala Asn Pro Phe Ile Thr Pro
545 550 555 560
Asp Trp Ala Val Gly Gln Asp Trp Phe Leu Gly Gly Asn Gly Val Glu
565 570 575
Gly Gln Pro Leu Tyr Leu Asp Thr Asn Ser Ile Thr Lys Ala Ile Val
580 585 590
Val Gln Leu Ala Ala Pro Gly His Leu Val Phe Asp Asn Val Ala Ile
595 600 605
Thr Cys Val Ser Asn Glu Ser Cys Phe Gln Gly Pro Leu Ala Lys Gln
610 615 620
Pro Ile Val Lys Ala Gly Pro Ala Pro Ile Ile Tyr Glu Ala Glu Ala
625 630 635 640
Tyr Thr Ala Val Thr Gly Glu Val Gln Thr Glu Asp Thr Gln Asp Ala
645 650 655
Gly Gly Gly Gln Asn Val Gly Tyr Ile Asp Ala Gly Glu Ala Leu Glu
660 665 670
Tyr Thr Ile Val Ala Pro Ile Asp Gly Thr Tyr Lys Phe Gln Tyr Arg
675 680 685
Leu Ala Ser Gly Leu Glu Ser Ala Ser Glu Phe Asp Val Ser Ile Asp
690 695 700
Asp Met Leu Ile Asp Gly Gln Ser Leu Pro Gly Thr Gly Gly Trp Gln
705 710 715 720
Val Trp Thr Thr Leu Glu Ser Gly Glu Phe Asp Leu Thr Ala Gly Glu
725 730 735
His Ala Ile Val Phe Asn Phe Ala Gly Gly Met Asn Phe Asn Trp Phe
740 745 750
Ala Ile Val Pro Pro Pro Ile Ala Ile Phe Ile Glu Ala Glu Asp Tyr
755 760 765
Ser Ser Met Ala Gly Val Gln Leu Glu Asp Thr Ala Asp Glu Gly Gly
770 775 780
Gly Gln Asn Val Gly Tyr Ile Asp Ala Gly Asp Phe Leu Gln Tyr Asn
785 790 795 800
Val Glu Val Pro Ala Asp Gly Thr Tyr Phe Ile Glu Leu Arg Val Ala
805 810 815
Ser Ser Gly Gly Ser Asp Gly Phe Thr Ile Thr Ser Asn Gly Ile Thr
820 825 830
Thr Ser Thr Ile Pro Val Ala Asp Thr Gly Gly Trp Gln Asn Trp Thr
835 840 845
Thr Gln Thr Val Glu Met Gln Leu Ser Ala Gly Gln Gln Thr Leu Arg
850 855 860
Phe Asp Phe Ile Gly Gly Ala Ile Asn Phe Asn Trp Ile Asn Val Thr
865 870 875 880
Asn
<210> 171
<211> 2646
<212> DNA
<213> Microbulbifer degradans
<400> 171
atggggttaa caatggttaa gaataaattg tatctagcgc tatctatagc ggctgctaca 60
agcctaacgg catgtggtgg tggcggggat gcggctgatg acactgttaa acgcaacgta 120
tttgccgtag gcgatgtgtt taaaacgcaa gaagatggcg atgcggttga ggcggatgta 180
agtgaaaacg attttggcag aggcctcacg ttcgcattag aaagcggtag caccacggct 240
aatggcgagc tggtattcaa tgccgatggc tcttttacgt atacacccaa tgcggacttc 300
tctggtaaag actcctttac ctatgtggcc acccacaccg catcgggcga tacagccagt 360
gccttggtga ctattaacgt aataagtgac tttgaaacca tcgaggaatc tgggtggacg 420
ctaacgtggt ccgacgaatt cgatagttta gatagcatgg cgtgggatgc gatgaacgca 480
tctgcttctg aaggtgtgtt gtcggtaagc gctgtagaag ggcaaacatc ttacgttaaa 540
agcaccgctg cactggggca ggctgggcgt attgaggcga gcattcagtt gcccgatggt 600
aaaagcttgt actctggctt tggcttaatg ccaatggccg acatgttcga tggtaaaaat 660
gcattaatgg ccatcgagag cgcgaacaat aaggcaactg ctggtggtca ttggggtatt 720
ggattagtaa atggtgttga aattaatgaa cccactaacg cagtagtacg tgcggaattt 780
catacctatg ccatagagtg gaatgagagt ctgattcgct ggtatataga tgatatacat 840
attcataccg tagataccct taatacctgg tcatacaatt taagcggcga taccgtggtt 900
gccgatacta ccactaagcc gtttagccaa gacctgcaaa taatgatgga gttaacggca 960
gcaagttcag gtttgccaaa tgcaatgctg gtagattttg taaaggttta tgagtgtgac 1020
acgtctgtta ccgatcaaat agaaagctgc gcattcgcgg ctgacgaaaa tgtggataag 1080
cttgcttcca accgtattga gtcggttggt gaaattgtta ccccattgtt cacagatgag 1140
cttacatcgt taagctggca ttactctgat gccgaagaag ctgttacttt tgttacagag 1200
caagaaagcg ctgtgcctac tcatagaggt attgtttctc agcccgacgt tatggagcct 1260
gtagcagaag ttgcaccagt ggaccctgta gaagaaggtg aagagggcta cgaggaatat 1320
caagcttact tagcttatgt ggactacctg aactatttag aaactctcgc atttttagat 1380
acagtggacc caagccgcga acgtggcgct gttattcggt atcaatcgga tgcaagcaca 1440
tggtctaatt ttagtttgaa tactccgagc ttaggcttag ttggaaaaga ttccgcgctg 1500
cagtttgata tgtatatcga tagcgcatct acaactaccg aaaccatcga aatacgcatg 1560
gaaaccggtt ggccattcct tggcactgtt ctgctaaacg ttgcagactt gcagctagat 1620
acttgggtga cgtacaacat tcccgttagc gactttttag ccaatccatt tattacaccc 1680
gattgggcag tgggccaaga ctggttcttg ggcggtaacg gtgttgaagg gcagcctctc 1740
tatttagata ccaattcaat tactaaagcc attgtggttc agctagcagc accagggcat 1800
ttagttttcg ataacgttgc aattacttgt gtttcgaatg aaagctgctt ccaggggcca 1860
ttagccaaac agcccatcgt aaaggccggt cctgctccca ttatttacga agcagaagct 1920
tacacggcgg taacgggtga agtgcaaacg gaagataccc aagacgcggg cggcggccaa 1980
aacgttggct atatcgacgc aggtgaagca ttggaataca ccattgttgc acctatcgac 2040
ggtacctata aattccaata tcgcttggca agtggcttag aaagcgcttc tgagtttgat 2100
gtttctatcg acgatatgct tatcgatggc caaagcctac ctggtactgg tggctggcaa 2160
gtgtggacca cattggagtc tggtgagttt gatctaactg ctggtgaaca cgctattgtg 2220
tttaattttg ctggcggtat gaactttaac tggtttgcaa ttgttccgcc ccctattgcg 2280
atttttatcg aagctgaaga ttactcgtca atggctggtg ttcagcttga agacaccgca 2340
gatgaaggcg gcggtcaaaa tgttgggtat atcgacgcag gtgatttcct tcaatacaac 2400
gtcgaagtgc cggctgatgg tacctacttc attgaattgc gtgtagccag cagtggtggt 2460
agcgatggct ttaccattac ttccaatggc attactacaa gcactattcc tgtagcggat 2520
accggtggtt ggcaaaattg gactactcaa actgtagaga tgcaattgtc tgctggtcaa 2580
caaacacttc gtttcgactt tatcggtggg gccattaact ttaactggat caatgtcacc 2640
aactaa 2646
<210> 172
<211> 1186
<212> PRT
<213> Microbulbifer degradans
<400> 172
Met Ile Leu Arg Leu Ile Arg Ala Ala Ile Tyr Ile Val Ala Phe Val
1 5 10 15
Ser Leu Ile Ala Cys Gly Gly Ser Ser Thr Ser Lys Pro Ala Gln Pro
20 25 30
Asp Ile Pro Asp Thr Asp Leu Pro Asp Thr Gly Thr Pro Glu Pro Glu
35 40 45
Asn Ser Asp Pro Val Val Gln Ser Thr Pro Ala Thr Ala Ile Ala Ala
50 55 60
Gly Asn Val Tyr Ser Tyr Gln Ile Met Ala Ser Asp Ala Asp Glu Ser
65 70 75 80
Asp Val Ile Ser Tyr Ala Ala Val Thr Leu Pro Ser Trp Leu Ala Phe
85 90 95
Asp Pro Asp Thr Gly Ile Leu Thr Gly Thr Pro Gln Gln Ala His Ala
100 105 110
Gly Asn Ala Glu Val Val Leu Ser Tyr Ser Asp Gly Asn Val Thr Leu
115 120 125
Gln Gln Gln Phe Thr Ile Ala Val Ser Ala Ser Tyr Thr Glu Glu Pro
130 135 140
Pro Thr Pro Met Ser Arg Pro Thr Val Asn Ala Ala Asp Thr Ser Thr
145 150 155 160
Tyr Thr Ile Thr Ala Tyr Gly Ala Gly Ser Ile Ala Asp Ala Ile Asn
165 170 175
Pro Ala Ser Tyr Gly Cys Val Tyr Asp Tyr Gly Asn Trp Ile Tyr Asn
180 185 190
Ala Gly Val Val Glu Pro Gly Val Ser Gly Cys Asp Pro Ile Gly Ala
195 200 205
Pro Thr Tyr Arg Thr Pro Gln Val Val Gly Glu Ala Ala Ser Val Pro
210 215 220
Thr Pro Thr His Lys Trp Trp Gly Ser Val Ser Phe Leu Gly Glu Met
225 230 235 240
Lys Ile Gly Asp Pro Ala Gly Ala Gly Tyr Ile Thr Pro Asp Pro Ile
245 250 255
Thr Ala Arg Ile Ser Asn Lys Gly Val Arg Ile Met Gly Ile Pro Asn
260 265 270
Gly Leu Gly Ala Gln Gly Asn Gln Phe Ile Tyr Ser Val Pro Asp Pro
275 280 285
Phe Ser Glu Val Phe Asp Gly Ile Ala Val Ala Asn Ser Glu Tyr Ala
290 295 300
Asn Leu Glu Ala Tyr Leu Lys Ser Tyr Ser Asp Gly Thr Ala Thr Val
305 310 315 320
Gln Trp Gln Ser Gly Asn Leu Pro Val Met Gln Ala Thr Phe Val His
325 330 335
Gly Ser Pro Tyr Val Phe Phe Lys Ala Tyr Arg Gly Asn Met Val Leu
340 345 350
Arg Thr Lys Ala Ala Asp Gly Gly Glu Lys Gly Thr Phe Tyr Asn Glu
355 360 365
Asn Asn Ser Leu Gly Ile Trp Thr Ser Val Ala Gly Asn Lys Asn Asp
370 375 380
Phe Leu Ile Thr Gly Glu Gly Glu Thr Val Phe Asn Asn Ile Glu Thr
385 390 395 400
Asp Thr Ile Thr Leu Thr Asn Ala Ala Asn Glu Phe Thr Leu Thr Leu
405 410 415
Leu Pro Thr Ala Gly Ala Gly Thr Pro Ser Ser Thr Val Ile Gln Ala
420 425 430
Phe Glu Asp Ser Ala Arg Ala Val Val Ala Lys Val Asp Ile Gln Tyr
435 440 445
Ser Val Asp Arg Thr Asn Asn Met Val Thr Val Thr His Thr Tyr Lys
450 455 460
Asn Glu Ser Asn Thr Pro Val Gln Thr Leu Ala Gly Leu Leu Pro Met
465 470 475 480
His Trp Lys Tyr Ser Asp Thr Ala Leu Ser Gly Tyr Lys Thr Arg Ser
485 490 495
Ala Arg Gly Met Val Gln Phe Ala His Ile Asp Ser Phe Ser Tyr Thr
500 505 510
Ile Pro Tyr Val Gly Val Leu Pro Tyr Leu Pro Ser Ser Val Gly Asp
515 520 525
Phe Asp Ser Ser Val Leu Ala Gly Leu Val Gln Ala Phe Val Ala Glu
530 535 540
Gly Pro Glu Asn Trp Asn Pro His Thr Asp Thr Tyr Trp Ser Gly Lys
545 550 555 560
Ala Phe Asn Lys Val Ala Glu Leu Ser Ala Ile Ala Arg Ser Val Gly
565 570 575
Met Thr Ser Glu Ala Asp Thr Leu Leu Asn Trp Leu Lys Ala Glu Leu
580 585 590
Gln Asp Trp Phe Ser Ala Asn Thr Asn Gly Ser Leu Asp Glu Lys Lys
595 600 605
Tyr Phe Val Tyr Asp Ala Glu Trp Asn Thr Leu Leu Gly Leu Glu Glu
610 615 620
Ser Phe Ala Ala His Gln Gln Leu Asn Asp His His Phe His Tyr Gly
625 630 635 640
Tyr Phe Val Arg Ala Ala Ala Glu Ile Cys Arg Val Asp Ala Ser Trp
645 650 655
Cys Gly Ala Asp Gln Tyr Gly Pro Met Val Glu Leu Leu Ile Arg Asp
660 665 670
Tyr Ala Gly Ala Lys Asp Asp Thr Met Phe Pro Tyr Val Arg Asn Phe
675 680 685
Asp Pro Ala Asn Gly Phe Ser Trp Ala Ser Gly Ser Ala Asn Phe Val
690 695 700
Leu Gly Asn Asn Asn Glu Ser Thr Ser Glu Ala Ala Asn Ala Tyr Gly
705 710 715 720
Ala Ile Ile Leu Tyr Gly Leu Ile Thr Gly Asp Asn Glu Leu Val Glu
725 730 735
Arg Gly Met Tyr Leu His Ala Ser Ser Ser Val Ala Tyr Trp Glu Tyr
740 745 750
Trp Asn Asn Ile Asp Arg Tyr Leu Gly Ala Asp Ala Asp Arg Asp Asn
755 760 765
Phe Pro Ser Gly Tyr Asp Lys Leu Thr Thr Ser Ile Ile Trp Gly His
770 775 780
Gly Gly Val Phe Ser Thr Trp Phe Ser Gly Ala Tyr Ala His Ile Leu
785 790 795 800
Gly Ile Gln Gly Leu Pro Thr Asn Pro Leu Ile Phe His Val Gly Leu
805 810 815
His Pro Glu Tyr Met Glu Asp Tyr Val Ala Leu Gly Leu Ser Glu Ser
820 825 830
Ser Asn Asn Lys Pro Ser Gly Leu Ile Asp Asp Gln Trp Arg Asp Ile
835 840 845
Trp Trp Asn Leu Trp Ala Leu Thr Asp Ala Glu Ala Ala Ile Ala Asp
850 855 860
Tyr Asn Thr Val Gly Ser Asn Tyr Ala Pro Glu Phe Gly Glu Thr Lys
865 870 875 880
Ala His Thr Tyr His Trp Leu His Thr Trp Asn Ala Leu Gly His Leu
885 890 895
Lys Thr Gly Thr Gly Glu Leu Thr Val Asn Asp Pro Ala Ala Leu Val
900 905 910
Phe Glu Lys Asp Gly Ile Lys Thr Tyr Val Ala Tyr Asn Phe Ser Gly
915 920 925
Thr Pro Lys Thr Ile Leu Ala Ser Asp Gly Phe Glu Phe Ile Ala Gln
930 935 940
Pro Asn Asp Phe Thr Val Val Thr Thr Ala Asp Asn Asn Pro Asp Asp
945 950 955 960
Thr Gln Pro Pro Thr Leu Pro Ala Asn Leu Gln Ala Leu Asn Leu Thr
965 970 975
Gln Thr Ser Leu Asp Val Lys Trp Asp Ala Ser Thr Asp Asn Tyr Arg
980 985 990
Met Ala Gly Tyr Val Val Gln Val Leu Gln Ala Asp Thr Leu Ile Glu
995 1000 1005
Glu Thr Ser Ser Ile Ala Ser Ile Ala Ser Phe Asn Asn Leu Thr Ala
1010 1015 1020
Ser Thr Ser Tyr Thr Ile Gln Val Lys Ala Lys Asp Arg Ser Gly Asn
1025 1030 1035 1040
Glu Thr Ala Trp Val Ser Ile Thr Val Thr Thr Pro Ser Glu Thr Asp
1045 1050 1055
Asp Leu Leu Pro Thr Leu Asp Gly Gly Val Tyr Ser Ala Asn Val Gly
1060 1065 1070
Pro Asn Ser Ala Asp Leu Ser Trp Ala Ala Ala Thr Asp Asp Arg Gly
1075 1080 1085
Ile Ala Ser Tyr Thr Ile Glu Val Gln Val Gly Gly Ala Val Phe Val
1090 1095 1100
Thr Glu Thr Val Phe Asp Thr Ser Tyr Ala Leu Ser Gly Leu Thr Glu
1105 1110 1115 1120
Ala Thr Glu Tyr Asn Val Ala Val Tyr Ala Thr Asp Thr Gly Gly Gln
1125 1130 1135
Gln Ser Ala Thr Ile Ser Gly Ile Val Asn Thr Thr Ser Asn Pro Phe
1140 1145 1150
Gly Ser Gly Cys Glu Leu Ile Cys Ala Ser Ala Thr Ser Ser Ser Ser
1155 1160 1165
Val Thr Phe Thr Val His Gln Ala Gly Ala Val Asp Ile His Tyr Leu
1170 1175 1180
Val Asn
1185
<210> 173
<211> 3558
<212> DNA
<213> Microbulbifer degradans
<400> 173
atgatactgc gactgatacg tgccgcaatt tacattgtgg cgtttgtttc tcttatcgcc 60
tgtgggggct caagcacatc caagcccgcg caaccggata ttcccgatac ggatctgcct 120
gacacgggaa cacccgaacc cgaaaacagc gatccagttg ttcaatctac gccggctaca 180
gccattgcgg cgggcaacgt gtatagctat caaataatgg ctagcgacgc agacgaatcg 240
gatgtcatta gctatgcggc ggttactttg cctagctggc tggcgttcga cccagatacc 300
ggaatactta ccggcacgcc ccagcaagct catgctggta atgctgaagt ggtgcttagc 360
tatagtgacg gcaacgtaac cctgcagcag caatttacaa ttgcagttag cgcgagctac 420
accgaagagc caccaacacc gatgagtcgc ccaaccgtaa atgcggcgga taccagtact 480
tacactatta ctgcctacgg cgcgggcagt attgcggatg cgattaaccc tgctagctat 540
ggctgtgtgt acgattacgg taattggatt tataacgccg gcgttgtgga gccaggagta 600
agcggctgtg accctatagg cgcaccaacc taccgtacac cgcaggtggt tggggaagcg 660
gcaagcgtgc caaccccaac gcataaatgg tggggctcgg tttctttttt aggtgaaatg 720
aaaataggtg acccggcagg cgcaggttac ataacgccag acccaataac tgcacgtatc 780
tcgaataaag gcgtgcgtat tatgggcata cccaacggct tgggcgcgca aggcaaccag 840
tttatttact ctgtacccga cccttttagc gaggtattcg atggaatagc ggtagcgaat 900
agcgaatatg ccaacctaga agcgtattta aaatcctaca gcgatggcac cgctaccgtg 960
caatggcaga gtggcaacct gccggttatg caggccacat ttgtacacgg ctccccctat 1020
gtgtttttta aagcctatcg cggcaacatg gtgttgcgca ccaaagctgc agacggcggt 1080
gaaaaaggca cgttttataa tgaaaataat agtttaggta tttggaccag tgtagcgggt 1140
aacaaaaatg actttttaat taccggtgaa ggcgaaacgg tttttaacaa tatcgaaacc 1200
gataccatta cgttaaccaa tgcagcgaac gaatttacct taacgttatt acctacagcg 1260
ggcgccggta caccttcaag tactgttatt caggcgtttg aagatagcgc ccgtgcagtg 1320
gttgccaaag tagatattca atactcggta gaccgtacca ataacatggt aacggttact 1380
catacctata aaaatgaaag caatacgcca gtgcaaaccc tcgcggggtt actgcccatg 1440
cattggaagt attccgatac agcattaagc ggctacaaaa cccgaagtgc gcgcgggatg 1500
gtgcaatttg cccatattga ttcgtttagc tacaccattc cttatgtggg agtattaccg 1560
tatttaccgt catcggtagg ggatttcgat tcctctgtgc tcgcaggctt agtgcaggcg 1620
tttgtagctg aagggccaga aaattggaac ccgcacaccg atacctattg gtcgggcaaa 1680
gcgtttaata aagttgccga gttatcggca atagcgcgct cggtaggtat gacaagcgaa 1740
gcagatacgc tgttaaattg gcttaaagcc gaactgcaag attggtttag cgccaacacc 1800
aacggcagtt tggatgagaa aaaatacttt gtgtacgatg ccgagtggaa taccttgctt 1860
ggcctagaag aatcttttgc tgcacaccaa caactaaacg atcaccactt tcactacggc 1920
tattttgtac gcgccgctgc agaaatatgc cgagtagatg caagctggtg tggcgcagac 1980
cagtatggtc ccatggtgga actgcttatt cgcgattacg ccggtgccaa agacgatact 2040
atgttccctt acgtgcgtaa cttcgacccc gccaatggct tttcttgggc atcgggtagt 2100
gccaactttg tgctgggtaa caacaacgaa tccacatccg aagcggccaa tgcctatggt 2160
gcaattattc tgtacgggtt aataactggc gataatgagt tagtggagcg cggtatgtat 2220
ttgcacgcgt cttcgtcggt ggcctactgg gaatattgga acaacatcga ccgctacttg 2280
ggggccgatg ccgaccgcga taattttcca tcgggttacg acaaactaac tacctctatt 2340
atttgggggc atggcggcgt gttctctacc tggttcagtg gcgcttacgc tcatattctt 2400
ggtattcaag ggctgcctac caacccgctt atttttcatg tgggcttaca ccccgagtat 2460
atggaagact atgtggcact gggtttaagt gaatcgagca acaataaacc ttcggggtta 2520
atcgacgatc agtggcgcga tatttggtgg aacctatggg ccttgactga tgcagaagcg 2580
gccatcgccg attacaacac ggtaggcagt aactatgcgc cagagtttgg tgaaaccaag 2640
gcgcacacct accactggtt gcacacgtgg aatgcgctcg gccatttaaa aaccggcact 2700
ggcgagctaa cggtaaacga ccccgctgca ttagtgtttg aaaaggacgg aataaaaacc 2760
tacgtagcgt ataactttag cggtacacca aaaactattt tggctagcga tggctttgag 2820
tttattgcgc agcccaatga ttttaccgtg gtgaccaccg ccgataacaa ccccgacgac 2880
acgcaaccgc ctacactacc ggcaaacttg caagcgctta accttaccca aacgagcttg 2940
gatgttaaat gggatgcatc taccgataac taccgtatgg caggttatgt agtgcaggta 3000
ttacaagccg atacattaat agaagaaacg tcttctatag ccagcatcgc ctcgtttaat 3060
aacttaacgg ccagcacaag ttacaccatt caagttaaag ctaaggatcg ctctggcaat 3120
gaaaccgcat gggtaagcat tacggtaacc acgcccagcg aaaccgacga tttactacca 3180
acacttgatg gcggtgtata tagcgcaaac gtggggccta actctgcgga tttaagctgg 3240
gcggcagcaa cagatgatcg cggtatagca agttacacca tagaggtgca ggttggcggc 3300
gcagtatttg taactgaaac agtgtttgat acttcgtatg cgcttagcgg attgaccgaa 3360
gcaaccgaat acaatgtggc ggtttatgct acagataccg gcggtcaaca atctgccact 3420
attagcggta ttgttaatac caccagcaat ccatttggca gcggttgcga attgatttgt 3480
gccagcgcta cctctagctc atctgttacg tttactgtgc atcaagcggg cgcggtagat 3540
attcactact tagtgaac 3558
<210> 174
<211> 371
<212> PRT
<213> Microbulbifer degradans
<400> 174
Met Gly Ser Ser Phe Leu Thr Cys Tyr Lys Pro Ser Lys Glu Val Ile
1 5 10 15
Thr Val Lys Arg Ala Ala Ile Asn Leu Val Leu Ile Pro Gly Leu Ala
20 25 30
Val Ser Met Gly Thr Leu Ser Ser Ser Ala Phe Ala Gln Ser Thr Cys
35 40 45
Ser Val Asp Tyr Arg Val Glu Ser Asp Trp Gly Ala Gly Ala Thr His
50 55 60
Lys Val Leu Val Thr Asn Thr Gly Ala Pro Ile Asn Gly Trp Gln Met
65 70 75 80
Ser Trp Thr Phe Ser Gly Asn Glu Gln Ile Thr Asn Leu Trp Asn Gly
85 90 95
Met Phe Thr Gln Ser Ser Gln Ser Val Ile Val Asp Ser Leu Ser Trp
100 105 110
Asn Ala Gln Leu Asn Thr Gly Ala Thr Ala Glu Val Gly Phe Asn Ile
115 120 125
Asn Ser Pro Ser Gly Gln Leu Pro Asp Val Tyr Leu Asn Gly Val Asn
130 135 140
Cys Ser Asn Pro Gly Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
145 150 155 160
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
165 170 175
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
180 185 190
Pro Glu Pro Glu Pro Glu Leu Ala Tyr Ser Leu Asp Ala Thr Ala Ser
195 200 205
Phe Leu Asn Phe Val Ser Ser Lys Lys Thr His Val Leu Glu Thr His
210 215 220
Arg Phe Asp Val Leu Ser Gly Gly Ile Ser Thr Ala Gly Glu Ala Gln
225 230 235 240
Leu Val Ile Asp Leu Asn Ser Val Asn Thr Gly Ile Asp Val Arg Asn
245 250 255
Gly Arg Met Arg Asp Tyr Leu Phe Glu Thr Ala Thr Tyr Ser Val Ala
260 265 270
Thr Val Thr Val Pro Val Asp Leu Ala Ala Val Ala Gly Leu Ala Val
275 280 285
Gly Glu Asp Met Leu Val Asp Val Ser Ala Thr Leu Asp Leu His Gly
290 295 300
Val Pro Gly Val Ile Asp Thr Gln Leu Asn Val Gln Arg Leu Ser Ala
305 310 315 320
Thr Arg Ile Met Val Gln Asn Gln Ser Pro Leu Leu Ile Lys Ala Ala
325 330 335
Asp Tyr Ser Leu Glu Ala Gly Ile Glu Thr Leu Arg Asn Leu Ala Ser
340 345 350
Leu Asn Val Ile Ser Thr Thr Val Pro Val Asp Phe Val Leu Phe Tyr
355 360 365
Glu Ala Pro
370
<210> 175
<211> 1116
<212> DNA
<213> Microbulbifer degradans
<400> 175
atgggaagca gttttctcac ctgttataaa cctagcaaag aggtaataac cgtgaagcga 60
gcagcaataa acttggtgct gatacccggg ctagccgtaa gtatggggac gttgtcgtct 120
tctgcttttg cacaaagcac ttgtagtgtg gactatcgag tagagtccga ctggggtgca 180
ggggctacgc acaaagtatt agttactaac actggagccc ccataaacgg gtggcaaatg 240
tcatggactt tttctggcaa cgagcaaatc actaaccttt ggaatggcat gttcacccaa 300
agctctcaaa gtgtgattgt tgatagtctg tcgtggaatg cccagctaaa taccggtgcc 360
accgcagaag tggggtttaa tattaactct ccttcagggc agttgccgga tgtgtatttg 420
aacggcgtaa attgtagtaa ccctggtgag ccagagccag agccagagcc agagccagag 480
ccagagccag agccagagcc agagccagaa cctgagcctg agccagaacc tgagccagaa 540
cctgagccag aacctgagcc agaacctgag ccagagccag agccagagcc agaattggcc 600
tattcattag atgcgactgc atctttctta aactttgtta gctcaaagaa aactcacgtt 660
cttgaaacac accgttttga cgtgctttct ggcggtatta gtactgcggg tgaggcacaa 720
cttgttatcg atcttaacag tgtgaacaca ggtatagacg tgcgcaacgg acgtatgcgc 780
gattacctgt ttgagactgc aacctattct gtagcaaccg tgactgtgcc tgtagatcta 840
gccgcagtag caggtttagc cgtgggcgaa gatatgctgg tggatgtatc cgcaactttg 900
gatcttcacg gtgtgccagg cgttatcgat acgcagctaa atgtacagcg cttatcggca 960
actcgcatta tggttcagaa ccaatctcca ctattaataa aggcagccga ttattcgctt 1020
gaggcgggta tcgagacatt gcgtaatctc gcgagcctaa atgttattag caccacggtg 1080
ccagtggatt ttgttttgtt ctacgaagcg ccttaa 1116
<210> 176
<211> 1042
<212> PRT
<213> Microbulbifer degradans
<400> 176
Met Lys Lys Ile Phe Lys Ile Ser Ala Leu Ser Leu Gly Phe Ser Ile
1 5 10 15
Ala Gly Ala Ala Ser Ala Ala Asp Leu Cys Asn Val Thr Tyr Glu Ala
20 25 30
Val Asn Ser Trp Gly Ser Gly Ala Gln Gln Ala Val Thr Val Val Asn
35 40 45
Asn Gly Pro Ala Leu Asn Ala Trp Gln Leu Ser Trp Thr Phe Asn Gly
50 55 60
Ser Glu Asn Ile Asp Asn Leu Trp Asp Gly Val Leu Ser Gln Thr Gly
65 70 75 80
Ala Asn Val Thr Val Asn Asn Ala Gly Tyr Asn Gly Ser Val Gly Thr
85 90 95
Gly Gly Gln Phe Ser Phe Gly Phe Thr Val Ser Gly Trp Ser Glu Asn
100 105 110
Phe Pro Thr Glu Phe Tyr Leu Asn Gly Glu Ala Cys Ser Gly Ala Val
115 120 125
Asp Pro Asn Pro Asn Pro Asn Pro Glu Pro Thr Asp Gly Ala Val Trp
130 135 140
Glu Leu Asn Ser Ala Asp Ser Val Phe Ser Phe Val Thr Val Lys Lys
145 150 155 160
Glu His Val Ala Glu Val Gln Thr Phe Thr Ala Tyr Asn Ala Thr Val
165 170 175
Asp Ser Asp Gly Val Ala Thr Leu Ala Ile Asp Leu Asn Ser Ala Glu
180 185 190
Thr Asn Ile Asp Ile Arg Asn Glu Arg Phe Arg Asn Val Leu Phe Glu
195 200 205
Thr Ala Phe Leu Pro Thr Leu Tyr Tyr Ser Val Gln Leu Asp Met Ala
210 215 220
Ser Leu Ser Ala Leu Ala Val Gly Asp Ala Gln Thr Gln Thr Leu Gly
225 230 235 240
Gly Thr Leu Thr Leu His Gly Val Gln Ala Val Val Glu Ala Glu Val
245 250 255
Leu Val Val Lys Thr Ser Ala Thr Asp Leu Thr Val Ser Thr Ser Lys
260 265 270
Pro Ile Leu Ile Lys Ala Ala Asp Phe Asp Leu Val Ser Gly Val Glu
275 280 285
Ser Leu Arg Ala Leu Ala Ser Leu Ser Ser Ile Gly Gln Thr Val Pro
290 295 300
Val Tyr Phe Arg Leu Asp Phe Asp Ala Ala Asp Pro Gln Val Thr Asn
305 310 315 320
Ala Val Ala Val Pro Ala Thr Pro Ala Ala Pro Thr Ser Leu Thr Ala
325 330 335
Asp Phe Thr Glu Ser Ser Gly Ile Ala Ala Leu Asn Trp Asn Asp Ala
340 345 350
Ser Asn Asn Glu Thr Glu Phe Leu Val Arg Arg Arg Glu Ala Ser Thr
355 360 365
Gly Tyr Trp Ser Arg Leu Thr Glu Val Asn Ala Asn Ser Thr Leu Leu
370 375 380
Asp Asp Leu Leu Leu Glu Glu Asp Thr Tyr Asp Tyr Lys Val Ile Ala
385 390 395 400
Leu Asn Asn Gly Val Pro Ser Ala Pro Ser Pro Val Ala Thr Val Val
405 410 415
Ala Thr Thr Asn Pro Asn Pro Glu Pro Leu Thr Gly Glu Glu His Tyr
420 425 430
Gln Ala Lys Cys Ala Ser Cys His Gly Asp Asp Ala Ser Gly Gly Val
435 440 445
Val Gly Val Ala Leu Asn Thr Glu Arg Asp Leu Thr Val Met Leu Asn
450 455 460
Thr Ile Val Thr Arg Met Pro Pro Gly Glu Ala Asp Asn Cys Asp Gln
465 470 475 480
Glu Cys Ala Glu Ala Ile Gly Gly Tyr Ile Gln Thr Thr Phe Trp Asn
485 490 495
Gly Gly Glu Pro Glu Pro Glu Leu Ala Cys Asp Thr Val Thr Tyr Gly
500 505 510
Ala Arg Gln Leu Lys Leu Leu Thr Lys Ala Glu Tyr Gln Arg Ser Val
515 520 525
Glu Asp Leu Val Gly Ile Asp Tyr Asn Val Ala Ser Gly Leu Ala Glu
530 535 540
Asp Asn Ile Ile Gly Tyr Phe Val Asn Asn Thr Thr Lys Val Val Val
545 550 555 560
Pro Thr Val Tyr Asp Gln Tyr Leu Thr Val Ala Glu Glu Ile Ala Gln
565 570 575
Trp Ser Ala Asp Arg Asn Phe Ala Gly Ala Leu Thr Cys Gly Thr Asn
580 585 590
Phe Asn Gln Thr Cys Ala Asn Gln Phe Val Asn Asn Phe Ala Pro Lys
595 600 605
Val Phe Arg Arg Ala Leu Ser Ser Asp Glu Ala Ala Ala Tyr Leu Ala
610 615 620
Ile Ala Asn Gly Ser Ala Thr Asn Gly Asp Val Lys Ala Gly Ile Gln
625 630 635 640
Leu Ala Met Glu Gly Leu Phe Ser Ser Pro Gln Phe Val Tyr Arg His
645 650 655
Glu Leu Gly Glu Ala Asn Pro Asn Asn Asn Ala Ile Asp Ser Asp Ala
660 665 670
Phe Glu Leu Thr Ser Tyr Glu Met Ala Thr Trp Leu Ser Tyr Thr Tyr
675 680 685
Ala Gly Thr Thr Pro Asp Ala Ile Ala Met Gln Lys Ala Ala Asn Asn
690 695 700
Gln Leu Arg Thr Asp Ala Glu Ile Arg Ala Glu Ala Gln Arg Leu Leu
705 710 715 720
Glu Gly Ala Gly Ala Lys Gln Lys Met Gly Asp Phe Val Ala Ser Trp
725 730 735
Leu Gly Thr Asp His Ile Ala Asn Ala Pro Lys Asp Ala Ser Val Trp
740 745 750
Pro Gly Phe Asp Ala Leu Ile Pro His Leu Gln Thr Glu Ile Arg Glu
755 760 765
Met Phe Ser Tyr Val Met Leu Glu Pro Thr Glu Ser Phe Ala Ser Val
770 775 780
Tyr Asn Ala Asn Tyr Thr Phe Val Asn Gly Pro Leu Ala Gln His Tyr
785 790 795 800
Gly Ile Asn Gly Val Ser Gly Asn Glu Phe Gln Lys Val Thr Thr Thr
805 810 815
Asp Arg Gly Gly Ile Leu Ala Asn Gly Ala Phe Met Ala Arg Trp Gly
820 825 830
Glu Ser Val Glu Ser Ser Pro Ile Arg Arg Ser Val Arg Val Arg Arg
835 840 845
Arg Met Leu Cys Gln Asp Gln Pro Asp Pro Pro Gly Asn Val Asn Ile
850 855 860
Gly Arg Glu Asn Ala Ala Asp Glu Phe His Glu Ala Leu Ala Asp Pro
865 870 875 880
Thr Thr Thr Asn Arg Glu Arg Tyr Glu Leu Leu Thr Ser Gly Glu Thr
885 890 895
Cys Ala Thr Cys His Gln Glu Trp Ile Asn Pro Leu Gly Phe Gly Met
900 905 910
Glu Asp Phe Thr Ala Val Gly Thr Arg Arg Val Thr Asp Leu Asn Gly
915 920 925
Asn Thr Ile Asp Ala Ser Gly Gln Leu Tyr Ala Pro Glu Asn Leu Asn
930 935 940
Asp Lys Asp Val Phe Ile Asn Phe Asn Gly Thr Gln Gly Leu Gly Ala
945 950 955 960
Leu Leu Thr Thr Leu Pro Ser Ala Gln Ser Cys Leu Pro Gln Asn Leu
965 970 975
Phe Arg Tyr Ser Val Gly Val Gly Val Glu Gly Leu Asp Asp Asn Pro
980 985 990
Glu Gly Asn Glu Leu Val Pro Ala Glu Arg Asp Gly Tyr Ala Cys Glu
995 1000 1005
Val Lys Asn Leu Thr Ser Thr Met Leu Glu Gln Ser Pro Arg Ala Met
1010 1015 1020
Leu Glu Gly Met Gly Ser Met Gln Ala Val Arg Tyr Arg Lys Ala Trp
1025 1030 1035 1040
Ala Arg
<210> 177
<211> 3129
<212> DNA
<213> Microbulbifer degradans
<400> 177
atgaaaaaaa tattcaagat ttcagcgctc tcgctaggtt tctcaattgc tggcgctgca 60
tctgcagctg atttgtgcaa tgtaacctac gaagcggtaa atagctgggg ctccggtgcc 120
cagcaagcgg taaccgttgt aaataacggc cctgctttaa atgcctggca gttaagctgg 180
acatttaacg gcagcgaaaa tatagacaac ctatgggatg gcgtgctttc gcaaactggt 240
gcgaatgtta ccgttaacaa tgctggctat aacggcagtg ttggtactgg cggtcaattt 300
agcttcggtt ttaccgtaag cggttggagc gaaaactttc ccacagagtt ttacctaaat 360
ggtgaagcgt gtagtggagc tgttgaccct aatcccaatc ctaacccaga gccaactgat 420
ggcgctgtgt gggagttaaa ctcagccgat tcagtgttta gctttgtaac tgttaaaaaa 480
gagcacgttg ccgaagttca aacgtttact gcctacaacg caacagttga tagcgatggc 540
gttgcaacgc ttgctataga cttaaacagt gcagaaacga acattgatat acgcaacgag 600
cgcttccgca atgtattgtt tgaaacagct ttccttccta ctctgtacta cagtgttcag 660
ctagatatgg cttcactgtc tgcattggcg gttggcgatg ctcaaacgca aacgcttggt 720
ggcacgctaa cactacacgg tgtacaagca gtggttgagg ctgaagtgtt ggttgttaaa 780
actagcgcta ccgacttaac cgtatcaacg tctaagccta ttttaattaa ggctgccgat 840
tttgatttgg taagcggtgt agagtctttg cgagccttgg ctagtctttc aagtattggt 900
cagacggtac ctgtttactt ccgtttagac ttcgatgctg cagacccgca agtaactaat 960
gcggttgctg ttcctgctac tcctgcggcg ccaacgtctc taactgcaga cttcacagaa 1020
agcagtggta ttgccgcgtt gaactggaat gatgcaagca ataacgaaac cgagtttttg 1080
gttcgtcgtc gtgaagcgtc aaccggttac tggtcaagac tgaccgaagt taacgcgaac 1140
agtactttgt tagatgatct tctgttagaa gaagacacct acgactacaa agtgattgca 1200
ttgaacaacg gtgtgccttc tgcgccatcg ccagttgcaa ctgttgtagc tactaccaac 1260
cctaacccag aaccattgac tggcgaagag cactatcaag ctaagtgcgc aagctgtcac 1320
ggtgatgacg ctagcggtgg tgtagttggt gttgctttga atactgagcg cgatttaacc 1380
gttatgctga acaccattgt tactcgcatg cctccaggcg aagcggataa ctgtgatcaa 1440
gagtgtgcgg aagcaattgg tggttatatt caaactactt tctggaacgg cggtgagcca 1500
gagccagaat tagcttgtga cactgttact tacggtgctc gtcagttgaa gctacttacc 1560
aaagctgagt atcaacgttc tgttgaagac ttagttggta ttgattacaa cgtagcaagc 1620
ggcttagcag aagataacat tattggttac ttcgttaata acaccaccaa agtggttgtg 1680
ccaacggttt acgatcagta tctcacagta gctgaagaaa tcgcgcagtg gtctgctgac 1740
agaaactttg cgggtgcatt aacttgtggc actaacttta accaaacctg tgcgaatcag 1800
tttgttaaca acttcgctcc aaaagtattc cgacgcgcgc tttctagcga tgaagctgcg 1860
gcttacttgg cgattgctaa cggcagcgct accaacggtg atgttaaagc gggtattcaa 1920
cttgcgatgg aaggcttgtt ctcttctccg caatttgtat atagacacga gcttggtgaa 1980
gccaatccaa ataacaatgc tatcgactct gacgcgtttg aattaacgtc ttatgaaatg 2040
gctacttggt tgtcttacac ttatgccggt accacaccag atgcaattgc tatgcaaaaa 2100
gctgcaaaca atcagttgcg cacagacgcg gaaattcgcg ccgaagctca acgtttgtta 2160
gaaggtgctg gtgcgaagca aaaaatgggt gacttcgttg ctagctggtt aggtactgac 2220
cacattgcaa acgcacctaa agatgcttct gtatggcctg gttttgatgc gttaattccg 2280
catttacaaa ctgaaatacg cgaaatgttc tcgtacgtaa tgttagagcc aactgaaagc 2340
tttgcttcgg tttacaacgc taactatacc ttcgtgaacg gaccattggc gcaacactac 2400
ggcattaacg gtgtaagcgg taatgaattc caaaaagtaa caactaccga tcgcggtggc 2460
attttggcga acggtgcgtt tatggcgcgc tggggtgaaa gtgtagagtc ttcaccaatt 2520
cgtcgttctg tgcgcgtacg ccgtcgtatg ctttgtcagg atcaaccaga tccccctggc 2580
aacgtaaaca tcggtcgtga aaacgctgca gacgaattcc acgaggcgtt ggcagatcct 2640
actacgacta accgtgagcg ttatgagctg ttaacatcag gtgaaacttg tgctacttgt 2700
caccaagagt ggattaaccc tctcggcttt ggtatggaag atttcactgc ggttggtacg 2760
cgtcgtgtga cggatctaaa tggcaatact attgatgcaa gcggtcaatt gtacgcgcca 2820
gaaaacctca acgataaaga cgtctttatt aactttaacg gtactcaagg tttaggtgcg 2880
ttactgacta ccttgccaag tgcgcagtct tgtttgccac aaaacttgtt ccgttattcc 2940
gtgggcgttg gtgtagaagg gttggatgat aatccagaag gcaacgagct agttcctgca 3000
gaacgggatg gctatgcctg tgaagttaaa aacctgacga gcactatgtt agagcaaagt 3060
ccgcgagcaa tgctggaagg catgggttcc atgcaagcgg tacgttaccg caaagcgtgg 3120
gcgcgttaa 3129
<210> 178
<211> 933
<212> PRT
<213> Microbulbifer degradans
<400> 178
Met Tyr Pro Leu Pro Phe Asn Ser Lys Ile Val Ile Ser Leu Gly Ala
1 5 10 15
Met Thr Leu Ala Leu Ala Thr Gln Gln Ala Gln Ala Leu Ser Cys Thr
20 25 30
Val Ser Ala Asp Ser Trp Asn Ser Gly Tyr Thr Ala Asn Val Thr Val
35 40 45
Val Asn Asp Ser Ser Tyr Ser Ile Asn Ser Trp Asp Val Thr Leu Gly
50 55 60
Phe Asn Gln Pro Pro Ser Val Ser Ala Gly Trp Asn Ala Asn Val Ser
65 70 75 80
Thr Val Gly Thr Thr Val Met Ala Ser Asn Val Gly Tyr Asn Gly Asn
85 90 95
Leu Ser Pro Gly Gln Ser Thr Ser Phe Gly Phe Gln Gly Ala His Asn
100 105 110
Gly Asn Phe Glu Leu Pro Ser Cys Ala Gly Gly Met Thr Ser Ser Ser
115 120 125
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
130 135 140
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
145 150 155 160
Ser Ser Ser Asn Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Thr Ser Ser Ser Ser Gly Ser Gly Gly Ala Asn Thr Val Thr Ile Glu
180 185 190
Leu Glu Asn Leu Ser Gly Gln Asn Gly Phe Ser Pro Phe Ser Val Gln
195 200 205
Asn Asp Ser Ser Ala Ser Gly Gly Gln Tyr Ile Val Trp Pro Asn Asn
210 215 220
Gly Asp Gln Leu Leu Ser Gly Ala Ser Asp Gly Gln Ser Gly Thr Leu
225 230 235 240
Ala Val Ser Phe Glu Leu Ser Gln Thr Ala Asn Val Ser Phe Asp Ile
245 250 255
Arg Ala Ser Leu Ala Asn Gly Asn Asp Asp Ser Phe Tyr Tyr Lys Leu
260 265 270
Asp Ser Gly Val Trp Ser Thr Gln Asn Asn Thr Ser Thr Ser Gly Phe
275 280 285
Glu Thr Leu Ser Pro Thr Thr Phe Asn Gly Val Ser Ala Gly Val His
290 295 300
Thr Leu Tyr Ile Gln Arg Arg Glu Asp Gly Ala Lys Leu Asp Ser Leu
305 310 315 320
Ser Leu Thr Ala Ser Val Gly Asn Ile Ile Ser Ser His Ala Asn Ser
325 330 335
Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Gly Gly Asn Glu Leu Val Ile Ala Ile Asn Ala Gly Gly
355 360 365
Gly Ala Thr Ser Leu Asp Gly Val Asn Phe Val Ala Asp Val His Ser
370 375 380
Leu Gly Gly Ser Thr Gly Ser Thr Thr Asp Ser Ile Ala Gly Ala Thr
385 390 395 400
Ser Ser Thr Leu Tyr Gln Thr Glu Arg Tyr Gly Ser Tyr Ser Tyr Ala
405 410 415
Val Pro Val Thr Asn Ala Thr Tyr Ser Leu Lys Leu His Phe Ala Glu
420 425 430
Ile Tyr His Thr Glu Ala Gly Ala Arg Ser Phe Asn Leu Ala Val Glu
435 440 445
Gly Gln Gln Glu Met Ala Ser Val Asp Leu Tyr Ser Leu Ser Gly His
450 455 460
Asp Gly Ala Tyr Thr Tyr Glu Val Asn Asp Phe Pro Val Asp Asp Gly
465 470 475 480
Glu Val Thr Ile Ser Leu Glu Ser Leu Thr Asp Asn Gly Thr Leu Ser
485 490 495
Gly Phe Glu Leu Ser Ser Ser Asp Gly Gly Glu Tyr Val Glu Pro Thr
500 505 510
Pro Ile Pro Thr Pro Glu Pro Gly Glu Gly Gly Arg Tyr Arg Val Ile
515 520 525
His Thr Thr Asp Met Gly Ala Asp Pro Asp Asp Glu Gln Ser Leu Val
530 535 540
Arg Gln Leu Val Met Ala Asn Glu Tyr Asp Leu Glu Gly Ile Ile Thr
545 550 555 560
Thr Thr Gly Cys Trp Lys Lys Ser Thr Ser Asn Thr Ala Tyr Val Asp
565 570 575
Arg Ile Leu Asn Ala Tyr Ser Gln Ala Tyr Pro Asn Leu Ser Lys His
580 585 590
Ala Glu Gly Phe Pro Thr Pro Ala Tyr Leu Asp Ser Ile Asn Val Met
595 600 605
Gly Gln Arg Gly Tyr Gly Met Gly Asp Val Gly Ser Gly Lys Asp Ser
610 615 620
Ala Gly Ser Asn Leu Ile Ile Ala Ala Val Asp Lys Asp Asp Pro Arg
625 630 635 640
Pro Val Trp Ala Thr Cys Trp Gly Gly Cys Asn Thr Ile Ala Gln Ala
645 650 655
Val Trp Lys Val Gln Asn Thr Arg Ser Gln Ala Gln Leu Asp Ala Phe
660 665 670
Ile Ser Lys Leu Arg Val Tyr Asp Ile Leu Gly Gln Asp Asn Ala Gly
675 680 685
Thr Trp Leu Ala Lys Asn Phe Pro Asn Leu Ile Tyr Ile Arg Ala Arg
690 695 700
Ser Val Tyr Ser Trp Gln Pro Ser Asp Ser Tyr Leu Asp Asn His Ile
705 710 715 720
Gln Ser His Gly Ala Leu Gly Ala Val Tyr Pro Asn Arg Arg Tyr Ala
725 730 735
Thr Glu Gly Asp Thr Pro Ala Phe Leu His Met Ala Asn Pro Gly Leu
740 745 750
Asn Asp Pro Ser Val Val Ser Met Gly Gly Trp Gly Gly Arg Phe Pro
755 760 765
Ser Lys Gln Ala Gly Val Arg Gly Met Ser Cys Met Ser Gly Glu Asp
770 775 780
Ala Val Tyr Asp Thr Tyr Tyr Met Tyr Thr Glu Asn Gly Glu Ser Ile
785 790 795 800
Lys Arg Trp Ser Thr Ala Ile His Asn Asp Phe Gln Ala Arg Met Asp
805 810 815
Trp Ala Ile Glu Ser Asn Tyr Ser Ala Ala Asn His His Pro Val Pro
820 825 830
Val Val Asn Asn Asp Ala Asn Glu Ala Val Met Tyr Leu Asn Ala Ser
835 840 845
Ala Gly Ser Thr Val Ser Leu Asp Ala Ser Gly Ser Ser Asp Pro Asp
850 855 860
Gly Asp Ser Leu Asn Tyr Ser Trp Ser His Tyr Gly Glu Ala Asp Ser
865 870 875 880
Tyr Ser Gly Ser Val Ser Ile Ser Asn Ser Ser Ser Ala Ser Ala Asn
885 890 895
Val Gln Ile Pro Ser Asn Ala Gly Gly Lys Asp Ile His Ile Leu Leu
900 905 910
Thr Leu Arg Asp Asn Gly Ser Pro Asn Leu Tyr Ala Tyr Arg Arg Val
915 920 925
Val Ile Asn Val Gln
930
<210> 179
<211> 108
<212> PRT
<213> Microbulbifer degradans
<400> 179
Met Tyr Trp Pro Pro Leu Ala Leu Leu Ser Phe Cys Thr Leu Asn Gly
1 5 10 15
Glu Lys Pro Phe Trp Pro Leu Lys Phe Ser Ser Ser Ile Val Thr Val
20 25 30
Leu Ala Pro Pro Leu Pro Glu Glu Glu Leu Val Leu Glu Leu Asp Glu
35 40 45
Leu Leu Leu Glu Leu Glu Glu Leu Glu Glu Leu Glu Glu Leu Glu Glu
50 55 60
Leu Glu Glu Leu Glu Glu Leu Glu Glu Leu Glu Glu Leu Glu Glu Leu
65 70 75 80
Glu Glu Leu Glu Glu Leu Glu Glu Leu Glu Glu Asp Asp Glu Val Ile
85 90 95
Pro Pro Ala Gln Glu Gly Asn Ser Lys Leu Pro Leu
100 105
<210> 180
<211> 2802
<212> DNA
<213> Microbulbifer degradans
<400> 180
atgtatccct tgccatttaa ttcaaagata gttatttcgc ttggtgcaat gacgctcgcg 60
ctggcgacgc agcaagctca ggcgcttagt tgtacggttt cggccgacag ctggaatagc 120
ggttatacgg ccaatgtaac agttgtcaac gatagtagtt acagtattaa tagttgggac 180
gttacgcttg gttttaatca gccgccaagc gtgagcgccg gttggaatgc gaatgtgtct 240
actgttggca ccacagtcat ggccagcaac gtgggttaca atggaaactt gtcgcccggg 300
cagtctacat cctttggttt tcagggggct cacaacggca attttgaatt gccttcttgc 360
gctggaggta tgacttcatc atcctcttct agttcttcta gttcttctag ttcttctagt 420
tcttctagtt cttctagttc ttctagttct tctagttctt ctagttcttc tagttcttct 480
agttcttcta attcttctag ctcgagcagc agctcgtcga gttctagtac cagctcttct 540
tccggcagcg ggggcgctaa tacagtgact attgagctag agaatttgag cggccagaat 600
ggcttttcac catttagcgt gcagaatgac agtagtgcca gtggcggcca gtacatcgtt 660
tggcccaata atggtgacca actgctaagc ggagcctccg atggccaatc tggtactttg 720
gcagtgtcgt ttgaattgtc gcaaacggca aatgtgtcct tcgatataag agcgagcttg 780
gcaaatggca atgatgattc gttttattac aaacttgact ctggtgtttg gagtacccag 840
aacaacactt ctaccagcgg ttttgaaact ctctcgccaa ctacctttaa cggagtgtct 900
gctggtgttc atactcttta tattcagcgc cgagaagatg gagctaagct cgatagtctg 960
agcctgactg cctctgtggg caatattatc agcagccatg ctaatagcag ttctagttct 1020
tcttccagcg gctcaagttc gtctagctct tcttccagtt caagtggtgg taatgagcta 1080
gtaattgcta taaacgctgg gggcggtgcc acatctctag atggtgtgaa ttttgtagct 1140
gatgttcatt cattgggcgg ctccaccggt tccaccaccg acagcatcgc aggcgctact 1200
agcagtactt tgtatcaaac agagcgctac ggcagctaca gctacgctgt tcccgtaact 1260
aatgccactt attcgttaaa gctgcatttt gctgagattt accatactga agcgggggcc 1320
cgctcgttta atcttgctgt ggaaggtcaa caggagatgg ctagtgtaga tctgtattca 1380
ctgtcagggc acgatggtgc ttatacctac gaggtgaatg atttcccggt ggatgatggc 1440
gaggttacga tttcgcttga atcacttacc gataacggca cgcttagtgg tttcgagctt 1500
tcctcctctg acggtggtga atatgtggaa ccaacaccta tacctacgcc agagccgggt 1560
gaaggcggtc gttaccgagt gattcacacc acggatatgg gggcagaccc agacgatgag 1620
cagtcgttgg ttcgccagtt ggtaatggcc aacgagtatg acctagaggg cattattacg 1680
acaacaggtt gttggaaaaa atccaccagt aacacggcct atgttgacag gatcctcaac 1740
gcctacagcc aggcatatcc caatttgagc aagcatgcgg aaggattccc aactccggcg 1800
tatctggatt ccattaatgt gatggggcag cgtggctacg gtatgggtga tgtcggctct 1860
ggcaaagact ctgccgggtc caacctgatt attgctgcag tggataaaga tgatccacgt 1920
cctgtatggg caacttgttg gggtggatgt aatactattg ctcaggcagt gtggaaagtg 1980
caaaacacgc gctcgcaagc gcaattggat gcttttatca gcaagttacg tgtatacgat 2040
attctcgggc aagacaatgc gggcacttgg ctggctaaaa actttccaaa ccttatctac 2100
attcgcgcgc gatctgttta cagctggcaa ccctctgata gttatctgga taatcacatt 2160
cagagtcacg gtgcactggg tgctgtatac ccaaatcgtc gatatgcaac agagggggat 2220
acaccagcat ttctgcatat ggctaatccg ggattgaacg acccatcagt ggtttccatg 2280
gggggctggg gtgggcgttt ccctagcaaa caagctggtg tacgtggtat gtcgtgcatg 2340
agtggtgaag atgccgtcta cgatacctac tacatgtata ctgagaatgg cgaatccatt 2400
aagcggtgga gcactgcaat tcacaacgac tttcaggcac gtatggattg ggccattgag 2460
agtaactatt ctgcagcgaa ccaccaccca gtacccgttg ttaataatga tgctaacgag 2520
gcagtaatgt atctcaatgc gtctgcgggt tcgacagttt cgctggatgc tagcggctcg 2580
agtgacccgg atggagatag ccttaattat tcgtggtctc actacggcga agcggattct 2640
tacagtggtt ccgtgagcat tagtaatagt agttcagcca gcgccaatgt tcagattccg 2700
tcgaatgctg gtgggaagga tatccacatt ttgctaaccc tgcgtgataa cggttcccct 2760
aacctctatg cctaccgccg cgtggttatt aacgtgcaat aa 2802
<210> 181
<211> 584
<212> PRT
<213> Microbulbifer degradans
<400> 181
Met Arg Cys Val Leu Val Lys Asn Tyr Gln Ile Ile Lys Leu Lys Asn
1 5 10 15
Tyr Ser His His Val Cys Trp Arg Lys Phe Met Ser Thr Gln Gly Lys
20 25 30
Pro Cys Leu Lys Lys Leu Trp Pro Ala Leu Ala Leu Ala Ala Ala Val
35 40 45
Ala Ser Pro Asn Ala Phe Ser Ala Cys Glu Tyr Val Val Ser Asn Gln
50 55 60
Trp Asp Ser Gly Tyr Ser Ala Thr Ile Lys Ile His Asn Asp Thr Asn
65 70 75 80
Ser Thr Ile Asn Gly Trp Asn Val Asn Trp Gln Tyr Ser Gly Asp Asn
85 90 95
Arg Val Thr Asn Leu Trp Asn Ala Ala Tyr Val Gly Ser Asn Pro Tyr
100 105 110
Ser Ala Ser Asn Leu Ser Trp Asn Ser Thr Val Gly Ala Gly Gln Thr
115 120 125
Ile Glu Phe Gly Phe Gln Gly Ala Lys Asn Gly Gly Ser Ala Glu Val
130 135 140
Pro Val Val Thr Gly Asp Val Cys Ser Gly Gly Gly Ser Ser Ser Gly
145 150 155 160
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Gly Ser Ser Thr Ser
165 170 175
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Gly Gly Ser Thr Ser
180 185 190
Ser Ser Ser Ser Ser Ser Ser Thr Gly Gly Asn Gly Gly Ala Gln Gln
195 200 205
Cys Asn Trp Tyr Gly Glu Ile Arg Pro Leu Cys Asn Asn Gln Asp Ser
210 215 220
Gly Trp Gly Trp Glu Asn Gln Gln Ser Cys Ile Gly Arg Asp Thr Cys
225 230 235 240
Ser Asn Gln Ser Gly Asn Gly Gly Ile Ile Gly Gly Ser Ser Ser Ser
245 250 255
Ser Ser Ser Ser Ser Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser
260 265 270
Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser
275 280 285
Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
290 295 300
Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
305 310 315 320
Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
325 330 335
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
340 345 350
Ser Gly Gly Gly Val Ile Phe Ser Ala Asp Phe Glu Ser Asp Ser Asp
355 360 365
Ser Thr Gln Pro Ala Gly Trp Asp Asn Phe Ile Gly Trp Arg Gln Asn
370 375 380
Gly Pro Asn Pro Asn Gly Ser Thr Phe Ala Leu Val Glu Ser Gly Arg
385 390 395 400
Ala His Ser Gly Asn Asn Ala Val His Phe Ser Gly Gly Ala Ser Pro
405 410 415
Ala Met Ile Ala Arg Ala Leu Pro Ala Asp Leu Asp Thr Leu Tyr Val
420 425 430
Arg Ala Trp Val Tyr Met Thr Arg Gln Leu Gly Met Asn Pro Gly Asp
435 440 445
Asn His Glu Thr Leu Ile Ala Leu Arg Gly Ser Ala Gly Asn Ala Asn
450 455 460
Asn Glu Val Arg Phe Gly Glu Ile Lys Gly Val Ile Gly Thr Asn Glu
465 470 475 480
Val Pro Ser Asp Asn Ile Ala Pro Thr Met Ala Ser Trp Gly Gly Gly
485 490 495
Pro Ala Tyr Ala Ser Asp Thr Trp His Cys Ile Glu Val Ala Phe Leu
500 505 510
Ser Gln Pro Ala Tyr Asp Thr Val Asn Ala Trp Val Asn Asp Ala Leu
515 520 525
Val His Thr Ile Asp Glu Gly Ser Asp Trp Asn Asn Gly Ala Leu Glu
530 535 540
Ala Asp Trp Leu Ser Asn Lys Tyr Val Glu Leu Ala Phe Gly Trp His
545 550 555 560
Ser Phe Ser Gly Asn Asp Val Asp Val Trp Met Asp Asp Ile Val Val
565 570 575
Ser Thr Thr Pro Ile Gly Cys Asp
580
<210> 182
<211> 64
<212> PRT
<213> Microbulbifer degradans
<400> 182
Met Glu Pro Pro Val Glu Leu Leu Glu Glu Leu Leu Glu Glu Leu Leu
1 5 10 15
Val Glu Leu Pro Val Glu Leu Leu Asp Glu Glu Leu Asp Asp Glu Pro
20 25 30
Glu Glu Leu Pro Pro Pro Leu His Thr Ser Pro Val Thr Thr Gly Thr
35 40 45
Ser Ala Leu Pro Pro Phe Leu Ala Pro Trp Lys Pro Asn Ser Ile Val
50 55 60
<210> 183
<211> 1755
<212> DNA
<213> Microbulbifer degradans
<400> 183
gtgcggtgtg tattagtaaa aaattatcaa ataataaaat taaaaaatta tagccatcat 60
gtttgctgga gaaagtttat gtcaacccaa ggtaaacctt gtttaaaaaa attgtggccg 120
gcgttagcgc ttgctgctgc tgtagctagt ccgaacgcat tttctgcgtg tgagtatgtt 180
gtatccaatc aatgggattc agggtattcg gcgactatta aaattcataa cgatacaaat 240
tcaaccatta atggttggaa tgtgaattgg caatatagcg gcgataaccg cgtaaccaat 300
ttatggaatg cggcgtatgt aggtagtaac ccatactctg caagtaacct ttcgtggaat 360
agcaccgttg gtgccggtca aactatcgag tttggtttcc agggggccaa aaacggcggc 420
agtgccgaag tgcccgttgt tacgggtgat gtgtgtagtg gtggtggcag ttcttctggt 480
tcatcatcta gctcttcatc gagtagctct acaggtagtt ctacaagcag ttcttcaagt 540
agctcttcaa gtagttccac aggcggttcc acaagcagtt catcaagctc gtcttctacc 600
ggtggcaatg gtggtgcaca gcaatgcaat tggtacggtg aaattcgccc actgtgtaac 660
aatcaagata gcggctgggg ttgggaaaat cagcaaagct gtatcggtcg cgacacctgt 720
tccaaccaaa gtggcaatgg cggtataatt ggcggtagtt cttctagctc gtccagttct 780
tcaaccggtt catcgagcag ctcttcgagt tcttcgtcaa gctcatccag ctcttctagt 840
tcaacgtcaa gctcatccag ctcttctagt tcaacgtctt caagttcttc gtcgagttca 900
tcatctagct cttcaagttc tacaagtagc tcgtcgtctt cgtctagctc atcgagtagt 960
tcttcaagca gttcgacgtc tagcagtagt tcttcatctt cgagctctag cagttcatct 1020
agctcttcaa gcagttcttc tagtagctct tcttcaagtg gtggcggtgt tatttttagc 1080
gcagacttcg aaagcgatag cgatagtaca cagccagcag gttgggataa ttttattggc 1140
tggcgtcaaa acgggccaaa cccaaatggt tcgacatttg cgttggtgga atcaggtcgc 1200
gcacacagtg gtaacaacgc tgtgcacttc agtggtggcg caagcccagc tatgattgct 1260
cgcgctttgc ctgcagattt agatacttta tatgtgcgtg cttgggtgta tatgactcgc 1320
caattgggta tgaaccccgg tgataaccat gaaacgctca tcgccttgcg tggctctgct 1380
ggcaatgcaa acaacgaagt gcgttttggt gaaattaaag gggtaattgg tactaacgaa 1440
gtaccgagcg ataacattgc acctactatg gcaagctggg gtggtggccc tgcctatgca 1500
tcagatactt ggcattgtat cgaagtggca tttttatcgc agccagcgta cgacactgta 1560
aatgcatggg ttaacgacgc gcttgtacat actattgatg aaggctctga ctggaacaat 1620
ggtgccttag aggcggattg gttaagtaat aaatacgtag agctagcctt cggctggcac 1680
agctttagcg gcaacgatgt agatgtatgg atggatgata tcgttgtatc cacaacccca 1740
atcggttgcg actaa 1755
<210> 184
<211> 788
<212> PRT
<213> Microbulbifer degradans
<400> 184
Met Phe Thr Arg Ile Thr Phe Met Leu Ile Asn Lys Ala Arg Gly Arg
1 5 10 15
Leu Ala Ala Ala Met Leu Ala Thr Ala Ala Ser Ala Trp Gly Ser Thr
20 25 30
Ala Leu Ala Glu Cys Ser Tyr Gln Val Thr Asn Asn Trp Gly Ser Gly
35 40 45
Phe Thr Ala Ala Ile Arg Ile Thr Asn Thr Gln Thr Ser Ser Ile Asn
50 55 60
Asp Trp Gln Val Ser Trp Thr Tyr Glu Asn Asn Thr Leu Val Asn Ala
65 70 75 80
Trp Asn Ala Asn Val Ser Gly Asn Tyr Thr Ala Thr Asn Met Gly Trp
85 90 95
Asn Gly Ser Leu Thr Ser Gly Gln Ser Val Glu Phe Gly Leu Gln Gly
100 105 110
Thr Thr Thr Gly Gly Val Val Glu Ile Pro Leu Leu Thr Gly Asn Val
115 120 125
Cys Asn Ser Asn Thr Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ala
130 135 140
Ser Ser Ser Ser Ser Ser Ser Gly Ala Ala Ala Val Asn Leu Ala Gly
145 150 155 160
Ile Ala Asn Ala Ser Thr Ser Tyr Val Ser Ala Trp Glu Thr Leu Ala
165 170 175
Ala Val Asn Asp Gly Asn Thr Pro Ala Asn Ser Asn Asp Lys Ser Asn
180 185 190
Gly Ala Tyr Gly Asn Trp Asn Asn Pro Asn Ser Ile Gln Trp Val Gln
195 200 205
Tyr Asp Trp Pro Gln Asn Tyr Thr Leu Ser Ser Thr Gln Ile Tyr Trp
210 215 220
Phe Asp Asp Asn Gly Gly Val Leu Val Pro Asp Val Ala Tyr Ile Glu
225 230 235 240
Tyr Trp Ser Asn Gly Thr Trp Val Lys Val Gly Asp Val Pro Arg Gln
245 250 255
Glu Asn Thr Phe Asn Thr Leu Asn Leu Asn Asn Ile Val Thr Asn Arg
260 265 270
Leu Arg Val Ser Ile Ser Asn Thr Leu Gln Ser Thr Gly Ile Leu Glu
275 280 285
Trp Arg Val Glu Gly Thr Glu Pro Ser Gly Ser Ser Ser Ser Ser Ser
290 295 300
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Gly Gly Pro Glu
305 310 315 320
Gln Cys Asp Ala Tyr Val Trp Pro Glu Tyr Glu Pro Asn Leu Asn Tyr
325 330 335
Asp Phe Arg Gln Asp Tyr Ala Asp Ile Asn Pro Glu Glu Phe Glu Val
340 345 350
Phe Leu Gly Cys Asp Pro Ser Gln Val Ala Gly Val Lys Thr Ser Gly
355 360 365
Trp Tyr Ala Phe Ile Trp Gly His Asn Arg Asn Pro Ala Ile Thr Asp
370 375 380
Glu Asp Ile Asp Arg Val Leu Ala Asn Leu Asn Glu Asp Met Ala Tyr
385 390 395 400
Ala Arg Gly Glu Met Gly Trp Pro Pro Asp Lys Leu Pro Gln Glu Gly
405 410 415
Tyr Tyr Ser Asn Val Tyr Leu Tyr Gly Ser Gly Leu Cys Thr Asp Asn
420 425 430
Ala Ala Asn Thr Glu Arg Gly Gly Trp Gln Ser Ser Ile Ala Gly Tyr
435 440 445
Pro Met Val Leu Leu Ser Tyr Tyr Pro Val Ile Thr Pro Ser Glu Arg
450 455 460
Gly Gly Ile Thr His Glu Ala Ile His Thr Ile Met Ala Ser Met Gly
465 470 475 480
Asn Lys Ala Ala Trp Phe Asn Glu Gly Gly Asn Thr Trp Leu Gln Met
485 490 495
Asn Met Glu Ala Ser Arg Val Gly Asp Tyr Gly Val Gly Phe Leu Asp
500 505 510
Gly Ala Pro Phe Leu Ala Pro His Met Pro Ile Glu Asn Tyr Ser Gly
515 520 525
Trp Leu Gln Asp Gly Ser Phe Gly Gly Pro Asn Ala Glu Gly Val His
530 535 540
Arg Glu Leu Asn Gly Gln Gln Ile Ala Thr Trp Arg Asp Tyr Leu Gly
545 550 555 560
Gly His Gln Tyr Asn Ser Val Phe Ser His Phe Leu Ala Gln Tyr Val
565 570 575
Ser Ser Gly Ala Asn Ala Trp Ile Trp Lys Asn Gly Pro Tyr Asn His
580 585 590
Ile Leu Ala Ser Leu Ala Ala Gly Leu Gly Asp Asp Gln Thr Arg His
595 600 605
Leu Ile Met Gln Tyr Arg Ala Arg Gln Ala Met Val Asp Phe Gly Pro
610 615 620
Trp Thr Asn Gly Phe Lys Gln Pro Ile Asn Asn Asn Trp Gln Arg Thr
625 630 635 640
Ile Gly Ala Glu Glu Thr Ala Ala Gly Lys Trp Met Glu Pro Glu Pro
645 650 655
His Gln Leu Thr Phe Tyr Ala Ala Thr Ser Gln Glu Gly Asn Thr Leu
660 665 670
Ile Pro Ala Gln Asn Thr Leu Pro Gly Trp Ser Gly Ala Asn Gln Ile
675 680 685
Pro Leu Gln Val Thr Gly Asn Lys Val Arg Val Asp Phe Glu Pro Phe
690 695 700
Gly Asn Asn Met Arg Leu Gln Leu Ala Tyr Arg Ala Gln Asp Gly Ser
705 710 715 720
Ala Val Tyr Ser Gln Pro Ile Glu Ser Gly Glu Ala Cys Leu Thr Leu
725 730 735
Glu Lys Thr Pro Lys Asn Gly Val Val Val Ala Ile Val Ser Asn Thr
740 745 750
Asp Tyr Thr Tyr Ala Gly Asp Glu Thr Arg Lys Gln Lys Tyr Asp Tyr
755 760 765
Arg Val His Ile Gln Glu Gly Val Ser Gly Thr Ala Ser Leu Tyr Ser
770 775 780
Lys His Tyr Glu
785
<210> 185
<211> 2367
<212> DNA
<213> Microbulbifer degradans
<400> 185
atgtttacga ggattacctt tatgttaatt aacaaagcga gaggtcgctt agctgcagca 60
atgttagcca cagcagcctc agcttggggt agcacggctc tagccgagtg cagctatcaa 120
gtcacaaata attggggcag cggatttact gccgccattc gcatcactaa tacccaaacg 180
agtagtatta atgactggca agttagctgg acgtacgaaa ataatacact tgtaaatgct 240
tggaacgcga atgttagcgg caattacacc gctacaaaca tgggctggaa tggctcactt 300
acctcgggcc aatctgtcga gtttggcctg caaggcacca caactggcgg tgtggttgaa 360
atcccccttt taaccggtaa tgtgtgcaac tcaaatacaa gtagtagctc ttctagtagt 420
acaagttcag cctcaagttc aagttcgagt tccggcgctg ccgcagtaaa tctcgctggt 480
atagccaatg cgagcacttc ctatgtttct gcatgggaaa cattggcagc agtgaatgat 540
ggtaacaccc cggccaattc caacgacaag tccaacggag cttacggcaa ctggaacaat 600
ccaaattcga tacagtgggt acagtacgat tggccacaaa actatacgtt atcctcaacg 660
caaatatatt ggtttgatga taatggcggc gtactggtac ccgatgttgc ttacattgaa 720
tattggagca atggcacatg ggtaaaagta ggtgatgtac cgcggcaaga aaatacattt 780
aacacgctaa atttaaacaa tattgttaca aaccgcctac gtgtttctat aagcaacaca 840
ttgcaatcta caggcatact ggagtggcgc gtagagggaa cagagccaag cggcagctca 900
tctagcagca gctcaagctc atcatccagc tcaacgtcta gtagtagtgg cggaccagaa 960
cagtgtgatg cgtatgtgtg gcccgaatac gagccgaatt taaattacga ctttagacag 1020
gactacgcag atataaaccc cgaagaattt gaggtatttt taggttgcga cccaagccaa 1080
gttgcaggtg ttaaaacttc tggctggtac gcgtttattt gggggcataa tcgcaacccc 1140
gcaattaccg atgaagatat agatagagta ttggccaatt taaatgaaga tatggcatac 1200
gctcgcggcg aaatgggctg gccacccgat aagcttcctc aggaaggcta ctacagcaat 1260
gtgtatttgt acggctctgg tttatgtacc gataacgcag ccaataccga gcgaggtggc 1320
tggcaaagta gcattgccgg ctaccccatg gtactgcttt cgtattaccc agtaataaca 1380
cctagcgagc gcggtggtat aacccacgaa gctatccaca ctattatggc ttcgatggga 1440
aacaaagctg catggtttaa cgaaggcggc aacacttggc tacaaatgaa tatggaggca 1500
tctcgcgttg gggattatgg cgttggcttt ttagatggtg cccccttttt agcgccgcac 1560
atgccaatcg aaaattacag tggttggcta caagatggct ccttcggtgg gccgaatgct 1620
gaaggcgtac accgggaatt aaatggtcag caaatagcaa cttggcgaga ttacttaggc 1680
ggccatcaat acaactctgt gttttctcac tttttagcgc aatatgtttc tagcggtgca 1740
aatgcgtgga tatggaaaaa cgggccatat aaccatattc tcgcctcgct agcagcgggc 1800
ttgggcgacg accaaactcg ccacttaatt atgcaatacc gcgcacggca agcaatggtt 1860
gattttggcc cttggacgaa tggatttaaa caaccaatca ataacaattg gcaacgcacc 1920
attggtgcag aagaaaccgc tgctggcaaa tggatggagc ctgaacccca tcaattaacg 1980
ttttatgctg caacaagcca ggaaggtaat actttgatac cagcacaaaa tacattgccg 2040
ggctggtctg gtgccaatca aataccatta caagtaaccg gcaacaaagt gcgtgttgat 2100
tttgaaccat tcggcaacaa tatgcgcttg caattagcct atcgcgcaca agacggttca 2160
gcggtataca gccagcctat agaaagtggc gaagcgtgct taacactaga aaaaacaccc 2220
aaaaatggcg tagtggtggc aattgtatcc aacaccgatt acacctacgc aggggatgaa 2280
actcgcaaac aaaaatacga ttacagagtg catattcaag aaggcgtaag cggtacggca 2340
tcactttata gcaagcacta tgaatag 2367
<210> 186
<211> 787
<212> PRT
<213> Microbulbifer degradans
<400> 186
Met Leu Arg Lys Leu Asn Phe Ile Leu Ala Pro Leu Gly Met Val Leu
1 5 10 15
Gly Val Asn Thr Tyr Ala Asp Val Ser Cys Ala Val Ser Gly Ser Val
20 25 30
Trp Asn Asn Gly Tyr Val Ala Asn Val Ala Val Thr Asn Ala Gly Glu
35 40 45
Ser Leu Gln Glu Gly Trp Ser Val Ala Leu Leu Phe Asp Asn Thr Pro
50 55 60
Thr Ile Asn Asn Ser Trp Ser Ala Glu Leu Ala Val Asp Gly Asn Val
65 70 75 80
Leu Thr Ala Ser Asn Val Ala Trp Asn Ala Asn Leu Ala Ala Gly Gln
85 90 95
Ser Ala His Phe Gly Phe Val Gly Ser Tyr Ser Gly Glu Phe Glu Leu
100 105 110
Pro Glu Cys Phe Val Gly Ala Leu Asp Asp Asp Thr Ala Leu Glu Asp
115 120 125
Tyr Leu Lys Gln Gly Val His Asn His Leu Asn Leu Arg Ile Ala Asn
130 135 140
Ser Ala Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Gly Ala Ser Ser
145 150 155 160
Ser Ser Ser Ser Ser Ser Ala Ser Ser Ala Ser Ser Thr Ser Ser Gly
165 170 175
Ser Pro Val Ala Glu Pro Asp Thr Gln Pro Asp Asp Ile Thr Asn Thr
180 185 190
Gln Glu Asp Gly Val Asp Glu Ala Asp Ile Ile Glu Ser Asp Gly Asn
195 200 205
His Phe Phe Val Val Arg Ser Ser Leu Tyr Phe Thr Gly Thr Ser Ser
210 215 220
Ser Thr Ser Gly Ser Ser Ser Ser Ser Ala Ser Ser Ser Gly Ser Gly
225 230 235 240
Gly Tyr Gly Asp Tyr Thr Gln Gly Val Leu Ile Glu Ala Tyr Ser Lys
245 250 255
Asp Ser Ala Ala Gly Thr Thr Gln His Val Gly Gln Val Thr Leu Pro
260 265 270
Tyr Glu Glu Asn Tyr Leu His Val Ser Gly Ala Tyr Phe Arg Lys Thr
275 280 285
Pro Gln Gly Asn Arg Leu Ala Ile Val Ser Asn Thr Arg Glu Ser Gln
290 295 300
Pro Tyr Tyr Trp Ser Tyr Gly Tyr Gly Tyr Tyr Pro Tyr Tyr Gln Ile
305 310 315 320
Ser Asn Lys Val Ala Ile Ser Val Ala Asn Ile Asp Thr Pro Glu Leu
325 330 335
Met Asp Glu Ala Glu Arg Ile Thr Phe Asp Gly Asp Leu Val Ser Ser
340 345 350
Arg Arg Ile Gly Ser Lys Leu Tyr Ile Ala Ser Arg Tyr Ser Pro Asn
355 360 365
Leu Asn Arg Leu Gly Phe Asp Phe Ser Ser Ser Ala Ser Asn Glu Glu
370 375 380
Asn Ala Gln Arg Leu Asp Glu Ala Pro Leu Ala Asp Leu Leu Pro His
385 390 395 400
Val Tyr Asp Glu Asn Gly Val Gly Thr Pro Leu Val Thr Ala Ala Asp
405 410 415
Cys Thr Val Pro Glu Trp Pro Glu Ser Val Asp Val Tyr Ala Gly Ser
420 425 430
Leu Leu Val Ile Thr Gln Ile Asp Leu Asp Asn Asn Leu Ala Ile Ser
435 440 445
Ser Arg Cys Leu Pro Ala Ser Ala Thr Glu Ile Tyr Ser Ser Ala Asp
450 455 460
Ala Ile Tyr Ala Phe Ser Asn Ala Phe Tyr Ala Gly Val Lys Val His
465 470 475 480
Lys Phe Thr Leu Lys Asn Asp Ala Gln Glu Ser Glu Ile Ser Tyr Lys
485 490 495
Gly Ser Val Lys Leu Pro Gly Phe Ile Ala Cys Ser Asn Gln Ser Tyr
500 505 510
Cys Phe Gly Glu Gln Asp Gly Ala Leu Arg Val Leu Tyr Arg Val Ser
515 520 525
Asn Tyr Ser Asp Ala Thr Pro Tyr Arg Ile Ala Val Ile Lys Gln Ser
530 535 540
Pro Ser Ser Ala Glu Gly Leu Asp Ile Ile Ala Thr Leu Pro Asn Ala
545 550 555 560
Glu Arg Pro Ala Ser Ile Gly Lys Pro Gly Glu Ile Val Tyr Ala Met
565 570 575
Arg Ser Phe Gly Asp His Ala Tyr Val Val Thr Phe Asp Arg Ile Asp
580 585 590
Pro Leu Tyr Ala Ile Asp Phe Ser Asn Pro Ala Asp Pro Tyr Ile Ala
595 600 605
Gly Glu Leu Glu Val Thr Gly Val Ser Asp Tyr Leu His Pro Val Gly
610 615 620
Glu Asn Leu Leu Leu Gly Val Gly Arg Asp Ala Ile Phe Asp Glu Ala
625 630 635 640
Arg Asn Leu Thr Trp Phe Gln Gly Val Lys Val Glu Leu Phe Asp Val
645 650 655
Ser Asp Pro Thr Asn Leu Arg Ser Leu Gly Ser Glu Val Ile Gly Lys
660 665 670
Arg Glu Ser Ser Thr Thr Leu Ser Phe Asp Ala Arg Gly Ile Ala Phe
675 680 685
Ser His Asp Glu Asn Gly Ile Arg Phe Ala Ile Pro Val Lys Met His
690 695 700
Asp Lys Pro Ile Gly Pro Ala Asn Thr Asp Ala Leu Asn Gln Tyr Tyr
705 710 715 720
Ser Trp Thr His Thr Gly Leu Tyr Val Tyr Gln Ile Glu Thr Gln Pro
725 730 735
Asn Thr Asp Ala Ser Leu Ser Arg Leu Gly Ile Leu Lys Thr Asp Ile
740 745 750
Gly Glu Ala Ser Val Arg Tyr Asp Arg Gly Val Ile Asp Gly Asp Ala
755 760 765
Val Tyr Tyr Leu His Asn Tyr His Met Phe Ser Ser Leu Ile Asn Tyr
770 775 780
Leu Ala Gln
785
<210> 187
<211> 2364
<212> DNA
<213> Microbulbifer degradans
<400> 187
atgttacgga aattaaattt tattttagcg ccgcttggta tggtgttagg cgttaacacc 60
tatgcggatg tgtcgtgcgc tgtttctggc agcgtgtgga acaatggcta tgtagctaat 120
gttgcagtta ctaacgcggg tgaatctctg caagagggtt ggagtgttgc tttgttgttc 180
gataacactc caactataaa caatagttgg agtgcggagc ttgctgtaga cggtaatgtg 240
ctaaccgcga gtaatgtggc ttggaacgct aatttggcag cggggcaatc tgctcatttt 300
ggttttgttg gctcatattc aggtgagttt gaattgcctg aatgtttcgt tggcgcacta 360
gatgatgaca ctgcactcga agattattta aaacagggcg tgcataatca tttaaattta 420
cgcattgcaa attcggcgtc tggttcatct agctcttcga gctcttccgg tgcgtccagt 480
tcttccagtt catccagtgc ttctagcgca tcgagtacat cttctggttc gcccgtggcc 540
gagcccgata ctcagcctga tgatattacc aatacccaag aagatggcgt ggatgaggcg 600
gatataattg aatccgacgg taaccacttc tttgtggtgc gttcttcttt atattttacc 660
ggcactagct catcaacctc tggctctagt tctagttcgg cttcatctag cggttcaggt 720
gggtatggcg attacacaca gggagtgttg attgaagcct acagtaaaga ttcagcagcc 780
ggaacaactc aacatgtggg gcaggtgacg ttgccctatg aagaaaatta tcttcacgtt 840
tctggtgctt actttcgcaa aacaccgcaa ggcaatagat tagctattgt aagtaacacc 900
agagaatcac agccttatta ttggagctat gggtatggtt attaccccta ttatcaaatt 960
tctaataagg tagctatttc tgttgctaat attgataccc ctgaattaat ggacgaggca 1020
gagcgaatta cttttgatgg tgatttagta tcgagcagac gtattggtag caaattatat 1080
attgccagcc gttatagccc gaacctaaat cgtttagggt ttgatttttc tagcagcgcg 1140
agcaatgagg aaaatgcaca gcgtttagat gaagcgcccc ttgcggattt attgccacat 1200
gtttacgacg aaaatggcgt aggtacgcct ttggttactg ctgctgactg caccgtaccg 1260
gagtggccag aaagcgtgga tgtatatgcc gggtcactgt tggttattac tcaaattgac 1320
ttggataata acttagcaat tagctcgcgt tgtttacctg cttcggcaac agaaatatac 1380
agttctgcgg atgcaattta tgctttctcc aatgcatttt atgctggggt aaaagtgcat 1440
aaattcacat taaaaaatga tgcccaagaa agtgaaatta gctataaggg tagtgttaag 1500
ttgccgggtt ttatagcttg cagtaatcaa tcttattgct ttggcgagca agacggggca 1560
ttgcgcgttt tgtatagagt gtccaattac agtgatgcta ctccctatcg tattgcggta 1620
attaagcaat cgccaagctc tgcagaaggt ttagacatta tagctacctt gccaaatgct 1680
gaacgcccag cctctattgg taagccgggc gagattgttt atgccatgcg tagttttggt 1740
gatcacgctt acgtcgttac tttcgataga attgacccgt tatatgctat tgatttctct 1800
aacccagcag acccttacat tgccggtgag cttgaggtta ctggtgtgtc tgattactta 1860
catccagtgg gggagaattt attgttgggc gtgggccgag atgcaatatt cgacgaagcg 1920
cgcaatttaa cttggtttca gggtgttaaa gtggagctgt ttgatgtgag tgaccctaca 1980
aacctgcgca gcttaggcag tgaagtaata ggtaagcgcg agtcgtcaac gacacttagt 2040
tttgatgccc gtggtattgc atttagccat gatgaaaacg gtattcgttt tgctattcct 2100
gtgaaaatgc atgataagcc aattggaccg gcaaataccg atgcgttaaa ccaatattat 2160
agttggaccc atacggggtt gtatgtttac caaatagaaa cgcagccaaa caccgatgct 2220
agtttgtctc gcttaggtat actaaaaact gatatcggcg aggccagtgt gcgctatgat 2280
cgtggcgtaa ttgatggtga tgctgtttac tatttgcata attaccacat gttttcgagc 2340
ttaatcaatt atttggcgca atag 2364
<210> 188
<211> 681
<212> PRT
<213> Microbulbifer degradans
<400> 188
Met Arg Ile Ser Thr Ala Ile Thr Gln Phe Leu Ile Val Ile Ala Thr
1 5 10 15
Leu Ile Leu Ala Ala Cys Ser Gly Ser Gly Gly Gly Ala Asn Ser Gly
20 25 30
Gly Ser Asn Pro Gly Ala Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
35 40 45
Ser Asn Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Gly Gly
50 55 60
Ser Ser Gly Ala Ala Val Ser Leu Pro Ser Thr Ile Ala Ala Met Asp
65 70 75 80
Phe Val Ala Ala Tyr Asp Thr Asp Asn Ala Asn Ser Gly Asp Cys Gly
85 90 95
Asn Gly Pro Val Asp Met Gln Thr Ser Thr Asp Thr Gln Gly Ala Ala
100 105 110
Cys Thr Val Gly Trp Thr Lys Ala Gly Glu Trp Leu Ala Tyr Asp Val
115 120 125
Ser Val Ala Val Thr Gln Lys Met Asp Ile Val Phe Arg Val Ala Thr
130 135 140
Asn Gln Asn Ser Arg Ala Leu Lys Val Gln Leu Asn Asn Lys Thr Leu
145 150 155 160
Gly Val Leu Asn Val Ser Gly Thr Ala Phe Asp Glu Trp Gln Thr Val
165 170 175
Ser Leu Lys Asp Ile Glu Ile Pro Ala Gly Thr His Gln Leu Lys Leu
180 185 190
Val Trp Met Thr Gly Ala Ile Asn Val Asn Thr Leu Ser Phe Thr Ala
195 200 205
His Ser Ala Tyr Gly Gln Thr Thr Asn Leu Trp Gly Ser Asn Gly Ala
210 215 220
Glu His Asp Pro Ser Gly Val Leu Ser Asp Trp Ser Tyr Ala Gly Tyr
225 230 235 240
His Trp Gly Glu Glu Glu Pro Pro Val Lys Ser Pro Thr Ile Asn Val
245 250 255
Val Thr Asp His Gly Ala Ile Ala Asp Asp Asp Ser Asp Asp Ser Ala
260 265 270
Ala Phe Ile Ala Ala Leu Thr Ala Ala Asn Asn Gly Asp Val Val Tyr
275 280 285
Val Pro Glu Gly Arg Phe Ile Leu Thr Gln Val Leu Ser Ile Pro Asn
290 295 300
Gly Val Val Leu Gln Gly Ala Gly Ser Glu Leu Thr Thr Leu Tyr Ile
305 310 315 320
Pro Thr Asn Leu Asn Glu Ala Thr Gly Ile Asp Pro Ser Phe Thr Gly
325 330 335
Gly Phe Ile Glu Met Lys Gly Ser Ser Ser Asp Gly Ser Lys Leu Ser
340 345 350
Thr Ile Thr Ala Ala Ala Ala Arg Gly Ser Asn Gln Ile Thr Val Glu
355 360 365
Ser Val Ala Gly Leu Thr Val Gly Asp Trp Val Gln Ile Gln Gln Thr
370 375 380
Asp Val Asn Gly Asn Phe Leu Leu Glu Ala Leu Tyr Ala Gly Tyr Thr
385 390 395 400
Ala Gly Leu Ser Asp Tyr Ser Asn Leu Val Asn Asp Ala Glu Leu Glu
405 410 415
Phe Phe Ser Arg Ile Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu Glu
420 425 430
Arg Pro Leu Pro Ile Thr Val Ser Pro Asn Tyr Met Ala Glu Leu His
435 440 445
Ser Val Ser Val Ala Tyr Gly Glu Val Gly Phe Glu Gly Met Thr Leu
450 455 460
Glu Phe Pro Glu Thr Thr Tyr Pro Gly His Phe Asn Glu Leu Gly Tyr
465 470 475 480
Asn Gly Ile Asp Met Arg Ala Gln His Ser Trp Val Arg Asp Val Val
485 490 495
Val Lys Asn Ile Asp Tyr Gly Ile Asn Leu Arg Gly Ser His Phe Val
500 505 510
Ser Ile Leu Asp Phe Thr Val Ile Asn Thr Asn Asp Arg Ser Gly His
515 520 525
His Gly Ile Ser Val Gly Ser Ser Thr Asp Cys Leu Ile Arg Gly Phe
530 535 540
Asp Ile Gln Ala Glu Leu Val His Asp Leu Thr Val Glu Trp Tyr Ala
545 550 555 560
Tyr Gly Asn Val Phe Thr Gln Gly Arg Gly Lys Asn Ile Thr Leu Asp
565 570 575
His His Arg Ala Ala Ala Tyr Ala Asn Leu Phe Thr Gln Ile Asp Leu
580 585 590
Gly Glu Ala Thr Arg Ala Trp Lys Thr Gly Gly Arg Ser Asp Arg Gly
595 600 605
Tyr Lys Thr Ala Val Tyr Ser Thr Phe Trp Asn Met Thr Ala Glu Gln
610 615 620
Ala Ile Asp Trp Pro Ala Asn Asp Phe Gly Pro Arg Met Val Phe Met
625 630 635 640
Gly Leu Thr Met Asp Gly Ser His Ser Ser Val Leu Asp Trp Val Val
645 650 655
Glu Asp Ile Ser Ala Asp Asp Ile Tyr Pro Pro Asn Leu Trp Leu Ser
660 665 670
Gln Arg Glu Lys Arg Leu Gly Arg Gln
675 680
<210> 189
<211> 66
<212> PRT
<213> Microbulbifer degradans
<400> 189
Met Leu Gly Ser Asp Thr Ala Ala Pro Asp Glu Pro Pro Leu Glu Asp
1 5 10 15
Asp Glu Leu Val Leu Leu Glu Leu Leu Leu Asp Glu Glu Asp Glu Glu
20 25 30
Leu Glu Leu Glu Leu Ala Pro Gly Leu Glu Pro Pro Glu Phe Ala Pro
35 40 45
Pro Pro Glu Pro Leu Gln Ala Ala Arg Ile Ser Val Ala Ile Thr Ile
50 55 60
Arg Asn
65
<210> 190
<211> 2046
<212> DNA
<213> Microbulbifer degradans
<400> 190
atgcgtattt ctactgctat cactcaattc ctaattgtaa ttgctacact tattctagct 60
gcctgcagtg gctcgggcgg aggggcgaat tctggtggct ctaatcctgg tgctagttct 120
agctccagtt cttcgtcttc ttcatctaat agcagttcga gtagcactag ttcgtcgtcc 180
tctagtggtg gttcatccgg tgccgcggta tcgctcccaa gcactattgc cgcaatggat 240
tttgtcgcag cttacgatac agacaacgca aattcaggtg attgtggtaa tggccctgtc 300
gatatgcaaa cgtcaaccga tacgcaaggt gcggcctgca cggtaggttg gacaaaagct 360
ggcgagtggt tggcttatga tgtgagtgtt gctgttaccc aaaaaatgga tattgtgttt 420
cgtgttgcta ctaaccaaaa ctcacgtgcc ttaaaagtac agctaaataa caaaacattg 480
ggtgtgttaa atgtttctgg tacagcgttt gatgaatggc aaactgtatc gctaaaagac 540
atagaaattc ctgcaggcac gcatcaatta aaactcgtgt ggatgaccgg ggcaataaac 600
gtcaataccc ttagttttac cgcgcacagc gcctacggcc aaactaccaa tttgtggggc 660
agtaacggtg ccgaacacga cccgagcggc gtgttatctg attggtctta tgctggttat 720
cactggggtg aagaagagcc gccggtaaaa tcgccaacca ttaatgtggt taccgatcac 780
ggtgcgatag ccgatgacga tagcgacgat agcgctgcgt ttattgctgc attaaccgca 840
gcaaataatg gtgatgtagt ttacgtaccc gaggggcgtt ttattcttac gcaggtgcta 900
agcataccga atggtgttgt gttgcagggc gcaggcagcg aattaactac tttatatatt 960
cccaccaact taaatgaagc aacgggtata gatccatctt ttactggtgg gtttattgaa 1020
atgaaaggtt cgtcgagtga cggcagtaaa ctttctacta ttactgcagc ggcggcacga 1080
ggcagtaatc aaataacagt agaaagtgtt gcaggtttaa ccgtaggcga ttgggtgcaa 1140
atacaacaaa ccgatgtgaa cggtaacttt ttattagagg ccttgtatgc agggtacacc 1200
gcaggcttaa gtgattacag taatttggtt aacgatgcag agttagagtt tttcagccgc 1260
ataacaaata ttagtggcaa cacacttacc ctagagcgac cgctacccat taccgtaagc 1320
cctaactata tggcggaatt acacagcgta tcggttgcct acggcgaggt gggtttcgaa 1380
ggaatgactt tagagtttcc agaaacaacc tacccaggtc acttcaatga gttagggtac 1440
aacggcatag atatgcgcgc tcagcactct tgggtgcgag atgttgtagt aaaaaatata 1500
gattatggta ttaatttaag aggctcgcac tttgtaagca ttttagattt tactgtaatt 1560
aacacaaatg atcgcagtgg ccaccacggg ataagtgtgg ggtcgtccac cgattgttta 1620
attcgcggtt tcgatattca agccgaacta gtccacgatt taactgttga atggtatgcg 1680
tatggcaatg tatttacgca agggcgaggc aaaaatatta ccctcgatca tcaccgcgca 1740
gcagcctacg ccaacttatt tacccaaatt gatttaggtg aagcaacacg cgcatggaaa 1800
acaggtggcc gaagcgatcg cggatacaaa accgcggtat acagtacgtt ttggaacatg 1860
accgccgaac aggcgatcga ttggcccgcc aatgatttcg gcccacgtat ggtgtttatg 1920
gggttaacca tggacggcag tcacagctct gttttagatt gggtggtaga agatatttct 1980
gcagatgata tatatccacc aaacctttgg ttgtcgcaaa gagaaaaaag attggggcgg 2040
cagtag 2046
<210> 191
<211> 654
<212> PRT
<213> Microbulbifer degradans
<400> 191
Met Arg Phe Gln Tyr Ala Leu Cys Thr Asp Ser Gly Gly Thr Ile Lys
1 5 10 15
Asn Asn Gln Ser Trp Val Ser Ile Met Leu Arg Ile His Leu Ala Met
20 25 30
Pro Asn Met Ala Phe Ser Leu Leu Gln His Tyr Leu Leu Asn Thr Phe
35 40 45
Ser Gly Val Ser Ile Trp Val Leu Cys Gly Leu Leu Met Gly Ala Tyr
50 55 60
Val Asn Ala Ala Thr Gln Glu Ile Asn Gly Asn Gln Leu Gln Val Thr
65 70 75 80
Cys Gln Ser Glu Thr Phe Cys Asp Ile His Tyr Arg Leu Asn Asn Gly
85 90 95
Arg Glu Leu Asn Ile Ala Met Ala Ser Leu Gly Asn Gly Glu Tyr Ala
100 105 110
His Thr Ile Ser Asn Leu Ile Ser Gly Asp Val Ile Val Tyr Tyr Leu
115 120 125
Thr Tyr Gln Gln Asn Gly Leu Ala Tyr Asp Ser Glu Arg Leu Thr His
130 135 140
Ile Tyr Gly Gly Ala Ser Ser Ser Ala Gly Ala Asp Pro Tyr Leu Gly
145 150 155 160
Phe Arg Ala Pro Ile Pro Gly Thr Ile Leu Ala Val Asn Tyr Asp Thr
165 170 175
Gly Gly Glu Gly Ile Ala Phe His Asp Ala Thr Ile Gly Asn Ser Gly
180 185 190
Gly Gln Tyr Arg Asp Glu Ser Val Asp Ile Glu Ser Ala Ser Ile Gly
195 200 205
Gly Phe Asn Val Gly Trp Val Ala Ser Gly Glu Trp Leu Gln Tyr Ser
210 215 220
Val Asp Val Ala Arg Ala Gly Ser Phe Asp Ile Val Ala Gln Ile Ala
225 230 235 240
Ser Pro Asn Ser Gly Gly Ser Leu His Tyr Glu Ile Thr Gly Ala Thr
245 250 255
Asn Val Gln Ser Asp Ala Val Gln Phe Ala Asn Thr Gly Gly Trp Gln
260 265 270
Val Trp Ala His Thr Ala Ala Ala Lys Val Gln Leu Asn Ala Gly Ala
275 280 285
His Thr Ile Arg Leu Val Phe Glu Ser Ala Gly Phe Asn Ile His Ser
290 295 300
Leu Leu Val Ser Glu Ala Ser Ser Ser Phe Gly Lys Val Glu Ala Glu
305 310 315 320
Ser Phe Asp Ser Met Ser Gly Val Val Ser Glu Ala Thr Ser Val Gly
325 330 335
Tyr Phe Asp Arg Gly Asp Trp Met Lys Tyr Ser Ala Val Ser Phe Gly
340 345 350
Asn Leu Ala Lys Ser Ile Thr Leu Ser Val Ala Gly Glu Tyr Asn Asn
355 360 365
Gly Ile Ala Glu Leu Arg Leu Asp Ser Val Ser Gly Pro Val Ile Gly
370 375 380
Thr Tyr Ser Met Glu Ser Thr Gly Gly Trp Ala Ser Phe Lys Pro Lys
385 390 395 400
Ser Phe Asn Ile Val His Thr Leu Gly Val His Asp Leu Tyr Ile Val
405 410 415
Gly Lys Asn Gly Ser Gly Ile Phe Asn Leu Asp Tyr Phe Gln Leu Ser
420 425 430
Ala Asp Val Val Glu Glu Thr Asn Pro Ala Thr Ala Val Lys Ala Met
435 440 445
Ser Leu Asn Ile Tyr Gly Trp Ala Thr Met Pro Gln Asn Ala Asp Lys
450 455 460
Tyr Ala Ala Leu Ile Arg Ser Arg Gly Val Asp Val Val Gly Ile Gln
465 470 475 480
Glu Gly Val Glu Asp Trp Leu Ile Gly Pro Gly Phe Pro Thr Asn Tyr
485 490 495
Ser Lys Ala Asp Ala Leu Gly Ala Ala Leu Gly Ala Cys Trp Gln Gln
500 505 510
Arg Tyr Gln Ile Phe Ile Asn Ile Cys Glu Gly Asn Ser Phe Val Ser
515 520 525
Asn Arg Arg Phe Asp Met Thr Asp Gly Pro Asn Ala Thr Arg Thr Gly
530 535 540
Glu Ser Ala Arg Ile Asn Lys Asn Gly Phe Glu Tyr Ala Val Leu Thr
545 550 555 560
Val His Trp Asp His Gln Ser Gly Ala Ala Lys Val Ala Asn Ala His
565 570 575
Glu Thr Ala Ala Glu Val Asn Tyr Tyr Gly Ala Leu Pro Thr Val Val
580 585 590
Val Gly Asp Phe Asn Thr Gly Cys Thr Ser His Glu Val Asn Thr Leu
595 600 605
Met His Glu Ala Gly Met Val Leu Ile Gly Asn Ala Gly Ile Asp Cys
610 615 620
Ile Leu Ala Lys Arg Phe Asn Gly Thr Ala Gln Thr Phe Asp Ala Ala
625 630 635 640
Pro Ser Asp His Pro Gly Leu Asp Ala Ser Leu Ser Thr Asn
645 650
<210> 192
<211> 1965
<212> DNA
<213> Microbulbifer degradans
<400> 192
gtgcgttttc agtatgcgtt gtgcacagac agtggcggta caattaaaaa taatcaaagt 60
tgggtatcta ttatgttgcg aattcatcta gccatgccaa atatggcctt ttcactatta 120
cagcactatc ttttaaacac attttctggg gtatccatat gggtactttg tggtttattg 180
atgggggcat acgttaatgc cgccacacaa gagatcaacg gcaaccagtt acaagtgaca 240
tgccaaagcg aaaccttttg cgatattcat tatcgtttga ataatggcag agaattaaat 300
atagcgatgg cgtcgttagg taatggtgag tacgcacata cgatttctaa tcttatttct 360
ggcgatgtga ttgtttacta ccttacgtac caacaaaacg gtttggccta tgactctgag 420
cgcctgacgc acatatatgg gggggctagc tccagcgctg gcgcagatcc ctatttgggg 480
tttagagcac ctattccagg gactattctc gctgtgaact acgatactgg aggcgaaggc 540
atcgcttttc atgatgccac aataggaaac tctggtggtc aataccgcga cgaaagtgtg 600
gatatagaga gcgcttcgat tggtggcttt aatgtggggt gggttgcttc cggcgagtgg 660
ctgcaatact ccgtggatgt ggcgagagct ggcagtttcg atattgttgc gcaaatcgcc 720
tctcctaatt ctgggggcag ccttcattat gaaattacag gggcaaccaa tgttcaatct 780
gacgcggtgc aatttgcgaa tacaggtggt tggcaggttt gggcacacac agcggctgcg 840
aaggtacagc tgaatgcagg tgctcatacc attcggttag tgtttgaatc agcagggttt 900
aatattcatt ctctgctggt aagcgaagcg agctctagtt tcgggaaagt tgaggcagaa 960
agttttgatt ccatgagtgg cgtagtgtcg gaggcgacaa gcgtaggcta ttttgatcgc 1020
ggtgattgga tgaaatacag cgcggtcagc tttggtaatc ttgccaagag cataacgtta 1080
tccgtcgctg gtgaatacaa taatggtata gcagaactta gattagacag tgtcagtggg 1140
ccggtgattg gcacctactc aatggaatcg acaggtggtt gggctagttt taagccaaag 1200
agttttaata tcgttcatac cttgggagtg catgatcttt atattgttgg taaaaatggc 1260
agtggcattt ttaatttaga ctattttcaa ttatctgcag atgtcgtaga agagaccaac 1320
cctgctacag cggtaaaagc gatgtcgctg aatatatatg ggtgggcaac aatgcctcaa 1380
aatgctgata agtatgcagc gcttattcgt tctcggggtg ttgatgttgt gggtattcaa 1440
gagggggtag aagattggct tattgggccg ggttttccca caaattactc caaagcagat 1500
gctcttggtg ctgccttagg agcttgttgg cagcagcgct atcaaatttt tattaatatt 1560
tgtgaaggaa atagcttcgt gtcgaatcgt cgattcgaca tgaccgatgg gcctaatgca 1620
acgcgcacag gcgagtctgc acgaataaac aaaaatgggt tcgaatacgc agtcttaacg 1680
gtgcactggg atcaccaaag tggcgcagct aaagttgcta atgcacatga aacagccgcc 1740
gaggttaatt actacggtgc attgcctacc gttgttgtgg gcgatttcaa cactggctgt 1800
acaagccatg aagtaaatac acttatgcat gaagctggaa tggttttgat tggtaacgcc 1860
ggcattgatt gcattttagc taagcgattc aatggtacgg cacaaacatt tgatgctgca 1920
ccatctgatc accctggctt ggatgcgtca ctcagtacca actag 1965
<210> 193
<211> 550
<212> PRT
<213> Microbulbifer degradans
<400> 193
Met Lys Ser Cys Cys Ile Lys Phe Phe Thr Thr Leu Cys Thr Ala Val
1 5 10 15
Tyr Val Leu Gly Cys Gly Ala Leu Ala His Ala Gln Thr Gly Pro Ala
20 25 30
Gly Tyr Asn Tyr Ala Ala Ala Glu Asn Glu Thr Val Tyr Leu Asn Gly
35 40 45
Thr Thr Asn Val Ala Tyr Gly Ala Asn Gly Ser Phe Tyr Tyr Ala Tyr
50 55 60
Asn Gln Thr Gly Ser Val Asn Cys Ser Asn Gln Thr Phe Gly Asp Pro
65 70 75 80
Ile Phe Gly Val Arg Lys Ala Cys Tyr Thr Gln Gln Val Ala Ser Asn
85 90 95
Asn Pro Pro Ser Val Ser Phe Ala Ser Pro Thr Gly Asn Leu Thr Val
100 105 110
Asp Glu Gly Tyr Ala Leu Ser Val Thr Val Asn Ala Ser Asp Ser Asp
115 120 125
Gly Ser Ile Ala Ser Val Glu Leu Phe Ile Asn Asn Gln Leu Val Arg
130 135 140
Gln Glu Leu Tyr Ala Pro Tyr Glu Trp Gly Ala Ala Ala Glu Pro Asp
145 150 155 160
Glu Leu Asn Gly Leu Pro Val Gly Thr His Thr Ile Lys Ala Val Ala
165 170 175
Thr Asp Asn Asp Gly Asp Thr Lys Gln Ala Ser Phe Lys Leu Thr Val
180 185 190
Arg Gly Ala Ala Val Asp Val Pro Gly Leu Val Gln Ala Glu Asp Tyr
195 200 205
Thr Gly Phe Tyr Asp Thr Thr Asn Gly Asn Thr Gly Gly Ala Tyr Arg
210 215 220
Asn Asp Asn Val Asp Val Glu Thr Thr Ser Asp Ser Asn Gly Gly Tyr
225 230 235 240
Asp Val Gly Trp Phe Ala Ala Asn Glu Trp Leu Glu Tyr Pro Ile Asn
245 250 255
Val Thr Glu Ala Gly Asn Tyr Val Leu Glu Ala Arg Val Ala Ser Ala
260 265 270
Val Gly Gly Gly Met Phe Thr Ala Glu Ile Asn Gly Asn Asn Ser Ser
275 280 285
Thr Phe Ser Ile Gly Asn Thr Gly Gly Trp Gln Asn Trp Gln Thr Leu
290 295 300
Asn Asn Asn Ile Gly Asn Leu Ser Thr Gly Lys Lys Thr Leu Arg Ile
305 310 315 320
Gln Ala Gln Ser Gly Asn Phe Asn Leu Asn Trp Leu Arg Leu Lys Arg
325 330 335
Ala Thr Thr Ser Val Cys Thr Leu Asn Thr Pro Ala Glu Asn Ile Pro
340 345 350
Thr Pro Phe Asn Leu Phe Thr Val Ile Asp Thr Asp Leu Asn Arg Tyr
355 360 365
Glu Phe Cys Lys Ala Ser Lys Trp Phe Glu Glu Ser Asn Gly Lys Gln
370 375 380
Val Phe Lys Leu Phe Thr Gly Asp Asn Leu Ala Asp Asn Val Pro Gly
385 390 395 400
Ala Arg Val His Ala Arg Thr Glu Ala Gly Gln Gly Leu Lys Phe Lys
405 410 415
Ala Gly Ser Thr Trp His Thr Phe Glu Ala Arg Met Lys Pro Ser Lys
420 425 430
Lys Leu Asp Tyr Thr Tyr Thr Ile Ala Gln Leu Phe Ala Gly Cys Cys
435 440 445
Gly Pro Gln Leu Arg Ile Glu Val Lys Ser Asn Gly Arg Ile His Met
450 455 460
Gly Ser Arg Gly Asn Gly Asn Ile Arg Ile Ser Asp Asp Gln Asp Tyr
465 470 475 480
Ala Asn Gly Ser Arg Ser Phe Lys Ile Lys Ile Arg Thr Asn Gly Asp
485 490 495
Gln Phe Glu Val Tyr Phe Asn Ser Ser Lys Lys Phe Ser Gly Arg Thr
500 505 510
Asp Glu Ala Lys Asn Gly Asn Thr Ser Ala Leu Tyr His Phe Arg Trp
515 520 525
Gly Val Tyr Ser Asn Glu Val Met Ser Glu Asp Leu Ser Asn Thr Val
530 535 540
Thr Glu Ile Ile Arg Asn
545 550
<210> 194
<211> 1653
<212> DNA
<213> Microbulbifer degradans
<400> 194
atgaaatcat gttgtatcaa gttttttacc acactctgta ctgctgttta tgtactgggt 60
tgtggcgcgt tagcgcatgc gcaaactggc ccggcgggtt ataactacgc tgctgccgaa 120
aacgaaacgg tgtatttaaa cggaaccacc aacgttgcct acggcgcgaa tggctcgttt 180
tattacgctt acaatcaaac cggttcggtt aattgctcta atcaaacctt tggcgaccct 240
atttttgggg tgcgcaaagc gtgttacacc cagcaagtgg cgagcaataa cccgccgtcg 300
gtttcgtttg ctagcccaac gggcaaccta acagtcgacg aaggttatgc tctttctgta 360
accgtgaacg ccagcgatag cgacggcagc attgccagtg tagagctgtt tattaacaac 420
caattagtac gacaagagct ttacgcacct tacgagtggg gcgcagctgc tgagccagac 480
gaattaaatg gcctacccgt tggtacacac acaataaaag cggtagccac cgataacgac 540
ggcgacacca aacaagctag ctttaagcta acggtacgcg gtgcggcggt agatgttcct 600
ggcttggtac aagcggagga ttacacaggc ttttacgaca caaccaatgg caacactggt 660
ggtgcgtatc gcaacgacaa tgtagacgta gaaaccacaa gcgatagtaa cggtggttac 720
gacgtaggtt ggtttgcggc aaacgaatgg ctggaatacc ccattaacgt taccgaagcg 780
ggcaattatg tattagaagc acgcgtcgca tctgctgtag gcggtggtat gtttaccgca 840
gaaataaatg gcaacaacag cagtacattt agcataggca ataccggcgg ctggcaaaat 900
tggcaaaccc tgaataacaa tattggcaat ttaagcaccg gtaaaaaaac gttacgcatt 960
caagcgcaaa gcggaaactt taatttaaac tggctgcgcc tcaagcgtgc cacaacctct 1020
gtgtgcacac ttaatactcc tgccgaaaac attccaacgc catttaattt atttaccgtt 1080
atcgatacag atttaaaccg ctacgaattc tgtaaagcgt ctaaatggtt tgaggaatct 1140
aacggtaaac aagtgtttaa attatttacc ggcgacaacc tagcagataa cgtacctggc 1200
gcccgcgtac atgctcgcac agaagctggc caagggctta agtttaaagc aggctccaca 1260
tggcacacct ttgaagccag aatgaaaccc agcaaaaagt tagactacac ttacaccatt 1320
gctcaattgt ttgccggctg ttgcgggccg cagttgcgca ttgaagtaaa atctaacgga 1380
cgcatccaca tggggtcgcg cggtaacggc aatattcgta ttagtgacga ccaagattac 1440
gccaacggct ctagatcgtt caaaattaaa attcgcacca atggcgatca gttcgaagtg 1500
tattttaaca gcagcaaaaa gttcagcggc cgcacagacg aagccaagaa cggcaatacc 1560
agtgcgcttt accacttccg ctggggtgta tattccaacg aagtaatgag cgaagattta 1620
tctaacactg ttacagaaat tattcgaaat taa 1653
<210> 195
<211> 500
<212> PRT
<213> Microbulbifer degradans
<400> 195
Met Phe Ala Ser Asn Leu Glu Asp Pro Thr Tyr Tyr Ile Lys Tyr Lys
1 5 10 15
Gly Val Tyr Met Asn Thr Asn Arg Lys Gln Val Asn Gln Asn Leu Ile
20 25 30
Ala Leu Leu Gly Leu Met Leu Leu Met Leu Phe Thr Pro His Gly Tyr
35 40 45
Ala Gln Asp Gln Cys Asn Thr Thr Leu Glu Cys Lys Ile Leu His Gly
50 55 60
Asp Thr Ala Thr Asp Cys Lys Asn Ser Arg Ser Asp Asn Ser Ile Cys
65 70 75 80
Met Cys Gly Ser Thr Glu Cys Ala Val Asp Asn Pro Gln Pro Glu Pro
85 90 95
Ser Asn Ser Ala Ala Val Pro Gly Leu Ile Gln Ala Glu Asp Phe Thr
100 105 110
Asp Tyr Tyr Asp Val Thr Ala Ala Asn His Gly Gly Ala Tyr Arg Ser
115 120 125
Thr Gly Val Asp Ile Gln Val Thr Thr Asp Thr Asn Gly Gly Tyr Asn
130 135 140
Val Gly Trp Ile Ala Ala Asn Glu Trp Leu Glu Tyr Asn Ile Asn Val
145 150 155 160
Leu Gln Ala Gly Asn Tyr Thr Ala Asn Ile Arg Val Ala Ser Asn Asn
165 170 175
Gly Val Gly Met Tyr Ser Leu Ala Val Asp Gly Val Thr Val Ser Gly
180 185 190
Thr Asn Thr Val Asn Gly Thr Gly Gly Trp Gln Val Trp Ile Thr Gln
195 200 205
Thr Ala Asn Leu Gly Tyr Leu Thr Gln Gly Glu His Thr Leu Arg Ile
210 215 220
Ala Val Gln Ala Gly Asn Phe Asn Ile Asn Trp Leu Glu Leu Leu Leu
225 230 235 240
Ala Gly Thr Gln Gln Pro Asp Met Leu Gly Val Phe Asn Lys Ser Arg
245 250 255
Asp Leu Leu Leu Ala Asn Phe Asp Ser Arg Pro Asp Pro Asp Asp Ile
260 265 270
His Ser Val Ala Ala Leu Ala Thr Met Leu Lys Asp Ser Arg Phe Ser
275 280 285
Asn Val Gln Tyr His Ala Val Ser Gly Ala Tyr Gly Ile Gln Gly Gly
290 295 300
Asp Tyr Ile Glu Ala Thr His Leu Phe Asn Leu Ala Phe Gly Ala Gly
305 310 315 320
Asn Trp Ser Asn Ala His Thr Asn Arg Asp Val Ala Leu Thr Thr Val
325 330 335
Tyr Asn Lys Val Ala Ala Thr Leu Thr Asn Gly Gly Asp Ile Trp Val
340 345 350
Gln Glu Ala Gly Gln Ser Asp Phe Ser Ala Asp Leu Val Arg Arg Ile
355 360 365
Lys Gln Gln Leu Pro Ala Ile Asn Thr Gln Thr Arg Ile His Ile Val
370 375 380
Gln His Ser Asn Trp Asn Gln Asp Lys Thr Thr Pro Ala Asp Leu Thr
385 390 395 400
Tyr Val Lys Asn Gln Thr Asp Tyr Lys Lys Ile Ala Asp Gly Asn Ser
405 410 415
Thr Gly Asn Gly Thr Pro Gly Phe Asn Ser Ser Ser Ser Ala Asn Trp
420 425 430
Asn Arg Ala Leu Asn His Ala Gln Val Gly Ala Ile Trp Gln Glu Ala
435 440 445
Lys Arg Ile Ala Asp Asn Ala Ile Ala Asn His Gln Gly Trp Gln Asn
450 455 460
Pro Asn Ile Arg Asp Gly Gly Met Asp Phe Ser Asp Ala Val Glu Asp
465 470 475 480
Cys Trp Ile Phe Gly Phe Asn Ser Leu Thr Asn Ile Asn Ser Phe Phe
485 490 495
Asp Glu Phe Leu
500
<210> 196
<211> 1503
<212> DNA
<213> Microbulbifer degradans
<400> 196
atgtttgcta gcaacctcga agaccctact tactatataa aatataaagg tgtttacatg 60
aatactaata ggaaacaagt caaccaaaac ctcatcgcgc tattgggctt aatgttgcta 120
atgctattta cacctcatgg ctatgcgcaa gaccaatgca acaccacgct tgagtgcaaa 180
atactccacg gcgatacagc cacggattgt aaaaatagtc gctcagataa cagtatttgc 240
atgtgtggaa gtactgaatg tgcagtagat aatccacaac ccgaacctag caattccgcc 300
gcagtaccgg gcctaataca agcagaagac tttaccgatt actacgatgt aacggcggct 360
aatcacggcg gcgcttaccg ctccacaggc gtagacattc aagttaccac agacaccaat 420
ggcgggtaca acgtaggctg gatagccgct aatgaatggc tagagtacaa cataaacgta 480
ctgcaagctg gtaactacac cgcgaatatt cgtgttgcat ctaataatgg tgttggcatg 540
tatagcttgg cggtagatgg tgtaacggta agtggcacca acaccgttaa tggcacgggt 600
ggctggcaag tttggattac ccaaaccgca aaccttggct acttaaccca aggcgagcat 660
acgttgcgta tagcagtaca ggctggcaat tttaatatta actggttaga gctgctgcta 720
gcgggtactc agcaaccaga tatgcttggt gtatttaata aaagccgcga cctgttgtta 780
gcaaatttcg atagccgccc agacccagac gatattcatt cagtagcggc cctggcaaca 840
atgctaaaag attcccgttt tagcaatgtg caataccacg cagtatcggg tgcttacggt 900
atacaaggtg gtgattacat tgaagctaca catttgttta acttagcctt tggtgccggt 960
aattggtcca acgcgcacac caatagagat gtggcgctta caactgttta caacaaagtt 1020
gccgcaacgt taaccaatgg tggtgatatt tgggttcaag aggccggtca atccgacttt 1080
agcgccgatt tggtacgcag aatcaaacaa cagttacccg ctattaatac gcaaacgcga 1140
attcatatag tgcagcatag taactggaac caagacaaaa ccactccagc cgatttaacc 1200
tatgtaaaaa accaaaccga ctacaaaaaa attgctgatg gcaattccac aggcaacggc 1260
acgcctgggt ttaactcatc cagcagcgcc aactggaacc gcgcactaaa ccacgcgcaa 1320
gtaggcgcaa tttggcaaga agcaaaacgc atagccgaca atgccatcgc caaccaccaa 1380
ggctggcaaa accccaatat tcgagacggc ggtatggact tctcggatgc agtcgaagac 1440
tgctggatat tcggcttcaa ctcactcaca aatataaaca gcttttttga tgagttttta 1500
tag 1503
<210> 197
<211> 473
<212> PRT
<213> Microbulbifer degradans
<400> 197
Met Gln Leu Ile Gln Leu Leu Lys Thr Met Gly Ile Ser Arg Leu Cys
1 5 10 15
Ile Phe Leu Phe Gly Ala Val Leu Ala Ser Thr Met Leu Ala Gly Cys
20 25 30
Gly Gly Ser Ser Ser Ser Glu Lys Ser Ser Gly Gln Val Val Thr Glu
35 40 45
Pro Glu Ser Glu Pro Glu Ser Glu Pro Glu Ser Glu Pro Glu Ser Glu
50 55 60
Pro Glu Ser Glu Pro Glu Ser Glu Pro Glu Ser Glu Pro Asp Pro Ala
65 70 75 80
Pro Asp Thr Ala Gln Asp Met Arg Ser Glu Lys Arg Gly Leu Ala Tyr
85 90 95
Gly Tyr His Ser Glu Asn Asp Leu Lys Ala Met Gln Gly Lys Val Lys
100 105 110
Trp Trp Tyr Asn Trp Asp Thr Gln Ala Asp Ala Asn Val Lys Glu Asn
115 120 125
Tyr Ala Ser Tyr Gly Tyr Asp Phe Val Pro Met Ala Trp Asp Glu Asn
130 135 140
Phe Asn Glu Glu Ala Leu Arg Ser Phe Leu Asp Asn His Pro Asp Val
145 150 155 160
Lys Tyr Leu Leu Gly Trp Asn Glu Pro Asn Phe Met Glu Gln Ala Asn
165 170 175
Leu Thr Pro Ala Glu Ala Ala Ala His Trp Pro Val Leu Glu Ala Ile
180 185 190
Ala Gln Asp Tyr Asn Leu Lys Leu Val Ala Pro Ala Val Asn Tyr Ser
195 200 205
Pro Gly Asn Val Asp Ile Pro Gly Thr Asp Asp Asp Tyr Asp Pro Trp
210 215 220
Leu Tyr Leu Asp Ala Phe Phe Glu Ala Cys Glu Gly Cys Gln Val Asp
225 230 235 240
Tyr Ile Ala Val His Cys Tyr Met Lys Tyr Glu Ser Ala Phe Ser Trp
245 250 255
Tyr Val Gly Glu Phe Glu Arg Tyr Asn Lys Pro Ile Trp Val Thr Glu
260 265 270
Trp Ala Gly Trp Asp Asp Gly Gly Pro Ala Asn Met Gly Glu Gln Met
275 280 285
Asn Phe Leu Ser Asp Thr Val Arg Trp Met Glu Ser Asn Asp Asn Ile
290 295 300
Tyr Arg Tyr Ser Trp Phe Leu Gly Arg Ser Ser Glu Gly Tyr Asp Gln
305 310 315 320
Phe Pro Tyr Leu Asp Val Leu Leu Ala Asp Gly Glu Leu Thr Pro Leu
325 330 335
Gly Ser Val Tyr Thr Ser Ile Pro Ser Asn Asp Phe Arg Tyr Lys Ile
340 345 350
Pro Ala Arg Ile Glu Ala Glu Gly Ala His Ser Leu Thr Gly Phe Lys
355 360 365
His Leu Ala Thr Thr Asp Thr Thr Gly Leu Ala Lys Leu Ile Ala Ala
370 375 380
Ser Asn Glu Val Ala Glu Tyr Lys Leu Asn Val Glu Glu Gly Gly Asp
385 390 395 400
Tyr Thr Leu Ala Leu Arg Leu Ala Ser Ser Ala Asn Ser Asp Ile Ala
405 410 415
Ile Arg Val Asp Gly Leu Leu Val Tyr Thr Phe Glu Asp Ile Asn Thr
420 425 430
Gly Gly Val Glu Ala Trp Met Thr Phe Ser Ser Thr Pro Ile Ser Leu
435 440 445
Thr Ala Gly Asp His Ile Leu Arg Val Glu Ser Lys Ser Ser Arg Phe
450 455 460
Gly Phe Asn Trp Leu Glu Leu Thr Asn
465 470
<210> 198
<211> 1422
<212> DNA
<213> Microbulbifer degradans
<400> 198
atgcagttaa ttcagctact aaagacgatg ggtattagtc ggctatgtat ttttttattt 60
ggtgccgtac tcgctagcac aatgctagct ggttgcggtg gttcttcaag ttcagaaaag 120
tcctcaggcc aagtggtgac tgagcccgaa tctgaaccag agtcagaacc agagtcagaa 180
ccagagtcgg agccagaatc tgaaccagag tcggagccag aatccgagcc ggacccagcg 240
cccgacacag cgcaagatat gcgcagcgaa aaacgcggct tagcctatgg ctaccacagt 300
gaaaacgacc tcaaagctat gcagggtaaa gtgaagtggt ggtacaactg ggatacccaa 360
gcggacgcga acgtaaaaga gaattacgcc agttatggct acgattttgt tcctatggcg 420
tgggacgaaa actttaacga agaagcgctg cgcagctttt tagataatca ccccgatgtg 480
aaatacctgc tgggctggaa tgaaccaaac ttcatggaac aggccaacct cacaccagcg 540
gaagccgcag cccattggcc cgtgctagag gccatcgcgc aggattacaa cctaaaatta 600
gtcgcccctg cggtgaacta cagccccggt aatgttgata ttccaggtac cgatgatgat 660
tacgaccctt ggctatacct cgatgccttc tttgaagcgt gtgaaggttg ccaagtagat 720
tacattgccg tgcattgcta tatgaaatac gaaagcgctt tcagttggta tgtgggtgaa 780
tttgagcgct acaacaaacc tatttgggta accgagtggg cagggtggga cgatggcggc 840
ccagcgaata tgggtgagca aatgaacttc ttgtccgata ctgtgcgctg gatggagagc 900
aacgataata tatatcgcta ttcttggttt ttggggcgca gtagtgaagg ctacgatcag 960
ttcccatacc tagatgtttt actggccgat ggcgaactaa caccgttggg tagtgtgtat 1020
acctctattc cgtccaacga ttttcgctac aaaatacccg cgcgtattga ggccgaaggc 1080
gcccatagct taacgggctt caaacactta gccacaaccg atactacagg tttagctaag 1140
ttaatagccg cgtctaacga agtagcagag tacaaattaa acgtggaaga ggggggcgat 1200
tacaccttag ctttacgttt ggcttcatct gcaaacagtg atattgccat ccgtgtggat 1260
ggcttattgg tgtacacctt cgaagatatt aataccggcg gtgttgaggc gtggatgacc 1320
tttagctcaa cccctattag tttaaccgcg ggtgatcaca tattacgtgt agagtctaaa 1380
tcgtcgcgtt ttggctttaa ttggttagag cttactaatt ag 1422
<210> 199
<211> 465
<212> PRT
<213> Microbulbifer degradans
<400> 199
Met Lys Leu Leu Ser Ile Thr His Thr Leu Lys Arg Ala Ile Ala Ser
1 5 10 15
Ala Val Phe Val Ala Ser Ala Ala Thr Ala Ser Ile Ala Asn Ala Val
20 25 30
Thr Val Asp Ile Leu Val Leu Tyr Asp Asn Tyr Ser Ala Asn Tyr Phe
35 40 45
Gly Gly Asp Pro Gln Thr Ala Met Asn Gly Trp Ala Asn Asp Met Asn
50 55 60
Ser Ala Leu Lys Ala Ser Gln Ile Asp Met Lys Phe Arg Ile Val Gly
65 70 75 80
Val Arg His His Glu Glu Asp Gly Ala Gly Met Gly Asp Val Leu Gly
85 90 95
Asn Leu Arg Val Asp Gly Gly Ala Ile Ala Leu Arg Asp Gln Leu Gly
100 105 110
Ala Asp Met Val Ser Gln Leu His Glu Lys Gly Ala Cys Gly Val Gly
115 120 125
Tyr Val Ala Val Asp Lys Asn Tyr Thr Trp Asn Val Thr His Pro Gly
130 135 140
Cys Gly Pro Met Val Met Leu His Glu Phe Gly His Asn Met Gly Val
145 150 155 160
Thr His Ser Arg Lys Gln Gly Asp Gln Gly Gly Thr Arg Tyr Arg Tyr
165 170 175
Gly Val Gly Tyr Gly Val Gln Asp Val Phe Val Asp Ile Met Ala Tyr
180 185 190
Glu Gly Val Phe Asn Thr Ser Arg Val Asn Val Phe Ser Asn Pro Asn
195 200 205
Leu Asn Cys Arg Gly Leu Pro Cys Gly Lys Pro Val Gly Asp Ser Glu
210 215 220
Glu Ala His Ala Ser Leu Ala Ile His Asn Val Arg Asn Glu Leu Ala
225 230 235 240
Asn Phe Arg Asn Thr Val Asn Ser Gly Gly Pro Val Arg Leu Phe Glu
245 250 255
His Cys Tyr Tyr Thr Gly Tyr Thr Val Gly Leu Gly Glu Gly Ser Tyr
260 265 270
Arg Leu Ala Asp Leu Met Asn Arg Gly Leu Val Asn Asp Asp Leu Ser
275 280 285
Ser Leu Gln Val Asp Ala Gly Tyr Arg Val Glu Met Phe Gln His Asp
290 295 300
Asn Phe Thr Gly Asn Val Val Thr Arg Thr Gly Ser Asp Asp Cys Leu
305 310 315 320
Val Asp Glu Gly Met Asn Asp Asp Ile Ser Ser Leu Arg Ile Thr Arg
325 330 335
Val Ser Gly Gly Phe Ser Gln Thr Ile Gln Ala Glu Asn Phe Phe Ala
340 345 350
Asn Asn Gly Val Gln Leu Glu Asn Thr Thr Asp Ser Gly Gly Gly Gln
355 360 365
Asn Val Gly Trp Ile Asp Ala Asn Asp Trp Met Ala Phe Ser Asn Ile
370 375 380
Thr Ile Pro Thr Thr Gly Asn Tyr Arg Ile Glu Tyr Arg Val Ala Gly
385 390 395 400
Phe Gly Gly Thr Leu Ser Leu Asp Leu Asn Gly Gly Ala Ile Val Leu
405 410 415
Gly Gln Ile Asn Leu Pro Asn Thr Asn Gly Trp Gln Asn Trp Gln Thr
420 425 430
Ala Ser His Thr Val His Ile Asn Ala Gly Thr Tyr Asn Phe Gly Ile
435 440 445
Phe Ala Asn Ala Pro Gly Trp Asn Ile Asn Trp Phe Arg Ile Val Gln
450 455 460
Leu
465
<210> 200
<211> 1398
<212> DNA
<213> Microbulbifer degradans
<400> 200
atgaaattac tctcaatcac tcacacacta aagcgggcaa ttgcgagcgc agtttttgtc 60
gcgtccgctg ctaccgcctc tatagccaat gcagtaacgg tagatattct tgtgctttac 120
gataattatt cggcaaatta ttttggcggc gacccgcaga cggccatgaa cggttgggct 180
aacgatatga actctgcgtt aaaagccagc caaattgata tgaagttccg cattgttggt 240
gtgcgtcatc acgaagaaga tggcgcgggc atgggcgacg tacttggtaa tttgcgcgtt 300
gacggtggcg caatagccct gcgcgatcaa cttggtgcag atatggtgtc acaactccac 360
gaaaagggtg cctgtggcgt ggggtatgtt gccgtagata aaaattacac atggaacgtt 420
actcacccag gctgcgggcc aatggtaatg ctgcatgaat ttggccataa catgggggtt 480
actcactcgc gtaaacaggg tgatcaaggt ggtacccgct atcgctatgg cgtaggttat 540
ggtgtgcagg atgttttcgt cgatatcatg gcttacgaag gcgtttttaa caccagccgc 600
gttaatgtat tttctaaccc taacctcaat tgcagaggcc ttccgtgtgg taaacccgtt 660
ggcgattccg aagaagccca cgcttcactc gctattcaca atgtgcgtaa cgaacttgcc 720
aattttcgca ataccgtaaa ttctggtgga ccggttcgat tgtttgaaca ctgttattac 780
acgggttata cagttggctt aggcgaaggt agctatcgct tagccgattt gatgaaccgc 840
ggtttggtaa acgacgattt gtcatcgttg caagttgatg caggctatag agtggaaatg 900
tttcaacacg acaactttac cggcaatgtt gtaacgcgca ccggcagcga cgattgttta 960
gtcgatgaag gcatgaacga cgatataagt tcactgcgta taacacgagt aagtggcggg 1020
tttagtcaaa ctatccaagc agagaatttt ttcgctaaca acggcgtgca actagagaat 1080
acaaccgata gcggtggcgg ccaaaacgta ggttggattg atgctaacga ttggatggct 1140
ttcagcaaca ttaccattcc aaccaccggt aactaccgta tcgaataccg cgttgcaggt 1200
tttggtggca cgctttcgct cgacttaaat ggcggcgcta tagtgctagg ccaaattaat 1260
ttacccaata ccaatggctg gcagaattgg caaaccgcat cgcacactgt acacattaat 1320
gctggcactt ataatttcgg tatttttgct aacgcacccg gctggaatat caattggttc 1380
cgtatagtac agctttaa 1398
<210> 201
<211> 1024
<212> PRT
<213> Microbulbifer degradans
<400> 201
Met Lys Lys Pro Pro Ser Arg Tyr Ser Thr Leu Ala Ile Thr Leu Cys
1 5 10 15
Met Ser Leu Ser Gln Ala Ala Ile Ala Lys Asp Ile Tyr Val Ala Pro
20 25 30
Thr Gly Asp Asp Ala Gly Ala Gly Ser Phe Ser Ser Pro Tyr Gln Thr
35 40 45
Leu Ala Lys Ala Ala Gln Thr Ala Gln Ala Gly Asp Val Val Tyr Leu
50 55 60
Arg Glu Gly Thr Tyr Gln Glu Thr Leu Arg Pro Ala Asn Ser Gly Thr
65 70 75 80
Ala Ser His Pro Ile Val Phe Gln Ser Tyr Gln Asn Glu Lys Val Ile
85 90 95
Ile Ser Ala Met Glu Ala Leu Ser Gly Trp Gln Gln Asp Thr Ser Asn
100 105 110
Ile Tyr Lys Thr Thr Val Asn Trp Asp Leu Gly Gln Glu Asn Phe Val
115 120 125
Met His Lys Ser Thr Ala Leu Asp Leu Ala Arg Trp Pro Asn Asn Thr
130 135 140
Asp Ala Asp Pro Phe Thr Leu Asn Ser Lys Arg Asn Thr Gly Gly Ser
145 150 155 160
Gly Pro Glu Val Gly Gln Gly Ala Tyr Ile Glu Tyr Ala Ala Gly Leu
165 170 175
Pro Asn Ile Asn Trp Thr Gly Gly Thr Val Phe Tyr Tyr Gly Asp Lys
180 185 190
Pro Gly Gly Gly Trp Leu Ala Trp Arg Glu Thr Ile Val Ser His Thr
195 200 205
Gln Thr Arg Ile Asn Ile Asp Leu Ser Lys Lys Asn Pro Ala Trp Val
210 215 220
Arg Thr Ala His Asp Pro Ala Ser Gly Gly Glu Phe Tyr Leu Met Gly
225 230 235 240
Val Lys Gly Ala Leu Asp Tyr Gln Asn Glu Trp Tyr Phe Asp Ser Asn
245 250 255
Thr Arg Glu Leu Phe Val Gln Leu Pro Asn Gly Ala Arg Pro Gln Asn
260 265 270
Gly Asp Ile Gln Phe Arg Lys Arg Leu Gln Thr Ile Asn Leu Ala Asn
275 280 285
Arg Ser His Ile His Ile Lys Asn Ile Ala Val Phe Gly Gly Ala Ile
290 295 300
Glu Ile Thr Asn Asn Ala Asn Ser Asn Leu Leu Ser Gly Val Ser Ser
305 310 315 320
Phe Tyr Gly Asn Ala Thr Leu Gly Val His Thr Gly Phe Ser Ala Pro
325 330 335
Ser Tyr Ser Val Lys Ile Gln Gly Ser Asp Asn Arg Ile Glu Asn Ser
340 345 350
Glu Ile Ala Tyr Gly Ser Gly Thr Gly Ile Tyr Asp Ser Gly Thr Arg
355 360 365
Ser Gln Ile Val Asn Asn Tyr Ile His Asp Phe Asn Thr Leu Gly Asp
370 375 380
Tyr Asn Ala Pro Val Asn Ala Arg Gly Gly Ser Asn Thr Leu Val Lys
385 390 395 400
Asn Asn Arg Ile Ser Arg Gly Gly Arg Asp Thr Ile Gln Ala Phe Asn
405 410 415
Arg Asp Ser Glu Trp Ser Tyr Asn Asp Val Ser His Ser Asn Leu Ile
420 425 430
Ala Asp Asp Cys Gly Leu Phe Tyr Thr Val Gly Gly Pro His Asn Val
435 440 445
Glu Ile His His Asn Trp Phe His Asp Ala Tyr Ser His Gly Asn Lys
450 455 460
Asn Lys Ala Ala Gly Ile Tyr Leu Asp Asn Asp Ala Arg Gly Phe Lys
465 470 475 480
Val His His Asn Val Val Trp Asn Thr Glu Trp Thr Gly Ile Gln Ile
485 490 495
Asn Trp Asn Gly Thr Asp Ile Asp Val Phe Asn Asn Thr Leu Trp Asn
500 505 510
Asn Ser Ala Ala Met Gly Ala Trp His Lys Ala Gly Thr Ala Phe Ser
515 520 525
Asp Val Arg Val Trp Asn Asn Leu Ser Asn Ser Asn Lys Trp Glu Glu
530 535 540
Gln Ala Asn Lys Gln Asn Asn Leu Thr Gly Thr Gly Asp Pro Phe Val
545 550 555 560
Asn Ser Gln Ala Gly Asp Phe Arg Leu Lys Ala Asn Thr Ala Pro Ile
565 570 575
Asp Tyr Gly Arg Thr Ile Ala Gly Thr Thr Glu Gly His Ser Gly Ala
580 585 590
Asn Pro Asp Ala Gly Ala Tyr Glu Tyr Gly Ala Thr Ala Trp Lys Ala
595 600 605
Gly Val Thr Trp Asp Ile Thr Lys Gly Ala Ala Asn Arg Cys Tyr Gln
610 615 620
Leu Pro Gly Glu Val Cys Phe Asp Asp Thr Asp Thr Gly Gly Asp Ile
625 630 635 640
Gly Glu Leu Pro Gly Lys Val Glu Ala Glu Asn Tyr Ala His Tyr Tyr
645 650 655
Asp Thr Thr Pro Gly Asn Ile Gly Gly Ala Tyr Arg Asn Gln Asp Val
660 665 670
Asp Ile Gln Pro Thr Thr Asp Thr Leu Gly Gly Tyr Asn Val Gly Trp
675 680 685
Ile Asn Ala Gly Glu Trp Leu Glu Tyr Asp Ile Asp Val Thr Gln Ala
690 695 700
Gly Arg Tyr Asp Ala Glu Leu Arg Val Ala Ser Lys Leu Gly Ala Gly
705 710 715 720
Gln Val Ala Ile Ala Ile Asp Gly Val Ala Arg Gly Glu Ala Leu Thr
725 730 735
Ile Gln Ser Thr Gly Asp Trp Gln Asn Trp Ala Thr Leu Thr Thr Gln
740 745 750
Leu Gly Tyr Leu Glu Ala Gly Leu His Thr Leu Arg Val Ser Ala Met
755 760 765
Ser Gly Gly Phe Asn Leu Asn Trp Tyr Asn Phe Thr Lys Gln Val Ser
770 775 780
Phe Gly Glu Thr Ala Val Gly Phe Thr Asn Thr Pro Pro Thr Arg Ile
785 790 795 800
Pro Ala Thr Arg His Gln Ser Phe Ser Val Asp Tyr Val Ala Asn Glu
805 810 815
Pro Arg Glu Leu Phe Leu Leu Phe Phe Asn Pro Asp Trp Ser Trp Val
820 825 830
Ala Ser Thr Lys Thr Thr Val Glu Pro Gly Lys Ala Ser Thr Thr Leu
835 840 845
Gln Leu Asn Leu Pro Phe Val Pro Thr Ala Val Gln Ala Phe Asn Val
850 855 860
Lys Leu Glu Asn Arg Pro Ile Gly Ala Asn Trp Asp Asn Pro Asn Asn
865 870 875 880
Val Glu Ala His Ala Ala Val Thr Thr Gln Ala Ala Pro Leu Gln His
885 890 895
Asn Gly Gly Phe Glu Asn Gly Gly Thr Ser Gly Trp Asn Gly Tyr Gly
900 905 910
Ser His Ala Leu Ser Thr Glu Ala His Ser Gly Ser Tyr Ala Gly Lys
915 920 925
Val Thr Gly Gly Pro Ser Ala Phe Ser Gln Thr Ile Pro Asn Leu Thr
930 935 940
Pro Asn Thr Thr Tyr Ser Leu Ser Ala Phe Val Lys Ala Arg Ala Gly
945 950 955 960
His Thr Gly Phe Leu Gly Val Lys Glu Tyr Gly Gly Gln Glu Thr Ser
965 970 975
Leu Ile Val Asn Ser Thr His Tyr Gln Lys Lys Thr Ile Thr Phe Thr
980 985 990
Thr Gly Pro Asn Ala Ser Ser Ala Lys Ile Tyr Leu Tyr Val Arg Asp
995 1000 1005
Asn Asn His Val Ala Phe Ile Asp Glu Leu Val Ile Val Lys Val
1010 1015 1020
Tyr
<210> 202
<211> 3075
<212> DNA
<213> Microbulbifer degradans
<400> 202
atgaaaaaac cacctagccg ttacagtacg ctggccataa ccctgtgtat gagtttatcg 60
caggccgcta tagcaaagga tatctacgtc gcccctacag gagatgacgc tggagccgga 120
tcgtttagca gcccctatca aacgctggcg aaagcggcac aaaccgcaca agctggtgat 180
gtcgtttacc tgcgcgaagg cacctatcag gaaacgcttc gcccagcgaa ttcaggcaca 240
gccagtcacc ccattgtgtt tcaatcctat cagaatgaaa aagtgatcat cagtgcaatg 300
gaagccttaa gcggctggca gcaagacacc tcaaatattt ataaaaccac ggtgaactgg 360
gatctaggcc aagaaaattt tgtgatgcat aaatctacag cactggattt agcccgctgg 420
ccaaacaata ccgatgcaga ccccttcacg ttaaattcta aaagaaacac agggggaagt 480
ggccccgagg taggccaagg tgcttatata gaatacgcag caggattgcc caatattaat 540
tggactgggg gtaccgtatt ttattatggc gataaacccg gtggcggttg gctagcgtgg 600
cgcgagacga ttgtaagcca cactcaaacc cgtataaata tagacctttc taaaaagaat 660
cccgcgtggg tacgcaccgc acacgacccg gcaagcggcg gagaattcta tttaatgggc 720
gtaaaaggtg ctttagatta tcagaacgaa tggtacttcg attcaaatac gcgagagctt 780
tttgtccaac tgcccaacgg cgcgcgccca caaaatggcg atattcaatt tagaaaacgc 840
ctgcagacca ttaacttggc aaatagaagc cacatacaca ttaaaaacat tgctgttttt 900
ggtggcgcta tagaaataac caataacgct aattctaacc tgctttctgg ggtatccagt 960
ttttatggca atgctacgct cggtgtacac accggatttt cggcgccaag ctacagcgta 1020
aaaattcaag gcagcgataa ccgtatagaa aacagtgaaa ttgcctacgg ctccggtact 1080
gggatatacg acagcggcac ccgcagccaa attgtgaata actacattca cgattttaat 1140
acccttggcg actacaacgc gccggttaac gcgcgcggcg gaagtaatac gttagtaaaa 1200
aataatcgaa tctcccgcgg aggccgcgac accatacaag cttttaatcg agacagcgaa 1260
tggtcgtata acgatgtttc ccacagtaat ttaatagccg atgactgcgg cttgttttac 1320
accgttggcg gaccacacaa tgtggaaatc catcacaact ggtttcacga cgcatattca 1380
cacggcaaca aaaacaaagc cgctggtatt tatttagata acgatgcccg aggttttaaa 1440
gtacaccaca acgtggtgtg gaacaccgag tggacaggca ttcaaataaa ttggaacggc 1500
acggacatag acgtatttaa caacacatta tggaacaaca gtgccgctat gggagcatgg 1560
cacaaagccg ggactgcgtt ttcggatgtt cgagtttgga acaacctctc caacagtaat 1620
aagtgggagg agcaagccaa taagcaaaac aaccttacgg gaacaggcga tccgtttgtg 1680
aacagccaag ccggagattt tagattaaaa gccaatacgg cccctatcga ctatggccgc 1740
actatagcgg gtacaaccga ggggcacagt ggcgccaacc cagatgctgg tgcgtatgaa 1800
tacggcgcaa ccgcttggaa agcaggcgtc acatgggata taaccaaagg cgcagccaac 1860
cgctgctatc aacttcccgg tgaggtttgc tttgatgaca ccgataccgg cggagacatt 1920
ggtgagctac ccggtaaagt agaagccgaa aactacgccc actactacga caccaccccc 1980
ggtaatattg gcggcgccta ccgcaaccaa gatgtagata ttcagcccac caccgataca 2040
ctaggtggct acaacgtagg ctggattaac gctggcgagt ggttagaata cgacatcgat 2100
gtcactcagg cggggcgcta tgatgccgag cttagagtag cctctaaact gggtgcgggc 2160
caagtagcca tagccataga tggggtcgct agaggcgaag cgttaacgat tcagtcgacg 2220
ggcgattggc aaaattgggc aaccctaaca actcaactag ggtatttaga agcagggcta 2280
catacgctgc gggtatccgc tatgagcggt ggcttcaatc taaattggta taacttcact 2340
aaacaagtaa gctttgggga aaccgccgta ggcttcacca acactcctcc tacacgtata 2400
ccagctacaa ggcatcagag cttttctgtg gattatgttg ccaacgagcc gcgagaactg 2460
ttcttattat ttttcaaccc cgactggagt tgggtagcct cgacaaaaac aaccgtagag 2520
cccggtaaag cttctaccac gttgcaatta aaccttccct ttgtgccaac cgctgtgcaa 2580
gcatttaacg ttaaattaga aaatcgcccc ataggcgcca actgggataa cccgaacaat 2640
gtcgaagcgc acgctgccgt taccacgcaa gcagcaccgc tacaacataa cggtggcttc 2700
gaaaacggtg gcacctcggg ttggaatgga tacggtagcc atgcacttag cacagaagcc 2760
cacagcggca gctatgcagg caaagtaacc ggtggcccct ccgcgttttc gcaaactatt 2820
cccaacctta ccccaaacac cacctacagc ctctcggctt ttgttaaagc acgcgctggt 2880
cacacggggt ttttaggcgt taaagaatat ggaggccaag aaacgagcct gatagtaaac 2940
agcacccact atcaaaagaa aaccatcacc ttcaccaccg gccccaacgc cagctcggca 3000
aaaatttatt tatacgttag ggataacaac catgtggcgt ttattgatga gttagtcatt 3060
gtgaaagtgt actag 3075
<210> 203
<211> 912
<212> PRT
<213> Microbulbifer degradans
<400> 203
Met Lys Lys Ile Leu Phe Leu Ala Leu Gly Leu Leu Ile Ser Gln Ala
1 5 10 15
Ser Phe Ala Gln Gln Thr Ile Ser Ser Leu Ala Glu Phe Ile Ala Leu
20 25 30
Gln Asp Gly Ser Asn Gln Asn Ile Lys Met Ala Pro Gly Thr Tyr His
35 40 45
Ile Ser Ser Ser Ser Lys Ser Leu Phe Pro Gly Gly Asp Trp Arg Ala
50 55 60
Asn Val Glu Gly Asn Trp Pro Gly Leu Phe Lys Phe Ser Gly Asn Asn
65 70 75 80
Asn Thr Phe Asp Leu Thr Gly Val Thr Phe Thr Phe Asp Ser Thr Ile
85 90 95
Leu Leu Glu Met Pro Asn Leu Val His Ala Asn Leu Met Glu Phe Gly
100 105 110
Gly Ser Gly Asn Val Trp Lys Gly Leu Asn Ile Gln Glu Lys Pro Asn
115 120 125
Ser Lys Gly Glu Tyr Gly Ala Phe Ile His Thr Ser Gly Gly Thr Ile
130 135 140
Ala Val Phe Thr Gly Asp Asn His Lys Val Ser Asp Phe Thr Leu Lys
145 150 155 160
Thr Arg Phe Ser Arg Pro Tyr Gly Leu Gly Ser Leu Tyr Gly Lys Thr
165 170 175
Gly Asn Ser Ser Ser Thr Leu Pro Gly Val Arg Leu Ser Lys Lys Thr
180 185 190
Ala Met Phe Leu Ile Ser Leu Asp Asp Ser Tyr Phe Glu Asn Val Leu
195 200 205
Ile Asp His Ser Gly Phe Gly His Thr Leu Ala Phe Asn Gly Val Asp
210 215 220
Asn Val Val Phe Asn Thr Val Glu Ile Ile Ala Glu Ser Arg Ser Thr
225 230 235 240
Asp Asp Leu Tyr Ala Asn Gly Ile Gly Gly Thr Asp Arg Asn Gly Val
245 250 255
Pro Phe Asn Val Leu Phe Asn Gly Asp Glu Leu Ile Gly Thr Asn Phe
260 265 270
Ala Asp Ala Asp Tyr Phe Leu Asn Leu Phe Asp Thr Asp Asn Phe Asn
275 280 285
Gln Cys Gln Asn Met Ser Gly Gly Val Gln Tyr Ser Pro Ile Arg Lys
290 295 300
Gly Tyr Gln Tyr Ser Leu Thr Glu Asp Ser Phe Arg Gly Tyr Tyr Ser
305 310 315 320
Ala Gly Ser Leu Gly Asn Ile Glu Ile Tyr Asn Ala Thr Val Thr Gly
325 330 335
Ser Arg Ala Gly Val Val Met Glu Tyr Ala Ser Glu Gly Met Ile Val
340 345 350
Asp Gly Met Thr Val Arg Gly Ile Ala Gly His Gly Val Pro Ala Cys
355 360 365
Asp Gly Ala Trp Asn Ser Ala Asn Gly Gly Glu Gly Asp Ala Ser Ala
370 375 380
Tyr Gly Pro Pro Ser Asn Ser Val Leu Lys Arg Ala Lys Ala Asp Ala
385 390 395 400
Ala Tyr Ser Thr Val Leu Glu Ile Pro Asn Phe Val Asp Asn Val Thr
405 410 415
Ala Asp Ile Glu Val Leu Asp Pro Leu Asn Gly Tyr Asn Arg Pro Ser
420 425 430
Ala Ser Asn Ala Leu Ala Leu Ile Lys Gly Asp Asp His Asp Ile Arg
435 440 445
Leu Trp Lys Arg Asp Asn Gln Ala Leu Thr Arg Asp Leu Val Val Lys
450 455 460
Val Ser Asp Ala Asp Asn Leu Leu Leu Cys Asn Met Thr Lys Gln Gly
465 470 475 480
Val Thr Val Ala Ser Ser Val Thr Asn Ser Thr Ile Tyr Ser Val Gly
485 490 495
Ser Ile Ser Asn Ser Ser Gly Ser Ser Asn Thr Val Val Lys Leu Ser
500 505 510
Ser Ala Ala Asp Glu Pro Ala Ile Cys Lys Ala Leu Glu Thr Asn Glu
515 520 525
Val Ile Thr Asp Cys Gly Asp Phe Asp Ala Phe Ala Gly Ile Gln Ala
530 535 540
Glu Ala Tyr Cys Asp Met Ala Gly Val Glu Ile Glu Ser Ser Ser Asp
545 550 555 560
Asp Gly Gly Glu Gln Val Gly Tyr Ile Asn Asn Asn Glu Trp Ile Ala
565 570 575
Phe Asn Asp Val Asp Phe Gly Asn Gly Ala Ser Gly Phe Glu Ala Arg
580 585 590
Val Ser Ser Ala Thr Ser Gly Gly Asn Ile Glu Leu Arg Leu Asp Ser
595 600 605
Gln Thr Gly Ala Leu Ile Gly Thr Cys Ser Val Asn Gly Thr Gly Ser
610 615 620
Trp Thr Thr Tyr Glu Thr Val Ser Cys Asp Ile Gly Gly Val Ser Gly
625 630 635 640
Val Gln Asp Leu Tyr Leu Val Phe Thr Gly Asn Ser Gly Tyr Leu Met
645 650 655
Asn Val Asn Trp Phe Asn Phe Thr Gln Ala Ala Thr Asn Cys Ser Leu
660 665 670
Pro Trp Ser Asp Ser Asp Phe Ser Val Glu Lys Glu Ile Val Asn Tyr
675 680 685
Ser Ser Gly Ala Ile Asp Ile Ser Cys Ala Ser Asn Val Glu Ile Ser
690 695 700
Met Asn Leu Glu Gly Val Gly Ala Met Glu Asp Ala Asp Tyr Leu Asn
705 710 715 720
Val Tyr Tyr Arg Val Asp Gly Gly Ala Gln Gln Val Ile Ser Glu Asn
725 730 735
Val Asn Ala Phe Ser Glu Lys Thr Val Ser Val Ser Gly Ile Asn Gly
740 745 750
Ser Ser Leu Glu Ile Ile Ala Asn Val Tyr Thr Ser Tyr Gly Ala Glu
755 760 765
Ile Tyr Thr Ile Ser Asp Met Ser Val Lys Ala Asp Ser Gln Thr Ala
770 775 780
Tyr Ser Leu Glu Val Val Ser Val Leu Ala Ser Ala Asp Asp Gly Asn
785 790 795 800
Val Pro Ala Asn Thr Arg Asp Gly Asp Leu Gly Thr Arg Trp Ser Ala
805 810 815
Asn Gly Asp Ser Gln Trp Ile Thr Tyr Asp Leu Gly Ser Ser Lys Thr
820 825 830
Val Thr Asp Val Gly Ile Ala Phe Phe Arg Gly Asp Gln Arg Thr Ala
835 840 845
Phe Ile Ser Ile Glu Thr Ser Thr Asn Asn Ser Thr Trp Gln Thr Val
850 855 860
Tyr Ser Asp Glu Gln Ser Ser Ser Thr Thr Glu Ile Gln Asn Phe Asp
865 870 875 880
Val Thr Asp Ser Asn Ala Arg Tyr Val Arg Ile Ile Gly Tyr Gly Asn
885 890 895
Ser Val Asn Asn Trp Asn Ser Phe Thr Glu Val Glu Ile Ile Gly Arg
900 905 910
<210> 204
<211> 2739
<212> DNA
<213> Microbulbifer degradans
<400> 204
atgaaaaaaa tcctgttttt ggccttaggc cttttaataa gtcaggcttc gtttgcccag 60
cagacgattt caagccttgc cgagtttatt gctcttcaag atggcagcaa tcaaaacata 120
aaaatggccc ccggcaccta tcatatatca tcatcgtcta aatcactgtt ccccggtggg 180
gattggcggg cgaacgtcga aggtaattgg ccgggcctgt ttaagtttag tggcaacaat 240
aatacgtttg acctgaccgg cgttaccttc accttcgact ccacgatctt gttagaaatg 300
cctaatcttg ttcacgcaaa cctcatggag tttgggggtt ccggtaacgt ttggaagggg 360
ctgaatattc aggagaagcc caacagtaag ggcgagtacg gcgcttttat tcatacctct 420
ggcggcacca tcgctgtatt cactggggat aatcacaaag tttctgattt tactttaaag 480
acacgttttt ctcggcccta tggtttaggt tctctctatg gaaaaacggg taactcgtct 540
agcacgctgc ccggcgttcg cctgtccaag aaaacggcca tgtttcttat tagtttagat 600
gatagttact ttgaaaatgt tctgattgat cacagtggct ttggtcatac gcttgccttt 660
aacggtgtcg ataacgtggt attcaatacc gttgaaatca tcgcagaatc tcgctctacc 720
gatgatttgt atgctaacgg cattggcgga acagaccgca atggtgtgcc ttttaacgtc 780
ctctttaatg gcgatgaact cataggaacg aattttgccg atgccgatta ctttcttaac 840
ctgtttgata cggataactt caatcagtgt caaaacatga gcggtggcgt gcaatattcg 900
cccatccgaa aaggttacca atatagcctg actgaagatt cgtttcgcgg gtactacagt 960
gctggttccc ttggcaatat cgagatttat aacgccaccg ttacgggttc aagagcgggc 1020
gtcgtcatgg aatatgcgag cgagggtatg attgtagacg gtatgacggt tagaggtatt 1080
gccgggcatg gcgtgccagc gtgcgacggc gcgtggaatt cagctaacgg tggtgaaggc 1140
gatgcttccg cctatggtcc tccatcaaac agtgtactta agcgtgcgaa agccgatgcc 1200
gcctattcca ctgttttaga aatccccaat tttgtcgata acgtgacggc cgatattgaa 1260
gtgttggacc cgttaaacgg ttacaaccgt ccttcggcat caaacgcttt ggcgcttatt 1320
aaaggtgatg atcacgatat tcgactttgg aaacgtgata accaagcctt aacccgtgac 1380
ttggtggtta aggtgagtga tgcggataat ctattgctct gtaacatgac aaaacaaggc 1440
gtgactgtcg ccagcagcgt gaccaactcc accatttatt ctgttggtag catcagtaat 1500
tcaagtggca gttctaacac ggtggtaaaa ctgtcaagcg cagctgacga gccagcgata 1560
tgtaaagctc ttgagactaa cgaagttata acggattgcg gtgattttga tgcattcgcc 1620
ggtatccaag ccgaagccta ctgcgatatg gctggcgttg aaattgaatc ctccagcgat 1680
gacggtggcg aacaggttgg ttacattaac aacaacgaat ggattgcatt caacgatgtt 1740
gattttggca atggcgcaag cggttttgaa gcgcgcgtta gtagtgctac ctccggtggc 1800
aatattgagc ttcgtttaga cagccaaaca ggcgcgttaa ttggcacgtg ctctgttaat 1860
ggaacgggca gttggacgac ctatgaaacg gtttcttgcg atatcggtgg cgtcagcggc 1920
gtgcaggatt tgtacctcgt gtttaccgga aatagtggct atttaatgaa tgttaattgg 1980
tttaacttca cgcaagctgc aacgaattgt tctctgcctt ggagtgacag tgacttcagc 2040
gttgaaaaag aaatagttaa ctacagttcc ggtgctattg atatttcatg tgcttccaac 2100
gttgaaattt ccatgaatct agaaggtgta ggcgccatgg aagatgccga ttacttgaac 2160
gtttattacc gcgttgatgg cggtgcccaa caagtgatat ctgaaaatgt gaatgcgttt 2220
tctgaaaaaa ccgtatcggt atcaggtatc aatggcagta gcttagaaat tatcgccaat 2280
gtttatacca gctatggcgc agagatttat acgatttctg atatgagtgt taaagctgat 2340
tcgcaaacag cttattctct agaggttgtt tctgtcttag ccagtgcgga cgatggcaat 2400
gtgcctgcca atacccgtga tggcgacctg ggtacacgct ggtcggctaa tggtgattcg 2460
cagtggatta cctatgatct tggcagcagt aaaactgtga ccgatgtggg catcgccttt 2520
tttagaggcg accaacgcac ggcatttatc agcatcgaaa cctcaaccaa taatagtacc 2580
tggcaaaccg tttattctga tgagcagtca agcagcacaa cagagattca aaattttgat 2640
gtgaccgata gcaatgctcg ctatgtaaga attattggct acggaaatag cgtaaataac 2700
tggaacagtt ttacggaggt cgagattatt ggtcgttaa 2739
<210> 205
<211> 541
<212> PRT
<213> Microbulbifer degradans
<400> 205
Leu Ala Leu Ser Ala Ser Leu Thr Gln Ala Ala Thr Ile Ser Asn Ser
1 5 10 15
Gly Phe Glu Ser Gly Phe Asp Gly Trp Thr Asp Thr Asp Pro Ser Ala
20 25 30
Leu Ser Ser Asp Ala Asn Asn Gly Ser Arg Ser Ala Lys Ile Thr Gly
35 40 45
Ser Ala Gly Arg Val Asp Gln Asp Val Ala Val Thr Pro Asn Thr Asn
50 55 60
Tyr Gln Leu Thr Ala Tyr Val Leu Gly Ser Gly Arg Val Gly Val Asn
65 70 75 80
Thr Gly Thr Ala Val Tyr Asp Glu Ala Val Asn Thr Ser Ser Trp Ser
85 90 95
Lys Val Thr Val Asn Phe Asn Ser Gly Ser Ala Asn Ser Val Glu Val
100 105 110
Phe Gly Lys Tyr Asn Ser Gly Thr Gly Arg Phe Asp Asp Phe Ser Leu
115 120 125
Val Glu Thr Gly Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro
130 135 140
Thr Pro Thr Pro Ala Gly Cys Asn Ser Leu Asn Thr Ile Asp Ile Ser
145 150 155 160
Ser Ala Thr Asp Asp Gly Ser His Asp Gly His Gly Pro His Leu Ala
165 170 175
Val Asp Gly Asp Leu Ser Ala Asp Ser Arg Trp Ser Ser Lys Gly Asp
180 185 190
Gly Lys Ala Ile Thr Leu Asp Leu Gly Ala Glu Ala Thr Val Arg Gln
195 200 205
Leu Lys Thr Ala Trp Tyr Lys Gly Asp Ser Arg Thr Ala Tyr Phe Asp
210 215 220
Val Glu Thr Ser Thr Asp Lys Ser Asn Trp Ser Thr Ala Leu Ser Asn
225 230 235 240
Val Gln Ser Gln Gly Ser Thr Gly Leu Lys Ser Asn Ser Ile Asp Asp
245 250 255
Val Thr Ala Arg Tyr Val Arg Ile Val Gly His Gly Asn Ser Ser Asn
260 265 270
Thr Trp Asn Ser Leu Ile Glu Ala Gln Val Leu Gly Cys Ala Gly Thr
275 280 285
Val Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr
290 295 300
Pro Thr Pro Thr Pro Thr Pro Ser Gly Ser Lys Ile Pro Glu Ser Ile
305 310 315 320
Thr Asn Ser Asp Val Trp Asp Leu Glu Gly Glu Asn Pro His Pro Leu
325 330 335
Val Asp Pro Tyr Thr Leu Glu Phe Val Pro Leu Glu Ala Arg Val Thr
340 345 350
Thr Pro Asn Gly Asn Gly Trp Arg His Glu Tyr Lys Ile Ala Ser Ser
355 360 365
Glu Arg Thr Ala Met Thr Ala Thr Tyr Glu Asp Phe Ser Ala Thr Ile
370 375 380
Lys Val Asp Leu Ser Thr Gly Gly Lys Thr Ile Val Ala Gln His His
385 390 395 400
Ala Gly Asp Thr Gly Thr Ile Met Lys Leu Tyr Val Ser Asp Thr Ser
405 410 415
Glu Ser Gly Phe Phe Asp Ser Val Ala Ala Asn Gly Ile Phe Asp Val
420 425 430
Tyr Val Arg Ile Arg Asn Thr Ser Gly Val Glu Glu Lys Lys Pro Leu
435 440 445
Gly Thr Ile Arg Ser Gly Asp Ser Phe Ser Phe His Val Leu Asn Asn
450 455 460
Tyr Gly Val Val Lys Val Ser Ala Phe Gly Lys Asn Leu Glu Thr Glu
465 470 475 480
Val Glu Asp Asp Ser Ala Ser Tyr Leu Lys Phe Gly Asn Tyr Leu Gln
485 490 495
Ser Gln Tyr Pro Gln Gly Ser Lys Asp Cys Gly Ser His Gly Asp Ser
500 505 510
Asp Ser Phe Arg Ala Cys Tyr Glu Asp Ile Gly Ile Thr Glu Ala Lys
515 520 525
Ile Thr Met Thr Asn Val Ser Tyr Thr Arg Ile Thr Lys
530 535 540
<210> 206
<211> 1626
<212> DNA
<213> Microbulbifer degradans
<400> 206
ctcgcgctta gtgcatcttt aacacaggct gcgaccattt ctaactcagg cttcgaaagt 60
ggctttgacg gttggacaga cactgaccct tcggcacttt ctagcgatgc aaataacggc 120
agccgatctg ccaaaattac tggctccgcc ggccgtgtag atcaagacgt tgcggtaacc 180
ccaaacacca actaccagtt aactgcatat gtacttggca gtggacgcgt gggtgttaac 240
accggcactg cagtttatga cgaagccgta aacacaagca gctggagcaa agtaactgtt 300
aactttaact ctggctctgc aaactctgta gaagtgtttg gtaagtacaa tagtggtact 360
ggccgctttg acgatttcag cttagttgag acgggaacac cgactcctac gccaacacct 420
actcctacgc ccacgcctac accagcgggc tgtaacagct taaataccat cgacattagc 480
tctgctaccg atgatggctc tcacgatggt cacggcccac acctagctgt ggacggcgat 540
ttatcagctg attcacgctg gtcttctaaa ggcgacggca aagcgattac tttagactta 600
ggcgccgaag ccacagttcg tcaactaaaa acggcttggt acaaaggtga ctcacgtacc 660
gcttacttcg atgtagaaac gtctaccgat aagagcaact ggtctaccgc tttaagcaac 720
gtgcagtcac aaggtagcac tggccttaaa tcaaacagca ttgacgacgt aacagcgcgt 780
tacgtacgta ttgttggtca cggtaactcg tcgaatactt ggaacagctt gattgaagca 840
caagtattgg gttgtgctgg cactgtaacg ccaacaccta ctcctacgcc tacaccaact 900
ccgaccccaa cgcctactcc tacgccaact ccatctggct caaaaattcc tgaaagcatt 960
acaaacagcg atgtttggga tttggaaggc gagaacccac acccattggt agacccttac 1020
acgttggaat tcgtacctct tgaagcacgc gtaacgactc caaacggtaa cggctggcgc 1080
cacgagtaca aaatcgcttc tagcgagcgt actgcgatga ctgctaccta cgaagatttc 1140
tctgcaacta ttaaagtaga cctatctact ggcggtaaga caattgttgc acagcatcac 1200
gcaggcgaca ctggcaccat catgaaacta tacgtttccg acacaagcga atctggcttt 1260
ttcgatagcg tagcggcaaa cggcattttc gacgtgtatg ttcgtattcg taacaccagc 1320
ggtgttgaag agaagaaacc attgggcaca atccgctctg gcgactcttt cagcttccac 1380
gtacttaaca actacggcgt tgtaaaagta tctgcctttg gtaaaaacct agagacagaa 1440
gtagaagacg attctgcatc ttacttgaag tttggtaact acctacaatc gcaataccca 1500
caaggtagca aagattgtgg ttcacatggc gactccgatt cgttccgtgc ctgctacgaa 1560
gatataggca ttaccgaagc gaaaatcacc atgactaacg tttcttacac gcgtatcact 1620
aagtaa 1626
<210> 207
<211> 1033
<212> PRT
<213> Microbulbifer degradans
<400> 207
Met Gly Glu Ala Pro Val Lys Lys Gln Ala Tyr Thr Thr Thr Pro Val
1 5 10 15
Lys Pro Asn Ala Leu Gly Leu Ala Ile Arg Thr Leu Ala Leu Gly Gly
20 25 30
Leu Ala Ala Gly Leu Val Asn Val Ala His Ala Asp Leu Leu Ser Val
35 40 45
Asn Lys Thr Ala Thr Ala Ser Ser Glu Met Gln Ala Ala Ala Tyr Ala
50 55 60
Phe Asp Asn Asn Gln Asn Thr Arg Trp Glu Ser Ala His Ala Val Asp
65 70 75 80
Pro Thr Ser Ile Ser Val Asp Leu Gly Glu Thr Tyr Asp Leu Asp Ser
85 90 95
Ile Val Val His Trp Glu Ala Ala Asn Ala Ala Ser Tyr Thr Ile Glu
100 105 110
Gly Ser Asp Asn Gly Val Asn Trp Thr Gln Ile Gly Thr Tyr Thr Gly
115 120 125
Gly Thr Phe Gly Asn Arg Thr Asp Thr Val Asn Val Asp Gly Asn Tyr
130 135 140
Arg His Val Arg Leu Asn Gly Thr Gln Arg Ser Asp Gly Asn Ala Trp
145 150 155 160
Gly Tyr Ser Ile Trp Glu Leu Glu Val His Gly Thr Glu Val Thr Glu
165 170 175
Pro Pro Val Val Glu Pro Pro Thr Glu Pro Gly Glu Asn Leu Ala Ile
180 185 190
Tyr Gly Thr Ala Thr Ala Ser Ser Gly Asn Ala Asp Val Ala Ile Asp
195 200 205
Asn Asn Ala Gly Thr Arg Trp Glu Ser Asp His Gly Ile Asp Pro Ser
210 215 220
Ser Phe Thr Leu Asp Leu Gly Ala Thr Tyr Ser Leu Asn Gln Val Val
225 230 235 240
Ile Asp Trp Glu Ala Ala Asn Ala Lys Val Tyr Ala Ile Gln Gly Ser
245 250 255
Asn Asp Gly Thr Asn Phe Thr Thr Leu Ala Asn Tyr Ser Gly Gly Glu
260 265 270
Phe Gly Thr Arg Thr Asp Thr Leu Asn Ile Ala Gly Asp Tyr Arg Tyr
275 280 285
Val Arg Leu Leu Gly Thr Glu Arg Ser Asp Gly Asn Ala Trp Gly Tyr
290 295 300
Ser Ile Trp Glu Phe Lys Val Tyr Gly Gly Gly Lys Thr Glu Pro Pro
305 310 315 320
Val Thr Glu Pro Pro Val Thr Glu Pro Pro Val Val Glu Pro Pro Ile
325 330 335
Phe Thr Asp Leu Asn Tyr Gln Pro Leu Phe Asn Asn Thr His Ser Pro
340 345 350
Asp Thr Ala Gln Glu Trp Tyr Thr Lys Pro Asp Gly Thr Val Val Thr
355 360 365
Ile Ala Ser Gly Arg Ala Arg Ser Arg His Glu Ser Glu Asp Ile Phe
370 375 380
Tyr Thr Phe Pro Thr His Tyr Phe Glu His Arg Thr Phe Glu Ile Glu
385 390 395 400
Ile His Asp His Thr Pro Lys Gly Gln Asn Leu Val Glu Val Phe Tyr
405 410 415
His Pro Glu Tyr Ala Asn Tyr Val Pro Pro Gly Cys Arg Ser Ser Tyr
420 425 430
Ser Asn Val Trp Arg Ala Asp Phe Asn Asn Asn Ala Gly Met Asp Glu
435 440 445
Lys Leu Gln Thr Ala Thr Pro Asp Gly Lys Gly Glu Arg Trp Val Cys
450 455 460
Arg Ile Gln Arg Asp Ala His Asn Gly Asp Asp Gly Ile Leu Asp Val
465 470 475 480
Gly Ser Trp Met Glu Phe Glu Leu Gln Gln Phe Leu Gly Leu Tyr Glu
485 490 495
Gly Asp Pro Asn Val Arg Gly Gln Ala Val Tyr Tyr Thr Asp Thr Tyr
500 505 510
Arg Phe Lys Leu Gly Gln Pro Gly Ile Tyr Ile Val Gly Asp Glu Ala
515 520 525
Met Glu Glu Lys Ile Arg Ala Gly Gly Arg Ala Thr Ala Pro Tyr Val
530 535 540
Lys Gly Gly Asp Ser Val Pro Val Asn Glu Val Ile Ser Val Asn Gly
545 550 555 560
Asp Asn Thr Leu Thr Tyr Lys Val Met Ala Asn Gly Lys Trp Thr Gln
565 570 575
Lys Asp Asn Pro Asn Gly Thr Val Val Thr Phe Pro Ile Arg Asp Gly
580 585 590
Ile Glu Val Tyr Asp Asn Tyr Val Val Ala Ser Gly Val Ala Asp Trp
595 600 605
Thr Thr Tyr Phe Arg Glu Ala Leu Asn Ile Gln Trp Asp Thr His Asn
610 615 620
Ala Phe Met Gln Gly Arg Arg Val Phe His Thr Arg Met Asp Thr Gly
625 630 635 640
Val His Glu Glu Val Gly Asn Pro Asp Phe Pro Glu Leu Ala Asn Ile
645 650 655
Ala Asp Gly Leu Met Val Lys Asn Ser Cys Leu Gly Cys His Val Asn
660 665 670
Asn Gly Arg Gly Ile Ala Pro Gln Asn Gly Ala Leu Leu Asp Thr Leu
675 680 685
Val Val Lys Val Gly Ser Gly Ala Phe Asp Asn Leu Gly Gln Pro Gln
690 695 700
Pro His Ser Tyr Tyr Gly Gly Val Leu Gln Asn Leu Ser Leu Asp Ala
705 710 715 720
Ala Val Pro Ala Glu Gly Ser Val Arg Val Thr Tyr Thr Ala Gln Asn
725 730 735
Gly Thr Phe Asn Asp Gly Thr Gly Tyr Ser Leu Gln Val Pro Thr Tyr
740 745 750
Ser Leu Glu Met Asn Asp Thr Asn Gly Gly Ala Ile Gln His Ile Ser
755 760 765
Pro Arg Met Pro Gln Asn Ile Thr Gly Leu Gly Leu Leu Glu Ala Leu
770 775 780
Pro Glu Asn Glu Ile Leu Ala Trp His Asp Pro Asp Asp Ser Asn Gly
785 790 795 800
Asp Gly Ile Ser Gly Arg Ala Asn Val Val Thr Ser Pro Glu Thr Gly
805 810 815
Gln Gln Phe Ile Gly Arg Phe Gly Trp Lys Ala Ser Ser Ala Ser Leu
820 825 830
Arg Asp Phe Ala Ala Thr Ala Leu Ser Gly Asp Met Gly Val Asn Thr
835 840 845
Ser Val Leu Pro Asn Ala Asp Cys Gly Ala Gln Gln Thr Ala Cys Ile
850 855 860
Gln Asn Ser Gly Gln Gly Val Glu Leu Ser Asp Leu Arg Leu Asn Glu
865 870 875 880
Leu Val Val Tyr Leu Gln Ala Leu Gly Ala Pro Ser Arg Arg Pro Glu
885 890 895
Thr Val Asp Gln Pro Met Val Val Ala Gly Glu Gln Arg Phe Thr Asp
900 905 910
Ile Gly Cys Ala Ser Cys His Arg Pro Glu Met Asn Thr Gly His Lys
915 920 925
His Asp Leu Ala Glu Leu Arg Gly Asn Val Ile Arg Pro Tyr Thr Asp
930 935 940
Met Leu Leu His Asp Met Gly Pro Gly Leu Ala Asp Ser Leu Thr Gln
945 950 955 960
Ala Pro Glu Leu Asn Arg Glu Trp Arg Thr Ala Pro Leu Trp Gly Leu
965 970 975
Gly Met Asn Leu Ala Val Asn Gly His Asp Asn Leu Leu His Asp Gly
980 985 990
Arg Ala Arg Ser Ile Glu Glu Ala Ile Leu Trp His Gly Gly Glu Ala
995 1000 1005
Gln Ala Ser Asn Asn Ala Tyr Lys Ala Leu Ser Ala Gln Gln Arg Ala
1010 1015 1020
Glu Leu Ile Ala Phe Leu Arg Ser Leu
1025 1030
<210> 208
<211> 3102
<212> DNA
<213> Microbulbifer degradans
<400> 208
gtgggagaag cacctgtgaa aaaacaagct tatacaacta caccagttaa accaaatgcg 60
ttaggtttag ccattcgcac actggccctt ggtggcctag ctgcgggact tgtaaacgta 120
gcgcatgcag atttattgtc ggtaaataaa actgctacgg ccagcagtga aatgcaagct 180
gcagcttacg cattcgataa caatcaaaat acgcgctggg aaagtgcgca cgcggttgat 240
ccaaccagca taagtgtaga cttgggcgaa acctacgatt tagatagcat tgtcgttcac 300
tgggaagctg ccaacgctgc aagctacacc atagagggct ccgataacgg tgttaactgg 360
acgcaaatag gcacctatac cggtggtacc tttggcaata gaaccgatac cgtaaacgta 420
gatggtaatt accgtcacgt gcgattaaat ggtactcagc gcagcgatgg caatgcatgg 480
ggttactcca tatgggagct ggaagttcac gggacagaag taaccgaacc acctgtagtg 540
gaacctccta cagagcctgg cgagaatttg gcaatatatg gcacggcaac cgccagctct 600
ggtaacgccg atgtagctat tgataataac gctggtacgc gctgggagag tgaccacggc 660
atcgaccctt ctagttttac tttagaccta ggcgcaacct acagcttgaa ccaagtggtt 720
atagattggg aggccgccaa cgccaaggtg tatgctatcc aaggctctaa cgatggcact 780
aactttacta cccttgctaa ttacagcggc ggtgaattcg gcacgcgtac cgatacatta 840
aatattgctg gtgattatcg ctatgtacgc ctgttgggta ctgagcgcag cgatggcaac 900
gcatggggct attctatttg ggaatttaaa gtatatggcg gtggaaaaac tgagccaccg 960
gttaccgagc cacctgtgac agaaccgcca gtggtagagc cccccatttt taccgaccta 1020
aattatcagc cactgtttaa caacacacac agcccagata ctgcgcaaga gtggtacacc 1080
aagcccgacg gtacagtggt aacaattgca agtggccgcg cgcgttcacg tcacgaatcg 1140
gaagatattt tttacacttt cccaacgcat tattttgagc accgtacatt tgaaatagaa 1200
attcacgacc atacccctaa gggccaaaat ttggtggagg ttttctacca cccagaatac 1260
gccaactatg tgccgccggg ttgtcgctct tcttacagca atgtttggcg tgcagacttt 1320
aacaacaacg ccggcatgga tgaaaaactg caaaccgcca ccccagacgg taaaggtgag 1380
cgttgggtat gccgcataca gcgcgacgcg cacaatggcg acgacggcat tttagatgtg 1440
ggttcatgga tggagtttga attacaacaa ttcttaggct tatatgaagg cgaccctaat 1500
gtgcgtggcc aggccgttta ctacaccgat acataccgtt ttaaattagg ccagcccggt 1560
atttatattg tgggcgatga agccatggag gaaaaaattc gtgcgggtgg ccgcgctaca 1620
gcgccttatg taaaaggcgg cgactcggta cctgttaacg aagttatttc ggttaatggc 1680
gacaacacac ttacctacaa agtaatggca aatggtaagt ggacgcaaaa agataacccc 1740
aacggcactg tagttacttt ccctattcgc gatggcattg aggtatacga taactacgta 1800
gtggcgagtg gcgtggccga ttggaccacc tatttccgcg aagcgttaaa tattcagtgg 1860
gatacgcaca acgcatttat gcaggggcgg cgcgtgtttc atacgcgtat ggataccggc 1920
gtacacgaag aagtgggtaa cccagacttt cccgaattgg caaatattgc cgatggttta 1980
atggtgaaaa attcgtgctt gggttgtcat gtaaacaacg gccgcggtat tgctccgcaa 2040
aacggcgcgt tattagatac gctagtcgtt aaagtaggct cgggtgcgtt cgataacctt 2100
ggccagccac aaccacacag ttattacggt ggtgtactgc aaaacctatc actagatgct 2160
gctgtgcctg cagagggttc ggttcgtgta acttacaccg cgcaaaacgg aacctttaac 2220
gacggcacag gctacagctt gcaagtgcct acctattcgt tagaaatgaa cgataccaac 2280
ggcggggcta ttcagcatat ttcaccgcgc atgccacaaa acataactgg tttaggctta 2340
ttagaagcct tacccgaaaa tgaaattttg gcatggcacg acccagatga tagcaatggc 2400
gatggtattt ctggccgcgc taatgttgtt acctcaccag aaacggggca acagtttata 2460
ggtcgctttg gctggaaagc gtcatcagca agtttgcgtg attttgctgc aacggcttta 2520
agtggcgata tgggtgtaaa tacctctgtg ttacccaatg cagattgtgg cgcacaacaa 2580
accgcctgta ttcaaaatag cggtcagggc gtagagctaa gcgacctgcg tttgaatgaa 2640
ctggtggttt acttgcaagc gctgggcgcg ccttcgcgca ggccagaaac tgtagaccag 2700
ccaatggtag tggccggtga gcagcgtttt accgatatag gctgtgcctc gtgtcaccga 2760
ccagaaatga acaccggtca taagcacgat ttagctgagt tgcgtggcaa cgttattcgc 2820
ccctataccg atatgctgtt acacgatatg ggcccaggtt tggcagatag cttaactcaa 2880
gcgccagagc taaatcgcga atggcgtaca gcgccgctgt ggggcttggg catgaacttg 2940
gctgtaaacg gtcacgacaa cttgctccac gatggtcgcg cacgcagtat tgaagaagcc 3000
attttatggc acggtgggga agcgcaagcc agtaataatg cctataaagc actaagtgcg 3060
cagcagcgag cagagttaat agcgttttta cgctcactct aa 3102
<210> 209
<211> 557
<212> PRT
<213> Microbulbifer degradans
<400> 209
Met Leu His Ile Ser Val Gln Pro Asn Leu Thr Arg Ser Ile Ser Gly
1 5 10 15
Phe Leu Leu Gly Leu Ala Gly Val Val Ser Gly Ser Ala Tyr Ala Gln
20 25 30
Trp Asn Pro Ala Pro Asn Trp Glu Asp Ser Tyr Ser Val Asn Gly Val
35 40 45
Cys Tyr Cys Asn Ser Ser Asn Tyr Asp His Gly Leu Ser Ala Lys Thr
50 55 60
Ala Pro Thr Pro Ile Gly Glu Leu Asn Val Val Asp Ile Cys Thr Asp
65 70 75 80
Ile Lys Ala Val Leu Gly Glu Gly Ala Thr Asn Gly Arg Ile Pro Phe
85 90 95
Asn Asp Ile Gln Cys Gly Asn Gly Pro Ala Asn Asp Ala Ala Asp Glu
100 105 110
Ala Gly Cys Pro Gly Arg Val Asp Ile Gly Ser Ala Gly Cys Asp Val
115 120 125
Ile Gly Pro Lys Trp Asp Leu Val Ser Val Tyr Gly Pro Trp Pro Asp
130 135 140
Gly Gly Leu Asn Arg Asp Ala Trp Glu Val Ser Ala Ser Asn Gly Ser
145 150 155 160
Gly Ser Ala Gln Leu Ala Leu Asp Gly Leu Ala Ser Thr Arg Trp Ala
165 170 175
Thr Gly Val Phe Gln Ser Pro Gly Gln Tyr Phe Asp Ile Asp Phe Gln
180 185 190
Asp Ala Leu Thr Phe Asp Ser Ile Val Leu Ala Thr Thr Glu Asn Pro
195 200 205
Glu Asp Tyr Pro Arg Ala Tyr Glu Val Tyr Ile Ser Ser Asn Gly Ser
210 215 220
Asp Trp Gly Val Pro Val Val Thr Gly Ala Gly Asn Gly Ser Thr Thr
225 230 235 240
Thr Ile Glu Leu Asp Thr Val Thr Thr Arg Tyr Leu Arg Ile Leu Gln
245 250 255
Thr Gly Ser Ser Ser Asn Arg Trp Trp Ser Ile His Glu Phe Asn Ile
260 265 270
Phe Asn Ser Gly Pro Ile Pro Pro Glu Pro Glu Tyr Pro Ala Leu Asp
275 280 285
Arg Ser Asp Trp Thr Val Ser Ala Ser Val Asn Gly Ala Glu Ala Glu
290 295 300
Phe Ala Ile Asp Ser Ser Pro Asn Thr Arg Trp Asp Thr Ala Gln Ser
305 310 315 320
Gln Arg Ala Gly Gln Ser Phe Glu Val Asp Leu Gly Glu Leu Asn Thr
325 330 335
Leu Ala Ala Ile Glu Leu Asp Ser Ala Gly Ser Ala Asn Asp Tyr Pro
340 345 350
Arg Gly Tyr Ala Val Tyr Val Ser Asn Asp Gly Ser Asn Trp Gly Ser
355 360 365
Ala Ile Ala Ser Gly Ala Ala Val Ser Ala Ser Thr Thr Ile Glu Phe
370 375 380
Thr Pro Val Ser Ala Arg Phe Val Lys Ile Glu Gln Thr Gly Gly Asp
385 390 395 400
Gly His Tyr Trp Trp Ser Ile His Asn Leu Asn Ile Phe Gly Glu Pro
405 410 415
Ser Asp Asn Pro Gln Pro Thr Ile Glu Leu Leu Asp Ser Thr Pro Trp
420 425 430
Ser Leu Ala Ala Asn Arg Arg Asn Ser Ala Ala Gly Asn Ala Ile Asp
435 440 445
Asn Asn Gln Ser Thr Arg Trp Thr Thr Gly Gln Thr Gln Arg Asp Gly
450 455 460
Gln Thr Phe Glu Ile Asp Leu Ser Thr Val Gln Thr Phe Ser Arg Ile
465 470 475 480
Val Leu Asp Ser Ala Ala Ser Asp Asp Asp His Pro Arg Asn Tyr Glu
485 490 495
Leu Tyr Val Ser Asn Asp Gly Ser Asn Trp Gly Ser Pro Val Ala Thr
500 505 510
Gly Ala Gly Asp Ser Ser Gly Val Thr Val Ile Asp Phe Pro Ser Val
515 520 525
Thr Ala Arg Tyr Val Leu Ile Ala Gln Ala Gly Ser Asp Ser Ser His
530 535 540
Trp Trp Ser Ile His Glu Leu Ser Ile Tyr Asn Val Gln
545 550 555
<210> 210
<211> 1674
<212> DNA
<213> Microbulbifer degradans
<400> 210
atgctacaca tctctgttca accaaatctt acccgctcta tttctgggtt cttgttgggc 60
ctagcgggcg ttgtctcggg ctcggcttac gcgcagtgga acccagcacc caactgggag 120
gacagctatt ctgtgaatgg agtttgctac tgtaattcca gtaattatga tcatgggcta 180
agcgccaaaa cagcacctac tcccattggc gagcttaacg ttgtcgatat atgtaccgac 240
atcaaagccg tgcttggtga gggggcaacg aatggacgta ttccatttaa cgacattcag 300
tgtggtaacg ggcccgccaa cgatgctgcc gatgaagcgg ggtgcccagg ccgagtggat 360
attggctcgg cagggtgtga tgtgattggg cctaaatggg atttggttag cgtttacggg 420
ccatggccag atggcgggct taatagggat gcttgggaag tgagtgcctc aaacggttct 480
ggtagcgcgc aactggcgct tgatggcttg gcgagtacgc gctgggcgac gggtgtgttt 540
caatcgccag gtcagtattt tgatattgat tttcaagacg cgctaacttt cgattcaata 600
gtacttgcta ccaccgagaa cccagaagat tacccaaggg cttacgaggt ttacatttca 660
tcaaatggca gcgactgggg agtaccagtg gtgacgggag ccggtaacgg ctccactaca 720
actatcgagc tagatactgt taccactcgc tacttaagaa ttttacaaac cggcagctca 780
tctaaccgtt ggtggtccat tcatgaattt aacatattca attctgggcc aattccgcca 840
gagccagagt acccggcatt agatagaagt gactggacag tttctgcatc ggtaaatggc 900
gcagaagccg aattcgctat cgactcttca cccaatacgc ggtgggatac agcacaaagc 960
caacgagcag ggcagagttt tgaggtagat ttaggagaac tcaacacgct agcggccatt 1020
gagctggatt cagcaggcag tgccaatgac tatccgcggg gttacgccgt ttatgtatcg 1080
aacgatggct caaactgggg gagcgctatc gcttctggcg ctgcagtaag cgcgtccaca 1140
acaatagaat ttactccagt aagcgctcgc ttcgttaaaa tagagcagac gggaggtgac 1200
ggacactatt ggtggtctat acacaactta aatatttttg gtgagccttc agataaccca 1260
caaccgacta tagagctact cgatagcact ccctggtctc tagccgcgaa caggcgaaat 1320
agcgcggctg gcaatgccat agataataac cagtctacac gatggaccac gggccagacc 1380
cagcgcgatg gtcagacgtt cgaaattgac ctgagcactg tgcaaacctt tagccgtatt 1440
gttttagatt ccgcagcgag tgacgatgat caccctcgca attatgaact ttatgtctct 1500
aacgatggtt ctaattgggg cagcccagtg gcaaccggtg caggagatag tagcggagta 1560
acggtaatag atttccccag cgtgaccgct cgctatgtac ttatagctca ggctggctcg 1620
gatagctcac actggtggtc cattcacgag ttgagtattt acaacgtaca ataa 1674
<210> 211
<211> 674
<212> PRT
<213> Microbulbifer degradans
<400> 211
Met Lys Ile Ile Arg Leu Leu Val Leu Cys Leu Gly Val Val Ser Ser
1 5 10 15
Val Val Ala Val Ala Ser Ser Ser Glu Pro His Asp Glu Gly Asn Ala
20 25 30
Leu Glu Ser Ala Leu Trp Asp Thr Leu Thr Ile Pro Val Cys Trp Glu
35 40 45
Asn Pro Glu Ala Phe Pro Val Glu Glu Gln Ala Trp Val Arg Lys Ala
50 55 60
Val Glu Arg Thr Trp Glu Gln Glu Ser Leu Val Arg Phe Thr Gly Trp
65 70 75 80
Gly Glu Cys Gly Ala Asn Asp Asp Gly Ile Arg Ile Leu Val Asp Asp
85 90 95
Val Gly Pro His Val Lys Gln Leu Gly Ser Arg Leu Asp Gly Tyr Val
100 105 110
Asn Gly Met Val Leu Asn His Thr Phe Gln Asn Trp Gly Thr Ser Cys
115 120 125
Ser Tyr Arg Arg Gln Tyr Cys Ala Glu Val Ile Ala Val His Glu Phe
130 135 140
Gly His Ala Leu Gly Phe Ala His Glu Gln Asn Arg Asp Asp Thr Asp
145 150 155 160
Asp Met Cys Ala Ala Glu Ala Gln Gly Thr Asp Gly Asp Ile Tyr Val
165 170 175
Gly Ala Trp Asp Leu Asp Ser Val Leu Asn Tyr Cys Asn Pro Glu Trp
180 185 190
Asn Gly Ala Gly Asn Leu Ser Asp Thr Asp Ile Glu Met Val Gln Leu
195 200 205
Phe Tyr Gly Gln Pro Ile Glu Thr Ser Gly Asn Glu Pro Val Ala Ile
210 215 220
Cys Ser Val Ser Ala Ser Ser Ser Asp Gly Asn Ile Ala Glu Asn Thr
225 230 235 240
Leu Asp Gly Asp Tyr Ala Thr Arg Trp Ser Ala Asn Gly Asp Gly Glu
245 250 255
Trp Ile Gln Phe Asn Leu Cys Glu Ser Gln Val Ala Asp Arg Val Glu
260 265 270
Leu Ala Trp Tyr Lys Gly Asp Thr Arg Ser Ser Thr Phe Ser Ile Glu
275 280 285
Tyr Ile Thr Thr Asp Gly Tyr Val Trp Tyr Thr Thr Pro Val Arg Arg
290 295 300
Tyr Ser Ser Gly Ala Ser Leu Gly Leu Glu Ser Ala Ser Phe Thr Ala
305 310 315 320
Ala Glu Ile Gln Ser Leu Arg Ile Thr Gly Tyr Gly Asn Ser Ser Asn
325 330 335
Thr Trp Asn Ser Ile Thr Glu Ala Val Ile Tyr Thr Pro Ser Ala Gly
340 345 350
Leu Asp Tyr Pro Asn Leu Val Ala Pro Thr Asp Val Ala Ala Thr Val
355 360 365
Thr Gly Gln Ser Gln Ile Thr Ile Ser Trp Ala Asp Thr Asn Ser Glu
370 375 380
Glu Glu Ser Tyr Phe Val Glu Ala Arg Ile Gly Gly Asn Asp Phe Phe
385 390 395 400
Ala Ile Gly Leu Thr Glu Ala Asn Ala Thr Ser Tyr Thr His Thr Asp
405 410 415
Leu Ile Ala Glu Gly Leu Tyr Glu Tyr Arg Val Thr Ala Val Ser Gly
420 425 430
Ile Ile Arg Ser Asn Phe Ala Ala Ala Ser Val Tyr Phe Gln Pro Ala
435 440 445
Gly Gly Ser Gln Ala Asn Leu Val Arg Pro Glu Asn Phe Thr Val Thr
450 455 460
Ala Asn Ala Gln Gly Gln Ile Ala Leu Ser Trp Val Asp Val Ser Glu
465 470 475 480
Gly Glu Glu Gly Tyr Thr Leu Glu Tyr Lys Leu Ser Ser Glu Ala Glu
485 490 495
Phe Thr Val Ile Glu Leu Ala Ala Asn Ala Asp Ser Ala Ile Val Ser
500 505 510
Asp Leu Ala Ala Gly Ser Tyr Tyr Phe Arg Val Ser Ser Tyr Phe Gly
515 520 525
Asn Asn Ala Ser Glu Tyr Ser Glu Leu Leu Val Thr Val Phe Ala Ser
530 535 540
Glu Ala Thr Ile Val Pro Val Asp Val Tyr Ala Ser Ser Asp Asp Gly
545 550 555 560
Asn Val Ala Ser Asn Val Phe Asp Asn Asp Tyr Ser Thr Arg Trp Ser
565 570 575
Ala Phe Gly Val Gly Glu Asn Leu Thr Ile Ala Leu Gly Asp Asn Tyr
580 585 590
Leu Val Thr Asp Met Arg Ile Ala Trp Tyr Lys Gly Asp Gln Arg Gln
595 600 605
Thr Arg Phe Gln Val Glu Val Ser Asp Asp Asn Gln Thr Trp Val Gln
610 615 620
Val Phe Asp Gly Ile Asn Ser Gly Glu Ser Leu Ser Leu Glu Thr Thr
625 630 635 640
Phe Gly Gly Asp Tyr Arg Ala Ser Tyr Ile Arg Ile Ile Gly Leu Gly
645 650 655
Asn Glu Phe Asn Asn Trp Asn Ser Ile Thr Glu Val Ser Ile Lys Gly
660 665 670
His Leu
<210> 212
<211> 2025
<212> DNA
<213> Microbulbifer degradans
<400> 212
gtgaaaatca tacggttgct tgttttgtgt ttgggggtag taagttctgt ggttgcggta 60
gctagtagct cagagccaca tgatgaaggt aatgcattag agagtgcatt gtgggatacg 120
ttaactattc ctgtgtgctg ggagaatcca gaggcttttc ctgtagaaga gcaagcttgg 180
gtaagaaaag cggttgagcg tacttgggag caggagtcat tagtccgctt caccggttgg 240
ggcgagtgtg gggcgaacga tgatgggata cgtattttag tggatgatgt tgggccgcat 300
gttaagcagc taggctcgcg tttggatggc tatgttaatg gcatggtact aaaccataca 360
ttccaaaatt ggggcaccag ctgcagttac cgtcgtcaat attgtgcaga agtgattgca 420
gttcatgagt ttggtcatgc tctaggcttt gcccatgaac aaaacagaga tgacaccgat 480
gatatgtgcg cggccgaggc gcagggtact gacggcgaca tttatgtcgg cgcctgggac 540
ttggattcgg ttttaaacta ttgcaaccct gagtggaatg gcgcgggcaa tctaagcgat 600
accgatatag agatggttca gctgttttac ggtcagccca ttgaaacgtc tggtaacgag 660
cctgtagcta tttgtagtgt tagcgccagt tcaagcgatg gcaatattgc agaaaatacg 720
ctagacggcg attacgctac gcgttggtct gcaaacggcg acggcgagtg gatacaattt 780
aatttgtgtg aaagccaagt cgcggatcgt gtcgagttag cttggtataa gggagatact 840
cgctcaagta ccttctctat tgagtatata actaccgatg gttacgtttg gtataccacg 900
cctgttcgtc gatattcttc cggtgcttcg ctaggcttgg aatctgcgtc atttactgca 960
gctgaaattc agtccttgcg tattactggc tacggcaact cgtctaacac ttggaatagt 1020
attaccgaag cggtaattta tactccaagc gctgggctag attaccctaa cttagttgca 1080
cctacagatg tggcggctac ggtaacaggg caatcgcaaa ttactatttc atgggcagat 1140
actaatagcg aagaagaaag ctattttgtt gaggcgcgta ttggtggcaa cgattttttc 1200
gctataggtc ttactgaggc gaacgcaaca tcttatacgc ataccgacct tattgctgaa 1260
ggcctttacg agtatcgtgt tactgccgta agcggcatta ttcgctcaaa ctttgcagct 1320
gcaagtgtat attttcaacc tgcaggcggc tcgcaggcta atcttgtgcg cccagaaaac 1380
ttcactgtaa cagctaatgc gcaagggcaa atagcactaa gttgggtgga tgtaagtgaa 1440
ggcgaagagg gttatacctt agagtataag cttagtagtg aagctgaatt tactgtaatt 1500
gagctagctg cgaatgccga ttctgcaatt gttagcgatt tggctgcggg tagttactat 1560
tttcgtgtaa gcagttattt cggcaacaat gcttctgaat acagcgagct tttggttact 1620
gtgtttgcat cagaagcgac tattgtgccg gttgatgttt atgcatctag tgatgatggc 1680
aatgtggcaa gcaatgtttt tgataatgat tactctacgc gatggtcggc ttttggtgta 1740
ggtgagaatt taaccattgc gctgggggat aattatctcg ttaccgatat gcgcatagcg 1800
tggtacaaag gtgatcaacg tcaaacccgc tttcaggtag aggtgagtga tgataaccaa 1860
acctgggtgc aagtgtttga tgggattaac tcgggtgagt cgttaagttt agagacgacg 1920
tttggcggtg attatcgcgc aagctatatt cgaattatcg gtttgggtaa tgagtttaat 1980
aactggaaca gtattactga ggtaagcatt aaagggcacc tctag 2025
<210> 213
<211> 818
<212> PRT
<213> Microbulbifer degradans
<400> 213
Met Ile Lys Leu Ser His Leu Lys His Cys Arg Lys Phe Cys Ile Ser
1 5 10 15
Leu Leu Cys Ala Leu Gly Met Ala Asn Ala His Ala Ala Leu Asn Val
20 25 30
Thr Ala Ser Ala Asp Asp Gly Asn Val Pro Ala Asn Thr Leu Asp Asp
35 40 45
Asn Ile Asp Thr Arg Trp Ser Ala Asn Gly Ser Gly Gln Trp Ile Glu
50 55 60
Tyr Asp Leu Gly Ala Thr His Thr Val Asp Ala Val Gln Ile Ala Phe
65 70 75 80
Phe Arg Gly Asp Val Arg Asp Ala Thr Ile Asp Ile Gln Val Ser Asn
85 90 95
Asp Gly Gly Asn Trp Gln Thr Leu Phe Ser Gly Thr Pro Pro Thr Arg
100 105 110
Thr Leu Ala Gln Gln His Phe Glu Leu Asp Asp Thr Ser Ala Arg Tyr
115 120 125
Val Arg Ile Val Gly Tyr Gly Asn Ser Gln Asn Asn Trp Asn Ser Ile
130 135 140
Thr Glu Phe Asp Val Val Thr Leu Ala Ser Gly Glu Asn Ile Ala Leu
145 150 155 160
Gly Lys Ala Thr Ser Gln Ser Ser Thr Gly Tyr Glu Gly Val Ser Ser
165 170 175
Arg Ala Val Asp Gly Asn Thr Asn Gly Asn Trp Asn Gln Gly Ser Ile
180 185 190
Thr His Thr Asn Asn Glu Tyr Gln Pro Trp Trp Gln Val Asp Leu Gly
195 200 205
Ser Val Arg Ser Ile Asp Gln Val Asn Leu Trp Asn Arg Thr Asn Cys
210 215 220
Cys Ser Ser Arg Leu Ser Ala Phe Tyr Val Leu Val Ser Asp Val Pro
225 230 235 240
Phe Thr Ser Gln Thr Leu Ser Gly Ala Leu Ser Gln Ala Gly Val Ser
245 250 255
Ala Tyr Tyr Phe Asn Asp Thr Ala Gly Ser Pro Thr Glu Ile Asn Ile
260 265 270
Asp Arg Thr Gly Arg Tyr Val Arg Val Gln Leu Ser Gly Thr Asn Pro
275 280 285
Leu Ser Leu Ala Glu Val Glu Val Ile Glu Gly Ser Glu Ile Val Pro
290 295 300
Pro Ala Pro Thr Gly Pro Asp Ala Ser Trp Thr Tyr Cys Ala Ala Glu
305 310 315 320
Arg Glu Gln Cys Ala Phe Ser Asn Ile Lys Glu Val Ala Tyr Gly Ala
325 330 335
Gly Asp Ser Trp Asn Tyr Ser Val Glu Leu Asp Gly Val Thr Cys Asn
340 345 350
Asn Thr Asn Leu Gly Asp Pro Val Arg Gly Thr Val Lys Ser Cys Trp
355 360 365
Val Arg Asn Ala Gln Gln Asn Tyr Val Ala Val Arg Asn Leu Ala Glu
370 375 380
Leu Gln Asn Ala Ile Ser Asn Ser Asn Gln His Ile Arg Met Lys Arg
385 390 395 400
Gly Val Tyr Glu Ala Thr Ala Leu Met Ser Asp Asn Thr Thr Val Phe
405 410 415
Arg Phe Asp Gly Ala Asn Asn Val Leu Asp Phe Thr Gly Val Thr Ile
420 425 430
Gln Val Pro Thr Lys Leu Leu Asn Ser Met Ser Thr Gln Pro Ile His
435 440 445
Ser Gln Val Thr Tyr Asp Val Met Gly Asp Asn Ile Thr Phe Leu Asn
450 455 460
Gly Thr Phe Glu Asn Thr Tyr Pro Asn Gly Gln His Asp Val Thr Asp
465 470 475 480
Phe Thr Ala His Asn Lys Asn Pro Asp Tyr Trp Pro Ala Arg Gln Met
485 490 495
Thr Glu Phe Arg Val Trp Gly Asn Gly Val Gln Phe Leu Asn Asn Thr
500 505 510
Ile Thr Val Arg Gly Ser Tyr Pro Tyr Gly Tyr Gly Asp Met Leu Gly
515 520 525
Lys Gly Ala Gly Ser Ala Val Tyr Leu Arg Lys His Ala Gly Val Gln
530 535 540
Ile Ser Gly Asp Asn Val Leu Ile Asp Gly Met Lys Leu Thr Val Leu
545 550 555 560
Ala Phe Gly His Gly Ile Phe Met Gln Gly Ala Asp Asn Thr Val Ile
565 570 575
Lys Asn Ser Val Val Gln Gly Arg Met Arg Leu Gly Ala Asp Met Tyr
580 585 590
Asn Asp Gly Pro Asp Ser Leu Met Gly Pro Phe Asn Phe Glu Gln Gln
595 600 605
Tyr Pro Asp His Phe Val Gly Leu Pro Ile Val Arg Asp Leu Met Tyr
610 615 620
Asn Leu Thr Glu Asp Gly Ile Arg Ala Tyr Thr Gln Gly Thr Lys Leu
625 630 635 640
Asp Gly Ser Val Val Arg Thr Gly Ala Ile Thr Val Glu Asp Thr Lys
645 650 655
Val Ile Asn Met Arg Gly Cys Ile Thr Thr Pro Leu Ala Ser Lys Pro
660 665 670
Ser Tyr Ile Lys Asn Val Glu Ile Gln Gly Cys Ser Val Gly Tyr Ala
675 680 685
Leu Ala Asn Asn Ser Asp Val Ile Asn Ser Arg Gly Asp Ala Gly Tyr
690 695 700
Gly Pro Leu Leu His Ser Thr Tyr Asp Thr Arg Asn Asn Ala Asn Val
705 710 715 720
Glu Ile Thr Val Thr Asn Ile Pro Ser Thr Gly Ser His Ala Phe Ala
725 730 735
Tyr Ile Ala Gly Ser Gly His Asn Ile Thr Phe Leu Ser Asp Gly Ser
740 745 750
Asn Pro Asp Val Pro Arg Glu Ile Arg Val Gly Asp Thr Gly Asp Arg
755 760 765
Trp Ala Gly Asp Pro Ser Tyr Gln Asp Ala Ala Asn Ile Leu Leu Ile
770 775 780
Asn Glu Thr Asn Gln Pro Val Val Leu Thr Glu Thr Ser Arg Asn Ile
785 790 795 800
Thr Gly Glu Ser Val Gly Glu Val Thr Asp Asn Gly Thr Gly Asn Ser
805 810 815
Leu Lys
<210> 214
<211> 2457
<212> DNA
<213> Microbulbifer degradans
<400> 214
atgataaaac tatctcacct caaacactgc aggaaatttt gtatttcttt gctttgcgcg 60
ctgggcatgg ctaatgctca cgccgcatta aatgtaactg catctgcgga tgacggcaat 120
gtgcccgcga ataccttaga tgacaatata gatacgcgct ggtcggcgaa tggatctggg 180
cagtggattg aatatgattt gggcgcgaca cacactgtgg atgctgtgca aatagcattt 240
tttcgcggag atgtgcgcga tgcaaccatc gacattcaag tgtcgaacga tggcggcaat 300
tggcaaacac ttttttcggg tacgccacca acccgaacct tagcgcagca acattttgag 360
ttggatgata cttctgcgcg ctatgtgcgt attgtaggtt atggtaacag ccaaaataat 420
tggaacagta ttaccgagtt cgatgtggtt acgcttgcaa gtggagaaaa tattgctctt 480
ggtaaagcta catcccaatc atccactggc tatgagggtg tatctagccg tgcggtagat 540
ggcaacacca atggtaactg gaatcaaggc tcgattaccc acaccaataa cgagtatcaa 600
ccttggtggc aagtggatct aggctcagtt agatctatcg accaagttaa cttgtggaat 660
cgtaccaact gctgcagctc gcgtttatcg gcgttttatg tgttggtgtc cgatgtgccc 720
tttacatcgc aaaccttaag cggtgcgctt agtcaagcgg gtgtaagtgc ttattatttt 780
aatgatactg cgggcagccc aaccgaaata aatatagatc gcacaggtcg ctatgtgcgt 840
gtacaacttt ctggcacaaa cccattaagc ctagcggaag ttgaagttat tgaaggcagc 900
gaaattgttc caccagcacc aaccggccca gatgcatcat ggacttattg tgccgccgag 960
cgcgagcaat gtgcattttc taatataaaa gaagtggcct acggcgctgg cgatagctgg 1020
aactactctg tcgagctaga cggcgtaact tgtaataata ctaacttggg tgaccccgtt 1080
cgcggcacgg ttaaatcgtg ctgggtacgc aacgcacaac aaaattatgt ggcggtgcga 1140
aaccttgctg aattgcaaaa tgctatttcc aatagcaatc aacacattcg catgaagcgc 1200
ggtgtgtatg aagctacagc gttaatgtct gataacacca ccgtatttcg attcgacggt 1260
gcgaataatg tattggattt taccggtgta accattcagg tacccactaa actcttaaac 1320
agtatgagta cccagcctat tcactcgcaa gtaacctacg atgtgatggg cgataacatt 1380
acttttttaa acggtacttt cgaaaatacc tatccgaatg gccagcacga tgtaacagat 1440
tttaccgccc ataataaaaa cccagactat tggcccgcgc gtcaaatgac ggagtttaga 1500
gtgtggggga acggcgtaca gtttttaaat aacaccatta ctgtgcgcgg ttcatacccg 1560
tatggctacg gtgatatgtt aggtaagggc gccggttctg cagtgtattt acgcaaacat 1620
gctggtgtac aaatttctgg cgacaatgta ttaattgacg gcatgaaatt aaccgtactt 1680
gcattcggcc acggcatttt tatgcagggt gcagataaca cagtaattaa aaactctgtt 1740
gtacaagggc gtatgcgctt aggcgccgat atgtataacg acgggccaga ctccttaatg 1800
gggccgttta attttgagca gcaataccca gatcactttg tgggcctgcc aattgtgcgc 1860
gacctaatgt acaaccttac cgaagatggc atacgcgctt atacacaagg caccaaatta 1920
gatggtagtg tggtgcgtac tggtgcaatt actgtagaag acaccaaagt aattaatatg 1980
cgcggttgta ttactacgcc attggcttct aagccaagtt atataaaaaa tgtagaaatt 2040
caaggttgta gtgtgggtta cgcgctggcg aataatagcg atgtaatcaa ttcgcgcggt 2100
gatgcaggct acggtccgct attgcactct acgtacgata cgcgcaacaa cgccaatgtc 2160
gaaataaccg ttaccaatat tccatctacc ggctcacacg cttttgcgta tattgcaggc 2220
tctggtcata acattacctt cctaagtgat ggcagcaacc cagatgtgcc gcgcgaaata 2280
cgcgtaggcg atactggcga ccgctgggct ggcgacccaa gctatcaaga tgcagcaaat 2340
atattactca taaacgaaac caaccaaccg gtggtgttaa ctgaaaccag tagaaatatt 2400
actggtgaaa gtgtgggtga ggtgactgat aatgggacgg gaaatagctt gaaatag 2457
SEQUENCE LISTING
<110> TAYLOR, LARRY EDMUND
WEINER, RONALD M.
HUTCHESON, STEVEN WAYNE
EKBORG, NATHAN A.
HOWARD, MICHAEL
<120> ENZYME SYSTEMS FOR SACCHARIFICATION OF PLANT CELL WALL POLYSACCHARIDES
<130> 108172-00124
<140> 11 / 519,104
<141> 2006-09-12
<150> 11 / 121,154
<151> 2005-05-04
<150> 60 / 567,971
<151> 2004-05-04
<160> 214
<170> PatentIn version 3.3
<210> 1
<211> 1167
<212> PRT
<213> Microbulbifer degradans
<400> 1
Met Thr Ile Lys Arg Trp Pro Phe Asp Arg Lys Gly Pro Pro Lys Lys
1 5 10 15
Pro Asn Ala Lys Lys Leu Leu Ala Ser Leu Ala Ala Ala Leu Ser Leu
20 25 30
Thr Ala Met Gln Ser Thr Ala Ala Val Glu Pro Leu Gln Thr Ser Gly
35 40 45
Asn Gln Ile Leu Val Gly Asn Gln Ala Lys Ala Leu Gly Gly His Ser
50 55 60
Leu Phe Trp His Asn Val Pro Ala Ala Gly Ser Leu Tyr Asn Ala Asp
65 70 75 80
Thr Val Ser Arg Leu Lys Asn Asp Trp Asn Ser Lys Val Ile Arg Ala
85 90 95
Ala Ile Gly Val Glu Val Pro Phe Asn Ser Glu Asn Thr Tyr Ile Gly
100 105 110
Asn Lys Gly Ser Ser Leu Ala Ala Ile Asp Arg Val Val Asn Ala Ala
115 120 125
Val Ala Asn Asp Met Tyr Val Ile Ile Asp Phe His Thr His His Ala
130 135 140
Asp Gln Val Glu Asn Val Ala His Asp Phe Phe Asn Glu Val Ser Ser
145 150 155 160
Arg Tyr Gly His Leu Asn Asn Val Ile Tyr Glu Val Phe Asn Glu Pro
165 170 175
Glu Trp Cys Gly Glu His Gly Arg Trp Ala Ser Thr Ile Lys Pro Tyr
180 185 190
Ala Glu Arg Val Ile Gln Thr Ile Arg Asn Asn Asp Pro Asp Asn Leu
195 200 205
Val Ile Val Gly Thr Thr Cys Phe Ser Gln Asp Val Asp Val Ala Ala
210 215 220
Ala Asp Pro Ile Asn Asp Val Asn Val Ala Tyr Thr Leu His Phe Tyr
225 230 235 240
Ala Ala Thr Pro Ala His Gln Gln Pro Leu Arg Asp Lys Ala Gln Thr
245 250 255
Ala Leu Asp Arg Gly Ala Pro Leu Phe Val Thr Glu Trp Gly Thr Thr
260 265 270
Thr Phe Thr Gly Asp Gly Phe Val Asp Glu Ala Gln Thr Arg Thr Trp
275 280 285
Ile Asn Trp Leu Asn Glu Arg Gly Ile Ser His Val Asn Trp Ser Ala
290 295 300
Ser Thr Gln Pro Glu Ser Ser Ala Ile Trp Asn Gly Asp Met Thr Tyr
305 310 315 320
Lys His Ser Gly Leu Leu Val Gly Glu Leu Val Gln Gln Thr Asn Gly
325 330 335
Thr Thr Thr Pro Pro Thr Gly Glu Ile Ser Gly Pro Cys Asp Leu His
340 345 350
Phe Val Pro Ala Lys Ala Glu Ala Glu Ser Phe Cys Thr Ala Lys Gly
355 360 365
Ile Gln Phe Glu Thr Thr Thr Asp Thr Gly Gly Gly Gln Asn Met Gly
370 375 380
Trp Leu Asp Ala Gly Asp Trp Val Thr Phe Asp Val Asp Val Pro Ala
385 390 395 400
Ser Gly Gln Tyr Leu Ile Asp Tyr Arg Val Ala Ser Glu Leu Gly Asp
405 410 415
Gly Arg Phe Arg Thr Glu Ala Ala Asn Gly Thr Ala Leu Gly Thr Ile
420 425 430
Ser Val Pro Asn Thr Gly Gly Trp Gln Asn Trp Gln Thr His Thr His
435 440 445
Thr Val Gln Leu Ser Gln Gly Thr Gln Thr Val Lys Leu Val Ala Glu
450 455 460
Thr Gly Gly Trp Asn Leu Asn Trp Phe Glu Val Arg Ala Gly Glu Val
465 470 475 480
Cys Glu Gly Ala Asp Cys Pro Cys Glu Gly Ala Glu Cys Pro Cys Pro
485 490 495
Asp Cys Asn Gly Thr Pro Val Lys Phe Glu Ala Glu Thr Phe Val Ala
500 505 510
Met Gln Gly Val Gln Leu Glu Asn Thr Ser Asp Val Gly Gly Gly Gln
515 520 525
Asn Val Gly Tyr Ile Asp Ser Gly Asp Trp Ile Thr Tyr Asn Gly Ala
530 535 540
Leu Pro Ala Ser Ala Asp Asn Arg Tyr Val Val Ser Tyr Arg Val Ala
545 550 555 560
Arg Gln Pro Ser Gly Asn Ala Lys Phe Lys Ile Glu Gln Pro Gly Gly
565 570 575
Ala Ala Val Tyr Gly Glu Ile Ser Val Pro Ser Thr Gly Gly Trp Gln
580 585 590
Thr Trp Thr Thr Ile Ser His Thr Ile Thr Ile Pro Ala Asn Ala Asn
595 600 605
Gly Phe Ala Leu Ala Ala Ile Asp Gly Gly Trp Asn Ile Asn Trp Ile
610 615 620
Glu Ile Lys Pro Ala Thr Thr Gln Pro Pro Glu Pro Ile Asn Pro Leu
625 630 635 640
Lys Leu Gln Ala Glu Asp Tyr Ile Asn Phe Asn Asp Thr Thr Pro Gly
645 650 655
Asn Glu Gly Gly Ala His Arg Ser Asp Asp Val Asp Ile Gln Ala Thr
660 665 670
Thr Asp Thr Gly Gly Gly Phe Asn Val Gly Trp Val Asp Ala Gly Glu
675 680 685
Trp Leu Glu Tyr Glu Phe Phe Leu Glu Ser Pro Asp Phe Tyr Ala Ala
690 695 700
Asp Val Arg Val Ala Ser Asp Gln Thr Gly Gly Ala Leu Gln Leu Gln
705 710 715 720
Ile Asp Gly Gln Asn Val Gly Gln Ala Ile Thr Val Gly Asn Thr Gly
725 730 735
Gly Trp Gln Ala Trp Thr Thr Lys Asn Thr Leu Ile Gly Asp Leu Ser
740 745 750
Ala Gly Thr His Thr Leu Arg Val Tyr Ala Gln Ser Gly Pro Leu Asn
755 760 765
Leu Asn Trp Val Glu Leu Lys Arg Thr Thr Pro Ala Pro Ala Thr Ser
770 775 780
Cys Phe Asn Ile Ala Glu Asp Arg Leu Asn Val His Leu Asp Ala His
785 790 795 800
Cys Thr Ala Gly Ser Asn Leu Gln Tyr Asn Trp Asp Phe Gly Asp Gly
805 810 815
Asn Ser Ala Thr Gly Val Ala Thr Ser His Ser Tyr Tyr Thr Ser Gly
820 825 830
Thr Tyr Thr Ile Thr Leu Thr Val Ser Asp Thr Arg Thr Thr Asp Thr
835 840 845
Ser Ser Gln Gln Val Thr Val Asp Phe Ser Ala Pro Ala Gly Pro Val
850 855 860
Asp Phe Tyr Gly Glu Leu Met Val Asn Gly Asn Arg Ile His Gly Glu
865 870 875 880
Lys Thr Gly Glu Pro Ala Gln Val Arg Gly Met Ser Phe Phe Trp Ser
885 890 895
Asn Thr Gly Trp Gly Gln Glu Lys Trp Trp Asn Ala Ser Thr Val Asp
900 905 910
Arg Met Val Asp Glu Phe Lys Val Glu Leu Val Arg Gly Ala Met Gly
915 920 925
Thr Asp Glu Gly Gly Gly Tyr Leu His Asp Ala Ser Asn Lys Ala Arg
930 935 940
Leu Gln Ala Val Val Glu Gln Ala Ile Ala Arg Asn Val Tyr Val Ile
945 950 955 960
Ile Asp Trp His Thr His His Ala Glu Asp Asn Ile Ala Glu Ala Ile
965 970 975
Thr Phe Phe Ser Glu Met Ala Gln Leu Tyr Gly His His Asp Asn Val
980 985 990
Ile Phe Glu Ile Tyr Asn Glu Pro Leu Asn Thr Thr Ser Ser Trp Gly Thr
995 1000 1005
Ile Lys His Tyr Ala Glu Gln Val Ile Pro Ala Ile Arg Ala His Ser
1010 1015 1020
Asp Asn Leu Ile Val Val Gly Thr Arg Thr Trp Ser Gln Asn Val Asp
1025 1030 1035 1040
Glu Ala Ala Phe Asp Lys Ile Asn Asp Ser Asn Thr Ala Tyr Ala Leu
1045 1050 1055
His Phe Tyr Val Gly Ser His Gly Asn His Val Arg Asn Leu Ala Gln
1060 1065 1070
Thr Ala Leu Asn Asn Gly Ala Ala Ile Phe Ala Ser Glu Trp Gly Ile
1075 1080 1085
Trp Pro Asn Asn Asn Tyr Asp Gly Met Asn Ala Asp Asp Trp Met Asn
1090 1095 1100
Phe Leu Asp Gln Asn Lys Ile Ser Trp Ala Asn Trp Ala Ile Ser Asp
1105 1110 1115 1120
Lys Val Asp Pro Asn Thr Gly Gln Leu Glu Pro Pro Ser Met Phe Asn
1125 1130 1135
Pro Asp Gly Ser Leu Ser Ser Asn Gly Gln Tyr Val Val Asn Lys Leu
1140 1145 1150
Asn Glu Tyr Ala Ala Gln Ala Pro Trp Arg Glu Ala Ile Ala Asn
1155 1160 1165
<210> 2
<211> 3504
<212> DNA
<213> Microbulbifer degradans
<400> 2
atgacaatta aacgttggcc gttcgaccga aaaggcccac ctaaaaaacc taacgctaaa 60
aaattactcg caagcttagc ggctgcacta agcttaaccg ccatgcaaag cactgcagcg 120
gtagagccat tacaaaccag cggcaatcaa attcttgttg gcaaccaagc caaagccctt 180
ggcggccaca gcttgttttg gcataacgtg ccggcagcag gcagcttata caatgcagat 240
acagtaagca ggcttaagaa tgattggaac tccaaggtta ttcgggccgc aattggggtt 300
gaagtacctt tcaattcaga aaacacctac ataggcaata agggcagctc gctggccgca 360
atagaccgcg tagttaatgc cgctgttgcc aacgatatgt atgtgattat cgattttcat 420
actcaccatg cagatcaagt agaaaacgtt gcccacgact ttttcaacga agtttctagc 480
cgttacggtc atttaaacaa tgttatttat gaagtattta acgagccaga atggtgtggc 540
gagcacggtc ggtgggcatc taccattaag ccctacgccg agcgcgttat ccaaaccatt 600
cgcaacaatg acccagacaa cctagtaata gtaggcacta cctgtttctc gcaagatgta 660
gatgtagccg cagccgaccc cattaacgat gtaaacgtgg cctatacgct acacttttac 720
gcagccaccc ctgcccacca gcaacccttg cgcgacaagg cccaaaccgc gctcgaccgc 780
ggcgcgccac tatttgtaac cgaatggggt acaaccacat ttacaggtga tggttttgta 840
gatgaggcgc aaacgcgcac atggattaac tggttaaacg aacgcggtat tagccacgtt 900
aactggtcgg cgtctaccca gccagaaagc tcagctatat ggaatggcga catgacctac 960
aagcattcgg gcttattggt tggcgaactg gtgcaacaaa caaatggcac aaccacgcca 1020
ccaaccggtg aaataagtgg cccgtgcgat ttacattttg tacctgccaa agccgaggct 1080
gaaagcttct gtaccgccaa aggcattcaa tttgaaacca ccaccgacac gggcggcggc 1140
caaaacatgg gctggctaga tgccggcgac tgggtaactt ttgatgtaga tgtacctgct 1200
agcggccaat atttaataga ttaccgcgta gcatcagagc taggtgatgg tcggttccgc 1260
accgaagccg ccaacggcac tgcccttggc acaatatctg tacccaatac cggcggctgg 1320
cagaattggc aaacgcacac acacacagtg caactctcgc aaggcacaca aaccgttaaa 1380
ctagttgccg aaactggtgg ctggaactta aattggtttg aagtgcgcgc aggtgaggtg 1440
tgcgaaggcg ctgactgccc atgtgaagga gccgaatgcc cttgcccaga ttgcaacggc 1500
acaccggtta agtttgaggc agaaacgttt gtggctatgc aaggcgtgca gctagaaaac 1560
acatccgatg tgggcggcgg ccaaaacgtt ggctacattg atagcggcga ctggataact 1620
tacaacgggg ccttgcccgc aagtgcagac aaccgctatg tagtgtctta tagagtagcg 1680
cgtcaaccta gcggcaatgc caaatttaaa atagaacagc caggtggagc agcggtatat 1740
ggcgaaattt cggtgcccag caccggcggc tggcaaacat ggacaaccat tagccacacc 1800
ataacaattc ccgctaacgc aaacggcttt gcactagcag caatagatgg cggttggaat 1860
ataaactgga tagaaataaa accggcgacc actcaaccac ccgagccaat caacccgtta 1920
aaacttcaag ctgaagatta catcaacttt aacgacacca cccccggtaa cgaaggcggt 1980
gcacacagaa gcgatgatgt agatattcaa gcaactaccg ataccggtgg cggttttaat 2040
gttggctggg tagacgctgg cgaatggcta gagtatgagt tctttttaga gtctcctgat 2100
ttttatgcag ctgatgtacg ggttgcttca gaccaaactg gcggcgcact gcaactacaa 2160
atagatggcc aaaacgttgg ccaagccatt accgttggca acaccggtgg ctggcaagcg 2220
tggacaacca aaaacacact cattggcgac ctaagtgcag gcacccacac gttgcgtgta 2280
tacgcgcaaa gcggcccatt aaatttaaac tgggtagagc taaagcgtac aacgcccgca 2340
ccagccactt cgtgttttaa tattgccgaa gaccgcttaa acgttcacct agatgcgcac 2400
tgtactgcag gcagcaacct gcaatacaat tgggattttg gtgacggcaa cagcgcaacc 2460
ggcgtagcca ctagccacag ctactacact agcggcactt acaccattac cttaaccgtt 2520
agtgataccc gcaccacaga cacctctagc caacaggtaa cggtagattt ttctgcccct 2580
gcaggccctg tggattttta cggcgaacta atggtgaatg gcaaccgcat tcacggcgaa 2640
aaaaccggcg aacccgcaca agtacgcggc atgagctttt tttggagcaa caccggttgg 2700
ggccaagaaa aatggtggaa cgccagcacc gtggaccgca tggttgatga gttcaaagta 2760
gaacttgtgc gcggcgcaat gggcactgat gaaggcggcg gttatttaca cgacgcgtct 2820
aataaggctc gcttacaagc agttgttgaa caagccattg cacgcaatgt gtatgtaatt 2880
atcgactggc acacccacca tgccgaagat aacattgccg aagccattac attctttagc 2940
gaaatggcgc agctttatgg ccaccacgac aacgtgattt tcgagattta caacgagcca 3000
ttaaacacca caagctgggg cactattaag cactacgctg aacaagttat tcctgctatt 3060
cgcgctcatt ccgataattt aattgttgtg ggcacgcgca cctggtcgca aaacgtagac 3120
gaagccgcgt tcgataaaat taacgacagc aacaccgcct acgccctgca cttttatgtt 3180
ggctcgcacg gcaaccacgt tcgcaaccta gcacaaaccg cactaaacaa cggcgcggct 3240
atttttgcta gcgaatgggg aatttggcca aacaacaact acgatggcat gaacgccgac 3300
gattggatga actttttaga ccaaaacaaa atatcttggg ctaactgggc catatccgac 3360
aaagtagacc ccaacacagg ccaactagaa ccacccagca tgttcaaccc agacggcagc 3420
ctaagcagta atggtcaata tgtagtgaac aaactaaatg aatacgcagc acaagcaccg 3480
tggagggagg caatcgctaa ttga 3504
<210> 3
<211> 133
<212> PRT
<213> Microbulbifer degradans
<400> 3
Met Val Val Ser Leu Ala Asp Asn Ser Ala Gly Ala Ile Ser Cys Trp
1 5 10 15
His Ala Lys Ala Ser Pro Pro Glu Glu Leu Glu Glu Leu Leu Asp Glu
20 25 30
Glu Leu Glu Leu Asp Glu Leu Glu Leu Glu Glu Glu Leu Glu Glu Leu
35 40 45
Val Glu Glu Leu Leu Glu Glu Leu Leu Asp Glu Leu Leu Leu Asp Glu
50 55 60
Leu Glu Asp Glu Pro Leu Ala Ala Pro Pro Ser Leu Pro Pro Pro Gln
65 70 75 80
Ala Val Ser Pro Ala Lys Gln Leu Ile Ser Ser Ala Asp Phe Lys Lys
85 90 95
Val Ser Phe Arg Val Gln Leu Asn Ala Val Arg Val Lys Ser Lys Arg
100 105 110
Asn Ile Asn His Ser Arg Ile Phe Leu Phe Trp Leu Phe Ser His Phe
115 120 125
Arg Ser Arg Arg Cys
130
<210> 4
<211> 566
<212> PRT
<213> Microbulbifer degradans
<400> 4
Met Phe Leu Leu Asp Phe Thr Arg Thr Ala Phe Ser Cys Thr Arg Lys
1 5 10 15
Leu Thr Phe Leu Lys Ser Ala Leu Leu Ile Ser Cys Phe Ala Gly Leu
20 25 30
Thr Ala Cys Gly Gly Gly Ser Asp Gly Gly Ala Ala Ser Gly Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
50 55 60
Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
65 70 75 80
Ser Ser Ser Ser Asn Ser Ser Ser Ser Ser Ser Gly Gly Asp Ala Leu
85 90 95
Ala Cys Gln His Glu Met Ala Pro Ala Leu Leu Ser Ala Ser Asp Thr
100 105 110
Thr Met Val Gln Ala Glu Tyr Tyr Asp Thr Cys Ala Ser Ser Ala Leu
115 120 125
Asp Asn Thr Thr Gly Asn Ser Gly Gly Glu Leu Arg Thr Asp Asp Val
130 135 140
Asp Ile Val Ala Ile Ala Asp Gly Tyr Ala Ile Thr Asp Met Gln Ser
145 150 155 160
Gly Glu Tyr Val Glu Tyr Ser Leu Thr Val Gln Thr Ser Gly Leu Phe
165 170 175
Asp Ile Ser Phe Ala Val Gln Pro His Ala Ala Asn Thr Ala Gly Leu
180 185 190
Ala Leu Ser Val Asp Gly Ala Val Leu Gly Thr Val Asp Ile Ala Ala
195 200 205
Asn Asp Ser Thr Ala Phe Gly Glu Tyr Thr Leu Asn Gly Val Tyr Ile
210 215 220
Ser Asp Gly Ala Gln Val Ile Arg Val Thr Met Ala Gly Glu Gly Ala
225 230 235 240
Ala Ile Gly Leu Asp Ser Ile Ala Phe Asn Tyr Thr Asp Asn Thr Val
245 250 255
Tyr Thr Pro Glu Asn Ala Val Leu Gly Met Gly Ile Gly Ile Asn Leu
260 265 270
Gly Asn Thr Leu Asp Ala Phe Pro Asn Glu Gly Asp Trp Ala Pro Ala
275 280 285
Ala Gln Glu Tyr Tyr Phe Lys Ala Tyr Lys Asp Ala Gly Phe Arg His
290 295 300
Val Arg Ile Pro Ala Thr Trp Asp Asp His Thr Ala Asp Thr Ala Pro
305 310 315 320
Tyr Ala Val Asn Ala Ala Arg Met Asp Arg Thr Glu Gln Ile Val Asp
325 330 335
Trp Ala Leu Ala Gln Gly Tyr Phe Val Ile Leu Asn Ala His His Glu
340 345 350
His Trp Leu Lys Glu Asn Tyr Gly Asn Gln Thr Tyr Arg Asp Arg Phe
355 360 365
Asp Ala Ile Trp Gln Gln Ile Ala Glu Arg Phe Lys Asn Lys Ser Ala
370 375 380
Arg Leu Met Phe Glu Ile Leu Asn Glu Pro Asn Gly Met Thr Val Ala
385 390 395 400
Asp Val Asp Asp Leu Asn Pro Arg Ile Leu Asp Ile Ile Arg Glu Thr
405 410 415
Asn Pro Thr Arg Leu Val Val Phe Ser Gly Asn Gly Tyr Thr Pro Val
420 425 430
Asp Ala Leu Leu Ala Ala Ala Ile Pro Asn Asp Asp Tyr Leu Ile Gly
435 440 445
Asn Phe His Ser Tyr Asp Pro Trp Gln Phe Gly Gly Gln Cys Val Arg
450 455 460
Ser Trp Gly Thr Glu Gln Asp Tyr Thr Asp Leu Glu Asn Ile Tyr Lys
465 470 475 480
Arg Ala Asn Thr Trp Ser Glu Gln His Asp Ile Pro Val Met Val Asn
485 490 495
Glu Phe Gly Ala Ala His Tyr Asp Phe Thr Ala Pro Gln Asn Val Cys
500 505 510
Asn Gln Gln Ala Arg Leu Ala Tyr Leu Gly Ala His Ala Thr Phe Ala
515 520 525
Ile Gln Tyr Gly Phe Gly Ala Ser Val Trp Asp Asp Gly Gly Ser Phe
530 535 540
Glu Val Tyr Lys Arg Gly Glu Asn Ser Trp Arg Glu Ala Lys Asp Val
545 550 555 560
Leu Val Ala Pro Asn Pro
565
<210> 5
<211> 1701
<212> DNA
<213> Microbulbifer degradans
<400> 5
atgtttcttt tagactttac ccgcactgcg tttagctgta cacgaaagct tacctttttg 60
aaatccgcgc tacttataag ctgctttgcc gggcttactg cctgtggtgg cgggagtgat 120
ggcggtgctg caagtggctc atcctctagc tcgtctagca gcagttcgtc tagtagctct 180
tcgagcagtt cttcaactag ttcctcaagc tcctcttcaa gctctagttc gtccagttcc 240
agctcttcgt ctaatagttc ctctagctcc tctggtggcg atgctttagc gtgccagcat 300
gaaatggcac cagcgctatt atctgcaagt gatactacca tggtgcaagc ggagtattac 360
gatacctgtg cttcttcggc attagataac accactggta acagtggcgg tgagttgcga 420
actgacgatg tagatatagt ggccattgcg gacggctatg ctattacgga tatgcagtca 480
ggcgagtacg tagaatattc actaacagtg caaacttccg gtttgtttga cattagtttt 540
gcggtacagc cgcacgcagc taatactgcc ggtttggcgc tgagtgtaga tggcgcagtg 600
ttaggcacag ttgatattgc cgctaatgac agcaccgcat ttggcgaata tacgcttaac 660
ggcgtgtaca taagcgatgg cgcgcaagta ataagggtaa ccatggccgg cgaaggcgct 720
gctattgggt tagattccat tgcctttaat tacaccgata ataccgttta caccccagaa 780
aacgccgtgt tgggtatggg aataggtatt aacctaggca ataccttaga tgccttcccc 840
aacgaaggtg actgggcacc ggctgcgcag gaatactatt ttaaagccta caaggatgca 900
ggtttccgcc atgtacgcat cccagcaact tgggatgatc acacggctga tacagccccc 960
tacgctgtaa atgcagcacg tatggatcgc actgagcaga ttgtagattg ggccttggcg 1020
cagggctatt tcgtaattct taatgcccac cacgaacact ggctaaaaga aaactacggc 1080
aatcaaacat accgcgatcg ctttgatgca atttggcagc aaattgccga acgctttaag 1140
aataagtcgg ctcgcttaat gtttgagata ctcaatgagc caaacggcat gacagtggcc 1200
gatgtggatg acctcaaccc acgtattctc gatattattc gcgaaaccaa tcccacgcga 1260
ttggtagtgt tctctggtaa tgggtatacc cctgtggatg ccttacttgc ggctgcaatc 1320
cctaatgatg attaccttat tggtaacttt cactcctacg acccttggca gtttggcggt 1380
cagtgcgtac gatcgtgggg tacagagcaa gattacaccg acctagagaa catatataag 1440
cgcgcaaata cttggtctga gcagcacgac atacccgtta tggtgaacga atttggcgct 1500
gcccattacg attttactgc accgcagaat gtatgtaacc agcaggctcg tttggcttat 1560
ttaggtgccc atgccacatt tgctattcag tacggctttg gcgcaagtgt atgggacgac 1620
ggtggatcat ttgaggtgta caagcgcggt gaaaatagct ggcgcgaagc taaagatgta 1680
ttagtggcgc caaacccgta g 1701
<210> 6
<211> 451
<212> PRT
<213> Microbulbifer degradans
<400> 6
Met Arg Ile Ile Thr Ala Phe Ala Val Met Leu Leu Cys Ile Thr Gly
1 5 10 15
Cys Ser Gly Ser Gly Ala Ser Asp Ser Pro Gln Ala Ser Asn Ser Ser
20 25 30
Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser
50 55 60
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Glu Ala Leu
65 70 75 80
Tyr Pro Ser Tyr Asn Thr Asn Pro Pro Ala Pro Asp Met Thr Gly Met
85 90 95
Thr Ser Thr Ala Thr Gln Leu Ala Asp Arg Ile Thr Val Gly Trp Asn
100 105 110
Ile Gly Asn Thr Leu Glu Ala Ile Gly Gly Glu Thr Asn Trp Gly Asn
115 120 125
Pro Leu Val Thr Asn Glu Leu Ile Gln Ala Val Lys Ala Ser Gly Phe
130 135 140
Asp Ser Ile Arg Ile Pro Ala Ala Trp Asp Gln Tyr Ala Asn Gln Glu
145 150 155 160
Thr Ala Ala Ile Asp Ile Asn Trp Leu Asn Arg Val Lys Gln Val Val
165 170 175
Gln Tyr Ser Ile Asp Asn Asp Met Val Val Val Leu Asn Ile His Trp
180 185 190
Asp Gly Gly Trp Leu Glu Arg Asn Val Glu Pro Ser Glu Gln Val Ala
195 200 205
Val Asn Ala Lys Gln Lys Ala Tyr Trp Glu Gln Ile Ala Thr His Leu
210 215 220
Arg Asp Phe Asp Glu Arg Leu Ile Phe Ala Ser Ala Asn Glu Pro His
225 230 235 240
Val Glu Thr Glu Ala Gln Met Ala Val Leu Asn Val Tyr His Gln Thr
245 250 255
Phe Val Asp Thr Val Arg Ala Thr Gly Gly Lys Asn Ala Tyr Arg Val
260 265 270
Leu Val Leu Gln Gly Pro Lys Thr Asp Ile Glu Thr Thr Ser Leu Leu
275 280 285
Trp Thr Gln Met Pro Gln Asp Ser Ala Val Asn Lys Leu Met Ala Glu
290 295 300
Leu His Phe Tyr Thr Pro Tyr Asn Phe Thr Leu Met Asn Val Asp Glu
305 310 315 320
Ser Trp Gly Asn Gln Phe Tyr Tyr Trp Gly Glu Gly Asn His Ser Thr
325 330 335
Thr Asp Thr Gly Arg Asn Pro Thr Trp Gly Glu Glu Ala Thr Val Asp
340 345 350
Ser Leu Leu Ala Ile Thr Lys Gln Gln Phe Val Asp Gln Gly Ile Pro
355 360 365
Val Ile Ile Gly Glu Tyr Gly Ala Gln Arg Arg Asp Asn Leu Thr Gly
370 375 380
Asp Glu Leu Ala Leu His Leu Gln Ser Arg Asn Tyr Tyr Leu Lys Tyr
385 390 395 400
Val Thr Gln Lys Cys Val Glu Leu Gly Leu Lys Pro Phe Tyr Trp Asp
405 410 415
Thr Gly Gly Leu Asp Asn Asn Gln Ser Gly Leu Phe Asn Arg Ser Thr
420 425 430
Tyr Gln Val Phe Asp Gln Asn Ala Leu Asp Ala Ile Met Glu Gly Ala
435 440 445
Arg Gly Glu
450
<210> 7
<211> 1356
<212> DNA
<213> Microbulbifer degradans
<400> 7
atgagaataa taacggcgtt tgcagttatg ctgctatgca taacaggctg tagcggatcg 60
ggcgcgagtg atagcccgca agcatccaat tcgtcttcgg gcagttcttc tagctctagc 120
agttcgtcaa gttcgagcag ttcctctagt tcgtcgtcta gctcttcaac aagctctagc 180
agctcatcta gctccagctc atcatcaagc tctagcagtt cttcgggcgg cgaagcgctt 240
tacccaagct acaatacaaa cccgccagcg ccagatatga ccggcatgac aagtactgcc 300
acacaactag cagatcgtat aaccgtgggc tggaatattg gtaacacgct agaggcaata 360
ggcggcgaaa ccaactgggg taacccgctg gttactaacg aattaattca agcggtaaaa 420
gccagtggct ttgattccat tcgtataccc gccgcgtggg atcaatacgc caaccaagaa 480
acggccgcaa tagatataaa ctggctaaac cgcgttaaac aagttgtgca atacagcata 540
gataacgaca tggtggtagt gctaaacatc cactgggatg gcggttggct agagcgcaat 600
gtagagccca gcgagcaagt agcagtaaat gcaaaacaaa aagcctattg ggaacaaatt 660
gccactcacc tgcgcgactt tgacgagcgc ctaatatttg ccagcgccaa cgaaccccat 720
gtagaaaccg aagcacaaat ggccgtacta aacgtatacc atcaaacgtt tgtagataca 780
gtgcgtgcaa ctggcggtaa aaatgcttac cgcgtactgg tattgcaggg gccaaaaaca 840
gatatagaaa ccacctcgct attgtggacc caaatgccgc aagatagcgc cgtaaataaa 900
cttatggcag agctacactt ctataccccg tacaacttta cgttaatgaa tgtagatgaa 960
agctggggca accagttcta ctactggggc gaaggtaatc attccactac cgacacaggc 1020
cgcaacccaa cctggggcga agaagcaaca gtagattcac tgctggcaat taccaaacaa 1080
cagtttgtgg accaaggtat acccgtaatt attggcgaat acggtgcaca acgccgcgat 1140
aaccttaccg gcgatgaatt ggccctgcac ttacaatcgc gcaactacta cttaaaatac 1200
gttactcaaa aatgtgtaga gctaggctta aaaccttttt attgggatac cggcggctta 1260
gacaacaatc aatctggcct gtttaatcgc agtacctacc aagtatttga tcaaaatgcc 1320
ctagatgcca ttatggaagg ggccagaggg gaataa 1356
<210> 8
<211> 621
<212> PRT
<213> Microbulbifer degradans
<400> 8
Met Leu Lys His Gln Phe Ser Lys Ala Leu Arg Ala Leu Gly Phe Gly
1 5 10 15
Gly Ala Val Phe Ala Ala Ser Leu Met Ala Ser Gln Ala Ser Ala Leu
20 25 30
Glu Cys Glu His Ser Ile Ser Asn Asp Trp Gly Ala Gly Phe Thr Gly
35 40 45
Ala Met Lys Val Thr Asn Asn Asp Ser Ser Pro Ile Thr Gly Trp Arg
50 55 60
Val Glu Trp Ala Tyr Ser Gly Asn Val Asn Ile Val Asn Ser Trp Asn
65 70 75 80
Ala Ser Val Thr Lys Gly Ser Asn Tyr Val Ala Val Asp Ala Gly Trp
85 90 95
Asn Gly Asn Leu Gln Pro Ser Gln Ser Thr Glu Phe Gly Leu Gln Gly
100 105 110
Asp Gly Ala Asp Arg Asn Val Thr Ile Ile Ser Cys Val Ala Glu Gly
115 120 125
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
130 135 140
Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Thr
145 150 155 160
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Thr Ser Ser
165 170 175
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Asn Cys Val
180 185 190
Ala Met Cys Asn Trp Tyr Gly Glu Asn Arg Pro Val Cys Ala Asn Gln
195 200 205
Asn Thr Gly Trp Gly Trp Glu Asn Asn Gln Ser Cys Ile Gly Ala Asn
210 215 220
Thr Cys Asn Asp Gln Trp Gly Asp Gly Gly Val Val Ser Ser Cys Gly
225 230 235 240
Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
245 250 255
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser
260 265 270
Ser Ser Ser Ser Ser Gly Gly Leu Ser Ala Val Glu Phe Ser Gln Gln
275 280 285
Met Gly Leu Gly Trp Asn Leu Gly Asn Ser Leu Glu Ala Ile Gly Gly
290 295 300
Glu Thr Ala Trp Gly Asn Pro Met Val Thr Gln Gln Leu Ile Asn Ser
305 310 315 320
Ile Lys Ala Ala Gly Phe Asp Thr Ile Arg Ile Pro Val Ala Trp Ser
325 330 335
Gln Phe Ser Asp Glu Ala Asn Phe Val Ile Asn Ser Asn Trp Ile Ala
340 345 350
Arg Val Glu Glu Val Val Asn Tyr Ala Leu Ser Ala Asp Met Tyr Val
355 360 365
Val Met Asn Gln His Trp Asp Gly Gly Trp Met Gln Pro Thr Tyr Ala
370 375 380
Gln Gln Glu Tyr Val Asn Asn Arg Leu Gln Ile Met Trp Thr Gln Ile
385 390 395 400
Ala Asn His Phe Lys Asp Tyr Asp Ser Arg Leu Leu Phe Ala Gly Thr
405 410 415
Asn Glu Val Met Val Glu Gly Asp Tyr Gly Thr Pro Thr Phe Glu Tyr
420 425 430
Tyr Thr Val Gln Asn Ser Phe Asn Gln Thr Phe Val Asp Ala Val Arg
435 440 445
Ala Thr Gly Gly Ala Asn Ala Ser Arg Tyr Leu Val Val Gln Gly Phe
450 455 460
Asn Thr Asn Ile Asp His Thr Val Asn Phe Ala Val Val Pro Thr Asp
465 470 475 480
Pro Ala Thr Asn Arg Leu Met Met Glu Val His Tyr Tyr Asp Pro Tyr
485 490 495
Asn Phe Thr Leu Asn Thr Asn Ser Asn Ile Thr Gln Trp Gly Val Ile
500 505 510
Ala Thr Asp Pro Ser Val Thr Glu Thr Trp Ala Asn Glu Ser Tyr Val
515 520 525
Asp Ala Thr Phe Gln Lys Met Lys Thr Asn Phe Val Asp Gln Gly Ile
530 535 540
Ala Val Ile Leu Gly Glu Tyr Gly Val Val Ser Arg Ala Asn Val Ala
545 550 555 560
Gly His Glu Thr Tyr Arg Glu Tyr Trp Asn Gln Tyr Ile Thr Gln Ser
565 570 575
Ala Val Asp His Gly Met Val Pro Ile Tyr Trp Asp Asn Gly Tyr Ser
580 585 590
Gly Asp Gly Gly Met Ala Leu Phe Asp Arg Ala Ser Gly Asn Gln Leu
595 600 605
Tyr Pro Asn Ile Ile Asn Ala Ile Ile Asn Ala Gly Asn
610 615 620
<210> 9
<211> 85
<212> PRT
<213> Microbulbifer degradans
<400> 9
Met Leu Glu Glu Glu Leu Glu Val Glu Leu Glu Glu Glu Leu Val Glu
1 5 10 15
Glu Leu Glu Leu Leu Asp Glu Glu Val Leu Glu Leu Asp Glu Glu Leu
20 25 30
Leu Glu Glu Leu Glu Glu Leu Asp Glu Leu Glu Asp Asp Pro Pro Ser
35 40 45
Ala Thr Gln Leu Ile Met Val Thr Phe Leu Ser Ala Pro Ser Pro Cys
50 55 60
Lys Pro Asn Ser Val Asp Trp Leu Gly Cys Lys Leu Pro Phe His Pro
65 70 75 80
Ala Ser Thr Ala Thr
85
<210> 10
<211> 1866
<212> DNA
<213> Microbulbifer degradans
<400> 10
atgttgaaac atcaattcag caaagcgctg cgtgcgctag gctttggtgg ggctgtgttt 60
gcggcatcgc taatggctag ccaagcaagt gcccttgagt gtgagcattc aatcagtaat 120
gattggggcg ccggctttac cggtgcaatg aaagttacca ataatgactc tagccccatt 180
accggttggc gggtcgaatg ggcgtatagc ggcaatgtaa atattgttaa ttcgtggaac 240
gcctcagtaa caaaaggcag taattatgtt gccgtagatg ccggatggaa tggtaattta 300
cagccgagcc aatctaccga atttggctta cagggtgatg gcgccgatag aaatgtaacc 360
attattagtt gtgttgccga aggcggatca tcttctagtt catcaagttc ttccagctcc 420
tcaagtagtt cttcatctag ctcaagtact tcttcatcga gtagctcaag ttcctcgacg 480
agctcttctt ctagttcgac ttctagctct tcttcaagca cctcttctag ctcatcgtcc 540
agttcatcaa gctcttcttc gggcggcaac tgtgttgcaa tgtgtaattg gtacggtgaa 600
aaccgccctg tttgtgccaa tcaaaatact ggttgggggt gggaaaacaa ccaaagctgt 660
ataggtgcaa acacctgtaa cgatcaatgg ggcgacgggg gcgtggtgtc cagctgtggt 720
acgtctagct cttcatccag ttcttcgtcc agttcgtcta ccagttcatc ctcgtcttct 780
agctcgagca ccagctctac aagcagctca tcaagctcta gttcgtcgtc tggtgggtta 840
agcgcggtag agttttcgca gcaaatgggc ttggggtgga atcttggaaa ctccctagaa 900
gcgattggtg gcgaaaccgc gtggggcaac ccaatggtta cgcagcaatt aattaactcc 960
ataaaagctg ctgggttcga cactattcgc attccggttg cgtggagcca attctcggac 1020
gaagctaatt ttgttatcaa tagcaattgg attgcacgcg tagaagaagt agtgaactac 1080
gcattgagcg ccgatatgta cgtggtaatg aaccaacatt gggacggcgg ttggatgcag 1140
cccacatatg cacagcaaga atatgttaac aatcgcttgc aaattatgtg gacgcaaata 1200
gctaatcact ttaaagatta cgatagtcgc ttactgtttg caggcaccaa cgaagtgatg 1260
gtggaaggcg attacggtac gcccaccttc gaatactaca cagtacaaaa tagctttaac 1320
caaacgtttg tggatgctgt acgtgcaacc ggtggcgcta atgctagccg ttacttagtg 1380
gtacaggggt ttaataccaa catagatcac acggtgaact tcgcggtagt gccaaccgac 1440
ccggcaacaa acaggttaat gatggaagta cactattacg acccctataa ctttacgtta 1500
aataccaaca gcaacattac tcagtggggc gtaattgcaa ctgaccctag cgttaccgaa 1560
acatgggcga atgaatctta tgtggatgcg actttccaaa aaatgaaaac taacttcgtt 1620
gatcaaggta tagcggtaat tttaggtgag tacggggttg tatcgcgcgc gaatgtggcc 1680
gggcacgaaa cttaccgaga gtattggaac caatacatta ctcaatctgc ggtagatcat 1740
ggaatggtgc ctatttattg ggataacggt tattccggtg atggtggtat ggcattgttt 1800
gatcgcgcca gtggcaatca actttacccc aatattatta acgcaattat caatgccggt 1860
aactaa 1866
<210> 11
<211> 673
<212> PRT
<213> Microbulbifer degradans
<400> 11
Met Leu Ile Gly Thr Val Thr Ala Ser Ala Leu Val Gly Arg Gly Arg
1 5 10 15
Gly Thr Pro Lys Lys Ile Ile Asn Lys Gly Ser Ile Met Trp Gln Ile
20 25 30
Asn Lys Ser Ala Leu Ala Ala Val Val Leu Val Cys Ser Ser Ser Ser
35 40 45
Phe Ala Gln Ser Ala Cys Asp Thr Gln Arg Ile Glu Ala Glu Asn Tyr
50 55 60
Val Ala Met Ser Gly Ile Gln Thr Glu Ser Thr Ala Asp Thr Gly Gly
65 70 75 80
Gly Leu Asn Val Gly Trp Ile Asp Ala Gly Asp Trp Leu Ser Tyr Gln
85 90 95
Val Asn Leu Pro Ala Ala Gly Gln Tyr Glu Val Arg Tyr Arg Val Ala
100 105 110
Ser Arg Asn Gly Gly Gly Val Leu Arg Leu Glu Gly Asn Ala Gly Gln
115 120 125
Thr Leu Tyr Gly Thr Met Asn Val Pro Asn Thr Gly Gly Trp Gln Asn
130 135 140
Trp Gln Thr Leu Ser His Ser Val Thr Leu Ala Ala Gly Glu Gln Ser
145 150 155 160
Ile Gly Ile Gly Val Pro Ser Gly Gly Phe Asn Ile Asn Trp Leu Glu
165 170 175
Phe Val Pro Leu Asp Cys Ser Gly Pro Ile Asp Pro Pro Ile Asn Pro
180 185 190
Pro Ser Asn Cys Ala Ser Ile Val Phe Glu Ala Glu Asn Tyr Asp Gln
195 200 205
Met Ser Gly Ile Arg Thr Gln Thr Thr Ser Asp Thr Gly Gly Gly Leu
210 215 220
Asn Val Gly Trp Ile Asp Ala Gly Asp Trp Leu Ser Tyr Ala Thr Val
225 230 235 240
Asn Ile Pro Ser Thr Gln Val Tyr Asn Phe Glu Tyr Arg Val Ala Ser
245 250 255
Pro Asn Gly Gly Ser Phe Asn Leu Gln Gly Ser Ala Gly Ala Glu Asn
260 265 270
Phe Asp Thr Ala Thr Leu Pro Asn Thr Gly Gly Trp Gln Asn Trp Thr
275 280 285
Thr Val Thr Gly Ser Ala Leu Leu Pro Ala Gly Asn Val Asn Phe Gly
290 295 300
Ile Ser Ala Ile Thr Gly Gly Trp Asn Ile Asn Trp Phe Lys Ala Thr
305 310 315 320
Pro Glu Ser Cys Asp Asp Ile Asn Pro Pro Ser Thr Gly Ile Thr Ala
325 330 335
Lys Gln Ala Ala Ala Ala Met Gly Lys Gly Phe Asn Leu Gly Gln Met
340 345 350
Phe Glu Ser Thr Gln His Pro Arg Thr Phe Asn Ala Ala Lys Ser Lys
355 360 365
Ile Asp Ala Tyr Tyr Asn Met Gly Tyr Arg Asn Val Arg Ile Pro Ile
370 375 380
Thr Trp Thr Glu Ala Val Gly Gly Asn Arg Leu Val Ala Asp Ala Asn
385 390 395 400
Val Gly Ala Val Asn Arg Asn His Ser Arg Leu Ala Val Ile Thr Gln
405 410 415
Val Val Asp Tyr Ala Leu Ser Leu Pro Gly Met Tyr Val Val Ile Asn
420 425 430
Ala His His Glu Gly Gly Leu Lys Thr Asn Asn Arg Trp Trp Val Leu
435 440 445
Glu Thr Leu Trp Ala Asp Ile Ala Asp Ile Phe Lys Asp Arg Asp His
450 455 460
Arg Leu Leu Phe Glu Ile Leu Asn Glu Pro His Leu Ser Asp Ala Asn
465 470 475 480
Lys Ser Pro Met Pro Pro Ala Asn Leu Arg Phe Met Thr Gly Lys Ala
485 490 495
Tyr Asn Lys Ile Arg Ala Ile Asp Ala Gln Arg Ile Val Ile Ile Gly
500 505 510
Gly Asn Gln Trp Phe Gly Ala Gly Glu Met Ala Asn Val Trp Pro Asn
515 520 525
Leu Asn Asp Val Gly Gly Gly Ser Asp Ala Tyr Val Met Ala Thr Phe
530 535 540
His His Tyr Asp Pro Trp Ser Phe Ser Gly Asp Asn Gln Gly Asp Tyr
545 550 555 560
Ala Asp Ala Trp Thr Leu Ser Asn Val Gly Asn Pro Met Asp Ile Met
565 570 575
Gln Ser Trp Ala Asn Gly Val Gly Gln Gly Met Pro Val Tyr Ile Gly
580 585 590
Glu Trp Gly Val Gly Trp Gly Ser Arg Tyr Ser Ala Met Gln Cys Asn
595 600 605
Asn Ile Arg Tyr Trp Tyr Gln Leu Phe Asp Ala Ser Tyr Ala Ser Ala
610 615 620
Lys Gly Gln Pro Thr Ala Val Trp Asp Asp Gly Gly Trp Phe Lys Ile
625 630 635 640
Phe Asp His Gly Thr Asn Ser Phe Asn Asn Asn Leu Ala Gln Cys Ile
645 650 655
Gly Gly Asn Cys Ala Trp Asp Gly Ala Asp Arg Phe Asn Ser Gly Cys
660 665 670
Asn
<210> 12
<211> 2022
<212> DNA
<213> Microbulbifer degradans
<400> 12
atgttgattg gtactgttac ggcttcagca ctggttggtc gaggccgtgg cacccctaaa 60
aaaataatca acaagggttc tattatgtgg caaatcaaca aatcggcttt agcggccgtg 120
gtattagtgt gttcctcatc tagctttgcg caatctgcat gtgacactca acgcattgaa 180
gccgaaaatt acgtggcaat gagtggtatt caaaccgaaa gcacggcaga cactggtggc 240
ggtttaaatg tgggctggat agacgccggc gactggctta gttaccaagt taacctacct 300
gctgcagggc agtacgaggt gcgctatcgc gttgccagta gaaatggcgg cggtgtactt 360
cggttagagg gcaatgccgg tcaaaccttg tatggaacta tgaatgtacc caacacgggt 420
ggctggcaaa attggcaaac cctttctcat tcagtgacat tagcggcagg agagcagtct 480
attggtattg gtgtgccaag cggcgggttt aatattaatt ggctggagtt cgtaccttta 540
gattgcagtg ggccaatcga cccgcccatt aacccacctt cgaactgcgc gagcattgta 600
ttcgaggccg aaaattacga tcaaatgagc ggcattagaa cgcaaaccac aagtgatacc 660
ggaggcggct taaatgtggg gtggatagat gctggcgact ggcttagcta tgccactgtg 720
aatatcccca gcacgcaggt gtacaatttt gaataccgtg tggctagccc taatggcggc 780
agttttaatt tgcagggttc ggctggcgca gagaattttg ataccgctac tttgcccaat 840
acgggtggtt ggcaaaattg gacaacggta acaggctcgg cgcttttacc tgctggcaat 900
gtgaatttcg gtattagtgc gattactggt ggctggaata taaactggtt taaagctaca 960
ccagagagct gtgatgatat aaaccctcca agtaccggta ttactgctaa gcaagcagcg 1020
gcagccatgg gcaaggggtt taatttgggg caaatgttcg aaagtacgca acacccaaga 1080
acatttaatg ctgcaaaaag taaaatagat gcttactaca atatgggcta cagaaatgtg 1140
cgcatcccta ttacttggac tgaagccgta ggcggaaaca ggcttgttgc agatgcaaat 1200
gtaggcgcag tcaatcgcaa ccactctcgc ttagctgtaa ttactcaagt agtagattac 1260
gcgctttcgc tacccggcat gtacgtggtt attaatgcgc atcacgaagg tggattaaaa 1320
accaataatc gctggtgggt gttagaaact ctgtgggcag atattgccga tatatttaaa 1380
gacagagatc accgtttgct atttgaaata ttaaacgagc cacacctaag cgatgccaat 1440
aagtcgccta tgccccccgc caatttgcgt tttatgacgg gcaaagccta taacaaaatt 1500
cgcgcgatag atgcgcagcg aatcgttatt attggtggca accagtggtt tggtgcaggt 1560
gaaatggcaa acgtatggcc aaaccttaat gatgttggcg gcggttccga tgcatatgta 1620
atggctactt ttcaccatta cgacccgtgg tcgtttagtg gcgataacca aggcgattac 1680
gccgatgctt ggacgctatc taacgtgggt aacccaatgg atataatgca aagctgggca 1740
aacggcgtag gccaaggtat gcctgtgtat attggcgagt ggggcgtagg ttggggcagc 1800
cgctacagcg ccatgcagtg caataatatt cgctattggt accagctgtt cgacgcgagc 1860
tatgcctcgg caaaaggcca gcctacggca gtgtgggatg acggcggttg gtttaaaata 1920
ttcgaccacg gtaccaacag cttcaataat aatttagccc aatgtattgg tggaaactgc 1980
gcttgggatg gcgccgatag atttaattct ggctgtaatt aa 2022
<210> 13
<211> 365
<212> PRT
<213> Microbulbifer degradans
<400> 13
Met Arg Thr Thr Lys Phe Leu Ala Leu Ala Leu Cys Leu Leu Ala Ser
1 5 10 15
Ala Ser Ala Leu Ser Ala Asn Asn Ser Ala Pro Ser Asn Asp Trp Trp
20 25 30
Asp Ile Pro Tyr Pro Ser Gln Phe Asp Val Lys Ser Leu Lys Thr Gln
35 40 45
Ser Phe Ile Ser Val Lys Gly Asn Lys Phe Ile Asp Asp Lys Gly Lys
50 55 60
Thr Phe Thr Phe Arg Gly Val Asn Ile Ala Asp Thr Gly Lys Leu Leu
65 70 75 80
Ser Gln Asn Gln Trp Gln Lys Ser Leu Phe Glu Glu Leu Ala Asn Asn
85 90 95
Trp Gly Val Asn Thr Ile Arg Leu Pro Ile His Pro Val Ser Trp Arg
100 105 110
Lys Leu Gly Pro Asp Val Tyr Leu Gly His Ile Asp Glu Ala Val Arg
115 120 125
Trp Ala Asn Asp Leu Gly Ile Tyr Leu Ile Leu Asp Trp His Ser Ile
130 135 140
Gly Tyr Leu Pro Thr Glu Gln Tyr Gln His Pro Met Tyr Asp Thr Thr
145 150 155 160
Ile Lys Glu Thr Arg Asp Phe Trp Arg Arg Ile Thr Phe Arg Tyr Lys
165 170 175
Asn Val Pro Thr Val Ala Val Tyr Glu Leu Phe Asn Glu Pro Thr Thr
180 185 190
Met Gly Asn Thr Leu Gly Glu Arg Asn Trp Ala Glu Trp Lys Thr Leu
195 200 205
Asn Glu Ser Leu Ile Asp Met Ile Tyr Ala Ser Asp Lys Thr Val Ile
210 215 220
Pro Leu Val Ala Gly Phe Asn Trp Ala Tyr Asp Leu Ser Pro Ile Lys
225 230 235 240
Lys Ala Pro Ile Glu Arg Glu Gly Ile Ala Tyr Ala Ala His Pro Tyr
245 250 255
Pro Gln Lys Ala Lys Pro Glu Val Lys Asn Asp Lys Asn Phe Phe Lys
260 265 270
Leu Trp Asp Glu Lys Trp Gly Phe Ala Ala Asp Thr Tyr Pro Val Ile
275 280 285
Ala Thr Glu Leu Gly Trp Val Gln Pro Asp Gly Tyr Gly Ala His Ile
290 295 300
Pro Val Lys Asp Asp Gly Ser Tyr Gly Pro Arg Ile Val Lys Tyr Met
305 310 315 320
Gln Lys Lys Gly Val Ser Tyr Thr Val Trp Val Phe Asp Pro Asp Trp
325 330 335
Ser Pro Thr Met Ile Asn Asp Trp Asp Phe Thr Pro Ser Glu Gln Gly
340 345 350
Ala Phe Phe Lys Gln Val Met Leu Glu Ala Lys Lys Arg
355 360 365
<210> 14
<211> 1098
<212> DNA
<213> Microbulbifer degradans
<400> 14
atgcgcacaa ccaaatttct tgcgcttgca ctctgcttgc tggcctcagc cagtgcactg 60
agtgcaaata acagcgcccc atcaaacgac tggtgggata taccctaccc gagccaattc 120
gatgtaaaaa gccttaaaac gcaaagtttt atatcggtaa aaggtaacaa gttcattgat 180
gataagggca aaaccttcac ttttagaggg gtaaacattg ccgatacagg taagctactt 240
agccaaaatc aatggcaaaa atcgctgttt gaagagctgg ctaataactg gggggtaaat 300
actattcgcc tgcctattca ccctgtaagt tggcgtaaac ttgggccaga cgtttattta 360
ggccacatcg atgaggcggt acgctgggcg aatgatttag gtatttacct tattcttgat 420
tggcactcca ttggctattt gcccaccgag caataccaac accccatgta cgacaccacc 480
attaaagaaa cccgcgactt ttggcgcaga attacgttcc gctacaaaaa cgtgcccacc 540
gtagcggtat acgaattatt taatgagcca accaccatgg gtaacaccct aggcgaacgc 600
aactgggccg agtggaaaac cttaaatgaa agcctaattg atatgatata tgccagtgac 660
aaaaccgtca ttccgctggt tgcaggcttc aactgggcct atgatttatc gccaatcaaa 720
aaggcaccta tcgagcgtga aggcattgct tacgccgcac acccctaccc gcaaaaggcg 780
aaaccagagg ttaagaacga taaaaacttc ttcaaactgt gggacgaaaa gtggggcttt 840
gctgcagaca cctaccctgt aatagcaaca gagctaggct gggtacaacc cgatggttat 900
ggtgcccaca tacccgttaa agacgacggc agttacggcc cccgcatagt gaagtatatg 960
cagaaaaaag gcgtttctta cacggtatgg gtattcgacc ccgactggag cccaacaatg 1020
attaacgact gggattttac ccccagcgag caaggcgcgt tttttaaaca ggttatgcta 1080
gaagctaaaa aacgctaa 1098
<210> 15
<211> 638
<212> PRT
<213> Microbulbifer degradans
<400> 15
Met Thr Phe Thr Arg Met Lys Ser Ser His Gln Gly Ala Cys Arg Pro
1 5 10 15
Arg Ser Ser Thr Leu Gln Arg Leu Ile Ala Ser Ser Leu Thr Thr Ala
20 25 30
Cys Leu Leu Ala Ala Ser Thr Phe Ala Asp Val Ala Pro Leu Thr Val
35 40 45
Asp Gly Asn Arg Ile Leu Ser Gly Gly Gln Glu Ala Ser Phe Ala Gly
50 55 60
Asn Ser Leu Phe Trp Ser Asn Asn Tyr Trp Gly Gly Glu Lys Tyr Tyr
65 70 75 80
Thr Ala Glu Thr Val Asn Trp Leu Lys Gln Asp Trp Gly Ala Thr Leu
85 90 95
Val Arg Ala Ala Met Gly Val Glu Asp Asn Gly Gly Tyr Leu Asp Asp
100 105 110
Lys Glu Gly Asn Lys Gln Lys Val Lys Thr Val Val Asp Ala Ala Ile
115 120 125
Ala Asn Asp Met Tyr Val Ile Ile Asp Trp His Ser His His Ala Glu
130 135 140
Asp His Lys Ser Glu Ala Ile Ala Phe Phe Glu Asp Met Ala Arg Thr
145 150 155 160
Tyr Gly Asn Lys Lys His Val Ile Tyr Glu Ile Tyr Asn Glu Pro Leu
165 170 175
Gln Ile Ser Trp Ser Asn Thr Ile Lys Pro Tyr Ala Glu Asp Val Ile
180 185 190
Arg Ala Ile Arg Ala Ile Asp Pro Asp Asn Leu Ile Val Gly Thr
195 200 205
Pro Thr Trp Ser Gln Asp Val Asp Val Ala Ser Gln Asp Pro Ile Thr
210 215 220
Gly Tyr Ala Asn Ile Ala Tyr Thr Leu His Phe Tyr Ala Gly Thr His
225 230 235 240
Lys Gln Ser Leu Arg Asp Lys Ala Gln Thr Ala Leu Asn Asly Gly Ile
245 250 255
Ala Leu Phe Ala Thr Glu Trp Gly Thr Val Asn Ala Asn Gly Asp Gly
260 265 270
Ala Val Asn Thr Thr Glu Thr Asp Lys Trp Met Thr Phe Phe Lys Thr
275 280 285
Asn His Ile Ser His Ala Asn Trp Ala Leu Asn Asp Lys Ser Glu Gly
290 295 300
Ala Ser Ala Leu Asn Pro Gly Ala Ser Pro Asn Gly Asn Trp Ser Asn
305 310 315 320
Ala Asp Leu Thr Thr Ser Gly Lys Tyr Val Lys Asn Ile Ile Lys Asn
325 330 335
Trp Asn Asp Gly Thr Pro Gly Gly Ser Ser Ser Ser Ser Ser Gly Gly
340 345 350
Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Ser Ser Ser Gly
355 360 365
Ala Gly Lys Val Asn Leu Pro Ala Arg Ile Glu Ala Glu Asn Tyr Asn
370 375 380
Ser Ala Pro Val Glu Thr Thr Ala Gly Asn Ser Gly Gly Ser Val Ser
385 390 395 400
Gln Cys Thr Tyr Arg Gly Leu Asn Val Asp Val Gln Asp Ala Ser Glu
405 410 415
Gly Thr Cys Asn Ile Gly Trp Thr Ala Ala Gly Glu Lys Val Thr Tyr
420 425 430
Asn Ile Gly Thr Ala Asn Asn Thr Tyr Asn Ile Ala Leu Arg Thr Ala
435 440 445
Ser Leu Asp Ala Gly Lys Arg Val Ser Val Tyr Val Gly Asn Thr Leu
450 455 460
Ala Asp Thr Ile Ser Thr Gln Gly Gly Gly Trp Gln Asn Trp Lys Thr
465 470 475 480
Gln Thr Ile Pro Asn Val Tyr Ile Pro Ser Asn Ser Val Ile Thr Val
485 490 495
Glu Phe Tyr Asp Gly Arg Thr Asn Leu Asn Tyr Leu Asn Ile Ser Ala
500 505 510
Ala Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser
515 520 525
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Gly Ser Cys
530 535 540
Ser Ser Tyr Ile Asp Ile Pro Trp Asn Thr Arg Thr Glu Val Thr Leu
545 550 555 560
Thr Ser Gly Ala Cys Val Arg Phe Asn Gln Asn Leu Ser Gly Lys Thr
565 570 575
Leu Gln Val Trp Asp Ser Asp Ala Asn Ser Ser Cys Asp Phe Arg Gly
580 585 590
Thr Val Thr Thr Val Gly Gly Thr Gly Ser Leu Asn Val Ser Ser Asn
595 600 605
Tyr Val Ser Ser Lys Ser Leu Thr Gly Thr Lys Leu Thr Phe Asn Ser
610 615 620
Ala Ser Asn Asn Asn Cys Lys Tyr Val Lys Val Arg Ala Tyr
625 630 635
<210> 16
<211> 1917
<212> DNA
<213> Microbulbifer degradans
<400> 16
atgactttca caagaatgaa atcatcacac caaggcgcgt gtcgaccaag gtcttccacc 60
ctacagcgac taatcgcctc atcacttacc accgcatgtt tgctagcagc gtctactttt 120
gccgacgtag cgccgttaac cgtagatggc aaccgcattc tcagcggtgg ccaagaggct 180
agctttgccg gtaacagttt gttttggagc aacaattatt ggggcggtga gaaatactac 240
acagccgaaa ctgttaactg gttaaaacaa gactggggcg caacactagt gcgcgcggcc 300
atgggtgtag aagataacgg cggctaccta gatgacaaag aaggcaacaa acaaaaggta 360
aaaaccgttg tagatgctgc tattgccaac gacatgtatg taattatcga ttggcacagc 420
caccacgccg aagaccacaa aagtgaagcc attgcttttt ttgaggatat ggcgcgcacc 480
tacggcaata aaaaacacgt tatttacgaa atttataacg agcctttaca aatttcgtgg 540
agcaacacaa ttaaacccta cgccgaagat gtaattagag ctattcgcgc gatagacccc 600
gacaacttaa ttattgttgg tacgccaacg tggtcgcaag atgtagacgt agcatcgcaa 660
gaccccatta ccggctacgc caatattgcc tacacattgc acttttacgc aggcacccac 720
aaacaatctt tacgagacaa agcgcaaacc gcacttaaca acggcatagc gcttttcgca 780
acagagtggg gaacagtaaa tgcaaacggt gatggcgctg taaacaccac cgaaacagac 840
aagtggatga cgttctttaa aaccaaccac ataagccacg caaactgggc gctaaacgac 900
aaatcagaag gcgcttctgc attaaacccc ggagccagcc ccaatggcaa ctggagcaac 960
gccgacttaa ccacatcggg taagtacgta aaaaacatta tcaaaaactg gaacgacggc 1020
acgccgggag gcagctcttc aagctcgtcc ggcggctcaa ccagttcctc ctcaagctca 1080
tctagctcta attccagctc tggtgctggc aaagtaaatt tacccgcacg cattgaagcc 1140
gaaaactata acagtgcacc ggtagaaaca actgcaggca atagtggcgg cagcgtttca 1200
caatgtacat acagagggct aaatgtagac gtacaagacg caagcgaagg cacttgtaat 1260
attggctgga cagcagcagg cgaaaaagtt acctacaaca taggcacagc aaataatact 1320
tacaatattg cacttcgcac cgcatcgctt gatgcaggca agcgcgtatc ggtatatgta 1380
ggcaacaccc tcgccgacac aataagcacc caaggtggcg gctggcaaaa ttggaagacg 1440
caaaccatcc ccaatgtata tattccatca aactcagtta ttaccgtgga attctacgat 1500
ggccgcacca accttaacta cttaaacatt agtgcagctt cggggtcttc ctcttcaagc 1560
tcctcatcta gctcgtcaac gtctagctct tcttcgagct catcttctag ctcttcaggt 1620
ggtggcagtt gtagcagcta tatagatata ccttggaata ctcgcaccga agttacccta 1680
acaagtggcg cctgcgttcg ctttaaccaa aacctttcgg gcaaaaccct acaagtgtgg 1740
gatagcgatg caaactcatc gtgcgatttc cggggcacag ttacaacagt aggcggcact 1800
ggcagtttaa atgtaagcag caactatgtt tcgtctaaga gcctaacagg aaccaaactt 1860
acatttaatt cagcaagtaa taacaattgt aagtacgtta aagttcgtgc ttattag 1917
<210> 17
<211> 630
<212> PRT
<213> Microbulbifer degradans
<400> 17
Met Lys Ser Ala Thr Thr Asn Gln Ser Arg Ala Arg Ser Ser Ala Phe
1 5 10 15
Lys Asn Met Leu Ala Ala Ser Leu Ala Gly Leu Gly Leu Leu Ser Ala
20 25 30
Ser Ala Phe Ala Asp Val Ala Pro Leu Thr Val Asp Gly Asn Lys Ile
35 40 45
Leu Ser Gly Gly Gln Gln Ala Ser Phe Ala Gly Asn Ser Leu Phe Trp
50 55 60
Ser Asn Asn Gly Trp Gly Gly Glu Lys Tyr Tyr Thr Ala Gly Thr Val
65 70 75 80
Glu Trp Leu Lys Gln Asp Trp Gly Ser Asn Leu Val Arg Ala Ala Met
85 90 95
Gly Val Asp Glu Asn Gly Gly Tyr Leu Glu Asp Pro Ala Gly Asn Lys
100 105 110
Ala Lys Val Thr Thr Val Val Asp Ala Ala Ile Ala Asn Asp Met Tyr
115 120 125
Val Ile Ile Asp Trp His Ser His His Ala Glu Asp Tyr Gln Asn Gln
130 135 140
Ala Ile Ser Phe Phe Gln Asp Met Ala Arg Thr Tyr Gly Asn Asn Asn
145 150 155 160
Asn Val Ile Tyr Glu Ile Tyr Asn Glu Pro Leu Gln Val Ser Trp Ser
165 170 175
Gly Thr Ile Lys Pro Tyr Ala Glu Ala Val Ile Gly Ala Ile Arg Ala
180 185 190
Ile Asp Pro Asp Asn Leu Ile Ile Val Gly Thr Pro Thr Trp Ser Gln
195 200 205
Asp Val Asp Val Ala Ser Arg Asp Pro Ile Thr Gln Tyr Ser Asn Ile
210 215 220
Ala Tyr Thr Ile His Phe Tyr Ala Gly Thr His Lys Gln Ser Leu Arg
225 230 235 240
Asp Lys Ala Gln Thr Ala Leu Asn Asn Gly Ile Ala Leu Phe Ala Thr
245 250 255
Glu Trp Gly Thr Val Asn Ala Asn Gly Asp Gly Gly Val Asp Ala Ala
260 265 270
Glu Thr Asp Arg Trp Met Gln Phe Phe Lys Ala Asn His Ile Ser His
275 280 285
Ala Asn Trp Ala Leu Asn Asp Lys Ala Glu Gly Ser Ser Ala Leu Lys
290 295 300
Pro Gly Ser Asn Ala Asn Gly Gly Trp Ser Asn Ser Asp Leu Thr Ala
305 310 315 320
Ser Gly Thr Tyr Val Lys Asn Leu Ile Lys Thr Trp Asn Asp Gly Ser
325 330 335
Pro Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly
355 360 365
Gly Thr Asn Leu Pro Ala Arg Ile Glu Ala Glu Asn Tyr Asp Ser Ala
370 375 380
Pro Val Glu Thr Thr Ala Gly Asn Ser Gly Ser Pro Thr Asn Cys Ser
385 390 395 400
Tyr Lys Gly Met Gly Val Asp Val Glu Asn Ser Thr Glu Gly Ala Cys
405 410 415
Asn Ile Gly Trp Thr Ala Ala Gly Glu Lys Val Thr Tyr Asn Ile Gly
420 425 430
Asn Ala Asp Gly Thr Tyr Asp Ile Ala Leu Arg Val Ala Ser Met Asp
435 440 445
Ala Gly Lys Arg Ile Ser Val His Val Asn Asn Ser Leu Ala Asp Thr
450 455 460
Val Thr Thr Gln Gly Gly Gly Trp Gln Ala Trp Thr Thr Glu Thr Ile
465 470 475 480
Ser Asn Val Tyr Ile Pro Ser Asn Ser Val Ile Thr Val Glu Phe Tyr
485 490 495
Asp Ser Gly Ser Asn Leu Asn Phe Leu Asn Ile Thr Glu Ser Ser Gly
500 505 510
Thr Glu Pro Pro Val Glu Pro Pro Val Glu Pro Pro Val Glu Pro Pro
515 520 525
Val Asp Asn Gly Asn Phe Pro Cys Asn Asp Gly Asn Ser Thr Leu Ala
530 535 540
Asn Asn Gly Ala Ser Ile Asn Leu Asn Gln Gly Ala Cys Val Lys Tyr
545 550 555 560
Asn His Gly Trp Gly Asp Ile Arg Leu Gly Thr Trp Ser Gly Asn Gly
565 570 575
Thr Ile Arg Tyr Asp Val Leu Asp Cys Asn Asn Asn Val Met Ser Asp
580 585 590
Ile Ala Gln Lys Leu Asn Asp Phe Thr Ala Val Asp Thr Ala Thr Met
595 600 605
Asn Cys Ala His Tyr Ile Tyr Val Lys Gln Ala Pro Ser Ser Tyr Thr
610 615 620
Leu Gln Phe Gly Ser Trp
625 630
<210> 18
<211> 1893
<212> DNA
<213> Microbulbifer degradans
<400> 18
atgaaatcag caaccacaaa tcaatcgagg gcacgcagta gcgcctttaa aaatatgttg 60
gcggcatcgc tcgcaggttt agggctacta tcagcttctg catttgccga tgtagccccg 120
ctaaccgtag acggcaataa aattcttagc ggtggccagc aagccagttt tgccggtaat 180
agcttatttt ggtctaacaa tggctggggc ggtgagaagt attacacggc cggtaccgtt 240
gaatggctaa agcaagactg gggcagtaat ttagttcgcg ccgcaatggg tgtcgatgaa 300
aacggcggct acttagaaga cccagcagga aacaaagcga aagtaacaac cgttgtagat 360
gcagccatcg ctaacgatat gtatgtaatt atcgattggc acagccacca cgccgaagac 420
taccaaaacc aagccattag ctttttccaa gatatggctc gcacctacgg taacaacaac 480
aacgttatat acgaaattta taacgagcca ttacaggttt cttggagcgg caccatcaag 540
ccttacgcag aagcggtaat tggcgcaatt cgcgcaatcg acccagataa ccttattatt 600
gtgggcacgc ctacttggtc gcaggatgta gacgtagcct cgcgcgaccc catcacgcag 660
tacagcaaca ttgcctacac tattcacttt tatgcgggca cccacaaaca atccctacgc 720
gataaagcac aaaccgcatt aaataatggt attgctttgt ttgctaccga atggggtaca 780
gtaaatgcca acggtgacgg cggtgtagac gcagccgaaa ctgatcgttg gatgcagttt 840
tttaaagcga atcatataag ccatgccaac tgggccttaa acgataaagc cgaaggctct 900
tctgcattaa agcctggctc taacgcaaac ggcggctgga gcaattccga cttaaccgcc 960
tctggtacct atgttaaaaa cttaattaaa acatggaacg acggctcacc gagcagcagc 1020
tcatctagca gcaccagttc ttcttcaagc agctcctcgt ctagtagctc atcatctagc 1080
agctcttcat ctagtagttc tggcggtacc aatttacccg cgcgcattga agcagaaaac 1140
tacgatagcg caccggtaga aaccactgca ggtaatagcg gctcacccac caattgttcg 1200
tataaaggta tgggcgtaga tgtagaaaac tctactgaag gtgcttgtaa tattggctgg 1260
actgcggcag gcgaaaaagt aacttacaac attggcaatg ccgatggcac ttacgatatt 1320
gcattgcgcg tagcctctat ggatgcgggc aaacgtatct ctgtgcatgt aaacaacagc 1380
ctagcagata ccgtaaccac acaaggtggc ggctggcagg catggactac cgaaaccatt 1440
tctaacgtgt atatcccatc aaactcggta attaccgttg agttttacga tagtggctct 1500
aacctaaact ttttaaacat taccgaaagc tcgggtaccg aaccacctgt agaaccaccc 1560
gttgagccgc cagtagaacc acccgtagac aacggtaact tcccatgtaa cgacggtaac 1620
tctacgcttg ccaacaacgg cgcctccatt aaccttaacc aaggagcgtg tgttaaatac 1680
aatcacggct ggggcgatat tcgtttaggc acctggagcg gcaacggtac cattcgatac 1740
gacgtactag actgcaataa caacgtaatg agtgatattg cacaaaaact taatgacttt 1800
actgctgtag acaccgcaac aatgaactgc gcacactaca tttatgtaaa acaagcccct 1860
agcagctaca ccctgcaatt tggtagctgg tag 1893
<210> 19
<211> 725
<212> PRT
<213> Microbulbifer degradans
<400> 19
Met Lys Ile Asn Thr Leu Phe Thr Pro Leu Arg Thr Val Gly Ala Ala
1 5 10 15
Val Ala Ile Ala Leu Ser Pro Val Ala Phe Ala Asp Val Thr Cys Glu
20 25 30
Val Thr Asn Phe Asn Gln Trp Asn Ser Gly Tyr Gln Ala Asp Val Arg
35 40 45
Val Thr Asn Ser Gly Ser Ala Val Ser Gly Trp Thr Val Asn Leu Asn
50 55 60
Phe Ala Ser Ala Pro Gln Met Thr Asn Gly Trp Asn Ala Ala Leu Ser
65 70 75 80
Thr Ser Gly Asn Thr Ile Ser Ala Ser Asn Ile Ser Trp Asn Gly Asn
85 90 95
Leu Gly Asn Gly Gln Ser Thr Ser Phe Gly Phe Gln Gly Asn Ser Asn
100 105 110
Gly Asn Leu Ala Thr Pro Thr Cys Val Gly Ser Gly Thr Gly Ser Ser
115 120 125
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr
130 135 140
Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser Gly
145 150 155 160
Gly Glu Cys Val Glu Met Cys Lys Trp Tyr Gln Asp Ala Pro Arg Pro
165 170 175
Leu Cys Asn Asn Gln Asp Ser Gly Trp Gly Trp Glu Asn Asn Gln Ser
180 185 190
Cys Ile Gly Arg Thr Thr Cys Asn Ser Gln Ser Gly Asn Gly Gly Val
195 200 205
Ile Asn Ser Cys Pro Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr
210 215 220
Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr
225 230 235 240
Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
245 250 255
Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser Gly Gly Gly Val
260 265 270
Phe Arg Val Asp Ala Thr Gly Asn Ile Thr Lys Asn Gly Val Luu
275 280 285
Pro Val Arg Cys Gly Asn Trp Phe Gly Leu Glu Gly Gln His Glu Pro
290 295 300
Ser Asp Ala Gln Asn Asn Pro Gly Gly Ala Pro Leu Glu Leu Tyr Val
305 310 315 320
Gly Asn Met Trp Trp Val Asp Ser Gly Arg Thr Ile Gln Gln Thr Met
325 330 335
Ser Glu Ile Thr Ala Gln Gly Ile Asn Met Val Arg Leu Pro Ile Ala
340 345 350
Pro Gln Thr Leu Asn Pro Asn Asp Pro Gln Gly Val Gly Asp Val Arg
355 360 365
Asn Gly Gly Val Leu Lys Asn His Glu Ser Val Gln Gln Thr Asn Ala
370 375 380
Arg Gln Ala Leu Glu Asp Phe Ile Val Gln Ala Asn Glu Asn Asp Ile
385 390 395 400
Gln Val Leu Ile Asp Ile His Ser Cys Ser Asn Tyr Val Gly Trp Arg
405 410 415
Ala Gly Arg Leu Asp Ala Glu Pro Pro Tyr Val Asp Ala Thr Arg Val
420 425 430
Gly Tyr Asp Phe Thr Arg Glu Asp Tyr Ser Cys Gly Thr Asn Val Gly
435 440 445
Pro Gly Val Thr Val His Glu Tyr Asn Glu Glu Ile Trp Leu Asn Asn
450 455 460
Leu Arg Glu Ile Ala Gly Leu Ser Glu Ser Leu Gly Val Asp Asn Ile
465 470 475 480
Ile Gly Ile Asp Ile Phe Asn Glu Pro Trp Asp Tyr Thr Trp Glu Glu
485 490 495
Trp Lys Ala Leu Ser Glu Ser Ala Tyr Gln Ala Ile Ser Glu Val Asn
500 505 510
Pro Asp Ile Leu Ile Phe Val Glu Gly Val Ala Gly Gly Thr Gly Ala
515 520 525
Gly Val Asp Val Pro His Gly Asp Glu Ser Ser Asn Pro Asn Trp Gly
530 535 540
Glu Asn Phe Tyr Pro Ala Gln Thr Ala Pro Leu Asn Ile Pro Lys Asp
545 550 555 560
Arg Leu Val Ile Ser Pro His Thr Tyr Gly Pro Ser Val Phe Val Gln
565 570 575
Arg Gln Phe Met Asp Pro Asn Asp Pro Glu Cys Val Gly Leu Glu Gly
580 585 590
Asp Glu Ala Ala Glu Ala Gly Cys Gln Ile Val Ile Asp Tyr Ala Thr
595 600 605
Leu Ala Ala Gly Trp Asp Glu His Phe Gly Phe Leu Arg Glu Gln Gly
610 615 620
Phe Ala Met Val Val Gly Glu Phe Gly Gly Asn Met Asp Trp Pro Asn
625 630 635 640
Gly Thr Arg Gln Ala Glu Lys Asp Met Trp Ser His Ile Thr Pro Gly
645 650 655
Ile Asp Arg Gln Trp Gln Glu Ala Phe Val Asp Tyr Met Val Glu Lys
660 665 670
Asn Ile Gln Ala Cys Tyr Trp Ser Ile Asn Pro Glu Ser Gly Asp Thr
675 680 685
Gly Gly Trp Tyr Gly His Glu Tyr Asp Pro Val Ser Asn Asp Ala Gly
690 695 700
Trp Gly Arg Trp Leu Asp Phe Asp Ser Arg Lys Thr Asn Leu Leu Lys
705 710 715 720
Glu Leu Trp Gly Ile
725
<210> 20
<211> 71
<212> PRT
<213> Microbulbifer degradans
<400> 20
Met Leu Glu Val Glu Leu Leu Leu Val Glu Leu Val Glu Leu Asp Glu
1 5 10 15
Val Leu Glu Val Leu Leu Val Glu Leu Asp Glu Val Leu Glu Val Leu
20 25 30
Asp Glu Leu Val Asp Glu Val Leu Glu Glu Leu Glu Glu Leu Glu Glu
35 40 45
Leu Gly Gln Leu Leu Ile Thr Pro Pro Leu Pro Asp Trp Leu Leu Gln
50 55 60
Val Val Arg Pro Ile Gln Leu
65 70
<210> 21
<211> 2178
<212> DNA
<213> Microbulbifer degradans
<400> 21
atgaaaatca acactctctt tacgcctttg cgtactgtgg gtgctgcagt tgcgatagct 60
ttatcgcctg tagcctttgc agacgtaacg tgcgaagtaa cgaactttaa ccagtggaat 120
agtggctacc aagccgatgt tcgtgttaca aacagcggta gcgctgttag tggctggacc 180
gtaaatttaa attttgcctc agccccgcaa atgacaaatg gctggaacgc agctttgagt 240
actagcggca atacaattag tgcatctaat attagttgga atggcaattt gggtaatggt 300
cagtccacca gctttggttt tcagggcaat tcaaatggta acttggcaac gccaacgtgt 360
gtaggtagcg gtacggggtc ttctagcagc tcttcatcca gctctacttc tagcacaagc 420
tcatcatcta caagttcttc tagcacgtct tctactagct ctagcagttc atcctctggt 480
ggtgaatgtg tagaaatgtg taagtggtat caagatgcac cgcgcccatt atgtaataat 540
caagacagtg gttggggttg ggaaaacaat caaagctgta ttggtcgcac tacttgtaac 600
agccaatctg gcaatggtgg tgtaattaat agttgcccaa gttcttcaag ttcttcaagt 660
tcttctagca cttcgtctac cagctcatct agtacttcaa gtacttcatc gagctcaaca 720
agtagtactt caagcacttc atcaagttcc acaagctcta ctagcagcag ctcaacctct 780
agcactagct cgtcgtcttc aagtggtggt ggagtattcc gcgtagatgc taccggtaat 840
attactaaaa atggtgaagt actgcctgtt cgttgtggta actggtttgg tctagagggc 900
cagcacgagc cttcagatgc gcaaaataac ccaggcggtg cgccgcttga attatatgtt 960
ggcaacatgt ggtgggtaga tagtggccgc actattcagc aaaccatgag cgaaattacc 1020
gcccaaggta tcaacatggt tcgcttgcct attgcaccgc aaacattaaa ccctaacgac 1080
cctcaaggtg tgggtgatgt gcgcaacggc ggcgtgctta aaaatcacga atctgtgcag 1140
caaaccaatg cacgtcaagc gttagaagac ttcattgttc aagctaacga aaatgacatt 1200
caagtgctaa ttgatattca ctcttgtagt aactacgtgg gttggcgtgc aggccgttta 1260
gatgcagagc ctccttatgt ggatgcaacg cgagtgggtt atgactttac ccgtgaagat 1320
tattcttgtg gcaccaatgt gggcccaggt gtaactgtgc acgagtacaa cgaggaaatt 1380
tggttaaaca acttgcgtga gattgctggt ttatctgaat ccttgggcgt tgataatatt 1440
atcggtatcg atatttttaa cgaaccatgg gattacactt gggaagagtg gaaagcactt 1500
tctgaaagcg cttatcaagc cattagcgaa gttaacccag atattctaat ctttgttgag 1560
ggtgttgcag gcggcacggg tgctggtgtt gatgtgccac atggagacga gtcttctaac 1620
cctaactggg gcgaaaactt ttatcctgcg caaactgctc cgcttaatat tccaaaagat 1680
cgtctagtta tttcaccgca tacctatggc ccatctgtat ttgttcagcg tcaatttatg 1740
gacccgaatg atccagagtg tgttggttta gaaggtgatg aggcggctga agctggctgt 1800
caaattgtta tcgattatgc aaccttagca gctggttggg atgagcattt cggcttctta 1860
cgtgagcaag gctttgccat ggtagtgggt gagtttggtg gcaacatgga ttggccaaat 1920
ggcacgcgcc aagcagaaaa agatatgtgg agccacatca cccctggaat cgacagacag 1980
tggcaagaag cgtttgttga ctacatggtt gagaaaaaca tccaagcttg ttactggtca 2040
attaacccag agtctggcga cactggcggt tggtatggtc acgagtacga ccctgtttct 2100
aacgatgcag gttgggggcg ttggttagac ttcgattctc gcaaaactaa cttacttaaa 2160
gagctttggg gtatttaa 2178
<210> 22
<211> 610
<212> PRT
<213> Microbulbifer degradans
<400> 22
Met Met Tyr Thr Asn Leu Phe Asn Leu Lys Lys His Leu Phe Gln Thr
1 5 10 15
Ser Leu Lys Leu Leu Ala Cys Ala Thr Leu Ile Gly Gly Thr Leu Asn
20 25 30
Ala Ala Ala Asp Val Pro Ala Met Ser Val Gln Gly Asn Lys Val Leu
35 40 45
Val Gly Gly Glu Val Lys Ser Leu Gly Gly Met Ser Tyr Phe Trp Ser
50 55 60
Asn Asn Gly Trp Gly Gly Glu Lys Tyr Tyr Asn Ala Ser Thr Val Ser
65 70 75 80
Tyr Phe Lys Gln Asp Trp Lys Ala Ser Ile Val Arg Ala Ala Met Gly
85 90 95
Val Glu Asp Ala Gly Gly Tyr Phe Asp Asp Pro Gln Gly Ser Lys Gln
100 105 110
Lys Val Arg Thr Ile Val Asp Ala Ala Ile Ala Asn Asp Met Tyr Val
115 120 125
Ile Ile Asp Trp His Ser His Tyr Ala Asn Thr His Asp Trp Ala Ala
130 135 140
Ala Val Gln Phe Phe Gln Glu Met Ala Arg Asp Tyr Gly Gln Tyr Asn
145 150 155 160
Asn Val Ile Tyr Glu Val Tyr Asn Glu Pro Leu Asp Ile Pro Trp Gly
165 170 175
His Ile Lys Ser Tyr Ala Glu Thr Val Ile Asp Ala Ile Arg Ala Ile
180 185 190
Asp Pro Asp Asn Val Ile Val Val Gly Thr Pro Arg Trp Ser Gln Gly
195 200 205
Val Lys Glu Ala Ser Trp Asp Pro Ile Asn Arg Asn Asn Ile Ala Tyr
210 215 220
Thr Leu His Phe Tyr Ser Gly Ser His Gly Gln Trp Leu Arg Asn Asp
225 230 235 240
Ala Ala Glu Ala Met Ser Asn Gly Ile Ala Leu Phe Val Thr Glu Trp
245 250 255
Gly Ser Val Asn Ala Asn Gly Asp Gly Ala Val Asn Glu Gly Glu Thr
260 265 270
Ala Ala Trp Met Asn Phe Met Arg Asp Asn Gly Ile His His Ala Asn
275 280 285
Trp Ser Val Asn Asp Lys Ala Glu Gly Ala Ser Ala Leu Asn Pro Gly
290 295 300
Ala Ser Ala Thr Gly Gly Trp Gly Asp Gly Asp Leu Thr Trp Ser Gly
305 310 315 320
His Val Val Arg Gly Tyr Leu Arg Asp Trp Asn Gln Ile Gly Ser Gly
325 330 335
Asn Gly Asn Gly Asn Gly Thr Gly Cys Thr Glu Val Ser Leu Pro Gly
340 345 350
Thr Ile Glu Ala Glu Ala Tyr Cys Ala Met Asp Gly Ile Gln Thr Glu
355 360 365
Asn Thr Asn Asp Thr Asn Gly Gly Ser Asn Val Gly Tyr Ile Asp Ala
370 375 380
Gly Asp Trp Met Ser Tyr Ser Val Asn Val Ala Asn Ala Gly Thr Tyr
385 390 395 400
Thr Val Ser Tyr Arg Val Ala Ser Leu Gly Gly Gly Gly Val Leu Ser
405 410 415
Ile Glu Asn Ala Gly Gly Ser Pro Val Tyr Gly Thr Leu Asn Val Pro
420 425 430
Gln Thr Gly Gly Trp Gln Glu Trp Thr Thr Val Ser His Asp Ile Ser
435 440 445
Leu Gln Ala Gly Gln Gln Asn Ile Gly Ile Ala Ala Ile Glu Gly Gly
450 455 460
Phe Asn Ile Asn Trp Ile Ala Leu Thr Pro Ala Gly Thr Asn Pro Asn
465 470 475 480
Pro Val Gln Ser Ile Thr Leu Gln Ala Glu Asp Tyr Ser Phe Met Ser
485 490 495
Gly Val Gln Val Glu Asn Thr Ser Asp Asn Gly Gly Gly Met Asn Val
500 505 510
Gly Trp Leu Asp Ala Gly Asp Trp Leu Ala Tyr His Gly Val Asn Ile
515 520 525
Pro Thr Ser Gly Gln Tyr Thr Ile Thr Tyr Arg Val Ala Ser Gln Ser
530 535 540
Gly Gly Gly Ser Leu Gln Leu Glu Gln Ala Gly Gly G Val Val Tyr
545 550 555 560
Gly Asn Leu Asn Val Pro Ser Thr Gly Gly Trp Gln Asn Trp Val Asp
565 570 575
Val Ser His Thr Val Thr Leu Asn Ala Gly Val Gln Asp Phe Gly Leu
580 585 590
Gly Ile Thr Ser Gly Gly Phe Asn Ile Asn Trp Ile Lys Val Glu Ala
595 600 605
Ile his
610
<210> 23
<211> 1833
<212> DNA
<213> Microbulbifer degradans
<400> 23
atgatgtaca caaacctctt taatttaaaa aagcacctct ttcaaacctc acttaaacta 60
ctggcctgcg ccacattaat tggcggcacc ctaaacgcag ccgctgacgt gccagcaatg 120
tccgtacaag gcaataaagt actggtgggc ggtgaagtta aaagccttgg aggtatgagc 180
tatttttggt ctaacaacgg ctggggcggc gagaaatact acaacgcttc taccgttagt 240
tacttcaagc aagactggaa ggcatccatt gttcgagctg caatgggggt agaagatgcc 300
ggcggctact tcgatgaccc gcagggctct aagcaaaaag ttcgtacaat agtagatgcc 360
gccattgcga atgatatgta cgtcattatc gattggcact cacattacgc caacacccac 420
gactgggcag ccgctgtgca atttttccaa gaaatggcac gtgactatgg ccaatacaat 480
aatgtgattt atgaggtata caacgaacca ctggatatcc cttggggcca cataaaaagc 540
tacgccgaaa cggtaattga tgccattcgc gcaattgacc cagataacgt gatcgtagta 600
ggcactcctc gctggtcgca gggggtaaaa gaagcgtcat gggacccaat caaccgcaat 660
aatattgcct acacgctgca cttctattca ggtagtcatg gccaatggct gcgcaacgac 720
gcagcagaag ctatgagtaa tggtattgcc ttgtttgtta ctgaatgggg cagcgtaaat 780
gccaatggcg atggcgcagt caacgaaggc gaaaccgcag cgtggatgaa cttcatgcgc 840
gataacggta tccatcacgc aaactggtct gtaaacgaca aagcagaggg tgcatctgca 900
cttaaccctg gcgccagtgc cacaggtggt tggggcgacg gcgatttgac ttggtctggc 960
catgttgtgc gcggctacct gcgcgactgg aaccaaattg gttctggcaa tggtaacggc 1020
aacggcacag gctgcaccga ggttagccta ccaggcacga tagaagcgga agcctactgc 1080
gcaatggatg gtatccaaac cgaaaacacc aacgacacca acggcggcag taacgtgggc 1140
tacatagatg ctggcgactg gatgagctac agcgtaaacg ttgctaacgc aggcacttat 1200
accgtgtctt accgcgtggc tagccttggc ggcggcggtg ttctaagcat tgaaaatgcc 1260
ggcggctcgc ccgtttatgg cacgctgaat gtaccgcaaa ctggcggctg gcaagaatgg 1320
accactgtat ctcacgatat tagcttgcaa gccggccaac aaaacattgg catagcggca 1380
atagaaggtg gttttaacat caactggata gccctaaccc ctgctggcac caaccccaac 1440
ccagtgcaaa gtattacctt acaagcagaa gactactcct ttatgagtgg cgtgcaggta 1500
gaaaatacta gcgacaatgg cggcggtatg aacgtaggct ggttagatgc tggcgactgg 1560
cttgcctacc acggcgtaaa cattccaacc tctggccaat acaccataac ttaccgagta 1620
gccagccaaa gcggtggtgg aagcctgcag ctagaacaag caggtggcgg cgttgtttac 1680
ggtaacctga acgtaccaag cactggcggc tggcagaact gggtagacgt aagccatacc 1740
gttaccctta acgctggtgt acaagatttt gggttaggta ttactagtgg tggcttcaat 1800
attaactgga taaaagtcga ggcaattcac taa 1833
<210> 24
<211> 791
<212> PRT
<213> Microbulbifer degradans
<400> 24
Met Leu Ala Ser Asn Lys Asn Ser Lys Leu Ala Asn Ser Glu Gln His
1 5 10 15
Arg Pro Tyr Lys Thr Arg Thr Ala Arg Trp Leu Thr Gly Ser Gly Val
20 25 30
Ile Ala Ser Ser Leu Leu Phe Ser Ala Gln Ser Phe Ala Ala Gln Cys
35 40 45
Glu Tyr Ile Ile Ser Asn Glu Trp Asn Ser Gly Phe Thr Gly Ala Val
50 55 60
Arg Ile Thr Asn Asn Gly Thr Thr Pro Ile Asn Gly Trp Asp Val Ser
65 70 75 80
Trp Gln Tyr Ala Gly Asp Ala Val Thr Ser Ser Trp Asn Ala Asn Val
85 90 95
Ser Gly Ser Asn Pro Val Ser Ala Thr Pro Leu Ser Trp Asn Ala Asn
100 105 110
Ile Gln Pro Gly Gln Ser Val Glu Phe Gly Phe Gln Gly Ser Lys Ala
115 120 125
Gly Ser Asn Ala Glu Ile Pro Thr Val Thr Gly Ala Val Cys Asp Ser
130 135 140
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
145 150 155 160
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Thr Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Gly Thr Gly Gly Ile
180 185 190
Ala Cys Thr Val Gly Asn Ala Asn Ile Trp Gly Ser Gly Tyr Gln Leu
195 200 205
Asp Met Gln Val Val Asn Asn Gly Thr Ala Ala Val Ser Ser Trp Asp
210 215 220
Val Thr Met Ala Phe Gly Glu Ala Pro Gln Arg Thr Gly Gly Trp Asn
225 230 235 240
Ala Asn Phe Val Glu Ser Gly Asn Thr Ile Val Ala Ser Asn Ile Ser
245 250 255
Trp Asn Gly Asn Leu Ala Pro Gly Gln Ser Ala Ser Phe Gly Ile Gln
260 265 270
Gly Asn His Asp Gly Ser Phe Gly Gly Val Thr Cys Asn Gly Ala Ser
275 280 285
Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Thr Ser Ser Ser Ser Ser
290 295 300
Ser Ser Ser Ser Ser Gly Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser
305 310 315 320
Ser Thr Gly Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr
325 330 335
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
355 360 365
Ser Ser Ser Thr Ser Gly Ser Gly Ala Gly Phe Asp Asn Pro Phe Ile
370 375 380
Gly Gly Lys Trp Tyr Val Asp Pro Val Trp Ser Ala Lys Ala Ala Ala
385 390 395 400
Glu Pro Asn Gly Ser Leu Ile Ala Asn Tyr Asn Thr Ala Val Trp Met
405 410 415
Asp Arg Ile Gly Ala Ile Glu Gly Pro Glu Asp Gly Asp Gly Met Gly
420 425 430
Leu Glu Glu His Leu Asp Glu Ala Leu Ala Gln Gly Ala Asp Ile Phe
435 440 445
Met Phe Val Val Tyr Asp Leu Pro Asn Arg Asp Cys Ala Ala Leu Ala
450 455 460
Ser Ser Gly Glu Leu Leu Ile Ala Glu Asn Gly Phe Glu Arg Tyr Gln
465 470 475 480
Asn Glu Tyr Ile Gly Pro Ile Val Asp Ile Leu Ser Lys Pro Ala Tyr
485 490 495
Ser Ser Leu Arg Ile Ile Ala Ile Ile Glu Val Asp Ser Leu Pro Asn
500 505 510
Leu Val Thr Asn Leu Asn Ile Gln Lys Cys Val Glu Ala Asn Gly Pro
515 520 525
Gly Gly Tyr Val Asp Gly Ile Gln His Ala Leu Asn Glu Leu Asn Thr
530 535 540
Leu Asp Asn Val Tyr Pro Tyr Val Asp Ile Ala His Ser Gly Trp Leu
545 550 555 560
Gly Trp Ser Asp Asn Phe Ala Gly Ala Thr Lys Leu Ile Gly Asp Ala
565 570 575
Ile Lys Gly Thr Asn Lys Gly Val Asn Ser Ile Ala Gly Phe Val Ser
580 585 590
Asn Ser Ser Asn Tyr Thr Pro Val Thr Glu Pro Tyr Leu Pro Asn Pro
595 600 605
Thr Leu Gln Ile Gly Ser Asn Gln Val Arg Ser Ala Asp Phe Tyr Glu
610 615 620
Trp Thr Met Tyr Phe Glu Glu Leu Ser Phe Val Gln Asp Trp Arg Gln
625 630 635 640
Ala Met Ile Gln Gln Gly Phe Pro Glu Ser Ile Gly Met Leu Ile Asp
645 650 655
Thr Ala Arg Asn Gly Trp Gly Gly Pro Asp Arg Pro Thr Gly Glu Ser
660 665 670
Thr Ser Thr Asp Leu Asn Thr Tyr Val Asn Glu Ser Arg Ile Asp Arg
675 680 685
Arg Gln His Arg Gly Asn Trp Cys Asn Gln Pro Gly Gly Val Gly Phe
690 695 700
Arg Pro Gln Ala Ala Pro Glu Pro Gly Val Asp Ala Tyr Val Trp Val
705 710 715 720
Lys Pro Gln Gly Glu Ser Asp Gly Ile Ser Asp Pro Asn Phe Pro Ile
725 730 735
Asp Pro Asn Asp Pro Ala Lys Gln His Asp Pro Met Cys Asp Pro Asn
740 745 750
Ala Pro Asn Arg Asp Asn Asn Ala Val Gly Thr Gly Ala Leu Asp Asn
755 760 765
Ala Pro His Ala Gly Arg Trp Phe Pro Glu Ala Phe Gln Ile Leu Ile
770 775 780
Glu Asn Ala Tyr Pro Pro Leu
785 790
<210> 25
<211> 65
<212> PRT
<213> Microbulbifer degradans
<400> 25
Met Pro Leu Glu Pro Asp Asp Glu Leu Asp Glu Glu Val Leu Glu Glu
1 5 10 15
Leu Asp Glu Glu Leu Leu Val Leu Leu Glu Leu Leu Glu Glu Leu Asp
20 25 30
Glu Leu Asp Asp Glu Leu Glu Leu Glu Glu Leu Glu Pro Leu Ser His
35 40 45
Thr Ala Pro Val Thr Val Gly Ile Ser Ala Leu Glu Pro Ala Leu Leu
50 55 60
Pro
65
<210> 26
<211> 112
<212> PRT
<213> Microbulbifer degradans
<400> 26
Met Asn Gly Leu Ser Lys Pro Ala Pro Leu Pro Glu Val Glu Glu Leu
1 5 10 15
Glu Leu Glu Glu Asp Glu Leu Glu Leu Asp Asp Val Leu Glu Leu Glu
20 25 30
Leu Glu Leu Glu Leu Val Glu Leu Val Leu Glu Leu Glu Leu Glu Glu
35 40 45
Val Leu Glu Val Glu Glu Leu Leu Glu Leu Glu Val Glu Pro Val Glu
50 55 60
Glu Leu Glu Leu Glu Leu Glu Val Leu Glu Pro Asp Glu Leu Asp Glu
65 70 75 80
Leu Leu Asp Glu Leu Val Leu Glu Pro Glu Leu Asp Glu Pro Glu Leu
85 90 95
Glu Ala Pro Leu Gln Val Thr Pro Pro Lys Glu Pro Ser Trp Phe Pro
100 105 110
<210> 27
<211> 2376
<212> DNA
<213> Microbulbifer degradans
<400> 27
atgttggctt ctaataaaaa tagtaagctg gcaaactctg agcaacaccg cccttataaa 60
acccgcacag cgcgctggtt aaccgggtct ggggttattg cttcaagttt gcttttttct 120
gcgcagagtt ttgcggcgca atgtgaatac atcattagca atgaatggaa cagcggcttt 180
actggcgcag ttcgcattac taataatggc actactccca tcaatggctg ggatgttagc 240
tggcagtatg ccggcgatgc agtcaccagc agctggaacg cgaatgtttc tggctcgaac 300
cccgtttctg ctacaccatt aagctggaat gccaacattc aacccggtca aagcgttgag 360
tttggttttc agggcagcaa agccggctcc aatgcagaaa ttccaaccgt taccggcgcg 420
gtatgtgata gcggctctag ctcttccagc tccagctcat catctagttc atcaagctct 480
tctagtagct caagcagcac tagcagctcc tcgtccagct cttcaagcac ctcttcgtct 540
agctcatcat ctggctccag tggcacaggt ggtattgcgt gtactgtagg caatgcgaat 600
atttggggct cgggctacca gctggacatg caagttgtta acaacggcac cgctgcagta 660
agcagttggg acgtaaccat ggcattcggc gaggcaccac agcgcaccgg tggctggaac 720
gcaaactttg tagagtcagg caataccatt gttgcgagca acattagctg gaacggcaac 780
ctcgcaccgg ggcaatcagc ttcgtttggt attcaaggga accacgacgg ctcttttggc 840
ggcgtaacct gtaacggcgc ttcaagctct ggctcgtcta gttctggctc tagcaccagc 900
tcatcaagta gctcatccag ttcgtctggc tctagcactt ctagctctag ctcaagctcc 960
tctactggtt ctacctctag ctctagtagc tcttcaactt ctagcacttc ttcaagttct 1020
agctctagca ccagctccac gagttctagc tccagttcta gctcgagtac atcgtctagt 1080
tccagctcat cttcctcaag ctctagctct tctacttcag gcagtggcgc aggttttgac 1140
aacccgttca ttggcggcaa gtggtatgta gacccagtat ggtcagcaaa agctgcagca 1200
gagccaaacg gttcacttat tgccaactac aacacggcag tttggatgga tcgcattggt 1260
gcgattgaag gcccagaaga tggcgatggt atgggcttag aagaacactt agatgaagct 1320
ttagcacaag gtgcagacat ctttatgttc gtggtatacg acctaccaaa ccgcgactgt 1380
gcagctttgg cctcaagtgg cgaactactc attgccgaga acggttttga gcgctatcaa 1440
aatgagtaca ttggcccaat cgtagatata ctcagcaagc ccgcgtattc tagcttgcgt 1500
attatcgcga ttattgaagt ggattctcta cccaacctcg ttaccaacct caacattcaa 1560
aaatgtgttg aagcgaatgg cccgggtggg tacgtagacg gtatccaaca tgcacttaac 1620
gagctaaaca cgcttgataa tgtgtaccca tacgtcgata ttgctcactc aggctggcta 1680
ggctggagcg acaacttcgc cggcgccacc aagcttattg gtgatgcaat taaaggcaca 1740
aacaaaggtg taaacagtat tgcaggcttt gtaagtaact cttctaacta cacacctgtg 1800
actgaaccat acctacctaa ccctaccttg caaattggta gcaaccaagt tcgatctgcg 1860
gatttctacg agtggaccat gtacttcgaa gaacttagct ttgtacaaga ttggcgccaa 1920
gccatgattc agcaaggctt cccagaatca attggtatgc ttattgatac cgcacgtaat 1980
ggctggggtg gacctgaccg tccaactggt gagtctacat ctaccgacct caacacctat 2040
gtgaatgaat cgcgtataga ccgccgtcag catcgcggaa actggtgtaa ccagcccggt 2100
ggtgttggct tccgtccgca agcggcacca gaaccaggtg tagacgctta cgtttgggtt 2160
aagccacaag gtgagtcgga tggtattagt gatcctaact tccctatcga ccctaacgac 2220
ccagctaaac agcacgaccc aatgtgtgat ccaaacgcac ctaaccgcga taacaatgcg 2280
gttggcacag gcgcgctaga taacgctcca catgctggtc gctggttccc agaagcattc 2340
caaatactta tagaaaacgc ctacccaccg ctatag 2376
<210> 28
<211> 578
<212> PRT
<213> Microbulbifer degradans
<400> 28
Met Asn Lys Val Lys Val Leu Ala Leu Cys Ala Ser Val Ala Val Met
1 5 10 15
Ile Gly Cys Ser Asp Ala Asp Thr Lys Leu Ala Asn Ser Ala Lys Ala
20 25 30
Glu Val Gly Phe Thr Lys Val Asn Gln Leu Gly Tyr Leu Pro Ala Ala
35 40 45
Lys Lys Leu Ala Val Val Pro Ala Val Ala Ala Ala Lys Phe Asp Ile
50 55 60
Ile Asp Val Thr Ser Gly Lys Val Ala Phe Thr Gly Ser Leu Ser Asp
65 70 75 80
Val Lys Ser Trp Ser Ala Met Gly Asp Glu Ser Phe Lys Leu Ala Asp
85 90 95
Phe Ser Ala Leu Gln Ala Glu Gly Ser Tyr Arg Leu Val Val Gln Gly
100 105 110
Val Ser Asp Ser Tyr Thr Phe Asp Ile Ser Pro Ser Val Tyr Ser Gln
115 120 125
Ala His Asp Gly Ala Leu Lys Ala Tyr Tyr Tyr Asn Arg Ala Ser Thr
130 135 140
Glu Leu Thr Glu Gln Tyr Ala Gly Val Tyr Ala Arg Pro Ala Gly His
145 150 155 160
Pro Asp Thr Asp Val Arg Ile Phe Asp Asn Ala Ala Ser Ala Ala Arg
165 170 175
Pro Ala Asp Thr Ser Phe Ala Ala Pro Lys Gly Trp Tyr Asp Ala Gly
180 185 190
Asp Tyr Gly Lys Tyr Ile Val Asn Ser Gly Ile Ser Thr Tyr Thr Leu
195 200 205
Met Ala Ala Tyr Glu His Phe Pro Ser Phe Tyr Lys Gln Arg Asp Ile
210 215 220
Asp Ile Pro Glu Ser Gly Asp Ala Val Pro Asp Ile Leu Asp Glu Val
225 230 235 240
Met Trp Asn Leu Glu Trp Met Gln Val Met Gln Asp Pro Asn Asp Gly
245 250 255
Gly Val Tyr His Lys Leu Thr Thr Leu Asn Phe Ser Gly Ala Val Met
260 265 270
Pro His Glu Ala Thr Ala Gln Arg Tyr Phe Ile Lys Lys Ser Thr Ala
275 280 285
Ala Thr Leu Asp Phe Ala Ala Val Met Ala Thr Ala Ser Arg Val Tyr
290 295 300
Ala Pro Phe Glu Gly Ala Phe Pro Gly Lys Ser Ala Ala Tyr Arg Gln
305 310 315 320
Ala Ala Ile Ala Ala Trp Glu Trp Ala Gln Ala Asn Pro Ser Glu Thr
325 330 335
Tyr Ser Gln Thr Pro Leu Ser Lys Val Gln Thr Gly Ala Tyr Gly Asp
340 345 350
Lys Lys Leu Asn Asp Glu Phe Ala Trp Ala Ala Ala Glu Leu Phe Ile
355 360 365
Leu Thr Gly Glu Gln Lys Tyr Trp Gln Ala Phe Asn Lys Gln Lys Val
370 375 380
Gln Ala Gly Glu Ser Ser Trp Ala Asn Val Ala Gly Leu Gly Phe Ile
385 390 395 400
Ser Leu Ala Asn Asn Ala Arg Ser Leu Leu Asn Glu Ala Gln Tyr Lys
405 410 415
Thr Val Thr Asp Ser Ile Val Arg Ala Ala Asp Ser Leu Leu Val Thr
420 425 430
Tyr Lys Glu Asn Ala Tyr Gln Val Pro Ile Gly Asn Lys Asp Phe Phe
435 440 445
Trp Gly Gly Asn Ser Gly Thr Leu Asn Arg Ala Trp Val Leu Leu Glu
450 455 460
Ala Asn Lys Ile Lys Pro Gln Gln Glu Tyr Ile Asp Ala Ala Leu Ala
465 470 475 480
Ala Val Asp Tyr Ile Tyr Gly Arg Asn Pro Thr Asn Tyr Ser Phe Val
485 490 495
Thr Gly Phe Gly Asp Asn Pro Ala Val Gly Ile His His Arg Pro Ser
500 505 510
Tyr Ala Asp Gly Ile Lys Ala Pro Val Pro Gly Trp Leu Ala Gly Gly
515 520 525
Ala His Asn Gly Lys Gln Asp Gly Cys Glu Tyr Pro Ser Asp Ala Pro
530 535 540
Ala Lys Ser Tyr Leu Asp Asp Trp Cys Ser Tyr Ser Thr Asn Glu Ile
545 550 555 560
Ala Ile Asn Trp Asn Ala Pro Leu Val Tyr Ile Leu Ala Ala Val Asn
565 570 575
Asn leu
<210> 29
<211> 1737
<212> DNA
<213> Microbulbifer degradans
<400> 29
atgaacaaag ttaaagtttt agcgctgtgt gccagtgtgg ctgtaatgat aggttgcagt 60
gatgccgaca ctaaattagc taactcggcc aaggccgagg tgggctttac caaagtgaat 120
cagctgggtt atttgcccgc ggccaaaaag ctggcggtgg tacccgccgt tgcagctgca 180
aaattcgaca taatcgatgt aactagcggt aaagtagcgt ttacggggag tttaagcgac 240
gtaaaaagct ggagcgcgat gggggacgaa tctttcaagt tggcagactt tagcgccctg 300
caagccgaag ggagttaccg cttagttgtt cagggtgtga gtgattctta caccttcgat 360
attagcccaa gtgtatatag ccaagcgcac gatggagccc ttaaagccta ttactataat 420
cgagcgagca cagagttaac agaacagtac gccggggtgt atgcgcgacc tgcggggcac 480
ccagataccg acgtacgcat attcgataac gccgcctcag ccgcgcgccc agcagataca 540
agctttgctg caccaaaggg ttggtacgat gctggcgatt acggcaagta cattgttaac 600
agtggtattt ccacttacac cctaatggct gcgtacgagc atttcccgtc gttttacaag 660
caacgcgata tagatattcc cgaatctggc gatgccgtac cggatattct cgacgaggta 720
atgtggaacc ttgaatggat gcaggtcatg caagacccga acgacggcgg tgtgtaccac 780
aagcttacca ccctgaattt ttctggcgca gtcatgccgc acgaagcgac tgcgcagcgc 840
tattttatta aaaaatctac cgctgcaacg ctagattttg ccgcggttat ggccactgca 900
agccgagtat acgcaccgtt cgaaggtgct tttcctggta aatcagctgc ttatcgacag 960
gcggccattg ctgcgtggga gtgggcacaa gcaaacccta gtgagacata ttcgcagaca 1020
ccgctgagca aagttcaaac cggcgcctat ggtgataaaa agttaaacga tgaatttgcg 1080
tgggcggccg cagagttgtt tatattgacc ggcgagcaaa aatactggca ggcgtttaac 1140
aagcaaaaag tgcaggcggg tgagtctagc tgggcgaatg ttgcggggtt ggggtttatt 1200
tccttggcca ataatgcgcg cagcctgtta aacgaagctc aatacaaaac cgttaccgat 1260
tcaattgttc gcgctgcaga tagcttgctt gttacttaca aagagaatgc ctaccaagta 1320
cccattggca acaaagattt tttctggggt ggcaattccg gcacgttaaa tcgcgcttgg 1380
gttttgcttg aggccaataa aattaaaccg cagcaagaat acatcgatgc tgcacttgcc 1440
gcggtggatt atatttatgg tcgcaaccct accaactact cttttgtcac tgggtttggc 1500
gataaccctg cggtgggtat ccatcatcgt ccatcctatg ccgatggcat taaagcccct 1560
gtgcctggtt ggcttgcggg cggtgcgcac aatggcaagc aagatggttg tgagtaccct 1620
tccgatgcac cggcaaaatc ctatctagac gactggtgca gttactccac caacgaaatt 1680
gctattaatt ggaatgcgcc gttagtttac atactggctg cggtaaataa tttgtag 1737
<210> 30
<211> 867
<212> PRT
<213> Microbulbifer degradans
<400> 30
Met Asn Leu Thr Ser Ile Met Phe Glu Gln Ser Val Lys Lys Val Ala
1 5 10 15
Lys Ser Ala Ile Ala Val Ala Val Ala Ser Ala Val Thr Leu Ser Ala
20 25 30
Ala Gln Ala Glu Val Gly Asn Pro Arg Val Asn Gln Val Gly Tyr Ile
35 40 45
Pro Asn Gly Ala Lys Val Ala Ser Tyr Val Ala Pro Ser Asn Thr Ala
50 55 60
Gln Thr Trp Gln Leu Leu Arg Asn Gly Ser Val Val Ala Ser Gly Thr
65 70 75 80
Thr Thr Pro Lys Gly Thr Asp Ala Ala Ser Gly Asp Asn Ile His His
85 90 95
Ile Asp Phe Ser Ala Val Ser Ala Thr Gly Glu Gly Phe Ser Leu Leu
100 105 110
Val Gly Gly Asp Glu Ser Tyr Pro Phe Glu Ile Ser Ala Asp Ala Phe
115 120 125
Thr Pro Val Leu Tyr Asp Ser Ile Arg Tyr Phe Tyr His Asn Arg Ser
130 135 140
Gly Ile Ala Ile Glu Thr Gln Tyr Thr Gly Gly Gly Asn Gly Ser Tyr
145 150 155 160
Ala Ala Asn Ala Gln Trp Ala Arg Pro Ala Gly His Ile Asn Gln Asn
165 170 175
Ala Asn Gln Gly Asp Asn Ala Val Pro Cys Trp Ser Gly Ser Gly Cys
180 185 190
Asn Tyr Ala Leu Asp Val Thr Lys Gly Trp Tyr Asp Ala Gly Asp His
195 200 205
Gly Lys Tyr Val Val Asn Gly Gly Ile Ser Val Trp Lys Leu Leu Asn
210 215 220
Met Tyr Glu Arg Ala Leu His Ile Ser Gly Ser Gln Asn Lys Tyr Ala
225 230 235 240
Asp Gly Thr Leu Asn Ile Pro Glu Ser Gly Asn Gly Val Ala Asp Ile
245 250 255
Leu Asp Glu Ala Arg Trp Gln Met Glu Phe Leu Leu Ala Met Gln Val
260 265 270
Pro Glu Gly Glu Ala Lys Ala Gly Met Val His His Lys Met His Asp
275 280 285
Val Gly Trp Thr Gly Leu Pro Leu Ala Pro His Glu Asp Asn Arg Glu
290 295 300
Arg Ala Leu Val Pro Pro Ser Val Thr Ala Thr Leu Asn Val Ala Ala
305 310 315 320
Thr Gly Ala Gln Cys Ala Arg Leu Phe Asp Glu Ile Asp Ala Ser Phe
325 330 335
Ala Ala Ser Cys Leu Thr Ala Ala Glu Arg Ala Trp Asp Ala Ala Leu
340 345 350
Gln Asn Pro Asn Asp Val Tyr Thr Gly Gly Tyr Asp Asn Gly Gly Gly
355 360 365
Gly Tyr Gly Asp Glu Val Ala Asp Asp Glu Phe Phe Trp Ala Ala Ala
370 375 380
Glu Leu Tyr Ile Thr Thr Gly Asp Ser Lys Tyr Leu Ser Thr Ile Asn
385 390 395 400
Asn Tyr Asn Val Thr Arg Ile Asp Trp Gly Trp Pro Asp Thr Glu Leu
405 410 415
Pro Ala Leu Met Ser Leu Ala Val Val Pro Ala Asn His Thr Ala Asn
420 425 430
Leu Arg Ala Thr Ala Arg Ala Lys Ile Val Glu Ile Ala Asp Thr His
435 440 445
Val Ala Thr Ser Asn Ala Ala Gly Tyr Leu Thr Pro Ser Ser Ala Leu
450 455 460
Asp Tyr Tyr Trp Gly Ser Asn Asn Gly Val Ala Asn Lys Ile Ala Leu
465 470 475 480
Leu Gly Leu Ala Tyr Asp Phe Thr Gly Asp Asp Val Tyr Ala Lys Thr
485 490 495
Val Ser Lys Ala Val Asn Tyr Leu Phe Gly Asn Asn Thr Leu Ser Phe
500 505 510
Ser Tyr Ile Ser Gly His Gly Glu Asn Ala Leu Gln Gln Pro His His
515 520 525
Arg Phe Trp Ala Gly Ala Leu Asn Gly Ser Tyr Pro Trp Leu Pro Pro
530 535 540
Gly Ala Leu Ser Gly Gly Pro Asn Ala Gly Leu Glu Asp Gly Val Ala
545 550 555 560
Ala Ala Ala Leu Ser Ala Cys Val Ser Thr Pro Ala Lys Cys Tyr Met
565 570 575
Asp Asp Ile Glu Ser Trp Ser Thr Asn Glu Ile Thr Ile Asn Trp Asn
580 585 590
Gly Ala Leu Val Trp Ala Met Ala Phe Tyr Asp Asp Tyr Ala Asp Ser
595 600 605
Gly Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
610 615 620
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
625 630 635 640
Ser Ser Ser Ser Gly Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
645 650 655
Ser Ser Ser Ser Ser Ser Ser Gly Gly Glu Cys Val Glu Met Cys Lys
660 665 670
Trp Tyr Gln Asp Ala Pro Arg Pro Leu Cys Asn Asn Gln Asn Ser Gly
675 680 685
Trp Gly Trp Glu Asn Gln Gln Ser Cys Ile Gly Arg Thr Thr Cys Glu
690 695 700
Ser Gln Ser Gly Asn Gly Gly Val Ile Asn Ser Cys Gly Thr Ser Ser
705 710 715 720
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
725 730 735
Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser
740 745 750
Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly
755 760 765
Val Ala Gly Val Ala Cys Ala Val Thr Lys Met Asn His Trp Gly Ser
770 775 780
Gly Tyr Gln Leu Asp Val Thr Val Ser Asn Asn Gly Ala Ala Ala Val
785 790 795 800
Ser Gly Trp Ser Ile Glu Leu Asp Phe Gly Glu Ser Pro Gln Leu Thr
805 810 815
Gly Ser Trp Asn Ala Ala Val Ser Ala Ser Gly Asn Thr Val Ser Ala
820 825 830
Thr Asn Ile Ser Trp Asn Gly Asn Leu Ser Ala Gly Gln Ser Thr Ser
835 840 845
Phe Gly Met Gln Gly Asn Ser Asp Gly Ser Leu Ser Thr Pro Ser Cys
850 855 860
Leu Val Lys
865
<210> 31
<211> 49
<212> PRT
<213> Microbulbifer degradans
<400> 31
Met Glu Glu Leu Leu Glu Glu Leu Glu Pro Leu Asp Glu Glu Leu Leu
1 5 10 15
Leu Glu Asp Asp Glu Leu Glu Val Glu Leu Glu Glu Leu Asp Glu Glu
20 25 30
Leu Asp Glu Leu Glu Glu Leu Asp Glu Leu Glu Pro Leu Pro Glu Ser
35 40 45
Ala
<210> 32
<211> 2604
<212> DNA
<213> Microbulbifer degradans
<400> 32
atgaatctta cttcaatcat gtttgaacaa tcagtaaaaa aagtcgctaa gtcagccatt 60
gccgtggcag ttgcttcggc ggttacctta agtgcggcgc aggccgaggt gggtaaccca 120
cgtgttaacc aagtaggcta tatacccaat ggtgccaaag ttgccagtta tgttgcgcca 180
tcaaatacgg cacaaacgtg gcagttactg cgtaatggca gtgtggttgc aagtggcact 240
acaaccccaa agggtacaga tgcagcctcg ggtgacaata ttcaccatat cgatttttct 300
gcggtgagtg caaccggcga aggttttagt ttgcttgtgg gcggcgatga aagttacccc 360
tttgaaattt ctgccgacgc atttacaccg gttttatacg attccatccg ttacttttat 420
cacaaccgtt cgggtatcgc gattgaaacg cagtacaccg gtggcggtaa cggtagctac 480
gcggcgaatg ctcagtgggc taggcccgca ggtcacatta atcaaaatgc taaccaaggc 540
gataatgcgg tgccgtgttg gtcgggcagt ggttgcaact acgccttaga cgtaactaaa 600
ggttggtacg atgccggtga ccacggtaaa tatgttgtaa acggtggcat ttccgtatgg 660
aagctattaa acatgtacga gcgtgccttg cacattagtg gcagccaaaa taaatacgcc 720
gacggtacat taaatattcc tgaaagcggc aatggcgtgg cggatatttt ggatgaagct 780
cgctggcaaa tggagttttt attagccatg caagtgccag agggcgaagc gaaagctggc 840
atggtgcacc acaaaatgca cgatgtgggt tggacaggct tgccactagc accccatgaa 900
gataatcgcg agcgcgcgct tgtgccgcct tcggttactg caacccttaa cgttgcggcc 960
acaggcgcgc agtgtgcgcg tttatttgac gaaatagatg cgagttttgc agcaagttgt 1020
ttaactgccg cagagcgcgc atgggatgca gccctgcaaa accctaacga tgtttacact 1080
ggcggctacg ataatggcgg cggtggttac ggcgatgaag tggcggacga cgagtttttc 1140
tgggctgctg ctgagttata cattaccact ggcgatagca aatatctttc aaccattaac 1200
aactacaatg taacgcgcat tgattggggc tggccagata ccgagttgcc tgcgttgatg 1260
tcgttagcgg ttgtgcctgc taatcacacc gcaaatttgc gtgcgactgc tcgtgcaaaa 1320
attgtagaaa ttgcagatac ccatgtcgct accagtaatg ctgccggcta tttaacacca 1380
tcgtccgcgc tggattacta ctggggttct aacaatggcg tagccaataa aattgcgtta 1440
cttggtttgg catacgattt tactggcgat gacgtttacg cgaaaacggt gtcgaaagca 1500
gttaactatt tatttggtaa taatacctta tcgttttctt atatttctgg gcatggcgaa 1560
aatgctttgc aacagccgca tcaccgcttt tgggctgggg cattaaatgg aagttaccca 1620
tggttgccgc ctggtgcgct ttctggtggc cctaacgcag ggttagaaga tggcgttgcc 1680
gccgccgcgc taagtgcttg tgtttcaacg cctgccaaat gctatatgga tgatattgaa 1740
tcttggtcga ccaacgaaat tactattaac tggaatggtg cattggtttg ggcaatggcg 1800
ttttatgatg actacgccga ttcgggtagc ggttctagct cgtcaagttc ttctagctca 1860
tctagctctt cgtcaagttc ttccagttcg acttctagct cgtcgtcttc tagtagtagc 1920
tcttcgtcga gcggctcgag ttcttctagc agctcttcca catccagttc cagctcttcg 1980
agttcatcgg gtggggagtg tgtagaaatg tgtaagtggt atcaagatgc accgcgccct 2040
ctatgcaata accaaaacag cggttgggga tgggagaacc agcagagttg tattggtaga 2100
acaacttgcg aaagtcaaag tggcaatggt ggagtgatta attcgtgcgg cacgtctagc 2160
tcgagctctt catctagctc tagcagtagc tcttcgagtt catccagctc ttctagcagt 2220
tcttccacat caagctcgtc gagtagttcg tcttctagct cttctagttc gacttcaagt 2280
tcttcgtcga gcagttcagg gggcgttgca ggtgtggctt gtgcggtaac caaaatgaac 2340
cattggggca gcggatatca attagatgta acagtttcta ataatggtgc tgcagcggta 2400
agtggttgga gtattgaact cgattttggt gaatcgccac agcttactgg tagttggaat 2460
gctgctgtat cggcatctgg taatactgta tcggctacta acattagttg gaacggtaat 2520
ttaagcgctg ggcaatctac ctcttttggt atgcagggta attcagatgg ttcgctgagc 2580
acgccaagct gtttagttaa gtaa 2604
<210> 33
<211> 1072
<212> PRT
<213> Microbulbifer degradans
<400> 33
Met Lys Asn Thr Leu Ser Phe Lys Thr Ser Leu Leu Ala Gly Leu Val
1 5 10 15
Ala Ser Ser Leu Leu Val Ala Ala Cys Gln Gly Val Lys Gln Gln Thr
20 25 30
Glu Ala Thr Gln Thr Lys His Asn Ile Thr Leu Trp Pro Gln Ala Ser
35 40 45
Ser Pro Val Ile Lys Ser Pro Asp Tyr Glu Ala Glu Val Glu Ala Lys
50 55 60
Val Glu Ala Leu Leu Gly Gln Met Thr Leu Glu Gln Lys Val Gly Gln
65 70 75 80
Ile Leu Gln Pro Glu Ile Gln Ser Ile Lys Pro His Glu Val Lys Glu
85 90 95
Tyr His Ile Gly Ser Val Leu Asn Gly Gly Gly Ser Met Pro Asn Arg
100 105 110
Ile Glu Asn Ala Pro Pro Ile Glu Trp Val Lys Leu Ala Asp Ala Phe
115 120 125
Tyr Asp Ala Ser Met Asp Asp Ser Asp Gly Gly Ile Ala Ile Pro Ile
130 135 140
Ile Trp Gly Thr Asp Ala Val His Gly His Gly Asn Val Thr Gly Ala
145 150 155 160
Thr Ile Phe Pro His Asn Ile Gly Leu Gly Ala Ala Arg Asn Pro Ala
165 170 175
Leu Ile Glu Lys Ile Gly Glu Ile Thr Ala Lys Glu Val Arg Ala Thr
180 185 190
Gly Ile Glu Trp Ile Phe Gly Pro Thr Leu Ala Val Ala Gln Asn Asp
195 200 205
Leu Trp Gly Arg Thr Tyr Glu Ser Tyr Ser Glu Asp Pro Ala Ile Val
210 215 220
Ala Asp Tyr Ala Ser Ala Met Val Val Gly Met Gln Gly Lys Val Asp
225 230 235 240
Asp Ser Asp Phe Leu Ser Thr Asn Arg Val Val Ala Thr Ala Lys His
245 250 255
Phe Leu Ala Asp Gly Gly Thr Leu Gly Gly Asn Asp Gln Gly Asp Ala
260 265 270
Arg Ile Ser Glu Glu Glu Leu Val Gln Ile His Asn Ala Gly Tyr Val
275 280 285
Pro Ala Ile Glu Ser Gly Val Gln Thr Val Met Ala Ser Phe Ser Leu
290 295 300
Trp Asn Gly Val Lys Met His Gly Asn Asn Tyr Leu Leu Thr Gln Ala
305 310 315 320
Leu Lys Glu Arg Met Gly Phe Asp Gly Phe Ile Val Gly Asp Trp Asn
325 330 335
Gly His Gly Gln Val Pro Gly Cys Thr Asn Glu Ser Cys Pro Gln Ser
340 345 350
Leu Asn Ala Gly Leu Asp Met Tyr Met Val Pro Tyr Asp Trp Lys Lys
355 360 365
Leu Tyr Arg Asn Leu Ile Ser Gln Val Gln Ser Gly Glu Ile Ala Pro
370 375 380
Ser Arg Leu Asp Asp Ala Val Arg Arg Ile Leu Arg Val Lys Ile Arg
385 390 395 400
Ala Asn Leu Trp Ala Ala Lys Pro Ser Glu Arg Ile Asn Leu Ala Thr
405 410 415
Ile Asp Glu Val Val Gly His Ala Asn His Arg Glu Val Ala Arg Gln
420 425 430
Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn Lys Asn Ser Val Leu
435 440 445
Pro Ile Ala Ala Asn Lys Thr Val Leu Val Ala Gly Asp Gly Ala Asp
450 455 460
Asn Ile Gly Lys Gln Ser Gly Gly Trp Ser Val Ser Trp Gln Gly Thr
465 470 475 480
Gly Asn Thr Asn Ala Ser Phe Pro Gly Gly Thr Ser Ile Tyr Lys Gly
485 490 495
Ile Ala Asp Ala Val Thr Gln Gly Gly Gly Lys Ala Thr Leu Ser Val
500 505 510
Asp Gly Ser Tyr Lys Thr Lys Pro Asp Val Ala Ile Val Val Ile Gly
515 520 525
Glu Asp Pro Tyr Ala Glu Gly Gln Gly Asp Arg Asn Ser Leu Glu Phe
530 535 540
Glu Pro Val Asn Lys Lys Ser Leu Glu Leu Leu Lys Lys Leu Lys Ala
545 550 555 560
Asp Gly Ile Pro Val Val Thr Val Phe Ile Ser Gly Arg Pro Met Trp
565 570 575
Ala Asn Pro Glu Ile Asn Ala Ser Asp Ala Phe Val Ala Ala Trp Leu
580 585 590
Pro Gly Ser Glu Gly Gln Gly Val Ala Asp Val Leu Ile Gly Asn Ala
595 600 605
Asn Gly Lys Pro Arg Phe Asp Phe Lys Gly Thr Leu Ser Phe Ser Trp
610 615 620
Pro Lys Leu Pro Thr Gln Gly Leu Leu Asn Pro Thr His Pro Asn Tyr
625 630 635 640
Asp Pro Leu Phe Lys Leu Gly Tyr Gly Leu Thr Tyr Ala Ser Ser Glu
645 650 655
Thr Gly Pro Glu Gln Leu Ala Glu Asp Val Glu Gly Val Asp Lys Gly
660 665 670
Ser Thr Gly Asp Ile Asn Phe Tyr Val Gly Arg Thr Leu Glu Pro Trp
675 680 685
Glu Val Phe Val Arg Thr Pro Glu Ser Ser Gln Arg Leu Ser Gly Pro
690 695 700
Phe Ala Asp Leu Gly Asn Ala Ser Val Arg Thr Ser Asp Met Gln Val
705 710 715 720
Gln Glu Asp Ala Leu Thr Phe Thr Trp Gly Gly Ser Trp Met Ser Ile
725 730 735
Leu Gly Ile Glu Gly Gly Arg Gly Tyr Asp Leu Ser Ser Gln Tyr Lys
740 745 750
Glu Gly Gly Val Ile Ser Phe Asn Phe Asn Ser Ile Asp Met Ala Lys
755 760 765
Gly Asp Leu Lys Val Gln Met Ala Cys Gly Glu Gly Cys Thr Arg Glu
770 775 780
Val Asp Ile Thr Thr Ile Ala Arg Asp Leu Glu Gly Lys Gly Trp Gln
785 790 795 800
Ser Leu Thr Val Pro Leu Ala Cys Phe Ala His Glu Gly Asp Asp Phe
805 810 815
Thr His Ile Thr Ala Pro Phe Asn Leu Phe Ala Gly Gly Lys Gly Gln
820 825 830
Val Ala Val Ala Asn Ile Arg Ile Leu Arg Ala Gly Thr Gln Thr Val
835 840 845
Pro Cys Val Leu Pro Lys Asp Val Ser Val Thr Pro Glu Pro Leu Asn
850 855 860
Ala Ser Trp Ala Ile Asp Trp Trp Met Pro Arg His Lys Glu Lys Leu
865 870 875 880
Ala Arg Ile Gln Gln Gly Asn Val Asp Leu Leu Met Ile Gly Asp Ser
885 890 895
Ile Thr His Gly Trp Glu Asp Ala Gly Lys Asp Val Trp Ala Gln Tyr
900 905 910
Tyr Ala His Arg Asn Ala Val Asp Leu Gly Phe Ser Gly Asp Arg Thr
915 920 925
Glu Asn Val Leu Trp Arg Leu Gln His Gly Glu Ala Asp Gly Ile Lys
930 935 940
Pro Lys Val Ala Val Val Met Ile Gly Thr Asn Asn Ala Gly His Arg
945 950 955 960
His Glu Pro Ser His Tyr Thr Ala Lys Gly Val Ala Ala Val Val Ala
965 970 975
Glu Leu Gln Lys Arg Leu Pro Glu Thr Lys Ile Leu Leu Leu Gly Ile
980 985 990
Phe Pro Arg Gly Glu Thr Ser Glu Asp Pro Leu Arg Val Leu Asn Ala
995 1000 1005
Lys Thr Asn Thr Leu Leu Ala Lys Met Ala Asp Gly Glu Lys Val Val
1010 1015 1020
Tyr Leu Asn Ile Asn Lys Thr Phe Leu Asp Glu Asn Gly Val Leu Pro
1025 1030 1035 1040
Lys Asp Ile Met Pro Asp Leu Leu His Pro Asn Glu Lys Gly Tyr Ala
1045 1050 1055
Leu Trp Ala Lys Ala Met Glu Pro Thr Leu Lys Lys Met Leu Gly Glu
1060 1065 1070
<210> 34
<211> 3219
<212> DNA
<213> Microbulbifer degradans
<400> 34
atgaaaaata ctttatcctt taaaacatcc ttgcttgcgg gcttggtggc atccagttta 60
ctggttgcgg cctgtcaggg tgttaaacag caaacggaag ctactcagac aaagcacaat 120
attaccttat ggccgcaggc gtctagccct gtaataaagt cgccagatta cgaagcggaa 180
gtggaagcca aggtagaagc gttgttagga caaatgacgc tagagcaaaa agtagggcaa 240
atcctacagc cagaaattca atctattaag ccgcatgaag taaaagaata ccacattggc 300
tctgtactaa atggtggtgg ctctatgcct aaccgcatag aaaatgcgcc gcccattgaa 360
tgggtaaaat tggccgatgc cttttacgat gcctctatgg acgattctga cggtggaatc 420
gcaattccca ttatttgggg taccgatgcc gtacacggtc acggcaatgt aactggcgca 480
accatattcc cgcataacat aggccttggt gctgcacgca acccagcgct tatcgaaaaa 540
attggcgaaa taacggcaaa agaagtacgc gcaaccggca ttgaatggat atttggccca 600
actttggccg tagcgcaaaa cgatttatgg ggccgcactt acgaaagcta ctcggaagac 660
ccagccatag tggccgacta cgccagtgcc atggtggtag gtatgcaggg caaagtggac 720
gacagcgatt ttctgtccac taatcgcgta gttgccacag caaagcactt tttagctgac 780
ggcggtacct taggaggcaa cgatcaaggt gatgcgcgca taagcgaaga agagttggtg 840
caaattcata atgcgggcta tgtgcctgcc attgaatcgg gcgtgcaaac ggttatggcc 900
agtttctctt tgtggaatgg cgtaaaaatg catggtaaca actacctact tacccaagca 960
cttaaagagc gtatggggtt tgatggtttt atagtagggg attggaatgg ccacgggcag 1020
gtacctgggt gcaccaacga atcttgccct caatcgctaa acgccggttt agatatgtac 1080
atggtgcctt acgattggaa aaaactgtac agaaacttaa ttagccaagt gcaatcgggt 1140
gaaattgccc caagccgttt agatgacgct gtacgccgta ttcttcgggt aaaaattcgc 1200
gctaatttgt gggctgcgaa accttcagag cgaattaatc tagccactat tgacgaggtg 1260
gttggccacg caaaccaccg tgaggtagcg cggcaggcgg tgcgagaaag tttagtattg 1320
ttaaaaaata aaaatagcgt actgcctatt gctgccaata aaaccgtgct ggttgcaggt 1380
gacggcgccg ataatattgg caaacaatct ggcggttgga gtgtaagctg gcagggcact 1440
ggtaacacca atgcatcctt ccccggtggt acatctattt ataaaggtat tgccgatgca 1500
gtcactcagg gcggcggtaa agctacgctt tctgtggatg gcagctacaa aactaaaccc 1560
gatgttgcca ttgtggtaat aggcgaagac ccttacgccg aaggccaagg cgaccgcaat 1620
agtttagagt tcgagccggt gaataaaaaa tcgcttgagc tattaaaaaa attaaaagca 1680
gatggcatac ccgttgtaac agtatttatt tctggccgac ctatgtgggc taacccagaa 1740
attaacgcgt ctgatgcatt tgttgccgcg tggttacctg gctctgaagg gcagggcgta 1800
gcagatgtac ttataggcaa cgccaacggc aagcctcgtt ttgatttcaa gggcaccttg 1860
tcgttctctt ggcctaagct gccgacccaa ggcttgctca acccaacgca ccccaactac 1920
gacccgttat ttaaattggg atacgggcta acttatgcct cgagtgaaac tggcccagag 1980
caattggcgg aagatgttga aggtgtagat aaaggctcaa ccggcgacat taatttttat 2040
gttggccgca cattagagcc gtgggaagtg tttgttcgaa ctcctgaaag ttcgcagcgt 2100
ttaagtggcc catttgcaga cttaggcaat gccagtgtgc gtaccagtga tatgcaggta 2160
caagaagatg cccttacttt tacttggggc ggtagctgga tgtctattct gggaatagaa 2220
ggagggcgcg gttacgacct ttcttcgcaa tataaagaag gcggagtaat aagctttaac 2280
ttcaattcaa tagatatggc taaaggcgat ttaaaagtac aaatggcctg tggtgaaggt 2340
tgcacgcgtg aagtagatat cacaactatc gcacgcgact tggaaggcaa aggctggcag 2400
tcgttaacag tgcccttagc gtgctttgca cacgaaggcg acgatttcac ccatattact 2460
gcgccgttta acttatttgc cggtggaaaa ggtcaagttg ctgtagccaa cattcgcata 2520
ctgcgcgccg gtacacaaac cgtgccgtgt gtattgccta aagatgtttc cgtaacgcca 2580
gagccgctga atgctagctg ggcgatagat tggtggatgc cgcgccacaa agaaaaactg 2640
gcgcgtatcc agcaaggtaa tgtggattta ctaatgattg gcgattccat tacccacggc 2700
tgggaagatg caggtaaaga cgtgtgggcg caatattacg cgcaccgcaa tgcagtggac 2760
ttaggcttta gtggcgaccg aaccgaaaac gtattgtggc gcttacagca cggcgaagca 2820
gacggtatta agcctaaagt ggcagtggtt atgattggta ccaacaatgc cggccatcgt 2880
cacgagcctt cgcactacac agccaagggt gttgcggctg tcgttgctga attgcaaaaa 2940
cgattgcctg aaacaaagat attattactg ggtatattcc ctcgcggcga aaccagtgaa 3000
gaccctttgc gggtattaaa tgccaaaacc aatactcttt tggcgaaaat ggccgacgga 3060
gagaaggtgg tgtatttgaa tatcaataaa acgtttttag atgaaaacgg cgtattgcct 3120
aaagatataa tgcccgacct attgcacccc aatgaaaagg ggtacgcatt gtgggcgaaa 3180
gcgatggaac ccacccttaa aaaaatgctg ggcgaatag 3219
<210> 35
<211> 862
<212> PRT
<213> Microbulbifer degradans
<400> 35
Met Leu Lys Lys Ile Asn Lys Lys Gly Leu Ala Leu Ser Leu Ala Ile
1 5 10 15
Ala Ala Met Leu Ser Gly Cys Asn Glu Gly Asp Ser Asn Lys Thr Lys
20 25 30
Pro Ser Ala Glu Thr Leu Ser Ala Thr Gln Ala Ser Asn Thr Val Ala
35 40 45
Asn Pro Ser Ile Trp Pro Lys Val Thr Ser Lys Val Ala Lys Asp Ala
50 55 60
Lys Met Glu Ala Asp Ile Ser Ala Ile Leu Ser Gly Met Thr Leu Glu
65 70 75 80
Gln Lys Val Ala Gln Met Ile Gln Pro Glu Ile Arg Ala Phe Ser Lys
85 90 95
Glu Asp Met Lys Lys Tyr Gly Phe Gly Ser Tyr Leu Asn Gly Gly Gly
100 105 110
Ala Phe Pro Asn Asp Asn Lys His Ser Thr Met Ala Asp Trp Val Ala
115 120 125
Leu Ala Asp Asp Met Tyr Glu Ala Ser Ile Asp Asp Ser Ile Asp Gly
130 135 140
Ser Thr Ile Pro Thr Met Trp Gly Thr Asp Ala Val His Gly His Asn
145 150 155 160
Asn Val Val Lys Ala Thr Ile Phe Pro His Asn Ile Gly Leu Gly Ala
165 170 175
Met His Asn Pro Lys Leu Met Gln Gln Ile Gly Ala Ala Thr Ala Lys
180 185 190
Val Val Gln Val Thr Gly Ile Asp Trp Val Phe Ala Pro Thr Val Ala
195 200 205
Val Val Arg Asp Asp Arg Trp Gly Arg Thr Tyr Glu Gly Tyr Ser Glu
210 215 220
Asp Pro Ala Ile Val Lys Glu Tyr Ala Arg Ala Met Val Ile Gly Met
225 230 235 240
Gln Gly Glu Ala Asn Ser Glu Ala Phe Met Gly Asp Gly Thr Val Ile
245 250 255
Ala Thr Ala Lys His Phe Leu Gly Asp Gly Gly Thr Asp Lys Gly Asp
260 265 270
Asp Gln Gly Asn Asn Leu Ser Thr Glu Gln Glu Leu Ile Asp Ile His
275 280 285
Ala Gln Gly Tyr Ile Ser Ala Ile Glu Glu Gly Val Gln Thr Ile Met
290 295 300
Ala Ser Phe Asn Ser Trp Asn Gly Glu Lys Met His Gly Asn Lys Ser
305 310 315 320
Leu Leu Thr Asp Val Leu Lys Lys Gln Met Gly Phe Asp Gly Leu Val
325 330 335
Val Gly Asp Trp Asp Gly His Gly Gln Val Lys Gly Cys Ser Asn Ala
340 345 350
Ser Cys Ala Gln Ala Ile Asn Ala Gly Val Asp Ile Ile Met Val Pro
355 360 365
Asn Glu Trp Lys Pro Met Phe Glu Asn Thr Val Ala Gln Val Lys Ser
370 375 380
Gly Glu Ile Ser Glu Ala Arg Ile Asn Asp Ala Val Thr Arg Ile Leu
385 390 395 400
Arg Val Lys Met Arg Ala Gly Ile Phe Asp Gly Val Lys Pro Ser Asp
405 410 415
Arg Ala Phe Ala Ala Glu Glu Lys Tyr Leu Gly Ser Ala Glu Asn Arg
420 425 430
Ala Ile Ala Arg Gln Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn
435 440 445
Gln Asn Lys Leu Leu Pro Leu Asp Arg Lys Met Asn Val Leu Met Ala
450 455 460
Gly Ser Gly Ala Asp Asn Ile Gly Lys Gln Ser Gly Gly Trp Thr Leu
465 470 475 480
Ser Trp Gln Gly Thr Gly Asn Val Asn Ser Asp Phe Pro Gly Ala Thr
485 490 495
Ser Ile Tyr Asp Gly Val Asn Gln Val Val Ser Ser Ala Gly Gly Lys
500 505 510
Val Glu Leu Ser Glu Asn Gly Asn Tyr Gln Ala Lys Pro Asp Val Ala
515 520 525
Ile Val Val Phe Gly Glu Asn Pro Tyr Ala Glu Gly Val Gly Asp Ile
530 535 540
Glu Gly Ile Glu Tyr Gln Leu Asn Asn Lys Arg Asp Ile Asn Leu Leu
545 550 555 560
Gln Lys Leu Lys Ala Asp Gly Ile Pro Val Val Ser Val Phe Leu Thr
565 570 575
Gly Arg Pro Leu Trp Val Asn Lys Glu Leu Asn Ala Ser Asp Ala Phe
580 585 590
Val Ala Ala Trp Leu Pro Gly Ser Glu Gly Val Gly Val Ser Asp Val
595 600 605
Leu Phe Lys Lys Ala Asp Gly Ser Ile Asn Tyr Asp Phe Lys Gly Lys
610 615 620
Leu Thr Tyr Ser Trp Pro Lys Tyr Asp Asp Gln Val Val Ile Asn Lys
625 630 635 640
Gly Asp Lys Asp Tyr Ala Pro Leu Tyr Pro Tyr Gly Tyr Gly Leu Thr
645 650 655
Tyr Ser Asp Val Asp Thr Gln Gly Asp Asp Leu Pro Glu Glu Thr Lys
660 665 670
Val Lys Ile Gly Arg Ala Asp Asp Glu Pro Met Ala Ile Phe Asp Ser
675 680 685
Leu Pro Gln Ser Asp Leu Gly Phe Phe Leu Gly Asp Lys Ala Asn Trp
690 695 700
Val Val Pro Ile Ala Thr Ser Val Val Thr Thr His Asn Ser Asp Asn
705 710 715 720
Leu Thr Met Arg Thr Tyr Asn Trp Lys Val Gln Glu Asp Ala Arg Gln
725 730 735
Leu Ile Trp Lys Gly Asp Ser Lys Ala Asn Ala Phe Phe Ala Trp Pro
740 745 750
Asp Pro His Asn Met Gln Gly Met Leu Glu His Lys Ala Ala Tyr Ser
755 760 765
Phe Ser Ile Lys Val Asp Lys Ala Pro Ala Gly Asp Leu Thr Leu Gly
770 775 780
Ile His Cys Met Glu Glu Cys Gly Lys Lys Leu Val Leu Asn Glu Ala
785 790 795 800
Leu Ser Lys Ile Pro Ala Gly Glu Trp Gly Glu Leu Thr Ile Asp Leu
805 810 815
Ala Cys Ile Ala Asp Ala Glu Ala Leu Ala Glu Val Arg Ser Pro Phe
820 825 830
Met Leu Ser Thr Asp Ala Pro Ala Ser Ile Val Phe Gly Asp Val Lys
835 840 845
Leu Val Pro Gly Gly Ala Asp Ser Ala Ala Ile Lys Cys Asp
850 855 860
<210> 36
<211> 2540
<212> DNA
<213> Microbulbifer degradans
<400> 36
atgctcaaaa agataaacaa gaaaggtctt gctttaagct tagcaattgc agcaatgcta 60
agcggctgca acgaaggcga cagcaacaaa accaaaccaa gtgcggaaac cctctccgct 120
actcaagcca gtaacactgt agccaacccc agcatttggc ccaaggtaac tagcaaggtt 180
gccaaagacg ccaaaatgga agcagatata agcgcaatac tcagcggtat gacccttgag 240
caaaaagtag cccaaatgat ccaacccgaa attcgtgcct tcagcaaaga agacatgaaa 300
aagtatggtt ttggctccta ccttaacggt ggcggcgcat tccctaacga caacaaacat 360
tccaccatgg ccgactgggt tgccctagcc gacgacatgt atgaagcctc tatagacgac 420
agcatagacg gcagcactat tccaaccatg tggggtaccg atgcagtaca cggccacaac 480
aacgtggtta aagcgactat tttcccacac aacattggcc ttggcgccat gcataacccc 540
aagctcatgc agcaaatagg cgctgccacg gctaaagtgg tacaagttac tggtatcgac 600
tgggtatttg cgcccactgt tgcggtagtg cgcgacgacc gctggggccg tacttacgag 660
ggctactctg aagaccccgc catagtaaaa gaatacgctc gcgccatggt tattggcatg 720
cagggcgaag ccaatagcga agcgtttatg ggtgacggca ctgttatagc caccgccaaa 780
cactttttgg gcgatggcgg caccgacaaa ggcgacgacc aaggcaacaa cttatccacc 840
gaacaagaat taattgatat tcacgcccaa ggctatataa gcgccattga agaaggtgtg 900
caaactatca tggcatcttt caatagctgg aatggcgaaa agatgcacgg caataaatct 960
ctgcttaccg atgtccttaa aaagcaaatg ggctttgacg gtttggtggt tggcgattgg 1020
gatggccacg gccaagtaaa aggttgctct aatgcaagct gtgcccaagc catcaacgcc 1080
ggtgtcgata tcatcatggt acccaatgag tggaaaccca tgttcgaaaa caccgttgca 1140
caagttaaaa gcggcgaaat ctctgaagcg cgaattaacg atgcagttac ccgtatttta 1200
cgtgtaaaaa tgcgcgctgg tattttcgac ggtgttaaac catcggatcg cgccttcgca 1260
gcagaagaaa aatacctagg ctctgccgaa aaccgcgcta tcgctcgtca agctgtacgc 1320
gaatcgttag tgttgcttaa aaaccaaaac aaactgctgc cattagaccg caaaatgaac 1380
gttttaatgg cgggttctgg cgcagacaac atcggcaagc aaagtggtgg ttggacatta 1440
agctggcagg gtactggcaa cgtgaacagc gacttccctg gcgcaacatc tatttacgac 1500
ggcgttaacc aagtagtgag cagcgctggc ggtaaagtag agctaagcga aaacggcaac 1560
taccaagcca aaccagatgt agcgattgta gtatttggtg aaaaccctta cgcagaaggc 1620
gtaggcgata ttgaaggtat tgaataccaa ctaaacaata agcgcgatat caatttgtta 1680
caaaaactca aagccgatgg cattcctgtt gtatcggtat tcttaaccgg tcgtccactt 1740
tgggtaaaca aagagcttaa tgcctccgat gcttttgttg cagcttggct gccaggctct 1800
gaaggtgtag gcgtttctga tgtgctattc aaaaaagccg acggtagtat taactacgac 1860
tttaaaggca agctaactta ctcttggcca aagtatgatg accaagtagt aataaacaaa 1920
ggcgacaaag attacgcccc gctttaccct tatggttacg gcttaaccta cagcgatgtt 1980
gacacccaag gtgacgactt acctgaagaa accaaagtta aaattggccg cgctgacgac 2040
gagccaatgg ccatcttcga cagcctaccc caaagcgacc tcggcttctt ccttggcgac 2100
aaagccaact gggtagtacc tattgcaaca agtgtagtta caacgcacaa cagcgataac 2160
ctaaccatgc gcacctacaa ctggaaagta caagaagatg ctcgccagtt aatttggaaa 2220
ggcgacagca aagccaatgc cttctttgca tggccagacc cacacaatat gcaaggcatg 2280
ttagaacaca aagcggctta cagctttagc attaaagtag ataaagcacc cgctggcgac 2340
ctaacactag gcatacactg catggaagaa tgcggtaaaa aacttgtgct taacgaagcg 2400
cttagcaaaa ttcctgctgg tgagtgggga gagctaacaa tagatctagc ttgcatagca 2460
gatgccgaag ccttggccga agttcgctca cccttcatgc taagcaccga tgcacccgca 2520
tctatcgtgt ttggcgatgt 2540
<210> 37
<211> 862
<212> PRT
<213> Microbulbifer degradans
<400> 37
Met Leu Lys Lys Ile Asn Lys Lys Gly Leu Ala Leu Ser Leu Ala Ile
1 5 10 15
Ala Ala Met Leu Ser Gly Cys Asn Glu Gly Asp Ser Asn Lys Thr Lys
20 25 30
Pro Ser Ala Glu Thr Leu Ser Ala Thr Gln Ala Ser Asn Thr Val Ala
35 40 45
Asn Pro Ser Ile Trp Pro Lys Val Thr Ser Lys Val Ala Lys Asp Ala
50 55 60
Lys Met Glu Ala Asp Ile Ser Ala Ile Leu Ser Gly Met Thr Leu Glu
65 70 75 80
Gln Lys Val Ala Gln Met Ile Gln Pro Glu Ile Arg Ala Phe Ser Lys
85 90 95
Glu Asp Met Lys Lys Tyr Gly Phe Gly Ser Tyr Leu Asn Gly Gly Gly
100 105 110
Ala Phe Pro Asn Asp Asn Lys His Ser Thr Met Ala Asp Trp Val Ala
115 120 125
Leu Ala Asp Asp Met Tyr Glu Ala Ser Ile Asp Asp Ser Ile Asp Gly
130 135 140
Ser Thr Ile Pro Thr Met Trp Gly Thr Asp Ala Val His Gly His Asn
145 150 155 160
Asn Val Val Lys Ala Thr Ile Phe Pro His Asn Ile Gly Leu Gly Ala
165 170 175
Met His Asn Pro Lys Leu Met Gln Gln Ile Gly Ala Ala Thr Ala Lys
180 185 190
Val Val Gln Val Thr Gly Ile Asp Trp Val Phe Ala Pro Thr Val Ala
195 200 205
Val Val Arg Asp Asp Arg Trp Gly Arg Thr Tyr Glu Gly Tyr Ser Glu
210 215 220
Asp Pro Ala Ile Val Lys Glu Tyr Ala Arg Ala Met Val Ile Gly Met
225 230 235 240
Gln Gly Glu Ala Asn Ser Glu Ala Phe Met Gly Asp Gly Thr Val Ile
245 250 255
Ala Thr Ala Lys His Phe Leu Gly Asp Gly Gly Thr Asp Lys Gly Asp
260 265 270
Asp Gln Gly Asn Asn Leu Ser Thr Glu Gln Glu Leu Ile Asp Ile His
275 280 285
Ala Gln Gly Tyr Ile Ser Ala Ile Glu Glu Gly Val Gln Thr Ile Met
290 295 300
Ala Ser Phe Asn Ser Trp Asn Gly Glu Lys Met His Gly Asn Lys Ser
305 310 315 320
Leu Leu Thr Asp Val Leu Lys Lys Gln Met Gly Phe Asp Gly Leu Val
325 330 335
Val Gly Asp Trp Asp Gly His Gly Gln Val Lys Gly Cys Ser Asn Ala
340 345 350
Ser Cys Ala Gln Ala Ile Asn Ala Gly Val Asp Ile Ile Met Val Pro
355 360 365
Asn Glu Trp Lys Pro Met Phe Glu Asn Thr Val Ala Gln Val Lys Ser
370 375 380
Gly Glu Ile Ser Glu Ala Arg Ile Asn Asp Ala Val Thr Arg Ile Leu
385 390 395 400
Arg Val Lys Met Arg Ala Gly Ile Phe Asp Gly Val Lys Pro Ser Asp
405 410 415
Arg Ala Phe Ala Ala Glu Glu Lys Tyr Leu Gly Ser Ala Glu Asn Arg
420 425 430
Ala Ile Ala Arg Gln Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn
435 440 445
Gln Asn Lys Leu Leu Pro Leu Asp Arg Lys Met Asn Val Leu Met Ala
450 455 460
Gly Ser Gly Ala Asp Asn Ile Gly Lys Gln Ser Gly Gly Trp Thr Leu
465 470 475 480
Ser Trp Gln Gly Thr Gly Asn Val Asn Ser Asp Phe Pro Gly Ala Thr
485 490 495
Ser Ile Tyr Asp Gly Val Asn Gln Val Val Ser Ser Ala Gly Gly Lys
500 505 510
Val Glu Leu Ser Glu Asn Gly Asn Tyr Gln Ala Lys Pro Asp Val Ala
515 520 525
Ile Val Val Phe Gly Glu Asn Pro Tyr Ala Glu Gly Val Gly Asp Ile
530 535 540
Glu Gly Ile Glu Tyr Gln Leu Asn Asn Lys Arg Asp Ile Asn Leu Leu
545 550 555 560
Gln Lys Leu Lys Ala Asp Gly Ile Pro Val Val Ser Val Phe Leu Thr
565 570 575
Gly Arg Pro Leu Trp Val Asn Lys Glu Leu Asn Ala Ser Asp Ala Phe
580 585 590
Val Ala Ala Trp Leu Pro Gly Ser Glu Gly Val Gly Val Ser Asp Val
595 600 605
Leu Phe Lys Lys Ala Asp Gly Ser Ile Asn Tyr Asp Phe Lys Gly Lys
610 615 620
Leu Thr Tyr Ser Trp Pro Lys Tyr Asp Asp Gln Val Val Ile Asn Lys
625 630 635 640
Gly Asp Lys Asp Tyr Ala Pro Leu Tyr Pro Tyr Gly Tyr Gly Leu Thr
645 650 655
Tyr Ser Asp Val Asp Thr Gln Gly Asp Asp Leu Pro Glu Glu Thr Lys
660 665 670
Val Lys Ile Gly Arg Ala Asp Asp Glu Pro Met Ala Ile Phe Asp Ser
675 680 685
Leu Pro Gln Ser Asp Leu Gly Phe Phe Leu Gly Asp Lys Ala Asn Trp
690 695 700
Val Val Pro Ile Ala Thr Ser Val Val Thr Thr His Asn Ser Asp Asn
705 710 715 720
Leu Thr Met Arg Thr Tyr Asn Trp Lys Val Gln Glu Asp Ala Arg Gln
725 730 735
Leu Ile Trp Lys Gly Asp Ser Lys Ala Asn Ala Phe Phe Ala Trp Pro
740 745 750
Asp Pro His Asn Met Gln Gly Met Leu Glu His Lys Ala Ala Tyr Ser
755 760 765
Phe Ser Ile Lys Val Asp Lys Ala Pro Ala Gly Asp Leu Thr Leu Gly
770 775 780
Ile His Cys Met Glu Glu Cys Gly Lys Lys Leu Val Leu Asn Glu Ala
785 790 795 800
Leu Ser Lys Ile Pro Ala Gly Glu Trp Gly Glu Leu Thr Ile Asp Leu
805 810 815
Ala Cys Ile Ala Asp Ala Glu Ala Leu Ala Glu Val Arg Ser Pro Phe
820 825 830
Met Leu Ser Thr Asp Ala Pro Ala Ser Ile Val Phe Gly Asp Val Lys
835 840 845
Leu Val Pro Gly Gly Ala Asp Ser Ala Ala Ile Lys Cys Asp
850 855 860
<210> 38
<211> 2589
<212> DNA
<213> Microbulbifer degradans
<400> 38
atgctcaaaa agataaacaa gaaaggtctt gctttaagct tagcaattgc agcaatgcta 60
agcggctgca acgaaggcga cagcaacaaa accaaaccaa gtgcggaaac cctctccgct 120
actcaagcca gtaacactgt agccaacccc agcatttggc ccaaggtaac tagcaaggtt 180
gccaaagacg ccaaaatgga agcagatata agcgcaatac tcagcggtat gacccttgag 240
caaaaagtag cccaaatgat ccaacccgaa attcgtgcct tcagcaaaga agacatgaaa 300
aagtatggtt ttggctccta ccttaacggt ggcggcgcat tccctaacga caacaaacat 360
tccaccatgg ccgactgggt tgccctagcc gacgacatgt atgaagcctc tatagacgac 420
agcatagacg gcagcactat tccaaccatg tggggtaccg atgcagtaca cggccacaac 480
aacgtggtta aagcgactat tttcccacac aacattggcc ttggcgccat gcataacccc 540
aagctcatgc agcaaatagg cgctgccacg gctaaagtgg tacaagttac tggtatcgac 600
tgggtatttg cgcccactgt tgcggtagtg cgcgacgacc gctggggccg tacttacgag 660
ggctactctg aagaccccgc catagtaaaa gaatacgctc gcgccatggt tattggcatg 720
cagggcgaag ccaatagcga agcgtttatg ggtgacggca ctgttatagc caccgccaaa 780
cactttttgg gcgatggcgg caccgacaaa ggcgacgacc aaggcaacaa cttatccacc 840
gaacaagaat taattgatat tcacgcccaa ggctatataa gcgccattga agaaggtgtg 900
caaactatca tggcatcttt caatagctgg aatggcgaaa agatgcacgg caataaatct 960
ctgcttaccg atgtccttaa aaagcaaatg ggctttgacg gtttggtggt tggcgattgg 1020
gatggccacg gccaagtaaa aggttgctct aatgcaagct gtgcccaagc catcaacgcc 1080
ggtgtcgata tcatcatggt acccaatgag tggaaaccca tgttcgaaaa caccgttgca 1140
caagttaaaa gcggcgaaat ctctgaagcg cgaattaacg atgcagttac ccgtatttta 1200
cgtgtaaaaa tgcgcgctgg tattttcgac ggtgttaaac catcggatcg cgccttcgca 1260
gcagaagaaa aatacctagg ctctgccgaa aaccgcgcta tcgctcgtca agctgtacgc 1320
gaatcgttag tgttgcttaa aaaccaaaac aaactgctgc cattagaccg caaaatgaac 1380
gttttaatgg cgggttctgg cgcagacaac atcggcaagc aaagtggtgg ttggacatta 1440
agctggcagg gtactggcaa cgtgaacagc gacttccctg gcgcaacatc tatttacgac 1500
ggcgttaacc aagtagtgag cagcgctggc ggtaaagtag agctaagcga aaacggcaac 1560
taccaagcca aaccagatgt agcgattgta gtatttggtg aaaaccctta cgcagaaggc 1620
gtaggcgata ttgaaggtat tgaataccaa ctaaacaata agcgcgatat caatttgtta 1680
caaaaactca aagccgatgg cattcctgtt gtatcggtat tcttaaccgg tcgtccactt 1740
tgggtaaaca aagagcttaa tgcctccgat gcttttgttg cagcttggct gccaggctct 1800
gaaggtgtag gcgtttctga tgtgctattc aaaaaagccg acggtagtat taactacgac 1860
tttaaaggca agctaactta ctcttggcca aagtatgatg accaagtagt aataaacaaa 1920
ggcgacaaag attacgcccc gctttaccct tatggttacg gcttaaccta cagcgatgtt 1980
gacacccaag gtgacgactt acctgaagaa accaaagtta aaattggccg cgctgacgac 2040
gagccaatgg ccatcttcga cagcctaccc caaagcgacc tcggcttctt ccttggcgac 2100
aaagccaact gggtagtacc tattgcaaca agtgtagtta caacgcacaa cagcgataac 2160
ctaaccatgc gcacctacaa ctggaaagta caagaagatg ctcgccagtt aatttggaaa 2220
ggcgacagca aagccaatgc cttctttgca tggccagacc cacacaatat gcaaggcatg 2280
ttagaacaca aagcggctta cagctttagc attaaagtag ataaagcacc cgctggcgac 2340
ctaacactag gcatacactg catggaagaa tgcggtaaaa aacttgtgct taacgaagcg 2400
cttagcaaaa ttcctgctgg tgagtgggga gagctaacaa tagatctagc ttgcatagca 2460
gatgccgaag ccttggccga agttcgctca cccttcatgc taagcaccga tgcacccgca 2520
tctatcgtgt ttggcgatgt gaagttagta cctggcggtg cagatagcgc agctattaag 2580
tgtgactaa 2589
<210> 39
<211> 461
<212> PRT
<213> Microbulbifer degradans
<400> 39
Met Lys Thr Phe Asn Pro Asp Phe Val Trp Gly Ala Ala Ser Ser Ala
1 5 10 15
Tyr Gln Val Glu Gly Ala Thr Thr Thr Thr Asp Gly Arg Gly Pro Ser Ile
20 25 30
Trp Asp Ala Phe Ser Ser Ile Pro Gly Lys Thr Tyr His Asn Gln Asn
35 40 45
Ala Asp Ile Ala Cys Asp His Tyr Asn Arg Trp Gln Glu Asp Val Ala
50 55 60
Ile Met Lys Glu Met Gly Leu Lys Ala Tyr Arg Phe Ser Ile Ser Trp
65 70 75 80
Ser Arg Ile Phe Pro Thr Gly Arg Gly Glu Val Asn Glu Lys Gly Val
85 90 95
Ala Phe Tyr Asn Asn Leu Ile Asp Glu Leu Ile Lys Asn Asp Ile Thr
100 105 110
Pro Trp Val Thr Leu Phe His Trp Asp Phe Pro Leu Ala Leu Gln Met
115 120 125
Glu Met Asp Gly Leu Leu Asn Pro Ala Ile Ala Asp Glu Phe Ala Asn
130 135 140
Tyr Ala Lys Leu Cys Phe Ala Arg Phe Gly Asp Arg Val Thr His Trp
145 150 155 160
Ile Thr Leu Asn Glu Pro Trp Cys Ser Ala Met Leu Gly His Gly Met
165 170 175
Gly Ser Lys Ala Pro Gly Arg Val Ser Lys Asp Glu Pro Tyr Ile Ala
180 185 190
Ala His Asn Leu Leu Arg Ala His Gly Lys Met Val Asp Ile Tyr Arg
195 200 205
Arg Glu Phe Gln Pro Thr Gln Lys Gly Met Ile Gly Ile Ala Asn Asn
210 215 220
Cys Asp Trp Arg Glu Pro Lys Thr Asp Ser Glu Leu Asp Lys Lys Ala
225 230 235 240
Ala Glu Arg Ala Leu Glu Phe Phe Val Ser Trp Phe Ala Asp Pro Ile
245 250 255
Tyr Leu Gly Asp Tyr Pro Ala Ser Met Arg Glu Arg Leu Gly Glu Arg
260 265 270
Leu Pro Thr Phe Ser Asp Glu Asp Ile Ala Leu Ile Lys Asn Ser Ser
275 280 285
Asp Phe Phe Gly Leu Asn His Tyr Thr Thr Met Leu Ala Glu Gln Thr
290 295 300
His Glu Gly Asp Val Val Glu Asp Thr Ile Arg Gly Asn Gly Gly Ile
305 310 315 320
Ser Glu Asp Gln Met Val Thr Leu Ser Lys Asp Pro Ser Trp Glu Gln
325 330 335
Thr Asp Met Glu Trp Ser Ile Val Pro Trp Gly Cys Lys Lys Leu Leu
340 345 350
Ile Trp Leu Ser Glu Arg Tyr Asn Tyr Pro Asp Ile Tyr Ile Thr Glu
355 360 365
Asn Gly Cys Ala Leu Pro Asp Glu Asp Asp Val Asn Ile Ala Ile Asn
370 375 380
Asp Thr Arg Arg Val Asp Phe Tyr Arg Gly Tyr Ile Asp Ala Cys His
385 390 395 400
Gln Ala Ile Glu Ala Gly Val Lys Leu Lys Gly Tyr Phe Ala Trp Thr
405 410 415
Leu Met Asp Asn Tyr Glu Trp Glu Glu Gly Tyr Thr Lys Arg Phe Gly
420 425 430
Leu Asn His Val Asp Phe Thr Thr Gly Lys Arg Thr Pro Lys Gln Ser
435 440 445
Ala Ile Trp Tyr Ser Thr Leu Ile Lys Asp Gly Gly Phe
450 455 460
<210> 40
<211> 1386
<212> DNA
<213> Microbulbifer degradans
<400> 40
atgaaaacct ttaacccaga tttcgtatgg ggagcagcca gttccgccta tcaggtagaa 60
ggcgccacca ccaccgatgg cagaggcccc agtatttggg atgcgttcag ttccattccc 120
ggtaaaacct accacaacca aaacgccgac atagcctgcg accactacaa ccgctggcaa 180
gaagacgtgg ccataatgaa agagatgggg ctaaaggctt accgcttttc tatttcttgg 240
tcgcgcatat tccctactgg gcgcggcgaa gttaacgaaa aaggcgtagc cttttacaac 300
aaccttatcg acgaattaat aaaaaacgac attacccctt gggtaaccct atttcactgg 360
gactttcctc tggcactgca aatggaaatg gacggcctac ttaaccccgc catcgccgac 420
gaattcgcca actacgccaa gctgtgtttc gcgcgctttg gcgaccgcgt tacccactgg 480
attaccctaa acgaaccttg gtgcagtgcc atgcttggcc acggcatggg cagcaaagcc 540
cctggccgcg tatctaagga tgaaccctat atagccgccc acaacttgct gcgtgcacac 600
ggcaaaatgg tagatattta ccggcgcgaa tttcagccca cacaaaaagg catgataggc 660
atagccaaca attgcgactg gcgcgaaccc aaaaccgatt ctgaattaga taaaaaagca 720
gccgagcgcg ccctagaatt ttttgtaagc tggtttgccg accccattta tttgggcgac 780
tacccagcca gcatgcgcga gcgcttgggt gagcgtttac ccacctttag cgacgaagac 840
attgcgctaa taaaaaactc tagcgacttt tttggtttga atcactacac caccatgctt 900
gccgaacaaa cccacgaagg tgacgttgtt gaagatacta ttcgcggcaa cggcggcata 960
tcggaagacc aaatggtcac cctctccaaa gacccaagct gggaacaaac cgacatggag 1020
tggagcattg tgccctgggg ctgtaaaaaa ttattaatct ggttaagcga gcgctacaac 1080
taccccgaca tttacattac cgaaaacggc tgcgccctac ccgacgaaga cgacgtaaac 1140
atagccatta acgatacacg ccgcgtagat ttttaccgcg gttatatcga tgcgtgtcac 1200
caagcaatag aggccggcgt aaaactaaaa ggctattttg catggacact tatggataac 1260
tacgaatggg aagaaggcta caccaaacgc tttggcttaa accatgtaga tttcaccaca 1320
ggcaaacgca cacctaaaca gtctgcaatt tggtatagca cgttaattaa agatggtggg 1380
ttctag 1386
<210> 41
<211> 444
<212> PRT
<213> Microbulbifer degradans
<400> 41
Met Asn Arg Leu Thr Leu Pro Pro Ser Ser Arg Leu Arg Ser Lys Glu
1 5 10 15
Phe Thr Phe Gly Val Ala Thr Ser Ser Tyr Gln Ile Glu Gly Gly Ile
20 25 30
Asp Ser Arg Leu Pro Cys Asn Trp Asp Thr Phe Cys Glu Gln Pro Asn
35 40 45
Thr Ile Ile Asp Asn Thr Asn Gly Ala Ile Ala Cys Asp His Ile Asn
50 55 60
Arg Trp Gln Asp Asp Ile Glu Leu Ile Ala Asn Leu Gly Val Asp Ala
65 70 75 80
Tyr Arg Phe Ser Ile Ala Trp Gly Arg Val Ile Asn Leu Asp Gly Ser
85 90 95
Leu Asn Asn Glu Gly Val Thr Phe Tyr Lys Asn Ile Leu Thr Lys Leu
100 105 110
Arg Glu Lys Asn Leu Lys Ala Tyr Ile Thr Leu Tyr His Trp Asp Leu
115 120 125
Pro Gln His Leu Glu Asp Ala Gly Gly Trp Leu Asn Arg Asp Thr Ala
130 135 140
Tyr Lys Phe Arg Asp Tyr Val Asn Leu Ile Thr Gln Ala Leu Asp Asp
145 150 155 160
Asp Val Phe Cys Tyr Thr Thr Leu Asn Glu Pro Phe Cys Ser Ala Tyr
165 170 175
Leu Gly Tyr Glu Ile Gly Val His Ala Pro Gly Ile Lys Asp Leu Ala
180 185 190
Ser Gly Arg Lys Ala Ala His His Leu Leu Leu Ala His Gly Leu Ala
195 200 205
Met Gln Val Leu Arg Lys Asn Cys Pro Asn Ser Leu Ser Gly Ile Val
210 215 220
Leu Asn Met Ser Pro Cys Tyr Ala Gly Ser Asn Ala Gln Ala Asp Ile
225 230 235 240
Asp Ala Ala Lys Arg Ala Asp Asp Leu Leu Phe Gln Trp Tyr Ala Gln
245 250 255
Pro Leu Leu Thr Gly Cys Tyr Pro Asp Ala Ile Asn Ser Leu Pro Asp
260 265 270
Asn Ala Lys Pro Pro Ile Cys Glu Gly Asp Met Ala Leu Ile Ser Gln
275 280 285
Pro Leu Asp Tyr Leu Gly Leu Asn Tyr Tyr Thr Arg Ala Val Phe Phe
290 295 300
Ala Asp Gly Asn Gly Gly Phe Thr Glu Gln Val Pro Glu Gly Val Glu
305 310 315 320
Leu Thr Asp Met Gly Trp Glu Val Tyr Pro Gln Gly Leu Thr Asp Leu
325 330 335
Leu Ile Asp Leu Asn Gln Arg Tyr Thr Leu Pro Pro Leu Leu Ile Thr
340 345 350
Glu Asn Gly Ala Ala Met Val Asp Glu Leu Val Asn Gly Glu Val Asn
355 360 365
Asp Ile Ala Arg Ile Asn Tyr Phe Gln Thr His Leu Gln Ala Val His
370 375 380
Asn Ala Ile Glu Gln Gly Val Asp Val Arg Gly Tyr Phe Ala Trp Ser
385 390 395 400
Leu Met Asp Asn Phe Glu Trp Ala Leu Gly Tyr Ser Lys Arg Phe Gly
405 410 415
Ile Thr Tyr Val Asp Tyr Gln Thr Gln Lys Arg Thr Leu Lys Ala Ser
420 425 430
Gly His Ala Phe Ala Glu Phe Val Ser Ser Arg Ser
435 440
<210> 42
<211> 1335
<212> DNA
<213> Microbulbifer degradans
<400> 42
atgaatagac ttacactacc gccttcttct cgtttgcgca gcaaagagtt tacctttggt 60
gttgcaacgt cgtcttacca aattgaaggc ggcatagatt ctcgcctgcc ctgtaattgg 120
gatacgttct gtgagcagcc caataccatt attgataaca ccaacggcgc cattgcttgc 180
gaccacataa atagatggca agacgatata gaacttattg ccaacctagg ggtagatgcc 240
taccgctttt ctattgcgtg gggccgtgtt attaatttag acggcagcct caataatgaa 300
ggcgttacat tttacaaaaa tattttaact aagcttcgcg aaaagaattt aaaagcttat 360
ataacgctat accactggga cttgccacaa catttagaag atgctggcgg ctggcttaac 420
cgcgataccg cctacaagtt tcgcgactat gtaaacctta taacccaagc gcttgatgac 480
gatgtatttt gctacacaac gttaaacgag cccttttgca gtgcctacct tggctatgaa 540
attggtgtac acgcaccggg tataaaagac ttagccagtg ggcgcaaagc cgcacaccat 600
ttattacttg cccatggctt agctatgcaa gtgctgcgaa aaaactgccc caatagttta 660
agcggcatag tgttaaacat gagcccttgt tacgccggca gcaacgcaca agcagatata 720
gatgcagcaa aacgcgcgga cgatttatta tttcagtggt atgcacaacc gctacttact 780
ggctgctacc ctgatgcaat aaacagcctg ccagacaatg ccaaaccacc tatttgtgaa 840
ggcgacatgg cgttaataag ccaaccttta gattatttag gccttaacta ctatacccgc 900
gcagtatttt ttgccgacgg taatggcggt tttaccgaac aagtacctga gggtgtagag 960
ctaaccgata tgggctggga agtttacccg caaggcttaa ccgatttact aatagaccta 1020
aaccaacgct ataccctacc cccgttactt attaccgaaa acggcgcagc aatggtggac 1080
gaacttgtta acggcgaagt taacgatatt gcccgaataa attattttca aacccattta 1140
caagcggtac acaacgccat tgaacaaggt gttgatgtac gcggttattt tgcttggagc 1200
ctaatggata attttgagtg ggcactgggt tacagcaaac gattcggtat tacctatgta 1260
gattaccaaa cacaaaagcg aacgctaaaa gccagcggcc acgcatttgc tgagtttgtc 1320
tcgagtagga gctaa 1335
<210> 43
<211> 866
<212> PRT
<213> Microbulbifer degradans
<400> 43
Met Leu Leu Ser Leu Lys Asn Thr Gln Leu Lys Arg Ser Met Asn Met
1 5 10 15
Asn Leu Lys His Leu Phe Leu Val Ala Leu Ala Leu Asn Ile Ala Ala
20 25 30
Cys Asn Val Lys Glu Pro Ala Ala Thr Asn Asp Asn His Ile Ser Tyr
35 40 45
Gln Ala Ala Arg Glu Ala Arg Leu Ala Lys Val Glu Ala Glu Val Glu
50 55 60
Arg Leu Leu Pro Leu Leu Thr Leu Glu Glu Lys Ala Ser Leu Val His
65 70 75 80
Ala Asn Ser Lys Phe Ser Ile Ala Ser Ile Glu Arg Leu Gly Ile His
85 90 95
Glu Met Trp Met Ser Asp Gly Pro His Gly Val Arg Tyr Gln Ile Glu
100 105 110
Arg His Gly Trp Ala Pro Ala Gly Trp Thr Asp Asp Asn Ser Thr Tyr
115 120 125
Leu Pro Pro Leu Thr Thr Val Ala Ala Ser Trp Asn Pro Glu Ile Ala
130 135 140
Ala Leu His Gly Asp Val Leu Gly Ala Glu Ala Arg His Arg Arg Lys
145 150 155 160
Asp Val Ile Leu Gly Pro Gly Val Asn Leu Ala Arg Leu Pro Leu Tyr
165 170 175
Gly Arg Asn Phe Glu Tyr Met Gly Glu Asp Pro Phe Leu Ala Ser Arg
180 185 190
Leu Ala Val Ala Glu Ile Lys Ala Ile Gln Glu Asn Asp Val Ala Ala
195 200 205
Cys Ile Lys His Phe Ala Leu Asn Asn Gln Glu Leu Asn Arg Thr Gly
210 215 220
Val Asn Ala Lys Pro Asp Glu Arg Thr Leu Arg Glu Val Tyr Leu Pro
225 230 235 240
Ala Phe Glu Ala Ala Val Lys Glu Ala Gly Val His Thr Ile Met Gly
245 250 255
Ala Tyr Asn Glu Phe Arg Gly Thr Asn Ala Asn Gln Ser Lys His Leu
260 265 270
Val Met Asp Ile Leu Lys Gly Glu Trp Gly Tyr Lys Gly Val Leu Leu
275 280 285
Thr Asp Trp Asn Val Asp Ile Asn Thr Tyr Asp Ala Ala Val Asn Gly
290 295 300
Leu Asp Ile Glu Met Gly Thr Asn Val Asp Ser Tyr Asp Asp Tyr Met
305 310 315 320
Leu Ala Gln Pro Met Ile Asp Met Ile Lys Ala Gly Ser Ile Pro Glu
325 330 335
Ser Val Leu Asp Asp Lys Val Arg Arg Ile Leu Arg Val Gln Leu Ser
340 345 350
Ile Gly Met Met Asp Lys Tyr Arg Leu Ser Gly Glu Arg Asn Thr Ala
355 360 365
Lys His His Glu Ala Ala Arg Lys Ile Ala Ser Glu Gly Ile Val Leu
370 375 380
Leu Lys Asn Glu Asn Ile Leu Pro Leu Asn Lys Asn Lys Ile Lys Asn
385 390 395 400
Val Leu Val Leu Gly Pro Asn Ala Asp Lys Val His Gly Leu Gly Gly
405 410 415
Gly Ser Ser Glu Val Pro Ala Leu Tyr Glu Ile Thr Pro Leu Gln Gly
420 425 430
Leu Lys Gln Lys Leu Gly Asp Asn Val Asn Ile Thr Val Met Arg Ala
435 440 445
Arg Tyr Asp Gly Val Leu Met Pro Ile Ala Ser Asp Tyr Val Thr Ser
450 455 460
Arg His Trp Thr Gly Thr Pro Ala Trp Asn Met Val Arg Tyr Ser Asp
465 470 475 480
Ala Ala Arg Thr Gln Ala Ile Gly Asp Ser Ala Ile Val Asp Ser Ala
485 490 495
Tyr Ser Ser Pro Ala Gly Thr Thr Lys Glu Tyr Val Thr Met Thr Ala
500 505 510
Thr Ile Lys Pro Leu Lys Ser Gly Glu His Thr Leu Lys Thr Ser Val
515 520 525
Met Gly Asp Phe Glu Leu Lys Ile Asn Gly Lys Thr Thr Val Lys His
530 535 540
Ser Ser Thr Ser Gly Asp Val Val Thr Gln Lys Ile Ala Leu Asn Gly
545 550 555 560
Gly Glu Thr Tyr Ser Phe Glu Ile Leu Tyr Ser Gly Asn Lys Asn Phe
565 570 575
Thr Leu Gly Trp Asp Ala Pro Gly Asp Leu Phe Thr Ala Glu Lys Glu
580 585 590
Tyr Ile Ala Ala Ala Lys Lys Ala Asp Val Val Phe Tyr Phe Gly Gly
595 600 605
Leu Thr His Gly Asp Asp Arg Glu Ala Ile Asp Arg Pro His Met Lys
610 615 620
Leu Pro Asn His Gln Asp Pro Val Ile Ser Lys Val Leu Ala Ala Asn
625 630 635 640
Pro Asn Thr Val Val Phe Leu Ile Ala Gly Ser Ala Val Glu Met Pro
645 650 655
Trp Ala Asp Lys Ala Lys Ala Ile Val Trp Gly Trp Tyr Gly Gly Met
660 665 670
Glu Ala Gly Asn Ala Tyr Ala Asp Met Leu Phe Gly Asp Thr Asn Pro
675 680 685
Ser Gly Lys Met Pro Ile Thr Leu Pro Lys Ala Leu Glu Asp Thr Ala
690 695 700
Pro Ile Ala Leu Asn Asp Tyr Asn Pro Val Glu Ser Leu Tyr Thr Glu
705 710 715 720
Gly Val Phe Ile Gly Tyr Arg Trp Phe Glu Lys Gln Asn Ile Glu Pro
725 730 735
Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Thr Gln Phe Lys Tyr Asn
740 745 750
Asn Ile Lys Leu Ser Ser Ala Asn Ile Lys Gly Asp Gln Thr Val Thr
755 760 765
Val Ser Ala Thr Ile Thr Asn Thr Gly Lys Val Ala Gly Ala Glu Val
770 775 780
Val Gln Leu Tyr Leu His Asp Glu Gln Ala Ser Val Glu Arg Pro Ala
785 790 795 800
Lys Glu Leu Lys Gly Phe Gln Lys Val Phe Leu Lys Pro Gly Glu Ser
805 810 815
Lys Ala Val Asn Ile Thr Leu Asn Lys Arg Ala Leu Ser Phe Trp Asp
820 825 830
Glu Asn Ser Asn Asp Trp Leu Ala Glu Thr Gly Lys Phe Asn Val Leu
835 840 845
Leu Gly Ala Ser Val Ser Asp Ile Arg Leu Gln Thr Ser Phe Gln Tyr
850 855 860
Gln Gln
865
<210> 44
<211> 2601
<212> DNA
<213> Microbulbifer degradans
<400> 44
atgctgctaa gcttaaaaaa cactcaactc aaaagaagta tgaacatgaa ccttaaacac 60
ctctttctgg ttgctttggc gctaaatatt gctgcgtgca atgtaaaaga gcccgcggcg 120
acaaatgata accacattag ctaccaagcc gctcgcgaag cgcgcttggc aaaagttgaa 180
gccgaagttg aacgcctgct gccactatta acactagaag aaaaagcctc tttggttcat 240
gcgaacagca aattctctat cgcctctatc gagcggctag gcattcacga aatgtggatg 300
tctgatggcc cccacggcgt gcgctatcaa atcgaacgcc acggctgggc accagcaggc 360
tggacagatg acaactccac ttacttacca ccgcttacta ccgtagccgc cagctggaac 420
cccgaaatag ctgcccttca cggcgatgta ctcggcgcag aagctcgcca ccgccgtaaa 480
gatgtaatat taggcccagg cgtaaactta gctcgcctgc cactttatgg tcgtaacttt 540
gaatatatgg gtgaagaccc cttcttggca tcacgtcttg ctgtggcaga aattaaagcc 600
attcaagaaa atgacgtggc cgcctgtatc aaacatttcg cgcttaacaa tcaagagctg 660
aatcgcaccg gcgtaaacgc caaacccgat gaacgcacat tacgcgaagt gtatttaccc 720
gccttcgaag ccgccgttaa agaagcgggc gtgcacacca taatgggggc ctacaatgaa 780
tttcgcggta ccaacgccaa ccaaagcaaa catttagtaa tggatattct aaaaggcgaa 840
tggggctaca aaggcgtgtt actcacagac tggaacgtag atatcaacac ttacgatgcc 900
gctgttaacg gcctcgatat cgaaatgggt acaaatgtag atagctacga cgactacatg 960
cttgcccaac caatgatcga catgattaaa gcgggcagca ttccagagtc agtacttgat 1020
gataaagttc gtcgcatact gcgcgtgcaa ctcagcatag gcatgatgga caaataccgc 1080
ttatctggtg agcgcaatac tgccaagcat cacgaagctg cacgcaaaat tgcatctgaa 1140
ggtattgtgc tactaaaaaa tgaaaacatt ctgccgctaa ataaaaacaa aattaaaaac 1200
gtattggtgc ttggccccaa cgcagacaaa gtgcacggtt taggcggtgg ctcgtcagaa 1260
gtgccagcac tttatgaaat aaccccgtta caagggttaa aacagaagct gggagataat 1320
gtaaacatta ccgttatgcg cgcacgctat gacggtgtgt taatgcctat cgccagtgat 1380
tatgttactt ctcgtcactg gaccggcaca cctgcatgga acatggtgcg ttactcggat 1440
gctgcgcgca cccaagctat tggcgactcc gccattgttg attcggctta ttcttcgcct 1500
gcaggcacga ctaaagaata cgtcaccatg accgccacaa ttaaaccgtt aaaatcgggc 1560
gagcacacac tcaaaacatc ggtgatgggc gatttcgaat taaaaattaa cggtaaaacc 1620
acagtaaaac atagcagcac tagcggcgat gtagtaaccc aaaaaatcgc cctcaacggc 1680
ggtgaaacat acagcttcga aattttatac agcggcaata aaaactttac cttgggctgg 1740
gatgcaccgg gagatttatt taccgcagaa aaagaataca tagccgccgc gaaaaaagcg 1800
gatgtagtgt tttactttgg cggcctaacc cacggcgacg accgcgaagc aattgaccgc 1860
cctcacatga agctgcctaa ccatcaagac ccagttatta gcaaagtatt agctgcaaac 1920
ccgaacacgg ttgtattttt aattgcaggc tctgctgtag aaatgccgtg ggccgataaa 1980
gctaaagcta ttgtgtgggg ctggtatggc ggtatggagg ccggtaacgc ctacgccgat 2040
atgctatttg gcgataccaa ccccagcggc aaaatgccaa taactttacc aaaggcactg 2100
gaagatactg ctccaatcgc actgaatgat tacaaccctg ttgaatcact ctacaccgag 2160
ggcgtgttta ttggttaccg ctggttcgaa aaacaaaaca tcgagccgct attcccgttc 2220
ggtcatggtt tgtcttatac ccagtttaag tacaacaata taaagctctc tagcgcgaac 2280
attaaaggcg accaaaccgt caccgtaagc gcaaccatta ccaatactgg caaagtggcc 2340
ggcgctgaag ttgtacaact gtatttgcat gacgagcaag caagcgtaga acgcccagca 2400
aaagaactta aaggtttcca aaaagtgttt ttaaagccgg gtgaaagcaa agcggtaaat 2460
attacgctta ataaacgcgc cctttcattt tgggatgaaa acagcaacga ctggcttgca 2520
gaaacaggta aatttaatgt gctattgggc gcatcagtaa gcgatatacg cttacaaact 2580
agcttccaat accagcagta a 2601
<210> 45
<211> 811
<212> PRT
<213> Microbulbifer degradans
<400> 45
Met Lys Phe Gly His Phe Asp Asp Lys Ala Arg Glu Tyr Val Ile Thr
1 5 10 15
Asp Pro Lys Thr Pro Tyr Pro Trp Ile Asn Tyr Leu Gly Asn Glu Asp
20 25 30
Phe Phe Ser Leu Val Ser Asn Thr Gly Gly Gly Tyr Ser Phe Tyr Lys
35 40 45
Asp Ala Lys Phe Arg Arg Leu Thr Arg Tyr Arg Tyr Asn Asn Val Pro
50 55 60
Val Asp Asn Gly Gly Lys Tyr Phe Tyr Ile Asn Asp Ser Gly Asp Val
65 70 75 80
Trp Ser Pro Gly Trp Lys Pro Val Lys Ala Glu Leu Asp Ala Tyr Ser
85 90 95
Cys Ala His Gly Leu Ser Tyr Thr Arg Ile Thr Gly Glu Arg Asn Gly
100 105 110
Ile Gln Ala Glu Val Leu Ser Phe Ile Pro Leu Gly Thr Trp Ala Glu
115 120 125
Ile Gln Lys Val Ser Leu Lys Asn Thr Ser Gly Ala Thr Lys Lys Phe
130 135 140
Lys Leu Phe Ser Phe Ala Glu Trp Cys Leu Trp Asn Ala Glu Asp Asp
145 150 155 160
Met Thr Asn Phe Gln Arg Asn Phe Ser Thr Gly Glu Val Glu Val Glu
165 170 175
Asp Ser Val Ile Tyr His Lys Thr Glu Phe Lys Glu Arg Arg Asn His
180 185 190
Tyr Ala Phe Tyr Ser Val Asn Ala Pro Ile Gln Gly Phe Asp Thr Asp
195 200 205
Arg Asp Lys Trp Lys Gly Leu Tyr Asn Asp Phe Asp Lys Pro Asp Ala
210 215 220
Val Phe Glu Gly Glu Pro Arg Asn Ser Glu Ala His Gly Trp Ser Pro
225 230 235 240
Ile Ala Ser His Tyr Leu Glu Val Glu Leu Ala Pro Gly Glu Ser Lys
245 250 255
Asp Leu Ile Phe Val Leu Gly Tyr Ile Glu Val Ala Pro Glu Asn Lys
260 265 270
Trp Glu Ser Lys Gly Val Ile Asn Lys Ser Pro Ala Lys Glu Leu Ile
275 280 285
Ala Arg Phe Asp Ser Val Glu Lys Val Asp Ala Glu Leu Thr Lys Leu
290 295 300
Ala Asp Tyr Trp Ala Asn Leu Leu Ser Thr Tyr Ser Val Glu Ser Gly
305 310 315 320
Asp Glu Lys Leu Asp Arg Met Val Asn Ile Trp Asn Gln Tyr Gln Cys
325 330 335
Met Val Thr Phe Asn Met Ser Arg Ser Ala Ser Phe Phe Glu Ser Gly
340 345 350
Ile Gly Arg Gly Met Gly Phe Arg Asp Ser Asn Gln Asp Leu Ile Gly
355 360 365
Phe Val His Gln Val Pro Glu Arg Ala Arg Glu Arg Ile Ile Asp Ile
370 375 380
Ala Ser Thr Gln Phe Glu Asp Gly Ser Ala Tyr His Gln Tyr Gln Pro
385 390 395 400
Leu Thr Lys Arg Gly Asn Asn Ala Ile Gly Gly Asn Phe Asn Asp Asp
405 410 415
Pro Leu Trp Leu Ile Leu Ser Thr Thr Asp Tyr Ile Lys Glu Thr Gly
420 425 430
Asp Phe Ser Ile Leu Glu Glu Gln Val Pro Tyr Asp Asn Asp Ala Ser
435 440 445
Lys Ala Thr Ser His Phe Glu His Leu Lys Arg Ser Phe Tyr His Thr
450 455 460
Val Asn Asn Leu Gly Pro His Gly Leu Pro Leu Ile Gly Arg Ala Asp
465 470 475 480
Trp Asn Asp Cys Leu Asn Leu Asn Cys Phe Ser Glu Asp Pro Asn Glu
485 490 495
Ser Phe Gln Thr Thr Gly Asn Lys Thr Gly Arg Thr Ala Glu Ser Leu
500 505 510
Met Ile Ala Gly Leu Phe Val Leu Tyr Gly Asn Glu Phe Val Lys Leu
515 520 525
Cys Arg Glu Ile Gly Gln Asp Gly Glu Ala Ala Glu Ala Gln Ala His
530 535 540
Ile Asp Gln Met Val Glu Ala Val Lys Lys His Gly Trp Asp Gly Glu
545 550 555 560
Trp Phe Leu Arg Ala Tyr Asp Tyr Tyr Gly Lys Lys Val Gly Ser Lys
565 570 575
Glu Asn Glu Glu Gly Lys Ile Phe Ile Glu Ser Gln Gly Phe Cys Gly
580 585 590
Met Ala Gly Ile Gly Leu Glu Asp Gly Leu Val Glu Lys Ser Met Asp
595 600 605
Ser Val Lys Glu Trp Leu Asp Cys Asp Tyr Gly Ile Val Leu Gln Gln
610 615 620
Pro Ala Phe Thr Lys Tyr Tyr Ile Glu Tyr Gly Glu Ile Ser Thr Tyr
625 630 635 640
Pro Ala Gly Tyr Lys Glu Asn Ala Gly Ile Phe Cys His Asn Asn Pro
645 650 655
Trp Ile Met Ile Thr Glu Thr Leu Leu Gly Arg Gly Asp Lys Ala Phe
660 665 670
Glu Tyr Tyr Arg Lys Ile Ala Pro Ala Tyr Leu Glu Glu Ile Ser Asp
675 680 685
Leu His Lys Val Glu Pro Tyr Ala Tyr Cys Gln Met Ile Ala Gly Lys
690 695 700
Asp Ala Tyr Leu Pro Gly Glu Gly Lys Asn Ser Trp Leu Thr Gly Thr
705 710 715 720
Ala Ser Trp Asn Phe Ala Ala Ile Thr Gln Tyr Ile Leu Gly Val Lys
725 730 735
Pro Asp Tyr Ser Gly Leu Ala Ile Asn Pro Cys Ile Pro Ser Ser Trp
740 745 750
Asp Gly Phe Lys Val Thr Arg Lys Tyr Arg Gly Ala Thr Tyr Asn Ile
755 760 765
Ile Val Thr Asn Pro Thr His Val Ser Lys Gly Val Lys Ser Leu Thr
770 775 780
Leu Asn Gly Asn Ala Ile Asp Gly Tyr Ile Val Pro Pro Gln Gln Ala
785 790 795 800
Gly Thr Val Cys Asn Val Glu Val Thr Leu Gly
805 810
<210> 46
<211> 2436
<212> DNA
<213> Microbulbifer degradans
<400> 46
atgaaatttg ggcactttga cgacaaagca cgcgagtatg taattaccga cccgaaaact 60
ccctacccgt ggataaacta cttaggcaac gaagacttct tcagcctagt atctaacact 120
gggggtggct acagttttta caaagatgca aagttccgtc gtttaacacg ctatagatac 180
aacaacgtac ccgtagacaa cggcggtaaa tatttttaca tcaatgatag tggcgatgta 240
tggagccccg gttggaagcc ggtaaaagca gagctagacg catacagctg cgctcacggc 300
cttagctaca cccgcattac cggcgaaaga aacggcattc aagcggaagt acttagcttt 360
atccctctcg gcacttgggc cgaaattcaa aaagttagcc ttaagaatac ctctggcgct 420
accaaaaaat ttaaactgtt ttctttcgcc gaatggtgcc tatggaacgc agaagatgac 480
atgaccaact tccaacgcaa cttctccacc ggtgaagtag aggtggaaga ctctgttatt 540
tatcacaaga cagaatttaa agagcgccgc aatcattacg cattctactc tgtaaacgca 600
ccaattcagg gcttcgacac cgacagagac aaatggaaag gcttgtacaa cgattttgat 660
aaacccgatg ccgtttttga aggcgagcct cgcaactccg aagcgcacgg ctggtcgcca 720
attgcatctc actatctaga agtggagctc gcaccaggcg aaagcaaaga cttaattttt 780
gtgcttggct atatagaagt tgccccagaa aacaaatggg aatcaaaggg cgttatcaac 840
aagtctccag ccaaagaact tattgcgcgt ttcgatagcg tagaaaaagt agatgccgag 900
ttaaccaagc tagccgatta ttgggcaaat ttgctttcta cttacagcgt agaaagtggc 960
gacgaaaagc tagaccgcat ggtaaatatt tggaaccaat accagtgtat ggtgacattt 1020
aatatgagtc gctctgcgtc tttcttcgaa tctggcattg gccgtggtat gggcttccgc 1080
gattccaatc aggatttgat aggctttgta caccaagtac ccgagcgcgc ccgcgaacgc 1140
ataattgata ttgcttctac tcagtttgaa gacggttcgg cctaccacca gtatcagcct 1200
ttaaccaaac gcggcaacaa cgcaattggc ggcaacttta acgatgaccc tctttggcta 1260
atcctttcta ccaccgatta cataaaagag actggcgatt tctctatttt agaagagcaa 1320
gtgccttacg ataatgatgc gagcaaagcc acaagtcatt ttgaacattt aaagcgctcg 1380
ttttatcaca cggttaataa tttaggccca catggcttgc cacttattgg tcgcgccgac 1440
tggaacgact gcctaaacct aaactgcttt agtgaagacc ctaacgaatc attccaaacc 1500
acgggcaaca aaaccggcag aacggctgag tcgttaatga ttgcaggttt atttgtttta 1560
tacggcaacg agtttgtaaa actgtgccgt gaaataggcc aagacggaga agcggcagaa 1620
gcccaagccc atattgacca aatggtagaa gctgtgaaaa agcacggctg ggatggcgag 1680
tggtttttgc gtgcttacga ctactacggt aaaaaagtag gcagtaaaga aaacgaagaa 1740
ggcaaaatat ttatcgaatc gcaaggtttc tgcggcatgg caggaatcgg cctagaagac 1800
ggccttgtcg aaaaatcgat ggattctgtt aaagaatggt tagattgcga ttacggtatt 1860
gtgttgcagc aaccggcgtt taccaagtac tacatagagt atggtgaaat ctccacctac 1920
cctgctggct acaaagagaa cgcaggtatc ttctgccaca acaacccgtg gattatgatc 1980
accgaaactt tgcttggccg cggtgacaaa gcctttgaat actaccgcaa aattgcacct 2040
gcatacctag aggaaattag cgatcttcac aaagtagagc cttacgccta ctgccagatg 2100
attgcaggta aagatgccta cttacctggc gagggtaaaa actcatggct aacagggacc 2160
gcttcgtgga acttcgctgc aattactcag tacattttag gcgtaaaacc agactatagc 2220
ggtttagcaa ttaacccttg cataccgtct agctgggatg gctttaaagt tacccgtaag 2280
tatcgcggcg caacctataa catcatcgta accaacccaa cccatgtaag caaaggcgta 2340
aaatcgctca ccctaaatgg caacgctatt gatggctaca tagtgccacc gcaacaagct 2400
ggcaccgtat gtaacgtaga agttacattg ggctaa 2436
<210> 47
<211> 788
<212> PRT
<213> Microbulbifer degradans
<400> 47
Met Leu Lys Ala Ile Asn Asn Gly Glu Arg Tyr Gln Leu Thr Ser Pro
1 5 10 15
Thr Ala Met Pro Gln Ser Ala Ser Phe Leu Trp Asn Lys Lys Met Met
20 25 30
Ile Gln Val Asn Cys Arg Gly Tyr Ala Val Ala Gln Phe Met Gln Pro
35 40 45
Glu Pro Ala Lys Tyr Ala Tyr Ala Pro Asn Leu Glu Ala Lys Thr Phe
50 55 60
Met Gln Pro Glu Gln Pro Tyr Tyr Ala His His Pro Gly Arg Phe Phe
65 70 75 80
Tyr Ile Lys Asp Glu Glu Thr Gly Glu Ile Phe Ser Ala Pro Tyr Glu
85 90 95
Pro Val Arg Ser Gln Leu Asn Asn Phe Ser Phe Asn Ala Gly Lys Ser
100 105 110
Asp Ile Ser Trp His Ile Ala Ala Leu Gly Ile Glu Val Glu Leu Cys
115 120 125
Leu Ser Leu Pro Val Asp Asp Val Val Glu Leu Trp Glu Leu Lys Ile
130 135 140
Lys Asn Gly Gly Ala Gln Pro Arg Lys Leu Ser Ile Tyr Pro Tyr Phe
145 150 155 160
Pro Val Gly Tyr Met Ser Trp Met Asn Gln Ser Gly Asp Tyr Ser Gln
165 170 175
Thr Ala Gly Gly Ile Ile Ala Ser Cys Val Thr Pro Tyr Gln Lys Val
180 185 190
Ala Asp Tyr Phe Lys Asn Lys Asp Phe Lys Asp Lys Thr Phe Phe Leu
195 200 205
His Glu Thr Ala Pro Ala Ala Trp Glu Val Asn Gln Lys Asn Phe Glu
210 215 220
Gly Glu Gly Gly Leu His Asn Pro Asn Ala Ile Gln Gln Glu Thr Leu
225 230 235 240
Gly Cys Gly Asn Ala Leu Tyr Glu Thr Pro Thr Ala Val Leu Gln Tyr
245 250 255
Arg Arg Glu Leu Ala Ala Gln Glu Gln Gln Thr Phe Arg Phe Ile Phe
260 265 270
Gly Pro Ala Phe Asp Glu Ser Glu Ala Ile Ala Leu Arg Asn Lys Tyr
275 280 285
Leu Ser Ala Glu Gly Phe Ala Lys Ala Lys Ser Glu Tyr Gln Thr Tyr
290 295 300
Ile Thr Ser Gly Lys Gly Cys Leu Gln Ile Asn Thr Pro Asp Pro Glu
305 310 315 320
Leu Asn Asn Phe Val Asn His Trp Leu Pro Arg Gln Val Phe Tyr His
325 330 335
Gly Asp Val Asn Arg Leu Thr Thr Asp Pro Gln Thr Arg Asn Tyr Ile
340 345 350
Gln Asp Asn Met Gly Met Ser Tyr Ile Lys Pro Asn Ile Thr Arg Gln
355 360 365
Ala Phe Leu His Ala Leu Ser Gln Gln Glu Glu Ser Gly Ala Met Pro
370 375 380
Asp Gly Ile Leu Leu Leu Glu Gly Ala Glu Leu Lys Tyr Ile Asn Gln
385 390 395 400
Ile Pro His Thr Asp His Cys Val Trp Leu Pro Val Cys Met Gln Ala
405 410 415
Tyr Leu Asp Glu Thr Asn Asp Tyr Ala Leu Leu Asp Glu Ile Val Pro
420 425 430
Tyr Ala Ser Gly Glu Lys Arg Glu Thr Val Glu Gln His Met His His
435 440 445
Ala Met Arg Trp Leu Leu Gln Ala Arg Asp Glu Arg Gly Leu Ser Phe
450 455 460
Ile Ala Gln Gly Asp Trp Cys Asp Pro Met Asn Met Val Gly Tyr Lys
465 470 475 480
Gly Lys Gly Val Ser Gly Trp Leu Ser Val Ala Thr Ala Tyr Ala Leu
485 490 495
Asn Leu Trp Ala Asp Val Cys Glu Gln Arg Gln Gln Asn Ser Cys Ala
500 505 510
Asn Glu Phe Arg Gln Gly Ala Lys Asp Ile Asn Ala Ala Val Asn Lys
515 520 525
His Ile Trp Asp Gly Glu Trp Phe Gly Arg Gly Ile Thr Asp Asp Gly
530 535 540
Val Leu Phe Gly Thr Ser Lys Asp Lys Glu Gly Arg Ile Phe Leu Asn
545 550 555 560
Pro Gln Ser Trp Ala Ile Leu Gly Gly Ala Ala Asp Glu Gln Lys Ile
565 570 575
Pro Cys Leu Leu Asp Ala Val Glu Gln Gln Leu Glu Thr Pro Tyr Gly
580 585 590
Val Met Met Leu Ala Pro Ala Phe Thr Ala Met Arg Asp Asp Val Gly
595 600 605
Arg Val Thr Gln Lys Phe Pro Gly Ser Ala Glu Asn Gly Ser Val Tyr
610 615 620
Asn His Ala Ala Val Phe Tyr Ile Phe Ser Leu Leu Ser Ile Gly Glu
625 630 635 640
Ser Glu Arg Ala Tyr Lys Leu Leu Arg Gln Met Leu Pro Gly Pro Asp
645 650 655
Glu Ala Asp Leu Leu Gln Arg Gly Gln Leu Pro Val Phe Ile Pro Asn
660 665 670
Tyr Tyr Arg Gly Ala Tyr Tyr Gln His Pro Arg Thr Ala Gly Arg Ser
675 680 685
Ser Gln Leu Phe Asn Thr Gly Thr Val Ser Trp Val Tyr Arg Cys Leu
690 695 700
Ile Glu Gly Val Phe Gly Leu Lys Gly Ser Pro Gln Gly Leu Val Val
705 710 715 720
Gln Pro Gln Leu Pro Val Ala Trp Gln Thr Ala Glu Ala Val Arg Glu
725 730 735
Phe Arg Gly Ala Thr Phe Asn Val Ser Tyr Arg Lys Ser Ser Asp Ile
740 745 750
Lys Glu Met Glu Ile Gln Leu Asn Glu Ser Val Ile Ser Gly Asn Thr
755 760 765
Ile Ser Asp Ile Thr Ala Gly Ala Thr Tyr Gln Leu Thr Val Leu Leu
770 775 780
Pro Ala Thr His
785
<210> 48
<211> 2367
<212> DNA
<213> Microbulbifer degradans
<400> 48
atgttaaaag ccattaacaa cggcgaacgc tatcaactca ctagccctac cgctatgccg 60
caaagcgcat cgtttttatg gaataaaaaa atgatgatac aagtaaattg ccgcggctac 120
gccgttgcgc aatttatgca gccagaacca gccaaatacg cttacgcacc caatctggaa 180
gcaaaaacat ttatgcaacc agagcaaccc tattacgcgc atcaccccgg gcgctttttc 240
tatataaaag atgaagagac aggcgagatt ttttcggcac cctacgagcc tgtgcgcagc 300
cagctgaaca actttagctt taacgcaggc aagagcgata taagctggca tattgccgct 360
ttaggcattg aagtagagct atgtcttagc ctgccggtgg acgatgtagt agaattgtgg 420
gaactaaaaa taaaaaacgg cggcgcgcaa cctcgtaaac tcagtattta cccgtacttt 480
cctgtgggtt acatgtcgtg gatgaatcaa tctggtgact acagccaaac cgccggcggc 540
attattgcca gctgcgtaac gccttatcaa aaagtcgccg actactttaa gaataaagac 600
tttaaagata aaacgttctt tcttcacgaa accgccccag cagcatggga agtaaaccag 660
aaaaacttcg aaggcgaagg cgggttgcac aaccccaacg ccatacaaca agaaacgctg 720
ggctgcggca acgcattgta cgaaacgccc acagcggtat tgcaataccg ccgcgaactt 780
gcagcgcaag agcagcaaac ctttcgcttt atttttggcc cagcatttga cgagagcgaa 840
gccattgcac tgcgcaataa gtatttatct gccgaaggtt ttgccaaagc aaaaagcgaa 900
taccaaacct atataacgag cggcaaaggc tgcttgcaaa ttaacacccc agacccagaa 960
ctaaacaact ttgtaaacca ctggctaccg cgccaagtgt tttatcacgg cgatgtaaac 1020
cggttaacca ccgacccgca aacgcgcaat tatattcaag acaatatggg catgagctac 1080
attaagccca acattacgcg gcaggcgttt ttacatgcct taagccagca ggaagaaagc 1140
ggtgcaatgc ccgacggcat tttattgctt gaaggcgccg agcttaaata cataaaccaa 1200
ataccccata ccgatcactg cgtttggctg ccggtgtgta tgcaagccta tttggatgaa 1260
accaatgact acgccctatt agacgaaata gtaccctatg cgagtggcga gaagcgcgaa 1320
actgttgagc aacatatgca tcacgctatg cgctggcttt tgcaagcacg cgacgaacgc 1380
ggcctaagct ttatcgcaca gggcgactgg tgcgacccca tgaacatggt gggctacaag 1440
ggcaaagggg tatccggctg gctttcagtc gctaccgctt atgcattaaa cctgtgggca 1500
gatgtttgcg aacaacggca gcaaaacagt tgcgccaacg aatttagaca gggcgctaaa 1560
gatataaacg cggcggtaaa caagcatatt tgggatggcg aatggtttgg ccgcggcatt 1620
acagatgacg gcgtactgtt tggcaccagc aaagataaag aaggcagaat ttttctaaac 1680
ccacaaagct gggcaatact tggcggcgcc gccgacgaac aaaaaatccc atgcctgcta 1740
gacgcagtag agcaacaact ggaaacccct tacggcgtaa tgatgctggc ccccgcgttt 1800
accgccatgc gcgatgacgt aggccgagtt acccaaaaat tcccaggctc tgcagaaaac 1860
ggctctgttt ataatcacgc ggcggtgttt tatatattta gcttgttatc cattggcgag 1920
agcgaacgcg catataaact gctacgccaa atgctgcctg ggccagatga agccgatctt 1980
ttacagcgcg gccaactgcc agtattcata cctaactatt atcgcggcgc atactaccag 2040
cacccccgca ccgccggtcg ctctagccag ctctttaata cgggtacagt ctcgtgggtt 2100
taccgctgct taattgaagg ggtattcggc ttgaaaggct cgccacaagg cttagttgta 2160
caaccgcaac tgcctgtcgc ctggcaaaca gcagaagccg ttagggaatt tagaggcgca 2220
acgtttaacg tgagctaccg caaaagcagc gatataaaag aaatggaaat acagctaaat 2280
gaatcggtaa taagtggcaa caccatctcc gacatcaccg ccggcgcgac ctatcaatta 2340
accgttctat tacctgccac acactaa 2367
<210> 49
<211> 574
<212> PRT
<213> Microbulbifer degradans
<400> 49
Met Lys Lys Leu Ile Lys Pro Thr Leu Ser Trp Val Ala Gly Val Ala
1 5 10 15
Leu Ser Leu Gly Ile Ala Gln Gly Ala Gly Ala Gln Asn Val Gln Phe
20 25 30
Val Gly Asn Ile Thr Thr Asn Gly Ser Val Arg Asn Asp Phe Met Asp
35 40 45
Tyr Trp Asp Gln Ile Thr Pro Glu Asn Glu Gly Lys Trp Gly Ser Val
50 55 60
Glu Arg Ser Arg Asp Asn Tyr Ser Trp Ser Gly Gln Asp Ala Ala Tyr
65 70 75 80
Asn Phe Ala Arg Ala Asn Gly Ile Pro Phe Lys Ala His Thr Leu Val
85 90 95
Trp Gly Ser Gln Tyr Pro Ser Trp Ile Asn Asn Leu Ser Asn Ala Glu
100 105 110
Lys Ala Ala Glu Ile Glu Glu Trp Ile Arg Asp Tyr Cys Asn Arg Tyr
115 120 125
Pro Ala Thr Asp Ile Ile Asp Val Val Asn Glu Ala Thr Pro Gly His
130 135 140
Ala Pro Ala Asn Tyr Ala Arg Asp Ala Phe Gly Asp Asn Trp Ile Ile
145 150 155 160
Lys Ser Phe Gln Leu Ala Arg Gln Tyr Cys Pro Asn Ala Thr Leu Val
165 170 175
Leu Asn Asp Tyr Asn Val Leu Ile Trp Asn Thr Asn Asp Phe Ile Ala
180 185 190
Met Ala Gln Pro Val Ile Asn Ala Gly Val Val Asp Ala Leu Gly Leu
195 200 205
Gln Ala His Gly Leu Glu Ser Leu Ser Ala Ser Gln Leu Lys Ser Thr
210 215 220
Leu Asp Arg Ile Ala Asn Leu Gly Leu Pro Ile Tyr Ile Ser Glu Tyr
225 230 235 240
Asp Val Arg Ser Thr Asn Asp Gln Glu Gln Leu Arg Ile Met Arg Asp
245 250 255
Gln Phe Pro Val Phe Tyr Asn His Pro Ser Val Arg Gly Ile Thr Leu
260 265 270
Trp Gly Tyr Met Val Gly Ala Thr Trp Arg Glu Gly Thr Gly Leu Ile
275 280 285
Arg Ala Asp Gly Ser His Arg Pro Ala Met Thr Trp Leu Met Asn Tyr
290 295 300
Leu Glu Asn Asn Arg Gly Gly Ser Thr Ser Ser Ser Ser Ser Ser Ser
305 310 315 320
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Ser Gly
325 330 335
Gly Pro Ser Ser Leu Thr Val Glu Leu Glu Ser Leu Ser Asp Ser Ser
340 345 350
Asn Phe Ser Pro Phe Ser Val Gln Ser Asp Ser Ser Ala Ala Gly Gly
355 360 365
Gln Tyr Val Val Trp Pro Asn Asn Gly Asn Gln Ile Val Ser Ser Pro
370 375 380
Ser Asp Ser Ala Ser Gly Gln Ile Gln Val His Phe Thr Leu Ser Gln
385 390 395 400
Ser Ala Asp Val Gln Phe Gln Ile Arg Ala Asp Leu Ala Asn Gly Asn
405 410 415
Asp Asp Ser Phe Tyr Tyr Lys Leu Asp Ser Gly Ser Trp Asn Thr Gln
420 425 430
Asn Asn Ala Ser Thr Ser Gly Trp Gly Thr Leu Thr Pro Ala Thr Phe
435 440 445
Ser Asn Val Ser Thr Gly Ser His Thr Leu His Ile Leu Arg Arg Glu
450 455 460
Asp Gly Ala Lys Leu Asp Lys Val Thr Leu Asn Ala Ser Val Gly Gln
465 470 475 480
Val Ser Ala Ser Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
485 490 495
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Ala Ala Val Ala
500 505 510
Ser Cys Asp Gly Val Asn Glu Tyr Pro Ser Trp Thr Ala Lys Asp Trp
515 520 525
Ser Gly Gly Asp Tyr Asn His Ala Asn Ser Gly Asp Tyr Met Ser Tyr
530 535 540
Gln Gly Val Leu Tyr Arg Ala Asn Trp Tyr Thr Ala Thr Val Pro Gly
545 550 555 560
Ser Asp Ser Ser Trp Thr Arg Val Gly Asp Cys Asn Phe Val
565 570
<210> 50
<211> 1725
<212> DNA
<213> Microbulbifer degradans
<400> 50
atgaagaagt taattaagcc tacgctatcg tgggttgcag gagttgcttt gtcgctgggt 60
attgcccagg gagctggtgc tcaaaatgtg cagtttgttg gtaatatcac taccaatggt 120
agtgtgcgca acgacttcat ggactactgg gatcagatta ccccagagaa tgaaggcaag 180
tggggctcgg tggagcgcag tcgcgacaac tattcatgga gcggccaaga tgccgcctac 240
aattttgccc gtgccaacgg catcccattt aaagcacata ctttagtatg gggcagtcaa 300
tatcccagct ggataaacaa tttaagtaac gcggaaaaag ccgctgagat tgaagagtgg 360
attcgcgatt actgtaaccg ttacccagcc accgatatta tcgatgttgt caatgaagca 420
acgccgggcc acgcgccagc aaattatgct cgcgatgcat ttggcgacaa ctggataatc 480
aagtccttcc agctggcacg tcagtactgc cccaatgcca cgttagtgtt gaacgactac 540
aacgtactta tttggaacac caatgatttt atagcgatgg cccagccggt aattaacgcc 600
ggagtagtag atgctttggg tttgcaggcc cacggtctgg agagcctttc tgcgtcgcaa 660
ttaaaatcga ctctggatcg tatcgccaat ttgggtttgc caatttatat ctctgaatac 720
gatgttcgca gcaccaatga tcaggagcag ctgcgtatta tgcgtgatca attccctgta 780
ttttacaacc acccaagtgt acgtggcata actttgtggg gttatatggt gggggccacc 840
tggcgagaag gcacaggttt gattcgtgct gatggctccc atcgtccagc gatgacctgg 900
ttgatgaact atctggagaa caatcgtggc ggctcaacct cttcaagtag ttcatcctcc 960
tctagcagtt cgtcttccag tagttcttct tcgggaagtt cctctggtgg cccaagtagt 1020
ttgacggtag agctagaatc tttgtcggat agcagtaact tttcgccatt ctcggtacag 1080
agtgacagca gcgcagcggg cggccagtac gtggtatggc ctaacaacgg caatcagatt 1140
gtaagctcac cctccgatag cgccagcggg caaattcagg tgcactttac cctgtcgcaa 1200
tcggcggatg tgcaatttca gattcgtgca gacctagcta acggcaatga cgactctttt 1260
tattacaagc tggactcagg ctcttggaat actcagaaca acgcttccac gtctggttgg 1320
ggcaccttaa ccccagcaac tttctctaat gtatccacag gatcccatac cttacacatt 1380
ctccgcagag aagatggggc gaaactcgat aaggtaactc tgaatgcttc agttggtcag 1440
gtttccgcta gtacaggcag tagctccagc tcttccagca gctccagttc atccagcagt 1500
tctagttctt caagcagcag cggcgcggca gtcgcaagtt gtgacggtgt taatgaatac 1560
cccagctgga cagcaaaaga ttggtctggg ggtgactata accacgccaa tagcggtgac 1620
tacatgagct atcagggtgt tctatatcga gcaaactggt acaccgcaac tgttcctgga 1680
agtgattctt cctggactcg agttggcgat tgcaattttg tgtaa 1725
<210> 51
<211> 619
<212> PRT
<213> Microbulbifer degradans
<400> 51
Met Ile Lys Leu Arg Gln Ser Ile His Gly Ala Leu Ala Arg Thr Val
1 5 10 15
Gly Ile Ile Ser Ile Ser Thr Gly Leu Val Leu Ala Ala Gln Thr Ala
20 25 30
Ser Ala Ala Cys Glu Tyr Thr Val Thr Asn Ser Trp Gly Ser Gly Phe
35 40 45
Thr Ala Ser Ile Arg Ile Thr Asn Asp Thr Gly Ser Ala Val Asn Gly
50 55 60
Trp Ala Val Asn Trp Gln Tyr Ala Asn Gly Asn Arg Val Thr Asn Ser
65 70 75 80
Trp Asn Ala Thr Leu Ser Gly Asn Asn Pro Tyr Ser Ala Ser Asn Ile
85 90 95
Gly Trp Asn Gly Gly Ile Gln Pro Gly Gln Ser Val Glu Phe Gly Phe
100 105 110
Gln Gly Thr Ala Asn Gly Ala Ala Glu Thr Pro Ala Val Thr Gly Ala
115 120 125
Val Cys Ala Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
130 135 140
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
145 150 155 160
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly
165 170 175
Ala Asn Cys Val Glu Met Cys Lys Trp Tyr Gln Asp Ala Pro Arg Pro
180 185 190
Leu Cys Asn Asn Gln Asn Ser Gly Trp Gly Trp Glu Asn Asn Gln Ser
195 200 205
Cys Ile Gly Arg Ala Thr Cys Glu Ser Gln Pro Ser Asn Ala Gly Gly
210 215 220
Val Val Asn Ser Cys Pro Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
225 230 235 240
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
245 250 255
Ser Thr Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser
260 265 270
Ser Ser Ser Ser Gly Ser Ala Ala Asn Leu Tyr Thr Leu Ala Asp Phe
275 280 285
Pro Ile Gly Val Ala Val Thr Ala Gly Asn Glu Ser Arg Ser Phe Leu
290 295 300
Ser Ile Ala Ala Lys Glu Ala Thr Val Lys Lys His Phe Asp Gln Ile
305 310 315 320
Thr Ala Gly Asn Ile Met Lys Met Ser Tyr Leu His Pro Ser Glu Asn
325 330 335
Ser Tyr Thr Phe Ser Gln Ala Asp Ala Met Val Asn Trp Ala Asn Ser
340 345 350
Asn Gly Val Ser Val His Gly His Thr Phe Ile Trp His Ser Asp Tyr
355 360 365
Gln Val Pro Asn Trp Met Asn Asn Tyr Ser Gly Asn Phe Ala Ser Met
370 375 380
Met Asp Thr His Val Thr Thr Ile Ala Asp His Phe Glu Gly Arg Val
385 390 395 400
Val Ser Trp Asp Val Val Asn Glu Ala Ile Asp Glu Ser Gln Ser Ser
405 410 415
Cys Tyr Arg Asn Ser Leu Phe Tyr Gln Arg Leu Gly Lys Ala Tyr Ile
420 425 430
Ala Asn Ala Phe Arg Ala Ala Arg Ala Ala Asp Pro Ser Val Glu Leu
435 440 445
Tyr Tyr Asn Asp Tyr Asp Thr Glu Gly Gly Asn Ala Asn Lys Leu Asn
450 455 460
Cys Leu Leu Gln Leu Val Asp Asp Leu Gln Ala Asn Asn Val Pro Ile
465 470 475 480
Asp Gly Val Gly Phe Gln Met His Val Gln Ile Asp Trp Pro Ser Thr
485 490 495
Ser Asn Ile Ala Ala Ala Phe Gln Ala Ile Val Asp Arg Gly Leu Lys
500 505 510
Val Lys Ile Thr Glu Leu Asp Val Pro Ile Asn Asn Pro Tyr Gly Ser
515 520 525
Gly Ser Phe Pro Gln Tyr Ser Thr Tyr Thr Ser Gln Ala Ala Ala Leu
530 535 540
Gln Lys Ala Arg Tyr Lys Ser Ile Val Lys Thr Tyr Leu Thr Val Val
545 550 555 560
Pro Ala His Leu Arg Gly Gly Leu Thr Val Trp Gly Ile Trp Asp Gly
565 570 575
Asp Ser Trp Leu Leu Asp Phe Asp Asn Arg Gln Gly Ala Asp Asp Trp
580 585 590
Pro Leu Leu Phe Ser Gly Pro Ala Asn Gly Pro Tyr Val Glu Lys Glu
595 600 605
Ala Phe Tyr Gly Val Ala Glu Ala Leu Thr Glu
610 615
<210> 52
<211> 50
<212> PRT
<213> Microbulbifer degradans
<400> 52
Met Leu Asp Glu Leu Leu Asp Glu Leu Glu Leu Leu Leu Glu Leu Glu
1 5 10 15
Leu Leu Asp Glu Glu Leu Glu Leu Glu Leu Glu Glu Leu Asp Pro Val
20 25 30
Ala His Thr Ala Pro Val Thr Ala Gly Val Ser Ala Ala Pro Leu Ala
35 40 45
Val pro
50
<210> 53
<211> 103
<212> PRT
<213> Microbulbifer degradans
<400> 53
Met Gly Lys Ser Ala Lys Val Tyr Lys Phe Ala Ala Glu Pro Leu Glu
1 5 10 15
Leu Glu Leu Leu Glu Glu Val Glu Leu Val Leu Asp Glu Leu Glu Leu
20 25 30
Val Asp Glu Leu Glu Leu Leu Asp Glu Leu Glu Leu Leu Asp Glu Leu
35 40 45
Glu Leu Glu Glu Leu Glu Glu Leu Leu Glu Leu Asp Gly Gln Leu Phe
50 55 60
Thr Thr Pro Pro Ala Leu Asp Gly Cys Asp Ser His Val Ala Arg Pro
65 70 75 80
Ile Gln Leu Trp Leu Phe Ser Gln Pro Gln Pro Leu Phe Trp Leu Leu
85 90 95
His Lys Gly Arg Gly Ala Ser
100
<210> 54
<211> 1860
<212> DNA
<213> Microbulbifer degradans
<400> 54
atgatcaagc tacgtcaatc tatccacggc gccttggcgc gtaccgtggg cataataagt 60
ataagcaccg gacttgtact cgcagcgcaa actgcaagtg cagcctgtga atacaccgta 120
accaattcgt ggggttcggg ttttaccgcg agtattcgca taacaaacga taccggtagc 180
gcagtaaacg gttgggcggt taactggcaa tacgctaatg gcaaccgtgt aacaaattca 240
tggaacgcta cgctgtctgg caataaccct tatagcgcca gcaatattgg ttggaacggc 300
ggtattcaac ctgggcagtc ggtggaattt ggttttcaag gcacggctaa tggcgcggca 360
gaaacaccag cggtaacggg ggctgtatgt gctacagggt ctagctcttc cagctcaagc 420
tctagttctt cgtctagcag ctctagttct agcagcagtt cgagttcatc gagtagctcg 480
tctagcactt caagctcgtc atctagcagc tcttccagtt catcgggcgc aaactgtgta 540
gaaatgtgta agtggtatca agatgcgcct cgccctttat gcaataacca aaatagtggt 600
tggggttggg aaaacaacca aagttgtatc ggccgagcaa cgtgcgaatc gcaaccgtct 660
aatgcgggtg gggtggtaaa tagttgtccg tctagttcta gcagctcttc aagttcctct 720
agttcgagct cgtctagcag ttcaagctcg tctagcagtt caagctcgtc tactagttct 780
agttcatcaa gcacaagttc tacttcttca agtagttcta gctctagcgg ttctgctgca 840
aacttatata ccttggcaga tttccccatt ggcgttgctg taactgcggg taatgagagc 900
cgtagctttt tatctattgc tgcgaaagag gcaactgtta aaaaacactt cgaccaaatt 960
acagccggta acattatgaa gatgagttac ttgcacccat ccgaaaatag ctacaccttt 1020
agtcaagcgg atgccatggt taactgggca aatagcaacg gcgtaagtgt gcacggccat 1080
acttttattt ggcattccga ttaccaagta ccaaattgga tgaataatta cagcggtaat 1140
tttgcgtcta tgatggatac ccacgtaacc actattgccg atcattttga aggccgagta 1200
gtaagctggg atgtggtaaa cgaagctatc gatgagagcc aatctagttg ttatcgcaac 1260
tctttgtttt accagcgttt aggtaaagct tatattgcca atgcgttccg cgcggcccga 1320
gcagcagacc ctagcgtaga gttgtattac aacgattacg ataccgaagg tggcaatgcc 1380
aataagttaa attgcttgtt gcaattagtc gatgacttgc aagcgaacaa tgtgcctatc 1440
gatggtgtgg gctttcaaat gcacgtgcaa attgattggc ccagcaccag caatattgct 1500
gcggctttcc aagctattgt ggatcgcggc ttaaaggtaa aaattactga gctggatgtg 1560
cctattaata acccttatgg cagtggttca ttcccgcaat attcaactta cacgtcacaa 1620
gccgctgcgt tgcaaaaggc gcgttataaa tccattgtaa aaacctactt gactgttgtg 1680
ccagcgcatt tgcgcggggg cttaaccgta tggggtatat gggatggtga tagctggttg 1740
ttagattttg ataatcgtca aggcgctgat gattggccgc tattatttag tggcccagct 1800
aatggcccct atgtagaaaa agaagcattc tatggcgtgg cagaggcgct tacagaatag 1860
<210> 55
<211> 470
<212> PRT
<213> Microbulbifer degradans
<220>
<221> MOD_RES
<222> (301) .. (457)
Variable amino acid
<400> 55
Met Asn Cys Thr Arg Arg Asn Ile Val Lys Ala Gly Leu Leu Gly Ser
1 5 10 15
Ala Phe Val Ala Leu Pro Ala Val Ala Arg Ala Leu Pro Gly Leu Ala
20 25 30
Thr Lys Phe Arg Asp Gln Phe Tyr Val Gly Thr Ala Val Ser Ala Arg
35 40 45
Ser Leu Asn Thr Pro Ser Gly Ala Phe Ala Ala Thr Val Ala His Gln
50 55 60
Phe Asn Ala Leu Thr Ala Glu Asn Ala Met Lys Pro Ala Leu Leu Gln
65 70 75 80
Pro Gln Met Gly Glu Trp Arg Trp Gln Asp Ala Asp Ala Ile Val Arg
85 90 95
Phe Ala Glu Gln His Gln Met Leu Met His Gly His Thr Leu Val Trp
100 105 110
His Ser Gln Thr Pro Asp Trp Phe Phe Gln Asn Lys Gln Gly Glu Pro
115 120 125
Ala Asp Lys Ala Thr Leu Tyr Arg Arg Gln Glu Glu Tyr Ile Asn Ala
130 135 140
Val Val Gly Arg Tyr Lys Gly Arg Val His Ser Trp Asp Val Val Asn
145 150 155 160
Glu Ala Glu Asp Glu Gly Lys Gly Trp Arg Lys Ser His Trp Tyr Asn
165 170 175
Ile Cys Gly Pro Glu Phe Met Glu Arg Ala Phe Arg Leu Ala His Ala
180 185 190
Ala Asp Pro Lys Ala His Leu Cys Tyr Asn Asp Tyr Asn Met His Leu
195 200 205
Pro Gln Lys Arg Glu Phe Leu Val Lys Leu Phe Lys Asp Tyr Ile Lys
210 215 220
Arg Gly Val Pro Ile His Gly Val Gly Leu Gln Gly His Val Gly Leu
225 230 235 240
Asp Tyr Pro Ser Leu Asp Glu Leu Glu Lys Thr Ile Val Ala Met Ala
245 250 255
Asp Leu Gly Leu Lys Val His Ile Thr Glu Leu Asp Val Asp Val Leu
260 265 270
Pro Ala Pro Trp Gln Leu Ala Ser Ala Asp Ile Ser Thr Lys Phe Glu
275 280 285
Tyr Asp Lys Ser Leu Asn Pro Tyr Val Asp Gly Leu Xaa Xaa Xaa Xaa
290 295 300
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
305 310 315 320
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
325 330 335
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
340 345 350
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
355 360 365
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
370 375 380
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
385 390 395 400
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
405 410 415
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
420 425 430
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
435 440 445
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Met Tyr Lys Met Arg Leu
450 455 460
Pro Ala Lys Ile Leu Ala
465 470
<210> 56
<211> 1413
<212> DNA
<213> Microbulbifer degradans
<220>
<221> modified_base
(902) (902). (1369)
<223> a, c, g, or t
<400> 56
gtgaattgta cgcgtaggaa tatagtaaaa gcaggccttc ttggctcggc attcgtcgcc 60
ctgcctgccg tggcgcgcgc gctgcctgga ttggccacga aatttcgcga tcagttttac 120
gtgggcactg cggttagtgc gcgctcactt aatacgccca gcggcgcgtt tgcagccact 180
gtcgcgcatc aattcaatgc actaaccgct gaaaacgcca tgaagcccgc cttacttcaa 240
ccacaaatgg gggagtggcg ctggcaggat gccgatgcca ttgtgagatt tgccgagcag 300
catcagatgc taatgcatgg tcacaccctt gtgtggcatt cgcaaacgcc agattggttc 360
ttccaaaaca agcagggcga accggcagac aaagcaaccc tataccgcag gcaagaggag 420
tatatcaatg ccgtagttgg gcgctataaa gggcgggtac actcgtggga tgtggtgaat 480
gaagcagaag atgagggtaa aggctggcgc aagagccact ggtataacat ttgtgggcca 540
gagtttatgg aacgagcctt tcgcttagct cacgcagcgg acccaaaagc acacttatgt 600
tacaacgatt acaatatgca cttgccgcaa aagcgcgaat ttttggttaa gttattcaaa 660
gactacatta agcgcggcgt gcctattcac ggcgtagggt tgcaggggca tgtgggctta 720
gactacccct cgctggacga gttggaaaaa accatcgtgg ccatggccga tttaggtcta 780
aaagtacaca ttacagaatt ggatgtagat gtattacccg cgccatggca actagctagc 840
gcagatataa gtactaaatt cgagtacgac aaaagcttaa acccgtacgt tgatggtttg 900
cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt gactatgtac 1380
aaaatgcgct tgcccgcgaa aatattagct taa 1413
<210> 57
<211> 1186
<212> PRT
<213> Microbulbifer degradans
<400> 57
Met Leu Arg Ser Thr Gln Ser Thr Pro Ile Val Lys Arg Lys Ile Ser
1 5 10 15
Ala Tyr Val Gly Trp Gly Leu Cys Val Leu Leu Ser Val Cys Thr Ala
20 25 30
Ser Ile Ser Trp Ala Gly Asn Pro Ile Val Ser His Val Tyr Thr Ala
35 40 45
Asp Pro Ala Ala Arg Val Ile Asn Gly Arg Ala Tyr Val Met Val Thr
50 55 60
His Asp Gln Asp Asn Gln Asn Asp Tyr Gly Gly Leu Ile Asp Tyr Tyr
65 70 75 80
Leu Phe Ser Ser Asp Asp Met Val Asn Trp Gln Asp His Gly Ile Val
85 90 95
Trp Asn Ser Arg Thr Asp Ser Ser Trp Ala Ser Leu Ala Tyr Ala Pro
100 105 110
Asp Phe Ile Glu Arg Asn Gly Lys Tyr Tyr Leu Tyr Phe Pro Asn Gly
115 120 125
Ala Asn Ser Ile Gly Val Ala Val Ala Asp Ser Pro Glu Gly Pro Tyr
130 135 140
Thr Asp Pro Leu Gly Arg Pro Leu Val Asp Arg Asn Thr Pro Asn Ala
145 150 155 160
Asn Val Asp Trp Leu Phe Asp Pro Gly Val Phe Ile Asp Asp Asp Gly
165 170 175
Gln Ala Phe Leu Tyr Phe Gly Gly Gly Ala Asp Gly Thr Ala Arg Val
180 185 190
Ile Arg Leu Asn Asn Asp Met Ile Ser Thr Ser Gly Ala Ala Ile Ser
195 200 205
Ile Asp Val Pro Asn Phe Phe Glu Ala Leu Tyr Met His Lys Arg Asn
210 215 220
Gly Ile Tyr Tyr Leu Ser Tyr Ser Thr Asn Pro Ser Ala Gly Met Ser
225 230 235 240
Ile Asp Tyr Met Thr Ser Asn Asn Pro Thr Ser Gly Phe Thr His Arg
245 250 255
Gly Thr Ile Leu Pro Asn Pro Trp Glu Asn Asn Ser Asn Asn Asn His
260 265 270
Gln Ser Ile Ile Glu Phe Asn Asn Glu Trp Tyr Ile Phe Tyr His Asn
275 280 285
Arg Ala Val Ala Asn Thr Arg Gly Asp Ser Thr Phe Ser Arg Ser Ile
290 295 300
Asn Val Asp Arg Leu Tyr Tyr Asn Ser Asp Gly Ser Ile Arg Glu Val
305 310 315 320
Asn Ala Ser Ser Ile Gly Val Pro Ala Val Arg Asn Val Asn Ala Phe
325 330 335
Ser Ile Asn Gln Ala Glu Thr Phe Asp Gln Glu Gly Gly Ile Glu Thr
340 345 350
Glu Pro Ser Ser Glu Gly Thr Leu Asn Ile Gln Met Gly Pro Gly Asp
355 360 365
Trp Val Lys Val Ala Asn Val Asp Phe Gly Asn Gly Ala Thr Gln Phe
370 375 380
Asn Ala Arg Val Ala Ser Ala Ile Asp Asn Ser Lys Leu Glu Ile Ile
385 390 395 400
Leu Gly Ser Leu Ser Asn Thr Pro His Ala Ser Leu Glu Ile Thr Asn
405 410 415
Thr Gly Gly Trp Gln Asn Trp Gln Thr Gln Ser Thr Ser Phe Asn Ala
420 425 430
Ile Thr Gly Val His Asp Val Tyr Leu Arg Gly Thr Ser Gly His Asn
435 440 445
Leu Asn Trp Phe Glu Phe Glu Gly Glu Asn Asn Gly Gly Ser Ser Gln
450 455 460
Leu Thr Val Glu Leu Glu Asp Leu Ala Ser Gln Ser Leu Phe Ala Pro
465 470 475 480
Leu Ser Val Arg Ser Asp Asn Met Ala Asn Asn Gly Ala Tyr Ile Glu
485 490 495
Trp Ser Asn Asp Gly Ser Asn Gln Ile Leu Ser Val Ala Ser Glu Gln
500 505 510
Ser Gln Gly Gln Ile Ser Val Pro Phe Thr Leu Ser Gln Ala Ser Asp
515 520 525
Val Glu Phe Asn Val Arg Val Asn Leu Ala Asn Gly Asn Asp Asp Ser
530 535 540
Phe Tyr Tyr Lys Leu Asn Ser Asn Ser Trp Gln Thr Phe Asn Asn Gln
545 550 555 560
Ala Thr Thr Gly Trp Gln Val Leu Thr Pro Asn Thr Phe Thr Gly Leu
565 570 575
Ser Pro Gly Asn His Ile Leu Thr Leu Leu Arg Arg Glu Asp Gly Ala
580 585 590
Lys Leu Asp Thr Leu Thr Leu Val Ala Ser Ala Gly Ser Ile Gln Thr
595 600 605
Asn Asn Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Thr Thr Ser
610 615 620
Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Gly Ala Ala Pro Thr Gly
625 630 635 640
Asn Val Thr Tyr Ser Ile Asn Val Thr Asn Asp Trp Gln Ser Gly Tyr
645 650 655
Cys Ala Glu Leu Thr Val Thr Asn Asn Thr Asn Asn Ala Leu Gln Trp
660 665 670
Gln Ala Ser Val Ser Met Ser Asp Ser Val Asp Ser Met Trp Asn Ala
675 680 685
Ser Trp Ser Gln Ser Gly Asn Ile Leu Asn Val Ser Gly Val Glu Trp
690 695 700
Asn Asn Thr Leu Gln Ala Gly Gln Ser Gln Ser Gly Ile Gly Phe Cys
705 710 715 720
Ala Thr Arg Ala Ser Ser Ser Ser Ser Ser Ser Thr Thr Ser Ser Thr
725 730 735
Ser Gly Ser Thr Ser Ser Ser Ser Ser Ser Gly Gly Tyr Thr Val Pro
740 745 750
Ser Asn Asn Phe Ala Val Asn Gly Gly Val Glu Asn Asn Leu Gln Ser
755 760 765
Trp Gly Ala Thr Ala Gly Ser Val Thr Arg Ser Thr Glu Gln Arg Tyr
770 775 780
Ser Gly Asn Ala Ser Ala Arg Ile Thr Asn Arg Ala Glu Asn Trp His
785 790 795 800
Gly Leu Thr Phe Ser Val Gly Glu Leu Thr Gln Gly Asn Leu Tyr Glu
805 810 815
Val Ala Val Trp Val Lys Leu Ala Ala Gly Ser Ala Asp Thr Pro Ile
820 825 830
Thr Leu Thr Ala Lys Arg Gln Asn Asp Ser Asp Asp Ser Thr Tyr Asn
835 840 845
Glu Tyr Thr Gly Ile Val Thr Thr Ile Ala Asn Asp Ser Glu Trp Val
850 855 860
Leu Leu His Gly Gln Tyr Thr Gln Thr Gly Thr Ala Phe Glu His Phe
865 870 875 880
Ile Ile Glu Ser Glu Ser Asp Ser Val Ser Phe Tyr Ala Asp Glu Phe
885 890 895
Ser Ile Gly Gly Glu Val Thr Pro Lys Asn Glu Val Gly Phe Phe Val
900 905 910
Gly Asn Ile Thr Thr Asn Gly Asn Val Arg Asn Asp Phe Thr Gln Tyr
915 920 925
Trp Asp Gln Leu Thr Pro Glu Asn Glu Gly Lys Trp Gly Ser Val Glu
930 935 940
Arg Thr Arg Asp Val Tyr Asp Trp Ser Gly Leu Asp Arg Ala Tyr Asn
945 950 955 960
Tyr Ala Lys Gln Asn Asn Ile Pro Phe Lys Gln His Thr Met Val Trp
965 970 975
Gly Ser Gln Gln Pro Asn Trp Ile Asp Ser Leu Ser Pro Ala Glu Gln
980 985 990
Ala Ala Glu Ile Glu Glu Trp Ile Arg Asp Tyr Cys Ala Arg Tyr Pro
995 1000 1005
Asp Thr Glu Met Ile Asp Val Val Asn Glu Ala Thr Leu Gly His Ala
1010 1015 1020
Pro Ala Asn Tyr Ala Ala Ser Ala Phe Gly Asn Asn Trp Ile Ile Arg
1025 1030 1035 1040
Ser Phe Glu Leu Thr Arg Gln Tyr Cys Pro Asn Ser Ile Leu Ile Leu
1045 1050 1055
Asn Asp Tyr Asn Val Leu Ser Trp Asn Thr Gln Glu Phe Ile Gln Met
1060 1065 1070
Ala Thr Pro Ala Val Asn Ala Gly Val Val Asp Ala Ile Gly Leu Gln
1075 1080 1085
Ala His Gly Leu Ala Asp Trp Ser Leu Ser Asp Leu Glu Thr Lys Leu
1090 1095 1100
Asn Gln Val Ala Ala Leu Gly Leu Pro Ile Tyr Ile Ser Glu Tyr Asp
1105 1110 1115 1120
Ile Glu Lys Thr Asn Asp Gln Glu Gln Leu Arg Val Met Gln Thr Gln
1125 1130 1135
Phe Pro Leu Phe Tyr Asn His Pro Ser Val Lys Gly Ile Thr Ile Trp
1140 1145 1150
Gly Tyr Val Val Gly Ala Thr Trp Arg Asp Gly Thr Gly Leu Leu His
1155 1160 1165
Ser Asn Gly Thr Pro Arg Pro Ala Leu Thr Trp Leu Met Asp Tyr Leu
1170 1175 1180
Asn arg
1185
<210> 58
<211> 3561
<212> DNA
<213> Microbulbifer degradans
<400> 58
atgttgcgaa gcacccaatc aacacccatt gttaagcgaa agatttctgc ctatgtaggt 60
tggggtctgt gcgtgttact tagcgtctgc acggcctcga tctcttgggc aggtaaccct 120
attgtgtctc atgtatatac cgcagaccct gctgcacggg taataaacgg aagagcctat 180
gtaatggtta cccacgatca ggataaccaa aatgattacg gtggtttgat tgattactac 240
ctgttctcat cggacgatat ggttaattgg caagatcacg gtattgtgtg gaattctcga 300
acagacagta gttgggccag tcttgcttac gccccagatt ttatcgagcg caatggaaag 360
tactacctgt actttcccaa cggcgcaaac tctattggtg tcgctgtggc cgatagccct 420
gagggcccct atactgatcc actcggtagg ccgctggttg accgcaatac ccccaatgcc 480
aatgttgact ggctgttcga tcccggtgta tttattgatg acgacggaca agcctttttg 540
tactttggtg gaggcgctga tggaaccgcg cgcgttattc gtttaaataa cgacatgata 600
agtaccagtg gtgcagccat aagtattgac gtacctaact tctttgaagc gctatacatg 660
cataagcgca acggcattta ctacttatcc tactcgacca accccagcgc ggggatgagc 720
atagattaca tgacgagtaa taaccctacc tcagggttca cccatcgcgg caccattttg 780
cccaaccctt gggaaaataa ttccaataac aaccaccagt caattattga atttaataac 840
gaatggtaca ttttttacca caatagagct gtcgcaaata cgcggggcga tagtaccttt 900
tcccgctcta ttaacgtgga tcgtctttac tacaattccg acggcagtat tcgagaagta 960
aatgccagtt caataggtgt acccgcggta cgtaatgtta atgctttttc cataaaccaa 1020
gcagaaacat tcgatcaaga aggtggcata gaaactgagc cgtcttctga aggtaccttg 1080
aatattcaga tgggcccagg agattgggta aaagttgcta acgtcgattt tggtaacggc 1140
gccacacaat ttaacgctcg agttgctagc gcaatcgata attcaaagct ggaaattatt 1200
ttaggcagtc tcagtaatac cccgcatgcc tcgctcgaaa ttaccaacac aggcgggtgg 1260
caaaattggc aaacacaaag cacaagtttt aatgcaataa ctggtgttca cgatgtatac 1320
ctgcgcggta cttctgggca caacctaaat tggtttgaat ttgaaggcga aaataatgga 1380
ggaagcagtc agctaacggt tgagttggaa gacttggctt cgcaatctct ttttgctccc 1440
cttagcgtac gctccgataa catggctaat aacggcgctt acattgaatg gagtaatgat 1500
gggagcaatc agattctcag tgtggccagc gagcaatcgc aaggccaaat cagtgtccca 1560
tttactctat cgcaagcttc cgatgtcgaa tttaacgtac gcgtgaatct tgctaatggc 1620
aatgatgatt cgttttatta caagctaaac agtaatagct ggcagacttt taataatcaa 1680
gctaccactg gttggcaggt gctcacgccc aacaccttca ctggtcttag ccctgggaat 1740
cacattctta ctctacttcg gcgtgaagat ggcgccaaat tagataccct cacgttggta 1800
gcctccgcgg gcagtattca aaccaataac agctcatcaa gttctacctc cagcagtagt 1860
tcaacgacta gctcaagttc aaccagctcg agtagttcct ctggcgccgc gccaactggt 1920
aacgttactt actctataaa tgttactaac gactggcaaa gtggttattg tgcggagctt 1980
accgttacca ataacacgaa caacgctctg cagtggcaag ctagtgtttc tatgagcgat 2040
agtgtcgaca gtatgtggaa tgctagctgg tcgcagagcg ggaacatact taacgtaagc 2100
ggggtagagt ggaataatac gttgcaagca gggcaaagcc agagtggcat aggattttgt 2160
gctacacgtg ccagctcgtc ttcctccagc tctacaacaa gttctacttc cggttccaca 2220
tcaagctcta gttcatcggg aggctatacc gttccgagta ataatttcgc agtcaatggt 2280
ggtgtagaaa acaacctgca gagctggggc gcaacggcgg gttcagtaac acgttctact 2340
gaacaacgtt atagcggaaa cgcaagcgcg cgtataacaa atcgagcaga aaactggcac 2400
ggtttgacgt tcagtgttgg tgagcttacg caaggcaacc tgtacgaagt tgcggtgtgg 2460
gtaaaacttg cggcaggcag tgcggacaca cctattacgc ttaccgccaa acgacaaaat 2520
gatagcgacg attccactta taacgaatat accggcatag tcacgaccat tgctaacgat 2580
tctgaatggg tgctgctgca cgggcaatac actcaaactg gcacagcgtt tgagcatttt 2640
attatcgagt cagaaagcga tagcgtaagt ttttatgccg atgagttttc tattggtgga 2700
gaggtcacgc ccaaaaacga agtgggattt tttgtgggta acattaccac taatggcaat 2760
gtgcgcaatg attttactca gtactgggat caactaacac cagaaaatga aggaaagtgg 2820
ggttcggtag aacgcactcg tgatgtgtat gattggagtg gactagacag agcctataac 2880
tacgccaaac aaaataatat tccgtttaaa cagcatacta tggtgtgggg tagccaacag 2940
cccaactgga ttgattcgct cagcccagca gaacaggctg cagagataga ggagtggata 3000
agagattatt gtgcgcgcta tcctgatact gaaatgattg atgtggtaaa cgaagcaacg 3060
ctgggccatg ctcctgctaa ctacgcggcg agtgcgtttg gcaataattg gatcattcgt 3120
tcgttcgagc ttactcgtca atattgtcct aacagcattt taatattgaa cgattacaat 3180
gttttaagtt ggaacactca agagtttatc cagatggcta ctccggctgt caatgcaggc 3240
gttgtagatg caattggatt acaagcacac ggcttagcgg attggtcttt aagtgattta 3300
gaaaccaaac taaaccaggt tgcggcattg ggtttaccca tttatatatc cgaatacgat 3360
atagaaaaaa ctaacgacca agaacagctg cgcgtaatgc aaactcagtt cccgctgttt 3420
tataaccatc catcggtgaa aggcattact atttgggggt atgttgttgg ggctacttgg 3480
cgcgatggga cgggattgtt gcacagtaac ggaacaccca gaccggcact tacttggtta 3540
atggattact tgaatagata g 3561
<210> 59
<211> 670
<212> PRT
<213> Microbulbifer degradans
<400> 59
Met Val Ile Ile Thr Met Lys Ala Gly Leu Leu Leu Arg Ile Leu Leu
1 5 10 15
Thr Val Leu Ala Leu Asn Met Leu Ala Ala Cys Gly Gly Ser Ser Ser
20 25 30
Asn Thr Lys Glu Pro Val Thr Gln Pro Glu Pro Glu Pro Glu Gln Gln
35 40 45
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu
50 55 60
Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Met Glu Pro Gln
65 70 75 80
Pro Glu Pro Gln Ala Pro Pro Ala Gly Gly Val Ser Ile Ile Asp Thr
85 90 95
Asn Pro Asn Asn Ala Ser Phe Trp Ala Gly Ser Asn Asn Gly Asp Val
100 105 110
Gly Ser Arg Ala Val Ile Asp Val Asp His Pro Glu Phe Ser Gln Ala
115 120 125
Thr Arg Ile Thr Val Ser Asn Pro Ala Ser Asp Tyr Trp Asn Gly Gln
130 135 140
Leu Ser Phe Pro Leu Asn Ala Ser Val Ala Ala Gly Asp Val Val Leu
145 150 155 160
Val Arg Leu Tyr Met Arg Ser Val Glu Asn Thr Tyr Glu Ser Gly Ala
165 170 175
Ser Phe Thr Thr Val Phe Ile Glu Asp Asn Ile Asp Phe Thr Lys Phe
180 185 190
Leu Asn Arg Glu Ile Thr Ala Ala Gln Asp Trp Val Glu Tyr Tyr Leu
195 200 205
Pro Ala Glu Ile Thr Asp Asn His Ala Thr Gly Glu Val Gly Leu Arg
210 215 220
Ile Gly Phe Gly Ala Gly Pro Arg Ala Gln Val Phe Asp Ile Gly Gly
225 230 235 240
Val Glu Leu Leu His Tyr Thr Asn Thr Asp Ile Ser Ala Met Pro Ser
245 250 255
Thr Arg Pro Ser Tyr Glu Gly Arg Glu Pro Asp Ala Ala Trp Arg Thr
260 265 270
Ala Ala Ala Glu Arg Ile Glu Gln His Arg Lys Gly Asp Phe Glu Leu
275 280 285
Thr Val Val Asp Asp Gly Asn Pro Ile Ala Asn Ala Thr Ile Asp Val
290 295 300
Asp Phe Gln Lys His Ala Tyr His Phe Gly Ser Val Thr Val Gly His
305 310 315 320
Leu Leu Met Gly Thr Ser Glu Asp Ser Ala Ile Tyr Arg Glu Lys Val
325 330 335
Leu Glu Leu Phe Asn Gln Ser Gly Pro Glu Asn Asp Leu Lys Trp Gly
340 345 350
Pro Trp Glu Gly Glu Trp Gly Asn Asn Phe Asn Gln Thr Gln Thr Leu
355 360 365
Asn Gly Leu Gln Trp Leu Arg Asp Asn Gly Leu Tyr Thr Arg Gly His
370 375 380
Val Met Val Trp Pro Ser Lys Arg Asn Leu Pro Asn Leu Met Gln Gln
385 390 395 400
Tyr Leu Pro Glu Gly Asp Pro Ala Ser Ala Asn Pro Glu Ala Lys Gln
405 410 415
Val Val Leu Asp His Ile Asp Asp Ile Ala Thr Ala Thr Ala Asn Tyr
420 425 430
Leu Asp Glu Trp Asp Val Leu Asn Glu Pro Tyr Asp Asn His Tyr Leu
435 440 445
Met Asp Ala Phe Gly Asp Ser Val Met Val Asp Trp Phe Asn Arg Ala
450 455 460
Arg Thr Asn Leu Pro Ala His Gly Leu Tyr Ile Asn Asp Tyr Ser Ile
465 470 475 480
Leu Ser Ala Gly Gly Arg Asn Phe Ala His Gln Glu His Tyr Thr Asn
485 490 495
Thr Ile Gln Tyr Leu Val Asp Asn Asn Ala Pro Ile Thr Gly Ile Gly
500 505 510
Leu Gln Ser His Phe Gly Asp Ser Pro Thr Ala Ile Thr Arg Ile Tyr
515 520 525
Glu Ile Ile Asp Gln Tyr Ser Thr Ala Phe Pro Gln Leu Asp Ile Arg
530 535 540
Ala Thr Glu Phe Asp Val Ser Thr Thr Asp Glu Asp Leu Gln Ala Asp
545 550 555 560
Phe Thr Arg Asp Phe Leu Thr Ile Phe Phe Ser His Pro Lys Thr Val
565 570 575
Gly Val Gln Leu Trp Gly Phe Trp Ala Asn Ala His Trp Tyr Pro Asn
580 585 590
Ala Ala Leu Tyr Asp Ala Asp Trp Arg Glu Lys Pro Asn Ala Leu Ala
595 600 605
Trp Lys Glu Gln Ile Phe Asn Glu Trp Trp Asn Asp Phe Asp Gly Thr
610 615 620
Thr Asn Ala Gln Gly Lys Phe Asp Glu Arg Gly Phe Tyr Gly Asp Tyr
625 630 635 640
Gln Val Thr Val Thr Val Gly Glu Glu Gln Gln Ile Phe Thr Phe Ser
645 650 655
Leu Val Lys Gly Gly Glu Gln Asn Phe Ser Phe Glu Trp Gln
660 665 670
<210> 60
<211> 2013
<212> DNA
<213> Microbulbifer degradans
<400> 60
atggtcataa taactatgaa agccggttta cttctacgca tcctattaac tgtactcgcg 60
ctcaatatgc ttgccgcatg tggcggtagt tctagcaata ccaaagaacc cgttacccag 120
ccggaaccag agccagagca gcagccagaa ccagaaccag agccagagcc agagccagag 180
ccagaaccag agccagaacc agagccagaa ccagagccag agcctgaaat ggaaccgcag 240
ccagagccac aagcgccgcc tgcaggtggt gtatctatca ttgataccaa ccccaacaat 300
gcatcgtttt gggcaggctc aaacaatggt gatgtgggca gtagggctgt tatagatgtc 360
gatcaccccg aatttagcca agcgacgcgc ataaccgtaa gcaaccccgc tagcgactat 420
tggaatggtc agctctcctt cccgcttaat gcgagtgtgg cggcggggga tgtagtatta 480
gtgcgtttgt acatgcgctc ggtggagaat acttacgaat cgggtgctag ttttactacc 540
gtatttattg aagacaacat cgactttact aaatttttaa accgcgaaat aaccgccgcg 600
caagattggg tagagtatta cctacccgca gaaattaccg ataaccatgc aaccggtgaa 660
gtgggcttgc gcattggctt tggcgctggc cctagggcgc aggtgtttga tattggcggt 720
gtagagctat tgcattacac caatactgat ataagcgcta tgcctagtac acgcccaagt 780
tacgaaggcc gcgagccaga tgccgcatgg cgtacagcgg cggcagagcg aattgagcag 840
caccgcaaag gcgactttga gctaacagta gtggacgatg gcaaccctat cgccaatgcc 900
accatagatg tagattttca aaaacacgcc tatcattttg gctcggtaac tgttggccat 960
ctattgatgg gcaccagtga agatagcgcc atttaccgcg aaaaagtgct cgagctattt 1020
aaccaaagtg gcccagaaaa cgatttaaag tggggcccat gggaaggcga gtggggcaac 1080
aattttaacc aaactcaaac cctaaacggc ttgcagtggc tgcgcgataa cggcctgtac 1140
acacgtggcc atgtaatggt ttggccttct aagcgcaact tgccaaactt aatgcagcaa 1200
tatttaccag aaggcgaccc cgccagcgcc aacccagaag caaaacaagt ggtgctggat 1260
cacatcgatg atatagcaac cgcaacagct aattatttag atgagtggga tgtactaaac 1320
gagccttacg acaaccacta tttaatggat gcctttggcg atagtgtaat ggtggattgg 1380
tttaatcgcg cgcgtactaa cctgcctgcg cacggtttgt acataaacga ttacagtatt 1440
ttatctgcgg gcgggcgcaa ttttgctcac caagaacact acaccaacac gattcaatat 1500
ttggtcgata acaacgcacc catcaccggt ataggtttgc aaagtcactt tggcgactcg 1560
cctacagcca ttacgcgtat ttacgaaatt attgatcaat acagtaccgc gtttccgcag 1620
ttagatattc gcgcaacgga atttgacgta agtacaacag atgaagacct gcaggcagat 1680
tttacccgcg acttcttaac gatattcttt agccacccta aaacagtggg tgtgcagttg 1740
tggggttttt gggcaaatgc acattggtac cctaatgcag cgctttatga tgccgattgg 1800
cgagaaaagc ccaatgcact agcttggaaa gagcaaattt ttaacgagtg gtggaacgac 1860
tttgacggca cgaccaacgc acagggtaaa tttgatgaac gcggttttta cggcgattac 1920
caagtaactg taaccgtagg tgaagagcag caaattttta cctttagcct agttaaaggc 1980
ggcgaacaaa actttagttt tgagtggcaa tag 2013
<210> 61
<211> 275
<212> PRT
<213> Microbulbifer degradans
<400> 61
Met Asn Ile Lys Thr Phe Phe Pro Ala Leu Ile Ala Ser Val Phe Leu
1 5 10 15
Leu Ile Asn Ala Ser Thr Gly Tyr Ala Ala Ser Ile Thr Lys Thr Leu
20 25 30
Cys Asn Pro Ala Asp Ser Asp Asn Gly Tyr Gly Ala Gly Thr Phe Asn
35 40 45
Gly Lys Phe Tyr Ser Trp Phe Glu Leu Ser Gln Glu Asp Ile Thr Asp
50 55 60
Cys Asp Thr Lys Ile Gly Phe Tyr Asn Glu Thr Asn Arg His Phe Arg
65 70 75 80
Val Glu Trp Asn Val Ala Gln Ser Trp Gly Glu Asp Ala Ile Gly Gly
85 90 95
Met Gly Trp Ser Ser Gly Ser Arg Asp Arg Lys Ile Gly Tyr Asn Val
100 105 110
Gly Gln Leu Thr Thr Asn Ser Ser Ile Gln Lys Ala Leu Val Ala Met
115 120 125
Tyr Gly Trp Ser Cys Ser Thr Ser Gly Gly Asn Gln Ile Ser Gln Glu
130 135 140
Tyr Tyr Val Val Asp Thr Trp Asp Gly Gly Lys Phe Val Pro Trp Asp
145 150 155 160
Glu Asn Ala Asn Asn Gly Asn Gly Ala Pro Ala Gln Ser Val Gly Thr
165 170 175
Val Ser Ala Asn Gly Ala Thr Tyr Asp Val Tyr Lys Val Arg Arg Asn
180 185 190
Gly Ala Gln Tyr Cys Phe Asn Gly Ser Ser Arg Ser Phe Asp Gln Phe
195 200 205
Trp Ser Val Arg Arg Thr Pro Arg Ala Ile Asn Gly Asn Arg Asn Met
210 215 220
Asp Phe Arg Pro His Ala Asn Arg Trp Asp Asn Ser Asp Leu Gly Phe
225 230 235 240
Lys Val Asp Gly Leu Ser Ser Gly Tyr Gln Ile Leu Ala Val Glu Ile
245 250 255
Phe Gly Asp Ala Asn Leu Arg His Lys Gly Ala Ala Asp Ile Thr Leu
260 265 270
Trp Pro Arg
275
<210> 62
<211> 828
<212> DNA
<213> Microbulbifer degradans
<400> 62
atgaacataa aaacattctt ccccgcactt attgcaagtg tatttttatt aattaacgcc 60
agcactggct atgcagcaag cattaccaaa acgctttgca acccagccga ttccgataac 120
ggctacggtg caggaacctt caatggcaaa ttttattctt ggtttgagtt aagccaagaa 180
gacattaccg attgcgatac aaaaattggt ttttacaacg aaaccaatcg acactttagg 240
gtggagtgga atgttgctca atcttgggga gaagatgcaa ttggtggaat gggttggagc 300
tctggctcga gagatagaaa aataggttac aacgttggcc aacttacaac taattcttct 360
attcaaaaag cattggttgc tatgtatggc tggtcttgct ctaccagtgg tggcaaccaa 420
atatcacaag aatattatgt agtggataca tgggacggcg gcaagtttgt gccttgggat 480
gaaaacgcaa ataatggcaa cggtgctcca gcacagagtg taggaacagt tagcgctaat 540
ggtgcaacat acgatgttta taaggttcgc cgcaacggtg cgcaatattg ttttaatggc 600
agcagccgct cgtttgatca gttttggagt gtgcgtagaa cgcctagagc gattaacggc 660
aaccgtaata tggattttcg cccgcacgcc aaccgctggg acaacagtga cctaggtttt 720
aaagttgacg ggttaagcag cggttaccaa attttagcgg ttgaaatatt tggtgatgcg 780
aacctaagac ataaaggtgc agcagatatt actttatggc cacgctaa 828
<210> 63
<211> 767
<212> PRT
<213> Microbulbifer degradans
<400> 63
Met Lys Ser Ile Asn Val Cys Gly Arg Arg Leu Lys Gln Ala Leu Ala
1 5 10 15
Ala Ile Ala Thr Ala Ala Ala Thr Leu Trp Phe Thr Pro Val Asp Ala
20 25 30
Gln Thr Leu Thr Ser Asn Gln Thr Gly Thr His Gly Gly Tyr Tyr Tyr
35 40 45
Ser Phe Trp Thr Asp Ser Ala Gly Thr Val Ser Met Thr Leu Gly Asn
50 55 60
Gly Gly Asn Tyr Ser Ser Ser Trp Ser Asn Thr Gly Asn Trp Val Gly
65 70 75 80
Gly Lys Gly Trp Gln Thr Gly Gly Arg Lys Thr Val Asn Tyr Ser Gly
85 90 95
Thr Phe Asn Pro Ser Gly Asn Gly Tyr Leu Thr Leu Tyr Gly Trp Thr
100 105 110
Gln Asn Pro Leu Ile Glu Tyr Tyr Ile Ile Glu Ser Trp Gly Thr Tyr
115 120 125
Arg Pro Gly Glu Ser Gly Thr Tyr Tyr Gly Thr Val Asn Thr Asp Gly
130 135 140
Gly Thr Tyr Asp Ile Tyr Arg Thr Gln Arg Val Asn Gln Pro Ser Ile
145 150 155 160
Glu Gly Thr Ala Thr Phe Tyr Gln Tyr Trp Ser Val Arg Gln Gln Lys
165 170 175
Arg Val Gly Gly Thr Ile Thr Thr Gly Asn His Phe Asp Ala Trp Ala
180 185 190
Ser His Gly Leu Asn Leu Gly Thr His Asn Tyr Met Val Met Ala Thr
195 200 205
Glu Gly Tyr Gln Ser Ser Gly Asn Ser Asn Ile Thr Val Ser Glu Gly
210 215 220
Ser Gly Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Thr Gly Gly Pro
225 230 235 240
Ser Gly Thr Asn Ile Val Val Arg Ala Gln Gly Val Ser Gly Gln Glu
245 250 255
His Ile Asn Leu Ile Ile Gly Gly Asn Val Val Ala Asp Trp Thr Leu
260 265 270
Ser Thr Ser Met Gln Asp Tyr Thr Tyr Thr Gly Asn Ala Ala Gly Asp
275 280 285
Leu Gln Val Glu Tyr Asp Asn Asp Ala Ser Gly Arg Asp Val Glu Leu
290 295 300
Asp Tyr Val Tyr Val Asn Gly Glu Ile Arg Gln Ala Glu Asp Met Glu
305 310 315 320
Tyr Asn Thr Ala Thr Tyr Ser Gly Glu Cys Gly Gly Gly Ser Tyr Ser
325 330 335
Gln Thr Met His Cys Ser Gly Val Ile Gly Phe Gly Asp Thr Ser Asp
340 345 350
Cys Phe Ser Gly Asn Cys Asn Gly Ala Ser Ser Thr Ser Ser Ser Ser
355 360 365
Ser Ser Ser Ser Thr Ser Ser Ser Thr Ser Ser Gly Gly Asn Asn Asn
370 375 380
Ser Gly Ile Thr Val Arg Ala Arg Gly Thr Asn Gly Asp Glu His Ile
385 390 395 400
Asn Leu Ile Val Gly Gly Asn Ile Val Gly Asn Trp Thr Leu Thr Thr
405 410 415
Ser Asn Gln Asn Tyr Val Tyr Asn Gly Asn Ala Ser Gly Asp Val Glu
420 425 430
Val Gln Phe Asp Asn Asp Ala Asn Gly Arg Asp Val Ile Leu Asp Tyr
435 440 445
Val Ile Val Asn Gly Glu Thr Arg Gln Ala Glu Asp Met Glu Tyr Asn
450 455 460
Thr Ala Thr Tyr Ser Gly Ser Cys Gly Gly Gly Ser Tyr Ser Glu Thr
465 470 475 480
Met His Cys Ser Gly Glu Ile Gly Phe Gly His Thr Asp Asp Cys Phe
485 490 495
Ser Gly Asn Cys Thr Ser Ser Ser Gly Thr Thr Gly Ser Ser Gly Gly
500 505 510
Thr Ser Ser Asn Asn Gly Thr Ser Ser Cys Asn Gly Tyr Val Gly Ile
515 520 525
Thr Phe Asp Asp Gly Pro Gly Asn Asn Thr Ala Thr Leu Ile Asn Leu
530 535 540
Leu Gln Gln Asn Asn Leu Thr Pro Val Thr Trp Phe Asn Thr Gly Gln
545 550 555 560
Asn Ile Ala Ala Asn Thr Gly Gln Phe Ala Gln Gln Lys Ser Val Gly
565 570 575
Glu Ile Gln Asn His Ser Tyr Thr His Ser His Met Leu Asn Trp Ser
580 585 590
Tyr Gln Gln Val Arg Asp Glu Leu Ala Ser Thr Asn Gln Ala Ile Val
595 600 605
Asn Ala Gly Gly Ala Thr Pro Thr Leu Phe Arg Pro Pro Tyr Gly Glu
610 615 620
Thr Asn Ser Thr Ile Asn Gln Ala Ala Gln Asp Leu Gly Leu Arg Val
625 630 635 640
Ile Thr Trp Asp Val Asp Ser Arg Asp Trp Asp Gly Ala Ser Ala Ser
645 650 655
Ala Ile Ala Asn Ser Ala Asn Gln Leu Gln Asn Gly Gln Val Ile Leu
660 665 670
Met His Asp Ala Ser Tyr Asn Asn Thr Asn Gly Ala Ile Ser Gln Phe
675 680 685
Ala Ala Asn Leu Arg Ala Arg Gly Leu Cys Ala Gly Lys Ile Asp Pro
690 695 700
Ser Thr Gly Arg Ala Val Ala Pro Ser Thr Asn Thr Gly Gly Asn Thr
705 710 715 720
Gly Ser Asn Thr Gly Asn Gly Gly Asn Gly Gly Met Cys Asn Trp Tyr
725 730 735
Gly Thr Ser Ile Pro Leu Cys Gln Thr Thr Asn Asp Gly Trp Gly Trp
740 745 750
Glu Asn Ser Gln Ser Cys Val Ser Gln Asn Thr Cys Asn Ser Gln
755 760 765
<210> 64
<211> 2304
<212> DNA
<213> Microbulbifer degradans
<400> 64
atgaagtcaa tcaatgtatg cggcagacgc ctcaagcaag ccctcgcagc aatagcaacc 60
gctgcagcaa ctctctggtt tacgccagtg gatgcacaaa ccttaacctc aaaccaaact 120
ggtactcatg gtggttacta ctattccttc tggaccgaca gtgctggcac tgtttctatg 180
acactcggca atggcggcaa ttacagttca tcgtggagca ataccggtaa ctgggtggga 240
ggtaaaggct ggcaaacggg gggacgcaaa accgtaaact attccggtac gtttaacccc 300
tcgggcaatg gttatttaac cctctacggt tggacccaaa acccactcat tgaatactac 360
atcattgaaa gctggggcac ctatcgccca ggtgaaagcg gaacctacta cggcaccgtc 420
aacaccgatg gcggcactta cgatatttat cgcacccaac gcgttaacca accgtcaatt 480
gaaggcactg caacgtttta tcagtactgg agtgttaggc aacaaaaacg cgtaggcggc 540
accataacaa ccggcaacca ttttgatgcg tgggcgagtc atggccttaa cctaggcaca 600
cacaattaca tggtaatggc caccgaaggt tatcaaagta gcggcaactc caatattacc 660
gttagcgaag gcagcggttc gagcagtact agttcgagta gctctagcac cggtggccca 720
agtggtacca atattgttgt gcgcgcacaa ggtgtaagcg gccaagaaca tatcaattta 780
attattggcg gtaacgtagt ggcagactgg acgctttcaa ccagcatgca agattacacc 840
tacaccggta atgccgcagg cgacctgcaa gtagaatacg acaacgatgc tagtggtcgc 900
gatgtagagc tagactatgt gtatgtgaat ggcgaaattc gtcaagcaga agacatggaa 960
tacaacaccg caacttacag tggtgaatgt ggtggcggtt cctattcgca aaccatgcac 1020
tgcagcggtg taattggctt tggcgatacc agtgattgtt ttagcggcaa ctgtaatggt 1080
gcatcttcta caagttctag ttcgtctagt agctcaacca gctctagcac aagctctggc 1140
ggtaacaata acagcggcat tactgttcgc gcacgcggta ccaatggcga tgaacatatc 1200
aaccttattg ttggcggcaa tatagtaggc aattggacgc tcaccaccag caaccaaaat 1260
tatgtttaca acggcaatgc atctggtgat gtagaagtac aattcgacaa cgatgccaac 1320
ggtcgcgatg ttattctcga ttacgtaata gtaaatggcg aaactcgcca agcggaagat 1380
atggaataca acacggcgac ctacagcggt tcctgtggtg gtggctccta ttcggaaaca 1440
atgcactgca gcggcgaaat tggttttggt cacaccgacg attgctttag tggaaattgc 1500
actagcagca gcggcacaac cggtagctct ggaggaacat caagcaataa cggtacaagt 1560
agctgtaacg gttatgtagg tattaccttc gatgatggcc caggcaataa caccgctaca 1620
ttaataaact tactacaaca aaataactta accccagtaa cttggtttaa cacaggccaa 1680
aatattgctg ccaatacagg tcagtttgcc cagcaaaaaa gtgttggtga aattcaaaac 1740
cacagctaca cccattccca tatgcttaat tggagctatc aacaagttcg cgacgaactc 1800
gccagcacca atcaagctat tgtgaatgct gggggcgcaa cgccaactct attccgtccg 1860
ccttatggcg aaacaaactc caccattaat caagcggcac aagatttagg cctgcgcgta 1920
ataacctggg atgtagattc gcgcgattgg gatggcgcaa gcgcttcagc tattgccaac 1980
tcggctaatc agttgcaaaa cggccaagta attttgatgc acgatgccag ctacaacaat 2040
accaacggag ccatatcaca atttgcagcc aatctaagag caagagggct atgtgcaggt 2100
aaaatagacc caagcactgg ccgcgcagtt gcaccaagca caaataccgg cggcaacact 2160
ggcagcaata caggaaatgg cggtaatggc ggcatgtgta actggtacgg caccagcatt 2220
ccattatgcc aaactaccaa cgacggttgg ggctgggaaa actcacaaag ctgcgtttcg 2280
caaaatacct gtaactcaca ataa 2304
<210> 65
<211> 360
<212> PRT
<213> Microbulbifer degradans
<400> 65
Met Cys Leu Lys Ile Asn Arg Cys Trp Val Phe Val Trp Leu Cys Ile
1 5 10 15
Cys Ala Thr Thr Ala His Ser Glu Thr Tyr Val Pro Ala Asp Asn Asp
20 25 30
Gln Tyr Leu Tyr Thr Gly Arg Ile Asp Phe Ser Asp Ile Lys Ala Pro
35 40 45
Ser Leu Ser Trp Pro Gly Thr Ser Ile Lys Ala Asn Phe Thr Gly Glu
50 55 60
His Leu Glu Val Val Leu Asp Asp Gln Asn Gly Lys Asn Phe Phe Asn
65 70 75 80
Val Ile Ile Asp Gly Asn Asp Arg Phe Pro Tyr Val Leu Glu Ala Lys
85 90 95
Gln Gly Glu His Arg Tyr Leu Ile Ser Ser Ala Leu Ser Lys Gly Lys
100 105 110
His Ser Val Glu Ile Tyr Lys Arg Thr Glu Gly Glu Glu Gly Ala Thr
115 120 125
Leu Phe Lys Gly Leu Trp Leu Ala Asp Asp Ser Tyr Leu Leu Lys Pro
130 135 140
Pro Lys Arg Pro Lys Arg Arg Ile Glu Ile Tyr Gly Asp Ser Ile Thr
145 150 155 160
Ser Gly Met Gly Asn Glu Gly Ala Asp Asn Gly Ala Asp His Leu Gly
165 170 175
Ser Glu Lys Asn Asn Tyr Leu Ala Tyr Gly Ala Ile Thr Ala Arg Asn
180 185 190
Leu Asn Ala Glu Leu His Thr Ile Ser Gln Ser Gly Ile Gly Val Met
195 200 205
Val Ser Trp Phe Pro Phe Ile Met Pro Gln Phe Tyr Asn Gln Leu Ser
210 215 220
Ala Val Gly Asn Asn Asp Ser Ile Trp Asp Phe Lys Gln Trp Thr Pro
225 230 235 240
His Val Val Val Ile Asn Leu Met Gln Asn Asp Ser Trp Leu Ile Asp
245 250 255
Arg Glu Lys Arg Leu Thr Pro Ile Pro Ala Asp Ala Gln Arg Ile Ala
260 265 270
His Tyr Gln Ala Phe Val Gln Ser Ile Arg Ala Glu Tyr Pro Lys Ala
275 280 285
Gln Ile Ile Cys Ala Leu Gly Ser Met Asp Ala Thr Ala Asn Glu Lys
290 295 300
Trp Pro Asn Tyr Val Arg Glu Ala Val Lys Asn Met Gln Asp Asn Gly
305 310 315 320
Asp Asn Lys Ile Asp Thr Ile Phe Phe Glu Tyr Ile Gly Tyr Gly Gln
325 330 335
His Pro Arg Val Ala Gln His Asn Ala Asn Ala Asp Lys Leu Thr Lys
340 345 350
Phe Ile Lys Lys Lys Met Lys Trp
355 360
<210> 66
<211> 1083
<212> DNA
<213> Microbulbifer degradans
<400> 66
atgtgcctaa aaataaaccg gtgctgggtg tttgtttggt tgtgtatttg cgcaactact 60
gcccatagtg aaacctacgt acccgcagat aacgaccaat acctttatac cggccgtata 120
gattttagcg atataaaagc accctcgcta agctggcccg gcacaagtat aaaagccaac 180
tttaccggcg aacatttaga ggtagtgtta gacgatcaaa acggtaagaa tttttttaat 240
gtgattatcg acggtaacga tcgatttcct tatgtgctag aagctaaaca aggtgagcat 300
cgatatttaa tttcttctgc gctaagcaag ggcaagcaca gcgtagaaat ttataaacgt 360
acagaaggcg aagagggcgc aacgctattt aaagggcttt ggttagccga tgatagttat 420
ttattaaaac cccctaaacg cccaaaacgc agaatagaaa tttatggtga ctcaattaca 480
agcggtatgg gtaacgaagg cgcagataac ggcgccgacc atttgggctc cgaaaaaaat 540
aattaccttg cctatggggc tattaccgca cgcaatttaa acgccgagct acataccatt 600
tcgcaaagcg gtattggggt aatggtaagt tggtttccgt ttattatgcc gcagttttac 660
aaccagctaa gtgctgttgg taataatgat tccatatggg actttaaaca atggacgccc 720
catgtagttg taataaacct aatgcaaaac gatagctggc taatagatag agaaaagcgc 780
cttacgccaa ttcctgcaga tgcacaacgc atagcccatt atcaagcgtt tgtgcaaagc 840
attcgtgccg aataccccaa ggcgcaaata atatgcgcac tgggcagtat ggatgcaacc 900
gcaaacgaaa aatggccaaa ctacgtgcgc gaagctgtaa aaaatatgca agataatggc 960
gataataaaa tcgatactat tttctttgaa tacatcggct acggccaaca cccgcgcgta 1020
gcgcaacaca atgcgaatgc agataagtta actaaattta ttaagaaaaa aatgaaatgg 1080
tag 1083
<210> 67
<211> 973
<212> PRT
<213> Microbulbifer degradans
<400> 67
Met Asn Tyr Tyr Leu Asn Lys Lys Arg Leu Gly Gln Leu Leu Thr Gly
1 5 10 15
Ala Ala Ile Ile Pro Val Leu Tyr Ala Cys Gly Ser Gln Glu Lys Asn
20 25 30
Val Glu Pro Ala Thr Val Asn Trp His Lys Thr Ser Asp Gly Val Val
35 40 45
Val Ser Leu Gln Asp Ser Glu Ala Lys Lys Val Arg Leu Gln Val Ile
50 55 60
Asn Asp Arg Ile Val Arg Val Thr Ala Thr Pro Gln Gln Asp Phe Asn
65 70 75 80
Asn Leu Pro Asn Thr Leu Met Val Val Ala Lys Pro Glu Gln Thr Ala
85 90 95
Phe Glu Val Lys Gln Asn Asp Ala Ser Val Val Leu Ser Thr Ala Asp
100 105 110
Leu Ser Ala Glu Val Ser Leu Val Thr Gly Val Val Ser Phe Lys Asp
115 120 125
Glu His Gly Lys Val Leu Thr Thr Glu Val Asp Arg Gly Asn Phe Gly
130 135 140
Ala Val Thr Arg Asp Pro Gly Val Val Asp Ala Asp Ser Phe Ala Ile
145 150 155 160
Arg Gln Gln Phe Thr Ser Asp Glu Asn Glu Gly Tyr Tyr Gly Leu Gly
165 170 175
Gln Gln Gln Asp Gly Glu Val Asn Tyr Ala Gly Asp Asn Val Glu Leu
180 185 190
Thr Thr Tyr Asn Leu Glu Ile Ser Ile Pro Tyr Val Val Ser Ser Lys
195 200 205
Asp Tyr Ala Leu Leu Trp Asn Asn Thr Ser Ile Ser Arg Leu Gly Asp
210 215 220
Pro Asn Pro Pro Glu Pro Leu Lys Glu Gly Phe Lys Leu Phe Asp Ala
225 230 235 240
Asn Gly Asn Pro Gly Gly Leu Thr Ala Arg Tyr Phe Asp Gly Asp Lys
245 250 255
Leu Leu Leu Glu Arg Val Glu Ala Asp Leu Asp Tyr Gln Phe Leu Ala
260 265 270
Gln Gly Ser Asn Arg Thr Thr Pro Met Pro Asp Glu Thr Ala Asp Ala
275 280 285
Lys Asn Leu Arg Ile Glu Trp Glu Gly Ser Ile Glu Ser Asp Thr Asn
290 295 300
Gly Val His Glu Leu Lys Met Tyr Ser Ser Gly Tyr Ala Lys Leu Tyr
305 310 315 320
Leu Asn Gly Glu Leu Val Leu Asp Arg Trp Arg Met Asn Trp Asn Pro
325 330 335
Trp Tyr His Asn Thr Lys Leu Glu Met Gln Ala Gly Lys Lys Val Ala
340 345 350
Leu Lys Leu Asp Trp Gln Val Asp Gly Gly Tyr Met Arg Ile Lys Gln
355 360 365
His Lys Pro Leu Pro Val Ala Glu Gln Gly Arg Leu Ser Ile Ala Ser
370 375 380
Asp Thr Ala Lys Ala Ile Asp Tyr Tyr Phe Val Val Gly Asp Asn Lys
385 390 395 400
Asp Glu Leu Val Ser Gly Tyr Arg Thr Leu Thr Gly Lys Ala Val Met
405 410 415
Leu Pro Lys Trp Val Phe Gly Phe Trp Gln Ser Arg Glu Arg Tyr Lys
420 425 430
Thr Gln Asp Glu Ile Ile Asp Ala Leu Gln Glu Tyr Arg Asp Arg Lys
435 440 445
Ile Pro Ile Asp Asn Ile Val Leu Asp Trp Ser Tyr Trp Pro Gln Asp
450 455 460
Ala Trp Gly Ser His Asp Phe Asp Glu Gln Phe Phe Pro Asp Pro Ser
465 470 475 480
Ala Leu Val Asp Lys Val His Glu Leu Asn Gly Asn Ile Met Ile Ser
485 490 495
Val Trp Pro Lys Phe Tyr Pro Thr Thr Asp Asn Tyr Lys Ala Leu Asn
500 505 510
Ala Lys Gly Cys Met Phe Asn Lys Asn Ile Glu Gln Lys Asn Leu Asp
515 520 525
Trp Ile Gly Glu Gly Tyr Leu Asn Gly Phe Tyr Asp Ala Tyr Asn Pro
530 535 540
Glu Cys Arg Glu Met Phe Trp Ala Gln Ile Arg Asp Lys Ile Asn Val
545 550 555 560
His Gly Phe Asp Ala Trp Trp Leu Asp Ala Val Glu Pro Asp Ile His
565 570 575
Ser Asn Leu Ser Phe Glu His Arg Lys Asp Leu Met Thr Pro Asn Ala
580 585 590
Leu Gly Thr Gly Ala Glu Val Phe Asn Ala Tyr Ala Leu Pro His Ala
595 600 605
Glu Thr Val Tyr Gln Gly Glu Arg Arg Asp Asp Gly Asp Lys Arg Ala
610 615 620
Phe Ile Leu Thr Arg Ser Gly Phe Ala Gly Ile Gln Arg Thr Gly Ser
625 630 635 640
Ala Ile Trp Ser Gly Asp Val Val Ser Arg Trp Ser Asp Leu Lys Glu
645 650 655
Gln Ile Ala Ala Gly Val Gly Val Gly Ile Ser Gly Met Pro Tyr Trp
660 665 670
Thr Phe Asp Ile Gly Gly Phe Thr Pro Glu Asp Arg Tyr Arg Tyr Ser
675 680 685
Ala Lys Gly Ser Val Gly His Phe Ser Met Met Asn Glu Ser Glu Val
690 695 700
Pro Glu Trp Gln Glu Ile Asn Leu Arg Trp Phe Gln Phe Gly Thr Phe
705 710 715 720
Val Pro Leu Phe Arg Ser His Gly Gln Asn Pro Tyr Arg Glu Ile Tyr
725 730 735
Asn Ile Ala Asp Lys Gly Thr Glu Val Tyr Asp Ser Met Val Trp Tyr
740 745 750
Thr Lys Thr Arg Tyr Arg Leu Met Pro Tyr Ile Tyr Ser Leu Val Gly
755 760 765
Asp Ala His His Lys Asp Gly Thr Phe Met Arg Ala Leu Val Met Asp
770 775 780
Phe Pro Ser Asp Leu Asn Val Arg Asp Ile Asn Asp Gln Tyr Met Phe
785 790 795 800
Gly Pro Ala Leu Leu Val Asn Pro Val Ser Glu Phe Lys Ala Arg Ser
805 810 815
Arg Asp Val Tyr Leu Pro Ala Gly Ala Asp Trp Tyr Asp Phe Tyr Thr
820 825 830
Gly Val Lys His Thr Gly Gly Lys Thr Ile Lys Ala Asp Ala Pro Leu
835 840 845
Ala Lys Met Pro Ile Phe Val Lys Ala Gly Ser Ile Ile Pro Thr Gly
850 855 860
Val Glu Ile Gln His Val Tyr Asp Lys Pro Asp Ala Pro Tyr Thr Leu
865 870 875 880
Asn Val Tyr Thr Gly Ala Asn Gly Ser Phe Glu Ile Tyr Glu Asp Asp
885 890 895
Gly Lys Thr Tyr Ala Tyr Glu Gln Gly Ala Trp Ala Arg Ile Pro Val
900 905 910
Ser Tyr Asn Asp Lys Thr Gly Glu Leu Thr Ile Gly Asp Arg Val Gly
915 920 925
Ser Phe Glu Gly Met Thr Lys Glu Arg Glu Phe Arg Val Arg Trp Ile
930 935 940
Ser Ala Lys Arg Asp Asp Ala Ala Asn Phe Asp Thr Gly Val Ala Lys
945 950 955 960
Ala Val Thr Tyr Thr Gly Lys Ala Ile Thr Ile Lys Arg
965 970
<210> 68
<211> 2922
<212> DNA
<213> Microbulbifer degradans
<400> 68
gtgaattatt atttaaacaa aaagcgactg gggcaattgc tcaccggcgc ggccattatt 60
cccgtgctat atgcatgtgg ctcacaggaa aaaaacgtag agcctgcaac ggttaattgg 120
cataaaacaa gcgacggcgt cgttgtaagc ttgcaagata gcgaagcaaa aaaagtgcgc 180
ttgcaagtca ttaacgatcg gatagtacgt gttaccgcta cgccacagca ggatttcaac 240
aacctgccaa atacgcttat ggtggtggcc aagcccgagc aaacggcgtt tgaagttaaa 300
caaaacgatg catctgttgt gttatcaacg gcagatctat ctgccgaagt gtcattagta 360
actggtgttg taagttttaa agatgagcac ggcaaggtgc ttacaacaga agttgatcgc 420
ggcaattttg gggcggtaac ccgcgaccca ggtgtggtgg acgccgattc atttgctatt 480
cgccaacagt ttacaagcga cgaaaatgaa ggctactacg gtttaggtca gcagcaggat 540
ggcgaagtaa actacgctgg cgataacgta gagttaacaa cttacaactt agaaatttct 600
ataccttatg ttgtatcaag caaagattac gcgctgctat ggaacaatac ctcaatttct 660
cgtttgggcg accccaatcc acccgagcca ctaaaagagg gctttaaact ctttgacgct 720
aatggtaacc ccggcgggct aaccgcacgt tattttgatg gcgataaatt actgctcgag 780
cgtgtagagg ccgatttaga ttatcaattt ttagcgcaag gtagtaatcg cactacgccc 840
atgcctgatg aaaccgctga tgcaaaaaat ctgcgtattg aatgggaagg tagtatcgaa 900
tccgatacca acggtgtgca cgagttaaaa atgtattcca gtggctacgc taaattgtat 960
ttgaatggcg agttagtgtt agatcgctgg cgtatgaact ggaacccttg gtatcacaac 1020
accaagttag aaatgcaggc cggtaaaaaa gttgcattaa agttagattg gcaagtagat 1080
ggtggttata tgcgcataaa acagcataaa ccactgccgg tagcagagca gggacgtttg 1140
tctattgctt ccgataccgc gaaagccatt gattactact ttgtagttgg cgataacaag 1200
gatgagttgg tgtctggcta ccgtacgctc acaggtaaag cagtgatgct acctaagtgg 1260
gtgtttggtt tttggcaaag ccgcgagcgc tataaaacac aagatgaaat tatcgacgcc 1320
ttgcaagaat accgcgatcg taaaattcct atcgataaca ttgtattaga ttggagttat 1380
tggcctcagg atgcatgggg tagtcatgat ttcgacgagc aatttttccc cgacccatct 1440
gcactagtag ataaagtaca cgagctaaac ggcaatatta tgatttccgt atggcctaag 1500
ttttacccta caaccgacaa ctacaaagcg ctaaacgcta aaggttgtat gtttaataaa 1560
aacatcgagc agaaaaacct cgattggatt ggcgagggtt acctaaatgg cttttacgat 1620
gcctataacc cagagtgccg tgaaatgttt tgggcgcaaa ttcgcgataa gatcaatgtg 1680
cacggtttcg atgcttggtg gttagatgcg gtagagccag atatccattc caacctttct 1740
tttgagcacc gcaaagattt aatgacaccc aatgcactcg gcaccggtgc cgaagtgttt 1800
aacgcttacg ctttgccgca cgcagaaact gtttaccaag gcgagcgtag agatgacggt 1860
gacaagcgcg catttattct aacgcgttct gggtttgccg gtattcagcg caccggttcg 1920
gctatttgga gtggcgatgt ggtatcgcgc tggtccgact taaaagaaca aattgcagca 1980
ggtgtgggcg tgggcatttc tggtatgccg tattggacgt tcgatatcgg tggctttact 2040
ccagaagatc gctaccgtta tagcgccaaa ggttctgttg gtcatttctc tatgatgaac 2100
gaatcggaag tgcctgaatg gcaagaaatc aatctgcgtt ggttccaatt tggtaccttt 2160
gtgccgctgt ttaggtccca cggccaaaac ccatatcgcg aaatatataa catcgccgat 2220
aaaggcaccg aggtatacga cagcatggtg tggtacacca aaactcgcta tcgcttaatg 2280
ccttatattt attcgttagt tggcgatgct caccacaaag acggcacctt tatgcgcgct 2340
ctggtgatgg atttccctag cgaccttaat gtgcgcgata ttaacgacca gtatatgttt 2400
ggccccgcgc tactcgtaaa ccctgtgtcg gaatttaaag cgcgttcacg ggatgtgtat 2460
ctacctgcgg gcgcagattg gtacgatttc tatacaggtg tgaagcacac aggtggtaaa 2520
accattaagg ccgatgcacc gcttgccaaa atgcctattt ttgttaaggc cggctctatt 2580
attccaacag gtgtagaaat ccagcatgtg tacgataagc ccgatgctcc ttacaccctt 2640
aacgtgtata ccggtgcgaa tggcagcttc gaaatttatg aagatgacgg caaaacctac 2700
gcttacgagc aaggggcttg ggcgcgcatt cccgtttcgt acaacgataa aaccggtgag 2760
ctaaccattg gcgatcgcgt aggtagcttt gagggaatga ccaaagagcg cgaattccgc 2820
gtgcgctgga tatctgccaa gcgagacgat gccgccaatt tcgatacagg tgtggccaaa 2880
gccgttacct atacgggtaa ggcaataacc attaagcgct aa 2922
<210> 69
<211> 893
<212> PRT
<213> Microbulbifer degradans
<400> 69
Met Asn Lys His Phe Leu Val Gly Val Ile Thr Leu Gly Val Ile Leu
1 5 10 15
Gln Gly Leu Thr Ala Cys Ser Lys Ser Ala Ala Pro Asn Ala Asn Gln
20 25 30
Pro Gln Asp Thr Ala Ala Ser Thr Ala Thr Tyr Pro Phe Arg Asp Ala
35 40 45
Ser Leu Ser Val Asp Ala Arg Val Asp Asp Leu Val Ser Arg Leu Thr
50 55 60
Thr Thr Glu Lys Ile Ala Gln Met Phe Asn Asp Thr Pro Ala Ile Glu
65 70 75 80
Arg Leu Gly Ile Pro Ala Tyr Asn Trp Trp Asn Glu Ser Leu His Gly
85 90 95
Val Ala Arg Ala Gly Lys Ala Thr Val Tyr Pro Gln Ala Ile Gly Leu
100 105 110
Ala Ser Thr Phe Asp Glu Asp Leu Met Leu Arg Val Ala Thr Ser Ile
115 120 125
Ser Asp Glu Gly Arg Ala Lys Tyr His Asp Phe Leu Ser Lys Asp Val
130 135 140
Arg Thr Ile Tyr Gly Gly Leu Thr Phe Trp Ser Pro Asn Ile Asn Ile
145 150 155 160
Phe Arg Asp Pro Arg Trp Gly Arg Gly Gln Glu Thr Tyr Gly Glu Asp
165 170 175
Pro Phe Leu Thr Gly Arg Met Ala Ile Asn Phe Val Lys Gly Ile Gln
180 185 190
Gly Glu Asn Asp Asn Ser Asp Tyr Leu Lys Ala Val Ala Thr Ile Lys
195 200 205
His Tyr Ala Val His Ser Gly Pro Glu Lys Thr Arg His Ser Asp Asp
210 215 220
Tyr His Pro Thr Arg Lys Asp Leu Phe Glu Thr Tyr Leu Pro Ala Phe
225 230 235 240
Arg Met Ala Ile Ala Glu Thr Asn Val Gln Ser Leu Met Cys Ala Tyr
245 250 255
Asn Arg Val Asp Gly Ala Pro Ala Cys Gly Asn Asn Glu Leu Met Gln
260 265 270
Glu Ile Leu Arg Gly Asp Met Gly Phe Asn Gly Tyr Val Val Ser Asp
275 280 285
Cys Gly Ala Ile Ala Asp Phe Tyr Glu Ser Arg Ser His His Val Val
290 295 300
Asp Ser Pro Ala Glu Ala Ala Ala Trp Ala Val Lys Ser Gly Thr Asp
305 310 315 320
Leu Asn Cys Gly Asp Ser His Gly Asn Thr Tyr Thr Asn Leu His Tyr
325 330 335
Ala Leu Gln Gln Gly Leu Ile Thr Glu Asp Tyr Ile Asp Ile Ala Val
340 345 350
Lys Arg Leu Phe Lys Ala Arg Ile Lys Leu Gly Met Phe Asp Glu Gln
355 360 365
Asp Arg Val Pro Tyr Ser Glu Ile Gly Met Asp Val Val Gly Ser Pro
370 375 380
Lys His Leu Ala Leu Thr Gln Glu Ala Ala Glu Lys Ser Ile Val Leu
385 390 395 400
Leu Lys Asn Asn Gly Val Leu Pro Leu Lys Ala Gly Val Lys Val Ala
405 410 415
Val Ile Gly Pro Asn Ala Val Asp Glu Asp Val Leu Val Gly Asn Tyr
420 425 430
His Gly Val Pro Val Lys Pro Val Leu Pro Leu Glu Gly Ile Val Asn
435 440 445
Arg Val Gly Glu Ala Asn Val Phe Tyr Ala Pro Gly Ser Ala Gln Ile
450 455 460
Ala Asp Ile Tyr Ser His Tyr Glu Pro Ile Ser Ala Glu Asn Phe Tyr
465 470 475 480
His Lys Asp Ala Asn Gly Asn Leu Ala Ala Gly Leu Lys Ala Glu Tyr
485 490 495
Tyr Ala Asp Tyr Tyr Asn Ala Ala Glu Ile Asn Asp Asp Thr Phe Ser
500 505 510
Ala Thr Pro Ala Leu Asn Arg Ile Asp Ala Asp Ile Asn Phe Ser Trp
515 520 525
Pro Val Ser Pro Ile Asp Asn Ser Leu Asp Asp Glu Phe Ser Ala Val
530 535 540
Trp Thr Gly Ile Leu Lys Pro Lys Lys Ser Gly Ser Tyr Arg Phe Ser
545 550 555 560
Gly Thr Val Ala Leu Ala Ile Asn Gly Lys Pro Val Asn Gly Ala Val
565 570 575
Asn Leu Lys Ala Gly Glu Ser Tyr Asn Ile Lys Ala Ile Phe Gly Val
580 585 590
Gln Lys Trp Trp Pro Val Asn Ala Ile His Pro Tyr Gly Lys Leu Thr
595 600 605
Trp Leu Asp Glu Ser Arg Asp Leu Glu Glu Glu Ala Leu Ala Ala Ala
610 615 620
Arg Lys Ala Asp Val Ile Ile Phe Met Gly Gly Ile Asp Ala His Leu
625 630 635 640
Glu Gly Glu Glu Met Pro Leu Glu Leu Asp Gly Phe Thr His Gly Asp
645 650 655
Arg Thr His Ile Asn Leu Pro Lys Val Gln Thr Asn Leu Leu Lys Gln
660 665 670
Leu Lys Ala Thr Gly Lys Pro Val Val Met Val Asn Phe Ser Gly Ser
675 680 685
Ala Met Ala Leu Asn Trp Glu Ser Glu Lys Leu Asp Ala Ile Leu Gln
690 695 700
Ala Phe Tyr Pro Gly Glu Ala Thr Gly Thr Ala Leu Ala Asn Ile Leu
705 710 715 720
Trp Gly Asp Val Ser Pro Ser Gly Arg Leu Pro Val Thr Phe Tyr Lys
725 730 735
Gly Val Asp Asp Leu Pro Ala Phe Asn Asp Tyr His Met Glu Asn Arg
740 745 750
Thr Tyr Lys Phe Tyr Arg Gly Glu Pro Leu Tyr Ala Phe Gly His Gly
755 760 765
Leu Gly Tyr Val Asp Phe Ala Tyr Asn Asn Leu Val Val Ala Asn Thr
770 775 780
Ala Glu Ala Gly Lys Ala Leu Pro Ile Ala Val Ser Val Thr Asn Thr
785 790 795 800
Gly Lys Met Gln Ala Glu Asp Val Ala Gln Val Tyr Ile Ser Leu Leu
805 810 815
Asp Ala Pro Ala Asn Thr Pro Ile Arg Asp Leu Lys Ala Phe Lys Arg
820 825 830
Thr Lys Leu Ala Ala Gly Glu Ser Thr Glu Leu Glu Phe Asn Leu Pro
835 840 845
Ala Arg Val Leu Thr Tyr Ile Asp Asp Asn Gly Lys Thr Gln Thr Tyr
850 855 860
Thr Gly Arg Val Glu Val Thr Val Gly Ser Gly Gln Lys Gly Tyr Val
865 870 875 880
Lys Glu Asn Ala Ile Ala Val Ala Thr Ile Asn Val Gln
885 890
<210> 70
<211> 2682
<212> DNA
<213> Microbulbifer degradans
<400> 70
atgaataaac actttttagt aggtgtaatt acgttagggg taattctgca ggggctaact 60
gcatgtagca aaagcgctgc acctaatgcc aatcaaccgc aagataccgc agctagtacg 120
gctacctacc cgtttaggga tgcaagctta agtgtagatg cccgcgtaga cgacttggta 180
tcgcgtttaa ccacaaccga aaaaattgcc caaatgttta acgatacgcc cgcaatcgag 240
cgattgggta ttcccgccta caattggtgg aacgaatcgt tgcacggtgt ggcccgtgcg 300
ggtaaagcaa cggtataccc gcaggcaata ggcttagcgt ctacatttga tgaagactta 360
atgttgcgcg tggctacttc tatttctgat gaggggcgcg ctaagtatca cgacttccta 420
tcgaaagacg tgcgcaccat atacggcggg cttacctttt ggtcgccaaa tattaatatc 480
ttccgcgacc cgcgttgggg cagggggcaa gaaacctacg gtgaagaccc gttcttaacg 540
gggcgtatgg ccattaattt tgttaagggt attcaaggcg aaaacgacaa cagcgattac 600
ctaaaagccg tagcgacaat taagcactat gccgtacaca gcggccccga aaaaacgcgt 660
cattcggatg actaccatcc aacccgtaaa gatttattcg aaacctattt gcctgcattt 720
cgcatggcaa tagcagagac taacgtgcaa tcgttaatgt gtgcctacaa ccgtgtagat 780
ggggcacctg cctgtggcaa taatgaatta atgcaagaaa ttttgcgtgg cgatatgggc 840
tttaacggtt atgtcgtgtc tgactgtggc gccattgccg atttttacga gagtagatcg 900
caccacgtgg ttgactcacc tgcagaggct gcagcgtggg ccgttaaatc gggtaccgat 960
ttaaactgtg gcgattcaca tggcaatacc tacaccaacc tgcattacgc gttacagcaa 1020
ggtttaatta cagaagatta tattgatata gcggtaaagc gtttgtttaa agcgcgtatt 1080
aagcttggca tgtttgacga gcaagaccgc gtgccttaca gcgaaattgg tatggatgtt 1140
gtaggttcac ctaagcacct agcgctaacc caagaagcgg cagaaaaatc tattgtgctg 1200
ctaaaaaaca atggtgtatt gccattaaaa gcaggggtaa aggtagccgt aatagggcca 1260
aatgcagttg atgaagatgt attggtaggc aactaccacg gcgtaccagt gaaacctgtg 1320
ttgccgctag aggggattgt taatcgtgtt ggcgaggcca acgtatttta tgccccaggc 1380
agtgcacaaa tagccgatat atacagccac tacgaaccga taagtgcaga aaatttttat 1440
cataaagatg caaatggtaa tttagctgca ggcttaaaag cagagtatta cgccgattat 1500
tacaacgcag ctgaaattaa cgacgatacc tttagcgcaa ccccagcgtt aaatagaatt 1560
gatgcagata ttaatttctc ttggcctgta tcgcctattg ataattcgtt agatgatgaa 1620
tttagtgcag tatggacagg catacttaaa ccgaaaaagt cgggtagcta ccgtttctcg 1680
ggcacggttg cattagccat taacggcaaa cctgttaatg gggctgttaa cctaaaggca 1740
ggtgaaagct ataacataaa agctattttt ggcgtgcaaa aatggtggcc cgttaatgca 1800
atacacccgt acggaaaact tacttggcta gatgagtcgc gcgatttaga agaagaggca 1860
ttagctgctg cccgaaaagc cgatgtgatt atttttatgg gcggtataga tgcgcacctt 1920
gaaggcgaag aaatgccgct agagctagat ggctttactc acggtgatcg tacgcacatt 1980
aatttaccta aagtacaaac caatttgctt aaacaattaa aagcaacggg taaacctgtt 2040
gtaatggtta actttagtgg tagtgccatg gctttaaatt gggaaagcga aaagctagac 2100
gcaatactgc aagcgtttta cccaggtgaa gcaaccggta cagcgttagc taatattttg 2160
tggggcgatg taagcccgag tggccgctta cctgtaacct tttacaaagg cgtagacgat 2220
ctaccagcat ttaatgatta ccacatggaa aaccgcacct ataaatttta ccgcggtgag 2280
cctttgtatg catttggcca cggtttaggt tacgttgatt ttgcttataa caatttagtc 2340
gtagcaaata ctgcagaagc gggcaaagcg ctacctatag ctgtaagcgt aaccaatacc 2400
ggtaaaatgc aagcagaaga cgttgcccaa gtttatataa gtttgctaga tgcccccgca 2460
aacacgccca tccgcgattt aaaagcgttt aaacgtacca agcttgcggc aggcgaaagc 2520
accgagcttg aatttaactt gccggcgaga gtgcttacct atatagacga taatggtaaa 2580
acccaaacct atactggcag ggtagaagtt actgttggct ctgggcaaaa gggatacgta 2640
aaagaaaatg cgatagctgt agcgactatt aacgttcagt ag 2682
<210> 71
<211> 317
<212> PRT
<213> Microbulbifer degradans
<400> 71
Met Tyr Thr Tyr Val Ser Ala Ile Ala Leu Phe Ile Phe Ser Ile Ala
1 5 10 15
Ser Ser Cys Cys Val Ala Gln Asn Pro Leu Asp Phe Gly Ser Asn Ile
20 25 30
Lys Thr Ala Asp Pro Ser Gly His Ile Trp Ala Asp Gly Arg Met Tyr
35 40 45
Leu Tyr Thr Ser His Asp Gln Glu Cys Gln Glu Asp Phe Tyr Met Lys
50 55 60
Asp Trp His Thr Phe Ser Ser Ser Asp Leu Ile Asn Trp Thr Ala His
65 70 75 80
Gly Pro Ser Leu Ser Val Ala Asp Ile Thr Trp Ala Asp Asn Tyr Ala
85 90 95
Trp Ala Pro Asp Ala Ala Tyr Lys Asn Gly Lys Tyr Tyr Leu Phe Phe
100 105 110
Pro Ala Gly Thr Gly Val Lys Asp Arg Val Asn Pro Glu Lys Ser Thr
115 120 125
Lys Trp Met Gly Ile Gly Val Ala Val Ser Asp Ser Pro Thr Gly Pro
130 135 140
Phe Lys Asp Ala Ile Gly Ala Pro Leu Trp Thr Asp Pro Tyr Ala Asn
145 150 155 160
Asp Pro Ser Ile Phe Ile Asp Asp Asp Gly Lys Gly Tyr Leu Tyr Phe
165 170 175
His Gly Lys Gly Ala Asp Tyr Leu Val Ala Glu Met Ala Asp Asp Leu
180 185 190
Leu Ser Val Lys Gly Glu Phe His Lys Met Asp Met Gly Gly Tyr Glu
195 200 205
Pro Lys Met Glu Gly Pro Trp Val Phe Lys Arg Glu Gly Met Tyr Tyr
210 215 220
Phe Thr Met Pro Glu Asn Asn Arg Ser Leu Ala Tyr Tyr Met Ala Lys
225 230 235 240
Ser Pro Phe Gly Pro Trp Glu Tyr Lys Gly Ile Phe Met Gln Glu Glu
245 250 255
Gly Gly Asn Asn His His Ser Ile Val Gln Phe Lys Gly Lys Trp Ile
260 265 270
Leu Phe Tyr His Arg Trp Leu Met Gly Glu Gly Glu Cys Lys Lys Lys
275 280 285
Gln Arg His Thr Ala Ala Glu Tyr Leu His Phe Asn Ala Asp Gly Thr
290 295 300
Ile Lys Glu Val Lys Arg Thr Arg Glu Gly Leu Thr Lys
305 310 315
<210> 72
<211> 954
<212> DNA
<213> Microbulbifer degradans
<400> 72
atgtacacat atgtatccgc catagcacta tttatatttt caattgcctc gtcgtgttgt 60
gttgcccaaa acccgctcga ctttggcagt aatattaaaa ccgcagatcc gtctggccat 120
atatgggctg atggcagaat gtacctttac acctcgcacg accaagaatg ccaagaagat 180
ttttatatga aggattggca taccttttcg tccagcgact taataaattg gactgcccac 240
ggcccaagtt tatctgtagc ggatattacg tgggcagata actacgcatg ggcgcccgac 300
gcggcctata aaaatgggaa gtactatttg ttctttccgg cgggaaccgg tgttaaagat 360
agagtaaacc ccgaaaaaag cactaagtgg atgggcattg gtgttgcagt aagcgatagc 420
cctacaggcc cctttaaaga tgcgattggc gcccccttgt ggaccgaccc ctatgccaac 480
gacccaagta tttttataga tgatgacggc aagggctact tatattttca cggtaaaggt 540
gcagactacc tagtagccga aatggcagac gatttactga gtgtaaaagg tgagtttcac 600
aaaatggata tgggcggtta cgagccaaaa atggagggcc cttgggtttt taagcgcgag 660
ggaatgtatt actttaccat gccagaaaac aatcgttcac ttgcttacta tatggcgaaa 720
tctccctttg ggccgtggga atacaagggc atttttatgc aagaagaagg cggtaacaac 780
caccattcta ttgtgcaatt taaaggcaag tggatattgt tttatcaccg ctggttaatg 840
ggcgaaggcg agtgtaaaaa gaagcaacgc cacaccgcag cggaatacct tcactttaat 900
gccgacggca caattaaaga agtaaaaaga acgcgcgagg ggttaactaa gtag 954
<210> 73
<211> 577
<212> PRT
<213> Microbulbifer degradans
<400> 73
Met Lys Ile Lys Cys Leu Leu Leu Ala Val Tyr Ala Gly Leu Leu Ala
1 5 10 15
Ala Cys Ala Leu Asp Ala Pro Leu Lys Thr Ser Ser Lys Pro Leu Ala
20 25 30
His Phe Ser Trp Phe Glu Tyr Gln Gly Asn Asp Glu Ile Phe Lys Ala
35 40 45
Pro Leu Ala Ser Asn Gln Tyr Gln Asn Pro Ile Leu Ala Gly Tyr His
50 55 60
Pro Asp Pro Ser Ile Val Arg Val Gly Glu Asp Tyr Tyr Leu Val Asn
65 70 75 80
Ser Thr Phe Gly Phe Tyr Pro Gly Ile Pro Val Phe His Ser Arg Asp
85 90 95
Leu Val Asn Trp Thr Gln Leu Gly Asn Ala Ile His Arg Pro Glu Gln
100 105 110
Leu Ser Phe Asp Gly Ile His Leu Gly Tyr Asn Gly Val Tyr Ala Pro
115 120 125
Ala Ile Glu Tyr Arg Asp Gly Thr Phe Tyr Val Ile Asn Thr Cys Val
130 135 140
Ala Cys Gly Gly Asn Phe Ile Val Thr Ala Thr Asn Pro Ala Gly Pro
145 150 155 160
Trp Ser Asp Pro Ile Trp Leu Pro Glu Val Ile Gly Ile Asp Pro Ser
165 170 175
Leu Phe Phe Asp Glu Asp Gly Lys Thr Tyr Ile Val His His Arg Asn
180 185 190
Pro Pro Val Gln Lys Tyr Pro Ala His Thr Ala Leu Trp Val Met Glu
195 200 205
Val Asp Ser Lys Thr Phe Ala Pro Val Ser Asp Asp Val Met Leu Val
210 215 220
Asp Gly Gly Asp Glu Ala Pro Trp His Thr Glu Tyr Ile Glu Gly Pro
225 230 235 240
His Ile Tyr Lys Ile Asp Gly Thr Tyr Tyr Leu Tyr Ala Pro Gly Gly
245 250 255
Gly Thr Gly Tyr Phe His Gly Gln Leu Val Tyr Arg Ser Asp Asn Val
260 265 270
Phe Gly Pro Tyr Glu Ala Asn Pro Asn Asn Pro Val Leu Thr Gln Val
275 280 285
Gly Leu Pro Asp Asp Arg Glu His Pro Val Thr Ala Thr Gly His Ala
290 295 300
Asp Leu Phe Gln Asp Thr Asn Gly Asp Trp Trp Thr Val Phe Leu Gly
305 310 315 320
Thr Arg Val Tyr Asp Leu Ala Lys Pro Pro Gln Asp Pro Gly Asn Phe
325 330 335
Ala Thr Gly Arg Glu Thr Phe Met Leu Pro Val Thr Trp Gln Asn Gly
340 345 350
Trp Pro His Val Leu Glu Lys Gly Glu Ala Val Pro Tyr Arg Val Thr
355 360 365
Lys Pro Lys Leu Pro Ala Gly Lys Pro Ala Pro Arg Ala Met Thr Gly
370 375 380
Asn Phe Thr Val Arg Glu Glu Phe Thr Asn Ala Ser Leu Ala Pro His
385 390 395 400
Trp Leu Phe Val Arg Thr Pro Arg Ser Lys Trp Trp Gln Thr Gly Asn
405 410 415
Gly Glu Leu Ile Leu Glu Ala Arg Ala Asp Thr Ile Gly Ala Val Asn
420 425 430
Gln Pro Ser Phe Ile Gly Arg Arg Leu Ala His Met Thr Ala Ser Phe
435 440 445
Ala Thr Gln Leu Thr Phe Asn Pro His Thr Val Gly Asp Glu Ala Gly
450 455 460
Leu Leu Ala Val Gln Asn Asp Glu His Phe Tyr Ala Phe Gly Leu Gly
465 470 475 480
Leu Asn Ser Lys Gly Gln Thr Val Leu Arg Val Arg Lys Lys Ala Gly
485 490 495
Lys Asn Glu Ser Ile Arg Gly Asp Thr Val Ala Glu Gln Val Val Lys
500 505 510
Leu Lys His Gly His Pro Ile Tyr Leu Arg Val Asn Ile Gly Lys Ala
515 520 525
Glu Leu Asn Phe Ala Tyr Ser Thr Asn Gly Lys Arg Tyr Thr Thr Leu
530 535 540
Leu Asn Gln Ala Asp Ala Asn Leu Leu Thr Thr Ala Lys Ala Gly Gly
545 550 555 560
Phe Thr Gly Ala Val Val Gly Met Tyr Ala Glu Ser Thr Ala Gln Gln
565 570 575
Asn
<210> 74
<211> 1734
<212> DNA
<213> Microbulbifer degradans
<400> 74
atgaaaatta agtgcttact ccttgctgtt tacgcgggtc tacttgcggc ttgcgcgctg 60
gacgcgccgc tcaaaacctc aagtaaaccg ctagcgcatt tttcgtggtt tgaatatcaa 120
ggtaacgacg agatatttaa ggctccactc gcctcaaatc aataccaaaa ccccatactc 180
gccggctacc acccagaccc aagtattgtg cgagtaggcg aagattatta tttggtgaac 240
tccacctttg gcttctaccc tggcattcca gtatttcaca gccgtgactt agtgaattgg 300
acccaactgg gtaacgctat tcaccgccca gagcaacttt catttgatgg tattcactta 360
ggctacaacg gcgtttatgc accggcaatc gaataccgcg acgggacctt ttacgtaata 420
aatacctgcg tagcctgcgg aggaaatttt atcgttaccg ccaccaatcc cgcgggcccc 480
tggtcagacc caatatggct accagaggta attggcatag acccctcgct atttttcgac 540
gaggacggca aaacctatat cgtgcatcat cgtaatccac ctgtgcagaa ataccctgcc 600
cacacagccc tgtgggtaat ggaagttgac tccaaaacat ttgcgccggt atctgacgat 660
gtaatgcttg tggacggtgg cgacgaagcg ccatggcaca cagaatatat tgaagggccg 720
catatatata aaattgatgg cacctactac ctctatgccc ctggtggcgg cacgggatac 780
ttccacggcc aattggtgta tagatctgac aatgtatttg gaccctacga agccaacccc 840
aataaccctg tgttgactca agttggttta cccgacgaca gagaacaccc tgtaacggca 900
acgggtcatg cagatttatt tcaagatacc aacggcgact ggtggacggt atttctgggt 960
actcgcgttt acgatttagc taagccacca caagaccccg gcaattttgc caccggacgc 1020
gaaacattta tgttgccagt aacatggcaa aacggctggc cacacgtgct cgaaaaaggc 1080
gaggctgtgc cctaccgagt aaccaaaccc aaattacctg caggcaaacc cgccccgcgc 1140
gcaatgactg gaaactttac tgtgcgcgag gaatttacca acgcttcgct tgccccccac 1200
tggctatttg ttcgcacacc gcgttccaaa tggtggcaaa caggtaatgg cgaacttatt 1260
ttagaagcgc gcgccgatac cattggggca gttaaccagc cgtcgtttat tggccgacgg 1320
ctcgctcata tgacggcctc cttcgccacc caactaacct ttaacccaca caccgttggc 1380
gacgaagcag ggttactcgc cgtacaaaac gacgaacact tttacgcctt tggcctaggg 1440
ttaaacagta aagggcaaac cgttttgcgc gtgcgtaaaa aagcgggtaa aaatgaatcg 1500
ataaggggag atacggttgc cgagcaggtt gttaagctta agcacggcca ccctatttac 1560
ctgcgtgtaa atataggtaa agccgaatta aatttcgcgt atagcaccaa cggcaaacgc 1620
tacaccacct tgttgaacca agccgatgcc aacctactta ccacagctaa agcgggcggg 1680
tttactggcg cagtagtggg tatgtacgcc gaatccaccg cacaacaaaa ctaa 1734
<210> 75
<211> 566
<212> PRT
<213> Microbulbifer degradans
<400> 75
Met Arg Leu Leu Pro Ile Leu Leu Val Ser Leu Leu Pro Leu Leu Ser
1 5 10 15
Ser Cys Thr Ser Ala Ile Asn Gly Gln Gln Asn Ser Gln Thr Ser Pro
20 25 30
Val Phe Asp Trp Phe Glu Tyr Ala Gly Ser Asp Ala Leu Tyr Asn Thr
35 40 45
Val Ala Pro Ser Lys Asn Ala Tyr Thr Asn Pro Val Ile Lys Gly Phe
50 55 60
Tyr Pro Asp Pro Ser Ile Val Arg Val Gly Ala Asp Tyr Tyr Leu Val
65 70 75 80
Asn Ser Ser Phe Gly Tyr Phe Pro Gly Val Pro Ile Phe His Ser Thr
85 90 95
Asp Leu Val Asn Trp Val Gln Ile Gly Asn Ile Leu Glu Arg Pro Ser
100 105 110
Gln Leu Gln Ile Pro Ser Gly Met Gly Val Ser Arg Gly Ile Phe Ala
115 120 125
Pro Thr Leu Arg His His Asn Gly Ile Phe Tyr Met Ile Thr Thr Met
130 135 140
Val Asp Gly Gly Gly Asn Phe Ile Val Thr Ala Lys Asn Pro Ala Gly
145 150 155 160
Pro Trp Ser Asp Pro Val Trp Leu Pro Glu Val Gly Gly Ile Asp Pro
165 170 175
Asp Leu Phe Phe Asp Asp Asn Gly Lys Ala Tyr Ile Leu Asn Asn Asp
180 185 190
Ala Pro Ile Gly Glu Pro Leu Tyr Asp Gly His Arg Ala Ile Trp Ile
195 200 205
Arg Glu Phe Asp Leu Ala Thr Leu Lys Thr Val Gly Asp Ala Lys Leu
210 215 220
Ile Val Asn Gly Gly Val Asp Ile Thr Thr Lys Pro Val Trp Ile Glu
225 230 235 240
Gly Pro His Leu Phe Lys Asn Lys Gly Ala Tyr Tyr Leu Ile Asn Ala
245 250 255
Glu Gly Gly Thr Ser Val Asn His Ser Gln Val Val Phe Lys Ala Gln
260 265 270
Ser Pro Trp Gly Pro Tyr Ile Pro Trp Glu Asn Asn Pro Ile Leu Thr
275 280 285
Gln Arg His Leu Pro Ala Asp Arg Ala Asn Pro Val Thr Ser Val Gly
290 295 300
His Val Asp Leu Val Gln Thr Gln His Gly Asp Trp Trp Ala Val Phe
305 310 315 320
Leu Gly Cys Arg Pro Tyr Lys Asp Asn Tyr Tyr Asn Thr Gly Arg Glu
325 330 335
Thr Phe Leu Leu Pro Val Asp Trp Ser Gly Glu Tyr Pro Val Ile Leu
340 345 350
Arg Gly Asp Ala Glu Val Pro Tyr His His Gln Arg Pro Gln Leu Gly
355 360 365
Ala Ser Gln Gln Pro Ala Ile Ala Leu Ser Gly Asn Phe Ile Glu Arg
370 375 380
Asp Glu Phe Asp Ser Ala Leu Lys Leu Tyr Trp Arg Lys Val Arg Thr
385 390 395 400
Pro Thr Asn Asn Phe Thr Asp Leu Thr Ser Gln Lys Gly Lys Leu Val
405 410 415
Leu Thr Ala Asn Asn Thr Asp Leu Ser Asp Phe Gly Ser Pro Ala Phe
420 425 430
Ile Ala Arg Ala Gln Gln His Leu Thr Gly Ser Ala Thr Thr Lys Leu
435 440 445
Val Tyr Thr Pro Pro His Val Gly Asp Lys Ala Gly Ile Ala Ala Phe
450 455 460
Gln Asn Asp Glu Tyr Phe Tyr Ala Leu Thr Val Thr Lys Asn Asn Ser
465 470 475 480
Gly Leu Ala Ile Gln Leu Glu Lys Gln Leu Gly Lys Asn Lys Glu Ile
485 490 495
Val Ala Gln Tyr Pro Leu Gln Glu Lys Thr Leu Arg Asn Gly Leu Tyr
500 505 510
Leu Lys Ile Glu Phe Asn Asn Asp Lys Tyr Asp Phe Ser Tyr Ser Thr
515 520 525
Asn Asn Thr Lys Trp Gln Ser Val Gly Glu Thr Gln Asp Gly Thr Ile
530 535 540
Leu Ser Thr Gln Ser Ala Gly Gly Phe Val Gly Ala Thr Leu Gly Ile
545 550 555 560
Phe Ala Tyr Thr Ala His
565
<210> 76
<211> 1701
<212> DNA
<213> Microbulbifer degradans
<400> 76
atgcgacttt tacctatctt actcgttagc ttacttccac tgctctcaag ctgcacaagc 60
gccataaacg ggcaacaaaa tagccaaacc tcgcctgtat ttgattggtt tgaatacgcg 120
ggaagcgatg ctttatacaa cacggttgcg ccaagtaaaa atgcctatac caacccagta 180
ataaaagggt tttatccaga tccaagcatt gtaagagtgg gagcagatta ctacctcgtg 240
aactcttcat ttggctactt ccctggcgtg ccgatatttc atagcacaga tttagtgaat 300
tgggttcaaa taggtaatat tctcgagcgc ccatcacaat tacaaatacc cagcggcatg 360
ggtgtgtcgc gaggtatatt cgccccaaca ctgcgccacc acaacggtat tttttacatg 420
attactacaa tggtagacgg tggcggcaat tttattgtta ctgcaaaaaa ccccgcaggc 480
ccttggtcgg acccagtatg gttacctgaa gtgggcggta tagacccaga tttatttttt 540
gatgacaacg gcaaagccta catacttaac aacgacgccc ccattggcga gccgctttac 600
gatggccacc gagccatttg gattcgcgaa ttcgacttag ccacattaaa aaccgttggc 660
gacgccaagt taatagtaaa cggcggtgta gatataacta ccaaacccgt ttggatagaa 720
ggcccacacc ttttcaaaaa taaaggcgct tactatttaa ttaatgcaga aggtggcacc 780
agcgtgaatc acagccaagt tgtatttaaa gcgcaaagcc cttgggggcc gtatattcct 840
tgggaaaaca atccaatttt aacacagcgc catttaccgg ctgatcgcgc caaccccgtc 900
acatccgttg gccatgtcga tttagtacaa actcaacatg gcgactggtg ggcggtattt 960
ttaggctgca ggccctataa agataactac tacaataccg gccgcgaaac atttttatta 1020
ccggtagatt ggtctggcga ataccccgtc attcttcgcg gcgatgccga ggtgccctat 1080
catcaccaac gcccccaatt gggagcatcc caacaaccag ccattgccct tagcggtaac 1140
tttattgagc gcgatgaatt tgactcagca cttaaacttt attggcgcaa ggttcgcacc 1200
cccacaaaca actttacaga tttaacctct caaaaaggca agcttgtttt aactgcaaac 1260
aatacagatt taagcgactt tggatcacca gcatttattg cgcgcgcaca gcagcaccta 1320
acaggcagcg caacaaccaa actggtttac acacccccac acgtgggcga caaagcgggt 1380
attgctgcct ttcaaaacga tgagtatttt tatgcgctta ccgttacaaa aaataatagc 1440
ggccttgcca tacaactaga aaaacaactt ggcaagaaca aagaaattgt tgcgcaatat 1500
ccactacaag aaaaaacgct tcgcaatggc ttatatttga aaatagaatt taataacgac 1560
aaatatgatt tcagctattc cacaaataac accaagtggc aatcggtagg cgaaacacaa 1620
gatggaacta tattaagcac gcaaagtgca ggcgggtttg taggtgccac gctaggtata 1680
tttgcatata ccgcgcacta a 1701
<210> 77
<211> 319
<212> PRT
<213> Microbulbifer degradans
<400> 77
Met Ser Met Phe Asn Lys Lys Thr Leu Ala Ala Gly Ile Val Ala Ala
1 5 10 15
Cys Leu Thr Asn Val Ser Ala Ser Tyr Ala Ala Asn Pro Ala Ile Thr
20 25 30
Asp Thr His Thr Ala Asp Pro Ala Ala Leu Val His Gly Asp Thr Val
35 40 45
Tyr Leu Tyr Val Gly Asn Asp Glu Ala Lys Asp Asn Arg Val Phe Tyr
50 55 60
Asp Leu Lys Lys Trp Leu Val Tyr Ser Ser Lys Asp Met Val Asn Trp
65 70 75 80
Thr Asn His Gly Ser Pro Leu Ala Ala Thr Asp Phe Lys Trp Ala Ser
85 90 95
Gly Asp Ala Trp Ala Ala His Thr Val Glu Lys Asp Gly Lys Phe Tyr
100 105 110
Trp Tyr Thr Thr Val Arg His Ala Thr Ile Asn Gly Phe Ala Ile Gly
115 120 125
Val Ala Val Ser Asp Ser Pro Thr Gly Pro Phe Lys Asp Ala Leu Gly
130 135 140
Lys Ala Leu Ile Ser Asn Asp Met Thr Thr Asp Thr Asp Ile Asp Trp
145 150 155 160
Asp Asp Ile Asp Pro Ala Val Phe Ile Asp Asp Asp Gly Gln Ala Tyr
165 170 175
Ile Phe Trp Gly Asn Thr Lys Pro Arg Trp Ala Lys Leu Lys Pro Asn
180 185 190
Met Ile Glu Leu Asp Gly Pro Ile His Ala Ile Asp Ile Pro His Phe
195 200 205
Thr Glu Ala Leu Tyr Val His Lys His Gly Glu Tyr Tyr Tyr Leu Ser
210 215 220
Tyr Ala Thr Gly Phe Pro Glu Lys Thr Ala Tyr Ala Met Ser Lys Ser
225 230 235 240
Ile Glu Gly Pro Trp Glu Tyr Lys Gly Ile Leu Asn Glu Leu Ala Gly
245 250 255
Asn Ser Asn Thr Asn His Gln Ser Val Ile Asp Phe Lys Gly Lys Ser
260 265 270
Tyr Phe Ile Tyr His Asn Gly Gly Leu Gly Gln Asp Gly Gly Ser Phe
275 280 285
Arg Arg Ser Val Cys Ile Asp Tyr Leu Asn Tyr Asn Ala Asp Gly Thr
290 295 300
Ile Lys Arg Ile Val Met Thr Ser Glu Gly Val Asp Pro Val Lys
305 310 315
<210> 78
<211> 960
<212> DNA
<213> Microbulbifer degradans
<400> 78
gtgagtatgt ttaataaaaa aacactagca gccggtattg tagctgcatg tttaactaac 60
gtaagtgcaa gctatgctgc caaccccgca attaccgata ctcacacggc cgatcccgct 120
gcgttagtgc acggcgatac cgtttatttg tacgtgggta acgatgaagc gaaggataac 180
cgcgtatttt acgatcttaa aaaatggttg gtgtattcat caaaagatat ggtgaactgg 240
accaatcacg gttcgccgtt agctgcaacg gattttaagt gggccagcgg cgatgcgtgg 300
gcggcgcaca cggtagaaaa agatggcaag ttttattggt ataccacggt gcgtcacgca 360
accattaatg gttttgccat tggcgttgca gtaagtgata gccctacagg gccattcaaa 420
gatgctttgg gtaaagcact aataagtaat gacatgacca ccgataccga tattgattgg 480
gacgatatag acccagcagt atttattgac gacgatggcc aagcgtatat tttttggggc 540
aacaccaaac cgcgctgggc caagttaaaa cccaatatga ttgaactaga tggacctatt 600
cacgcaatcg atattccaca ctttaccgaa gcgctatacg tgcacaaaca cggtgaatat 660
tactacttaa gctatgcgac aggctttcca gaaaaaacag cttacgctat gagcaaatct 720
atagaagggc cgtgggaata caaaggcatt cttaatgaat tggctggtaa ctcaaatact 780
aatcaccaat ctgtcatcga ttttaagggc aagtcatact ttatttatca caatggtggc 840
ttgggtcaag atggcggtag cttccgtcgc agtgtatgta tcgattattt gaactacaac 900
gcggatggta ctatcaagcg aattgtaatg acatcagaag gtgtagaccc agttaaataa 960
<210> 79
<211> 385
<212> PRT
<213> Microbulbifer degradans
<400> 79
Met Pro Glu His Thr Arg Lys Arg Leu Leu Ser Thr Leu Gly Leu Ala
1 5 10 15
Leu Ser Gly Thr Ala Ile Thr Leu Thr Leu Val Gly Cys Gly Lys Asp
20 25 30
Asn Pro Ala Thr Gln Thr Glu Gly Ser His Ser Ala Gly His Thr Glu
35 40 45
Val Ala Ala Glu Gln Thr His Asp Ile Gly Gly Pro Gly Pro Glu Gly
50 55 60
Lys Pro Ile Asn Asp Pro Leu Val Thr His Ile Tyr Thr Ala Asp Pro
65 70 75 80
Ser Ala His Val Phe Asp Gly Lys Leu Tyr Ile Tyr Pro Ser His Asp
85 90 95
Val Glu Ala Gly Ile Pro Gln Asn Asp Asn Gly Asp His Phe Asp Met
100 105 110
Arg Asp Tyr His Val Leu Ser Met Glu Glu Pro Gly Gly Lys Val Thr
115 120 125
Asp His Gly Val Ala Leu Ala Arg Glu Asp Val Ala Trp Ala Gly Arg
130 135 140
Gln Leu Trp Ala Pro Asp Ala Ala Glu Lys Asp Gly Thr Tyr Tyr Leu
145 150 155 160
Tyr Phe Pro Met Lys Asp Lys Asp Asp Ile Phe Arg Ile Gly Val Ala
165 170 175
Ser Gly Ser Thr Pro Tyr Gly Pro Phe Lys Ala Glu Pro Glu Pro Met
180 185 190
Pro Gly Ser Tyr Ser Ile Asp Pro Ser Val Phe Gln Asp Gly Asp Asp
195 200 205
Tyr Tyr Met Tyr Ile Gly Gly Ile Trp Gly Gly Gln Leu Gln Arg Trp
210 215 220
Thr Thr Gly Glu Tyr Asn Pro Glu Asp Val Tyr Pro Ala Asp Asp Glu
225 230 235 240
Pro Ala Leu Leu Pro Lys Met Ala Lys Leu Ser Ala Asp Met Lys Ser
245 250 255
Phe Ala Glu Pro Leu Arg Asp Ile Gln Ile Leu Asp Glu Asn Gly Glu
260 265 270
Leu Ile Lys Ala Gly Asp Asn Asp Arg Arg Phe Phe Glu Ala Ala Trp
275 280 285
Val His Lys Tyr Asn Gly Lys Tyr Tyr Leu Ser Tyr Ser Thr Gly Asp
290 295 300
Thr His Tyr Ile Val Tyr Ala Ile Gly Asp Asn Pro Tyr Gly Pro Phe
305 310 315 320
Thr Tyr Gln Gly Val Val Leu Asn Pro Val Ile Gly Trp Thr Asn His
325 330 335
His Ser Ile Ala Glu Phe Lys Gly Lys Trp Tyr Leu Phe Tyr His Asp
340 345 350
Ser Ser Leu Ser Gly Gly Val Thr His Leu Arg Ser Val Lys Met Thr
355 360 365
Glu Leu Thr His Asn Pro Asp Gly Thr Ile Gln Thr Ile Asn Ala Tyr
370 375 380
Lys
385
<210> 80
<211> 1158
<212> DNA
<213> Microbulbifer degradans
<400> 80
atgccagaac atacgcgtaa gcgcttacta tcaaccttag gcctagcttt atcgggcaca 60
gctataaccc taacgcttgt ggggtgcggt aaagacaacc ccgcaactca aacagaaggc 120
agccacagcg ctggccatac agaagttgcc gcagaacaaa cacacgacat aggcggccca 180
ggccctgagg gcaagccaat taacgacccg cttgttaccc acatatacac cgcagaccct 240
tctgcccatg tgtttgacgg caaactttat atttacccat cgcacgatgt ggaagcgggt 300
attccgcaaa acgataacgg cgatcacttc gatatgcgcg attatcacgt gctttccatg 360
gaagagcctg gtggcaaagt caccgatcac ggcgtagccc ttgcgcgcga agatgtagct 420
tgggctggtc gccaactgtg ggcgcccgat gcggctgaaa aagacggcac ttactacctg 480
tatttcccca tgaaagataa ggatgacatc ttccgcattg gtgtcgccag tggcagtacc 540
ccttatggcc catttaaagc cgagccagag ccaatgcccg gcagctatag catagaccca 600
agcgtatttc aggatggcga cgactactac atgtacatag gtggtatttg gggcggccag 660
ttgcagcgtt ggacaaccgg tgagtacaac ccagaagatg tatacccagc ggatgacgag 720
cctgcgctat tacctaaaat ggccaagcta agtgcagata tgaaaagctt tgccgagcca 780
ttaagagaca ttcaaatttt ggatgaaaat ggcgagctaa ttaaagctgg cgataacgac 840
cgacgtttct tcgaagccgc gtgggtacac aaatataacg gcaagtatta cttgagctat 900
tcaaccggtg acacccacta tattgtgtat gccattggcg ataacccata cggcccgttt 960
acttaccagg gtgtagtgct caaccccgtt attggttgga ctaaccatca ctcaattgct 1020
gaatttaaag gtaagtggta tttgttctac cacgatagtt cgctttccgg tggtgtaaca 1080
catttgcgca gcgtgaaaat gacagagcta actcacaacc cagatggcac tatccaaacc 1140
attaatgcct ataagtaa 1158
<210> 81
<211> 738
<212> PRT
<213> Microbulbifer degradans
<400> 81
Met Ala Thr Leu Gly Val Asn Ala Ala Lys Phe Ala Met Phe Ala Ala
1 5 10 15
Ile Cys Leu Gln Phe Ser Val Ala Glu Ala Ala Lys Ser Arg Asp Gly
20 25 30
Tyr Gly Leu Trp Leu Asp Tyr Gln Pro Ile Thr Asn Thr Arg Glu Arg
35 40 45
Glu Gly Tyr Ile Lys Ala Leu Ser Pro Trp Gln Val Glu Gly Glu Ala
50 55 60
Ala Thr Ala Asp Phe Ile Arg Gln Glu Leu Thr Ala Ala Leu Gly Ala
65 70 75 80
Met Leu Gly Val Glu Ala Gly Pro Val Gly Asp Tyr Thr His Asn Ser
85 90 95
Leu Ala His Pro Val Ala Arg Leu Leu Val Ala Thr Pro Glu Glu Ser
100 105 110
Ala Val Ile Arg Ser Leu Ala Leu Gly Asp Ala Leu Thr Arg Val Gly
115 120 125
Gln Glu Gly Tyr Leu Ile Lys Thr Thr Arg Tyr Arg Asp Lys Pro Ile
130 135 140
Thr Ile Val Thr Ala Asn Thr His Ala Gly Leu Leu Tyr Gly Thr Phe
145 150 155 160
Lys Leu Leu Gln Leu Leu Gln Thr Gly Gln Ala Val Ser Asn Leu Ala
165 170 175
Ile Glu Ser Ala Pro Ala Thr Lys Leu Arg Val Leu Asn His Trp Asp
180 185 190
Asn Leu Asp Arg Tyr Val Glu Arg Gly Tyr Ala Gly Glu Ser Ile Trp
195 200 205
Asn Trp His Lys Leu Pro His Tyr Lys Ser Gln Arg Tyr Tyr Asp Tyr
210 215 220
Ala Arg Ala Asn Ala Ser Ile Gly Ile Asn Gly Val Val Leu Asn Asn
225 230 235 240
Val Asn Ala Asp Pro Leu Ile Leu Thr Pro Gln Tyr Leu Val Lys Val
245 250 255
Lys Ala Leu Ala Asp Ile Phe Arg Pro Tyr Gly Ile Lys Val Tyr Leu
260 265 270
Ser Val Lys Phe Ser Ser Pro Asn Leu Ile Gly Gly Leu Pro Thr Ser
275 280 285
Asp Pro Leu Asp Lys Asn Val Gln Ala Trp Trp Gln Ala Lys Ala Asn
290 295 300
Glu Ile Tyr Ser Leu Ile Pro Asp Phe Gly Gly Phe Leu Val Lys Ala
305 310 315 320
Asn Ser Glu Gly Gln Pro Gly Pro Gly Asp Phe Gly Arg Ser His Ala
325 330 335
Gln Gly Ala Asn Met Leu Ala Asp Ala Leu Ala Pro His Gly Gly Asn
340 345 350
Val Met Trp Arg Ala Phe Val Tyr Asn Val Glu Ala Asn Val Glu Arg
355 360 365
Ser Lys Gln Ala Tyr Asn Glu Phe Lys Pro Leu Asp Gly Thr Phe Arg
370 375 380
Gln Asn Val Leu Val Gln Val Lys Asn Gly Pro Ile Asp Phe Gln Pro
385 390 395 400
Arg Glu Pro Phe Ser Pro Leu Phe Gly Ala Met Pro Lys Thr Pro Leu
405 410 415
Met Met Glu Phe Gln Ile Thr Gln Glu Tyr Leu Gly Phe Ser Thr His
420 425 430
Leu Val Tyr Leu Gly Pro Leu Tyr Glu Glu Val Leu Lys Ala Asp Thr
435 440 445
Tyr Ala Lys Gly Ala Gly Ser Thr Val Ala Lys Val Val Asp Gly Ser
450 455 460
Leu Tyr Gly His Gly Ile Thr Gly Met Ala Gly Val Ala Asn Ile Gly
465 470 475 480
Ser Asp Arg Asn Trp Thr Gly His Ile Phe Gly Gln Ala Asn Trp Tyr
485 490 495
Val Phe Gly Gln Leu Ala Trp Asn Pro Glu Val Ser Thr Lys Gln Ile
500 505 510
Ala Asp Asp Trp Ile Arg Met Thr Leu Thr Arg Asp Asp Lys Ala Val
515 520 525
Asn Thr Ile Arg Ala Met Met Met Ala Ser Arg Glu Thr Ala Val Asn
530 535 540
Tyr Met Thr Pro Leu Gly Leu His His Ile Met Gly Trp Gly His His
545 550 555 560
Tyr Gly Pro Ala Pro Trp Ile Gly Glu Gln Lys Pro Asp Trp Met Arg
565 570 575
Glu Asp Trp Thr Ser Val Tyr Tyr His Ser Ala Asn Ala Thr Gly Leu
580 585 590
Gly Lys Asp Arg Thr Ala Ser Gly Ser Asn Val Ile Ala Gln Tyr His
595 600 605
Ala Pro Leu Arg Gln Ala Tyr Ser Asp Pro Lys Thr Thr Pro Thr Glu
610 615 620
Leu Leu Leu Trp Phe His His Leu Pro Trp His Tyr Glu Leu Ala Asn
625 630 635 640
Gly Asn Ser Leu Trp His Glu Leu Val Ala Arg Tyr Tyr Leu Gly Ala
645 650 655
Gln Ala Val Ala Glu Met Ala Lys Thr Trp Asp Gly Leu Glu Ala Asn
660 665 670
Ile Pro Pro Gln Leu Phe Lys Gln Val Gln Met Ala Leu Ala Ile Gln
675 680 685
Thr Gln Glu Ala Ala Trp Trp Arg Asp Ala Cys Val Leu Tyr Phe Gln
690 695 700
Ser Tyr Ser Lys Gln Ser Leu Pro Glu Gly Phe Ala Lys Pro Lys His
705 710 715 720
Ser Leu Glu Tyr Tyr Lys Gly Leu Ser Phe Pro His Ala Pro Gly Asp
725 730 735
Gly arg
<210> 82
<211> 2217
<212> DNA
<213> Microbulbifer degradans
<400> 82
atggctactt tgggggtaaa tgccgctaag tttgccatgt ttgcagctat ttgcttgcag 60
tttagtgtcg ccgaagcggc taaaagccgc gatggatatg ggctgtggtt agattaccag 120
ccaattacca atacccgcga acgcgagggc tatataaaag cattaagccc atggcaggta 180
gaaggcgaag ctgcaactgc cgattttatt cggcaagagc ttactgcagc gttgggcgct 240
atgcttggcg ttgaggctgg tccagtgggt gattacaccc ataactccct cgctcaccct 300
gtggcgcggc tattggttgc aactccagaa gaaagcgctg ttattcgctc tttggcttta 360
ggcgatgctt taactcgagt agggcaagag gggtacctta ttaaaaccac gcgttaccgt 420
gacaagccta tcaccattgt taccgcgaac acgcatgcag gcctgctgta tggcacattc 480
aaactactgc agctgctgca aacagggcag gccgtttcta atttagctat tgagtccgcc 540
ccagcaacca aactgcgtgt gcttaaccac tgggataacc tcgatcgcta tgtggagcgc 600
ggctatgccg gtgagtctat ttggaactgg cacaagctgc cgcactacaa atcgcagcgc 660
tactacgatt acgctcgcgc taacgcgtcc attggtatta acggtgtggt actaaacaat 720
gttaacgccg accccttaat tcttaccccg cagtaccttg taaaagtaaa agcactggca 780
gatattttta ggccctacgg cattaaagtt tatctttcgg tgaagtttag ctcgccgaat 840
cttattggcg ggctgccaac atccgacccg ttagataaaa atgtgcaagc ttggtggcaa 900
gcgaaagcga atgaaattta ctcgctcatt cccgactttg gtggcttttt agtaaaagcg 960
aattcggaag ggcagcccgg cccaggggac tttggccgca gccatgcaca aggggcaaat 1020
atgttggccg atgcactggc accccatggc ggcaatgtaa tgtggcgcgc gtttgtatat 1080
aacgtagaag ccaatgtgga gcgatccaag caggcataca acgaatttaa gccattagac 1140
ggtaccttta ggcaaaacgt attggtgcaa gtaaaaaatg ggccaattga ttttcagcca 1200
cgtgaaccgt ttagcccgct gtttggtgct atgcccaaaa cgccgttaat gatggagttt 1260
caaattactc aggagtactt ggggtttagt actcaccttg tttacttggg gccgctgtac 1320
gaagaagtac ttaaggccga tacctatgcg aagggggcag ggtctactgt tgcgaaggtg 1380
gtcgatggct cgctctacgg gcacggtata acgggtatgg ctggggtagc taatattggc 1440
agcgatcgca attggaccgg ccatattttc ggccaagcca actggtatgt atttggccaa 1500
ttggcgtgga accccgaggt aagcactaag caaatagccg atgattggat tcgcatgaca 1560
ctcacccgcg acgataaagc ggtaaacacc attcgcgcaa tgatgatggc cagccgcgaa 1620
acggcggtta actacatgac gcccctgggg ctgcatcaca ttatggggtg ggggcaccac 1680
tacggcccag cgccgtggat aggcgagcaa aaacccgatt ggatgcgtga agattggaca 1740
tctgtttact atcatagcgc aaacgccaca gggctaggca aagatagaac agcttctggc 1800
agcaatgtca tagcgcaata ccacgcccct ttacggcagg cctatagcga cccgaaaacc 1860
acgcccaccg agttgctatt gtggtttcat catttgcctt ggcattatga attagcgaat 1920
ggcaatagcc tgtggcatga actggtagcg cgttactatt taggcgcgca ggctgtggca 1980
gaaatggcca aaacgtggga tggcctagaa gctaatatcc ccccgcagct attcaaacaa 2040
gtacaaatgg cgctggctat tcaaacccaa gaagccgcgt ggtggcgcga tgcctgcgtg 2100
ctgtattttc aaagctattc taagcagtcg ctacccgagg gctttgcaaa acctaagcac 2160
tcgctcgaat actataaagg gttaagcttc ccgcatgcgc cgggtgacgg gcgttaa 2217
<210> 83
<211> 1316
<212> PRT
<213> Microbulbifer degradans
<400> 83
Met Arg Asn Lys Leu Gly Ser Met Leu Lys Met Ser Ala Ala Ile Gly
1 5 10 15
Gly Leu Val Ala Ala Gly Ser Ala Val Ala Gly Pro Val Gly Phe Ala
20 25 30
Ser Leu Asn Gly Gly Thr Thr Gly Gly Ala Gly Gly Gln Val Val Tyr
35 40 45
Ala Ser Thr Gly Ala Glu Ile Asn Gln Ala Met Cys Asn Arg Ala Ser
50 55 60
Asp Asp Thr Pro Leu Ile Ile Tyr Val Thr Gly Thr Ile Asn His Gly
65 70 75 80
Asn Thr Ala Lys Tyr Ser Gly Ser Cys Asp Thr Thr Ala Asp Glu Ile
85 90 95
Gln Phe Lys Gly Val Lys Asn Ile Ser Leu Ile Gly Thr Gly Ser Gly
100 105 110
Ala Val Phe Asp Gln Ile Gly Ile His Leu Arg Asp Thr Ser Asn Ile
115 120 125
Ile Leu Gln Asn Leu His Ile Lys Asn Val Lys Lys Ser Gly Ser Pro
130 135 140
Thr Ser Asn Gly Gly Asp Ala Ile Gly Met Glu Ser Gly Val Tyr Asn
145 150 155 160
Val Trp Val Asp His Cys Glu Leu Glu Ala Ser Gly Gly Glu Ser Asp
165 170 175
Gly Tyr Asp Ser Leu Leu Asp Met Lys Ala Thr Thr Gln Tyr Val Thr
180 185 190
Val Ser Tyr Thr Tyr Tyr His Asp Ser Gly Arg Gly Gly Leu Met Gly
195 200 205
Ser Ser Asp Ser Asp Asp Thr Asn Thr Phe Val Thr Phe His His Asn
210 215 220
Tyr Tyr Glu Asn Met Asp Ser Arg Leu Pro Leu Leu Arg His Gly Thr
225 230 235 240
Ala His Ala Phe Asn Asn Tyr Tyr Asn Gly Ile Ala Lys Ser Gly Met
245 250 255
Asn Pro Arg Ile Gly Gly Gln Ile Lys Ala Glu Asn Asn Tyr Phe Glu
260 265 270
Asn Ala His Asn Pro Ile Gly Thr Phe Tyr Thr Asp Asp Met Gly Tyr
275 280 285
Trp Asp Leu Arg Gly Asn Ile Phe Gly Ser Asn Val Thr Trp Ala Ser
290 295 300
Ala Asp Asp Glu Thr Pro Ala Gly Pro Asn Pro Thr Ser Thr Thr Ser
305 310 315 320
Ile His Ile Ser Tyr Pro Tyr Asp Leu Asp Asp Ala Ala Cys Val Pro
325 330 335
Asp Ile Val Lys Ser Thr Ala Gly Val Gly Thr Gly Leu Ala Val Ser
340 345 350
Asp Gly Ser Cys Thr Ile Thr Thr Pro Pro Ser Thr Ser Ser Ser Ser
355 360 365
Ser Ser Ser Ser Ser Thr Ser Ser Thr Gly Ser Ser Ser Ser Ser Ser
370 375 380
Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Gly Ser Leu Val Leu
385 390 395 400
Gly Asn Asn Leu Ser Ile Gly Ala Gly Ser Asp Gly Ser Ser Lys Gly
405 410 415
Ala Gly Ser Tyr Gly Asn Val Arg Asp Gly Asp Val Ser Ser Tyr Trp
420 425 430
Ala Pro Ser Gly Ser Thr Gly Arg Val Ser Ile Lys Trp Ser Gly Ser
435 440 445
Gln Thr Val Asn Ala Ile Val Ile Lys Glu Ala Ala Gly Tyr Glu Gly
450 455 460
Asn Ile Ser Gly Trp Gln Val Thr Asp Asn Asp Thr Gly Ala Val Leu
465 470 475 480
Ala Ala Gly Ser Ser Val Gly Thr Ile Thr Phe Asp Ala Val Thr Thr
485 490 495
Ser Lys Ile Asn Phe Glu Ile Thr Ser Ser Asn Gly Thr Pro Thr Val
500 505 510
Ala Glu Phe Glu Thr Tyr Asn Ala Thr Gly Ser Ser Ser Ser Ser Ser
515 520 525
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
530 535 540
Ser Ser Ser Ser Ser Ser Ser Thr Gly Gly Thr Ala Thr Leu Ser Thr
545 550 555 560
Thr Val Ser Gly Asp Gln Val Thr Leu Asn Trp Ser Val Asn Asn Ala
565 570 575
Thr Val Thr Gly Gln Gln Ile Tyr Arg Asp Val Asp Ser Asp Pro Ala
580 585 590
Gly Arg Val Arg Ile Ala Ser Gly Val Thr Gly Asn Thr Tyr Thr Asp
595 600 605
Thr Gly Leu Ala Asn Gly Thr Tyr Tyr Tyr Trp Val Lys Val Thr Asp
610 615 620
Ser Asn Ser Ala Thr Ile Asn Ser Asn Tyr Ser Glu Ala Gln Val Asn
625 630 635 640
Val Tyr Thr Thr Ser Thr Thr Thr Phe Glu Glu Asp Ala Gly Tyr Cys
645 650 655
Ser Val Asp Gly Ser Val Asp Ser Asn Asn Ser Gly Phe Ala Gly Ser
660 665 670
Gly Phe Ala Asn Thr Asp Asn Ala Ser Gly Asn Gly Val Asn Tyr Ala
675 680 685
Val Ser Val Pro Val Ala Gly Val Tyr Thr Leu Gln Val Arg Phe Ala
690 695 700
Asn Gly Ser Ser Ala Arg Pro Ala Asp Val Leu Val Asn Tyr Gly Asn
705 710 715 720
Ala Gly Val Phe Asp Leu Pro Ser Thr Gly Ser Trp Thr Ser Trp Ser
725 730 735
Asn Ser Asn Glu Ile Ser Val Asn Leu Val Ala Gly Asn Asn Ile Ile
740 745 750
Arg Leu Glu Ala Thr Thr Ser Gly Gly Leu Ala Asn Ile Asp Ser Leu
755 760 765
Ser Val Thr Gly Val Glu Pro Ser Ala Gly Asp Cys Asn Gly Ser Val
770 775 780
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser
785 790 795 800
Ser Ser Ser Thr Ser Ser Gly Gly Ser Ser Thr Ser Ser Ser Ser Thr
805 810 815
Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser Ser
820 825 830
Ser Thr Ser Ser Thr Ser Ser Ser Ser Gly Gly Gly Thr Ala Ser Cys
835 840 845
Glu Gln Leu Ile Asn Asp Pro Ser Val Asn Trp Asp Glu Ser Ala Leu
850 855 860
Ala Ser Glu Gln Glu Ile Val Ala Cys Leu Ala Gln Ser Leu Gly Ser
865 870 875 880
Pro Val Gly Phe Gly Glu Gly Thr Thr Gly Gly Tyr Asp Pro Ser Gly
885 890 895
Gly Ser Asn Leu Val Val Ile Lys Lys Asn Ile Gly Ile Ser Val Glu
900 905 910
Gln Gln Ile Leu Asp Ala Ile Ser Thr Glu Asn His Asn Trp Ile Val
915 920 925
Phe Asp Lys Asp Asp Phe Ala Ala Arg Thr Ala Val Ala Met Tyr Arg
930 935 940
Leu Asp Cys Asp Asn Ala Asp Val Arg Ser Ala Leu Gly Gly Ala Ser
945 950 955 960
Ala Ala Gln Cys Arg Asp His Ile Ala Trp Cys Ser Ala Asn Gly Ile
965 970 975
Ser Asp Glu His Asp Cys Glu Asn Glu Phe Phe Asn Asn Arg Leu Asn
980 985 990
Asp Ser Asp Leu Pro Ile Arg Asn Gln Met Ile Gln Ser Asn Thr Thr
995 1000 1005
Ile Asp Gly Arg Gly Ala Asn Ala Tyr Phe Phe Phe Asn Gly Phe Ser
1010 1015 1020
Ile Gly Lys Asp Ser Ser Gly Ala Ser Leu Tyr Ala Ala Gln Asn Val
1025 1030 1035 1040
Ile Val Thr Asn Asn Glu Phe Ile Gly Ala Gly His Thr Glu Asp His
1045 1050 1055
Asp Leu Asp Pro Asp Met Ile Arg Ser Thr Gly Glu Ser Asn Lys Ile
1060 1065 1070
Trp Ile His Gln Asn Thr Phe Asp His Thr Gly Asp Ser Ala Phe Asp
1075 1080 1085
Val Lys Val Gly Ala Tyr Asp Ile Thr Ile Ser Phe Asn Lys Leu Val
1090 1095 1100
Asn Val Lys Arg Ala Ala Leu His Gly Ser Ser Asp Ser Arg Ala Ile
1105 1110 1115 1120
Asn Ser Gln Ile Thr Thr Thr Met His Asn Asn Leu Phe Tyr Thr Ser
1125 1130 1135
Asp Asp Gln Tyr Ala Leu Ser Thr Tyr Asp Thr Leu Arg Arg Val Pro
1140 1145 1150
Leu Met Arg Arg Gly Gln Ser His Met Phe Asn Asn Val Phe Tyr Gly
1155 1160 1165
Tyr Arg Lys Asp Ile Leu Ser Val Arg Val Gly Gly Arg Ile Ala Phe
1170 1175 1180
Glu Asp Asn Ile Ile Leu Asn Lys Glu Ser Ser Ser Thr Pro Gly Asp
1185 1190 1195 1200
Gly Leu Lys Lys Gly Asp Asp Met Glu Tyr Tyr Val Glu Thr Leu Leu
1205 1210 1215
Arg Asp Phe Arg Glu Gly Gly Leu Glu Ile Ser Gly Ser Tyr Val Ser
1220 1225 1230
Phe Ala Asp Ser Ala Cys Asn Ser Tyr Gly Ala Ser Gly Asp Leu Thr
1235 1240 1245
Ala Ser His Gly Ala Thr Pro Asp Met Phe Asp Asp Tyr Ser Ser Ala
1250 1255 1260
Ser Lys Asn Thr Ile Ser Ala Asn Arg Phe Val Ala Gly Asp Asp Leu
1265 1270 1275 1280
Thr Asp Tyr Val Phe Ala Thr Ala Gly Lys Gly Gly Lys Ala Pro Tyr
1285 1290 1295
Val Ser Thr Phe Thr Ala Gly Gln Asn Ser Leu Ile Ser Gln Ala Asn
1300 1305 1310
Pro Val Cys Gln
1315
<210> 84
<211> 93
<212> PRT
<213> Microbulbifer degradans
<400> 84
Met Pro Pro Val Glu Leu Leu Asp Glu Leu Leu Asp Glu Glu Leu Asp
1 5 10 15
Glu Leu Leu Asp Glu Glu Leu Glu Glu Leu Glu Leu Leu Glu Glu Leu
20 25 30
Glu Pro Val Ala Leu Tyr Val Ser Asn Ser Ala Thr Val Gly Val Pro
35 40 45
Leu Glu Glu Val Ile Ser Lys Leu Ile Leu Leu Val Val Thr Ala Ser
50 55 60
Asn Val Ile Val Pro Thr Leu Glu Pro Ala Ala Asn Thr Ala Pro Val
65 70 75 80
Ser Leu Ser Val Thr Cys Gln Pro Leu Ile Leu Pro Ser
85 90
<210> 85
<211> 3951
<212> DNA
<213> Microbulbifer degradans
<400> 85
atgagaaata aattaggctc aatgttaaaa atgagcgcag ccattggcgg tttagttgca 60
gcgggttccg ctgttgcagg cccagttggt ttcgcaagtt taaacggcgg cactaccggc 120
ggcgcgggcg ggcaagttgt atatgctagc accggtgctg aaattaacca ggctatgtgt 180
aatcgcgcaa gcgacgatac accgctaatt atttatgtga cgggtaccat taaccacggt 240
aacaccgcca agtattctgg tagctgcgat accactgcag atgaaattca gtttaaaggt 300
gtaaaaaata tatcgttgat aggaacgggc agcggtgctg tgttcgatca aatcggtatt 360
cacctacgcg atacctcgaa tattattttg caaaatttgc atattaaaaa cgttaagaag 420
tctggttcgc ctacttcgaa tggcggtgac gctattggta tggaatctgg cgtatacaat 480
gtgtgggtag accactgtga gctagaagct tcaggcggtg aaagtgatgg atatgattca 540
ttgctagata tgaaagccac cacgcagtat gtaacggttt cttacactta ctatcacgat 600
tctggtcgcg gtggtttaat ggggtctagt gatagcgacg ataccaatac cttcgtcacc 660
ttccaccaca actactacga aaatatggat tcgcgcttgc cgttactgcg tcacggtaca 720
gctcatgcat ttaacaacta ctataatggt attgctaaat ctggcatgaa cccacgtata 780
ggtgggcaaa taaaagcgga aaacaattac ttcgaaaatg cgcacaaccc aattggtact 840
ttttatacag acgatatggg ttactgggac ttacgcggca atatatttgg cagtaacgta 900
acgtgggcgt ctgcggatga tgaaacccct gcaggcccga acccaacatc cactacgtct 960
attcatattt cttaccccta tgatctagat gacgctgctt gtgtgcctga tattgtaaaa 1020
tccacagcag gtgtgggtac tggcctagcg gtttcagacg gaagctgcac cataacaacg 1080
ccaccttcaa cgagttcgtc tagctccagt tctagctcaa cctcgtcgac tggttcgagt 1140
tcgtcttcaa gctcttcctc ttcaagcagc tctagctcca atggcggcag cttagtatta 1200
ggtaacaacc tttcaattgg tgctggctct gatggtagta gcaagggagc aggttcgtac 1260
ggcaatgtgc gcgatggcga tgtaagtagc tattgggcgc cgagtggcag tactggtcgt 1320
gtttcaatta aatggagcgg cagccaaact gttaacgcta ttgttattaa agaagcggca 1380
ggctatgaag gtaatattag tggttggcaa gtaactgata acgataccgg tgcagtattg 1440
gctgctggct caagcgtagg cacaattacg tttgatgcgg taacgactag caagatcaat 1500
ttcgaaatta cttcttctaa cggtacacca acggtagcgg aattcgaaac atataatgct 1560
acaggttcta gctcttcaag cagctcgagt tcttctagct cttcatcaag tagctcgtct 1620
agttcttcat cgagcagttc gtctagtagt tctacaggcg gcaccgctac tttaagtaca 1680
acggtttctg gcgatcaagt aacgttgaat tggagcgtaa ataatgcaac cgtaactggt 1740
cagcaaattt atcgcgatgt ggattcagac ccagctggcc gtgtgcgcat tgcatccggt 1800
gtaactggaa atacttacac agataccggt ttggctaacg gaacttatta ctactgggta 1860
aaagtaaccg attcaaactc agctacaatt aattccaact actcagaagc gcaagtgaat 1920
gtttatacaa catctactac aacgtttgaa gaggatgcgg gttattgctc ggtagacggt 1980
tcagtagata gcaataacag tggctttgct ggcagtggtt ttgctaatac cgataatgct 2040
tcgggtaatg gcgtaaacta cgcggtaagc gtacccgttg ctggtgtgta cacgctgcaa 2100
gtgcgttttg ctaatggctc aagtgcacgt cctgctgatg tgttagtgaa ctatggtaac 2160
gccggtgtat ttgatctgcc tagcacaggt tcttggacca gctggagcaa ctcaaacgaa 2220
attagcgtta acttagttgc tggcaataat attattcgtt tagaggctac cacgtctggc 2280
ggcttggcga atattgatag cctatctgta acgggtgtag agccttctgc aggtgactgt 2340
aacggtagtg ttggttctag cagttctagt tcttccagct cttctacttc tagcaccagt 2400
tcatctagca ctagctctgg tggcagctct actagctcaa gctcaacgtc tagctcatct 2460
acaagctcta catctagtag ttcaaccagc tctagcagca cgtcttctac ctcaagctct 2520
tcgggcggtg gtacggcaag ttgtgagcag ttgattaacg atccaagtgt taactgggat 2580
gagtctgcac tggcttcaga gcaagaaatt gtagcctgtt tggctcagtc tctaggtagc 2640
cctgttggct ttggggaagg tactaccggt ggttacgatc caagtggcgg cagcaacctt 2700
gttgttatta aaaagaacat aggtatttct gttgagcaac aaattttgga tgctataagc 2760
accgaaaacc acaactggat tgtgttcgac aaagatgatt ttgctgcgcg cactgcggta 2820
gcgatgtatc gcttagattg tgacaatgcc gatgtgcgtt cagcattggg tggcgcaagt 2880
gctgcacaat gtcgcgatca tatagcttgg tgttctgcta atggtatttc tgacgagcat 2940
gactgtgaaa atgaattctt taacaaccgt ttaaatgatt cagatttgcc aatccgcaat 3000
caaatgattc agtcaaacac taccattgat ggtcgtggtg caaacgcata cttcttcttt 3060
aatggtttct ccattggtaa agatagcagt ggtgcaagct tgtacgcagc gcaaaatgtg 3120
attgtaacga ataacgagtt tattggtgcc ggtcacactg aagatcacga tctagaccca 3180
gatatgattc gatctactgg cgaatcgaac aaaatttgga ttcaccaaaa cacgttcgac 3240
catactggtg attctgcgtt tgacgtaaag gtgggtgctt acgatataac aatatcattc 3300
aataagttgg tgaacgtgaa gcgtgctgcg ctacatggtt caagtgatag ccgagcaatt 3360
aactcgcaaa tcacaaccac tatgcacaac aacctgttct atacttcaga tgatcaatac 3420
gcgctaagta cctacgacac tttgcgtcgt gtaccgctaa tgcgtcgcgg tcaatcacac 3480
atgtttaaca acgttttcta cggttaccgt aaagatattc taagcgtgcg tgttggcggt 3540
cgtatcgcct ttgaagataa cattattttg aataaagaaa gcagctctac cccaggtgat 3600
ggcctgaaga aaggcgacga catggaatac tatgttgaaa ccttgttgcg cgacttccgt 3660
gagggtgggt tagaaattag cggtagctat gtatcgtttg cagatagcgc ttgtaattcc 3720
tatggcgcat cgggtgactt aaccgcatcg catggtgcta cgccagatat gtttgatgat 3780
tacagctctg catctaaaaa tactatatca gccaatcgct ttgttgctgg cgatgactta 3840
actgactatg tatttgctac tgcaggtaag ggcggtaaag cgccttatgt ttccaccttt 3900
actgctgggc aaaatagcct tatttcacag gctaacccag tttgtcagta g 3951
<210> 86
<211> 427
<212> PRT
<213> Microbulbifer degradans
<400> 86
Met Asn Lys Asn Asn Val Ile Ala Tyr Leu Leu Ile Ser Thr Phe Leu
1 5 10 15
Leu Phe Ser Ala Thr Val Phe Ala Val Lys Pro Ser Asn Ala Glu Thr
20 25 30
Arg Tyr Ser Ala Met Gly Ala Asp Thr Pro Ala Gly Leu Gly Gly Thr
35 40 45
Leu Pro Asp Gly Gln Ser Arg Ile Val Arg Val Thr Asn Leu Asn Ala
50 55 60
Ser Gly Glu Gly Ser Leu Ala Trp Ala Leu Gly Leu Ala Arg Pro Arg
65 70 75 80
Val Val Val Phe Glu Val Gly Gly Val Ile Asp Leu Ala Gly Gln Ser
85 90 95
Ile Thr Val Thr Gln Pro Phe Leu Thr Val Ala Gly Gln Ser Ala Pro
100 105 110
Ala Pro Gly Ile Thr Leu Ile Arg Gly Gly Leu Asn Ile Arg Thr His
115 120 125
Asp Val Arg Val Gln His Ile Arg Val Arg Pro Gly Asp Asn Leu Gln
130 135 140
Pro Lys Arg Ser Gly Trp Glu Ser Asp Gly Ile Ser Val Ala Gly Glu
145 150 155 160
Asn Ala Lys Asp Val His Ile Asp His Val Ser Val Ser Trp Ala Val
165 170 175
Asp Glu Asn Leu Ser Ala Ser Gly Asn Arg Tyr Lys Gly Tyr Gly Gln
180 185 190
Thr Ala Glu Arg Val Thr Phe Ser Asn Asn Leu Ile Ala Glu Ala Leu
195 200 205
Asp Tyr Ala Ser His Lys Lys Gly Lys His Ser Lys Gly Leu Leu Val
210 215 220
His Asp Tyr Val Arg Asp Val Ala Val Val Arg Asn Leu Phe Val Ser
225 230 235 240
Asn Asp Arg Arg Asn Pro Tyr Phe Lys Ala His Thr Ile Gly Phe Val
245 250 255
Ala Asn Asn Ile Ile Tyr Asn Ala Gly Asn Ala Ala Ile Gln Val Asn
260 265 270
Tyr Ile Glu Arg Glu Trp Glu Gly Gln Ser Thr Gly Pro Ala Asn Ala
275 280 285
Arg Val Ala Val Val Asn Asn Gln Leu Val Tyr Gly Arg Asp Thr Tyr
290 295 300
Ser Asp Leu Ala Leu Val Ser Val Arg Gly Asp Ala Tyr Leu Thr Gly
305 310 315 320
Asn Ser Val Thr Asn Leu Met Gly Glu Pro Met Pro Ile Thr Glu Gly
325 330 335
Ala Val Asn Ser Leu Ala Ser Ala Pro Ser Trp Leu Thr Gly Tyr Glu
340 345 350
Leu Trp Asp Ala Asp Glu Met Arg Glu Leu Leu Val Ala Ser Val Gly
355 360 365
Ala Thr Pro Trp Ala Arg Asp Ala Ile Asp Thr Arg Ile Ile Asn Gly
370 375 380
Val Ala Thr Gly Lys Ala Arg Ile Ile Asp Ser Gln Gln Asp Val Gly
385 390 395 400
Gly Tyr Pro Ser Tyr Lys Gln Thr Asn Lys Lys Phe Asp Ile Pro Asp
405 410 415
Asp Lys Ile Ala Glu Trp Leu Leu Gly Tyr Leu
420 425
<210> 87
<211> 1284
<212> DNA
<213> Microbulbifer degradans
<400> 87
atgaataaaa ataatgtaat tgcttatctg ctaatttcaa cttttctatt attttctgcg 60
actgtgttcg cggttaaacc cagcaacgct gaaacccgat attccgcaat gggtgcagat 120
accccagcag gtttgggagg cactttgcca gatggtcagt ctcgtatcgt tagggtgact 180
aatttaaatg caagtgggga gggctcgctc gcatgggcgc tgggtttagc tcgaccacgc 240
gtagtggtgt tcgaagttgg cggtgttata gatcttgctg ggcaaagtat taccgtcacg 300
cagccattcc ttactgttgc cggtcagtcg gcgccagcac cgggtattac attaattcgc 360
ggcggtttaa atatacgaac ccacgatgta agagtgcagc atattcgcgt gcggccggga 420
gataacttac aaccaaagcg ctccggctgg gaaagtgacg gtatatctgt ggccggtgaa 480
aatgccaaag atgtacatat agatcatgta tcggtaagtt gggcggtaga tgaaaacctc 540
tccgcttcgg ggaatcgtta caaaggttac ggtcaaaccg ctgagcgggt aacgtttagt 600
aataatctca ttgccgaagc gttagattat gccagccata aaaaaggcaa acactctaag 660
ggattattgg tacacgatta tgtgcgagat gttgccgtag ttagaaattt gtttgtgtct 720
aatgatcgtc gcaacccgta ctttaaagcg cacaccatag gttttgtagc aaataatatt 780
atttacaatg cgggtaatgc cgctatacag gttaactata ttgagcgtga gtgggagggc 840
cagagtacag gcccagctaa tgctagagta gcggtggtaa ataaccagtt agtttacggc 900
cgcgatacat actcagactt ggcgctagtg tctgtgcgtg gggatgctta tttgacgggt 960
aatagcgtta caaatttaat gggcgagccc atgcctatta cggaaggggc ggttaattct 1020
ttagcctctg caccttcatg gttaacaggt tatgagttgt gggatgctga cgagatgcgt 1080
gagctgctag tagccagtgt tggtgcaaca ccctgggcca gggatgcgat agatacccga 1140
ataattaatg gggtggcaac ggggaaggcg cgaataatag atagccagca agatgtgggt 1200
ggctacccga gctataagca aacaaataaa aaatttgata taccagacga caaaattgcc 1260
gaatggttac tgggttacct gtaa 1284
<210> 88
<211> 769
<212> PRT
<213> Microbulbifer degradans
<400> 88
Met Arg Asn Thr Lys His Leu Leu Asn Ser Gly Ala Val Leu Leu Ala
1 5 10 15
Ser Ser Ile Ala Thr Ala Ala Met Ala Gly Pro Val Gly Phe Ala Ser
20 25 30
Leu Asn Gly Gly Thr Thr Gly Gly Gln Gly Gly Gln Val Val Tyr Ala
35 40 45
Asn Thr Gly Thr Gln Ile Asn Glu Ala Met Cys Asn Arg Pro Ser His
50 55 60
Asp Thr Pro Leu Ile Ile Tyr Val Ser Gly Thr Ile Asn His Gly Asn
65 70 75 80
Thr Glu Lys Val Ser Gly Asn Cys Asp Thr Thr Gly Asp Glu Ile Gln
85 90 95
Phe Lys Lys Val Lys Asn Leu Ser Leu Ile Gly Thr Gly Asn Gly Ala
100 105 110
Val Phe Asp Gln Ile Gly Ile His Leu Arg Glu Thr Ser Asn Ile Ile
115 120 125
Leu Gln Asn Leu His Ile Lys Asn Val Lys Lys Ser Gly Ser Pro Thr
130 135 140
Ser Asn Gly Gly Asp Ala Ile Gly Met Glu Ser Gly Val Tyr Asn Val
145 150 155 160
Trp Val Asp His Cys Glu Leu Glu Ala Ser Gly Gly Glu Lys Asp Gly
165 170 175
Tyr Asp Ser Leu Leu Asp Met Lys Ala Thr Thr Gln Tyr Val Thr Val
180 185 190
Ser Tyr Thr Tyr Tyr His Asp Ser Gly Arg Gly Gly Leu Met Gly Ser
195 200 205
Ser Asp Ser Asp Asp Thr Asn Thr Tyr Val Thr Phe His His Asn Tyr
210 215 220
Tyr Lys Asn Met Asp Ser Arg Leu Pro Leu Leu Arg His Gly Thr Ala
225 230 235 240
His Ala Phe Asn Asn Tyr Tyr Asp Gly Ile Thr Lys Ser Gly Met Asn
245 250 255
Pro Arg Ile Gly Gly Gln Ile Lys Ala Glu Asn Asn Tyr Phe Glu Asn
260 265 270
Ala His Asn Pro Ile Gly Thr Phe Tyr Thr Asn Asp Met Gly Tyr Trp
275 280 285
Asp Leu Ser Gly Asn Ile Phe Gly Asn Asn Val Thr Trp Ala Ser Ala
290 295 300
Asp Asp Glu Thr Pro Ala Gly Pro Asn Pro Gln Ser Thr Thr Ser Ile
305 310 315 320
His Ile Ser Tyr Pro Tyr Ser Leu Asp Asp Ala Thr Cys Val Pro Lys
325 330 335
Ile Val Lys Ala Thr Ala Gly Val Gly Asn Gly Leu Ala Val Ser Thr
340 345 350
Gly Gly Ser Asn Cys Gly Thr Ser Ser Ser Ser Ser Ser Ser Ser
355 360 365
Ser Ser Ser Thr Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Ser Ser
370 375 380
Ser Asn Ser Ser Ser Gly Gly Ser Gly Val Asn Leu Ser Ile Gly Ala
385 390 395 400
Gly Ser Asp Gly Ser Ser Lys Gly Ala Gly Ser Tyr Gly Asp Val Arg
405 410 415
Asp Gly Asn Met Ser Thr Tyr Trp Ala Pro Ser Gly Ser Thr Gly Arg
420 425 430
Val Ser Ile Lys Trp Ser Ser Ala Thr Thr Val Ser Ser Ile Val Ile
435 440 445
Lys Glu Ala Ala Gly Phe Glu Gly Asn Ile Thr Gly Trp Gln Val Val
450 455 460
Asn Asn Glu Asn Gly Ala Val Leu Lys Ser Gly Ser Asn Ala Gly Val
465 470 475 480
Ile Ser Phe Ser Pro Val Ser Thr Thr Lys Leu Asn Phe Glu Ile Thr
485 490 495
Ser Ser Asn Gly Met Pro Thr Val Ala Glu Phe Glu Thr Tyr Ser Gly
500 505 510
Thr Val Gly Gly Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
515 520 525
Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Gly Gly Ser Ala
530 535 540
Asn Leu Gly Thr Ser Val Ser Gly Asp Gln Val Ser Leu Asn Trp Ser
545 550 555 560
Thr Ser Asn Ile Asp Val Gly Ser Gln Gln Val Tyr Arg Asp Thr Asp
565 570 575
Ser Asn Pro Ser Gly Arg Val Arg Ile Ser Ala Gly Val Ser Gly Asn
580 585 590
Ser Tyr Thr Asp Tyr Gly Leu Ala Ser Gly Thr Tyr Tyr Tyr Trp Ile
595 600 605
Lys Ile Thr Asp Gln Asn Gly Val Val Tyr Asn Thr Asn Ala Ala Glu
610 615 620
Ala Val Val Gly Ser Gln Ala Pro Thr Thr Phe Thr Ala Gln Glu Ser
625 630 635 640
Ala Gly Phe Cys Ser Val Asn Gly Ser Val Asp Ser Asn Asn Ala Gly
645 650 655
Tyr Thr Gly Asp Gly Phe Val Asn Thr Asp Asn Ala Ser Gly Asn Ala
660 665 670
Ala Val Tyr Ala Phe Asn Ala Pro Ser Ala Gly Met Tyr Ser Leu Gln
675 680 685
Val Arg Tyr Ala Asn Gly Ser Ser Ala Arg Pro Gly Asp Val Leu Val
690 695 700
Asn Ala Gly Asn Ile Gly Thr Phe Asp Phe Ser Ser Thr Gly Ser Trp
705 710 715 720
Thr Ser Trp Ala Asn Ser Asn Glu Leu Ser Ala Tyr Phe Ser Ala Gly
725 730 735
Asn Asn Thr Val Arg Ile Gln Ala Thr Asn Ser Gly Gly Leu Pro Asn
740 745 750
Leu Asp Ser Val Ser Val Thr Gly Asn Ala Pro Ala Ala Gly Asn Cys
755 760 765
Asn
<210> 89
<211> 2310
<212> DNA
<213> Microbulbifer degradans
<400> 89
atgagaaaca ctaaacacct actcaatagc ggtgccgtat tgctagctag cagtattgca 60
acagctgcga tggcggggcc tgtgggcttt gcatcgctta atggtggcac aacgggcggt 120
caaggcggtc aagttgtata cgccaacacc ggtacacaaa ttaacgaagc catgtgtaac 180
cgcccatctc acgatacgcc gttaattatt tatgtatccg gtaccattaa ccacggcaac 240
accgaaaagg tgtcgggtaa ttgcgataca accggcgacg agattcagtt taaaaaagtt 300
aaaaacctat cgttaattgg tactggtaac ggtgcggtgt ttgatcaaat aggtattcat 360
ttacgcgaaa cctccaatat tattctgcaa aaccttcata ttaaaaatgt taaaaaatcg 420
ggttctccaa cctccaatgg cggcgatgca attggaatgg agtctggcgt atacaatgtg 480
tgggtggatc actgtgagct agaagcatcc ggtggtgaaa aagatggtta cgattcattg 540
ctagatatga aagcaaccac gcagtatgta accgtttctt acacctatta ccacgattca 600
ggccgcggtg gtttaatggg gtcgagcgat agtgacgata ccaacaccta cgtgactttc 660
caccacaatt actacaaaaa tatggattca cgcttaccgc ttttacgcca cggtactgcg 720
catgccttta acaactatta cgatggcatt accaaatctg gtatgaaccc ccgtataggc 780
ggtcaaataa aagcagaaaa taactatttc gaaaacgcac acaacccaat aggtacgttt 840
tacacaaacg atatgggtta ctgggactta agcggcaata tatttggcaa caacgtaacg 900
tgggcgtctg cggatgatga aacccctgca gggccgaatc cacaatccac aacgtccatt 960
catatttctt acccctacag cttggatgac gcaacgtgcg tgccgaagat tgtaaaagct 1020
actgcgggtg tgggtaacgg tttggctgtg tctaccggtg gtagcaattg cggtacttct 1080
agctcttcgt ctagcagctc ttcgtccagc tctacctcgt ctaccagttc tacatctagc 1140
agttcttcat catccaatag ttcttctggt ggctcaggtg taaacctttc aattggcgca 1200
ggttctgatg gtagcagtaa aggtgcgggc tcttatggcg atgtgcgcga tggcaatatg 1260
agcacctact gggcaccgag tggcagcact ggtcgcgtat ctattaagtg gagttcggca 1320
acgacggtaa gtagcattgt tattaaagaa gcggctggct ttgaaggtaa cattactggc 1380
tggcaagttg taaacaacga gaatggcgca gtattaaaaa gcggctctaa cgctggcgta 1440
atttcttttt ctccggtttc tactacgaag ttaaatttcg aaattacctc ttcgaacggc 1500
atgcccacgg ttgcagaatt tgaaacctat agcggcacgg ttggtggtac ttcgtcatcc 1560
agttcttcaa gtagttcgtc aagcagttct tcaagcagct caagttcaac gagcagctct 1620
ggtggttccg cgaatctagg tacctcggta agtggcgatc aggtttcact taattggtct 1680
acctcaaaca ttgatgtggg ttcgcagcaa gtttatcgcg ataccgattc aaacccatct 1740
ggtcgtgtgc gtatctctgc aggcgtttct ggtaattcgt ataccgatta cggcttagct 1800
agcggtactt attactactg gataaaaatt accgatcaaa acggtgttgt ttacaacaca 1860
aatgccgcag aagcggttgt aggcagccaa gcgccaacga cgtttactgc gcaagagtct 1920
gcgggtttct gctctgttaa cggttctgtt gattccaata acgctggcta cactggcgat 1980
ggttttgtaa ataccgataa cgccagtggc aatgcagcag tttatgcctt caatgcacct 2040
agtgcgggta tgtatagctt gcaggttcgc tacgcaaacg gttctagtgc acgcccaggt 2100
gatgtgctgg tcaacgctgg caatattgga acatttgatt tttccagtac cggttcttgg 2160
acatcttggg caaacagtaa tgagttaagt gcgtacttct ctgcgggtaa caatactgtt 2220
cgtattcaag ctactaactc tggcggctta cctaacttag atagtgtttc tgtaacgggt 2280
aatgcaccag cggcaggcaa ctgtaattaa 2310
<210> 90
<211> 594
<212> PRT
<213> Microbulbifer degradans
<400> 90
Met Lys Ile Phe Lys Leu Leu Leu Met Phe Val Leu Ala His Asn Leu
1 5 10 15
Val Ala Cys Gly Gly Ser Asn Asp Gly Gly Glu Ile Glu Leu Asn Phe
20 25 30
Gly Glu Glu Asn Thr Pro Glu Pro Glu Thr Glu Pro Glu Ala Glu Pro
35 40 45
Glu Gly Glu Pro Glu Gly Glu Pro Glu Gly Glu Pro Glu Gly Glu Pro
50 55 60
Glu Gly Glu Pro Glu Gly Glu Thr Ala Asp Ala Thr Ala Asp Ala Gly
65 70 75 80
Phe Ala Gly His Asn Phe Asn Leu Thr Gly Gly Glu Gly Gly Thr Ala
85 90 95
Tyr Thr Val Asn Asn Gly Lys Asp Leu Gln Thr Val Leu Asp Asn Ala
100 105 110
Lys Ser Ser Asn Ser Pro Val Ile Ile Tyr Val Asp Gly Thr Ile Asn
115 120 125
Ser Phe Asn Ser Ala Asn Gly Asn Gln Pro Ile Gln Ile Lys Asp Met
130 135 140
Asp Asn Val Ser Ile Ile Gly Tyr Gly Ala Glu Ala Thr Phe Asp Gly
145 150 155 160
Val Gly Ile Ala Ile Arg Arg Ala Asn Asn Ile Ile Ile Arg Asn Leu
165 170 175
Thr Phe Lys Ser Val Leu Thr Glu Gly Lys Asp Ala Ile Ser Ile Glu
180 185 190
Gly Asp Asp Asp Gly Ser Thr Thr Ser Asn Ile Trp Val Asp His Asn
195 200 205
Glu Phe Tyr Ser Ala Pro Thr Ala Asp Lys Asp Phe Tyr Asp Gly Leu
210 215 220
Ile Asp Ser Lys Ser Gly Ala Ser Asn Ile Thr Ile Ser Tyr Asn Tyr
225 230 235 240
Leu His Asp His Trp Lys Ala Ser Leu His Gly His Thr Glu Asn Asp
245 250 255
Glu Gly Ala His Asn Thr Asp Arg Lys Ile Thr Phe His His Asn Arg
260 265 270
Phe Glu Asn Ile Glu Ser Arg Leu Pro Leu Phe Arg Arg Gly Val Gly
275 280 285
His Leu Tyr Asn Asn Tyr Tyr Lys Asp Val Gly Ser Thr Ala Ile Asn
290 295 300
Ser Arg Ile Gly Ala Glu Leu Leu Ile Glu Asn Asn Val Phe Glu Asp
305 310 315 320
Ser Gln Asn Pro Ile Val Ser Phe Tyr Ser Asp Val Ile Gly Tyr Trp
325 330 335
Asn Thr Ser Gly Asn Leu Phe Thr Asn Val Thr Trp Thr Thr Pro Gly
340 345 350
Thr Gly Glu Val Ser Ala Gly Ala Thr Gln Thr Pro Thr Ser Asp Tyr
355 360 365
Val Val Pro Tyr Ser Tyr Thr Leu Met Pro Ala Ala Asp Val Lys Ala
370 375 380
His Val Ile Ala Ser Ala Gly Val Gly Lys Ile Asp Gln Thr Gly Leu
385 390 395 400
Thr Ile Pro Asp Pro Val Thr Pro Glu Gly Asp Leu Gly Glu Pro Glu
405 410 415
Ala Pro Val Gln Gly Asp Val Ser Leu Pro Tyr Thr Glu Asn Phe Ala
420 425 430
Ala Thr Asp Ala Ala Asn Phe Phe Ser Ala Ala Tyr Arg Asp Ile Thr
435 440 445
Gly Ser Ala Gly Thr Ser Thr Pro Met Tyr His Arg Val Thr Gly Thr
450 455 460
Val Glu Ile Asn Ala Gln Gln Leu Asp Met Thr Gly Ala Arg Val Ser
465 470 475 480
Ile Gly Asn Thr Thr Pro Ser Val Ser Thr Thr Gly Ala Asp Thr Thr
485 490 495
Thr Thr Gly Val Leu Asp Leu Ser Ala Pro Tyr Thr Val Ser Phe Lys
500 505 510
Val Val Ser Val Gly Gly Thr Leu Thr Lys Lys Phe Gln Ile Tyr Val
515 520 525
Asp Asn Asn Thr Ser Ala Ser Gly Asp Ser Ile His Gly Gly Ser Ser
530 535 540
Arg Phe Tyr Ser Glu Thr Leu Asp Ser Leu Val Ala Gly Gln Thr Tyr
545 550 555 560
Thr Val Thr Gly Phe Thr Ala Thr Asn Ser Ser Phe Ile Thr Leu Arg
565 570 575
Thr Glu Ser Ser Gly Gln Ile Val Leu Asp Asp Leu Ser Ile Gln Ala
580 585 590
Ala glu
<210> 91
<211> 1785
<212> DNA
<213> Microbulbifer degradans
<400> 91
atgaaaattt ttaaattgtt attaatgttt gtactcgccc acaacttagt tgcttgtggt 60
ggcagtaacg acggtggtga aattgaatta aactttggcg aagaaaacac accagagcca 120
gaaaccgaac cagaagctga gcctgaagga gaaccagagg gcgagccgga gggagaacct 180
gaaggagagc ctgaagggga accagagggc gaaacagccg acgcaaccgc agatgctggc 240
ttcgccggcc acaattttaa tcttaccggt ggcgaaggcg gcacagccta taccgttaat 300
aacggcaaag atttgcaaac agttttagac aacgccaaat cgagtaattc accggtcatt 360
atttacgtag acggcaccat aaattcgttt aactctgcca acggcaacca gcctattcaa 420
attaaagata tggataacgt atctataatt ggttacggcg ccgaagcaac atttgacggt 480
gttggtatag caatacgccg cgccaacaac attattattc gcaaccttac ttttaaaagc 540
gtccttaccg aaggtaaaga tgcaattagt atagaaggtg atgacgacgg cagcaccacg 600
tcaaacattt gggttgatca caacgaattc tacagcgccc caacggcaga caaagatttt 660
tacgacggtt taatcgatag taaaagcggc gcgagcaaca ttactatttc ttacaactac 720
ctgcacgacc attggaaagc atcgttacac ggccataccg aaaatgacga aggtgcacac 780
aacaccgacc gcaaaattac tttccaccac aaccgttttg agaatattga atcgcgttta 840
ccgctgttcc gtcgcggtgt aggccatttg tacaataact actacaaaga cgtaggctca 900
acggctatca actcacgtat tggtgccgag ttattaattg agaataacgt ttttgaagat 960
tcacaaaacc cgattgtctc tttttactct gacgtaattg ggtactggaa cacctcaggc 1020
aacctcttca ccaatgtaac ttggacaacc ccaggtactg gcgaagtatc tgcaggcgca 1080
acacaaacgc caacctcaga ttacgtagtg ccatacagct acacgcttat gccggcagcc 1140
gatgtaaaag cccacgtcat tgcgagtgca ggcgttggca aaatagacca gacagggctt 1200
accattccag accccgttac ccctgaaggc gacctaggtg aaccagaagc cccagtgcaa 1260
ggtgatgtaa gcctacctta cactgaaaat tttgccgcca ctgacgccgc caatttcttt 1320
agcgccgcgt accgcgatat tactggctct gctggcacca gcacacccat gtaccaccgc 1380
gtaaccggca cggtggaaat taacgcacag caattggata tgactggcgc acgcgtatca 1440
attggcaaca caacgccaag tgtaagcaca accggtgcag acaccactac aacgggcgta 1500
ttagatttaa gcgcgcccta caccgtaagc tttaaagtgg taagcgtagg cggcacccta 1560
actaagaaat ttcaaatata tgtagacaac aatacctctg ccagcggcga ctctattcac 1620
ggcggctcat cgcgctttta cagtgaaact ttagactcgc tagttgcagg ccaaacctac 1680
acagtaaccg gctttaccgc caccaacagc tctttcataa cattacgtac cgaaagtagc 1740
ggccaaattg tattagatga cctaagtatt caagccgcag aataa 1785
<210> 92
<211> 425
<212> PRT
<213> Microbulbifer degradans
<400> 92
Met Phe Lys Tyr Ala Leu Tyr Val Val Ala Leu Val Ala Gly Val Val
1 5 10 15
Val Ser Leu Ala Ala Cys Ser Lys Arg Ala Thr Gln Gln Val Glu Thr
20 25 30
Glu Phe Tyr Glu Ile Asn Glu Arg Gly Gly Asp Asp Gly Arg Leu Leu
35 40 45
Arg Val Val Asn Leu Asn Asn Gln Gly Val Gly Ser Leu Arg Trp Ala
50 55 60
Leu Ala Gln Thr Gly Ala Arg Lys Ile Ile Phe Asp Val Gly Gly Val
65 70 75 80
Ile Asp Leu Glu Glu Lys Ser Leu Lys Ile Arg Glu Ala His Val Thr
85 90 95
Ile Ala Gly Glu Thr Ala Pro Ser Pro Gly Ile Thr Leu Ile Lys Gly
100 105 110
Gly Leu Arg Ile Glu Thr His Asn Val Lys Val Ser His Leu Met Ile
115 120 125
Arg Pro Gly Asp Ala Gly Tyr Ser Lys Gly Gln Gly Trp Lys Pro Asp
130 135 140
Gly Ile Thr Ile Tyr Gly Ser Lys Ala Arg His Val Val Ile Asp His
145 150 155 160
Cys Ser Val Thr Trp Ala Val Asp Glu Asn Ile Ala Val Ser Gly Pro
165 170 175
Ala Asp Lys Gly Ala Glu Ala Thr Ala Gly Lys Val Leu Ile Arg Asn
180 185 190
Ser Ile Ile Ala Glu Ala Leu Ser Asn Ala Ser His Pro Glu Gly Glu
195 200 205
His Ser Lys Gly Ile Leu Ile His Asn Asn Val Gln His Val Ser Leu
210 215 220
Val Asn Asn Leu Leu Ala His Asn Arg Arg Arg Asn Pro Tyr Phe Lys
225 230 235 240
Ala Gly Thr Thr Gly Ile Val Ile Gly Asn Ile Ile Tyr Asn Pro Gly
245 250 255
Lys Arg Ala Ile His Met Ser Ser Gly Arg Ala Asp Ala Ser Leu Pro
260 265 270
Thr Leu Ser Ile Thr Gly Asn Leu Phe Ile Pro Ala Ala Asn Thr Ser
275 280 285
Pro Asn Leu Ser Leu Ile Ser Asn Tyr Gly Lys Ile Tyr Ser Ser Gly
290 295 300
Asn Leu Val Gln Gly Glu Ser Arg Pro Ile Thr Asp Gly Lys Ser Ile
305 310 315 320
Ser Leu Thr Ala Pro Pro Leu Gln Gln Val Gly Ile Asn Leu Thr Asp
325 330 335
Thr Gly Thr Gln Asn Asp Phe Cys Gln Thr Leu Ser Asn Ala Gly Ala
340 345 350
Arg Pro Trp Asp Pro Asp Pro Ile Asp Ile Arg Ile Lys Thr Gln Leu
355 360 365
Leu Ala Gly Glu Gly Arg Ile Ile Asp Ser Gln Ser Glu Val Gly Gly
370 375 380
Tyr Pro Ile His Asn Val Asn Asn Lys Glu Thr Ala Glu Ala Gly Ser
385 390 395 400
Thr Gln Gly Asp Gly Met Leu Gln Phe Asp Thr Glu Leu Leu Arg Lys
405 410 415
Ile Pro Asn Leu Cys Ser Gly Met Met
420 425
<210> 93
<211> 1278
<212> DNA
<213> Microbulbifer degradans
<400> 93
atgtttaagt atgcgctata tgttgtggcc ttggtggctg gcgtagtggt tagcttagca 60
gcctgcagca agagagctac gcagcaagta gagactgaat tctacgagat taatgagcgg 120
ggcggagatg atggtcgcct gctacgcgtt gtgaatttaa ataatcaggg ggttggctct 180
ttgcgctggg cgttggcgca gacaggtgct agaaaaataa ttttcgatgt agggggggtg 240
atagatctag aagaaaaatc gctcaagatt cgtgaagccc atgtgacaat cgctggagag 300
acggccccat caccgggtat cacccttatc aagggcggac taagaataga aacccacaat 360
gtcaaagttt cgcaccttat gattaggcct ggtgacgcag ggtactccaa aggtcaaggt 420
tggaaacccg acggcataac tatatatggc agcaaagcga ggcatgttgt tattgatcat 480
tgctcggtta catgggctgt cgacgaaaat atcgcagtat ctggcccagc agataagggg 540
gcagaggcta ccgcgggtaa ggttcttatt cgcaattcaa ttattgccga agcgctaagc 600
aatgcatccc acccagaggg ggagcattct aagggcatac tcatacacaa caatgtgcag 660
catgtaagct tggttaacaa tttgttggct cacaataggc gaagaaaccc ttattttaag 720
gcgggtacaa cgggaattgt aattggcaat ataatttata acccagggaa acgtgctatt 780
catatgtctt cgggccgtgc tgatgctagc ctgccaacac tgtcaattac agggaatttg 840
tttattccag cagctaatac ttcccccaac cttagtttga taagcaacta tggaaaaatt 900
tattcgagtg gaaacttggt gcagggggag agcaggccga taactgatgg taaaagtatt 960
tcattgactg cgccgccgct acagcaagta ggcataaatt taaccgatac aggtacacaa 1020
aacgactttt gccaaaccct aagcaacgca ggtgcccgcc cgtgggaccc cgacccgatt 1080
gatattagaa taaaaacaca acttttggct ggcgaaggac gaattattga tagccagagc 1140
gaggttggag gctacccaat acataacgtc aataacaaag aaaccgctga agcaggctct 1200
acgcaggggg atggtatgtt gcagtttgat actgagttat tgagaaaaat accaaacctg 1260
tgtagtggca tgatgtaa 1278
<210> 94
<211> 772
<212> PRT
<213> Microbulbifer degradans
<400> 94
Met Arg Asp Ile Thr Met Lys Asn Asn Lys Phe Arg Ser Ser Phe Thr
1 5 10 15
Leu Lys Lys Leu Thr Pro Phe Phe Val Ala Gly Thr Met Leu Gly Gly
20 25 30
Ser Asn Ala Trp Ala Gly Cys Asp Tyr Thr Val Thr Asn Gln Trp Gly
35 40 45
Ser Gly Phe Thr Gly Asn Val Arg Ile Thr Asn Ser Gly Asn Thr Pro
50 55 60
Thr Asn Gly Trp Ala Val Asn Trp Gln Tyr Ala Gly Asp Asn Arg Ile
65 70 75 80
Ser Asn Ser Trp Gly Ala Gln Leu Ser Gly Ser Asn Pro Tyr Ser Ala
85 90 95
Thr Ala Glu Ser Trp Asn Ala Val Ile Gln Pro Ser Gln Ser Ile Glu
100 105 110
Ile Gly Phe Gln Gly Thr Gly Asp Gly Asn Glu Ile Pro Thr Ile Asn
115 120 125
Gly Asp Val Cys Gln Thr Ser Ser Gly Ser Thr Ser Ser Ser Ser Ser
130 135 140
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser
145 150 155 160
Ser Asn Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Ser Gly Ser Thr Thr Gly Tyr Ile His Ile Glu Glu Asn Glu Leu Gly
180 185 190
Phe Cys Tyr Val Gln Gly Ser Ile Asp Ser Asn Asn Gly Phe Thr
195 200 205
Gly Thr Gly Phe Ala Asn Thr Asp Asn Val Asn Gly Ser Gln Ile Asn
210 215 220
Trp Lys Val Asn Val Asp Phe Asp Gly Tyr Tyr Ala Leu Glu Trp Arg
225 230 235 240
Tyr Ala Asn Gly Ser Gly Thr Ala Arg Thr Ala Ser Val Ser Ala Asn
245 250 255
Gly Ala Gln Ser Glu Ile Ser Phe Pro Thr Thr Gly Ser Trp Asp Ser
260 265 270
Trp Leu Leu Asp Ser Thr Thr Leu Phe Leu Lys Ala Gly Val Asn Asp
275 280 285
Val Ile Leu Ser Ala Asn Thr Ser Ser Gly Leu Ala Asn Ile Asp Ser
290 295 300
Leu Thr Val His Gly Asp Gly Val Ala Ala Ala Asp Cys Asn Thr Asp
305 310 315 320
Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr
325 330 335
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser
340 345 350
Gly Gly Pro Gln Ile Leu Lys Ala Phe Pro Thr Ala Glu Gly Tyr Gly
355 360 365
Lys Ile Thr Ala Gly Gly Arg Gly Gly Asp Val Tyr Ile Val Thr Asn
370 375 380
Leu Asn Asp Ser Gly Ala Gly Ser Leu Arg Gln Ala Val Glu Ala Ser
385 390 395 400
Gly Pro Arg Thr Val Val Phe Glu Val Ser Gly Thr Ile Thr Leu Asn
405 410 415
Lys Pro Leu Thr Ile Lys Asn Asn Asn Ile Thr Ile Ala Gly Gln Thr
420 425 430
Ala Pro Gly Asp Gly Ile Thr Leu Arg Lys His Asn Phe Ser Ile Gln
435 440 445
Ala Asp Asp Val Ile Val Arg Tyr Ile Arg Val Arg Phe Gly Asp Glu
450 455 460
Thr Leu Thr Asp Ser Asp Ala Ile Ser Met Arg Tyr Gln Lys Asn Ile
465 470 475 480
Ile Leu Asp His Val Ser Ala Ser Trp Gly Asp Asp Glu Thr Leu Ser
485 490 495
Leu Tyr His Gly Glu Asn Ile Thr Val Gln Trp Ser Met Ile Thr Glu
500 505 510
Thr Leu Asn Arg Gly Gly Glu His Ala Phe Ala Ala Ile Trp Gly Ser
515 520 525
Pro Phe Ser Thr Phe His His Asn Leu Ile Ala His Asn Val Ala Arg
530 535 540
Asn Val Arg Phe Ala Ser Gly Ser Gly Tyr Thr Asp Tyr Arg Asn Asn
545 550 555 560
Val Val Tyr Asn Trp Gly Tyr Ser Ser Thr His Gly Gly Glu Ala Gln
565 570 575
Gln Val Gly Asn Ala Asn Phe Asn Phe Thr Thr Val Asn Met Val Gly
580 585 590
Asn Tyr Tyr Lys Pro Gly Pro Arg Thr Glu Ser Gly Val Arg Ser Arg
595 600 605
Leu Leu Thr Pro Asn Thr Arg Asn Gly Asp Ala Asp Leu Gly Ser Phe
610 615 620
Tyr Val Ser Gly Asn His Met Val Gly Ser Pro Asn Val Thr Ala Asp
625 630 635 640
Asn Ser Ile Gly Val Ser Asn Lys Asn Ala Leu Ile Ser Ser Pro Trp
645 650 655
Asn Ser Met Lys Ile Glu Gly Glu Gln Thr Ala Glu Gln Ala Tyr Glu
660 665 670
Ser Val Leu Ala Tyr Ala Gly Ala Ser Lys Val Arg Asp Ser Val Asp
675 680 685
Thr Arg Ile Ile Glu Glu Val Arg Thr Gly Thr Ala Thr Tyr Gly Gly
690 695 700
Asn Gly Ile Ile Glu Ser Gln Asn Glu Val Gly Gly Trp Pro Gln Leu
705 710 715 720
Arg Ser Glu Thr Pro Pro Gln Asp Ser Asp Arg Asp Gly Met Pro Asp
725 730 735
Asp Trp Glu Arg Ala Asn Asn Leu Asn Pro Phe Asn Ala Ala Asp Arg
740 745 750
Asn Thr Lys Asp Ser Ile Gly Tyr Thr Met Leu Glu Arg Tyr Ile Asn
755 760 765
Gly leu val asp
770
<210> 95
<211> 2319
<212> DNA
<213> Microbulbifer degradans
<400> 95
atgagagata tcacgatgaa gaataataaa ttcaggtcgt cttttacatt aaaaaaactc 60
acaccgtttt ttgttgcggg caccatgctt ggcggttcca acgcctgggc tggctgcgac 120
tatacggtca ctaatcagtg gggctcaggc tttactggca acgttcgtat aactaatagc 180
ggtaacacgc caacaaatgg ttgggctgtt aactggcagt acgctggcga taatcgtatt 240
agtaatagct ggggagcaca gctttcaggg tcgaacccat actctgccac ggcagaaagc 300
tggaatgctg ttattcagcc tagtcagtcc atagaaattg ggtttcaagg taccggcgac 360
ggaaatgaaa taccaactat aaatggcgat gtttgccaga ctagcagcgg aagtacttca 420
tccagctcat cttcaagtac gtcttctagc agctcttcaa gctcgtccac tagcagctct 480
tcaaacagct cttctagctc tagctcgtcc agctcgtcta gctcctcctc tggctctaca 540
actggatata ttcatataga agagaatgaa cttggttttt gttatgtaca aggttccatt 600
gactccaaca acggtggctt taccggcaca ggctttgcca ataccgataa cgttaatggc 660
tcacagatta actggaaagt aaatgtcgac tttgatggat attatgcgct cgaatggcgc 720
tatgcgaatg gctccggcac cgcgcgcact gcaagcgtta gcgctaatgg agcacaaagc 780
gaaatttcct tccctacaac aggttcgtgg gatagctggt tattagacag cactacccta 840
tttttaaaag ctggcgtaaa cgacgtaata ttgagcgcaa atacaagcag tggcctagcg 900
aacatagatt cacttacagt gcacggtgat ggcgtagctg cggcagactg taatactgat 960
ggaagctcaa gcagcagctc tagttcaagc tctagttcca gctcaacttc tagtagctcc 1020
tcaagttcca gctcgtctag cacgtccagc tcttctggtg gcccgcaaat attaaaagca 1080
ttccccaccg cagaaggcta cggaaaaata accgcaggtg gtcgtggtgg cgatgtctat 1140
atagttacaa acctgaatga ctcaggcgcg ggtagtttgc gtcaggccgt agaggcatct 1200
ggccctagaa ccgttgtgtt cgaagtgtct ggaaccatca ctctaaataa accactcaca 1260
atcaaaaata ataacatcac aatagcagga caaactgcac caggcgatgg cattacactt 1320
agaaagcaca acttttctat ccaagctgat gatgtcatcg tacgttacat acgtgttcgc 1380
tttggtgatg aaaccctaac cgattctgat gcgatttcca tgaggtacca aaaaaatatt 1440
attttggatc atgtgagtgc tagctgggga gatgatgaaa ccttatctct ttatcacggc 1500
gaaaatatca ctgtgcaatg gagcatgatt acagagaccc tcaatcgtgg cggcgaacat 1560
gcattcgcag ctatatgggg ttcgcctttt agtaccttcc accacaattt aattgctcac 1620
aatgttgcga gaaacgttcg ctttgcgtcg ggttccggtt atacggatta tcgtaacaat 1680
gtcgtatata actggggcta tagcagcaca cacggaggcg aagctcaaca agttggcaac 1740
gctaatttta atttcaccac cgtcaatatg gtcggcaact attacaaacc tgggccgaga 1800
actgaatctg gcgttcgtag tcgactactt acacctaaca cgcgtaacgg cgatgcggac 1860
ttaggtagtt tttacgtttc tggtaaccac atggttggca gcccaaatgt aactgcagac 1920
aactcgattg gcgtatcgaa taaaaatgcc ttaataagta gcccttggaa ttcaatgaaa 1980
atagaaggcg aacaaacagc tgagcaagca tatgagtcag ttcttgctta cgcaggtgca 2040
tctaaagtac gcgactcggt agatactcgt attattgaag aagtacgtac aggcacagct 2100
acttatggtg gaaacggcat aattgaatcg cagaatgaag tgggtggttg gccacaactt 2160
agaagtgaaa cgcccccgca agacagtgat cgcgacggaa tgccagatga ctgggaacgc 2220
gcgaacaacc taaatccatt caacgcagcc gatagaaaca ctaaagacag tattggctac 2280
acaatgttag agcgatatat taacgggctt gttgattaa 2319
<210> 96
<211> 511
<212> PRT
<213> Microbulbifer degradans
<400> 96
Met Leu Arg Ile Pro Lys Ala Trp Leu Ala Leu Pro Leu Val Leu Gly
1 5 10 15
Ser Thr Asn Leu Tyr Ala Gln Val Thr Cys Ser Ile Ser Asn Thr Asn
20 25 30
Val Trp Asn Asn Gly Tyr Thr Val Asn Val Asn Val Thr Asn Thr Gly
35 40 45
Ser Ser Gln Val Gly Ser Trp Gln Val Pro Ile Asn Phe Ser Glu Pro
50 55 60
Pro Gln Val Ser Ser Gly Trp Asn Ala Ile Leu Ser Thr Asn Gly Asn
65 70 75 80
Thr Val Thr Ala Gly Asn Ile Gly Trp Asn Gly Asn Leu Asn Pro Gly
85 90 95
Gln Ser Ala Ser Phe Gly Phe Gln Gly Gly His Asp Gly Ser Phe Val
100 105 110
Glu Pro Thr Cys Ser Gly Gly Gly Ser Ser Thr Ser Ser Ser Ser Ser
115 120 125
Ser Ser Ser Ser Ser Thr Ser Ser Thr Ser Ser Ser Ser Thr Ser Ser
130 135 140
Ser Ser Ser Ser Ser Ser Gly Gly Ser Glu Leu Leu Ile Gln Glu Asn
145 150 155 160
Ala Ser Gly Phe Cys Arg Val Asp Gly Ser Ile Asp Asn Asn Asn Ser
165 170 175
Gly Tyr Thr Gly Ser Gly Phe Ala Asn Thr Glu Asn Gln Asn Gly Ser
180 185 190
Ala Val Glu Tyr Ala Leu Asn Val Pro Ser Asn Gly Asn Tyr Leu Leu
195 200 205
Asp Ala Arg Tyr Ala Ser Ala Thr Thr Arg Ser Ala Ser Val Val Val
210 215 220
Asn Gly Ser Ser Val Gly Ser Phe Ser Phe Pro Ser Thr Gly Ser Trp
225 230 235 240
Thr Ser Trp Thr Val Asp Ser Ala Asn Val Pro Leu Lys Gly Gly Asn
245 250 255
Asn Ile Val Arg Ile Val Ala Thr Asn Ser Ser Gly Leu Pro Asn Ile
260 265 270
Asp Ser Leu Lys Val Ile Gly Thr Asn Pro Ser Ala Gly Ser Cys Ser
275 280 285
Ser Asn Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
290 295 300
Ser Asn Ser Gly Gly Lys Gly Ser Ser Cys Arg Ser Thr Gly Ser Gln
305 310 315 320
Ser Val Ser Ser Thr Ile Lys Val Thr Ser Gly Thr Phe Asp Gly Asn
325 330 335
Cys Lys Thr Tyr Asn Pro Thr Ser Ala Leu Gly Asp Gly Ser Gln Ser
340 345 350
Glu Ser Gln Lys Pro Ala Phe Arg Val Glu Asn Gly Ala Thr Leu Lys
355 360 365
Asn Val Ile Leu Gly Asn Asn Gly Val Asp Gly Ile His Val Tyr Asn
370 375 380
Gly Gly Thr Leu Asp Asn Ile Arg Trp Thr Asn Val Gly Glu Asp Ala
385 390 395 400
Met Thr Val Lys Ser Glu Gly Asn Val Thr Val Ser Asn Ile Glu Gly
405 410 415
Tyr Asp Gly Ser Asp Lys Phe Ile Gln Val Asn Ala Val Thr Asn Leu
420 425 430
Lys Val Ser Asn Cys Ile Val Asp Lys Met Gly Lys Phe Leu Arg Gln
435 440 445
Asn Gly Gly Lys Thr Phe Ala Met Ser Val Thr Val Asp Asn Cys Asp
450 455 460
Ile Ser Asn Met Gly Glu Gly Val Phe Arg Ser Asp Ser Pro Asn Ala
465 470 475 480
Thr Ala Arg Ile Thr Asn Ser Arg Leu Lys Asn Ala Gly Asp Ile Cys
485 490 495
Ile Gly Lys Trp Lys Ser Cys Thr Ser Ser Asn Ile Thr Ser Phe
500 505 510
<210> 97
<211> 1536
<212> DNA
<213> Microbulbifer degradans
<400> 97
atgttgcgaa tccccaaggc ttggctggca cttccacttg tactgggaag taccaatcta 60
tacgctcaag taacttgcag tatctctaac accaatgttt ggaataacgg atacaccgtt 120
aatgttaatg taaccaacac aggctcttca caggttggtt cttggcaggt tcctattaat 180
ttttctgagc cacctcaagt aagcagcggc tggaatgcaa tattaagcac aaacggaaac 240
accgtaactg ccggcaatat tggttggaat ggtaatttaa atcccggcca aagcgcctcc 300
tttggttttc aaggtggcca cgatggcagc tttgtggagc ccacctgctc gggcggaggc 360
tctagcacta gctcaagcag ctctagtagt tctagctcaa caagttctac cagttcttca 420
tccacaagtt caagtagctc ttctagctcc ggcggctctg aacttttaat ccaagaaaat 480
gcatccggct tctgccgtgt ggacggatcg atagataaca ataactcagg ctataccggt 540
agtggctttg ccaacaccga gaaccaaaac ggttccgcag ttgaatacgc acttaacgtt 600
ccctctaatg ggaattatct cctcgacgct cgatatgcaa gcgctactac acgatcggct 660
agcgtggtag ttaatggatc ttcagtaggc agctttagtt ttccatctac gggttcgtgg 720
acaagctgga cagttgactc cgccaacgtt ccgttaaaag gcgggaataa tattgttcga 780
attgttgcaa ctaacagcag cggattacct aatattgatt cattaaaggt aataggcacc 840
aacccgtcag ccggcagttg ttcaagcaac tcgtcatcca ctagttcatc gtctagctca 900
agttcatcaa gcagtaactc cggtggcaaa ggctctagct gccgttctac aggcagtcaa 960
tctgtttcct ctactattaa agttactagc gggactttcg atgggaactg taaaacgtat 1020
aaccctacaa gtgcccttgg cgatggcagt caatcagaaa gccagaaacc ggcattccga 1080
gtggagaacg gcgcaacact caaaaacgtg attctaggca acaatggcgt agacggtatt 1140
catgtttata acggcggcac cttggataac atccgctgga ccaatgtggg tgaagatgca 1200
atgaccgtta aatctgaagg aaacgttacc gtttcaaata ttgagggtta tgacggttca 1260
gataaattta tacaagtaaa cgcagttacc aacctaaagg tttctaattg cattgtagat 1320
aaaatgggta aatttttacg tcagaatggc ggtaaaactt tcgctatgtc tgtaaccgta 1380
gataattgtg atatctcaaa tatgggtgaa ggtgttttcc gctcagacag cccaaatgca 1440
acagcgagaa tcacaaatag ccgattaaaa aatgcaggcg acatttgtat tggtaagtgg 1500
aaaagctgca catcttccaa cattaccagc ttctaa 1536
<210> 98
<211> 455
<212> PRT
<213> Microbulbifer degradans
<400> 98
Met Ile Met Met Arg Asn Lys Ile Leu Leu Ala Leu Val Leu Cys Gly
1 5 10 15
Ala Ser Ala Ser Ala Phe Ala Ala Ser Asn Arg Pro Ser Gly Tyr Thr
20 25 30
Thr Ile Cys Lys Thr Asp Gln Thr Cys Ser Val Ser Ser Ser Thr Asn
35 40 45
Val Ala Phe Gly Ala Ala Gly Lys Phe Val Tyr Lys Val Ile Asn Gly
50 55 60
Thr Phe Thr Cys Asn Thr Ser Thr Phe Gly Ser Asp Pro Asn Pro Ala
65 70 75 80
Lys Ser Val Lys Glu Cys Ser Val Pro Thr Asn Gly Ser Ser Ser Thr
85 90 95
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly
100 105 110
Ser Ser Ser Ser Cys Gly Thr Gly Gly Gly Ala Thr Val Cys Leu Ser
115 120 125
Ala Gly Gly Gly Ser Asn Asp Ile Asp Leu Thr Trp Thr Val Ser Gly
130 135 140
Ser Ile Ser Ser Ala Gln Val Tyr Arg Asp Thr Asp Ser Asn Pro Ser
145 150 155 160
Gly Arg Thr Arg Ile Ala Gln Leu Gly Gly Asp Ala Arg Ser Tyr Ser
165 170 175
Asp Thr Asn Val Ser Ala Gly Lys Gln Tyr Tyr Tyr Trp Ile Lys Phe
180 185 190
Gly Ala Asn Gly Ser Asn Tyr Asn Ser Asn Ala Ala Ser Ala Thr Tyr
195 200 205
Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
210 215 220
Ser Gly Gly Ser Ala Glu Cys Lys Ala Gly Ala Thr Ile Ser Gly Lys
225 230 235 240
Thr Val Asp Cys Gly Gly Lys Glu Ile Gly Leu Ser Cys Ser Gly Asp
245 250 255
Ser Glu Thr Gln Pro Pro Val Leu Thr Leu Lys Asn Ala Thr Ile Lys
260 265 270
Asn Leu Val Ile Ser Ala Lys Gly Gly Ser Asp Gly Ile His Cys Thr
275 280 285
Gly Asn Cys Thr Met Glu Asn Val Val Trp Lys Asp Ile Cys Glu Asp
290 295 300
Ala Ala Thr Asn Lys Thr Asp Gly Ile Thr Met Thr Ile Ile Gly Gly
305 310 315 320
Ser Ala Tyr Asn Ser Thr Ser Gly Tyr Gly Gly Lys Pro Asp Lys Val
325 330 335
Phe Gln His Asn Ser Lys Asn Ser Thr Thr Val Ile Lys Gly Gly Phe
340 345 350
Thr Leu Thr Gly Glu His Gly Lys Leu Trp Arg Ser Cys Gly Asn Cys
355 360 365
Thr Asn Asn Gly Gly Pro Arg Asn Val Thr Ile Asp Asn Val Lys Val
370 375 380
Asp Ala Lys Ile Gly Ser Ile Val Gly Val Asn Arg Asn Tyr Gly Asp
385 390 395 400
Lys Ala Thr Ile Lys Asn Leu Lys Ile Lys Asp Tyr Lys Ser Gly Ser
405 410 415
Pro Lys Val Cys Glu Glu Tyr Lys Gly Val Gln Lys Gly Ser Gly Glu
420 425 430
Ser Ser Lys Tyr Gly Glu Tyr Trp Asp Thr Ala Asn Cys Asp Val Ser
435 440 445
Lys Ser Asp Val Ser Ala Leu
450 455
<210> 99
<211> 60
<212> PRT
<213> Microbulbifer degradans
<400> 99
Met Pro His Glu Leu Leu Glu Pro Asp Glu Leu Leu Glu Leu Glu Leu
1 5 10 15
Glu Glu Leu Leu Val Leu Leu Val Leu Glu Glu Pro Leu Val Gly Thr
20 25 30
Glu His Ser Phe Thr Asp Leu Ala Gly Leu Gly Ser Leu Pro Lys Val
35 40 45
Asp Val Leu Gln Val Lys Val Pro Leu Ile Thr Leu
50 55 60
<210> 100
<211> 1368
<212> DNA
<213> Microbulbifer degradans
<400> 100
gtgataatga tgcgtaataa aatcctattg gcgcttgtat tgtgtggagc ttctgcctct 60
gcctttgcgg ctagtaatcg tcctagtggt tacacaacta tctgtaaaac cgatcaaact 120
tgttctgtaa gctcgtctac caacgttgcg ttcggcgctg ctggtaagtt tgtttacaaa 180
gtaattaacg gtacctttac ttgtaataca tctacttttg gcagcgatcc taaccctgct 240
aaatctgtaa aagaatgttc tgtacctact aatggttctt ctagcactag cagcaccagt 300
agttcttcta gctctagttc aagtagctca tcgggttcaa gcagctcatg tggcactggt 360
ggcggcgcaa ctgtatgttt aagtgcaggt ggcggcagta acgatatcga tttaacttgg 420
acagtatctg gttctatttc tagcgctcag gtttaccgcg acacagattc taaccctagt 480
ggtcgcacac gtattgctca attaggtggc gatgcaagaa gctatagcga tacgaatgtt 540
agtgctggta agcagtacta ctactggatt aagtttggcg ccaacggctc taactacaat 600
tcgaatgcgg cttctgctac ctatagtggt tcaagtagct catcgagttc ttcaagctct 660
tctagttcct catctggcgg ctcggctgaa tgtaaagctg gcgctactat ttctggtaaa 720
accgtagatt gcggtggtaa agaaattggc ttgtcgtgct cgggtgatag tgaaactcaa 780
ccaccagtat taacgcttaa aaatgccacc attaaaaact tggtaatttc tgctaaaggt 840
gggtccgacg gtattcactg tactggcaac tgcaccatgg aaaatgttgt ttggaaagat 900
atctgtgaag atgctgctac caacaaaacc gacggtatta ccatgaccat tattggtggt 960
agtgcgtata actctacaag cggttacggc ggcaagccag ataaggtttt ccaacataac 1020
tctaaaaaca gtactactgt aattaaaggc ggctttacat taacaggtga gcacggcaaa 1080
ttgtggcgtt catgtggtaa ctgtactaat aacggcggcc cacgtaatgt gactatcgac 1140
aacgttaaag tagacgcgaa aataggcagt attgttggcg ttaaccgcaa ctatggcgat 1200
aaggcaacaa tcaaaaactt aaagattaaa gactacaaat ctggtagccc caaagtgtgt 1260
gaagaataca agggtgtaca aaagggtagt ggcgagtctt ctaagtatgg cgaatactgg 1320
gatactgcaa actgcgatgt aagtaaatca gatgtgtctg ctctttaa 1368
<210> 101
<211> 424
<212> PRT
<213> Microbulbifer degradans
<400> 101
Met Phe Asn Lys Ile Leu Val Ala Val Gly Leu Leu Ala Ala Ser Leu
1 5 10 15
Ser Val His Ala Ala Thr Asn Arg Pro Ser Gly Tyr Thr Thr Ile Cys
20 25 30
Lys Val Gly Glu Thr Cys Ser Val Ser Gln Ser Thr Asn Val Ala Phe
35 40 45
Gly Ala Ser Gly Gln Phe Val Tyr Lys Val Leu Asn Gly Ser Phe Ser
50 55 60
Cys Ser Val Ser Thr Phe Gly Ser Asp Pro Ile Pro Ser Lys Ser Val
65 70 75 80
Lys Glu Cys Ser Ile Pro Ser Asn Gly Ser Ser Ser Ser Gly Ser Ser
85 90 95
Ser Ser Ser Ser Ser Ser Ser Ser Gly Ser Ser Ser Gly Gly Gly Cys
100 105 110
Gly Ser Gly Gly Gly Ser Thr Val Cys Leu Ser Ala Ser Gly Ser Ser
115 120 125
Asn Gly Ile Asn Leu Ser Trp Ser Val Ser Gly Ser Ile Ser Ser Val
130 135 140
Gln Leu Tyr Arg Asp Thr Asp Ser Asn Pro Ser Gly Arg Thr Arg Ile
145 150 155 160
Ala Ser Val Ser Ser Ser Thr Thr Ser Phe Ser Asp Thr Gly Ala Ala
165 170 175
Ser Gly Thr Thr Tyr Tyr Tyr Trp Val Lys Tyr Tyr Val Asn Gly Thr
180 185 190
Ala Tyr Asn Ser Gly Val Ala Ser Ala Val Arg Gly Ser Ser Ser Ser
195 200 205
Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Gly Gly Lys Gly
210 215 220
Ser Ser Cys Ser Ser Thr Gly Ser Gln Ser Val Ser Ser Thr Ile Lys
225 230 235 240
Val Thr Ser Gly Thr Tyr Asp Gly Gly Cys Lys Thr Phe Asn Pro Thr
245 250 255
Ser Ala Leu Gly Asp Gly Ser Gln Ser Glu Ser Gln Lys Pro Ala Phe
260 265 270
Arg Val Glu Asn Gly Ala Thr Leu Lys Asn Val Ile Ile Gly Asn Asn
275 280 285
Gly Val Asp Gly Ile His Val Tyr Asn Gly Gly Thr Leu Asn Asn Ile
290 295 300
Leu Trp Thr Asn Val Gly Glu Asp Ala Met Thr Val Lys Ser Glu Gly
305 310 315 320
Asn Val Thr Val Thr Asn Val Glu Gly Tyr Asp Gly Glu Asp Lys Phe
325 330 335
Ile Gln Val Asn Ala Val Thr Asn Leu Lys Val Ser Asn Cys Ile Val
340 345 350
Asn Lys Met Gly Lys Phe Leu Arg Gln Asn Gly Gly Lys Thr Phe Ala
355 360 365
Met Ser Val Ser Val Asp Asn Cys Asp Ile Ser Asn Met Gly Glu Gly
370 375 380
Ile Phe Arg Ser Asp Ser Pro Asn Ala Thr Ala Val Ile Thr Asn Ser
385 390 395 400
Arg Leu Arg Asn Ala Gly Asp Ile Cys Ile Gly Ala Trp Lys Ser Cys
405 410 415
Lys Ser Ser Asn Ile Ser Ser Phe
420
<210> 102
<211> 1275
<212> DNA
<213> Microbulbifer degradans
<400> 102
atgtttaaca agatactcgt tgcagtagga ttacttgcgg ctagcctttc tgtgcacgcc 60
gcaacaaacc gcccaagtgg ttatacaaca atttgtaagg ttggtgaaac atgctcggta 120
agtcagtcta cgaatgtagc ctttggcgcg tctgggcagt ttgtgtataa agtattaaac 180
ggtagctttt cttgtagtgt ttctacgttt ggtagtgacc ctattccttc taaatctgta 240
aaagaatgtt caatcccatc aaacggctct agctcttctg gctcgtcttc atcttcgtct 300
agcagctctt ccggtagctc ttctggtggt ggctgtggca gcggtggtgg ttctacggtg 360
tgcttatcgg cctcgggttc tagcaatggt atcaatttaa gttggtctgt atctggttct 420
atatcttccg tgcagcttta tcgcgatacc gattcaaacc caagcggtcg cacgcgtatt 480
gctagtgtat ctagctctac tactagcttt agtgataccg gcgcggcatc gggcaccact 540
tattactact gggttaaata ttatgtaaat ggtactgctt acaactcggg tgttgcttct 600
gcggtgcgcg gttcttctag ctctagtagt tcaagttctt ccagcacttc tagcagttct 660
ggtggaaaag gttctagttg tagctctact ggtagccaat ctgtgtcttc tactattaag 720
gtaaccagcg gtacttacga tggtggttgt aaaacattta accctaccag tgctttgggt 780
gatggtagcc aatctgaaag ccaaaaacct gctttccgtg tagaaaacgg tgcaacgtta 840
aagaatgtaa ttattggcaa taacggtgtg gatggtattc acgtttacaa cggcggtacg 900
ttaaataata ttctttggac taacgtaggt gaagatgcca tgaccgttaa gtctgaaggt 960
aacgtgacgg taaccaatgt tgaaggctat gacggcgaag ataagtttat tcaggtaaac 1020
gcagtgacta acttaaaagt ttctaactgt attgtgaata aaatgggtaa gtttcttcgt 1080
cagaatggtg gtaaaacatt tgccatgtcg gtaagtgtag ataactgcga tatatctaat 1140
atgggtgaag gtatcttccg ttcagacagc ccgaacgcta cagcggttat tactaacagc 1200
cgtttacgca acgctgggga tatttgtatt ggggcttgga aaagttgtaa atcttccaat 1260
atcagcagct tttaa 1275
<210> 103
<211> 392
<212> PRT
<213> Microbulbifer degradans
<400> 103
Met Lys Lys Leu Ile Leu Met Val Ala Leu Leu Ala Phe Ser Val Ser
1 5 10 15
Ser Phe Ala Ala Leu Ser Ser Gly Arg Tyr Ile Ile Val Ser Lys Leu
20 25 30
Asn Gly Asn Ala Leu Asp Val Asp Ser Phe Ser Thr Ala Asp Gly Ala
35 40 45
Asn Val Met Gln Trp Phe Ala Leu Gly Gly Val Asn Gln Gln Phe Asp
50 55 60
Val Ala Val Leu Ser Asp Gly Ser Tyr Ser Ile Arg Pro Val His Ser
65 70 75 80
Gly Lys Ser Leu Asp Val Tyr Ala Trp Asn Ala Asp Asp Gly Ala Glu
85 90 95
Leu Arg Gln Trp Ala Tyr Thr Gly Ala Asp Asn Gln Arg Trp Tyr Ile
100 105 110
Asp Asn Gln Ser Gly Asp Tyr Tyr Ser Ile Thr Ser Lys Phe Ser Gly
115 120 125
Arg Ala Leu Asp Val Trp Gly Met Ser Met Tyr Thr Gly Ala Asp Val
130 135 140
Arg Leu Tyr Ser Tyr Trp Gly Gly Ala Gly Gln Leu Trp Thr Phe Gln
145 150 155 160
Lys Val Gly Ser Ser Ser Glu Cys Tyr Ala Gly Ala Thr Leu Thr Asn
165 170 175
Arg Phe Val Asp Cys Gly Gly Lys Thr Ile Gly Leu Ser Cys Val Gly
180 185 190
Asp Ser Glu Thr Gln Gly Ala Val Leu Thr Leu Lys Asn Ser Ser Ile
195 200 205
Arg Asn Val Lys Leu Ala Ala Asn Gly Gly Ala Asp Gly Ile His Cys
210 215 220
Thr Ser Gly Asn Cys Thr Leu Ala Asp Val Val Trp Asn Asp Ile Cys
225 230 235 240
Glu Asp Ala Ala Thr Asn Lys Ser Glu Gly Gly Thr Leu Thr Ile Val
245 250 255
Gly Gly Ser Ala Tyr Asn Ser Thr Gly Gly Tyr Gly Gly Thr Pro Asp
260 265 270
Lys Ile Phe Gln His Asn Ser Lys Asn Ser Thr Thr Ile Val Ala Gly
275 280 285
Gly Phe Thr Ala Tyr Gly Thr His Gly Lys Leu Trp Arg Ser Cys Gly
290 295 300
Asn Cys Thr Asn Asn Gly Gly Pro Arg Asn Leu Leu Val Tyr Ser Val
305 310 315 320
Asn Ile Asp Ala Ser Ile Gly Ala Ile Ala Gly Val Asn Arg Asn Tyr
325 330 335
Gly Asp Arg Ala Thr Ile Arg Asp Leu Lys Ile Lys Asn Tyr Ser Ser
340 345 350
Gly Ser Pro His Val Cys Asp Glu Tyr Gln Gly Val Gln Lys Gly Asn
355 360 365
Ser Ser Thr Lys Tyr Gly Glu Tyr Trp Asn Thr Ala Ser Cys Asp Val
370 375 380
Ser Arg Ser Asp Val Ser Gly Leu
385 390
<210> 104
<211> 1179
<212> DNA
<213> Microbulbifer degradans
<400> 104
atgaaaaaac ttatccttat ggtggcgctg ttggctttta gtgttagttc ttttgctgca 60
ctgtcttcag gccgctacat tattgtttct aaacttaatg gcaacgcgtt agatgtagat 120
agctttagca ccgcagatgg cgccaatgtt atgcagtggt ttgctttggg tggtgtgaac 180
cagcagtttg acgtggcagt gcttagcgat ggcagttact ccatacgacc agtgcacagc 240
ggtaagtcat tagatgtata tgcgtggaac gcagacgatg gtgcggaact tcgtcagtgg 300
gcatacacag gcgcagataa ccaacgttgg tatatcgata atcaaagtgg cgattactat 360
tcaattacgt ctaaatttag cgggcgcgca ttggatgtat ggggtatgag tatgtacacc 420
ggcgcagatg tccgccttta ttcatattgg ggcggcgcgg ggcagctgtg gaccttccaa 480
aaggtaggta gctcaagtga gtgttacgca ggtgctacgt taacaaaccg ctttgtggat 540
tgtggcggca aaacaatagg ccttagttgt gtaggcgata gtgaaactca aggcgcggtg 600
ctaaccctta aaaactcgtc cattcgcaat gttaagttgg ctgcaaacgg tggtgcggat 660
ggcattcact gcactagtgg caactgcaca ttagccgacg ttgtttggaa cgatatttgt 720
gaagatgctg ccacgaataa gtctgaaggt ggcaccctga ctattgtggg tggttcggcg 780
tataactcta ctggcgggta tggtggtaca ccggataaaa tttttcagca caactcgaaa 840
aacagcacaa caattgttgc cggcggcttc actgcatatg gtacccacgg taagttgtgg 900
cgctcgtgtg gtaactgtac aaacaacggc ggtccgcgta atttactggt ttatagcgtg 960
aatattgacg caagtattgg cgcaattgct ggtgttaacc gcaattacgg cgatagagcg 1020
accattcgcg acctaaaaat aaagaattat tcttctggca gcccgcatgt gtgtgacgaa 1080
tatcaaggcg tacagaaggg caattcttct acaaaatatg gcgagtactg gaataccgca 1140
agttgtgatg tttcgcggtc agatgtaagt gggctttaa 1179
<210> 105
<211> 733
<212> PRT
<213> Microbulbifer degradans
<400> 105
Met Phe Arg Tyr Ile Leu Thr Ala Phe Ala Leu Val Ala Ala Ala Ser
1 5 10 15
Cys Ala Gln Ala Ala Thr Asn Arg Pro Ser Gly Tyr Thr Thr Ile Cys
20 25 30
Lys Thr Asn Gln Thr Cys Ser Val Ser Ser Pro Thr Asn Val Ala Phe
35 40 45
Gly Ala Ser Gly Lys Phe Thr Phe Lys Val Leu Asn Gly Ser Phe Val
50 55 60
Cys Ser Val Ala Thr Phe Gly Ser Asp Pro Asn Pro Ala Lys Ser Ala
65 70 75 80
Lys Glu Cys Ser Ile Pro Ser Asp Gly Ser Ser Ser Thr Ser Ser Thr
85 90 95
Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser
100 105 110
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
115 120 125
Ser Ser Ser Ser Ser Gly Ser Ser Gln Ala Gly Cys Gly Ser Gly Gly
130 135 140
Gly Ala Thr Val Cys Leu Ser Ala Thr Asp Thr Ala Ser Ala Ile Asn
145 150 155 160
Leu Asn Trp Thr Val Ser Gly Ser Leu Ser Ser Val Gln Val Tyr Arg
165 170 175
Asp Thr Asp Pro Asn Pro Ser Gly Arg Thr Arg Leu Thr Ser Leu Ser
180 185 190
Pro Ser Val Thr Ser Tyr Thr Asp Asn Asn Ala Gln Ala Gly Thr Thr
195 200 205
Tyr Tyr Tyr Trp Ile Lys Phe Gly Ala Asn Gly Ser Asn Tyr Asn Ser
210 215 220
Gly Ala Ala Ser Ala Val Ile Ala Asn Thr Gly Asn Asp Asp Glu Gly
225 230 235 240
Cys Gly Ser Asp Val Cys Leu Thr Ala Thr Ala Asn Ile Gly Ser Ile
245 250 255
Gly Leu Ser Trp Gly Ser Ser Ala Ala Leu Thr Ser Val Gln Ile Tyr
260 265 270
Arg Asp Thr Asp Ser Asn Pro Ser Gly Arg Thr Arg Ile Ala Ser Leu
275 280 285
Ser Thr Ser Ala Thr Ser Phe Thr Asp Ser Thr Thr Ala Val Gly Thr
290 295 300
Thr Tyr Tyr Tyr Trp Val Lys Tyr Gly Leu Asn Gly Ser Gln Leu Asn
305 310 315 320
Ser Asn Val Ala Ser Ala Thr Ala Leu Gln Asn Asn Thr Gly Asn Ala
325 330 335
Ser Cys Pro Gly Glu Thr Ser Gly Glu Thr Ala Ala Thr Val Tyr Tyr
340 345 350
Val Thr Pro Asn Gly Ser Ala Ser Ala Ser Gly Asn Ser Phe Ala Ser
355 360 365
Ala Met Asp Ile Asp Thr Ala Leu Ser Ile Val Gly Ala Gly Gln Met
370 375 380
Ile Leu Met Gln Pro Gly Thr Tyr Thr Val Ala Tyr Ser Ala Gly Asn
385 390 395 400
Lys Asn Thr Lys Val Leu Ser Arg Ser Gly Ala Ala Gly Ala Pro Ile
405 410 415
Lys Met Val Ala Ala Asn Cys Gly Arg Ala Val Phe Asp Phe Ser Phe
420 425 430
Pro Glu Arg Glu Trp Val Gln Asp Ser Tyr Gly Phe Phe Leu Thr Gly
435 440 445
Asp Tyr Trp Tyr Phe Lys Gly Ile Glu Ile Thr Arg Ala Gly Tyr Gln
450 455 460
Gly Val Tyr Val Thr Gly Ala His Asn Thr Phe Glu Asn Cys Ala Phe
465 470 475 480
Tyr Tyr Asn Arg Asn Thr Gly Leu Glu Ile Asn Lys Gly Gly Ser Tyr
485 490 495
Thr Thr Val Ile Asn Ser Asp Ala Tyr Arg Asn Tyr Asp Pro Lys Lys
500 505 510
Asn Gly Ser Met Ala Asp Gly Phe Gly Pro Lys Gln Thr Gln Gly Pro
515 520 525
Gly Asn Lys Phe Ile Gly Cys Arg Ala Trp Glu Asn Ser Asp Asp Gly
530 535 540
Phe Asp Leu Tyr Asp Ser Pro Glu Glu Val Thr Ile Glu Asn Ser Trp
545 550 555 560
Ala Phe Arg Asn Gly Val Asp Val Trp Gly Tyr Gly Gly Phe Ala Gly
565 570 575
Asn Gly Asn Gly Phe Lys Leu Gly Gly Asn His Val Ala Ala Asn Asn
580 585 590
Arg Ile Thr Asn Ser Val Ala Phe Gly Asn Pro Val Lys Gly Phe Asp
595 600 605
Gln Asn Asn Asn Ala Gly Gly Ile Thr Val Leu Asn Cys Thr Ala Tyr
610 615 620
Ala Asn Gly Thr Asn Tyr Gly Phe Gly Asn Asn Leu Asn Ser Gly Glu
625 630 635 640
Gln His Tyr Phe Arg Asn Asn Val Ser Val Ser Gly Ala Val Asn Ile
645 650 655
Ser Asn Ala Asp Asn Lys Tyr Asn Ser Trp Asn Gly Gly Val Thr Ala
660 665 670
Ser Thr Ala Asp Phe Glu Asn Val Asp Leu Ser Lys Ala Thr Ala Ala
675 680 685
Arg Asn Ile Asp Gly Ser Leu Pro Asn Asn Gly Leu Phe Arg Leu Lys
690 695 700
Ser Gly Ser Asp Leu Ile Asp Ala Gly Val Glu Val Gly Leu Pro Ser
705 710 715 720
Asn Gly Ser Ala Pro Asp Met Gly Ala Phe Glu Ala Asn
725 730
<210> 106
<211> 2202
<212> DNA
<213> Microbulbifer degradans
<400> 106
atgtttcgat acatccttac tgctttcgca ttggtggcag cggcttcctg cgcgcaagca 60
gctaccaatc gccctagcgg ttacaccacc atatgtaaaa ccaatcaaac ctgctctgtt 120
tcaagcccta ccaatgtggc gtttggcgca tcgggtaaat ttacctttaa agtgcttaat 180
ggctcttttg tttgtagcgt agccactttt ggctccgacc ctaacccggc taaaagtgct 240
aaagagtgct ctattccgtc ggatggctct tccagcacct ctagtacttc gagcacatcg 300
tctagttcta gtagttcatc tagcagcaca agttcaagca gcagctctag cagttcttct 360
agctcttcgt cttccagtag ctcaagttct agttcaagtg gctcttcgca agctggctgc 420
ggtagtggcg ggggagcaac ggtttgttta tcggcaaccg atacagctag tgctatcaat 480
ttaaattgga cagtgagtgg ctcgctgtcg agcgtgcagg tgtatcgcga taccgatccc 540
aacccaagtg gacgtacgcg gttaacgtcg ttaagccctt cggtaacaag ctacaccgac 600
aacaacgcac aggccggtac aacctactat tactggatta aattcggcgc aaacggcagc 660
aactataatt ccggtgctgc gtcggctgta atagccaata ccggtaacga tgatgaaggt 720
tgtggtagtg atgtgtgctt gacggcaacc gctaatattg gctctatagg cttaagctgg 780
ggctcttctg ccgctttaac cagcgtacaa atttatcgcg atacagattc aaacccaagt 840
gggcgtacac gtattgcatc gcttagcact tctgcaacca gctttaccga ttcaaccact 900
gcagtaggca caacgtatta ttactgggtt aaatacggct taaacggcag ccagctaaac 960
tctaatgttg catctgccac tgctttgcaa aataacactg gcaacgcaag ttgccccggt 1020
gaaacaagtg gcgaaactgc agcaaccgtg tattacgtaa caccaaacgg ttcggctagt 1080
gcaagcggca atagctttgc atctgcaatg gatatagaca cagcactttc aattgtaggc 1140
gcggggcaaa tgatattaat gcagcctggt acctacaccg ttgcctatag tgcgggtaat 1200
aaaaatacca aagtgctttc gcgctcgggt gccgctggtg cacctatcaa aatggtggca 1260
gccaattgcg gtcgtgcagt gtttgatttt tcgttcccag aacgtgagtg ggtgcaagat 1320
tcttacggct ttttcttaac tggcgattac tggtatttta aaggaataga aattacccgt 1380
gcaggctacc aaggtgtgta tgtaacgggt gcgcacaaca catttgaaaa ttgcgccttt 1440
tattacaacc gcaatacagg tttagaaatt aacaaaggcg ggtcttacac caccgtcatt 1500
aattcagatg cctatcgcaa ttacgatccc aagaaaaacg gcagcatggc cgatggcttt 1560
ggccctaaac aaacccaagg cccaggcaat aaatttattg gttgccgcgc gtgggaaaac 1620
tccgacgatg gatttgacct gtacgatagc ccagaagaag taactattga aaatagctgg 1680
gcatttcgca acggtgtaga tgtatggggt tacggtggtt ttgcgggtaa tggcaacggc 1740
tttaaattgg gcggtaacca cgtggctgca aacaatcgta ttaccaactc ggttgcgttc 1800
ggcaaccccg taaaaggttt tgatcaaaac aataatgccg gcggtattac agtgcttaat 1860
tgcacagcct acgccaacgg cactaactac ggctttggca acaacttaaa ctcgggtgag 1920
caacactact tccgcaataa tgtttctgta tctggcgctg tgaatattag caatgccgac 1980
aacaaataca attcgtggaa cggcggagta acagcatcca cggcagattt tgaaaacgta 2040
gatttatcca aagccaccgc tgcacgtaac atagatggca gcctgccaaa caacggccta 2100
ttccgcttaa aaagcggcag cgatttaata gacgccggtg tagaggtagg tttaccaagt 2160
aacggcagcg cgcccgatat gggagcgttc gaagctaact ag 2202
<210> 107
<211> 700
<212> PRT
<213> Microbulbifer degradans
<400> 107
Met Lys Asn Val Phe Asn Thr Gln Lys Thr Lys Arg His Cys Asn Tyr
1 5 10 15
Ala Tyr Asn Ala Lys Ala Ala Lys Pro Phe Ser Gln Lys Ala Leu Val
20 25 30
Gln Lys Cys Ala Ala Ala Ala Leu Ser Val Gly Leu Leu Gly Ala Val
35 40 45
Gly Asn Ala Tyr Ala Ile Ser Cys Ser Ala Thr Ala Asp Thr Trp Gly
50 55 60
Gly Gly Tyr Val Leu Asn Val Thr Val Thr Asn Asp Thr Asn Asn Ala
65 70 75 80
Ile Ser Asn Trp Ala Leu Ala Leu Asn Tyr Asp Gln Ala Ala Ala Ile
85 90 95
Thr Asn Ser Trp Asn Ala Ser Val Ser Ala Asn Gly Asn Val Val Asn
100 105 110
Ala Thr Asn Ile Gly Trp Asn Gly Asn Leu Ala Ala Gly Gln Ser Thr
115 120 125
Ser Phe Gly Leu Gln Gly Thr Tyr Thr Gly Asn Phe Ser Leu Pro Val
130 135 140
Cys Val Gly Gln Gly Gln Ser Ser Ser Ser Ser Ser Thr Ser Ser
145 150 155 160
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
165 170 175
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
180 185 190
Ser Ser Ser Ser Thr Ser Ser Thr Gly Gly Ser Ser Glu Leu Thr Ile
195 200 205
Gln Glu Asp Asn Ser Gly Phe Cys Gly Val Asp Gly Ser Ile Asp Ser
210 215 220
Asn Asn Ser Gly Phe Thr Gly Ser Gly Phe Ala Asn Thr Asp Asn Ala
225 230 235 240
Thr Gly Lys Ser Val Asp Trp Ser Val Ser Val Pro Tyr Ser Gly Asn
245 250 255
Tyr Leu Leu Glu Trp Arg Tyr Ala Asn Gly Ser Gly Asn Asn Arg Ala
260 265 270
Gly Ala Ile Glu Val Asn Gly Asn Ala Arg Gly Asn Gln Ser Phe Pro
275 280 285
Thr Thr Gly Ala Trp Thr Ser Trp Thr Thr Ala Ser Ala Asn Val Ser
290 295 300
Leu Asp Ala Gly Thr Asn Leu Ile Ser Leu Val Ala Ser Thr Gly Glu
305 310 315 320
Gly Leu Gly Asn Ile Asp Ser Leu Thr Val Ile Gly Asn Asp Ile Gln
325 330 335
Thr Gly Ala Cys Asp Ser Thr Gly Ser Ser Ser Ser Ser Ser Ser Ser
340 345 350
Ser Ser Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser
355 360 365
Ser Ser Ser Ser Gly Ala Pro Met Leu Pro Gln Ala Gly Asn Pro Ile
370 375 380
Asn Gly Lys Phe Gly Lys Tyr Lys Ser Trp Gln Lys Gly Ser Leu Ser
385 390 395 400
Ala Asp Lys Gln Phe Ala Asp Ile Leu Leu Ser His Gln Tyr Thr Asn
405 410 415
Gly Gly Phe Pro Lys Asn Gln Ala Tyr Asp Ser Met Gly Ser Gly Gly
420 425 430
Asn Ser Ala Gly Thr Ile Asp Asn Asp Ala Thr Thr Thr Glu Leu Leu
435 440 445
Phe Leu Ala Asp Val Tyr Gln Arg Thr Gly Glu Thr Lys Tyr Arg Asp
450 455 460
Gly Ala Arg Lys Ala Leu Asp Phe Leu Leu Asp Met Gln Tyr Ser Ser
465 470 475 480
Gly Gly Trp Pro Gln Tyr Tyr Pro Val Arg Ser Gly Tyr Tyr Glu His
485 490 495
Val Thr Phe Asn Asp Asp Ala Met Ala Arg Val Leu Ile Val Leu Asp
500 505 510
Lys Ala Lys Gln Gly Val Ala Pro Leu Asn Gly Asp Leu Leu Thr Ser
515 520 525
Asn Gln Arg Ala Arg Leu Ser Ser Ala Val Asn Lys Gly Val Asp Tyr
530 535 540
Ile Leu Lys Ser Gln Trp Arg Gln Asn Gly Thr Leu Thr Val Trp Cys
545 550 555 560
Ala Gln His Gly Lys Asp Asp Tyr Leu Pro Lys Lys Ala Arg Ala Tyr
565 570 575
Glu Leu Glu Ser Leu Ser Gly Ser Glu Ser Val Leu Val Val Ala Phe
580 585 590
Leu Met Ser Gln Pro Gln Thr Pro Glu Ile Lys Thr Ala Val Lys Ala
595 600 605
Ala Ile Asn Trp Phe Arg Ser Pro Asn Thr Tyr Leu Ala Gly Tyr Thr
610 615 620
Tyr Asp Ser Ser Arg Lys Gly Asp Gly Asn Ser Pro Ile Val Ala Lys
625 630 635 640
Ser Gly Ser Lys Met Trp Tyr Arg Phe Tyr Asp Leu Asn Thr Asn Arg
645 650 655
Gly Phe Phe Ser Asp Arg Asp Ser Arg Lys Val Tyr Asp Ile Leu Asp
660 665 670
Ile Ser Thr Glu Arg Lys Asp Gly Tyr Arg Trp Gly Gly Asp Tyr Gly
675 680 685
Ser Gly Ile Ile Ser Tyr Ala Glu Ser Val Gly Tyr
690 695 700
<210> 108
<211> 72
<212> PRT
<213> Microbulbifer degradans
<400> 108
Met Val Asn Ser Leu Leu Pro Pro Val Glu Leu Val Asp Glu Asp Asp
1 5 10 15
Glu Leu Glu Asp Glu Glu Leu Leu Glu Leu Leu Glu Glu Leu Leu Glu
20 25 30
Glu Leu Glu Asp Glu Leu Leu Asp Glu Leu Glu Glu Leu Asp Glu Leu
35 40 45
Leu Leu Val Leu Leu Leu Glu Glu Glu Leu Trp Pro Trp Pro Thr Gln
50 55 60
Thr Gly Lys Leu Lys Leu Pro Val
65 70
<210> 109
<211> 2103
<212> DNA
<213> Microbulbifer degradans
<400> 109
atgaaaaacg tatttaatac acaaaaaacc aagcggcatt gcaactatgc ctataacgcg 60
aaagctgcca aacccttcag ccaaaaagca ctggtacaaa agtgtgcggc tgcggcattg 120
tctgttggct tactgggggc cgtgggtaat gcgtatgcaa tatcctgttc ggcaactgct 180
gatacctggg gtggtggcta tgtgctaaat gtgaccgtta ctaacgacac aaataatgca 240
attagcaatt gggcgctagc tttaaattac gatcaagctg cagccataac taattcgtgg 300
aacgcgagtg tgtctgcaaa tggcaatgtg gttaatgcta ccaacattgg ttggaatggc 360
aatttagcgg ctggccaaag tacaagtttt ggtttgcaag gcacctatac cggcaacttt 420
agtttacctg tctgtgttgg ccagggccaa agctcttcct ctagtagcag tactagtagt 480
agttcgtcca gctcttcgag ttcatccagc agttcatctt ccagctcttc aagtagttct 540
tcaagcagct ctagtagttc ttcatcttct agctcatcgt cttcgtccac cagttcaact 600
ggtggaagta gtgagttaac cattcaagaa gataatagcg gcttttgcgg ggttgatggt 660
tcaatcgatt ccaataattc aggctttacc ggaagcggct tcgccaatac cgataacgcg 720
acaggcaaaa gcgtagattg gagtgtgagc gttccttata gtggcaacta tttgcttgaa 780
tggcgttacg caaatggttc cggtaataac cgtgctggcg cgattgaagt aaatggtaac 840
gctcgtggta atcaaagctt tcctactaca ggggcctgga ctagctggac aacggcaagt 900
gctaacgtaa gtttggatgc aggtacaaac ttaattagct tggttgcatc tacgggcgaa 960
ggcttaggga atattgattc acttactgtc attggtaacg atattcagac cggcgcttgt 1020
gattctacag ggtctagctc ttccagtagc tctagttcct ccagtacttc tagctcaagt 1080
tccagctcca gttcaagtac cagtagttct agcagcggtg cgcccatgtt acctcaagct 1140
gggaacccca ttaatggcaa gtttggcaaa tataaatctt ggcaaaaagg aagcttgtct 1200
gccgataagc aatttgcaga catccttcta tcgcaccaat ataccaatgg cggatttccc 1260
aaaaaccaag cctacgacag tatgggtagc ggtggtaaca gtgcgggcac aatcgacaac 1320
gatgccacaa caacagagtt gttattctta gctgatgtgt accagcgtac tggtgaaacc 1380
aaataccgag acggtgcgcg taaagcgtta gatttccttt tggatatgca gtattcatcg 1440
ggcggctggc cgcaatacta ccctgtgcgc agtggctact acgagcatgt aacatttaac 1500
gacgatgcaa tggcgcgagt gctaattgtt ttagataaag cgaaacaagg tgtggcgccg 1560
ctaaatggcg atctattaac atctaaccag cgtgcgcgtt taagcagtgc ggttaataag 1620
ggcgtggatt acattcttaa atcgcagtgg cgtcaaaacg gaaccttaac tgtttggtgt 1680
gcgcaacatg gtaaagatga ttatctacct aaaaaggcgc gtgcttacga gctggaatcg 1740
ttgagtggta gcgaatcggt attggtagtt gcattcctta tgtctcaacc tcaaacccct 1800
gaaattaaaa cggcggttaa ggctgctatc aattggttta gaagccccaa tacttactta 1860
gctggttaca cttacgattc atctagaaaa ggcgacggca acagccccat cgtcgcgaaa 1920
agcggtagta aaatgtggta tcgcttctac gacctaaata ctaaccgtgg cttcttcagt 1980
gatagagata gcagaaaagt ttacgatatt ttagatattt ctacagagcg taaagatggc 2040
tatcgttggg gcggtgatta tggctccggc atcattagtt acgcggaaag cgttggttac 2100
taa 2103
<210> 110
<211> 96
<212> PRT
<213> Microbulbifer degradans
<400> 110
Met Ser Val Pro Ala Leu Pro Pro Glu Glu Leu Leu Glu Leu Glu Asp
1 5 10 15
Glu Leu Leu Glu Leu Glu Asp Glu Leu Leu Glu Leu Asp Glu Leu Leu
20 25 30
Glu Leu Asp Glu Leu Leu Glu Leu Val Glu Leu Leu Glu Leu Glu Glu
35 40 45
Leu Val Glu Ala Phe Ile Ser Leu Ala Gly Pro Glu Leu Pro Pro Pro
50 55 60
Gln Ala Leu Asn Ser Val Ala Ala Ser Ala Ser Val Ile Arg Arg Arg
65 70 75 80
Lys Phe Ile Ile Val Ala Leu Tyr Val Phe Ile Leu Arg Ala His Lys
85 90 95
<210> 111
<211> 574
<212> PRT
<213> Microbulbifer degradans
<400> 111
Met Asn Phe Leu Arg Leu Ile Thr Leu Ala Leu Ala Ala Thr Leu Leu
1 5 10 15
Ser Ala Cys Gly Gly Gly Ser Ser Gly Pro Ala Lys Glu Ile Asn Ala
20 25 30
Ser Thr Ser Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
50 55 60
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Ser Ala Gly
65 70 75 80
Thr Leu Thr Ile Gln Glu Ser Glu Lys Gly Phe Cys Thr Val Asn Gly
85 90 95
Glu Ile Val Asn Asn His Glu Gly Tyr Ser Gly Thr Gly Phe Val Asp
100 105 110
Thr Ala Asn Ala Glu Gly Ala Ser Ile Thr Trp Lys Val Asp Val Asp
115 120 125
Gly Gly Asn Tyr Asp Val Ser Val Arg Phe Ala Asn Gly Ser Thr Ala
130 135 140
Arg Gly Ala Thr Leu Ser Ser Asn Glu Ile Asn Thr Thr Tyr Gly Phe
145 150 155 160
Ala Thr Thr Gly Asp Trp Ala Thr Trp Ala Asp Glu Thr His Thr Val
165 170 175
Ser Leu Ala Ala Gly Glu Asn Thr Ile Gln Leu Ser Ala Leu Thr Ala
180 185 190
Gly Gly Leu Pro Asn Val Asp Ala Ile Thr Ile Ala Gly Ala Gly Val
195 200 205
Leu Ala Ala Asp Cys Ala Thr Glu His Thr Gly Pro Met Leu Ser Gln
210 215 220
Thr Gly Asn Pro Ile Tyr Thr Glu Leu Asn Asn Tyr Lys Ser Trp Leu
225 230 235 240
Thr Gly Ser Gly Thr Thr Ala Ala Lys Leu Ala Ala Asp Lys Thr Ile
245 250 255
Ala Asp Asn Met Ile Thr Trp Gln Met Pro His Gly Gly Phe Tyr Lys
260 265 270
Tyr Gly Val Ser Lys Tyr Ser Ser Ala Trp Asn Gly Ser Asp Ala Arg
275 280 285
Ser Gly Trp Thr Gly Ala Asn Gly Val Glu Leu Gly Thr Ile Asp Asn
290 295 300
Asp Ala Thr Val Ser Glu Leu Leu Phe Leu Ala Asp Val Tyr Lys Arg
305 310 315 320
Ser Gly Glu Thr Lys Tyr Arg Asp Ala Ala Arg Ser Ala Leu Glu Phe
325 330 335
Leu Leu Thr Met Gln Tyr Ser Thr Gly Grp Trp Pro Gln Val Tyr Pro
340 345 350
Ala Arg Thr Gly Thr Ser Tyr Ser Asn His Val Thr Phe Asn Asp Asn
355 360 365
Ala Met Ala Arg Val Leu Ile Leu Leu Asp Lys Ala Ala Arg Leu Glu
370 375 380
Ala Pro Leu Asp Gly Asp Ile Phe Thr Thr Asp Gln His Thr Arg Ile
385 390 395 400
Thr Thr Ala Ile Asn Gly Gly Ile Asp Phe Ile Leu Asn Ala Gln Ile
405 410 415
Val Gln Gly Asp Val Lys Thr Val Trp Cys Ala Gln His Asp Pro Tyr
420 425 430
Thr Tyr Glu Ala Lys Ala Ala Arg Ser Tyr Glu Leu Ala Ser Lys Ser
435 440 445
Gly Lys Glu Ser Val Leu Val Val Ala Phe Leu Met Thr Arg Pro Gln
450 455 460
Ser Glu Ala Ile Glu Asn Ala Val Lys Ala Ala Leu Ala Trp Tyr Arg
465 470 475 480
Asn Pro Asn Val Gln Val Ala Asn Thr Glu Tyr Val Lys Arg Thr Asn
485 490 495
Asn Asp Asp Asn Tyr Asn Pro Ile Gln Thr Lys Ala Gly Ser Thr Met
500 505 510
Trp Tyr Arg Phe Tyr Asp Leu Asp Gln Asp Val Gly Phe Phe Ser Gly
515 520 525
Arg Ser Ala Ser Asp Asn Pro Ala Gly Asn Gly Lys Gln Tyr Asp Ile
530 535 540
Met Leu Ile Glu Pro Glu Arg Arg Tyr Gly Tyr Glu Trp Gly Gly Asn
545 550 555 560
Tyr Gly Lys Lys Ile Ile Asp Tyr Ala Asn Ser Val Gly Tyr
565 570
<210> 112
<211> 1725
<212> DNA
<213> Microbulbifer degradans
<400> 112
atgaattttc tacgccttat tacactcgca ctcgccgcaa cactattaag tgcctgcggt 60
ggaggtagtt ctggcccagc caaagagata aacgcttcca ccagttcctc tagctctagc 120
agctccacga gttctagcag ctcatccagc tctagtagtt cgtctagctc aagcagctcg 180
tcttccagtt ccagcagctc atcctctagc tctagcagct cttctggtgg tagcgccggc 240
acactcacta ttcaggaaag cgaaaaaggc ttttgcactg ttaacggtga gattgtgaat 300
aatcacgagg ggtatagcgg tacaggtttt gtagacaccg ccaatgccga aggcgccagc 360
attacttgga aggtagatgt cgatggcggc aattatgatg taagtgtgcg attcgcaaac 420
ggttccactg cacgcggcgc aacgcttagc agtaacgaaa taaataccac ctatggtttt 480
gctaccactg gcgactgggc cacatgggca gacgaaaccc acaccgtttc gttagccgca 540
ggcgaaaaca ccatccagct aagtgcactt accgcgggtg gcttacccaa tgtagacgct 600
attactattg caggtgcagg tgtactcgca gcagactgcg ccacagagca tactgggcca 660
atgctttcgc aaacaggcaa ccctatttat accgagttaa ataattacaa gtcctggcta 720
acgggtagcg gcacaacagc agccaaatta gccgcagata aaacaattgc cgacaatatg 780
attacctggc aaatgcctca cggtggtttt tataaatacg gcgtatctaa atacagctcg 840
gcttggaacg gtagcgatgc ccgctctggc tggactgggg ccaacggcgt tgagcttggc 900
acaattgata atgatgcaac cgttagcgaa ttattatttt tagcggacgt atataaacgc 960
agcggtgaaa ctaaatatag agatgccgca agaagcgcgt tggaattttt acttaccatg 1020
caatattcca ctggcggttg gccacaggtt taccctgcgc gcactggcac cagttactcc 1080
aatcacgtta cgtttaacga taacgccatg gctcgtgtac ttattttatt ggataaagcc 1140
gcgcgattag aagcaccact cgatggcgac atttttacca cagaccagca cacgcgtatt 1200
actaccgcaa taaatggcgg catcgatttt attttgaatg cgcaaatagt acagggcgac 1260
gtgaaaaccg tttggtgtgc gcaacacgac ccttatacct acgaggcaaa agcagctcgc 1320
tcttatgagt tggcctctaa aagcggtaaa gaatctgtat tggttgtagc gtttttaatg 1380
acacgcccgc aaagcgaagc catagaaaat gccgtgaaag cagcccttgc ttggtaccgc 1440
aacccaaatg ttcaagtcgc caacaccgag tatgtaaaac gcacaaataa cgatgacaac 1500
tacaacccga tacaaacgaa agcaggtagc actatgtggt accgctttta cgatttagac 1560
caagacgttg gattctttag cggccgctct gcaagtgaca acccagcagg taacggtaag 1620
caatacgaca ttatgcttat tgaacccgag cgcaggtatg gctatgaatg gggtggcaat 1680
tacggcaaaa aaataatcga ttacgctaat tcggtagggt attaa 1725
<210> 113
<211> 463
<212> PRT
<213> Microbulbifer degradans
<400> 113
Met Tyr Lys Ile Ser Arg Arg Thr Thr Leu Lys Gly Leu Gly Leu Thr
1 5 10 15
Cys Leu Ala Gly Cys Thr Thr Ser Leu Pro Thr Leu Glu Gln Asp Pro
20 25 30
Trp Ala Phe Ala Gln Asn Ile Ala Asp Asn Thr Thr Ile Pro Thr Phe
35 40 45
Pro Asn Lys Glu Phe Asn Leu Leu Glu Phe Gly Gly Lys Glu Gly Ser
50 55 60
Asp Asn Thr Leu Ala Phe Lys Lys Ala Ile Ala Ala Cys Ser Lys Ala
65 70 75 80
Gly Gly Gly Lys Val Val Val Pro Ala Gly Arg Phe Glu Thr Gly Ala
85 90 95
Ile His Leu Glu Ser Asn Val Asn Leu His Ile Ser Glu Gly Ala Thr
100 105 110
Ile Ala Phe Phe Thr Asp Pro Lys Tyr Tyr Leu Pro Ala Val Phe Thr
115 120 125
Arg Trp Glu Gly Met Glu Cys Met Gly Tyr Ser Pro Leu Ile Tyr Ala
130 135 140
Tyr Gly Lys Thr Asn Ile Ala Ile Thr Gly Lys Gly Thr Leu Asp Gly
145 150 155 160
Gln Ala Asp Pro Thr His Trp Trp Ala Trp Lys Gly Asn Lys Glu Trp
165 170 175
Gly Val Glu Gly Tyr Pro Ser Gln Lys Glu Ser Arg Asn Gln Leu Phe
180 185 190
Ala Gln Ala Glu Ala Gly Asp Pro Val Arg Glu Arg Val Tyr Ala Asp
195 200 205
Gly His Tyr Leu Arg Pro Ser Phe Val Gln Pro Tyr Lys Cys Glu Asn
210 215 220
Val Leu Ile Glu Asp Ile Thr Ile Ile Asn Ala Pro Phe Trp Leu Leu
225 230 235 240
His Pro Thr Leu Ser Gln Asn Val Thr Val Arg Gly Val His Leu Glu
245 250 255
Ser Leu Gly Pro Asn Ser Asp Gly Cys Asp Pro Glu Ser Cys Lys Asn
260 265 270
Val Val Ile Glu Asn Cys Phe Phe Asn Thr Gly Asp Asp Cys Ile Ala
275 280 285
Ile Lys Ser Gly Arg Asn Asn Asp Gly Arg Arg Leu Ala Thr Pro Thr
290 295 300
Glu Asn Val Ile Ile Arg Asn Cys Lys Met Glu Ala Gly His Gly Gly
305 310 315 320
Val Val Ile Gly Ser Glu Ile Ser Gly Gly Val Arg Asn Val Phe Ala
325 330 335
Glu Asn Asn Val Met Ser Ser Pro Asp Leu Glu Lys Gly Ile Arg Ile
340 345 350
Lys Thr Asn Ser Val Arg Gly Gly Leu Leu Glu Asn Ile Tyr Val Arg
355 360 365
Asn Cys Thr Ile Gly Glu Val Gln Gln Ala Ile Val Ile Asn Phe Gln
370 375 380
Tyr Glu Glu Gly Asp Ala Gly Lys Phe Asp Pro Thr Val Arg Asn Val
385 390 395 400
Glu Ile Arg Asn Leu Val Cys Gln His Ala Leu Gln Val Phe Asn Ile
405 410 415
Arg Gly Phe Glu Arg Ala Pro Ile Gln Asn Phe Arg Ile Ile Asp Ser
420 425 430
Thr Phe Val Arg Gly Asp Asn Pro Gly Val Ile Glu His Thr Thr Gly
435 440 445
Leu Val Ile Asp Asn Val Gln Val Asn Gly Lys Ala Phe Asn Ile
450 455 460
<210> 114
<211> 1392
<212> DNA
<213> Microbulbifer degradans
<400> 114
atgtataaaa tttcacgccg cacaacactc aaaggcttag gcctaacttg cctagccggc 60
tgcaccacca gcctacccac actagagcaa gacccatggg cttttgcaca aaacatagcg 120
gacaacacca ccatccccac attcccaaac aaagaattta atttactcga attcggcggc 180
aaagaaggga gcgacaacac cctcgccttc aaaaaagcga ttgcagcatg cagcaaagca 240
ggtggcggca aggtggtagt acccgcagga cgatttgaga caggcgccat ccacttagag 300
tcgaacgtta accttcatat tagcgaaggc gctaccatcg ccttttttac cgaccccaaa 360
tattacctgc ctgcggtttt cactcgctgg gaaggcatgg agtgcatggg ctactcaccc 420
cttatatacg cctacggcaa aaccaacata gccattaccg gtaaaggcac cctcgacggt 480
caagccgacc caacgcactg gtgggcatgg aaaggcaaca aagaatgggg cgtagagggc 540
tacccaagcc aaaaggaaag ccgcaaccaa ctatttgccc aagcagaagc tggcgacccc 600
gttagagagc gcgtgtatgc agacggccac tacctgcgcc cctcgtttgt gcaaccctac 660
aagtgcgaaa acgtgctgat agaagacata actattatca acgctccctt ctggttgcta 720
caccccaccc tttcacaaaa cgtcactgta cgcggtgttc acctagaaag cctaggcccc 780
aactcggatg gctgcgatcc tgaaagctgt aagaatgtag ttatcgaaaa ctgctttttt 840
aataccggtg acgactgtat cgctattaaa tctggccgca acaacgatgg ccgcaggctt 900
gccacaccta ccgagaacgt gattattcgc aactgtaaaa tggaagcggg tcacggtggc 960
gtagttatag gctcagaaat ttctggcggc gtgcgcaatg tgtttgccga aaataacgta 1020
atgagcagcc ccgatttaga gaaaggcatt cgcattaaaa ccaactctgt gcgcggcgga 1080
ctgctagaga acatctatgt gcgcaactgc accataggcg aagtacaaca agccattgtt 1140
attaacttcc aatacgaaga aggcgatgcg ggtaaatttg accccaccgt gcgcaatgta 1200
gaaatacgca atttggtctg ccagcacgcc ttacaagtgt ttaacatccg cggttttgag 1260
cgcgccccca ttcaaaactt taggataatc gacagcacct ttgtgcgtgg tgacaaccca 1320
ggcgtaattg aacataccac agggttagtt atcgacaacg tccaagtcaa cggcaaagcg 1380
tttaacatct ag 1392
<210> 115
<211> 1084
<212> PRT
<213> Microbulbifer degradans
<400> 115
Met Leu Asp Met Thr Lys Arg Thr Leu Ser Ala Leu Leu Ala Leu Cys
1 5 10 15
Ala Thr Leu Thr Ala Cys Gly Gly Gly Asp Ile Thr Ser Gly Gly Asp
20 25 30
Ala Ile Pro Ala Val Asn Gln Pro Ala Pro Val Gln Glu Pro Glu Pro
35 40 45
Glu Pro Glu Pro Gln Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro
50 55 60
Glu Pro Glu Gly Ala Trp Thr Cys Pro Glu Thr Gly Phe Tyr Phe Cys
65 70 75 80
Asp Asp Phe Glu Asp Gly Thr Phe Asp Asp Lys Trp Asp Asp Leu Ile
85 90 95
Ala Thr Tyr Asp Leu Pro Ser Pro Gly Val Phe Asp Ile Leu Asp Glu
100 105 110
Ala Ser Gly Lys Ser Leu Arg Phe Thr Ala Gly Thr Arg Gly Gly Asp
115 120 125
Leu Ala Asp Gly Glu Leu Ile Val Val Lys Asp Thr Ala Phe Glu Asn
130 135 140
Val Thr Asn Ala Asp Tyr Ser Leu Glu Tyr Arg Ile Arg Pro Arg Asn
145 150 155 160
Asn Gly Asn Thr Gly Asn Lys Tyr Leu His Ala Met Ser Arg Tyr Glu
165 170 175
Gly Pro Lys Glu Tyr Tyr Phe Gly Gly Leu Ser Met Gln Gly Ser Thr
180 185 190
Ala Ser Thr Gln Val Glu Ala Gly Phe Val Leu Pro Glu Asn Thr Thr
195 200 205
Ser Ile Ser Asn Arg Leu Val Gln Ala Lys Tyr Pro Leu Glu Leu Gly
210 215 220
Thr Thr Gly Met Ser Asp Gly Tyr Trp Tyr Glu Val Arg Phe Asp Met
225 230 235 240
Ile Gly Asn Thr Gly Thr Ile Tyr Leu Asp Gly Glu Pro Gln Gly Ser
245 250 255
Phe Thr Asp Ala Asp Gly Leu Tyr Pro Leu Thr Gly Lys Ile Gly Phe
260 265 270
Met Thr Tyr Asn Arg Ser Phe Glu Ile Asp Trp Val Arg Val Gly Asp
275 280 285
Pro Ala Ile Lys Pro Val Gln Leu Ser Leu Asp Tyr Ala Ser Pro Leu
290 295 300
Trp Glu Ala Ala Ala Asp Gln Asp Pro Leu Asn Val Thr Val Thr Ala
305 310 315 320
Ile Gln Ser Asp Gly Val Thr Ala Asp Thr Phe Thr Ala Val Ser Ser
325 330 335
Asp Thr Asn Val Val Thr Thr Ser Ile Ala Asn Asn Val Val Thr Ile
340 345 350
Thr Pro Val Ala Gln Gly Ser Ala Thr Val Thr Phe Thr Ala Gly Ser
355 360 365
Asp Ala Asn Arg Val Lys Thr Ile Asp Val Glu Ile Ala Arg Ala Phe
370 375 380
Val Met Ser Thr Thr Asp Tyr Gly Asp Ile Ala Ser Lys Val Thr Pro
385 390 395 400
Thr Val Gly Met Thr Asp Ala Asn Pro Asp Ala His Leu Ser Ile Thr
405 410 415
Phe Asp Ser Ala Pro Thr Leu Ser Gly Val Gly Ser Ile Arg Ile Tyr
420 425 430
Asn Ala Ala Asp Asp Ser Glu Val Asp Val Ile Arg Leu Thr Asp Glu
435 440 445
Ser Asp Ala Leu Gly Tyr Ala Gly Gln Ala Asn Lys Arg Glu Leu Asn
450 455 460
Thr Thr Pro Val Tyr Leu Asp Gly Asn Thr Leu His Val Ser Pro His
465 470 475 480
Ser Asn Ala Leu Ala Tyr Gly Gln Asp Tyr Tyr Val Ala Ile Gly Asp
485 490 495
Asn Val Leu Thr Gly Ala Thr Leu Asn Thr Ile Ala Phe Asp Gly Leu
500 505 510
Gly Lys Asn Ala Gly Trp Thr Phe Ser Thr Lys Ala Ser Ala Pro Thr
515 520 525
Gly Asn Thr Val Thr Val Asp Asp Asp Ala Ser Ala Asp Phe Ser Thr
530 535 540
Val Gln Gly Ala Leu Asn Tyr Ala Met Ala Asn Thr Thr Asp Asp Ser
545 550 555 560
Ile Thr Ile Asn Ile Ala Asn Gly Asn Tyr Tyr Glu Pro Leu Tyr Leu
565 570 575
Ala Glu Arg Asn Asn Val Thr Leu Lys Gly Glu Ser Arg Asp Gly Val
580 585 590
Val Ile His Tyr Asn Asn His Glu Ala Met Asn Gly Gly Ser Thr Gly
595 600 605
Arg Ala Asn Phe Tyr Val Ala Asn Ser Asp Met Leu Thr Leu Glu Thr
610 615 620
Leu Thr Leu Lys Asn Gly His Gln Arg Thr Gly Gly Gly Asp Gln Ala
625 630 635 640
Glu Thr Ile Tyr Phe Asn Ser Ser Ser Asn Thr Asp Arg Leu Ile Ala
645 650 655
Lys Gly Ala Ala Phe Ile Ser Glu Gln Asp Thr Leu Leu Leu Lys Gly
660 665 670
Tyr Asn Trp Phe Tyr Asn Ser Leu Val Val Gly Asn Val Asp Phe Ile
675 680 685
Trp Gly Tyr Ser Ala Val Thr Leu Phe Glu Glu Thr Glu Ile Arg Ser
690 695 700
Ile Ala Asp Ser Lys Pro Gly Ala Gly Asp Ser Gly Gly Tyr Ile Leu
705 710 715 720
Gln Ala Arg Thr Pro Leu Glu Thr Asp Leu Gly Phe Val Phe Leu Asn
725 730 735
Ser Glu Leu Thr Lys Ala Thr Gly Val Asn Gly Asn Glu Ile Gly Asp
740 745 750
Gly Lys Thr Tyr Leu Ala Arg Ser Gly Gly Ser Thr Gly Tyr Phe Asp
755 760 765
Asn Ile Ser Phe Ile Asn Thr Lys Met Gly Ser His Ile Ala Asp Ile
770 775 780
Gly Phe Ala Tyr Ala Asp Ile Asn Gly Gln Pro Ala Pro Asn Pro Ala
785 790 795 800
Val Ala Thr Ala Asp Ala Gly Trp Arg Glu Phe Gly Ser Met Asp Ser
805 810 815
Ala Gly Thr Ala Leu Asp Val Ser Ala Arg Cys Gly Asp Ser Gly Ser
820 825 830
Cys Ile Gln Leu Thr Gln Ala Gln Val Asp Ala Gln Tyr Cys Asn Arg
835 840 845
Ala Gln Ile Phe Ala Ser Trp Asn Asp Trp Thr Gly Trp Asp Pro Leu
850 855 860
Pro Glu Asp Thr Ser Asp Asp Ala Cys Ala Asp Pro Val Ile Pro Gly
865 870 875 880
Ala Val Thr Trp Thr Gly Ile Ala Met Ser Leu Gly Gly Ser Thr Thr
885 890 895
Ser Val Ser Gly Asn Ile Thr Glu Gln Thr Asp Ser Asn Ile Thr Phe
900 905 910
Thr Ala Asp Gly Gly Lys Phe Glu Ser Ser Lys Leu Ser Thr Tyr Phe
915 920 925
Ala Tyr Gln Glu Leu Thr Gly Asp Phe Val Ile Ser Ala Lys Ala Lys
930 935 940
Thr Ile Gly Leu Leu Arg Glu Asn Gly Ser Tyr Gln Phe Pro Thr Gly
945 950 955 960
Ile Leu Met Cys Val Cys Asp Ala Ala Ala Ala Thr Thr Gly Leu Met
965 970 975
Gly His Ala Ser Leu Asn Asp Ile Thr Val Asp Thr Thr Val Asn Leu
980 985 990
Val Ala Thr Tyr Gly His Ile Gln Thr Thr Ala Gly Ser Trp Asn Lys
995 1000 1005
Thr Gly Thr Thr Asp Val Thr Ala Gly Asp Asn Leu Tyr Ile Gln Leu
1010 1015 1020
Glu Arg Ala Gly Asn Ser Tyr Thr Ala Arg Tyr Ser Thr Asp Gly Gly
1025 1030 1035 1040
Ala Thr Tyr Ser Asn Ile Gly Gly Ser Ser Phe Thr Asp Thr Leu Pro
1045 1050 1055
Asp Thr Leu Lys Val Gly Phe Phe Ala Thr Pro Asn Asn Thr Gly Glu
1060 1065 1070
Gln Thr Phe Val Tyr Glu Asp Ile Gln Ile Thr Gln
1075 1080
<210> 116
<211> 3255
<212> DNA
<213> Microbulbifer degradans
<400> 116
atgctcgata tgacaaaacg aactctatct gcgttgttag ccctgtgcgc aacattaacg 60
gcttgcggtg gtggcgatat aaccagcggc ggcgatgcta taccggcagt aaaccaaccc 120
gccccagtac aagagcctga acctgaacct gaaccacaac cggaacccga acccgaaccc 180
gagcccgagc cagaaccaga gggcgcgtgg acctgcccag aaacaggctt ctacttctgt 240
gacgactttg aagacggcac gtttgatgac aagtgggacg atctcattgc cacatacgac 300
ctaccaagcc ctggtgtatt cgacatatta gacgaagcaa gcggcaaatc tttgcgcttt 360
acagcaggca cccgtggcgg tgacttagca gatggcgaac ttattgttgt aaaagataca 420
gcattcgaaa atgtaaccaa cgcagattac tccttagagt accgtattcg cccgcgcaac 480
aacggcaaca caggcaacaa gtacctgcac gctatgtcgc gctacgaagg ccctaaagaa 540
tattactttg gcggtttaag catgcaaggc tctactgcaa gtacgcaagt agaagcaggt 600
ttcgtattgc cagaaaacac cactagcatt agcaaccgct tggtgcaggc caagtacccg 660
ttagagctag gtacaacagg catgagcgac ggctactggt acgaagtacg cttcgatatg 720
ataggcaata caggcaccat ttacctagat ggcgaaccac aaggcagctt taccgatgcc 780
gatggccttt acccattaac aggtaaaatt ggctttatga cttacaaccg ctcattcgaa 840
attgattggg tgcgagtagg cgacccagct attaagcctg tacaactttc actggattac 900
gccagcccgc tatgggaagc agcggcagac caagacccgc taaacgtaac agttactgcc 960
atacaaagcg atggcgtaac agcagatacc tttaccgcag ttagcagcga taccaatgta 1020
gtaaccacaa gtattgcaaa taacgtagta accattaccc ctgtagctca aggtagtgcc 1080
accgtgacct ttaccgctgg ttcagatgct aatcgcgtta aaacaattga tgtagaaatt 1140
gcacgcgcgt ttgttatgtc tactaccgac tacggcgata tagcttctaa ggtaacacca 1200
actgttggta tgactgacgc caacccagac gcacatttaa gcattacatt cgatagcgca 1260
cctaccctaa gcggtgttgg ctctatacgt atatacaatg cagcagacga tagcgaagta 1320
gatgttattc gccttaccga cgaaagtgat gcattgggtt acgccggcca agccaacaag 1380
cgtgaattaa ataccacacc ggtttacttg gatggcaaca ccctacacgt tagcccacac 1440
agtaacgcac ttgcctacgg ccaagactac tacgttgcca ttggcgataa cgtacttacc 1500
ggcgcaacac taaacaccat tgcgtttgat ggtttaggta aaaacgcggg ttggactttc 1560
tctaccaaag cctctgcccc taccggcaac accgtaactg tagacgacga tgcaagtgca 1620
gatttcagca cagtacaagg tgcgttgaac tatgctatgg caaataccac ggacgattca 1680
atcaccatta acattgctaa cggcaactac tacgagccgc tatatctagc agagcgcaac 1740
aacgtaacgc taaaaggtga aagccgcgac ggcgttgtta ttcattacaa caaccacgaa 1800
gccatgaacg gtggcagcac tggccgcgca aacttctatg ttgccaactc agacatgcta 1860
accctagaaa cgctaaccct taaaaacggt catcagcgca ctggtggtgg cgaccaagca 1920
gaaactatct acttcaatag cagcagcaat accgatcgct taattgccaa aggcgctgct 1980
tttattagtg aacaagatac gctgttactt aaaggctaca actggttcta caactcgctt 2040
gtggtaggta acgtagactt tatttggggc tacagcgcag taaccttgtt tgaagaaaca 2100
gaaattcgat ctattgccga ctctaaacca ggtgcgggcg actcgggtgg ctatattctg 2160
caagcgcgta cgccactaga aacagacctt ggctttgttt tcttaaatag cgaattaaca 2220
aaagctaccg gcgtaaacgg taacgaaatt ggcgatggca aaacctacct tgcgcgcagc 2280
ggcggcagca cgggttactt cgataatatt tcgtttatta acaccaaaat gggtagccat 2340
attgccgaca taggcttcgc ctacgccgac attaacggtc aacctgcccc taacccagcg 2400
gtagctactg ctgacgcagg ctggcgtgaa tttggcagca tggattctgc aggcacggct 2460
ctagatgtat ctgcacgctg cggtgatagc ggcagctgta tccaacttac gcaagcacaa 2520
gtagatgcgc agtactgtaa ccgcgcgcaa atttttgcta gctggaacga ttggacaggc 2580
tgggacccgt tgccagaaga tacctctgac gatgcctgtg ctgaccccgt tatacctggt 2640
gcagtaacgt ggactggcat tgcaatgagc cttgggggtt ctacaacatc tgtttccggc 2700
aacattaccg agcaaacaga cagcaatatt acattcactg cagacggcgg taagtttgaa 2760
tcgagcaaac tttcaactta cttcgcttat caagaattga ctggcgactt tgtaattagc 2820
gccaaagcta aaaccattgg cttactgcgc gaaaacggca gctaccagtt ccctacaggc 2880
atattgatgt gtgtttgcga tgcggcagcg gcaacaactg gcttaatggg ccacgccagc 2940
ctcaatgaca ttacagttga tactactgtg aatttagttg ccacctacgg ccacattcaa 3000
accacagctg gtagctggaa taaaactgga acgactgacg taaccgctgg cgacaacctg 3060
tatatacagc tagagcgcgc aggtaatagt tataccgcac gctactcgac tgatggcggt 3120
gccacctata gcaacattgg tggcagctca tttacagaca cccttccaga cacacttaaa 3180
gtgggtttct tcgctacgcc taacaacacc ggtgagcaaa ctttcgttta cgaagatata 3240
caaatcactc agtaa 3255
<210> 117
<211> 391
<212> PRT
<213> Microbulbifer degradans
<400> 117
Met Gly Met Gly Thr Lys Ile Asn Phe Leu Leu Leu Gly Phe Ile Leu
1 5 10 15
Ser Ala Cys Ser Leu Ser Gly Cys Ala Asp Lys Ile Lys Arg Asn Thr
20 25 30
Pro Leu Thr Glu Thr Ala Leu Pro Ser Lys Lys Ile Leu Tyr Val Gln
35 40 45
Thr Glu Val Cys Asp Pro Pro Ser Gln Leu Thr Gly Ser Cys Tyr Asn
50 55 60
Ser Leu Gln Arg Ala Ile Asp Val Ala His Thr Val Pro Ser Ala Thr
65 70 75 80
His Val Thr Ile Glu Met Ala Ala Gly His Tyr His Glu Arg Ile Val
85 90 95
Leu Ser Arg Gly Asn Ile Asp Ile Val Gly Ala Gly Lys Asn Lys Thr
100 105 110
Tyr Val Gln Tyr Asn Leu Asn Ala Glu Gln Gly Lys Ala Tyr His Arg
115 120 125
Asp Gly Trp Gly Thr Pro Gly Ser Ala Thr Phe Thr Ile Asn Ala Ser
130 135 140
Glu Val Asn Val Ser Asp Leu Thr Ile Glu Asn Thr Phe Asp Phe Leu
145 150 155 160
Arg Asn Asp Ser Lys Asp Lys Thr Asp Pro Ser Lys Val Arg Ala Ser
165 170 175
Gln Gly Val Ala Leu Leu Leu Asp Glu His Ser Asp Lys Val Ala Leu
180 185 190
Tyr Arg Val Gly Leu Tyr Gly Tyr Gln Asp Thr Leu Phe Ala Asn Gly
195 200 205
Lys Arg Ala Phe Ile Tyr Gln Ser Asp Ile Ala Gly Asn Val Asp Phe
210 215 220
Ile Phe Gly Ala Gly Gln Val Val Ile Glu Asn Ser Arg Val Ile Ser
225 230 235 240
Arg Pro Arg Gly Lys Ala Ile Ala Ser Asn Glu Ile Ala Gly Tyr Ile
245 250 255
Thr Ala Pro Ser Thr Asn Ile Thr Asp Ala Phe Gly Leu Val Phe Ile
260 265 270
Asn Ser Arg Leu Glu Arg Glu Gln Gly Val Ala Asp Ala Ser Val Thr
275 280 285
Leu Gly Arg Pro Trp His Pro Thr Thr Asn Phe Ser Asp Gly Arg Tyr
290 295 300
Ala Asp Pro Asn Ala Ile Gly His Ala Leu Phe Phe Asn Cys Phe Met
305 310 315 320
Asp Ala His Ile His Pro Ala Arg Trp Ser Ser Met Lys Gly Thr Ala
325 330 335
Lys Asp Gly Ser Lys Thr Leu Val Phe Thr Pro Glu Gln Ser Arg Phe
340 345 350
Phe Glu Val Gln Ser Phe Gly Pro Ser Gly Asn Asp Glu Val Thr Thr
355 360 365
Ser Tyr His Ser Leu Ser Ala Asp Ser Leu Arg Glu Gln Ala Leu Gly
370 375 380
Asp Trp Asn Val Ser Ile Asn
385 390
<210> 118
<211> 1176
<212> DNA
<213> Microbulbifer degradans
<400> 118
gtgggtatgg gtacaaaaat taattttttg ctattaggtt ttattttgtc tgcatgttca 60
ttaagtggct gcgccgacaa aataaagcgc aatacgcctt taaccgaaac ggcattgccg 120
agtaaaaaaa tactctatgt acaaaccgaa gtatgtgacc cgccttcgca attaacgggt 180
agttgctaca acagcttgca gcgagctatt gacgtggcgc acacagtgcc gtctgctacc 240
catgtaacca ttgaaatggc tgcggggcat taccacgaac gcattgtgct tagccgtggc 300
aatatcgata ttgttggtgc aggtaaaaat aaaacctatg ttcaatacaa cctgaatgcc 360
gagcagggta aagcttatca ccgcgacggt tggggtactc ctggctcagc tacatttacc 420
attaatgcca gtgaagtaaa tgttagcgat ttaactatcg aaaatacttt cgacttttta 480
agaaatgatt caaaagataa aaccgaccct tcaaaagtga gggcatcgca aggcgttgca 540
ttattattgg atgaacacag cgataaggtt gcgctgtatc gagtaggcct atacggatac 600
caagacaccc tatttgcaaa tggaaagcgt gcatttatct accaatcaga tattgcaggc 660
aatgttgatt ttatttttgg cgctggccaa gtggttatag aaaatagtcg tgttatttct 720
aggccgcgcg gcaaagccat tgcttccaat gaaattgccg gctatatcac agcgccatcc 780
accaatatta cggacgcctt tggtctggtt tttattaata gtcgattaga acgtgagcaa 840
ggcgtggcag atgcgtcggt caccttgggt cgcccttggc accctacaac caatttcagc 900
gatggccgat atgccgaccc aaacgcgatt ggccatgcgc tattttttaa ctgctttatg 960
gatgcgcata ttcaccccgc gagatggtct agcatgaaag gcaccgctaa agacggcagt 1020
aaaacgctag tgtttacgcc cgagcaatcg cgtttttttg aagttcagtc ctttggcccc 1080
agcggcaacg atgaagtaac cacctcgtat cattcgttaa gcgccgactc attacgcgaa 1140
caagcgctcg gcgattggaa tgtatcaatt aactaa 1176
<210> 119
<211> 914
<212> PRT
<213> Microbulbifer degradans
<400> 119
Met Ser Ala Leu Thr Arg Pro Lys Phe Gly Ala His Thr Lys Leu Phe
1 5 10 15
His Ala Ile Lys His Ala Leu Thr Pro Val Ile Phe Leu Gly Ala Ala
20 25 30
Ala Phe Pro Leu Ala Ala His Ser Gln Tyr Asn Met Glu Asn Leu Asp
35 40 45
Arg Gly Leu Val Ala Ile Asp Arg Lys Asp Gly Ser Val Leu Val Ser
50 55 60
Trp Arg Trp Leu Gly Gln Glu Pro Asp Asn Thr Ser Phe Asn Val Tyr
65 70 75 80
Arg Asn Gly Thr Leu Leu Thr Ser Ser Pro Leu Thr Asn Lys Thr Asn
85 90 95
Phe Val Asp Thr Ser Gly Asn Pro Asn Ala Asn Tyr Ala Val Glu Ala
100 105 110
Ile Val Asn Gly Ala Ser Gln Ser Leu Ala Thr Thr His Val Trp Ser
115 120 125
Asp Ile Tyr Arg Thr Ile Pro Leu Gln Arg Pro Pro Gly Gly Thr Thr
130 135 140
Pro Asp Gly Val Ala Tyr Thr Tyr Ser Pro Asn Asp Ile Ser Ala Ala
145 150 155 160
Asp Leu Asp Gly Asp Gly Gly Tyr Glu Leu Ile Val Lys Trp Asp Pro
165 170 175
Ser Asn Ala Lys Asp Asn Ser Gln Ser Gly Tyr Thr Gly Asn Val Tyr
180 185 190
Leu Asp Ala Tyr Glu Ile Ser Gly Glu Phe Met Trp Arg Ile Asp Leu
195 200 205
Gly Arg Asn Ile Arg Ala Gly Ala His Tyr Thr Gln Phe Leu Ala Phe
210 215 220
Asp Phe Asp Ser Asp Gly Lys Ala Glu Val Ala Val Lys Thr Ala Asp
225 230 235 240
Ala Thr Lys Asp Ser Gln Gly Val Val Ile Gly Asp Ser Asn Ala Asp
245 250 255
Tyr Arg Asn Ser Ala Gly Tyr Val Leu Ser Gly Pro Glu Tyr Leu Thr
260 265 270
Met Phe Glu Gly Gln Thr Gly Arg Ala Leu Asn Thr Val Asn Tyr Val
275 280 285
Pro Ala Arg Gly Ser Val Ser Ser Trp Gly Asp Asn Tyr Gly Asn Arg
290 295 300
Val Asp Arg Phe Leu Gly Gly Val Ala Tyr Leu Asp Gly Gln Asn Pro
305 310 315 320
Ser Leu Ile Met Ser Arg Gly Tyr Tyr Thr Arg Thr Val Val Ala Ala
325 330 335
Trp Asp Trp Arg Asn Gly Gln Leu Ser Gln Arg Trp Val Phe Asp Ser
340 345 350
Asn Thr Ser Gly Asn Ser Ser Tyr Ala Gly Gln Gly Ala His Ser Leu
355 360 365
Thr Ile Gly Asp Val Asp Ala Asp Gly Lys Gln Glu Ile Val Phe Gly
370 375 380
Ala Met Thr Ile Asp Asp Asn Gly Thr Gly Leu Asn Asn Thr Arg Leu
385 390 395 400
Gly His Gly Asp Ala Leu His Leu Ser Asp Met Asp Pro Ser Asn Pro
405 410 415
Gly Leu Glu Val Phe Met Val His Glu Cys Pro Ser Cys Tyr Gly Glu
420 425 430
His Gly Ile Glu Met His Asp Ala Ala Thr Gly Gln Ile Leu Trp Ser
435 440 445
His Pro Gly Asp Tyr Ile Asp Ile Gly Arg Gly Val Ala Met Asp Ile
450 455 460
Asp Pro Arg Tyr Ala Gly Tyr Glu Ala Trp Ala Ser Arg Gly Gly Leu
465 470 475 480
Tyr Ser Ala Lys Gly Glu Thr Ile Ser Ser Thr Arg Pro Ser Gln Ile
485 490 495
Asn Phe Ala Ala Trp Trp Asp Gly Asp Leu Leu Arg Glu Ile Leu Asp
500 505 510
Asn Asn Tyr Ile Asn Lys Trp Asn Tyr Thr Ala Ser Ser Thr Thr Arg
515 520 525
Leu Leu Ser Ala Gly Asn Tyr Gly Ala Ala Ser Asn Asn Gly Thr Lys
530 535 540
Ala Thr Pro Gly Leu Ser Ala Asp Ile Leu Gly Asp Trp Arg Glu Glu
545 550 555 560
Val Val Trp Arg Asn Ser Asn Asn Gln Glu Leu Met Val Phe Thr Thr
565 570 575
Pro His Glu Ser Glu Tyr Arg Leu Arg Thr Leu Met His Asp Pro Gln
580 585 590
Tyr Arg Thr Ala Ile Ala Trp Gln Asn Val Gly Tyr Asn Gln Pro Pro
595 600 605
His Pro Ser Tyr Phe Leu Gly Ala Gly Met Thr Thr Pro Asn Gln Pro
610 615 620
His Ile Thr Ile Val Gly Glu Gly Thr Val Gln Pro Pro Ala Pro Thr
625 630 635 640
Gly Asp Ala Ile Gln Glu Asn Ala Thr Gly Phe Cys Gly Tyr Glu Gly
645 650 655
Thr Ile Asp Ser Asn His Ser Gly Tyr Thr Gly Ala Gly Phe Thr Asn
660 665 670
Thr Thr Asn Ala Thr Gly Ala Gly Ile Asn Trp Asn Leu His Ala Ser
675 680 685
Thr Ala Gly Thr Tyr Arg Leu Ser Met Arg Tyr Ala Asn Gly Ser Thr
690 695 700
Ala Arg Gly Ala Val Leu Asn Val Glu Thr Thr Gly Asn Ser Tyr Pro
705 710 715 720
Met Ala Phe Ala Pro Thr Ser Thr Trp Thr Asn Trp Gln Glu Glu Tyr
725 730 735
Val Asp Ala His Leu Asn Ala Gly Tyr Asn Ser Ile Arg Leu Glu Ala
740 745 750
Asn Gln Ala Ala Gly Leu Pro Asn Leu Asp Ala Ile Tyr Leu Ala Asp
755 760 765
Gly Leu Thr Ala Ala Ala Cys Gly Gln Thr Ser Ser Ser Ser Ser Ser
770 775 780
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
785 790 795 800
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Gly Thr Thr Ala
805 810 815
Gly Leu Ala Cys Ala Asn Gly Ser Thr Asp Thr Trp Gly Thr Gly Phe
820 825 830
Val Leu Asn Gly Phe Gln Val Glu Asn Glu Gly Gln Gln Ala Thr Asn
835 840 845
Asn Trp Gln Val Thr Leu Gln Phe Asp Gln Pro Val Asn Ile Thr Asn
850 855 860
Ala Trp Gly Val Asn Val Glu Thr Thr Gly Thr Thr Val Val Ala Thr
865 870 875 880
Ser Val Gly Tyr Asn Ser His Leu Asn Pro Gly Gln Ser Ala Ser Phe
885 890 895
Gly Met Gln Gly Thr Ser Ala Thr Ala Val Ser Asn Pro Leu Cys Ser
900 905 910
Ala gln
<210> 120
<211> 82
<212> PRT
<213> Microbulbifer degradans
<400> 120
Met Ala Cys Trp Pro Ser Phe Ser Thr Trp Lys Pro Phe Asn Thr Asn
1 5 10 15
Pro Val Pro His Val Ser Val Leu Pro Leu Ala Gln Ala Arg Pro Ala
20 25 30
Val Val Pro Pro Asp Glu Leu Leu Leu Glu Leu Leu Leu Glu Leu Leu
35 40 45
Leu Glu Leu Glu Glu Leu Leu Glu Glu Leu Glu Glu Leu Leu Glu Asp
50 55 60
Glu Leu Leu Leu Asp Val Trp Pro His Ala Ala Ala Val Arg Pro Ser
65 70 75 80
Ala arg
<210> 121
<211> 2745
<212> DNA
<213> Microbulbifer degradans
<400> 121
atgtcggcac tcactcgtcc caaatttggt gcacacacca aactgttcca cgctataaag 60
catgcgttaa cccccgttat atttttaggt gctgctgctt tcccacttgc cgctcacagc 120
caatacaaca tggaaaacct cgaccgcggg ttggtggcca tagatcgcaa agacggcagc 180
gtattagtaa gctggcgctg gttggggcaa gagccagata acaccagctt caacgtatat 240
agaaacggca cactattaac cagctccccc cttaccaata aaacaaactt tgtagatacc 300
agcggcaacc caaacgctaa ttacgcggta gaagccatag taaacggcgc gagccaatca 360
ctagccacca cacatgtatg gagcgatata taccgcacta ttccgctgca aagaccacca 420
ggtggcacca cccccgacgg agtggcatac acctatagcc ccaacgatat aagcgcagcc 480
gatttagacg gcgatggcgg ttacgagcta atcgtaaaat gggacccctc caacgcaaaa 540
gacaactcgc aaagtggcta caccggcaat gtgtatctcg acgcttacga aatatctggc 600
gagtttatgt ggcgcataga cctaggccgc aatattcgag caggcgccca ctacacgcaa 660
tttttagcat tcgattttga tagcgatggc aaagcagaag ttgcagttaa aaccgcagac 720
gccaccaaag acagccaagg agtagtaata ggagacagca atgccgatta ccgcaacagt 780
gcaggctatg ttttatctgg ccccgaatac ctcaccatgt ttgaaggcca aaccggtaga 840
gcgctaaata ccgttaacta cgtacccgcg cgcggcagtg tttctagctg gggcgacaac 900
tacggcaacc gtgttgatag atttttaggt ggcgtagcct acttagatgg ccaaaaccca 960
agtttgatta tgtcgcgcgg gtactacacc cgcaccgtgg tagcagcatg ggactggcgc 1020
aatgggcagc ttagccaacg ctgggtattt gattccaaca ccagtggcaa cagcagctat 1080
gcaggccaag gcgcgcacag ccttaccatt ggcgatgtag atgcagacgg aaaacaagaa 1140
attgtatttg gcgccatgac catagatgac aacggcaccg gcttgaacaa tacccgccta 1200
ggtcacggcg atgcactgca cctatccgac atggacccaa gcaacccagg cttagaagtg 1260
tttatggtgc acgaatgccc ctcttgctat ggcgagcacg gaatagaaat gcacgatgcc 1320
gccaccggcc aaatactatg gagccaccca ggcgactaca tagacatagg ccgaggtgta 1380
gccatggata tagacccacg ttatgccggc tacgaagctt gggcttcgcg cggtggttta 1440
tacagtgcaa aaggcgaaac aatatcgagc acgcgcccgt cgcaaataaa ctttgccgct 1500
tggtgggatg gcgacttact gcgcgaaata ctcgataaca attacataaa caaatggaac 1560
tacaccgcca gctctaccac gcgcttgcta agcgcaggta actacggtgc agcatcgaac 1620
aacggcacca aggctacacc agggctttcg gccgatattc ttggtgactg gcgcgaagaa 1680
gtggtatggc gcaacagcaa caaccaagaa cttatggtgt ttaccacccc acacgaaagt 1740
gaataccgct taagaaccct aatgcacgac ccgcaatacc gcacagccat agcttggcaa 1800
aacgtgggct acaaccaacc acctcacccc tcttactttt tgggtgcggg tatgactacc 1860
cccaaccaac cacacataac catagttggc gaaggcacag tgcagcctcc agcccctaca 1920
ggcgacgcaa tacaagaaaa cgccaccggc ttttgcggct acgaaggcac tatagatagc 1980
aaccactctg gctatacagg cgcgggcttt actaacacta ccaacgcaac aggtgcaggc 2040
attaactgga acctacatgc gtccaccgcc ggtacatacc gcctaagcat gcgttacgcg 2100
aatggcagta cggcacgagg tgctgtgcta aacgtagaaa caaccggtaa tagctacccc 2160
atggcatttg caccaacaag cacatggaca aactggcaag aagaatatgt agacgcccac 2220
ctaaacgccg gctacaacag cattcggctt gaagcaaacc aagcggcagg tttgcccaac 2280
ctcgatgcca tctacctagc cgatggtctt acggcggcag cgtgcggcca aacatctagc 2340
agcagctcgt cctccagtag ctcctctagt tcttccagca gctcttcaag ctctagtagc 2400
agttccagta gtagctcgag cagcagctca tccggcggta caacagcagg cctagcttgc 2460
gccaacggca gtacagatac atggggaacg gggtttgtgt taaacggctt ccaagtagaa 2520
aacgaaggcc agcaagccac caataactgg caagtgacac ttcaattcga ccagcccgta 2580
aacataacca acgcatgggg tgtaaacgta gaaacaacag gcacaaccgt tgtagcaaca 2640
agcgtaggct acaacagcca cctaaacccg gggcaaagtg ctagctttgg aatgcaagga 2700
acatcggcaa cggcggtaag caacccgcta tgcagtgccc agtaa 2745
<210> 122
<211> 789
<212> PRT
<213> Microbulbifer degradans
<400> 122
Met Arg Ser Leu Ala Pro Ile Lys Ile Arg Glu Lys Ile Arg Glu Thr
1 5 10 15
Leu Met Phe Asn Ile Arg Ala Trp Gln Leu Asp Leu Pro Leu Ala Leu
20 25 30
Leu Ala Phe Ser Ser Thr Ser Tyr Ala Ile Asp Asn Gly Thr Tyr Thr
35 40 45
Ile Gln Ser Lys His Ser Gly Lys Val Val Glu Val Ala Ala Gly Ser
50 55 60
Val Asp Asp Ala Ala Asn Val Ala Gln Trp Pro Ser Asn Gly His Pro
65 70 75 80
Thr Gln Gln Trp Ile Ile Thr Gln Ile Ser Gly Asp Asp Tyr Ser Val
85 90 95
Ile Asn Val Asn Ser Gly Lys Ala Met Glu Val Tyr Asp Phe Gly Thr
100 105 110
Thr Asp Gly Gly Asn Ile Val Gln Tyr Pro Tyr Trp Gly Gly Ala Pro
115 120 125
Gln Leu Trp Thr Ile Thr Asp Gln Gly Gly Tyr Tyr Ser Leu Ile Asn
130 135 140
Lys His Ser Gly Lys Ala Leu Asp Leu Leu Asn Trp Asp Thr Thr Asp
145 150 155 160
Gly Ala Asn Ile Gly Gln Trp Ala Trp Trp Gly Gly Asp Ala Gln Leu
165 170 175
Trp Ala Leu Asn Thr Val Gln Pro Ser Thr Val Thr Phe Thr Leu Glu
180 185 190
Glu Asn Gln Ala Gly Phe Cys Ser Val Asp Gly Ser Ile Asp Ser Asn
195 200 205
His Thr Gly Tyr Thr Gly Ser Gly Phe Ala Asn Thr Thr Asn Ala Asn
210 215 220
Gly Gln Gly Val Asn Trp Ser Val Asn Val Ala Thr Ala Gly Thr Tyr
225 230 235 240
Thr Phe Thr Trp Arg Tyr Ala Gly Thr Ser Asn Arg Pro Ala Asn Leu
245 250 255
Leu Ile Asp Gly Ser Thr Gln Val Ser Gly Ile Ala Leu Asn Ser Thr
260 265 270
Gly Ala Trp Ala Thr Trp Ala Asn Ser Ala Glu Ile Ser Val Trp Leu
275 280 285
Asp Thr Gly Val His Ser Leu Arg Leu Gln Ala Thr Thr Ser Ala Gly
290 295 300
Leu Pro Asn Ile Asp Ser Leu Ser Ile Thr Gly Gln Ser Ala Ala Ala
305 310 315 320
Gly Asn Cys Ser Gly Ala Ile Glu Pro Ile Thr Phe Ala Thr Pro Ser
325 330 335
Phe Thr Asn Ile Ala Val His Asp Pro Ser Val Ile Glu Ala Asn His
340 345 350
Gln Tyr Tyr Val Phe Gly Ser His Leu Ser Val Ala Lys Thr Pro Asp
355 360 365
Leu Lys Asn Trp Ser Arg Val Ala Asp Gly Val Thr Thr Asn Asn Pro
370 375 380
Leu Phe Asn Asp Val Thr Ser Glu Leu Ala Glu Ala Leu Ala Trp Ala
385 390 395 400
Glu Thr Thr Thr Leu Trp Ala Pro Asp Val Thr Tyr Val Asn Gly Arg
405 410 415
Tyr Leu Met Tyr Tyr Asn Ala Cys Arg Gly Asp Ser Pro Leu Ser Ala
420 425 430
Met Gly Ile Ala Ser Ser Asn Asn Ile Glu Gly Pro Tyr Thr Asn Asp
435 440 445
Gly Ile Phe Leu Lys Ser Gly Met Trp Gly Gln Thr Ser Glu Asp Gly
450 455 460
Thr Val Tyr Asp Ala Thr Val His Pro Asn Ala Val Asp Pro Val Ile
465 470 475 480
Phe Ser Asp Ala Asn Asn Arg Met Trp Met Thr Tyr Gly Ser Tyr Ser
485 490 495
Gly Gly Ile Phe Ile Met Glu Leu Asn Pro Ser Thr Gly Phe Pro Tyr
500 505 510
Ala Gly Gln Gly Tyr Gly Lys His Leu Met Gly Gly Asn His Ala Arg
515 520 525
Ile Glu Gly Ala Tyr Thr Ile Tyr Ser Pro Glu Thr Gly Tyr Tyr Tyr
530 535 540
Met Tyr Val Ser Tyr Gly Gly Leu Gly Ala Asp Gly Gly Tyr Asn Val
545 550 555 560
Arg Val Ala Arg Ala Thr Ser Pro Asp Gly Pro Tyr Tyr Asp Ala Asn
565 570 575
Gly Thr Asn Met Ala Asn Val Lys Ser Asn Pro Ser Leu Pro Leu Phe
580 585 590
Asp Asp Ala Ser Ile Ala Pro His Gly Val Lys Leu Met Gly Asn His
595 600 605
Val Phe Ser Gly Thr Asn Asn Val Leu Gly Tyr Val Ser Pro Gly His
610 615 620
Asn Ser Ala Tyr Arg Asp Ala Thr Thr Gly Gln Thr Phe Leu Leu Phe
625 630 635 640
His Thr Arg Phe Pro Gly Arg Gly Glu Glu His Glu Val Arg Val His
645 650 655
Glu Val Phe Tyr Asn Asp Ala Gly Trp Pro Val Ile Ala Pro Leu Arg
660 665 670
Tyr Ala Gln Lys Val Asp Ala Asn Asn Pro Asn Arg Ser Ala Ser Glu
675 680 685
Leu Asn Ala Val Tyr Ala Ser Glu Leu Pro Gly Ser Tyr Gln Leu Ile
690 695 700
Asn His Gly Lys Asp Ile Ser Ala Thr Ile Lys Asn Ser Val Asn Ile
705 710 715 720
Thr Leu Asn Ser Asn Gly Ser Ile Ser Gly Glu Leu Ser Gly Ser Trp
725 730 735
Thr Tyr Asn Ala Asn Thr Arg Asn Thr Val Ile Thr Val Ala Gly Val
740 745 750
Ala Tyr Arg Gly Val Val Ser Arg Gln Trp Asn Gln Ala Arg Asn Arg
755 760 765
Phe Glu Val Thr Phe Ser Ala Leu Ser Ala Asp Gly Thr Ala Ile Trp
770 775 780
Gly Val Asn Ser Asp
785
<210> 123
<211> 2370
<212> DNA
<213> Microbulbifer degradans
<400> 123
gtgcgcagtc tagcccccat aaaaataaga gagaaaataa gagagacact catgtttaat 60
atacgcgctt ggcaacttga cttgcccttg gcgttattgg cgttttcgtc tacaagttac 120
gctatcgata acggcactta cacaattcaa tctaaacaca gtggcaaggt tgtagaagtg 180
gccgcaggca gtgtagatga tgctgcaaat gtggcccaat ggcccagtaa tggccatcct 240
acccagcagt ggataattac ccaaattagc ggcgatgatt actcggtaat aaatgtaaat 300
agcggcaaag ctatggaggt atacgacttc ggcaccacgg acggcggcaa catagtgcaa 360
tacccctact ggggaggcgc cccccagttg tggacaatta cagatcaagg cggttattac 420
agcttaataa acaaacacag tggtaaagct ttagatttgt taaattggga taccacagac 480
ggcgccaata taggccagtg ggcttggtgg ggcggcgatg cgcaactgtg ggcactgaac 540
acagtgcaac ccagcacagt caccttcaca cttgaagaaa accaagcggg tttttgcagc 600
gtagatggca gcatagatag caatcatacg ggctataccg gcagcgggtt tgccaataca 660
accaatgcaa atggccaagg agttaattgg tcggtaaatg tagccacagc tggtacctat 720
acgtttacat ggcgctatgc gggcactagc aaccgcccag ccaacttgct aatagatggc 780
agcacacagg tttcgggcat tgctttaaat tcaaccggcg catgggcaac ttgggctaat 840
agtgcagaaa taagtgtttg gcttgacaca ggtgtgcact cacttcggct gcaagccaca 900
accagcgcgg gcttacctaa tatagattcg cttagcataa ccggccaaag cgcagcagcg 960
ggcaactgca gcggcgcaat tgaacctatc acttttgcca caccaagctt taccaatata 1020
gcggtgcacg acccctcggt aatagaagca aaccatcaat actacgtatt tggctcccac 1080
ctttctgtgg ctaaaacgcc cgacctaaaa aactggtcgc gcgtggcgga tggtgtaacc 1140
accaataacc cattgtttaa cgatgtaacc agcgaacttg cagaagcatt agcttgggca 1200
gaaaccacta ccctgtgggc gccagatgtt acctatgtaa atggtcggta tttgatgtat 1260
tacaacgcct gccgtggcga ctcaccactg tcggctatgg gtattgcttc ttcgaacaac 1320
atagaaggcc cttacactaa cgatggtata ttccttaaat ctggaatgtg gggccaaacc 1380
agcgaagatg gcactgtgta cgacgcaacc gtgcacccca atgctgtgga ccccgttatt 1440
tttagcgacg caaataatcg catgtggatg acctacggtt cgtattcggg tggtattttt 1500
attatggagt taaacccatc cacggggttc ccttacgcgg ggcaaggtta tggtaaacat 1560
ttaatgggtg gcaaccacgc gcgcattgaa ggcgcttaca ccatctacag cccagaaacg 1620
ggctactact atatgtatgt aagctacggt ggcctaggcg cagatggcgg ctataacgtt 1680
cgtgtggccc gagcaactag cccagacggc ccctactatg atgccaacgg caccaatatg 1740
gccaacgtaa aaagcaaccc aagcttgcca ctgttcgacg acgccagcat agcaccccac 1800
ggtgtaaaac ttatgggtaa ccacgtgttt agcggcacta acaatgtact tggttacgta 1860
tcaccagggc acaactctgc ataccgtgac gccactaccg gccaaacatt tttactattc 1920
cacacacgct tccctgggcg cggcgaagag catgaagtgc gagtgcatga agtgttctac 1980
aacgatgcag gctggccggt aatagcacca ttgcgctatg cccaaaaagt agatgccaac 2040
aacccaaata gaagtgcgag cgagctaaat gcagtgtacg caagcgaact gccaggtagc 2100
tatcagctaa ttaaccacgg caaagacata agcgcgacaa ttaaaaattc cgttaacatc 2160
acgctaaaca gcaacggcag tatctctggc gagctatctg gcagctggac atacaacgcc 2220
aacacccgca ataccgtaat caccgttgcc ggtgtggcct accgcggcgt ggtatctcgc 2280
caatggaacc aagcacgcaa ccgcttcgaa gtcaccttca gcgccctatc tgcagacggc 2340
acagcaatat ggggggtgaa cagcgactaa 2370
<210> 124
<211> 362
<212> PRT
<213> Microbulbifer degradans
<400> 124
Met Ser Ser Phe Ile Met Asp Lys Ser Gln Leu Gln Ser Gly Phe Ala
1 5 10 15
Phe Lys Thr Ser Gly Phe Asn Val Leu Ile Val Val Thr Phe Leu Ala
20 25 30
Leu Leu Ala Ala Leu Val Gly Cys Ser Ser Ala Lys Leu Ala Pro Val
35 40 45
Ala Ser Pro Ser Leu Pro Gln Pro Leu Val Ala Gln Arg Ala Asp Pro
50 55 60
Trp Val His Lys His Ser Asp Gly Tyr Tyr Tyr Phe Ile Ala Thr Val
65 70 75 80
Pro Ala Tyr Asp Arg Leu Glu Met Arg Arg Ala Thr Thr Ile Ala Gly
85 90 95
Leu Arg Ser Ala Pro Ala Val Val Val Trp Gln Arg Asn Thr Ile Gly
100 105 110
Gly Met Ser Ala Asn Ile Trp Ala Pro Glu Leu His Phe Ile Asp Gly
115 120 125
Lys Trp Tyr Ile Tyr Val Ala Ala Ala Thr Asp His Asn Lys Pro Trp
130 135 140
Thr Ile Arg Met His Thr Leu Ser Asn Ala Ser Ala Asn Pro Met Gln
145 150 155 160
Gly Glu Trp Gln Glu Glu Gly Arg Phe His Thr Pro Leu Asp Thr Phe
165 170 175
Ser Leu Asp Ala Thr Thr Phe Glu His Arg Gly Lys Arg Tyr Leu Val
180 185 190
Trp Ala Gln Gln Asn Glu Ala Arg Thr Tyr Asn Ser Ala Leu Leu Ile
195 200 205
Ala Gln Met Asp Ser Pro Thr Ser Ile Thr Gly Pro Ile Val Thr Leu
210 215 220
Ser Glu Pro Thr Leu Pro Trp Glu Ile Gly His Lys Val Asn Glu
225 230 235 240
Gly Ala Ala Val Ile Lys His Gly Lys Arg Ile Phe Ile Ser Tyr Ser
245 250 255
Ala Ser Ala Thr Asp His Asn Tyr Ala Met Gly Leu Leu Trp Ala Asp
260 265 270
Glu Asn Ala Asp Leu Leu Asp Ala Ala Ser Trp Thr Lys Ser Pro Glu
275 280 285
Pro Val Phe Tyr Ser Asn Glu Gln Leu Lys Arg Phe Gly Pro Gly His
290 295 300
Asn Cys Phe Val Lys Ala Glu Asp Gly Val Thr Asp Leu Met Val Tyr
305 310 315 320
His Ala Arg Asp Tyr Lys Glu Ile Asp Gly Glu Pro Leu Arg Asp Pro
325 330 335
Asn Arg His Thr Arg Val Arg Lys Val Tyr Trp Asp Glu Gln Gly Met
340 345 350
Pro Asp Phe Arg Gln His Glu Ala Asp Leu
355 360
<210> 125
<211> 1089
<212> DNA
<213> Microbulbifer degradans
<400> 125
atgagtagct tcataatgga taaaagccag ctacaaagtg gttttgcgtt taaaacaagt 60
ggttttaacg tgctgattgt tgtcacattt ttggcattgc ttgcagcgct tgttggctgc 120
agcagcgcca agctcgcacc cgtcgcctct cctagtttac cgcagccatt agtggcccaa 180
cgggcagacc cttgggtgca caagcacagc gatggttatt actactttat agcaacggta 240
ccagcatacg accgcttaga aatgcgtagg gccacaacca tagcaggctt acgtagcgcg 300
cccgctgtag tggtatggca gcgcaatact attggaggta tgagcgcgaa tatttgggcg 360
cccgagctgc attttattga tggtaaatgg tacatctatg tagcggctgc caccgatcac 420
aacaagccgt ggacaattcg tatgcacacg ctttccaatg catcggccaa ccctatgcaa 480
ggtgagtggc aagaagaggg gcgctttcat acaccgctag atactttctc gctagatgcc 540
acaacctttg agcacagggg taaacgctat ttagtatggg cgcaacagaa tgaagcccgt 600
acttataact cggcgttact tatagcgcaa atggatagcc ctacaagtat tactggcccc 660
attgttacct taagtgaacc gacattaccg tgggaaatta ttggccataa ggttaatgag 720
ggtgcggcag taattaaaca cggtaagcgt atttttataa gttattccgc cagtgcgacc 780
gatcataact atgcgatggg tttgttatgg gcagacgaaa acgctgattt gctcgacgca 840
gcaagctgga ccaagtcacc cgagcctgta ttttactcaa acgaacaatt aaagcgcttt 900
ggccctggcc ataattgttt tgttaaagct gaagatggtg ttaccgattt aatggtgtac 960
cacgcgcgtg attataaaga gatagatggt gagccattgc gagacccaaa ccgccacacg 1020
cgggtgcgca aagtgtattg ggatgaacag ggcatgccgg attttcgtca acatgaagca 1080
gacctatag 1089
<210> 126
<211> 314
<212> PRT
<213> Microbulbifer degradans
<400> 126
Met Lys Lys Leu Ser Pro Leu Ile Glu Gln Arg Ala Asp Pro Tyr Ile
1 5 10 15
Tyr Lys His Thr Asp Gly Tyr Tyr Tyr Phe Thr Ala Ser Val Pro Ala
20 25 30
Tyr Asp Gly Ile Glu Leu Arg Arg Ala Lys Thr Ile Gln Ala Leu Ala
35 40 45
Thr Ala Glu Thr Val Met Val Trp Arg Lys Pro Ser Glu Gly Asp Tyr
50 55 60
Ser Glu Leu Ile Trp Ala Pro Glu Ile His Phe Asn Met Gly Ala Trp
65 70 75 80
Tyr Val Tyr Phe Ala Ala Ala Pro Ser Arg Glu Ile Lys Phe Asp Leu
85 90 95
Phe Gln His Arg Met Tyr Ala Ile Ser Cys Ser Asp Ala Asn Pro Leu
100 105 110
Thr Gly Glu Trp Ile Phe Glu Gly Lys Ile Asp Ser Gly Ile Asp Ala
115 120 125
Phe Cys Leu Asp Ala Thr Thr Phe Thr His Ser Asn Glu Leu Tyr Tyr
130 135 140
Val Trp Ala Gln Lys Glu Leu Asp Val Arg Gly Asn Ser Asn Leu Met
145 150 155 160
Ile Ala Lys Met Glu Thr Pro Thr Lys Leu Ala Thr Lys Pro Val Arg
165 170 175
Leu Ser Lys Pro Glu Tyr Asp Trp Glu Ile Gln Gly Phe Trp Val Asn
180 185 190
Glu Gly Pro Ser Ile Val Lys His Gly Ser Arg Ile Phe Ile Ser Tyr
195 200 205
Ser Gly Ser Ala Thr Asp Glu Arg Tyr Ala Met Gly Ile Leu Trp Ala
210 215 220
Glu Gln Ser Ala Asp Leu Leu Asp Pro Ala Ser Trp Thr Lys Ser Val
225 230 235 240
Glu Pro Val Leu Val Ser Glu Pro Ser Glu Lys Val Phe Gly Pro Gly
245 250 255
His Asn Ser Phe Thr Val Asp Glu Glu Gly Asn Asp Met Leu Val Tyr
260 265 270
His Ala Arg Asn Tyr Thr Glu Ile Glu Gly Asp Pro Leu Trp Asp Pro
275 280 285
Asn Arg His Thr Tyr Val Lys Lys Leu Arg Trp Asp Glu Thr Gly Met
290 295 300
Pro Ile Phe Gly Ser Pro Ala Phe Glu Glu
305 310
<210> 127
<211> 945
<212> DNA
<213> Microbulbifer degradans
<400> 127
gtgaaaaaat tatcgcctct tatagagcaa cgtgcagacc cttatattta taaacacacc 60
gacggctact attattttac ggcttcggtg cccgcctatg atggcataga actgcgtcgc 120
gcaaaaacta tacaagcgtt agccaccgca gaaaccgtta tggtgtggcg caagccaagc 180
gaaggtgatt atagtgagct tatttgggcg ccagaaatac actttaatat gggggcttgg 240
tatgtatatt ttgctgcggc tccatcacgt gaaattaagt tcgatttatt tcaacaccgc 300
atgtatgcca ttagctgtag cgatgccaac ccgctaacag gtgaatggat atttgaaggt 360
aaaatagata gcggcataga tgcattctgt ttagatgcca ccacctttac tcacagcaat 420
gagctctact atgtttgggc gcaaaaagaa ttagatgttc gcggcaactc taatttgatg 480
atcgcaaaaa tggaaacgcc caccaagctt gccaccaagc ccgtgcgttt atctaaaccc 540
gaatacgact gggagattca gggtttttgg gttaacgaag gcccatccat tgttaagcac 600
ggctcacgta tttttatttc ttattctggc tctgctaccg atgagcgcta cgcaatgggt 660
attttgtggg cagaacaaag cgcagactta ctagacccag caagttggac caagtcggta 720
gagcctgtat tagtttctga accctctgaa aaagtatttg gcccaggcca caatagtttt 780
actgtggatg aagagggtaa cgatatgttg gtgtatcatg ctcgcaatta taccgaaatt 840
gaaggcgacc cgctgtggga cccaaatcgt catacttacg ttaaaaaatt gcgctgggat 900
gaaacaggca tgcctatttt tggcagccct gcgtttgaag agtag 945
<210> 128
<211> 350
<212> PRT
<213> Microbulbifer degradans
<400> 128
Asn Gly Tyr Tyr Ala Val Leu Asn Lys His Ser Gly Lys Ala Leu Asp
1 5 10 15
Leu Tyr Gly Phe Asp Thr Ser Asn Gly Ala Asn Ile Ala Gln Trp Ala
20 25 30
Phe Trp Gly Gly Asp Pro Gln Gln Trp Gln Phe Thr Lys Ile Ala Asn
35 40 45
Val Gly Ala Pro Pro Val Asp Thr Ser Thr Thr Asn Gly Ala Thr Asn
50 55 60
His Trp Ser Leu Thr Gly Asn Leu Val Thr His Asp Pro Thr Met Ala
65 70 75 80
Tyr Glu Asn Gly Ser Trp Trp Leu Tyr Gln Thr Gly Glu Gly Ile Tyr
85 90 95
Gly Lys Tyr Ser Ala Asn Gly Leu Ala Trp Asp Gly Leu Pro Ser Val
100 105 110
Phe Pro Asn Gly Leu Ser Trp Trp Lys Thr Tyr Val Pro Gly Gln Ser
115 120 125
Asn Asn Asp Val Trp Ala Pro Asp Val Arg Thr Tyr Asn Gly Arg Val
130 135 140
Tyr Leu Tyr Tyr Ser Ile Ser Thr Phe Gly Ser Arg Val Ser Ala Ile
145 150 155 160
Gly Leu Ala Ser Ala Ser Ser Leu Ala Ala Ser Asp Trp Gln Asp His
165 170 175
Gly Leu Val Ile Asn Thr Thr Ser Ser Ser Asp Trp Asn Ala Ile Asp
180 185 190
Pro Asp Leu Val Val Asp Glu His Gly Asn Pro Trp Leu Thr Met Gly
195 200 205
Ser Trp Asn Ser Gly Ile Lys Val Met Arg Leu Asn Pro Ile Thr Met
210 215 220
Lys Pro Ile Gly Thr Leu Tyr Ser Ile Ala Gln Lys Gly Gly Gly Ile
225 230 235 240
Glu Ala Pro Ser Ile Val Tyr Arg Arg Gly Tyr Tyr Tyr Leu Phe Val
245 250 255
Ser Ile Gly Lys Cys Cys Ala Gly Val Asp Ser Thr Tyr Gln Ile Ala
260 265 270
Tyr Gly Arg Ser Thr Ser Ile Thr Gly Pro Tyr Leu Asp Lys Asn Gly
275 280 285
Asn Asp Met Met Ser Gly Gly Gly Ser Ile Leu Asp Ala Gly Asn Asn
290 295 300
Val Trp Val Gly Pro Gly Gly Gln Asp Ile Ile Asn Thr Asp Val Ile
305 310 315 320
Val Arg His Ala Tyr Asp Ala Thr Asp Ala Gly Thr Pro Lys Met Ile
325 330 335
Ile Ser Thr Leu Asn Trp Asp Ala Asn Gly Trp Pro Lys Tyr
340 345 350
<210> 129
<211> 1053
<212> DNA
<213> Microbulbifer degradans
<400> 129
aatggctatt atgccgtgct aaataaacac agcggcaaag cgttagattt gtatggtttt 60
gatacgtcta acggcgcgaa tattgcgcaa tgggcctttt ggggcgggga cccgcagcag 120
tggcaattta ccaaaatcgc caatgtaggt gcgccgccag tagatacatc taccaccaac 180
ggtgcaacca accactggtc cttaaccggt aatctagtga ctcacgaccc cacaatggcc 240
tacgaaaacg gctcatggtg gttgtatcaa accggcgagg gaatttacgg taagtattca 300
gccaatggtt tggcgtggga tggcttacct tctgtgtttc ccaatggttt aagttggtgg 360
aagacctatg tacccggcca gtcgaacaac gatgtatggg cgcctgatgt acgcacttat 420
aatgggcggg tttatttgta ctattccatc tctacttttg gctcgcgtgt atctgccatt 480
ggtttggcgt cggcatcgag tttggctgcg agtgattggc aggaccacgg cttagtaatt 540
aataccacct catctagcga ttggaatgcg atcgacccag atttagtggt cgatgagcat 600
ggcaaccctt ggttaacaat gggaagttgg aacagcggta ttaaagtgat gcgcttgaac 660
cccattacca tgaagccaat tggcacactt tattctattg cgcaaaaggg cggcggtatt 720
gaagcgcctt ctattgtgta tcgccgtggg tattactatt tatttgtttc tatcggcaaa 780
tgctgtgcgg gcgtagatag cacctatcaa attgcttacg ggcgctctac aagtattacc 840
ggcccttatt tggataagaa cggcaacgat atgatgagtg gtggtggcag tattttagat 900
gcgggcaaca acgtgtgggt tggccctggt gggcaagata ttattaacac cgatgtcatt 960
gtgcgccacg cgtacgatgc cacagatgca ggcacaccta agatgattat tagtaccttg 1020
aattgggatg ctaatggatg gccgaaatac tag 1053
<210> 130
<211> 346
<212> PRT
<213> Microbulbifer degradans
<400> 130
Met Leu Asn Lys Asn Lys Arg Pro Ile Thr Phe Ala Leu Val Val Ser
1 5 10 15
Leu Leu Ala Leu Leu Ala Leu Ala Gly Cys Ser Glu Ala Lys Gln Val
20 25 30
Ser Ile His Asp Pro Val Met Ile Lys Glu Gly Asp Thr Tyr Tyr Leu
35 40 45
Phe Ser Thr Gly Pro Gly Ile Thr Met Tyr Ser Ser Ser Asp Met Lys
50 55 60
Asn Trp Arg Arg Glu Gly Glu Val Phe Asn Gln Ala Pro Ser Trp Ala
65 70 75 80
Ser Asn Ala Val Pro Tyr Phe Lys Gly His Leu Trp Ala Pro Asp Ile
85 90 95
Ile Glu Lys Asp Gly Leu Phe Tyr Leu Tyr Tyr Ser Val Ser Ala Phe
100 105 110
Gly Lys Asn Thr Ser Gly Ile Gly Val Thr Val Ser Pro Thr Leu Asn
115 120 125
Pro Arg Ala Pro Asn Tyr Gly Trp Gln Asp Lys Gly Met Val Leu Arg
130 135 140
Ser Val Pro Glu Arg Asp Glu Trp Asn Ala Ile Asp Pro Asn Ile Val
145 150 155 160
Val Asp Asn Asn Gly Thr Ala Trp Met Ala Phe Gly Ser Phe Trp Gln
165 170 175
Ser Leu Lys Met Val Ala Leu Asp Ser Ser Trp Thr Lys Ile Ala Glu
180 185 190
Pro Gln Gln Trp His Thr Ile Ala Ala Leu Pro Lys Gly Ser Met Pro
195 200 205
Thr Gly Asp Ala Val Lys Asp Gly Glu Ile Glu Ala Pro Phe Ile Phe
210 215 220
Lys Lys Asn Asp Asp Tyr Phe Leu Phe Val Ser Trp Gly Lys Cys Cys
225 230 235 240
Arg Lys Asp Glu Ser Thr Tyr Arg Leu Ala Met Gly Arg Ser Lys Asn
245 250 255
Thr Thr Gly Pro Phe Leu Asp Lys Asn Gly Lys Asp Leu Ala Gln Gly
260 265 270
Gly Gly Thr Leu Leu Ile Ser Gly Asn Lys Asn Trp Pro Gly Leu Gly
275 280 285
His Asn Ser Ala Tyr Thr Phe Asp Gly Lys Asp Trp Leu Val Leu His
290 295 300
Ala Tyr Glu Ser Ala Asp Asn Gly Leu Gln Lys Leu Lys Ile Leu Glu
305 310 315 320
Ile Asn Trp Asp Lys Asp Gly Trp Pro Thr Val Asp Thr Lys Glu Leu
325 330 335
Asp Glu Phe Val Ser Ile Glu Leu Thr Gln
340 345
<210> 131
<211> 1041
<212> DNA
<213> Microbulbifer degradans
<400> 131
atgcttaaca aaaacaaacg cccaattaca ttcgctttag tcgttagcct cttagccctg 60
cttgcccttg caggctgcag cgaggcaaaa caagtaagca tccacgaccc agtaatgatt 120
aaagaaggtg acacctacta cttgtttagc actggccccg gcataacaat gtatagctct 180
agcgatatga aaaactggcg ccgcgaaggc gaagtattta atcaagcccc tagttgggcc 240
tccaacgccg taccctattt taaaggccac ctgtgggcac ccgacatcat tgaaaaagat 300
ggtctgtttt acctctacta ttctgtgtct gcttttggaa agaacacatc cggcattggc 360
gttaccgtat cgcccacgct taacccacgc gcgcccaatt acggttggca agataaaggc 420
atggtattgc gcagcgtgcc tgagcgcgac gagtggaacg ctatcgaccc caatattgtg 480
gtagataaca acggcaccgc atggatggct tttggctcct tttggcaaag cttaaaaatg 540
gtggcactag acagcagctg gacaaaaata gctgagcctc aacagtggca taccatagca 600
gccttaccca aaggcagtat gcccacaggc gacgcagtaa aggacggcga aatagaagct 660
ccttttattt ttaaaaagaa cgacgattac tttttgtttg taagttgggg taaatgctgc 720
cgcaaagatg aaagcaccta ccgcctagca atgggccgca gcaaaaatac taccggtcca 780
ttcttagata aaaacggcaa agacctcgcc caaggtggtg gcaccctatt aataagtggc 840
aacaaaaact ggcccggctt aggccacaac agcgcctaca ccttcgacgg caaagattgg 900
cttgtgctac acgcctatga atctgcagat aacggtttac aaaaactaaa aatattagaa 960
ataaactggg ataaagacgg ctggccaact gtagatacca aagaactgga tgagtttgtt 1020
agtattgaat taactcaata a 1041
<210> 132
<211> 665
<212> PRT
<213> Microbulbifer degradans
<400> 132
Met Asp Asn Ile Met Lys Met Ile Lys Leu Ala Leu Ala Val Thr
1 5 10 15
Leu Ala Val Trp Val Ala Gly Cys Thr Asn Gln Ala Gly Leu Asn Ala
20 25 30
Glu Asn Lys Asn Ile Glu Arg Gln Thr Ile Asn Ser Pro Asp Lys Ser
35 40 45
Leu Lys Val Arg Leu Ser Leu Asp Glu Ser Gly Lys Val Phe Tyr Ser
50 55 60
Ile Ser Arg Asn Gly Glu Gln Val Met Leu Pro Ser Gln Leu Gly Val
65 70 75 80
Glu Leu Asn Ser Gln Ala Phe Thr Asp Gly Leu Thr Ile Thr Asp Val
85 90 95
Asp Ala Gly Lys Val Asn Asp Ser Tyr Thr Leu Leu His Gly Lys Gln
100 105 110
Arg Asp Ile Thr Tyr Asn Ala Asn Glu Lys Ile Tyr Ser Leu Lys Asn
115 120 125
Lys Gln Gly Asp Lys Leu Ile Ile Ala Phe Arg Val Ser Asn Asp Gly
130 135 140
Val Ala Phe Gln Tyr Arg Phe Pro Asn Thr Ala Lys Gln Leu Leu Ala
145 150 155 160
Val Lys Lys Glu Ile Thr Ser Phe Ala Phe Glu His Thr Thr Lys Ala
165 170 175
Trp Leu Gln Pro Ile Ala Val Ala Gln Thr Gly Trp Ala Asn Thr Asn
180 185 190
Pro Ser Tyr Glu Glu His Tyr Gln Met Asn Ile Pro Val Asp Thr Val
195 200 205
Ser Pro Ser Pro Ala Gly Trp Val Phe Pro Ala Leu Phe Lys Ala Asn
210 215 220
Lys His Trp Leu Leu Ile Thr Glu Ala Gly Met Asn Gly Asp Tyr His
225 230 235 240
Ala Ser Arg Leu His Ala Glu Ser Pro Asn Gly Glu Tyr Ser Leu Gly
245 250 255
Ile Pro Met Ala Ala Glu Val Phe Glu Gln Asp Gly Asn Lys Gly Ala
260 265 270
Leu Leu Ala Gln Ser Asn Thr Ala Phe His Ser Pro Trp Arg Val Ile
275 280 285
Leu Val Gly Gly Leu Asp Thr Ile Ile Ala Ser Thr Leu Gly Thr Asp
290 295 300
Leu Ala Asp Pro Ala Ile Ala Lys Met Asp Phe Val Lys Pro Gly Thr
305 310 315 320
Ala Ser Trp Ser Trp Ala Leu Leu Lys Asp Glu Ser Val Asn Tyr Glu
325 330 335
Thr Ser Leu Glu Phe Ile Asp Tyr Ala Ala Glu Met Gly Trp Asp Tyr
340 345 350
Thr Leu Val Asp Ala Asp Trp Asp Arg Arg Ile Gly Tyr Glu Arg Thr
355 360 365
Ala Gln Leu Ala Ala Tyr Ala Gln Ser Lys Asn Val Gly Leu Leu Val
370 375 380
Trp Tyr Asn Ser Ser Gly Asp Trp Asn Thr Thr Glu Tyr Ser Pro Lys
385 390 395 400
Ser Ala Leu Leu Asp Arg Asp Lys Arg Arg Ala Glu Phe Ala Arg Leu
405 410 415
Gln Asn Met Gly Val Lys Gly Val Lys Ile Asp Phe Phe Pro Gly Asp
420 425 430
Gly Lys Ser Val Met Ala Tyr Tyr Asn Asp Leu Ala Lys Asp Ala Ala
435 440 445
Asp Tyr Asn Leu Leu Val Asn Tyr His Gly Ser Ser Leu Pro Arg Gly
450 455 460
Leu His Arg Thr Tyr Pro Asn Ile Met Thr Met Glu Ser Val His Gly
465 470 475 480
Phe Glu Met Ile Thr Phe Met Gln Pro Ser Ala Asp Lys Ala Ala Thr
485 490 495
His Met Ala Ile Leu Pro Phe Thr Arg Asn Ala Phe Asp Pro Met Asp
500 505 510
Phe Thr Pro Thr Thr Phe Ser Asp Ile Pro Asn Ile Glu Arg Arg Thr
515 520 525
Ser Asn Gly Phe Glu Leu Ala Leu Pro Val Leu Phe Leu Ser Gly Leu
530 535 540
Gln His Ile Ala Glu Thr Ala Gln Gly Met Ala Thr Asn Ala Pro Asp
545 550 555 560
Tyr Val Lys Ala Tyr Met Arg Asp Ile Pro Val Leu Trp Asp Glu Ser
565 570 575
Lys Leu Ile Asp Gly Met Pro Gly Glu His Val Val Ile Ala Arg Lys
580 585 590
His Gly Glu Arg Trp Phe Val Ala Gly Ile Asn Ala Thr Asn Glu Ala
595 600 605
Ile Asn Leu Glu Met Asn Phe Asp Phe Ala Leu Gly Lys Gln Gly Thr
610 615 620
Leu Ile Thr Asp Ser Asn Ile Asn Thr Lys Gly Val Glu Ser Phe Thr
625 630 635 640
Ser His Thr Ile Thr Ala Thr Lys Asn Asn Ala Leu Thr Val Lys Ala
645 650 655
Asn Gly Gly Phe Val Ile Val Phe Asn
660 665
<210> 133
<211> 619
<212> PRT
<213> Microbulbifer degradans
<133> 133
Met Ala Ala Gly Gln Ile Ile Ser Leu Glu Val Lys Val Lys Lys Ile
1 5 10 15
Glu Glu Ile Met Lys His Thr Ala Arg Thr Ile Ala Leu Gly Ala Thr
20 25 30
Gly Ala Ala Leu Leu Thr Gly Leu Ile Ala Cys Asn Gly Thr Asn Val
35 40 45
Asn Thr Asn Gly Asp Thr Gln Gln Ala Ser Ile Lys Lys Ala Pro Glu
50 55 60
Gly Met Phe Ala Asn Pro Leu Phe Ala Asn Gly Ala Asp Pro Trp Leu
65 70 75 80
Glu Tyr Tyr Asp Gly Asn Tyr Tyr Leu Thr Thr Thr Thr Thr Trp Thr Ser
85 90 95
Gln Leu Val Met Arg Lys Ser Pro Thr Leu Asp Gly Leu Ser Thr Ala
100 105 110
Leu Pro Val Asn Val Trp Ser Asp Ser Asp Leu Thr Arg Cys Cys Asn
115 120 125
Phe Trp Ala Phe Glu Phe His Arg Leu Asn Gly Pro Asn Gly Trp Arg
130 135 140
Trp Tyr Leu Met Tyr Thr Ser Gly Gln His Gly Thr Leu Asp His Gln
145 150 155 160
His Leu Ser Val Leu Glu Ser Val Gly Asp Asp Pro Met Gly Pro Tyr
165 170 175
Thr Tyr Lys Gly Glu Met Met Pro Asn Thr Trp Asn Ile Asp Gly Ser
180 185 190
Tyr Leu Glu His Asn Gly Gln Leu Tyr Leu Leu Trp Ser Glu Trp Val
195 200 205
Gly Asp Glu Gln Gln Asn Phe Ile Ser Lys Met Thr Thr Pro Trp Ser
210 215 220
Ile Glu Gly Pro Arg Ala Leu Leu Thr Arg Pro Glu Ala Glu Trp Glu
225 230 235 240
Lys Ser Gly Arg Lys Val Asn Glu Gly Pro Glu Ile Leu Lys Lys Asp
245 250 255
Gly Arg Thr Phe Leu Ile Tyr Ser Ala Ser Tyr Cys Asp Thr Pro Asp
260 265 270
Tyr Lys Leu Ala Met Lys Glu Leu Thr Gly Asp Asp Pro Met Asn Ser
275 280 285
Glu His Trp Thr Lys Tyr Asp Lys Pro Val Phe Glu Arg Gly Asn Gly
290 295 300
Val Phe Ala Pro Gly His Asn Gly Phe Phe Lys Ser Pro Asp Gly Thr
305 310 315 320
Glu Asp Trp Ile Val Tyr His Gly Asn Ser Lys Glu Glu His Gly Cys
325 330 335
Gly Ala Thr Arg Ser Val Arg Ala Gln Lys Phe Thr Trp Asn Thr Asp
340 345 350
Gly Thr Pro Asn Phe Gly Glu Pro Ile Pro Glu Gly Gln Phe Leu Pro
355 360 365
Leu Pro Ser Gly Glu Asn Gly Pro Leu Val Thr Ala Leu Gln Gly Ala
370 375 380
Arg Ile Gln Leu Arg Asn Gly Glu Ser Cys Leu Leu Ala Glu Gly Lys
385 390 395 400
Glu Leu Lys Gln Gly Ser Cys Gln Ala Glu Ala Ser Leu Trp Val Met
405 410 415
Asp Asn Thr Ala Asp Asn His Tyr Arg Phe Gly Asn Val Ala Ser Asn
420 425 430
Leu Phe Leu Thr Ala Asp Glu Gly Leu Ser Gln Ser Ala Trp Val Asn
435 440 445
Thr Ala Ser Gln Arg Trp Ala Leu Asn Ala Gly Glu Gly Asn Phe Val
450 455 460
Ala Phe Thr Asn Lys Tyr Thr Gly Asp Ala Leu Met Gln Asn Asn Trp
465 470 475 480
Gln Ile Leu Pro Val Gly Lys Val Ala Ile Ser Ser Ile Gln Ser Gly
485 490 495
Arg Val Leu Gln Ala Cys Asp Lys Asn Ser Ala Asn Val Asn Gln Gly
500 505 510
Gly Trp Gln Gly Arg Ala Cys Gln Ala Trp Gln Phe Asn Pro Ala Ser
515 520 525
Glu Gly His Val Gln Ile Lys Thr Gly Asn Gln Cys Leu Thr Val Glu
530 535 540
Asn Lys Ser Ile Val Pro Gly Thr Asn Val Ile Ala Gly Glu Cys Glu
545 550 555 560
Ser Thr Ser Ser Gln Trp Leu Tyr Gln Leu Asp Lys Glu Gly Arg Ala
565 570 575
Thr Phe Thr Asn Arg Glu Ser Lys Gln Arg Leu Asp Leu Ala Asn Cys
580 585 590
Gly Leu Ala Asp Gly Thr Asn Phe Ala Gln Ala Pro Ala Leu Asp Thr
595 600 605
Ile Cys Gln Ala Phe Gln Val Arg Tyr Leu Pro
610 615
<210> 134
<211> 1860
<212> DNA
<213> Microbulbifer degradans
<400> 134
gtggccgcag gccaaataat ttcactggag gttaaagtga aaaagataga agaaataatg 60
aaacacacag cgcgcactat agcgctaggt gcaacaggtg ccgccttgct aacggggtta 120
attgcctgta acggtaccaa tgtgaataca aacggggata cccaacaagc aagcattaaa 180
aaagcgccag aaggcatgtt tgccaacccg ttgttcgcca atggcgcaga cccttggtta 240
gagtattacg atggcaatta ctacctcact accaccacat ggacatcgca
Claims (23)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/519,104 US8273557B2 (en) | 2004-05-04 | 2006-09-12 | Hydrolytic enzyme mixtures for saccharification of lignocellulosic polysaccharides |
US11/519,104 | 2006-09-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20090088856A true KR20090088856A (en) | 2009-08-20 |
Family
ID=39184291
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020097007532A KR20090088856A (en) | 2006-09-12 | 2007-09-11 | Enzyme systems for saccharification of plant cell wall polysaccharides |
Country Status (6)
Country | Link |
---|---|
US (2) | US8273557B2 (en) |
EP (1) | EP2059604B1 (en) |
KR (1) | KR20090088856A (en) |
CN (1) | CN101636499A (en) |
AU (1) | AU2007294906A1 (en) |
WO (1) | WO2008033330A2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101322797B1 (en) * | 2010-12-30 | 2013-10-29 | 고려대학교 산학협력단 | Method for introducing Genes into Saccharophagus degradans and Expression of Genes in Saccharophagus degradans |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8273557B2 (en) | 2004-05-04 | 2012-09-25 | University Of Maryland | Hydrolytic enzyme mixtures for saccharification of lignocellulosic polysaccharides |
US7365180B2 (en) * | 2004-05-04 | 2008-04-29 | University Of Maryland | Plant wall degradative compounds and systems |
US7977076B2 (en) * | 2006-12-29 | 2011-07-12 | Genifuel Corporation | Integrated processes and systems for production of biofuels using algae |
JP2010525815A (en) * | 2007-04-30 | 2010-07-29 | ユニバーシティー オブ メリーランド | Expression of carbohydrase during degradation of whole plant material by Saccharophagagus degradans |
EP2192177A1 (en) | 2008-11-28 | 2010-06-02 | Total S.A. | Cellulase Cel5H related reagents and their use in microorganisms |
WO2010099406A2 (en) * | 2009-02-27 | 2010-09-02 | University Of Maryland | Processes for plant polysaccharide conversion |
WO2010118007A2 (en) * | 2009-04-06 | 2010-10-14 | University Of Maryland | Enhanced cellulase expression in s. degradans |
CN109295031B (en) * | 2018-10-23 | 2020-06-19 | 南京工业大学 | Antifungal protein β -1, 3-glucanase, engineering bacteria containing antifungal protein β -1, 3-glucanase and application of antifungal protein β -1, 3-glucanase |
CN110643620B (en) * | 2019-10-22 | 2021-05-28 | 怀化学院 | High-activity poria cocos cellulose endonuclease gene and protein and recombinant vector thereof |
CN111154740B (en) * | 2020-02-07 | 2022-03-04 | 中国农业大学 | Husky microvesicle bacterium beta-galactosidase and coding gene and application thereof |
CN114540328A (en) * | 2022-02-18 | 2022-05-27 | 浙江工业大学 | Temperature-sensitive alpha-amylase and preparation method and application thereof |
CN116103317A (en) * | 2023-02-17 | 2023-05-12 | 大连工业大学 | Gene of carbohydrate binding module, fusion enzyme and application |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5366558A (en) | 1979-03-23 | 1994-11-22 | Brink David L | Method of treating biomass material |
US6333181B1 (en) | 1997-04-07 | 2001-12-25 | University Of Florida Research Foundation, Inc. | Ethanol production from lignocellulose |
US5916780A (en) | 1997-06-09 | 1999-06-29 | Iogen Corporation | Pretreatment process for conversion of cellulose to fuel ethanol |
AU9374298A (en) | 1997-09-12 | 1999-04-05 | Oceanix Biosciences Corporation | Preparation and use of biofilm-degrading, multiple-specificit y, hydrolytic enzyme mixtures |
US20050136426A1 (en) | 2003-06-27 | 2005-06-23 | Michael Howard | Bacterial phytochelatin synthetase |
US7384772B2 (en) | 2003-06-27 | 2008-06-10 | The University Of Maryland | Chitin degradative systems |
US7365180B2 (en) * | 2004-05-04 | 2008-04-29 | University Of Maryland | Plant wall degradative compounds and systems |
US8273557B2 (en) | 2004-05-04 | 2012-09-25 | University Of Maryland | Hydrolytic enzyme mixtures for saccharification of lignocellulosic polysaccharides |
JP2010525815A (en) | 2007-04-30 | 2010-07-29 | ユニバーシティー オブ メリーランド | Expression of carbohydrase during degradation of whole plant material by Saccharophagagus degradans |
-
2006
- 2006-09-12 US US11/519,104 patent/US8273557B2/en not_active Expired - Fee Related
-
2007
- 2007-09-11 EP EP07838012.8A patent/EP2059604B1/en not_active Not-in-force
- 2007-09-11 AU AU2007294906A patent/AU2007294906A1/en not_active Abandoned
- 2007-09-11 CN CN200780041931A patent/CN101636499A/en active Pending
- 2007-09-11 WO PCT/US2007/019708 patent/WO2008033330A2/en active Application Filing
- 2007-09-11 KR KR1020097007532A patent/KR20090088856A/en not_active Application Discontinuation
-
2012
- 2012-08-22 US US13/592,006 patent/US8835139B2/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101322797B1 (en) * | 2010-12-30 | 2013-10-29 | 고려대학교 산학협력단 | Method for introducing Genes into Saccharophagus degradans and Expression of Genes in Saccharophagus degradans |
Also Published As
Publication number | Publication date |
---|---|
EP2059604A4 (en) | 2009-10-21 |
US20070292929A1 (en) | 2007-12-20 |
AU2007294906A1 (en) | 2008-03-20 |
WO2008033330A2 (en) | 2008-03-20 |
EP2059604A2 (en) | 2009-05-20 |
WO2008033330A3 (en) | 2008-12-18 |
US20130196401A1 (en) | 2013-08-01 |
EP2059604B1 (en) | 2013-05-01 |
CN101636499A (en) | 2010-01-27 |
US8835139B2 (en) | 2014-09-16 |
US8273557B2 (en) | 2012-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20090088856A (en) | Enzyme systems for saccharification of plant cell wall polysaccharides | |
US10889844B2 (en) | Methods of degrading or hydrolyzing a polysaccharide | |
El-Sersy et al. | Optimization, economization and characterization of cellulase produced by marine Streptomyces ruber | |
Marjamaa et al. | Novel Penicillium cellulases for total hydrolysis of lignocellulosics | |
US7365180B2 (en) | Plant wall degradative compounds and systems | |
Odeniyi et al. | Production characteristics and properties of cellulase/polygalacturonase by a Bacillus coagulans strain from a fermenting palm-fruit industrial residue | |
US20170218354A1 (en) | Highly potent cellulolytic enzyme preparations and processes for producing same | |
JP6562735B2 (en) | New xylanase | |
WO2013176205A1 (en) | Xylanase, and method for producing sugar using same | |
WO2010118007A2 (en) | Enhanced cellulase expression in s. degradans | |
WO2021153587A1 (en) | Filamentous fungus trichoderma mutant strain | |
Amat et al. | Biomass hydrolyzing enzymes from plant pathogen Xanthomonas axonopodis pv. punicae: optimizing production and characterization | |
Gajula et al. | Fermentation of enzymatically saccharified groundnut shell for fuel ethanol production by Pichia stipitis NCIM 3498 | |
Rajoka et al. | Kinetics and thermodynamics of the native and mutated extracellular endoglucanases from Cellulomonas biazotea | |
Mardina et al. | Optimization of cellulase production from a thermohalophilic bacterium PLS 75 isolated from underwater fumaroles | |
US20220356499A1 (en) | Endoglucanase, and use thereof | |
JP5547688B2 (en) | A novel cellulase derived from Thermosporos thris hazakensis | |
Hu et al. | Screening of highly efficient fungi for the degradation of lignocelluloses by ionic liquids-assisted cellulase | |
JP2023145205A (en) | Enzyme composition for biomass saccharification | |
JP2019193669A (en) | Novel xylanase | |
Hamisu et al. | Subcloning of Fusarium oxysporum Endoglucanase Gene into pET39b (+) vector and Expression in Escherichia coli | |
WO2017037654A1 (en) | Cellulolytic enzyme composition | |
Henriksen | The application of pectinolytic enzymes in the conversion of lignocellulosic biomass to fuel ethanol | |
Aguiar | CYTA-Journal of Food | |
Saxena | Dolamani Amat, Anju Arora, Lata Nain |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E601 | Decision to refuse application |