KR20070056002A - 울케니아의 pufa―pks 유전자 - Google Patents
울케니아의 pufa―pks 유전자 Download PDFInfo
- Publication number
- KR20070056002A KR20070056002A KR1020067023437A KR20067023437A KR20070056002A KR 20070056002 A KR20070056002 A KR 20070056002A KR 1020067023437 A KR1020067023437 A KR 1020067023437A KR 20067023437 A KR20067023437 A KR 20067023437A KR 20070056002 A KR20070056002 A KR 20070056002A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- val
- leu
- glu
- gly
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title abstract description 46
- 241001491678 Ulkenia Species 0.000 title description 34
- 238000004519 manufacturing process Methods 0.000 claims abstract description 25
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 8
- 108020004414 DNA Proteins 0.000 claims description 76
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 claims description 31
- 150000001413 amino acids Chemical class 0.000 claims description 29
- 101000787132 Acidithiobacillus ferridurans Uncharacterized 8.2 kDa protein in mobL 3'region Proteins 0.000 claims description 18
- 101000827262 Acidithiobacillus ferrooxidans Uncharacterized 18.9 kDa protein in mobE 3'region Proteins 0.000 claims description 18
- 101000811747 Antithamnion sp. UPF0051 protein in atpA 3'region Proteins 0.000 claims description 18
- 101000827607 Bacillus phage SPP1 Uncharacterized 8.5 kDa protein in GP2-GP6 intergenic region Proteins 0.000 claims description 18
- 101000961975 Bacillus thuringiensis Uncharacterized 13.4 kDa protein Proteins 0.000 claims description 18
- 101000964407 Caldicellulosiruptor saccharolyticus Uncharacterized 10.7 kDa protein in xynB 3'region Proteins 0.000 claims description 18
- 101000768777 Haloferax lucentense (strain DSM 14919 / JCM 9276 / NCIMB 13854 / Aa 2.2) Uncharacterized 50.6 kDa protein in the 5'region of gyrA and gyrB Proteins 0.000 claims description 18
- 101000607404 Infectious laryngotracheitis virus (strain Thorne V882) Protein UL24 homolog Proteins 0.000 claims description 18
- 101000735632 Klebsiella pneumoniae Uncharacterized 8.8 kDa protein in aacA4 3'region Proteins 0.000 claims description 18
- 101000818100 Spirochaeta aurantia Uncharacterized 12.7 kDa protein in trpE 5'region Proteins 0.000 claims description 18
- 101001037658 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) Glucokinase Proteins 0.000 claims description 18
- 101000666833 Autographa californica nuclear polyhedrosis virus Uncharacterized 20.8 kDa protein in FGF-VUBI intergenic region Proteins 0.000 claims description 16
- 101000977027 Azospirillum brasilense Uncharacterized protein in nodG 5'region Proteins 0.000 claims description 16
- 101000962005 Bacillus thuringiensis Uncharacterized 23.6 kDa protein Proteins 0.000 claims description 16
- 101000785191 Drosophila melanogaster Uncharacterized 50 kDa protein in type I retrotransposable element R1DM Proteins 0.000 claims description 16
- 101000747704 Enterobacteria phage N4 Uncharacterized protein Gp1 Proteins 0.000 claims description 16
- 101000861206 Enterococcus faecalis (strain ATCC 700802 / V583) Uncharacterized protein EF_A0048 Proteins 0.000 claims description 16
- 101000769180 Escherichia coli Uncharacterized 11.1 kDa protein Proteins 0.000 claims description 16
- 101000976301 Leptospira interrogans Uncharacterized 35 kDa protein in sph 3'region Proteins 0.000 claims description 16
- 101000658690 Neisseria meningitidis serogroup B Transposase for insertion sequence element IS1106 Proteins 0.000 claims description 16
- 101000748660 Pseudomonas savastanoi Uncharacterized 21 kDa protein in iaaL 5'region Proteins 0.000 claims description 16
- 101000584469 Rice tungro bacilliform virus (isolate Philippines) Protein P1 Proteins 0.000 claims description 16
- 101000818096 Spirochaeta aurantia Uncharacterized 15.5 kDa protein in trpE 3'region Proteins 0.000 claims description 16
- 101000766081 Streptomyces ambofaciens Uncharacterized HTH-type transcriptional regulator in unstable DNA locus Proteins 0.000 claims description 16
- 101000804403 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HIT-like protein Synpcc7942_1390 Proteins 0.000 claims description 16
- 101000750910 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HTH-type transcriptional regulator Synpcc7942_2319 Proteins 0.000 claims description 16
- 101000644897 Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) Uncharacterized protein SYNPCC7002_B0001 Proteins 0.000 claims description 16
- 101000916336 Xenopus laevis Transposon TX1 uncharacterized 82 kDa protein Proteins 0.000 claims description 16
- 101001000760 Zea mays Putative Pol polyprotein from transposon element Bs1 Proteins 0.000 claims description 16
- 101000678262 Zymomonas mobilis subsp. mobilis (strain ATCC 10988 / DSM 424 / LMG 404 / NCIMB 8938 / NRRL B-806 / ZM1) 65 kDa protein Proteins 0.000 claims description 16
- 101000977023 Azospirillum brasilense Uncharacterized 17.8 kDa protein in nodG 5'region Proteins 0.000 claims description 15
- 101000961984 Bacillus thuringiensis Uncharacterized 30.3 kDa protein Proteins 0.000 claims description 15
- 101000644901 Drosophila melanogaster Putative 115 kDa protein in type-1 retrotransposable element R1DM Proteins 0.000 claims description 15
- 101000747702 Enterobacteria phage N4 Uncharacterized protein Gp2 Proteins 0.000 claims description 15
- 101000758599 Escherichia coli Uncharacterized 14.7 kDa protein Proteins 0.000 claims description 15
- 101000768930 Lactococcus lactis subsp. cremoris Uncharacterized protein in pepC 5'region Proteins 0.000 claims description 15
- 101000976302 Leptospira interrogans Uncharacterized protein in sph 3'region Proteins 0.000 claims description 15
- 101000778886 Leptospira interrogans serogroup Icterohaemorrhagiae serovar Lai (strain 56601) Uncharacterized protein LA_2151 Proteins 0.000 claims description 15
- 101001121571 Rice tungro bacilliform virus (isolate Philippines) Protein P2 Proteins 0.000 claims description 15
- 101000818098 Spirochaeta aurantia Uncharacterized protein in trpE 3'region Proteins 0.000 claims description 15
- 101001026590 Streptomyces cinnamonensis Putative polyketide beta-ketoacyl synthase 2 Proteins 0.000 claims description 15
- 101000750896 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized protein Synpcc7942_2318 Proteins 0.000 claims description 15
- 101000916321 Xenopus laevis Transposon TX1 uncharacterized 149 kDa protein Proteins 0.000 claims description 15
- 101000760088 Zymomonas mobilis subsp. mobilis (strain ATCC 10988 / DSM 424 / LMG 404 / NCIMB 8938 / NRRL B-806 / ZM1) 20.9 kDa protein Proteins 0.000 claims description 15
- 238000000034 method Methods 0.000 claims description 15
- 108090000790 Enzymes Proteins 0.000 claims description 11
- 102000004190 Enzymes Human genes 0.000 claims description 10
- 102000053602 DNA Human genes 0.000 claims description 9
- 239000002028 Biomass Substances 0.000 claims description 8
- 230000004071 biological effect Effects 0.000 claims description 8
- 150000007523 nucleic acids Chemical class 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 7
- 125000003729 nucleotide group Chemical group 0.000 claims description 7
- 229930001119 polyketide Natural products 0.000 claims description 7
- 108020004707 nucleic acids Proteins 0.000 claims description 6
- 102000039446 nucleic acids Human genes 0.000 claims description 6
- 230000035897 transcription Effects 0.000 claims description 6
- 238000013518 transcription Methods 0.000 claims description 6
- 108020004511 Recombinant DNA Proteins 0.000 claims description 5
- 230000000694 effects Effects 0.000 claims description 5
- 125000000830 polyketide group Chemical group 0.000 claims description 3
- 230000008569 process Effects 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 7
- 230000000295 complement effect Effects 0.000 claims 1
- 108010030975 Polyketide Synthases Proteins 0.000 abstract description 18
- 235000014113 dietary fatty acids Nutrition 0.000 abstract description 10
- 229930195729 fatty acid Natural products 0.000 abstract description 10
- 239000000194 fatty acid Substances 0.000 abstract description 10
- 150000004665 fatty acids Chemical class 0.000 abstract description 9
- 230000002255 enzymatic effect Effects 0.000 abstract description 7
- 241001298226 Ulkenia sp. Species 0.000 description 80
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 56
- 108010087924 alanylproline Proteins 0.000 description 49
- MBMBGCFOFBJSGT-KUBAVDMBSA-N all-cis-docosa-4,7,10,13,16,19-hexaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 description 48
- 108010050848 glycylleucine Proteins 0.000 description 41
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 38
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 36
- 108010005233 alanylglutamic acid Proteins 0.000 description 35
- 108010049041 glutamylalanine Proteins 0.000 description 35
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 28
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 27
- 108010047495 alanylglycine Proteins 0.000 description 26
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 26
- 108010034529 leucyl-lysine Proteins 0.000 description 26
- KPDTZVSUQCBOAE-HTFCKZLJSA-N (2s)-2-[[(2s)-1-[(2s)-2-[[(2s)-2-[[(2s)-2-aminopropanoyl]amino]propanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]propanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O KPDTZVSUQCBOAE-HTFCKZLJSA-N 0.000 description 25
- 235000020669 docosahexaenoic acid Nutrition 0.000 description 25
- 108010003700 lysyl aspartic acid Proteins 0.000 description 25
- 108010061238 threonyl-glycine Proteins 0.000 description 25
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 24
- 229940090949 docosahexaenoic acid Drugs 0.000 description 24
- 230000015572 biosynthetic process Effects 0.000 description 23
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 22
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 21
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 21
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 21
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 20
- 241000880493 Leptailurus serval Species 0.000 description 20
- 108700026244 Open Reading Frames Proteins 0.000 description 20
- 108010009298 lysylglutamic acid Proteins 0.000 description 20
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 19
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 19
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 18
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 18
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 18
- 108010056582 methionylglutamic acid Proteins 0.000 description 18
- 108010079364 N-glycylalanine Proteins 0.000 description 17
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 17
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 17
- 241000894007 species Species 0.000 description 17
- 238000003786 synthesis reaction Methods 0.000 description 17
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 16
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 16
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 16
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 16
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 16
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 16
- 108010047857 aspartylglycine Proteins 0.000 description 16
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 16
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 15
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 15
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 15
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 15
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 15
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 15
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 15
- 230000014509 gene expression Effects 0.000 description 15
- 239000013612 plasmid Substances 0.000 description 15
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 14
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 14
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 14
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 14
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 14
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 14
- 108010044940 alanylglutamine Proteins 0.000 description 14
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 14
- 108010077245 asparaginyl-proline Proteins 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 14
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 13
- 108010093581 aspartyl-proline Proteins 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 12
- 241000196324 Embryophyta Species 0.000 description 12
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 12
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 12
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 12
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 12
- 108010015792 glycyllysine Proteins 0.000 description 12
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 11
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 11
- 235000001014 amino acid Nutrition 0.000 description 11
- 108010038633 aspartylglutamate Proteins 0.000 description 11
- 108010037850 glycylvaline Proteins 0.000 description 11
- 108010017391 lysylvaline Proteins 0.000 description 11
- 244000005700 microbiome Species 0.000 description 11
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 10
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 10
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 10
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 10
- 108010054155 lysyllysine Proteins 0.000 description 10
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 9
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 9
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 9
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 9
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 9
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 9
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 9
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 9
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 9
- 108091034117 Oligonucleotide Proteins 0.000 description 9
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 9
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 9
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 9
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 9
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 9
- 125000002252 acyl group Chemical group 0.000 description 9
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 description 9
- 108010013835 arginine glutamate Proteins 0.000 description 9
- 235000020673 eicosapentaenoic acid Nutrition 0.000 description 9
- 229960005135 eicosapentaenoic acid Drugs 0.000 description 9
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 description 9
- 229940088598 enzyme Drugs 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010064235 lysylglycine Proteins 0.000 description 9
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 8
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 8
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 8
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 8
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 8
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 8
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 8
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 8
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 8
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 8
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 8
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 8
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 8
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 8
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 8
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 8
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 8
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 8
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 8
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 8
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 108010041407 alanylaspartic acid Proteins 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 8
- 108010016616 cysteinylglycine Proteins 0.000 description 8
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010077515 glycylproline Proteins 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 108010000761 leucylarginine Proteins 0.000 description 8
- 108010068488 methionylphenylalanine Proteins 0.000 description 8
- 108010029020 prolylglycine Proteins 0.000 description 8
- 230000009466 transformation Effects 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 7
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 7
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 7
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 7
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 7
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 7
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 7
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 7
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 7
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 7
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 7
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 7
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 7
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 7
- 108010047562 NGR peptide Proteins 0.000 description 7
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 7
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 7
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 7
- 235000021342 arachidonic acid Nutrition 0.000 description 7
- 229940114079 arachidonic acid Drugs 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 7
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 7
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 7
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 7
- 229960004488 linolenic acid Drugs 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 239000003921 oil Substances 0.000 description 7
- 235000020660 omega-3 fatty acid Nutrition 0.000 description 7
- 108010012581 phenylalanylglutamate Proteins 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 108010053725 prolylvaline Proteins 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 108010071207 serylmethionine Proteins 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- 108010073969 valyllysine Proteins 0.000 description 7
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 6
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 6
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 6
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 6
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 6
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 6
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 6
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 6
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 6
- 108050002233 Beta-ketoacyl synthases Proteins 0.000 description 6
- 102000011802 Beta-ketoacyl synthases Human genes 0.000 description 6
- 108010078791 Carrier Proteins Proteins 0.000 description 6
- 241000233866 Fungi Species 0.000 description 6
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 6
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 6
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 6
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 6
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 6
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 6
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 6
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 6
- 108090000854 Oxidoreductases Proteins 0.000 description 6
- 102000004316 Oxidoreductases Human genes 0.000 description 6
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 6
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 6
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 6
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 6
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 6
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 6
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 6
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 6
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010060199 cysteinylproline Proteins 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010079547 glutamylmethionine Proteins 0.000 description 6
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 108010004914 prolylarginine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 241000251468 Actinopterygii Species 0.000 description 5
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 5
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 5
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 5
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 5
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 5
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 5
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 5
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 5
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 5
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 5
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 5
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 5
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 5
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 5
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 5
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 5
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 5
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 5
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 5
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 5
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 5
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 5
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 5
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 5
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 5
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 5
- 108090001042 Hydro-Lyases Proteins 0.000 description 5
- 102000004867 Hydro-Lyases Human genes 0.000 description 5
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 5
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 5
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 5
- 108090000769 Isomerases Proteins 0.000 description 5
- 102000004195 Isomerases Human genes 0.000 description 5
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 5
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 5
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 5
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 5
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 5
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 5
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 5
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 5
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 5
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 5
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 5
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 5
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 5
- 108091081024 Start codon Proteins 0.000 description 5
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 5
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 5
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 5
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 5
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 5
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 5
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 5
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 5
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 5
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 5
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 5
- 108010011559 alanylphenylalanine Proteins 0.000 description 5
- 108010070783 alanyltyrosine Proteins 0.000 description 5
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 230000000052 comparative effect Effects 0.000 description 5
- 230000004136 fatty acid synthesis Effects 0.000 description 5
- 235000019688 fish Nutrition 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 5
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 5
- 108010012058 leucyltyrosine Proteins 0.000 description 5
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 5
- 150000003881 polyketide derivatives Chemical class 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 4
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 4
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 4
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 4
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 4
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 4
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 4
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 4
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 4
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 4
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 4
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 4
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 4
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 4
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 4
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 4
- 101000584877 Clostridium pasteurianum Putative peroxiredoxin in rubredoxin operon Proteins 0.000 description 4
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 4
- 101000618323 Enterobacteria phage T4 Uncharacterized 7.3 kDa protein in mobB-Gp55 intergenic region Proteins 0.000 description 4
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 4
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 4
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 4
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 4
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 4
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 4
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 4
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 4
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 4
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 4
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 4
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 4
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 4
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 4
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 4
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 4
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 4
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 4
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 4
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 4
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 4
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 4
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 4
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 4
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 4
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 4
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 4
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 4
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 4
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 4
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 4
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 4
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 4
- 101001056912 Saccharopolyspora erythraea 6-deoxyerythronolide-B synthase EryA1, modules 1 and 2 Proteins 0.000 description 4
- 101001056914 Saccharopolyspora erythraea 6-deoxyerythronolide-B synthase EryA3, modules 5 and 6 Proteins 0.000 description 4
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 4
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 4
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 4
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 4
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 4
- 101000819251 Staphylococcus aureus Uncharacterized protein in ileS 3'region Proteins 0.000 description 4
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 4
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 4
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 4
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 4
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 4
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 4
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 4
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 4
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 4
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 4
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 4
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 108010009297 diglycyl-histidine Proteins 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 239000000835 fiber Substances 0.000 description 4
- 229940013317 fish oils Drugs 0.000 description 4
- 108091008053 gene clusters Proteins 0.000 description 4
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- KQQKGWQCNNTQJW-UHFFFAOYSA-N linolenic acid Natural products CC=CCCC=CCC=CCCCCCCCC(O)=O KQQKGWQCNNTQJW-UHFFFAOYSA-N 0.000 description 4
- 235000020978 long-chain polyunsaturated fatty acids Nutrition 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 238000006722 reduction reaction Methods 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 3
- 108700016155 Acyl transferases Proteins 0.000 description 3
- 102000057234 Acyl transferases Human genes 0.000 description 3
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 3
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 3
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 3
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 3
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 3
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 3
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 3
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 3
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 3
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 3
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 3
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 3
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 3
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 3
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 3
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 3
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 3
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 3
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 3
- DGFGDPVSDQPANQ-XGEHTFHBSA-N Arg-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)O DGFGDPVSDQPANQ-XGEHTFHBSA-N 0.000 description 3
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 3
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 3
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 3
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 3
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 3
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 3
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 3
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 3
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 3
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 3
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 3
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 3
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 3
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 3
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 3
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 3
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 3
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 3
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 3
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 3
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 3
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 3
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 3
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 3
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 3
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 3
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 3
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 3
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 3
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 3
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 3
- 241000351920 Aspergillus nidulans Species 0.000 description 3
- 241000701489 Cauliflower mosaic virus Species 0.000 description 3
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 3
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 3
- MKMKILWCRQLDFJ-DCAQKATOSA-N Cys-Lys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKMKILWCRQLDFJ-DCAQKATOSA-N 0.000 description 3
- UDDITVWSXPEAIQ-IHRRRGAJSA-N Cys-Phe-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UDDITVWSXPEAIQ-IHRRRGAJSA-N 0.000 description 3
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 3
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 3
- 241000195619 Euglena gracilis Species 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 3
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 3
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 3
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 3
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 3
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 3
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 3
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 3
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 3
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 3
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 3
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 3
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 3
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 3
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 3
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 3
- YUXIEONARHPUTK-JBACZVJFSA-N Glu-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YUXIEONARHPUTK-JBACZVJFSA-N 0.000 description 3
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 3
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 3
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 3
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 3
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 3
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 3
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 3
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 3
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 3
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 3
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 3
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 3
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 3
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 3
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 3
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 3
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 3
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 3
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 3
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 3
- UMBKDWGQESDCTO-KKUMJFAQSA-N His-Lys-Lys Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O UMBKDWGQESDCTO-KKUMJFAQSA-N 0.000 description 3
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 3
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 3
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 3
- 101001122476 Homo sapiens Mu-type opioid receptor Proteins 0.000 description 3
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 3
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 3
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 3
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 3
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 3
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 3
- 102100024407 Jouberin Human genes 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 3
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 3
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 3
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 3
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 3
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 3
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 3
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 3
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 3
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 3
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 3
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 3
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 3
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 3
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 3
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 3
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 3
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- 108010003266 Lys-Leu-Tyr-Asp Proteins 0.000 description 3
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 3
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 3
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 3
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 3
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 3
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 3
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 3
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 3
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 3
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 3
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 3
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 3
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 3
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 3
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 3
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 3
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 3
- 241000294598 Moritella marina Species 0.000 description 3
- 102100028647 Mu-type opioid receptor Human genes 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 108010065395 Neuropep-1 Proteins 0.000 description 3
- 101100131043 Oryza sativa subsp. japonica MOF1 gene Proteins 0.000 description 3
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 3
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 3
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 3
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 3
- 241000607568 Photobacterium Species 0.000 description 3
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 3
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 3
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 3
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 3
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 3
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 3
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 3
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 3
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 3
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 3
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 3
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- 241000169446 Promethis Species 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 3
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 3
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 3
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 3
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 3
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 3
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 3
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 3
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 3
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 3
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 3
- 241000863430 Shewanella Species 0.000 description 3
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 3
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 3
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 3
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 3
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 3
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 3
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 3
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 3
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 3
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 3
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 3
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 3
- BOMYCJXTWRMKJA-RNXOBYDBSA-N Trp-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N BOMYCJXTWRMKJA-RNXOBYDBSA-N 0.000 description 3
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 description 3
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 3
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 3
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 3
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 3
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 3
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 3
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 3
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 3
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 3
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 3
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 3
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 3
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 3
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 3
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 3
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 3
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 3
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- RSEIVHMDTNNEOW-JYJNAYRXSA-N Val-Trp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N RSEIVHMDTNNEOW-JYJNAYRXSA-N 0.000 description 3
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 3
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 108010066119 arginyl-leucyl-aspartyl-serine Proteins 0.000 description 3
- 230000008827 biological function Effects 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 3
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 125000000346 malonyl group Chemical group C(CC(=O)*)(=O)* 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 235000016709 nutrition Nutrition 0.000 description 3
- 235000020665 omega-6 fatty acid Nutrition 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 3
- 108700004896 tripeptide FEG Proteins 0.000 description 3
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 3
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- 108010019608 3-Oxoacyl-(Acyl-Carrier-Protein) Synthase Proteins 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 2
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 2
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 2
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 2
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 2
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 2
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- 101000645498 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_10220 Proteins 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 2
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 2
- QEKBCDODJBBWHV-GUBZILKMSA-N Arg-Arg-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O QEKBCDODJBBWHV-GUBZILKMSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 2
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 2
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 2
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 2
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 2
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 2
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 2
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- OOXKFYNWRVGYFM-XIRDDKMYSA-N Asp-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC(=O)O)N OOXKFYNWRVGYFM-XIRDDKMYSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 108010049994 Chloroplast Proteins Proteins 0.000 description 2
- 101001132313 Clostridium pasteurianum 34.2 kDa protein in rubredoxin operon Proteins 0.000 description 2
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 2
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 2
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 2
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 2
- AFYGNOJUTMXQIG-FXQIFTODSA-N Cys-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N AFYGNOJUTMXQIG-FXQIFTODSA-N 0.000 description 2
- DTFJUSWYECELTM-BPUTZDHNSA-N Cys-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O DTFJUSWYECELTM-BPUTZDHNSA-N 0.000 description 2
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 2
- 101000618325 Enterobacteria phage T4 Uncharacterized 12.4 kDa protein in mobB-Gp55 intergenic region Proteins 0.000 description 2
- 101000653284 Enterobacteria phage T4 Uncharacterized 9.4 kDa protein in Gp31-cd intergenic region Proteins 0.000 description 2
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 2
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 2
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 2
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 2
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 2
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 2
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 2
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 2
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 2
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 2
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 2
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 2
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 2
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 2
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 2
- ZUELLZFHJUPFEC-PMVMPFDFSA-N His-Phe-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 ZUELLZFHJUPFEC-PMVMPFDFSA-N 0.000 description 2
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 2
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 2
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 2
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 2
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 2
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 2
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- 101001110310 Lentilactobacillus kefiri NADP-dependent (R)-specific alcohol dehydrogenase Proteins 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 2
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 2
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 2
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 2
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 2
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 2
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 2
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 2
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 2
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 2
- YAWKHFKCNSXYDS-XIRDDKMYSA-N Met-Glu-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N YAWKHFKCNSXYDS-XIRDDKMYSA-N 0.000 description 2
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 2
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 2
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- 241000592260 Moritella Species 0.000 description 2
- 101100202339 Mus musculus Slc6a13 gene Proteins 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 2
- FGXIJNMDRCZVDE-KKUMJFAQSA-N Phe-Cys-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N FGXIJNMDRCZVDE-KKUMJFAQSA-N 0.000 description 2
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 2
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 2
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 2
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- RTUWVJVJSMOGPL-KKUMJFAQSA-N Phe-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RTUWVJVJSMOGPL-KKUMJFAQSA-N 0.000 description 2
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- RJTUIDFUUHPJMP-FHWLQOOXSA-N Pro-Trp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O RJTUIDFUUHPJMP-FHWLQOOXSA-N 0.000 description 2
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 2
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 2
- 101000758676 Pyrococcus woesei Uncharacterized 24.7 kDa protein in gap 5'region Proteins 0.000 description 2
- 101100202330 Rattus norvegicus Slc6a11 gene Proteins 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- 102100029437 Serine/threonine-protein kinase A-Raf Human genes 0.000 description 2
- 101000691656 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA1, modules 1 and 2 Proteins 0.000 description 2
- 101000691655 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA2, modules 3 and 4 Proteins 0.000 description 2
- 101000691658 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA3, module 5 Proteins 0.000 description 2
- 101001125873 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA4, module 6 Proteins 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 2
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 2
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 2
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- WQYPAGQDXAJNED-AAEUAGOBSA-N Trp-Cys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N WQYPAGQDXAJNED-AAEUAGOBSA-N 0.000 description 2
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 2
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 2
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 2
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 2
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 2
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 2
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 2
- OXVPMZVGCAPFIG-BQFCYCMXSA-N Val-Gln-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N OXVPMZVGCAPFIG-BQFCYCMXSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 239000000460 chlorine Substances 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000006482 condensation reaction Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 108010022240 delta-8 fatty acid desaturase Proteins 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 235000004626 essential fatty acids Nutrition 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000019867 fractionated palm kernal oil Nutrition 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 235000020664 gamma-linolenic acid Nutrition 0.000 description 2
- VZCCETWTMQHEPK-QNEBEIHSSA-N gamma-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCC(O)=O VZCCETWTMQHEPK-QNEBEIHSSA-N 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 230000035764 nutrition Effects 0.000 description 2
- 229940012843 omega-3 fatty acid Drugs 0.000 description 2
- 229940033080 omega-6 fatty acid Drugs 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 235000013930 proline Nutrition 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- DVSZKTAMJJTWFG-SKCDLICFSA-N (2e,4e,6e,8e,10e,12e)-docosa-2,4,6,8,10,12-hexaenoic acid Chemical compound CCCCCCCCC\C=C\C=C\C=C\C=C\C=C\C=C\C(O)=O DVSZKTAMJJTWFG-SKCDLICFSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- SKPQXOSVPKPXML-ULQDDVLXSA-N 2-[[(2s)-1-[(2s)-3-phenyl-2-[[(2s)-pyrrolidine-2-carbonyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H]1NCCC1)CC1=CC=CC=C1 SKPQXOSVPKPXML-ULQDDVLXSA-N 0.000 description 1
- QVOBNSFUVPLVPE-ROUUACIJSA-N 2-[[(2s)-2-[[2-[[(2s)-2-amino-3-phenylpropanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 QVOBNSFUVPLVPE-ROUUACIJSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- ALYNCZNDIQEVRV-UHFFFAOYSA-N 4-aminobenzoic acid Chemical compound NC1=CC=C(C(O)=O)C=C1 ALYNCZNDIQEVRV-UHFFFAOYSA-N 0.000 description 1
- GZJLLYHBALOKEX-UHFFFAOYSA-N 6-Ketone, O18-Me-Ussuriedine Natural products CC=CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O GZJLLYHBALOKEX-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 102000009153 ACT domains Human genes 0.000 description 1
- 108050000029 ACT domains Proteins 0.000 description 1
- 241000224424 Acanthamoeba sp. Species 0.000 description 1
- 241000228431 Acremonium chrysogenum Species 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 1
- IDLBLNBDLCTPGC-HERUPUMHSA-N Ala-Trp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N IDLBLNBDLCTPGC-HERUPUMHSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- 241001136782 Alca Species 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 101000758020 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized aminotransferase BpOF4_10225 Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 241000143060 Americamysis bahia Species 0.000 description 1
- 102100030343 Antigen peptide transporter 2 Human genes 0.000 description 1
- 241000003610 Aplanochytrium Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- MHBUWPFQNPJTAS-QAETUUGQSA-N Asn-Leu-Phe-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 MHBUWPFQNPJTAS-QAETUUGQSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- HTOZUYZQPICRAP-BPUTZDHNSA-N Asp-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N HTOZUYZQPICRAP-BPUTZDHNSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 1
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- LXKLDWVHXNZQGB-SRVKXCTJSA-N Asp-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O LXKLDWVHXNZQGB-SRVKXCTJSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- QBJCJWAZOPCNIX-JPLJXNOCSA-N Asp-Leu-Phe-Val Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 QBJCJWAZOPCNIX-JPLJXNOCSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 244000178993 Brassica juncea Species 0.000 description 1
- 235000011332 Brassica juncea Nutrition 0.000 description 1
- 235000014700 Brassica juncea var napiformis Nutrition 0.000 description 1
- 101000946068 Caenorhabditis elegans Ceramide glucosyltransferase 3 Proteins 0.000 description 1
- 101100275473 Caenorhabditis elegans ctc-3 gene Proteins 0.000 description 1
- 102100027667 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Human genes 0.000 description 1
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 description 1
- 208000017667 Chronic Disease Diseases 0.000 description 1
- 235000010523 Cicer arietinum Nutrition 0.000 description 1
- 244000045195 Cicer arietinum Species 0.000 description 1
- 101000744710 Clostridium pasteurianum Uncharacterized glutaredoxin-like 8.6 kDa protein in rubredoxin operon Proteins 0.000 description 1
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 1
- 102100032182 Crooked neck-like protein 1 Human genes 0.000 description 1
- 241000199913 Crypthecodinium Species 0.000 description 1
- 241000199912 Crypthecodinium cohnii Species 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 1
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 1
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- PFAQXUDMZVMADG-AVGNSLFASA-N Cys-Gln-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PFAQXUDMZVMADG-AVGNSLFASA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- XCDDSPYIMNXECQ-NAKRPEOUSA-N Cys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS XCDDSPYIMNXECQ-NAKRPEOUSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 241000199914 Dinophyceae Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 101000653283 Enterobacteria phage T4 Uncharacterized 11.5 kDa protein in Gp31-cd intergenic region Proteins 0.000 description 1
- 101000618324 Enterobacteria phage T4 Uncharacterized 7.9 kDa protein in mobB-Gp55 intergenic region Proteins 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000195620 Euglena Species 0.000 description 1
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 1
- 108010087894 Fatty acid desaturases Proteins 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 101150038242 GAL10 gene Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 102100024637 Galectin-10 Human genes 0.000 description 1
- 102100039555 Galectin-7 Human genes 0.000 description 1
- 241000192128 Gammaproteobacteria Species 0.000 description 1
- 241000702463 Geminiviridae Species 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 1
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- KFHASAPTUOASQN-JYJNAYRXSA-N Gln-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KFHASAPTUOASQN-JYJNAYRXSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 1
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- 102100022624 Glucoamylase Human genes 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 1
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 241000208818 Helianthus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 1
- YIGCZZKZFMNSIU-RWMBFGLXSA-N His-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YIGCZZKZFMNSIU-RWMBFGLXSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 1
- ABCCKUZDWMERKT-AVGNSLFASA-N His-Pro-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O ABCCKUZDWMERKT-AVGNSLFASA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 101000652582 Homo sapiens Antigen peptide transporter 2 Proteins 0.000 description 1
- 101000725947 Homo sapiens Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Proteins 0.000 description 1
- 101000736065 Homo sapiens DNA replication complex GINS protein PSF2 Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 101000608772 Homo sapiens Galectin-7 Proteins 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- SRGRINJFBHKHAC-NAKRPEOUSA-N Ile-Cys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N SRGRINJFBHKHAC-NAKRPEOUSA-N 0.000 description 1
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- BQIIHAGJIYOQBP-YFYLHZKVSA-N Ile-Trp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N BQIIHAGJIYOQBP-YFYLHZKVSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- 241000003482 Japonochytrium Species 0.000 description 1
- 229930194542 Keto Natural products 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241001467308 Labyrinthuloides Species 0.000 description 1
- 241001491666 Labyrinthulomycetes Species 0.000 description 1
- 108010059881 Lactase Proteins 0.000 description 1
- 235000014647 Lens culinaris subsp culinaris Nutrition 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- MTBBHUKKPWKXBT-ULQDDVLXSA-N Lys-Met-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MTBBHUKKPWKXBT-ULQDDVLXSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 1
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- KTINOHQFVVCEGQ-XIRDDKMYSA-N Lys-Trp-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(O)=O KTINOHQFVVCEGQ-XIRDDKMYSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- XLTSAUGGDYRFLS-UMPQAUOISA-N Met-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCSC)N)O XLTSAUGGDYRFLS-UMPQAUOISA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- WOGNGBROIHHFAO-JYJNAYRXSA-N Met-Tyr-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N WOGNGBROIHHFAO-JYJNAYRXSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 241000562398 Phaeomonas Species 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 1
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- OAOLATANIHTNCZ-IHRRRGAJSA-N Phe-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N OAOLATANIHTNCZ-IHRRRGAJSA-N 0.000 description 1
- FUAIIFPQELBNJF-ULQDDVLXSA-N Phe-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FUAIIFPQELBNJF-ULQDDVLXSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- NWVMQNAELALJFW-RNXOBYDBSA-N Phe-Trp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NWVMQNAELALJFW-RNXOBYDBSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 102000011755 Phosphoglycerate Kinase Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241001208362 Photobacterium profundum SS9 Species 0.000 description 1
- 241000031611 Pinguiochrysis Species 0.000 description 1
- 241000705982 Pinguiococcus Species 0.000 description 1
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000031608 Polypodochrysis Species 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 1
- CKXMGSJPDQXBPG-JYJNAYRXSA-N Pro-Cys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O CKXMGSJPDQXBPG-JYJNAYRXSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- MZNUJZBYRWXWLQ-AVGNSLFASA-N Pro-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 MZNUJZBYRWXWLQ-AVGNSLFASA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- JFBJPBZSTMXGKL-JYJNAYRXSA-N Pro-Met-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JFBJPBZSTMXGKL-JYJNAYRXSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101000961876 Pyrococcus woesei Uncharacterized protein in gap 3'region Proteins 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 101000912235 Rebecca salina Acyl-lipid (7-3)-desaturase Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 101001056915 Saccharopolyspora erythraea 6-deoxyerythronolide-B synthase EryA2, modules 3 and 4 Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- DGPGKMKUNGKHPK-QEJZJMRPSA-N Ser-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGPGKMKUNGKHPK-QEJZJMRPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- AXVNLRQLPLSIPQ-FXQIFTODSA-N Ser-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N AXVNLRQLPLSIPQ-FXQIFTODSA-N 0.000 description 1
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000863432 Shewanella putrefaciens Species 0.000 description 1
- 101000877236 Siganus canaliculatus Acyl-CoA Delta-4 desaturase Proteins 0.000 description 1
- 101000819248 Staphylococcus aureus Uncharacterized protein in ileS 5'region Proteins 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 102100028897 Stearoyl-CoA desaturase Human genes 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 101001099217 Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) Triosephosphate isomerase Proteins 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- KLCCPYZXGXHAGS-QTKMDUPCSA-N Thr-His-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N)O KLCCPYZXGXHAGS-QTKMDUPCSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 241001467333 Thraustochytriaceae Species 0.000 description 1
- 241001298230 Thraustochytrium sp. Species 0.000 description 1
- 241000702295 Tomato golden mosaic virus Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 1
- AOAMKFFPFOPMLX-BVSLBCMMSA-N Trp-Arg-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AOAMKFFPFOPMLX-BVSLBCMMSA-N 0.000 description 1
- NAQBQJOGGYGCOT-QEJZJMRPSA-N Trp-Asn-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NAQBQJOGGYGCOT-QEJZJMRPSA-N 0.000 description 1
- LAIUAVGWZYTBKN-VHWLVUOQSA-N Trp-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O LAIUAVGWZYTBKN-VHWLVUOQSA-N 0.000 description 1
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 1
- HRKOLWXWQSDMSK-XIRDDKMYSA-N Trp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HRKOLWXWQSDMSK-XIRDDKMYSA-N 0.000 description 1
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 1
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 1
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 1
- CSOBBJWWODOYGW-ILWGZMRPSA-N Trp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O CSOBBJWWODOYGW-ILWGZMRPSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- SGQSAIFDESQBRA-IHPCNDPISA-N Trp-Tyr-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SGQSAIFDESQBRA-IHPCNDPISA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- WDIJBEWLXLQQKD-ULQDDVLXSA-N Tyr-Arg-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O WDIJBEWLXLQQKD-ULQDDVLXSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- BVWADTBVGZHSLW-IHRRRGAJSA-N Tyr-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BVWADTBVGZHSLW-IHRRRGAJSA-N 0.000 description 1
- HGEHWFGAKHSIDY-SRVKXCTJSA-N Tyr-Asp-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O HGEHWFGAKHSIDY-SRVKXCTJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- FRUYSSRPJXNRRB-GUBZILKMSA-N Val-Cys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FRUYSSRPJXNRRB-GUBZILKMSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 241000219094 Vitaceae Species 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000010775 animal oil Substances 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000002210 biocatalytic effect Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000003925 brain function Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000001721 carbon Chemical group 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 210000002230 centromere Anatomy 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 229910052801 chlorine Inorganic materials 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 108010037489 delta-4 fatty acid desaturase Proteins 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 230000004392 development of vision Effects 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- KAUVQQXNCKESLC-UHFFFAOYSA-N docosahexaenoic acid (DHA) Natural products COC(=O)C(C)NOCC1=CC=CC=C1 KAUVQQXNCKESLC-UHFFFAOYSA-N 0.000 description 1
- 235000013345 egg yolk Nutrition 0.000 description 1
- 210000002969 egg yolk Anatomy 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- UKFXDFUAPNAMPJ-UHFFFAOYSA-N ethylmalonic acid Chemical compound CCC(C(O)=O)C(O)=O UKFXDFUAPNAMPJ-UHFFFAOYSA-N 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 125000001924 fatty-acyl group Chemical group 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000012041 food component Nutrition 0.000 description 1
- 239000005417 food ingredient Substances 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- VZCCETWTMQHEPK-UHFFFAOYSA-N gamma-Linolensaeure Natural products CCCCCC=CCC=CCC=CCCCCC(O)=O VZCCETWTMQHEPK-UHFFFAOYSA-N 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 229960002733 gamolenic acid Drugs 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 235000021021 grapes Nutrition 0.000 description 1
- 208000019622 heart disease Diseases 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 235000020256 human milk Nutrition 0.000 description 1
- 210000004251 human milk Anatomy 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 229940116108 lactase Drugs 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 150000002617 leukotrienes Chemical class 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 1
- LTYOQGRJFJAKNA-VFLPNFFSSA-N malonyl-coa Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-VFLPNFFSSA-N 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- ZIYVHBGGAOATLY-UHFFFAOYSA-N methylmalonic acid Chemical group OC(=O)C(C)C(O)=O ZIYVHBGGAOATLY-UHFFFAOYSA-N 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000004570 mortar (masonry) Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 239000006014 omega-3 oil Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 229940094443 oxytocics prostaglandins Drugs 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000000858 peroxisomal effect Effects 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 150000003071 polychlorinated biphenyls Chemical class 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 150000003148 prolines Chemical class 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 150000003815 prostacyclins Chemical class 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 201000000980 schizophrenia Diseases 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000008399 tap water Substances 0.000 description 1
- 235000020679 tap water Nutrition 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000009772 tissue formation Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000014723 transformation of host cell by virus Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 235000014393 valine Nutrition 0.000 description 1
- 125000002987 valine group Chemical class [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6472—Glycerides containing polyunsaturated fatty acid [PUFA] residues, i.e. having two or more double bonds in their backbone
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- General Chemical & Material Sciences (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Immobilizing And Processing Of Enzymes And Microorganisms (AREA)
Abstract
본 발명은 폴리케타이드 합성효소(PKS)에 특이적인 서열을 코딩하는 유전자에 관한 것이다. 그로부터 합성되는 PKS는 PUFAs(polysaturated fatty acids:고도불포화 지방산)을 생산하는 효소 능력에 의해 특징된다. 본 발명은 또한 재조합 및/또는 형질전환된 유기체의 생산을 위한 뉴클레오타이드 서열의 용도뿐만 아니라 그에 상응하는 DNA 서열의 동정에 관한 것이다.
Description
본 발명은 폴리케타이드 합성효소(PKS)에 특이적인 서열을 코딩하는 유전자에 관한 것이다. 그로부터 합성되는 PKS는 PUFAs(polysaturated fatty acids:고도불포화 지방산)을 생산하는 효소 능력에 의해 특징된다. 본 발명은 또한 재조합 및/또는 형질전환된 유기체의 생산을 위한 뉴클레오타이드 서열의 용도뿐만 아니라 그에 상응하는 DNA 서열의 동정에 관한 것이다.
PUFAs(polysaturated fatty acids:고도불포화 지방산)이라는 용어는 탄소원소 12개 이상의 사슬 길이를 가지고, 적어도 두 개의 이중결합을 가지는 다중 불포화된 긴-사슬 지방산을 의미한다. PUFA의 두 개의 주요 패밀리로는 오메가-3와 오메가-6 지방산이 있는데, 이들은 알킬 말단에 대한 첫번째 이중 결합의 위치에 따라 나뉜다. 이는 세포막의 주요성분으로, 세포막에서 지질, 특히 인지질 내에 존재한다. PUFAs는 또한 사람과 동물 내에서 프로스타글란딘(prostaglandins), 로이코트리엔(leukotrienes), 프로스타싸이클린(prostacyclins)과 같은 중요한 분자의 예비 단계로서 작용한다(A.P. Simopoulos, essential fatty Acids in health and chronic disease, Am. J. Clin. Nutr. 1999(70), pp. 560-569). 오메가-3 지방산의 중요한 대표적 물질로는 DHA(docosahexaenoic acid)과 EPA(eicosapentaenoic aid)가 있는데, 이는 어류 오일과 해양 미생물에서 발견될 수 있다. 오메가-6 지방산의 주요 대표적 물질로는 ARA(arachidonic acid)이 있는데, 그것은 예를 들면, 단섬유 균류에서 발생하지만, 또한 간과 콩팥과 같은 동물 조직으로부터 분리할 수도 있다. DHA와 ARA는 사람의 모유 내에서 서로 근접하여 발견된다.
PUFAs는 인간의 적절한 성장, 특히 뇌의 발달, 조직 형성 및 그것의 회복을 위해 필수적이다. 그래서, DHA는 사람 세포막, 특히 신경의 주요 성분이다. 더구나, DNA는 뇌 기능의 발달에 있어서 중요한 역할을 하며, 시력의 발달에 있어서도 필수적이다. DHA와 EPA 등의 오메가-3 PUFAs는 영양 보충제로 사용되는데, DHA의 충분한 공급에 의한 균형잡힌 영양상태가 특정 질병을 예방하는데 있어서 유익하기 때문이다(A.P. Simopoulos, Essential fatty acids in health and chronic deseasee, American Journal of Clinical Nutrition 1999(70), pp. 560-569). 예를 들면, 인슐린-비의존성 당뇨병을 가진 성인에게서는 DHA의 결핍이 나타나며, 심장병과 관련하여서도 DHA의 불균형은 이후 문제를 일으킨다. 알츠하이머 또는 정신분열증과 같은 신경 질환도 낮은 DHA 레벨을 동반한다.
DHA를 상업적으로 추출하기 위한 수많은 공급원으로는, 해양 냉수어의 오일, 난황 조각 또는 해양 미생물 등이 있다. n-3 PUFA의 추출에 적합한 미생물로는, 비브리오(예를 들면, Vibrio marinus) 속의 박테리아 또는 편조류(Dinophyta)가 있고, 특히 C. cohnii와 같은 크립테코디니움(Crypthecodinium) 속, 또는 파에오모나스(Phaeomonas), 핑귀오키리시스(Pinguiochrysis), 핑귀오코쿠스(Pinguiococcus) 및 폴리포도크리시스(Polypodochrysis)와 같은 스트라메노필레스(또는 라비린툴로미코타(Labyrinthulomycota))이 있다. PUFA 생산을 위한 다른 바람직한 미생물로는 트라우스토키트리알레스(트라우스키트리이데아) 목(目), 자포노키트리움(Japonochytrium), 치조키트리움(Schizochytrium), 트라우스토키트리움, 알쏘니아(Althornia), 라비린툴로이데스(Labyrinthuloides), 아플라노키트리움(Aplanochytrium) 및 울케니아(Ulkenia) 등이 있다.
PUFA의 공급원으로는 식물 또는 동물이 상업적으로 사용되는데, 그것으로부터 추출된 오일은 종종 매우 이질적인 조성에 의해 특징화된다. 이런 방법으로 추출된 오일은 하나 또는 여러 개의 PUFAs를 풍부하게 하기 위해 고가의 정제 과정을 필요로 한다. 그러한 공급원으로부터의 PUFA의 공급은 또한 조절되지 않는 변동에 당면하게 된다. 그래서, 질병과 날씨 영향이 동물과 또한 식물 산출을 감소시킬 수 있다. 어류로부터의 PUFA의 추출은 계절 변동에 영향을 받기 쉬우며, 또한 물고기 남획이나 기온의 변동(예를 들면, Nino) 때문에 일시적으로 급격히 감소될 수 있다. 동물 오일, 특히 물고기 오일은 먹이 사슬에 의한 환경으로부터 유해 물질이 축적될 수도 있다. 동물은 폴리염화비페닐과 같은 유기염소에 의해 급격한 스트레스를 받는다고 알려져 있는데, 특히 상업적 양식장에서 물고기 소비의 측면에서 건강을 해친다(Hites et al. 2004, Global assessment of organic contaminants in farmed salmon, Science 303, pp. 226-229). 물고기 생산의 질에서 결과적 손실은, 오메가-3 PUFA 공급원으로서의 어류와 어류 오일에 대한 소비자의 감소된 수용이라는 결과를 가져온다. 더구나, 어류로부터의 DHA 정제는 상대적으로 고도 기술 장비를 요구하므로 값이 비싸다. 반면, 세포의 총 지방 조성의 대략 50%양의 몇몇 해양 미생물 내에 존재하는 DHA는 큰 발효조에서 상대적으로 경제적으로 배양할 수 있다. 미생물의 다른 장점은 그로부터 추출한 오일의 조성은 몇몇 조성으로 한정된다는 것이다.
도코사헥산(DHA; 22:6, n-3)과 에이코사펜타에녹산(EPA; 20:5, n-3)와 같은긴-사슬 PUFA의 생합성을 위한 다양한 생촉매 경로가 알려져 있다. 진핵 생물에서 긴-사슬 PUFA를 생산하기 위한 전통적 생합성 경로는 리놀렌산(LA; 18:2, n-6)과 알파리놀렌산의 델타-6 역포화로 시작된다. 그 결과, 리놀렌산으로부터 감마리놀렌산(GLA; 18:3, n-6)이, 그리고 알파리놀렌산으로부터 옥타데카테트라엔산(OTA; 18:4, n-3)이 합성된다. 이러한 역포화 단계는 델타-5 역포화뿐 아니라, 연장 단계에 의해 n-3 지방산뿐 아니라 n-6에서도 일어나서 결과적으로 아라키돈산(ARA; 20:4, n-6)과 에이코사펜타에녹산(EPA; 20:5, n-3)으로 귀착된다. 에이코사펜타에녹산(EPA; 20:5, n-3)으로부터 시작된 도코사헥산(DHA; 22:6, n-3)의 합성은 두 가지 서로 다른 생합성 경로로 일어날 수 있다. 선형 생합성 경로라 불리는, 두 개의 추가 탄소에 의한 에이코사펜타에녹산(EPA; 20:5, n-3)의 연장은 도코사헥산(DHA; 22:6, n-3)의 형성을 위해 델타-4 역포화에 연이어 발생한다. 이런 생합성 경로의 존재는 트라우스토키트리움과 유글레나와 같은 유기체 내의 델타-4 탈포화효소의 존재에 의해 확증될 수 있었다(Qiu, et al., Identification of a delta 4 fatty acid desaturase from Thraustochytrium sp. involved in the biosynthesis of docosahexaenoic acid by heterologous expression in Saccharomyces cerevisiae and Brassica juncea., J. Biol. Chem. 276 (2001), pp. 31561-31,566 and Meyer et al., Biosynthesis of docosahexaenoic acid in Euglena gracilis: Biochemical and molecular evidence for the involvement of a delta 4 fatty acyl group desaturase. Biochemistry 42 (2003), pp. 9779-9788).
에이코사펜타에녹산(EPA; 20:5, n-3)으로부터 시작된 도코사헥산(DHA; 22:6, n-3)의 두 번째 합성 경로는, Sprecher pathway로 불리는 것으로, 델타-4 탈포화에 의존한다. 그것은 테트라코사펜타에녹산(24:5, n-3)와 테트라코사헥산(24:6, n-3)에 대한 연속적인 델타-6 역포화에 대한 두 개의 탄소 유닛에 의한 두 개의 성공적인 연장 단계로 구성된다. 그리고 나서 산화(peroxisomal β oxidation)의 결과로서 두 개의 탄소 유닛에 의한 단축으로 페록시도코사헥산이 형성된다(H. Sprecher, Metabolism of highly unsaturated n-3 and n-6 fatty acids. Biochimica et Biophysica Acta 1486 (2000), pp. 219-231). 이 두 번째 생합성 경로는 포유동물에서 두드러진 DHA 합성 경로이다(Leonard et al., Identification and expression of mammalian long-chain PUFA elongation enzymes. Lipids 37 (2002), pp. 733-740). C20 PUFA의 형성을 위한 다른 생합성 경로는 델타 6 역포화효소 활성이 부족한 몇몇 유기체에 존재한다. 이런 유기체에는, 원생생물 가시아메바 종(Acanthamoeba sp.)과 Euglena gracilis가 포함된다. 다른 C20 PUFA 합성에서의 첫번째 단계는 두 개의 탄소 유닛에 의한 C18 지방산, 리놀렌산(LA; 18:2, n-6) 및 알파리놀린산(ALA; 18:3. n-3)의 연장에 존재한다. 지방산 에이코사펜타에녹산(20:2, n-6)과 에이코사펜타에녹산(20:3, n-3)은 그리고 나서 델타 8 역포화효소와 연속적인 델타 5 역포화에 의해 아라키도닉산(ARA; 20:4, n-6) 및/또는 에이코사펜타에녹산(EPA; 20:5, n-3)으로 전환된다(Sayanova and Napier, Eicosapentaenoic acid: Biosynthetic routes and the potential for synthesis in transgenic plants. Phytochemistry 65 (2004), pp. 147-158; Wallis and Browse; The delta-8 desaturase of Euglena gracilis: An alternate pathway for synthesis of 20-carbon polyunsaturated fatty acids. Arch. Biochem. Biophys. 362 (1999), pp. 307-316).
고등 식물은 초기 단계로부터 C20 PUFA를 합성하는 능력을 가지고 있지 않다. 그들은 다양한 역포화효소(desaturase)를 통해, 스테아린산(18:0), 올렌산(C18: 1; 델타-9 역포화효소), 리놀렌산(18:2, n-6, 델타 12 역포화효소)와 알파 리놀렌산(18:3, n-3; 델타 15 역포화효소)로부터 개시하여 형성한다.
그러나 특정 해양 미생물은 EPA와 DHA의 생산에 있어서 완전히 서로 다른 생합성 경로를 가진다. 이러한 PUFA-생산 미생물에는 사이토파가 플라보박테리움 박테로이드(cytophaga flavobacterium bacteroides) 그룹의 몇몇 종과 현재의 진핵 원생 생물, 치조키트리움 종. ATCC 20888뿐만 아니라, 해양 감마 프로테오박테리아를 대표적으로 포함한다(Metz et al. 2001, Production of polyunsaturated fatty acids by polyketide synthases in both prokaryotes and eukaryotes. Science 293:290-293). 그들은 폴리케타이드 합성효소(PKS)로 불리는 효소에 의해 긴-사슬 PUFA를 합성한다. 이러한 PKSs는 케타이드 유닛으로 구성되는 두 번째 대사산물의 합성을 촉매하는 큰 효소이다(G.W. Wallis, J.L. Watts and J. Browse, Polyunsaturated fatty acid synthesis: what will they think of next? Trends in Biochemical Sciences 27 (9) pp. 467-473). 폴리케타이드의 합성은 수많은 효소 작용을 포함하는데, 그것은 지방산 합성과 유사하다(Hopwood & Sherman Annu. Rev. Genet. 24 (1990) pp. 37-66; Katz & Donadio Annu. Rev. of Microbiol. 47 (1993) pp. 875-912).
서로 다른 PUFA-PKSs(PIFA-합성 PKSs)의 유전자 서열은 이미 알려져 있다. 그래서, 해양 미생물 Shewanella 종으로부터 단리된 38kb 유전자 단편은 에이이코사펜타엔산(eicosapentaenoic acid: EPA)의 생산에 대한 정보를 포함한다. 이러한 단편의 연속적인 시퀀싱은 8개의 해독프래임(open reading grames(ORFS))을 동정하였다(H. Takeyama et al., Microbiology 143(1997) pp. 2725-2731). Shewanella의 이러한 5개의 해독틀은 폴리케타이드 합성 유전자와 매우 관련이 있다. 마찬가지로, 미국 특허 US 5,798,259는 Shewanella putrefaciens SCRC-2874의 EPA 유전자 클러스터에 대해 개시한다. PUFA-PKS 유전자는 또한 해양 원핵생물 Photobacterium profundum strain SS9(Allen and Bartlett, Microbiology 2002, 148 pp. 1903-1913)와 Moritella marina strain MP-1, earlier Vibrio marinus (Tanaka at al., Biotechnol. Letters 1999, 21, pp. 939-945)에서도 발견된다. 유사 PUFA-생산, PKS-같은 ORFs는 또한 진핵 원핵생물 치조키트리움에서 동정될 수 있다(Metz et al, Science 293 (2001) pp. 290-293 and US Pat. No. 6,556,583, WO02/083870 A2). 치조키트리움에서 검출된 3개의 ORFs는 부분적으로 Shewanell의 EPA 유전자 클러스터와 부분적으로 동일하다. 몇몇 원핵생물과 진핵생물 치조키트리움에 보존된 PKS 유전자의 존재는 원핵과 진핵 생물 사이의 PUFA-PKS유전자의 균일한 유전자 이동 가능성에 대해 암시한다.
정상적으로 PUFAs를 생산하지 않는 미생물에서 단리된 유전자 클러스터를 사용하여, PUFAs의 형질전환 생산이 가능하다는 것은 이미 알려져 있다. 그래서, Shewanella 종. SCRC-2738 클러스터 내에 존재하는 상기 5개의 ORFs(열린 해독틀)은 IPA가 없는 생산자 E.coli와 Synechoccus 종 내의 측정 가능한 양의 EPA를 생산하는데 있어서 충분하다(Yazawa, Lipids 1996, 31, pp. 297-300 and Takayama et al., Microbiology 1997, 143, pp. 2725-2731).
일반적으로, 넒은 범위의 PUFAs의 생산에 대한 새로운 PUFA 생산자는 항상 필요하다. 이러한 생산이 예를 들면, 원핵생물, 원생생물 또는 식물 내에서 일어나는지 어떤지는 중요하지 않다. 목표는 항상 경제적으로 가능한한 많은 양의, 높은-질의 PUFAs를 생산하는 것, 그리고 가능한 어느 정도는 환경을 보호하는 것이다. 본 발명은 이러한 목표를 수행하기 위해, 특히 효과적인 PUFA 생산자인 울케니아 종으로부터 적절한 PUFA-PKS 유전자를 개시한다.
당해 기술분야에서, 본 발명은 PUFAs의 생산에 매우 적합한 DHA를 생산하는 미생물인 울케니아 종으로부터 PUFA-PKS 유전자를 동정하고 추출하는 것에 대한 것이다. 또한, 그들의 조절요소뿐만 아니라 그러한 유전자의 위치와 배열에 대한 지식을 얻는 것이다. 이로부터 얻은 지식, 특히 이로부터 얻은 핵산 물질은 형질전환된 유기체뿐 아니라 동계의 PUFA-PKS 유전자의 발현을 강화하게 한다.
본 발명에서 초기에 언급된 것에서 쉽게 유도되거나 결론지을 수 있는 명백히 언급되지 않은 것뿐만 아니라 상기한 과제들도 본 발명의 청구항에서 정의된 요지에 의해 해결된다.
1. 아래와 같은 특징을 갖는, PUFA-PKS
a.서열번호 6(ORF 1), 7(ORF 2), 8 및/또는 80(ORF 3)중 적어도 하나의 아미노산 서열을 포함하는 것으로, 적어도 70%, 바람직하게는 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 99% 그리고 매우 특히 바람직하게는 100%의 상동성 서열을 가지고, 적어도 하나의 PUFA-PKS 도메인의 생물학적 활성을 가지거나, 또는
b. 서열번호 32, 34, 45, 58, 59, 60, 61, 72, 74 및/또는 77 중 적어도 하나의 아미노산 서열을 포함하고, 적어도 그와 70%, 바람직하게는 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 99% 그리고 매우 특히 바람직하게는 100%의 상동성 서열을 가지고, 적어도 하나의 PUFA-PKS 도메인의 생물학적 활성을 가진다.
2. 10 이상의 ACP 도메인을 갖는, 제 1항에 따른 분리된 PUFA-PKS.
또한, 본 발명에 있어서 바람직한 PUFA-PKS는 서열번호 6(ORF 1), 7(ORF 2), 및/또는 8 및/또는 80(ORF 3) 서열의 적어도 500개의 연속적인 아미노산 서열과 적어도 70%, 바람직하게는 80%, 더 바람직하게는 90%, 더욱 바람직하게는 99% 동일한, 적어도 하나의 아미노산 서열을 포함하는 것이다.
또한, 본 발명은 바람직한 아미노산 서열, 서열번호 6(ORF 1), 7(ORF 2) 및/또는 8 및/또는 80(ORF 3)의 적어도 500개의 연속적인 아미노산 서열과 적어도 70%, 바람직하게는 80%, 더 바람직하게는 90%, 더욱 바람직하게는 99% 동일한 아미노산 서열에 관한 것이다.
더욱 바람직한 양태에 있어서, 본 발명은 선행 청구항 중 하나에 따른 PUFA-PKS를 코딩하는 분리된 DNA 분자에 관한 것이다.
더욱 바람직하게는, 서열번호 6(ORF 1), 7(ORF 2) 및/또는 8 및/또는 80(ORF 3)의 적어도 500개의 연속적인 아미노산 서열과 적어도 70%, 동일한 아미노산 서열을 코드하는 것으로 특징된다.
또한, 본 발명은 서열번호 3, 4, 5 및/또는 9 서열의 적어도 500개의 연속적인 뉴클레오타이드와 적어도 70%, 바람직하게는 80%, 더 바람직하게는 90%, 더욱 바람직하게는 95% 동일한 분리된 DNA 분자에 관한 것이다.
더욱 바람직한 양태로서, 본 발명은 전사를 조절하는 DNA 서열 중 적어도 하나와 기능적으로 연결된 이전에 기술된 DNA 분자 중 하나를 포함하는 재조합 DNA 분자에 관한 것으로, 바람직하게는 서열번호 3, 4 와 5 및/또는 9 또는 그의 기능적 변이체 뿐만 아니라 적어도 500개의 뉴클레오타이드의 일부분으로 구성되는 그룹으로부터 선택된다.
본 발명의 더 바람직한 양태는, 이전에 기술된 재조합 DNA 분자를 포함하는 재조합 숙주 세포에 관한 것이다.
본 발명의 바람직한 양태는, 적어도 10 ACP 도메인을 가지는 본 발명에 따른 PUFA-PKS를 내생적으로 나타내는 재조합 숙주 세포에 관한 것이다.
또한, 본 발명의 더 바람직한 양태는, PUFA, 바람직하게는 DHA를 포함하는 오일의 생산 방법에 관한 것으로, 이러한 방법에 의해 생산된 오일뿐 아니라, 그러한 재조합 숙주 세포의 배양을 포함한다.
또한, 본 발명의 더 바람직한 양태는, PUFA, 바람직하게는 DHA를 포함하는 바이오매스의 생산 방법에 관한 것으로, 이 방법에 의해 생산된 바이오매스뿐만 아니라 그러한 재조합 숙주 세포의 배양을 포함한다.
그러므로, 본 발명의 더 바람직한 양태는, 청구항 15항에 따른 재조합 바이오매스에 관한 것으로, 그 바이오매스는 청구항 8에 따른 핵산 및/또는 청구항 1에 따른 아미노산 서열 또는 그것에 대해 적어도 500개의 연속적인 아미노산 상동의 부분을 포함하는 것이다.
본 발명은 또한 더욱 바람직한 일 양태로서, 인공적인 폴리케타이드, 예를 들면, 폴리케타이드 항생물질 및/또는 새로운, 변화된 지방산을 생산하기 위한, 서열번호 6, 7, 8 및/또는 80, 서열번호 32, 33, 34, 45, 58, 59, 60, 61, 72, 74 및/또는 77로 구성되는 PUFA-PKS의 개별적인 효소 도메인의 용도에 관한 것이다.
본 발명에 따르면, 핵산의 경우 동일성은 비교되는 가닥의 특정 위치에 있어서의 동일한 염기쌍을 나타낸다. 그러나, 차이는 있을 수 있다. 동일성 값 계산에 대한 가능성은 blastn과 fasta프로그램에 의해 %로 나타냈다.
아미노산과 관련하여, 상동성이라는 개념은 예를 들어, 아미노산 서열에서의 보존적 변화를 포함하는 것으로, 그것은 기능 및/또는 단백질의 구조에 상당한 영향을 미치지는 않는다. 심지어 프로그램에 의해 계산되는 그러한 상동성 값, 예를 들면 blastp, Matrix PAM30, Gap Penalties: 9, Extension: 1은 당업자에게 알려진 것이다(Altschul at al., NAR 25, 3389-3402).
울케니아 종의 PUFA-PKS유전자의 서열 정보는 서열번호 3 내지 5 및/또는 9, 서열번호 1과 2로 정의되는 핵산과 아미노산 서열은 두 개의 분리된 코스미드(cosmid)의 전체 유전자 DNA서열을 나타내는데 이용된다(실시예 2와 실시예 3을 보라). 뒷부분에서는 이웃한 조절 서열(flanking regulatory sequence)뿐 아니라 PUFA 합성에 필수적인 3개의 관련된 열린 해독틀 ORFs 1-3에 대한 정보의 부분에 대해 포함한다. 또한, 유전자 서열로부터 유도될 수 있는 단백질 서열은 그것의 결과로 표현된다.
발명은 또한 고도의 순수 PUFAs의 생산을 위해 본 발명에 따른 핵산으로 숙주 유기체를 상동 및 이종 형질 전환하는 방법에 대해 포함한다. 그 분리된 열린 해독틀은 PUFA, 특히 DHA, EPA 및 DPA를 생산하는 형질전환 유기체에서뿐 아니라 동계에서도 바람직하다.
그 생산된 PUFAs는 바람직하게는 바이오매스의 조성 또는 오일로 존재한다.
본 발명 이전에는, 단지 진핵 생물 유기체, 원생 생물 치조키트리움의 PUFA-PKS 유전자에 대해서만 알려져 있었다(US Patent No. 6, 566, 583, WO 02/083870). 그리고나서 cDNA와 염색체 DNA로부터 부분적으로 서열 데이터가 결정되었다. 처음으로, PUFA 합성에 필수적인 진핵 원생 생물의 모든 PUFA-PKS 유전자가 본 발명의 염색체 DNA로부터 완전히 개시되었다. 이러한 결과는 울케니아 종으로부터 이전에 알려진 PUFA-PKS를 코딩하는 유전자의 결정뿐 아니라, 추가적으로 전사의 프로모터와 종결부위와 같은 이웃 조절 요소에 대해서도 제공한다. 게다가, 염색체 서열 정보는 각각의 PUFA-PKS 유전자의 위치와 배열에 대한 통찰도 가능하게 한다.
슈와넬라, 포토박테리움 또는 모리텔라와 같은, 이전에는 원핵생물 PUFA-PKS 대표로 알려졌던 클러스터가 더 이상 존재하지 않는다는 것은 매우 놀랍다. 처음 동정된 코스미드(서열번호 1)는 울케니아에 끼어들어간 독립적인 ORFs의 선형 배열을 보여주며, 또한 독립적인 ORFs의 해독틀 방향이 거꾸로 향한 것을 볼 수 있다(도 1). 이는 아마도 거대 유전자 전이의 결과인 거 같다. 독립적인 ORFs는 또한 각각 전이의 결과로서 서로로부터 명백히 큰 간격을 보여준다. 그래서, 두 개의 ORFs 1과 2는 대략 13 kb의 간격을 가진다. 세 번째 ORF는 그 후의 코스미드(서열번호 2)까지 이러한 관계에 있어서는 동정되지 못하고, 두 개의 코스미드(서열번호 1과 2) 사이의 부분적 동일성을 발견할 수 없다(도 1). 이는 울케니아 종의 ORF가 더 이상 두 개의 ORFs 1과 2의 부근의 공간에 위치하지 않는다는 것을 의미한다. 이로부터, 상기 언급된 원핵생물의 대표로 알려진 PUFA 유전자 클러스터가, 진핵생물 울케니아 종에서는 더 이상 존재하지 않는다는 결론을 내릴 수 있다. 유전자 상에서의 원핵생물 치조키트리움의 독립적인 PUFA-PKS 유전자의 위치와 배열은 부분적으로 결정되었는데(WO 02/083870), 이는 ORFs A와 B의 상반하는 방향을 보여준다. 그러나, 그것은 단지 4224 염기쌍에 의해 서로로부터 분리되어있다. 이 서열 부분은 특허 출원 WO 02/083870에서 양방향을 가지는 유전자 내 부분으로 논의되는 것이다. 양방향 프로모터 요소는 동종의 ORFs 1과 2 사이의 것으로, 최소한 울케니아에 대해서는, 울케니아에서 검출된 12.95 kb의 간격으로 인하여 있을 수 있는 것으로 보여진다. 울케니아의 ORF1 과 OFR2 사이의 12.95 kb 지역 내에 명백한 ORFs가 존재하는지는 명백하지 않다. 이는 거대한 재조합 및/또는 전사가 일어나는 지역에 대한 것이다. 전이효소-같은 사건이 몇몇 반복 서열에서 일어날 수 있다.
울케니아 종의 PUFA-PKS가 DHA 생산자인 모리텔라(5 x ACP) 와 치조키트리움(9 x ACP)뿐만 아니라 EPA 생산자인 슈와넬라(6 x ACP)와 포토박테리움(5 x ACP)의 PUFA-PKS와 비교하여, 10 ACP 도메인을 가진 아실 운반 단백질의 수많은 반복을 가진다는 것은 더욱 놀랍다(도 3). 이는 울케니아 종으로부터 추출된 PUFA-PKS가 관련된 원생생물 치조키트리움의 PUFA-PKS로부터 상대적으로 아미노산 서열이 빗나갔을뿐 아니라 구조적으로도 독특하다는 것을 의미한다. 또 다른 특이한 점은 울케니아 종의 세 번째 ORF는 치조키트리움의 ORF C에 대해 상대적으로 38개의 아미노산이 단축되어 있으며, 치조키트리움에서는 존재하지 않는 알라닌-풍부 도메인을 추가적으로 포함한다는 것이다. 흥미롭게도, 이 서열은 현재 ORF 1의 독립적인 ACT 도메인들 사이에 있는 영역과 비슷하고, 어쩌면 링커 지역으로 나타난다. 알라닌 연속부분이 독립적인 프롤린과 발린에 의해 끼어들어갔다는 사실뿐 아니라 서열의 길이에 있어서도 유사성을 갖는다. ORF 3의 아미노산의 큰 부분, 30 아미노산 길이가, 탈수효소/이성화효소 도메인 사이에 있어서, 치조키트리움 ORF C와 비교하여 부족하여 결실이 있다는 결론을 내릴 수 있다(도 6). 결과적으로, 서로에 대한 짧은 간격에 있어서 상응하는 단백질에 위치하는 이러한 도메인은, 효소 활성에 영향을 줄 수 있다. ORF 3에 있어서, 5' 말단에 위치하는 ATG 코돈은 시작 코돈으로 생각할 수 있으며, 이론적으로 심지어 ORF 최대는 1848 아미노산 길이가 존재할 수 있다(서열번호 9와 80). 심지어 동시에 ORF 3의 변이를 일으키는 것이 이러한 관계에 있어서는 가능하다.
구체적으로, 울케니아 종의 ORF 1(서열번호 3과 6)은 베타 케토아실 합성효소(서열번호 14와 32)로 불리는 도메인을 포함하는데, 그것은 모티브(DXAC)에 의해 특징된다(서열번호 12와 30). 울케니아 ORF 1의 효소 도메인의 활성 중심지에 대한 이러한 모티브는 바람직하게는 17 아미노산(GMNCVVDAACASSLIAV 서열번호 11과 29)의 영역으로부터 확장될 수 있다. 전체 베타 케토아실 합성효소 도메인은 N 말단(서열번호 10과 2)과 C 말단(서열번호 13과 31) 부분으로 구분할 수 있다. 베타 케토아실 합성효소 도메인의 생물학적 작용은 지방산 및/또는 PKS 합성에서 축화 반응의 촉매 작용이다. 아실 그룹은 치오에스테르 결합에 기초한 연장을 위해 효소 도메인의 활성 중심의 시스테인 그룹에 결합하며, 아실 운반 단백질의 말로닐 그룹의 탄소 원자 2로 CO2를 해리하면서, 몇 단계에 걸쳐 이동한다. 베타 케토아실 합성효소 도메인은 말로닐 CoA-ACP 전이효소 도메인(서열번호 15와 33)의 뒤에 온다. 이 도메인은 아실 운반 단백질(ACP)에 있는 4'-포스포판테테인 그룹으로의 말로닐CoA의 이동을 촉매한다. 말로닐CoA-ACP 이동효소 도메인은 또한 그것이 다른 선형 탄소 사슬로 분지하는 동안에 메틸-또는 에틸 말로네이트를 ACP로 이동시킨다. 그리고 나서 링커 영역은 알라닌-풍부 서열 부분(서열번호 16과 34)의 뒤에 오는데, 그것은 아실 운반 단백질 도메인(ACP 영역)의 10 반복을 (17-26 and 35-44) 포함한다. 이 ACP 도메인은 첫부분에 알라닌과 프롤린을 포함하는 링커 영역에 의해 서로 나뉘어진다. 각각의 ACP 도메인은 4'-포스포판테테인 분자(LGXDS(L/I))에 대한 결합 모티브에 의해 특징지어진다. 그 4'-포스포판테테인 분자는 여기서 모티브 내의 보존된 세린에 결합하는데, 그 ACP 도메인은 4'-포스포판테테인 그룹을 통해 지방산 및/또는 폴리케타이드 사슬의 성장을 위한 운반자로서 도움을 준다. 케토리덕테이즈(ketoreductase)(서열번호 27과 45)와 부분적으로 동일한 서열이 그 후에 따라온다. 이러한 도메인의 생물학적 기능은 3-케토아실-ACP의 NADPH-의존 환원에 있다. 이는 지방산 합성에 있어서 첫 번째 환원 작용을 나타낸다. 이 반응은 또한 폴리케타이스 합성에 있어서 연속적으로 일어난다(도 3을 보라).
울케니아 종의 ORF 2(서열번호 4와 7)는 또한 베타 케토아실 합성효소 도메인(서열번호 50과 58)과 함께 시작하는데, 그것은 모티브(DXAC)(서열번호 48과 56)에 의해 특징된다. 울케니아 ORF2의 효소 도메인의 활성 중심에 대한 모티브는 바람직하게는 17 아미노산(PLHYSVDAACATALYVL)(서열번호 47과 55)의 범위로 확장될 수 있다. 전체 베타 케토아실 합성효소 도메인은 N-말단(서열번호 46과 54)와 C-말단(서열번호 49와 57) 부분으로 나눌 수 있다. 메타 케토아실 합성효소 도메인에 대해 상응하는 이러한 도메인의 생물학적 활성은 ORF 1에 기술하였다. 케토합성효소(kethosynthases)는 연장에 있어서의 주요 부분으로 역할하며, 지방산 합성의 다른 효소보다 높은 기질 특이성을 보여준다. 이는 베타 케토아실 합성효소 도메인에 대한 더 작은 부분의 동일성을 가지는 서열 부분에 의해 다시 이어진다. 또한, 이 도메인은 활성 중심에 대한 모티브 DXAC가 결실되어 있다. 그것은 타입 Ⅱ PKS-유사 시스템(서열번호 51과 59)으로부터의 사슬 길이 인자(CLF)로 불리는 특성을 가진다. CLF 아미노산 서열은 부분적으로 케토합성효소와 동일하지만, 시스테인 그룹에 상응하는 활성 중심은 갖지 못하는 특성이 있다. PKS 시스템에서 CLFs의 부분은 현재 쟁점이 되는 부분에서 논의되고 있다. 최근 결과에 의하면, CLF 도메인 부분은 말로닐 ACP의 카르복시이탈반응에 있다. 생성된 아세틸 그룹은 연속적으로 베타 케토아실 합성효소의 활성 중심에 결합할 수 있고, 그래서 소위 초기 응축 반응의 기초로 나타난다. CLF-일치 서열은 또한 분자의 PKS 시스템 내의 load 도메인으로 발견된다. CLF 서열 특성을 갖는 도메인은 이전에 알려진 PUFA-PKS 시스템 모두에 존재한다. 이는 아실 이동효소 도메인(서열번호 52와 60)에 의해 이어진다. 이 도메인은 아실로부터 코엔자임A 또는 ACP도메인으로의 이동과 같은 수많은 아실 이동을 촉매한다. ORF 2의 종결 도메인은 산화환원효소(서열번호 53과 61)와 부분적으로 상동성을 보이며, 심한 엔오일 환원효소(enoly reductase)를 나타낸다. 엔오일 환원효소 도메인의 생물학적 활성은 지방산 합성의 두 번째 환원 반응에 존재한다. 그것은 지방산 아실 ACP의 이동된 이중 결합의 환원을 촉매한다(도 2 참조).
울케니아 종의 ORF 3(서열번호 5와 6)는 두 개의 탈수효소/이성화효소 도메인(서열번호 66, 68, 72 및 74)으로 구성된다. 두 개의 도메인 모두 직접적으로 시스테인에 인접한 "활성화 부분" 히스티딘을 포함한다(서열번호 69와 75뿐만 아니라 서열번호 67과 73). 이러한 도메인의 생물학적 기능은 H2O로부터 분리된 지방산 또는 폴리케타이드 분자로의 트랜스 이중 결합의 삽입과 그 후의, 시스 이성체 형태로의 이중 결합의 전환이다. 두 번째 탈수효소/이성화효소 도메인은 알라닌-풍부 지역(서열번호 70과 76)으로 융합되는데, 알려진 기능은 없으나 링커 영역으로 나타난다. 이는 울케니아의 엔오일 환원효소 도메인에 높은 부분의 상동성을 갖는 엔오일 환원효소 도메인(서열번호 71과 77)은 ORF 2에 이미 존재한다. 그것의 생물학적 기능은 이미 상기 기술된 엔오일 활성효소 도메인에 상응한다(도 2 참조).
울케니아 종의 ORF 1에 대한 시작 ATG 코돈의 앞에 프로모터 서열로 바람직하게는 2000bp(서열번호 62)가 존재한다. 특히 바람직하게는 시작 코돈 앞에 1500 bp, 더욱 바람직하게는 1000bp가 있다.
바람직하게는 2000bp(서열번호 63)은 ORF 1에 대한 종결 서열로서 종결 코돈 TAA 뒤에 존재할 수 있다. 더 바람직하게는 1500 bp, 더욱 바람직하게는 1000bp가 종결 코돈 뒤에 올 수 있다. ORF 1의 mRNA 합성에 대한 가능한 종결 시그널은, 염기 서열 AATAAA로, 종결 코돈 TAA 뒤의 412 bp가 존재한다.
바람직하게는 2000bp(서열번호 64)가 울케니아 종의 ORF 2에 대한 시작 ATG 코돈의 앞에 프로모터 서열로 존재한다. 더 바람직하게는 1500 bp, 더욱 바람직하게는 1000bp가 시작 코돈 앞에 올 수 있다.
바람직하게는 2000bp(서열번호 65)가 ORF 2에 대한 종결 서열로서 종결 코돈 TAA 뒤에 존재할 수 있다. ORF 2의 mRNA 합성에 대한 가능한 종결 시그널은, 염기 서열 AATAAA를 갖는 것으로, 종결 코돈 TAA 뒤에 1650 bp가 존재할 수 있다.
바람직하게는 2000bp(서열번호 78)가 울케니아 종의 ORF 3에 대한 시작 ATG 코돈의 앞에 프로모터 서열로 존재한다. 더 바람직하게는 1500 bp, 더욱 바람직하게는 1000bp가 시작 코돈 앞에 올 수 있다.
바람직하게는 2000bp(서열번호 79)가 ORF 3에 대한 종결 서열로서 종결 코돈 TAA 뒤에 존재할 수 있다. ORF 3의 mRNA 합성에 대한 가능한 종결 시그널은, 염기 서열 AATAAA를 갖는 것으로, 종결 코돈 TAA 뒤에 4229 bp가 존재할 수 있다.
예를 들면 DHA와 같은 PUFA는 E.coli와 같은 숙주에서 이종적으로 뿐만 아니라 울케니아 종에서 상동적으로, 본 발명에서 검출된 서열 정보를 사용하여 생산할 수 있다. 본 발명에 따른 핵산 서열은 PUFA의 생산을 증가시키는데 사용할 수 있는데, 예를 들면, PUFA-생산 유기체에서의 수많은 PUFA-PKS 유전자를 증대하는데 사용될 수 있다. 당연히, ACP 도메인을 코딩하는 서열 부분과 같은 독립적인 핵산 부분은, 상동의 또는 이종의 생산 유기체에서도 증식시킬 수 있다. 구체적으로, ACP 도메인은 생산을 증대시키기 위해, PUFA 합성에 필수적인 공동인자 4-포스파판테테인에 대한 결합 부분으로 스스로를 나타낸다. 당연히, 프로모터, 종결부위와 인핸서와 같은 다른 조절 인자의 사용은 유전학적으로 수정된 PUFA 생산자의 생산을 증대시키는 결과를 가져올 수 있다. 독립적인 서열 부분의 유전적 변이는 결과물의 구조적 변화와 서로 다른 PUFAs의 생산이라는 결과를 가져올 수 있다. 더구나, PUFA 합성효소의 폴리케타이드 합성효소에 대한 유사성은 혼합된 시스템의 해석을 가능하게 한다. 이는 소위 결합성 있는 생합성은 새로운 인공 바이오액티브 물질의 생산을 가능하게 한다. 예를 들면, 형질전환된 미생물에서 생산된 새로운 폴리케타이드 항생제로는 PKS-와 PUFA-PKS 단위의 혼합에 의한 것을 생각할 수 있다.
PUFA 유전자의 이종 조직의 발현에 적합한 숙주세포로는, E.coli 이외에도 사카로마이스세레비시애(Saccharomyce cerevisiae)와 피치아파스토리스(Pichia Pastoris)와 같은 효모, 또는 아스퍼길루스니둘란스(Aspergillus nidulans)와 아크레모니움키리소게눔(Acremonium chrysogenum)과 같은 단섬유 균류가 있다. PUFA-생산 식물은, 예를 들면, 콩, 포도, 해바라기, 아마 또는 그 이외의 것으로 바람직하게는 오일이 풍부한 식물에, 본 발명에 따른 유전자를 도입함으로써 생산할 수 있다. 효과적인 이종조직에서의 PUFA 유전자의 발현을 위해, 예를 들면, 4-포스포판테테인 이동효소와 같은 다른 보조 유전자도 또한 사용될 수 있다. 또한, 숙주-특이적인 프로모터/오퍼레이터 시스템이 유전자 발현을 강화하거나 유도하기 위해 사용될 수도 있다.
원핵 생물의 발현 시스템의 대다수가 이종조직에서의 PUFA 생산을 위해 사용될 수 있다. 또한, PUFA 유전자, 프로모터, 리보솜 결합 사이트와 전사 종결인자를 포함하는 발현벡터를 조작할 수 있다. E.coli 트립토판 합성과 람다 파지의 프로모터의 프로모터/오퍼레이터는 E.coli의 그러한 조절 요소의 일례이다. 마찬가지로, 엠피실린, 테트라시클린 또는 클로람페니콜에 대해 내성을 갖는 것과 같은 선택표지가 적절한 벡터에 사용될 수 있다. E.coli로의 형질변환에 매우 적합한 벡터로는 pBR322, pCQV2와 pUC 플라스미드와 그의 유도체가 있다. 이러한 플라스미드는 박테리아 요소뿐 아니라 바이러스성 요소도 포함할 수 있다. 예를 들면, JM 101, JM109, RR 1, HB101, DH1 또는 AG1와 같은, E.coli K12에서 유래된 모든 계통이 E. Coli 숙주 계통으로 사용될 수 있다. 당연히, 모든 다른 통상의 원핵 생물의 발현 시스템에서 이종조직의 PUFA 생산에 대해 사용될 수 있다(Sambrook et al. 참조). 숙주 시스템으로서 오일-형성 박테리아의 사용도 가능하다.
예를 들면, 효모와 같은 균류뿐 아니라 포유동물, 식물과 곤충 세포도 진핵생물 발현 시스템으로 사용될 수 있다. 효모 시스템의 경우에 있어서, 해당 과정에서의 효소 유전자에 대한 전사 개시 요소도 사용될 수 있다. 이는 알코올 탈수소효소, 글리세롤 알데하이드-3-포스페이트 탈수소화효소, 포스포글루코이소머라아제(phosphoglukoisomerase), 포스포글리세레이트 키나아제 등의 조절요소를 포함한다. 그러나, 산성 포스파타아제, 락타아제, 메탈로티오네인 또는 글루코아밀라아제와 같은 유전자로부터의 조절 요소도 사용될 수 있다. 프로모터도 발현을 강화하거나 유도하기 위해 사용될 수 있다. 갈락토오스에 의해 유도되는 프로모터(GAL1, GAL7 and GAL10)가 또한 특히 사용될 수 있다(Lue et al. 1987 Mol. Cell. Biol. 7, p. 3446 ff. and Johnston 1987 Mircobiol. Rev. 51, p. 458 ff.). 바람직한 3' 말단 서열은 또한 효모로부터 유래된 것이다. 시작 코돈(ATG) 주위에 인접한 뉴클레오타이드 서열이 효모 유전자의 발현에 영향을 주기 때문에, 효모의 효율적인 전사 개시 서열이 또한 바람직하다. 효모 플라스미드를 사용하는 경우, 효모의 복제 기원을 포함하는 선택 표지를 포함한다. 이 선택 표지는 예를 들면, LEU, TRP 또는 HIS와 같은 영양요구성 표지인 것이 바람직하다. 그러한 효모 플라스미드는 소위 YRps (Yeast Replicating plasmids), YCps (Yeast Centromere plasmids) 및 YEps (Yeast Episomal plasmids)라고 불린다. 복제 기원이 없는 플라스미드는 Yips (Yeast Integrating plasmids)로, 그것은 게놈 내에 전환된 DNA를 통합하는데 사용된다. pPICZ 플라스미드 뿐만 아니라 pYES2 와 pYX424플라스미드도 적합하다(special interest).
만약, 아스퍼길러스 니둘란스(Aspergillus nidulans)와 같은 단섬유 균류가 이종조직의 PUFA 생산자로 사용된다면, 상응하는 유기체의 프로모터를 사용할 수 있다. 강화된 발현을 위한 gpdA 프로모터와 발현 유도를 위한 alcA 프로모터가 그 예로 사용될 수 있다. pHELP와 같은 효모 플라스미드(D.J. Balance and G. Turner (1985) Development of a high-frequency transforming vector for Aspergillus nidulans. Gene 36, 321-331) 와 ura, bio 또는 paba와 같은 선택 표지를 사용하는 것이 단섬유 균류의 형질전환에 있어서 바람직하다. 심지어 단섬유 균류의 3' 조절 요소도 바람직하다.
곤충 세포에서의 PUFA의 생산이 바큘로바이러스 발현 시스템에 의해 일어날 수 있다. 그러한 발현 시스템은 예를 들면, Clonetech 또는 Invitrogen에 의해 경제적으로 가능하다.
아그로박테리움 또는 콜리플라워 모자이크바이러스(CaMV), 제미니바이러스(Geminivirus), 토마토 골든 모자이크바이러스 또는 타바코 모자이크바이러스(TMV)의 Ti 플라스미드와 같은 벡터가 식물 형질전환을 위해 사용될 수 있다. 바람직한 프로모터로는 CaMV의 35S 프로모터가 있다. 식물의 형질전환을 위한 더 큰 실현성을 위해 칼슘 포스페이트 방법, 폴리에틸렌 글리콜 방법, 미량주사(microinjection), 전기천공법 또는 원생생물의 리포펙션(lipofection)이 사용될 수 있다. DNA-전하를 띈 극미립자(유전자 총)로 충격을 가하여 형질전환하는 것이 또한 바람직하다. 식물에서의 대안 PUFA 생산이 엽록체의 형질전환의 결과 가능하다. 예를 들면, N-말단 리더 펩타이드는 엽록체의 단백질의 이동을 가능하게 한다. 바람직한 리더 펩타이드는 루비스코(ribulose biphosphate carboxylase)의 작은 서브유닛으로부터 유래한 것이지만, 다른 엽록체 단백질의 리더 펩타이드도 사용될 수 있다. 다른 가능성은 엽록체 게놈의 안정적인 형질전환에 의해 제공된다. 구체적으로 생탄도법(biolistic)과 다른 방법에 이를 위해 고려될 수 있다(Blowers et al. Plant Cell 1989 1 pp. 123-132, Kline et al. Nature 1987 327 pp. 70-73 and Schrier et al. Embo J. 4 pp. 25-32).
포유동물 세포에 대해서도 경제적으로 가능한 발현 시스템이 사용될 수 있다. 다른 것들 중에서, 바이러스성과 비-바이러스성 형질전환과 예를 들면, 렌티바이러스 또는 아데노바이러스 시스템 또는 Invitrogen의 T-Rex 시스템과 같은 발현 시스템이 실시예에서 사용될 수 있다. Invitrogen의 Flp-In 시스템이 또한 포유동물 세포에서의 목적 DNA의 삽입을 위해 존재한다.
본 발명에 따른 방법을 구성하는 핵산과 아미노산을 다음의 몇몇 실시예를 사용하여 기술하였다. 그러나, 그 서열과 본 발명은 이러한 실시예에 한정되지 않는다.
도 1은 울케니아 종 게놈의 PUFA-PKSd의 위치에 대해 나타낸다. 또한, 이러한 유전자에 의해 암호화되는 PUFA-PKS의 개별적인 도메인에 대해 나타낸 것이다(KS: 케토 합성효소, MAT: 말로닐-CoA:, ACP: 아실 전이효소, ACP: 아실 운반 단백질, KR: 케토 환원효소, CLF: 사슬 길이 인자, AT: 아실 전이효소, ER: 엔오일 환원효소 및 DH: 탈수효소/이성화효소).
도 2는 모리텔라 마리나(GenBank accession no.: AB025342.1) , 포토박테리움 프로펀덤 SS9(GenBank accession no.: AF409100), 슈와넬라 종. SCRC-2783(GenBank accession no.: U73935.1) 및 치조키트리움(GenBank accession nos.: AF378327, AF378328, AF378329).의 동족의 ORFs와 상응하는 울케니아 종의 ORF 2와 ORF 3을 비교한 것을 나타낸 것이다. 진화 단계에 있어서의 독립적인 ORFs 사이의 유전자 전위를 또한 도메인 구조와 함께 나타낸 것이다.
도 3은 모리텔라 마리나(GenBank accession no.: AB025342.1) , 포토박테리움 프로펀덤 SS9(GenBank accession no.: AF409100), 슈와넬라 종. SCRC-2783(GenBank accession no.: U73935.1) 및 치조키트리움(GenBank accession nos.: AF378327, AF378328, AF378329)의 동족의 ORFs와 상응하는 울케니아 종의 ORF 1을 비교한 것을 나타낸 것이다. ACP도메인의 수와 연속 아미노산 서열 LGIDSIKRVEIL의 반복이 강조되었다.
도 4는 치조키트리움의 ORF와 울케니아 종의 ORF1의 비교서열을 나타낸 것이다. 두 서열의 부분적 상동성 정도는 대략 81.5%이다.
도 5는 치조키트리움의 ORF B와 울케니아 종의 ORF 2의 비교서열을 나타낸 것이다. 두 서열의 부분적 상동성 정도는 대략 75.9%이다.
도 6은 치조키트리움의 ORF C와 울케니아 종의 ORF 3의 비교서열을 나타낸 것이다. 두 서열의 부분적 상동성 정도는 대략 80.0%이다.
도 7은 데이터뱅크 서열(Swiss-PROT All library)과 실시예 1에 개시된 PCR 산물을, FASTAX를 수행하여 비교한 서열을 나타낸 것이다.
도 8은 실시예 2의 코스미드 뱅크를 생산하기 위해 사용했던, 코스미드 SuperCosl(Stragagene)의 벡터 카드를 나타낸 것이다.
도 9는 데이터뱅크 서열(Swiss-PROT All library)과 실시예 3에 개시된 PCR 산물을 BLASTX를 수행하여 비교한 서열을 나타낸 것이다.
실시예
실시예
1:
울케니아
종
SAM2179
로부터 분리된
DNA
의
PUFA
-
PKS
-특이 서열의 증폭
1.1
PUFA
-
PKS
를 코딩하는 유전자를 포함하는 게놈
DNA
의 분리
50 ml DH1 배지(50 g/l 글루코오스; 12.5 g/l 효모 추출물; 16.65 g/l Tropic Marin; pH 6.0)를 250 ml 삼각플라스크에 흐름막이를 사용하여 울케니아 종. SAM 2179(Ulkenia spec BP-5601; WO9803671)와 섞어넣고, 28 ℃, 150 rpm에서 48 시간 동안 배양한다. 그리고 세포를 계속적으로 수돗물에서 세척하여 원심분리하고, -85 ℃에서 세포 침전물을 냉동시킨다. 세포 침전물의 정밀 검사를 위해, 막자사발에 옮기고 액체 질소를 넣고 막자로 빻아, 고운 가루로 분쇄한다. 그리고 나 서, 가루로 만들어진 세포 물질의 대략 1/10th를 2 ml 용해 버퍼 (50 mM tris/Cl pH 7.2; 50 mM EDTA; 3% (v/v) SDA; 0.01% (v/v) 2-머캅토에탄올)와 혼합하고, 68 ℃에서 1시간 동안 2 ml의 페놀/클로로폼/이소아밀알콜(25:24:1)를 계속적으로 첨가하면서 배양하면서 흔든 다음, 100000 rpm에서 20분 동안 원심분리시킨다. 상층의 액상을 제거한 후, 600 ㎕의 새로운 반응 용기로 각각 옮기고, 각각에 페놀/클로로폼/이소아밀알콜(25:24:1)를 혼합하여, 흔든 다음 13000 rpm에서 15분 동안 원심분리시킨다. 각각의 상층액 400 ㎕를 새로운 반응 용기에 옮기고, 1 ml 에탄올(100%)을 매번 첨가하면서 2번 내지 3번 전화시킨다. 그리고나서, 침전된 DNA를 유리 막대기로 감아서, 70% 에탄올로 세척하여 건조시키고, 50 ㎕ H2Odist에 2 ㎕ RNase A와 혼합하여 용해시키고, 4℃에서 보관한다.
1.2 모티브-특정
올리고뉴클레오타이드를
사용하는
PCR
반응
PCR 프라이머 MOF1와 MOR1를 모티브-특정 올리고뉴클레오타이드로 사용하였다.
MOF1: 5'- CTC GGC ATT GAC TCC ATC - 3'(서열번호 81)
MOR1: 5'- GAG AAT CTC GAC ACG CTT - 3'(서열번호 82)
1.1 에서 기술한 울케니아 종. SAM2179로부터 얻은 DNA는 1:100로 희석한다. 이 희석된 용액의 2 ㎕ 는 50 ㎕ 부피의 PCR 반응 혼합물(1 x 버퍼 (시그마; dNTPs (각 200 μM); MOF1 (20 pmol), MOR1 (20 pmol)과 2.5U Taq-DNA 중합효소 (Sigma) 로 옮긴다. PCR은 아래의 조건에서 수행된다: 처음 변성은 94 ℃에서 3분, 94 ℃에서 1분 동안, 각각 30 순환을 계속적으로 수행하고, 55 ℃에서 1분, 72 ℃에서 1분, 그리고 마지막으로 72 ℃에서 8분 동안 수행한다. PCR 산물은 겔 전기영동으로 분석하고, 적절한 크기의 단편은 벡터 pCR2.1 TOPO에 T/A 클로닝(Invitrogen)한다. E. coli TOP 10F'에 형질변환시킨 뒤, 플라스미드 DNA를 분리하고(Qiaprep Spin, QUAGEN) 서열화한다.
얻어진 서열 데이터(서열번호 1)를 공식적으로 얻은 EMBL Nucleotide Sequence Database(http://www.ebi.ac.uk/embl/)와 비교하고 평가하였다. 울케니아 종. SAM 2179의 주요 PCR 산물에 대해 FASTAX로 얻은 비교 서열을 얻었는데, 이는 치조키트리움 종. ATCC 20888(도 7)로부터 PUFA-PKS(ORF A; ORF: 열린 해독틀)의 아실 운반 단백질과 부분적으로, 아미노산 레벨에서 대략 90% 동일하다. 울케니아 종. SAM 2179에서의 PUFA-PKS를 검출하기 위해 단 한 번의 PCR 실험만 수행하는 것은 놀라운 일이다. 이는 사용된 올리고뉴클레오티드의 높은 유효성을 보여준다.
실시예
2:
울케니아
종.
SAM
2179의 유전자
DNA
로부터의 유전자 뱅크의 생산.
울케니아 종. SAM 2179의 유전자 DNA 50 ㎍를 2.5U Sau3AI의 500 ㎕ 내에 부분적으로 쪼개서 37 ℃에서 2분 동안 연속적으로 직접적으로 동일 부피의 페놀/클로로폼에 침전시킨 다음, H2Odist에서 에탄올로 침전시킨다. 그리고나서 생산자의 지 시에 따라, 쪼개진 Sau3AI에 유전자 DNA를 SAP(Shrimp Alkaline Phosphatase; Roche) 로 dephosphorylate 시킨다. 65 ℃에서 20분 동안 가열하여 계속적으로 효소를 불활성시킨다. 코스미드 Supercos I (Stratagene, figure 8)를 벡터로 사용하였다. 10 ㎍ Supercos I를 37 ℃에서 XbaI로 몇 시간 동안 완전히 분리시킨다. 효소를 65 ℃에서 20분 동안 가열하여 불활성화시키고, 생산자의 지시에 따라 65 ℃에서 20분 동안 반응 용기를 가열하여 효소를 불활성시킨다. XbaI 조각과 dephosphorylate된 Supercos I 코스미드를 37 ℃에서 몇 시간 동안 BamHI로 완전히 분리시킨다. 제거된 코스미드 DNA는 페놀/클로로폼으로 침전시키고, H2Odist 에서 에탄올로 계속적으로 침전시킨다. 1 ㎍ 코스미드 DNA를 연결시키기 위해, XbaI 와 BamHI 로 분열한 뒤, 3.5 ㎕ Sau3AI 분열된 유전자 DNA를 부피 20 ㎕ 내에서 결합하고, 몇 시간 동안 생산자의 지시에 따라 T4 리가아제(Biolabs)로 연결시킨다. 결합 batch의 대략 1/7th를 생산자의 지시에 따라, 연속적으로 파지 내에 Gigapack III XL Packaging 추출물(Stratagene)을 사용하여 넣는다. 그리고나서E.co.i XL1-Blue MR의 트랜스펙션을 위해 사용하였다. 유전자뱅크로부터의 PUFA-PKS-특이적 코스미드의 추출은 Ulkenia-PKS-특이적 올리고뉴클레오타이드 PSF2: 5'- ATT ACT CCT CTC TGC ATC CGT - 3'(서열번호 83)와 PSR2: 5'- GCC GAA GAC AGC ATC AAA CTC 3'(서열번호 84)를 사용한 PCR 스크리닝의 형태로 QIAGEN company (Hilden, Germany)에 의해 연속적으로 일어났다. 검출된 코스미드 클론 c19f09의 코스미드 DNA를 그 후에 추출하고 서열화하였다(서열번호 1).
실시예
3:
울케니아
종
ORF
3의 동정
울케니아 종. SAM 2179의 ORF 3를 동정하기 위하여, 올리고뉴클레오타이드를 서로 다른 PUFA-PKS의 고도로 보존된 서열 부분으로부터 얻었다. 흥미롭게도, PCR 증폭에 적합한 고도의 부분적 상동성이 개체 종 사이의 탈수효소/이성화효소를 암호화하는 서열 부분 내에서 나타났다.
3.1
PUFA
-
PKS
를 암호화하는 유전자를 포함하는 게놈
DNA
의 추출
실시예 1.1을 보라.
3.2
PUFA
-
PKS
-특이적
올리고뉴클레오타이드를
사용한
PCR
반응
다음은 PUFA-PKS-특이적 올리고뉴클레오타이드로 사용된 PCR 프라이머이다:
CFOR1: 5'-GTC GAG AGT GGC CAG TGC GAT -3'(서열번호 85)
CREV3: 5'-AAA GTG GCA GGG AAA GTA CCA -3'(서열번호 86).
3.1에서 기술된 울케니아 종. 2179의 게놈 DNA를 1:10의 비율로 희석하여, 희석된 용액의 2 ㎕ 는 50 ㎕ 부피의 PCR 반응 혼합물(1 x 버퍼 (시그마; dNTPs (각 200 μM); CFOR1(20 pmol), CREV(20 pmol)과 2.5U Taq-DNA 중합효소 (Sigma)로 옮긴다. PCR은 아래의 조건에서 수행된다: 처음 변성은 94 ℃에서 3분, 94 ℃에서 1분 동안, 각각 30 순환을 계속적으로 수행하고, 55 ℃에서 1분, 72 ℃에서 1분, 그리고 마지막으로 72 ℃에서 8분 동안 수행한다. PCR 산물은 겔 전기영동으로 분 석하고, 적절한 크기의 단편은 벡터 pCR2.1 TOPO에 T/A 클로닝(Invitrogen)한다. E. coli TOP 10F'에 형질변환시킨 뒤, 플라스미드 DNA를 분리하고 (Qiaprep Spin, QUAGEN) 부분적으로 서열화한다.
얻어진 서열 데이터를 공식적으로 얻은 EMBL Nucleotide Sequence Database(http://www.ebi.ac.uk/embl/)와 비교하고 평가하였다. 울케니아 종. SAM 2179의 주요 PCR 산물에 대해 FASTAX로 얻은 비교 서열을 얻었는데, 이는 치조키트리움 종. ATCC 20888(도 9)로부터 PUFA-PKS(ORF A; ORF: 열린 해독틀) 합성효소의 ORF C와 아미노산 레벨에서 대략 80% 상동성을 갖는다. 울케니아 종. SAM 2179에서의 PUFA-PKS를 검출하기 위해 단 한 번의 PCR 실험만 수행하는 것은 놀라운 일이다. 이는 사용된 올리고뉴클레오티드의 높은 유효성을 보여준다. 실시예 2에서 기술된 유전자 뱅크로부터 PUFA-PKS-특이적인 코스미드의 추출은 QIAGEN 회사(Hilden, Germany)에 의해, PCR에 이미 사용된 올리고뉴클레오타이드 CFOR1: 5'- GTC GAG AGT GGC CAG TGC GAT - 3'(서열번호 85)와 CREV3: 5' - AAA GTG GCA GGG AAA GTA CCA - 3'(서열번호 86)를 사용한 PCR 스크리닝의 형태로 수행한다. 코스미드 클론 058G09의 코스미드 DNA를 그 후 추출하고 서열화하였다(서열번호 2).
<110> Nutrinova Nutrition Specialties and Food Ingredients GmbH
<120> PUFA-PKS Gene from Ulkenia
<130> IPA9610-696
<160> 86
<170> PatentIn version 3.1
<210> 1
<211> 43372
<212> DNA
<213> Ulkenia sp.
<400> 1
ggatccacag cgttcattta ctcaagatca cactcgtgtg cagtccttga accttgggaa 60
agctcatgtc tctaggtatt gctgtcatgg tttgaaattt tgtcctcaaa agaatcgctt 120
gtaatttttc acttggtggg gtgcacaatg gtctctcaga accatctgct ctaaggagtc 180
ctactgacac ctacctacca cccttccttc atacccatgc ctactaacca acctattgat 240
aactctaacc agggttctat gataggcaaa tcagccaatc tcccgtggaa attagtcttt 300
tcaatcgttg gccagcaagc accatcgcaa cgacagcgct gcatcagcag gaactcgagt 360
acgcttcacc gtcatcgtca tcggtatcac cactattcat gaaatcagaa cctagtcacc 420
cagttacttt ttacgaggca gttgattctg tggagagatg ctcctgatca atggatatgt 480
ctattttatc tacaggtcac acataatcaa tcattcgggg tcatgatttt ccgccatggc 540
gatagtccaa aaaaactcag gaggcaaaat cattgttcaa tttacaacta cccacggagt 600
aaattaatgt aagagctcca atttacaggc aggtatatca tcacggtgtg ctgcagtagg 660
ttctgggtta tcatcctcaa tcattcataa acataacatt cattcataaa cataacattc 720
attcattcat aaacataaca ttcattcatt cattcactca ttcactcatt cattcattca 780
ctcattaatc cgcttaattt aactttaaat tgattgattg attgattgat ggcagaacca 840
cctattagca attggttact ccttgtattg aaaggcctga ataagtaagc aagcaagcca 900
ttggtaaacc ttcctcgccg cgactcgagc gacctcgaga gcggtctgag tgagtctctc 960
acgcaggccc cccgcctcct gagccgtctg tctcgctcaa ctgaagctcc gacaagccaa 1020
gctcacagct gcaagcttgc aagcaagctc gcttctgtct actcgtcctg catcgaatca 1080
acaaccttct cttacgccat gacggacgcc tcttccgaga tgcgcaagcg taagcgctac 1140
gcataccgca tcctcactga tgagtcatcc tcctcccatg caccctctgc tgaggatggt 1200
tccgtgcagg actctcgtat gctccgccat gccggcagca tctgggatgc cgaagagcgc 1260
cgccgcgctg gcaaaatgtc ctcttccgca actgcagcca tgtccagtgt acctcctgga 1320
gaggaactct ggcttgtgtc tatccctgcg gacttcgacg cccatgacct caatggcctt 1380
cgcctgtctg ggaagaagcc cctcgcggac caagaaatcc aaattggcgc tacccacacg 1440
ctcactgctg acctgctctc gggctcttct caggtgcggt gcctgcgccc tactagctcc 1500
tatgtcaacg gcctgaggct tacaccgcct gccgcgcgtg ttttccacgt cgtagagcgt 1560
gatgccgctg atgatgaggc cagtgaagcg ggaggcagtg cccaagagga ggaggagcgc 1620
ctgcgcaagg ctgaagaggt cgtcaagaga cttttgccga agccgcgtga gcaaattgaa 1680
tttaggactt tttctatggc cgacaaagag gaactgctga agcgcatgca aaaggcaaag 1740
gcgcgtggag agaagaagag gggcagaaac gcgattaagg aagaagcaga agacgaggag 1800
gacaaggagg aagagaagtt ggtggccaag acagcaaaga aggacaagaa gaagggcaag 1860
aaggaaaagg agaaaaggcg caagtctgtg gcctgagctg gaaacccctt taaagtgaat 1920
aaaggctgtc ttgacatgtt caagaacgct tattcgatac atgaagacgt gctctggggt 1980
tatttcgatg aagcctgatc taaatactag tctgcttcag aatcatgcac agtgttcaaa 2040
ttgattctta actacagcct acgctgaagt tcagcttcaa attttggtct attttgaagt 2100
tcttcaccga aagtcatttc tagagtcccg ccccaaagtc tgatctacac tctctactcc 2160
attaccgcta atatccttta caactcttat ctttttcgac ttcttcaagc gctaaggagc 2220
ggaccactaa actgatgcaa gcttgcatca actctacgac cttttttatg tcaacacaag 2280
ttctggcctt acgctgaact cgtctctgat acacaatatg caacgaacac cgccaagacg 2340
gtcgctcatg cacatacgca cacatatata caaccaaaca tacaaataaa cacataagca 2400
ttggtcaagc cagctacagg accaatattc catcttttgc tgcttttctg caatttgggc 2460
cgctttttta tgtttggctg tatatatttt tcttggcatg caacctaaca agacacatga 2520
gcagaaaaaa taaatacggt caaagtcttg tctctgatgc tcatgtcttt cttctaatct 2580
taccagcgag aagacctttc taaagaataa tatcacatat actcaattgt ccaaattgct 2640
ttcaataagc attctttact ggatagctct cgccaaactg tcattcttag gaacactgct 2700
aatacgtggc tgaaagcact cccaacatgc acttttattc ctatgcattt tcttcttgga 2760
gctcaatttg acaaaatgcc ggtcgataag ctcgcggtct tgactttgat gcttacttcc 2820
ttgtttaact cgaaaacctt ctcatggctc attggaaaat catcaaatgg attatctatc 2880
atcttcactt aacccaattt ttgtttctct aaaacagccc caactatttt ttaaagaaat 2940
ttgtgtgctc tatcttctgt ttgcaactca aactaacaag ccacatcaac aaacatttat 3000
ttttttcaaa cttgataact ttagaccaac tttgcatcct cgatgctcgg gactccatct 3060
taccccttgt caggtatgaa gcatctgatg aagcttgcag tattattacc ttttccagaa 3120
cactactgct accttcaaag atttgttcat ttcttttctt tgggggaaac aatgaatgct 3180
gattacccga agcgtaatat ggttgttgca tatattcaaa tattttaaac cttctaagta 3240
tttatatgat aggtatatgt tatttttaaa gacctttaat gcagttattt catatcaata 3300
accaagctct cgcagttttg cgctgtactg gcagtggtgg aggacccgtt gatctttata 3360
aaataggatc actggaggaa ggtgagacca ggaaactaag actatataag tttgtgggtt 3420
tctgtcattg tcactgacaa ggatcaaagt tatcctaatg cagagcatcc aacctttgtc 3480
tcagggaccc acccaatcca ctcttcaagt tttcactttc aatttcaggc caatttaaga 3540
caggaataca actcaaacta aatcaggatt cttctttttt aactcccagt catgcgatct 3600
ttaaaattga tcacattgcc ggcataataa ccatgggttt cgcaacttcc tccctggttt 3660
ctttgccaaa taaaacttcc acacactcga gagcaaactc cattgccgtg ccaggccctc 3720
tagacgtcac aattttggcc tcgtgctcaa ccaccacgcg atcctctgaa catcctcctt 3780
gagcgctctc aagatctttc gcaaatgccg gatggcacgt ggctctgcgc cctttcacaa 3840
tacccaaagg tgctagcacc accgctggag cggcgcaaat tgcggcaacc caggctccgc 3900
gcgagttctg tgccaacaat agcgagcgga gaggctcgct cgcggcgaga tttgaagcgc 3960
cgggcatccc gccaggaacg attacgaggt cgaaagatgg agatgtagaa ctggcatcca 4020
ggaggtcatc caaacgaaca tcagcctcaa tacgcacgcc tcgagaacag gtgagggttt 4080
ttccgctatc gcaaacagcg gcaacaatca cggaggcacg ggctctgcgc aggacatcaa 4140
tcggaatcac gctttccatc tcttcactgc catccgccat gacaaccagc acggacggtg 4200
gggaggaagg ggaagaagac atcgtcgaat tatgggaaac gtcgagactg gagcaagcgg 4260
gggcgattgt ttaagcgagc acaaagtgac gaggaattga gttacaatgt gaatctatag 4320
ataaataggt acctgtgcct tgcgacgaca gaaagatatt ttctcataat aggcctatct 4380
aaaaccaata attttgaaca ttttcatcat tgacgaaaag ctcctgcctt ccaaattgga 4440
agtgactatc cttaatatag tgcaataacg cattggacca aacagaatcc tcctggaggt 4500
gaccaccatg ttaggacctt gaacttcgca attgattggt ttcgaccttt tctccctcct 4560
tttataaaat aagcggctca aattaattag cctatcacgg tttctctagt ttttgggggt 4620
ttcgctatta tttggttatt atgaacaaat gtacagcttc ttacttacca gcctcctcgt 4680
tcagcatggt gaatgcatga aataaggaat caacttcatg actcatgctc tgcgtacaac 4740
attagattat ttttgcatgt ggtgttgaaa gtaagtcttc aagtcttttt cgtcaggata 4800
aaaactttct ttcatttgaa gttgtatgca agtcgcacca agatgtgatg actattttgc 4860
ttttcattaa ctttcctttg cagcaaaaaa gctctgtgcc tatgaaagcg ttagaactta 4920
cttatataac ctccaaatgg tagtgactat tccacctaaa ttacatatca taatgattta 4980
agtctttgtt aaaaagtgga tgtttggtaa gaaactggaa taactaaggg accactaagc 5040
tccagacact acaagtgaag caaatcttca atttaaatta tcaaagtact tcaaccaaaa 5100
ttttagcgtc tcaacaagta cccttcgtgt gctatcccgg aggcaatcac atgtgcacaa 5160
gtaacgatgt tgaacgtacc tatggctctg gtttattttg gcagccatga gcaacgcaac 5220
actgaccgta tctttctcta cgctacaatg tcctccgcca agcaaaaaga gaatatccca 5280
gctcatttgc aaagccgaga ttttattcct gccagtggtg tcaactggtc atttacggag 5340
aggattgcac ttcaaagcca tgcaatgaat gtggtattat ccacgacaat cttggaaaat 5400
ccaagctttt aaaatgcccc aaaaccatgc aaacacgtag ccgatcgtga tatccacgcc 5460
ctccagctgc gccacctatc caaggacatg gtttaagaat tgtcgtttgg tcatatgtta 5520
gttttcaacc cgcaattggg ccttagtcca ccttgttacc ataggaaatg caagctttgc 5580
aaattttgta ggctaatctc taagtgtagc ttttgtcatt gtaaagacac aattcattga 5640
catgaggttg aaagctgttc tcatatgtaa caatccgcaa cattgactac gtcacatgtt 5700
cgtgcataga gggaacactt atcttgcata gtatgccctc acaactctcc tcccccgtac 5760
agcaatcgca cgcaccatca tttattcaaa tgagacaata cttgctatcg tcccgattgc 5820
tctttagttg gacatagaac taaatgcgcg tcgcgatgcg accggaaagg tttaccagca 5880
gactgttctg caatcgttcc gtaccctatt tcacaacatt agtcgatcga tcagaacaaa 5940
tcaagataga acctgcagga ggggtcgcgc aaagtttagg cacccaggca cagccgctct 6000
gtaagtggat tttcattcaa ttgtggtcct gtgcattcat tgtttgctcg tgtagcaaat 6060
agaaccacaa ggggttttgc agaaagaaaa caaggatcat ggggcgaaac cgaggccaga 6120
cggcgggacc actcgaccgc cagtcgaggt tcatgaccaa ggttctgcgg caccgcgcgg 6180
cagacatggg tcttgaaatg cgttcagatg ggtttgtgcg cgtagaagac cttctgaaac 6240
ttcagcaact taaagacatt ggccttgagg atgtcaaagc tattgttgct gctgataaca 6300
aacagcgatt tggccttcag caggaagagg accagacctg gtggattcgt gccaaccaag 6360
gtcactctat ggctagtgtc gagacagaag atcttcttga ggaggttgac ctcgatggga 6420
tttctctctg tttgcacggc acctatttgc ggttctggcc attgatagta cgcgatggtt 6480
taaagcgtat gcaacgtaac catatccact ttgcaacagg ccttcccggg gacgatggtg 6540
tccttagtgg atttcgcaac tctgctgagg tgcttattta tcttgatacc gtgcaggcga 6600
aaaaagctgg actcaaaatg tatcgctctg caaaccaggt gctcctaagt ccaggtcttg 6660
gcgacagtgg agtaatccct gtcaccttgt ttgctaaggc tgtcgagcgc cgctctggaa 6720
agctactttg gccaatagag gaaggtaaag agtcgcaacc ccctacagcg cctacttcag 6780
accaccaacc tcgacaagga caactagcaa gtaagcgaaa agctggtggc cacaacaaga 6840
aactatcgca catgcttagc cgtgtcctgc ggcactctgc agttgatgaa ggaatcacca 6900
ttcgtgaaga tggcttcgtg cgccttgaag atctccaaac caaactcaag cgtttcgaaa 6960
atgtaactct tgatgacgtt caagctgtgg tgcgtgacaa tgacaaacaa cgcttcacac 7020
tacgccagga gtcagacggg tcctggatta ttcgcgcaaa ccaaggtcat tccatggctg 7080
ttgtcaaaga atcttttctc ttgcgggaac ttgaccctac cacaattgat gtgtgtcttc 7140
atggtactta caaagaagct tgggcaaaga ttcgaaaaac tggtctctcg cgcatgaacc 7200
gaaaccatat tcactttgct cgtggattgc cctccgactc caatggtgtt atcagtggca 7260
tgcggaaatc atgcgaagta catctctata ttgatgcctc tgcagcaggc aaagatggga 7320
ttaaattctt tgaatctgac aacggtgtta tcttaagtcc tggtaatggt gatggcatta 7380
tccctcctaa atactttaag tctgtcacag atcgccaagg cgcttcctta gaaaacctaa 7440
aatgacaaat tatgtagatc ttagttgttg aggacttcat gtcctttttg ttgtttgatt 7500
ccttgtatag cttatacacc ctggttatgt acattgtcat tcttgttaga ggcaattctt 7560
catctttgat tgatattcta tagaacttcc tcatgggtgt acctatacac aattatttat 7620
tataccgtgt gatattgtga ggttctaaag ttagcatcgc ctctgacacc tatgatggat 7680
gcagagtgac gccaatcctt cctctatatt gtgcgtgcct gctcgagaat caaatgatgt 7740
taaaagtcgt cttcattcat tatataacag agcataatgg aataataaaa ggaggcagga 7800
gacaagggta cttctgttgt gtaaaattcc attactatgt tcgtgtatag tagtattcct 7860
tgcctttagg atagtaggga agatattctc tgtgactttc acctacttca ctcttatgca 7920
agctcttatg caatcacaga tggatgtaga ttccgcttct tcattctcac tacgagaaca 7980
gcgcaactac aaatcttaag gactgtcaac tggcctgaaa tagtgaccaa ttatatattc 8040
caaaataaat ttatttgtat aaaattgtaa agatgcagca tgatagctta ggtacacata 8100
aacaacggtt aagtgtatag ggatacgcaa acgcaagcga gaacatgcaa gcgagaccat 8160
cgcctttcac cataatgtta taaatgtcta ttcttctgcc aagagcacga tacactcaac 8220
gttggtctaa gcactaaaga cagcatgtat ttatgtaagg acaacaacaa gcacctatac 8280
ctcaaaactt agtaataggc ttactaaaca ttctaacact atgatcttca tgtgaaaata 8340
ctcagcagca tggatgttga agctccacaa atggaataca gaaaacacaa tctagcaaga 8400
cgatgaaaat tgttcttagg tttcaggatc agaataacca aaatgcgcac cacacctgtt 8460
tctgatgctg tagctgtcat gttatggtaa aaacgtgcac agggcaccac tagcctgtta 8520
ttgtgtcgat tttgatacag tttatcacac gagagcttac tgactatgtt gtagaatgta 8580
aataccctat tcaaataacc ttgtggacac actcatccaa catactctac tcaactctta 8640
ctaaaacaac caaaagattc cgctgaacta gaccaaaata atttgagtga tatgctgcaa 8700
ttcgtttgaa cacaatacat gtattgatgg ctgagatatg acttgccaaa gattgttcgt 8760
tgcaattaaa gtttactctc tgagtgcata tactcaatac aatgcagctt tatcgtggaa 8820
atccgggcta agcatgccat taggacccta tagcaggctc tgggcacgat ctttatatct 8880
tagcgatagt ttgtgcagca aaataatgga taaatcaaac ttcaacgagt cttaattcat 8940
agtttcgaat ccctacgagg ctatatatat aaagaaggtg tgagtcgaca gcacagttat 9000
gtaggaaaag ttataattat gtggaaaata accttagttg tcgaatcgtg gtgaataaaa 9060
gcttcattta agcgttttca gagatgccgg agcccatacc aaatattaat ttgctcaaag 9120
tcatcaattt cttatttgat agaatctaaa acagctttat attatatgaa gagcatatat 9180
attttaagct agtttagact tcaaccaagg ggatccaatt ttcgctcgtc actctgcgtc 9240
aaggtcgttt gcaaaaacat caaatctggt gcaagctcaa atgactaggg tcaataagga 9300
ctcctactaa ttatagttgt cactattatt tccactagga accgataaaa cagatgtaat 9360
taactctctt ggcgcttacc ttgtatagca agagtaaaga gtaaatgatg cggcaaaaac 9420
tatctctgtt acttatatgt tatagagtgc attggctgcg ccatgccata tgatagtagg 9480
taaactttgg aagttgaaag gggcgagaaa gggatcacag gtgatctata tataaaatgc 9540
aaatgaaaat tttaaagttt ggaaagttta tatgcgacac ataaaattat aatttgcata 9600
tgtggattaa gtgaatggaa tgagtctagc tataactact acctatccct atcataatca 9660
tgggaacaga tcaggagcaa attgggctta caggcgctca gtgggcacgt agatgtcatc 9720
aatctcggca gcaacctgct tggcgttagc cttcagcggg gcattacgga cagcttcgag 9780
gcggcgcaag aagcaggcac cacggaggat ctgcaagttg atttgcacaa catcggggta 9840
ctcgttggca acggcggggt caaggtaggt acccttgatg aagtcgttga aagatccaat 9900
cgctgggcca caccaaacct ggtagtccat ggcacggtcc gggatgccag cgtttgccca 9960
gaagctcgcc aaaccaaggt accagcggaa gcacaaggac atcttaagct tggggtcacg 10020
ctccgcgcgc tcaatcttct ccgggttctg caacctgttg atgtagaagt ccttggtctc 10080
ttcccaaact tctgacagag acttcttgaa aatgcgcttc tccacacgtt ccagctctcc 10140
aggagccatg gactcaaagg agtcatactt gacgaagagc tcatagagct tgttggcacg 10200
cgaggggaac atagttccct tcttgagcac ctggagcttg acaccttcct caaacatgtc 10260
agctgctggg gccatgcaga tgtcggagta ggtggcttgt gagagctgct tgcgaacggt 10320
gtcacaggtt ccagcttgct tactcatctg gtttacggta ccagtgacga tgaaggccgc 10380
gcccatgttg aaggtggcaa tggcggcctg agggcatcca atgccaccac cagcaccaac 10440
gcgaacgcga aggtgggcag ggtagccgca ctccttgtgc agacgatcac ggaggttgac 10500
aatgagaggg aggatgacgt ggatggggcg gttatcggtg tggccaccgg agtccgcctc 10560
aacggcaatg tcgtctgcca caggcactgt gcgtgcgaga gcagcctgct cttgggtgat 10620
ctcgccggac ttcagcagct tctcgaggag attctcgggc gcgggacgga taaacattgc 10680
ggcaagctct gtgcgagaaa ccttaccgat gacgcggttc ttaataaccg tggagccatc 10740
agcagcgcga gagagacctg cagcacggta gcgcacgagc tgcggggtca aggtcataaa 10800
ggcggaggct tcaacgacag tgacgccctt ctcgaggaag aggtcgacgt tacccttctc 10860
gaggttgctg tcgaagggag agtggatgag gttgacagcg taagggccct tgggcagttc 10920
agcctggata gcttcgagag ccttgcgtac ggtggcgata ggaagaccac cagcaccgag 10980
agaaccaagg atgccgcgct ttccggcagc gataaccatc tcagcggatg caatgccctt 11040
tgccatggcg ccggtgtaca tgggggcgga tacaccatat gtctccatga aggcacggct 11100
gccaagatcc ttgatatcgc acttgggcac aacaatagat gcttcacttg ggcttgcttc 11160
aacgagatca ccgttggcgt tgacaccaag catcaaagtg ctgttgagct ccaaaagttt 11220
ggcacggaga gcctcagagg aagccacaac agctccggag acgctgcggg ccggagcagc 11280
aggggcaagg gcaggagcag cggaaggttt gttatccttg ttgagaatag ggtcgcgtgt 11340
ggcgtcttgc tcctgaatgt cgagacgctc catgaacttg ggggcaatag gctgcatctt 11400
gcgagcctgg ataagagcct cgatcttggg gtccgcagga ggaagcttag ctagcacctg 11460
gggcggcacg agctgctttt tggggtcata gcgaccattg accacaatct tacgcaagaa 11520
cttgttctta gtaggcttct tgccagccac catatcgttg taactctgcg tagcctcctc 11580
aacagtctcg gggtggtaca gaggggagac cttcacgcca ggcacgcggt gggcttggag 11640
agaggcaacc agcttgacca tggttgtcca agcattctcg ttctggcggt ccatggatcc 11700
ggtgacaaaa ggcttgctat ttccaagggt ggcgcgaatt gcggcgctac ggtgggcgtt 11760
gggaccagtc tcaacaaaga cgtcaaagtt cttgtcgcta acggtcttgg cgatcttagg 11820
aaagtctgcc tgaacagtgt acagctgtgc tgcgtattca ccaaagctgg gtgcgtactc 11880
gtcgctggct ccagtggact tgttaacaag cttcttctgg ttgacgctcg tgtacaggtc 11940
aaggccggca acctcgggaa tctcgaggac gctatggatc tcagcgatct gcttgccgta 12000
cggctcgacc acggggcagt ggccacacat accaaggtcc acgggcaaag cagggaggtt 12060
gctgctcagg cgagcaatgg cagccttgca atcttcaggc ttgccactga tgagagcact 12120
gttggcatcg ttgacaatgg tcaagtgcac gtacttattg ttggggccga tggccgcttc 12180
aacggcctcg cgggttccac gtaccacgta tccttgccag aactcgctga caggggtatc 12240
ttggggaata ttccaggcct tgcggagggc gtcaaactca acagcgaggg ccttacgcca 12300
gacctccgag ttgcggagtt tagttgtcag ctcctcagag acaaggccgt tcttctcaga 12360
aaaggcaaaa accatggaaa tctctccaag gctcagtccg aaagcagcct tgggctggat 12420
gccaagcacg tcgcgagcga tgtgggtgaa gcacatggac atgagaatac cgagtcggaa 12480
catctccacc tggttgcggt tgaactcatc ttcctgcgcc ttaagctcct ccttcgtcga 12540
ggcgcgcggg atcaaccatc tgtcgccttg atcccaaagc ttgttggtct tggcgtttac 12600
aaactcgtga agttcgggcc agatgcggtg aatgtcaagg ccgataccat agtaagggct 12660
tcggccttcg ccgtacataa acgcaacgcg atcgcttgac agtggcttgg gtgcaaagtg 12720
gctgcccgag ggtgatgtcc agtcgcggcc catcttaaga ctccgcggga tgcccttgga 12780
ggcgagttca agctccttct ggagcttact aggagaggtc accaggcaca gagcgaaggc 12840
cggcaacggg gtcttggtct cctgggcaat gctctcgccg agcaactcca taaaagcaag 12900
acgtacatta gcgctaggct gggcgaggcg ctcgcggagc ttgtcaacac gctgcgtgat 12960
agcgtcatgg gagtctccgc ggattacgag gagtttgacg gcatcgtcat cgagcgaaat 13020
gcggctcttg gtctcgtggt ggccctccac atcagagagc agcaccgtgt agcatgaacg 13080
ggtctcggaa acacctgaga cagctgcgtg gcggcgagct ccagggttct tcaaccaggc 13140
ccgcgaggac tggcacgcgt acagagactt gccccactgt gtctcaggtg caggctcctc 13200
ccaggaggcg ccgtttgagg gcaagtagcg gttgtacaga cagagagccg tcttgatgag 13260
actggcagct cctgaggcgt agccggtgtc accgacagtg gacttgacgc tgctgacagc 13320
gacgttgtgg ggctccacag cttcgttgct agagcgctgg ctgagaatgg cctcaatgcc 13380
gcggatttcc tcctcagcag tgagttcctt aggcagaacg gaggggttct tgaggtggcg 13440
ggcagagtca gcggagagct cgagcatctc aacgtccttg gggttgacgc gagcctgggc 13500
gagagcctcc tccatgcagg ctgccggcat gttgccgggc acgatagcgt ccatgcaggc 13560
gtaaatgcgt tcgtccttgg tgcagtcgct ctcgcgcttg aggacgaggg caccacatcc 13620
ctcaccaaca aagtagccgt cagcgccgga gtcgaagctg gcccgcgggc tctcctgctc 13680
cgagaccttg aaacgacgcg acttcacgta gagattctca gcgctggcgc aaagatccac 13740
accggcgatc actacggcct cgacctcgcc agtctcgagc aagtacttgc ccaactctgc 13800
gcaacggtag acggagttgt tgccctctgt gatggtgaaa gaaggaccct cgaaacccca 13860
ttgtgaagac acgcgggtgg ccacgaggtt gccgatgtag gatgtgtacg aggtagcggt 13920
accgcaatcg ttgatgtagg acatcatatc attgagggct gaagcggctt cgggacgagc 13980
acgctccttg agggcaacgc gggcgcggtg acggtagagc tcaaggtcag tgccaaggcc 14040
gacgaagaca gcgaccttac ctcccttctt gaggccagag ttgagaatgg cacggtcgat 14100
ggttgtgaca gcaagtagct gcatggggcg caacatgtcg tctggcgtca tgggcgtgcg 14160
caggcggcta aagtccacct cgacgtcctc aatgtagcat ccgtggggca cctccttgac 14220
accgcacagg tccaaaaagt ccttgtcttt accaaggaaa cgccagcgct tctcaggcaa 14280
tggcacagca ccatgttggc cattgtagat ggcacgctca aaggcgtcca ggcccttgag 14340
ggagccgaag gtggcatcca taccggtaat agcaatgcgc atgttgccct ccccgccaca 14400
acgtgagctg agggaactga tgctatcgtg ggtggcacag gcagccttgg agcggtcaaa 14460
ctcctcaaag actgcgtggg cgttggtgcc accaaagccg aaagcggaga gaccagcgcg 14520
cttgggctcg ccctcagtgt cgggccatgg gatgggctca gagaccacaa gcgggtccat 14580
ttgggaagat ccatcgacac caggagtggg cgggatcaca ccatgcttca tggcaaggag 14640
taccttgcac atgcctgcga aaccagctgc aacgagtgtg tggccaaagt tacccttgga 14700
gcttccaaag cgaggcacct tgccctcgaa gcaagccttg acggcatcaa tctcaacgcg 14760
gtctccctgg ggagtacccg ttgcgtggca ctcgacgtac tggatcttgt gcgggtgcac 14820
gttgacgcgc ttgtaggtat caatgaggca ggacttctcg ctgggcaagt gcggcttgag 14880
gggaagacca cagccagcat tgctgatggt agcaccgagc agagtaccgt aaatgtggtc 14940
tccatcgcga atagcgtcgt caaggcgctt gagaaccata atggcaccac cttcaccagg 15000
ggtgagaccc tgactgtcct tgtgaagcgg gtacgagatg ccgtctcccg atacaggcat 15060
ggcctggaaa gtggagaatc cggagagaat gaaaaagggc tccgggaagc aagttgcacc 15120
agcgagcatg acatcagcag caccggaaac gaggtggtcc tgggcgaggc gaaggacgta 15180
aagggcggtg gcacaggcag catcgacaga gtagtgaaga ggaccgaggt tgagctcttc 15240
tgctacgaag gatgccgggt ccataaagat gcggcggtca ccagcctcgg ggttctgcga 15300
ctgctcacgc tcggaccact tggaggcatc cttgaagacg cgagcgccga gtttcttttc 15360
gacgtggttt tggtacacat tgaggagttc gccctggagg ttgtccatgg gaaaggacag 15420
gcatccgctc acaataccgc accttgtaga gtcggagacc gatgtctcgg agagagcctt 15480
cttggagagc ttaaggagaa gctcgtgttc gttatcgacg gagtcatcga cgcagccgta 15540
gttctcgttg caaaaggtat ctgcaaattt gctacgctct gctttgaagt gctcggctcg 15600
cttgttggat ccgaggcgtt tatcgctaat cttagtccat gcagcctcac cgcccatgac 15660
tactttccag aactcttcct tgtctttgca gcccgcgtat tgcacggcca tgcccaccac 15720
ggcaatgcgc ttctcgtcgt gcatttcgtg agcagcgctc acattcttgc gagaggccat 15780
ctttttgctt tcttgttgct gcttactgta aacaaaaaaa agagcttgcg tgtcacctga 15840
ccggcacttt tagatcgatc aaaaagcggt cgtgtagatg gtttgctttg gaggagatgt 15900
ataaatgatg tgattgacta ccttgagcaa gtgattacag ggatgccaga gcaatcaaat 15960
aatcaatcag ttaatcaacg ccgtaataaa ggctatcaat caatcaatca atcaatcagc 16020
caactagcta gccgaagctg cgatggactg gcgtttggac agcgcgaagc tgtaggaact 16080
ggcgccgcac gagctgcgag gctgccaagc tagaggctgt ctgcctttgt ctcactcctt 16140
ttccgaggaa ggagagagag agagagagag agagagagag tggggggatg aaagtttgga 16200
tgcacgatgc gtgctttgtg gtttgtttcc ttgtttcttt ctttgcttgt tttttctctc 16260
tttttctttg ttattttgtc tctcttgaag caaatagaaa gaacctcgaa ctagacgctc 16320
caaagggtct tcaagaggtc tcgaaggcta ggctggcgaa agcgcgcacg ctggtcaagc 16380
aagcaagcaa agcaagcagg caagcaagca agcaagcaag caagcaaagc aagcaagggg 16440
tggattccac gaatgcgaga agtcaaaact ctgcttcaaa cagagaacaa atgggcaaac 16500
gaatgaggat aaatgagcaa ctaagtgaag tttacatttt caaaactcaa caaaacgatt 16560
acccaatcaa ctatgagacg cgcagacgtc tgcggcagca tctcttttat gattttcaaa 16620
aacaaaaaca aaaaccaaaa caaaataatt tgcaacaaat taatgaaaag cgaaacaaca 16680
aacagaaaca ttgtttaaac taaaaagtca tttttattga aaatctgttc ttttcatctg 16740
tacgtatgta tgtttgtatg tacacacttt gcttcatcgg tttattcgag tgctcttcat 16800
tcttgaaatt gccttagttc ttgctgttat aactgtcaaa caaacctcgc gaccttgaca 16860
agcagctcca cctcaccttc gggcctgctc gtttgccttt ctcgcttttt tcgcgatctt 16920
ctgccatcct tgcctactct gtccttatct catcaggctg ctgcggcctc ttgacctagc 16980
agttcaagta taattaattt gaaaataaac aaaaaaacac tgccacttat tatgcagatg 17040
gcactctctc agtgttgcaa aagtagagtg aaattctggt ttacaaaaaa tatttattta 17100
ataaacaaat aaaataaata taaattcatg ttatgttaga tcattttatt ttgttttctg 17160
agggcgcgat aaacgcttac ttgagaacca agaaaagcaa gaaaagcaaa ggtgcgaaag 17220
aagcaaacac attgatttcc ctagttccca ccacttcttt ctttctttgt ttgtatattt 17280
gtttgtttct ttctttcctg ctttgttttg tttgttttgt ttgttttgtt tgtttgtctg 17340
tttgtctgtt tatctgtttg ttagtttgtt agttactaga ctgctaattg atttgaaaac 17400
caagccaaac ccacgcaatg aatacgcaga aagcacagct aaaaagaaga agaagaggag 17460
gaattccgaa tcaggcgaga aagtctcgaa agcagtgcac caaaatcctc atttggaatc 17520
aaagccctcc ttcccagcga ctacggaggc ccacgacgac gacgacgccg acgacgccgc 17580
ccgcccgccc atcctcctct ctctccgcct gctcctcgtc ttctccctcc ctccctccct 17640
ccctcgcgca cgccgctccg aatggaatga catgactgac gcaagcgcgc aatggccgcc 17700
gtgcgatggc tcgaagcagc atcgcatcgc attgcattgg cattattcat tgattcattc 17760
attgattcat tcattaattt attcatttta attcattcat tcattcaatc attcatttat 17820
tcattcattc attaatttat tcattttaat tcattcatta atttattcat taatttattc 17880
atttttattc attcatactc ccgagcgcta cccggcgcta ggtgggtgct aggcgtggat 17940
ggagcggacc tctctgccag cagaaagagg aatgaatcta tctggatact gcgcgcagct 18000
tcttgcttgc tttgcttcaa cttgcttgca aacagccagg aggccgaacg gcttcgaccg 18060
ctcagcgtgt tcgccagcaa agaaccacct ccgccctcgc agtcgccgga tggatgaacg 18120
agcgaatgcg aatcctcctc cgatcttgaa cctcgaacct tcaatcaact tgccttaatt 18180
ttactttcat gactctcact attttaaata tacatgtatg tatgtatgta tgtatgtatg 18240
tatgtatgaa tgcacctcat actgataggg acctgcgggg gactgatacc acctgtctga 18300
atcaatttgc gagaccgcga gactgagtgg caggtagtag ctagctaagt agctgcctaa 18360
gagtctatcg gcatgcatga atcaaaaact atcatgtcaa tgttcctttg aggcttcgaa 18420
gtccgtcatt tgtcacgaaa ggttttgggt gaacgatcca ctgtttcgag agagatggtg 18480
tgaatgtata ggtgatagtt gccgagctgg cgagccgtcc caagcggtgc cggcactcac 18540
ccggctgaag cttcttacat gctctccgtt cataatcgtc caaattgatc ctgattcatg 18600
attcatgatt catgattcat gatgacacga gttggagttg gacgataagt cagcgctcgc 18660
tcaaccaaac tacctctgct cgcctagctg ctgttaggta gtgctactga ggcaggaccc 18720
aacttgaagc tacctactgc ctaggtattc ctacgctgtt tcgctgattt gcaatctctt 18780
cgttaccaag agataaaatt aacgagttat gacattgcgt atgcagacta cataataaag 18840
attgtgtcat ttatttataa gtggaaaggt gtaagatcaa gaactaagca ctaggtagca 18900
attaggcgtt atttgttagc gcgtggaaga aaatgcctct ggacagatag ctattaatag 18960
ctattaatag ccggtgttgt atttacaacc ttctgaaaga atttctccat agaggaaagt 19020
aaagaaacat cttattctgt gaaaagagat aaacaacttt ctagaaaatg gatgacagag 19080
caaagaaggt cgatcgtctt caaccgcaga tctgggaatg ctaaggttgg cgccaggctt 19140
acattatgcg tcatgctgac caaagggcgt aaagtgccga tgggcatccg atatatgcgc 19200
gttcaaggtg aggaattcaa gatcatcaag tttgtttgaa tttcgaggtt gaaaacacag 19260
agttttgaca atcgatcaat caatcaatca atcaatcaat caatttaaaa ccaatttaaa 19320
accaaatgaa tgagtgaatg agtgaatgac tgaatcaatt taaactaaat gaatgaatga 19380
atttaaaacc aaatgaatga gtccttagcg atttcaagtt ctgcagtgaa atctacaaat 19440
ctacgacgaa agtagtgaga tcgtatcaac gtgtatagac agacaatgat gctgcggata 19500
cctaagtgct tgcgtggagg gactacgatg cagatcccga gttttaggtc ctagttcctc 19560
cgttctctgg taaaaaagaa agcctctcct tcttgacgcc attcagcgac gtggaacaag 19620
cgagacagag gcacaagttt tggagtcatt gagtcgggtc tgctctgctt tgaggatgaa 19680
ccaacgacct tcggagtctt gcagatagat ggtccattct tcaaacgaca cagagatcgt 19740
cgtctcgcgt aagttggcag tgggtctaga gctagctaaa aacatctgac agagagcaca 19800
tacagagcta aagaggagtg tactcggcaa aatagcgtgg acggatgaca tcatcaatcg 19860
ctcagctttt tcgtttctta ccaaaaaatt gacaaaccag agaaataaat agattgactc 19920
aacaaattaa attaaaacaa taaattaaaa aagatctctt aaagaagttt tctgaaagaa 19980
accaaaaaca ataaactctg cgacaagaac ttgaggccag aagggatgaa gaaggtacgt 20040
atctagatgg tgactgggga cacaaagaag caaggtctga attctcagaa gccagctgca 20100
gccagccagc tactaggagt gtctgccagc tccgtcgtca tgccacgagt gtccctgcca 20160
acgcttcaag cgtacttgca acttttattt gattactaca ttactacatt ataacttcat 20220
ctatagcttt aaaaaggaaa taaaggaaat aataaaataa atcaaataat ggtaaaaagt 20280
tataaataat caacgactaa aaaggaattt tattcgaagg tcctcggcag gaaataagtg 20340
gaatcaaaga gaaggcggga acggtaggga ccatacatga tagtcccaaa ctgaggaact 20400
acgaattgcg gggctaagca aattcatagg atcccagtta gggacagacc ctcgaggtcc 20460
gagttggtat cctgggccaa agcttgcgca agggtgctct agagctacaa ctcaatacca 20520
gtagttgcat ggccatctct gatagctttc ttcatgaata tggggtgagc ttagagacaa 20580
gcagtagaca ctctgtgacc tacgagctat atttgctgtc gcagagcatc tcctcaaaat 20640
aattcatcga agaaagacgg attgaaagtt ttgccttatt tgaacaaagt taatatttta 20700
actctcggta gttaaaccat gatagctcat ttatagcgta ggctgacaca gaagcgtagg 20760
ggcttagacg tcatgatgat tcgtgatgaa ataaatcaag gattctcgaa cgttgacacg 20820
cgcaatggag cgtgccaatg tcaaaagggt attgctgtat catcaacgta ggtaggtagt 20880
caaacgggct acagctctgt cctattcact cactaagaca aaatgttttc tctcaaacgg 20940
ccagctcgaa agtaatattg ggagcaagaa tgaaaatcat tctccagtac acttgcagtg 21000
agatcaagtt tcaagaccat caaacgatac gatacaggag gtactatctt tgctgaagtc 21060
agtagcagca gcattacgag cctggtagat ataaattgat aaaaagacaa gaggtatatc 21120
atatttcaga gtagagtaca tactgagctg gaaacataaa actagtgcac gcaatcgacg 21180
gttcaacttt tctcaagacg cttccagtcg tttcttaatt agctcagatg gtagcaaaag 21240
tgatatgcgc atcagacttt cgtaaacgta aaactcggca tctgtagatg ttgagtcatt 21300
gttttcttca ataatttact tctcgcagca gtgcacttgg aaaggtttgt caagtttgac 21360
ccagctaatg aaacacaaca tcatcaggcg gggctcgaaa agtagatctg aaagtctata 21420
aagaatgaaa gttactctca acacagaaag caatttgtgc aaacataaga gagaatggcg 21480
tctatgctgc aagagaaaat tcgacggtcg catcatagtc gtctacactg ctgtgcatgg 21540
gcaatttata atatcatgtc tgatcacggt ttctgagaac atttaaacga aataagtcaa 21600
aacgaatgcg ctctgtcgcg attatagttt tgttctgaca gtaactccta accaaagggc 21660
caaataagga cgagagaata aaatagattg ctctctcact tcggacccag gaatcccgaa 21720
tttatataat ttcaatgtac tcacgtaaca ctgacaagct atgcggcgtc aataactcat 21780
ccacgttggg agaatctcga aacaacgcaa cgagttattt tatcctgatt aataatctag 21840
cttgaaccgt ttgttgtaac tagaacccaa gctgcaaaga gctacaacca aggtttgatt 21900
tcgttccaag ctaacatgaa actctcaaac ttcgtcgatt tttttaatgt ttgtcaaaaa 21960
cctagtacag cggtcctagg taccgatttg agaagcaggc aacccgctta taaataaaag 22020
aaaaagagtc tttattattt tataaataga aaaaacttta attgggacaa tattctttat 22080
gtgttctctg tcttcttcct tcatgtatga cgtaatgatc atgctccttt catctccttc 22140
cttccaaaaa gttcattttt cctactaggt ctttttcaaa attaaaaata taattaagta 22200
agaaagaaag aaggaaagaa agaaaaacct gggtactaat cagtgtgata tgaggtgaat 22260
ggtggttttg ttttacttct cggaagtgtc gagtcctata aggagcacta tacctatcct 22320
agacgctttt ggtaccaagc cctgcgcggc aggcatacgt cagcaagcta cgatagcagt 22380
acacgctact cagaaaggcc tagtgaggta ggcgagcagg aagtagtgct cttgcgtcat 22440
gcttatgatg gcatcagcca cgcgagaacc tcattcgaat agtccttttg caattcattc 22500
acgcatgcat gcattgatgc ctgctacaga gtagctagtg agagagtatg atacttagtt 22560
agtgctactt atgcgttgtc acctatgcaa tagcattgga tagaaggaat cagattcacc 22620
gctgactctc gctgagagta agggccatac gcagtgctcc tgagttgttt cattaaacgg 22680
acttcaagct gagttctggc taggcacctg gtagctgggg ctagagggta cctacctacc 22740
tacctactga tagctaactt tcaaatgagg aaagattgga gattgaatag aaagaaagtg 22800
atacatactg tcagccgtat cgaaactccg aagtggcacg cggatggcgt cagcaaactg 22860
ccgtagcaag tgaataacgc acatctcaat tgggacgtcc atgaaaacaa aaaacaaaaa 22920
agcaaaaaaa agttgcaatc gatcatgaat cgtgctgatt catgggttgc ttgcttagtt 22980
gttatgctgg agggtgtcga gacttggatc tggtgagcag tgcgctctcc actcaagttg 23040
gaccctttgg tatcagggga gtgcgagtgg gcacactacc atagtatcct aaattacctc 23100
tacgttttga ttgcctttga tcacagcaga taattttcaa tttaaataaa aatcataaaa 23160
agaagaagaa gaagaagaaa gaaagtgaag gtggcgtttc tgatgtcatc attttcgcag 23220
tgcttcccag cgaagattta ctgtgaacta ctacgcatgt gagtatggca agcactgggt 23280
aagtaggtac ctaccactac catgttgtaa aacaaaacaa ggaatatgtt agctagaaca 23340
gagcgaatcc ggtgtgagtg ggagtcatca tcagatattg aaagttgtcc tctcaattaa 23400
tataaatatt tctaactaaa gcaattaaac atatatttat taatttaatt ataaattaaa 23460
taaatatgct gggtgggtcc gagtcattct gactatcatc tatgatgttt aataataaaa 23520
tattgaaagc agtcaaggtt atttggaatt atgggatgat cgtgatctgt gtatcattct 23580
gcatcattgt ggatgctggc ctacgaaact acgacggcat tgcaattgcc acctggcggt 23640
gcgatcgcgt gcactcctgc aattgcgagt gtcttccgcc ggcttcaagt tgaggtgctg 23700
cgacagtgcg ggcccagagc tcctaacatt tcgtggatga ccgactgact cagacagagg 23760
tctctcaagc ttagaaagtg cgctgcaaaa aagggcgcta gctagataag atacgagtga 23820
gtgagtgagt gagtgagtga gtgagtgagg ttctagctag tgctcctccc aaatcttgga 23880
gtgccgatgc tcgagaatac atacatactt caagacacga agaacttgaa cccgaagacg 23940
aatgccgtct tcgacgtcat ctttgccgtc gtcatggccc actgcagcaa cgatccagtg 24000
cgtgcgagca gcagggccag cccacgatca cgcagctcgt cgggctggac ttggctcaat 24060
gaatgaatga atcaatcaat gaaagaatga ctcaatgaat caatgaatca gcaagttgcc 24120
accaaagccc atcgcaacga cgggtcctgc ctgcgtgcgc cattcttagg atccagagca 24180
agcaagatct tcttcaccta tcgctcagca agcgagaacg caacctccct ctgcatcatg 24240
atgcaggata agtaagataa atccatcttg gacctcgagc tcaaatcgac gcttgctgca 24300
tctatctatc tttgtatcta tctatgtatc tatctttgta tctatgtgtc tatctatctc 24360
tctgcgtgcc tcgtcgtgtt tttgaaaagg agtttcgatc gtggcccaat cggaagagaa 24420
ggctctctct ccctctctct ctctctctct ctctctgcat cgcacagacc aatgagcctt 24480
gcggcaacac agcttcaact tcattgcagg atccaatcca tccaaggcat cgcttgggct 24540
ctcagtgaat gaattcgacc aaagctcgtt ggcaggcaga caaggcctgg acaacataaa 24600
gcaagggggc acgaaggcaa gatggcaagg aggcagagca ggcaccagcg actgcgatgc 24660
tggcgagaga agatcaaggc aaagcagagg ctgcaagcaa gctctgcagt agccacctcc 24720
tcagcagatt cgtcaagatc gggcaaactt cgtctgtggc tgccacgcca gagcagagca 24780
tgcctgcttc atgatccatg ctcaagaaag aaagacagac aagacagaca agacagatag 24840
atggatgaca gcgaacttac atttgcagac ttcgaaggtg cctgacgggt attggtgcca 24900
ctaagacgag aaggagcact tgcttccaga tcgctcacgc cgctcacatc accatgctac 24960
gtcttcaata cgcctggtcc ggttcgcaag agccgcgcgc cggcgattgg gcgaaaggcg 25020
gaggagtcga ggtacgcgtt atcagcagaa tgtaggaaca ccgcgacgcg gccgacgacg 25080
ctggtgagga ggaagaaaga cctggcgcct gtacgtacgt acctacgttc tagcagtagc 25140
ttgaagtgga ctgtgggtcc cctccatctt cttcaagacc ttcaagttgc ttgctgacgg 25200
catcgctgtt tgtttgtggc tgttaggtag gtaggtagct agctagctat agctgtgtcc 25260
tagctgcaca gggagcactc agcctctttc ctagtttctt tggttctgtg cttgtttttc 25320
tagcgagtcg tgcaaataac ctgcggcggc cacgagaagt ccgcgttgag gcgatcttgc 25380
gccagtgcgg cagttgccat cactcgtgca gacagagttg agttgcttct caatcgttac 25440
caatcgctcc aagcaggcct agacatagat tttccttctc tggaccatct actaaaatga 25500
tcaagttaga taggtagata gatagataga tagatagcta gggagatact aggcaccttc 25560
tatgccggca cgtctcgaac aaagcgaaga aagagctgtg ggcaagagca ctcattttga 25620
tcgtagatga tcgtagacgc gctgtagagg agagctctta gtggcggcta ctgtgatgga 25680
ctatgagagg ggacttcgca agacctgtct cggtcgcacg tagctgtggg aagcgagaac 25740
ccgcagagga ctgattctga ttagtgcgga taacttggtc gaggaagagc ggggacccgc 25800
agggaacccg catagcagcg acgttggcac ccgacgacgc tagggcaaag acgcagcatg 25860
cgtgcgaggt gcctataagc tgcgcaattc agagaattaa gacagcagcg ctgggaagga 25920
aggaggagat ttgaaggctc ggcgggagct gtcgagatgg aggcaggcag gcaagcaagc 25980
aagcaagcga aagaggcggc cagggctcgc gtcgaagccg ctgatggacg agagaatcgc 26040
acgaagaaga atacggagtg tttgttttca aagccaaaga aagccaaagc caaagccaat 26100
tcgttcgttc gtgagttaac ttattattta atttaattga catcttcatt tactactgtt 26160
gttatctatt atttatttat ttatttattt atttatttat ttatttattt atttatttat 26220
ttatttattt atttatttat ttatttattt atttatttat tgtttatatt tttttaaatt 26280
aaaaaaattc aaaattcaaa attcaaaatt cacgaataaa ttgcacttga aggagatgaa 26340
gcaaagcttt gtttcttcta aaaagagtat aaataataca aagtgatgac ggaaagaagc 26400
atcattctga tggtaagcac ttcggcaaga tgcacgcact agcacttgtc gccttgcttg 26460
cgatccgcgg aggtaatagt ggaggcgaaa gaaggagttc attcctgtta tttcgcgctg 26520
gggttacagc agtgccaaga tttcgaatat ttgaattttt gaatttttga atttttggat 26580
cttcgttccc cttcttcctg aactgttcaa acgactcgga ggttgtcgat cggatcactc 26640
aatctctcaa tctctcactc actcactcac tcactttttc tcagctgcct gatccttcgc 26700
aatgctcgcg aagcgcgagg gatatgcgtg ggcgagcacg caccatcttc tctccacgcg 26760
taaagaagag cagagccaga ggcaggtagg tatctccacc catctcaggc tgtgacttct 26820
ttgtttcttt ctttctttgc ttgttttctg ttctctctct gtgctctgtc cacacgagaa 26880
agagaaagag agagagaaag aaccacgggt ttatagagcg cactcgtcct tcctgcttca 26940
gcagaaagca ctgcgtagga gaactacggg ggaggaggaa gcacgcacgg aggaggcgtg 27000
gaaggaagga ggagacagag agagagagac actgagggac agagggggag aggcagaggg 27060
agaggcatct gatgtttgcg agaaaccaat aagttttgaa agtgatttga tttagctgat 27120
tgactgatct atggcctgaa agaaagcttt taaagcggag ggagatagat gacgagggca 27180
gctgcgatgg cgtacggcgc atccgtctct ctctgtgtct ctctctcttt ctctctcgtc 27240
agggcgtgga gacctcggaa gctgcacgcg gcgcggtgag gaggcagggc agcagaggga 27300
gaggagagat cccagagtcg aagagcattg attgattgca gatgatcttg ggcaacgcgc 27360
gtcagcttga gcgaggaatg ctttggactt caggttcttc gcttctgtgt ttcattcttt 27420
ctcgaagaaa gaaagaatga aagaaagaga gaaagaaaga aagaaagaaa gaaagaaaga 27480
aagaaagaaa gaatgaatga atgaaagaaa gagagaaaga aagaacgaat gaaagaaaga 27540
gagaaagaat caaagagaaa gcgcattcgc agttcttctt cgtgaaagaa aaggaaaaga 27600
gaggcgatgg taggctctga tctcatcatt tctggtttct ctgttgtacc tgtactctgt 27660
gcttgtggcc ttgcgaaggc tgaagacgcc atgcagacaa ccacgcctcc gcagagactt 27720
tgcgggaaag cagagggctt ctcgccactc tcgaagaaac gagctcgcca gttttcgggg 27780
ttgttctcag aattgcgagt gttggcttta tatgggatga tggtatggca cttcgtcatc 27840
gttactctcg ctcgcttgct tacgaagatt ttcaaaaggg cgaaagaagt gctcagcttt 27900
taaaataaag tcacaccaaa gactaggccg catagcagaa agctaaagta aacccaatct 27960
gtctgaagag agtgtcgtgg ttagatactt acgcaagagt ttaaaagctg taaatagtac 28020
aggaacaaaa acaaataaat atatatatat tcttttttat tagtaaaaca tgaaaccaaa 28080
aaactccttt aaaataaaat aaaataaaat aaaataaaat aaaataaaat aaatttacta 28140
ctatatatac atatatatat acaataaata aaaacaactt tttcagacca gaaaaagact 28200
gagaaaaaag gaaactaatg actctcgagc accgagagcg atataagagt ggattatatt 28260
tgctaggccc accacgagtg agtcccctag gaggaagcgc cctctgagac aggagcagag 28320
gcgtcgctgg tgctccaaaa agcgacggcg aatggaaagc aaaacccttt cgagggaggc 28380
ttgtggccgt gactattcaa atctccagca tctcagctcc agcacagcag aagctacctc 28440
gcttctcagc tctagctatc acatcgatcg cagcatctag ctcgtagaca gctagcgccg 28500
caccttcccc caaatcaact tgggcaactt aactcttttt tcaccagaac tcctcttttc 28560
ctttaatctt cgaaaagaag acgaataaaa gagataatcc tctgccgcag cacattctaa 28620
aagaaaagcg gcatactggc gtaggcaaga ctttcaagct cttcctcgcc tccaccccgt 28680
atttccctgt tcatctttgt gaaacgagga aacaagaaat tttataggac aagatggctc 28740
aacgtgagaa ccgtctcgag gccaacatgg atacccgcat cgctgtgatc ggcatgtccg 28800
ccatcctccc ctgcggtacc accgttcgtg agtcttggga ggctatccgc gatggtatcg 28860
actgcctcag tgatctcccc gaggaccgcg tcgatgtgac cgcctacttc gacccggtca 28920
agaccaccaa ggataagatc tactgcaaac gtggtggatt catccctgag tacgacttcg 28980
acgcccgtga gttcggcctc aacatgtttc agatggagga ctccgacgca aaccaaaccg 29040
tcaccctcct caaggtcaag gaggccctcg aggacgctgg catcgaagcc ctcagcaagg 29100
aaaagaagaa cattggatgt gttctcggta tcggtggtgg ccagaagtcc agccacgagt 29160
tctactcccg cttaaactat gttgtcgttg agaaggtcct tcgcaagatg ggcatgcctg 29220
aggaggatgt tcaagctgct gttgagaagt acaaggccaa cttccctgag tggcgccttg 29280
actccttccc cggtttcctc ggcaacgtta ctgccggtcg ctgtaccaac accttcaacc 29340
tcgatggtat gaactgtgtc gtcgatgctg cctgtgctag ttctctcatc gccgttaagg 29400
ttgccattga tgagcttctc cacggagact gtgacatgat gatcactggt gctacctgca 29460
cggataactc catcggtatg tacatggcct tctccaagac cccggtgttc tctaccgacc 29520
ctagcgtccg cgcatacgat gagaagacca agggtatgct tattggcgaa ggctctgcca 29580
tgcttgtgct taaacgttac gccgacgctg ttcgtgatgg tgacgagatt cacgctgtca 29640
ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc ccgaccatct 29700
ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat cccgccaccg 29760
tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt gagctcaccg 29820
ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc gctgttggca 29880
gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt atgatcaagg 29940
tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat gagcccccta 30000
agctttacga caacactccc atcaccgact catcgctgta cattaacacg atgaaccgtc 30060
cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc ggttttggtg 30120
gtgccaacta ccacgccgtt cttgaggaag ccgagcccga gcaccagaag gcttaccgtc 30180
tcaacaaacg cccccagccg gtgcttctga tggcatcttc aacccaggct cttgcttccc 30240
tctgtgaagc ccagcttaag gaattcgaga aggctatcga ggagaacaag accgtcaaga 30300
acactgctta catcaagtgc gtcgacttct gtgagaagtt caagttccct ggatctatcc 30360
cgagctctaa cgctcgcctc ggttttcttg tcaaggaggc cgatgatgcc accgagaccc 30420
tccgtgccat cgttgcccag ttccaaaagt cagctggcaa ggattcttgg caccttcccc 30480
gccagggtgt gagctttcgt gctcagggca tcaacaccac tggtggtgtc gctgccctct 30540
tctctggcca gggtgctcag tacacccaca tgttcagcga ggtcgccatg aactggcctc 30600
agttccgtga gagcatctct gacatggatc gtgcccaggc taaggttgct ggcgctgaca 30660
aggactacga gcgtgtctcc caagtcctct acccgcgtaa gccttataac tctgagcccg 30720
agcaggacca caagaagatc tccctgacct catactctca gccctctacc ctcgcctgcg 30780
ctcttggtgc ctacgagatc ttcaagcagg ctggtttcaa gcccgacttc gctgccggtc 30840
actctctcgg tgagtttgcg gccctctacg ctgctgactg cgtcaaccgt gacgacctct 30900
ttgagctcgt gtgccgtcgt gcccgcatca tgggtggcaa ggatgcacct gctaccccca 30960
agggatgcat ggctgctgtc attggaccca atgccgagaa gatccagatt cgcactgctg 31020
atgtctggct cggcaactgc aactcccctt cgcagactgt catcaccggc tctgttgagg 31080
gtatcaagaa ggagtccgag cttctccaga gtgagggctt ccgtgttgtc cccctcgcct 31140
gcgagagtgc cttccactca ccgcagatgc aaaacgcctc ctctgccttc aaggatgttc 31200
tctccaaggt tgccttccgt cagcctagcg cccagaccaa gctcttcagc aacgtgtctg 31260
gcgagaccta ctccaacaat gcccaggacc tccttaagga gcacatgacc agcagtgtta 31320
agttcatctc tcaggttcgc aacatgcact ctgctggtgc tcgcatcttt gtcgagtttg 31380
gccccaagca ggtgctctct aagcttgttt ccgagaccct caaggacgat ccttccatta 31440
tcactatctc tgtcaaccct tcctctggca aggatgccga tattcagctt cgcgaggctg 31500
ctgtgcagct cgttgttgct ggagtcaacc ttcagggctt cgacaagtgg gacgcacctg 31560
acgccacccg ccttcagccg attaagaaga agaagactac tcttcgtctc tcggctgcca 31620
cttacgtgtc tgacaagacc aagaaggctc gcgaggctgc catgaacgac ggccgcatgc 31680
tcagctgtgt cagcaaggtc atcgcccccc ctgacgccaa gcccattgtg gacaccaagg 31740
ctcaggagga ggttgctcgt ctccagaagc agcttcagga tgcccaggcc cagatccaga 31800
aggccaaggc cgatgctgct gaggctgaca agaagcttgc cgctgctaag gatgaggcca 31860
agcgtgccgc cgcttctgca cctgtgcaga agcaggttga caccaccatt gttgataagc 31920
accgtgctat cctcaagtct atgcttgctg agcttgactg ctactccact cctggtgctg 31980
tgtccagctc tttccaggca cctgttgctg ctacccctgc tccggtcgct gcgcctgttg 32040
cagctgctcc tgctccggct gtcaacaatg ctctccttgc caaggctgag tctgttgtca 32100
tggaggttct tgccgccaag actggttacg agactgacat gatcgagccc gacatggagc 32160
tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg 32220
cccagctcaa cgtcgaggcc aaggatgttg atgctcttag ccgcacccgc accgtcggtg 32280
aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgcc gctgctgccc 32340
cggccccggt tgctgctgct cccgctgccc ctgcccctgc tgtcaacagc gctcttcttg 32400
ccaaggctga gactgttgtc atggaggttc ttgccgccaa gactggttac gagactgaca 32460
tgattgagcc cgacatggag ctcgagactg agctcggcat tgactccatc aagcgtgtcg 32520
agattctctc tgaggttcag gcccagctca acgttgaggc caaggatgtt gatgctctta 32580
gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatc gctggcagct 32640
ctggtgctgc cgctgctgcc ccggcccctg ttgctgctgc tccggcgccc gtcgctgccg 32700
ctgcccctgc tgtcagcagc gctctccttg agaaggctga gtctgttgtc atggaggttc 32760
ttgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag ctcgagactg 32820
agctcggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag gcccagctca 32880
acgtcgaggc caaggatgtc gatgctctta gccgcacccg caccgttggt gaggttgtca 32940
acgccatgaa ggctgagatc gctggcagct ctggtgctgc tgccccggcc ccggtcgctg 33000
cggcccctgc tccggtcgct gccgctgccc ctgctgtcaa cagcgctctt cttgagaagg 33060
ctgagactgt tgtcatggag gttcttgccg ccaagactgg ttacgagact gacatgatcg 33120
agcccgacat ggagctcgag actgagctcg gcattgactc tatcaagcgt gtcgagattc 33180
tctctgaggt ccaggcccag ctcaacgttg aggccaagga tgttgatgct cttagccgca 33240
cccgcaccgt tggtgaggtt gtcaacgcca tgaaggctga gatcgctggc agctctggtg 33300
ctgccgctgc tgccccggcc ccggttgctg ctgctcccgc tcccgtcgct gcccctgctg 33360
tcagcagcgc tctccttgag aaggctgagt ctgtcgtcat ggaggttctt gccgccaaga 33420
ctggttacga gactgacatg attgaggccg acatggagct cgagactgag ctcggcattg 33480
actccatcaa gcgtgtcgag attctctctg aggtccaggc ccagctcaac gttgaggcca 33540
aggatgtcga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg 33600
ctgagatcgc tggcagctct ggtgctgccg ctgctgcccc ggcccctgtt gctgcctctc 33660
ccgctcccgt cgctgccgct gcccctgctg tcagcagcgc tctccttgag aaggccgaat 33720
ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg attgaggctg 33780
acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag attctctctg 33840
aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc cgcacccgca 33900
ccgttggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct ggtgccgccg 33960
ctgctgcccc ggccccggtt gctgctgctc cggcgcccgt cactgccgct gcccctgctg 34020
tcagcagcgc tctccttgag aaggccgaat ctgttgtcat ggaggttctc gccgccaaga 34080
ctggttacga gactgacatg attgaggccg acatggagct cgagactgag cttggcattg 34140
actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac gtcgaggcca 34200
aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg 34260
ctgagattgc tagcagctct ggtgctgctg cccctgctcc ggctgctgcc gttgcaccgg 34320
cccctgctgc tgcccctgct gtcagcagcg ctctccttga gaaggccgaa tctgttgtca 34380
tggaggttct cgccgccaag actggttacg agactgacat gattgaggcc gacatggagc 34440
tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg 34500
ctatgcttaa cgttgaggcc aaggatgttg atgctcttag ccgcacccgc accgttggtg 34560
aggttgtcaa cgccatgaag gctgagattg ctagcagctc tggtgctgct gcccctgctc 34620
ctgctgctgc cgctgcaccg gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg 34680
agaaggctga gtctgttgtc atggaggttc tcgccgccaa gactggttac gagactgaca 34740
tgattgaggc cgacatggag ctcgagactg agcttggcat tgactccatc aagcgtgtcg 34800
agattctctc tgaggtccag gctatgctta acgttgaggc caaggatgtt gatgctctta 34860
gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatt gctagcagct 34920
ctggtgctgc tgcccctgct cctgctgctg ccgctgcacc ggcccctgct gctgcccctg 34980
ctgtcagcag cgctcttctt gagaaggctg agtctgttgt catggaggtt ctcgccgcca 35040
agactggtta cgagactgac atgattgagg ccgacatgga gctcgagact gagcttggca 35100
ttgactccat caagcgtgtc gagattctct ctgaggtcca ggctatgctt aacgttgagg 35160
ccaaggatgt tgatgctctt agccgcaccc gcaccgttgg tgaggttgtc aacgccatga 35220
aggctgagat cgctggcagc tctggtgctg ctactgcctc tgcccctgct gctgcagctg 35280
ccgcccctgc tatcaagatc tccactgttc acggtgctga ctgcgatgac ctctctgtga 35340
tgtctgctga gcttgtcgac attcgtcgcg ctgatgagct ccttcttgag cgccctgaga 35400
accgcccggt ccttattgtc gatgatggta ccgagctcac ctctgctctg gttcgtgttc 35460
ttggtgctgg tgctgtagtt cttacctttg acggtcttca gttggctcag cgtgctggtg 35520
ctgctgttcg ccatgtccag gtgaaggacc tctccgctga gagtgccgag aaggctatca 35580
aggaggctga gcaacgcttc ggccagcttg gaggcttcat ctctcagcag gctgagcgct 35640
ttgcccctgc tgacattctt ggtttcaccc tcatgtgcgc taagtttgcc aaggcttccc 35700
tctgcacccc tgtgcagggt ggccgtgcct tcttcattgg tgtggcccgt cttgacggtc 35760
gccttggttt cacctcccag ggatctactg actccctcac acgtgcccag cgtggtgcta 35820
tcttcggcct ctgcaagacc attggccttg agtggtctgc taacgaagtg ttcgcccgcg 35880
gtattgatat tgctcgtgag gtccaccctg aagatgctgc cgtcgccatc actcgcgaaa 35940
tgtcctgcgc tgacaaccgt atccgcgagg tcggcattgg cctcaaccag aagcgctgca 36000
ccatccgtgc tgtggacctc aagccgggtg cccccaagat ccagatcagc caggatgacg 36060
ttctccttgt gtctggtggt gctcgtggta ttactcctct ctgcatccgt gagatcaccc 36120
gtcaggtccg cggtggtaag tacattctcc tcggtcgctc caaggtccct gctggtgagc 36180
ctgcttggtg caacggtgtt tctgatgacg atcttggcaa ggctgctatg caggagctga 36240
agcgtgcttt ctccgccggt gagggcccca agcccacccc gatgacccac aagaagctcg 36300
ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt gaggctctcg 36360
gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc gccaaggctg 36420
ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac gcttctggtg 36480
tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct gtcttcggca 36540
ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac cttaagcacc 36600
tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct gactacgcca 36660
tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg tccgtgaagt 36720
ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag aagcagttcc 36780
agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg gctcgcattg 36840
tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc accaagaagg 36900
ttggcagtga gcccgttgtg atccaccgca agatcagcgc tgcatccaac ccttttctta 36960
aggaccacgt catccagggt cgctgtgtgc tccccatgac cattgctgtg ggctgccttg 37020
ctgagacctg cctgggtcag ttccctggat actccctctg ggctattgag gatgctcaac 37080
tcttcaaggg tgtcaccgtt gacggtgatg tcaactgtga gatcactctc aagccttccc 37140
agggtactgc cggccgcgtt atgattcagg ccaccctgaa gaccttcgct agcggcaagc 37200
ttgttccggc ttaccgtgcc gtgatcgttc tctccactca gggaaagccc cctgctgcta 37260
ctacttccca gaccccctct ctccaggctg atcctgctgc ccgtggcaac ccttacgacg 37320
gcaagaccct cttccacggc cctgccttcc agggtcttaa ggagatcatc tcttgcaaca 37380
agtctcagct tgtcgccgag tgcaccttca ttccgtcttc cgagagcgct ggtgagttcg 37440
cttctgacta cgagtcccac aaccctttcg tcaacgacat tgctttccag gccatgctcg 37500
tctggattcg ccgcaccctc ggccaggctg ccctccccaa ctctatccag cgcattgtgc 37560
agcaccgtgc tcttccccag gacaagccct tctacttgac cctcaagagc aacagcgcga 37620
gtggccactc tcagcacaag acctccgttc agtttcacaa cgagcagggt gacctcttcg 37680
tggacatcca ggcttccgtc acctcttctg actcccttgc cttctaaagt tgtgaggctg 37740
tcttgtcttg tcagtcgcga aagtgtaagc aagaactttg tcatacaaag aagcaaccaa 37800
cttccgaacc aacacacctt gtaggattac aaccacaact ttctataaat agtgcgcaag 37860
aataaccagt aagctatcct tcgtgtacct gttacaacaa cgacattttt acttgatctt 37920
cctacttgtg atgggtagtc ccggcttgta ctgacagtga tgccacagca gagtagatca 37980
ctgtgaataa gtaaataagc ctacttatta tattcccaaa gtactcgctg ggatattatt 38040
agtatcacga aaagtgatat gttttataac tcgcttgtct tgccaagatc taaccttttt 38100
tttttaaatg gccaaaaagt cgccagaaca catcttacaa taaacaaaaa tttagattat 38160
atcgtatgta taatgtataa tatattatat tattatatac atacgatata atctaaagcc 38220
attccagact tattcggtga tgaaaaatgc tttcccagct ttatacaaac tattcaaaaa 38280
gttgcatgac ccattttcag atatatttaa tagtataaga ttatgtccat ttgttttcaa 38340
agttattcaa gagtttacat cttgaagttt catcccttta ctactacact gtttttcgtt 38400
tgggtttttt ctctaacggc gaaagaaaca agtcaccaag cttaactagt aggcatcttt 38460
gtggtgacga aattaaagtt gaatatataa attatagtta gtcattatgg aatctcagtt 38520
tgaacgaagc taagctattt ataaaaatca ctgcatggag ataatacttg aattttgatg 38580
atagtgttta tgaagaagtt taatcttgct ttttattaat gttattctct aatatagaaa 38640
tatttcaata aaaaaatcat atgaagggat aataaataca gagaatgatc gttatcattt 38700
gatatgtcga acgctaatct atcatcttat ctaggaaaca aaggtggaaa taaaggaaag 38760
ccctacacga gttaattcct caaacgaact actttggatt atcaaatcca actgctgaca 38820
ctggatacat gcatgtattt agtgggtgtt actgtacttc cttatttcct ttaattcaat 38880
tgtcttgatt tttacttcgg agattctact tgaaaatcat ctcccttcac ttccggttat 38940
acagaaagac ccttcaattc gaatgctggc caggtacaat aactatcagc gattcccctc 39000
cactagacat gaccgactgt aagcacctca acccgatttc aagcaacaca tgatgactag 39060
ctgtttccgc aaaacaacaa ataagagagg tagtggaaaa cacccagttc gctcgagctc 39120
ccctagtaga ttcgacattc actttctatt tgattgctaa ttgtgggtcc ggctatttaa 39180
ggaaagaact gatgaaagtc cacctcacgc aatcaaatcg cggtctagtt ggaagctaca 39240
atggccgacg tatgcgcgcc tctatctttt aggattgtag aacagggcgg caatctgcta 39300
acataaattt aataccttgc tcaagctgct ttccatactt ttcaatccat ttgtgataat 39360
cttgcaatgg accaatctcc aaatctgtag aagcaataac aaggacatcg cagggtcccg 39420
gttcgtttgc atgctcgtct tctggtgcca caacaatgct gcctgttatt atctcatgag 39480
agtctttata ctgcggatcc gtggctatag cgtgaataaa cgttgtgcgc aagcctatat 39540
cctcgcgatg gagatactgg cctgctacag tttgcgttcg tctgcctacg acaacgcatg 39600
gaacattctt tggtgtgcga gtgggccgta gcgttcgacc ctgggcaagg aagccatgca 39660
gacgtgattc cgagaggcca tctcgcgtgt aagacttatc ccaattttct ggatcctcta 39720
atttccagct agccataagc tcagtcaaca gaccaagcgt tcttgatctt ctttctaggt 39780
caaatacatc ttgatggaag cctgcagtaa tttctttgta agatttggaa acgacgttct 39840
tgaaatgaac acaaactgat attgcattca tgggtgcagg tgacagttgc aaatgaactg 39900
aaatgtctgg agaaaagttg aggaagcgtg gtttataaag cggccaagct gtcctcgcat 39960
gcgcaagacc tagtatatta ctaatgactc tgcgaccaca atcctccatg cgttcaaact 40020
tgctatgcgg aattccacga atgatgttac cttgaggatt tggggctctc caaaggagct 40080
gttgcagttg ctgtacgtat tcgcggtgtt cgcggacctg atctcgaagt cgggcatttt 40140
cctctgagca aggccctaca ggtggaaatc tgcacagcat attgtatgtt ctctctagat 40200
gtactgcccg ttgccgcaaa tgagctacat ccatctccag tttatttact gtgtcttcga 40260
gcgcaaacct ttcacagcgc ctgcgtttgc gttcatttct cgaaatctct cgccgccgct 40320
gcctgattcg ttctgcgcga tcaactcggt catcccctgt gtagcttggt gatgacgtgg 40380
atccatcttg tgaggcgtca aagccagaca ctgcctttac ttctaaatct cgccattcat 40440
ctgcaaaatc cctatatcct tccccataag tgtaatcgtc actacctatc aattctgtag 40500
atgccgcatc tacagtccta attatttgag gatttccttg cattgtaaag caaagatact 40560
cggaggctgg atttgtcaca aaaggtacga cagccctatt gatcaaattg aaggaagggg 40620
attgctttta ccagtacacg atgttactgt tgttgctatt gttgttgttc ccaatttctt 40680
cagacgtagc gtgccgcttc tgacattgcc aatagctgct tgtctttggt cttctttggg 40740
gaatgggcca gtaaaagaaa ccctaggcag ttcgattatc tactaatcta aagaacctgt 40800
ggcccctttc ccctcaaccc acgcccttcg ttgctctctt cggtcggtga agcgtttaga 40860
tgcgaggttt cctccactac gtgcttcttc aatgctaaac gcccaagtca actgaggaca 40920
ctgaaagcct gcacggagca gaagacccac acagacggtc gcaggatcaa ccctacctac 40980
gcctcgttgc cacgatggtc gctgccgatc ctcgatctct cgtcgattat tggtctcctg 41040
ttgcgctctt ggccacgcgg ccactcagac tctgcttctg tggcttctca ctgacgtgat 41100
gtagaaagaa atagaaagca cagagccact ttaaaaggaa aaggggaaag cagagaggaa 41160
agggaaaaag aagacctcag attgactcag agattgactc aatcgacgag agaatggaag 41220
ggaatggacg ccacggagac agaggcgcag cgagacggag cgagacggag gtaggcagag 41280
gcagaggcag aggtggaggc gaggggccgg gttgtcggca ctggcagagg gagagagaga 41340
gaaggagagg cggaccagtt tgaaaactct cgccagcttc gatagccgta ctcggtatgt 41400
atgtatgtat gtatgtatgt atgtatgtat gtatgtatgt atgtatgtat gcactcttct 41460
acttgtttcc aatgtgctgt tctatgcttt acagtgtttt ccgcgctcgc tacttgctac 41520
tttcatcagt ctgtctgcct gaggcggcgg tgatgcagaa tgcacctagg tacctatttg 41580
tcgccaactt tggatttgcg tggcggcagg attcctcttc tcctgcactt tgtttcgact 41640
cgccttagaa gggttgttgg aagacgccta aacgggtatt gcccggagat aggtgctgct 41700
ggtagctcat gtagatagtt cgttaggtag ttacactgga acagacagac gctctgtgtt 41760
tcgtggtgtt gcaggtcatg gactcagagg ggctgcgtga gttttgtgtt cgagagcaga 41820
gtgttgatat tcttttatgg gcaggacaca ttgcaacttg aagtaccgtg gttgtaacta 41880
caggacctcc atctgaagcg cggcatcacg tgaaaaagaa atgaaatgaa gagggaaagg 41940
acacccaaag gttcataatg tttggtttgc aaaggttatt cgaaagacac cttcttcgtg 42000
gtagatggtg attctgtcga aactgccgag attttgctga gagtgaacca aagcagggtt 42060
ttgagataga agaatcaatc gtgcatggac aacctattcg taggattgtt atagctgttg 42120
tttgttatag gtcaaacttt atagcttcaa cccctcgctg gcaagtacga agggaaagtg 42180
taaatataca ttcttggttt aacgcataat ctcaagagct tccatgctga aaagttagat 42240
agtatattct tctgatttta catatttaaa ccaagtaaac aagttccacc aagggactta 42300
cttggcaact taaccatggt catcataatt tgcgcatcac ttagatcact acgttaacat 42360
tcgttcttga tctcttcgag cgcctaaata agcaaactgg cagcgaatta ggtcaccata 42420
tttttccaag gaggaaaaac tgtattgtgc tacccgttgt ggtgtaaaac ttgtaattct 42480
tcgcatctct aattcctatc gttaaacttg tcatcttact ttctggaagg aagcttggta 42540
tctcagaaaa tcgaactttg caataatacg aaagcacaag taagggttta tggcagcata 42600
acattgtctt aagaaattga atttaaaagc agaccgaatg caccgcagaa tacattgtaa 42660
attggtgcca aatattatga gtagcaatca tcaatctaac gcacgatttt ttgaagaagt 42720
acaatacaaa tttccccgtc gtagagaatc aaatggtttt acacatctat ttcaacactt 42780
ttcttggatt gtgatttcat atcaagacaa ggcttaaatg atcttggctt tctctgcaag 42840
agcggttctc caaatttcct ctcctgtttc tggattcatg tcaaaacata gtttaacaat 42900
agaaagaagg tgaccaggta ggtacgcaat aatagtttcc gcaatgaatt ggggcttgta 42960
gcgtgcagag aaatgcatga gatatagggc ctggcagttg tccaatgcac ctcgttttgc 43020
aaacctcgcg agctcttcaa tgtggatgtg gccacgctct ctagcaaagg agatatcgcc 43080
atcaaaaaat gtaagctcca tgcaaagtgt ggcagcctga agaaatagag cctcagggat 43140
atccagggcg tctataattg tgtcacctgt atatgcaaat tcaatcgttt ctctgtacac 43200
gaaatcctca ggcataggag gtgacttttg aaagcgcttt cgtttctctt ctggactgag 43260
gttggcaagc tctgacctaa gctcttttcg tttcgtcttc accgcatagc ctacagaggg 43320
aactctatgc atcgtcttgc acaccacaac acttgcatct cctcctagat cc 43372
<210> 2
<211> 39976
<212> DNA
<213> Ulkenia sp.
<220>
<221> misc_feature
<222> (32086)
<223> n steht f? irgendeine Base
<220>
<221> misc_feature
<222> (32086)
<223> n steht f? irgendeine Base
<220>
<221> misc_feature
<222> (32084)
<223> n steht f? irgendeine Base
<400> 2
tcaagaattc gcggccgcaa ttaaccctca ctaaagggat ctgatgaact tggagcaaga 60
ataagaaatc catccattca agtcagcaca cccgatggca tcatcaatct tcgtcaactc 120
tttgtgcagg cagattggtg cttcgggcaa tcaatcggtt gacggattga ttgatcaatc 180
gctttgcttg cttgcttgct tgcttgcttg caattgatcg gcaaaagagg ccatccatcg 240
tagagcgtgc aatcttcaat gctctagcta gaggcgccat caggtagtta gttagctagc 300
tcgttagtta gttgctcttc ctgaaactaa caatgtatga catcagcatc atcgttcttt 360
cttctttatc catccaggat ccttcttttc aattcgtttg ttttgttttg tcttgttttg 420
tctttttctt tcaatgcaag catctcttaa ttcaacaaac caaacgaacc aagagatgaa 480
actcaaaaaa cgttttaaaa taaacaaaca attaaaatca aatagaaaat gaaattgaaa 540
gcacttttgt tttcgcctct ctagagagct agctatagct acctactatt cgttctcgct 600
cttcgtcgtc gggactgctg catcctgtca ttatcgggcc ctaagagtgc cctagtctta 660
gaaattgatg gcgataagat ggcggtcttt cttatccttc ttctcgttgc tgctgctgtg 720
ctctttgcct ctcggatcct tttgtttaca gctggccagt cagtcagaca gtcagttaat 780
cgattaacag gcaagcaagc aagcaagcaa gcacgcaagt cagccagctg gatagacagt 840
tagatagatc gtggcgtcgt cgttggcttc gtcgctgttt tggtgcttga ggattcgaag 900
tgcacgaggt tccttctacc tacagctctt cctttcactc ttcacctatt attatgcgct 960
gcaagttctt ttcgaaaggc tttttcttct ttcattctct ttcttttggc ctttgcgtta 1020
cagagcggag acgcctagtt ttatagatct aaataaacaa gagggaggac aacagaggcg 1080
gaaaacaagc aagttcaaga cggcaagaaa gcagcgcctt tgtttctttg tttcttttgt 1140
ttcttttcaa aagagccctt cctcggaaag ctttctttct ctcttgagcc aacttgaatt 1200
cgaatctgat cttcaaagcg agttagttcc tcaggcgcca ggcacctctc tccctccctc 1260
cctccctcta tcgcaggcag gccagcgtga cacctgtgac agcaggcagc tcaggcgtgc 1320
atgcaacgaa ggcgttgact catgcattgg cgctcactca ctcactcact cactcactca 1380
ctcgcgtacg tacgcacgca cactcacgca ctcacgcact caatcactca atcactcact 1440
cactcactca ctcactcacg ccagcattct cgaggagagg ccatgcgtag gtgaggtacg 1500
aaggaaagga gtccatagtt tggaggcgat gatggcgaat tgcagagcat aacagtgcag 1560
agggagaaac ttacatccat tcatacgtag ggaggcgcat acttacgtaa ctaagtgcaa 1620
tcggtggatc aagaaagaag gaatgaaaga atgaatgaag gaatgaatga aagaaagaaa 1680
gaaagaataa atgaataaat gaatgaatga atgaatgaat gaatgaatag ataaatgaat 1740
gaaagaaaga gccccgctta tttggtatcg atctcattgc aaatgttcct gaaagttgct 1800
tatttgcctc acaactatga gtaggtagtg atgataataa tagtaattgc tattgctatt 1860
acttgaattt gaatttgaat ttgaattcag gtagacaata aaataagatt agcaaaacat 1920
tttgagagga agcagaggat atgcagtgca aaaggaggtc ccgagtttcg atcttctttg 1980
cacctgctac gtatctagtg cacgtagagc aagaaagaat gaaagaaaga acgaaagaaa 2040
gaaagagaga gagagagaga gagagagaga gaaagcgaag atgatagcgg agagaactct 2100
tcttcgcagt cactctgttt ctcagtcagt cccgcaacca ataacaactc gaactcgcag 2160
cagtgttctt cggagtgcca gcgctcgctc gcactgcgtc ggcacagcag cagcagcagc 2220
aggccccgcg ctcgctgcac tcagcccggg caggagcaac agctgctgag cagctgaggc 2280
cagctggctg gcggctcgcc tcgcctcgcc tcgcgtcgcg tcgcgagaga aagcgatcga 2340
ccaactgtca atcgattatt cgagtccttc gagcgcttta tagggcactg attgatcact 2400
cattgattca ttgactcatt tattctttgc gtggtcagcc aaacggcgtt agcattgggc 2460
aaagcgggtc tttgctttgc tctaaaatag atttgctcgc gagagtacgt acttgcagga 2520
gtaggtaggc tctgcctagt acctgggcat ttgaatattt gaacttcgaa cttcgttgag 2580
tatctgaata tttgaatatc tgaatatttg aatttcgaaa gtttgaatat ttgaatattt 2640
gaattttgga atattggaat agctgggttt ggagataaga cttactaagc taagcgccga 2700
cgtaagagcg gcgagtaaat ccacacacaa gagagaggca gagagagagg gagggagaca 2760
actcgcgcag gcaagctgag cccactggac gcacggggcg cgtcccccct gacgggcgct 2820
ctggtggtgg cgtgtttggg agggttttgc atgcttgtga taggggctct ggcgcgggct 2880
ctgtacggtg cttggagatg cacgggcagg gcgagagagg ggacgggttc ccgggaggcg 2940
ctgcttggag gtgctgagag ggagggagaa ggcgtgcttt gcgatgcgcg gggcgaccta 3000
ggcgctgctg cgcggtgcag cagcagggac ctcggacgtg agtcgaagcc gtctgcagag 3060
gagatggtag aagggccgcg gattggtagc agagaagagg aaatagaaga agaagaagaa 3120
atagaagaag aagaaataga agaagaagaa atagaagaag aagaggagga cgggcaggcg 3180
ggaaagatgg agaaaggact cgcggcggga aaacaagaga atgtgaactt gggcttgaac 3240
tttggtttga atttgaatgt ggagaacgag gggttgaatt tgagtttgaa tttgaaagaa 3300
aacttacgga aagaaagttt agttgaaagt gagaaagaaa aaaatgagaa agaaaaagag 3360
aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag 3420
aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag 3480
aaagaaaaag aagaagaaaa agaagaagaa aaagagaaag aaaaagagaa agaaaaagag 3540
aaagaaaaag aagaaggaga tttaaaaagt tgtttagttg aaaaaggaga aggaggaaga 3600
agcagcgaca gcggcagaag aagaagtagt tgttgtaaga ggggaacgga ggcagtagca 3660
gtggagcagg cggaggcgac agcaaacctc gaactcgacc ccgtcgagcc gcagcaagaa 3720
caagagcccg accaggtgga cgaggacgag gtccgcttgt tgtcaggaac aacagaagtt 3780
gcaggactag ccgagagtgc taccactgca attcttagat ccacagacgc aagagcagaa 3840
aacttacaac tgctcgccac aacacaagaa ccaccttcag atacaaccag gttcgagaac 3900
tccacaagtc tagaagcagc aacagctcta gcagataatc aaacaggtcc agaaaaagct 3960
acgactagaa gagaaattat cgagtcgcaa cttgcaacca tggccactcg cgtgaagacc 4020
aacaagaaac catgctggga gatgaccaag gaggagctca ccagcggcaa gaacgtcgtt 4080
ttcgactatg acgagctcct tgagttcgcc gagggtgaca tcagcaaggt cttcggcccc 4140
gaattcagcc agatcgacca gtacaagcgt cgcgttcgtc tccccgcccg cgagtacctc 4200
ctcgtcaccc gcgtcaccct catggacgcc gaggtcaaca actaccgcgt cggtgcccgc 4260
atggtcactg agtacgacct ccccgtcaac ggtgagctct ctgagggtgg tgactgcccc 4320
tgggccgtgc tcgtcgagag tggtcagtgt gatctcatgc tcatctccta catgggtatt 4380
gacttccaga acaagagcga ccgcgtctac cgtctgctca acaccaccct caccttctac 4440
ggtgttgccc aggagggcga gaccctggag tacgacatcc gcgtgaccgg cttcgccaag 4500
cgtctcgacg gtgacatctc catgttcttc ttcgagtacg actgctacgt caacggccgt 4560
ctcctcatcg agatgcgcga cggctgtgcc ggtttcttca ccaacgagga gctcgccgcc 4620
ggcaagggtg tcgtctttac ccgcgctgat ctcctcgccc gcgagaagac caagaagcag 4680
gacatcaccc cgtacgccat tgccccgcgt cttaacaaga ccgttctcaa cgagactgag 4740
atgcagtccc tcgtggacaa gaactggacc aaggttttcg gccccgagaa cggcatggac 4800
cagatcaact acaaactctg cgcccgtaag atgctcatga ttgaccgcgt caccaagatt 4860
gactacaccg gtggccccta cggccttggt cttctcgttg gtgagaagat cctcgagcgc 4920
gaccactggt actttccgtg ccacttcgtc ggagaccagg tcatggctgg atccctcgtg 4980
tctgacggct gcagccagct cctcaagatg tacatgctct ggctcggcct ccaccttaag 5040
accggtccct tcgacttccg ccccgtcaac ggccacccca acaaggtccg ctgccgtggc 5100
cagatctccc cgcacaaggg taagctcgta tacgtcatgg agatcaagga gatgggctac 5160
gacgaggctg gtgacccgta cgccatcgcc gatgtcaaca ttctcgacat tgacttcgag 5220
aagggccaga ctttcgacct tgccaacctc cacgagtacg gcaagggcga cctcaacaag 5280
aagatcgtcg tcgacttcaa gggtattgcc ctcaagctcc agaagcgctc tggccctgcc 5340
gttgtcgctc ccgagaagcc cctcgctctc aacaaggacc tttgcgcccc ggctgttgag 5400
gccatccctg agcacatcct caagggcgat gctcttgccc ctaaccagat gacctggcac 5460
ccgatgtcca agatcgctgg caaccccacg ccctcgttct ctccctcggc ctaccctccc 5520
cgtcccatca ccttcacccc gttccccggc aacaagaacg acaacaacca cgtgcccggc 5580
gagatgccgc tctcgtggta caacatggct gagttcatgg ccggcaaggt cagcctctgc 5640
ctcggccctg agttcgccaa gttcgatgac tccaacacca gccgcagccc tgcatgggac 5700
cttgctcttg tgactcgtgt ggtctccgtt tctgacatgg agtgggtcca gtggaagaac 5760
gtggactgca acccgtccaa gggaaccatg gttggcgagt tcgactgccc catcgacgcc 5820
tggttcttcc agggatcttg taacgacggc cacatgccgt actccatcct catggagatc 5880
gccctccaga cctctggtgt cctcacctct gtgctcaagg ccccgctcac catggagaag 5940
aaggacattc tcttccgcaa ccttgacgcc aacgccgaga tggttcgctc tgatattgac 6000
ctccgcggca agaccatcca caacctcacc aagtgtaccg gctacagcat gctcggagac 6060
atgggtgtcc accgcttcag cttcgagctc tctgttgatg gtgtagtctt ctacaagggt 6120
accacctcct tcggctggtt cgtccctgag gtcttcatct cccagactgg tctcgacaac 6180
ggtcgccgca cccagccctg gcacattgag tccaaggtgc cttccgccca ggtcctcacc 6240
tacgacgtta cccccaacgg tgccggtcgc acccagctct acgccaacgc ccccaagggc 6300
gctcagctca ctcgccgctg gaaccagtgc cagtaccttg acaccatcga ccttgtggtc 6360
gccggtggct ccgccggtct tggctacggt catggccgca agcaggtgaa ccccaaggac 6420
tggttcttct cgtgccactt ctggttcgac tccgtcatgc ccggctcgct cggtgtggag 6480
tctatgttcc agctcgtcga gtccatcgct gtcaagcagg acctcgccgg caagtacggc 6540
atcaccaacc cgaccttcgc tcatgctccg ggcaagatct cctggaagta ccgtggtcag 6600
ctcaccccca cctccaagtt catggactcc gaggcccaca ttgtctccat cgaggcccac 6660
gacggcgtcg tcgacatcgt tgccaatggt aacctctggg ctgatggcct ccgcgtctac 6720
aacgtcagca acatccgtgt gcgcattgtt gctggcgccg cccctgctgc tgctgctgct 6780
gctgctgctg ttgctgctcc ggctgccgcc cctgctccgg ttgctgcatc tggccctgcc 6840
cagaccatca ccctcaagca gctcaaggct gagcttcttg acgttgagaa gcctctctac 6900
atctcctcca gcaacggcca ggtcaagaag cacgccgatg tggctggtgg ccaggccacc 6960
attgtgcagg cttgcagcct cagtgacctc ggtgatgaag gcttcatgaa gacctacggt 7020
gttgtggctc ctctctacac cggtgccatg gccaagggta ttgcctctgc tgaccttgtg 7080
attgccactg gtaagcgcaa gatcctcggt tccttcggtg ctggcggtct ccccatgcac 7140
attgtccgtg ccgctgttga gaagatccag gctgagctcc cgaacggccc cttcgccgtc 7200
aacctcatcc actccccctt cgatagcaac cttgagaagg gcaacgttga cctcttcctc 7260
gagaagggcg ttactgtcgt cgaggcctcc gccttcatga ccttgacccc gcaagtcgtc 7320
cgctaccgtg ctgctggtct ttcccgtaac gctgatggct ccattaacat caagaaccgc 7380
atcatcggta aggtctcccg taccgagctc gctgagatgt tcatccgccc tgccccgcag 7440
aacctcctcg acaagctcat ccagtctggt gagattacca aggagcaggc tgagcttgcc 7500
aagctcgtcc ccgtcgccga cgacatcgcc gtcgaggccg actctggtgg ccacaccgac 7560
aaccgcccca tccacgtcat cctccccctt atcatcaacc tccgcaaccg cctccacaag 7620
gagtgcggct accccgctca cctccgcgtg cgcgttggag ctggtggtgg tgttggatgc 7680
ccccaggccg ctgccgctgc tctcgctatg ggtgctgcct tccttgttac cggcactgtc 7740
aaccaggtcg ccaagcagtc cggcacctgc gacaatgtcc gcaagcagct ctgcatggcc 7800
acctactctg acgtctgcat ggctcccgct gctgacatgt tcgaggaggg cgtcaagctc 7860
caggtcctca agaagggaac catgttcccg tccagggcta acaagctcta cgagctcttc 7920
tgcaagtacg actccttcga gtccatgcct gccacagagc tcgagcgtgt tgagaagcgc 7980
atcttccagt gccctcttgc tgatgtctgg gctgagacct ccgacttcta catcaaccgc 8040
ctccacaacc cggagaagat cacccgtgcc gagcgtgacc ccaagctcaa gatgtctctc 8100
tgcttccgct ggtaccttgg tcttgcctct cgctgggcca acaccggtga ggctggacgc 8160
gtcatggact accaggtctg gtgtggccct gccattggag ccttcaacga cttcatcaag 8220
ggctcctacc ttgacccggc cgtctctggt gagtacccgg acgtcgtgca gatcaacttg 8280
cagatccttc gcggtgcctg ctacctccgc cgtctcaatg tcatccgcaa cgacccgcgt 8340
gtcagcattg aggtcgagga tgctgagttc gtctacgagc ccaccaacgc cctctaagcg 8400
agttatatct gtctagaaaa cttggcatgg ctagcaattt atgtctagct attccataca 8460
cacggtaatg ccagtagcct gttagttata gctcttttgg ttgttgtctc acaatacact 8520
gacatcagca gaacaaaatg aaaggggcct tggctaccat gaaatcaata cttcaaaagg 8580
tctcttggtt tctttactcg catgtcgcta tttacttaca ttcctcgagt acataacata 8640
tcatacatca aagaaattaa aaagaaaaca aacattcaaa tatgcattac tttccctact 8700
gtactagtaa gtacgtttct ggtattaagt tgttttttct caaaagaaca atgtgcttac 8760
ttgtaaaatc cacagctgct tacttgtaag cctcaactag ttagtgatgt gattatcata 8820
aaatgttcga cactgtacct cctttccagc tatcttccta cacctcctct gacgcaggtt 8880
gacggaggag gcgtgggggt tgattgaagt gcaacacaac gttttgttta agatattcct 8940
tgccttggcc gactccaaat ggatagcaca gaagcctaat gataatttga attaatttta 9000
tttcgagctt atttaatgct cttatcagag tccgtaggta tctcttttcc tactaattgt 9060
tgaaaaagga tgttttggac atagcaggtc atcatactat ttggttccat caaattcata 9120
tccatttctt tcgttcaagt gcttcccttc ctacttatta tatatattat atatccataa 9180
atgtaaaaga gacgattacg aatactttgc atacatgtat agcgaaacag agatggtagc 9240
aaaagttcac cttcactaat ctaagaatct ctccacgtgg gtaaaaactt cagcagtaag 9300
attgtaaatg atgtccaaga acaaaacgtc atgctagtcc aggggttact gagctaacga 9360
ttaataatgt ttcgtagtct tcctaattgc accatcaaaa cttgtctgca caagttttaa 9420
agtattggag cctttactga agaatcagag gacatagatg gggcacgttc gccttgaaaa 9480
aaatagtctt ctttacctgc atggtgttac aaacaaaaac gagttgaaaa tagctgtgca 9540
aggaggcaaa catgattgga aaagaaaaac gaggggaccc ttatacagga gggcgccaca 9600
tagtagaatg agtagattgt tagagtaggg tacgctttat gtgattgatt gaatgggcga 9660
gtgaaagttg ctgtcaaggt tctaaacaaa aggatgtttg agtttgtgag tattgtttgc 9720
ggcaaaaaga ttcagtagag agaaatgcac aaaaagataa tacgtgtgta gggcgattat 9780
ggaggcatgc atttggggga aatcatcgca tgcgcatgag tttctccatc tgccgaatct 9840
ttgcaaaggc attttcaagc tccatttgca tagcgtaggc ttgctgctca aactgagcgc 9900
gctgatgcgc cagattttct tcatgtcttt tgttcaaact acgctcaaga ccctcaagag 9960
ccgcaacctt gagcttgcgt tccttttgct gaatctccat aactcttcgt ttcacctgga 10020
gctcaatttc tgcagcatcc gtggtctttg cagcggcctg tgcgtcttgt gcggcctgtg 10080
cgttgtttgc gagctccttt cgcagctcct ccatctccgc gttctttttc tcctccatcc 10140
atttggcacc gagtttggca gcttgatcga tgcggccctt gagaacttct tcgttctcct 10200
caagttctgc gatacgcgcg tgtaagccga ggatctcctc cgagacagcc tcgccattga 10260
tcattatttc acttcccgag tcttgaatga caacatcagc cttggtgcca ggttcaccgg 10320
tatctcgctc gcaaccctgc tggcgcatag acagcataag gcgcgcatta tcctcacgca 10380
gatcatccac ctgttctgat aaaagtttga ctgcctgctc aagattacgg gggttcactt 10440
cgtgaaaaat ttcttgaagg tctcgaagct cagaaagctt ggcagagcaa gtgtgcatcg 10500
ctctgcactt tttaagacgt gcaagtgcat catcaagttt ggcattattt accttcatgg 10560
aggcttcagc tacttcggct tcttcgatta caattttctg cagctctaca acatcatggc 10620
caattaactt gcgatgcagc tcggcaatca ccccatgcat cttttcggta tggcctggac 10680
gcgcctcatc ctgcgttctt cggatctcct cctctagttc tcgatttaga cgaagggctg 10740
gtccaagggg cgggtaatta gcctgagtca agccaagctc tgttgctagt ccaaggcagt 10800
cggaaagtcg cagccggtcc ctatcagaaa cagccttttg caagtctacg ctcaaacgca 10860
cttcttgagc cttgcgcacc atcttcggtt ctgcctgtcg cagaagtttc gagtcgtagc 10920
cagcttgcca cgctagcacg atggcacgcg caagtgacct cagttgaccg ctgttcatgg 10980
cagacttgag caacattttg atttgcacaa atacctcatc tgattcatca tcttcagctt 11040
cctcaagctc tgcaggtgtc ttgcgctctc cagagacttg aagagcaggg ttcaaaccgc 11100
cctccaggac ctcgctcgca agcgcctcct ctgtctcagc tttgcgcaat agcgcagcag 11160
cattctccgc cattgtgttt gtcactcacg agattaatat cgttgccaga gtatacggta 11220
atgcgagtta aggattcaca gaatctctca aattaatctt ttcacctaat gatatccaca 11280
aaacgttgca atcgctcagc ccaacgacaa gcgtgcttct tgttttaaga ctgcaactgc 11340
tcctttttct attagtcaat atggaccgtc ctccaaacgt ccagaaaata gcacagaatt 11400
taccagcagc cgctgcagac aagaagtgca agagagcagg caagcaagtg agggtttgag 11460
caaataggcc aacctctcca cgcagaattc tagggtcgca accggaactc acagtcctta 11520
gaaaccgtgc gaagccctgg gctcaacttc aatttgtcca cgggaccttc agcaagcacc 11580
aagctcagca gcgtgaaggc aggcgctgac cacagtttga gctcagaggg cttggtgtgc 11640
ctcgcgattg atattgaagt caattgcgca ggacggcagc aacggaccag gtggtgaaga 11700
aggtaatctc cagcggagtg atgatggagc tcgaccgact actccggaat cgaccagggg 11760
aggtgcgggc gcccttcaca agcgggcgag aggcagggga gagaaggctc gactccacgt 11820
cttgaagcgt gtacgtgtgc gcgctcacgc gtgcgacacg ccggcaaggg cgccttagtg 11880
gcctgctgct gctgctggtc gccacgctgc gagcccaaga gatttgaatt gaactcgaag 11940
aaaataacta tcatttatca attccaatca atcaatgcat tatgaagcac ctctgaagtg 12000
aactattctc ctctccaata tacaacaaaa aacacacaca gtgggtttta ccctataacc 12060
tattgttccg cgagcgatca actactctat agagcgaatg accagttttt ctttctttct 12120
ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctgttt 12180
tcctatctaa taaccccttt aatcgaggaa acctttcgat ttaaaaggaa agctctgtct 12240
gtatatatct gttacagata ctgctatcat gccatgcaga aagaaacaca aaagaaaaac 12300
aaaagaaaga gagaaagaga gaaagaaaga gagaaagaaa gaaagaaaga aagaaagaag 12360
agcttttctc aatcggtttc ctcatcgacc gctcacatat ctacgattgt ggcaaagaaa 12420
gaaagaaaga aagaaggaaa gcctcagcag agtccgcacg aaagccttca ttgagccacc 12480
atgtcgtggt ccgctgcagt cagtgccgcc tctctgtgaa ttgagtgagt gagtgagtga 12540
gtgagttggt tggttagtta gttagtgcct cttcagctca aagcctttca cggtcgctct 12600
tcgagcgttt gctttttcat aaacaaataa acaaaccatc gaacgaacca tcgaacgaac 12660
gaacaatggt accccagaat agacggaatt aattgctaag taaaccagta acagtaagtt 12720
agtgtttctg acctgagccg ttttctttat ttattcctct cagctctgtg aagagaattt 12780
gggatgaaaa gaaacgtttt tatttattta aaagtttagt aacaagaaaa acatggtccc 12840
tcttcttcct tcatgtaaaa ataagtaagt aaaaaaaaga aaagaaaaaa aaaaaagctt 12900
ttaaagtagt aaagcgaggt agagataaaa gttctttctc agggctccta gtaggcactt 12960
aggaggtacg tctaagaccg cctcgtggga agaaaagaga aaacaagaag agaaaagaga 13020
gagagaaaca gcgctgaccc gagaggctca tgcgcagagc ccaaatctgc ccaactttgg 13080
caaaatgcag cgccgcctct gcggcggaga cggtcatgtg aatccgcaga gctgcacgca 13140
cgcgtcacag gctacagctg gatatttttt atacgagccc gcgcgagacc gcggcggaga 13200
aacggggtcc cgcgcgaagg gcctctgaaa agcaggcagc gaaccaggcc tgcaccagcg 13260
ccgacctccg cgagacttcc ttcgatctca ggaaggacct tctgaagagt ggctcaaagc 13320
agcgcaggcg gaggcagcgg cggagggcac gcccagcgag ggcatcggct cgaggctcca 13380
gggctgccag gtcgcgaggc atgcacggcc tcggttcgtg atcttggccc tgccgggtgt 13440
gccgggatcc aatatggtgc gcaccgtttt tgaagctgtc gctcttttct cgcgtcgcac 13500
attacgatgc gcagaactga gtgagtggac aaacgaagag ggcgatcgat ggcttggaat 13560
gcgaactccg tccatcgaca tcgacatcga tcaacccatc gacccatcca ctccgtgcac 13620
aagctgcact ccgtgcacaa gctggagacg agcgaccgaa gaggtgacga ttcgctctcg 13680
ctcgggatgc ttggatgatt ggatgattgg gtgcacgagc tgccacttgt tgttcttgtg 13740
ttgttcttgt tgctgttctt cttcttcttg gcggtcgttg agcgaatgcg ctgtttgtcg 13800
agaaccatga aatgagcgtc ttgaatatgg gtggcctcgg gaatccgcag aacgatggta 13860
tcgcattcgc atccctggtt gcaagaaggc ttgcgatgag gtaagcacat gccgactcgc 13920
cgatcgacca gcgcgggcct ctgtgccgaa ggagcgacag cttggacgca ggggaatggg 13980
gcctcgaagt tcttgtggtc actcaggaca gaaactcttg ttttaatttt tctagttgct 14040
tagctcaagt tagttagcca gttggctagt ttgcttttaa ttaaaaatga agaaaactaa 14100
aattgagttc tcaagtctga aagaacaagc aaacaaaagc gaaggatgtg ctgtgcatgc 14160
acgagcttcg gctcaggcag aggaagattg ccagctcgca tgaccttgga tcttccatac 14220
tgcgtaatgc tgagcgtcag agaaagatgc gggccaggtg ccggaagata taccttcatg 14280
gactttccgc agaggtgaag atcagcgatg atcatgtgga agtgacacga cgcacctcga 14340
gcatcccagg aattgcagtg tttgcccagg caggcagtga gtgcctggtc aattatggaa 14400
tagtcaatct agtaatatga gtgagtggaa ggcagaaaat aatttccatt ccttcattcc 14460
atgactagct gcatcaacat catgatgttg cttcagctcg tcagcagggt gaacaacgtg 14520
cgggctagaa gaattagaaa agaacaatga gtgtctatga atgcatgaga atcgagtgta 14580
atgcaataca gaaacgtgag aaattgcagg attgattaga aagtattagt agggcaagaa 14640
cagagagatt agagaagtga aaagggatga cggtgaaacc agtgtagtcg tagtaaagag 14700
tggcttgcaa ataggtgcac cgcatccatc aattggtcaa cgagcaaatt agtgcagcca 14760
gcgtactagc tatttactgc gacgatgtaa cgaagtcctc caaggacgcg tacacggtgg 14820
ccggcaagtc ttcattggcc ttgagcttgt ccaagataat gcggggaaac tggattgcct 14880
ggtcaatcac agccttacgg gcctgctcat cgttgacaag tgtagggtcg cgctcgaggt 14940
ggtcggtgaa gcgagagtgc aaatcaagag ctacagcaat gatgcgctca gccttgagct 15000
catcacggtc ccactcaact tcctgttgca taagcgaacg gatgcgcttt acaaggcagt 15060
ctcggacctt gggcaggtca ttaataccat cgtaataaat actcatgacc ttccacatgt 15120
tgggctcact cttacactta gaacgcattt tgtcaaagag ttcctccaat tgtgcgcgca 15180
aactagacac aagttcaggg ctcatctcct tacgaggggc ctccggcgtg tgcgggtctt 15240
cactagaggt ctgagaatcc gacttacgga cagagaccaa ggcatctaca actacaagga 15300
gactctgtaa gtcaaccatc tcagcaatgg cctcacgagt gccagcgcgc acatccacaa 15360
ggttgctcat accatcaatg gccatacccc actgatgagt gttgacagcc aacataatgt 15420
agttctccca aatacgccag ttggaacgtg actgacgaac ggcctcaata acagccttca 15480
aggcagcagg gtagtcgttg agctgaatca aaatcgaact caagttagcc caagcatcac 15540
cgctatccgg atcctggcga gtcacatgcg caaaggctgt acgggccaag gtccactgtt 15600
caaggcgcat agcgcatgag ccaaggcgga accacgactc cgggtacaac gggttgatct 15660
taagggcatc ctgaagatgg tcaatactct cctgcaaatc accgcggtca aatgccatga 15720
gggccaattc acgcttagca cgcgcatggc gcttgccaga aaactcccac gccttgctga 15780
accagtcctc atcctgaagc aaagagccca aaacgcacat aaggtgtgca gtgggctcaa 15840
ccgcaaggcg ctcacggatc aacttctcag cacgagcgcg cttgtccatg atcacaaggc 15900
agtccacagc ctcttcccaa agacgaacct cctcaaagat ctgcaacgca ctaccagcag 15960
cgcccacttc atagtacaat tgcgcgaggc cgcgcttgag ctcccaaact gcgggccagg 16020
atagcgcgtg cagaaaggcg agacgctcgg tcactggggc agcgttgtcc acgtcacgct 16080
ggcgaggctg tgttggtgtg agccggtcag tctgctggtc aacgaggact tgcatctgta 16140
aaatggcacg ctccttggtc ttgttgcgct caaactcaag ctgagactta attagaagtg 16200
cagtggagta taccatccag ttttcaggac tctggaggac acgctccacg taagcaagca 16260
tctcctctgc agtgagagcc tccatagcgt agctgttctt cacatccata cacaagccga 16320
gcacgataca ttggtccaaa aggctcaacg ttccgcggcg catgcgagcc tcctcatcac 16380
tggtctcctt tgcgtactgg atctcctcgt gaagtggagt ctcagcgtca acttcttcga 16440
ggcgcacctt ggggataccg agaacagaag tggatcctac aatctcgcca ccgttctctt 16500
cctcagttgc atcattatca tcctctgcca caatctcgtt gggtacctcg gtctcgatgg 16560
ccttgactgc aggggaatta gaattcttag gagcttcagt gtcttgctcg cgattttctg 16620
ggtctttggt ggtagctgac gatgcaagaa gaataagctg ggtcttctct tgtttctgaa 16680
acttggtacg ctttcccatt acaccagtca tctgaatatt cagctgtgct gtttcctttg 16740
cctttgcaaa agcacgctta gctccatctg cagccttgaa cttgtgccgt gctacaccgc 16800
attcaaccca cacgagggac tccagaagtt tatcatctgg gtacatgatc tgcactgctc 16860
ggaccgtgcg agcaaatcct ctctctgcct cctgctcgag gctaggagcc gctgcctcct 16920
tggcttcaag ggtctcctgg tgaacaacag cagatcgtgc tgcccaccag ctaggtgtca 16980
gcaaatgccg aagagctccg cggactgtgt tggagatctg ttcatgaggg ttagcgctca 17040
tgccgccgtt ggtctttccg ggaagatcgt cctcgccatc ttcttcatct tcatcagcgc 17100
caatgacagc cccgttctcg tccacaaggc gagcctcagc gagcatatca gttgggtcca 17160
cagcaccagg ggttactgtc ggattagcga cgacgcgaag aatgacgcga gcaacaagca 17220
agaagtgaag gtacttggca tcacggtaaa cctcctcacc gttcgcgcca agcataagag 17280
ctgttcttgc gtggagctcc ggatacgcgt ccaactgtga ctcgtggaag ctggcatcct 17340
tgccagacac agcggcagca gccttagcat cggaggcgag agcggagaga gcagtctcca 17400
catacctgag acccggctca gaggcagact tggagctagt atctacagaa ttcttggggg 17460
cattggggtt gtcgtagacg ccgcgaacaa aagggagggg gtagaacttg tcaataccat 17520
ggctagagac aggaggacct gtccagttgg cctggacgaa gatgtgaaga caagcaacac 17580
ctgcgaacat gcaggccata gctcgcagag cacgttggtt tacgtggtcc tcaggactcg 17640
agacactcgg gtccttgaag gagctgcgac ctgtgcaggg gccaatacca gtctcgatgt 17700
gctcaacaac acgctcatga agaaaacgac cttcacggtt tttaacgcga cttacagagt 17760
atcccttctt aagatcttct gcggcaaaga ggccttgcgc cgcaggagag gcgaggacct 17820
cgaagaaatc gccttgggca agagcgcacg ccatacgaag gacctcaagt tggagctctt 17880
tcacttcagg ctcgtcacgg agctcctcgc ggtcatctgc agcagccgat gcagccacaa 17940
gaacatcttc gagaccctcg gcattgctct cgagagcgag acgctcgaca agtcgcagcg 18000
agtaaagagg tttcgcaact ccagagccct ttttggagtt gaagacacca gcgccgttaa 18060
gatcaccatc gtcagcctcg tcgatctcat catcaggacc gtcagggagc tcaaagtcct 18120
gtggaaggcc taggaactcg ttcaggtctg cgtcagagct cgaatccgac gcgtagtccg 18180
ccatcctggc ctacaggacc gccgaaacag gttgcggcag ccgcccaaag tctaagctgc 18240
aagagtcaac cctcaatcgc gagcttgcgg cacaacgtcg ccgcaggatc tcgcgccaag 18300
acgtctccaa atgcaagtct ggtgctcaag tcatcctggc cacccgcgcc tttgcccctt 18360
aagctaggtc acctacctta aaccagagtt gccccgcggt gtcatattgt aaacatttta 18420
taacaatata cgtcatatta aaaacctaga tgtggggaca atgttataaa taagtaacaa 18480
atatagacta catcgagaag aaagaattct tcggcactcc gtgtgagttt gggcgaaact 18540
gcaatcacga agccatgcaa agtcttcgta tatctgagtg gagcctcgct ggagagaaga 18600
ccccatgtga atgggtgtag aacgacgaat ctacgcagcg ttgtctccgt tgagacgctc 18660
tgtccagata tgaggtccct cactattctc gtatttgatc atgccaagca tctccagttc 18720
caacaatgga gttttctatt gaaagaacat agacatgttt ggaacggttc ctttcagagg 18780
ggaaaaacta atcaaaaatc aattgaggaa tgcagggggg ttatttgctg cagttttagc 18840
aataaaataa aaatcctttg ttgatgtgat ttcattcgtt cctttgacat tcaatcattg 18900
aattgctctt caccggagct tttcaaggtg cccaactgcg atctccgctg cggctgctcg 18960
cggccgggct ctgagctcta tctccgtgtg ggaggcggga agccagcagg tgcggcgacc 19020
ctctccaaat agaggccgcg gcgaccttga ggcactcgcg tggcgggcgg attggcgatt 19080
ctgtgttcaa ccgagatatt tcatacatat tatttgctaa ttattagcaa atagaaataa 19140
atatacagac tttgcaagct cagtagagaa agtgaagatc caaaatgtcg gcctcttcct 19200
cgcaatctac ttcggagcag cgcaagtcac gcgtggcgta cttttacaaa cctgagattg 19260
gcagctacta ctatgggtaa gttagtatgg gaaaattggc gacagaaaaa tataataaaa 19320
aaagcaactg tatcgccacc gtttattcac ggtagttaga aggtatttgc ttcctgcgca 19380
cactcgatct gcaggatgta catgtcttga gtggcattgt ccaacgatcg ttctgtttgg 19440
cggaacattg cttttaaaca aaaacgagat agtgaatata ttctacccaa ctaccaccat 19500
ccggtttaag gagacaaata aatctgtctt tcgacccagg ataaggaggc ttgcatggga 19560
atcttttata atctagtctt tatgtcaaat tttcgcaggt tccagcctac catctctcat 19620
gctatttgtg attgcacaag atgatatgaa agtaaagaaa caaggcaaag gatataagat 19680
gcataaggat gtgcagaaaa ctaactagaa acattcatgt gatgaaacct tcctcttgaa 19740
aactcacctc ggtttgtttt ggatcttggt ttgtctttgc tcactttttt tcattattta 19800
cagcccgtcc catccgatga agcctcaccg cctgaaactg actcacaacc tgcttcttac 19860
atacggactc ttccgacaca tggaagttct gcgcccgcac gacgcgactg cggaagacat 19920
ggagcgtttc cactcgcacg aatatgttga ctttctaaag cgcatttctc ccgacaccga 19980
gcaagagttc gagaagcaaa tgacccgttt caacgttggt ccctattctg attgccctat 20040
ttttgacggc ttatacaatt ttatgtctag ctgctccggc gcatcgttgg atgccgcaat 20100
taagatcaac cacggacagg ccgatgtttg tgtcaactgg tctggtggtc ttcaccacgc 20160
aaagaagggt gaagcttctg gtttttgcta catcaacgat attgttctct gtattgttga 20220
gctcctcaag tatcaccctc gtgtactcta tgtggatatc gacattcacc atggtgacgg 20280
agttgaggaa gcgttttaca caaccaatcg tgtgatgacc tgctcttttc acaagtatgg 20340
tgacttcttt cccggtagtg gtgcctacac agataccggc gctcgcgctg gtaagaacta 20400
cgccgtaaac tttccgctca aggatggtct tgacgatgcc agctttgaga gcatcttcaa 20460
gcctgttctt gatggcatca tgaagcactt tcagcccggt gctgtggtga tgtgctgtgg 20520
tgctgattcc atctctggtg atcgccttgg gtgctggaac atgtcattgc gaggccatgg 20580
ctacgctgta cagtacgtga aatcctttgg cgtacctgtt gtgcttcttg gtggtggagg 20640
ttacaccccg cgtaacgtgg ctcgctgctg ggcttacgaa accggcattg cactcggcaa 20700
gcatgaggat atgcagaatg atattccatg gaacaactac cacaactact ttggccctaa 20760
ccatcttctt cacattactc ctgacccgca gatgaagaac gccaattcac gcacctacat 20820
ggacaagtac accaacatta ttctcgagaa cctttcgaag cttgaagcgg tgcccagtgt 20880
acagttccaa gatcgcccta acgactttgc aaacccagat gagcgtgctc gtattgctct 20940
tgacaacgct gaccctgatg aaaaggatta cattcaacgt cctcagcacg aggccgaata 21000
ttacgaagac gagaaacacc aagactcgga ccgtcccaat ccggctgatg gtggtgccga 21060
ctcaaaggta aagtctgaaa aatcctcagg cgatggagct gcggacgaag cggagaccgg 21120
atccagaaag ccttacaaaa agggcactga atgcggtggt ctacttgaaa ttgacgaggc 21180
tgtcatggaa gtggactcca atgaagcgcc caaggagact gctcctgctt cagattctgc 21240
tatcaagact gaggatgctc ctgctgctga gtctgctgcc tccccctcgg atgccaaggc 21300
ctaaacatga agactttgtt ttaatgcaat agacgtgctc ttttgctgct cgagtagcgg 21360
caaccctagt gccatgtcct ccttttttct tactcacttc tctctctacc tttgaaagag 21420
accaagtgga accaagcagc catttctgtg ttccacattg caatagatta tcttttaaca 21480
attctcatac atacatattt tcttcatttt tcttttctat gtatttttaa aataaaatat 21540
aacaacaaag tagtagtttg tatgaatttc ggccatgcag gtgacaaaag gtgaaagtaa 21600
tgagcgtcat tttggatcac attaccagcg aatccactca acgactcttc tcttctcgag 21660
ctttagaagc tgactgtgag ataatagaac agagcacggt ccatcaatca aaatacataa 21720
ttagctcgca atagcttcgc ctcacagtga tcgtttcacc tcatgatacc cttgttgggc 21780
gctcgctctt aggctctccc ttgttgttat atgatgcaac gatcatctaa gtgctgtccg 21840
cagtcatcaa gacatcctat tctgtagcaa gcaagcaagc aagcaagcta gctagtttag 21900
ctggctagct agtttagctg gctgagttcg cagtgaataa acaattaaca cctcaagtct 21960
tgaaggagca ggaaacttgg ctcctatgat atgccatcct ggaaggccat gttttggggg 22020
gtatgagaga caggtctttc cttttctact ctggttcggt ggatgacgag acaacaacca 22080
gacgtcccgc ctagtacctg ggtggtcgat ctgtcctccg ttcactccga gtgcagggct 22140
tgtgggacga ctcgctctgt tgaattgagg tccttcacgc gagcctatct gggcatcgat 22200
cgacctcatc catcaacaca cacacatatg ttcaatccgc gccaccctcg ctgactccca 22260
gactgcccag cgaaactttg aaaacttccc catctcgaaa cagcactccc aaaagacgca 22320
cacaagcaac gcttgagcct aggcaggctc tccgctggac gcacaaacca cctcgcagcc 22380
atccactctc tgactcccca agcatgcatg gccttctccc tcgatttggc gcttcgcgtt 22440
gctgtcttcg aagtcctcaa acacgaactt ttcactaatc atcctcgacc tcagcaggat 22500
gccccccctc ctaagctctg tttgctatgt atttattaga ggaaggacgg caagctgggg 22560
gtctgcggaa cgcattttgg gggtttgaaa attttcgaat tttcaaactc cccgaaacgg 22620
ccatggtttc ttccgagaag cggtagttag gtggggaaat gagagcacgg cggagttggc 22680
gagaagcata aatctgggcg ggcaagcaaa ccccaaacta tcctgcaatc aacaaaacac 22740
acgcactccg caatcaactt gcaccgtaag tctttggaat tgattatggt atctgcttcg 22800
ccgtcttcaa ctttaacttt gcgcctcgca acgagacttt gttttgtaat gtgcctttag 22860
atttgacgaa acatctttaa gcgagatagt acagcagcgc gttggtacca agagagatag 22920
atcctgggac cttttgaaat aaataaactg tgtgatgaac ggtcgactaa ctgggcttgt 22980
aattgatata ttgatgatac tcttggtcca catgggagtg agcacagtcc acaaacaact 23040
tgctaaccca cacaaaaacc tcccaaactt gcagacccgt tctgcattct tgtaaacaca 23100
taatcacaca gcacacataa tcacaatgac ctacggcaca gcacacaact acgtgcagga 23160
gcagattgag ttggacgaat gcttcaacaa ctttggcgaa gaagtgagca gctctgttga 23220
gcctcggtgg cagcgcaagg ccttggccgc tcgcactccc aagtctagcc gcaagcgtag 23280
ccgcaccggc aagaccccga gcaagggcaa gtctacgccc cagcacgacc gattcatccc 23340
caaccgtggc gccatggacc tcgctaacgc tcacttcaac ctcatgaagg agaacagcag 23400
ctccgcctct aaccagtgcg agtcccctac tcgtgctgaa ttcaacaagg ctttggcgtc 23460
cagcatgggt gcgggtgagt cccgtgtttt ggccttcaag aagaaggctc cggcaccgcc 23520
tgagggatat gaaaactccc tcaaggtttt gtacacgcag aacaaggaga agatggcgcg 23580
cactcagaag cccgttcgtc acattccttc ggcaccggag cgtatcctcg acgcacccga 23640
cctcttggac gactactacc tcaaccttgt cgactggggc gcctccaaca tgctcgccgt 23700
ggcccttggc cagacggtgt acttgtggaa cgccgagacc ggcggcattg aggagctctg 23760
ccagtgtgat gccgaggatg actacatcac ctcggttaag tttgttcagg agggcggtgg 23820
ctacttggct gtgggcacga acttcagcga gaccaagctc tttgatgtgg agacctgcaa 23880
gcttctccgc aacatggacg gtcacagctc tcgcgtgtcc tcgctctcgt ggaaccagca 23940
catcctttcc agtggcagcc gcgactcgac tattgtgcac cacgacgttc gcgtggccag 24000
ccacaaggtc ggtgttcttg agggtcacgt gcaggaggtc tgtgggcttt catggtcccc 24060
ggatggccag accttggcct ccggaggcaa cgacaacctg ctgtgcctct gggacgctcg 24120
ttactctggc gacggtcgct cccagcagac cgtgcagacc ccgcgtctta agatcgctga 24180
ccacctcgct gctgtgaagg ctcttgcctg gtgcccgcac cagcgcaatg tccttgccag 24240
cggaggtggt actgccgatc gcacgattaa gatctggaac gctgccaatg gcgcctgcct 24300
caacagcgtc gacactggat cccaggtgtg ctccctcctc tggaacccac acgagaagga 24360
gcttctgtct tctcacggct tcagtgagaa ccagctcagt ctctggaagt tcccttccat 24420
ggctcgtgtc aaggatcttc gcagccactc cgctcgcgtt ctccacttgg cgatgtctcc 24480
ggacggaacc actgtctgct ccgctgctgc tgacgagacc cttcgattct ggaaggtctt 24540
cgaggcagct aacccggtca agcgcaacaa gcgcgccgct ggagctgcca ctgcctctca 24600
cggtggcctc gcccgcatga gcatccggta agtttccccc cttcccttgt ccggttaatt 24660
cactttcgac tactgtctta cacagaagca aagcatggtt atgcaagcaa acttgctggc 24720
atgctctctt ttgtctcttc agtagcgaga ggccgtggtc aaggggctca tgcgggagct 24780
ccaatgtaat ctaccaccac ccggcctctc atgtatacat atatatatat ctatttatat 24840
gctgatcatg atgcaaaaaa atcccacgcc gtcatactaa agcgcgtcag tgtttacaat 24900
actgttggcg tatagttcgg tagtgaaaat taaaatcctt cagggtttgt acctatagct 24960
tttggtgatg aatgtgatct actactactg acgtgacaga agcaacaatt cttgtgaatc 25020
tgacttcttt tttgtgtatt ctatttcgca tgactgcctg attgtatgat atgggtctga 25080
tttggtcgac tgtactctat tttgcatgcc atgtaacttt ttgttcgatt atactatgaa 25140
tctgtggcaa cttttgctga gaagaaggga tggcagacag tttgattttc ttgatcaatg 25200
tgtttcgctg tcccgctgtg ttgaaagaat gcagtaaatg acccgagtat cggactggag 25260
tgcgtatgtt tcacgctgcc ttatgaatcc ccaggggttc gcagcagcac tttccctcgt 25320
ctgtctctgt gtttgctgtt tgttcgctcg taaatgtgtt ttgcctgtat catatgcatg 25380
taggatagaa agttattacg cagtgtgtat tatagattta tggaagatca ggtggactcg 25440
tatatgctga ctggtgggta tgcttcacgg gatactcgca ttaagttcaa attcgaggca 25500
atggttgctg ctgaagtcgc tgacgaagga gagctcattg ttcttgtcgc caatttgtaa 25560
gtaggtggca cctgattcct ctttcctctg ggaagagatg cagcgctctt gggatcagtt 25620
tctctctcaa tcacgcttgc cgagcagttt ttagtagcaa gcaataggtc tttaatgact 25680
tctagaacta gatgagcagg tatttgcatc atgcaaggct ggcatgtttg gtggctttgc 25740
aatttctctg tcttgaactt agctggatag atagcgagag agtgaagttg gtacaaacat 25800
aaccgacagc atgtagccgc tgccttcgct cgcagctcta gcgctcgcct gcagagacgg 25860
aagagtgtat aattgcccag tgtcaacttt tgggtggtgg gtctgactca caatcaatgg 25920
taccgttcag gtatctttcg gtagattatg acactggcca cttttctgaa gtgatttgag 25980
atttggtatc gatgatgaag agtgagagaa ttttgaaaga aatacctcat taacttccaa 26040
tagtcagtat cttgatgaaa aacgctgacc tgaaagctgc gcgtgttttg ttgacacggt 26100
ccttttattt tgttttttga tgatctattg gtacttatac ctgcgatttt tcttttgcaa 26160
gctaaggcac attcgacttt gtctagaagg aaagtgatca tcacgcttcg gcacacatct 26220
gttttcctca gttaagtttt cttcttggtt caggtatggt attacatgca ggaagaaagg 26280
ggatgcgggg acagccgtat agatgccacc aactttaaca tggtttgtgt tttggggaaa 26340
caaggaaaga gagcatacgc tatgagctac ttaaactagt gacacaagaa gcaacttatc 26400
ataccggaga tcacaatgga gtgattaggt tctatcagat agtagaagca gagtatgcga 26460
cctgcggtgg ctacgtacat gggtgaaaat aatagaacac ctcgcgtagc gtcgaaaacc 26520
gcctcgtaga ctctgtgtca ggtatgaacc acccactttt tttgtcctct ttatctccac 26580
actatttcct tcatggagac aaactcattc tcgaaagaca aacaatcaaa tcaatccatt 26640
accctcatgt tctcatgatg ggtatgttat acatatatgt ctcagacata tgtttatcct 26700
ttttaaaaca catacttaat aggcacttag cactgttact gctatagaaa actcatccat 26760
tcaagaggag ggagagaaca gagttggcaa aatcttggaa gggcaaagtt tatagcaagt 26820
aagtagtagc acagagagag tattatgtat gtgttcatct agcaaaatct aaatagaaga 26880
gccgatcgac tcagtcagtt gtaattagga ctagtcgtta atcatgacat ggctcataaa 26940
caactagtca gtttcttgat ttacttggca ctcaggaaca aagtatgttg ccatccctgg 27000
gcaatagatt tgatcccgtg cgttgagata aagcttgcca aggtcgggtc atgtaactgc 27060
agaggcactg ggcgtagatt ccagtcccag acataaggaa cagcaagatc ctcaccaacc 27120
acgcaaatgc cctcagttcc aattgtaact tcaagctgag gagtcttgtg ctcggcggaa 27180
agctcgaaag gggtaaaaac aggtacaggg tcaaggactg tgcgagctgt ggccttgtat 27240
ttgttggtgg acttccaaaa tccctcctcc atgaatggtt caatctgctt ggtcacagcc 27300
tcggagcttg aagtttcctt gtcggacatg agaccccact ggtaaagctt gcagccgtgg 27360
ccctgagaat ctttaactaa agcgacataa ctctgcgggc ctgcccaaat gtcaagcacc 27420
gggcccgcct cagggccgaa ctcgacctct cttggtgaaa cctggtcctc gtagttgctt 27480
ttggcctcgt ccaactggcg agattgcatc ttgccccaca caaagacacg accatccttg 27540
agcaaggctg cgctgctgtt catgccagca gcaaccttga tggccggtcc aggtagatct 27600
ctgacctctt gcatgacgaa gaagtcgtca ataccgcgca gaccgattcc gagttgaccg 27660
cgctgtccct tgccccacgc gaagactttg ccgctcactt tcgtagccac aactccgtgt 27720
ctgaatccca acgcaacgct ggccacggca tcatcatctt caggaagacc aattgtagtc 27780
cttgggtccc agaagtacga gtctgtggtt cccgtcgcgc actgtccata gacattctcg 27840
ccaaatacaa aaagcgtgtc cgtttccttc gtaatgaaag ctgtcacacc ggcaccacac 27900
acaacttccc gaataggttc tgtcgagtaa ccctcaaact ttgtctcaag accccttttc 27960
cgtgagtcct gctcaatatc gtcctcacca aggtccttgt acaccttgta gctcaatacc 28020
tcctttggct caatcgcatc cacagatgtc tccacaccca tcatccgcat gacatactgc 28080
atcaccatgc ttgatttcgc atagcgaccc agacgcacag taagccgggt atcgtgggtg 28140
cgaccaaaga gataaacgcg accttgggcg tcaagaacgg cgctgtggcc aaagcccgct 28200
gcaagtttta caggctgggc ctgcttggtg tcgaggtcgc cgtggatctg tgtagggctg 28260
tcagcgttat cgagactacc tgtaccgagg gcaccgttga taccaatgcc tcgagcccat 28320
acgccgcgaa gggcggttcc ggcagaagag ctaagcatcc gcttggcacc tgttagggag 28380
cccagcgccg tcatggtggt ggtctgtatg tcaatgtatc tgtagaaagg cagccagcta 28440
actaaccagc tgtactgtga accacagaag aggcttttgc aaaagatgct cgagagcaaa 28500
atggatgatc ggtggagatg cggagaagcg cacagcacga tccgagtccg aacttgattg 28560
aactcaagtt cggagtttgc aatttttcta caactaggta taccttcgta gtatcacgta 28620
gtaggtggta gtactagtag tcctttgaat tgcggcaggg aatttacgac agcaactctg 28680
gtaaattaat ttaggacgcc tcttttgtac taaagtcctt ctctttagaa cggaaagaac 28740
atatgatatt gagacatcat gaggacatgg gaaagggttg tgcatctttg gaactgtatt 28800
gcccagtatg gctggacttc accttggact tattcataga atgaccacag ctattcctgg 28860
ggtagatgga ggtctgacaa tgctcgagct aaccctgccc atccatgatc aagacgcacc 28920
caagcactat ggccgcaagt ttcagttcat ggagagcaga gctgctcaaa tttagcttct 28980
gcggtcgatt ggtcttggca caaccgctct taagagtcat ctacgacagg ctaccatcca 29040
ctcaagataa aaatggactc acagatagat agatagatag atagatagat agatagatgg 29100
caggcgacca atcgcagcgc actctcgctc tcaagatatg cccgcccatc gaaacacggc 29160
cttctcatgc ggcctgtttc gtctcaagct cgagcaggcg tcggcccatg ctccagcgca 29220
acgggcccgc aactttcagt ttcgagcttg gtcttgcttt tgagtttgct tttgcttttg 29280
agtttgagtt tgagtttgag tttgagttca aaattcaaat tcttcaaatt caaattcttc 29340
gaattcaaac tcaaattgga gaatccatct tttcaaaaac tcaattcacg ctctcgaaga 29400
agttcaaact ccgcagtcgc atccagctga ggcacgcact ccccatcgca tcgccggcgc 29460
tctctcctcg ctcctgccgc gtctaagcgt gctcgcgtct ctgtcctgct gctgcttgct 29520
tgccagtatc tccacttctc gcgagcagaa ggaggacgag cagaagaaga aggaaggatc 29580
aagaatcatc aagaaggaac actctctttg tttctgtggt tcgtcattag tttgttgtag 29640
cttgaaggag aaggagaaga cggagaagat ggagaagaag ggaatgaaca gcagtggcgt 29700
ttatctgtct ctagctagct aggtacctta cctaccaggt agagttagga ggagaggata 29760
gccgagacta aggaagcaag ccgtagtttt attttactat gtctgttgtt ctttctctcg 29820
actaccttct ctcgctaccc ccgtgggaag gaggtctctt gtgtcgagtc tgatccacgt 29880
ggacgcctcg aggatcttcc ctcgcacccc gggcccggtc gctgccggtg caaaacctcc 29940
tcagtggcct tgctcgcgct gtgtgctttc gttcctgcgt ctggaacgtc agatagcaga 30000
taaagagata taagatagtt agttgacgga agcagtcaaa gcaaacctcg aacggattga 30060
agcgaagcga ggacgctctc gcctctttgc tgactgctcc gcctattgct gctctggccc 30120
tcactctgag atattactat gtctgaacct gccgcagccg caccgccggc cgagcccaaa 30180
tcgtcgtggg cggatgaagt cgataatgac acggagggag acgctgtggc cgctctgagc 30240
gaacatgcgg ctaagttgga cctcgacgtc cacggagctc cagacctgca cagcggtgct 30300
cttgtagtac gcgaggccgg gtgccccgtg gacgagccca agacgcaggc agtgacaagt 30360
ttctcagccc ttgcgattga tgacgacctc aagaagtcta tcgcgaacgt caagggctgg 30420
agcactatgt ctaagatcca gcaaattgga cttccgcttg tgatcagcga ccctccacga 30480
aaccttatcg ggcaggctca agccggcacg ggtaagaccg gtacctttgt catctctatg 30540
cttgcaagga tctctgcaga taagaagccc agcacgcctc aggccattat cttggctgta 30600
actcaggagc tgtgcacgca gattgcacag gaggtcaacg cactgggatc cgacaagggc 30660
attaaagcac gcagagttat gtctgctagg tccaaaaatg gacccctcgc ggaagggagc 30720
gcggcggcgc cgtgggcact tagtgaaggt gaagactttg atgagcaggt cgttgtggga 30780
acacctggaa tggtcaagaa ctacctcaaa aatgccatgg gacgcaagaa gcgcaagccc 30840
atgatcgatc cgtctgagtg ccgcgttctt attcttgatg aagctgacaa gatggtgcag 30900
cagccacctc acggatttgg acaggacgtt caggagattc gcgacattat tctcaagaag 30960
cgcaaggaca agccgtgcca aattttgctc ttttcggcca ccttcaccga aaatgtacga 31020
cagattgccc gccagttcgt tggtggacat gacatggacg agtccaagta ccacgagatc 31080
acgctgcgca aggaggatgt cactctcgac aaagtcgtca acttcgttgt ctatattgga 31140
gacgagaatg agcgcaacga agaggaaatc tataagaaga agtttgaggc cattaatgag 31200
atctgggaga acctctctca gctcagcgag gggcagtccg ttatcttttg caatcgtaaa 31260
gatcgtgtac aacgcctcgc ggattatctt cgcgggctaa acttcccggt cggtcagatc 31320
catggtgaca tggataaggc cgagcgtgac attgtgctca gtgagttcaa gcgcggtgag 31380
cgcaaggctc tcgtttctac tgatgtcacc tcgcgcggta ttgacaaccc caatgtgact 31440
ttagttatta atgtcgacct tcctgttaac cgcgagcagg aagctgaccc ggagaatttt 31500
gtgcacagga taggccgctc gggacgttgg actaagaagg gtgcttctgt ttctcttgtg 31560
gctcgcagcc ctgccttccg tgaccttggc ctcatgaagg acattgagcg tgcactcttc 31620
gctaatgcag aggtaaaccg tccgcttatc cccgtcgatg atctctccaa ccttgagagc 31680
aagatcattt ctgctcttga agcatacaac taagtgccta cctaccttaa tcagccctta 31740
tcacttgcat tgcgagcccg ggtttccgca gcgcttgccc tgtgttgcta gagactgggc 31800
aagctggctc gcctgtctct ttctcgcatt caacaatgca ttcaccgttt ctcctagctg 31860
cacccgccct ctctcttgcg cccacgacaa gaaaaataca gttcatatca gcatcccccc 31920
caaaacaacc ataacaatta cgtaaatgaa ggccgtttat tctaccgtgc atcatgagca 31980
ctgcaccttt tctctcctcc atcgcgcctt ataccgataa acaaaaaata gataacacct 32040
ttttgtagag caaccaccac cattgtttcc cttccctccc tccncnctcc ctcccaaaat 32100
aacttgcttt gtttgtacgg cgttccttct atctactttt tctttaatct tcaatcatgt 32160
ctgacggttc ctttacttat tatgcgttgt tttattcggt cacaaggagg tacagccttg 32220
atggtcctgc gatagatgcc gtactttatt gtcatatgtt tataactttt aaaaaattaa 32280
ttttttagta cttatattca aaattcaaaa ttcaaaatat aaaattcaaa attcaaaaat 32340
tcaaaaattc gaaattcgaa attcgaaatt caatttagat tgtaatctga ttatctttga 32400
atccgtcacc ttctttttat tattttttaa aataatttat ttttaatgtt tttagttaag 32460
ctaattttgt aaaaacaatt atattgttat aataacctta tcacctgaat aataagatag 32520
aaaacgaaga tgcatcctta cctcagcata agaccaaaca gactaaaacg aaacatcttg 32580
gattgcattt tgtctcgact atatcccatc tcaagagagc aataaaagtt attactgagc 32640
cttttcaagt cagaaatgtg tagtcgtgtt caaatttgaa ctttagtttt cgctaaataa 32700
catataagat ctgaattttg caacgactgt gacacaacac tttggttctc aagagaacac 32760
aagttcttgg ttggccagtg cttgttattc cgtatagtat tttgggataa tggacaagga 32820
tccaaaccaa gcacaattga gaagcataat tgcaacacca aacctgaaaa gtaactattt 32880
tgaagacatt accttgtggt gcagtttgat cgatacgaga gcaacgaacg gagcattgag 32940
gttaagcgag gggagtcaaa gaaagttatg ggacaggcac tcaactccac gatgaatgcc 33000
atgcatgtat ccaaggctgg ctgctcctct gggtggatgg gtgtcggggc acatgattat 33060
gtagaggaca aagatgtccc ttctcttgag ccttctgagc atagccaggc accttttcgt 33120
tgttcttgcg tacaatctcg ggttgtaggc cccaaaagtc acgttgaaaa ggtaatgggc 33180
tcacgatgtt gtcaaagccc tcgatgtagc gcgggcaaag gcacgcttgc agaactcgac 33240
gaggtcatgg acaacaaagt ccgaatttct ctagacgttg gcgaagacgt cgatgtcggc 33300
catgaagtcg gcaaagaaga taagacgagg ggcaaggcga ctcttcatga tggaggtgtt 33360
agagacaaac tcagaatcgc tgatggtgtt attggctcta attatgttga tctcaaggcc 33420
aaagaaagtc atcaagcacc aggctcggtt atccaatcgc agcggcactc tcgagccgag 33480
aagtaccgcc gagcagacgc tgttcgcgaa gctcttcgcg atttgtcttc tttctcgcca 33540
agtacttcaa tgaatacttt tcctgactct tcgagccgaa caacacctgc atgctcccct 33600
gaatcagaaa ctagccttga tgaggagaag gagaatatag ggctggtaaa taacgttcta 33660
cttgaggaag aacacgttag tcgcccacga tcaatgacgt ttgatgcttc actttcgatg 33720
acggagctgg aaacccaaaa cgaagtggag cacgctgtgt tgacttcgtc tgtcatgtat 33780
gcagccgaga aaactctaag ttttattaag gagaattccg gagaattggg caaacatatc 33840
ggaaccgaag gcggaagtaa tatcaaagac attgttgaag aacatgcaaa tcaaaaatcg 33900
caagaaagtg ataatgaaat gtttatgagg ttgcttgaag atctgcctac tcaggcccaa 33960
caagtagttt ccgaaagttt gggaacacct actaccaaac atcattactt ttccagcgcc 34020
aacacgagca gtggagcatc gcgaagcttg cagtcaggtc gatcaagcac cccaaactgt 34080
gtcacggtat ctccatgcac agagctgggc tctcctcgtt gcgggcttga ctctgtactt 34140
ggtaaccaaa ttgatgaaaa acatggtgaa gggcttgacg atcaccatag gatcccgcag 34200
tttgatctct tacaacatga gcttttacaa gatagcaact ctattacagc acacagagat 34260
ggtgaaacga cttcgtcccc agttgcctgg gctggagatc ttcaagatga tcttacgcgc 34320
tctctgttga cagaagttga acatcctttc atctgtcgag aaacaaatat accaccggtc 34380
cattcaaaag ggaacgaggg tttgagaaca tgcaatggtt cgtcgcatag atctagtctg 34440
ggagcaattt tgcacgagat tctcgaaacc aagggagact ttcgtaaaaa cggtgaactg 34500
atcaccgacc tcgacatctt cctaggcgat aaattgccaa aaggcaaaac attttggtcg 34560
ctcttgacaa gtagcgagct aggtgagctt ggtgaaagag ttgaactcga aataatgagc 34620
cgccccctcg cgcaccagcc ttaccgagaa tcactctggt gtgttgcatt tcagacaatc 34680
cagctcactc cctatcgcca aagattggcg ctcagctgtc gcgatagact tttgcctcac 34740
gagcgggctt taagcgggtt ctccattgct caactaggtc gtgcgtgttt tgtacttcgg 34800
caaaggctcg tagactgctt ccaccacaac ggcaggataa agttcaaatg ttacaggcga 34860
acatgcaagt tgctggaagc aaggatgtgg caatgagcct caaaacatag gcttggcaca 34920
gggtgttgaa gcgcctttct gagacccatg aaactcctag tttgtttgct ttgcatcgct 34980
ctgtatcaat cgtgccgcat gcaaatgcaa taagctaaca ctcaaatcat ggtacagtct 35040
tttaatttgg accgagtcta gggcacccga ggcatttcga tgcaaacatc tttctcatca 35100
aagacttatt taggcgagtt aggcattgga gctcaccttc cctggcaggt cgcctttacg 35160
tggtaagtta tataagtcaa gaggaaaacc cgagcgacgc tggtctctat aagattgaca 35220
gatccctgga ggtgataaag gttgtatcgt acaacttgtt ctacgagaat caaatcttgt 35280
acgctccaag ccagcagctt gaaattggca gatgagttgt atctgcgtca ggagttatca 35340
gagagcttac tggactatca aatggtagac atgttgacac tgcgcacctg aaaagctctg 35400
ccaagcacct ccgctcccca gaaagcctgg tttacatgaa gtgtgatgta gtctgcagtt 35460
caagatctaa tctcatcaga gagcgcttag tacccattgg tgatctgtca cattttgagg 35520
ctacgcacag tttggatgac gctcttcgcg ctgtatgcaa cacatccgac gaacgagatg 35580
aacctacttc caaagactcg tgtgctggtt ggcgcgcggc ctagacctgg tcggggcact 35640
ggcgcatgct atgagattgc tggacgcgaa aaatgtggcg aagctgtgta cgcagtgaac 35700
tggggtgcca aatcaatgat tctaagagtg tttgccccaa agtatggctt aaaatgtttc 35760
aaactaccca agggttcccc gacatgaggc cacatgtggg aagtgtattt gccccccatt 35820
tgagaagttg ggacagagcg cttcgtcagg gatgatcatg aagcatgttc tatgaacttg 35880
caccacttgt ttagaacgga agtgtggctg gaatgaaacc tatatgtcag catatctgcg 35940
ggtaatcccc aactacataa tatttgctgg tatgcttgct ttaagcagca atcaagtttc 36000
tagcaacagg gtaataacca ggtcaccggt caatcgcaca atggcctttt tagttcggaa 36060
aatttgacaa cctgtggatg tttggggagt ccatggataa atgtggagct gtttggtgta 36120
acagaacatt gcaaagggtg acgccttaga tccttttctc atgacaggct tcgatcacaa 36180
agttgtacac tttcaaggtt gtaggtgcgt attgaacttg gcatttctgg aacaaacaga 36240
cactatatct cgaatctggg tctgcctgcc cctctagctc aggccctgat agtttgacta 36300
gagcatcgcc gtctcgtgta ttctctccga atctttctgc acattgagtt agacttctcg 36360
tcgtgtttgg agcatgtgta aatacatcag cgatattttt ttactcctaa aaatggcaaa 36420
ttcgcattta cctactgcaa ataatgaatc aaaatgagga aacaatgtgc tatatgaacc 36480
gtgctctttg gaacacaaat aaaaaataaa taaagtcaaa gatcgtgcca aatccgccca 36540
acttgagaga aaggcttggc tggtgacctg ccctgttgtg gcatcatcct atcttggctg 36600
ccgccctcca aagagaaatg tgagcctcgg aagagcgggc taggctggta accaatgaga 36660
gctatgtaaa tagcaaagga agagagaata aatctttggg aataaacctg tcagcaaggc 36720
tccaaagctt gctttctggg caaggcttac atgttgcttg atatgatttc acagaagcat 36780
ttggacacgc caaactctgc tactttgact gtgcctaggt ggtaaaccaa gcaactgcta 36840
tctttgacgc caccatgcag gtttccatca aaatagagat agaggagaag ttaccatatt 36900
tgaatccacc aattcttcaa gtgtgtggag acgctcgagt aatgagcata cttgaggaag 36960
atgctcatgg accttccgtg tgtttttctc ccgaggtatt acacgatatt ttcgtatttg 37020
caatgttgca gagtcttgat atcgtgtgac agtggaaaca aatgctacag ttgattcctt 37080
gatccccttc atcgcaaaga gcttgttatt ctctataata agagctagtt accggcaccg 37140
tagtcgcttt tgctcagcaa gtggcccttt tccagcatga gataagacct cctaattttg 37200
gctcgttttc tgattacaaa tgaaggtcct tgccaactac accatggtca cagctttctc 37260
tgccgagctc agggatgcaa ctgtcggctt agacaccaag tcagcgtcgg ttgcaagtgc 37320
tgcttctgag agctgactgc tgtagtgtgt gggtttgctc cacctatgag tgggtatgag 37380
taggtctgct ccacctatga ggaccaccaa gtttgctctc catgtgctac agcgcctgcg 37440
tctcttgtgc ggtgagacat attttttgag cttggtcttt acgaaatgaa ggcctgcgac 37500
agacaacgat cgcaacaatt ctgcctcgaa ggcgcttatc cctacgtaga cgtaggtctc 37560
tgttcccact aaagccactc ctgcgtcaat agaacaaaag caaaagctct tatggctgct 37620
gtacaaatag agtaaaactt cacctttcta ctcgtaacac tacagttata agtagcaagt 37680
caatcagagc aagacctttg cgagtaaacc tgcattgctc tatcgcagtc ttccagcatc 37740
ttcgcgaggc ggtctcgcac aacttcagtc agtctgtaat aacaggagct ttagcaccag 37800
ccaaagcagt tgcgttgcaa ccagcagaag acttggcatc atgctcattc ccgctgtgga 37860
cgtggccgtg ggcggtgctg tggcgtcctc tgagaagttt gatctcctca aacgcctgag 37920
ttggtgcggc ccccttcgca tcatcccttc acttgactct gtctccgcac caagtgtggg 37980
tgcccctgag gagaaggact tctggaaatc tgctgttcgc aagtggggca aagctttgtg 38040
ttcgtaccct tgccaagttg gtcccatcgc cgctacaagc gttgaggaag tgacgcaatg 38100
gctcaacgaa ggcgctgtcc aagtcattgt tgagggttct ttcgacgacc tcgaggacat 38160
tgcttcgcag cttcctcgtg aacgtcttgt tgccagattt tccgagaagg tccttgaaga 38220
cgacggtctc ctgagcaaac tttctggcag cgttgggggc gtttcaatta tttctgaggc 38280
caaaaattct gaagaagtcg tcaaggtcgc agagagggca tggcagcttt tgggaaaacg 38340
ccttgctatc gcattagagg tccccgagat cgaggccgga ggcgaggcgc agaagattaa 38400
caaccagctt gttggtaagc tccatggact ccactccaca gactttcctg tgaacgttgt 38460
gtctgagaac gtttccatgc caacagaagg gtctcttgcg acagatactg actcagaagc 38520
tgccttttgc gtggcaaggt cttttgtagc gtgccttcgc accgaccgta cagatggtct 38580
ctttgcgacg gtcgtcaccg atgagaatgg cgtggcactt ggcctcgtgt actccagcga 38640
acagtctgtg gttgcctcgt tggcgtgtgg ccgcggcgtg tactggtcaa gatccaggca 38700
gagtctgtgg cgcaagggcg acacaagcgg tgcctttcag gagcttgtgt ctatcgcatt 38760
tgactgtgat gccgacgcga tgaggttcaa ggtgcgccag cgtggaaacc ctcctgcatt 38820
ttgccatcaa cagacccgca catgctgggg ttatgacggt ggcatccccc acctctttcg 38880
cactcttgag tcccgcaagc ttaacgcccc agaaggatca tacacaaaac gtctttttga 38940
ggacaaggca ttgctgcgta acaagctcat tgaggaggca caagaggtaa ttgaggctat 39000
tgaggagaat gacccagagc atgttgcccg cgaggtcgca gacctcgcat acttcctctt 39060
tgccgcgtgc acgtgcggaa atgcgtcgct cgaggacgtt acacggcagc ttgacatgcg 39120
ttccctcaag gtcaagcgga ggccaggcaa tgcaaaggca gatcgcatcg ctgctggtga 39180
ggcagttctc caggctcagc agcagaaaaa gtctgcagag gagcccccag cagctcccaa 39240
ggaccaggcc taaattgcat gcttattatt acacccaaat cctgcttatt gtgacttgtc 39300
tgcacccttt tcacattgaa gaagcgtgtt ttcttacccg tcacaccacc actaagtctc 39360
atcctttctt tcttaccttt ttactagtcc gaacgatata aactttatct ttgcaaggct 39420
cttgttatac tgcaattgtt atttagtttg ttttctattg ataggcaaac cagacgtaat 39480
cgtctgagag tgtttgaaga ggataaaaca aagaatcatt aacaggtttt gtgtttctgt 39540
acacttgaat agttttatgc ctatctactt ctagagcctg ggcggagttg gcatttgtat 39600
aatctcaaca ttcgataaca aattgcttca aatgaagaac aaaaacagga aatgatttga 39660
attaaaatct aatatttgta gaaaagaaaa agcgagctga catcattcca tcaaattgac 39720
caattgactc cttagcacag tagatatttc ctaaacgact tcaactcatt cctcattatc 39780
ctcgctgttc ctgcttccgt gagtaccctt gctgattcgt acttccaaat cgccgccatc 39840
ctcccggtca tcatcatctt cgtcatcttc gtcttcatca tcagcccctg acgaggagta 39900
aatgtcaagg taaggtttgg gattctcgag ctttcgcaat tctccaatac ttattggttg 39960
gccacagacc ggatcc 39976
<210> 3
<211> 8994
<212> DNA
<213> Ulkenia sp.
<400> 3
atggctcaac gtgagaaccg tctcgaggcc aacatggata cccgcatcgc tgtgatcggc 60
atgtccgcca tcctcccctg cggtaccacc gttcgtgagt cttgggaggc tatccgcgat 120
ggtatcgact gcctcagtga tctccccgag gaccgcgtcg atgtgaccgc ctacttcgac 180
ccggtcaaga ccaccaagga taagatctac tgcaaacgtg gtggattcat ccctgagtac 240
gacttcgacg cccgtgagtt cggcctcaac atgtttcaga tggaggactc cgacgcaaac 300
caaaccgtca ccctcctcaa ggtcaaggag gccctcgagg acgctggcat cgaagccctc 360
agcaaggaaa agaagaacat tggatgtgtt ctcggtatcg gtggtggcca gaagtccagc 420
cacgagttct actcccgctt aaactatgtt gtcgttgaga aggtccttcg caagatgggc 480
atgcctgagg aggatgttca agctgctgtt gagaagtaca aggccaactt ccctgagtgg 540
cgccttgact ccttccccgg tttcctcggc aacgttactg ccggtcgctg taccaacacc 600
ttcaacctcg atggtatgaa ctgtgtcgtc gatgctgcct gtgctagttc tctcatcgcc 660
gttaaggttg ccattgatga gcttctccac ggagactgtg acatgatgat cactggtgct 720
acctgcacgg ataactccat cggtatgtac atggccttct ccaagacccc ggtgttctct 780
accgacccta gcgtccgcgc atacgatgag aagaccaagg gtatgcttat tggcgaaggc 840
tctgccatgc ttgtgcttaa acgttacgcc gacgctgttc gtgatggtga cgagattcac 900
gctgtcattc gcggctgcgc ctcttcctct gacggtaagg cctccggtat ttacaccccg 960
accatctctg gtcaagagga ggctcttcgc cgtgcctaca tgcgcgctaa cgtcgatccc 1020
gccaccgtca ctcttgttga gggccacggt accggtaccc ccgttggtga ccgtattgag 1080
ctcaccgctc tccgtaacct cttcgacagt gcctacggca acgagaagga gaaggtcgct 1140
gttggcagca ttaagtccaa catcggtcac ctcaaggctg tcgccggtct tgccggtatg 1200
atcaaggtca tcatggccct caagcataag actcttccgg ccaccatcaa cgttgatgag 1260
ccccctaagc tttacgacaa cactcccatc accgactcat cgctgtacat taacacgatg 1320
aaccgtccgt ggttccctgc tccgggtgtg ccccgtcgcg ctggtatctc cagtttcggt 1380
tttggtggtg ccaactacca cgccgttctt gaggaagccg agcccgagca ccagaaggct 1440
taccgtctca acaaacgccc ccagccggtg cttctgatgg catcttcaac ccaggctctt 1500
gcttccctct gtgaagccca gcttaaggaa ttcgagaagg ctatcgagga gaacaagacc 1560
gtcaagaaca ctgcttacat caagtgcgtc gacttctgtg agaagttcaa gttccctgga 1620
tctatcccga gctctaacgc tcgcctcggt tttcttgtca aggaggccga tgatgccacc 1680
gagaccctcc gtgccatcgt tgcccagttc caaaagtcag ctggcaagga ttcttggcac 1740
cttccccgcc agggtgtgag ctttcgtgct cagggcatca acaccactgg tggtgtcgct 1800
gccctcttct ctggccaggg tgctcagtac acccacatgt tcagcgaggt cgccatgaac 1860
tggcctcagt tccgtgagag catctctgac atggatcgtg cccaggctaa ggttgctggc 1920
gctgacaagg actacgagcg tgtctcccaa gtcctctacc cgcgtaagcc ttataactct 1980
gagcccgagc aggaccacaa gaagatctcc ctgacctcat actctcagcc ctctaccctc 2040
gcctgcgctc ttggtgccta cgagatcttc aagcaggctg gtttcaagcc cgacttcgct 2100
gccggtcact ctctcggtga gtttgcggcc ctctacgctg ctgactgcgt caaccgtgac 2160
gacctctttg agctcgtgtg ccgtcgtgcc cgcatcatgg gtggcaagga tgcacctgct 2220
acccccaagg gatgcatggc tgctgtcatt ggacccaatg ccgagaagat ccagattcgc 2280
actgctgatg tctggctcgg caactgcaac tccccttcgc agactgtcat caccggctct 2340
gttgagggta tcaagaagga gtccgagctt ctccagagtg agggcttccg tgttgtcccc 2400
ctcgcctgcg agagtgcctt ccactcaccg cagatgcaaa acgcctcctc tgccttcaag 2460
gatgttctct ccaaggttgc cttccgtcag cctagcgccc agaccaagct cttcagcaac 2520
gtgtctggcg agacctactc caacaatgcc caggacctcc ttaaggagca catgaccagc 2580
agtgttaagt tcatctctca ggttcgcaac atgcactctg ctggtgctcg catctttgtc 2640
gagtttggcc ccaagcaggt gctctctaag cttgtttccg agaccctcaa ggacgatcct 2700
tccattatca ctatctctgt caacccttcc tctggcaagg atgccgatat tcagcttcgc 2760
gaggctgctg tgcagctcgt tgttgctgga gtcaaccttc agggcttcga caagtgggac 2820
gcacctgacg ccacccgcct tcagccgatt aagaagaaga agactactct tcgtctctcg 2880
gctgccactt acgtgtctga caagaccaag aaggctcgcg aggctgccat gaacgacggc 2940
cgcatgctca gctgtgtcag caaggtcatc gccccccctg acgccaagcc cattgtggac 3000
accaaggctc aggaggaggt tgctcgtctc cagaagcagc ttcaggatgc ccaggcccag 3060
atccagaagg ccaaggccga tgctgctgag gctgacaaga agcttgccgc tgctaaggat 3120
gaggccaagc gtgccgccgc ttctgcacct gtgcagaagc aggttgacac caccattgtt 3180
gataagcacc gtgctatcct caagtctatg cttgctgagc ttgactgcta ctccactcct 3240
ggtgctgtgt ccagctcttt ccaggcacct gttgctgcta cccctgctcc ggtcgctgcg 3300
cctgttgcag ctgctcctgc tccggctgtc aacaatgctc tccttgccaa ggctgagtct 3360
gttgtcatgg aggttcttgc cgccaagact ggttacgaga ctgacatgat cgagcccgac 3420
atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag 3480
gtccaggccc agctcaacgt cgaggccaag gatgttgatg ctcttagccg cacccgcacc 3540
gtcggtgagg ttgtcaacgc catgaaggct gagatcgctg gcagctctgg tgctgccgct 3600
gctgccccgg ccccggttgc tgctgctccc gctgcccctg cccctgctgt caacagcgct 3660
cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag 3720
actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag 3780
cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat 3840
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct 3900
ggcagctctg gtgctgccgc tgctgccccg gcccctgttg ctgctgctcc ggcgcccgtc 3960
gctgccgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgttgtcatg 4020
gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc 4080
gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc 4140
cagctcaacg tcgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag 4200
gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgctgc cccggccccg 4260
gtcgctgcgg cccctgctcc ggtcgctgcc gctgcccctg ctgtcaacag cgctcttctt 4320
gagaaggctg agactgttgt catggaggtt cttgccgcca agactggtta cgagactgac 4380
atgatcgagc ccgacatgga gctcgagact gagctcggca ttgactctat caagcgtgtc 4440
gagattctct ctgaggtcca ggcccagctc aacgttgagg ccaaggatgt tgatgctctt 4500
agccgcaccc gcaccgttgg tgaggttgtc aacgccatga aggctgagat cgctggcagc 4560
tctggtgctg ccgctgctgc cccggccccg gttgctgctg ctcccgctcc cgtcgctgcc 4620
cctgctgtca gcagcgctct ccttgagaag gctgagtctg tcgtcatgga ggttcttgcc 4680
gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctc 4740
ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggccca gctcaacgtt 4800
gaggccaagg atgtcgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc 4860
atgaaggctg agatcgctgg cagctctggt gctgccgctg ctgccccggc ccctgttgct 4920
gcctctcccg ctcccgtcgc tgccgctgcc cctgctgtca gcagcgctct ccttgagaag 4980
gccgaatctg ttgtcatgga ggttctcgcc gccaagactg gttacgagac tgacatgatt 5040
gaggctgaca tggagctcga gactgagctc ggcattgact ctatcaagcg tgtcgagatt 5100
ctctctgagg tccaggctat gcttaacgtt gaggccaagg atgttgatgc tcttagccgc 5160
acccgcaccg ttggtgaggt tgtcaacgcc atgaaggctg agatcgctgg cagctctggt 5220
gccgccgctg ctgccccggc cccggttgct gctgctccgg cgcccgtcac tgccgctgcc 5280
cctgctgtca gcagcgctct ccttgagaag gccgaatctg ttgtcatgga ggttctcgcc 5340
gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctt 5400
ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggctat gcttaacgtc 5460
gaggccaagg atgttgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc 5520
atgaaggctg agattgctag cagctctggt gctgctgccc ctgctccggc tgctgccgtt 5580
gcaccggccc ctgctgctgc ccctgctgtc agcagcgctc tccttgagaa ggccgaatct 5640
gttgtcatgg aggttctcgc cgccaagact ggttacgaga ctgacatgat tgaggccgac 5700
atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag 5760
gtccaggcta tgcttaacgt tgaggccaag gatgttgatg ctcttagccg cacccgcacc 5820
gttggtgagg ttgtcaacgc catgaaggct gagattgcta gcagctctgg tgctgctgcc 5880
cctgctcctg ctgctgccgc tgcaccggcc cctgctgctg cccctgctgt cagcagcgct 5940
cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 6000
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 6060
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 6120
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagattgct 6180
agcagctctg gtgctgctgc ccctgctcct gctgctgccg ctgcaccggc ccctgctgct 6240
gcccctgctg tcagcagcgc tcttcttgag aaggctgagt ctgttgtcat ggaggttctc 6300
gccgccaaga ctggttacga gactgacatg attgaggccg acatggagct cgagactgag 6360
cttggcattg actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac 6420
gttgaggcca aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac 6480
gccatgaagg ctgagatcgc tggcagctct ggtgctgcta ctgcctctgc ccctgctgct 6540
gcagctgccg cccctgctat caagatctcc actgttcacg gtgctgactg cgatgacctc 6600
tctgtgatgt ctgctgagct tgtcgacatt cgtcgcgctg atgagctcct tcttgagcgc 6660
cctgagaacc gcccggtcct tattgtcgat gatggtaccg agctcacctc tgctctggtt 6720
cgtgttcttg gtgctggtgc tgtagttctt acctttgacg gtcttcagtt ggctcagcgt 6780
gctggtgctg ctgttcgcca tgtccaggtg aaggacctct ccgctgagag tgccgagaag 6840
gctatcaagg aggctgagca acgcttcggc cagcttggag gcttcatctc tcagcaggct 6900
gagcgctttg cccctgctga cattcttggt ttcaccctca tgtgcgctaa gtttgccaag 6960
gcttccctct gcacccctgt gcagggtggc cgtgccttct tcattggtgt ggcccgtctt 7020
gacggtcgcc ttggtttcac ctcccaggga tctactgact ccctcacacg tgcccagcgt 7080
ggtgctatct tcggcctctg caagaccatt ggccttgagt ggtctgctaa cgaagtgttc 7140
gcccgcggta ttgatattgc tcgtgaggtc caccctgaag atgctgccgt cgccatcact 7200
cgcgaaatgt cctgcgctga caaccgtatc cgcgaggtcg gcattggcct caaccagaag 7260
cgctgcacca tccgtgctgt ggacctcaag ccgggtgccc ccaagatcca gatcagccag 7320
gatgacgttc tccttgtgtc tggtggtgct cgtggtatta ctcctctctg catccgtgag 7380
atcacccgtc aggtccgcgg tggtaagtac attctcctcg gtcgctccaa ggtccctgct 7440
ggtgagcctg cttggtgcaa cggtgtttct gatgacgatc ttggcaaggc tgctatgcag 7500
gagctgaagc gtgctttctc cgccggtgag ggccccaagc ccaccccgat gacccacaag 7560
aagctcgttg gcactattgc tggtgcccgt gaggttcgtt cctcaattgc taacattgag 7620
gctctcggtg gcaaggcaat ctactcctct tgtgatgtga actctgctgc tgatgtcgcc 7680
aaggctgttc gcgaggctga ggctcagctt ggcgcccgtg taactggtgt cgtccacgct 7740
tctggtgtcc ttcgtgaccg cctcattgag cagaagcgcc ccgatgagtt tgatgctgtc 7800
ttcggcacca aggtgactgg tctcgagaac ctctttggtg ccattgacat ggccaacctt 7860
aagcacctcg tcctcttcag ctctcttgct ggtttccacg gcaacattgg tcagtctgac 7920
tacgccatgg ctaacgaggc cctcaacaag atgggtcttg agctctctga ccgtgtgtcc 7980
gtgaagtcta tttgcttcgg cccctgggat ggtggcatgg ttacccccca gctcaagaag 8040
cagttccagt ctatgggtgt tcagatcatc ccccgtgagg gtggtgccga tactgtggct 8100
cgcattgtcc tcggctcctc ccctgctgag atccttgttg gcaactggac cactcccacc 8160
aagaaggttg gcagtgagcc cgttgtgatc caccgcaaga tcagcgctgc atccaaccct 8220
tttcttaagg accacgtcat ccagggtcgc tgtgtgctcc ccatgaccat tgctgtgggc 8280
tgccttgctg agacctgcct gggtcagttc cctggatact ccctctgggc tattgaggat 8340
gctcaactct tcaagggtgt caccgttgac ggtgatgtca actgtgagat cactctcaag 8400
ccttcccagg gtactgccgg ccgcgttatg attcaggcca ccctgaagac cttcgctagc 8460
ggcaagcttg ttccggctta ccgtgccgtg atcgttctct ccactcaggg aaagccccct 8520
gctgctacta cttcccagac cccctctctc caggctgatc ctgctgcccg tggcaaccct 8580
tacgacggca agaccctctt ccacggccct gccttccagg gtcttaagga gatcatctct 8640
tgcaacaagt ctcagcttgt cgccgagtgc accttcattc cgtcttccga gagcgctggt 8700
gagttcgctt ctgactacga gtcccacaac cctttcgtca acgacattgc tttccaggcc 8760
atgctcgtct ggattcgccg caccctcggc caggctgccc tccccaactc tatccagcgc 8820
attgtgcagc accgtgctct tccccaggac aagcccttct acttgaccct caagagcaac 8880
agcgcgagtg gccactctca gcacaagacc tccgttcagt ttcacaacga gcagggtgac 8940
ctcttcgtgg acatccaggc ttccgtcacc tcttctgact cccttgcctt ctaa 8994
<210> 4
<211> 6093
<212> DNA
<213> Ulkenia sp.
<400> 4
atggcctctc gcaagaatgt gagcgctgct cacgaaatgc acgacgagaa gcgcattgcc 60
gtggtgggca tggccgtgca atacgcgggc tgcaaagaca aggaagagtt ctggaaagta 120
gtcatgggcg gtgaggctgc atggactaag attagcgata aacgcctcgg atccaacaag 180
cgagccgagc acttcaaagc agagcgtagc aaatttgcag ataccttttg caacgagaac 240
tacggctgcg tcgatgactc cgtcgataac gaacacgagc ttctccttaa gctctccaag 300
aaggctctct ccgagacatc ggtctccgac tctacaaggt gcggtattgt gagcggatgc 360
ctgtcctttc ccatggacaa cctccagggc gaactcctca atgtgtacca aaaccacgtc 420
gaaaagaaac tcggcgctcg cgtcttcaag gatgcctcca agtggtccga gcgtgagcag 480
tcgcagaacc ccgaggctgg tgaccgccgc atctttatgg acccggcatc cttcgtagca 540
gaagagctca acctcggtcc tcttcactac tctgtcgatg ctgcctgtgc caccgccctt 600
tacgtccttc gcctcgccca ggaccacctc gtttccggtg ctgctgatgt catgctcgct 660
ggtgcaactt gcttcccgga gccctttttc attctctccg gattctccac tttccaggcc 720
atgcctgtat cgggagacgg catctcgtac ccgcttcaca aggacagtca gggtctcacc 780
cctggtgaag gtggtgccat tatggttctc aagcgccttg acgacgctat tcgcgatgga 840
gaccacattt acggtactct gctcggtgct accatcagca atgctggctg tggtcttccc 900
ctcaagccgc acttgcccag cgagaagtcc tgcctcattg atacctacaa gcgcgtcaac 960
gtgcacccgc acaagatcca gtacgtcgag tgccacgcaa cgggtactcc ccagggagac 1020
cgcgttgaga ttgatgccgt caaggcttgc ttcgagggca aggtgcctcg ctttggaagc 1080
tccaagggta actttggcca cacactcgtt gcagctggtt tcgcaggcat gtgcaaggta 1140
ctccttgcca tgaagcatgg tgtgatcccg cccactcctg gtgtcgatgg atcttcccaa 1200
atggacccgc ttgtggtctc tgagcccatc ccatggcccg acactgaggg cgagcccaag 1260
cgcgctggtc tctccgcttt cggctttggt ggcaccaacg cccacgcagt ctttgaggag 1320
tttgaccgct ccaaggctgc ctgtgccacc cacgatagca tcagttccct cagctcacgt 1380
tgtggcgggg agggcaacat gcgcattgct attaccggta tggatgccac cttcggctcc 1440
ctcaagggcc tggacgcctt tgagcgtgcc atctacaatg gccaacatgg tgctgtgcca 1500
ttgcctgaga agcgctggcg tttccttggt aaagacaagg actttttgga cctgtgcggt 1560
gtcaaggagg tgccccacgg atgctacatt gaggacgtcg aggtggactt tagccgcctg 1620
cgcacgccca tgacgccaga cgacatgttg cgccccatgc agctacttgc tgtcacaacc 1680
atcgaccgtg ccattctcaa ctctggcctc aagaagggag gtaaggtcgc tgtcttcgtc 1740
ggccttggca ctgaccttga gctctaccgt caccgcgccc gcgttgccct caaggagcgt 1800
gctcgtcccg aagccgcttc agccctcaat gatatgatgt cctacatcaa cgattgcggt 1860
accgctacct cgtacacatc ctacatcggc aacctcgtgg ccacccgcgt gtcttcacaa 1920
tggggtttcg agggtccttc tttcaccatc acagagggca acaactccgt ctaccgttgc 1980
gcagagttgg gcaagtactt gctcgagact ggcgaggtcg aggccgtagt gatcgccggt 2040
gtggatcttt gcgccagcgc tgagaatctc tacgtgaagt cgcgtcgttt caaggtctcg 2100
gagcaggaga gcccgcgggc cagcttcgac tccggcgctg acggctactt tgttggtgag 2160
ggatgtggtg ccctcgtcct caagcgcgag agcgactgca ccaaggacga acgcatttac 2220
gcctgcatgg acgctatcgt gcccggcaac atgccggcag cctgcatgga ggaggctctc 2280
gcccaggctc gcgtcaaccc caaggacgtt gagatgctcg agctctccgc tgactctgcc 2340
cgccacctca agaacccctc cgttctgcct aaggaactca ctgctgagga ggaaatccgc 2400
ggcattgagg ccattctcag ccagcgctct agcaacgaag ctgtggagcc ccacaacgtc 2460
gctgtcagca gcgtcaagtc cactgtcggt gacaccggct acgcctcagg agctgccagt 2520
ctcatcaaga cggctctctg tctgtacaac cgctacttgc cctcaaacgg cgcctcctgg 2580
gaggagcctg cacctgagac acagtggggc aagtctctgt acgcgtgcca gtcctcgcgg 2640
gcctggttga agaaccctgg agctcgccgc cacgcagctg tctcaggtgt ttccgagacc 2700
cgttcatgct acacggtgct gctctctgat gtggagggcc accacgagac caagagccgc 2760
atttcgctcg atgacgatgc cgtcaaactc ctcgtaatcc gcggagactc ccatgacgct 2820
atcacgcagc gtgttgacaa gctccgcgag cgcctcgccc agcctagcgc taatgtacgt 2880
cttgctttta tggagttgct cggcgagagc attgcccagg agaccaagac cccgttgccg 2940
gccttcgctc tgtgcctggt gacctctcct agtaagctcc agaaggagct tgaactcgcc 3000
tccaagggca tcccgcggag tcttaagatg ggccgcgact ggacatcacc ctcgggcagc 3060
cactttgcac ccaagccact gtcaagcgat cgcgttgcgt ttatgtacgg cgaaggccga 3120
agcccttact atggtatcgg ccttgacatt caccgcatct ggcccgaact tcacgagttt 3180
gtaaacgcca agaccaacaa gctttgggat caaggcgaca gatggttgat cccgcgcgcc 3240
tcgacgaagg aggagcttaa ggcgcaggaa gatgagttca accgcaacca ggtggagatg 3300
ttccgactcg gtattctcat gtccatgtgc ttcacccaca tcgctcgcga cgtgcttggc 3360
atccagccca aggctgcttt cggactgagc cttggagaga tttccatggt ttttgccttt 3420
tctgagaaga acggccttgt ctctgaggag ctgacaacta aactccgcaa ctcggaggtc 3480
tggcgtaagg ccctcgctgt tgagtttgac gccctccgca aggcctggaa tattccccaa 3540
gatacccctg tcagcgagtt ctggcaagga tacgtggtac gtggaacccg cgaggccgtt 3600
gaagcggcca tcggccccaa caataagtac gtgcacttga ccattgtcaa cgatgccaac 3660
agtgctctca tcagtggcaa gcctgaagat tgcaaggctg ccattgctcg cctgagcagc 3720
aacctccctg ctttgcccgt ggaccttggt atgtgtggcc actgccccgt ggtcgagccg 3780
tacggcaagc agatcgctga gatccatagc gtcctcgaga ttcccgaggt tgccggcctt 3840
gacctgtaca cgagcgtcaa ccagaagaag cttgttaaca agtccactgg agccagcgac 3900
gagtacgcac ccagctttgg tgaatacgca gcacagctgt acactgttca ggcagacttt 3960
cctaagatcg ccaagaccgt tagcgacaag aactttgacg tctttgttga gactggtccc 4020
aacgcccacc gtagcgccgc aattcgcgcc acccttggaa atagcaagcc ttttgtcacc 4080
ggatccatgg accgccagaa cgagaatgct tggacaacca tggtcaagct ggttgcctct 4140
ctccaagccc accgcgtgcc tggcgtgaag gtctcccctc tgtaccaccc cgagactgtt 4200
gaggaggcta cgcagagtta caacgatatg gtggctggca agaagcctac taagaacaag 4260
ttcttgcgta agattgtggt caatggtcgc tatgacccca aaaagcagct cgtgccgccc 4320
caggtgctag ctaagcttcc tcctgcggac cccaagatcg aggctcttat ccaggctcgc 4380
aagatgcagc ctattgcccc caagttcatg gagcgtctcg acattcagga gcaagacgcc 4440
acacgcgacc ctattctcaa caaggataac aaaccttccg ctgctcctgc ccttgcccct 4500
gctgctccgg cccgcagcgt ctccggagct gttgtggctt cctctgaggc tctccgtgcc 4560
aaacttttgg agctcaacag cactttgatg cttggtgtca acgccaacgg tgatctcgtt 4620
gaagcaagcc caagtgaagc atctattgtt gtgcccaagt gcgatatcaa ggatcttggc 4680
agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca 4740
aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct 4800
ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct 4860
gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc 4920
gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc 4980
tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct 5040
gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc 5100
gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag 5160
atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt 5220
gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt 5280
gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc 5340
gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc 5400
gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac 5460
accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct 5520
gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg 5580
cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct 5640
ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa 5700
gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag 5760
cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc 5820
tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg 5880
attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag 5940
taccccgatg ttgtgcaaat caacttgcag atcctccgtg gtgcctgctt cttgcgccgc 6000
ctcgaagctg tccgtaatgc cccgctgaag gctaacgcca agcaggttgc tgccgagatt 6060
gatgacatct acgtgcccac tgagcgcctg taa 6093
<210> 5
<211> 4398
<212> DNA
<213> Ulkenia sp.
<400> 5
atggccactc gcgtgaagac caacaagaaa ccatgctggg agatgaccaa ggaggagctc 60
accagcggca agaacgtcgt tttcgactat gacgagctcc ttgagttcgc cgagggtgac 120
atcagcaagg tcttcggccc cgaattcagc cagatcgacc agtacaagcg tcgcgttcgt 180
ctccccgccc gcgagtacct cctcgtcacc cgcgtcaccc tcatggacgc cgaggtcaac 240
aactaccgcg tcggtgcccg catggtcact gagtacgacc tccccgtcaa cggtgagctc 300
tctgagggtg gtgactgccc ctgggccgtg ctcgtcgaga gtggtcagtg tgatctcatg 360
ctcatctcct acatgggtat tgacttccag aacaagagcg accgcgtcta ccgtctgctc 420
aacaccaccc tcaccttcta cggtgttgcc caggagggcg agaccctgga gtacgacatc 480
cgcgtgaccg gcttcgccaa gcgtctcgac ggtgacatct ccatgttctt cttcgagtac 540
gactgctacg tcaacggccg tctcctcatc gagatgcgcg acggctgtgc cggtttcttc 600
accaacgagg agctcgccgc cggcaagggt gtcgtcttta cccgcgctga tctcctcgcc 660
cgcgagaaga ccaagaagca ggacatcacc ccgtacgcca ttgccccgcg tcttaacaag 720
accgttctca acgagactga gatgcagtcc ctcgtggaca agaactggac caaggttttc 780
ggccccgaga acggcatgga ccagatcaac tacaaactct gcgcccgtaa gatgctcatg 840
attgaccgcg tcaccaagat tgactacacc ggtggcccct acggccttgg tcttctcgtt 900
ggtgagaaga tcctcgagcg cgaccactgg tactttccgt gccacttcgt cggagaccag 960
gtcatggctg gatccctcgt gtctgacggc tgcagccagc tcctcaagat gtacatgctc 1020
tggctcggcc tccaccttaa gaccggtccc ttcgacttcc gccccgtcaa cggccacccc 1080
aacaaggtcc gctgccgtgg ccagatctcc ccgcacaagg gtaagctcgt atacgtcatg 1140
gagatcaagg agatgggcta cgacgaggct ggtgacccgt acgccatcgc cgatgtcaac 1200
attctcgaca ttgacttcga gaagggccag actttcgacc ttgccaacct ccacgagtac 1260
ggcaagggcg acctcaacaa gaagatcgtc gtcgacttca agggtattgc cctcaagctc 1320
cagaagcgct ctggccctgc cgttgtcgct cccgagaagc ccctcgctct caacaaggac 1380
ctttgcgccc cggctgttga ggccatccct gagcacatcc tcaagggcga tgctcttgcc 1440
cctaaccaga tgacctggca cccgatgtcc aagatcgctg gcaaccccac gccctcgttc 1500
tctccctcgg cctaccctcc ccgtcccatc accttcaccc cgttccccgg caacaagaac 1560
gacaacaacc acgtgcccgg cgagatgccg ctctcgtggt acaacatggc tgagttcatg 1620
gccggcaagg tcagcctctg cctcggccct gagttcgcca agttcgatga ctccaacacc 1680
agccgcagcc ctgcatggga ccttgctctt gtgactcgtg tggtctccgt ttctgacatg 1740
gagtgggtcc agtggaagaa cgtggactgc aacccgtcca agggaaccat ggttggcgag 1800
ttcgactgcc ccatcgacgc ctggttcttc cagggatctt gtaacgacgg ccacatgccg 1860
tactccatcc tcatggagat cgccctccag acctctggtg tcctcacctc tgtgctcaag 1920
gccccgctca ccatggagaa gaaggacatt ctcttccgca accttgacgc caacgccgag 1980
atggttcgct ctgatattga cctccgcggc aagaccatcc acaacctcac caagtgtacc 2040
ggctacagca tgctcggaga catgggtgtc caccgcttca gcttcgagct ctctgttgat 2100
ggtgtagtct tctacaaggg taccacctcc ttcggctggt tcgtccctga ggtcttcatc 2160
tcccagactg gtctcgacaa cggtcgccgc acccagccct ggcacattga gtccaaggtg 2220
ccttccgccc aggtcctcac ctacgacgtt acccccaacg gtgccggtcg cacccagctc 2280
tacgccaacg cccccaaggg cgctcagctc actcgccgct ggaaccagtg ccagtacctt 2340
gacaccatcg accttgtggt cgccggtggc tccgccggtc ttggctacgg tcatggccgc 2400
aagcaggtga accccaagga ctggttcttc tcgtgccact tctggttcga ctccgtcatg 2460
cccggctcgc tcggtgtgga gtctatgttc cagctcgtcg agtccatcgc tgtcaagcag 2520
gacctcgccg gcaagtacgg catcaccaac ccgaccttcg ctcatgctcc gggcaagatc 2580
tcctggaagt accgtggtca gctcaccccc acctccaagt tcatggactc cgaggcccac 2640
attgtctcca tcgaggccca cgacggcgtc gtcgacatcg ttgccaatgg taacctctgg 2700
gctgatggcc tccgcgtcta caacgtcagc aacatccgtg tgcgcattgt tgctggcgcc 2760
gcccctgctg ctgctgctgc tgctgctgct gttgctgctc cggctgccgc ccctgctccg 2820
gttgctgcat ctggccctgc ccagaccatc accctcaagc agctcaaggc tgagcttctt 2880
gacgttgaga agcctctcta catctcctcc agcaacggcc aggtcaagaa gcacgccgat 2940
gtggctggtg gccaggccac cattgtgcag gcttgcagcc tcagtgacct cggtgatgaa 3000
ggcttcatga agacctacgg tgttgtggct cctctctaca ccggtgccat ggccaagggt 3060
attgcctctg ctgaccttgt gattgccact ggtaagcgca agatcctcgg ttccttcggt 3120
gctggcggtc tccccatgca cattgtccgt gccgctgttg agaagatcca ggctgagctc 3180
ccgaacggcc ccttcgccgt caacctcatc cactccccct tcgatagcaa ccttgagaag 3240
ggcaacgttg acctcttcct cgagaagggc gttactgtcg tcgaggcctc cgccttcatg 3300
accttgaccc cgcaagtcgt ccgctaccgt gctgctggtc tttcccgtaa cgctgatggc 3360
tccattaaca tcaagaaccg catcatcggt aaggtctccc gtaccgagct cgctgagatg 3420
ttcatccgcc ctgccccgca gaacctcctc gacaagctca tccagtctgg tgagattacc 3480
aaggagcagg ctgagcttgc caagctcgtc cccgtcgccg acgacatcgc cgtcgaggcc 3540
gactctggtg gccacaccga caaccgcccc atccacgtca tcctccccct tatcatcaac 3600
ctccgcaacc gcctccacaa ggagtgcggc taccccgctc acctccgcgt gcgcgttgga 3660
gctggtggtg gtgttggatg cccccaggcc gctgccgctg ctctcgctat gggtgctgcc 3720
ttccttgtta ccggcactgt caaccaggtc gccaagcagt ccggcacctg cgacaatgtc 3780
cgcaagcagc tctgcatggc cacctactct gacgtctgca tggctcccgc tgctgacatg 3840
ttcgaggagg gcgtcaagct ccaggtcctc aagaagggaa ccatgttccc gtccagggct 3900
aacaagctct acgagctctt ctgcaagtac gactccttcg agtccatgcc tgccacagag 3960
ctcgagcgtg ttgagaagcg catcttccag tgccctcttg ctgatgtctg ggctgagacc 4020
tccgacttct acatcaaccg cctccacaac ccggagaaga tcacccgtgc cgagcgtgac 4080
cccaagctca agatgtctct ctgcttccgc tggtaccttg gtcttgcctc tcgctgggcc 4140
aacaccggtg aggctggacg cgtcatggac taccaggtct ggtgtggccc tgccattgga 4200
gccttcaacg acttcatcaa gggctcctac cttgacccgg ccgtctctgg tgagtacccg 4260
gacgtcgtgc agatcaactt gcagatcctt cgcggtgcct gctacctccg ccgtctcaat 4320
gtcatccgca acgacccgcg tgtcagcatt gaggtcgagg atgctgagtt cgtctacgag 4380
cccaccaacg ccctctaa 4398
<210> 6
<211> 2997
<212> PRT
<213> Ulkenia sp.
<400> 6
Met Ala Gln Arg Glu Asn Arg Leu Glu Ala Asn Met Asp Thr Arg Ile
1 5 10 15
Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val Arg
20 25 30
Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu Ser Asp Leu
35 40 45
Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys Thr
50 55 60
Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Tyr
65 70 75 80
Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met Glu Asp
85 90 95
Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys Glu Ala Leu
100 105 110
Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys Asn Ile Gly
115 120 125
Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His Glu Phe Tyr
130 135 140
Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg Lys Met Gly
145 150 155 160
Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr Lys Ala Asn
165 170 175
Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn Val
180 185 190
Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly Met Asn Cys
195 200 205
Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val Lys Val Ala
210 215 220
Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile Thr Gly Ala
225 230 235 240
Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe Ser Lys Thr
245 250 255
Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp Glu Lys Thr
260 265 270
Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val Leu Lys Arg
275 280 285
Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala Val Ile Arg
290 295 300
Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile Tyr Thr Pro
305 310 315 320
Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr Met Arg Ala
325 330 335
Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His Gly Thr Gly
340 345 350
Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg Asn Leu Phe
355 360 365
Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val Gly Ser Ile
370 375 380
Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu Ala Gly Met
385 390 395 400
Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro Ala Thr Ile
405 410 415
Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro Ile Thr Asp
420 425 430
Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe Pro Ala Pro
435 440 445
Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Ala
450 455 460
Asn Tyr His Ala Val Leu Glu Glu Ala Glu Pro Glu His Gln Lys Ala
465 470 475 480
Tyr Arg Leu Asn Lys Arg Pro Gln Pro Val Leu Leu Met Ala Ser Ser
485 490 495
Thr Gln Ala Leu Ala Ser Leu Cys Glu Ala Gln Leu Lys Glu Phe Glu
500 505 510
Lys Ala Ile Glu Glu Asn Lys Thr Val Lys Asn Thr Ala Tyr Ile Lys
515 520 525
Cys Val Asp Phe Cys Glu Lys Phe Lys Phe Pro Gly Ser Ile Pro Ser
530 535 540
Ser Asn Ala Arg Leu Gly Phe Leu Val Lys Glu Ala Asp Asp Ala Thr
545 550 555 560
Glu Thr Leu Arg Ala Ile Val Ala Gln Phe Gln Lys Ser Ala Gly Lys
565 570 575
Asp Ser Trp His Leu Pro Arg Gln Gly Val Ser Phe Arg Ala Gln Gly
580 585 590
Ile Asn Thr Thr Gly Gly Val Ala Ala Leu Phe Ser Gly Gln Gly Ala
595 600 605
Gln Tyr Thr His Met Phe Ser Glu Val Ala Met Asn Trp Pro Gln Phe
610 615 620
Arg Glu Ser Ile Ser Asp Met Asp Arg Ala Gln Ala Lys Val Ala Gly
625 630 635 640
Ala Asp Lys Asp Tyr Glu Arg Val Ser Gln Val Leu Tyr Pro Arg Lys
645 650 655
Pro Tyr Asn Ser Glu Pro Glu Gln Asp His Lys Lys Ile Ser Leu Thr
660 665 670
Ser Tyr Ser Gln Pro Ser Thr Leu Ala Cys Ala Leu Gly Ala Tyr Glu
675 680 685
Ile Phe Lys Gln Ala Gly Phe Lys Pro Asp Phe Ala Ala Gly His Ser
690 695 700
Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Asp Cys Val Asn Arg Asp
705 710 715 720
Asp Leu Phe Glu Leu Val Cys Arg Arg Ala Arg Ile Met Gly Gly Lys
725 730 735
Asp Ala Pro Ala Thr Pro Lys Gly Cys Met Ala Ala Val Ile Gly Pro
740 745 750
Asn Ala Glu Lys Ile Gln Ile Arg Thr Ala Asp Val Trp Leu Gly Asn
755 760 765
Cys Asn Ser Pro Ser Gln Thr Val Ile Thr Gly Ser Val Glu Gly Ile
770 775 780
Lys Lys Glu Ser Glu Leu Leu Gln Ser Glu Gly Phe Arg Val Val Pro
785 790 795 800
Leu Ala Cys Glu Ser Ala Phe His Ser Pro Gln Met Gln Asn Ala Ser
805 810 815
Ser Ala Phe Lys Asp Val Leu Ser Lys Val Ala Phe Arg Gln Pro Ser
820 825 830
Ala Gln Thr Lys Leu Phe Ser Asn Val Ser Gly Glu Thr Tyr Ser Asn
835 840 845
Asn Ala Gln Asp Leu Leu Lys Glu His Met Thr Ser Ser Val Lys Phe
850 855 860
Ile Ser Gln Val Arg Asn Met His Ser Ala Gly Ala Arg Ile Phe Val
865 870 875 880
Glu Phe Gly Pro Lys Gln Val Leu Ser Lys Leu Val Ser Glu Thr Leu
885 890 895
Lys Asp Asp Pro Ser Ile Ile Thr Ile Ser Val Asn Pro Ser Ser Gly
900 905 910
Lys Asp Ala Asp Ile Gln Leu Arg Glu Ala Ala Val Gln Leu Val Val
915 920 925
Ala Gly Val Asn Leu Gln Gly Phe Asp Lys Trp Asp Ala Pro Asp Ala
930 935 940
Thr Arg Leu Gln Pro Ile Lys Lys Lys Lys Thr Thr Leu Arg Leu Ser
945 950 955 960
Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys Lys Ala Arg Glu Ala Ala
965 970 975
Met Asn Asp Gly Arg Met Leu Ser Cys Val Ser Lys Val Ile Ala Pro
980 985 990
Pro Asp Ala Lys Pro Ile Val Asp Thr Lys Ala Gln Glu Glu Val Ala
995 1000 1005
Arg Leu Gln Lys Gln Leu Gln Asp Ala Gln Ala Gln Ile Gln Lys Ala
1010 1015 1020
Lys Ala Asp Ala Ala Glu Ala Asp Lys Lys Leu Ala Ala Ala Lys Asp
1025 1030 1035 1040
Glu Ala Lys Arg Ala Ala Ala Ser Ala Pro Val Gln Lys Gln Val Asp
1045 1050 1055
Thr Thr Ile Val Asp Lys His Arg Ala Ile Leu Lys Ser Met Leu Ala
1060 1065 1070
Glu Leu Asp Cys Tyr Ser Thr Pro Gly Ala Val Ser Ser Ser Phe Gln
1075 1080 1085
Ala Pro Val Ala Ala Thr Pro Ala Pro Val Ala Ala Pro Val Ala Ala
1090 1095 1100
Ala Pro Ala Pro Ala Val Asn Asn Ala Leu Leu Ala Lys Ala Glu Ser
1105 1110 1115 1120
Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met
1125 1130 1135
Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile
1140 1145 1150
Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu
1155 1160 1165
Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val
1170 1175 1180
Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala
1185 1190 1195 1200
Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Ala Pro Ala Pro Ala
1205 1210 1215
Val Asn Ser Ala Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val
1220 1225 1230
Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met
1235 1240 1245
Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile
1250 1255 1260
Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp
1265 1270 1275 1280
Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys
1285 1290 1295
Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro
1300 1305 1310
Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val Ser
1315 1320 1325
Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala
1330 1335 1340
Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu
1345 1350 1355 1360
Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser
1365 1370 1375
Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu
1380 1385 1390
Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu
1395 1400 1405
Ile Ala Gly Ser Ser Gly Ala Ala Ala Pro Ala Pro Val Ala Ala Ala
1410 1415 1420
Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val Asn Ser Ala Leu Leu
1425 1430 1435 1440
Glu Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly
1445 1450 1455
Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu
1460 1465 1470
Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala
1475 1480 1485
Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg
1490 1495 1500
Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser
1505 1510 1515 1520
Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala
1525 1530 1535
Pro Val Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu
1540 1545 1550
Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp
1555 1560 1565
Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser
1570 1575 1580
Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val
1585 1590 1595 1600
Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu
1605 1610 1615
Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala
1620 1625 1630
Ala Ala Ala Pro Ala Pro Val Ala Ala Ser Pro Ala Pro Val Ala Ala
1635 1640 1645
Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val
1650 1655 1660
Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile
1665 1670 1675 1680
Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys
1685 1690 1695
Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala
1700 1705 1710
Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val
1715 1720 1725
Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala
1730 1735 1740
Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Thr Ala Ala Ala
1745 1750 1755 1760
Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met
1765 1770 1775
Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala
1780 1785 1790
Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val
1795 1800 1805
Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp
1810 1815 1820
Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala
1825 1830 1835 1840
Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro
1845 1850 1855
Ala Ala Ala Val Ala Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser
1860 1865 1870
Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala
1875 1880 1885
Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu
1890 1895 1900
Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu
1905 1910 1915 1920
Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser
1925 1930 1935
Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile
1940 1945 1950
Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Ala
1955 1960 1965
Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys
1970 1975 1980
Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
1985 1990 1995 2000
Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile
2005 2010 2015
Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu
2020 2025 2030
Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val
2035 2040 2045
Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser Ser Gly
2050 2055 2060
Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala
2065 2070 2075 2080
Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val
2085 2090 2095
Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu
2100 2105 2110
Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg
2115 2120 2125
Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys
2130 2135 2140
Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn
2145 2150 2155 2160
Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr Ala Ser
2165 2170 2175
Ala Pro Ala Ala Ala Ala Ala Ala Pro Ala Ile Lys Ile Ser Thr Val
2180 2185 2190
His Gly Ala Asp Cys Asp Asp Leu Ser Val Met Ser Ala Glu Leu Val
2195 2200 2205
Asp Ile Arg Arg Ala Asp Glu Leu Leu Leu Glu Arg Pro Glu Asn Arg
2210 2215 2220
Pro Val Leu Ile Val Asp Asp Gly Thr Glu Leu Thr Ser Ala Leu Val
2225 2230 2235 2240
Arg Val Leu Gly Ala Gly Ala Val Val Leu Thr Phe Asp Gly Leu Gln
2245 2250 2255
Leu Ala Gln Arg Ala Gly Ala Ala Val Arg His Val Gln Val Lys Asp
2260 2265 2270
Leu Ser Ala Glu Ser Ala Glu Lys Ala Ile Lys Glu Ala Glu Gln Arg
2275 2280 2285
Phe Gly Gln Leu Gly Gly Phe Ile Ser Gln Gln Ala Glu Arg Phe Ala
2290 2295 2300
Pro Ala Asp Ile Leu Gly Phe Thr Leu Met Cys Ala Lys Phe Ala Lys
2305 2310 2315 2320
Ala Ser Leu Cys Thr Pro Val Gln Gly Gly Arg Ala Phe Phe Ile Gly
2325 2330 2335
Val Ala Arg Leu Asp Gly Arg Leu Gly Phe Thr Ser Gln Gly Ser Thr
2340 2345 2350
Asp Ser Leu Thr Arg Ala Gln Arg Gly Ala Ile Phe Gly Leu Cys Lys
2355 2360 2365
Thr Ile Gly Leu Glu Trp Ser Ala Asn Glu Val Phe Ala Arg Gly Ile
2370 2375 2380
Asp Ile Ala Arg Glu Val His Pro Glu Asp Ala Ala Val Ala Ile Thr
2385 2390 2395 2400
Arg Glu Met Ser Cys Ala Asp Asn Arg Ile Arg Glu Val Gly Ile Gly
2405 2410 2415
Leu Asn Gln Lys Arg Cys Thr Ile Arg Ala Val Asp Leu Lys Pro Gly
2420 2425 2430
Ala Pro Lys Ile Gln Ile Ser Gln Asp Asp Val Leu Leu Val Ser Gly
2435 2440 2445
Gly Ala Arg Gly Ile Thr Pro Leu Cys Ile Arg Glu Ile Thr Arg Gln
2450 2455 2460
Val Arg Gly Gly Lys Tyr Ile Leu Leu Gly Arg Ser Lys Val Pro Ala
2465 2470 2475 2480
Gly Glu Pro Ala Trp Cys Asn Gly Val Ser Asp Asp Asp Leu Gly Lys
2485 2490 2495
Ala Ala Met Gln Glu Leu Lys Arg Ala Phe Ser Ala Gly Glu Gly Pro
2500 2505 2510
Lys Pro Thr Pro Met Thr His Lys Lys Leu Val Gly Thr Ile Ala Gly
2515 2520 2525
Ala Arg Glu Val Arg Ser Ser Ile Ala Asn Ile Glu Ala Leu Gly Gly
2530 2535 2540
Lys Ala Ile Tyr Ser Ser Cys Asp Val Asn Ser Ala Ala Asp Val Ala
2545 2550 2555 2560
Lys Ala Val Arg Glu Ala Glu Ala Gln Leu Gly Ala Arg Val Thr Gly
2565 2570 2575
Val Val His Ala Ser Gly Val Leu Arg Asp Arg Leu Ile Glu Gln Lys
2580 2585 2590
Arg Pro Asp Glu Phe Asp Ala Val Phe Gly Thr Lys Val Thr Gly Leu
2595 2600 2605
Glu Asn Leu Phe Gly Ala Ile Asp Met Ala Asn Leu Lys His Leu Val
2610 2615 2620
Leu Phe Ser Ser Leu Ala Gly Phe His Gly Asn Ile Gly Gln Ser Asp
2625 2630 2635 2640
Tyr Ala Met Ala Asn Glu Ala Leu Asn Lys Met Gly Leu Glu Leu Ser
2645 2650 2655
Asp Arg Val Ser Val Lys Ser Ile Cys Phe Gly Pro Trp Asp Gly Gly
2660 2665 2670
Met Val Thr Pro Gln Leu Lys Lys Gln Phe Gln Ser Met Gly Val Gln
2675 2680 2685
Ile Ile Pro Arg Glu Gly Gly Ala Asp Thr Val Ala Arg Ile Val Leu
2690 2695 2700
Gly Ser Ser Pro Ala Glu Ile Leu Val Gly Asn Trp Thr Thr Pro Thr
2705 2710 2715 2720
Lys Lys Val Gly Ser Glu Pro Val Val Ile His Arg Lys Ile Ser Ala
2725 2730 2735
Ala Ser Asn Pro Phe Leu Lys Asp His Val Ile Gln Gly Arg Cys Val
2740 2745 2750
Leu Pro Met Thr Ile Ala Val Gly Cys Leu Ala Glu Thr Cys Leu Gly
2755 2760 2765
Gln Phe Pro Gly Tyr Ser Leu Trp Ala Ile Glu Asp Ala Gln Leu Phe
2770 2775 2780
Lys Gly Val Thr Val Asp Gly Asp Val Asn Cys Glu Ile Thr Leu Lys
2785 2790 2795 2800
Pro Ser Gln Gly Thr Ala Gly Arg Val Met Ile Gln Ala Thr Leu Lys
2805 2810 2815
Thr Phe Ala Ser Gly Lys Leu Val Pro Ala Tyr Arg Ala Val Ile Val
2820 2825 2830
Leu Ser Thr Gln Gly Lys Pro Pro Ala Ala Thr Thr Ser Gln Thr Pro
2835 2840 2845
Ser Leu Gln Ala Asp Pro Ala Ala Arg Gly Asn Pro Tyr Asp Gly Lys
2850 2855 2860
Thr Leu Phe His Gly Pro Ala Phe Gln Gly Leu Lys Glu Ile Ile Ser
2865 2870 2875 2880
Cys Asn Lys Ser Gln Leu Val Ala Glu Cys Thr Phe Ile Pro Ser Ser
2885 2890 2895
Glu Ser Ala Gly Glu Phe Ala Ser Asp Tyr Glu Ser His Asn Pro Phe
2900 2905 2910
Val Asn Asp Ile Ala Phe Gln Ala Met Leu Val Trp Ile Arg Arg Thr
2915 2920 2925
Leu Gly Gln Ala Ala Leu Pro Asn Ser Ile Gln Arg Ile Val Gln His
2930 2935 2940
Arg Ala Leu Pro Gln Asp Lys Pro Phe Tyr Leu Thr Leu Lys Ser Asn
2945 2950 2955 2960
Ser Ala Ser Gly His Ser Gln His Lys Thr Ser Val Gln Phe His Asn
2965 2970 2975
Glu Gln Gly Asp Leu Phe Val Asp Ile Gln Ala Ser Val Thr Ser Ser
2980 2985 2990
Asp Ser Leu Ala Phe
2995
<210> 7
<211> 2030
<212> PRT
<213> Ulkenia sp.
<400> 7
Met Ala Ser Arg Lys Asn Val Ser Ala Ala His Glu Met His Asp Glu
1 5 10 15
Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys
20 25 30
Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp
35 40 45
Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His
50 55 60
Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn
65 70 75 80
Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu
85 90 95
Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr
100 105 110
Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
115 120 125
Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu
130 135 140
Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln
145 150 155 160
Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
165 170 175
Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val
180 185 190
Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp
195 200 205
His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys
210 215 220
Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
225 230 235 240
Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser
245 250 255
Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg
260 265 270
Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu
275 280 285
Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His
290 295 300
Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn
305 310 315 320
Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr
325 330 335
Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu
340 345 350
Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr
355 360 365
Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met
370 375 380
Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln
385 390 395 400
Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu
405 410 415
Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr
420 425 430
Asn Ala His Ala Val Phe Glu Glu Phe Asp Arg Ser Lys Ala Ala Cys
435 440 445
Ala Thr His Asp Ser Ile Ser Ser Leu Ser Ser Arg Cys Gly Gly Glu
450 455 460
Gly Asn Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser
465 470 475 480
Leu Lys Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His
485 490 495
Gly Ala Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp
500 505 510
Lys Asp Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys
515 520 525
Tyr Ile Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met
530 535 540
Thr Pro Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr
545 550 555 560
Ile Asp Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val
565 570 575
Ala Val Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg
580 585 590
Ala Arg Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala
595 600 605
Leu Asn Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser
610 615 620
Tyr Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln
625 630 635 640
Trp Gly Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser
645 650 655
Val Tyr Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu
660 665 670
Val Glu Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu
675 680 685
Asn Leu Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser
690 695 700
Pro Arg Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu
705 710 715 720
Gly Cys Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys Thr Lys Asp
725 730 735
Glu Arg Ile Tyr Ala Cys Met Asp Ala Ile Val Pro Gly Asn Met Pro
740 745 750
Ala Ala Cys Met Glu Glu Ala Leu Ala Gln Ala Arg Val Asn Pro Lys
755 760 765
Asp Val Glu Met Leu Glu Leu Ser Ala Asp Ser Ala Arg His Leu Lys
770 775 780
Asn Pro Ser Val Leu Pro Lys Glu Leu Thr Ala Glu Glu Glu Ile Arg
785 790 795 800
Gly Ile Glu Ala Ile Leu Ser Gln Arg Ser Ser Asn Glu Ala Val Glu
805 810 815
Pro His Asn Val Ala Val Ser Ser Val Lys Ser Thr Val Gly Asp Thr
820 825 830
Gly Tyr Ala Ser Gly Ala Ala Ser Leu Ile Lys Thr Ala Leu Cys Leu
835 840 845
Tyr Asn Arg Tyr Leu Pro Ser Asn Gly Ala Ser Trp Glu Glu Pro Ala
850 855 860
Pro Glu Thr Gln Trp Gly Lys Ser Leu Tyr Ala Cys Gln Ser Ser Arg
865 870 875 880
Ala Trp Leu Lys Asn Pro Gly Ala Arg Arg His Ala Ala Val Ser Gly
885 890 895
Val Ser Glu Thr Arg Ser Cys Tyr Thr Val Leu Leu Ser Asp Val Glu
900 905 910
Gly His His Glu Thr Lys Ser Arg Ile Ser Leu Asp Asp Asp Ala Val
915 920 925
Lys Leu Leu Val Ile Arg Gly Asp Ser His Asp Ala Ile Thr Gln Arg
930 935 940
Val Asp Lys Leu Arg Glu Arg Leu Ala Gln Pro Ser Ala Asn Val Arg
945 950 955 960
Leu Ala Phe Met Glu Leu Leu Gly Glu Ser Ile Ala Gln Glu Thr Lys
965 970 975
Thr Pro Leu Pro Ala Phe Ala Leu Cys Leu Val Thr Ser Pro Ser Lys
980 985 990
Leu Gln Lys Glu Leu Glu Leu Ala Ser Lys Gly Ile Pro Arg Ser Leu
995 1000 1005
Lys Met Gly Arg Asp Trp Thr Ser Pro Ser Gly Ser His Phe Ala Pro
1010 1015 1020
Lys Pro Leu Ser Ser Asp Arg Val Ala Phe Met Tyr Gly Glu Gly Arg
1025 1030 1035 1040
Ser Pro Tyr Tyr Gly Ile Gly Leu Asp Ile His Arg Ile Trp Pro Glu
1045 1050 1055
Leu His Glu Phe Val Asn Ala Lys Thr Asn Lys Leu Trp Asp Gln Gly
1060 1065 1070
Asp Arg Trp Leu Ile Pro Arg Ala Ser Thr Lys Glu Glu Leu Lys Ala
1075 1080 1085
Gln Glu Asp Glu Phe Asn Arg Asn Gln Val Glu Met Phe Arg Leu Gly
1090 1095 1100
Ile Leu Met Ser Met Cys Phe Thr His Ile Ala Arg Asp Val Leu Gly
1105 1110 1115 1120
Ile Gln Pro Lys Ala Ala Phe Gly Leu Ser Leu Gly Glu Ile Ser Met
1125 1130 1135
Val Phe Ala Phe Ser Glu Lys Asn Gly Leu Val Ser Glu Glu Leu Thr
1140 1145 1150
Thr Lys Leu Arg Asn Ser Glu Val Trp Arg Lys Ala Leu Ala Val Glu
1155 1160 1165
Phe Asp Ala Leu Arg Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val
1170 1175 1180
Ser Glu Phe Trp Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val
1185 1190 1195 1200
Glu Ala Ala Ile Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val
1205 1210 1215
Asn Asp Ala Asn Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys
1220 1225 1230
Ala Ala Ile Ala Arg Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp
1235 1240 1245
Leu Gly Met Cys Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys Gln
1250 1255 1260
Ile Ala Glu Ile His Ser Val Leu Glu Ile Pro Glu Val Ala Gly Leu
1265 1270 1275 1280
Asp Leu Tyr Thr Ser Val Asn Gln Lys Lys Leu Val Asn Lys Ser Thr
1285 1290 1295
Gly Ala Ser Asp Glu Tyr Ala Pro Ser Phe Gly Glu Tyr Ala Ala Gln
1300 1305 1310
Leu Tyr Thr Val Gln Ala Asp Phe Pro Lys Ile Ala Lys Thr Val Ser
1315 1320 1325
Asp Lys Asn Phe Asp Val Phe Val Glu Thr Gly Pro Asn Ala His Arg
1330 1335 1340
Ser Ala Ala Ile Arg Ala Thr Leu Gly Asn Ser Lys Pro Phe Val Thr
1345 1350 1355 1360
Gly Ser Met Asp Arg Gln Asn Glu Asn Ala Trp Thr Thr Met Val Lys
1365 1370 1375
Leu Val Ala Ser Leu Gln Ala His Arg Val Pro Gly Val Lys Val Ser
1380 1385 1390
Pro Leu Tyr His Pro Glu Thr Val Glu Glu Ala Thr Gln Ser Tyr Asn
1395 1400 1405
Asp Met Val Ala Gly Lys Lys Pro Thr Lys Asn Lys Phe Leu Arg Lys
1410 1415 1420
Ile Val Val Asn Gly Arg Tyr Asp Pro Lys Lys Gln Leu Val Pro Pro
1425 1430 1435 1440
Gln Val Leu Ala Lys Leu Pro Pro Ala Asp Pro Lys Ile Glu Ala Leu
1445 1450 1455
Ile Gln Ala Arg Lys Met Gln Pro Ile Ala Pro Lys Phe Met Glu Arg
1460 1465 1470
Leu Asp Ile Gln Glu Gln Asp Ala Thr Arg Asp Pro Ile Leu Asn Lys
1475 1480 1485
Asp Asn Lys Pro Ser Ala Ala Pro Ala Leu Ala Pro Ala Ala Pro Ala
1490 1495 1500
Arg Ser Val Ser Gly Ala Val Val Ala Ser Ser Glu Ala Leu Arg Ala
1505 1510 1515 1520
Lys Leu Leu Glu Leu Asn Ser Thr Leu Met Leu Gly Val Asn Ala Asn
1525 1530 1535
Gly Asp Leu Val Glu Ala Ser Pro Ser Glu Ala Ser Ile Val Val Pro
1540 1545 1550
Lys Cys Asp Ile Lys Asp Leu Gly Ser Arg Ala Phe Met Glu Thr Tyr
1555 1560 1565
Gly Val Ser Ala Pro Met Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala
1570 1575 1580
Ser Ala Glu Met Val Ile Ala Ala Gly Lys Arg Gly Ile Leu Gly Ser
1585 1590 1595 1600
Leu Gly Ala Gly Gly Leu Pro Ile Ala Thr Val Arg Lys Ala Leu Glu
1605 1610 1615
Ala Ile Gln Ala Glu Leu Pro Lys Gly Pro Tyr Ala Val Asn Leu Ile
1620 1625 1630
His Ser Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe
1635 1640 1645
Leu Glu Lys Gly Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu
1650 1655 1660
Thr Pro Gln Leu Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Ala Ala
1665 1670 1675 1680
Asp Gly Ser Thr Val Ile Lys Asn Arg Val Ile Gly Lys Val Ser Arg
1685 1690 1695
Thr Glu Leu Ala Ala Met Phe Ile Arg Pro Ala Pro Glu Asn Leu Leu
1700 1705 1710
Glu Lys Leu Leu Lys Ser Gly Glu Ile Thr Gln Glu Gln Ala Ala Leu
1715 1720 1725
Ala Arg Thr Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp Ser
1730 1735 1740
Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro Leu Ile
1745 1750 1755 1760
Val Asn Leu Arg Asp Arg Leu His Lys Glu Cys Gly Tyr Pro Ala His
1765 1770 1775
Leu Arg Val Arg Val Gly Ala Gly Gly Gly Ile Gly Cys Pro Gln Ala
1780 1785 1790
Ala Ile Ala Thr Phe Asn Met Gly Ala Ala Phe Ile Val Thr Gly Thr
1795 1800 1805
Val Asn Gln Met Ser Lys Gln Ala Gly Thr Cys Asp Thr Val Arg Lys
1810 1815 1820
Gln Leu Ser Gln Ala Thr Tyr Ser Asp Ile Cys Met Ala Pro Ala Ala
1825 1830 1835 1840
Asp Met Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly Thr
1845 1850 1855
Met Phe Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Val Lys Tyr
1860 1865 1870
Asp Ser Phe Glu Ser Met Ala Pro Gly Glu Leu Glu Arg Val Glu Lys
1875 1880 1885
Arg Ile Phe Lys Lys Ser Leu Ser Glu Val Trp Glu Glu Thr Lys Asp
1890 1895 1900
Phe Tyr Ile Asn Arg Leu Gln Asn Pro Glu Lys Ile Glu Arg Ala Glu
1905 1910 1915 1920
Arg Asp Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly
1925 1930 1935
Leu Ala Ser Phe Trp Ala Asn Ala Gly Ile Pro Asp Arg Ala Met Asp
1940 1945 1950
Tyr Gln Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe Ile
1955 1960 1965
Lys Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu Tyr Pro Asp Val
1970 1975 1980
Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Phe Leu Arg Arg
1985 1990 1995 2000
Leu Glu Ala Val Arg Asn Ala Pro Leu Lys Ala Asn Ala Lys Gln Val
2005 2010 2015
Ala Ala Glu Ile Asp Asp Ile Tyr Val Pro Thr Glu Arg Leu
2020 2025 2030
<210> 8
<211> 1465
<212> PRT
<213> Ulkenia sp.
<400> 8
Met Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr
1 5 10 15
Lys Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu
20 25 30
Leu Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu
35 40 45
Phe Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg
50 55 60
Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn
65 70 75 80
Asn Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val
85 90 95
Asn Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val
100 105 110
Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp
115 120 125
Phe Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu
130 135 140
Thr Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile
145 150 155 160
Arg Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe
165 170 175
Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met
180 185 190
Arg Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly
195 200 205
Lys Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr
210 215 220
Lys Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys
225 230 235 240
Thr Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp
245 250 255
Thr Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys
260 265 270
Leu Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp
275 280 285
Tyr Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile
290 295 300
Leu Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln
305 310 315 320
Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys
325 330 335
Met Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp
340 345 350
Phe Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln
355 360 365
Ile Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu
370 375 380
Met Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn
385 390 395 400
Ile Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn
405 410 415
Leu His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp
420 425 430
Phe Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val
435 440 445
Val Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro
450 455 460
Ala Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala
465 470 475 480
Pro Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro
485 490 495
Thr Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe
500 505 510
Thr Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu
515 520 525
Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val
530 535 540
Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr
545 550 555 560
Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser
565 570 575
Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro
580 585 590
Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp
595 600 605
Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu
610 615 620
Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys
625 630 635 640
Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu Asp
645 650 655
Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly Lys Thr
660 665 670
Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu Gly Asp Met
675 680 685
Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp Gly Val Val Phe
690 695 700
Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ile
705 710 715 720
Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln Pro Trp His Ile
725 730 735
Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr Asp Val Thr Pro
740 745 750
Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala Pro Lys Gly Ala
755 760 765
Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu Asp Thr Ile Asp
770 775 780
Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr Gly His Gly Arg
785 790 795 800
Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe
805 810 815
Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu
820 825 830
Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile
835 840 845
Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr
850 855 860
Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His
865 870 875 880
Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala Asn
885 890 895
Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser Asn Ile
900 905 910
Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala
915 920 925
Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser
930 935 940
Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu Lys Ala Glu Leu Leu
945 950 955 960
Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser Ser Asn Gly Gln Val Lys
965 970 975
Lys His Ala Asp Val Ala Gly Gly Gln Ala Thr Ile Val Gln Ala Cys
980 985 990
Ser Leu Ser Asp Leu Gly Asp Glu Gly Phe Met Lys Thr Tyr Gly Val
995 1000 1005
Val Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala Ser Ala
1010 1015 1020
Asp Leu Val Ile Ala Thr Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly
1025 1030 1035 1040
Ala Gly Gly Leu Pro Met His Ile Val Arg Ala Ala Val Glu Lys Ile
1045 1050 1055
Gln Ala Glu Leu Pro Asn Gly Pro Phe Ala Val Asn Leu Ile His Ser
1060 1065 1070
Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu
1075 1080 1085
Lys Gly Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro
1090 1095 1100
Gln Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala Asp Gly
1105 1110 1115 1120
Ser Ile Asn Ile Lys Asn Arg Ile Ile Gly Lys Val Ser Arg Thr Glu
1125 1130 1135
Leu Ala Glu Met Phe Ile Arg Pro Ala Pro Gln Asn Leu Leu Asp Lys
1140 1145 1150
Leu Ile Gln Ser Gly Glu Ile Thr Lys Glu Gln Ala Glu Leu Ala Lys
1155 1160 1165
Leu Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp Ser Gly Gly
1170 1175 1180
His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn
1185 1190 1195 1200
Leu Arg Asn Arg Leu His Lys Glu Cys Gly Tyr Pro Ala His Leu Arg
1205 1210 1215
Val Arg Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala Ala Ala
1220 1225 1230
Ala Ala Leu Ala Met Gly Ala Ala Phe Leu Val Thr Gly Thr Val Asn
1235 1240 1245
Gln Val Ala Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys Gln Leu
1250 1255 1260
Cys Met Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala Asp Met
1265 1270 1275 1280
Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly Thr Met Phe
1285 1290 1295
Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser
1300 1305 1310
Phe Glu Ser Met Pro Ala Thr Glu Leu Glu Arg Val Glu Lys Arg Ile
1315 1320 1325
Phe Gln Cys Pro Leu Ala Asp Val Trp Ala Glu Thr Ser Asp Phe Tyr
1330 1335 1340
Ile Asn Arg Leu His Asn Pro Glu Lys Ile Thr Arg Ala Glu Arg Asp
1345 1350 1355 1360
Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala
1365 1370 1375
Ser Arg Trp Ala Asn Thr Gly Glu Ala Gly Arg Val Met Asp Tyr Gln
1380 1385 1390
Val Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe Ile Lys Gly
1395 1400 1405
Ser Tyr Leu Asp Pro Ala Val Ser Gly Glu Tyr Pro Asp Val Val Gln
1410 1415 1420
Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn
1425 1430 1435 1440
Val Ile Arg Asn Asp Pro Arg Val Ser Ile Glu Val Glu Asp Ala Glu
1445 1450 1455
Phe Val Tyr Glu Pro Thr Asn Ala Leu
1460 1465
<210> 9
<211> 5547
<212> DNA
<213> Ulkenia sp.
<400> 9
atgcttgtga taggggctct ggcgcgggct ctgtacggtg cttggagatg cacgggcagg 60
gcgagagagg ggacgggttc ccgggaggcg ctgcttggag gtgctgagag ggagggagaa 120
ggcgtgcttt gcgatgcgcg gggcgaccta ggcgctgctg cgcggtgcag cagcagggac 180
ctcggacgtg agtcgaagcc gtctgcagag gagatggtag aagggccgcg gattggtagc 240
agagaagagg aaatagaaga agaagaagaa atagaagaag aagaaataga agaagaagaa 300
atagaagaag aagaggagga cgggcaggcg ggaaagatgg agaaaggact cgcggcggga 360
aaacaagaga atgtgaactt gggcttgaac tttggtttga atttgaatgt ggagaacgag 420
gggttgaatt tgagtttgaa tttgaaagaa aacttacgga aagaaagttt agttgaaagt 480
gagaaagaaa aaaatgagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa 540
aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa 600
aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaagaaaa agaagaagaa 660
aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaaggaga tttaaaaagt 720
tgtttagttg aaaaaggaga aggaggaaga agcagcgaca gcggcagaag aagaagtagt 780
tgttgtaaga ggggaacgga ggcagtagca gtggagcagg cggaggcgac agcaaacctc 840
gaactcgacc ccgtcgagcc gcagcaagaa caagagcccg accaggtgga cgaggacgag 900
gtccgcttgt tgtcaggaac aacagaagtt gcaggactag ccgagagtgc taccactgca 960
attcttagat ccacagacgc aagagcagaa aacttacaac tgctcgccac aacacaagaa 1020
ccaccttcag atacaaccag gttcgagaac tccacaagtc tagaagcagc aacagctcta 1080
gcagataatc aaacaggtcc agaaaaagct acgactagaa gagaaattat cgagtcgcaa 1140
cttgcaacca tggccactcg cgtgaagacc aacaagaaac catgctggga gatgaccaag 1200
gaggagctca ccagcggcaa gaacgtcgtt ttcgactatg acgagctcct tgagttcgcc 1260
gagggtgaca tcagcaaggt cttcggcccc gaattcagcc agatcgacca gtacaagcgt 1320
cgcgttcgtc tccccgcccg cgagtacctc ctcgtcaccc gcgtcaccct catggacgcc 1380
gaggtcaaca actaccgcgt cggtgcccgc atggtcactg agtacgacct ccccgtcaac 1440
ggtgagctct ctgagggtgg tgactgcccc tgggccgtgc tcgtcgagag tggtcagtgt 1500
gatctcatgc tcatctccta catgggtatt gacttccaga acaagagcga ccgcgtctac 1560
cgtctgctca acaccaccct caccttctac ggtgttgccc aggagggcga gaccctggag 1620
tacgacatcc gcgtgaccgg cttcgccaag cgtctcgacg gtgacatctc catgttcttc 1680
ttcgagtacg actgctacgt caacggccgt ctcctcatcg agatgcgcga cggctgtgcc 1740
ggtttcttca ccaacgagga gctcgccgcc ggcaagggtg tcgtctttac ccgcgctgat 1800
ctcctcgccc gcgagaagac caagaagcag gacatcaccc cgtacgccat tgccccgcgt 1860
cttaacaaga ccgttctcaa cgagactgag atgcagtccc tcgtggacaa gaactggacc 1920
aaggttttcg gccccgagaa cggcatggac cagatcaact acaaactctg cgcccgtaag 1980
atgctcatga ttgaccgcgt caccaagatt gactacaccg gtggccccta cggccttggt 2040
cttctcgttg gtgagaagat cctcgagcgc gaccactggt actttccgtg ccacttcgtc 2100
ggagaccagg tcatggctgg atccctcgtg tctgacggct gcagccagct cctcaagatg 2160
tacatgctct ggctcggcct ccaccttaag accggtccct tcgacttccg ccccgtcaac 2220
ggccacccca acaaggtccg ctgccgtggc cagatctccc cgcacaaggg taagctcgta 2280
tacgtcatgg agatcaagga gatgggctac gacgaggctg gtgacccgta cgccatcgcc 2340
gatgtcaaca ttctcgacat tgacttcgag aagggccaga ctttcgacct tgccaacctc 2400
cacgagtacg gcaagggcga cctcaacaag aagatcgtcg tcgacttcaa gggtattgcc 2460
ctcaagctcc agaagcgctc tggccctgcc gttgtcgctc ccgagaagcc cctcgctctc 2520
aacaaggacc tttgcgcccc ggctgttgag gccatccctg agcacatcct caagggcgat 2580
gctcttgccc ctaaccagat gacctggcac ccgatgtcca agatcgctgg caaccccacg 2640
ccctcgttct ctccctcggc ctaccctccc cgtcccatca ccttcacccc gttccccggc 2700
aacaagaacg acaacaacca cgtgcccggc gagatgccgc tctcgtggta caacatggct 2760
gagttcatgg ccggcaaggt cagcctctgc ctcggccctg agttcgccaa gttcgatgac 2820
tccaacacca gccgcagccc tgcatgggac cttgctcttg tgactcgtgt ggtctccgtt 2880
tctgacatgg agtgggtcca gtggaagaac gtggactgca acccgtccaa gggaaccatg 2940
gttggcgagt tcgactgccc catcgacgcc tggttcttcc agggatcttg taacgacggc 3000
cacatgccgt actccatcct catggagatc gccctccaga cctctggtgt cctcacctct 3060
gtgctcaagg ccccgctcac catggagaag aaggacattc tcttccgcaa ccttgacgcc 3120
aacgccgaga tggttcgctc tgatattgac ctccgcggca agaccatcca caacctcacc 3180
aagtgtaccg gctacagcat gctcggagac atgggtgtcc accgcttcag cttcgagctc 3240
tctgttgatg gtgtagtctt ctacaagggt accacctcct tcggctggtt cgtccctgag 3300
gtcttcatct cccagactgg tctcgacaac ggtcgccgca cccagccctg gcacattgag 3360
tccaaggtgc cttccgccca ggtcctcacc tacgacgtta cccccaacgg tgccggtcgc 3420
acccagctct acgccaacgc ccccaagggc gctcagctca ctcgccgctg gaaccagtgc 3480
cagtaccttg acaccatcga ccttgtggtc gccggtggct ccgccggtct tggctacggt 3540
catggccgca agcaggtgaa ccccaaggac tggttcttct cgtgccactt ctggttcgac 3600
tccgtcatgc ccggctcgct cggtgtggag tctatgttcc agctcgtcga gtccatcgct 3660
gtcaagcagg acctcgccgg caagtacggc atcaccaacc cgaccttcgc tcatgctccg 3720
ggcaagatct cctggaagta ccgtggtcag ctcaccccca cctccaagtt catggactcc 3780
gaggcccaca ttgtctccat cgaggcccac gacggcgtcg tcgacatcgt tgccaatggt 3840
aacctctggg ctgatggcct ccgcgtctac aacgtcagca acatccgtgt gcgcattgtt 3900
gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc 3960
cctgctccgg ttgctgcatc tggccctgcc cagaccatca ccctcaagca gctcaaggct 4020
gagcttcttg acgttgagaa gcctctctac atctcctcca gcaacggcca ggtcaagaag 4080
cacgccgatg tggctggtgg ccaggccacc attgtgcagg cttgcagcct cagtgacctc 4140
ggtgatgaag gcttcatgaa gacctacggt gttgtggctc ctctctacac cggtgccatg 4200
gccaagggta ttgcctctgc tgaccttgtg attgccactg gtaagcgcaa gatcctcggt 4260
tccttcggtg ctggcggtct ccccatgcac attgtccgtg ccgctgttga gaagatccag 4320
gctgagctcc cgaacggccc cttcgccgtc aacctcatcc actccccctt cgatagcaac 4380
cttgagaagg gcaacgttga cctcttcctc gagaagggcg ttactgtcgt cgaggcctcc 4440
gccttcatga ccttgacccc gcaagtcgtc cgctaccgtg ctgctggtct ttcccgtaac 4500
gctgatggct ccattaacat caagaaccgc atcatcggta aggtctcccg taccgagctc 4560
gctgagatgt tcatccgccc tgccccgcag aacctcctcg acaagctcat ccagtctggt 4620
gagattacca aggagcaggc tgagcttgcc aagctcgtcc ccgtcgccga cgacatcgcc 4680
gtcgaggccg actctggtgg ccacaccgac aaccgcccca tccacgtcat cctccccctt 4740
atcatcaacc tccgcaaccg cctccacaag gagtgcggct accccgctca cctccgcgtg 4800
cgcgttggag ctggtggtgg tgttggatgc ccccaggccg ctgccgctgc tctcgctatg 4860
ggtgctgcct tccttgttac cggcactgtc aaccaggtcg ccaagcagtc cggcacctgc 4920
gacaatgtcc gcaagcagct ctgcatggcc acctactctg acgtctgcat ggctcccgct 4980
gctgacatgt tcgaggaggg cgtcaagctc caggtcctca agaagggaac catgttcccg 5040
tccagggcta acaagctcta cgagctcttc tgcaagtacg actccttcga gtccatgcct 5100
gccacagagc tcgagcgtgt tgagaagcgc atcttccagt gccctcttgc tgatgtctgg 5160
gctgagacct ccgacttcta catcaaccgc ctccacaacc cggagaagat cacccgtgcc 5220
gagcgtgacc ccaagctcaa gatgtctctc tgcttccgct ggtaccttgg tcttgcctct 5280
cgctgggcca acaccggtga ggctggacgc gtcatggact accaggtctg gtgtggccct 5340
gccattggag ccttcaacga cttcatcaag ggctcctacc ttgacccggc cgtctctggt 5400
gagtacccgg acgtcgtgca gatcaacttg cagatccttc gcggtgcctg ctacctccgc 5460
cgtctcaatg tcatccgcaa cgacccgcgt gtcagcattg aggtcgagga tgctgagttc 5520
gtctacgagc ccaccaacgc cctctaa 5547
<210> 10
<211> 837
<212> DNA
<213> Ulkenia sp.
<400> 10
acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag 60
tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc 120
gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt 180
ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag 240
atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag 300
gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc 360
ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag 420
aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac 480
aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact 540
gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc 600
tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt 660
gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc 720
tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag 780
ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgct 837
<210> 11
<211> 51
<212> DNA
<213> Ulkenia sp.
<400> 11
ggtatgaact gtgtcgtcga tgctgcctgt gctagttctc tcatcgccgt t 51
<210> 12
<211> 12
<212> DNA
<213> Ulkenia sp.
<400> 12
gatgctgcct gt 12
<210> 13
<211> 522
<212> DNA
<213> Ulkenia sp.
<400> 13
cacgctgtca ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc 60
ccgaccatct ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat 120
cccgccaccg tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt 180
gagctcaccg ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc 240
gctgttggca gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt 300
atgatcaagg tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat 360
gagcccccta agctttacga caacactccc atcaccgact catcgctgta cattaacacg 420
atgaaccgtc cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc 480
ggttttggtg gtgccaacta ccacgccgtt cttgaggaag cc 522
<210> 14
<211> 1380
<212> DNA
<213> Ulkenia sp.
<400> 14
acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag 60
tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc 120
gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt 180
ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag 240
atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag 300
gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc 360
ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag 420
aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac 480
aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact 540
gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc 600
tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt 660
gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc 720
tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag 780
ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgctgtt 840
cgtgatggtg acgagattca cgctgtcatt cgcggctgcg cctcttcctc tgacggtaag 900
gcctccggta tttacacccc gaccatctct ggtcaagagg aggctcttcg ccgtgcctac 960
atgcgcgcta acgtcgatcc cgccaccgtc actcttgttg agggccacgg taccggtacc 1020
cccgttggtg accgtattga gctcaccgct ctccgtaacc tcttcgacag tgcctacggc 1080
aacgagaagg agaaggtcgc tgttggcagc attaagtcca acatcggtca cctcaaggct 1140
gtcgccggtc ttgccggtat gatcaaggtc atcatggccc tcaagcataa gactcttccg 1200
gccaccatca acgttgatga gccccctaag ctttacgaca acactcccat caccgactca 1260
tcgctgtaca ttaacacgat gaaccgtccg tggttccctg ctccgggtgt gccccgtcgc 1320
gctggtatct ccagtttcgg ttttggtggt gccaactacc acgccgttct tgaggaagcc 1380
1380
<210> 15
<211> 996
<212> DNA
<213> Ulkenia sp.
<400> 15
ctcttctctg gccagggtgc tcagtacacc cacatgttca gcgaggtcgc catgaactgg 60
cctcagttcc gtgagagcat ctctgacatg gatcgtgccc aggctaaggt tgctggcgct 120
gacaaggact acgagcgtgt ctcccaagtc ctctacccgc gtaagcctta taactctgag 180
cccgagcagg accacaagaa gatctccctg acctcatact ctcagccctc taccctcgcc 240
tgcgctcttg gtgcctacga gatcttcaag caggctggtt tcaagcccga cttcgctgcc 300
ggtcactctc tcggtgagtt tgcggccctc tacgctgctg actgcgtcaa ccgtgacgac 360
ctctttgagc tcgtgtgccg tcgtgcccgc atcatgggtg gcaaggatgc acctgctacc 420
cccaagggat gcatggctgc tgtcattgga cccaatgccg agaagatcca gattcgcact 480
gctgatgtct ggctcggcaa ctgcaactcc ccttcgcaga ctgtcatcac cggctctgtt 540
gagggtatca agaaggagtc cgagcttctc cagagtgagg gcttccgtgt tgtccccctc 600
gcctgcgaga gtgccttcca ctcaccgcag atgcaaaacg cctcctctgc cttcaaggat 660
gttctctcca aggttgcctt ccgtcagcct agcgcccaga ccaagctctt cagcaacgtg 720
tctggcgaga cctactccaa caatgcccag gacctcctta aggagcacat gaccagcagt 780
gttaagttca tctctcaggt tcgcaacatg cactctgctg gtgctcgcat ctttgtcgag 840
tttggcccca agcaggtgct ctctaagctt gtttccgaga ccctcaagga cgatccttcc 900
attatcacta tctctgtcaa cccttcctct ggcaaggatg ccgatattca gcttcgcgag 960
gctgctgtgc agctcgttgt tgctggagtc aacctt 996
<210> 16
<211> 3510
<212> DNA
<213> Ulkenia sp.
<400> 16
gcccaggccc agatccagaa ggccaaggcc gatgctgctg aggctgacaa gaagcttgcc 60
gctgctaagg atgaggccaa gcgtgccgcc gcttctgcac ctgtgcagaa gcaggttgac 120
accaccattg ttgataagca ccgtgctatc ctcaagtcta tgcttgctga gcttgactgc 180
tactccactc ctggtgctgt gtccagctct ttccaggcac ctgttgctgc tacccctgct 240
ccggtcgctg cgcctgttgc agctgctcct gctccggctg tcaacaatgc tctccttgcc 300
aaggctgagt ctgttgtcat ggaggttctt gccgccaaga ctggttacga gactgacatg 360
atcgagcccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag 420
attctctctg aggtccaggc ccagctcaac gtcgaggcca aggatgttga tgctcttagc 480
cgcacccgca ccgtcggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct 540
ggtgctgccg ctgctgcccc ggccccggtt gctgctgctc ccgctgcccc tgcccctgct 600
gtcaacagcg ctcttcttgc caaggctgag actgttgtca tggaggttct tgccgccaag 660
actggttacg agactgacat gattgagccc gacatggagc tcgagactga gctcggcatt 720
gactccatca agcgtgtcga gattctctct gaggttcagg cccagctcaa cgttgaggcc 780
aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag 840
gctgagatcg ctggcagctc tggtgctgcc gctgctgccc cggcccctgt tgctgctgct 900
ccggcgcccg tcgctgccgc tgcccctgct gtcagcagcg ctctccttga gaaggctgag 960
tctgttgtca tggaggttct tgccgccaag actggttacg agactgacat gattgaggcc 1020
gacatggagc tcgagactga gctcggcatt gactccatca agcgtgtcga gattctctct 1080
gaggtccagg cccagctcaa cgtcgaggcc aaggatgtcg atgctcttag ccgcacccgc 1140
accgttggtg aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgct 1200
gccccggccc cggtcgctgc ggcccctgct ccggtcgctg ccgctgcccc tgctgtcaac 1260
agcgctcttc ttgagaaggc tgagactgtt gtcatggagg ttcttgccgc caagactggt 1320
tacgagactg acatgatcga gcccgacatg gagctcgaga ctgagctcgg cattgactct 1380
atcaagcgtg tcgagattct ctctgaggtc caggcccagc tcaacgttga ggccaaggat 1440
gttgatgctc ttagccgcac ccgcaccgtt ggtgaggttg tcaacgccat gaaggctgag 1500
atcgctggca gctctggtgc tgccgctgct gccccggccc cggttgctgc tgctcccgct 1560
cccgtcgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgtcgtcatg 1620
gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc 1680
gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc 1740
cagctcaacg ttgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag 1800
gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgccgc tgctgccccg 1860
gcccctgttg ctgcctctcc cgctcccgtc gctgccgctg cccctgctgt cagcagcgct 1920
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 1980
actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag 2040
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 2100
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct 2160
ggcagctctg gtgccgccgc tgctgccccg gccccggttg ctgctgctcc ggcgcccgtc 2220
actgccgctg cccctgctgt cagcagcgct ctccttgaga aggccgaatc tgttgtcatg 2280
gaggttctcg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc 2340
gagactgagc ttggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggct 2400
atgcttaacg tcgaggccaa ggatgttgat gctcttagcc gcacccgcac cgttggtgag 2460
gttgtcaacg ccatgaaggc tgagattgct agcagctctg gtgctgctgc ccctgctccg 2520
gctgctgccg ttgcaccggc ccctgctgct gcccctgctg tcagcagcgc tctccttgag 2580
aaggccgaat ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg 2640
attgaggccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag 2700
attctctctg aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc 2760
cgcacccgca ccgttggtga ggttgtcaac gccatgaagg ctgagattgc tagcagctct 2820
ggtgctgctg cccctgctcc tgctgctgcc gctgcaccgg cccctgctgc tgcccctgct 2880
gtcagcagcg ctcttcttga gaaggctgag tctgttgtca tggaggttct cgccgccaag 2940
actggttacg agactgacat gattgaggcc gacatggagc tcgagactga gcttggcatt 3000
gactccatca agcgtgtcga gattctctct gaggtccagg ctatgcttaa cgttgaggcc 3060
aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag 3120
gctgagattg ctagcagctc tggtgctgct gcccctgctc ctgctgctgc cgctgcaccg 3180
gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg agaaggctga gtctgttgtc 3240
atggaggttc tcgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag 3300
ctcgagactg agcttggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag 3360
gctatgctta acgttgaggc caaggatgtt gatgctctta gccgcacccg caccgttggt 3420
gaggttgtca acgccatgaa ggctgagatc gctggcagct ctggtgctgc tactgcctct 3480
gcccctgctg ctgcagctgc cgcccctgct 3510
<210> 17
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 17
ctccttgcca aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgtcggtgag gttgtcaac 219
<210> 18
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 18
cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 19
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 19
ctccttgaga aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgtcgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 20
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 20
cttcttgaga aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 21
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 21
ctccttgaga aggctgagtc tgtcgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgtcgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 22
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 22
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 23
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 23
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg tcgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 24
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 24
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 25
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 25
cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 26
<211> 219
<212> DNA
<213> Ulkenia sp.
<400> 26
cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210> 27
<211> 609
<212> DNA
<213> Ulkenia sp.
<400> 27
aagaagctcg ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt 60
gaggctctcg gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc 120
gccaaggctg ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac 180
gcttctggtg tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct 240
gtcttcggca ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac 300
cttaagcacc tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct 360
gactacgcca tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg 420
tccgtgaagt ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag 480
aagcagttcc agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg 540
gctcgcattg tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc 600
accaagaag 609
<210> 28
<211> 279
<212> PRT
<213> Ulkenia sp.
<400> 28
Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr
1 5 10 15
Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu
20 25 30
Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro
35 40 45
Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile
50 55 60
Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln
65 70 75 80
Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys
85 90 95
Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys
100 105 110
Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His
115 120 125
Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg
130 135 140
Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr
145 150 155 160
Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu
165 170 175
Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly
180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val
195 200 205
Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile
210 215 220
Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe
225 230 235 240
Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp
245 250 255
Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val
260 265 270
Leu Lys Arg Tyr Ala Asp Ala
275
<210> 29
<211> 17
<212> PRT
<213> Ulkenia sp.
<400> 29
Gly Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala
1 5 10 15
Val
<210> 30
<211> 4
<212> PRT
<213> Ulkenia sp.
<400> 30
Asp Ala Ala Cys
1
<210> 31
<211> 174
<212> PRT
<213> Ulkenia sp.
<400> 31
His Ala Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser
1 5 10 15
Gly Ile Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg
20 25 30
Ala Tyr Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu
35 40 45
Gly His Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala
50 55 60
Leu Arg Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val
65 70 75 80
Ala Val Gly Ser Ile Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala
85 90 95
Gly Leu Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr
100 105 110
Leu Pro Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn
115 120 125
Thr Pro Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro
130 135 140
Trp Phe Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe
145 150 155 160
Gly Phe Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala
165 170
<210> 32
<211> 460
<212> PRT
<213> Ulkenia sp.
<400> 32
Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr
1 5 10 15
Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu
20 25 30
Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro
35 40 45
Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile
50 55 60
Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln
65 70 75 80
Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys
85 90 95
Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys
100 105 110
Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His
115 120 125
Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg
130 135 140
Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr
145 150 155 160
Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu
165 170 175
Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly
180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val
195 200 205
Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile
210 215 220
Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe
225 230 235 240
Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp
245 250 255
Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val
260 265 270
Leu Lys Arg Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala
275 280 285
Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile
290 295 300
Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr
305 310 315 320
Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His
325 330 335
Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg
340 345 350
Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val
355 360 365
Gly Ser Ile Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu
370 375 380
Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro
385 390 395 400
Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro
405 410 415
Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe
420 425 430
Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe
435 440 445
Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala
450 455 460
<210> 33
<211> 332
<212> PRT
<213> Ulkenia sp.
<400> 33
Leu Phe Ser Gly Gln Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val
1 5 10 15
Ala Met Asn Trp Pro Gln Phe Arg Glu Ser Ile Ser Asp Met Asp Arg
20 25 30
Ala Gln Ala Lys Val Ala Gly Ala Asp Lys Asp Tyr Glu Arg Val Ser
35 40 45
Gln Val Leu Tyr Pro Arg Lys Pro Tyr Asn Ser Glu Pro Glu Gln Asp
50 55 60
His Lys Lys Ile Ser Leu Thr Ser Tyr Ser Gln Pro Ser Thr Leu Ala
65 70 75 80
Cys Ala Leu Gly Ala Tyr Glu Ile Phe Lys Gln Ala Gly Phe Lys Pro
85 90 95
Asp Phe Ala Ala Gly His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala
100 105 110
Ala Asp Cys Val Asn Arg Asp Asp Leu Phe Glu Leu Val Cys Arg Arg
115 120 125
Ala Arg Ile Met Gly Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly Cys
130 135 140
Met Ala Ala Val Ile Gly Pro Asn Ala Glu Lys Ile Gln Ile Arg Thr
145 150 155 160
Ala Asp Val Trp Leu Gly Asn Cys Asn Ser Pro Ser Gln Thr Val Ile
165 170 175
Thr Gly Ser Val Glu Gly Ile Lys Lys Glu Ser Glu Leu Leu Gln Ser
180 185 190
Glu Gly Phe Arg Val Val Pro Leu Ala Cys Glu Ser Ala Phe His Ser
195 200 205
Pro Gln Met Gln Asn Ala Ser Ser Ala Phe Lys Asp Val Leu Ser Lys
210 215 220
Val Ala Phe Arg Gln Pro Ser Ala Gln Thr Lys Leu Phe Ser Asn Val
225 230 235 240
Ser Gly Glu Thr Tyr Ser Asn Asn Ala Gln Asp Leu Leu Lys Glu His
245 250 255
Met Thr Ser Ser Val Lys Phe Ile Ser Gln Val Arg Asn Met His Ser
260 265 270
Ala Gly Ala Arg Ile Phe Val Glu Phe Gly Pro Lys Gln Val Leu Ser
275 280 285
Lys Leu Val Ser Glu Thr Leu Lys Asp Asp Pro Ser Ile Ile Thr Ile
290 295 300
Ser Val Asn Pro Ser Ser Gly Lys Asp Ala Asp Ile Gln Leu Arg Glu
305 310 315 320
Ala Ala Val Gln Leu Val Val Ala Gly Val Asn Leu
325 330
<210> 34
<211> 1170
<212> PRT
<213> Ulkenia sp.
<400> 34
Ala Gln Ala Gln Ile Gln Lys Ala Lys Ala Asp Ala Ala Glu Ala Asp
1 5 10 15
Lys Lys Leu Ala Ala Ala Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser
20 25 30
Ala Pro Val Gln Lys Gln Val Asp Thr Thr Ile Val Asp Lys His Arg
35 40 45
Ala Ile Leu Lys Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro
50 55 60
Gly Ala Val Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala
65 70 75 80
Pro Val Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn
85 90 95
Ala Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala
100 105 110
Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu
115 120 125
Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu
130 135 140
Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser
145 150 155 160
Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile
165 170 175
Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala
180 185 190
Ala Pro Ala Ala Pro Ala Pro Ala Val Asn Ser Ala Leu Leu Ala Lys
195 200 205
Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
210 215 220
Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly Ile
225 230 235 240
Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu
245 250 255
Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val
260 265 270
Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly
275 280 285
Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val
290 295 300
Ala Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu
305 310 315 320
Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp
325 330 335
Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser
340 345 350
Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val
355 360 365
Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu
370 375 380
Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala
385 390 395 400
Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Ala
405 410 415
Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala Glu Thr Val Val Met
420 425 430
Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro
435 440 445
Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val
450 455 460
Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp
465 470 475 480
Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala
485 490 495
Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro
500 505 510
Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser
515 520 525
Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala
530 535 540
Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu
545 550 555 560
Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser
565 570 575
Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu
580 585 590
Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu
595 600 605
Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala
610 615 620
Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val Ser Ser Ala
625 630 635 640
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
645 650 655
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
660 665 670
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
675 680 685
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
690 695 700
Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala
705 710 715 720
Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala
725 730 735
Pro Ala Pro Val Thr Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu
740 745 750
Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly
755 760 765
Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu
770 775 780
Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala
785 790 795 800
Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg
805 810 815
Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser
820 825 830
Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala Pro
835 840 845
Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser
850 855 860
Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met
865 870 875 880
Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile
885 890 895
Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu
900 905 910
Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val
915 920 925
Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala
930 935 940
Pro Ala Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala
945 950 955 960
Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val
965 970 975
Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met
980 985 990
Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile
995 1000 1005
Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp
1010 1015 1020
Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys
1025 1030 1035 1040
Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala
1045 1050 1055
Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu
1060 1065 1070
Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr
1075 1080 1085
Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu
1090 1095 1100
Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln
1105 1110 1115 1120
Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr
1125 1130 1135
Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly
1140 1145 1150
Ser Ser Gly Ala Ala Thr Ala Ser Ala Pro Ala Ala Ala Ala Ala Ala
1155 1160 1165
Pro Ala
1170
<210> 35
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 35
Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 36
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 36
Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 37
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 37
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 38
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 38
Leu Leu Glu Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 39
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 39
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 40
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 40
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 41
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 41
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 42
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 42
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 43
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 43
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 44
<211> 73
<212> PRT
<213> Ulkenia sp.
<400> 44
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210> 45
<211> 203
<212> PRT
<213> Ulkenia sp.
<400> 45
Lys Lys Leu Val Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser
1 5 10 15
Ile Ala Asn Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys
20 25 30
Asp Val Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu
35 40 45
Ala Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val
50 55 60
Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp Ala
65 70 75 80
Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly Ala Ile
85 90 95
Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser Leu Ala Gly
100 105 110
Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met Ala Asn Glu Ala
115 120 125
Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg Val Ser Val Lys Ser
130 135 140
Ile Cys Phe Gly Pro Trp Asp Gly Gly Met Val Thr Pro Gln Leu Lys
145 150 155 160
Lys Gln Phe Gln Ser Met Gly Val Gln Ile Ile Pro Arg Glu Gly Gly
165 170 175
Ala Asp Thr Val Ala Arg Ile Val Leu Gly Ser Ser Pro Ala Glu Ile
180 185 190
Leu Val Gly Asn Trp Thr Thr Pro Thr Lys Lys
195 200
<210> 46
<211> 780
<212> DNA
<213> Ulkenia sp.
<400> 46
aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag 60
ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc 120
ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt 180
tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt 240
aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt 300
gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac 360
caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc 420
gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca 480
tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt 540
gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat 600
gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc 660
actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt 720
cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct 780
780
<210> 47
<211> 51
<212> DNA
<213> Ulkenia sp.
<400> 47
cctcttcact actctgtcga tgctgcctgt gccaccgccc tttacgtcct t 51
<210> 48
<211> 12
<212> DNA
<213> Ulkenia sp.
<400> 48
gatgctgcct gt 12
<210> 49
<211> 477
<212> DNA
<213> Ulkenia sp.
<400> 49
tacggtactc tgctcggtgc taccatcagc aatgctggct gtggtcttcc cctcaagccg 60
cacttgccca gcgagaagtc ctgcctcatt gatacctaca agcgcgtcaa cgtgcacccg 120
cacaagatcc agtacgtcga gtgccacgca acgggtactc cccagggaga ccgcgttgag 180
attgatgccg tcaaggcttg cttcgagggc aaggtgcctc gctttggaag ctccaagggt 240
aactttggcc acacactcgt tgcagctggt ttcgcaggca tgtgcaaggt actccttgcc 300
atgaagcatg gtgtgatccc gcccactcct ggtgtcgatg gatcttccca aatggacccg 360
cttgtggtct ctgagcccat cccatggccc gacactgagg gcgagcccaa gcgcgctggt 420
ctctccgctt tcggctttgg tggcaccaac gcccacgcag tctttgagga gtttgac 477
<210> 50
<211> 1278
<212> DNA
<213> Ulkenia sp.
<400> 50
aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag 60
ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc 120
ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt 180
tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt 240
aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt 300
gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac 360
caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc 420
gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca 480
tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt 540
gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat 600
gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc 660
actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt 720
cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct 780
attcgcgatg gagaccacat ttacggtact ctgctcggtg ctaccatcag caatgctggc 840
tgtggtcttc ccctcaagcc gcacttgccc agcgagaagt cctgcctcat tgatacctac 900
aagcgcgtca acgtgcaccc gcacaagatc cagtacgtcg agtgccacgc aacgggtact 960
ccccagggag accgcgttga gattgatgcc gtcaaggctt gcttcgaggg caaggtgcct 1020
cgctttggaa gctccaaggg taactttggc cacacactcg ttgcagctgg tttcgcaggc 1080
atgtgcaagg tactccttgc catgaagcat ggtgtgatcc cgcccactcc tggtgtcgat 1140
ggatcttccc aaatggaccc gcttgtggtc tctgagccca tcccatggcc cgacactgag 1200
ggcgagccca agcgcgctgg tctctccgct ttcggctttg gtggcaccaa cgcccacgca 1260
gtctttgagg agtttgac 1278
<210> 51
<211> 801
<212> DNA
<213> Ulkenia sp.
<400> 51
atgcgcattg ctattaccgg tatggatgcc accttcggct ccctcaaggg cctggacgcc 60
tttgagcgtg ccatctacaa tggccaacat ggtgctgtgc cattgcctga gaagcgctgg 120
cgtttccttg gtaaagacaa ggactttttg gacctgtgcg gtgtcaagga ggtgccccac 180
ggatgctaca ttgaggacgt cgaggtggac tttagccgcc tgcgcacgcc catgacgcca 240
gacgacatgt tgcgccccat gcagctactt gctgtcacaa ccatcgaccg tgccattctc 300
aactctggcc tcaagaaggg aggtaaggtc gctgtcttcg tcggccttgg cactgacctt 360
gagctctacc gtcaccgcgc ccgcgttgcc ctcaaggagc gtgctcgtcc cgaagccgct 420
tcagccctca atgatatgat gtcctacatc aacgattgcg gtaccgctac ctcgtacaca 480
tcctacatcg gcaacctcgt ggccacccgc gtgtcttcac aatggggttt cgagggtcct 540
tctttcacca tcacagaggg caacaactcc gtctaccgtt gcgcagagtt gggcaagtac 600
ttgctcgaga ctggcgaggt cgaggccgta gtgatcgccg gtgtggatct ttgcgccagc 660
gctgagaatc tctacgtgaa gtcgcgtcgt ttcaaggtct cggagcagga gagcccgcgg 720
gccagcttcg actccggcgc tgacggctac tttgttggtg agggatgtgg tgccctcgtc 780
ctcaagcgcg agagcgactg c 801
<210> 52
<211> 792
<212> DNA
<213> Ulkenia sp.
<400> 52
gctgctttcg gactgagcct tggagagatt tccatggttt ttgccttttc tgagaagaac 60
ggccttgtct ctgaggagct gacaactaaa ctccgcaact cggaggtctg gcgtaaggcc 120
ctcgctgttg agtttgacgc cctccgcaag gcctggaata ttccccaaga tacccctgtc 180
agcgagttct ggcaaggata cgtggtacgt ggaacccgcg aggccgttga agcggccatc 240
ggccccaaca ataagtacgt gcacttgacc attgtcaacg atgccaacag tgctctcatc 300
agtggcaagc ctgaagattg caaggctgcc attgctcgcc tgagcagcaa cctccctgct 360
ttgcccgtgg accttggtat gtgtggccac tgccccgtgg tcgagccgta cggcaagcag 420
atcgctgaga tccatagcgt cctcgagatt cccgaggttg ccggccttga cctgtacacg 480
agcgtcaacc agaagaagct tgttaacaag tccactggag ccagcgacga gtacgcaccc 540
agctttggtg aatacgcagc acagctgtac actgttcagg cagactttcc taagatcgcc 600
aagaccgtta gcgacaagaa ctttgacgtc tttgttgaga ctggtcccaa cgcccaccgt 660
agcgccgcaa ttcgcgccac ccttggaaat agcaagcctt ttgtcaccgg atccatggac 720
cgccagaacg agaatgcttg gacaaccatg gtcaagctgg ttgcctctct ccaagcccac 780
cgcgtgcctg gc 792
<210> 53
<211> 1302
<212> DNA
<213> Ulkenia sp.
<400> 53
agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca 60
aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct 120
ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct 180
gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc 240
gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc 300
tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct 360
gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc 420
gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag 480
atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt 540
gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt 600
gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc 660
gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc 720
gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac 780
accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct 840
gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg 900
cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct 960
ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa 1020
gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag 1080
cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc 1140
tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg 1200
attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag 1260
taccccgatg ttgtgcaaat caacttgcag atcctccgtg gt 1302
<210> 54
<211> 260
<212> PRT
<213> Ulkenia sp.
<400> 54
Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys
1 5 10 15
Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp
20 25 30
Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His
35 40 45
Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn
50 55 60
Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu
65 70 75 80
Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr
85 90 95
Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
100 105 110
Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu
115 120 125
Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln
130 135 140
Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
145 150 155 160
Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val
165 170 175
Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp
180 185 190
His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys
195 200 205
Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
210 215 220
Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser
225 230 235 240
Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg
245 250 255
Leu Asp Asp Ala
260
<210> 55
<211> 17
<212> PRT
<213> Ulkenia sp.
<400> 55
Pro Leu His Tyr Ser Val Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val
1 5 10 15
Leu
<210> 56
<211> 4
<212> PRT
<213> Ulkenia sp.
<400> 56
Asp Ala Ala Cys
1
<210> 57
<211> 159
<212> PRT
<213> Ulkenia sp.
<400> 57
Tyr Gly Thr Leu Leu Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu
1 5 10 15
Pro Leu Lys Pro His Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr
20 25 30
Tyr Lys Arg Val Asn Val His Pro His Lys Ile Gln Tyr Val Glu Cys
35 40 45
His Ala Thr Gly Thr Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val
50 55 60
Lys Ala Cys Phe Glu Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly
65 70 75 80
Asn Phe Gly His Thr Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys
85 90 95
Val Leu Leu Ala Met Lys His Gly Val Ile Pro Pro Thr Pro Gly Val
100 105 110
Asp Gly Ser Ser Gln Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro
115 120 125
Trp Pro Asp Thr Glu Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe
130 135 140
Gly Phe Gly Gly Thr Asn Ala His Ala Val Phe Glu Glu Phe Asp
145 150 155
<210> 58
<211> 426
<212> PRT
<213> Ulkenia sp.
<400> 58
Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys
1 5 10 15
Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp
20 25 30
Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His
35 40 45
Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn
50 55 60
Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu
65 70 75 80
Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr
85 90 95
Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
100 105 110
Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu
115 120 125
Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln
130 135 140
Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
145 150 155 160
Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val
165 170 175
Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp
180 185 190
His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys
195 200 205
Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
210 215 220
Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser
225 230 235 240
Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg
245 250 255
Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu
260 265 270
Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His
275 280 285
Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn
290 295 300
Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr
305 310 315 320
Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu
325 330 335
Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr
340 345 350
Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met
355 360 365
Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln
370 375 380
Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu
385 390 395 400
Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr
405 410 415
Asn Ala His Ala Val Phe Glu Glu Phe Asp
420 425
<210> 59
<211> 267
<212> PRT
<213> Ulkenia sp.
<400> 59
Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser Leu Lys
1 5 10 15
Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His Gly Ala
20 25 30
Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp Lys Asp
35 40 45
Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys Tyr Ile
50 55 60
Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met Thr Pro
65 70 75 80
Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr Ile Asp
85 90 95
Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val Ala Val
100 105 110
Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg Ala Arg
115 120 125
Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala Leu Asn
130 135 140
Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser Tyr Thr
145 150 155 160
Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln Trp Gly
165 170 175
Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser Val Tyr
180 185 190
Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu Val Glu
195 200 205
Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu Asn Leu
210 215 220
Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser Pro Arg
225 230 235 240
Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu Gly Cys
245 250 255
Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys
260 265
<210> 60
<211> 264
<212> PRT
<213> Ulkenia sp.
<400> 60
Ala Ala Phe Gly Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe
1 5 10 15
Ser Glu Lys Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg
20 25 30
Asn Ser Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu
35 40 45
Arg Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp
50 55 60
Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala Ile
65 70 75 80
Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp Ala Asn
85 90 95
Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala Ala Ile Ala
100 105 110
Arg Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp Leu Gly Met Cys
115 120 125
Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys Gln Ile Ala Glu Ile
130 135 140
His Ser Val Leu Glu Ile Pro Glu Val Ala Gly Leu Asp Leu Tyr Thr
145 150 155 160
Ser Val Asn Gln Lys Lys Leu Val Asn Lys Ser Thr Gly Ala Ser Asp
165 170 175
Glu Tyr Ala Pro Ser Phe Gly Glu Tyr Ala Ala Gln Leu Tyr Thr Val
180 185 190
Gln Ala Asp Phe Pro Lys Ile Ala Lys Thr Val Ser Asp Lys Asn Phe
195 200 205
Asp Val Phe Val Glu Thr Gly Pro Asn Ala His Arg Ser Ala Ala Ile
210 215 220
Arg Ala Thr Leu Gly Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp
225 230 235 240
Arg Gln Asn Glu Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser
245 250 255
Leu Gln Ala His Arg Val Pro Gly
260
<210> 61
<211> 434
<212> PRT
<213> Ulkenia sp.
<400> 61
Ser Arg Ala Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr
1 5 10 15
Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala
20 25 30
Gly Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile
35 40 45
Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro Lys
50 55 60
Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu
65 70 75 80
Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val
85 90 95
Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu Val Arg Tyr Arg
100 105 110
Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser Thr Val Ile Lys Asn
115 120 125
Arg Val Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Ala Met Phe Ile
130 135 140
Arg Pro Ala Pro Glu Asn Leu Leu Glu Lys Leu Leu Lys Ser Gly Glu
145 150 155 160
Ile Thr Gln Glu Gln Ala Ala Leu Ala Arg Thr Val Pro Val Ala Asp
165 170 175
Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro
180 185 190
Ile His Val Ile Leu Pro Leu Ile Val Asn Leu Arg Asp Arg Leu His
195 200 205
Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly
210 215 220
Gly Gly Ile Gly Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly
225 230 235 240
Ala Ala Phe Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala
245 250 255
Gly Thr Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser
260 265 270
Asp Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys
275 280 285
Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys
290 295 300
Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met Ala Pro
305 310 315 320
Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys Ser Leu Ser
325 330 335
Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn Arg Leu Gln Asn
340 345 350
Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser
355 360 365
Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Phe Trp Ala Asn Ala
370 375 380
Gly Ile Pro Asp Arg Ala Met Asp Tyr Gln Val Trp Cys Gly Pro Ala
385 390 395 400
Ile Gly Ser Phe Asn Asp Phe Ile Lys Gly Thr Tyr Leu Asp Pro Ala
405 410 415
Val Ala Asn Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu
420 425 430
Arg Gly
<210> 62
<211> 2000
<212> DNA
<213> Ulkenia sp.
<400> 62
gagcacgcac catcttctct ccacgcgtaa agaagagcag agccagaggc aggtaggtat 60
ctccacccat ctcaggctgt gacttctttg tttctttctt tctttgcttg ttttctgttc 120
tctctctgtg ctctgtccac acgagaaaga gaaagagaga gagaaagaac cacgggttta 180
tagagcgcac tcgtccttcc tgcttcagca gaaagcactg cgtaggagaa ctacggggga 240
ggaggaagca cgcacggagg aggcgtggaa ggaaggagga gacagagaga gagagacact 300
gagggacaga gggggagagg cagagggaga ggcatctgat gtttgcgaga aaccaataag 360
ttttgaaagt gatttgattt agctgattga ctgatctatg gcctgaaaga aagcttttaa 420
agcggaggga gatagatgac gagggcagct gcgatggcgt acggcgcatc cgtctctctc 480
tgtgtctctc tctctttctc tctcgtcagg gcgtggagac ctcggaagct gcacgcggcg 540
cggtgaggag gcagggcagc agagggagag gagagatccc agagtcgaag agcattgatt 600
gattgcagat gatcttgggc aacgcgcgtc agcttgagcg aggaatgctt tggacttcag 660
gttcttcgct tctgtgtttc attctttctc gaagaaagaa agaatgaaag aaagagagaa 720
agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa tgaatgaatg aaagaaagag 780
agaaagaaag aacgaatgaa agaaagagag aaagaatcaa agagaaagcg cattcgcagt 840
tcttcttcgt gaaagaaaag gaaaagagag gcgatggtag gctctgatct catcatttct 900
ggtttctctg ttgtacctgt actctgtgct tgtggccttg cgaaggctga agacgccatg 960
cagacaacca cgcctccgca gagactttgc gggaaagcag agggcttctc gccactctcg 1020
aagaaacgag ctcgccagtt ttcggggttg ttctcagaat tgcgagtgtt ggctttatat 1080
gggatgatgg tatggcactt cgtcatcgtt actctcgctc gcttgcttac gaagattttc 1140
aaaagggcga aagaagtgct cagcttttaa aataaagtca caccaaagac taggccgcat 1200
agcagaaagc taaagtaaac ccaatctgtc tgaagagagt gtcgtggtta gatacttacg 1260
caagagttta aaagctgtaa atagtacagg aacaaaaaca aataaatata tatatattct 1320
tttttattag taaaacatga aaccaaaaaa ctcctttaaa ataaaataaa ataaaataaa 1380
ataaaataaa ataaaataaa tttactacta tatatacata tatatataca ataaataaaa 1440
acaacttttt cagaccagaa aaagactgag aaaaaaggaa actaatgact ctcgagcacc 1500
gagagcgata taagagtgga ttatatttgc taggcccacc acgagtgagt cccctaggag 1560
gaagcgccct ctgagacagg agcagaggcg tcgctggtgc tccaaaaagc gacggcgaat 1620
ggaaagcaaa accctttcga gggaggcttg tggccgtgac tattcaaatc tccagcatct 1680
cagctccagc acagcagaag ctacctcgct tctcagctct agctatcaca tcgatcgcag 1740
catctagctc gtagacagct agcgccgcac cttcccccaa atcaacttgg gcaacttaac 1800
tcttttttca ccagaactcc tcttttcctt taatcttcga aaagaagacg aataaaagag 1860
ataatcctct gccgcagcac attctaaaag aaaagcggca tactggcgta ggcaagactt 1920
tcaagctctt cctcgcctcc accccgtatt tccctgttca tctttgtgaa acgaggaaac 1980
aagaaatttt ataggacaag 2000
<210> 63
<211> 2000
<212> DNA
<213> Ulkenia sp.
<400> 63
agttgtgagg ctgtcttgtc ttgtcagtcg cgaaagtgta agcaagaact ttgtcataca 60
aagaagcaac caacttccga accaacacac cttgtaggat tacaaccaca actttctata 120
aatagtgcgc aagaataacc agtaagctat ccttcgtgta cctgttacaa caacgacatt 180
tttacttgat cttcctactt gtgatgggta gtcccggctt gtactgacag tgatgccaca 240
gcagagtaga tcactgtgaa taagtaaata agcctactta ttatattccc aaagtactcg 300
ctgggatatt attagtatca cgaaaagtga tatgttttat aactcgcttg tcttgccaag 360
atctaacctt ttttttttaa atggccaaaa agtcgccaga acacatctta caataaacaa 420
aaatttagat tatatcgtat gtataatgta taatatatta tattattata tacatacgat 480
ataatctaaa gccattccag acttattcgg tgatgaaaaa tgctttccca gctttataca 540
aactattcaa aaagttgcat gacccatttt cagatatatt taatagtata agattatgtc 600
catttgtttt caaagttatt caagagttta catcttgaag tttcatccct ttactactac 660
actgtttttc gtttgggttt tttctctaac ggcgaaagaa acaagtcacc aagcttaact 720
agtaggcatc tttgtggtga cgaaattaaa gttgaatata taaattatag ttagtcatta 780
tggaatctca gtttgaacga agctaagcta tttataaaaa tcactgcatg gagataatac 840
ttgaattttg atgatagtgt ttatgaagaa gtttaatctt gctttttatt aatgttattc 900
tctaatatag aaatatttca ataaaaaaat catatgaagg gataataaat acagagaatg 960
atcgttatca tttgatatgt cgaacgctaa tctatcatct tatctaggaa acaaaggtgg 1020
aaataaagga aagccctaca cgagttaatt cctcaaacga actactttgg attatcaaat 1080
ccaactgctg acactggata catgcatgta tttagtgggt gttactgtac ttccttattt 1140
cctttaattc aattgtcttg atttttactt cggagattct acttgaaaat catctccctt 1200
cacttccggt tatacagaaa gacccttcaa ttcgaatgct ggccaggtac aataactatc 1260
agcgattccc ctccactaga catgaccgac tgtaagcacc tcaacccgat ttcaagcaac 1320
acatgatgac tagctgtttc cgcaaaacaa caaataagag aggtagtgga aaacacccag 1380
ttcgctcgag ctcccctagt agattcgaca ttcactttct atttgattgc taattgtggg 1440
tccggctatt taaggaaaga actgatgaaa gtccacctca cgcaatcaaa tcgcggtcta 1500
gttggaagct acaatggccg acgtatgcgc gcctctatct tttaggattg tagaacaggg 1560
cggcaatctg ctaacataaa tttaatacct tgctcaagct gctttccata cttttcaatc 1620
catttgtgat aatcttgcaa tggaccaatc tccaaatctg tagaagcaat aacaaggaca 1680
tcgcagggtc ccggttcgtt tgcatgctcg tcttctggtg ccacaacaat gctgcctgtt 1740
attatctcat gagagtcttt atactgcgga tccgtggcta tagcgtgaat aaacgttgtg 1800
cgcaagccta tatcctcgcg atggagatac tggcctgcta cagtttgcgt tcgtctgcct 1860
acgacaacgc atggaacatt ctttggtgtg cgagtgggcc gtagcgttcg accctgggca 1920
aggaagccat gcagacgtga ttccgagagg ccatctcgcg tgtaagactt atcccaattt 1980
tctggatcct ctaatttcca 2000
<210> 64
<211> 2000
<212> DNA
<213> Ulkenia sp.
<400> 64
aaattaatga atgaatcaat gaatgaatca atgaataatg ccaatgcaat gcgatgcgat 60
gctgcttcga gccatcgcac ggcggccatt gcgcgcttgc gtcagtcatg tcattccatt 120
cggagcggcg tgcgcgaggg agggagggag ggagggagaa gacgaggagc aggcggagag 180
agaggaggat gggcgggcgg gcggcgtcgt cggcgtcgtc gtcgtcgtgg gcctccgtag 240
tcgctgggaa ggagggcttt gattccaaat gaggattttg gtgcactgct ttcgagactt 300
tctcgcctga ttcggaattc ctcctcttct tcttcttttt agctgtgctt tctgcgtatt 360
cattgcgtgg gtttggcttg gttttcaaat caattagcag tctagtaact aacaaactaa 420
caaacagata aacagacaaa cagacaaaca aacaaaacaa acaaaacaaa caaaacaaag 480
caggaaagaa agaaacaaac aaatatacaa acaaagaaag aaagaagtgg tgggaactag 540
ggaaatcaat gtgtttgctt ctttcgcacc tttgcttttc ttgcttttct tggttctcaa 600
gtaagcgttt atcgcgccct cagaaaacaa aataaaatga tctaacataa catgaattta 660
tatttatttt atttgtttat taaataaata ttttttgtaa accagaattt cactctactt 720
ttgcaacact gagagagtgc catctgcata ataagtggca gtgttttttt gtttattttc 780
aaattaatta tacttgaact gctaggtcaa gaggccgcag cagcctgatg agataaggac 840
agagtaggca aggatggcag aagatcgcga aaaaagcgag aaaggcaaac gagcaggccc 900
gaaggtgagg tggagctgct tgtcaaggtc gcgaggtttg tttgacagtt ataacagcaa 960
gaactaaggc aatttcaaga atgaagagca ctcgaataaa ccgatgaagc aaagtgtgta 1020
catacaaaca tacatacgta cagatgaaaa gaacagattt tcaataaaaa tgacttttta 1080
gtttaaacaa tgtttctgtt tgttgtttcg cttttcatta atttgttgca aattattttg 1140
ttttggtttt tgtttttgtt tttgaaaatc ataaaagaga tgctgccgca gacgtctgcg 1200
cgtctcatag ttgattgggt aatcgttttg ttgagttttg aaaatgtaaa cttcacttag 1260
ttgctcattt atcctcattc gtttgcccat ttgttctctg tttgaagcag agttttgact 1320
tctcgcattc gtggaatcca ccccttgctt gctttgcttg cttgcttgct tgcttgcttg 1380
cctgcttgct ttgcttgctt gcttgaccag cgtgcgcgct ttcgccagcc tagccttcga 1440
gacctcttga agaccctttg gagcgtctag ttcgaggttc tttctatttg cttcaagaga 1500
gacaaaataa caaagaaaaa gagagaaaaa acaagcaaag aaagaaacaa ggaaacaaac 1560
cacaaagcac gcatcgtgca tccaaacttt catcccccca ctctctctct ctctctctct 1620
ctctctctcc ttcctcggaa aaggagtgag acaaaggcag acagcctcta gcttggcagc 1680
ctcgcagctc gtgcggcgcc agttcctaca gcttcgcgct gtccaaacgc cagtccatcg 1740
cagcttcggc tagctagttg gctgattgat tgattgattg attgatagcc tttattacgg 1800
cgttgattaa ctgattgatt atttgattgc tctggcatcc ctgtaatcac ttgctcaagg 1860
tagtcaatca catcatttat acatctcctc caaagcaaac catctacacg accgcttttt 1920
gatcgatcta aaagtgccgg tcaggtgaca cgcaagctct tttttttgtt tacagtaagc 1980
agcaacaaga aagcaaaaag 2000
<210> 65
<211> 2000
<212> DNA
<213> Ulkenia sp.
<400> 65
gcccaatttg ctcctgatct gttcccatga ttatgatagg gataggtagt agttatagct 60
agactcattc cattcactta atccacatat gcaaattata attttatgtg tcgcatataa 120
actttccaaa ctttaaaatt ttcatttgca ttttatatat agatcacctg tgatcccttt 180
ctcgcccctt tcaacttcca aagtttacct actatcatat ggcatggcgc agccaatgca 240
ctctataaca tataagtaac agagatagtt tttgccgcat catttactct ttactcttgc 300
tatacaaggt aagcgccaag agagttaatt acatctgttt tatcggttcc tagtggaaat 360
aatagtgaca actataatta gtaggagtcc ttattgaccc tagtcatttg agcttgcacc 420
agatttgatg tttttgcaaa cgaccttgac gcagagtgac gagcgaaaat tggatcccct 480
tggttgaagt ctaaactagc ttaaaatata tatgctcttc atataatata aagctgtttt 540
agattctatc aaataagaaa ttgatgactt tgagcaaatt aatatttggt atgggctccg 600
gcatctctga aaacgcttaa atgaagcttt tattcaccac gattcgacaa ctaaggttat 660
tttccacata attataactt ttcctacata actgtgctgt cgactcacac cttctttata 720
tatatagcct cgtagggatt cgaaactatg aattaagact cgttgaagtt tgatttatcc 780
attattttgc tgcacaaact atcgctaaga tataaagatc gtgcccagag cctgctatag 840
ggtcctaatg gcatgcttag cccggatttc cacgataaag ctgcattgta ttgagtatat 900
gcactcagag agtaaacttt aattgcaacg aacaatcttt ggcaagtcat atctcagcca 960
tcaatacatg tattgtgttc aaacgaattg cagcatatca ctcaaattat tttggtctag 1020
ttcagcggaa tcttttggtt gttttagtaa gagttgagta gagtatgttg gatgagtgtg 1080
tccacaaggt tatttgaata gggtatttac attctacaac atagtcagta agctctcgtg 1140
tgataaactg tatcaaaatc gacacaataa caggctagtg gtgccctgtg cacgttttta 1200
ccataacatg acagctacag catcagaaac aggtgtggtg cgcattttgg ttattctgat 1260
cctgaaacct aagaacaatt ttcatcgtct tgctagattg tgttttctgt attccatttg 1320
tggagcttca acatccatgc tgctgagtat tttcacatga agatcatagt gttagaatgt 1380
ttagtaagcc tattactaag ttttgaggta taggtgcttg ttgttgtcct tacataaata 1440
catgctgtct ttagtgctta gaccaacgtt gagtgtatcg tgctcttggc agaagaatag 1500
acatttataa cattatggtg aaaggcgatg gtctcgcttg catgttctcg cttgcgtttg 1560
cgtatcccta tacacttaac cgttgtttat gtgtacctaa gctatcatgc tgcatcttta 1620
caattttata caaataaatt tattttggaa tatataattg gtcactattt caggccagtt 1680
gacagtcctt aagatttgta gttgcgctgt tctcgtagtg agaatgaaga agcggaatct 1740
acatccatct gtgattgcat aagagcttgc ataagagtga agtaggtgaa agtcacagag 1800
aatatcttcc ctactatcct aaaggcaagg aatactacta tacacgaaca tagtaatgga 1860
attttacaca acagaagtac ccttgtctcc tgcctccttt tattattcca ttatgctctg 1920
ttatataatg aatgaagacg acttttaaca tcatttgatt ctcgagcagg cacgcacaat 1980
atagaggaag gattggcgtc 2000
<210> 66
<211> 1212
<212> DNA
<213> Ulkenia sp.
<400> 66
ggcaagaacg tcgttttcga ctatgacgag ctccttgagt tcgccgaggg tgacatcagc 60
aaggtcttcg gccccgaatt cagccagatc gaccagtaca agcgtcgcgt tcgtctcccc 120
gcccgcgagt acctcctcgt cacccgcgtc accctcatgg acgccgaggt caacaactac 180
cgcgtcggtg cccgcatggt cactgagtac gacctccccg tcaacggtga gctctctgag 240
ggtggtgact gcccctgggc cgtgctcgtc gagagtggtc agtgtgatct catgctcatc 300
tcctacatgg gtattgactt ccagaacaag agcgaccgcg tctaccgtct gctcaacacc 360
accctcacct tctacggtgt tgcccaggag ggcgagaccc tggagtacga catccgcgtg 420
accggcttcg ccaagcgtct cgacggtgac atctccatgt tcttcttcga gtacgactgc 480
tacgtcaacg gccgtctcct catcgagatg cgcgacggct gtgccggttt cttcaccaac 540
gaggagctcg ccgccggcaa gggtgtcgtc tttacccgcg ctgatctcct cgcccgcgag 600
aagaccaaga agcaggacat caccccgtac gccattgccc cgcgtcttaa caagaccgtt 660
ctcaacgaga ctgagatgca gtccctcgtg gacaagaact ggaccaaggt tttcggcccc 720
gagaacggca tggaccagat caactacaaa ctctgcgccc gtaagatgct catgattgac 780
cgcgtcacca agattgacta caccggtggc ccctacggcc ttggtcttct cgttggtgag 840
aagatcctcg agcgcgacca ctggtacttt ccgtgccact tcgtcggaga ccaggtcatg 900
gctggatccc tcgtgtctga cggctgcagc cagctcctca agatgtacat gctctggctc 960
ggcctccacc ttaagaccgg tcccttcgac ttccgccccg tcaacggcca ccccaacaag 1020
gtccgctgcc gtggccagat ctccccgcac aagggtaagc tcgtatacgt catggagatc 1080
aaggagatgg gctacgacga ggctggtgac ccgtacgcca tcgccgatgt caacattctc 1140
gacattgact tcgagaaggg ccagactttc gaccttgcca acctccacga gtacggcaag 1200
ggcgacctca ac 1212
<210> 67
<211> 21
<212> DNA
<213> Ulkenia sp.
<400> 67
tggtactttc cgtgccactt c 21
<210> 68
<211> 1197
<212> DNA
<213> Ulkenia sp.
<400> 68
gtgcccggcg agatgccgct ctcgtggtac aacatggctg agttcatggc cggcaaggtc 60
agcctctgcc tcggccctga gttcgccaag ttcgatgact ccaacaccag ccgcagccct 120
gcatgggacc ttgctcttgt gactcgtgtg gtctccgttt ctgacatgga gtgggtccag 180
tggaagaacg tggactgcaa cccgtccaag ggaaccatgg ttggcgagtt cgactgcccc 240
atcgacgcct ggttcttcca gggatcttgt aacgacggcc acatgccgta ctccatcctc 300
atggagatcg ccctccagac ctctggtgtc ctcacctctg tgctcaaggc cccgctcacc 360
atggagaaga aggacattct cttccgcaac cttgacgcca acgccgagat ggttcgctct 420
gatattgacc tccgcggcaa gaccatccac aacctcacca agtgtaccgg ctacagcatg 480
ctcggagaca tgggtgtcca ccgcttcagc ttcgagctct ctgttgatgg tgtagtcttc 540
tacaagggta ccacctcctt cggctggttc gtccctgagg tcttcatctc ccagactggt 600
ctcgacaacg gtcgccgcac ccagccctgg cacattgagt ccaaggtgcc ttccgcccag 660
gtcctcacct acgacgttac ccccaacggt gccggtcgca cccagctcta cgccaacgcc 720
cccaagggcg ctcagctcac tcgccgctgg aaccagtgcc agtaccttga caccatcgac 780
cttgtggtcg ccggtggctc cgccggtctt ggctacggtc atggccgcaa gcaggtgaac 840
cccaaggact ggttcttctc gtgccacttc tggttcgact ccgtcatgcc cggctcgctc 900
ggtgtggagt ctatgttcca gctcgtcgag tccatcgctg tcaagcagga cctcgccggc 960
aagtacggca tcaccaaccc gaccttcgct catgctccgg gcaagatctc ctggaagtac 1020
cgtggtcagc tcacccccac ctccaagttc atggactccg aggcccacat tgtctccatc 1080
gaggcccacg acggcgtcgt cgacatcgtt gccaatggta acctctgggc tgatggcctc 1140
cgcgtctaca acgtcagcaa catccgtgtg cgcattgttg ctggcgccgc ccctgct 1197
<210> 69
<211> 21
<212> DNA
<213> Ulkenia sp.
<400> 69
tggttcttct cgtgccactt c 21
<210> 70
<211> 90
<212> DNA
<213> Ulkenia sp.
<400> 70
gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc 60
cctgctccgg ttgctgcatc tggccctgcc 90
<210> 71
<211> 1299
<212> DNA
<213> Ulkenia sp.
<400> 71
gaaggcttca tgaagaccta cggtgttgtg gctcctctct acaccggtgc catggccaag 60
ggtattgcct ctgctgacct tgtgattgcc actggtaagc gcaagatcct cggttccttc 120
ggtgctggcg gtctccccat gcacattgtc cgtgccgctg ttgagaagat ccaggctgag 180
ctcccgaacg gccccttcgc cgtcaacctc atccactccc ccttcgatag caaccttgag 240
aagggcaacg ttgacctctt cctcgagaag ggcgttactg tcgtcgaggc ctccgccttc 300
atgaccttga ccccgcaagt cgtccgctac cgtgctgctg gtctttcccg taacgctgat 360
ggctccatta acatcaagaa ccgcatcatc ggtaaggtct cccgtaccga gctcgctgag 420
atgttcatcc gccctgcccc gcagaacctc ctcgacaagc tcatccagtc tggtgagatt 480
accaaggagc aggctgagct tgccaagctc gtccccgtcg ccgacgacat cgccgtcgag 540
gccgactctg gtggccacac cgacaaccgc cccatccacg tcatcctccc ccttatcatc 600
aacctccgca accgcctcca caaggagtgc ggctaccccg ctcacctccg cgtgcgcgtt 660
ggagctggtg gtggtgttgg atgcccccag gccgctgccg ctgctctcgc tatgggtgct 720
gccttccttg ttaccggcac tgtcaaccag gtcgccaagc agtccggcac ctgcgacaat 780
gtccgcaagc agctctgcat ggccacctac tctgacgtct gcatggctcc cgctgctgac 840
atgttcgagg agggcgtcaa gctccaggtc ctcaagaagg gaaccatgtt cccgtccagg 900
gctaacaagc tctacgagct cttctgcaag tacgactcct tcgagtccat gcctgccaca 960
gagctcgagc gtgttgagaa gcgcatcttc cagtgccctc ttgctgatgt ctgggctgag 1020
acctccgact tctacatcaa ccgcctccac aacccggaga agatcacccg tgccgagcgt 1080
gaccccaagc tcaagatgtc tctctgcttc cgctggtacc ttggtcttgc ctctcgctgg 1140
gccaacaccg gtgaggctgg acgcgtcatg gactaccagg tctggtgtgg ccctgccatt 1200
ggagccttca acgacttcat caagggctcc taccttgacc cggccgtctc tggtgagtac 1260
ccggacgtcg tgcagatcaa cttgcagatc cttcgcggt 1299
<210> 72
<211> 404
<212> PRT
<213> Ulkenia sp.
<400> 72
Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu Leu Glu Phe Ala Glu
1 5 10 15
Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe Ser Gln Ile Asp Gln
20 25 30
Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu Tyr Leu Leu Val Thr
35 40 45
Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn Tyr Arg Val Gly Ala
50 55 60
Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn Gly Glu Leu Ser Glu
65 70 75 80
Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu Ser Gly Gln Cys Asp
85 90 95
Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe Gln Asn Lys Ser Asp
100 105 110
Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr Phe Tyr Gly Val Ala
115 120 125
Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg Val Thr Gly Phe Ala
130 135 140
Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe Phe Glu Tyr Asp Cys
145 150 155 160
Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg Asp Gly Cys Ala Gly
165 170 175
Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys Gly Val Val Phe Thr
180 185 190
Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys Lys Gln Asp Ile Thr
195 200 205
Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr Val Leu Asn Glu Thr
210 215 220
Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr Lys Val Phe Gly Pro
225 230 235 240
Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu Cys Ala Arg Lys Met
245 250 255
Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr Thr Gly Gly Pro Tyr
260 265 270
Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu Glu Arg Asp His Trp
275 280 285
Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val Met Ala Gly Ser Leu
290 295 300
Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met Tyr Met Leu Trp Leu
305 310 315 320
Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe Arg Pro Val Asn Gly
325 330 335
His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile Ser Pro His Lys Gly
340 345 350
Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met Gly Tyr Asp Glu Ala
355 360 365
Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile Leu Asp Ile Asp Phe
370 375 380
Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu His Glu Tyr Gly Lys
385 390 395 400
Gly Asp Leu Asn
<210> 73
<211> 7
<212> PRT
<213> Ulkenia sp.
<400> 73
Trp Tyr Phe Pro Cys His Phe
1 5
<210> 74
<211> 399
<212> PRT
<213> Ulkenia sp.
<400> 74
Val Pro Gly Glu Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met
1 5 10 15
Ala Gly Lys Val Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp
20 25 30
Asp Ser Asn Thr Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr
35 40 45
Arg Val Val Ser Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val
50 55 60
Asp Cys Asn Pro Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro
65 70 75 80
Ile Asp Ala Trp Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro
85 90 95
Tyr Ser Ile Leu Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr
100 105 110
Ser Val Leu Lys Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe
115 120 125
Arg Asn Leu Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu
130 135 140
Arg Gly Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met
145 150 155 160
Leu Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp
165 170 175
Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro
180 185 190
Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln
195 200 205
Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr
210 215 220
Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala
225 230 235 240
Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu
245 250 255
Asp Thr Ile Asp Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr
260 265 270
Gly His Gly Arg Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys
275 280 285
His Phe Trp Phe Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser
290 295 300
Met Phe Gln Leu Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly
305 310 315 320
Lys Tyr Gly Ile Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile
325 330 335
Ser Trp Lys Tyr Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp
340 345 350
Ser Glu Ala His Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp
355 360 365
Ile Val Ala Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn
370 375 380
Val Ser Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala
385 390 395
<210> 75
<211> 7
<212> PRT
<213> Ulkenia sp.
<400> 75
Trp Phe Phe Ser Cys His Phe
1 5
<210> 76
<211> 30
<212> PRT
<213> Ulkenia sp.
<400> 76
Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala Ala Ala Val Ala Ala
1 5 10 15
Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser Gly Pro Ala
20 25 30
<210> 77
<211> 433
<212> PRT
<213> Ulkenia sp.
<400> 77
Glu Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly
1 5 10 15
Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr Gly
20 25 30
Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro Met His
35 40 45
Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly
50 55 60
Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu
65 70 75 80
Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu
85 90 95
Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala
100 105 110
Ala Gly Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg
115 120 125
Ile Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg
130 135 140
Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu Ile
145 150 155 160
Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala Asp Asp
165 170 175
Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile
180 185 190
His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys
195 200 205
Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly
210 215 220
Gly Val Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala
225 230 235 240
Ala Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly
245 250 255
Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser Asp
260 265 270
Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys Leu
275 280 285
Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys Leu
290 295 300
Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr
305 310 315 320
Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp
325 330 335
Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro
340 345 350
Glu Lys Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu
355 360 365
Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly
370 375 380
Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala Ile
385 390 395 400
Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val
405 410 415
Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg
420 425 430
Gly
<210> 78
<211> 2000
<212> DNA
<213> Ulkenia sp.
<400> 78
gcacgtagag caagaaagaa tgaaagaaag aacgaaagaa agaaagagag agagagagag 60
agagagagag agaaagcgaa gatgatagcg gagagaactc ttcttcgcag tcactctgtt 120
tctcagtcag tcccgcaacc aataacaact cgaactcgca gcagtgttct tcggagtgcc 180
agcgctcgct cgcactgcgt cggcacagca gcagcagcag caggccccgc gctcgctgca 240
ctcagcccgg gcaggagcaa cagctgctga gcagctgagg ccagctggct ggcggctcgc 300
ctcgcctcgc ctcgcgtcgc gtcgcgagag aaagcgatcg accaactgtc aatcgattat 360
tcgagtcctt cgagcgcttt atagggcact gattgatcac tcattgattc attgactcat 420
ttattctttg cgtggtcagc caaacggcgt tagcattggg caaagcgggt ctttgctttg 480
ctctaaaata gatttgctcg cgagagtacg tacttgcagg agtaggtagg ctctgcctag 540
tacctgggca tttgaatatt tgaacttcga acttcgttga gtatctgaat atttgaatat 600
ctgaatattt gaatttcgaa agtttgaata tttgaatatt tgaattttgg aatattggaa 660
tagctgggtt tggagataag acttactaag ctaagcgccg acgtaagagc ggcgagtaaa 720
tccacacaca agagagaggc agagagagag ggagggagac aactcgcgca ggcaagctga 780
gcccactgga cgcacggggc gcgtcccccc tgacgggcgc tctggtggtg gcgtgtttgg 840
gagggttttg catgcttgtg ataggggctc tggcgcgggc tctgtacggt gcttggagat 900
gcacgggcag ggcgagagag gggacgggtt cccgggaggc gctgcttgga ggtgctgaga 960
gggagggaga aggcgtgctt tgcgatgcgc ggggcgacct aggcgctgct gcgcggtgca 1020
gcagcaggga cctcggacgt gagtcgaagc cgtctgcaga ggagatggta gaagggccgc 1080
ggattggtag cagagaagag gaaatagaag aagaagaaga aatagaagaa gaagaaatag 1140
aagaagaaga aatagaagaa gaagaggagg acgggcaggc gggaaagatg gagaaaggac 1200
tcgcggcggg aaaacaagag aatgtgaact tgggcttgaa ctttggtttg aatttgaatg 1260
tggagaacga ggggttgaat ttgagtttga atttgaaaga aaacttacgg aaagaaagtt 1320
tagttgaaag tgagaaagaa aaaaatgaga aagaaaaaga gaaagaaaaa gagaaagaaa 1380
aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gagaaagaaa 1440
aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaagaaa 1500
aagaagaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaaggag 1560
atttaaaaag ttgtttagtt gaaaaaggag aaggaggaag aagcagcgac agcggcagaa 1620
gaagaagtag ttgttgtaag aggggaacgg aggcagtagc agtggagcag gcggaggcga 1680
cagcaaacct cgaactcgac cccgtcgagc cgcagcaaga acaagagccc gaccaggtgg 1740
acgaggacga ggtccgcttg ttgtcaggaa caacagaagt tgcaggacta gccgagagtg 1800
ctaccactgc aattcttaga tccacagacg caagagcaga aaacttacaa ctgctcgcca 1860
caacacaaga accaccttca gatacaacca ggttcgagaa ctccacaagt ctagaagcag 1920
caacagctct agcagataat caaacaggtc cagaaaaagc tacgactaga agagaaatta 1980
tcgagtcgca acttgcaacc 2000
<210> 79
<211> 4683
<212> DNA
<213> Ulkenia sp.
<400> 79
gcgagttata tctgtctaga aaacttggca tggctagcaa tttatgtcta gctattccat 60
acacacggta atgccagtag cctgttagtt atagctcttt tggttgttgt ctcacaatac 120
actgacatca gcagaacaaa atgaaagggg ccttggctac catgaaatca atacttcaaa 180
aggtctcttg gtttctttac tcgcatgtcg ctatttactt acattcctcg agtacataac 240
atatcataca tcaaagaaat taaaaagaaa acaaacattc aaatatgcat tactttccct 300
actgtactag taagtacgtt tctggtatta agttgttttt tctcaaaaga acaatgtgct 360
tacttgtaaa atccacagct gcttacttgt aagcctcaac tagttagtga tgtgattatc 420
ataaaatgtt cgacactgta cctcctttcc agctatcttc ctacacctcc tctgacgcag 480
gttgacggag gaggcgtggg ggttgattga agtgcaacac aacgttttgt ttaagatatt 540
ccttgccttg gccgactcca aatggatagc acagaagcct aatgataatt tgaattaatt 600
ttatttcgag cttatttaat gctcttatca gagtccgtag gtatctcttt tcctactaat 660
tgttgaaaaa ggatgttttg gacatagcag gtcatcatac tatttggttc catcaaattc 720
atatccattt ctttcgttca agtgcttccc ttcctactta ttatatatat tatatatcca 780
taaatgtaaa agagacgatt acgaatactt tgcatacatg tatagcgaaa cagagatggt 840
agcaaaagtt caccttcact aatctaagaa tctctccacg tgggtaaaaa cttcagcagt 900
aagattgtaa atgatgtcca agaacaaaac gtcatgctag tccaggggtt actgagctaa 960
cgattaataa tgtttcgtag tcttcctaat tgcaccatca aaacttgtct gcacaagttt 1020
taaagtattg gagcctttac tgaagaatca gaggacatag atggggcacg ttcgccttga 1080
aaaaaatagt cttctttacc tgcatggtgt tacaaacaaa aacgagttga aaatagctgt 1140
gcaaggaggc aaacatgatt ggaaaagaaa aacgagggga cccttataca ggagggcgcc 1200
acatagtaga atgagtagat tgttagagta gggtacgctt tatgtgattg attgaatggg 1260
cgagtgaaag ttgctgtcaa ggttctaaac aaaaggatgt ttgagtttgt gagtattgtt 1320
tgcggcaaaa agattcagta gagagaaatg cacaaaaaga taatacgtgt gtagggcgat 1380
tatggaggca tgcatttggg ggaaatcatc gcatgcgcat gagtttctcc atctgccgaa 1440
tctttgcaaa ggcattttca agctccattt gcatagcgta ggcttgctgc tcaaactgag 1500
cgcgctgatg cgccagattt tcttcatgtc ttttgttcaa actacgctca agaccctcaa 1560
gagccgcaac cttgagcttg cgttcctttt gctgaatctc cataactctt cgtttcacct 1620
ggagctcaat ttctgcagca tccgtggtct ttgcagcggc ctgtgcgtct tgtgcggcct 1680
gtgcgttgtt tgcgagctcc tttcgcagct cctccatctc cgcgttcttt ttctcctcca 1740
tccatttggc accgagtttg gcagcttgat cgatgcggcc cttgagaact tcttcgttct 1800
cctcaagttc tgcgatacgc gcgtgtaagc cgaggatctc ctccgagaca gcctcgccat 1860
tgatcattat ttcacttccc gagtcttgaa tgacaacatc agccttggtg ccaggttcac 1920
cggtatctcg ctcgcaaccc tgctggcgca tagacagcat aaggcgcgca ttatcctcac 1980
gcagatcatc cacctgttct gataaaagtt tgactgcctg ctcaagatta cgggggttca 2040
cttcgtgaaa aatttcttga aggtctcgaa gctcagaaag cttggcagag caagtgtgca 2100
tcgctctgca ctttttaaga cgtgcaagtg catcatcaag tttggcatta tttaccttca 2160
tggaggcttc agctacttcg gcttcttcga ttacaatttt ctgcagctct acaacatcat 2220
ggccaattaa cttgcgatgc agctcggcaa tcaccccatg catcttttcg gtatggcctg 2280
gacgcgcctc atcctgcgtt cttcggatct cctcctctag ttctcgattt agacgaaggg 2340
ctggtccaag gggcgggtaa ttagcctgag tcaagccaag ctctgttgct agtccaaggc 2400
agtcggaaag tcgcagccgg tccctatcag aaacagcctt ttgcaagtct acgctcaaac 2460
gcacttcttg agccttgcgc accatcttcg gttctgcctg tcgcagaagt ttcgagtcgt 2520
agccagcttg ccacgctagc acgatggcac gcgcaagtga cctcagttga ccgctgttca 2580
tggcagactt gagcaacatt ttgatttgca caaatacctc atctgattca tcatcttcag 2640
cttcctcaag ctctgcaggt gtcttgcgct ctccagagac ttgaagagca gggttcaaac 2700
cgccctccag gacctcgctc gcaagcgcct cctctgtctc agctttgcgc aatagcgcag 2760
cagcattctc cgccattgtg tttgtcactc acgagattaa tatcgttgcc agagtatacg 2820
gtaatgcgag ttaaggattc acagaatctc tcaaattaat cttttcacct aatgatatcc 2880
acaaaacgtt gcaatcgctc agcccaacga caagcgtgct tcttgtttta agactgcaac 2940
tgctcctttt tctattagtc aatatggacc gtcctccaaa cgtccagaaa atagcacaga 3000
atttaccagc agccgctgca gacaagaagt gcaagagagc aggcaagcaa gtgagggttt 3060
gagcaaatag gccaacctct ccacgcagaa ttctagggtc gcaaccggaa ctcacagtcc 3120
ttagaaaccg tgcgaagccc tgggctcaac ttcaatttgt ccacgggacc ttcagcaagc 3180
accaagctca gcagcgtgaa ggcaggcgct gaccacagtt tgagctcaga gggcttggtg 3240
tgcctcgcga ttgatattga agtcaattgc gcaggacggc agcaacggac caggtggtga 3300
agaaggtaat ctccagcgga gtgatgatgg agctcgaccg actactccgg aatcgaccag 3360
gggaggtgcg ggcgcccttc acaagcgggc gagaggcagg ggagagaagg ctcgactcca 3420
cgtcttgaag cgtgtacgtg tgcgcgctca cgcgtgcgac acgccggcaa gggcgcctta 3480
gtggcctgct gctgctgctg gtcgccacgc tgcgagccca agagatttga attgaactcg 3540
aagaaaataa ctatcattta tcaattccaa tcaatcaatg cattatgaag cacctctgaa 3600
gtgaactatt ctcctctcca atatacaaca aaaaacacac acagtgggtt ttaccctata 3660
acctattgtt ccgcgagcga tcaactactc tatagagcga atgaccagtt tttctttctt 3720
tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctg 3780
ttttcctatc taataacccc tttaatcgag gaaacctttc gatttaaaag gaaagctctg 3840
tctgtatata tctgttacag atactgctat catgccatgc agaaagaaac acaaaagaaa 3900
aacaaaagaa agagagaaag agagaaagaa agagagaaag aaagaaagaa agaaagaaag 3960
aagagctttt ctcaatcggt ttcctcatcg accgctcaca tatctacgat tgtggcaaag 4020
aaagaaagaa agaaagaagg aaagcctcag cagagtccgc acgaaagcct tcattgagcc 4080
accatgtcgt ggtccgctgc agtcagtgcc gcctctctgt gaattgagtg agtgagtgag 4140
tgagtgagtt ggttggttag ttagttagtg cctcttcagc tcaaagcctt tcacggtcgc 4200
tcttcgagcg tttgcttttt cataaacaaa taaacaaacc atcgaacgaa ccatcgaacg 4260
aacgaacaat ggtaccccag aatagacgga attaattgct aagtaaacca gtaacagtaa 4320
gttagtgttt ctgacctgag ccgttttctt tatttattcc tctcagctct gtgaagagaa 4380
tttgggatga aaagaaacgt ttttatttat ttaaaagttt agtaacaaga aaaacatggt 4440
ccctcttctt ccttcatgta aaaataagta agtaaaaaaa agaaaagaaa aaaaaaaaag 4500
cttttaaagt agtaaagcga ggtagagata aaagttcttt ctcagggctc ctagtaggca 4560
cttaggaggt acgtctaaga ccgcctcgtg ggaagaaaag agaaaacaag aagagaaaag 4620
agagagagaa acagcgctga cccgagaggc tcatgcgcag agcccaaatc tgcccaactt 4680
tgg 4683
<210> 80
<211> 1848
<212> PRT
<213> Ulkenia sp.
<400> 80
Met Leu Val Ile Gly Ala Leu Ala Arg Ala Leu Tyr Gly Ala Trp Arg
1 5 10 15
Cys Thr Gly Arg Ala Arg Glu Gly Thr Gly Ser Arg Glu Ala Leu Leu
20 25 30
Gly Gly Ala Glu Arg Glu Gly Glu Gly Val Leu Cys Asp Ala Arg Gly
35 40 45
Asp Leu Gly Ala Ala Ala Arg Cys Ser Ser Arg Asp Leu Gly Arg Glu
50 55 60
Ser Lys Pro Ser Ala Glu Glu Met Val Glu Gly Pro Arg Ile Gly Ser
65 70 75 80
Arg Glu Glu Glu Ile Glu Glu Glu Glu Glu Ile Glu Glu Glu Glu Ile
85 90 95
Glu Glu Glu Glu Ile Glu Glu Glu Glu Glu Asp Gly Gln Ala Gly Lys
100 105 110
Met Glu Lys Gly Leu Ala Ala Gly Lys Gln Glu Asn Val Asn Leu Gly
115 120 125
Leu Asn Phe Gly Leu Asn Leu Asn Val Glu Asn Glu Gly Leu Asn Leu
130 135 140
Ser Leu Asn Leu Lys Glu Asn Leu Arg Lys Glu Ser Leu Val Glu Ser
145 150 155 160
Glu Lys Glu Lys Asn Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu
165 170 175
Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu
180 185 190
Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu
195 200 205
Lys Glu Lys Glu Lys Glu Glu Glu Lys Glu Glu Glu Lys Glu Lys Glu
210 215 220
Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Glu Gly Asp Leu Lys Ser
225 230 235 240
Cys Leu Val Glu Lys Gly Glu Gly Gly Arg Ser Ser Asp Ser Gly Arg
245 250 255
Arg Arg Ser Ser Cys Cys Lys Arg Gly Thr Glu Ala Val Ala Val Glu
260 265 270
Gln Ala Glu Ala Thr Ala Asn Leu Glu Leu Asp Pro Val Glu Pro Gln
275 280 285
Gln Glu Gln Glu Pro Asp Gln Val Asp Glu Asp Glu Val Arg Leu Leu
290 295 300
Ser Gly Thr Thr Glu Val Ala Gly Leu Ala Glu Ser Ala Thr Thr Ala
305 310 315 320
Ile Leu Arg Ser Thr Asp Ala Arg Ala Glu Asn Leu Gln Leu Leu Ala
325 330 335
Thr Thr Gln Glu Pro Pro Ser Asp Thr Thr Arg Phe Glu Asn Ser Thr
340 345 350
Ser Leu Glu Ala Ala Thr Ala Leu Ala Asp Asn Gln Thr Gly Pro Glu
355 360 365
Lys Ala Thr Thr Arg Arg Glu Ile Ile Glu Ser Gln Leu Ala Thr Met
370 375 380
Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr Lys
385 390 395 400
Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu
405 410 415
Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe
420 425 430
Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu
435 440 445
Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn
450 455 460
Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn
465 470 475 480
Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu
485 490 495
Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe
500 505 510
Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr
515 520 525
Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg
530 535 540
Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe
545 550 555 560
Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg
565 570 575
Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys
580 585 590
Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys
595 600 605
Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr
610 615 620
Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr
625 630 635 640
Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu
645 650 655
Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr
660 665 670
Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu
675 680 685
Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val
690 695 700
Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met
705 710 715 720
Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe
725 730 735
Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile
740 745 750
Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met
755 760 765
Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile
770 775 780
Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu
785 790 795 800
His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp Phe
805 810 815
Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val Val
820 825 830
Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro Ala
835 840 845
Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro
850 855 860
Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro Thr
865 870 875 880
Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe Thr
885 890 895
Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu Met
900 905 910
Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val Ser
915 920 925
Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr Ser
930 935 940
Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser Val
945 950 955 960
Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro Ser
965 970 975
Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp Phe
980 985 990
Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu Met
995 1000 1005
Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys Ala
1010 1015 1020
Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu Asp Ala
1025 1030 1035 1040
Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly Lys Thr Ile
1045 1050 1055
His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu Gly Asp Met Gly
1060 1065 1070
Val His Arg Phe Ser Phe Glu Leu Ser Val Asp Gly Val Val Phe Tyr
1075 1080 1085
Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ile Ser
1090 1095 1100
Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln Pro Trp His Ile Glu
1105 1110 1115 1120
Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr Asp Val Thr Pro Asn
1125 1130 1135
Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala Pro Lys Gly Ala Gln
1140 1145 1150
Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu Asp Thr Ile Asp Leu
1155 1160 1165
Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr Gly His Gly Arg Lys
1170 1175 1180
Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe Asp
1185 1190 1195 1200
Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu Val
1205 1210 1215
Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile Thr
1220 1225 1230
Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr Arg
1235 1240 1245
Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His Ile
1250 1255 1260
Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala Asn Gly
1265 1270 1275 1280
Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser Asn Ile Arg
1285 1290 1295
Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala Ala
1300 1305 1310
Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser Gly
1315 1320 1325
Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu Lys Ala Glu Leu Leu Asp
1330 1335 1340
Val Glu Lys Pro Leu Tyr Ile Ser Ser Ser Asn Gly Gln Val Lys Lys
1345 1350 1355 1360
His Ala Asp Val Ala Gly Gly Gln Ala Thr Ile Val Gln Ala Cys Ser
1365 1370 1375
Leu Ser Asp Leu Gly Asp Glu Gly Phe Met Lys Thr Tyr Gly Val Val
1380 1385 1390
Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Asp
1395 1400 1405
Leu Val Ile Ala Thr Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala
1410 1415 1420
Gly Gly Leu Pro Met His Ile Val Arg Ala Ala Val Glu Lys Ile Gln
1425 1430 1435 1440
Ala Glu Leu Pro Asn Gly Pro Phe Ala Val Asn Leu Ile His Ser Pro
1445 1450 1455
Phe Asp Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys
1460 1465 1470
Gly Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln
1475 1480 1485
Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala Asp Gly Ser
1490 1495 1500
Ile Asn Ile Lys Asn Arg Ile Ile Gly Lys Val Ser Arg Thr Glu Leu
1505 1510 1515 1520
Ala Glu Met Phe Ile Arg Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu
1525 1530 1535
Ile Gln Ser Gly Glu Ile Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu
1540 1545 1550
Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His
1555 1560 1565
Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu
1570 1575 1580
Arg Asn Arg Leu His Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val
1585 1590 1595 1600
Arg Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala Ala Ala Ala
1605 1610 1615
Ala Leu Ala Met Gly Ala Ala Phe Leu Val Thr Gly Thr Val Asn Gln
1620 1625 1630
Val Ala Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys Gln Leu Cys
1635 1640 1645
Met Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala Asp Met Phe
1650 1655 1660
Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro
1665 1670 1675 1680
Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe
1685 1690 1695
Glu Ser Met Pro Ala Thr Glu Leu Glu Arg Val Glu Lys Arg Ile Phe
1700 1705 1710
Gln Cys Pro Leu Ala Asp Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile
1715 1720 1725
Asn Arg Leu His Asn Pro Glu Lys Ile Thr Arg Ala Glu Arg Asp Pro
1730 1735 1740
Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser
1745 1750 1755 1760
Arg Trp Ala Asn Thr Gly Glu Ala Gly Arg Val Met Asp Tyr Gln Val
1765 1770 1775
Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser
1780 1785 1790
Tyr Leu Asp Pro Ala Val Ser Gly Glu Tyr Pro Asp Val Val Gln Ile
1795 1800 1805
Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn Val
1810 1815 1820
Ile Arg Asn Asp Pro Arg Val Ser Ile Glu Val Glu Asp Ala Glu Phe
1825 1830 1835 1840
Val Tyr Glu Pro Thr Asn Ala Leu
1845
<210> 81
<211> 18
<212> DNA
<213> artificail sequence
<400> 81
ctcggcattg actccatc 18
<210> 82
<211> 18
<212> DNA
<213> artificail sequence
<400> 82
gagaatctcg acacgctt 18
<210> 83
<211> 21
<212> DNA
<213> artificail sequence
<400> 83
attactcctc tctgcatccg t 21
<210> 84
<211> 21
<212> DNA
<213> artificail sequence
<400> 84
gccgaagaca gcatcaaact c 21
<210> 85
<211> 21
<212> DNA
<213> artificail sequence
<400> 85
gtcgagagtg gccagtgcga t 21
<210> 86
<211> 21
<212> DNA
<213> artificail sequence
<400> 86
aaagtggcag ggaaagtacc a 21
Claims (17)
- a. 서열번호 6(ORF 1), 7(ORF 2), 8 및/또는 80(ORF 3)중 적어도 하나의 아미노산 서열을 포함하는 것으로, 적어도 70%, 바람직하게는 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 99% 그리고 매우 특히 바람직하게는 100%의 상동성 서열을 가지고, 적어도 하나의 PUFA-PKS 도메인의 생물학적 활성을 가지거나, 또는b. 서열번호 32, 34, 45, 58, 59, 60, 61, 72, 74 및/또는 77 중 적어도 하나의 아미노산 서열을 포함하고, 적어도 그와 70%, 바람직하게는 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 99% 그리고 매우 특히 바람직하게는 100%의 상동성 서열을 가지고, 적어도 하나의 PUFA-PKS 도메인의 생물학적 활성을 가지는 것을 특징으로 하는 PUFA-PKS.
- 10개 이상의 ACP 도메인을 갖는, 제 1항에 따라 분리된 PUFA-PKS.
- 제 1항 또는 제 2항에 있어서, 상기 PUFA-PKS는 서열번호 6(ORF 1), 7(ORF 2), 및 8 및/또는 80(ORF 3) 서열의 적어도 500개의 연속적인 아미노산과 적어도 70%, 바람직하게는 적어도 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 99%의 동종성을 가지는, 적어도 하나의 아미노산 서열을 포함하고, PUFA-PKS의 적어도 하나의 도메인의 생물학적 활성을 갖는 것을 특징으로 하는 PUFA-PKS.
- 서열번호 6(ORF 1), 7(ORF 2) 및 8 및/또는 80(ORF 3) 서열의 적어도 500개의 연속적인 아미노산 서열과 적어도 70%, 바람직하게는 적어도 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 99% 동일하며, PUFA-PKS의 적어도 하나의 도메인의 생물학적 활성을 갖는 아미노산 서열.
- 제 1항 내지 제 4항 중 어느 한 항에 따른 아미노산 서열을 암호화하는 분리된 DNA 분자와, 그에 완전히 상보적인 DNA.
- 제 5항에 있어서, 분리된 DNA 분자가 서열번호 3, 4 및 5 및/또는 9 서열의 적어도 500개 연속적인 뉴클레오타이드와, 적어도 70%, 바람직하게는 적어도 80%, 더 바람직하게는 적어도 90%, 더욱 바람직하게는 적어도 95% 상동성을 갖는 것을 특징으로 하는 분리된 DNA 분자.
- 제 5항 또는 제 6항에 있어서, 서열번호 6(ORF 1), 7(ORF 2)와 8 및/또는 80(ORF 3) 서열의 적어도 500개의 연속적인 아미노산과 적어도 70% 상동성을 가지는 아미노산 서열을 코드하는 것을 특징으로 하는 DNA 분자,
- 전사를 조절하는 적어도 하나의 DNA 서열과 기능적으로 연결된 것으로, 바람직하게는 서열번호 XX-YY(종결부위/프로모터) 또는 그의 기능적 변이체뿐만 아니라 적어도 500개의 뉴클레오타이드의 일부분으로 구성되는 그룹으로부터 선택된, 제 5항, 제6항 및/또는 제7항 중 어느 한 항에 따른 DNA 분자 중 하나를 포함하는 재조합 DNA 분자.
- 제 8항에 따른 재조합 DNA 분자를 포함하는 재조합 숙주 세포.
- 제 9항에 있어서, 적어도 하나 이상의 PUFA-PKS의 추가 도메인의 활성을 갖는 제 1항에 따른 PUFA-PKS를 내생적으로 발현하는 재조합 숙주 세포.
- 전사를 조절하는 원소가 서열번호 XX-YY(종결부위/프로모터) 또는 그의 기능적 변이체뿐 아니라 적어도 500개 뉴클레오타이드의 일부분으로 구성되는 그룹으로부터 선택되는 재조합 DNA 구조물을 포함하는 재조합 숙주 세포.
- 제 9항 또는 제10항에 따른 숙주 세포의 배양물을 포함하는, PUFA, 바람직하게는 DHA를 함유하는 오일을 생산하는 방법.
- 제 12항의 방법에 따라 생산된 오일.
- 제 9항 또는 제10항에 따른 숙주 세포의 배양물을 포함하는, PUFA, 바람직하게는 DHA를 함유하는 바이오매스를 생산하는 방법.
- 제 14항의 방법에 따라 생산된 바이오매스.
- 제 8항에 따른 핵산 및/또는 제 1항에 따른 아미노산 서열 또는 이들에 상동 적인 적어도 50개의 연속적인 아미노산 부분을 포함하는, 제 15항에 따른 재조합 바이오매스.
- PUFA-PKS를 포함하는 서열번호 6, 7, 8 및/또는 80의 독립적인 효소 도메인의 인공적인 폴리케타이드를 생산하기 위한 용도.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102004017370A DE102004017370A1 (de) | 2004-04-08 | 2004-04-08 | PUFA-PKS Gene aus Ulkenia |
DE102004017370.2 | 2004-04-08 | ||
PCT/EP2005/003701 WO2005097982A2 (de) | 2004-04-08 | 2005-04-08 | Pufa-pks gene aus ulkenia |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020137020015A Division KR20130114225A (ko) | 2004-04-08 | 2005-04-08 | 울케니아의 pufa―pks 유전자 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20070056002A true KR20070056002A (ko) | 2007-05-31 |
KR101484097B1 KR101484097B1 (ko) | 2015-01-23 |
Family
ID=35062272
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020067023437A KR101484097B1 (ko) | 2004-04-08 | 2005-04-08 | 울케니아의 pufa―pks 유전자 |
KR1020137020015A KR20130114225A (ko) | 2004-04-08 | 2005-04-08 | 울케니아의 pufa―pks 유전자 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020137020015A KR20130114225A (ko) | 2004-04-08 | 2005-04-08 | 울케니아의 pufa―pks 유전자 |
Country Status (11)
Country | Link |
---|---|
US (1) | US7939305B2 (ko) |
EP (1) | EP1733029A2 (ko) |
JP (2) | JP2007532104A (ko) |
KR (2) | KR101484097B1 (ko) |
CN (2) | CN101087882A (ko) |
AU (1) | AU2005231964B2 (ko) |
BR (1) | BRPI0509747A (ko) |
CA (1) | CA2563427A1 (ko) |
DE (1) | DE102004017370A1 (ko) |
IL (1) | IL178613A0 (ko) |
WO (1) | WO2005097982A2 (ko) |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5340742A (en) | 1988-09-07 | 1994-08-23 | Omegatech Inc. | Process for growing thraustochytrium and schizochytrium using non-chloride salts to produce a microfloral biomass having omega-3-highly unsaturated fatty acids |
US8003772B2 (en) | 1999-01-14 | 2011-08-23 | Martek Biosciences Corporation | Chimeric PUFA polyketide synthase systems and uses thereof |
KR20090064603A (ko) * | 2000-01-28 | 2009-06-19 | 마텍 바이오싸이언스스 코포레이션 | 발효기 내에서 진핵 미생물의 고밀도 배양에 의한 고도불포화 지방산을 함유하는 지질의 증진된 생산 방법 |
CA3056110C (en) | 2004-04-22 | 2020-07-14 | Surinder Pal Singh | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
DK1756280T3 (en) | 2004-04-22 | 2015-02-02 | Commw Scient Ind Res Org | SYNTHESIS OF CHAIN, polyunsaturated fatty acids BY RECOMBINANT CELLS |
CA2647150A1 (en) | 2006-03-15 | 2007-09-20 | Martek Biosciences Corporation | Plant seed oils containing polyunsaturated fatty acids |
EP2059588A4 (en) | 2006-08-29 | 2010-07-28 | Commw Scient Ind Res Org | FATTY ACID SYNTHESIS |
ES2644883T3 (es) | 2008-11-18 | 2017-11-30 | Commonwealth Scientific And Industrial Research Organisation | Enzimas y métodos para producir ácidos grasos omega-3 |
CN102741267B (zh) | 2009-03-19 | 2020-06-23 | 帝斯曼知识产权资产有限公司 | 多不饱和脂肪酸合酶核酸分子和多肽,及其组合物、制备方法和用途 |
CN103415611B (zh) | 2010-10-01 | 2016-08-10 | 国立大学法人九州大学 | 原生藻菌的转化方法 |
US8816111B2 (en) | 2012-06-15 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
CN111154724B (zh) | 2013-12-18 | 2024-02-06 | 联邦科学技术研究组织 | 包含二十二碳六烯酸的提取的植物脂质 |
WO2015196250A1 (en) | 2014-06-27 | 2015-12-30 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
EP3099782B1 (en) | 2014-01-28 | 2019-03-20 | DSM IP Assets B.V. | Factors for the production and accumulation of polyunsaturated fatty acids (pufas) derived from pufa synthases |
CA2977271A1 (en) * | 2015-03-02 | 2016-09-09 | Synthetic Genomics, Inc. | Regulatory elements from labyrinthulomycetes microorganisms |
CA3017225A1 (en) | 2016-03-16 | 2017-09-21 | Synthetic Genomics, Inc. | Production of proteins in labyrinthulomycetes |
CN109477079A (zh) | 2016-05-12 | 2019-03-15 | 帝斯曼知识产权资产管理有限公司 | 增加微藻中ω-3多不饱和脂肪酸产量的方法 |
JOP20170154B1 (ar) | 2016-08-01 | 2023-03-28 | Omeros Corp | تركيبات وطرق لتثبيط masp-3 لعلاج أمراض واضطرابات مختلفة |
US10633454B2 (en) | 2016-11-01 | 2020-04-28 | Conagen Inc. | Expression of modified glycoproteins and glycopeptides |
WO2018219171A1 (zh) * | 2017-05-31 | 2018-12-06 | 厦门汇盛生物有限公司 | 一株生产dha和epa的细菌、该细菌基因组中的6个基因片段及它们的应用 |
CN108753810B (zh) * | 2018-05-22 | 2021-06-18 | 昆明理工大学 | 一种转录调节蛋白基因orf2的用途 |
JPWO2020032258A1 (ja) * | 2018-08-10 | 2021-08-12 | 協和発酵バイオ株式会社 | 多価不飽和脂肪酸を生産する微生物及び多価不飽和脂肪酸の製造法 |
CN112601808A (zh) * | 2018-08-10 | 2021-04-02 | 协和发酵生化株式会社 | 生产二十碳五烯酸的微生物和二十碳五烯酸的制造方法 |
CN110577921B (zh) * | 2019-05-28 | 2021-04-02 | 浙江工业大学 | 产两性霉素b的重组结节链霉菌及其应用 |
CN114107074B (zh) * | 2021-11-18 | 2024-04-09 | 厦门大学 | 一种过表达3-酮酰基合酶基因的裂殖壶菌基因工程菌株的构建方法及其应用 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5798259A (en) * | 1992-05-15 | 1998-08-25 | Sagami Chemical Research Center | Gene coding for eicosapentaenoic acid synthesizing enzymes and process for production of eicosapentaenoic acid |
EP2280062A3 (en) * | 1996-03-28 | 2011-09-28 | DSM IP Assets B.V. | Preparation of microbial polyunsaturated fatty acid containing oil from pasteurised biomass |
US6566583B1 (en) * | 1997-06-04 | 2003-05-20 | Daniel Facciotti | Schizochytrium PKS genes |
TWI426126B (zh) * | 2001-04-16 | 2014-02-11 | Dsm Ip Assets Bv | 多不飽和脂肪酸(pufa)聚乙醯合成酶系統及其用途(二) |
-
2004
- 2004-04-08 DE DE102004017370A patent/DE102004017370A1/de not_active Withdrawn
-
2005
- 2005-04-08 AU AU2005231964A patent/AU2005231964B2/en not_active Ceased
- 2005-04-08 KR KR1020067023437A patent/KR101484097B1/ko not_active IP Right Cessation
- 2005-04-08 CN CNA2005800188787A patent/CN101087882A/zh active Pending
- 2005-04-08 KR KR1020137020015A patent/KR20130114225A/ko not_active Application Discontinuation
- 2005-04-08 US US11/547,921 patent/US7939305B2/en not_active Expired - Fee Related
- 2005-04-08 EP EP05751638A patent/EP1733029A2/de not_active Ceased
- 2005-04-08 CA CA002563427A patent/CA2563427A1/en not_active Abandoned
- 2005-04-08 CN CN201410175661.8A patent/CN103981156A/zh active Pending
- 2005-04-08 BR BRPI0509747-9A patent/BRPI0509747A/pt not_active IP Right Cessation
- 2005-04-08 JP JP2007506732A patent/JP2007532104A/ja active Pending
- 2005-04-08 WO PCT/EP2005/003701 patent/WO2005097982A2/de active Application Filing
-
2006
- 2006-10-15 IL IL178613A patent/IL178613A0/en unknown
-
2012
- 2012-07-27 JP JP2012167374A patent/JP2012205595A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
DE102004017370A1 (de) | 2005-10-27 |
AU2005231964B2 (en) | 2012-03-08 |
US20090093033A1 (en) | 2009-04-09 |
CA2563427A1 (en) | 2005-10-20 |
IL178613A0 (en) | 2007-02-11 |
CN103981156A (zh) | 2014-08-13 |
AU2005231964A1 (en) | 2005-10-20 |
WO2005097982A2 (de) | 2005-10-20 |
WO2005097982A3 (de) | 2007-04-05 |
BRPI0509747A (pt) | 2007-09-25 |
KR20130114225A (ko) | 2013-10-16 |
JP2012205595A (ja) | 2012-10-25 |
EP1733029A2 (de) | 2006-12-20 |
CN101087882A (zh) | 2007-12-12 |
JP2007532104A (ja) | 2007-11-15 |
US7939305B2 (en) | 2011-05-10 |
KR101484097B1 (ko) | 2015-01-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101484097B1 (ko) | 울케니아의 pufa―pks 유전자 | |
AU2018203835B2 (en) | Recombinant dna constructs and methods for modulating expression of a target gene | |
AU2017248519B2 (en) | Isolated Polynucleotides And Polypetides, And Methods Of Using Same For Increasing Nitrogen Use Efficiency, Yield, Growth Rate, Vigor, Biomass, Oil Content, And/Or Abiotic Stress Tolerance | |
AU2016202373C1 (en) | Isolated Polynucleotides and Polypeptides and Methods of Using Same for Increasing Plant Yield | |
AU2020223681B2 (en) | Plant regulatory elements and uses thereof | |
KR102184432B1 (ko) | 식물체에서 dha 및 다른 lc-pufa의 생산 | |
AU2021225152A1 (en) | Isolated polypeptides and polynucleotides useful for increasing nitrogen use efficiency, abiotic stress tolerance, yield and biomass in plants | |
KR101524398B1 (ko) | Pufa 폴리케티드 신타제 시스템을 이용한 이종 생물체내 다불포화 지방산의 제조 | |
AU2021282499A1 (en) | Isolated polynucleotides and polypeptides, and methods of using same for increasing nitrogen use efficiency, yield, growth rate, vigor, biomass, oil content, and/or abiotic stress tolerance | |
KR20070084187A (ko) | Pufa 폴리케티드 신타제 시스템 및 그의 용도 | |
KR20180127526A (ko) | 식물에서 dha 및 다른 lc-pufas의 생산 | |
BRPI0618965A2 (pt) | construção de ácido nucléico, polipeptìdeo isolado compreendendo uma sequência de aminoácido, célula de planta compreendendo um polinucleotìdeo exógeno, método para aumentar a toleráncia de uma planta a uma condição de estresse, método para aumentar a biomassa, vigor e/ou rendimento de uma planta, método para aumentar a eficiência do uso de fertilizante e/ou absorção de uma planta e célula de planta | |
KR20200111172A (ko) | 네페탈락톨 산화 환원 효소, 네페탈락톨 합성 효소, 및 네페탈락톤을 생산할 수 있는 미생물 | |
RU2728854C2 (ru) | Получение омега 3 длинноцепочечных полиненасыщенных жирных кислот из масличных культур при использовании синтаз pufa траустохидридов | |
AU2022202318A1 (en) | Methods of increasing specific plants traits by over-expressing polypeptides in a plant | |
CN1352680A (zh) | 分离自植物细胞的组合物以及它们在植物细胞信号修饰中的应用 | |
AU2020210193B2 (en) | Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics | |
AU2017204404B2 (en) | Isolated Polynucleotides and Polypeptides, and Methods of Using Same for Increasing Plant Yield and/or Agricultural Characteristics | |
CN116635401A (zh) | 具有改进性能的基因修饰的甲基菌 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
AMND | Amendment | ||
E902 | Notification of reason for refusal | ||
AMND | Amendment | ||
E902 | Notification of reason for refusal | ||
AMND | Amendment | ||
E601 | Decision to refuse application | ||
A107 | Divisional application of patent | ||
AMND | Amendment | ||
J201 | Request for trial against refusal decision | ||
B601 | Maintenance of original decision after re-examination before a trial | ||
J301 | Trial decision |
Free format text: TRIAL DECISION FOR APPEAL AGAINST DECISION TO DECLINE REFUSAL REQUESTED 20130726 Effective date: 20141119 |
|
S901 | Examination by remand of revocation | ||
GRNO | Decision to grant (after opposition) | ||
GRNT | Written decision to grant | ||
LAPS | Lapse due to unpaid annual fee |