CA2365096A1 - Polyunsaturated fatty acid (pufa) elongase from caenorhabditis elegans - Google Patents
Polyunsaturated fatty acid (pufa) elongase from caenorhabditis elegans Download PDFInfo
- Publication number
- CA2365096A1 CA2365096A1 CA002365096A CA2365096A CA2365096A1 CA 2365096 A1 CA2365096 A1 CA 2365096A1 CA 002365096 A CA002365096 A CA 002365096A CA 2365096 A CA2365096 A CA 2365096A CA 2365096 A1 CA2365096 A1 CA 2365096A1
- Authority
- CA
- Canada
- Prior art keywords
- polypeptide
- leu
- phe
- pufa
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 title claims abstract description 23
- 241000244203 Caenorhabditis elegans Species 0.000 title description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 62
- 229920001184 polypeptide Polymers 0.000 claims abstract description 60
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 60
- 235000020978 long-chain polyunsaturated fatty acids Nutrition 0.000 claims abstract description 6
- 239000000194 fatty acid Substances 0.000 claims description 36
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 32
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 26
- 229930195729 fatty acid Natural products 0.000 claims description 26
- 241001465754 Metazoa Species 0.000 claims description 22
- 238000000034 method Methods 0.000 claims description 17
- 108020004414 DNA Proteins 0.000 claims description 16
- 229910052799 carbon Inorganic materials 0.000 claims description 14
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 12
- 238000006243 chemical reaction Methods 0.000 claims description 12
- 241000124008 Mammalia Species 0.000 claims description 10
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 claims description 10
- -1 carbon fatty acid Chemical class 0.000 claims description 10
- 210000002472 endoplasmic reticulum Anatomy 0.000 claims description 8
- 239000008194 pharmaceutical composition Substances 0.000 claims description 8
- 230000009261 transgenic effect Effects 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 7
- 230000037361 pathway Effects 0.000 claims description 7
- 235000015872 dietary supplement Nutrition 0.000 claims description 6
- 241000206602 Eukaryota Species 0.000 claims description 5
- 239000002253 acid Substances 0.000 claims description 5
- 238000007792 addition Methods 0.000 claims description 5
- 235000021342 arachidonic acid Nutrition 0.000 claims description 5
- 229940114079 arachidonic acid Drugs 0.000 claims description 5
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 claims description 4
- MBMBGCFOFBJSGT-KUBAVDMBSA-N all-cis-docosa-4,7,10,13,16,19-hexaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 claims description 4
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 4
- SECPZKHBENQXJG-FPLPWBNLSA-N palmitoleic acid Chemical compound CCCCCC\C=C/CCCCCCCC(O)=O SECPZKHBENQXJG-FPLPWBNLSA-N 0.000 claims description 4
- 241000251539 Vertebrata <Metazoa> Species 0.000 claims description 3
- 238000012217 deletion Methods 0.000 claims description 3
- 230000037430 deletion Effects 0.000 claims description 3
- WBBQTNCISCKUMU-PDBXOOCHSA-N (13Z,16Z,19Z)-docosatrienoic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCCCCCC(O)=O WBBQTNCISCKUMU-PDBXOOCHSA-N 0.000 claims description 2
- FPRKGXIOSIUDSE-SYACGTDESA-N (2z,4z,6z,8z)-docosa-2,4,6,8-tetraenoic acid Chemical compound CCCCCCCCCCCCC\C=C/C=C\C=C/C=C\C(O)=O FPRKGXIOSIUDSE-SYACGTDESA-N 0.000 claims description 2
- YUFFSWGQGVEMMI-JLNKQSITSA-N (7Z,10Z,13Z,16Z,19Z)-docosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCCCC(O)=O YUFFSWGQGVEMMI-JLNKQSITSA-N 0.000 claims description 2
- 241000251468 Actinopterygii Species 0.000 claims description 2
- 241000282693 Cercopithecidae Species 0.000 claims description 2
- 235000021294 Docosapentaenoic acid Nutrition 0.000 claims description 2
- 235000021292 Docosatetraenoic acid Nutrition 0.000 claims description 2
- 239000004606 Fillers/Extenders Substances 0.000 claims description 2
- 235000021319 Palmitoleic acid Nutrition 0.000 claims description 2
- 230000037429 base substitution Effects 0.000 claims description 2
- SECPZKHBENQXJG-UHFFFAOYSA-N cis-palmitoleic acid Natural products CCCCCCC=CCCCCCCCC(O)=O SECPZKHBENQXJG-UHFFFAOYSA-N 0.000 claims description 2
- 239000003085 diluting agent Substances 0.000 claims description 2
- 235000020669 docosahexaenoic acid Nutrition 0.000 claims description 2
- 229940090949 docosahexaenoic acid Drugs 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 230000003028 elevating effect Effects 0.000 claims 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 30
- 108090000623 proteins and genes Proteins 0.000 description 29
- 108700026244 Open Reading Frames Proteins 0.000 description 23
- 241000196324 Embryophyta Species 0.000 description 19
- 150000004665 fatty acids Chemical class 0.000 description 19
- 239000000758 substrate Substances 0.000 description 19
- 101100118654 Caenorhabditis elegans elo-1 gene Proteins 0.000 description 17
- 102000036181 Fatty Acid Elongases Human genes 0.000 description 15
- 108010058732 Fatty Acid Elongases Proteins 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- 210000000349 chromosome Anatomy 0.000 description 12
- VZCCETWTMQHEPK-QNEBEIHSSA-N gamma-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCC(O)=O VZCCETWTMQHEPK-QNEBEIHSSA-N 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 11
- 238000003786 synthesis reaction Methods 0.000 description 11
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 9
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 7
- 150000001413 amino acids Chemical class 0.000 description 7
- 230000014509 gene expression Effects 0.000 description 7
- 150000002632 lipids Chemical class 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 108010087894 Fatty acid desaturases Proteins 0.000 description 5
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 150000003408 sphingolipids Chemical class 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- 241000219194 Arabidopsis Species 0.000 description 4
- 102000009114 Fatty acid desaturases Human genes 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 4
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N Leu-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 102000004169 proteins and genes Human genes 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- IEHDJWSAXBGJIP-RYUDHWBXSA-N Phe-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 IEHDJWSAXBGJIP-RYUDHWBXSA-N 0.000 description 3
- LFCQXIXJQXWZJI-BZSNNMDCSA-N Tyr-His-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O LFCQXIXJQXWZJI-BZSNNMDCSA-N 0.000 description 3
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 3
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000004817 gas chromatography Methods 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- 101100225658 Arabidopsis thaliana ELP4 gene Proteins 0.000 description 2
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- 241001072256 Boraginaceae Species 0.000 description 2
- 235000007689 Borago officinalis Nutrition 0.000 description 2
- DPUOLQHDNGRHBS-UHFFFAOYSA-N Brassidinsaeure Natural products CCCCCCCCC=CCCCCCCCCCCCC(O)=O DPUOLQHDNGRHBS-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- URXZXNYJPAJJOQ-UHFFFAOYSA-N Erucic acid Natural products CCCCCCC=CCCCCCCCCCCCC(O)=O URXZXNYJPAJJOQ-UHFFFAOYSA-N 0.000 description 2
- 208000033962 Fontaine progeroid syndrome Diseases 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 2
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 2
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 2
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 2
- WMDZARSFSMZOQO-DRZSPHRISA-N Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WMDZARSFSMZOQO-DRZSPHRISA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- HSQGMTRYSIHDAC-BQBZGAKWSA-N Leu-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(O)=O HSQGMTRYSIHDAC-BQBZGAKWSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 2
- LHSGPCFBGJHPCY-STQMWFEESA-N Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-STQMWFEESA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 2
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 2
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 2
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- JHKXZYLNVJRAAJ-WDSKDSINSA-N Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(O)=O JHKXZYLNVJRAAJ-WDSKDSINSA-N 0.000 description 2
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- MIDZLCFIAINOQN-WPRPVWTQSA-N Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 MIDZLCFIAINOQN-WPRPVWTQSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- 101100118655 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ELO1 gene Proteins 0.000 description 2
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 2
- RCMHSGRBJCMFLR-BPUTZDHNSA-N Trp-Met-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 RCMHSGRBJCMFLR-BPUTZDHNSA-N 0.000 description 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 2
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 2
- JAQGKXUEKGKTKX-HOTGVXAUSA-N Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 2
- 235000021322 Vaccenic acid Nutrition 0.000 description 2
- UWHZIFQPPBDJPM-FPLPWBNLSA-M Vaccenic acid Natural products CCCCCC\C=C/CCCCCCCCCC([O-])=O UWHZIFQPPBDJPM-FPLPWBNLSA-M 0.000 description 2
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 238000002869 basic local alignment search tool Methods 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- DPUOLQHDNGRHBS-KTKRTIGZSA-N erucic acid Chemical compound CCCCCCCC\C=C/CCCCCCCCCCCC(O)=O DPUOLQHDNGRHBS-KTKRTIGZSA-N 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010071185 leucyl-alanine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 235000021281 monounsaturated fatty acids Nutrition 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 235000003441 saturated fatty acids Nutrition 0.000 description 2
- 150000004671 saturated fatty acids Chemical class 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 108010009106 sphingolipid desaturase Proteins 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- UWHZIFQPPBDJPM-BQYQJAHWSA-N trans-vaccenic acid Chemical compound CCCCCC\C=C\CCCCCCCCCC(O)=O UWHZIFQPPBDJPM-BQYQJAHWSA-N 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 230000035899 viability Effects 0.000 description 2
- LGHXTTIAZFVCCU-SSVNFBSYSA-N (2E,4E,6E,8E)-octadeca-2,4,6,8-tetraenoic acid Chemical compound CCCCCCCCC\C=C\C=C\C=C\C=C\C(O)=O LGHXTTIAZFVCCU-SSVNFBSYSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- JFOWDKWFHZIMTR-RUCXOUQFSA-N (2s)-2-aminopentanedioic acid;(2s)-2,5-diamino-5-oxopentanoic acid Chemical compound OC(=O)[C@@H](N)CCC(N)=O.OC(=O)[C@@H](N)CCC(O)=O JFOWDKWFHZIMTR-RUCXOUQFSA-N 0.000 description 1
- OYHQOLUKZRVURQ-NTGFUMLPSA-N (9Z,12Z)-9,10,12,13-tetratritiooctadeca-9,12-dienoic acid Chemical compound C(CCCCCCC\C(=C(/C\C(=C(/CCCCC)\[3H])\[3H])\[3H])\[3H])(=O)O OYHQOLUKZRVURQ-NTGFUMLPSA-N 0.000 description 1
- AMBKWKJGMIHTJR-UHFFFAOYSA-N 2-[2-[2-[(2-azaniumyl-3-methylbutanoyl)amino]propanoylamino]propanoylamino]-3-phenylpropanoate Chemical compound CC(C)C(N)C(=O)NC(C)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=CC=C1 AMBKWKJGMIHTJR-UHFFFAOYSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- JSLGXODUIAFWCF-WDSKDSINSA-N Arg-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O JSLGXODUIAFWCF-WDSKDSINSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- QADCERNTBWTXFV-JSGCOSHPSA-N Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(O)=O)=CNC2=C1 QADCERNTBWTXFV-JSGCOSHPSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- FFMIYIMKQIMDPK-BQBZGAKWSA-N Asn-His Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 FFMIYIMKQIMDPK-BQBZGAKWSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- FYRVDDJMNISIKJ-UWVGGRQHSA-N Asn-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FYRVDDJMNISIKJ-UWVGGRQHSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 1
- 208000016444 Benign adult familial myoclonic epilepsy Diseases 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 1
- YHDXIZKDOIWPBW-WHFBIAKZSA-N Cys-Gln Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O YHDXIZKDOIWPBW-WHFBIAKZSA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- LVNMAAGSAUGNIC-BQBZGAKWSA-N Cys-His Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 LVNMAAGSAUGNIC-BQBZGAKWSA-N 0.000 description 1
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 1
- CNBIWHCVAZHRBI-IHRRRGAJSA-N Cys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N CNBIWHCVAZHRBI-IHRRRGAJSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- 241000132179 Eurotium medium Species 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- CLSDNFWKGFJIBZ-YUMQZZPRSA-N Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O CLSDNFWKGFJIBZ-YUMQZZPRSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- VHLZDSUANXBJHW-QWRGUYRKSA-N Gln-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHLZDSUANXBJHW-QWRGUYRKSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- HKTRDWYCAUTRRL-YUMQZZPRSA-N Glu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 HKTRDWYCAUTRRL-YUMQZZPRSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- LGWUJBCIFGVBSJ-CIUDSAMLSA-N Glu-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LGWUJBCIFGVBSJ-CIUDSAMLSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- GHAFKUCRIVBLDJ-IHRRRGAJSA-N His-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N GHAFKUCRIVBLDJ-IHRRRGAJSA-N 0.000 description 1
- ORZGPQXISSXQGW-IHRRRGAJSA-N His-His-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O ORZGPQXISSXQGW-IHRRRGAJSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- UCGDDTHMMVWVMV-FSPLSTOPSA-N Ile-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(O)=O UCGDDTHMMVWVMV-FSPLSTOPSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- MUFXDFWAJSPHIQ-XDTLVQLUSA-N Ile-Tyr Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 MUFXDFWAJSPHIQ-XDTLVQLUSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- BCXBIONYYJCSDF-CIUDSAMLSA-N Ile-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BCXBIONYYJCSDF-CIUDSAMLSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- HHSJMSCOLJVTCX-ZDLURKLDSA-N L-Glutaminyl-L-threonine Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O HHSJMSCOLJVTCX-ZDLURKLDSA-N 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- VYZAGTDAHUIRQA-WHFBIAKZSA-N L-alanyl-L-glutamic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O VYZAGTDAHUIRQA-WHFBIAKZSA-N 0.000 description 1
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 1
- YUGVQABRIJXYNQ-CIUDSAMLSA-N Leu-Ala-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YUGVQABRIJXYNQ-CIUDSAMLSA-N 0.000 description 1
- YUGVQABRIJXYNQ-UHFFFAOYSA-N Leu-Ala-Ala Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C)C(O)=O YUGVQABRIJXYNQ-UHFFFAOYSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 1
- AZLASBBHHSLQDB-GUBZILKMSA-N Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C AZLASBBHHSLQDB-GUBZILKMSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- JPNRPAJITHRXRH-BQBZGAKWSA-N Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O JPNRPAJITHRXRH-BQBZGAKWSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 1
- ABHVWYPPHDYFNY-WDSOQIARSA-N Met-His-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 ABHVWYPPHDYFNY-WDSOQIARSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- AFVOKRHYSSFPHC-STECZYCISA-N Met-Ile-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFVOKRHYSSFPHC-STECZYCISA-N 0.000 description 1
- PBOUVYGPDSARIS-IUCAKERBSA-N Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C PBOUVYGPDSARIS-IUCAKERBSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- VYXIKLFLGRTANT-HRCADAONSA-N Met-Tyr-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N VYXIKLFLGRTANT-HRCADAONSA-N 0.000 description 1
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 1
- BJFJQOMZCSHBMY-YUMQZZPRSA-N Met-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BJFJQOMZCSHBMY-YUMQZZPRSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 101001014220 Monascus pilosus Dehydrogenase mokE Proteins 0.000 description 1
- 241000907999 Mortierella alpina Species 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 101000573542 Penicillium citrinum Compactin nonaketide synthase, enoyl reductase component Proteins 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- JWBLQDDHSDGEGR-DRZSPHRISA-N Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWBLQDDHSDGEGR-DRZSPHRISA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- RFCVXVPWSPOMFJ-STQMWFEESA-N Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-STQMWFEESA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- LKRUQZQZMXMKEQ-SFJXLCSZSA-N Phe-Trp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKRUQZQZMXMKEQ-SFJXLCSZSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- XBCOOBCTVMMQSC-BVSLBCMMSA-N Phe-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XBCOOBCTVMMQSC-BVSLBCMMSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- BXLYSRPHVMCOPS-ACZMJKKPSA-N Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO BXLYSRPHVMCOPS-ACZMJKKPSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- PZHJLTWGMYERRJ-SRVKXCTJSA-N Ser-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O PZHJLTWGMYERRJ-SRVKXCTJSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- IOWJRKAVLALBQB-IWGUZYHVSA-N Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O IOWJRKAVLALBQB-IWGUZYHVSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- BWUHENPAEMNGQJ-ZDLURKLDSA-N Thr-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O BWUHENPAEMNGQJ-ZDLURKLDSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 1
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 1
- YRXXUYPYPHRJPB-RXVVDRJESA-N Trp-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YRXXUYPYPHRJPB-RXVVDRJESA-N 0.000 description 1
- ORQGVWIUHICVKE-KCTSRDHCSA-N Trp-His-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O ORQGVWIUHICVKE-KCTSRDHCSA-N 0.000 description 1
- BYSKNUASOAGJSS-NQCBNZPSSA-N Trp-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BYSKNUASOAGJSS-NQCBNZPSSA-N 0.000 description 1
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 1
- BVZABQIRMYTKCF-JSGCOSHPSA-N Trp-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 BVZABQIRMYTKCF-JSGCOSHPSA-N 0.000 description 1
- HWCBFXAWVTXXHZ-NYVOZVTQSA-N Trp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HWCBFXAWVTXXHZ-NYVOZVTQSA-N 0.000 description 1
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 description 1
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- JXNRXNCCROJZFB-RYUDHWBXSA-N Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JXNRXNCCROJZFB-RYUDHWBXSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- ZQOOYCZQENFIMC-STQMWFEESA-N Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=C(O)C=C1 ZQOOYCZQENFIMC-STQMWFEESA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- NKMFRGPKTIEXSK-ULQDDVLXSA-N Tyr-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NKMFRGPKTIEXSK-ULQDDVLXSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- BMPPMAOOKQJYIP-WMZOPIPTSA-N Tyr-Trp Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C([O-])=O)C1=CC=C(O)C=C1 BMPPMAOOKQJYIP-WMZOPIPTSA-N 0.000 description 1
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 1
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- BUPRFDPUIJNOLS-UFYCRDLUSA-N Tyr-Tyr-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O BUPRFDPUIJNOLS-UFYCRDLUSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- HSRXSKHRSXRCFC-WDSKDSINSA-N Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(O)=O HSRXSKHRSXRCFC-WDSKDSINSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- VEYJKJORLPYVLO-RYUDHWBXSA-N Val-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VEYJKJORLPYVLO-RYUDHWBXSA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- MOJFVLVTLZDQGW-AVGNSLFASA-N Val-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C MOJFVLVTLZDQGW-AVGNSLFASA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 101710175177 Very-long-chain 3-oxoacyl-CoA reductase Proteins 0.000 description 1
- 241000221013 Viscum album Species 0.000 description 1
- 210000003486 adipose tissue brown Anatomy 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 238000003200 chromosome mapping Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 229910001873 dinitrogen Inorganic materials 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000005014 ectopic expression Effects 0.000 description 1
- 150000002066 eicosanoids Chemical class 0.000 description 1
- IQLUYYHUNSSHIY-HZUMYPAESA-N eicosatetraenoic acid Chemical compound CCCCCCCCCCC\C=C\C=C\C=C\C=C\C(O)=O IQLUYYHUNSSHIY-HZUMYPAESA-N 0.000 description 1
- 235000004626 essential fatty acids Nutrition 0.000 description 1
- 208000016427 familial adult myoclonic epilepsy Diseases 0.000 description 1
- 108010060641 flavanone synthetase Proteins 0.000 description 1
- ZGNITFSDLCMLGI-UHFFFAOYSA-N flubendiamide Chemical compound CC1=CC(C(F)(C(F)(F)F)C(F)(F)F)=CC=C1NC(=O)C1=CC=CC(I)=C1C(=O)NC(C)(C)CS(C)(=O)=O ZGNITFSDLCMLGI-UHFFFAOYSA-N 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 229960004488 linolenic acid Drugs 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 210000001853 liver microsome Anatomy 0.000 description 1
- 210000005228 liver tissue Anatomy 0.000 description 1
- 239000006210 lotion Substances 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 150000004702 methyl esters Chemical class 0.000 description 1
- 239000006151 minimal media Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 235000021436 nutraceutical agent Nutrition 0.000 description 1
- 235000020660 omega-3 fatty acid Nutrition 0.000 description 1
- 235000020665 omega-6 fatty acid Nutrition 0.000 description 1
- 229940094443 oxytocics prostaglandins Drugs 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- FKHIFSZMMVMEQY-UHFFFAOYSA-N talc Chemical compound [Mg+2].[O-][Si]([O-])=O FKHIFSZMMVMEQY-UHFFFAOYSA-N 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 1
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- YSGSDAIMSCVPHG-UHFFFAOYSA-N valyl-methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)C(C)C YSGSDAIMSCVPHG-UHFFFAOYSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Genetics & Genomics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Pharmacology & Pharmacy (AREA)
- Microbiology (AREA)
- Animal Behavior & Ethology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Coloring Foods And Improving Nutritive Qualities (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Acyclic And Carbocyclic Compounds In Medicinal Compositions (AREA)
Abstract
A isolated polypeptide comprising a functional long chain polyunsaturated fatty acid (PUFA) elongase.
Description
POLYSATURATED FATTY ACID (PUFA) ELONGASE FROM CAENORHABDITIS ELEGANS
The present invention relates to polyunsaturated fatty acid (PUFA) elongases.
More specifically, the invention relates to a DNA sequence from C. elegans encoding a PUFA
elongase.
Unsaturated fatty acids are essential components required for normal cellular function, being involved in a diverse number of roles ranging from membrane fluidity to acting as signal molecules (Gill, L, Valivety, R. (1997). Trends ~iotechnol. 15, 401-409;
Broun, P., et al (1999) Ann. Rev. Nutr. 19, 197-216). In particular, the class of fatty acids known as the polyunsaturated fatty acids (PUFAs) has attracted considerable interest as pharmaceutical and nutraceutical compounds (Broun supra; Horrobin, D. F. (1990) Reviews in Contemp Pharmacotherpy 1, 1-45).
The synthesis of PUFAs i.e. fatty acids of 18 carbons or more in length and containing two or more double bonds, is thought to be catalyzed in a variety of organisms by a specific fatty acid elongase enzyme. This elongase is responsible for the addition of 2 carbon units to an 18 carbon PUFA, resulting in a 20 carbon fatty acid. An example of this reaction is the elongation of Y-linolenic acid (GLA; 18:306'~~12) to di-homo-y-linolenic acid (DHGLA;
20:348°"°'4) in which the tri-unsaturated 18 carbon fatty acid is elongated by the addition of a two carbon unit to yield the tri-unsaturated 20 carbon fatty acid. Since there is considerable interest in the production of long chain PUFAs of more than 18 carbons in chain length, for example arachidonic acid and eicosapentanoic acid, the identification of this enzyme is of both academic and commercial interest.
At present, there are no examples of identified cloned genes encoding PUFA
elongases, though a number of genes encoding enzymes likely to be involved in other aspects of lipid synthesis have been identified. For example, an Arabidopsis gene (FAE1) has been shown to be required for the synthesis of very long chain monounsaturated fatty acids (such as erucic acid; 20:14") (James, D. W. et al, (1995) Plant Cell7, 309-319).
However, it is clear that this enzyme does not recognize di- and tri-unsaturated 18 carbon fatty acids, for example, linoleic acid, 18:209~'z or a,-linolenic acid, 18:39''2,15 respectively, as substrates, SUBSTITUTE SHEET (RULE 26) and is therefore not involved in the synthesis of long chain PUFAs (Millar &
Kunst (1997), Plant Journal 12, 121-131). This in itself is not surprising; since, of the plant kingdom, only a very few lower plant species, such as the moss Physcomicotrella patens (Girke et al., (1998), Plant J, 15: 39-48); are capable of synthesising long chain PLJFAs, and therefore Arabidopsis would not be expected to contain any such enzymes (Napier et al.
(1997), Biochem J, 328: 717-720; Napier et al., (1999) Trends in Plant Sci 4, 2-5).
A schematic diagram representing a generalized pathway for the product of PLTFAs is shown in Figure 1. Biochemical characterisation of mammalian elongation systems (most notably from liver microsomes) has indicated that a mammalian elongase consists of four subunits, made up of a condensing enzyme, a (3-ketoreductase, a dehydrase and an enoyl reductase (reviewed in Cinti, D. L., et al (1992) Prog. Lipid Res. 31, 1-51).
The Arabidopsis FAEI gene product encodes a polypeptide of 56kDa, which shows very limited homology to condensing enzymes such as chalcone synthase and stillbene synthase (James, D. W. supra). Although FAEI is normally only expressed in seed tissues, ectopic expression in non-seed tissue (or heterologously in yeast) revealed that FAEI
could direct the synthesis of erucic acid (Millar, A. A., Kunst, L. (1997) Plant J. 12, 121-131).
Three fatty acid elongase activities have been characterised from the yeast S.
cerevisiae.
Again, this organism does not synthesis PUFAs, and therefore does not contain genes encoding a PUFA elongase. One gene ELO1, was identified on the basis of a screen to isolate mutants defective in elongation of 14 carbon (i.e. medium) chain saturated fatty acids (Toke & Martin (1996) J Biol Chem 271, 18413-18422). Complementation of elol mutants restored viability, and the ELOl gene product was shown to encode a polypeptide which was responsible for the specific elongation of 14:0 fatty acids to 16:0 fatty acids.
Two related genes were also detected in the genome of S. cerevisiae, and their function determined by disruption. These two genes, subsequently named EL02 and EL03, were shown to be involved in the elongation of the very long chain saturated fatty acids found in sphingolipid molecules (Oh et al (1997), J. Biol Chem 272, 17376-17384). In particular, EL02 was required for elongation of fatty acids up to 24 carbons, and EL03 was required for elongation of the 24 carbon fatty acid to 26 carbons. However, neither gene was essential for viability. Examination of the these three fatty acid elongases revealed the presence of a conserved "histidine box" motif (Shanklin et al., (1994), Biochemistry, 33, 12787-12794) (His-X-X-His-His, where X is any amino acid) towards the centre of the polypeptide sequences. Importantly, there was no detectable homology between the yeast elongases (EL01,2,3) and the plant very long chain mono-unsaturated fatty acid elongase (FAE1) (Oh et al, supra).
In order to identify genes encoding PUFA elongases, it is necessary to study systems in which the synthesis of PUFAs is well documented; a good example of this is the model animal system C. elegans, a small free-living worm (Tanaka et al., (1996), Lipids 31, 1173-1178). C. elegans, like most other animals, and in contrast to higher plants, synthesises PUFAs such as arachidonic acid (AA; ZO:4 05~g~31,14) as precursors to a class of molecules known as the eicosanoids, which in turn serve as precursors for compounds such as prostaglandins and leucotrienes (Horrobin, (1990), Reviews in Contemp Pharmacotherpy, 1:1-45). The presence of AA and other long chain polyunsaturated fatty acids in C. elegans is well documented (Tanaka et al., (1996), Lipids 31, 1173-1178). The complete sequence of the nematode's genome is now publicly available (The C elegans consortium, 1998, Science 282, 2012-2018: Database at http:llwww.sanger.ac.uklProjectslC elganslblast server,shtml).
An object of the invention is to provide an isolated PUFA elongase.
Using the above-mentioned C. elegans genomic sequence, together with suitable search strings, the inventors identified eight related putative open reading frames (ORFs) encoding for PUFA elongases. A number of different search criteria were applied to identify a number of (ORFs) which were likely to encode polypeptides with fatty acid elongase activities. These ORFs were then subject to functional characterisation by heterologous expression in yeast, allowing the identification of a PUFA elongase.
Accordingly, a first aspect of the invention provides an isolated polypeptide comprising a functional long chain polyunsaturated fatty acid (PUFA) elongase i.e. the polypeptide has the function of extending the chain length of an 18 carbon PUFA to 20 carbons in length.
The present invention relates to polyunsaturated fatty acid (PUFA) elongases.
More specifically, the invention relates to a DNA sequence from C. elegans encoding a PUFA
elongase.
Unsaturated fatty acids are essential components required for normal cellular function, being involved in a diverse number of roles ranging from membrane fluidity to acting as signal molecules (Gill, L, Valivety, R. (1997). Trends ~iotechnol. 15, 401-409;
Broun, P., et al (1999) Ann. Rev. Nutr. 19, 197-216). In particular, the class of fatty acids known as the polyunsaturated fatty acids (PUFAs) has attracted considerable interest as pharmaceutical and nutraceutical compounds (Broun supra; Horrobin, D. F. (1990) Reviews in Contemp Pharmacotherpy 1, 1-45).
The synthesis of PUFAs i.e. fatty acids of 18 carbons or more in length and containing two or more double bonds, is thought to be catalyzed in a variety of organisms by a specific fatty acid elongase enzyme. This elongase is responsible for the addition of 2 carbon units to an 18 carbon PUFA, resulting in a 20 carbon fatty acid. An example of this reaction is the elongation of Y-linolenic acid (GLA; 18:306'~~12) to di-homo-y-linolenic acid (DHGLA;
20:348°"°'4) in which the tri-unsaturated 18 carbon fatty acid is elongated by the addition of a two carbon unit to yield the tri-unsaturated 20 carbon fatty acid. Since there is considerable interest in the production of long chain PUFAs of more than 18 carbons in chain length, for example arachidonic acid and eicosapentanoic acid, the identification of this enzyme is of both academic and commercial interest.
At present, there are no examples of identified cloned genes encoding PUFA
elongases, though a number of genes encoding enzymes likely to be involved in other aspects of lipid synthesis have been identified. For example, an Arabidopsis gene (FAE1) has been shown to be required for the synthesis of very long chain monounsaturated fatty acids (such as erucic acid; 20:14") (James, D. W. et al, (1995) Plant Cell7, 309-319).
However, it is clear that this enzyme does not recognize di- and tri-unsaturated 18 carbon fatty acids, for example, linoleic acid, 18:209~'z or a,-linolenic acid, 18:39''2,15 respectively, as substrates, SUBSTITUTE SHEET (RULE 26) and is therefore not involved in the synthesis of long chain PUFAs (Millar &
Kunst (1997), Plant Journal 12, 121-131). This in itself is not surprising; since, of the plant kingdom, only a very few lower plant species, such as the moss Physcomicotrella patens (Girke et al., (1998), Plant J, 15: 39-48); are capable of synthesising long chain PLJFAs, and therefore Arabidopsis would not be expected to contain any such enzymes (Napier et al.
(1997), Biochem J, 328: 717-720; Napier et al., (1999) Trends in Plant Sci 4, 2-5).
A schematic diagram representing a generalized pathway for the product of PLTFAs is shown in Figure 1. Biochemical characterisation of mammalian elongation systems (most notably from liver microsomes) has indicated that a mammalian elongase consists of four subunits, made up of a condensing enzyme, a (3-ketoreductase, a dehydrase and an enoyl reductase (reviewed in Cinti, D. L., et al (1992) Prog. Lipid Res. 31, 1-51).
The Arabidopsis FAEI gene product encodes a polypeptide of 56kDa, which shows very limited homology to condensing enzymes such as chalcone synthase and stillbene synthase (James, D. W. supra). Although FAEI is normally only expressed in seed tissues, ectopic expression in non-seed tissue (or heterologously in yeast) revealed that FAEI
could direct the synthesis of erucic acid (Millar, A. A., Kunst, L. (1997) Plant J. 12, 121-131).
Three fatty acid elongase activities have been characterised from the yeast S.
cerevisiae.
Again, this organism does not synthesis PUFAs, and therefore does not contain genes encoding a PUFA elongase. One gene ELO1, was identified on the basis of a screen to isolate mutants defective in elongation of 14 carbon (i.e. medium) chain saturated fatty acids (Toke & Martin (1996) J Biol Chem 271, 18413-18422). Complementation of elol mutants restored viability, and the ELOl gene product was shown to encode a polypeptide which was responsible for the specific elongation of 14:0 fatty acids to 16:0 fatty acids.
Two related genes were also detected in the genome of S. cerevisiae, and their function determined by disruption. These two genes, subsequently named EL02 and EL03, were shown to be involved in the elongation of the very long chain saturated fatty acids found in sphingolipid molecules (Oh et al (1997), J. Biol Chem 272, 17376-17384). In particular, EL02 was required for elongation of fatty acids up to 24 carbons, and EL03 was required for elongation of the 24 carbon fatty acid to 26 carbons. However, neither gene was essential for viability. Examination of the these three fatty acid elongases revealed the presence of a conserved "histidine box" motif (Shanklin et al., (1994), Biochemistry, 33, 12787-12794) (His-X-X-His-His, where X is any amino acid) towards the centre of the polypeptide sequences. Importantly, there was no detectable homology between the yeast elongases (EL01,2,3) and the plant very long chain mono-unsaturated fatty acid elongase (FAE1) (Oh et al, supra).
In order to identify genes encoding PUFA elongases, it is necessary to study systems in which the synthesis of PUFAs is well documented; a good example of this is the model animal system C. elegans, a small free-living worm (Tanaka et al., (1996), Lipids 31, 1173-1178). C. elegans, like most other animals, and in contrast to higher plants, synthesises PUFAs such as arachidonic acid (AA; ZO:4 05~g~31,14) as precursors to a class of molecules known as the eicosanoids, which in turn serve as precursors for compounds such as prostaglandins and leucotrienes (Horrobin, (1990), Reviews in Contemp Pharmacotherpy, 1:1-45). The presence of AA and other long chain polyunsaturated fatty acids in C. elegans is well documented (Tanaka et al., (1996), Lipids 31, 1173-1178). The complete sequence of the nematode's genome is now publicly available (The C elegans consortium, 1998, Science 282, 2012-2018: Database at http:llwww.sanger.ac.uklProjectslC elganslblast server,shtml).
An object of the invention is to provide an isolated PUFA elongase.
Using the above-mentioned C. elegans genomic sequence, together with suitable search strings, the inventors identified eight related putative open reading frames (ORFs) encoding for PUFA elongases. A number of different search criteria were applied to identify a number of (ORFs) which were likely to encode polypeptides with fatty acid elongase activities. These ORFs were then subject to functional characterisation by heterologous expression in yeast, allowing the identification of a PUFA elongase.
Accordingly, a first aspect of the invention provides an isolated polypeptide comprising a functional long chain polyunsaturated fatty acid (PUFA) elongase i.e. the polypeptide has the function of extending the chain length of an 18 carbon PUFA to 20 carbons in length.
This polypeptide can be used to elevate PUFA levels in animals, thereby providing a ready source of PLTFAs.
The polypeptide may be from a eukaryote.
The polypeptide may comprise at least a portion of the amino acid shown in SEQ
>D. 15, or variants thereof.
For the purposes of the present application, the term "variant" in relation to a certain sequence means a protein or polypeptide which is derived from the sequence through the insertion or deletion of one or more amino acid residues or the substitution of one or more ammo acid residues with amino acid residues having similar properties, e.g.
the replacement of a polar amino acid residue with another polar amino acid residue, or the replacement of a non-polar amino acid residue with another non-polar amino acid residue. In all cases, variants must have an elongase function as defined herein.
A second aspect of the invention provides a polypeptide having at least 60 %
homology to a polypeptide according to a first aspect of the invention. The polypeptide may have at least $0%, or as much as 90% or more homology to a polypeptide according to a first aspect of the invention.
The polypeptide according to either aspect of the invention may include a sequence motif responsible for Endoplasmic Reticulum (ER) - retention. This allows the polypeptide to be specifically located or targeted to the ER of a cell.
The polypeptide may also be able to elongate palmitoleic acid (PA; 16:1~~) to vacceric acid (VA; 18:14"). Thus, the polypeptide is also capable of elongation of a ~9-monounsaturated 16C fatty acid.
Preferably, the polypeptide is from an animal, more preferably, the animal is an invertebrate such as a worm. Where the animal is a worm, it is preferably C. elegans.
Alternatively, the animal is a vertebrate, preferably a mammal such as a human, rat or mouse.
A third aspect of the invention provides an isolated DNA sequence, preferably a cDNA
sequence, encoding a polypeptide according to a first or second aspect of the invention.
This DNA sequence may be used to engineer transgenic organisms.
Preferably, the DNA sequence comprises the sequence shown in SEQ 1D NO: 7 or variants of that sequence due, for example, to base substitutions, deletions, and/or additions.
A fourth aspect of the invention provides an engineered organism, such as a transgenic animal, engineered to express a polypeptide according to a first or second aspect of the invention. The engineered organism may be engineered to express elevated levels of the polypeptide, thereby providing a supply of polypeptide at a reduced cost as a reduced number of organisms need be used.
Preferably, the engineered organism is a mammal such as a rat, mouse or monkey.
A fifth aspect of the invention provides an engineered organism containing a synthetic pathway for the production of a polypeptide according to a first or second aspect of the invention. This has the advantage of allowing greater control over the production of PUFAs by the pathway by an organism.
The pathway may include D'-fatty acid desaturase, and/or 46-fatty acid desaturase.
The engineered organism according to a fourth or fifth aspect of the invention may be a lower eukaryote, such as yeast. Alternatively, the transgenic organism may be a fish.
A sixth aspect of the invention provides a transgenic plant engineered to express a polypeptide according to a first aspect of the invention.
A seventh aspect of the invention provides a transgenic plant containing a DNA
sequence according to a third aspect of the invention.
An eighth aspect of the invention provides a method of producing a PUFA
comprising carrying out an elongase reaction catalysed by a polypeptide according to a first or second aspect of the invention.
The PUFA may be di-homo-gamma-linoleic acid (20:308'"''4), arachidonic acid (20:4058°"''4), eicosapentanoic acid (20:5~5~8~"~'4~"), docosatrienoic acid (22:3~3~'6~'9), docosatetraenoic acid (22:4~'~'°~'3~'6), docosapentaenoic acid (22:50'''°~~3,16,19) or docosahexaenoic acid (22:54''''°,13.16,19)_ The PLJFA may be a 24 carbon fatty acid with at least 4 double bonds.
A ninth aspect of the invention provides a PUFA produced by a method according to an eighth aspect of the invention.
The PLTFA may be used in foodstuffs, dietary supplements or pharmaceutical compositions.
A tenth aspect of the invention provides a foodstuff comprising a PUFA
according to a fifth aspect of the invention. The foodstuff can be fed to an animal.
An eleventh aspect of the invention provides a dietary supplement comprising a PUFA
according to a fifth aspect of the invention. The dietary supplement can be supplied to an animal to augment its PUFA levels.
An twelfth aspect of the invention provides a pharmaceutical composition comprising a polypeptide according to a first or second aspect of the invention or a PLTFA
according to a ninth aspect of the invention.
Preferably, the pharmaceutical composition comprises a pharmaceutically-acceptable diluent, earner, excipient or extender. This allows the composition to be supplied in a form which best suits the pharmaceutical application in question. Fox example, a topical application would preferably be a cream or lotion, whereas if the composition was to be ingested a different form would be more suitable.
A thirteenth aspect of the invention provides a method of treatment of an animal, such as a mammal, or a plant, comprising supplying to the animal or plant a DNA sequence according to a third aspect of the invention, a foodstuff according to a tenth aspect of the invention, a dietary supplement according to an eleventh aspect of the invention, a pharmaceutical composition according to a twelfth aspect of the invention or a PUFA according to a ninth aspect of the invention.
Preferably, the mammal is a human.
The invention will now be further described, by way of example only, with reference to SEQ
>D1 to 16, and Figures 2 to 11, in which;
SEQ 1D 1 to 8 show the putative ORFs encoding PUFA elongases A to H
respectively; and SEQ ID9 to 16 show the deduced amino acid sequences of the putative ORFs of SEQ ID
NO: 1 to 8 respectively; and Figures 2 to 9 show hydrophobicity plots for each of PUFA elongases A to H
respectively.
Figure 10 shows an amino acid sequence line-up comparing the C'. elegans ORF
F56H11.4 (Z68749) with related sequences.
Figure 11 shows chromatograms of fatty acid methyl esters from transformed yeast.
Introduction to general strategy Initially the C. elegans databases were searched for any sequences which showed low levels of homology to yeast ELO genes (EL02 and EL03) using the TBLASTN programme. A
similar search was carried out using short (20 to 50 amino acid) stretches of ELO genes which were conserved amongst the three ELO polypeptide sequences. C. elegans sequences which were identified by this method were then used themselves as search probes, to identify any related C. elegans genes which the initial search with the yeast sequences failed to identify. This was necessary because the level of homology between the yeast ELO genes and ~ worm genes is always low (see BLAST scores later). To allow for a more sensitive search of worm sequences, a novel approach was adopted to circumvent the major drawback with searches using the BLAST programmes, namely that the search string (i.e.
the input search motif) must be longer than 1 S characters for the algorithm to work.
Thus, if it was desired to search for a short motif (like a histidine box), then the BLAST
programme would not be capable of doing this. A complete list of all the predicted ORFs present in the C.
elegans genome exists as a database called Wormpep, which is freely available from the Sanger WWW site (http://www.sanger.ac.uk/Projects/C_eleganslwebace front end.shtml).
The latest version of Wormpep was down loaded to the hard disc of a Pentium PC, and re-formatted as a Microsoft Word6 document, resulting in a document of about 3,500 pages.
This was then searched using the "Search & Replace" function of Word6, which also allows for the introduction of "wildcard" characters into the search motif. So, for example, it is possible to search both for the short text string HPGG, which would identify any predicted worm ORF present in the Wormpep 3,500 page document containing this motif, or alternatively search with HPGX (where X is a wild card character). Clearly, such (manual) searches of a 3,500 page document are extremely time-consuming and demanding, also requiring visual inspection of each and every identified ORF. For example, searching with a motif such as HXXHH identifies in excess of 300 different ORFs. However, by using a number of different short search strings (as outlined below), and combining these with other methods for identifying putative elongase enzymes, a number of candidate ORFs have been identified.
Database search using the FAE1 polypeptide sequence As a negative control, to demonstrate that the FAEl gene sequence was unlikely to provide a useful search sequence in the identification of C.elegans sequences encoding for PUFA
elongases, the GenBank databases (http://www.ncbi.nlm.nih.gov/Web/Search/index.html) were searched using the Arabidopis FAE1 polypeptide sequence to identify related genes or expressed sequence transcripts (ESTs). GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acid Research (1998) 26, 1-7). There are approximately 2,162,000,000 bases in 3,044,000 sequence records as of December 1998. The search was carried out using the BLAST2 (Basic Local Alignment Search Tool) algorithm (Altschul et al., (1990) ,I Mol Biol 215,403,410) Although a number of plant ORFs and ESTs were reported as being related, no animal sequences were identified by this search, confirming the observation that FAE1 was unlikely to be a suitable candidate as a search template for PUFA elongases.
Database search using yeast ELO sequences Using the three yeast fatty acid elongase sequences (ELO 1, 2, 3) as probes, a number of putative ORFs in the DNA of C. elegans-derived cosmid sequences which form the C.
elegans genomic sequence database were identified. Moreover, an extensive and time-consuming search of a downloaded copy of the WormPep database (ftp://ftp.sanger.ac.uk./pub/databases/wormpep) using manual search strings in MSWord 6, identified a number of C. elegans ORFs which contained presumptive histidine boxes.
Wormpep contains predicted proteins from the Caenorhabditis elegans genome sequence project, which is carried out jointly by the Sanger Centre in Cambridge, UK
and Genome Sequencing Center in St. Louis, USA. The current Wormpep database, Wormpep 16, contains 16,332 protein sequences (7,120,115 residues). Search strings used included [HXXHH], [HXXXHH], [QXXHH] and [YHH]. Comparison of the data from the two different searches indicated a small (<10) number of putative ORFs as candidate elongases.
The histidine box motifs are shown in bold in SEQ >D 9 to 16.
Hydrophobicity plot analysis Since the fatty acid elongase reaction is predicted to be carried out on the cytosolic face of the endomembrane system (Toke & Martin (1996), supra; Oh et al (1997), supra), the putative C. elegans ORFs were examined for potential membrane spanning domains, via Kyte & Doolittle hydrophobicity plots (J. Mol Biol, (1982), 157, 105-.132).
This revealed a number of ORFs with possible membrane-spanning domains, and also indicated a degree of similarity in the secondary-structure of a number of identified ORFs.
Screening for ER-retention signal sequences The inventors postulated that since fatty acid elongases are expected to be endoplasmic reticulum (ER) membrane proteins, they might be expected to have peptide signals which are responsible for "ER-retention". In the case of ER membrane proteins, this signal often takes the form of a C-terminal motif [K-K-XZ_,-Stop], or similar variants thereof (Jackson et al., (1990), EMBO J., 9, 3153-3162). Further sequence analysis of the C.
elegans putative elongases revealed that 4 ORFs (F41H10.7, F41H10.8, F56H11.4, Y53F4B.c) had C-terminal motifs that exactly matched this search pattern, and that a further 2 ORFs (F11E6.5, C40H1.4) had related sequences. These sequence motifs are underlined in SEQ
m 9 to 13, 15 and 16.
Chromosome mapping Since the inventors had previously observed that C.elegans genes involved in the synthesis of PUFA may exist in tandem (for example the 05 and 06 desaturases required for AA and GLA synthesis, respectively, are < 1 kB apart on chromosome N (Michaelson et al., (1998), FEBS Letts 439, 215-218), the positions of the putative C. elegans elongase ORFs were determined using the Sanger Centre's WebAce C. elegans server (http://www.sanger.ac.uk/Projects/C elegans/webace-front end.shtml).. This indicated that two pairs of putative elongases were in close proximity to each other on the C. elegans chromosome N.
F41H10.7 and F41H10.8 were identified as being approximately 10 Kb apart on chromosome N, and F56H11.3 and F56H11.4 were identified as being approximately 2 Kb apart on chromosome N.
Putative C. elegans fatty acid elongases The positions of the putative OIRF's in the C. elegans genome are shown below i.e.
chromosome number, and map position in centiMorgans, together with the GenBank database accession numbers.
The designations used employ the same method as used on the Sanger Centre's C
elegans database, i.e. ORF C40H1.4 is predicted coding sequence 4 on cosmid C40H1.
Elon ase Cosmid Saner ID GenBank Acc Chromosome Code A C40H 1.4 Z 19154 III
B D2024.3 U41011 N, 7.68 C F11E6.5 281058 IV, 18.8 D F41H10.7* U61954 N, 29.8 E F41H10.8* U61954 N, 29.8 F F56H11.3# 268749 IV, 2.5 G F56H11.4~ 268749 IV, 2.5 H Y53F4B.c 292860 II
* ors indicates genes in tandem Comparison of C. elegans putative elongase ORFs with yeast genes:
Each of the three yeast ELO polypeptides were compared against all of the worm putative elongase translated ORF sequences, and then ranked in order of similarity (as measured by the BLAST score) (Altschul et al (1990), supra) The results are shown below, with the ORF sequences ranked from most similar to least similar, and the BLAST scores are shown in brackets:
Yeast ELO1 (14 to 16 carbon fatty acid elongase) G (262) > E (241) > D (225) > C (219) > A (216) > F (215) > H (197) > B (172) Yeast EL02 (24 carbon sphingolipid elongase) E(231)>C(226)>G(189)>A(181)>F(166)>D(150)>H(141)>B(140) Yeast EL03 (24 to 26 sphingolipid elongase) D(171)>G(163)>F(154)>A(152)>E(150)>C(131)>B(132)>H(128) It is clear from the numeric values of the BLAST scores that the sequences are related, but the levels of homology are low. For comparison, the BLAST score for homology between two related worm proteins, the 05 and the 46 desaturase is in excess of 500.
Analysis of potential sphingolipid ancestry Previously, the inventors had noted the similarities between the fatty acid D6 desaturase and sphingolipid desaturases in plants, and that the two distinct enzymes could have arisen from one ancestral gene. Moreover, it was considered likely that the sphingolipid desaturase predated the fatty acid desaturase, and may in fact have been the ancesteral progenitor.
Therefore it is plausible that the next step in the arachidonic acid biosynthetic pathway has also evolved from the sphingolipid metabolic pathway. It is therefore considered highly significant that some of the C. elegans ORF putative elongases have similarity to sphingolipid enzymes. For this reason, these ORFs are considered to be very clear candidates for PUFA elongases. It has previously been considered that the C.
elegans 05 and D6 fatty acid desaturases have evolved from 1 ancestral gene (Michaelson et al., ( 1998), FEBS Letts 439, 215-218). It is also significant that one pair of C. elegans putative elongase ORFs (F & G) genetically maps close to the 05/06 fatty acid desaturase genes, with both gene pairs being located at the top end of chromosome IV.
Cosmid San eg r ID GenBank Acc Chromosome Encoded Peptide Code W08D2.4 270271 IV, 3.06 06 fatty acid desaturase T13F2.1 281122 IV, 3.06 45 fatty acid desaturase Cloning of Desaturase and Elongase Genes in Yeast Expression Vectors Putative elongases sequences F56H11.4 and F41H10.8 were cloned by PCR into the pYES2 vector (Invitrogen). A C. elegans mixed stage cDNA library was used as a PCR
template. F56H11.4 was amplified using primers:
56h114.for 5'-GCGGGTACCATGGCTCAGCATCCGCTC-3' and;
56h1 l4.rev 5'-GCGGGATCCTTAGTTGTTCTTCTTCTT-3'.
F41H10.8 was amplified using primers:
41h108.for 5'-GCGGGTACCATGCCACAGGGAGAAGTC-3' and;
41h108.rev 5'-GCGGGATCCTTATTCAATTTTTCTTTT-3'.
Amplified sequences were then restricted using Kpnl and BamHI (underlined in the forward and reverse primers, respectively), purified using the Qiagen PCR
purification kit, and ligated into a KpnI/BamHI cut pYes2 vector.
An ORF encoding the Mortierella alpina 45-fatty acid desaturase (Michaelson, L. V., et al (1998) J. Biol. Chem. 273, 19055-19059) was amplified using primers:
MadS.for 5'-GCGAATTCACCATGGGTACGGACCAAGGA-3' and;
MadS.rev 5'-GCGGAGCTCCTACTCTTCCTTGGGACG-3', and restricted using EcoRI and SacI, gel purified as described and ligated into a EcoRIlSacI
cut pESC-TRP vector (Stratagene) to generate pESC/~5.
An ORF encoding the borage O6-fatty acid desaturase (Sayanova, O., et al (1997) Proc.
Natl. Acad. Sci USA 94, 4211-4216) was restricted from pGEM3 using BamHI and XhoI
and ligated into a BamHIlXhoI cut pESC-TRP vector to generate pESC/0''.
A double construct was also generated by ligating the BamHIlXhoI borage 46 insert into the pESC/OS construct described previously, generating pESC/(05,06).
Functional Characterisation in Yeast Elongases and desaturase constructs were introduced in Saccharomyces cerevisiae W303-lA using a lithium acetate based method (Elble, R. (1992) Biotechniques 13, 18-20) and expression of the transgenes was induced by addition of galactose to 2%
(w/v) as described in Napier et al (Napier, J. A., et al (1998) Biochem J330, 611-614;
Michaelson L. V., supra; Michaelson, L. V., (1998) FEBS Letts 439, 215-218). Yeast transformants containing pYES2-derived constructs were grown on synthetic minimal media (SD, the composition of which is defined in Sherman, F ( 1991 ) Methods in Enzymology 194, 3-21 );
synthetic minimal medium minus uracil; pESC-derived constructs were grown on SD
minimal medium minus tryptophan. Co-transformed yeast (containing both pYES2 and pESC derivatives) were grown on SD minimal medium minus uracil and tryptophan.
Prior to induction, cultures were grown in the presence of 2% raffinose and supplemented with 0.5 mM of the appropriate fatty acid substrate in the presence of 1 % tergitol-(NP40) (Sigma). All cultures were then grown for a further 48-h unless indicated.
Fatty Acid Analysis To identify the elongation reaction responsible for the synthesis of di-homo-y-linolenic acid (DHGLA; 20:308"~'4) from GLA, this latter fatty acid was supplied as the (exogenous) substrate.
Lipids were extracted from transformed and control yeast by homogenisation in MeOH-CHC13 using a modification of the method of Bligh and Dyer (Dickenson &
Lester (1999) Biochim Biophys Acta 1426, 347-357). The resulting CHC13 phase was evaporated to dryness under nitrogen gas and the samples were transmethylated with 1M HCl in methanol at 80 °C for 1 hour. Fatty acid methyl esters (FAMES) were extracted in hexane and purified using a small column packed with Florisil. Analysis of FAMES was conducted using a Hewlett Packard 5880A Series Gas Chromatograph equipped with a 25M x 0.32mm RSL-SOOBP bonded capilliary column and a flame ionisation detector.
Fatty acids were identified by comparison of retention times with FAME
standards (Sigma) WO 00/$5330 CA 02365096 2001-09-14 pCT/GB00/01035 separated on the same GC. Quantitation was carried out using peak height area integrals expressed as a total of all integrals (Bligh, E.G. & Dyer, W.J. (1959) Can. J.
Biochem.
Physiol. 37, 911-917).
Total fatty acids extracted from yeast cultures were analysed by gas chromatography (GC) of methyl ester derivatives. Lipids were extracted, transmethylated and the fatty acid methyl esters (FAMEs) analysed as described by Sayanova et al.
Figure 11 shows chromatograms of fatty acid methyl esters from yeast transformed with the control (empty) plasmid pYES2 (Fig. 11A) or with ORF F56H11.4 in pYES2 (Fig.
11B).
Exogenous substrate in the form of GLA was supplied to the cultures. Two novel peaks are observed in (B); these peaks (annotated as 20:3 and 18:1 *) were identified (against known standards) as DHGLA and vaccenic acid, respectively. Detection was by flame ionisation.
One cDNA ORF tested in this manner displayed a high level of elongase activity on the GLA substrate, converting 44% to DHGLA. The identity of this elongation product was confirmed as DHGLA by comparison with a known standard (the standards used were known standards for either DHGLA, AA, EPA or VA from Sigma Chemicals, Ltd.), using GCMS analysis using a Kratos MS80RFA (Napier, J. A., supra; Michaelson, L. V., supra;
Michaelson, L. V., supra). The deduced amino acid sequence of the functional elongase clone identified it as being encoded by the C. elegans gene F56H11.4, and comparison with the yeast ELO genes showed low homology confined to a few short amino acid motifs (see Fig. 10). Some similarity with a mouse gene Cig30 (Tvrdik, P., (1997) J. Biol.
Chem. 272, 31738-31746), which has been implicated in the recruitment of brown adipose tissue in liver tissue, was also observed, as well as a potential human homologue encoded by a gene located on chromosome 4q25, BAC 207d4. The most closely related C. elegans ORFs, F41H10.8 (U61954) and F56H11.3 (Z68749) are also shown, as is part of a related human gene present on chromosome N (present on BAC clone B207d4; AC004050). The GenBank accession numbers are given for all sequences.
The range of fatty acids synthesised by C. elegans can potentially require a number of different elongation reactions (Tanaka, T., (1996) Lipids 31, 1173-1178). The substrate-specificity of the F56H11.4 PUFA elongase was therefore determined using a range of exogenously supplied fatty acids. This revealed that GLA is the major substrate, with a number of other fatty acids being elongated at a lower efficiency (see Table 1 ).
Although most of these substrates are polyunsaturated fatty acids, it was unexpectedly observed that palmitoleoic. acid (PA; 16:1 09) was also elongated by F56H11.4 to yield vaccenic acid (VA; 18:1 0"). The biosynthetic pathway for VA is unclear, but the data indicate that it may be synthesised by elongation of 0~-monounsaturated 16C
fatty acid.
The C. elegans PUFA elongase ORF F56H11.4 maps to the top of chromosome IV (at 4.32 cM) with a related sequence (F56H11.3; 51 % similarity) located 1,824bp downstream.
Another C. elegans gene (F41H10.8) was also observed, which is present on chromosome IV, and which shows a slightly higher level (53%) of similarity to the PUFA
elongase than F56H11.3 (see Fig. 10). However, when a PCR product encoding ORF F41H10.8 was expressed in yeast in a manner identical to that used for F56H11.4, the former failed to direct the elongation of any fatty acids, despite the provision of a range of substrates (see Table II).
In order to reconstitute the PUFA biosynthetic pathway in a heterologous system, the PLTFA elongase F56H11.4 was expressed in yeast in conjunction with either the 06- or OS-fatty acid desaturases previously isolated and characterised by the inventor (Napier, J.
A., supra; Michaelson, L. V., supra). Expression of the OG-fatty acid desaturase and F56H11.4 was carried out in the presence of two different substrates (LA or ALA) while the 4'-fatty acid desaturase and the elongase were expressed in the presence of GLA only.
This demonstrated that was possible to combine a desaturase and an elongase in yeast to generate significant amounts of a final "product" (see Table III). In the case of the elongase and the 4G-fatty acid desaturase, the reactions proved highly efficient with the production of 4.5% of DHGLA from the LA substrate. This resulted from 25%
desaturation of the LA substrate to GLA, which was then elongated to DHGLA at a similar level of efficiency (18%). This is lower than the % conversion observed for GLA when supplied exogenously (see Table I), indicating that the in vivo production of substrates for elongation may be rate-limiting.
If ALA was used as a substrate, 27% of this was initially O6-desaturated to yield octadecatetraenoic acid (OTA; 18:4 06.9,'z,'s) but only 8% of was subsequently elongated to yield eicosatetraenoic acid (20:4 ~8~"~'4~"). Thus, the conversion efficiency of ALA to the final 20-carbon tetraenoic PUFA was only about 2.2%.
Since DHGLA is an n-6 fatty acid, whilst the OTA-derived eicostetraenoic acid is an n-3 type, this demonstrates that the elongase is capable of accepting both forms of essential fatty acid, albeit with different efficiencies. Verification was also provided that the 20C
PUFAs synthesised in the yeast expression system were generated by the O6-desaturation of 18C substrates which were subsequently elongated, as the O6-desaturase showed no activity on 20:2 or 20:3 substrates (see Table IIII).
The combination of the D5-desaturase and the elongase also demonstrated that these two enzymes could work in tandem, although the efficiency of this overall conversion was lower (3.3% AA from GLA) which was due to the previously observed low activity of the OS-desaturase enzyme itself (Michaelson, L. V., supra; Michaelson, L. V., supra). Thus, although nearly 45% of the GLA substrate was elongated to DHGLA, only 7.5% of this was then desaturated to AA (see Table III).
Finally, the production of either AA or eicosapentanoic acid (EPA;
20:5~5~8~"~'4~") in yeast from dienoic or trienoic 18 carbon substrates was achieved via expression of all three enzymes (the two desaturases and the F56H11.4 PUFA elongase) simultaneously.
As shown in Table IV, small but significant amounts of AA were produced when the yeast was supplied with the 18C dienoic fatty acid LA.
GC-Mass Spectroscopy (MS) Analysis Peak identification and confirmation were carried out by GC-MS using a Kratos using known standards (Sigma). The identity of this 20C PLTFA was verified by GCMS, indicating that the conversion efficiency from LA was 0.65%. When ALA was used as a substrate, 12.5% of the (46-desaturated and elongated) eicosatetraenoic n-3 fatty acid was OS-desaturated, resulting in a total conversion of 0.3% of the ALA substrate to EPA (the identity of EPA was confirmed by GCMS).
Expression of C. elegans elongase in plants In order to express C. elegans elongase in plants, the following protocol is an example of a process which can be used to create the transgenic plants. C. elegans ORF
sequence can be subcloned into a plant expression vector pJD330, which comprises a viral 35S
promoter, and a Nos terminator. The resulting cassette or promoter/coding sequence/terminator can then be subcloned into the plant binary transformation vector pain 19, and the resulting plasmid introduced into Agrobacterium tumefaciens. This Agrobacterium strain can then be used to transform Arabidopsis by the vacuum-infiltration of inflorescences, and the seeds harvested and plated onto selective media containing kanamycin. Since pain 19 confers resistance to this antibotic, only transformed plant material will grow.
Resistant lines can therefore be identified and self fertilized to produce homozygous material.
Leaf material can then be analyzed for expression of C. elegans elongase.
Fatty acid methyl ester analysis can be carried out as previously described.
.. .r n ..a , n~, ~~ ..
~ ~ V, ~ C
N ,n , , . , N . .
, , , , .
r oanon o +a +~ ~ +~
+ Ya+~~~, ~~~nn N V1 r, ~O . ~ , , , . , , , , O
rr y~ CO
r N "', n .
C .-, , ~ ~O +1 +1 e0 0~
A
q~
Y, n, ,. v.., O,v W
N
p .~-v ~ ..i ~ M
O O O
~
~ 'H .-n '~
.w p v, a, a aA
O ~
~ .-i , v . Y n , v , M n ~
~
~
r. h C
+1 'H
+1 . ~
b M
~
' n . n . , n . v n , .
f ~ Y
~
~
~
+ y'~.
y'~e ae r.
.. -. .. n , , , v1 N h ... M ~ v , , , t~ , W
.~'!0 0 o -~i ~i +1 +1 '~
.. o ao .r . . . .
'' 'o ~c v . .
r ~ . .
n ~
, , n I ii ~
+ '~ g fl +
n ~0 0~
N M VI ~ t.i , , ~ , , v n W 1 , ~
M
+1 1t '1i n f Y ~ n , . , , . .
, . , v .
.
yl rr ~) ~ ~
M M
O O
+ 0~ T
n ,O , , , .
:g.~~a , , , , , , , m~~~
+1 -H 'i~
. +1 ~
.
, , , , , ~ , ~ h . , , , , , Y
~r ' p~ ~ p, ooO~o'o~po~~ ~AN~ i~
SUBSTITUTE SHEET (RULE 26) WO 00/55330 2p PCT/GB00/01035 Ov N N
oo C M M N
~
+I +I +I
+I
' (~ N
~ M
M N '~ M
Vi NMV1.-l ~~ i~~~~N ~i ~
a ~.
w O M ~ N
N
i-I +I ii +I
+ O I~ M
~ M
M ~ ~ N
~n N M ~t ~ ~ ~ ~ ~ i ~
~ ~ N ~ O
N O~ M
C O '-' .-.
~
O O
d' 00 .d.
'~'~
-H
M vi vi ~: ~
N -r ~ d' ~ ~ i v ~
W O~ ~ ~ ~ ~
d a d N N 'n O CO O
M """
~ O O
+ 00 ~O
-. vi O
~
N
. mt ~ ~ ~ ~
~ ~
N M N
O O '~ O
O
-H +I i1 ~ 1-1 ' ~ ~ +~ rr rr V V' M ~D
0 ~ O
N N v1 ~ ~ ~ ~ ~ ~ i -~ M ~ ~ i O
_ x _ ,~ _ w O N N
M
o +~+~ +~
+ os ~r '~ '~
M N " ~
'"~ Q~ O
N vi s C M ~ i ~ ~ ~ ~
N ~
OW O ~D
O .-1 O
'."' .-:
-f-I -fi i-i +I
O 'n N
~ .--~
~ h N M
d a U v0 N ~D
d' O N
i-I +i +i +i + ~ V7 M
~ N
00 M ~ ~!' M ~O -~ O
N
-~
' n ~ ~ . ~ ~
M ~ ~ ~
N ~D
v~
O O '~
O
-H -H
+I
~ Qv O~ O
'~ 'd' .-~ i!1 i ~ ~ ~ ~
v1 N ~ ~ i O~ l~
M
O O "' .-.
+I ii O ii + O Ov ~ u1 ~ O N
~
.-~ v~
~t N
b N
d b y # ~ o b aa d a d d ' ~' ooaoxoa W ' A
o~o' - ..
o o d H o v~ ,~ .-, ,-, .-.a c~ N A s t~ ~
,-. ,.aN w a w .-.
WO 00/55330 2 j PCT/GB00/01035 00 N V1 h M M ~ N M
-fl -!-i ~ -H
i-I
' 00 ~ ~ O N
Qv V1 ~. M 01 N N ~ ~!1 ~ ~ ~ ~ ~i ~ ~
...r ~ ~ ~ ~ ~ ~ i ~r d ~ a _ d' M 00 N M 00 N
Ov w +i -+~ o +I +~ -: o o -~ O~ ~G '~'~ N 'H -H
l'~ ~I
h ~t ~ N ~ M
N N ~ ~n .-.,~ ~' ~ ~ C~
N ~ ~ O~ ~
M l~
Oy N et O O O
o ii +I -ti r; .--~ V: pp O
N C~ ~ v~ ~n ~O ~ ~
~O ~ I~ 00 M ~t N
M
~ C O G O ~ ~ O O
O
.o + ,~. ii ~i M M -H i-I
-H ii ~-I
C1 d r, M d; N N ,n d: '~t 'ct h ~O C;
...y'i O ~O t'~ ~ M ~~ ~ OG
V1 N W .-n~ ~ Wn ~i;
rr C '-' O N
'n O
-H ii ~ ii +I
' t~ ~O ~ N
wt M et' ~ ~
N N Wi .-r ~ ~ ~ ~ ~~ ~ i i M ~ ~ ~
a ~D N ~h N
~; .-. M r; N 'n t~ M N
~ O O ~ O N O
,i~ C
-1-l~ Ov 'f'~ -H -H
~ N ~ O -i-I
00 00 ~ ~ Os M ~ I~
cJ ~ v1 t~
.-r .r O et ~ t~ M n ~i .-i .-i t~ N ~ .--n ~ ~ oO
~
v1 I~ N
r; n1 ~; ~; 'n r; -O O
O ' N I~ ~
N ~ ~ .-~ ..~ 00 ~
N et 'd' u'W,i i ~ W1iv n i v ~ i i i i d M 00 .-r .-; N N ~t M
r.;
N +i +f '~ ~ O
+I
C ~' t~ O ~ ~ M
N ~: ~ N o0 O
,ri N d' vi rt ~ i mt' ~
.-, v ~ ~
b N
d V ~ G c~
' ~ ~
a o -~ cv, o d d cv M ~r W d d .-. *. d ~ d d x aH
b ~~~~~~d aH o o ors o d H v ~ ~ ~~~~~-.,~ ~ac~oNA N ~Nw a c~oa~
C40H1.4 atggagcttgccgagttctggaatgatctcaacaccttcaccatctacggaccgaatcac acagatatgaccacaaaatacaaatattcatatcacttcccaggtgaacaggtggcggat ccgcagtattggacgattttattccagaaatattggtatcattcgatcacaatatcagtt ctttatttcattttaattaaggtgattcaaaagtttatggagaatcgaaaaccattcact ttgaaatacccattgattctttggaatggagctcttgcagcattcagtataattgccaca ttgcggttctctattgatcctctacgatcactatatgctgaaggattctacaaaactctg tgctattcgtgtaatccaactgatgtggctgcattttggagctttgcattcgctctttcc aagattgttgaacttggagacactatgttcattattttgagaaaacggccattgatcttt ttacactactatcatcatgcagcagtgttaatctacactgtccattctggtgccgagcat actgcagctggtcgtttctacatcctaatgaactacttcgcacattctctcatgtatact tactacacagtttctgccatgggatacagattaccgaaatgggtatcaatgactgtcaca actgttcaaacaactcaaatgttagctggagtcggaataacttggatggtgtacaaagtg aaaactgaatacaagcttccttgtcaacaatccgtagccaatttgtatctcgcattcgtc atctatgtcacatttgccattcttttcattcaattcttcgtcaaggcatacattatcaag tcgtcgaagaagtcgaaatcggtgaagaacgaataa D2024.3 atggcaaaatacgactacaatccgaagtatgggttagaaaattacagcatattccttccc tttgagacatcttttgatgcatttcgatcgacaacatggatgcaaaatcactggtatcaa tcaattacagcatctgtcgtgtatgtagccgtcatttttacaggaaagaaggtggttctc atctacaaaaaatcacgagttattacttttgagtctagccttcagaatgcaattaagaat cgaaaccgaaaatcacttaatagttctcaaatgtttcagattatggaaaagtacaagccc ttccaactggacacaccactcttcgtctggaattcatttttagccattttctcaattctc gggttcctccgaatgacacctgaatttgtatggagttggtcagcagaaggaaactcattc aaatattcaatttgtcattcatcttatgctcaaggagtcactggtttctggactgaacaa ttcgcaatgagcaaacttttcgagctcatcgacacaatcttcatcgttcttcgtaaacgt ccactcatcttccttcactggtatcatcatgtaactgttatgatctacacatggcacgcg tacaaggatcacactgcatcaggacggtggttcatttggatgaattatggagttcatgct cttatgtattcctactatgctcttcgttctctgaaattccgtcttccaaaacaaatggca atggttgttactactctccaacttgctcaaatggttatgggagtaatcatcggagtcact gtctaccgtatcaagtcatcgggtgaatactgccaacagacatgggacaatttgggatta tgctttggagtttatttcacatatttccttcttttcgccaacttcttctaccatgcatat gttaagaaaaacaaccgtacagtaaattatgaaaataattcaaaaaatttccccgatctc gttttaatttacctgagaaaaaaggtttcaagaaaatcgaaaaatcggcaatgttcagaa aataattataaaattcaattttcatcaaattttgttaatgttgatggaaaaaaacataag aaaacatatgaacttattcttccaagaagaaaaatgaccacaattttaacttttctattt ggaaaaaatcgaattttttcgaaatatcagaaaaatcgaaaaaacatttcgattcctgtt gatttcgaaattctggagccaaaagaagatatcaatgctaacatcgctgagccatccatc acaacgaggtccgccgccgcacgaagaaaagttcaaaaagctgattag F11E6.5 atggcagcagcacaaacaagtccagcagccacgctcgtcgatgttttgacaaaaccatgg agtctggatcagactgattcttacatgtctacatttgtaccattatcctataaaatcatg attggttatctcgtcaccatctacttcgggcaaaaattaatggctcacagaaaaccattc gatctccaaaatacacttgctctctggaacttcgggttttcactgttctcgggaatcgcc gcctataagcttattccagaactattcggagttttcatgaaggacgggtttgtcgcttcc tactgtcaaaacgagaactactacaccgatgcatcaactggattctggggctgggccttt gtgatgtcgaaagctccagaactaggggatactatgttcttggtccttcgtaaaaaacca gttatcttcatgcactggtatcatcatgccctcacatttgtctacgcagtagtcacatac tctgagcatcaggcatgggctcgttggtctttggctctcaaccttgccgtccacactgtt atgtatttctacttcgccgttcgcgccttgaacatccaaactccacgcccagtggcaaag ttcatcactactattcaaattgtccaatttgtcatctcatgctacatttttgggcatttg gtattcattaagtctgctgattctgttcctggttgcgctgttagctggaatgtgctatcg atcggaggactcatgtacatcagttatttgttcctttttgccaagttcttctacaaggcc tacattcaaaaacgctcaccaaccaaaaccagcaagcaggagtag F41H10.7 atgtcatcggacgatcgtggcactagaaccttcaagatgatggatcaaattcttggaaca aacttcacttatgaaggtgccaaagaagttgctcgaggccttgaaggtttctcagcaaag cttgccgtcggatatattgccactatttttggactgaaatattatatgaaagaccgaaaa gccttcgatctcagtactccattaaacatttggaatggtattctttcgacattcagctta ttgggattcttattcacttttcctactttgttatcagttatcagaaaggatggatttagt cacacctattcccatgtctctgagctttacactgacagtacctctggatattggatcttc ctttgggttatctcaaagattccggaacttttggatacagtattcattgttcttcgcaag agaccacttattttcatgcactggtaccatcacgcattgaccggttactatgctcttgtc tgctaccatgaggatgctgtccatatggtttgggttgtatggatgaattatattattcat gcattcatgtatggatactatcttctgaaatctctgaaagttccaattccaccatcagtt gctcaagcaatcaccacatctcaaatggttcaattcgcagttgccattttcgcacaagtt catgtttcctataaacactatgttgagggagttgaaggattagcctactcgttcagagga acagctatcggatttttcatgcttactacctacttctatctatggattcaattctacaaa gagcactatcttaagaatggaggcaaaaagtacaatttggcaaaggatcaggcaaaaact caaacaaagaaggctaactaa F41H10.8 (ce477) atgccacagg gagaagtctc attctttgag gtgctgacaa ctgctccatt cagtcatgag ctctcaaaaa agcatattgc acagactcag tatgctgctt tctggatctc aatggcatat gttgtcgtta tttttgggct caaggctgtc atgacaaacc gaaaaccatt tgatctcacg ggaccactga atctctggaa tgcgggtctt gctattttct caactctcgg atcacttgcc actacatttg gacttctcca cgagttcttc agccgtggat ttttcgaatc ttacattcac atcggagact tttataatgg actttctgga atgttcacat ggcttttcgt tctctcaaaa gttgctgaat tcggagatac actttttatt attcttcgta aaaagccatt gatgttcctt cattggtatc atcatgtgct tacaatgaat tatgctttta tgtcatttga agctaatttg ggatttaata cttggattac atggatgaat ttctcagttc actcaattat gtatggatat tatatgcttc gttcttttgg tgtcaaggtt ccagcatgga ttgccaagaa tattacaaca atgcaaattc ttcaattcgt tattactcat ttcattcttt tccacgttgg atatttggca gttactggac aatctgttga ctcaactcca ggatattatt ggttctgcct tctcatggaa atctcttatg tcgttctgtt cggaaacttc tactatcaat catacatcaa gggaggtggc aagaagttta atgcagagaa gaagactgaa aagaaaattg aataa F56H11.3 atgtatttgaattatttcgcgacggaaatcttccatcgtagtgcggtttgtgaaacagaa gcttgtcgctcgtcaaaaataatgattgctgacgtgttcaaatggaaattcgatgcaaac gaattgtggagtcttttaacgaatcaggatgaagttttcccgcatattagagcacggcga ttcattcaagaacattttggtctattcgtccagatggcaattgcatatgtcattttggtg ttctcaatcaaaaggttcatgagggatcgtgaaccatttcaactcaccacagctcttcgt ctctggaacttcttcctctccgtcttctcaatttatggttcctggacaatgtttccattt atggttcaacaaataagactttatggtctctacggatgtggatgcgaagcactttcaaac cttccgagtcaagcagaatattggcttttcctgacgatcttgtccaaagctgtggagttt gttgatacatttttcttggttctccggaaaaaaccactcatcttcctacactggtatcat catatggcaacatttgtcttcttctgcagtaattacccgactccatcgtcacaatcacgc gtcggagttatcgtcaacctgttcgtgcatgccttcatgtacccatactatttcacccga tcaatgaacatcaaagttcctgcgaaaatttcaatggctgttacagttcttcaattgact caattcatgtgctttatctatggatgtactctcatgtactactcgttggccactaatcag gcacgatacccctcaaatacacctgcgacactccaatgtttgtcctacactctacatttg ctttga F56H11.4 (Ce 166) atg gctcagcatc cgctcgttca acggcttctc gatgtcaaat tcgacacgaa acgatttgtg gctattgcta ctcatgggcc aaagaatttc cctgacgcaga aggtcgcaa gttctttgct gatcactttg atgttactat tcaggcttcaa tcctgtacat ggtcgttgtg ttcggaacaa aatggttcat gcgtaatcgt caaccattcc aattgactat tccactcaac atctggaatttcatcctcgc cgcattttcc atcgcaggag ctgtcaaaat gaccccagag ttctttggaa ccattgccaa caaaggaatt gtcgatcctactgc aaagtgtttg atttcacgaa aggagagaat ggatactgggt gtggctctt catggcttcc aaacttttcg aacttgttga caccatcttc ttggttctccgtaaacgtcc actcatgttc cttcactggt atcaccatat tctcaccatg atctacgcctggtactctca tccattgacc ccaggattca acagatacgg aatttatctt aactttgtcg tccacgcctt catgtactct tactacttcc ttcgctcgat gaagattcgc gtgc caggattcatcgccca agctatcaca tctcttcaaa tcgttcaatt catcatctc WO 00155330 PCTlGB00I01035 t tgcgccgttcttgctcatct tggttatctc atgcacttca ccaat gccaactgt gatttcgagc catcagtatt caagctcgca gttttcatgg acacaacata cttggctctt ttcgtcaact tcttcctcca atcatatgtt ctccgcggag gaaaagacaa gtacaaggca gtgccaaaga agaagaacaa ctaa Y53F4B.c atgtcggccg aagtgtccga acgattcaaa gtttggacag gaaacaatga gaccatcatc tattccccat tcgagtacga ttccacgttg ctcatcgagt catgtcggtg tacttatcag ctgcttatat tattgcgaca aatttattac agagatatat ggagtcacgg aaacctaaaa cttttactag catggaacgg ttttttggca gtgttcagta ttatgggtac atggagattt ggaatcgaat tctacgatgc tgttttcaga agaggcttca tcgattcgat ctgcctggct gtaaatccac gttcaccgtc cgcattctgg gcatgcatgt tcgctctatc gaaaatcgcc gagtttgggg acacgatgtt cttggtgctg aggaaacggc cggttatatt ccttcactgg tatcatcacg ctgttgttct gatcctttct tggcatgctg caatcgaact cacagctcca ggacgctggt ttatttttat gaactatttg gtgcattcaa taatgtatac atactacgca ataacatcaa tcggctatcg tcttcccaaa atcgtttcaa tgactgttac attccttcaa actcttcaaa tgctcattgg tgtcagcatt tcttgcattg tgctttattt gaagcttaat ggagagatgt gccaacaatc ctacgacaat ctggcgttga gcttcggaat ctacgcctca ttcctggtgc tattctccag tttcttcaac aatgcatatt tggtaaaaaa ggacaagaaa cccgatgtga agaaggatta a A
251 SVANLYLAFV IYVTFAILFI QFFVKAYIIK SSKKSKSVKN E*
B
401 DFEILEPKED INANIAEPSI TTRSAAARRK VQKAD*
C
251 FLFAKFFYKA YIQKRSPTKT SK E*
D
251 YFYLWIQFYK EHYLKNGGKK YNLAKDQAKT QTKKAN*
E
MPQGEVSFFE VLTTAPFSHE LSKKHIAQTQ YAAFWISMAY VWIFGLKAV
MTNRKPFDLT GPLNLWNAGL AIFSTLGSLA TTFGLLHEFF SRGFFESYIH
IGDFYNGLSG MFTWLFVLSK VAEFGDTLFI ILRKKPLMFL HWYHHVLTMN
YAFMSFEANL GFNTWITWMN FSVHSIMYGY YMLRSFGVKV PAWIAKNITT
MQILQFVITH FILFHVGYLA VTGQSVDSTP GYYWFCLLME ISYWLFGNF
YYQSYIKGGG KKFNAEKKTE KKIE*
F
251 LMYYSLATNQ ARYPSNTPAT LQCLSYTLHL L*
G
MAQHPLVQRL LDVKFDTKRF VAIATHGPKN FPDAEGRKFF ADHFDVTIQA
SILYMVVVFG TKWFMRNRQP FQLTIPLNIW NFILAAFSIA GAVKMTPEFF
GTIANKGIVA SYCKVFDFTK GENGYWVWLF MASKLFELVD TIFLVLRKRP
LMFLHV~IYHHI LTMIYAWYSH PLTPGFNRYG IYLNFVVHAF MYSYYFLRSM
KIRVPGFIAQ AITSLQIVQF IISCAVLAHL GYLMHFTNAN CDFEPSVFKL
AVFMDTTYLA LFVNFFLQSY VLRGGKDKYK AVPKKKNN*
H
MSAEVSERFKVWTGNNETIIYSPFEYDSTLLIESCRCTYQLLILLRQI
YYRDIWSHGNLKACDXLLLAWNGFLAVFSIMGTWRFGIEFYDAVFRXG
FIXSICLAVNPRSPSAFWACMFALSKIAEFGDTMFLVLRKRPVIFLHWYHH
AVVLILSWHAAIELTAPGRWFIFMNYLVHSIMYTYYAITSIGYRXPKIVSMT
VTFLQTLQMLIGVSISCIVLYLKLNGEMCQQSYDNLALSFGIYASFLVLSSFF
NNAYLVKKDKKPDVKKD*
ELONGASE.APP.txt SEQUENCE LISTING
<110> the university of Bristol <120> POLYUNSATURATED FATTY ACID (PUFA) ELONGASE FROM CAENORHABDITIS ELEGANS
<130> >1470003 <140> PCT/GB00/01035 <141> March 20, 2000 <160> 22 <170> Patentln Ver. 2.1 <210> 1 <211> 27 <212> DNA
<213> C. elegans <400> 1 gcgggtacca tggctcagca tccgctc 27 <210> 2 <211> 27 <212> DNA
<213> C. elegans <400> 2 gcgggatcct tagttgttct tcttctt 27 <210> 3 <211> 27 <212> DNA
<213> C. elegans <400> 3 gcgggtacca tgccacaggg agaagtc 27 <210> 4 <211> 27 <212> DNA
<213> c. elegans <400> 4 gcgggatcct tattcaattt ttctttt 27 <210> 5 <211> 29 <212> DNA
<213> C. elegans <400> 5 gcgaattcac catgggtacg gaccaagga 2g <210> 6 <211> 27 <212> DNA
ELONGASE.APP.tXt <213> C. ~legans <400> 6 gcggagctcc tactcttcct tgggacg 27 <210> 7 <211> 876 <212> DNA
<213> c. elegans <400> 7 atggagcttg ccgagttctg gaatgatctc aacaccttca ccatctacgg accgaatcac 60 acagatatga ccacaaaata caaatattca tatcacttcc caggtgaaca ggtggcggat 120 ccgcagtatt ggacgatttt attccagaaa tattggtatc attcgatcac aatatcagtt 180 ctttatttca ttttaattaa ggtgattcaa aagtttatgg agaatcgaaa accattcact 240 ttgaaatacc cattgattct ttggaatgga gctcttgcag cattcagtat aattgccaca 300 ttgcggttct ctattgatcc tctacgatca ctatatgctg aaggattcta caaaactctg 360 tgctattcgt gtaatccaac tgatgtggct gcattttgga gctttgcatt cgctctttcc 420 aagattgttg aacttggaga cactatgttc attattttga gaaaacggcc attgatcttt 480 ttacactact atcatcatgc agcagtgtta atctacactg tccattctgg tgccgagcat 540 actgcagctg gtcgtttcta catcctaatg aactacttcg cacattctct catgtatact 600 tactacacag tttctgccat gggatacaga ttaccgaaat gggtatcaat gactgtcaca 660 actgttcaaa caactcaaat gttagctgga gtcggaataa cttggatggt gtacaaagtg 720 aaaactgaat acaagcttcc ttgtcaacaa tccgtagcca atttgtatct cgcattcgtc 780 atctatgtca catttgccat tcttttcatt caattcttcg tcaaggcata cattatcaag 840 tcgtcgaaga agtcgaaatc ggtgaagaac gaataa <210> 8 <211> 1308 <212> DNA
<213> c. elegans <400> 8 atggcaaaat acgactacaa tccgaagtat gggttagaaa attacagcat attccttccc 60 tttgagacat cttttgatgc atttcgatcg acaacatgga tgcaaaatca ctggtatcaa 120 tcaattacag catctgtcgt gtatgtagcc gtcattttta caggaaagaa ggtggttctc 180 atctacaaaa aatcacgagt tattactttt gagtctagcc ttcagaatgc aattaagaat 240 cgaaaccgaa aatcacttaa tagttctcaa atgtttcaga ttatggaaaa gtacaagccc 300 ttccaactgg acacaccact cttcgtctgg aattcatttt tagccatttt ctcaattctc 360 gggttcctcc gaatgacacc tgaatttgta tggagttggt cagcagaagg aaactcattc 420 aaatattcaa tttgtcattc atcttatgct caaggagtca ctggtttctg gactgaacaa 480 ttcgcaatga gcaaactttt cgagctcatc gacacaatct tcatcgttct tcgtaaacgt 540 ccactcatct tccttcactg gtatcatcat gtaactgtta tgatctacac atggcacgcg 600 tacaaggatc acactgcatc aggacggtgg ttcatttgga tgaattatgg agttcatgct 660 cttatgtatt cctactatgc tcttcgttct ctgaaattcc gtcttccaaa acaaatggca 720 atggttgtta ctactctcca acttgctcaa atggttatgg gagtaatcat cggagtcact 780 gtctaccgta tcaagtcatc gggtgaatac tgccaacaga catgggacaa tttgggatta 840 tgctttggag tttatttcac atatttcctt cttttcgcca acttcttcta ccatgcatat 900 gttaagaaaa acaaccgtac agtaaattat gaaaataatt caaaaaattt ccccgatctc 960 gttttaattt acctgagaaa aaaggtttca agaaaatcga aaaatcggca atgttcagaa 1020 aataattata aaattcaatt ttcatcaaat tttgttaatg ttgatggaaa aaaacataag 1080 aaaacatatg aacttattct tccaagaaga aaaatgacca caattttaac ttttctattt 1140 ggaaaaaatc gaattttttc gaaatatcag aaaaatcgaa aaaacatttc gattcctgtt 1200 gatttcgaaa ttctggagcc aaaagaagat atcaatgcta acatcgctga gccatccatc 1260 acaacgaggt ccgccgccgc acgaagaaaa gttcaaaaag ctgattag 1308 <210> 9 <211> 825 <212> DNA
<213> c. elegans ELONGASE.APP.txt <400> 9 atggcagcag cacaaacaag tccagcagcc acgctcgtcg atgttttgac aaaaccatgg 60 agtctggatc agactgattc ttacatgtct acatttgtac cattatccta taaaatcatg 120 attggttatc tcgtcaccat ctacttcggg caaaaattaa tggctcacag aaaaccattc 180 gatctccaaa atacacttgc tctctggaac ttcgggtttt cactgttctc gggaatcgcc 240 gcctataagc ttattccaga actattcgga gttttcatga aggacgggtt tgtcgcttcc 300 tactgtcaaa acgagaacta ctacaccgat gcatcaactg gattctgggg ctgggccttt 360 gtgatgtcga aagctccaga actaggggat actatgttct tggtccttcg taaaaaacca 420 gttatcttca tgcactggta tcatcatgcc ctcacatttg tctacgcagt agtcacatac 480 tctgagcatc aggcatgggc tcgttggtct ttggctctca accttgccgt ccacactgtt 540 atgtatttct acttcgccgt tcgcgccttg aacatccaaa ctccacgccc agtggcaaag 600 ttcatcacta ctattcaaat tgtccaattt gtcatctcat gctacatttt tgggcatttg 660 gtattcatta agtctgctga ttctgttcct ggttgcgctg ttagctggaa tgtgctatcg 720 atcggaggac tcatgtacat cagttatttg ttcctttttg ccaagttctt ctacaaggcc 780 tacattcaaa aacgctcacc aaccaaaacc agcaagcagg agtag g25 <210> 10 <211> 861 <212> DNA
<213> C. elegans <400> 10 atgtcatcgg acgatcgtgg cactagaacc ttcaagatga tggatcaaat tcttggaaca 60 aacttcactt atgaaggtgc caaagaagtt gctcgaggcc ttgaaggttt ctcagcaaag 120 cttgccgtcg gatatattgc cactattttt ggactgaaat attatatgaa agaccgaaaa 180 gccttcgatc tcagtactcc attaaacatt tggaatggta ttctttcgac attcagctta 240 ttgggattct tattcacttt tcctactttg ttatcagtta tcagaaagga tggatttagt 300 cacacctatt cccatgtctc tgagctttac actgacagta cctctggata ttggatcttc 360 ctttgggtta tctcaaagat tccggaactt ttggatacag tattcattgt tcttcgcaag 420 agaccactta ttttcatgca ctggtaccat cacgcattga ccggttacta tgctcttgtc 480 tgctaccatg aggatgctgt ccatatggtt tgggttgtat ggatgaatta tattattcat 540 gcattcatgt atggatacta tcttctgaaa tctctgaaag ttccaattcc accatcagtt 600 gctcaagcaa tcaccacatc tcaaatggtt caattcgcag ttgccatttt cgcacaagtt 660 catgtttcct ataaacacta tgttgaggga gttgaaggat tagcctactc gttcagagga 720 acagctatcg gatttttcat gcttactacc tacttctatc tatggattca attctacaaa 780 gagcactatc ttaagaatgg aggcaaaaag tacaatttgg caaaggatca ggcaaaaact 840 caaacaaaga aggctaacta a 861 <210> 11 <211> 825 <212> DNA
<213> C. elegans <400> 11 atgccacagg gagaagtctc attctttgag gtgctgacaa ctgctccatt cagtcatgag 60 ctctcaaaaa agcatattgc acagactcag tatgctgctt tctggatctc aatggcatat 120 gttgtcgtta tttttgggct caaggctgtc atgacaaacc gaaaaccatt tgatctcacg 180 ggaccactga atctctggaa tgcgggtctt gctattttct caactctcgg atcacttgcc 240 actacatttg gacttctcca cgagttcttc agccgtggat ttttcgaatc ttacattcac 300 atcggagact tttataatgg actttctgga atgttcacat ggcttttcgt tctctcaaaa 360 gttgctgaat tcggagatac actttttatt attcttcgta aaaagccatt gatgttcctt 420 cattggtatc atcatgtgct tacaatgaat tatgctttta tgtcatttga agctaatttg 480 ggatttaata cttggattac atggatgaat ttctcagttc actcaattat gtatggatat 540 tatatgcttc gttcttttgg tgtcaaggtt ccagcatgga ttgccaagaa tattacaaca 600 atgcaaattc ttcaattcgt tattactcat ttcattcttt tccacgttgg atatttggca 660 gttactggac aatctgttga ctcaactcca ggatattatt ggttctgcct tctcatggaa 720 atctcttatg tcgttctgtt cggaaacttc tactatcaat catacatcaa gggaggtggc 780 aagaagttta atgcagagaa gaagactgaa aagaaaattg aataa 825 ELONGASE.APP.tXt <210> 12 <211> 846 <212> DNA
<213> C. elegans <400> 12 atgtatttga attatttcgc gacggaaatc ttccatcgta gtgcggtttg tgaaacagaa 60 gcttgtcgct cgtcaaaaat aatgattgct gacgtgttca aatggaaatt cgatgcaaac 120 gaattgtgga gtcttttaac gaatcaggat gaagttttcc cgcatattag agcacggcga 180 ttcattcaag aacattttgg tctattcgtc cagatggcaa ttgcatatgt cattttggtg 240 ttctcaatca aaaggttcat gagggatcgt gaaccatttc aactcaccac agctcttcgt 300 ctctggaact tcttcctctc cgtcttctca atttatggtt cctggacaat gtttccattt 360 atggttcaac aaataagact ttatggtctc tacggatgtg gatgcgaagc actttcaaac 420 cttccgagtc aagcagaata ttggcttttc ctgacgatct tgtccaaagc tgtggagttt 480 gttgatacat ttttcttggt tctccggaaa aaaccactca tcttcctaca ctggtatcat 540 catatggcaa catttgtctt cttctgcagt aattacccga ctccatcgtc acaatcacgc 600 gtcggagtta tcgtcaacct gttcgtgcat gccttcatgt acccatacta tttcacccga 660 tcaatgaaca tcaaagttcc tgcgaaaatt tcaatggctg ttacagttct tcaattgact 720 caattcatgt gctttatcta tggatgtact ctcatgtact actcgttggc cactaatcag 780 gcacgatacc cctcaaatac acctgcgaca ctccaatgtt tgtcctacac tctacatttg 840 ctttga 846 <210> 13 <211> 866 <212> DNA
<213> C. elegans <400> 13 atggctcagc atccgctcgt tcaacggctt ctcgatgtca aattcgacac gaaacgattt 60 gtggctattg ctactcatgg gccaaagaat ttccctgacg cagaaggtcg caagttcttt 120 gctgatcact ttgatgttac tattcaggct tcaatcctgt acatggtcgt tgtgttcgga 180 acaaaatggt tcatgcgtaa tcgtcaacca ttccaattga ctattccact caacatctgg 240 aatttcatcc tcgccgcatt ttccatcgca ggagctgtca aaatgacccc agagttcttt 300 ggaaccattg ccaacaaagg aattgtcgat cctactgcaa agtgtttgat ttcacgaaag 360 gagagaatgg atactgggtg tggctcttca tggcttccaa acttttcgaa cttgttgaca 420 ccatcttctt ggttctccgt aaacgtccac tcatgttcct tcactggtat caccatattc 480 tcaccatgat ctacgcctgg tactctcatc cattgacccc aggattcaac agatacggaa 540 tttatcttaa ctttgtcgtc cacgccttca tgtactctta ctacttcctt cgctcgatga 600 agattcgcgt gccaggattc atcgcccaag ctatcacatc tcttcaaatc gttcaattca 660 tcatctcttg cgccgttctt gctcatcttg gttatctcat gcacttcacc aatgccaact 720 gtgatttcga gccatcagta ttcaagctcg cagttttcat ggacacaaca tacttggctc 780 ttttcgtcaa cttcttcctc caatcatatg ttctccgcgg aggaaaagac aagtacaagg 840 cagtgccaaa gaagaagaac aactaa 866 <210> 14 <211> 801 <212> DNA
<213> C. elegans <400> 14 atgtcggccg aagtgtccga acgattcaaa gtttggacag gaaacaatga gaccatcatc 60 tattccccat tcgagtacga ttccacgttg ctcatcgagt catgtcggtg tacttatcag 120 ctgcttatat tattgcgaca aatttattac agagatatat ggagtcacgg aaacctaaaa 180 cttttactag catggaacgg ttttttggca gtgttcagta ttatgggtac atggagattt 240 ggaatcgaat tctacgatgc tgttttcaga agaggcttca tcgattcgat ctgcctggct 300 gtaaatccac gttcaccgtc cgcattctgg gcatgcatgt tcgctctatc gaaaatcgcc 360 gagtttgggg acacgatgtt cttggtgctg aggaaacggc cggttatatt ccttcactgg 420 tatcatcacg ctgttgttct gatcctttct tggcatgctg caatcgaact cacagctcca 480 ggacgctggt ttatttttat gaactatttg gtgcattcaa taatgtatac atactacgca 540 ataacatcaa tcggctatcg tcttcccaaa atcgtttcaa tgactgttac attccttcaa 600 actcttcaaa tgctcattgg tgtcagcatt tcttgcattg tgctttattt gaagcttaat 660 ELONGASE.APP.tXt ggagagatgt gccaacaatc ctacgacaat ctggcgttga gcttcggaat ctacgcctca 720 ttcctggtgc tattctccag tttcttcaac aatgcatatt tggtaaaaaa ggacaagaaa 780 cccgatgtga agaaggatta a <210>
<211>
<212>
PRT
<213> elegans C.
<400>
MetGlu LeuAla GluPheTrpAsn AspLeuAsn ThrPheThr IleTyr GlyPro AsnHis ThrAspMetThr ThrLysTyr LysTyrSer TyrHis PhePro GlyGlu GlnValAlaAsp ProGlnTyr TrpThrIle LeuPhe GlnLys TyrTrp TyrHisSerIle ThrIleSer ValLeuTyr PheIle LeuIle LysVal IleGlnLysPhe MetGluAsn ArgLysPro PheThr LeuLys TyrPro LeuIleLeuTrp AsnGlyAla LeuAlaAla PheSer IleIle AlaThr LeuArgPheSer IleAspPro LeuArgSer LeuTyr AlaGlu GlyPhe TyrLysThrLeu CysTyrSer CysAsnPro ThrAsp ValAla AlaPhe TrpSerPheAla PheAlaLeu SerLysIle ValGlu LeuGly AspThr MetPheIleIle LeuArgLys ArgProLeu IlePhe LeuHis TyrTyr HisHisAlaAla ValLeuIle TyrThrVal HisSer GlyAla GluHis ThrAlaAlaGly ArgPheTyr IleLeuMet AsnTyr PheAla HisSer LeuMetTyrThr TyrTyrThr ValSerAla MetGly TyrArg LeuPro LysTrpValSer MetThrVal ThrThrVal GlnThr ThrGln MetLeu AlaGlyValGly IleThrTrp MetValTyr LysVal LysThr GluTyr LysLeuProCys GlnGlnSer ValAlaAsn LeuTyr LeuAla PheVal IleTyrValThr PheAlaIle LeuPheIle GlnPhe PheVal LysAla TyrIleIleLys SerSerLys LysSerLys SerVal ELONGASE.APP.tXt ' 2?5 280 285 Lys Asn Glu <210>
<211>
<212>
PRT
<213> elegan s c.
<400>
MetAla LysTyrAsp TyrAsnPro LysTyrGly LeuGluAsn TyrSer IlePhe LeuProPhe GluThrSer PheAspAla PheArgSer ThrThr TrpMet GlnAsnHis TrpTyrGln SerIleThr AlaSerVal ValTyr ValAla ValIlePhe ThrGlyLys LysValVal LeuIleTyr LysLys SerArg ValIleThr PheGluSer SerLeuGln AsnAlaIle LysAsn ArgAsn ArgLysSer LeuAsnSer SerGlnMet PheGlnIle MetGlu LysTyr LysProPhe GlnLeuAsp ThrProLeu PheValTrp AsnSer PheLeu AlaIlePhe SerIleLeu GlyPheLeu ArgMetThr ProGlu PheVal TrpSerTrp SerAlaGlu GlyAsnSer PheLysTyr SerIle cysHis SerSerTyr AlaGlnGly ValThrGly PheTrpThr GluGln PheAla MetSerLys LeuPheGlu LeuIleAsp ThrIlePhe IleVal LeuArg LysArgPro LeuIlePhe LeuHisTrp TyrHisHis ValThr ValMet IleTyrThr TrpHisAla TyrLysAsp HisThrAla SerGly ArgTrp PheIleTrp MetAsnTyr GlyValHis AlaLeuMet TyrSer TyrTyr AlaLeuArg SerLeuLys PheArgLeu ProLysGln MetAla MetVal ValThrThr LeuGlnLeu AlaGlnMet ValMetGly ValIle IleGly ValThrVal TyrArgIle LysSerSer GlyGluTyr cysGln S
ELONGASE.APP.tXt Gln Thr Trp Asp Asn Leu Gly Leu Cys Phe Gly Val Tyr Phe Thr Tyr Phe Leu Leu Phe Ala Asn Phe Phe Tyr His Ala Tyr Val Lys Lys Asn Asn Arg Thr Val Asn Tyr Glu Asn Asn Ser Lys Asn Phe Pro Asp Leu Val Leu Ile Tyr Leu Arg Lys Lys Val Ser Arg Lys Ser Lys Asn Arg Gln Cys Ser Glu Asn Asn Tyr Lys Ile Gln Phe Ser Ser Asn Phe Val Asn Val Asp Gly Lys Lys His Lys Lys Thr Tyr Glu Leu Ile Leu Pro Arg Arg Lys Met Thr Thr Ile Leu Thr Phe Leu Phe Gly Lys Asn Arg Ile Phe Ser Lys Tyr Gln Ly5 Asn Arg Lys Asn Ile Ser Ile Pro Val Asp Phe Glu Ile Leu Glu Pro Lys Glu Asp Ile Asn Ala Asn Ile Ala Glu Pro Ser Ile Thr Thr Arg Ser Ala Ala Ala Arg Arg Lys Val Gln Lys Ala Asp <210> 17 <211> 274 <212> PRT
<213> C. elegans <400> 17 Met Ala Ala Ala Gln Thr Ser Pro Ala Ala Thr Leu Val Asp Val Leu Thr Lys Pro Trp Ser Leu Asp Gln Thr Asp Ser Tyr Met Ser Thr Phe Val Pro Leu Ser Tyr Lys Ile Met Ile Gly Tyr Leu Val Thr Ile Tyr Phe Gly Gln Lys Leu Met Ala His Arg Lys Pro Phe Asp Leu Gln Asn Thr Leu Ala Leu Trp Asn Phe Gly Phe Ser Leu Phe Ser Gly Ile Ala Ala Tyr Lys Leu Ile Pro Glu Leu Phe Gly Val Phe Met Lys Asp Gly Phe Val Ala Ser Tyr Cys Gln Asn Glu Asn Tyr Tyr Thr Asp Ala Ser ELONGASE.APP.txt Thr Gly Phe Trp Gly Trp Ala Phe Val Met Ser Lys Ala Pro Glu Leu Gly Asp Thr Met Phe Leu Val Leu Arg Lys Lys Pro Val Ile Phe Met His Trp Tyr His His Ala Leu Thr Phe Val Tyr Ala Val Val Thr Tyr Ser Glu His Gln Ala Trp Ala Arg Trp Ser Leu Ala Leu Asn Leu Ala Val His Thr Val Met Tyr Phe Tyr Phe Ala Val Arg Ala Leu Asn Ile Gln Thr Pro Arg Pro Val Ala Lys Phe Ile Thr Thr Ile Gln Ile Val Gln Phe Val Ile Ser Cys Tyr Ile Phe Gly His Leu Val Phe Ile Lys Ser Ala Asp Ser Val Pro Gly Cys Ala Val Ser Trp Asn Val Leu Ser Ile Gly Gly Leu Met Tyr Ile Ser Tyr Leu Phe Leu Phe Ala Lys Phe Phe Tyr Lys Ala Tyr Ile Gln Lys Arg Ser Pro Thr Lys Thr Ser Lys Gln Glu <210> 18 <211> 286 <212> PRT
<213> C. elegans <400> 18 Met Ser Ser Asp Asp Arg Gly Thr Arg Thr Phe Lys Met Met Asp Gln Ile Leu Gly Thr Asn Phe Thr Tyr Glu Gly Ala Lys Glu Val Ala Arg Gly Leu Glu Gly Phe Ser Ala Lys Leu Ala Val Gly Tyr Ile Ala Thr Ile Phe Gly Leu Lys Tyr Tyr Met Lys Asp Arg Lys Ala Phe Asp Leu Ser Thr Pro Leu Asn Ile Trp Asn Gly Ile Leu Ser Thr Phe Ser Leu Leu Gly Phe Leu Phe Thr Phe Pro Thr Leu Leu Ser Val Ile Arg Lys Asp Gly Phe Ser His Thr Tyr Ser His Val Ser Glu Leu Tyr Thr Asp Ser Thr Ser Gly Tyr Trp Ile Phe Leu Trp Val Ile Ser Lys Ile Pro ELONGASE.APP.tXt ' lI5 120 125 Glu Leu Leu Asp Thr Val Phe Ile Val Leu Arg Lys Arg Pro Leu Ile Phe Met His Trp Tyr His His Ala Leu Thr Gly Tyr Tyr Ala Leu Val Cys Tyr His Glu Asp Ala Val His Met Val Trp Val Val Trp Met Asn Tyr Ile Ile His Ala Phe Met Tyr Gly Tyr Tyr Leu Leu Lys Ser Leu Lys Val Pro Ile Pro Pro Ser Val Ala Gln Ala Ile Thr Thr Ser Gln Met Val Gln Phe Ala Val Ala Ile Phe Ala Gln Val His Val Ser Tyr Lys His Tyr Val Glu Gly Val Glu Gly Leu Ala Tyr Ser Phe Arg Gly Thr Ala Ile Gly Phe Phe Met Leu Thr Thr Tyr Phe Tyr Leu Trp Ile Gln Phe Tyr Lys Glu His Tyr Leu Lys Asn Gly Gly Lys Lys Tyr Asn Leu Ala Lys Asp Gln Ala Lys Thr Gln Thr Lys Lys Ala Asn <210> 19 <211> 274 <212> PRT
<213> C. elegans <400> 19 Met Pro Gln Gly Glu Val Ser Phe Phe Glu Val Leu Thr Thr Ala Pro Phe Ser His Glu Leu Ser Lys Lys His Ile Ala Gln Thr Gln Tyr Ala Ala Phe Trp Ile Ser Met Ala Tyr Val Val Val Ile Phe Gly Leu Lys Ala Val Met Thr Asn Arg Lys Pro Phe Asp Leu Thr Gly Pro Leu Asn Leu Trp Asn Ala Gly Leu Ala Ile Phe Ser Thr Leu Gly Ser Leu Ala Thr Thr Phe Gly Leu Leu His Glu Phe Phe Ser Arg Gly Phe Phe Glu Ser Tyr Ile His Ile Gly Asp Phe Tyr Asn Gly Leu Ser Gly Met Phe Thr Trp Leu Phe Val Leu Ser Lys Val Ala Glu Phe Gly Asp Thr Leu ELONGASE.APP.tXt Phe Ile Ile Leu Arg Lys Lys Pro Leu Met Phe Leu His Trp Tyr His His Val Leu Thr Met Asn Tyr Ala Phe Met Ser Phe Glu Ala Asn Leu Gly Phe Asn Thr Trp Ile Thr Trp Met Asn Phe Ser Val His Ser Ile Met Tyr Gly Tyr Tyr Met Leu Arg Ser Phe Gly Val Lys Val Pro Ala Trp Ile Ala Lys Asn Ile Thr Thr Met Gln Ile Leu Gln Phe Val Ile Thr His Phe Ile Leu Phe His Val Gly Tyr Leu Ala Val Thr Gly Gln Ser Val Asp Ser Thr Pro Gly Tyr Tyr Trp Phe Cys Leu Leu Met Glu Ile Ser Tyr Val Val Leu Phe Gly Asn Phe Tyr Tyr Gln Ser Tyr Ile Lys Gly Gly Gly Lys Lys Phe Asn Ala Glu Lys Lys Thr Glu Lys Lys Ile Glu <210> 20 <211> 281 <212> PRT
<213> C. elegans <400> 20 Met Tyr Leu Asn Tyr Phe Ala Thr Glu Ile Phe His Arg Ser Ala Val Cys Glu Thr Glu Ala Cys Arg Ser Ser Lys Ile Met Ile Ala Asp Val Phe Lys Trp Lys Phe Asp Ala Asn Glu Leu Trp Ser Leu Leu Thr Asn Gln Asp Glu Val Phe Pro His Ile Arg Ala Arg Arg Phe Ile Gln Glu His Phe Gly Leu Phe Val Gln Met Ala Ile Ala Tyr Val Ile Leu Val Phe Ser Ile Lys Arg Phe Met Arg Asp Arg Glu Pro Phe Gln Leu Thr Thr Ala Leu Arg Leu Trp Asn Phe Phe Leu Ser Val Phe Ser Ile Tyr Gly Ser Trp Thr Met Phe Pro Phe Met Val Gln Gln Ile Arg Leu Tyr ELONGASE.APP.tXt Gly Leu Tyr Gly Cys Gly Cys Glu Ala Leu Ser Asn Leu Pro Ser Gln Ala Glu Tyr Trp Leu Phe Leu Thr Ile Leu Ser Lys Ala Val Glu Phe Val Asp Thr Phe Phe Leu Val Leu Arg Lys Lys Pro Leu Ile Phe Leu His Trp Tyr His His Met Ala Thr Phe Val Phe Phe Cys Ser Asn Tyr Pro Thr Pro Ser Ser Gln Ser Arg Val Gly Val Ile Val Asn Leu Phe Val His Ala Phe Met Tyr Pro Tyr Tyr Phe Thr Arg Ser Met Asn Ile Lys Val Pro Ala Lys Ile Ser Met Ala Val Thr Val Leu Gln Leu Thr Gln Phe Met Cys Phe Ile Tyr Gly Cys Thr Leu Met Tyr Tyr Ser Leu Ala Thr Asn Gln Ala Arg Tyr Pro Ser Asn Thr Pro Ala Thr Leu Gln Cys Leu Ser Tyr Thr Leu His Leu Leu <210> 21 <211> 288 <212> PRT
<213> C. elegans <400> 21 Met Ala Gln His Pro Leu Val Gln Arg Leu Leu Asp Val Lys Phe Asp Thr Lys Arg Phe Val Ala Ile Ala Thr His Gly Pro Lys Asn Phe Pro Asp Ala Glu Gly Arg Lys Phe Phe Ala Asp His Phe Asp Val Thr Ile Gln Ala Ser Ile Leu Tyr Met Val Val Val Phe Gly Thr Lys Trp Phe Met Arg Asn Arg Gln Pro Phe Gln Leu Thr Ile Pro Leu Asn Ile Trp Asn Phe Ile Leu Ala Ala Phe Ser Ile Ala Gly Ala Val Lys Met Thr Pro Glu Phe Phe Gly Thr Ile Ala Asn Lys Gly Ile Val Ala Ser Tyr Cys Lys Val Phe Asp Phe Thr Lys Gly Glu Asn Gly Tyr Trp Val Trp Leu Phe Met Ala Ser Lys Leu Phe Glu Leu Val Asp Thr Ile Phe Leu ELONGASE.APP.tXt 130 ' 135 140 Val Leu Arg Lys Arg Pro Leu Met Phe Leu His Trp Tyr His His Ile Leu Thr Met Ile Tyr Ala Trp Tyr Ser His Pro Leu Thr Pro Gly Phe Asn Arg Tyr Gly Ile Tyr Leu Asn Phe Val Val His Ala Phe Met Tyr Ser Tyr Tyr Phe Leu Arg Ser Met Lys Ile Arg Val Pro Gly Phe Ile Ala Gln Ala Ile Thr Ser Leu Gln Ile Val Gln Phe Ile Ile Ser Cys Ala Val Leu Ala His Leu Gly Tyr Leu Met His Phe Thr Asn Ala Asn Cys Asp Phe Glu Pro Ser Val Phe Lys Leu Ala Val Phe Met Asp Thr Thr Tyr Leu Ala Leu Phe Val Asn Phe Phe Leu Gln Ser Tyr Val Leu Arg Gly Gly Lys Asp Lys Tyr Lys Ala Val Pro Lys Lys Lys Asn Asn <210> 22 <211> 269 <212> PRT
<213> C. elegans <400> 22 Met Ser Ala Glu Val Ser Glu Arg Phe Lys Val Trp Thr Gly Asn Asn Glu Thr Ile Ile Tyr Ser Pro Phe Glu Tyr Asp Ser Thr Leu Leu Ile Glu Ser Cys Arg Cys Thr Tyr Gln Leu Leu Ile Leu Leu Arg Gln Ile Tyr Tyr Arg Asp Ile Trp Ser His Gly Asn Leu Lys Ala Cys Asp Xaa Leu Leu Leu Ala Trp Asn Gly Phe Leu Ala Val Phe Ser Ile Met Gly Thr Trp Arg Phe Gly Ile Glu Phe Tyr Asp Ala Val Phe Arg Xaa Gly Phe Ile Xaa Ser Ile Cys Leu Ala Val Asn Pro Arg Ser Pro Ser Ala Phe Trp Ala Cys Met Phe Ala Leu Ser Lys Ile Ala Glu Phe Gly Asp ELONGASE.APP.tXt Thr Met PheLeu ValLeuArg LysArgProVal IlePheLeu HisTrp Tyr His HisAla ValValLeu IleLeuSerTrp HisAlaAla IleGlu Leu Thr AlaPro GlyArgTrp PheIlePheMet AsnTyrLeu ValHis Ser Ile MetTyr ThrTyrTyr AlaIleThrSer IleGlyTyr ArgXaa Pro Lys IleVal SerMetThr ValThrPheLeu GlnThrLeu GlnMet Leu Ile GlyVal SerIleSer CysIleValLeu TyrLeuLys LeuAsn Gly Glu MetCys GlnGlnSer TyrAspAsnLeu AlaLeuSer PheGly Ile Tyr AlaSer PheLeuVal LeuSerSerPhe PheAsnAsn AlaTyr Leu Val LysLys AspLysLys ProAspValLys LysAsp gcgggtacca~ggctcagcatccgctco PRIMER1.GBS.tXt PRIMER2.GBS.tXt gcgggatcct~agttgttcttcttctto PRIMER3.GBS.tXt gcgggtaccatgccacagggagaagtco PRIMER4.GBS.tXt gcgggatccttattcaatttttctttto PRIMERS.GBS.tXt gcgaattcaccatgggtacggaccaagga~
PRIMER6.GBS.tXt gcggagctcctactcttccttgggacgo SEQID1.GBS.tXt atggagcttgccgagttctggaatgatctcaacaccttcaccatctacggaccgaatcac acagatatgaccacaaaatacaaatattcatatcacttcccaggtgaacaggtggcggat ccgcagtattggacgattttattccagaaatattggtatcattcgatcacaatatcagtt ctttatttcattttaattaaggtgattcaaaagtttatggagaatcgaaaaccattcact ttgaaatacccattgattctttggaatggagctcttgcagcattcagtataattgccaca ttgcggttctctattgatcctctacgatcactatatgctgaaggattctacaaaactctg tgctattcgtgtaatccaactgatgtggctgcattttggagctttgcattcgctctttcc aagattgttgaacttggagacactatgttcattattttgagaaaacggccattgatcttt ttacactactatcatcatgcagcagtgttaatctacactgtccattctggtgccgagcat actgcagctggtcgtttctacatcctaatgaactacttcgcacattctctcatgtatact tactacacagtttctgccatgggatacagattaccgaaatgggtatcaatgactgtcaca actgttcaaacaactcaaatgttagctggagtcggaataacttggatggtgtacaaagtg aaaactgaatacaagcttccttgtcaacaatccgtagccaatttgtatctcgcattcgtc atctatgtcacatttgccattcttttcattcaattcttcgtcaaggcatacattatcaag tcgtcgaagaagtcgaaatcggtgaagaacgaataao SEQID2.GBS.tXt atggcaaaatacgactacaatccgaagtatgggttagaaaattacagcatattccttccc tttgagacatcttttgatgcatttcgatcgacaacatggatgcaaaatcactggtatcaa tcaattacagcatctgtcgtgtatgtagccgtcatttttacaggaaagaaggtggttctc atctacaaaaaatcacgagttattacttttgagtctagccttcagaatgcaattaagaat cgaaaccgaaaatcacttaatagttctcaaatgtttcagattatggaaaagtacaagccc ttccaactggacacaccactcttcgtctggaattcatttttagccattttctcaattctc gggttcctccgaatgacacctgaatttgtatggagttggtcagcagaaggaaactcattc aaatattcaatttgtcattcatcttatgctcaaggagtcactggtttctggactgaacaa ttcgcaatgagcaaacttttcgagctcatcgacacaatcttcatcgttcttcgtaaacgt ccactcatcttccttcactggtatcatcatgtaactgttatgatctacacatggcacgcg tacaaggatcacactgcatcaggacggtggttcatttggatgaattatggagttcatgct cttatgtattcctactatgctcttcgttctctgaaattccgtcttccaaaacaaatggca atggttgttactactctccaacttgctcaaatggttatgggagtaatcatcggagtcact gtctaccgtatcaagtcatcgggtgaatactgccaacagacatgggacaatttgggatta tgctttggagtttatttcacatatttccttcttttcgccaacttcttctaccatgcatat gttaagaaaaacaaccgtacagtaaattatgaaaataattcaaaaaatttccccgatctc gttttaatttacctgagaaaaaaggtttcaagaaaatcgaaaaatcggcaatgttcagaa aataattataaaattcaattttcatcaaattttgttaatgttgatggaaaaaaacataag aaaacatatgaacttattcttccaagaagaaaaatgaccacaattttaacttttctattt ggaaaaaatcgaattttttcgaaatatcagaaaaatcgaaaaaacatttcgattcctgtt gatttcgaaattctggagccaaaagaagatatcaatgctaacatcgctgagccatccatc acaacgaggtccgccgccgcacgaagaaaagttcaaaaagctgattago SEQID3.GBS.txt atggcagcagcacaaacaagtccagcagccacgctcgtcgatgttttgacaaaaccatgg agtctggatcagactgattcttacatgtctacatttgtaccattatcctataaaatcatg attggttatctcgtcaccatctacttcgggcaaaaattaatggctcacagaaaaccattc gatctccaaaatacacttgctctctggaacttcgggttttcactgttctcgggaatcgcc gcctataagcttattccagaactattcggagttttcatgaaggacgggtttgtcgcttcc tactgtcaaaacgagaactactacaccgatgcatcaactggattctggggctgggccttt gtgatgtcgaaagctccagaactaggggatactatgttcttggtccttcgtaaaaaacca gttatcttcatgcactggtatcatcatgccctcacatttgtctacgcagtagtcacatac tctgagcatcaggcatgggctcgttggtctttggctctcaaccttgccgtccacactgtt atgtatttctacttcgccgttcgcgccttgaacatccaaactccacgcccagtggcaaag ttcatcactactattcaaattgtccaatttgtcatctcatgctacatttttgggcatttg gtattcattaagtctgctgattctgttcctggttgcgctgttagctggaatgtgctatcg atcggaggactcatgtacatcagttatttgttcctttttgccaagttcttctacaaggcc tacattcaaaaacgctcaccaaccaaaaccagcaagcaggagtago atgtcatcgg'acgatc t SEQzo4.GBS.txt g ggcactagaaccttcaagatgatggatcaaattcttggaaca aacttcacttatgaaggtgccaaagaagttgctcgaggccttgaaggtttctcagcaaag cttgccgtcggatatattgccactatttttggactgaaatattatatgaaagaccgaaaa gccttcgatctcagtactccattaaacatttggaatggtattctttcgacattcagctta ttgggattcttattcacttttcctactttgttatcagttatcagaaaggatggatttagt cacacctattcccatgtctctgagctttacactgacagtacctctggatattggatcttc ctttgggttatctcaaagattccggaacttttggatacagtattcattgttcttcgcaag agaccacttattttcatgcactggtaccatcacgcattgaccggttactatgctcttgtc tgctaccatgaggatgctgtccatatggtttgggttgtatggatgaattatattattcat gcattcatgtatggatactatcttctgaaatctctgaaagttccaattccaccatcagtt gctcaagcaatcaccacatctcaaatggttcaattcgcagttgccattttcgcacaagtt catgtttcctataaacactatgttgagggagttgaaggattagcctactcgttcagagga acagctatcggatttttcatgcttactacctacttctatctatggattcaattctacaaa gagcactatcttaagaatggaggcaaaaagtacaatttggcaaaggatcaggcaaaaact caaacaaagaaggctaactaao SEQIDS.GBS.txt atgccacagggagaagtctcattctttgaggtgctgacaactgctccattcagtcatgag ctctcaaaaaagcatattgcacagactcagtatgctgctttctggatctcaatggcatat gttgtcgttatttttgggctcaaggctgtcatgacaaaccgaaaaccatttgatctcacg ggaccactgaatctctggaatgcgggtcttgctattttctcaactctcggatcacttgcc actacatttggacttctccacgagttcttcagccgtggatttttcgaatcttacattcac atcggagacttttataatggactttctggaatgttcacatggcttttcgttctctcaaaa gttgctgaattcggagatacactttttattattcttcgtaaaaagccattgatgttcctt cattggtatcatcatgtgcttacaatgaattatgcttttatgtcatttgaagctaatttg ggatttaatacttggattacatggatgaatttctcagttcactcaattatgtatggatat tatatgcttcgttcttttggtgtcaaggttccagcatggattgccaagaatattacaaca atgcaaattcttcaattcgttattactcatttcattcttttccacgttggatatttggca gttactggacaatctgttgactcaactccaggatattattggttctgccttctcatggaa atctcttatgtcgttctgttcggaaacttctactatcaatcatacatcaagggaggtggc aagaagtttaatgcagagaagaagactgaaaagaaaattgaataaa SEQID6.GBS.txt atgtatttgaattatttcgcgacggaaatcttccatcgtagtgcggtttgtgaaacagaa gcttgtcgctcgtcaaaaataatgattgctgacgtgttcaaatggaaattcgatgcaaac gaattgtggagtcttttaacgaatcaggatgaagttttcccgcatattagagcacggcga ttcattcaagaacattttggtctattcgtccagatggcaattgcatatgtcattttggtg ttctcaatcaaaaggttcatgagggatcgtgaaccatttcaactcaccacagctcttcgt ctctggaacttcttcctctccgtcttctcaatttatggttcctggacaatgtttccattt atggttcaacaaataagactttatggtctctacggatgtggatgcgaagcactttcaaac cttccgagtcaagcagaatattggcttttcctgacgatcttgtccaaagctgtggagttt gttgatacatttttcttggttctccggaaaaaaccactcatcttcctacactggtatcat catatggcaacatttgtcttcttctgcagtaattacccgactccatcgtcacaatcacgc gtcggagttatcgtcaacctgttcgtgcatgccttcatgtacccatactatttcacccga tcaatgaacatcaaagttcctgcgaaaatttcaatggctgttacagttcttcaattgact caattcatgtgctttatctatggatgtactctcatgtactactcgttggccactaatcag gcacgatacccctcaaatacacctgcgacactccaatgtttgtcctacactctacatttg ctttgao SEQID7.GBS.txt atggctcagcatccgctcgttcaacggcttctcgatgtcaaattcgacacgaaacgattt gtggctattgctactcatgggccaaagaatttccctgacgcagaaggtcgcaagttcttt gctgatcactttgatgttactattcaggcttcaatcctgtacatggtcgttgtgttcgga acaaaatggttcatgcgtaatcgtcaaccattccaattgactattccactcaacatctgg aatttcatcctcgccgcattttccatcgcaggagctgtcaaaatgaccccagagttcttt ggaaccattgccaacaaaggaattgtcgatcctactgcaaagtgtttgatttcacgaaag gagagaatggatactgggtgtggctcttcatggcttccaaacttttcgaacttgttgaca ccatcttcttggttctccgtaaacgtccactcatgttccttcactggtatcaccatattc tcaccatgatctacgcctggtactctcatccattgaccccaggattcaacagatacggaa tttatcttaactttgtcgtccacgccttcatgtactcttactacttccttcgctcgatga agattcgcgtgccaggattcatcgcccaagctatcacatctcttcaaatcgttcaattca tcatctcttgcgccgttcttgctcatcttggttatctcatgcacttcaccaatgccaact gtgatttcgagccatcagtattcaagctcgcagttttcatggacacaacatacttggctc ttttcgtcaacttcttcctccaatcatatgttctccgcggaggaaaagacaagtacaagg cagtgccaaagaagaagaacaactaa~
SEQID8.GBS.tXt atgtcggccgaagtgtccgaacgattcaaagtttggacaggaaacaatgagaccatcatc tattccccattcgagtacgattccacgttgctcatcgagtcatgtcggtgtacttatcag ctgcttatattattgcgacaaatttattacagagatatatggagtcacggaaacctaaaa cttttactagcatggaacggttttttggcagtgttcagtattatgggtacatggagattt ggaatcgaattctacgatgctgttttcagaagaggcttcatcgattcgatctgcctggct gtaaatccacgttcaccgtccgcattctgggcatgcatgttcgctctatcgaaaatcgcc gagtttggggacacgatgttcttggtgctgaggaaacggccggttatattccttcactgg tatcatcacgctgttgttctgatcctttcttggcatgctgcaatcgaactcacagctcca ggacgctggtttatttttatgaactatttggtgcattcaataatgtatacatactacgca ataacatcaatcggctatcgtcttcccaaaatcgtttcaatgactgttacattccttcaa actcttcaaatgctcattggtgtcagcatttcttgcattgtgctttatttgaagcttaat ggagagatgtgccaacaatcctacgacaatctggcgttgagcttcggaatctacgcctca ttcctggtgctattctccagtttcttcaacaatgcatatttggtaaaaaaggacaagaaa cccgatgtgaagaaggattaao SEQID9.GBS.txt MELAEFWNDL~ITFTIYGPNHTDMTTKYKYSYHFPGEQVADPQYWTILFQKYWYHSITISV
LYFILIKVIQKFMENRKPFTLKYPLILWNGALAAFSIIATLRFSIDPLRSLYAEGFYKTL
CYSCNPTDVAAFWSFAFALSKIVELGDTMFIILRKRPLIFLHYYHHAAVLIYTVHSGAEH
TAAGRFYILMNYFAHSLMYTYYTVSAMGYRLPKWVSMTVTTVQTTQMLAGVGITWMVYKV
KTEYKLPCQQSVANLYLAFVIYVTFAILFIQFFVKAYIIKSSKKSKSVKNEO
SEQID10.GBS.tXt MAKYDYNPKItGLENYSIFLPFETSFDAFRSTTWMQNHWYQSITASVVYVAVIFTGKKWL
IYKKSRVITFESSLQNAIKNRNRKSLNSSQMFQIMEKYKPFQLDTPLFVWNSFLAIFSIL
GFLRMTPEFVWSWSAEGNSFKYSICHSSYAQGVTGFWTEQFAMSKLFELIDTIFIVLRKR
PLIFLHWYHHVTVMIYTWHAYKDHTASGRWFIWMNYGVHALMYSYYALRSLKFRLPKQMA
MVVTTLQLAQMVMGVIIGVTVYRIKSSGEYCQQTWDNLGLCFGVYFTYFLLFANFFYHAY
VKKNNRTVNYENNSKNFPDLVLIYLRKKVSRKSKNRQCSENNYKIQFSSNFVNVDGKKHK
KTYELILPRRKMTTILTFLFGKNRIFSKYQKNRKNISIPVDFEILEPKEDINANIAEPSI
TTRSAAARRKVQKADD
SEQID11.GBS.tXt MAAAQTSPAA'fLVDVLTKPWSLDQTDSYMSTFVPLSYKIMIGYLVTIYFGQKLMAHRKPF
DLQNTLALWNFGFSLFSGIAAYKLIPELFGVFMKDGFVASYCQNENYYTDASTGFWGWAF
VMSKAPELGDTMFLVLRKKPVIFMHWYHHALTFVYAVVTYSEHQAWARWSLALNLAVHTV
MYFYFAVRALNIQTPRPVAKFITTIQIVQFVISCYIFGHLVFIKSADSVPGCAVSWNVLS
IGGLMYISYLFLFAKFFYKAYIQKRSPTKTSKQE~
SEQID12.GBS.txt MSSDDRGTRl'~FKMMDQILGTNFTYEGAKEVARGLEGFSAKLAVGYIATIFGLKYYMKDRK
AFDLSTPLNIWNGILSTFSLLGFLFTFPTLLSVIRKDGFSHTYSHVSELYTDSTSGYWIF
LWVISKIPELLDTVFIVLRKRPLIFMHWYHHALTGYYALVCYHEDAVHMVWVVWMNYIIH
AFMYGYYLLKSLKVPIPPSVAQAITTSQMVQFAVAIFAQVHVSYKHYVEGVEGLAYSFRG
TAIGFFMLTTYFYLWIQFYKEHYLKNGGKKYNLAKDQAKTQTKKAN~
SEQID13.GBS.tXt MPQGEVSFFE~VLTTAPFSHELSKKHIAQTQYAAFWISMAYVVVIFGLKAVMTNRKPFDLT
GPLNLWNAGLAIFSTLGSLATTFGLLHEFFSRGFFESYIHIGDFYNGLSGMFTWLFVLSK
VAEFGDTLFIILRKKPLMFLHWYHHVLTMNYAFMSFEANLGFNTWITWMNFSVHSIMYGY
YMLRSFGVKVPAWIAKNITTMQILQFVITHFILFHVGYLAVTGQSVDSTPGYYWFCLLME
ISYWLFGNFYYQSYIKGGGKKFNAEKKTEKKIEO
SEQID14.GBS.txt MYLNYFATEI~FHRSAVCETEACRSSKIMIADVFKWKFDANELWSLLTNQDEVFPHIRARR
FIQEHFGLFVQMAIAYVILVFSIKRFMRDREPFQLTTALRLWNFFLSVFSIYGSWTMFPF
MVQQIRLYGLYGCGCEALSNLPSQAEYWLFLTILSKAVEFVDTFFLVLRKKPLIFLHWYH
HMATFVFFCSNYPTPSSQSRVGVIVNLFVHAFMYPYYFTRSMNIKVPAKISMAVTVLQLT
QFMCFIYGCTLMYYSLATNQARYPSNTPATLQCLSYTLHLLD
SEQID15.GBS.tXt MAQHPLVQRL'~LDVKFDTKRFVAIATHGPKNFPDAEGRKFFADHFDVTIQASILYMVWFG
TKWFMRNRQPFQLTIPLNIWNFILAAFSIAGAVKMTPEFFGTIANKGIVASYCKVFDFTK
GENGYWVWLFMASKLFELVDTIFLVLRKRPLMFLHWYHHILTMIYAWYSHPLTPGFNRYG
IYLNFVVHAFMYSYYFLRSMKIRVPGFIAQAITSLQIVQFIISCAVLAHLGYLMHFTNAN
CDFEPSVFKLAVFMDTTYLALFVNFFLQSYVLRGGKDKYKAVPKKKNNO
SEQID16.GBS.txt MSAEVSERFICVWTGNNETIIYSPFEYDSTLLIESCRCTYQLLILLRQIYYRDIWSHGNLK
ACDXLLLAWNGFLAVFSIMGTWRFGIEFYDAVFRXGFIXSICLAVNPRSPSAFWACMFAL
SKIAEFGDTMFLVLRKRPVIFLHWYHHAWLILSWHAAIELTAPGRWFIFMNYLVHSIMY
TYYAITSIGYRXPKIVSMTVTFLQTLQMLIGVSISCIVLYLKLNGEMCQQSYDNLALSFG
IYASFLVLSSFFNNAYLVKKDKKPDVKKD~
The polypeptide may be from a eukaryote.
The polypeptide may comprise at least a portion of the amino acid shown in SEQ
>D. 15, or variants thereof.
For the purposes of the present application, the term "variant" in relation to a certain sequence means a protein or polypeptide which is derived from the sequence through the insertion or deletion of one or more amino acid residues or the substitution of one or more ammo acid residues with amino acid residues having similar properties, e.g.
the replacement of a polar amino acid residue with another polar amino acid residue, or the replacement of a non-polar amino acid residue with another non-polar amino acid residue. In all cases, variants must have an elongase function as defined herein.
A second aspect of the invention provides a polypeptide having at least 60 %
homology to a polypeptide according to a first aspect of the invention. The polypeptide may have at least $0%, or as much as 90% or more homology to a polypeptide according to a first aspect of the invention.
The polypeptide according to either aspect of the invention may include a sequence motif responsible for Endoplasmic Reticulum (ER) - retention. This allows the polypeptide to be specifically located or targeted to the ER of a cell.
The polypeptide may also be able to elongate palmitoleic acid (PA; 16:1~~) to vacceric acid (VA; 18:14"). Thus, the polypeptide is also capable of elongation of a ~9-monounsaturated 16C fatty acid.
Preferably, the polypeptide is from an animal, more preferably, the animal is an invertebrate such as a worm. Where the animal is a worm, it is preferably C. elegans.
Alternatively, the animal is a vertebrate, preferably a mammal such as a human, rat or mouse.
A third aspect of the invention provides an isolated DNA sequence, preferably a cDNA
sequence, encoding a polypeptide according to a first or second aspect of the invention.
This DNA sequence may be used to engineer transgenic organisms.
Preferably, the DNA sequence comprises the sequence shown in SEQ 1D NO: 7 or variants of that sequence due, for example, to base substitutions, deletions, and/or additions.
A fourth aspect of the invention provides an engineered organism, such as a transgenic animal, engineered to express a polypeptide according to a first or second aspect of the invention. The engineered organism may be engineered to express elevated levels of the polypeptide, thereby providing a supply of polypeptide at a reduced cost as a reduced number of organisms need be used.
Preferably, the engineered organism is a mammal such as a rat, mouse or monkey.
A fifth aspect of the invention provides an engineered organism containing a synthetic pathway for the production of a polypeptide according to a first or second aspect of the invention. This has the advantage of allowing greater control over the production of PUFAs by the pathway by an organism.
The pathway may include D'-fatty acid desaturase, and/or 46-fatty acid desaturase.
The engineered organism according to a fourth or fifth aspect of the invention may be a lower eukaryote, such as yeast. Alternatively, the transgenic organism may be a fish.
A sixth aspect of the invention provides a transgenic plant engineered to express a polypeptide according to a first aspect of the invention.
A seventh aspect of the invention provides a transgenic plant containing a DNA
sequence according to a third aspect of the invention.
An eighth aspect of the invention provides a method of producing a PUFA
comprising carrying out an elongase reaction catalysed by a polypeptide according to a first or second aspect of the invention.
The PUFA may be di-homo-gamma-linoleic acid (20:308'"''4), arachidonic acid (20:4058°"''4), eicosapentanoic acid (20:5~5~8~"~'4~"), docosatrienoic acid (22:3~3~'6~'9), docosatetraenoic acid (22:4~'~'°~'3~'6), docosapentaenoic acid (22:50'''°~~3,16,19) or docosahexaenoic acid (22:54''''°,13.16,19)_ The PLJFA may be a 24 carbon fatty acid with at least 4 double bonds.
A ninth aspect of the invention provides a PUFA produced by a method according to an eighth aspect of the invention.
The PLTFA may be used in foodstuffs, dietary supplements or pharmaceutical compositions.
A tenth aspect of the invention provides a foodstuff comprising a PUFA
according to a fifth aspect of the invention. The foodstuff can be fed to an animal.
An eleventh aspect of the invention provides a dietary supplement comprising a PUFA
according to a fifth aspect of the invention. The dietary supplement can be supplied to an animal to augment its PUFA levels.
An twelfth aspect of the invention provides a pharmaceutical composition comprising a polypeptide according to a first or second aspect of the invention or a PLTFA
according to a ninth aspect of the invention.
Preferably, the pharmaceutical composition comprises a pharmaceutically-acceptable diluent, earner, excipient or extender. This allows the composition to be supplied in a form which best suits the pharmaceutical application in question. Fox example, a topical application would preferably be a cream or lotion, whereas if the composition was to be ingested a different form would be more suitable.
A thirteenth aspect of the invention provides a method of treatment of an animal, such as a mammal, or a plant, comprising supplying to the animal or plant a DNA sequence according to a third aspect of the invention, a foodstuff according to a tenth aspect of the invention, a dietary supplement according to an eleventh aspect of the invention, a pharmaceutical composition according to a twelfth aspect of the invention or a PUFA according to a ninth aspect of the invention.
Preferably, the mammal is a human.
The invention will now be further described, by way of example only, with reference to SEQ
>D1 to 16, and Figures 2 to 11, in which;
SEQ 1D 1 to 8 show the putative ORFs encoding PUFA elongases A to H
respectively; and SEQ ID9 to 16 show the deduced amino acid sequences of the putative ORFs of SEQ ID
NO: 1 to 8 respectively; and Figures 2 to 9 show hydrophobicity plots for each of PUFA elongases A to H
respectively.
Figure 10 shows an amino acid sequence line-up comparing the C'. elegans ORF
F56H11.4 (Z68749) with related sequences.
Figure 11 shows chromatograms of fatty acid methyl esters from transformed yeast.
Introduction to general strategy Initially the C. elegans databases were searched for any sequences which showed low levels of homology to yeast ELO genes (EL02 and EL03) using the TBLASTN programme. A
similar search was carried out using short (20 to 50 amino acid) stretches of ELO genes which were conserved amongst the three ELO polypeptide sequences. C. elegans sequences which were identified by this method were then used themselves as search probes, to identify any related C. elegans genes which the initial search with the yeast sequences failed to identify. This was necessary because the level of homology between the yeast ELO genes and ~ worm genes is always low (see BLAST scores later). To allow for a more sensitive search of worm sequences, a novel approach was adopted to circumvent the major drawback with searches using the BLAST programmes, namely that the search string (i.e.
the input search motif) must be longer than 1 S characters for the algorithm to work.
Thus, if it was desired to search for a short motif (like a histidine box), then the BLAST
programme would not be capable of doing this. A complete list of all the predicted ORFs present in the C.
elegans genome exists as a database called Wormpep, which is freely available from the Sanger WWW site (http://www.sanger.ac.uk/Projects/C_eleganslwebace front end.shtml).
The latest version of Wormpep was down loaded to the hard disc of a Pentium PC, and re-formatted as a Microsoft Word6 document, resulting in a document of about 3,500 pages.
This was then searched using the "Search & Replace" function of Word6, which also allows for the introduction of "wildcard" characters into the search motif. So, for example, it is possible to search both for the short text string HPGG, which would identify any predicted worm ORF present in the Wormpep 3,500 page document containing this motif, or alternatively search with HPGX (where X is a wild card character). Clearly, such (manual) searches of a 3,500 page document are extremely time-consuming and demanding, also requiring visual inspection of each and every identified ORF. For example, searching with a motif such as HXXHH identifies in excess of 300 different ORFs. However, by using a number of different short search strings (as outlined below), and combining these with other methods for identifying putative elongase enzymes, a number of candidate ORFs have been identified.
Database search using the FAE1 polypeptide sequence As a negative control, to demonstrate that the FAEl gene sequence was unlikely to provide a useful search sequence in the identification of C.elegans sequences encoding for PUFA
elongases, the GenBank databases (http://www.ncbi.nlm.nih.gov/Web/Search/index.html) were searched using the Arabidopis FAE1 polypeptide sequence to identify related genes or expressed sequence transcripts (ESTs). GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acid Research (1998) 26, 1-7). There are approximately 2,162,000,000 bases in 3,044,000 sequence records as of December 1998. The search was carried out using the BLAST2 (Basic Local Alignment Search Tool) algorithm (Altschul et al., (1990) ,I Mol Biol 215,403,410) Although a number of plant ORFs and ESTs were reported as being related, no animal sequences were identified by this search, confirming the observation that FAE1 was unlikely to be a suitable candidate as a search template for PUFA elongases.
Database search using yeast ELO sequences Using the three yeast fatty acid elongase sequences (ELO 1, 2, 3) as probes, a number of putative ORFs in the DNA of C. elegans-derived cosmid sequences which form the C.
elegans genomic sequence database were identified. Moreover, an extensive and time-consuming search of a downloaded copy of the WormPep database (ftp://ftp.sanger.ac.uk./pub/databases/wormpep) using manual search strings in MSWord 6, identified a number of C. elegans ORFs which contained presumptive histidine boxes.
Wormpep contains predicted proteins from the Caenorhabditis elegans genome sequence project, which is carried out jointly by the Sanger Centre in Cambridge, UK
and Genome Sequencing Center in St. Louis, USA. The current Wormpep database, Wormpep 16, contains 16,332 protein sequences (7,120,115 residues). Search strings used included [HXXHH], [HXXXHH], [QXXHH] and [YHH]. Comparison of the data from the two different searches indicated a small (<10) number of putative ORFs as candidate elongases.
The histidine box motifs are shown in bold in SEQ >D 9 to 16.
Hydrophobicity plot analysis Since the fatty acid elongase reaction is predicted to be carried out on the cytosolic face of the endomembrane system (Toke & Martin (1996), supra; Oh et al (1997), supra), the putative C. elegans ORFs were examined for potential membrane spanning domains, via Kyte & Doolittle hydrophobicity plots (J. Mol Biol, (1982), 157, 105-.132).
This revealed a number of ORFs with possible membrane-spanning domains, and also indicated a degree of similarity in the secondary-structure of a number of identified ORFs.
Screening for ER-retention signal sequences The inventors postulated that since fatty acid elongases are expected to be endoplasmic reticulum (ER) membrane proteins, they might be expected to have peptide signals which are responsible for "ER-retention". In the case of ER membrane proteins, this signal often takes the form of a C-terminal motif [K-K-XZ_,-Stop], or similar variants thereof (Jackson et al., (1990), EMBO J., 9, 3153-3162). Further sequence analysis of the C.
elegans putative elongases revealed that 4 ORFs (F41H10.7, F41H10.8, F56H11.4, Y53F4B.c) had C-terminal motifs that exactly matched this search pattern, and that a further 2 ORFs (F11E6.5, C40H1.4) had related sequences. These sequence motifs are underlined in SEQ
m 9 to 13, 15 and 16.
Chromosome mapping Since the inventors had previously observed that C.elegans genes involved in the synthesis of PUFA may exist in tandem (for example the 05 and 06 desaturases required for AA and GLA synthesis, respectively, are < 1 kB apart on chromosome N (Michaelson et al., (1998), FEBS Letts 439, 215-218), the positions of the putative C. elegans elongase ORFs were determined using the Sanger Centre's WebAce C. elegans server (http://www.sanger.ac.uk/Projects/C elegans/webace-front end.shtml).. This indicated that two pairs of putative elongases were in close proximity to each other on the C. elegans chromosome N.
F41H10.7 and F41H10.8 were identified as being approximately 10 Kb apart on chromosome N, and F56H11.3 and F56H11.4 were identified as being approximately 2 Kb apart on chromosome N.
Putative C. elegans fatty acid elongases The positions of the putative OIRF's in the C. elegans genome are shown below i.e.
chromosome number, and map position in centiMorgans, together with the GenBank database accession numbers.
The designations used employ the same method as used on the Sanger Centre's C
elegans database, i.e. ORF C40H1.4 is predicted coding sequence 4 on cosmid C40H1.
Elon ase Cosmid Saner ID GenBank Acc Chromosome Code A C40H 1.4 Z 19154 III
B D2024.3 U41011 N, 7.68 C F11E6.5 281058 IV, 18.8 D F41H10.7* U61954 N, 29.8 E F41H10.8* U61954 N, 29.8 F F56H11.3# 268749 IV, 2.5 G F56H11.4~ 268749 IV, 2.5 H Y53F4B.c 292860 II
* ors indicates genes in tandem Comparison of C. elegans putative elongase ORFs with yeast genes:
Each of the three yeast ELO polypeptides were compared against all of the worm putative elongase translated ORF sequences, and then ranked in order of similarity (as measured by the BLAST score) (Altschul et al (1990), supra) The results are shown below, with the ORF sequences ranked from most similar to least similar, and the BLAST scores are shown in brackets:
Yeast ELO1 (14 to 16 carbon fatty acid elongase) G (262) > E (241) > D (225) > C (219) > A (216) > F (215) > H (197) > B (172) Yeast EL02 (24 carbon sphingolipid elongase) E(231)>C(226)>G(189)>A(181)>F(166)>D(150)>H(141)>B(140) Yeast EL03 (24 to 26 sphingolipid elongase) D(171)>G(163)>F(154)>A(152)>E(150)>C(131)>B(132)>H(128) It is clear from the numeric values of the BLAST scores that the sequences are related, but the levels of homology are low. For comparison, the BLAST score for homology between two related worm proteins, the 05 and the 46 desaturase is in excess of 500.
Analysis of potential sphingolipid ancestry Previously, the inventors had noted the similarities between the fatty acid D6 desaturase and sphingolipid desaturases in plants, and that the two distinct enzymes could have arisen from one ancestral gene. Moreover, it was considered likely that the sphingolipid desaturase predated the fatty acid desaturase, and may in fact have been the ancesteral progenitor.
Therefore it is plausible that the next step in the arachidonic acid biosynthetic pathway has also evolved from the sphingolipid metabolic pathway. It is therefore considered highly significant that some of the C. elegans ORF putative elongases have similarity to sphingolipid enzymes. For this reason, these ORFs are considered to be very clear candidates for PUFA elongases. It has previously been considered that the C.
elegans 05 and D6 fatty acid desaturases have evolved from 1 ancestral gene (Michaelson et al., ( 1998), FEBS Letts 439, 215-218). It is also significant that one pair of C. elegans putative elongase ORFs (F & G) genetically maps close to the 05/06 fatty acid desaturase genes, with both gene pairs being located at the top end of chromosome IV.
Cosmid San eg r ID GenBank Acc Chromosome Encoded Peptide Code W08D2.4 270271 IV, 3.06 06 fatty acid desaturase T13F2.1 281122 IV, 3.06 45 fatty acid desaturase Cloning of Desaturase and Elongase Genes in Yeast Expression Vectors Putative elongases sequences F56H11.4 and F41H10.8 were cloned by PCR into the pYES2 vector (Invitrogen). A C. elegans mixed stage cDNA library was used as a PCR
template. F56H11.4 was amplified using primers:
56h114.for 5'-GCGGGTACCATGGCTCAGCATCCGCTC-3' and;
56h1 l4.rev 5'-GCGGGATCCTTAGTTGTTCTTCTTCTT-3'.
F41H10.8 was amplified using primers:
41h108.for 5'-GCGGGTACCATGCCACAGGGAGAAGTC-3' and;
41h108.rev 5'-GCGGGATCCTTATTCAATTTTTCTTTT-3'.
Amplified sequences were then restricted using Kpnl and BamHI (underlined in the forward and reverse primers, respectively), purified using the Qiagen PCR
purification kit, and ligated into a KpnI/BamHI cut pYes2 vector.
An ORF encoding the Mortierella alpina 45-fatty acid desaturase (Michaelson, L. V., et al (1998) J. Biol. Chem. 273, 19055-19059) was amplified using primers:
MadS.for 5'-GCGAATTCACCATGGGTACGGACCAAGGA-3' and;
MadS.rev 5'-GCGGAGCTCCTACTCTTCCTTGGGACG-3', and restricted using EcoRI and SacI, gel purified as described and ligated into a EcoRIlSacI
cut pESC-TRP vector (Stratagene) to generate pESC/~5.
An ORF encoding the borage O6-fatty acid desaturase (Sayanova, O., et al (1997) Proc.
Natl. Acad. Sci USA 94, 4211-4216) was restricted from pGEM3 using BamHI and XhoI
and ligated into a BamHIlXhoI cut pESC-TRP vector to generate pESC/0''.
A double construct was also generated by ligating the BamHIlXhoI borage 46 insert into the pESC/OS construct described previously, generating pESC/(05,06).
Functional Characterisation in Yeast Elongases and desaturase constructs were introduced in Saccharomyces cerevisiae W303-lA using a lithium acetate based method (Elble, R. (1992) Biotechniques 13, 18-20) and expression of the transgenes was induced by addition of galactose to 2%
(w/v) as described in Napier et al (Napier, J. A., et al (1998) Biochem J330, 611-614;
Michaelson L. V., supra; Michaelson, L. V., (1998) FEBS Letts 439, 215-218). Yeast transformants containing pYES2-derived constructs were grown on synthetic minimal media (SD, the composition of which is defined in Sherman, F ( 1991 ) Methods in Enzymology 194, 3-21 );
synthetic minimal medium minus uracil; pESC-derived constructs were grown on SD
minimal medium minus tryptophan. Co-transformed yeast (containing both pYES2 and pESC derivatives) were grown on SD minimal medium minus uracil and tryptophan.
Prior to induction, cultures were grown in the presence of 2% raffinose and supplemented with 0.5 mM of the appropriate fatty acid substrate in the presence of 1 % tergitol-(NP40) (Sigma). All cultures were then grown for a further 48-h unless indicated.
Fatty Acid Analysis To identify the elongation reaction responsible for the synthesis of di-homo-y-linolenic acid (DHGLA; 20:308"~'4) from GLA, this latter fatty acid was supplied as the (exogenous) substrate.
Lipids were extracted from transformed and control yeast by homogenisation in MeOH-CHC13 using a modification of the method of Bligh and Dyer (Dickenson &
Lester (1999) Biochim Biophys Acta 1426, 347-357). The resulting CHC13 phase was evaporated to dryness under nitrogen gas and the samples were transmethylated with 1M HCl in methanol at 80 °C for 1 hour. Fatty acid methyl esters (FAMES) were extracted in hexane and purified using a small column packed with Florisil. Analysis of FAMES was conducted using a Hewlett Packard 5880A Series Gas Chromatograph equipped with a 25M x 0.32mm RSL-SOOBP bonded capilliary column and a flame ionisation detector.
Fatty acids were identified by comparison of retention times with FAME
standards (Sigma) WO 00/$5330 CA 02365096 2001-09-14 pCT/GB00/01035 separated on the same GC. Quantitation was carried out using peak height area integrals expressed as a total of all integrals (Bligh, E.G. & Dyer, W.J. (1959) Can. J.
Biochem.
Physiol. 37, 911-917).
Total fatty acids extracted from yeast cultures were analysed by gas chromatography (GC) of methyl ester derivatives. Lipids were extracted, transmethylated and the fatty acid methyl esters (FAMEs) analysed as described by Sayanova et al.
Figure 11 shows chromatograms of fatty acid methyl esters from yeast transformed with the control (empty) plasmid pYES2 (Fig. 11A) or with ORF F56H11.4 in pYES2 (Fig.
11B).
Exogenous substrate in the form of GLA was supplied to the cultures. Two novel peaks are observed in (B); these peaks (annotated as 20:3 and 18:1 *) were identified (against known standards) as DHGLA and vaccenic acid, respectively. Detection was by flame ionisation.
One cDNA ORF tested in this manner displayed a high level of elongase activity on the GLA substrate, converting 44% to DHGLA. The identity of this elongation product was confirmed as DHGLA by comparison with a known standard (the standards used were known standards for either DHGLA, AA, EPA or VA from Sigma Chemicals, Ltd.), using GCMS analysis using a Kratos MS80RFA (Napier, J. A., supra; Michaelson, L. V., supra;
Michaelson, L. V., supra). The deduced amino acid sequence of the functional elongase clone identified it as being encoded by the C. elegans gene F56H11.4, and comparison with the yeast ELO genes showed low homology confined to a few short amino acid motifs (see Fig. 10). Some similarity with a mouse gene Cig30 (Tvrdik, P., (1997) J. Biol.
Chem. 272, 31738-31746), which has been implicated in the recruitment of brown adipose tissue in liver tissue, was also observed, as well as a potential human homologue encoded by a gene located on chromosome 4q25, BAC 207d4. The most closely related C. elegans ORFs, F41H10.8 (U61954) and F56H11.3 (Z68749) are also shown, as is part of a related human gene present on chromosome N (present on BAC clone B207d4; AC004050). The GenBank accession numbers are given for all sequences.
The range of fatty acids synthesised by C. elegans can potentially require a number of different elongation reactions (Tanaka, T., (1996) Lipids 31, 1173-1178). The substrate-specificity of the F56H11.4 PUFA elongase was therefore determined using a range of exogenously supplied fatty acids. This revealed that GLA is the major substrate, with a number of other fatty acids being elongated at a lower efficiency (see Table 1 ).
Although most of these substrates are polyunsaturated fatty acids, it was unexpectedly observed that palmitoleoic. acid (PA; 16:1 09) was also elongated by F56H11.4 to yield vaccenic acid (VA; 18:1 0"). The biosynthetic pathway for VA is unclear, but the data indicate that it may be synthesised by elongation of 0~-monounsaturated 16C
fatty acid.
The C. elegans PUFA elongase ORF F56H11.4 maps to the top of chromosome IV (at 4.32 cM) with a related sequence (F56H11.3; 51 % similarity) located 1,824bp downstream.
Another C. elegans gene (F41H10.8) was also observed, which is present on chromosome IV, and which shows a slightly higher level (53%) of similarity to the PUFA
elongase than F56H11.3 (see Fig. 10). However, when a PCR product encoding ORF F41H10.8 was expressed in yeast in a manner identical to that used for F56H11.4, the former failed to direct the elongation of any fatty acids, despite the provision of a range of substrates (see Table II).
In order to reconstitute the PUFA biosynthetic pathway in a heterologous system, the PLTFA elongase F56H11.4 was expressed in yeast in conjunction with either the 06- or OS-fatty acid desaturases previously isolated and characterised by the inventor (Napier, J.
A., supra; Michaelson, L. V., supra). Expression of the OG-fatty acid desaturase and F56H11.4 was carried out in the presence of two different substrates (LA or ALA) while the 4'-fatty acid desaturase and the elongase were expressed in the presence of GLA only.
This demonstrated that was possible to combine a desaturase and an elongase in yeast to generate significant amounts of a final "product" (see Table III). In the case of the elongase and the 4G-fatty acid desaturase, the reactions proved highly efficient with the production of 4.5% of DHGLA from the LA substrate. This resulted from 25%
desaturation of the LA substrate to GLA, which was then elongated to DHGLA at a similar level of efficiency (18%). This is lower than the % conversion observed for GLA when supplied exogenously (see Table I), indicating that the in vivo production of substrates for elongation may be rate-limiting.
If ALA was used as a substrate, 27% of this was initially O6-desaturated to yield octadecatetraenoic acid (OTA; 18:4 06.9,'z,'s) but only 8% of was subsequently elongated to yield eicosatetraenoic acid (20:4 ~8~"~'4~"). Thus, the conversion efficiency of ALA to the final 20-carbon tetraenoic PUFA was only about 2.2%.
Since DHGLA is an n-6 fatty acid, whilst the OTA-derived eicostetraenoic acid is an n-3 type, this demonstrates that the elongase is capable of accepting both forms of essential fatty acid, albeit with different efficiencies. Verification was also provided that the 20C
PUFAs synthesised in the yeast expression system were generated by the O6-desaturation of 18C substrates which were subsequently elongated, as the O6-desaturase showed no activity on 20:2 or 20:3 substrates (see Table IIII).
The combination of the D5-desaturase and the elongase also demonstrated that these two enzymes could work in tandem, although the efficiency of this overall conversion was lower (3.3% AA from GLA) which was due to the previously observed low activity of the OS-desaturase enzyme itself (Michaelson, L. V., supra; Michaelson, L. V., supra). Thus, although nearly 45% of the GLA substrate was elongated to DHGLA, only 7.5% of this was then desaturated to AA (see Table III).
Finally, the production of either AA or eicosapentanoic acid (EPA;
20:5~5~8~"~'4~") in yeast from dienoic or trienoic 18 carbon substrates was achieved via expression of all three enzymes (the two desaturases and the F56H11.4 PUFA elongase) simultaneously.
As shown in Table IV, small but significant amounts of AA were produced when the yeast was supplied with the 18C dienoic fatty acid LA.
GC-Mass Spectroscopy (MS) Analysis Peak identification and confirmation were carried out by GC-MS using a Kratos using known standards (Sigma). The identity of this 20C PLTFA was verified by GCMS, indicating that the conversion efficiency from LA was 0.65%. When ALA was used as a substrate, 12.5% of the (46-desaturated and elongated) eicosatetraenoic n-3 fatty acid was OS-desaturated, resulting in a total conversion of 0.3% of the ALA substrate to EPA (the identity of EPA was confirmed by GCMS).
Expression of C. elegans elongase in plants In order to express C. elegans elongase in plants, the following protocol is an example of a process which can be used to create the transgenic plants. C. elegans ORF
sequence can be subcloned into a plant expression vector pJD330, which comprises a viral 35S
promoter, and a Nos terminator. The resulting cassette or promoter/coding sequence/terminator can then be subcloned into the plant binary transformation vector pain 19, and the resulting plasmid introduced into Agrobacterium tumefaciens. This Agrobacterium strain can then be used to transform Arabidopsis by the vacuum-infiltration of inflorescences, and the seeds harvested and plated onto selective media containing kanamycin. Since pain 19 confers resistance to this antibotic, only transformed plant material will grow.
Resistant lines can therefore be identified and self fertilized to produce homozygous material.
Leaf material can then be analyzed for expression of C. elegans elongase.
Fatty acid methyl ester analysis can be carried out as previously described.
.. .r n ..a , n~, ~~ ..
~ ~ V, ~ C
N ,n , , . , N . .
, , , , .
r oanon o +a +~ ~ +~
+ Ya+~~~, ~~~nn N V1 r, ~O . ~ , , , . , , , , O
rr y~ CO
r N "', n .
C .-, , ~ ~O +1 +1 e0 0~
A
q~
Y, n, ,. v.., O,v W
N
p .~-v ~ ..i ~ M
O O O
~
~ 'H .-n '~
.w p v, a, a aA
O ~
~ .-i , v . Y n , v , M n ~
~
~
r. h C
+1 'H
+1 . ~
b M
~
' n . n . , n . v n , .
f ~ Y
~
~
~
+ y'~.
y'~e ae r.
.. -. .. n , , , v1 N h ... M ~ v , , , t~ , W
.~'!0 0 o -~i ~i +1 +1 '~
.. o ao .r . . . .
'' 'o ~c v . .
r ~ . .
n ~
, , n I ii ~
+ '~ g fl +
n ~0 0~
N M VI ~ t.i , , ~ , , v n W 1 , ~
M
+1 1t '1i n f Y ~ n , . , , . .
, . , v .
.
yl rr ~) ~ ~
M M
O O
+ 0~ T
n ,O , , , .
:g.~~a , , , , , , , m~~~
+1 -H 'i~
. +1 ~
.
, , , , , ~ , ~ h . , , , , , Y
~r ' p~ ~ p, ooO~o'o~po~~ ~AN~ i~
SUBSTITUTE SHEET (RULE 26) WO 00/55330 2p PCT/GB00/01035 Ov N N
oo C M M N
~
+I +I +I
+I
' (~ N
~ M
M N '~ M
Vi NMV1.-l ~~ i~~~~N ~i ~
a ~.
w O M ~ N
N
i-I +I ii +I
+ O I~ M
~ M
M ~ ~ N
~n N M ~t ~ ~ ~ ~ ~ i ~
~ ~ N ~ O
N O~ M
C O '-' .-.
~
O O
d' 00 .d.
'~'~
-H
M vi vi ~: ~
N -r ~ d' ~ ~ i v ~
W O~ ~ ~ ~ ~
d a d N N 'n O CO O
M """
~ O O
+ 00 ~O
-. vi O
~
N
. mt ~ ~ ~ ~
~ ~
N M N
O O '~ O
O
-H +I i1 ~ 1-1 ' ~ ~ +~ rr rr V V' M ~D
0 ~ O
N N v1 ~ ~ ~ ~ ~ ~ i -~ M ~ ~ i O
_ x _ ,~ _ w O N N
M
o +~+~ +~
+ os ~r '~ '~
M N " ~
'"~ Q~ O
N vi s C M ~ i ~ ~ ~ ~
N ~
OW O ~D
O .-1 O
'."' .-:
-f-I -fi i-i +I
O 'n N
~ .--~
~ h N M
d a U v0 N ~D
d' O N
i-I +i +i +i + ~ V7 M
~ N
00 M ~ ~!' M ~O -~ O
N
-~
' n ~ ~ . ~ ~
M ~ ~ ~
N ~D
v~
O O '~
O
-H -H
+I
~ Qv O~ O
'~ 'd' .-~ i!1 i ~ ~ ~ ~
v1 N ~ ~ i O~ l~
M
O O "' .-.
+I ii O ii + O Ov ~ u1 ~ O N
~
.-~ v~
~t N
b N
d b y # ~ o b aa d a d d ' ~' ooaoxoa W ' A
o~o' - ..
o o d H o v~ ,~ .-, ,-, .-.a c~ N A s t~ ~
,-. ,.aN w a w .-.
WO 00/55330 2 j PCT/GB00/01035 00 N V1 h M M ~ N M
-fl -!-i ~ -H
i-I
' 00 ~ ~ O N
Qv V1 ~. M 01 N N ~ ~!1 ~ ~ ~ ~ ~i ~ ~
...r ~ ~ ~ ~ ~ ~ i ~r d ~ a _ d' M 00 N M 00 N
Ov w +i -+~ o +I +~ -: o o -~ O~ ~G '~'~ N 'H -H
l'~ ~I
h ~t ~ N ~ M
N N ~ ~n .-.,~ ~' ~ ~ C~
N ~ ~ O~ ~
M l~
Oy N et O O O
o ii +I -ti r; .--~ V: pp O
N C~ ~ v~ ~n ~O ~ ~
~O ~ I~ 00 M ~t N
M
~ C O G O ~ ~ O O
O
.o + ,~. ii ~i M M -H i-I
-H ii ~-I
C1 d r, M d; N N ,n d: '~t 'ct h ~O C;
...y'i O ~O t'~ ~ M ~~ ~ OG
V1 N W .-n~ ~ Wn ~i;
rr C '-' O N
'n O
-H ii ~ ii +I
' t~ ~O ~ N
wt M et' ~ ~
N N Wi .-r ~ ~ ~ ~ ~~ ~ i i M ~ ~ ~
a ~D N ~h N
~; .-. M r; N 'n t~ M N
~ O O ~ O N O
,i~ C
-1-l~ Ov 'f'~ -H -H
~ N ~ O -i-I
00 00 ~ ~ Os M ~ I~
cJ ~ v1 t~
.-r .r O et ~ t~ M n ~i .-i .-i t~ N ~ .--n ~ ~ oO
~
v1 I~ N
r; n1 ~; ~; 'n r; -O O
O ' N I~ ~
N ~ ~ .-~ ..~ 00 ~
N et 'd' u'W,i i ~ W1iv n i v ~ i i i i d M 00 .-r .-; N N ~t M
r.;
N +i +f '~ ~ O
+I
C ~' t~ O ~ ~ M
N ~: ~ N o0 O
,ri N d' vi rt ~ i mt' ~
.-, v ~ ~
b N
d V ~ G c~
' ~ ~
a o -~ cv, o d d cv M ~r W d d .-. *. d ~ d d x aH
b ~~~~~~d aH o o ors o d H v ~ ~ ~~~~~-.,~ ~ac~oNA N ~Nw a c~oa~
C40H1.4 atggagcttgccgagttctggaatgatctcaacaccttcaccatctacggaccgaatcac acagatatgaccacaaaatacaaatattcatatcacttcccaggtgaacaggtggcggat ccgcagtattggacgattttattccagaaatattggtatcattcgatcacaatatcagtt ctttatttcattttaattaaggtgattcaaaagtttatggagaatcgaaaaccattcact ttgaaatacccattgattctttggaatggagctcttgcagcattcagtataattgccaca ttgcggttctctattgatcctctacgatcactatatgctgaaggattctacaaaactctg tgctattcgtgtaatccaactgatgtggctgcattttggagctttgcattcgctctttcc aagattgttgaacttggagacactatgttcattattttgagaaaacggccattgatcttt ttacactactatcatcatgcagcagtgttaatctacactgtccattctggtgccgagcat actgcagctggtcgtttctacatcctaatgaactacttcgcacattctctcatgtatact tactacacagtttctgccatgggatacagattaccgaaatgggtatcaatgactgtcaca actgttcaaacaactcaaatgttagctggagtcggaataacttggatggtgtacaaagtg aaaactgaatacaagcttccttgtcaacaatccgtagccaatttgtatctcgcattcgtc atctatgtcacatttgccattcttttcattcaattcttcgtcaaggcatacattatcaag tcgtcgaagaagtcgaaatcggtgaagaacgaataa D2024.3 atggcaaaatacgactacaatccgaagtatgggttagaaaattacagcatattccttccc tttgagacatcttttgatgcatttcgatcgacaacatggatgcaaaatcactggtatcaa tcaattacagcatctgtcgtgtatgtagccgtcatttttacaggaaagaaggtggttctc atctacaaaaaatcacgagttattacttttgagtctagccttcagaatgcaattaagaat cgaaaccgaaaatcacttaatagttctcaaatgtttcagattatggaaaagtacaagccc ttccaactggacacaccactcttcgtctggaattcatttttagccattttctcaattctc gggttcctccgaatgacacctgaatttgtatggagttggtcagcagaaggaaactcattc aaatattcaatttgtcattcatcttatgctcaaggagtcactggtttctggactgaacaa ttcgcaatgagcaaacttttcgagctcatcgacacaatcttcatcgttcttcgtaaacgt ccactcatcttccttcactggtatcatcatgtaactgttatgatctacacatggcacgcg tacaaggatcacactgcatcaggacggtggttcatttggatgaattatggagttcatgct cttatgtattcctactatgctcttcgttctctgaaattccgtcttccaaaacaaatggca atggttgttactactctccaacttgctcaaatggttatgggagtaatcatcggagtcact gtctaccgtatcaagtcatcgggtgaatactgccaacagacatgggacaatttgggatta tgctttggagtttatttcacatatttccttcttttcgccaacttcttctaccatgcatat gttaagaaaaacaaccgtacagtaaattatgaaaataattcaaaaaatttccccgatctc gttttaatttacctgagaaaaaaggtttcaagaaaatcgaaaaatcggcaatgttcagaa aataattataaaattcaattttcatcaaattttgttaatgttgatggaaaaaaacataag aaaacatatgaacttattcttccaagaagaaaaatgaccacaattttaacttttctattt ggaaaaaatcgaattttttcgaaatatcagaaaaatcgaaaaaacatttcgattcctgtt gatttcgaaattctggagccaaaagaagatatcaatgctaacatcgctgagccatccatc acaacgaggtccgccgccgcacgaagaaaagttcaaaaagctgattag F11E6.5 atggcagcagcacaaacaagtccagcagccacgctcgtcgatgttttgacaaaaccatgg agtctggatcagactgattcttacatgtctacatttgtaccattatcctataaaatcatg attggttatctcgtcaccatctacttcgggcaaaaattaatggctcacagaaaaccattc gatctccaaaatacacttgctctctggaacttcgggttttcactgttctcgggaatcgcc gcctataagcttattccagaactattcggagttttcatgaaggacgggtttgtcgcttcc tactgtcaaaacgagaactactacaccgatgcatcaactggattctggggctgggccttt gtgatgtcgaaagctccagaactaggggatactatgttcttggtccttcgtaaaaaacca gttatcttcatgcactggtatcatcatgccctcacatttgtctacgcagtagtcacatac tctgagcatcaggcatgggctcgttggtctttggctctcaaccttgccgtccacactgtt atgtatttctacttcgccgttcgcgccttgaacatccaaactccacgcccagtggcaaag ttcatcactactattcaaattgtccaatttgtcatctcatgctacatttttgggcatttg gtattcattaagtctgctgattctgttcctggttgcgctgttagctggaatgtgctatcg atcggaggactcatgtacatcagttatttgttcctttttgccaagttcttctacaaggcc tacattcaaaaacgctcaccaaccaaaaccagcaagcaggagtag F41H10.7 atgtcatcggacgatcgtggcactagaaccttcaagatgatggatcaaattcttggaaca aacttcacttatgaaggtgccaaagaagttgctcgaggccttgaaggtttctcagcaaag cttgccgtcggatatattgccactatttttggactgaaatattatatgaaagaccgaaaa gccttcgatctcagtactccattaaacatttggaatggtattctttcgacattcagctta ttgggattcttattcacttttcctactttgttatcagttatcagaaaggatggatttagt cacacctattcccatgtctctgagctttacactgacagtacctctggatattggatcttc ctttgggttatctcaaagattccggaacttttggatacagtattcattgttcttcgcaag agaccacttattttcatgcactggtaccatcacgcattgaccggttactatgctcttgtc tgctaccatgaggatgctgtccatatggtttgggttgtatggatgaattatattattcat gcattcatgtatggatactatcttctgaaatctctgaaagttccaattccaccatcagtt gctcaagcaatcaccacatctcaaatggttcaattcgcagttgccattttcgcacaagtt catgtttcctataaacactatgttgagggagttgaaggattagcctactcgttcagagga acagctatcggatttttcatgcttactacctacttctatctatggattcaattctacaaa gagcactatcttaagaatggaggcaaaaagtacaatttggcaaaggatcaggcaaaaact caaacaaagaaggctaactaa F41H10.8 (ce477) atgccacagg gagaagtctc attctttgag gtgctgacaa ctgctccatt cagtcatgag ctctcaaaaa agcatattgc acagactcag tatgctgctt tctggatctc aatggcatat gttgtcgtta tttttgggct caaggctgtc atgacaaacc gaaaaccatt tgatctcacg ggaccactga atctctggaa tgcgggtctt gctattttct caactctcgg atcacttgcc actacatttg gacttctcca cgagttcttc agccgtggat ttttcgaatc ttacattcac atcggagact tttataatgg actttctgga atgttcacat ggcttttcgt tctctcaaaa gttgctgaat tcggagatac actttttatt attcttcgta aaaagccatt gatgttcctt cattggtatc atcatgtgct tacaatgaat tatgctttta tgtcatttga agctaatttg ggatttaata cttggattac atggatgaat ttctcagttc actcaattat gtatggatat tatatgcttc gttcttttgg tgtcaaggtt ccagcatgga ttgccaagaa tattacaaca atgcaaattc ttcaattcgt tattactcat ttcattcttt tccacgttgg atatttggca gttactggac aatctgttga ctcaactcca ggatattatt ggttctgcct tctcatggaa atctcttatg tcgttctgtt cggaaacttc tactatcaat catacatcaa gggaggtggc aagaagttta atgcagagaa gaagactgaa aagaaaattg aataa F56H11.3 atgtatttgaattatttcgcgacggaaatcttccatcgtagtgcggtttgtgaaacagaa gcttgtcgctcgtcaaaaataatgattgctgacgtgttcaaatggaaattcgatgcaaac gaattgtggagtcttttaacgaatcaggatgaagttttcccgcatattagagcacggcga ttcattcaagaacattttggtctattcgtccagatggcaattgcatatgtcattttggtg ttctcaatcaaaaggttcatgagggatcgtgaaccatttcaactcaccacagctcttcgt ctctggaacttcttcctctccgtcttctcaatttatggttcctggacaatgtttccattt atggttcaacaaataagactttatggtctctacggatgtggatgcgaagcactttcaaac cttccgagtcaagcagaatattggcttttcctgacgatcttgtccaaagctgtggagttt gttgatacatttttcttggttctccggaaaaaaccactcatcttcctacactggtatcat catatggcaacatttgtcttcttctgcagtaattacccgactccatcgtcacaatcacgc gtcggagttatcgtcaacctgttcgtgcatgccttcatgtacccatactatttcacccga tcaatgaacatcaaagttcctgcgaaaatttcaatggctgttacagttcttcaattgact caattcatgtgctttatctatggatgtactctcatgtactactcgttggccactaatcag gcacgatacccctcaaatacacctgcgacactccaatgtttgtcctacactctacatttg ctttga F56H11.4 (Ce 166) atg gctcagcatc cgctcgttca acggcttctc gatgtcaaat tcgacacgaa acgatttgtg gctattgcta ctcatgggcc aaagaatttc cctgacgcaga aggtcgcaa gttctttgct gatcactttg atgttactat tcaggcttcaa tcctgtacat ggtcgttgtg ttcggaacaa aatggttcat gcgtaatcgt caaccattcc aattgactat tccactcaac atctggaatttcatcctcgc cgcattttcc atcgcaggag ctgtcaaaat gaccccagag ttctttggaa ccattgccaa caaaggaatt gtcgatcctactgc aaagtgtttg atttcacgaa aggagagaat ggatactgggt gtggctctt catggcttcc aaacttttcg aacttgttga caccatcttc ttggttctccgtaaacgtcc actcatgttc cttcactggt atcaccatat tctcaccatg atctacgcctggtactctca tccattgacc ccaggattca acagatacgg aatttatctt aactttgtcg tccacgcctt catgtactct tactacttcc ttcgctcgat gaagattcgc gtgc caggattcatcgccca agctatcaca tctcttcaaa tcgttcaatt catcatctc WO 00155330 PCTlGB00I01035 t tgcgccgttcttgctcatct tggttatctc atgcacttca ccaat gccaactgt gatttcgagc catcagtatt caagctcgca gttttcatgg acacaacata cttggctctt ttcgtcaact tcttcctcca atcatatgtt ctccgcggag gaaaagacaa gtacaaggca gtgccaaaga agaagaacaa ctaa Y53F4B.c atgtcggccg aagtgtccga acgattcaaa gtttggacag gaaacaatga gaccatcatc tattccccat tcgagtacga ttccacgttg ctcatcgagt catgtcggtg tacttatcag ctgcttatat tattgcgaca aatttattac agagatatat ggagtcacgg aaacctaaaa cttttactag catggaacgg ttttttggca gtgttcagta ttatgggtac atggagattt ggaatcgaat tctacgatgc tgttttcaga agaggcttca tcgattcgat ctgcctggct gtaaatccac gttcaccgtc cgcattctgg gcatgcatgt tcgctctatc gaaaatcgcc gagtttgggg acacgatgtt cttggtgctg aggaaacggc cggttatatt ccttcactgg tatcatcacg ctgttgttct gatcctttct tggcatgctg caatcgaact cacagctcca ggacgctggt ttatttttat gaactatttg gtgcattcaa taatgtatac atactacgca ataacatcaa tcggctatcg tcttcccaaa atcgtttcaa tgactgttac attccttcaa actcttcaaa tgctcattgg tgtcagcatt tcttgcattg tgctttattt gaagcttaat ggagagatgt gccaacaatc ctacgacaat ctggcgttga gcttcggaat ctacgcctca ttcctggtgc tattctccag tttcttcaac aatgcatatt tggtaaaaaa ggacaagaaa cccgatgtga agaaggatta a A
251 SVANLYLAFV IYVTFAILFI QFFVKAYIIK SSKKSKSVKN E*
B
401 DFEILEPKED INANIAEPSI TTRSAAARRK VQKAD*
C
251 FLFAKFFYKA YIQKRSPTKT SK E*
D
251 YFYLWIQFYK EHYLKNGGKK YNLAKDQAKT QTKKAN*
E
MPQGEVSFFE VLTTAPFSHE LSKKHIAQTQ YAAFWISMAY VWIFGLKAV
MTNRKPFDLT GPLNLWNAGL AIFSTLGSLA TTFGLLHEFF SRGFFESYIH
IGDFYNGLSG MFTWLFVLSK VAEFGDTLFI ILRKKPLMFL HWYHHVLTMN
YAFMSFEANL GFNTWITWMN FSVHSIMYGY YMLRSFGVKV PAWIAKNITT
MQILQFVITH FILFHVGYLA VTGQSVDSTP GYYWFCLLME ISYWLFGNF
YYQSYIKGGG KKFNAEKKTE KKIE*
F
251 LMYYSLATNQ ARYPSNTPAT LQCLSYTLHL L*
G
MAQHPLVQRL LDVKFDTKRF VAIATHGPKN FPDAEGRKFF ADHFDVTIQA
SILYMVVVFG TKWFMRNRQP FQLTIPLNIW NFILAAFSIA GAVKMTPEFF
GTIANKGIVA SYCKVFDFTK GENGYWVWLF MASKLFELVD TIFLVLRKRP
LMFLHV~IYHHI LTMIYAWYSH PLTPGFNRYG IYLNFVVHAF MYSYYFLRSM
KIRVPGFIAQ AITSLQIVQF IISCAVLAHL GYLMHFTNAN CDFEPSVFKL
AVFMDTTYLA LFVNFFLQSY VLRGGKDKYK AVPKKKNN*
H
MSAEVSERFKVWTGNNETIIYSPFEYDSTLLIESCRCTYQLLILLRQI
YYRDIWSHGNLKACDXLLLAWNGFLAVFSIMGTWRFGIEFYDAVFRXG
FIXSICLAVNPRSPSAFWACMFALSKIAEFGDTMFLVLRKRPVIFLHWYHH
AVVLILSWHAAIELTAPGRWFIFMNYLVHSIMYTYYAITSIGYRXPKIVSMT
VTFLQTLQMLIGVSISCIVLYLKLNGEMCQQSYDNLALSFGIYASFLVLSSFF
NNAYLVKKDKKPDVKKD*
ELONGASE.APP.txt SEQUENCE LISTING
<110> the university of Bristol <120> POLYUNSATURATED FATTY ACID (PUFA) ELONGASE FROM CAENORHABDITIS ELEGANS
<130> >1470003 <140> PCT/GB00/01035 <141> March 20, 2000 <160> 22 <170> Patentln Ver. 2.1 <210> 1 <211> 27 <212> DNA
<213> C. elegans <400> 1 gcgggtacca tggctcagca tccgctc 27 <210> 2 <211> 27 <212> DNA
<213> C. elegans <400> 2 gcgggatcct tagttgttct tcttctt 27 <210> 3 <211> 27 <212> DNA
<213> C. elegans <400> 3 gcgggtacca tgccacaggg agaagtc 27 <210> 4 <211> 27 <212> DNA
<213> c. elegans <400> 4 gcgggatcct tattcaattt ttctttt 27 <210> 5 <211> 29 <212> DNA
<213> C. elegans <400> 5 gcgaattcac catgggtacg gaccaagga 2g <210> 6 <211> 27 <212> DNA
ELONGASE.APP.tXt <213> C. ~legans <400> 6 gcggagctcc tactcttcct tgggacg 27 <210> 7 <211> 876 <212> DNA
<213> c. elegans <400> 7 atggagcttg ccgagttctg gaatgatctc aacaccttca ccatctacgg accgaatcac 60 acagatatga ccacaaaata caaatattca tatcacttcc caggtgaaca ggtggcggat 120 ccgcagtatt ggacgatttt attccagaaa tattggtatc attcgatcac aatatcagtt 180 ctttatttca ttttaattaa ggtgattcaa aagtttatgg agaatcgaaa accattcact 240 ttgaaatacc cattgattct ttggaatgga gctcttgcag cattcagtat aattgccaca 300 ttgcggttct ctattgatcc tctacgatca ctatatgctg aaggattcta caaaactctg 360 tgctattcgt gtaatccaac tgatgtggct gcattttgga gctttgcatt cgctctttcc 420 aagattgttg aacttggaga cactatgttc attattttga gaaaacggcc attgatcttt 480 ttacactact atcatcatgc agcagtgtta atctacactg tccattctgg tgccgagcat 540 actgcagctg gtcgtttcta catcctaatg aactacttcg cacattctct catgtatact 600 tactacacag tttctgccat gggatacaga ttaccgaaat gggtatcaat gactgtcaca 660 actgttcaaa caactcaaat gttagctgga gtcggaataa cttggatggt gtacaaagtg 720 aaaactgaat acaagcttcc ttgtcaacaa tccgtagcca atttgtatct cgcattcgtc 780 atctatgtca catttgccat tcttttcatt caattcttcg tcaaggcata cattatcaag 840 tcgtcgaaga agtcgaaatc ggtgaagaac gaataa <210> 8 <211> 1308 <212> DNA
<213> c. elegans <400> 8 atggcaaaat acgactacaa tccgaagtat gggttagaaa attacagcat attccttccc 60 tttgagacat cttttgatgc atttcgatcg acaacatgga tgcaaaatca ctggtatcaa 120 tcaattacag catctgtcgt gtatgtagcc gtcattttta caggaaagaa ggtggttctc 180 atctacaaaa aatcacgagt tattactttt gagtctagcc ttcagaatgc aattaagaat 240 cgaaaccgaa aatcacttaa tagttctcaa atgtttcaga ttatggaaaa gtacaagccc 300 ttccaactgg acacaccact cttcgtctgg aattcatttt tagccatttt ctcaattctc 360 gggttcctcc gaatgacacc tgaatttgta tggagttggt cagcagaagg aaactcattc 420 aaatattcaa tttgtcattc atcttatgct caaggagtca ctggtttctg gactgaacaa 480 ttcgcaatga gcaaactttt cgagctcatc gacacaatct tcatcgttct tcgtaaacgt 540 ccactcatct tccttcactg gtatcatcat gtaactgtta tgatctacac atggcacgcg 600 tacaaggatc acactgcatc aggacggtgg ttcatttgga tgaattatgg agttcatgct 660 cttatgtatt cctactatgc tcttcgttct ctgaaattcc gtcttccaaa acaaatggca 720 atggttgtta ctactctcca acttgctcaa atggttatgg gagtaatcat cggagtcact 780 gtctaccgta tcaagtcatc gggtgaatac tgccaacaga catgggacaa tttgggatta 840 tgctttggag tttatttcac atatttcctt cttttcgcca acttcttcta ccatgcatat 900 gttaagaaaa acaaccgtac agtaaattat gaaaataatt caaaaaattt ccccgatctc 960 gttttaattt acctgagaaa aaaggtttca agaaaatcga aaaatcggca atgttcagaa 1020 aataattata aaattcaatt ttcatcaaat tttgttaatg ttgatggaaa aaaacataag 1080 aaaacatatg aacttattct tccaagaaga aaaatgacca caattttaac ttttctattt 1140 ggaaaaaatc gaattttttc gaaatatcag aaaaatcgaa aaaacatttc gattcctgtt 1200 gatttcgaaa ttctggagcc aaaagaagat atcaatgcta acatcgctga gccatccatc 1260 acaacgaggt ccgccgccgc acgaagaaaa gttcaaaaag ctgattag 1308 <210> 9 <211> 825 <212> DNA
<213> c. elegans ELONGASE.APP.txt <400> 9 atggcagcag cacaaacaag tccagcagcc acgctcgtcg atgttttgac aaaaccatgg 60 agtctggatc agactgattc ttacatgtct acatttgtac cattatccta taaaatcatg 120 attggttatc tcgtcaccat ctacttcggg caaaaattaa tggctcacag aaaaccattc 180 gatctccaaa atacacttgc tctctggaac ttcgggtttt cactgttctc gggaatcgcc 240 gcctataagc ttattccaga actattcgga gttttcatga aggacgggtt tgtcgcttcc 300 tactgtcaaa acgagaacta ctacaccgat gcatcaactg gattctgggg ctgggccttt 360 gtgatgtcga aagctccaga actaggggat actatgttct tggtccttcg taaaaaacca 420 gttatcttca tgcactggta tcatcatgcc ctcacatttg tctacgcagt agtcacatac 480 tctgagcatc aggcatgggc tcgttggtct ttggctctca accttgccgt ccacactgtt 540 atgtatttct acttcgccgt tcgcgccttg aacatccaaa ctccacgccc agtggcaaag 600 ttcatcacta ctattcaaat tgtccaattt gtcatctcat gctacatttt tgggcatttg 660 gtattcatta agtctgctga ttctgttcct ggttgcgctg ttagctggaa tgtgctatcg 720 atcggaggac tcatgtacat cagttatttg ttcctttttg ccaagttctt ctacaaggcc 780 tacattcaaa aacgctcacc aaccaaaacc agcaagcagg agtag g25 <210> 10 <211> 861 <212> DNA
<213> C. elegans <400> 10 atgtcatcgg acgatcgtgg cactagaacc ttcaagatga tggatcaaat tcttggaaca 60 aacttcactt atgaaggtgc caaagaagtt gctcgaggcc ttgaaggttt ctcagcaaag 120 cttgccgtcg gatatattgc cactattttt ggactgaaat attatatgaa agaccgaaaa 180 gccttcgatc tcagtactcc attaaacatt tggaatggta ttctttcgac attcagctta 240 ttgggattct tattcacttt tcctactttg ttatcagtta tcagaaagga tggatttagt 300 cacacctatt cccatgtctc tgagctttac actgacagta cctctggata ttggatcttc 360 ctttgggtta tctcaaagat tccggaactt ttggatacag tattcattgt tcttcgcaag 420 agaccactta ttttcatgca ctggtaccat cacgcattga ccggttacta tgctcttgtc 480 tgctaccatg aggatgctgt ccatatggtt tgggttgtat ggatgaatta tattattcat 540 gcattcatgt atggatacta tcttctgaaa tctctgaaag ttccaattcc accatcagtt 600 gctcaagcaa tcaccacatc tcaaatggtt caattcgcag ttgccatttt cgcacaagtt 660 catgtttcct ataaacacta tgttgaggga gttgaaggat tagcctactc gttcagagga 720 acagctatcg gatttttcat gcttactacc tacttctatc tatggattca attctacaaa 780 gagcactatc ttaagaatgg aggcaaaaag tacaatttgg caaaggatca ggcaaaaact 840 caaacaaaga aggctaacta a 861 <210> 11 <211> 825 <212> DNA
<213> C. elegans <400> 11 atgccacagg gagaagtctc attctttgag gtgctgacaa ctgctccatt cagtcatgag 60 ctctcaaaaa agcatattgc acagactcag tatgctgctt tctggatctc aatggcatat 120 gttgtcgtta tttttgggct caaggctgtc atgacaaacc gaaaaccatt tgatctcacg 180 ggaccactga atctctggaa tgcgggtctt gctattttct caactctcgg atcacttgcc 240 actacatttg gacttctcca cgagttcttc agccgtggat ttttcgaatc ttacattcac 300 atcggagact tttataatgg actttctgga atgttcacat ggcttttcgt tctctcaaaa 360 gttgctgaat tcggagatac actttttatt attcttcgta aaaagccatt gatgttcctt 420 cattggtatc atcatgtgct tacaatgaat tatgctttta tgtcatttga agctaatttg 480 ggatttaata cttggattac atggatgaat ttctcagttc actcaattat gtatggatat 540 tatatgcttc gttcttttgg tgtcaaggtt ccagcatgga ttgccaagaa tattacaaca 600 atgcaaattc ttcaattcgt tattactcat ttcattcttt tccacgttgg atatttggca 660 gttactggac aatctgttga ctcaactcca ggatattatt ggttctgcct tctcatggaa 720 atctcttatg tcgttctgtt cggaaacttc tactatcaat catacatcaa gggaggtggc 780 aagaagttta atgcagagaa gaagactgaa aagaaaattg aataa 825 ELONGASE.APP.tXt <210> 12 <211> 846 <212> DNA
<213> C. elegans <400> 12 atgtatttga attatttcgc gacggaaatc ttccatcgta gtgcggtttg tgaaacagaa 60 gcttgtcgct cgtcaaaaat aatgattgct gacgtgttca aatggaaatt cgatgcaaac 120 gaattgtgga gtcttttaac gaatcaggat gaagttttcc cgcatattag agcacggcga 180 ttcattcaag aacattttgg tctattcgtc cagatggcaa ttgcatatgt cattttggtg 240 ttctcaatca aaaggttcat gagggatcgt gaaccatttc aactcaccac agctcttcgt 300 ctctggaact tcttcctctc cgtcttctca atttatggtt cctggacaat gtttccattt 360 atggttcaac aaataagact ttatggtctc tacggatgtg gatgcgaagc actttcaaac 420 cttccgagtc aagcagaata ttggcttttc ctgacgatct tgtccaaagc tgtggagttt 480 gttgatacat ttttcttggt tctccggaaa aaaccactca tcttcctaca ctggtatcat 540 catatggcaa catttgtctt cttctgcagt aattacccga ctccatcgtc acaatcacgc 600 gtcggagtta tcgtcaacct gttcgtgcat gccttcatgt acccatacta tttcacccga 660 tcaatgaaca tcaaagttcc tgcgaaaatt tcaatggctg ttacagttct tcaattgact 720 caattcatgt gctttatcta tggatgtact ctcatgtact actcgttggc cactaatcag 780 gcacgatacc cctcaaatac acctgcgaca ctccaatgtt tgtcctacac tctacatttg 840 ctttga 846 <210> 13 <211> 866 <212> DNA
<213> C. elegans <400> 13 atggctcagc atccgctcgt tcaacggctt ctcgatgtca aattcgacac gaaacgattt 60 gtggctattg ctactcatgg gccaaagaat ttccctgacg cagaaggtcg caagttcttt 120 gctgatcact ttgatgttac tattcaggct tcaatcctgt acatggtcgt tgtgttcgga 180 acaaaatggt tcatgcgtaa tcgtcaacca ttccaattga ctattccact caacatctgg 240 aatttcatcc tcgccgcatt ttccatcgca ggagctgtca aaatgacccc agagttcttt 300 ggaaccattg ccaacaaagg aattgtcgat cctactgcaa agtgtttgat ttcacgaaag 360 gagagaatgg atactgggtg tggctcttca tggcttccaa acttttcgaa cttgttgaca 420 ccatcttctt ggttctccgt aaacgtccac tcatgttcct tcactggtat caccatattc 480 tcaccatgat ctacgcctgg tactctcatc cattgacccc aggattcaac agatacggaa 540 tttatcttaa ctttgtcgtc cacgccttca tgtactctta ctacttcctt cgctcgatga 600 agattcgcgt gccaggattc atcgcccaag ctatcacatc tcttcaaatc gttcaattca 660 tcatctcttg cgccgttctt gctcatcttg gttatctcat gcacttcacc aatgccaact 720 gtgatttcga gccatcagta ttcaagctcg cagttttcat ggacacaaca tacttggctc 780 ttttcgtcaa cttcttcctc caatcatatg ttctccgcgg aggaaaagac aagtacaagg 840 cagtgccaaa gaagaagaac aactaa 866 <210> 14 <211> 801 <212> DNA
<213> C. elegans <400> 14 atgtcggccg aagtgtccga acgattcaaa gtttggacag gaaacaatga gaccatcatc 60 tattccccat tcgagtacga ttccacgttg ctcatcgagt catgtcggtg tacttatcag 120 ctgcttatat tattgcgaca aatttattac agagatatat ggagtcacgg aaacctaaaa 180 cttttactag catggaacgg ttttttggca gtgttcagta ttatgggtac atggagattt 240 ggaatcgaat tctacgatgc tgttttcaga agaggcttca tcgattcgat ctgcctggct 300 gtaaatccac gttcaccgtc cgcattctgg gcatgcatgt tcgctctatc gaaaatcgcc 360 gagtttgggg acacgatgtt cttggtgctg aggaaacggc cggttatatt ccttcactgg 420 tatcatcacg ctgttgttct gatcctttct tggcatgctg caatcgaact cacagctcca 480 ggacgctggt ttatttttat gaactatttg gtgcattcaa taatgtatac atactacgca 540 ataacatcaa tcggctatcg tcttcccaaa atcgtttcaa tgactgttac attccttcaa 600 actcttcaaa tgctcattgg tgtcagcatt tcttgcattg tgctttattt gaagcttaat 660 ELONGASE.APP.tXt ggagagatgt gccaacaatc ctacgacaat ctggcgttga gcttcggaat ctacgcctca 720 ttcctggtgc tattctccag tttcttcaac aatgcatatt tggtaaaaaa ggacaagaaa 780 cccgatgtga agaaggatta a <210>
<211>
<212>
PRT
<213> elegans C.
<400>
MetGlu LeuAla GluPheTrpAsn AspLeuAsn ThrPheThr IleTyr GlyPro AsnHis ThrAspMetThr ThrLysTyr LysTyrSer TyrHis PhePro GlyGlu GlnValAlaAsp ProGlnTyr TrpThrIle LeuPhe GlnLys TyrTrp TyrHisSerIle ThrIleSer ValLeuTyr PheIle LeuIle LysVal IleGlnLysPhe MetGluAsn ArgLysPro PheThr LeuLys TyrPro LeuIleLeuTrp AsnGlyAla LeuAlaAla PheSer IleIle AlaThr LeuArgPheSer IleAspPro LeuArgSer LeuTyr AlaGlu GlyPhe TyrLysThrLeu CysTyrSer CysAsnPro ThrAsp ValAla AlaPhe TrpSerPheAla PheAlaLeu SerLysIle ValGlu LeuGly AspThr MetPheIleIle LeuArgLys ArgProLeu IlePhe LeuHis TyrTyr HisHisAlaAla ValLeuIle TyrThrVal HisSer GlyAla GluHis ThrAlaAlaGly ArgPheTyr IleLeuMet AsnTyr PheAla HisSer LeuMetTyrThr TyrTyrThr ValSerAla MetGly TyrArg LeuPro LysTrpValSer MetThrVal ThrThrVal GlnThr ThrGln MetLeu AlaGlyValGly IleThrTrp MetValTyr LysVal LysThr GluTyr LysLeuProCys GlnGlnSer ValAlaAsn LeuTyr LeuAla PheVal IleTyrValThr PheAlaIle LeuPheIle GlnPhe PheVal LysAla TyrIleIleLys SerSerLys LysSerLys SerVal ELONGASE.APP.tXt ' 2?5 280 285 Lys Asn Glu <210>
<211>
<212>
PRT
<213> elegan s c.
<400>
MetAla LysTyrAsp TyrAsnPro LysTyrGly LeuGluAsn TyrSer IlePhe LeuProPhe GluThrSer PheAspAla PheArgSer ThrThr TrpMet GlnAsnHis TrpTyrGln SerIleThr AlaSerVal ValTyr ValAla ValIlePhe ThrGlyLys LysValVal LeuIleTyr LysLys SerArg ValIleThr PheGluSer SerLeuGln AsnAlaIle LysAsn ArgAsn ArgLysSer LeuAsnSer SerGlnMet PheGlnIle MetGlu LysTyr LysProPhe GlnLeuAsp ThrProLeu PheValTrp AsnSer PheLeu AlaIlePhe SerIleLeu GlyPheLeu ArgMetThr ProGlu PheVal TrpSerTrp SerAlaGlu GlyAsnSer PheLysTyr SerIle cysHis SerSerTyr AlaGlnGly ValThrGly PheTrpThr GluGln PheAla MetSerLys LeuPheGlu LeuIleAsp ThrIlePhe IleVal LeuArg LysArgPro LeuIlePhe LeuHisTrp TyrHisHis ValThr ValMet IleTyrThr TrpHisAla TyrLysAsp HisThrAla SerGly ArgTrp PheIleTrp MetAsnTyr GlyValHis AlaLeuMet TyrSer TyrTyr AlaLeuArg SerLeuLys PheArgLeu ProLysGln MetAla MetVal ValThrThr LeuGlnLeu AlaGlnMet ValMetGly ValIle IleGly ValThrVal TyrArgIle LysSerSer GlyGluTyr cysGln S
ELONGASE.APP.tXt Gln Thr Trp Asp Asn Leu Gly Leu Cys Phe Gly Val Tyr Phe Thr Tyr Phe Leu Leu Phe Ala Asn Phe Phe Tyr His Ala Tyr Val Lys Lys Asn Asn Arg Thr Val Asn Tyr Glu Asn Asn Ser Lys Asn Phe Pro Asp Leu Val Leu Ile Tyr Leu Arg Lys Lys Val Ser Arg Lys Ser Lys Asn Arg Gln Cys Ser Glu Asn Asn Tyr Lys Ile Gln Phe Ser Ser Asn Phe Val Asn Val Asp Gly Lys Lys His Lys Lys Thr Tyr Glu Leu Ile Leu Pro Arg Arg Lys Met Thr Thr Ile Leu Thr Phe Leu Phe Gly Lys Asn Arg Ile Phe Ser Lys Tyr Gln Ly5 Asn Arg Lys Asn Ile Ser Ile Pro Val Asp Phe Glu Ile Leu Glu Pro Lys Glu Asp Ile Asn Ala Asn Ile Ala Glu Pro Ser Ile Thr Thr Arg Ser Ala Ala Ala Arg Arg Lys Val Gln Lys Ala Asp <210> 17 <211> 274 <212> PRT
<213> C. elegans <400> 17 Met Ala Ala Ala Gln Thr Ser Pro Ala Ala Thr Leu Val Asp Val Leu Thr Lys Pro Trp Ser Leu Asp Gln Thr Asp Ser Tyr Met Ser Thr Phe Val Pro Leu Ser Tyr Lys Ile Met Ile Gly Tyr Leu Val Thr Ile Tyr Phe Gly Gln Lys Leu Met Ala His Arg Lys Pro Phe Asp Leu Gln Asn Thr Leu Ala Leu Trp Asn Phe Gly Phe Ser Leu Phe Ser Gly Ile Ala Ala Tyr Lys Leu Ile Pro Glu Leu Phe Gly Val Phe Met Lys Asp Gly Phe Val Ala Ser Tyr Cys Gln Asn Glu Asn Tyr Tyr Thr Asp Ala Ser ELONGASE.APP.txt Thr Gly Phe Trp Gly Trp Ala Phe Val Met Ser Lys Ala Pro Glu Leu Gly Asp Thr Met Phe Leu Val Leu Arg Lys Lys Pro Val Ile Phe Met His Trp Tyr His His Ala Leu Thr Phe Val Tyr Ala Val Val Thr Tyr Ser Glu His Gln Ala Trp Ala Arg Trp Ser Leu Ala Leu Asn Leu Ala Val His Thr Val Met Tyr Phe Tyr Phe Ala Val Arg Ala Leu Asn Ile Gln Thr Pro Arg Pro Val Ala Lys Phe Ile Thr Thr Ile Gln Ile Val Gln Phe Val Ile Ser Cys Tyr Ile Phe Gly His Leu Val Phe Ile Lys Ser Ala Asp Ser Val Pro Gly Cys Ala Val Ser Trp Asn Val Leu Ser Ile Gly Gly Leu Met Tyr Ile Ser Tyr Leu Phe Leu Phe Ala Lys Phe Phe Tyr Lys Ala Tyr Ile Gln Lys Arg Ser Pro Thr Lys Thr Ser Lys Gln Glu <210> 18 <211> 286 <212> PRT
<213> C. elegans <400> 18 Met Ser Ser Asp Asp Arg Gly Thr Arg Thr Phe Lys Met Met Asp Gln Ile Leu Gly Thr Asn Phe Thr Tyr Glu Gly Ala Lys Glu Val Ala Arg Gly Leu Glu Gly Phe Ser Ala Lys Leu Ala Val Gly Tyr Ile Ala Thr Ile Phe Gly Leu Lys Tyr Tyr Met Lys Asp Arg Lys Ala Phe Asp Leu Ser Thr Pro Leu Asn Ile Trp Asn Gly Ile Leu Ser Thr Phe Ser Leu Leu Gly Phe Leu Phe Thr Phe Pro Thr Leu Leu Ser Val Ile Arg Lys Asp Gly Phe Ser His Thr Tyr Ser His Val Ser Glu Leu Tyr Thr Asp Ser Thr Ser Gly Tyr Trp Ile Phe Leu Trp Val Ile Ser Lys Ile Pro ELONGASE.APP.tXt ' lI5 120 125 Glu Leu Leu Asp Thr Val Phe Ile Val Leu Arg Lys Arg Pro Leu Ile Phe Met His Trp Tyr His His Ala Leu Thr Gly Tyr Tyr Ala Leu Val Cys Tyr His Glu Asp Ala Val His Met Val Trp Val Val Trp Met Asn Tyr Ile Ile His Ala Phe Met Tyr Gly Tyr Tyr Leu Leu Lys Ser Leu Lys Val Pro Ile Pro Pro Ser Val Ala Gln Ala Ile Thr Thr Ser Gln Met Val Gln Phe Ala Val Ala Ile Phe Ala Gln Val His Val Ser Tyr Lys His Tyr Val Glu Gly Val Glu Gly Leu Ala Tyr Ser Phe Arg Gly Thr Ala Ile Gly Phe Phe Met Leu Thr Thr Tyr Phe Tyr Leu Trp Ile Gln Phe Tyr Lys Glu His Tyr Leu Lys Asn Gly Gly Lys Lys Tyr Asn Leu Ala Lys Asp Gln Ala Lys Thr Gln Thr Lys Lys Ala Asn <210> 19 <211> 274 <212> PRT
<213> C. elegans <400> 19 Met Pro Gln Gly Glu Val Ser Phe Phe Glu Val Leu Thr Thr Ala Pro Phe Ser His Glu Leu Ser Lys Lys His Ile Ala Gln Thr Gln Tyr Ala Ala Phe Trp Ile Ser Met Ala Tyr Val Val Val Ile Phe Gly Leu Lys Ala Val Met Thr Asn Arg Lys Pro Phe Asp Leu Thr Gly Pro Leu Asn Leu Trp Asn Ala Gly Leu Ala Ile Phe Ser Thr Leu Gly Ser Leu Ala Thr Thr Phe Gly Leu Leu His Glu Phe Phe Ser Arg Gly Phe Phe Glu Ser Tyr Ile His Ile Gly Asp Phe Tyr Asn Gly Leu Ser Gly Met Phe Thr Trp Leu Phe Val Leu Ser Lys Val Ala Glu Phe Gly Asp Thr Leu ELONGASE.APP.tXt Phe Ile Ile Leu Arg Lys Lys Pro Leu Met Phe Leu His Trp Tyr His His Val Leu Thr Met Asn Tyr Ala Phe Met Ser Phe Glu Ala Asn Leu Gly Phe Asn Thr Trp Ile Thr Trp Met Asn Phe Ser Val His Ser Ile Met Tyr Gly Tyr Tyr Met Leu Arg Ser Phe Gly Val Lys Val Pro Ala Trp Ile Ala Lys Asn Ile Thr Thr Met Gln Ile Leu Gln Phe Val Ile Thr His Phe Ile Leu Phe His Val Gly Tyr Leu Ala Val Thr Gly Gln Ser Val Asp Ser Thr Pro Gly Tyr Tyr Trp Phe Cys Leu Leu Met Glu Ile Ser Tyr Val Val Leu Phe Gly Asn Phe Tyr Tyr Gln Ser Tyr Ile Lys Gly Gly Gly Lys Lys Phe Asn Ala Glu Lys Lys Thr Glu Lys Lys Ile Glu <210> 20 <211> 281 <212> PRT
<213> C. elegans <400> 20 Met Tyr Leu Asn Tyr Phe Ala Thr Glu Ile Phe His Arg Ser Ala Val Cys Glu Thr Glu Ala Cys Arg Ser Ser Lys Ile Met Ile Ala Asp Val Phe Lys Trp Lys Phe Asp Ala Asn Glu Leu Trp Ser Leu Leu Thr Asn Gln Asp Glu Val Phe Pro His Ile Arg Ala Arg Arg Phe Ile Gln Glu His Phe Gly Leu Phe Val Gln Met Ala Ile Ala Tyr Val Ile Leu Val Phe Ser Ile Lys Arg Phe Met Arg Asp Arg Glu Pro Phe Gln Leu Thr Thr Ala Leu Arg Leu Trp Asn Phe Phe Leu Ser Val Phe Ser Ile Tyr Gly Ser Trp Thr Met Phe Pro Phe Met Val Gln Gln Ile Arg Leu Tyr ELONGASE.APP.tXt Gly Leu Tyr Gly Cys Gly Cys Glu Ala Leu Ser Asn Leu Pro Ser Gln Ala Glu Tyr Trp Leu Phe Leu Thr Ile Leu Ser Lys Ala Val Glu Phe Val Asp Thr Phe Phe Leu Val Leu Arg Lys Lys Pro Leu Ile Phe Leu His Trp Tyr His His Met Ala Thr Phe Val Phe Phe Cys Ser Asn Tyr Pro Thr Pro Ser Ser Gln Ser Arg Val Gly Val Ile Val Asn Leu Phe Val His Ala Phe Met Tyr Pro Tyr Tyr Phe Thr Arg Ser Met Asn Ile Lys Val Pro Ala Lys Ile Ser Met Ala Val Thr Val Leu Gln Leu Thr Gln Phe Met Cys Phe Ile Tyr Gly Cys Thr Leu Met Tyr Tyr Ser Leu Ala Thr Asn Gln Ala Arg Tyr Pro Ser Asn Thr Pro Ala Thr Leu Gln Cys Leu Ser Tyr Thr Leu His Leu Leu <210> 21 <211> 288 <212> PRT
<213> C. elegans <400> 21 Met Ala Gln His Pro Leu Val Gln Arg Leu Leu Asp Val Lys Phe Asp Thr Lys Arg Phe Val Ala Ile Ala Thr His Gly Pro Lys Asn Phe Pro Asp Ala Glu Gly Arg Lys Phe Phe Ala Asp His Phe Asp Val Thr Ile Gln Ala Ser Ile Leu Tyr Met Val Val Val Phe Gly Thr Lys Trp Phe Met Arg Asn Arg Gln Pro Phe Gln Leu Thr Ile Pro Leu Asn Ile Trp Asn Phe Ile Leu Ala Ala Phe Ser Ile Ala Gly Ala Val Lys Met Thr Pro Glu Phe Phe Gly Thr Ile Ala Asn Lys Gly Ile Val Ala Ser Tyr Cys Lys Val Phe Asp Phe Thr Lys Gly Glu Asn Gly Tyr Trp Val Trp Leu Phe Met Ala Ser Lys Leu Phe Glu Leu Val Asp Thr Ile Phe Leu ELONGASE.APP.tXt 130 ' 135 140 Val Leu Arg Lys Arg Pro Leu Met Phe Leu His Trp Tyr His His Ile Leu Thr Met Ile Tyr Ala Trp Tyr Ser His Pro Leu Thr Pro Gly Phe Asn Arg Tyr Gly Ile Tyr Leu Asn Phe Val Val His Ala Phe Met Tyr Ser Tyr Tyr Phe Leu Arg Ser Met Lys Ile Arg Val Pro Gly Phe Ile Ala Gln Ala Ile Thr Ser Leu Gln Ile Val Gln Phe Ile Ile Ser Cys Ala Val Leu Ala His Leu Gly Tyr Leu Met His Phe Thr Asn Ala Asn Cys Asp Phe Glu Pro Ser Val Phe Lys Leu Ala Val Phe Met Asp Thr Thr Tyr Leu Ala Leu Phe Val Asn Phe Phe Leu Gln Ser Tyr Val Leu Arg Gly Gly Lys Asp Lys Tyr Lys Ala Val Pro Lys Lys Lys Asn Asn <210> 22 <211> 269 <212> PRT
<213> C. elegans <400> 22 Met Ser Ala Glu Val Ser Glu Arg Phe Lys Val Trp Thr Gly Asn Asn Glu Thr Ile Ile Tyr Ser Pro Phe Glu Tyr Asp Ser Thr Leu Leu Ile Glu Ser Cys Arg Cys Thr Tyr Gln Leu Leu Ile Leu Leu Arg Gln Ile Tyr Tyr Arg Asp Ile Trp Ser His Gly Asn Leu Lys Ala Cys Asp Xaa Leu Leu Leu Ala Trp Asn Gly Phe Leu Ala Val Phe Ser Ile Met Gly Thr Trp Arg Phe Gly Ile Glu Phe Tyr Asp Ala Val Phe Arg Xaa Gly Phe Ile Xaa Ser Ile Cys Leu Ala Val Asn Pro Arg Ser Pro Ser Ala Phe Trp Ala Cys Met Phe Ala Leu Ser Lys Ile Ala Glu Phe Gly Asp ELONGASE.APP.tXt Thr Met PheLeu ValLeuArg LysArgProVal IlePheLeu HisTrp Tyr His HisAla ValValLeu IleLeuSerTrp HisAlaAla IleGlu Leu Thr AlaPro GlyArgTrp PheIlePheMet AsnTyrLeu ValHis Ser Ile MetTyr ThrTyrTyr AlaIleThrSer IleGlyTyr ArgXaa Pro Lys IleVal SerMetThr ValThrPheLeu GlnThrLeu GlnMet Leu Ile GlyVal SerIleSer CysIleValLeu TyrLeuLys LeuAsn Gly Glu MetCys GlnGlnSer TyrAspAsnLeu AlaLeuSer PheGly Ile Tyr AlaSer PheLeuVal LeuSerSerPhe PheAsnAsn AlaTyr Leu Val LysLys AspLysLys ProAspValLys LysAsp gcgggtacca~ggctcagcatccgctco PRIMER1.GBS.tXt PRIMER2.GBS.tXt gcgggatcct~agttgttcttcttctto PRIMER3.GBS.tXt gcgggtaccatgccacagggagaagtco PRIMER4.GBS.tXt gcgggatccttattcaatttttctttto PRIMERS.GBS.tXt gcgaattcaccatgggtacggaccaagga~
PRIMER6.GBS.tXt gcggagctcctactcttccttgggacgo SEQID1.GBS.tXt atggagcttgccgagttctggaatgatctcaacaccttcaccatctacggaccgaatcac acagatatgaccacaaaatacaaatattcatatcacttcccaggtgaacaggtggcggat ccgcagtattggacgattttattccagaaatattggtatcattcgatcacaatatcagtt ctttatttcattttaattaaggtgattcaaaagtttatggagaatcgaaaaccattcact ttgaaatacccattgattctttggaatggagctcttgcagcattcagtataattgccaca ttgcggttctctattgatcctctacgatcactatatgctgaaggattctacaaaactctg tgctattcgtgtaatccaactgatgtggctgcattttggagctttgcattcgctctttcc aagattgttgaacttggagacactatgttcattattttgagaaaacggccattgatcttt ttacactactatcatcatgcagcagtgttaatctacactgtccattctggtgccgagcat actgcagctggtcgtttctacatcctaatgaactacttcgcacattctctcatgtatact tactacacagtttctgccatgggatacagattaccgaaatgggtatcaatgactgtcaca actgttcaaacaactcaaatgttagctggagtcggaataacttggatggtgtacaaagtg aaaactgaatacaagcttccttgtcaacaatccgtagccaatttgtatctcgcattcgtc atctatgtcacatttgccattcttttcattcaattcttcgtcaaggcatacattatcaag tcgtcgaagaagtcgaaatcggtgaagaacgaataao SEQID2.GBS.tXt atggcaaaatacgactacaatccgaagtatgggttagaaaattacagcatattccttccc tttgagacatcttttgatgcatttcgatcgacaacatggatgcaaaatcactggtatcaa tcaattacagcatctgtcgtgtatgtagccgtcatttttacaggaaagaaggtggttctc atctacaaaaaatcacgagttattacttttgagtctagccttcagaatgcaattaagaat cgaaaccgaaaatcacttaatagttctcaaatgtttcagattatggaaaagtacaagccc ttccaactggacacaccactcttcgtctggaattcatttttagccattttctcaattctc gggttcctccgaatgacacctgaatttgtatggagttggtcagcagaaggaaactcattc aaatattcaatttgtcattcatcttatgctcaaggagtcactggtttctggactgaacaa ttcgcaatgagcaaacttttcgagctcatcgacacaatcttcatcgttcttcgtaaacgt ccactcatcttccttcactggtatcatcatgtaactgttatgatctacacatggcacgcg tacaaggatcacactgcatcaggacggtggttcatttggatgaattatggagttcatgct cttatgtattcctactatgctcttcgttctctgaaattccgtcttccaaaacaaatggca atggttgttactactctccaacttgctcaaatggttatgggagtaatcatcggagtcact gtctaccgtatcaagtcatcgggtgaatactgccaacagacatgggacaatttgggatta tgctttggagtttatttcacatatttccttcttttcgccaacttcttctaccatgcatat gttaagaaaaacaaccgtacagtaaattatgaaaataattcaaaaaatttccccgatctc gttttaatttacctgagaaaaaaggtttcaagaaaatcgaaaaatcggcaatgttcagaa aataattataaaattcaattttcatcaaattttgttaatgttgatggaaaaaaacataag aaaacatatgaacttattcttccaagaagaaaaatgaccacaattttaacttttctattt ggaaaaaatcgaattttttcgaaatatcagaaaaatcgaaaaaacatttcgattcctgtt gatttcgaaattctggagccaaaagaagatatcaatgctaacatcgctgagccatccatc acaacgaggtccgccgccgcacgaagaaaagttcaaaaagctgattago SEQID3.GBS.txt atggcagcagcacaaacaagtccagcagccacgctcgtcgatgttttgacaaaaccatgg agtctggatcagactgattcttacatgtctacatttgtaccattatcctataaaatcatg attggttatctcgtcaccatctacttcgggcaaaaattaatggctcacagaaaaccattc gatctccaaaatacacttgctctctggaacttcgggttttcactgttctcgggaatcgcc gcctataagcttattccagaactattcggagttttcatgaaggacgggtttgtcgcttcc tactgtcaaaacgagaactactacaccgatgcatcaactggattctggggctgggccttt gtgatgtcgaaagctccagaactaggggatactatgttcttggtccttcgtaaaaaacca gttatcttcatgcactggtatcatcatgccctcacatttgtctacgcagtagtcacatac tctgagcatcaggcatgggctcgttggtctttggctctcaaccttgccgtccacactgtt atgtatttctacttcgccgttcgcgccttgaacatccaaactccacgcccagtggcaaag ttcatcactactattcaaattgtccaatttgtcatctcatgctacatttttgggcatttg gtattcattaagtctgctgattctgttcctggttgcgctgttagctggaatgtgctatcg atcggaggactcatgtacatcagttatttgttcctttttgccaagttcttctacaaggcc tacattcaaaaacgctcaccaaccaaaaccagcaagcaggagtago atgtcatcgg'acgatc t SEQzo4.GBS.txt g ggcactagaaccttcaagatgatggatcaaattcttggaaca aacttcacttatgaaggtgccaaagaagttgctcgaggccttgaaggtttctcagcaaag cttgccgtcggatatattgccactatttttggactgaaatattatatgaaagaccgaaaa gccttcgatctcagtactccattaaacatttggaatggtattctttcgacattcagctta ttgggattcttattcacttttcctactttgttatcagttatcagaaaggatggatttagt cacacctattcccatgtctctgagctttacactgacagtacctctggatattggatcttc ctttgggttatctcaaagattccggaacttttggatacagtattcattgttcttcgcaag agaccacttattttcatgcactggtaccatcacgcattgaccggttactatgctcttgtc tgctaccatgaggatgctgtccatatggtttgggttgtatggatgaattatattattcat gcattcatgtatggatactatcttctgaaatctctgaaagttccaattccaccatcagtt gctcaagcaatcaccacatctcaaatggttcaattcgcagttgccattttcgcacaagtt catgtttcctataaacactatgttgagggagttgaaggattagcctactcgttcagagga acagctatcggatttttcatgcttactacctacttctatctatggattcaattctacaaa gagcactatcttaagaatggaggcaaaaagtacaatttggcaaaggatcaggcaaaaact caaacaaagaaggctaactaao SEQIDS.GBS.txt atgccacagggagaagtctcattctttgaggtgctgacaactgctccattcagtcatgag ctctcaaaaaagcatattgcacagactcagtatgctgctttctggatctcaatggcatat gttgtcgttatttttgggctcaaggctgtcatgacaaaccgaaaaccatttgatctcacg ggaccactgaatctctggaatgcgggtcttgctattttctcaactctcggatcacttgcc actacatttggacttctccacgagttcttcagccgtggatttttcgaatcttacattcac atcggagacttttataatggactttctggaatgttcacatggcttttcgttctctcaaaa gttgctgaattcggagatacactttttattattcttcgtaaaaagccattgatgttcctt cattggtatcatcatgtgcttacaatgaattatgcttttatgtcatttgaagctaatttg ggatttaatacttggattacatggatgaatttctcagttcactcaattatgtatggatat tatatgcttcgttcttttggtgtcaaggttccagcatggattgccaagaatattacaaca atgcaaattcttcaattcgttattactcatttcattcttttccacgttggatatttggca gttactggacaatctgttgactcaactccaggatattattggttctgccttctcatggaa atctcttatgtcgttctgttcggaaacttctactatcaatcatacatcaagggaggtggc aagaagtttaatgcagagaagaagactgaaaagaaaattgaataaa SEQID6.GBS.txt atgtatttgaattatttcgcgacggaaatcttccatcgtagtgcggtttgtgaaacagaa gcttgtcgctcgtcaaaaataatgattgctgacgtgttcaaatggaaattcgatgcaaac gaattgtggagtcttttaacgaatcaggatgaagttttcccgcatattagagcacggcga ttcattcaagaacattttggtctattcgtccagatggcaattgcatatgtcattttggtg ttctcaatcaaaaggttcatgagggatcgtgaaccatttcaactcaccacagctcttcgt ctctggaacttcttcctctccgtcttctcaatttatggttcctggacaatgtttccattt atggttcaacaaataagactttatggtctctacggatgtggatgcgaagcactttcaaac cttccgagtcaagcagaatattggcttttcctgacgatcttgtccaaagctgtggagttt gttgatacatttttcttggttctccggaaaaaaccactcatcttcctacactggtatcat catatggcaacatttgtcttcttctgcagtaattacccgactccatcgtcacaatcacgc gtcggagttatcgtcaacctgttcgtgcatgccttcatgtacccatactatttcacccga tcaatgaacatcaaagttcctgcgaaaatttcaatggctgttacagttcttcaattgact caattcatgtgctttatctatggatgtactctcatgtactactcgttggccactaatcag gcacgatacccctcaaatacacctgcgacactccaatgtttgtcctacactctacatttg ctttgao SEQID7.GBS.txt atggctcagcatccgctcgttcaacggcttctcgatgtcaaattcgacacgaaacgattt gtggctattgctactcatgggccaaagaatttccctgacgcagaaggtcgcaagttcttt gctgatcactttgatgttactattcaggcttcaatcctgtacatggtcgttgtgttcgga acaaaatggttcatgcgtaatcgtcaaccattccaattgactattccactcaacatctgg aatttcatcctcgccgcattttccatcgcaggagctgtcaaaatgaccccagagttcttt ggaaccattgccaacaaaggaattgtcgatcctactgcaaagtgtttgatttcacgaaag gagagaatggatactgggtgtggctcttcatggcttccaaacttttcgaacttgttgaca ccatcttcttggttctccgtaaacgtccactcatgttccttcactggtatcaccatattc tcaccatgatctacgcctggtactctcatccattgaccccaggattcaacagatacggaa tttatcttaactttgtcgtccacgccttcatgtactcttactacttccttcgctcgatga agattcgcgtgccaggattcatcgcccaagctatcacatctcttcaaatcgttcaattca tcatctcttgcgccgttcttgctcatcttggttatctcatgcacttcaccaatgccaact gtgatttcgagccatcagtattcaagctcgcagttttcatggacacaacatacttggctc ttttcgtcaacttcttcctccaatcatatgttctccgcggaggaaaagacaagtacaagg cagtgccaaagaagaagaacaactaa~
SEQID8.GBS.tXt atgtcggccgaagtgtccgaacgattcaaagtttggacaggaaacaatgagaccatcatc tattccccattcgagtacgattccacgttgctcatcgagtcatgtcggtgtacttatcag ctgcttatattattgcgacaaatttattacagagatatatggagtcacggaaacctaaaa cttttactagcatggaacggttttttggcagtgttcagtattatgggtacatggagattt ggaatcgaattctacgatgctgttttcagaagaggcttcatcgattcgatctgcctggct gtaaatccacgttcaccgtccgcattctgggcatgcatgttcgctctatcgaaaatcgcc gagtttggggacacgatgttcttggtgctgaggaaacggccggttatattccttcactgg tatcatcacgctgttgttctgatcctttcttggcatgctgcaatcgaactcacagctcca ggacgctggtttatttttatgaactatttggtgcattcaataatgtatacatactacgca ataacatcaatcggctatcgtcttcccaaaatcgtttcaatgactgttacattccttcaa actcttcaaatgctcattggtgtcagcatttcttgcattgtgctttatttgaagcttaat ggagagatgtgccaacaatcctacgacaatctggcgttgagcttcggaatctacgcctca ttcctggtgctattctccagtttcttcaacaatgcatatttggtaaaaaaggacaagaaa cccgatgtgaagaaggattaao SEQID9.GBS.txt MELAEFWNDL~ITFTIYGPNHTDMTTKYKYSYHFPGEQVADPQYWTILFQKYWYHSITISV
LYFILIKVIQKFMENRKPFTLKYPLILWNGALAAFSIIATLRFSIDPLRSLYAEGFYKTL
CYSCNPTDVAAFWSFAFALSKIVELGDTMFIILRKRPLIFLHYYHHAAVLIYTVHSGAEH
TAAGRFYILMNYFAHSLMYTYYTVSAMGYRLPKWVSMTVTTVQTTQMLAGVGITWMVYKV
KTEYKLPCQQSVANLYLAFVIYVTFAILFIQFFVKAYIIKSSKKSKSVKNEO
SEQID10.GBS.tXt MAKYDYNPKItGLENYSIFLPFETSFDAFRSTTWMQNHWYQSITASVVYVAVIFTGKKWL
IYKKSRVITFESSLQNAIKNRNRKSLNSSQMFQIMEKYKPFQLDTPLFVWNSFLAIFSIL
GFLRMTPEFVWSWSAEGNSFKYSICHSSYAQGVTGFWTEQFAMSKLFELIDTIFIVLRKR
PLIFLHWYHHVTVMIYTWHAYKDHTASGRWFIWMNYGVHALMYSYYALRSLKFRLPKQMA
MVVTTLQLAQMVMGVIIGVTVYRIKSSGEYCQQTWDNLGLCFGVYFTYFLLFANFFYHAY
VKKNNRTVNYENNSKNFPDLVLIYLRKKVSRKSKNRQCSENNYKIQFSSNFVNVDGKKHK
KTYELILPRRKMTTILTFLFGKNRIFSKYQKNRKNISIPVDFEILEPKEDINANIAEPSI
TTRSAAARRKVQKADD
SEQID11.GBS.tXt MAAAQTSPAA'fLVDVLTKPWSLDQTDSYMSTFVPLSYKIMIGYLVTIYFGQKLMAHRKPF
DLQNTLALWNFGFSLFSGIAAYKLIPELFGVFMKDGFVASYCQNENYYTDASTGFWGWAF
VMSKAPELGDTMFLVLRKKPVIFMHWYHHALTFVYAVVTYSEHQAWARWSLALNLAVHTV
MYFYFAVRALNIQTPRPVAKFITTIQIVQFVISCYIFGHLVFIKSADSVPGCAVSWNVLS
IGGLMYISYLFLFAKFFYKAYIQKRSPTKTSKQE~
SEQID12.GBS.txt MSSDDRGTRl'~FKMMDQILGTNFTYEGAKEVARGLEGFSAKLAVGYIATIFGLKYYMKDRK
AFDLSTPLNIWNGILSTFSLLGFLFTFPTLLSVIRKDGFSHTYSHVSELYTDSTSGYWIF
LWVISKIPELLDTVFIVLRKRPLIFMHWYHHALTGYYALVCYHEDAVHMVWVVWMNYIIH
AFMYGYYLLKSLKVPIPPSVAQAITTSQMVQFAVAIFAQVHVSYKHYVEGVEGLAYSFRG
TAIGFFMLTTYFYLWIQFYKEHYLKNGGKKYNLAKDQAKTQTKKAN~
SEQID13.GBS.tXt MPQGEVSFFE~VLTTAPFSHELSKKHIAQTQYAAFWISMAYVVVIFGLKAVMTNRKPFDLT
GPLNLWNAGLAIFSTLGSLATTFGLLHEFFSRGFFESYIHIGDFYNGLSGMFTWLFVLSK
VAEFGDTLFIILRKKPLMFLHWYHHVLTMNYAFMSFEANLGFNTWITWMNFSVHSIMYGY
YMLRSFGVKVPAWIAKNITTMQILQFVITHFILFHVGYLAVTGQSVDSTPGYYWFCLLME
ISYWLFGNFYYQSYIKGGGKKFNAEKKTEKKIEO
SEQID14.GBS.txt MYLNYFATEI~FHRSAVCETEACRSSKIMIADVFKWKFDANELWSLLTNQDEVFPHIRARR
FIQEHFGLFVQMAIAYVILVFSIKRFMRDREPFQLTTALRLWNFFLSVFSIYGSWTMFPF
MVQQIRLYGLYGCGCEALSNLPSQAEYWLFLTILSKAVEFVDTFFLVLRKKPLIFLHWYH
HMATFVFFCSNYPTPSSQSRVGVIVNLFVHAFMYPYYFTRSMNIKVPAKISMAVTVLQLT
QFMCFIYGCTLMYYSLATNQARYPSNTPATLQCLSYTLHLLD
SEQID15.GBS.tXt MAQHPLVQRL'~LDVKFDTKRFVAIATHGPKNFPDAEGRKFFADHFDVTIQASILYMVWFG
TKWFMRNRQPFQLTIPLNIWNFILAAFSIAGAVKMTPEFFGTIANKGIVASYCKVFDFTK
GENGYWVWLFMASKLFELVDTIFLVLRKRPLMFLHWYHHILTMIYAWYSHPLTPGFNRYG
IYLNFVVHAFMYSYYFLRSMKIRVPGFIAQAITSLQIVQFIISCAVLAHLGYLMHFTNAN
CDFEPSVFKLAVFMDTTYLALFVNFFLQSYVLRGGKDKYKAVPKKKNNO
SEQID16.GBS.txt MSAEVSERFICVWTGNNETIIYSPFEYDSTLLIESCRCTYQLLILLRQIYYRDIWSHGNLK
ACDXLLLAWNGFLAVFSIMGTWRFGIEFYDAVFRXGFIXSICLAVNPRSPSAFWACMFAL
SKIAEFGDTMFLVLRKRPVIFLHWYHHAWLILSWHAAIELTAPGRWFIFMNYLVHSIMY
TYYAITSIGYRXPKIVSMTVTFLQTLQMLIGVSISCIVLYLKLNGEMCQQSYDNLALSFG
IYASFLVLSSFFNNAYLVKKDKKPDVKKD~
Claims (40)
1. An isolated polypeptide comprising a functional long chain polyunsaturated fatty acid (PUFA) elongase as herein defined.
2. A polypeptide according to claim 1 wherein the polypeptide is from a eukaryote.
3. A polypeptide according to claim 1 or claim 2 wherein the polypeptide has at least a portion of the amino acid sequence shown in SEQ ID 15, or variants thereof.
4. A polypeptide having at least 60% homology to a polypeptide according to claim 3 and having a PUFA elongase function.
5. A polypeptide according to claim 4 having at least 80% homology.
6. A polypeptide according to claim 5 having at least 90% homology.
7. A polypeptide according to any preceding claim wherein the polypeptide sequence includes a sequence motif responsible for Endoplasmic Reticulum (ER) -retention.
8. A polypeptide according to any preceding claim wherein the polypeptide is capable of elongating palmitoleic acid (PA; 16:1.DELTA.9) to vacceric acid (VA;
18:1.DELTA.11).
18:1.DELTA.11).
9. A polypeptide according to any preceding claim wherein the polypeptide is from an animal.
10. A polypeptide according to claim 9 wherein the animal is an invertebrate.
11. A polypeptide according to claim 10 wherein the invertebrate is a worm.
12. A polypeptide according to claim 11 wherein the worm is C. elegans.
13. A polypeptide according to claim 9 wherein the animal is a vertebrate.
14. A polypeptide according to claim 13 wherein the vertebrate is a mammal.
15. A polypeptide according to claim 14 wherein the mammal is a human, rat or mouse.
16. A DNA sequence encoding a polypeptide according to any preceding claim.
17. A DNA sequence according to claim 16 wherein the DNA comprises the sequence shown in SEQ ID 7 or variants of that sequence due to base substitutions, deletions and/or additions.
18. An engineered organism engineered to express a polypeptide according to any one of claims 1 to 15.
19. An engineered organism according to claim 18 wherein the animal is a mammal.
20. An engineered organism according to claim 19 wherein the mammal is a rat, mouse or monkey.
21. An engineered organism containing a synthetic pathway for the production of a polypeptide according to any one of claims 1 to 15.
22. An engineered organism according to claim 21 wherein the pathway includes .DELTA.5-fatty acid desaturase.
23. An engineered organism according to claim 21 or 22 wherein the pathway includes .DELTA.6-fatty acid desaturase.
24. An engineered organism according to any one of claims 21 to 23 wherein the animal is a lower eukaryote.
25. An engineered organism according to claim 24 wherein the lower eukaryote is a yeast.
26. An engineered organism according to claim 18 wherein the animal is a fish.
27. A transgenic plant engineered to express a polypeptide according to any one of claims 1 to 15.
28. A transgenic plant containing a DNA sequence according to claim 16 or 17.
29. A method of producing a PUFA comprising carrying out an elongase reaction catalysed by a polypeptide according to any one of claims 1 to 15.
30. A method according to claim 29 wherein the PUFA is di-homo-gamma-linoleic acid (20:3.DELTA.8,11,14), arachidonic acid (20:4.DELTA.5,8,11,14), eicosapentanoic acid (20:5.DELTA.5,8,11,14,17), docosatrienoic acid (22:3.DELTA.3,16,19), docosatetraenoic acid (22:4.DELTA.7,10,13,16), docosapentaenoic acid (22:5.DELTA.7,10,13,16,19) or docosahexaenoic acid (22:4.DELTA.4,7,10,13,16,19).
31. A method according to claim 29 wherein the PITFA is a 24 carbon fatty acid with at least 4 double bonds.
32. A PUFA produced by a method according to any one of claims 29 to 31.
33. A foodstuff comprising a PUFA according to claim 32.
34. A dietary supplement comprising a PUFA according to claim 32.
35. A pharmaceutical composition comprising a polypeptide according to any one of claims 1 to 15.
36. A pharmaceutical composition comprising a PUFA according to claim 32.
37. A pharmaceutical composition according to claim 35 or claim 36 wherein the composition comprises a pharmaceutically-acceptable diluent, carrier, excipient or extender.
38. A method of elevating the PUFA levels of an animal or a plant by supplying to the animal or plant a polypeptide according to any of claims 1 to 15, a DNA
sequence according to claim 16 or claim 17, a foodstuff according to claim 33, a dietary supplement according to claim 34, a pharmaceutical composition according to any of claims 35 to 37 or a PUFA according to claim 32.
sequence according to claim 16 or claim 17, a foodstuff according to claim 33, a dietary supplement according to claim 34, a pharmaceutical composition according to any of claims 35 to 37 or a PUFA according to claim 32.
39. A method of treatment according to claim 38 wherein the animal is a mammal.
40. A method of treatment according to claim 39 wherein the mammal is a human.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9906307.5 | 1999-03-18 | ||
GBGB9906307.5A GB9906307D0 (en) | 1999-03-18 | 1999-03-18 | Novel polypeptides |
GB0003869A GB0003869D0 (en) | 2000-02-18 | 2000-02-18 | Elongase II |
GB0003869.5 | 2000-02-18 | ||
PCT/GB2000/001035 WO2000055330A1 (en) | 1999-03-18 | 2000-03-20 | Polysaturated fatty acid (pufa) elongase from caenorhabditis elegans |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2365096A1 true CA2365096A1 (en) | 2000-09-21 |
Family
ID=26243677
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002365096A Abandoned CA2365096A1 (en) | 1999-03-18 | 2000-03-20 | Polyunsaturated fatty acid (pufa) elongase from caenorhabditis elegans |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1161542A1 (en) |
JP (1) | JP2002538826A (en) |
AU (1) | AU3308500A (en) |
CA (1) | CA2365096A1 (en) |
NO (1) | NO20014542D0 (en) |
WO (1) | WO2000055330A1 (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6403349B1 (en) * | 1998-09-02 | 2002-06-11 | Abbott Laboratories | Elongase gene and uses thereof |
GB0107510D0 (en) * | 2001-03-26 | 2001-05-16 | Univ Bristol | New elongase gene and a process for the production of -9-polyunsaturated fatty acids |
CA2449479A1 (en) * | 2001-06-05 | 2002-12-12 | Exelixis, Inc. | Map3ks as modifiers of the p53 pathway and methods of use |
US7705202B2 (en) | 2002-03-16 | 2010-04-27 | The University Of York | Transgenic plants expressing enzymes involved in fatty acid biosynthesis |
MX353906B (en) * | 2003-08-01 | 2018-02-02 | Basf Plant Science Gmbh | Method for the production of multiply-unsaturated fatty acids in transgenic organisms. |
DK1756280T3 (en) | 2004-04-22 | 2015-02-02 | Commw Scient Ind Res Org | SYNTHESIS OF CHAIN, polyunsaturated fatty acids BY RECOMBINANT CELLS |
EP2357243B1 (en) | 2004-04-22 | 2018-12-12 | Commonwealth Scientific and Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
DK1766023T3 (en) | 2004-06-04 | 2010-12-13 | Fluxome Sciences As | Metabolically engineered cells for the production of polyunsaturated fatty acids |
EP2059588A4 (en) | 2006-08-29 | 2010-07-28 | Commw Scient Ind Res Org | Synthesis of fatty acids |
US7892792B2 (en) | 2008-06-27 | 2011-02-22 | Indian Institute Of Science | Cells expressing Pichia cytochrome C |
CN113957105B (en) | 2008-11-18 | 2024-11-01 | 联邦科学技术研究组织 | Enzymes and methods for producing omega-3 fatty acids |
UA127917C2 (en) | 2012-06-15 | 2024-02-14 | Коммонвелт Сайнтіфік Енд Індастріел Рісерч Організейшн | A recombinant brassica napus cell containing long-chain polyunsaturated fatty acids, a transgenic brassica napus plant and seeds, a method for producing an extracted plant lipid, a food product and an ethyl ester of polyunsaturated fatty acids |
US9725399B2 (en) | 2013-12-18 | 2017-08-08 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
KR102527795B1 (en) | 2014-06-27 | 2023-05-02 | 커먼웰쓰 사이언티픽 앤 인더스트리알 리서치 오거니제이션 | Lipid comprising docosapentaenoic acid |
CN104920307B (en) * | 2015-06-19 | 2018-01-26 | 安徽省农业科学院农业工程研究所 | A kind of Caenorthaditis elegans ws123 fixing means suitable for single-particle microbeam device |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6403349B1 (en) * | 1998-09-02 | 2002-06-11 | Abbott Laboratories | Elongase gene and uses thereof |
-
2000
- 2000-03-20 AU AU33085/00A patent/AU3308500A/en not_active Abandoned
- 2000-03-20 EP EP00911091A patent/EP1161542A1/en not_active Withdrawn
- 2000-03-20 WO PCT/GB2000/001035 patent/WO2000055330A1/en not_active Application Discontinuation
- 2000-03-20 JP JP2000605748A patent/JP2002538826A/en active Pending
- 2000-03-20 CA CA002365096A patent/CA2365096A1/en not_active Abandoned
-
2001
- 2001-09-18 NO NO20014542A patent/NO20014542D0/en unknown
Also Published As
Publication number | Publication date |
---|---|
AU3308500A (en) | 2000-10-04 |
JP2002538826A (en) | 2002-11-19 |
EP1161542A1 (en) | 2001-12-12 |
WO2000055330A1 (en) | 2000-09-21 |
NO20014542L (en) | 2001-09-18 |
NO20014542D0 (en) | 2001-09-18 |
WO2000055330A9 (en) | 2002-08-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10174297B2 (en) | Fatty acid desaturases from primula | |
CA2365096A1 (en) | Polyunsaturated fatty acid (pufa) elongase from caenorhabditis elegans | |
US7135617B2 (en) | Diacylglycerol acyl transferase proteins | |
JP5501952B2 (en) | Δ8 desaturases and their use in making polyunsaturated fatty acids | |
CA2519169C (en) | Transgenic plants expressing enzymes involved in fatty acid biosynthesis | |
Murata et al. | Acyl-lipid desaturases and their importance in the tolerance and acclimatization to cold of cyanobacteria. | |
US6489461B1 (en) | Nucleic acid sequences encoding proteins involved in fatty acid beta-oxidation and methods of use | |
Sperling et al. | Plant sphingolipids: structural diversity, biosynthesis, first genes and functions | |
KR102527795B1 (en) | Lipid comprising docosapentaenoic acid | |
EP1911837B1 (en) | FAD5-2 fatty acid desaturase family member and uses thereof | |
JP5769374B2 (en) | Mutant Δ8 desaturase genes modified by targeted mutagenesis and their use in the production of polyunsaturated fatty acids | |
WO1998046764A1 (en) | Methods and compositions for synthesis of long chain polyunsaturated fatty acids in plants | |
BRPI0710368A2 (en) | isolated polynucleotides, polypeptide, isolated nucleic acid molecule, recombinant dna construction, yarrowia sp cell, transformation methods for transformed plant production cells, seeds, methods for making long-chain polyunsaturated fatty cells, polyunsaturated fatty acids long-chain plant cells, oil, oilseeds, oil or by-products, food or feed, method of making long-chain polyunsaturated fatty acids in cells and plant offspring | |
BRPI0711028A2 (en) | "isolated polynucleotide, recombinant DNA construct, cell, method for transforming a cell, method for producing a transgenic plant, transgenic seeds, long chain polyunsaturated fatty acid production method, oils or subporudots to produce, at least one Polyunsaturated Fatty Acid in a Vegetable Cell of an Oilseed, Plants and Isolated Nucleic Acid Molecule " | |
WO1998046764A9 (en) | Methods and compositions for synthesis of long chain polyunsaturated fatty acids in plants | |
JP2009517019A (en) | Δ9 elongases and their use in the production of polyunsaturated fatty acids | |
JP2010524468A (en) | Δ9 elongase and their use in making polyunsaturated fatty acids | |
Kim et al. | Functional characterization of a delta 6-desaturase gene from the black seabream (Acanthopagrus schlegeli) | |
WO2009001315A2 (en) | Use of a class of genes encoding lysophospholipid acyl transferases for application in agriculture, biotechnology, and medicine | |
Libisch et al. | Chimeras of Δ6-fatty acid and Δ8-sphingolipid desaturases | |
CA2343969A1 (en) | Novel plant acyltransferases | |
US8269062B2 (en) | Lunaria annua, Cardamine graeca and Teesdalia nudicaulis FAE genes and their use in producing nervonic and eicosenoic acids in seed oils | |
Shockey et al. | Cloning, functional analysis, and subcellular localization of two isoforms of NADH: cytochrome b5 reductase from developing seeds of tung (Vernicia fordii) | |
Makarenko et al. | Fatty acid composition of lipids in the calluses of two pine species Pinus sibirica and Pinus sylvestris | |
Zhao et al. | A complete biosynthetic pathway of the long-chain polyunsaturated fatty acids in an amphidromous fish, ayu sweetfish Plecoglossus altivelis (Stomiati; Osmeriformes) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Dead |