CA2322729C - Modification of fatty acid metabolism in plants - Google Patents
Modification of fatty acid metabolism in plants Download PDFInfo
- Publication number
- CA2322729C CA2322729C CA2322729A CA2322729A CA2322729C CA 2322729 C CA2322729 C CA 2322729C CA 2322729 A CA2322729 A CA 2322729A CA 2322729 A CA2322729 A CA 2322729A CA 2322729 C CA2322729 C CA 2322729C
- Authority
- CA
- Canada
- Prior art keywords
- promoter
- plant
- ala
- gly
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000004048 modification Effects 0.000 title abstract description 9
- 238000012986 modification Methods 0.000 title abstract description 9
- 230000004129 fatty acid metabolism Effects 0.000 title description 4
- 238000000034 method Methods 0.000 claims abstract description 68
- 102000004190 Enzymes Human genes 0.000 claims abstract description 62
- 108090000790 Enzymes Proteins 0.000 claims abstract description 62
- 238000007254 oxidation reaction Methods 0.000 claims abstract description 61
- 235000014113 dietary fatty acids Nutrition 0.000 claims abstract description 48
- 229930195729 fatty acid Natural products 0.000 claims abstract description 48
- 239000000194 fatty acid Substances 0.000 claims abstract description 48
- 150000004665 fatty acids Chemical class 0.000 claims abstract description 48
- 229920000903 polyhydroxyalkanoate Polymers 0.000 claims abstract description 39
- 230000009261 transgenic effect Effects 0.000 claims abstract description 34
- 238000004519 manufacturing process Methods 0.000 claims abstract description 27
- 239000005014 poly(hydroxyalkanoate) Substances 0.000 claims abstract description 23
- 230000003647 oxidation Effects 0.000 claims abstract description 22
- 108010023922 Enoyl-CoA hydratase Proteins 0.000 claims abstract description 6
- 102000011426 Enoyl-CoA hydratase Human genes 0.000 claims abstract description 6
- 241000196324 Embryophyta Species 0.000 claims description 142
- 108090000623 proteins and genes Proteins 0.000 claims description 119
- 108020004414 DNA Proteins 0.000 claims description 55
- 230000000694 effects Effects 0.000 claims description 41
- 108700019146 Transgenes Proteins 0.000 claims description 28
- 210000004027 cell Anatomy 0.000 claims description 25
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 claims description 20
- 244000068988 Glycine max Species 0.000 claims description 19
- 235000010469 Glycine max Nutrition 0.000 claims description 19
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 16
- 210000002706 plastid Anatomy 0.000 claims description 16
- 108010010718 poly(3-hydroxyalkanoic acid) synthase Proteins 0.000 claims description 16
- 101710089395 Oleosin Proteins 0.000 claims description 14
- 241000894006 Bacteria Species 0.000 claims description 12
- 241000589776 Pseudomonas putida Species 0.000 claims description 12
- 210000000172 cytosol Anatomy 0.000 claims description 12
- 102000002932 Thiolase Human genes 0.000 claims description 11
- 108060008225 Thiolase Proteins 0.000 claims description 11
- 210000003470 mitochondria Anatomy 0.000 claims description 11
- 101710163504 Phaseolin Proteins 0.000 claims description 10
- 230000001580 bacterial effect Effects 0.000 claims description 10
- 210000002824 peroxisome Anatomy 0.000 claims description 10
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 claims description 10
- 241000219198 Brassica Species 0.000 claims description 8
- 235000011331 Brassica Nutrition 0.000 claims description 7
- 241000589516 Pseudomonas Species 0.000 claims description 7
- 240000008042 Zea mays Species 0.000 claims description 6
- 244000020551 Helianthus annuus Species 0.000 claims description 5
- 235000003222 Helianthus annuus Nutrition 0.000 claims description 5
- 240000004658 Medicago sativa Species 0.000 claims description 5
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 5
- 241001465754 Metazoa Species 0.000 claims description 5
- 244000061176 Nicotiana tabacum Species 0.000 claims description 5
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 5
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 5
- 108091000039 acetoacetyl-CoA reductase Proteins 0.000 claims description 5
- 244000060011 Cocos nucifera Species 0.000 claims description 4
- 235000013162 Cocos nucifera Nutrition 0.000 claims description 4
- 235000004431 Linum usitatissimum Nutrition 0.000 claims description 4
- 240000006240 Linum usitatissimum Species 0.000 claims description 4
- 101710202365 Napin Proteins 0.000 claims description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 4
- 108090000848 Ubiquitin Proteins 0.000 claims description 4
- 102000044159 Ubiquitin Human genes 0.000 claims description 4
- 230000008488 polyadenylation Effects 0.000 claims description 4
- 101710168820 2S seed storage albumin protein Proteins 0.000 claims description 3
- 241000588986 Alcaligenes Species 0.000 claims description 3
- 235000017060 Arachis glabrata Nutrition 0.000 claims description 3
- 244000105624 Arachis hypogaea Species 0.000 claims description 3
- 235000010777 Arachis hypogaea Nutrition 0.000 claims description 3
- 235000018262 Arachis monticola Nutrition 0.000 claims description 3
- 235000003351 Brassica cretica Nutrition 0.000 claims description 3
- 235000003343 Brassica rupestris Nutrition 0.000 claims description 3
- 241000219193 Brassicaceae Species 0.000 claims description 3
- 244000020518 Carthamus tinctorius Species 0.000 claims description 3
- 235000003255 Carthamus tinctorius Nutrition 0.000 claims description 3
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 3
- 235000012343 cottonseed oil Nutrition 0.000 claims description 3
- 235000009973 maize Nutrition 0.000 claims description 3
- 230000004060 metabolic process Effects 0.000 claims description 3
- 235000010460 mustard Nutrition 0.000 claims description 3
- 235000020232 peanut Nutrition 0.000 claims description 3
- 230000002708 enhancing effect Effects 0.000 claims description 2
- 230000002538 fungal effect Effects 0.000 claims 3
- 241001133760 Acoelorraphe Species 0.000 claims 2
- 101710146995 Acyl carrier protein Proteins 0.000 claims 2
- 241000588722 Escherichia Species 0.000 claims 2
- 102000009114 Fatty acid desaturases Human genes 0.000 claims 2
- 108010087894 Fatty acid desaturases Proteins 0.000 claims 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 claims 2
- 108020004999 messenger RNA Proteins 0.000 claims 2
- 229920002494 Zein Polymers 0.000 claims 1
- 239000005019 zein Substances 0.000 claims 1
- 229940093612 zein Drugs 0.000 claims 1
- 239000002028 Biomass Substances 0.000 abstract description 14
- 229920000642 polymer Polymers 0.000 abstract description 14
- 230000015572 biosynthetic process Effects 0.000 abstract description 12
- 238000003786 synthesis reaction Methods 0.000 abstract description 7
- 241000607516 Aeromonas caviae Species 0.000 abstract description 5
- 241001464837 Viridiplantae Species 0.000 abstract description 2
- 230000003247 decreasing effect Effects 0.000 abstract description 2
- 239000013612 plasmid Substances 0.000 description 37
- 102000004169 proteins and genes Human genes 0.000 description 35
- 239000013615 primer Substances 0.000 description 33
- 235000018102 proteins Nutrition 0.000 description 33
- 230000009466 transformation Effects 0.000 description 33
- 238000006243 chemical reaction Methods 0.000 description 26
- 238000003556 assay Methods 0.000 description 21
- 108090000765 processed proteins & peptides Proteins 0.000 description 20
- 239000012634 fragment Substances 0.000 description 18
- 239000000758 substrate Substances 0.000 description 18
- 108091034117 Oligonucleotide Proteins 0.000 description 17
- 208000037534 Progressive hemifacial atrophy Diseases 0.000 description 17
- 238000012017 passive hemagglutination assay Methods 0.000 description 17
- 102000004196 processed proteins & peptides Human genes 0.000 description 17
- 230000037361 pathway Effects 0.000 description 16
- 239000013598 vector Substances 0.000 description 16
- 101710088194 Dehydrogenase Proteins 0.000 description 15
- 230000004927 fusion Effects 0.000 description 15
- 229920001184 polypeptide Polymers 0.000 description 15
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 14
- 239000002609 medium Substances 0.000 description 14
- 230000008685 targeting Effects 0.000 description 14
- 239000000203 mixture Substances 0.000 description 13
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 12
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 12
- 150000007970 thio esters Chemical class 0.000 description 12
- 229930006000 Sucrose Natural products 0.000 description 11
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 11
- 238000003752 polymerase chain reaction Methods 0.000 description 11
- 239000005720 sucrose Substances 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 10
- 101150007867 rbfox2 gene Proteins 0.000 description 10
- 108700011203 Phaseolus vulgaris phaseolin Proteins 0.000 description 9
- 150000001413 amino acids Chemical class 0.000 description 9
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 230000001105 regulatory effect Effects 0.000 description 9
- 238000005119 centrifugation Methods 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 230000012010 growth Effects 0.000 description 8
- 239000000543 intermediate Substances 0.000 description 8
- 230000001404 mediated effect Effects 0.000 description 8
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 8
- 239000006870 ms-medium Substances 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- 102000004879 Racemases and epimerases Human genes 0.000 description 7
- 108090001066 Racemases and epimerases Proteins 0.000 description 7
- OJFDKHTZOUZBOS-CITAKDKDSA-N acetoacetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 OJFDKHTZOUZBOS-CITAKDKDSA-N 0.000 description 7
- 125000002252 acyl group Chemical group 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 210000000805 cytoplasm Anatomy 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 108010003902 Acetyl-CoA C-acyltransferase Proteins 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- 102000004195 Isomerases Human genes 0.000 description 6
- 108090000769 Isomerases Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- 150000002632 lipids Chemical class 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 102000004672 Acetyl-CoA C-acyltransferase Human genes 0.000 description 5
- 241000589158 Agrobacterium Species 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 241000206602 Eukaryota Species 0.000 description 5
- 229920003266 Leaf® Polymers 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 5
- 108010047290 Multifunctional Enzymes Proteins 0.000 description 5
- 102000006833 Multifunctional Enzymes Human genes 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 230000009418 agronomic effect Effects 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 230000008676 import Effects 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- NWBJYWHLCVSVIJ-UHFFFAOYSA-N N-benzyladenine Chemical compound N=1C=NC=2NC=NC=2C=1NCC1=CC=CC=C1 NWBJYWHLCVSVIJ-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 101100281518 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) fox-2 gene Proteins 0.000 description 4
- 241000589538 Pseudomonas fragi Species 0.000 description 4
- 102000018120 Recombinases Human genes 0.000 description 4
- 108010091086 Recombinases Proteins 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 229940098773 bovine serum albumin Drugs 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 229920001577 copolymer Polymers 0.000 description 4
- 230000001086 cytosolic effect Effects 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 230000004133 fatty acid degradation Effects 0.000 description 4
- 239000004459 forage Substances 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 238000012544 monitoring process Methods 0.000 description 4
- 239000000178 monomer Substances 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 210000003463 organelle Anatomy 0.000 description 4
- 230000000858 peroxisomal effect Effects 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 238000003976 plant breeding Methods 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 238000001262 western blot Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 108010001058 Acyl-CoA Dehydrogenase Proteins 0.000 description 3
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- 241000252867 Cupriavidus metallidurans Species 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- 206010021929 Infertility male Diseases 0.000 description 3
- 108010025815 Kanamycin Kinase Proteins 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- 108090001030 Lipoproteins Proteins 0.000 description 3
- 102000004895 Lipoproteins Human genes 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- 208000007466 Male Infertility Diseases 0.000 description 3
- 101000913652 Mus musculus Fibronectin type III domain-containing protein 5 Proteins 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 102000004316 Oxidoreductases Human genes 0.000 description 3
- 108090000854 Oxidoreductases Proteins 0.000 description 3
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 3
- 229920000331 Polyhydroxybutyrate Polymers 0.000 description 3
- 101100173179 Pseudomonas fragi fadA gene Proteins 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 3
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000001851 biosynthetic effect Effects 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- KFWWCMJSYSSPSK-PAXLJYGASA-N crotonoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)/C=C/C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 KFWWCMJSYSSPSK-PAXLJYGASA-N 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 239000000835 fiber Substances 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 238000003306 harvesting Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 3
- 229960002064 kanamycin sulfate Drugs 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 235000019198 oils Nutrition 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 3
- 239000005015 poly(hydroxybutyrate) Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 239000004460 silage Substances 0.000 description 3
- -1 stems Substances 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 108010006229 Acetyl-CoA C-acetyltransferase Proteins 0.000 description 2
- 102000002735 Acyl-CoA Dehydrogenase Human genes 0.000 description 2
- 102000004539 Acyl-CoA Oxidase Human genes 0.000 description 2
- 108020001558 Acyl-CoA oxidase Proteins 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- 235000006463 Brassica alba Nutrition 0.000 description 2
- 244000140786 Brassica hirta Species 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 241001390275 Carinata Species 0.000 description 2
- 241000195493 Cryptophyta Species 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 229930182566 Gentamicin Natural products 0.000 description 2
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- 241001509401 Gordonia rubripertincta Species 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 2
- 241000589781 Pseudomonas oleovorans Species 0.000 description 2
- 241000187563 Rhodococcus ruber Species 0.000 description 2
- 241000190984 Rhodospirillum rubrum Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 241000589153 Zoogloea ramigera Species 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 239000003905 agrochemical Substances 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 229920001222 biopolymer Polymers 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 239000007844 bleaching agent Substances 0.000 description 2
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 2
- 229960003669 carbenicillin Drugs 0.000 description 2
- 125000004432 carbon atom Chemical group C* 0.000 description 2
- 230000019522 cellular metabolic process Effects 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000009260 cross reactivity Effects 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 238000006345 epimerization reaction Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 150000002185 fatty acyl-CoAs Chemical class 0.000 description 2
- 239000012869 germination medium Substances 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 230000002363 herbicidal effect Effects 0.000 description 2
- 239000004009 herbicide Substances 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 230000036571 hydration Effects 0.000 description 2
- 238000006703 hydration reaction Methods 0.000 description 2
- 150000001261 hydroxy acids Chemical class 0.000 description 2
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- JTEDVYBZBROSJT-UHFFFAOYSA-N indole-3-butyric acid Chemical compound C1=CC=C2C(CCCC(=O)O)=CNC2=C1 JTEDVYBZBROSJT-UHFFFAOYSA-N 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 230000037356 lipid metabolism Effects 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 150000004702 methyl esters Chemical class 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 2
- 101150048611 phaC gene Proteins 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 238000004382 potting Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 239000012492 regenerant Substances 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000008117 seed development Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 229910052710 silicon Inorganic materials 0.000 description 2
- 239000010703 silicon Substances 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 2
- 238000000527 sonication Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 229940027257 timentin Drugs 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- OFHXPCLWHLXQHT-JKQORVJESA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN OFHXPCLWHLXQHT-JKQORVJESA-N 0.000 description 1
- SGIWPAGWLKAFCF-FLKCQLHMSA-N (2s)-2-[[(2s)-2-[[(2s)-4-amino-2-[[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]-3-methylbutanoyl]amino]-4-oxobutanoyl]amino]-3-carboxypropanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C SGIWPAGWLKAFCF-FLKCQLHMSA-N 0.000 description 1
- GYNQVPIDAQTZOY-ROUUACIJSA-N (2s)-2-[[2-[[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)NCC(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 GYNQVPIDAQTZOY-ROUUACIJSA-N 0.000 description 1
- QHHKKMYHDBRONY-WZZMXTMRSA-N (R)-3-hydroxybutanoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@H](O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 QHHKKMYHDBRONY-WZZMXTMRSA-N 0.000 description 1
- WHBMMWSBFZVSSR-GSVOUGTGSA-N (R)-3-hydroxybutyric acid Chemical compound C[C@@H](O)CC(O)=O WHBMMWSBFZVSSR-GSVOUGTGSA-N 0.000 description 1
- VAAHKRMGOFIORX-RLVCGTLZSA-N (R)-3-hydroxyhexanoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@H](O)CCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 VAAHKRMGOFIORX-RLVCGTLZSA-N 0.000 description 1
- YYGYPCRWZMLSGK-MXYRDGFCSA-N (R)-3-hydroxypentanoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@H](O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 YYGYPCRWZMLSGK-MXYRDGFCSA-N 0.000 description 1
- YZAZXIUFBCPZGB-QZOPMXJLSA-N (z)-octadec-9-enoic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O.CCCCCCCC\C=C/CCCCCCCC(O)=O YZAZXIUFBCPZGB-QZOPMXJLSA-N 0.000 description 1
- 108090000344 1,4-alpha-Glucan Branching Enzyme Proteins 0.000 description 1
- 102000003925 1,4-alpha-Glucan Branching Enzyme Human genes 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- 108700020831 3-Hydroxyacyl-CoA Dehydrogenase Proteins 0.000 description 1
- 102100021834 3-hydroxyacyl-CoA dehydrogenase Human genes 0.000 description 1
- 108020005029 5' Flanking Region Proteins 0.000 description 1
- IJJWOSAXNHWBPR-HUBLWGQQSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]-n-(6-hydrazinyl-6-oxohexyl)pentanamide Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)NCCCCCC(=O)NN)SC[C@@H]21 IJJWOSAXNHWBPR-HUBLWGQQSA-N 0.000 description 1
- 102100037768 Acetyl-CoA acetyltransferase, mitochondrial Human genes 0.000 description 1
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 1
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102000002296 Acyl-CoA Dehydrogenases Human genes 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical group C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- HWPXGQCMZITGFN-XVYDVKMFSA-N Ala-Cys-His Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HWPXGQCMZITGFN-XVYDVKMFSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 241000190857 Allochromatium vinosum Species 0.000 description 1
- 108700021822 Arabidopsis oleosin Proteins 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- LUJQEUOZJUWRRX-BPUTZDHNSA-N Asn-Trp-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O LUJQEUOZJUWRRX-BPUTZDHNSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 241000133676 Aspergillus latus Species 0.000 description 1
- 241000589151 Azotobacter Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 108010018763 Biotin carboxylase Proteins 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 241000222178 Candida tropicalis Species 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 102000005870 Coenzyme A Ligases Human genes 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 244000168525 Croton tiglium Species 0.000 description 1
- 101710190853 Cruciferin Proteins 0.000 description 1
- 244000024469 Cucumis prophetarum Species 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 1
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000701484 Figwort mosaic virus Species 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- 108010068370 Glutens Proteins 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- YXASFUBDSDAXQD-UWVGGRQHSA-N His-Met-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O YXASFUBDSDAXQD-UWVGGRQHSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 101001083553 Homo sapiens Hydroxyacyl-coenzyme A dehydrogenase, mitochondrial Proteins 0.000 description 1
- 102100030358 Hydroxyacyl-coenzyme A dehydrogenase, mitochondrial Human genes 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 108010011449 Long-chain-fatty-acid-CoA ligase Proteins 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- 229920001736 Metabolix Polymers 0.000 description 1
- 241000589308 Methylobacterium extorquens Species 0.000 description 1
- 102000002568 Multienzyme Complexes Human genes 0.000 description 1
- 108010093369 Multienzyme Complexes Proteins 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 230000010718 Oxidation Activity Effects 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000589597 Paracoccus denitrificans Species 0.000 description 1
- 108010085186 Peroxisomal Targeting Signals Proteins 0.000 description 1
- 240000009164 Petroselinum crispum Species 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- PTLMYJOMJLTMCB-KKUMJFAQSA-N Phe-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N PTLMYJOMJLTMCB-KKUMJFAQSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- 244000010815 Phlomis lychnitis Species 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 108010064851 Plant Proteins Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 102000015176 Proton-Translocating ATPases Human genes 0.000 description 1
- 108010039518 Proton-Translocating ATPases Proteins 0.000 description 1
- 241000927377 Pseudomonas acidophila Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 101100119698 Pseudomonas fragi fadB gene Proteins 0.000 description 1
- 101100463818 Pseudomonas oleovorans phaC1 gene Proteins 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- 108091007187 Reductases Proteins 0.000 description 1
- 241001148115 Rhizobium etli Species 0.000 description 1
- 101100297400 Rhizobium meliloti (strain 1021) phaAB gene Proteins 0.000 description 1
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 1
- 241000190967 Rhodospirillum Species 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- 241000589196 Sinorhizobium meliloti Species 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108010039811 Starch synthase Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000192581 Synechocystis sp. Species 0.000 description 1
- 102000005488 Thioesterase Human genes 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 1
- MHCLIYHJRXZBGJ-AAEUAGOBSA-N Trp-Gly-Cys Chemical compound N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)NCC(=O)N[C@@H](CS)C(=O)O MHCLIYHJRXZBGJ-AAEUAGOBSA-N 0.000 description 1
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 1
- UPNRACRNHISCAF-SZMVWBNQSA-N Trp-Lys-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UPNRACRNHISCAF-SZMVWBNQSA-N 0.000 description 1
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 1
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- HLBHFAWNMAQGNO-AVGNSLFASA-N Val-His-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N HLBHFAWNMAQGNO-AVGNSLFASA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 102100023870 YLP motif-containing protein 1 Human genes 0.000 description 1
- 101100012514 Yersinia enterocolitica serotype O:8 / biotype 1B (strain NCTC 13174 / 8081) fadJ gene Proteins 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 229940100228 acetyl coenzyme a Drugs 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 101150103518 bar gene Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000006652 catabolic pathway Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- MYSWGUAQZAJSOK-UHFFFAOYSA-N ciprofloxacin Chemical compound C12=CC(N3CCNCC3)=C(F)C=C2C(=O)C(C(=O)O)=CN1C1CC1 MYSWGUAQZAJSOK-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 102000046543 delta(3)-delta(2)-enoyl-CoA isomerase activity proteins Human genes 0.000 description 1
- 108700011235 delta(3)-delta(2)-enoyl-CoA isomerase activity proteins Proteins 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000004136 fatty acid synthesis Effects 0.000 description 1
- 239000010685 fatty oil Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-M glufosinate-P zwitterion(1-) Chemical compound CP([O-])(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-M 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- KYYWBEYKBLQSFW-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O.CCCCCCCCCCCCCCCC(O)=O KYYWBEYKBLQSFW-UHFFFAOYSA-N 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010952 in-situ formation Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 229940127554 medical product Drugs 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 239000006272 natural pesticide Substances 0.000 description 1
- 238000011587 new zealand white rabbit Methods 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- RQFLGKYCYMMRMC-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O.CCCCCCCCCCCCCCCCCC(O)=O RQFLGKYCYMMRMC-UHFFFAOYSA-N 0.000 description 1
- TVMXDCGIABBOFY-UHFFFAOYSA-N octane Chemical compound CCCCCCCC TVMXDCGIABBOFY-UHFFFAOYSA-N 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 235000011197 perejil Nutrition 0.000 description 1
- 101150046540 phaA gene Proteins 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 210000000745 plant chromosome Anatomy 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 235000021118 plant-derived protein Nutrition 0.000 description 1
- 230000008119 pollen development Effects 0.000 description 1
- 108010078304 poly-beta-hydroxybutyrate polymerase Proteins 0.000 description 1
- 229920000728 polyester Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 238000010248 power generation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- QAQREVBBADEHPA-IEXPHMLFSA-N propionyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 QAQREVBBADEHPA-IEXPHMLFSA-N 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229920001169 thermoplastic Polymers 0.000 description 1
- 239000004416 thermosoftening plastic Substances 0.000 description 1
- 108020002982 thioesterase Proteins 0.000 description 1
- 238000007079 thiolysis reaction Methods 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 239000007160 ty medium Substances 0.000 description 1
- 108010012567 tyrosyl-glycyl-glycyl-phenylalanyl Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 238000003260 vortexing Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8287—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for fertility modification, e.g. apomixis
- C12N15/8289—Male sterility
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8247—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Nutrition Science (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Methods and systems to modify fatty acid biosynthesis and oxidation in plants to make new polymers are provided. Two enzymes are essential: a hydratase such as D-specific enoyl-CoA hydratase, for example, the hydratase obtained from Aeromonas caviae, and a .beta.-oxidation enzyme system. Some plants have a .beta.-oxidation enzyme system which is sufficient to modify polymer synthesis when the plants are engineered to express the hydratase. Examples demonstrate production of polymer by expression of these enzymes in transgenic plants.
Examples also demonstrate that modifications in fatty acid biosynthesis can be used to alter plant phenotypes, decreasing or eliminating seed production and increasing green plant biomass, as well as producing polyhydroxyalkanoates.
Examples also demonstrate that modifications in fatty acid biosynthesis can be used to alter plant phenotypes, decreasing or eliminating seed production and increasing green plant biomass, as well as producing polyhydroxyalkanoates.
Description
MODIFICATION OF FATTY ACID METABOLISM IN PLANTS
Background Of The Invention The present invention is generally in the field of transgenic plant systems for the production of polyhydroxyalkanoate materials, modification of triglycerides and fatty acids, and methods for altering seed production in plants.
Methods for producing stable transgenic plants for agronomic crops have been developed over the last 1 S years. Crops have been genetically modified for improvements in both input and output traits. In the former traits, tolerance to specific agrochemicals has been engineered into crops, and specific natural pesticides, such as the Bacillus thuringenesis toxin, have been expressed directly in the plant. There also has been significant progress in developing male sterility systems for the production of hybrid plants.
I 5 With respect to output traits, crops are being modified to increase the value of the product, generally the seed, grain, or fiber of the plant. Critical metabolic targets include the modification of starch, fatty acid, and oil biosynthetic pathways.
There is considerable commercial interest in producing microbial polyhydroxyalkanoate (PHA) biopolymers in plant crops. See, for example, U.S. Patent Nos. 5,245,023 and 5,250,430 to Peoples and Sinskey; U.S.
Patent No. 5,502,273 to Bright et al.; U.S. Patent No. 5,534,432 to Peoples and Sinskey; U.S. Patent No. 5,602,321 to John; U.S. Patent No. 5,610,041 to Somerville et al.; PCT WO 91/00917; PCT~WO-~2%19'74~; PCT WO
93/02187; PCT WO 93/02194; PCT WO 94/12014; Poirier et al., Science 256:520-23 (1992); van der Leij & Witholt, Can. J. Microbiol.
41(supplement):222-38 (1995); Nawrath & Poirier, The International Symposium on Bacterial Polyhydroxyalkanoates, (Eggink et al., eds.) Davos Switzerland (August 18-23, 1996); and Williams and Peoples, CHEMTECH
26: 38-44 (1996). PHAs are natural, thermoplastic polyesters and can be processed by traditional polymer techniques for use in an enormous variety of applications, including consumer packaging, disposable diaper linings and garbage bags, food and medical products.
Early studies on the production of polyhydroxybutyrate in the chloroplasts of the experimental plant system Arabidopsis thaliana resulted in the accumulation of up to 14% of the leaf dry weight as PHB (Nawrath et al., 1993). Arabidopsis, however, has no agronomic value. Moreover, in order to economically produce PHAs in agronomic crops, it is desirable to produce the PHAs in the seeds, so that the current infrastructure for harvesting and processing seeds can be utilized. The options for recovery of the PHAs from plant seeds (PCT WO 97/15681) and the end use applications (Williams & Peoples, CHEMTECH 26:38-44 (1996)) are significantly ai~ected by the polymer composition. Therefore, it would be advantageous to develop transgenic plant systems that produce PHA polymers having a well-defined composition.
Careful selection of the PHA biosynthetic enzymes on the basis of their substrate specificity allows for the production of PHA polymers of defined composition in transgenic systems (U.S. Patent Nos. 5,229,279;
5,245,023; 5,250,430; 5,480,794; 5,512,669; 5,534,432; 5,661,026; and 5,663,063).
In bacteria, each PHA group is produced by a specific pathway. In the case of the short pendant group PHAs, three enzymes are involved: (3-ketothiolase, acetoacetyl-CoA reductase, and PHA synthase. The homopolymer PHB, for example, is produced by the condensation of two molecules of acetyl-coenzyme A to give acetoacetyl-coenzyme A. The latter then is reduced to the chiral intermediate R-3-hydroxybutyryl-coenzyme A
by the reductase, and subsequently polymerized by the PHA synthase enzyme. The PHA synthase notably has a relatively wide substrate specificity which allows it to polymerize C3-CS hydroxy acid monomers including both 4-hydroxy and 5-hydroxy acid units. This biosynthetic pathway is found in a number of bacteria such as Alcaligenes eutrophus, A.
latus, AZOtobacter vinlandii, and Zoogloea ramigera. Long pendant group PHAs are produced for example by many different Pseudomonas bacteria.
Their biosynthesis involve the (3-oxidation of fatty acids and fatty acid synthesis as routes to the hydroxyacyl-coenzyme A monomeric units. The latter then are converted by PHA syntheses which have substrate specificities favoring the larger C6-C 14 monomeric units (Peoples & Sinskey, 1990).
In the case of the PHB-co-HX copolymers which usually are produced from cells grown on fatty acids, a combination of these routes can be responsible for the formation of the different monomeric units. Indeed, analysis of the DNA locus encoding the PHA synthase gene in Aeromonas caviae, which produces the copolymer PHB-co-3-hydroxyhexanoate, was used to identify a gene encoding a D-specific enoyl-CoA hydratase responsible for the production of the D-~i-hydroxybutyryl-CoA and D-p -hydroxyhexanoyl-CoA units (Fukui & Doi, J. Bacteriol. 179:4821-30 (1997); Fukui et. al., J. Bacteriol. 180:667-73 (1998)). Other sources of such hydratase genes and enzymes include Alcaligenes, Pseudomonas, and Rhodospirillum.
The enzymes PHA synthase, acetoacetyl-CoA reductase, and (3-ketothiolase, which produce the short pendant group PHAs in A. eutrophus, are coded by an operon comprising the phbC phbA phbB genes; Peoples et al., 1987; Peoples & Sinskey, 1989). In the Pseudomonas organisms, the PHA syntheses responsible for production of the long pendant group PHAs have been found to be encoded on the pha locus, specifically by the phaA
and phaC genes (LJ.S. Patent Nos. 5,245,023 and 5,250,430; Huisman et. al., J. Biol. Chem. 266:2191-98 (1991)). Since these earlier studies, a range of PHA biosynthetic genes have been isolated and characterized or identified from genome sequencing projects. Examples of known PHA biosynthetic genes are disclosed in the following references: Aeronomas caviae (Fukui &
Doi, 1997, J. Bacteriol. 179:4821-30); Alcadigenes eutrophus (U.S. Patent Nos. 5,245,023; 5,250,430; 5,512,669; and 5,661,026; Peoples & Sinskey, J.
Biol. Chem. 264:15298-03 (1989)); Acinetobacter (Schembri et. al., FEMS
Microbiol. Lett. 118:145-52 (1994)); Chromatium vinosum (Liebergesell &
Steinbuchel, Eur. J. Biochem. 209:135-50 (1992)); Methylobacterium extorquens (Valentin & Steinbuchel, Appl. Microbiol. Biotechnol. 39:309-17 (1993)); Nocardia corallina (GENBANK Accession No. AF019964; Hall et.
al., 1998, Can. J. Microbiol. 44:687-69); Paracoccus denitrificans (LJeda et al., J. Bacteriol. 178:774-79 {1996}; Yabutani et. al., FEMSMicrobiol. Lett.
133:85-90 (1995)); Pseudomonas acidophila (LJmeda et. al., 1998, Applied Biochemistry and Biotechnology, 70-72:341-52); Pseudomonas sp. 61-3 (Matsusaki et al., 1998, J. Bacteriol. 180:6459-67); Nocardia corallina;
Pseudomonas aeruginosa (Timm & Steinbuchel, Eur. J. Biochem. 209:15-30 (1992)); P. oleovorans (U.S. Patent Nos. 5,245,023 and 5,250,430; Huisman et. al., J. Biol. Chem. 266 4 :2191-98 (1991); Rhizobium etli (Cevallos et.
al., J. Bacteriol. 178:1646-54 (1996)); R. meliloti (Tombolini et. al., Microbiology 141:2553-59 (1995)); Rhodococcus ruber (Pieper-Furst &
Steinbuchel, FEMS Microbiol. Lett. 75:73-79 (1992)); Rhodospirillum rubrum (Hustede et. al., FEMSMicrobiol. Lett 93:285-90 (1992));
Rhodobacter sphaeroides (Hustede et. al., FEMS Microbiol. Rev. 9:217-30 (1992); Biotechnol. Lett. 15:709-14 (1993); Synechocystis sp. (DNA Res.
Background Of The Invention The present invention is generally in the field of transgenic plant systems for the production of polyhydroxyalkanoate materials, modification of triglycerides and fatty acids, and methods for altering seed production in plants.
Methods for producing stable transgenic plants for agronomic crops have been developed over the last 1 S years. Crops have been genetically modified for improvements in both input and output traits. In the former traits, tolerance to specific agrochemicals has been engineered into crops, and specific natural pesticides, such as the Bacillus thuringenesis toxin, have been expressed directly in the plant. There also has been significant progress in developing male sterility systems for the production of hybrid plants.
I 5 With respect to output traits, crops are being modified to increase the value of the product, generally the seed, grain, or fiber of the plant. Critical metabolic targets include the modification of starch, fatty acid, and oil biosynthetic pathways.
There is considerable commercial interest in producing microbial polyhydroxyalkanoate (PHA) biopolymers in plant crops. See, for example, U.S. Patent Nos. 5,245,023 and 5,250,430 to Peoples and Sinskey; U.S.
Patent No. 5,502,273 to Bright et al.; U.S. Patent No. 5,534,432 to Peoples and Sinskey; U.S. Patent No. 5,602,321 to John; U.S. Patent No. 5,610,041 to Somerville et al.; PCT WO 91/00917; PCT~WO-~2%19'74~; PCT WO
93/02187; PCT WO 93/02194; PCT WO 94/12014; Poirier et al., Science 256:520-23 (1992); van der Leij & Witholt, Can. J. Microbiol.
41(supplement):222-38 (1995); Nawrath & Poirier, The International Symposium on Bacterial Polyhydroxyalkanoates, (Eggink et al., eds.) Davos Switzerland (August 18-23, 1996); and Williams and Peoples, CHEMTECH
26: 38-44 (1996). PHAs are natural, thermoplastic polyesters and can be processed by traditional polymer techniques for use in an enormous variety of applications, including consumer packaging, disposable diaper linings and garbage bags, food and medical products.
Early studies on the production of polyhydroxybutyrate in the chloroplasts of the experimental plant system Arabidopsis thaliana resulted in the accumulation of up to 14% of the leaf dry weight as PHB (Nawrath et al., 1993). Arabidopsis, however, has no agronomic value. Moreover, in order to economically produce PHAs in agronomic crops, it is desirable to produce the PHAs in the seeds, so that the current infrastructure for harvesting and processing seeds can be utilized. The options for recovery of the PHAs from plant seeds (PCT WO 97/15681) and the end use applications (Williams & Peoples, CHEMTECH 26:38-44 (1996)) are significantly ai~ected by the polymer composition. Therefore, it would be advantageous to develop transgenic plant systems that produce PHA polymers having a well-defined composition.
Careful selection of the PHA biosynthetic enzymes on the basis of their substrate specificity allows for the production of PHA polymers of defined composition in transgenic systems (U.S. Patent Nos. 5,229,279;
5,245,023; 5,250,430; 5,480,794; 5,512,669; 5,534,432; 5,661,026; and 5,663,063).
In bacteria, each PHA group is produced by a specific pathway. In the case of the short pendant group PHAs, three enzymes are involved: (3-ketothiolase, acetoacetyl-CoA reductase, and PHA synthase. The homopolymer PHB, for example, is produced by the condensation of two molecules of acetyl-coenzyme A to give acetoacetyl-coenzyme A. The latter then is reduced to the chiral intermediate R-3-hydroxybutyryl-coenzyme A
by the reductase, and subsequently polymerized by the PHA synthase enzyme. The PHA synthase notably has a relatively wide substrate specificity which allows it to polymerize C3-CS hydroxy acid monomers including both 4-hydroxy and 5-hydroxy acid units. This biosynthetic pathway is found in a number of bacteria such as Alcaligenes eutrophus, A.
latus, AZOtobacter vinlandii, and Zoogloea ramigera. Long pendant group PHAs are produced for example by many different Pseudomonas bacteria.
Their biosynthesis involve the (3-oxidation of fatty acids and fatty acid synthesis as routes to the hydroxyacyl-coenzyme A monomeric units. The latter then are converted by PHA syntheses which have substrate specificities favoring the larger C6-C 14 monomeric units (Peoples & Sinskey, 1990).
In the case of the PHB-co-HX copolymers which usually are produced from cells grown on fatty acids, a combination of these routes can be responsible for the formation of the different monomeric units. Indeed, analysis of the DNA locus encoding the PHA synthase gene in Aeromonas caviae, which produces the copolymer PHB-co-3-hydroxyhexanoate, was used to identify a gene encoding a D-specific enoyl-CoA hydratase responsible for the production of the D-~i-hydroxybutyryl-CoA and D-p -hydroxyhexanoyl-CoA units (Fukui & Doi, J. Bacteriol. 179:4821-30 (1997); Fukui et. al., J. Bacteriol. 180:667-73 (1998)). Other sources of such hydratase genes and enzymes include Alcaligenes, Pseudomonas, and Rhodospirillum.
The enzymes PHA synthase, acetoacetyl-CoA reductase, and (3-ketothiolase, which produce the short pendant group PHAs in A. eutrophus, are coded by an operon comprising the phbC phbA phbB genes; Peoples et al., 1987; Peoples & Sinskey, 1989). In the Pseudomonas organisms, the PHA syntheses responsible for production of the long pendant group PHAs have been found to be encoded on the pha locus, specifically by the phaA
and phaC genes (LJ.S. Patent Nos. 5,245,023 and 5,250,430; Huisman et. al., J. Biol. Chem. 266:2191-98 (1991)). Since these earlier studies, a range of PHA biosynthetic genes have been isolated and characterized or identified from genome sequencing projects. Examples of known PHA biosynthetic genes are disclosed in the following references: Aeronomas caviae (Fukui &
Doi, 1997, J. Bacteriol. 179:4821-30); Alcadigenes eutrophus (U.S. Patent Nos. 5,245,023; 5,250,430; 5,512,669; and 5,661,026; Peoples & Sinskey, J.
Biol. Chem. 264:15298-03 (1989)); Acinetobacter (Schembri et. al., FEMS
Microbiol. Lett. 118:145-52 (1994)); Chromatium vinosum (Liebergesell &
Steinbuchel, Eur. J. Biochem. 209:135-50 (1992)); Methylobacterium extorquens (Valentin & Steinbuchel, Appl. Microbiol. Biotechnol. 39:309-17 (1993)); Nocardia corallina (GENBANK Accession No. AF019964; Hall et.
al., 1998, Can. J. Microbiol. 44:687-69); Paracoccus denitrificans (LJeda et al., J. Bacteriol. 178:774-79 {1996}; Yabutani et. al., FEMSMicrobiol. Lett.
133:85-90 (1995)); Pseudomonas acidophila (LJmeda et. al., 1998, Applied Biochemistry and Biotechnology, 70-72:341-52); Pseudomonas sp. 61-3 (Matsusaki et al., 1998, J. Bacteriol. 180:6459-67); Nocardia corallina;
Pseudomonas aeruginosa (Timm & Steinbuchel, Eur. J. Biochem. 209:15-30 (1992)); P. oleovorans (U.S. Patent Nos. 5,245,023 and 5,250,430; Huisman et. al., J. Biol. Chem. 266 4 :2191-98 (1991); Rhizobium etli (Cevallos et.
al., J. Bacteriol. 178:1646-54 (1996)); R. meliloti (Tombolini et. al., Microbiology 141:2553-59 (1995)); Rhodococcus ruber (Pieper-Furst &
Steinbuchel, FEMS Microbiol. Lett. 75:73-79 (1992)); Rhodospirillum rubrum (Hustede et. al., FEMSMicrobiol. Lett 93:285-90 (1992));
Rhodobacter sphaeroides (Hustede et. al., FEMS Microbiol. Rev. 9:217-30 (1992); Biotechnol. Lett. 15:709-14 (1993); Synechocystis sp. (DNA Res.
3:109-36 (1996)); Thiocapsiae violacea (Appl. Microbiol. Biotechnol.
38:493-501 (1993)) and Zoogloea ramigera (Peoples et. al., J. Biod. Chem.
262:97-102 (1987); Peoples & Sinskey, Molecular Microbiology 3:349-57 ( 1989)). The availability of these genes or their published DNA sequences should provide a range of options for producing PHAs.
PHA synthases suitable for producing PHB-co-HH copolymers comprising from 1-99% HH monomers are encoded by the Rhodococcus ruber, Rhodospirillum rubrum, Thiocapsiae violacea, and Aeromonas caviae PHA synthase genes. PHA synthases useful for incorporating 3-hydroxyacids of 6-12 carbon atoms in addition to R-3-hydroxybutyrate i.e.
for producing biological polymers equivalent to the chemically synthesized copolymers described in PCT WO 95/20614, PCT WO 95/20615, and PCT
WO 95/20621 have been identified in a number of Pseudomonas and other bacteria (Steinbuchel & Wiese, Appl. Microbiol. Biotechnol. 37:691-97 (1992); Valentin et al., Appl. Microbiol: Biotechnol. 36:507-14 (1992);
Valentin et al., Appl. Microbiol. Biotechnol. 40:710-16 (1994); Lee et al., Appl. Microbiol. Biotechnol. 42:901-09 (1995); Kato et al., Appl. Microbiol.
38:493-501 (1993)) and Zoogloea ramigera (Peoples et. al., J. Biod. Chem.
262:97-102 (1987); Peoples & Sinskey, Molecular Microbiology 3:349-57 ( 1989)). The availability of these genes or their published DNA sequences should provide a range of options for producing PHAs.
PHA synthases suitable for producing PHB-co-HH copolymers comprising from 1-99% HH monomers are encoded by the Rhodococcus ruber, Rhodospirillum rubrum, Thiocapsiae violacea, and Aeromonas caviae PHA synthase genes. PHA synthases useful for incorporating 3-hydroxyacids of 6-12 carbon atoms in addition to R-3-hydroxybutyrate i.e.
for producing biological polymers equivalent to the chemically synthesized copolymers described in PCT WO 95/20614, PCT WO 95/20615, and PCT
WO 95/20621 have been identified in a number of Pseudomonas and other bacteria (Steinbuchel & Wiese, Appl. Microbiol. Biotechnol. 37:691-97 (1992); Valentin et al., Appl. Microbiol: Biotechnol. 36:507-14 (1992);
Valentin et al., Appl. Microbiol. Biotechnol. 40:710-16 (1994); Lee et al., Appl. Microbiol. Biotechnol. 42:901-09 (1995); Kato et al., Appl. Microbiol.
Biotechnol 45:363-70 (1996); Abe et al., Int. J. Biol. Macromol. 16:115-19 (1994); Valentin et al., Appl. Microbiol. Biotechnol. 46:261-67 (1996)) and can readily be isolated as described in U.S. Patent Nos. 5,245,023 and 5,250,430. The PHA synthase from P. oleovorans (U.S. Patent Nos.
5,245,023 and 5,250,430; Huisman et. al., J. Biol. Chem. 266(4): 2191-98 (1991)) is suitable for producing the long pendant group PHAs. Plant genes encoding (3-ketothiolase also have been identified (Vollack & Bach, Plant Physiol. 111:1097-107 (1996)).
Despite this ability to modify monomer composition by selection of 10 the syntheses and substrates, it is desirable to modify other features of polymer biosynthesis, such as that which involves fatty acid metabolism.
It is therefore an object of the present invention to provide a method and DNA constructs to introduce fatty acid oxidation enzyme systems for manipulating the cellular metabolism of plants.
15 It is another object of the present invention to provide methods for enhancing the production of PHAs in plants, preferably in the oilseeds thereof.
Summary Of The Invention 20 Methods and systems to modify fatty acid biosynthesis and oxidation in plants to make new polymers are described. Two enzymes are essential: a hydratase such as D-specific enoyl-CoA hydratase, for example, the hydratase obtained from Aeromonas caviae, and a (3-oxidation enzyme system. Some plants have a j3-oxidation enzyme system which is sufficient 25 to modify polymer synthesis when the plants are engineered to express the hydratase.
Examples demonstrate production of polymer by expression of these enzymes in transgenic plants. Examples also demonstrate that modifications in fatty acid biosynthesis can be used to alter plant phenotypes, decreasing or 30 eliminating seed production and increasing green plant biomass, as well as producing PHAs.
Brief Description of the Drawings Figure 1 is a schematic of fatty acid ~i-oxidation routes to produce polyhydroxyalkanoate monomers.
Figure 2 is a schematic showing plasmid constructs pSBS2024 and pSBS2025.
Figures 3A and 3B are schematics showing plasmid constructs pCGmf124 and pCGmf125.
Figures 4A and 4B are schematics showing plasmid constructs pmf1249 and pmf1254.
Figures SA and SB are schematics showing plasmid constructs pCGmf224 and pCGmfZ25.
Figures 6A and 6B are schematics showing plasmid constructs pCGmfIP2S and pCGmf2PlS.
Detailed Description Of The Invention Methods and DNA constrocts for manipulating the cellular metabolism of plants by introducing fatty acid oxidation enzyme systems into the cytoplasm or plastids of developing oilseeds or green tissue are provided. Fatty acid oxidation systems typically comprise several enzyme activities including a ~i-ketothiolase enzyme activity which utilizes a broad range of ~i-ketoacyl-CoA substrates.
It surprisingly was found that expression of at least one of these transgenes from the bean phaseolin promoter results in male sterility.
Interestingly, these plants did not set seed, but instead produced higher than normal levels of biomass (e.g., leafs, stems, stalks). Therefore the methods and constructs described herein also can be used to create male sterile plants, for example, for hybrid production or to increase the production of biomass of forage, such as alfalfa or tobacco. Plants generated using these methods and DNA constructs are useful for producing polyhydroxyalkanoate biopolymers or for producing novel oil compositions.
The methods described herein include the subsequent incorporation of additional transgenes, in particular encoding additional enzymes involved in fatty acid oxidation or polyhydroxyalkanoate biosynthesis. For polyhydroxyalkanoate biosynthesis, the methods include the incorporation of transgenes encoding enzymes, such as NADH and/or NADPH acetoacetyl-CoenzymeA reductases, PHB synthases, PHA synthases, acetoacetyl-CoA
thiolase, hydroxyacyl-CoA epimerases, delta3-cis-delta2-trans enoyl-CoA
isomerases, acyl-CoA dehydrogenase, acyl-CoA oxidase and enoyl-CoA
hydratases by subsequent transformation of the transgenic plants produced using the methods and DNA constructs described herein or by traditional plant breeding methods.
I. Plant Expression Systems In a preferred embodiment, the fatty acid oxidation transgenes are expressed from a seed specific promoter, and the proteins are expressed in the cytoplasm of the developing oilseed. In an alternate preferred embodiment, fatty acid oxidation transgenes are expressed from a seed specific promoter and the expressed proteins are directed to the plastids using plastid targeting signals. In another preferred embodiment, the fatty acid oxidation transgenes are expressed directly from the plastid chromosome where they have been integrated by homologous recombination. The fatty acid oxidation transgenes may also be expressed throughout the entire plant tissue from a constitutive promoter. It is also useful to be able to control the expression of these transgenes by using promoters that can be activated following the application of an agrochemical or other active ingredient to the crop in the field. Additional control of the expression of these genes encompassed by the methods described herein include the use of recombinase technologies for targeted insertion of the transgenes into specific chromosomal sites in the plant chromosome or to regulate the expression of the transgenes.
The methods described herein involve a plant seed having a genome including (a) a promoter operably linked to a first DNA sequence and a 3'-untranslated region, wherein the first DNA sequence encodes a fatty acid oxidation polypeptide and optionally (b) a promoter operably linked to a second DNA sequence and a 3'-untranslated region, wherein the second WO 99145122 PC'TNS99/04999 DNA sequence encodes a fatty acid oxidation polypeptide. Expression of the two transgenes provides the plant with a functional fatty acid f3-oxidation system having at least (3-ketothiolase, dehydrogenase and hydratase activities in the cytoplasm or plastids other than peroxisomes or glyoxisomes. The first and/or second DNA sequence may be isolated from bacteria, yeast, fungi, algae, plants, or animals. It is preferable that at least one of the DNA
sequences encodes a polypeptide with at least two, and preferably three, enzyme activities.
Transformation Vectors DNA constructs useful in the methods described herein include transformation vectors capable of introducing transgenes into plants. Several plant transformation vector options are available, including those described in "Gene Transfer to Plants" (Potrykus, et al., eds.) Springer-Verlag Berlin Heidelberg New York (1995); "Transgenic Plants: A Production System for Industrial and Pharmaceutical Proteins" (Owen, et al., eds.) John Wiiey &
Sons Ltd. England (1996); and "Methods in Plant Molecular Biology: A
Laboratory Course Manual" (Malign, et al. eds.) Cold Spring Laboratory Press, New York (1995). Plant transformation vectors generally include one or more coding sequences of interest under the transcriptional control of S' and 3' regulatory sequences, including a promoter, a transcription termination and/or polyadenylation signal, and a selectable or screenable marker gene. The usual requirements for 5' regulatory sequences include a promoter, a transcription termination and/or a polyadenylation signal. For the expression of two or more polypeptides from a single transcript, additional RNA processing signals and ribozyme sequences can be engineered into the construct (LJ.S. Patent No. 5,519,164). This approach has the advantage of locating multiple transgenes in a single locus, which is advantageous in subsequent plant breeding efforts. An additional approach is to use a vector to specifically transform the plant plastid chromosome by homologous recombination (U.S. Patent No. 5,545,818), in which case it is possible to take advantage of the prokaryotic nature of the plastid genome and insert a number of transgenes as an operon.
Promoters A large number of plant promoters are known and result in either constitutive, or environmentally or developmentally regulated expression of the gene of interest. Plant promoters can be selected to control the expression of the transgene in different plant tissues or organelles for all of which methods are known to those skilled in the art (Gasser & Fraley, Science 244:1293-99 (1989)). The S' end of the transgene may be engineered to include sequences encoding plastid or other subcellular organelle targeting peptides linked in-frame with the transgene. Suitable constitutive plant promoters include the cauliflower mosaic virus 355 promoter (CaM~ and enhanced CaMV promoters (Odell et. al., Nature, 313: 810 (1985)), actin promoter (McElroy et al., Plant Cell 2:163-71 (1990)}, AdhI promoter (Fromm et. al., BiolTechnology 8:833-39 (1990);
Kyozuka et al., Mol. Gen. Genet. 228:40-48 ( 1991 )), ubiquitin promoters, the 15 Figwort mosaic virus promoter, mannopine synthase promoter, nopaline synthase promoter and octopine synthase promoter. Useful regulatable promoter systems include spinach nitrate-inducible promoter, heat shock promoters, small subunit of ribulose biphosphate carboxylase promoters and chemically inducible promoters (U.S. Patent No. 5,364,780 to Hershey et al.).
In a preferred embodiment of the methods described herein, the transgenes are expressed only in the developing seeds. Promoters suitable for this purpose include the napin gene promoter (LJ.S. Patent Nos. 5,420,034 and 5,608,152), the acetyl-CoA carboxylase promoter (LT.S. Patent Nos.
25 5,420,034 and 5,608,152), 2S albumin promoter, seed storage protein promoter, phaseolin promoter (Slightom et. al., Proc. Natl. Acad. Sci. USA
80:1897-1901 (1983)), oleosin promoter (Plant et. al., Plant Mol. Biol.
25:193-205 (1994); Rowley et al., Biochim. Biophys. Acta. 1345:1-4 (199?};
U.S. Patent No. 5,650,554; and PCT WO 93/20216), zero promoter, glutelin promoter, starch synthase promoter, and starch branching enzyme promoter.
The transformation of suitable agronomic plant hosts using these vectors can be accomplished with a variety of methods and plant tissues.
Representative plants useful in the methods disclosed herein include the Brassica family including napus, rappa, sp. carinata and juncea; maize;
soybean; cottonseed; sunflower; palm; coconut; safflower; peanut; mustards including Sinapis alba; and flax. Crops harvested as biomass, such as silage 5 corn, alfalfa, or tobacco, also are useful with the methods disclosed herein.
Representative tissues for transformation using these vectors include protoplasts, cells, callus tissue, leaf discs, pollen, and meristems.
Representative transformation procedures include Agrobacterium-mediated transformation, biolistics, microinjection, electroporation, polyethylene glycol-mediated protoplast transformation, liposome-mediated transformation, and silicon fiber-mediated transformation (U. S. Patent No.
5,464,765; "Gene Transfer to Plants" (Potrykus, et al., eds.) Springer-Verlag Berlin Heidelberg New York (1995); "Transgenic Plants: A Production System for Industrial and Pharmaceutical Proteins" (Oven, et al., eds.) John 15 Wiley & Sons Ltd. England (1996); and "Methods in Plant Molecular Biology: A Laboratory Course Manual" (Malign, et al. eds.) Cold Spring Laboratory Press, New York (1995)).
II. Methods for Making and Screening for Transgenic Plants In order to generate transgenic plants using the constructs described 20 herein, the following procedures can be used to obtain a transformed plant expressing the transgenes subsequent to transformation: select the plant cells that have been transformed on a selective medium; regenerate the plant cells that have been transformed to produce differentiated plants; select transformed plants expressing the transgene at such that the level of desired 25 polypeptide is obtained in the desired tissue and cellular location.
For the specific crops useful for practicing the described methods, transformation procedures have been established, as described for example, in "Gene Transfer to Plants" (Potrykus, et al., eds.) Springer-Verlag Berlin Heidelberg New York (1995); "Transgenic Plants: A Production System for 30 Industrial and Pharmaceutical Proteins" (Owen, et al., eds.) John Wiley &
Sons Ltd. England (1996); and "Methods in Plant Molecular Biology: A
Laboratory Course Manual" (Malign, et al. eds.) Cold Spring Laboratory Press, New York (1995).
Brassica napus can be transformed as described, for example, in U. S.
Patent Nos. 5,188,958 and 5,463,174. Other Brassica such as rappa, carinata and juncea as well as Sinapis alba can be transformed as described S by Moloney et. aL, Plant Cell Reports 8:238-42 (1989). Soybean can be transformed by a number of reported procedures (U.S. Patent Nos.
5,015,580; 5,015,944; 5,024,944; 5,322,783; 5,416,011; and 5,169,770).
Several transformation procedures have been reported for the production of transgenic maize plants including pollen transformation (U.S. Patent No.
5,629,183), silicon fiber-mediated transformation (U.S. Patent No.
5,464,765), electroporation of protoplasts (LJ.S. Patent Nos. 5,231,019;
5,472,869; and 5,384,253) gene gun (U.S. Patent Nos. 5,538,877 and 5,538,880 and Agrobacterium-mediated transformation (EP 0 604 662 Al;
PCT WO 94/00977}. The Agrobacterium-mediated procedure is particularly preferred, since single integration events of the transgene constructs are more readily obtained using this procedure, which greatly facilitates subsequent plant breeding. Cotton can be transformed by particle bombardment (U.S.
Patent Nos. 5,004,863 and 5,159,135). Sunflower can be transformed using a combination of particle bombardment and Agrobacterium infection (EP 0 486 233 A2; U.S. Patent No. 5,030,572). Flax can be transformed by either particle bombardment or Agrobacterium-mediated transformation.
Recombinase technologies include the cre-lox, FLP/FRT, and Gin systems.
Methods for utilizing these technologies are described for example in U.S.
Patent No. 5,527,695 to Hodges et al.; Dale & Ow, Proc. Natl. Acad. Sci.
USA 88:10558-62 (1991); Medberry et. al., Nucleic AcidsRes. 23:485-90 (1995).
Selectable Marker Genes Selectable marker genes useful in practicing the methods described herein include the neomycin phosphotransferase gene nptlI (U.S. Patent Nos.
5,034,322 and 5,530,196), hygromycin resistance gene (U.S. Patent No.
5,668,298), bar gene encoding resistance to phosphinothricin (U.S. Patent No. 5,276,268). EP 0 530 129 Al describes a positive selection system which enables the transformed plants to outgrow the non-transformed lines by expressing a transgene encoding an enzyme that activates an inactive compound added to the growth media. Screenable marker genes useful in the methods herein include the (3-glucuronidase gene (Jefferson et. al., EMBO J. 6:3901-07 (1987); U.S. Patent No. 5,268,463) and native or modified green fluorescent protein gene (Cubitt et. al., Trends Biochem Sci.
20:448-55 (1995); Pang et. al., Plant Physiol. 112:893-900 (1996)). Some of these markers have the added advantage of introducing a trait, such as herbicide resistance, into the plant of interest, thereby providing an additional agronomic value on the input side.
In a preferred embodiment of the methods described herein, more than one gene product is expressed in the plant. This expression can be achieved via a number of different methods, including (1) introducing the encoding DNAs in a single transformation event where all necessary DNAs are on a single vector; (2) introducing the encoding DNAs in a co-transformation event where all necessary DNAs are on separate vectors but introduced into plant cells simultaneously; (3) introducing the encoding DNAs by independent transformation events successively into the plant cells i.e. transformation of transgenic plant cells expressing one or more of the encoding DNAs with additional DNA constructs; and (4) transformation of each of the required DNA constructs by separate transformation events, obtaining transgenic plants expressing the individual proteins and using traditional plant breeding methods to incorporate the entire pathway into a single plant.
III. ~i-Oxidation Enzyme Pathways Production of PHAs in the cytosol of plants requires the cytosolic localization of enzymes that are able to produce R-3-hydroxyacyl CoA
thioesters as substrates for PHA synthases. Both eukaryotes and prokaryotes possess a (3-oxidation pathway for fatty acid degradation that consists of a series of enzymes that convert fatty acyl CoA thioesters to acetyl CoA.
While these pathways proceed via intermediate 3-hydroxyacyl CoA, the stereochemistry of this intermediate varies among organisms. For example, the j3-oxidation pathways of bacteria and the peroxisomal pathway of higher eukaryotes degrade fatty acids to acetyl CoA via S-3-hydroxyacyl CoA
(Schultz, "Oxidation of Fatty Acids" in Biochemistry of Lipids, Lipoproteins andMembranes (Vance et al., eds.) pp. 101-06 (Elsevier, Amsterdam 1991)).
5 In Escherichia coli, an epimerase activity encoded by the ~3-oxidation multifunctional enzyme complex is capable of converting S-3-hydroxyacyl CoA to R-3-hydroxyacyl CoA. Yeast possesses a peroxisomal localized fatty acid degradation pathway that proceeds via intermediate R-3-hydroxyacyl CoA (Hiltunen, et al. J. Biol. Chem. 267: 6646-53 (1992);
10 Filppula, et al. J. Biol. Chem. 270:27453-57 (1995)), such that no epimerase activity is required to produce PHAs.
Plants, like other higher eukaryotes, possesses a [i-oxidation pathway for fatty acid degradation localized subcellularly in the peroxisomes (Gerhardt, "Catabolism of Fatty Acids [a and (3 Oxidation]" in Lipid 15 Metabolism in Plants (Moose, Jr., ed.) pp. 527-65 (CRC Press, Boca Raton, Florida 1993)). Production of PHAs in the cytosol of plants therefore necessitates the cytosolic expression of a (3-oxidation pathway, for conversion of fatty acids to R-3-hydroxyacyl CoA thioesters of the correct chain length, as well as cytosolic expression of an appropriate PHA synthase, 20 to polymerize R-3-hydroxyacyl CoA to polymer.
Fatty acids are synthesized as saturated acyl-ACP thioesters in the plastids of plants (Hartwood, "Plant Lipid Metabolism" in Plant Biochemistry (Dey et al., eds.} pp. 237-72 (Academic Press, San Diego 1997}). Prior to export from the plastid into the cytosol, the majority of fatty 25 acids are desaturated via a d9 desaturase. The pool of newly synthesized fatty acids in most oilseed crops consists predominantly of oleic acid (cis 9-octadecenoic acid), stearic acid (octadecanoic acid), and palmitic acid (hexadecanoic acid). However, some plants, such as coconut and palm kernel, synthesize shorter chain fatty acids (C8-14). The fatty acid is 30 released from ACP via a thioesterase and subsequently converted to an acyl-CoA thioester via an acyl CoA synthetase located in the plastid membrane (Andrews, et al., "Fatty acid and lipid biosynthesis and degradation" in Plant Physiology, Biochemistry, and Molecular Biology (Dennis et al., eds.) pp.
345-46 (Longman Scientific & Technical, Essex, England 1990); Harwood, "Plant Lipid Metabolism" in Plant Biochemistry (Dey et al., eds) p. 246 (Academic Press, San Diego 1997)).
The cytosolic conversion of the pool of newly synthesized acyl CoA
thioesters via fatty acid degradation pathways and the conversion of intermediates from these series of reactions to R-3-hydroxyacyl-CoA
substrates for PHA syntheses can be achieved via the enzyme reactions outlined in Figure 1. The PHA synthase substrates are C4-C16 R-3-10 hydroxyacyl CoAs. For saturated fatty acyl CoAs, conversion to R-3-hydroxyacyl CoA thioesters using fatty acids degradation pathways necessitates the following sequence of reactions: conversion of the acyl CoA
thioester to traps-2-enoyl-CoA (reaction 1 ), hydration of traps-2-enoyl-CoA
to R-3-hyddroxy acyl CoA (reaction 2a, e.g. yeast system operates through 15 this route and the Aeromonas caviae D-specific hydratase yields C4-C7 R-3-hydroxyacyl-CoAs), hydration of traps-2-enoyl-CoA to S-3-hydroxy acyl CoA (reaction 2b), and epimerization of S-3-hydroxyacyl CoA to R-3-hydroxyacyl CoA (reaction 5, e.g. cucumber tetrafunctional protein, bacterial systems). If 3-hydroxyacyl CoA is not polymerized by PHA synthase 20 forming PHA, it can proceed through the remainder of the (3-oxidation pathway as follows: oxidation of 3-hydroxyacyl CoA to form (3-keto acyl CoA (reaction 3) followed by thiolysis in the presence of CoA to yield acetyl CoA and a saturated acyl CoA thioester shorter by two carbon units (reaction 4). The acyl CoA thioester produced in reaction 4 is free to re-enter the (3-25 oxidation pathway at reaction 1 and the acetyl-CoA produced can be converted to R-3-hydroxyacyl CoA by the action of (3-ketothiolase (reaction 7) and NADH or NADPH acetoacetyl-CoA reductase (reaction 6). This latter route is useful for producing R-3-hydroxybutyryl-CoA, R-3-hydroxyvaleryl-CoA and R-3-hydroxyhexanoyl-CoA. The 8-3-30 hydroxyacids of four to sixteen carbon atoms produced by this series of enzymatic reactions can be polymerized by PHA syntheses expressed from a transgene, or transgenes in the case of the two subunit synthase enzymes, into PHA polymers.
For A9 unsaturated fatty acyl CoAs, a variation of the reaction sequences described is required. Three cycles of (3-oxidation, as detailed in Figure l, will remove six carbon units yielding an unsaturated acyl CoA
thioester with a cis double bond at position 3. Conversion of the cis double bond at position 3 to a traps double bond at position 2, catalyzed by D3-cis-~2-traps-enoyl CoA isomerase will allow the ~i-oxidation reaction sequences outlined in Figure 1 to proceed. This enzyme activity is present on the microbial (3-oxidation complexes and the plant tetrafunctional protein, but not on the yeast foxl.
Acyl CoA thioesters also can be degraded to a (3-keto acyl CoA and converted to R-3-hydroxyacyl CoA via a NADH or NADPH dependent reductase (reaction 6).
Multifunctional enzymes that encode S-specific hydratase, S-specific 1 S dehydrogenase, ~i-ketothiolase, epimerase and D3-cis-D2-traps-enoyl CoA
isomerase activities have been found in bacteria such as Escherichia coli (Spratt, et al., J. Bacteriol. 158:535-42 (1984)) and Pseudomonas fragi (Immure, et al., J. Biochem. 107:184-89 ( 1990)). The multifunctional enzyme complexes consist of two copies of each of two subunits such that catalytically active protein forms a heterotetramer. The hydratase, dehydrogenase, epimerase, and 43-cis-DZ-traps-enoyl CoA isomerase activities are located on one subunit, whereas the thiolase is located on another subunit. The genes encoding the enzymes from organisms such as E.
coli (Spratt, et al., J. Bacteriol. 158:535-42 (1984); DiRusso, J. Bacteriod.
172:6459-68 (1990)) and P. fragi (Sato, et al., J. Biochem. 111:8-15 (1992)) have been isolated and sequenced and are suitable for practicing the methods described herein. Furthermore, the E. coli enzyme system has been subjected to site-directed mutagenesis analysis to identify amino acid residues critical to the individual enzyme activities (He & Yang, Biochemistry 35:9625-30 (1996); Yang et. al., Biochemistry 34:6641-47 (1995); He & Yang, Biochemishy 36:11044-49 (1997); He et. al., Biochemistry 36:261-68 (1997); Yang & Elzinga, J. Biol. Chem. 268:6588-92 (1993)). These mutant genes also could be used in some embodiments of the methods described herein.
Mammals, such as rat, possess a trifunctional (3-oxidation enzyme in their peroxisomes that contains hydratase, dehydrogenase, and D3-cis-42-5 traps-enoyl CoA isomerase activities. The trifunctional enzyme from rat liver has been isolated and has been found to be monomeric with a molecular weight of 78 kDa (Palosaari, et al., J. Biol. Chem. 265:2446-49 (1990)).
Unlike the bacterial system, thiolase activity is not part of the multienzyme protein (Schultz, "Oxidation of Fatty Acids" in Biochemistry ojLipids, 10 Lipoproteins and Membranes (Vance et al., eds) p. 95 (Elsevier, Amsterdam (1991)). Epimerization in rat occurs by the combined activities of two distinct hydratases, one which converts R-3-hydroxyacyl CoA to traps-2-enoyl CoA, and another which converts traps-2-enoyl CoA to S-3-hydroxyacyl CoA (Smeland, et al., Biochemical and Biophysical Research 15 Communications 160:988-92 (1989)). Mammals also possess (3-oxidation pathways in their mitochondria that degrade fatty acids to acetyl CoA via intermediate S-3-hydroxyacyl CoA (Schultz, "Oxidation of Fatty Acids" in Biochemistry of Lipids, Lipoproteins and Membranes (Vance et al., eds) p.
96 (Elsevier, Amsterdam (1991)). Genes encoding mitochondria) (i-20 oxidation activities have been isolated from several animals including a Rat mitochondria) long chain acyl CoA hydratase/3-hydroxy acyl CoA
dehydrogenase (GENBANK Accession # D 16478) and a Rat mitochondria) thiolase (GENBANK Accession #s D13921, D00511).
Yeast possesses a multifunctional enzyme, Fox2, that differs from the 25 (3-oxidation complexes of bacteria and higher eukaryotes in that it proceeds via a R-3-hydroxyacyl CoA intermediate instead of S-3-hydroxyacyl CoA
(Hiltunen, et al., J. Biol. Chem. 267:6646-53 (1992)). Fox2 possesses R-specific hydratase and R-specific dehydrogenase enzyme activities. This enzyme does not possess the ~3-cis-02-traps-enoyl CoA isomerase activity 30 needed for degradation of 09-cis-hydroxyacyl CoAs to form R-3-hydroxyacyl CoAs. The gene encoding jox2 from yeast has been isolated and sequenced and encodes a 900 amino acid protein. The DNA sequence of the structural gene and amino acid sequence of the encoded polypeptide is shown in SEQ ID NO:1 and SEQ ID N0:2.
Plants have a tetrafunctional protein similar to the yeast Fox2, but also encoding a 03-cis-02-traps-enoyl CoA isomerase activity (Muller et., al., J. Biol. Chem. 269:20475-81 ( 1994)). The DNA sequence of the cDNA and amino acid sequence of the encoded polypeptide is shown in SEQ ID N0:3 and SEQ ID N0:4.
IV. Targeting of Enzymes to the Cytoplasm of Oil Seed Crops Engineering PHA production in the cytoplasm of plants requires 10 directing the expression of (3-oxidation to the cytosol of the plant. No targeting signals are present in the bacterial systems, such as faoAB. In fungi, yeast, plants, and mammals, (3-oxidation occurs in subcellular organelles. Typically, the genes are expressed from the nuclear chromosome, and the polypeptides synthesized in the cytoplasm are directed 15 to these organelles by the presence of specific amino acid sequences. To practice the methods described herein using genes isolated from eukaryotic sources, e.g., fatty acid oxidation enzymes from eukaryotic sources, such as yeast, fungi, plants, and mammals, the removal or modification of subcellular targeting signals is required to direct the enzymes to the cytosol.
20 It may be useful to add signals for directing proteins to the endoplasmic reticulum. Peptides useful in this process are well known in the art. The general approach is to modify the transgene by inserting a DNA sequence specifying an ER targeting peptide sequence to form a chimeric gene.
Eukaryotic acyl CoA dehydrogenases, as well as other mitrochondrial 25 proteins, are targeted to the mitochondria via leader peptides on the N-terminus of the protein that are usually 20-60 amino acids long (Norwich, Current Opinion in Cell Biology, 2:625-33 (1990)). Despite the lack of an obvious consensus sequence for mitochondria) import leader peptides, mutagenesis of key residues in the leader sequence have been demonstrated 30 to prevent the import of the mitochrondrial protein. For example, the import of Saccharomyces cerevisiae F1-ATPase was prevented by mutagenesis of its leader sequence, resulting in the accumulation of the modified precursor protein in the cytoplasm (Bedwell, et al., Mol. Cell Biol. 9:1014-25 (1989)) Three eukaryotic peroxisomal targeting signals have been reported (Gould, et al., J. Cell Biol. 108:1657-64 (1989); Brickner, et al., J. Plant Physiol., 113:1213-21 (1997)). The tripeptide targeting signal S/A/C-K/H/R-L occurs at the C-terminal end of many peroxisomal proteins (Gould, et al., J. Cell Biol. 108:1657-64 ( 1989)). Mutagenesis of this sequence has been shown to prevent import of proteins into peroxisomes. Some peroxisomal proteins do not contain the tripeptide at the C-terminal end of the protein. For these proteins, it has been suggested that targeting occurs via the tripeptide in an internal position within the protein sequence (Gould, et al., J. Cell Biol. 108:1657-64 (1989)) or via an unknown, unrelated sequence (Brickner, et al., .l. Plant Physiol. 113:1213-21 (1997)). The results of in vitro peroxisomal targeting experiments with fragments of acyl CoA oxidase from Candida tropicalis appear to support the latter theory and suggest that there are two separate targeting signals within the internal amino acid sequence of the polypeptide (Small, et al., The EMBO Journal 7:1167-73 (1988)). In the aforementioned study, the targeting signals were localized to two regions of 118 amino acids in length, and neither of regions was found to contain the targeting signal S/A/C-K/H/R-L. A small number of peroxisomal proteins appear to contain an amino terminal leader sequence for import into peroxisomes (Brickner, et al., J. Plant Physiol. 113:1213-21 (1997)). These targeting signals can be deleted or altered by site directed mutagenesis.
' V. Cultivation and Harvesting of Transgenic Plant The transgenic plants can be grown using standard cultivation techniques. The plant or plant part also can be harvested using standard equipment and methods. The PHAs can be recovered from the plant or plant part using known techniques such as solvent extraction in conjunction with traditional seed processing technologies, as described in PCT WO 97/15681, or can be used directly, for example, as animal feed, where it is unnecessary to extract the PHA from the plant biomass.
Several Iines which did not produce seed, produced much higher levels of biomass. produced much higher levels of biomass. This phenotype therefore may be useful as a means to increase the amount of green biomass produced per acre for silage, forage, or other biomass crops. End uses include the more cost effective production of forage crops for animal feed or as energy crops for electric power generation. Other uses include increasing biomass levels in crops, such as alfalfa or tobacco, for subsequent recovery of industrial products, such as PHAs by extraction.
The compositions and methods of preparation and use thereof described herein are further described by the following non-limiting examples.
Example 1: Isolation and Characterization of the Pseudomonas putida faoAB Genes and Fao Enzyme All DNA manipulations, including PCR, DNA sequencing E. coli transformation, and plasmids purification, were performed using standard procedures, as described, for example, by Sambrook et. al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, New York (1989)). The genes encoding faoAB from Pseudomonas putida were isolated using a probe generated from P. putida genomic DNA by PCR
(polymerase chain reaction) using primers 1 and 2 possessing homology to faoB from Pseudomonas fragi (Sato, et al., J. Biochem. 111:8-15 (1992)).
Primer 1:
5' gat ggg ccg ctc caa ggg tgg 3' (SEQ D7 NO:S) Primer 2:
5' caa ccc gaa ggt gcc gcc att 3' (SEQ >D N0:6) A 1.1 kb DNA fragment was purified from the PCR reaction and used as a probe to screen a P. putida genomic library constructed in plasmid pBKCMV using the lambda ZAP expression system (Stratagene). Plasmid pMFXI was selected from the positive clones and the DNA sequence of the insert containing the faoAB genes and flanking sequences determined. This is shown in SEQ ID N0:7. A fragment containing faoAB was subcloned with the native P. putida ribosome binding site intact into the expression WO 99/45122 PC'T/US99/04999 vector pTRCN forming plasmid pMFX3 as follows. Plasmid pMFXI was digested with BsrG I. The resulting protruding ends were filled in with Klenow. Digestion with Hind III yielded a 3.39 kb blunt ended/Hind III
fragment encoding FaoAB. The expression vector pTRCN was digested with Sma llHind III and ligated with the faoAB fragment forming the 7.57 kb plasmid pMFX3.
Enzymes in the FaoAB multienzyme complex were assayed as follows. Hydratase activity was assayed by monitoring the conversion of NAD to NADH using the coupling enzyme L-~i-hydroxyacyl CoA
10 dehydrogenase as previously described, except that assays were run in the presence of CoA (Filppula, et al., J. Biol. Chem. 270:27453-57 (1995)).
Severe product inhibitation of the coupling enzyme was observed in the absence of CoA. The assay contained (1 mL final volume) 60 p,M crotonyl CoA, 50 p.M Tris-CI, pH 9, 50 p,g bovine serum albumin per mL, 50 mM
KCI, 1 mM NAD, 7 pg L-specific ~i-hydroxyacyl CoA dehydrogenase from porcine heart per mL, and 0.25 mM CoA. The assay was initiated with the addition of FaoAB to the assay mixture. A control assay was performed without substrate to determine the rate of consumption of NAD in the absence of the hydratase generated product, S-hydroxybutyryl CoA. One unit of activity is defined as the consumption of one p,Mol of NAD per min (ESao = 6220 M'1 cni ~ ).
Hydroxyacyl CoA dehydrogenase was assayed in the reverse direction with acetoacetyl CoA as the substrate by monitoring the conversion of NADH to NAD at 340 nm (Binstock, et aL, Methods in Enzymology, 71:403 (1981)). The assay contained (1 mL final volume) 0.1 M KH2P04, pH 7, 0.2 mg bovine serum albumin per mL, 0.1 mM NADH, and 33 p,M
acetoacetyl CoA. The assay was initiated with the addition of FaoAB to the assay mixture. When necessary, enzyme samples were diluted in 0.1 M
KH2POa, pH 7, containing 1 mg bovine serum albumin per mL. A control 30 assay was performed without substrate acetoacetyl CoA to detect the rate of consumption of NADH in the crude due to enzymes other than hydroxyacyl CoA dehydrogenase. One unit of activity is defined as the consumption of WO 99/45122 PC'TNS99/04999 one p.Mol of NADH per minute (s3~o= 6220 M-'crri').
HydroxyacylCoA dehydrogenase was assayed in the forward direction with crotonyl CoA as a substrate by monitoring the conversion of NAD -to NADH at 340 nm (Binstock, et al., Methods in Enzymology, 71:403 5 (1981)). The assay mixture contained (1 mL final volume) 0.1 M KH2P04, pH 8, 0.3 mg bovine serum albumin per mL, 2 mM ~i-mercaptoethanol, 0.25 mM CoA, 30 p,M crotonyl CoA, and an aliquot of FaoAB. The reaction was preincubated for a couple of minutes to allow in situ formation of S-hydroxybutyryl CoA. The assay then was initiated by the addition of NAD
10 (0.45 mM). A control assay was performed without substrate to detect the rate of consumption of NAD due to enzymes other than hydroxyacyl CoA
dehydrogenase. One unit of activity is defined as the consumption of one p.Mol ofNAD per minute (E3ao= 6220 M-'crri') Thiolase activity was determined by monitoring the decrease in 15 absorption at 304 nm due to consumption of substrate acetoacetyl CoA as previously described with some modifications (Palmer, et al., J. Biol. Chem.
266:1-7 (1991)). The assay contained (final volume 1 mL) 62.4 mM Tris-Cl, pH 8.1, 4.8 mM MgCl2, 62.5 p.M CoA, and 62.5 p.M acetoacetyl CoA. The assay was initiated with the addition of FaoAB to the assay mixture. A
20 control sample without enzyme was performed for each assay to detect the rate of substrate degradation of pH 8.1 in the absence of enzyme. One unit of activity is defined as the consumption of one N.MoI of substrate acetoacetyl CoA per minute (s3ao= 169001V1-'cni').
Epimerase activity was assayed as previously described (Binstock, et 25 al., Methods in Enrymology, 71:403 (1981)) except that R-3-hydroxyacyl CoA thioesters were utilized instead of D,L-3-hydroxyacyl CoA mixtures.
The assay contained (final volume 1 mL) 30 p,M R-3-hydroxyacyl CoA, ISO
mM KH2PO4 (pH 8), 0.3 mg/mL BSA, 0.5 mM NAD, 0.1 mM CoA, and 7 p,g/mL L-specific ~i-hydroxyacyl CoA dehydrogenase from porcine heart.
30 The assay was initiated with the addition of FaoAB.
For expression of FaoAB in DHSoc/pMFX3, cultures were grown in 2xTY medium at 30 °C. 2xTY medium contains (per L) 16 g tryptone, 10 g yeast, and 5 g NaCI. A starter culture was grown overnight and used to inoculate (1% inoculum) fresh medium (100 mL in a 250 mL Erlenmeyer flask for small scale growths; 1.5L in a 2.8L flask for large scale growths).
Cells were induced with 0.4 n~IVI IPTG when the absorbance at 600 nm was in the range of 0.4 to 0.6. Cells were cultured an additional 4 h prior to harvest. Cells were lysed by sonication, and the insoluble matter was removed from the soluble proteins by centrifugation. Acyl CoA
dehydrogenase activity was monitored in the reverse direction to ensure activity of the FaoA subunit (SEQ m N0:31 ) and thiolase activity was assayed to determine activity of the Fao subunit. FaoAB in DHSoc/pMFX3 contained dehydrogenase and thiolase activity values of 4.3 and 0.99 U/mg, respectively, which is significantly more than the 0.0074 and 0.0033 U/mg observed for dehydrogenase and thiolase, respectively, in control strain DHSoc/pTRCN.
1 S FaoAB was purified from DHSoc/pMFX3 using a modified procedure previously described for the purification of FaoAB from Pseudomonas fragi (Imamura, et al., .I. Biochem. 107:184-89 ( 1990)). Thiolase activity (assayed in the forward direction) and dehydrogenase activities (assayed in the reverse direction) were monitored throughout the purification. Three liters of DHSoc/pMFX3 cells (2 X 1.5 L aliquots in 2.8 L Erlenmeyer flasks) were grown in 2 x TY medium using the cell growth procedure previously described for preparing cells for enzyme activity analysis. Cells {15.8 g) were resuspended in 32 mL of 10 mM KH2P04, pH 7, and lysed by sonication. Soluble proteins were removed from insoluble cells debris by centrifugation (18,000 RPM, 30 min., 4 °C). The soluble extract was made 50% in acetone and the precipitated protein was isolated by centrifugation and redissolved in 10 mM KHZP04, pH 7. The sample was adjusted to 33%
saturation with (NH4)2504 and the soluble and insoluble proteins were separated by centrifugation. The resulting supernatant was adjusted to 56%
saturation with (NHa)2504 and the insoluble pellet was isolated by centrifugation and dissolved in 10 mM KH2P04, pH 7. The sample was heated at 50°C for 30 min. and the soluble proteins were isolated by centrifugation and dialyzed in a 6,000 to 8,000 molecular weight cut off membrane in 10 mM KH2POa, pH 7 (2 X 3L; 20 h). The sample was loaded on a Toyo Jozo DEAF FPLC column (3 cm x 14 cm) that previously had been equilibrated in 10 mM KHzP04, pH 7. The protein was eluted with a linear gradient (100 mL by 100 mL; 0 to 500 mM NaCI in 10 KHZP04, pH 7) at a flow of 3 mL/min. FaoAB eluted between 300 and 325 mM NaCI. The sample was dialyzed in a 50,000 molecular weight cut off membrane in I O
mM KH2POa, pH 7 (1 X 2L; 15h) prior to loading on a macro-prep hydroxylapatite 18/30 (Biorad) FPLC column (2 cm x 15 cm) that previously had been equilibrated in 10 mM KH2P04, pH 7. The protein was eluted with a linear gradient (250 mL by 250 mL; 10 to 500 mM KH2P04, pH 7) at a flow rate of 3 mL/min. FaoAB eluted between 70 and 130 mM KH2P04, The fractions containing activity were concentrated to 9 mL using a MIZLIPORETM 100,000 molecular weight cutoff concentrator. The buffer was exchanged 3 times with 10 mM KHZP04, pH 7 containing 20% sucrose and frozen at -70°C. Enzyme activities of the hydroxylapatite purified fraction were assayed with a range of substrates. The results are shown in Table 1 below.
T~AIP i ~ Fnwmp Cuhctratec and Activities En me Substrate Activit U/m h dratase croton 1 CoA 8.8 deh dro enase forwardcroton 1 CoA 0.46 deh dro enase reverseacetoace 1 CoA 29 thiolase acetoace 1 CoA 9.9 a imerase R-3-h drox octan 0.022 1 CoA
a imerase R-3- h drox hexan 0.0029 1 CoA
epimerase R-3- hydroxybutyryl0.000022 CoA
Example 2: Production of Antibodies to the FaoAB and FaoAB Polypeptides Following purification of the FaoAB protein as described in Example 1, a sample was separated by SDS-PAGE. The protein band corresponding to the FaoA (SEQ ID N0:31) and FaoB (SEQ ID N0:26) was excised and used to immunize New Zealand white rabbits with complete Freunds adjuvant. Boosts were performed using incomplete Freunds at three week - intervals. Antibodies were recovered from serum by affinity chromatography on Protein A columns (Pharmacia) and tested against the antigen by Western blotting procedures. Control extracts of Brassica seeds were used to test for cross reactivity to plant proteins. No cross reactivity S was detected.
Example 3: Construction of Plasmids for Expression of the Pseudomonas putido fao AB Genes in Transgenic Oilseeds Construction of pSBS2024 Oligonucleotide primers GVR471 5'-CGGTACCCATTGTACTCCCAGTATCAT-3' (SEQ ID N0:8) and GVR472 S'-CA TTTAAA TAGTAGAGTATTGAATATG-3' (SEQ ID N0:9) homologous to sequences flanking the 5' and 3' ends (underlined), respectively, of the bean phaseolin promoter (SEQ 117 NO:10; Slightom et al., 1983) were designed with the addition of KpnI {in italics, nucleotides 1-in SEQ ID N0:8) and SwaI (in italics, nucleotides 1-9 in SEQ ID N0:9) at the 5' ends of GVR471 and GVR472, respectively. These restriction sites were incorporated to facilitate cloning. The primers were used to amplify a 1.4 kb phaseolin promoter, which was cloned at the SmaI site in pUCl9 by blunt ended ligation. The designated plasmid, pCPPI (see Figure 2) was cut with SaII and SwaI and ligated to a SaIIlSwaI phaseolin terminator (SEQ ID
N0:27). The bean phaseolin terminator sequence encompassing the ' polyadeilylation signals was amplified' using the following PCR'piiliiers: ' GVR396:
5'-GATTTAAATGCAAGCTTAAATAAGTATGAACTAAAATGC-3' (SEQ ID N0:22) and GVR397:
S'-CGGTACCTTAGTTGGTAGGGTGCTA-3' (SEQ ID N0.23) and the 1.2Kb fragment (SEQ ID N0:27) cloned into Sall-SaI site of pCCPI
to obtain pSBS2024 (Figure 2). The resulting plasmid which contains a unique HindIII site for cloning was called pSBS2024 (Figure 2).
Construction of pSBS2025 A soybean oleosin promoter fragment (SEQ ID NO:l 1; Rowley et al., 1997) was simplified with primers that flank the DNA sequence.
Primer JA408 5' -TCTAGATACATCCATTTCTTAATATAATCCTCTTATTC-3' (SEQ ID N0:12) contains sequences that are complementary to the 5' end (underlined).
Primer np 1 5' -CA TTTAAA TGGTTAAGGTGAAGGTAGGGCT-3' (SEQ ID N0:13) contains sequences homologous to the 3' end (underlined) of the promoter fragment. The restriction sites Xbal (in italics) and SwaI (in italics) were incorporated at the 5' end of JA408 and npl, respectively, to facilitate cloning. The primers were used to amplify a 975 by promoter fragment, which then was cloned into Small site of pUCl9 (see Figure 2). The resulting plasmid, pCSPI, was cut with Salt and Swal and iigated to the soybean terminator (SEQ ID N0:28). The soybean oleosin terminator was amplified by PCR using the following primers:
JA410:
5'-AAGCTTACGTGATGAGTATTAATGTGTTGTTATG-3' (SEQ ID N0:29) and JA411:
'S'-TCfiAGACAATTCATCAAATACAAATCACATTGCC-3' (SEQ ID N0:30) and the 225 by fragment cloned into the SaII-SwaI site of pCSPl to obtain plasmid pSBS2025 (Figure 6). The designated plasmid, pSBS2025, earned a unique HindIII site for cloning (Figure 2).
Construction of Promoter-coding Sequence Fusions Two oligonucleotide primers were synthesized:
np2 5'AA GCTT'AAAATGATTTACGAAGGTAAAGCC-3' (SEQ ID N0:14) homologous to nucleotides 553 to 573 of the 5' flanking sequences, and np3 5' A TTGCTTT CAGTTGAAGCGCTG-3' (SEQ ID NO:15) complementary to nucleotides 2700 to 2683 flanking the 3' end of mfl (faoA, SEQ ID N0:24) of plasmid pmfx3. A HindIII (in italics) site was introduced at the S' end of primers np2 and np3 to facilitate cloning. In addition, a 3 by AAA sequence (bold) was incorporated to obtain a more favorable sequence surrounding the plant translational initiation codon. Primers np2 and np3 were used to amplify the fragment and cloned into SmaI site of pUCl9. The resulting plasmid was called pCmfl (Figures 3A and 3B). Plasmid pBmfl was constructed in a similar process {Figures SA and SB). In order to generate a HindIII {in italics) at 5' and 3' ends of the mFl (faoB) gene (SEQ
ID N0:25) for cloning, a second set of synthetic primers were designed.
Primers np4 5' AAGCTTAAAATGAGCCTGAATCCAAGAGAC-3' (SEQ ID N0:16) complementary to 5' (nucleotides 2732-2752 bp) and np5 5'AAGCTTTCAGACGCGTTCGAAGACAGTG -3' ' w (SEQ ID N0:17) homologous to 3' (nucleotides 3907-3886 bp) sequences of mf2 (faoB, SEQ
ID N0:25) of plasmid pmfx3 were used in a PCR reaction to amplify the 1.17 kb DNA fragment. The resulting PCR product was cloned into the EcoRV site of pBluescript. The plasmid was referred to as pBmf2.
Both plasmids were individually cut with HindIII and their inserts cloned in plasmids pSBS2024 and pSBS2025, which had previously been linearized with the same restriction enzyme. As a result, the following WO 99/45122 PC'T/US99/04999 plasmids were generated: pmf124 and pmf125 (Figures 3A and 3B) and pmfZ24 and pmf225 (Figures SA and SB) containing the Fao genes (mfl and mf2) fused to either the phaseolin or soybean promoters. DNA sequence analysis confirmed the correct promoter-coding sequence-termination sequence fusions for pmf124, pmf125, pmfZ24, and pmf225.
Example 4: Assembly of Promoter-coding Sequence Fusions into Plant Transformation Vectors After obtaining plasmids pmf124, pmf125, pmf224, and pmf225, promoter-coding sequence fusions were independently cloned into the binary vectors, pCGN1559 (McBride and Summerfelt, 1990) containing the CaMV
35S promoter driving the expression of NPTII gene (conferring resistance to the antibiotic kanamycin) and pSBS2004 containing a parsley ubiquitin promoter driving the PPT gene, which confers resistance to the herbicide phosphinothricine. Binary vectors suitable for this purpose with a variety of selectable markers can be obtained from several sources.
The phaseolin-mf21 fusion cassette was released from the parent plasmid withXbaI and ligated with pCGN1559, which had been linearized with the same restriction enzyme. The resulting plasmid was designated pCGmf124 (Figures 3A and 3B). Plasmid pCGmf125 containing the soybean-mfl fusion was constructed in a similar way (Figures 3A and 3B), except that both pmf125 and pCGNl 559 were cut with BamHI before ligation.
Construction of pmf1249 an pmf1254 The plasmid pSBS2004 was linearized with BamHI fragment containing the soybean-mfl fusion. This plasmid was designated pmf1254 (Figures 4A and 4B). Similarly, the Xbal phaseolin-mfl fusion fragment was ligated to pSBS2004 which had been linearized with the same restriction enzyme. The resulting plasmid was designated pmfI249 (Figures 4A and 4B).
Construction of pCGmfz24 and pCGmf225 The phaseolin-mfZ and soybean-mf2 fusions were constructed by excising the fusions from the vector by cutting with either BamHI or XbaI, and cloned into pCGNI559 which had been linearized with either restriction enzyme (Figures SA and SB).
Construction of ~CGmfl P2S and pCGmf2P 1 S
The two expression cassettes containing the promoter-coding sequence fusions were assembled on the same binary vector as follows:
Plasmid pmf124 containing the phaseolin-mfl fusion was cut with BamHI
and cloned into the BamHI site of pCGN1559 to create pCGmfB 124. This plasmid then was linearized with XbaI and Iigated to the XbaI fragment of pmf225 containing the soybean-mf2 fusion. The final plasmid was designated pCGmfl P2S (Figures 6A and 6B). Plasmid pCGmfZP 1 S was assembled in similar manner. The phaseolin-mfZ fusion was released from pmf224 by cutting with BamHI and cloned at the BamHI site of pCGN 1559.
The resulting plasmid, pCGmfB224, was linearized with XbaI and ligated to the Xbal fragment of pmf125 containing the soybean-mfl fusion (Figures 6A
and 6B).
Example 5: Transformation of Brassica Brassica seeds were surface sterilized in 10% commercial bleach (Javex, Colgate-Palmolive) for 30 min. with gentle shaking. The seeds were washed three times in sterile distilled water. Seeds were placed in germination medium comprising Murashige-Skoog (MS) salts and vitamins, 3% (w/v) sucrose and 0.7% (w/v) phytagar, pH 5.8 at a density of 20 per ' ' plate and maintained at 24 °C and a 16 h light 7 8 h dark' photoperiod 'at a light intensity of 60-80 ,uEni 2s'1 for four to five days.
Each of the constructs, pCGmf124, pCGmf125, pCGmf224, pCGmfIP2S, and pCGmf2PlS were introduced intoAgrobacterium tumefacians strain EHA101 (Hood et al., J. Bacteriol. 168:1291-1301 (1986)) by electroporation. Prior to transformation of cotyledonary petioles, single colonies of strain EHA101 harboring each construct were grown in 5 ml of minimal medium supplemented with 100 mg kanamycin per liter and 100 mg gentamycin per liter for 48 hr at 28 °C. One milliliter of bacterial suspension was pelletized by centrifugation for 1 min in a microfuge. The pellet was resuspended in 1 ml minimal medium.
For transformation, cotyledons were excised from 4 day old, or in some cases S day old, seedlings, so that they included approximately 2 mm of petiole at the base. Individual cotyledons with the cut surface of their petioles were immersed in diluted bacterial suspension for 1 s and immediately embedded to a depth of approximately 2 mm in co-cultivation medium, MS medium with 3% (w/v) sucrose and 0.7% phytagar and enriched with 20 ,uM benzyladenine. The inoculated cotyledons were plated at a density of 10 per plate and incubated under the same growth conditions for 48 h. After co-cultivation, the cotyledons then were transferred to regeneration medium comprising MS medium supplemented with 3%
sucrose, 20 pM benzyladenine, 0.7% (w/v) phytagar, pH 5.8, 300 mg timentinin per liter, and 20 mg kanamycin sulfate per liter.
After two to three weeks, regenerant shoots obtained were cut and maintained on "shoot elongation" medium (MS medium containing, 3%
sucrose, 300 mg timentin per liter, 0.7% (w/v) phytagar, 300 mg timentinin per liter, and 20 mg kanamycin sulfate per liter, pH 5.8) in Magenta jars.
The elongated shoots were transferred to "rooting" medium comprising MS
medium, 3% sucrose, 2 mg indole butyric acid per liter, 0.7% phytagar, and 500 mg carbenicillin per liter. After roots emerged, plantlets were transferred to potting mix (Redi Earth, W.R. Grace and Co.). The plants were maintained in a misting chamber (75% relative humidity) under the same growth conditions. Two to three weeks after growth, leaf samples were taken for neomycin phosphotransferase (NPTII) assays (Moloney et al., Plant Cell Reports 8:238-42 (1989)).
Seeds from the FaoA and FaoB transgenic lines can be analyzed for expression of the fatty acid oxidation polypeptides by western blotting using the anti-FaoA and anti-FaoB antibodies. The FaoB polypeptide (SEQ ID
N0:26) is not functional in the absence of the FaoA gene product; however, the FaoAB gene product has enzyme activity.
Transgenic lines expressing the FaoA and FaoB complex are obtained by crossing the FaoA and FaoB transgenic lines expressing the individual polypeptides and seeds analyzed by western blotting and enzymes assays as described.
Example 6: Transformation of B. napes cv. Westar and Analysis of Transgenic Lines Transformation The protocol used was adopted from a procedure described by Moloney et ad. ( 1989). Seeds of Brassica napes cv. Westar were surface sterilized in 10% commercial bleach (Javex, Colgate-Palmolive Canada Inc.) for 30 min with gentle shaking. The seeds were washed three times in sterile distilled water. Seeds were placed on germination medium comprising Murashige-Skoog (MS} salts and vitamins, 3% sucrose and 0.7% phytagar, pH 5.8 at a density of 20 per plate and maintained at 24 °C in a 16 h light/8 h dark photoperiod at a light intensity of 60-80 p,Eni 2s 1 for four to five days.
Each of the constructs, pCGmf124, pCGmf125, pCGmf224, pCGmfZ25, pCGmfIP2S, and pCGmfZPIS were introduced into Agrobacterium tumefaciens strain EHA101 (Hood et al. 1986} by electroporation. Prior to transformation of cotyledonary petioles, single colonies of strain EHA101 harboring each construct were grown in 5 mL of minimal medium supplemented with 100 mg kanamycin per liter, and 100 mg gentamycin per liter for 48 h at 28 °C. One milliliter of bacterial suspension was pelletized by centrifugation for 1 min in a microfuge. The pellet was resuspended in 1 mL minimal medium.
For transformation, cotyledons were excised from four-day-old, or in some cases five-day-old, seedlings so that they included approximately 2 mm of petiole at the base. Individual cotyledons with the cut surface of their petioles were immersed in diluted bacterial suspension for 1 s and immediately embedded to a depth of approximately 2 mm in co-cultivation medium, MS medium with 3% sucrose and 0.7% phytagar, enriched with 20 N,M benzyladenine. The inoculated cotyledons were plated at a density of 10 per plate and incubated under the same growth conditions for 48 h. After co-cultivation, the cotyledons then were transferred to regeneration medium, which comprised MS medium supplemented with 3% sucrose, 20 p.M
benzyladenine, 0.7% phytagar, pH 5.8, 300 mg timentinin per liter, and 20 mg kanamycin sulfate per liter.
After two to three weeks, regenerant shoots were obtained, cut, and maintained on "shoot elongation" medium (MS medium containing 3%
sucrose, 300mg timentin per liter, 0.7% phytagar, and 20 mg kanamycin per liter, pH 5.8) in Magenta jars. The elongated shoots then were transferred to "rooting" medium, which comprised MS medium, 3% sucrose, 2 mg indole butyric acid per liter, 0.7% phytagar and 500 mg carbenicillin per liter.
After roots emerged, the plantlets were transferred to potting mix (Redi Earth, W.R. Grace and Co. Canada Ltd.). The plants were maintained in a misting chamber (75% RH) under the same growth conditions. Two to three weeks after growth, leaf samples were taken for neomycin phosphotransferase (NPT II) assays (Moloney et al. 1989). The results are shown in Table 2 below. The data show the number of plants that were confirmed to be transformed.
Table 2: NPT II Activity in Transformed Plants Constructs No. of NPTII NPTII No. of plants plants assayedconfirmedconfirmed transformed CGmfl24 47 27 23 33 CGmf125 37 28 18 18 CGmfZ24 49 40 30 39 CGmf225 52 37 28 34 SpCGmfIP2S 27 27 21 21 6 CGmfl,P
pCGmf124 -bean pltaseolin regulatory sequences driving FaoA gene 'pCGmf125 - soybean oleosin regulatory sequences driving FaoA gene 3pCGmf224 - be n phaseolin regulatory sequences driving FaoB gene 4pCGmfl25 - soybean oleosin regulatory sequences driving FaoB gene SpCGmf192S - bean phaseolin and soybean oleosin regulatory sequences driving FaoA &
FaoB genes, respectively 6pCGmflPlS -bean phaseolin and soybean oleosin regulatory sequences driving FaoB &
FaoA genes, respectively The fate of the transforming DNA was investigated for sixteen randomly selected transgenic lines. Southern DNA hybridization analysis showed that the FaoA and/or FaoB were integrated into the genomes of the transgenic lines tested.
Approximately 80% of the pmf124 transgenic plants in which the FaoA gene is expressed from the strong bean phaseolin promoter were observed to be male sterile. Clearly high level expression of the FaoA gene 5 from this promoter results in functional expression of the FaoA gene product which impairs seed and/or pollen development. This result was very unexpected, since it was not anticipated that the plant cells would be capable of carrying out the first step in the (i-oxidation pathway in the cytosol.
This result, however, provides additional applications for expressing ~i-oxidation 10 genes in plants for male sterility for hybrid production or to prevent the production of seed. It was also note that in a side-by-side comparison with normal transgenic lines, the pmf124 lines produced much higher levels of biomass, presumably due to the elimination of seed development. This phenotype therefore may be useful as a means to increase the amount of 15 green biomass produced per acre for silage, forage, or other biomass crops.
Here, the use of an inducible promoter system or recombinase technology could be used to produce seed for planting. Seven of the sterile plants were successfully cross-pollinated with pollen from pmfl25 transgenic lines and set seeds.
20 Northern analysis on RNA from seeds from pmf224 lines containing the phaseolin promoter-FaoB constructs showed a signal indicative of the expected 1.2 kb transcript in all the samples tested except the control.
Northern analysis on RNA from seeds from pmf125 lines containing the weak soybean oleolsin promoter-FaoA constructs revealed a transcript of the .
25 expected size of 2.1 kb. Western blotting on 300-500 leg of protein from approximately 80% of seeds of pmf125 plants where the FaoA gene is expressed from the relatively weak soybean oleosin promoter were inconclusive, although a weak signal was detected in one transgenic line.
Fatt~Acid Analysis 30 Given the unexpected results indicating a strong metabolic effect of expressing the FaoA gene from the strong bean phaseolin promoter in seeds, the fatty acid profile of the seeds from transgenic lines expressing the FaoA
gene from the weak soybean oleosin promoter was analyzed. Seeds expressing only the FaoA gene or also expressing the FaoB gene from the bean phaseolin promoter were examined. The analysis was carried out as described in Millar et al., The Plant Cell 11:1889-902 (1998). Seed fatty 5 acid methyl esters (FAMES) were prepared by placing ten seeds of B. napus in 15 x 45-mm screw capped glass tubes and heating at 80 °C in 0.75 mL
of 1N methanolic HCl reagent (Supelco, PA) and 10 gL of 1 mg 17:0 methyl ester (internal standard) per mL overnight. After cooling the samples, the FAMES were extracted with 0.3 mL hexane and 0.5 mL 0.9% NaCI by 10 vortexing vigorously. The samples were allowed to stand to separate the phases, and 300 pL of the organic phase was drawn and analyzed on a Hewlett-Packard gas chromatograph.
Fatty acid profile analysis indicated the presence of an additional component or enhanced component in the lipid profile in all of the transgenic 15 plants expressing the FaoA gene SEQ m N0:24 which was absent from the control plants. This result again proves conclusively that the FaoA gene is being transcribed and translated and that the FaoA polypeptide SEQ ID NO:
27 is catalytically active. This peak also was observed in eleven additional transgenic plants harboring SoyP-FaoA, PhaP-FaoA-SoyP-FaoB, SoyP-20 FaoA-PhaP-FaoB genes and a sterile (PhaP-FaoA) plant cross-pollinated with SoyP-FaoB. These data clearly demonstrate functional expression of the FaoA gene and that even the very low levels of expression are sufficient to change the lipid profile of the seed. Adapting the methods described herein, one of skill in the art can express these genes at levels intermediate 25 between that obtained with the phaseolin promoter and the soybean oleosin promoter using other promoters such as the Arabidopsis oleosin promoter, napin promoter, or cruciferin promoter, and can use inducible promoter systems or recombinase technologies to control when fatty acid oxidation transgenes are expressed.
Example 7: Yeast ~i-oxidation Multi-functional Enzyme Complex S. cerevisiae contains a (i-oxidation pathway that proceeds via R-WO 99/45122 PC'fNS99/04999 hydroxyacyl CoA rather than the S-3-hydroxyacyl CoA observed in bacteria and higher eukaryotes. The fox2 gene from yeast encodes a hydratase that produces R-3-hydroxyacyl CoA from traps-2-enoyl-CoA and a dehydrogenase that utilizes R-3-hydroxyacyl-CoA to produce ~i-keto acyl CoAs.
The fox2 gene (sequence shown in SEQ ID NO:1) was isolated from S. cerevisiae genomic DNA by PCR in two pieces. Primers N-fox2b and N-bamfox2b were utilized to PCR a 1.1 kb SmaIBamHI fragment encoding the N-terminal region of Fox2, and primers C-fox2 and C-bamfox2 were utilized 10 to PCR a 1.6 kb BamHI/XbaI fragment encoding the C-terminal Fox2 region. The full fox2 gene was reconstructed via subcloning in vector pTRCN.
N-fox2b fox2 tcc ccc ggg agg agg ttt tta tta tgc ctg gaa att tat cct tca aag ata gag tt 1 S (SEQ ID N0:18) N-bamfox2b fox2 aaggatccttgatgtcatttacaactacc (SEQ ID NO:I9) C-fox2 fox2 get cta gat agg gaa aga tgt atg taa g (SEQ ID N0:20) 20 C-bamfox2 fox2 tgacatcaaggatcctttt (SEQ ID N0:21) The foil gene, however, does not possess a p-ketothiolase activity and this activity must be supplied by a second transgene. Representative sources of such a gene include algae, bacteria, yeast, plants, and mammals. The 25 bacterium Alcaligenes eutrophus possesses a broad specificity (3-ketothiolase gene suitable for use in the methods described herein. It can be readily isolated using the acetoacetyl-CoA thiolase gene as a hybridization probe, as described in U.S. Patent 5,661,026 to Peoples et al. This enzyme also has been purified (Haywood et al., FEMSMicro. Lett. 52:91 (1988)), and the 30 purified enzyme is useful for preparing antibodies or determining protein sequence information as a basis for the isolation of the gene.
Example 8: Plant ~i-Oxidation Gene The DNA sequence of the cDNA encoding (3-oxidation tetrafunctional protein, shown in SEQ ID N0:4, can be isolated as described in Preisig-Muller et al., J. Biol. Chem. 269:20475-8I (1994). The equivalent gene can be isolated from other plant species including Arabidopsis, Brassica, soybean, sunflower, and corn using similar procedures or by screening genomic libraries, many of which are commercially available, for example from Clontech Laboratories Inc., Palo Alto, California, USA. A
peroxisomal targeting sequence P-R-M was identified at the carboxy I O terminus of the protein. Constructs suitable for expressing in the plant cytosol can be prepared by PCR amplification of this gene using primers designed to delete this sequence.
SSQUENCB LISTING
<110> Metabolix, Inc.
<120> Modification of Fatty Acid Metabolism in Plants <130> MBX 024 <140>
<141>
<150> 60/077,107 <151> 1998-03-06 <160> 31 <170> PatentIn Ver. 2.0 <210> 1 <211> 2703 <212> DNA
<213> Saccharomyces <220>
<221> gene <222> (1)..(2703) <223> fox2 gene <400> 1 atgcctggaa atttatcctt caaagataga gttgttgtaa tcacgggcgc tggagggggc 60 ttaggtaagg tgtatgcact agcttacgca agcagaggtg caaaagtggt cgtcaatgat 120 ctaggtggca ctttgggtgg ttcaggacat aactccaaag ctgcagactt agtggtggat 180 gagataaaaa aagccggagg tatagctgtg gcaaattacg actctgttaa tgaaaatgga 240 gagaaaataa ttgaaacggc tataaaagaa ttcggcaggg ttgatgtact aattaacaac 300 gctggaatat taagggatgt ttcatttgca aagatgacag aacgtgagtt tgcatctgtg 360 ~w ~gtagatgttc atttgacagg tggctataag ctatcgcgtg°~c'tgettggcc ttatatgcgc 420 tctcagaaat ttggtagaat cattaacacc gcttcccctg ccggtctatt tggaaatttt 480 ggtcaagcta attattcagc agctaaaatg ggcttagttg gtttggcgga aaccctcgcg 540 aaggagggtg ecaaatacaa cattaatgtt aattcaattg cgccattggc tagatcacgt 600 atgacagaaa acgtgttacc accacatatc ttgaaacagt taggaccgga aaaaattgtt 660 cccttagtac tctatttgac acacgaaagt acgaaagtgt caaactccat ttttgaactc 720 gctgctggat tctttggaca gctcagatgg gagaggtctt ctggacaaat tttcaateca 780 gaccccaaga catatactcc tgaagcaatt ttaaataagt ggaaggaaat cacagactat 840 agggacaagc catttaacaa aactcagcat ccatatcaac tctcggatta taatgattta 900 atcaccaaag caaaaaaatt acctcccaat gaacaaggct cagtgsaaat caagtcgctt 960 tgcaacaaag tcgtagtagt tacgggtgca ggaggtggtc ttgggaagtc tcatgcaatc 1020 tggtttgcac ggtacggtgc gaaggtagtt gtaaatgaca tcaaggatcc tttttcagtt 1080 gttgaagaaa taaataaact atatggtgaa ggcacagcca ttccagattc ccatgatgtg 1140 gtcaccgaag ctcctctcat tatccaaact gcaataagta agtttcagag agtagacatc 1200 ttggtcaata acgctggtat tttgcgtgac aaatcttttt taaaaatgaa agatgaggaa 1260 tggtttgctg tcctgaaagt ccaccttttt tccacatttt cattgtcaaa agcagtatgg 1320 ccaatattta ecaaacaaaa gtctggattt attatcsata ctacttctac ctcaggaatt 1380 tatggtaatt ttggacaggc caattatgcc gctgcaaaag ccgccatttt aggattcagt 1440 aaaactattg cactggaagg tgccaagaga ggaattattg ttaatgttat cgctcctcat 1500 gcagaaacgg ctatgacaaa gactatattc tcggagaagg aattatcaaa ccactttgat 1560 gcatctcaag tetccccact tgttgttttg ttggcatctg sagaactaca aaagtattct 1620 ggaagaaggg ttattggcca attattcgaa gttggcggtg gttggtgtgg gcaaaccaga 1680 tggcaaagaa gttccggtta tgtttctatt aaagagacta ttgaaccgga agaaattaaa 1740 gaaaattgga accacatcac tgatttcagt cgcaacacta tcascccgag ctccacagag 1800 gagtcttcta tggcaacctt gcaagccgtg caaaaagcgc actcttcaaa ggagttggat 1860 gatggattat tcaagtacac taccaaggat tgtatcttgt acaatttagg acttggatgc 1920 acaagcaaag agcttaagta cacctacgag aatgatccag acttccaagt tttgcccacg 1980 ttcgccgtca ttccatttat gcaagctact gccacactag ctatggacaa tttagtcgat 2040 aacttcaatt atgcaatgtt actgcatgga gaacaatatt ttaagctctg cacgccgaca 2100 atgccaagta atggaactct aaagacactt gctaaacctt tacaagtact tgacaagaat 2160 ggtaaagccg ctttagttgt tggtggcttc gaaacttatg acattaaaac taagaaactc 2220 atagcttata acgaaggatc gttcttcatc aggggcgcac atgtacctcc agaaaaggaa 2280 gtgagggatg ggaaaagagc caagtttgct gtccaaaatt ttgaagtgcc acatggaaag 2340 gtaccagatt ttgaggcgga gatttctacg aataaagatc aagccgcatt gtacaggtta 2400 tctggcgatt tcaatccttt acatatcgat cccacgctag ccaaagcagt tsaatttcct 2460 acgccaattc tgcatgggct ttgtacatta ggtattagtg cgaaagcatt gtttgaacat 2520 tatggtccat atgaggagtt gaaagtgaga tttaccaatg ttgttttccc aggtgatact 2580 ctaaaggtta aagcttggaa gcaaggctcg gttgtcgttt ttcaaacaat tgatacgacc 2640 agaaacgtca ttgtattgga taacgccgct gtaaaactat cgcaggcaaa atctaaacta 2700 taa 2703 <210> 2 <211> 900 <212> PRT
<213> Saccharomyces <220>
<221> PEPTIDE
<2,22..> ~1) .,. (900) , .. . , . , ~ ., t <223> fox2 encoded polypeptide <400> 2 Met Pro Gly Asn Leu Ser Phe Lys Asp Arg Val Val Val Ile Thr Gly Ala Gly Gly Gly Lau Gly Lys Val Tyr Ala Leu Ala Tyr Ala Ser Arg Gly Ala Lys Val Val Val Asn Asp Leu Gly Gly Thr Leu Gly Gly Ser Gly His Asn Ser Lys Ala Ala Asp Leu Val Val Asp Glu Ile Lys Lys Ala Gly Gly Ile Ala Val Ala Asn Tyr Asp Ser Val Asn Glu Asn Gly Glu Lys Ile Ile Glu Thr Ala Ile Lys Glu Phe Gly Arg Val Asp Val Leu Ile Asn Asn Ala Gly Ile Leu Arg Asp Val Ser Phe Ala Lys Met Thr Glu Arg Glu Phe Ala Ser Val Val Asp Val His Leu Thr Gly Gly Tyr Lys Leu Ser Arg Ala Ala Trp Pro Tyr Mat Arg Ser Gln Lys Phe Gly Arg Ile Ile Asn Thr Ala Ser Pro Ala Gly Leu Phe Gly Asn Phe Gly Gln Ala Asn Tyr Ser Ala Ala Lys Met Gly Leu Val Gly Leu Ala Glu Thr Leu Ala Lys Glu Gly Ala Lys Tyr Asn Ile Asn Val Asn Ser Ile Ala Pro Leu Ala Arg Ser Arg Mat Thr Glu Asn Val Leu Pro Pro His Ile Leu Lys Gln Leu Gly Pro Glu Lys Ile Val Pro Leu Val Leu Tyr Leu Thr His Glu Ser Thr Lys Val Ser Asn Ser Ile Phe Glu Leu Ala Ala Gly Phe Phe Gly Gln Leu Arg Trp Glu Arg Ser Ser Gly Gln Ile Phe Asn Pro Asp Pro Lys Thr Tyr Thr Pro Glu Ala Ile Leu Asn Lys Trp Lys Glu Ile Thr Asp Tyr Arg Asp Lys Pro Phe Asn Lys Thr Gln His Pro Tyr Gln Leu Ser Asp Tyr Asn Asp Leu Ile Thr Lys Ala Lys Lys Leu Pro Pro Asn Glu Gln Gly Ser Val Lys Ile Lys Ser Leu Cys Asn Lys Val Val Val Val Thr Gly Ala Gly Gly Gly Leu Gly Lys Ser His Ala Ile Trp Phe Als Arg Tyr Gly Ala Lys Val Val Val Asn Asp Ile Lys Asp Pro Phe Ser Val Val Glu Glu Ile Aan Lys Leu Tyr Gly Glu Gly Thr Ala Ile Pro Asp Ser His Asp Val Val Thr Glu Ala Pro Leu Ile Ile Gln Thr Ala Ile Ser Lys Phe Gln Arg Val Asp Ile Leu Val Asn Asn Ala Gly Ile Leu Arg Asp Lys Sar Phe Leu Lys Met Lys Asp Glu Glu Trp Phe Ala Val Leu Lys Val His Leu Phe Ser Thr Phe Ser Leu Ser Lys Ala Val Trp Pro Ile Phe Thr Lys Gln Lys Ser Gly Phe Ile Ile Asn Thr Thr Ser Thr Ser Gly Ile Tyr Gly Aan Phe Gly Gln Ala Asn Tyr Ala Ala Ala Lys Ala Ala Ile Leu Gly Phe Ser Lys Thr Ile Ala Leu Glu Gly Ala Lys Arg Gly Ile Ile Val Asn Val Ile Ala Pro His Ala Glu Thr Ala Met Thr Lys Thr Ile Phe Ser Glu Lys Glu Leu Ser Asn His Phe Asp Ala Ser Gln Val Ser Pro Leu Val Val Leu Leu Ala Ser Glu Glu Leu Gln Lys Tyr Ser Gly Arg Arg Val Ile Gly Gln Leu Phe Glu Val Gly Gly Gly Trp Cys Gly Gln Thr Arg Trp Gln Arg Ser Ser Gly Tyr Val Ser Ile Lys Glu Thr Ile Glu Pro Glu Glu Ile Lys Glu Asn Trg Asn His Ile Thr Asp Phe Ser Arg Asn Thr Ile Asn Pro Ser Ser Thr Glu Glu Ser Ser Met Ala Thr Leu Gln Ala Val Gln Lys Ala His Ser Ser Lys Glu Leu Asp Asp Gly Leu Phe Lys Tyr Thr Thr Lys Asp Cys Ile Leu Tyr Asn Leu Gly Leu Gly Cys Thr Ser Lys Glu Leu Lys Tyr Thr Tyr Glu Asn Asp Pro Asp Phe Gln Val Leu Pro Thr Phe Ala Val Ile Pro Phe Met Gln Ala Thr Ala Thr Leu Ala Met Asp Asn Leu Val Asp Asn Phe Asn Tyr Ala Mat Leu Leu His Gly Glu Gln Tyr Phe Lys Leu Cys Thr Pro Thr Met Pro Ser Asn Gly Thr Leu Lys Thr Leu Ala Lys Pro Leu Gln Val Leu Asp Lys Asn Gly Lys Ala Ala Leu Val Val Gly Gly Phe Glu Thr Tyr Asp Ile Lys Thr Lys Lys Leu Ile Ala Tyr Asn Glu Gly Ser Phe Phe Ile Arg Gly Ala His Val Pro Pro Glu Lys Glu Val Arg Asp Gly Lys Arg Ala Lys Phe Ala Val Gln Asn Phe Glu Val Pro His Gly Lys Val Pro Asp Phe Glu Ala Glu Ile Ser Thr Asn Lys Asp Gln Ala Ala Leu Tyr Arg Leu Ser Gly Asp Phe Asn Pro Leu His Ile Asp Pro Thr Leu Ala Lys Ala Val Lys Phe Pro Thr Pro Ile Leu His Gly Leu Cys Thr Leu Gly Ile Ser Ala Lys Ala Leu Phe Glu His Tyr Gly Pro Tyr Glu Glu Leu Lys Val Arg Phe Thr Asn Val Val Phe Pro Gly Asp Thr Leu Lys Val Lys Ale Trp Lys Gln Gly Ser Val Val Val Phe Gln Thr Ile Asp Thr Thr Arg Asn Val Ile Val Leu Asp Asn Ala Ala Val Lys Leu Ser Gln Ala Lys Ser Lys Leu <210> 3 <211> 2177 <212> DNA
<213> Artificial Sequence <220>
<221> gene <222> (1)..(2177) <223> tetrafunctional beta-oxidation gene <220>
<223> Description of Artificial Sequence <400> 3 atgggaagca atgcaaaagg aagaacggta atggaggtgg gaactgatgg agtagcaata 60 atcaccatca tcaaccctcc agttaactcc ttgtcttttg atgtgttatt cagcctgaga 120 gatagttatg aacaagcctt'gagaagagat gatgtgaagg caattgttgt tacaggtgca 180 aagggaaaat tttctggtgg ctttgatata actgcttttg gtgtactcca aggaggaaag 240 ggggagcaac caaatgttag aaacatatca attgaaatga tcactgatat ttttgaagct 300 gcccgaaaac ctgcggttgc agcgatagat ggacttgctt tgggtggagg gttggaggtt 360 gccatggctt gtcatgctcg aatatcaact cctaccgctc aattagggtt gcctgaactt 420 cagctcggaa taattcctgg ttttggagga acacaacggc ttccacgtct tgttggtctc 480 tcaaaggccc tagaaatgat gttgacgtca aagccaatta aaggacaaga agctcattct 540 ttggggttag tggatgccat tgtccctccc gaagagttga tcaacactgc acgtagatgg 600 gctcttgaaa tcctagagcg gagaagacca tgggttcaca gtcttcacag gactgacaag 660 ttagagtctc ttgctgaggc taggaaaata tttaacttag ctagagctca ggcaaagaaa 720 cagtacccaa atcttaagca tacaattgcc tgcattgatg ctgttgaaac gggtgtcgtc 780 tctggccctc gtgctggact ttggaaggag gctgaagaat ttcagggact cctacattct 840 gatacttgca aaagcttaat tcatatcttc tttgcccagc gttcaacaac taaggtacct 900 ggagttactg atctgggttt ggtaccgaga caaatcaaga aagttgctat tgtcggagga 960 ggattaatgg gatctggtat agctacagca ttgattctta gcaactatca tgtggtactt 1020 aaagaagtga acgataagtt cttgcaggct ggcattgaca gagtcagagc aaacctacaa 1080 agccgagtca aaaaagggaa tatgactaat gagaaattcg aaasgagtat ttctttactc 1140 aagggagttc ttaactacga aagttttaaa gatgtagata tggtgataga ggctgttatt 1200 gagaatgttt ctttgaagca acaaatcttt tctgatcttg aaaaatattg ccccccacat 1260 tgcatgcttg ctactaatac ttccacaata gacttggagt tgattggaga gagaataaag 1320 tctcgtgaca ggattattgg aacagctgca caagtgattg ttgatctgct agatgttggg 1380 aagaatataa agaaaacacc agttgtcgtt ggaaattgta caggttttgc tgtcaacaga 1440 atgttttttc cctactctca ggctgcaatt ttacttgcag aacatggggt agatccctat 1500 cagattgaca gggctatttc caagtttgga atgccaatgg gacccttcag gttgtgcgac 1560 cttgttggtt ttggtgtggc agcagcaact gccagtcagt ttgttcaagc ttttccagaa 1620 agaacttata aatcgatgct aattcctctg atgcaagagg ataagaatgc aggtgaatcc 1680 actcgtaaag gtttctatgt ctatgacaag aaccgaaaag ctgggccaaa tccagagtta 1740 aagaaatata tcgagaaggc taggaacagt tctggtgttt cagttgatcc taagctcaca 1800 aagttacccg aaaaggacat tgtggagatg atatttttcc cagtggtgaa tgaagcatgt 1860 cgtgtccttg ccgaaggcat agcagtcaaa gcagctgacc tggacattgc tggtgtaatg 1920 ggaatgggtt tcccctccta cggggaggac tcatgttctg ggcggattct cttggatcaa 1980 attacatcta ttcaaggttg gaggaatggt cgaaacagta tggtggattt ttcaagccct 2040 gtggatactt agctgaaagg gctgttcagg gtgcaactct gagtgctccg ggtggccatg 2100 ctaaacctcg aatgtaagct catttcttta gtcctgctca catcatgccg ctattggaaa 2160 ttgtccgtac caagcat 2177 <210> 4 <211> 725 <212> PRT
<213> Cucumber <220>
<221> PEPTIDE
<222> (1)..(725) <223> tetrafunctional beta-oxidation protein <400> 4 Met Gly Ser Asn Ala Lys Gly Arg Thr Val Mat Glu Val Gly Thr Asp Gly Val Ala Ile Ile Thr Ile Ile Asn Pro Pro Val Asn Sar Leu Ser Phe Asp Val Lau Phe Ser Leu Arg Asp Sar Tyr Glu Gln Ala Leu Arg Arg Asp Asp Val Lys Ala Ile Val Val Thr Gly Als Lya Gly Lys Phe Ser Gly Gly Phe Asp Ile Thr Ala Phe Gly Val Leu Gln Gly Gly Lys Gly Glu Gln Pro Asn Val Arg Asn Ile Ser Ile Glu Met Ile Thr Asp Ile Phe Glu Ala Ala Arg Lys Pro Ala Val Ala Ala Ile Asp Gly Leu Ala Leu Gly Gly Gly Leu Glu Val Ala Met Ala Cys His Ala Arg Ile Ser Thr Pro Thr Ala Gln Leu Gly Leu Pro Glu Lau Gln Leu Gly Ile Ile Pro Gly Phe Gly Gly Thr Gln Arg Lau Pro Arg Leu Val C~ly Leu Ser Lys Ala Leu Glu Met Met Leu Thr Ser Lys Pro Ile Lys Gly Gln Glu Ala His Ser Leu Gly Leu Val Asp Ala Ile Val Pro Pro Glu Glu Leu Ile Asn Thr Ala Arg Arg Trp Ala Leu Glu Ile Leu Glu Arg Arg Arg Pro Trp Val His Ser Leu His Arg Thr Asp Lys Leu Glu Ser Leu Ala Glu Ala Arg Lys Ile Phe Asn Lau Ala Arg Ala Gln Ala Lys Lys Gln Tyr Pro Aan Leu Lys His Thr Ile Ala Cys Ile Asp Ala Val Glu Thr Gly Val Val Sar Gly Pro Arg Ala Gly Leu Trp Lys Glu Ala Glu 260 265 270' Glu Phe Gln Gly Leu Leu His Ser Asp Thr Cys Lys Ser Leu Ile His Ile Phe Phe Ala Gln Arg Ser Thr Thr Lys Val Pro Gly Val Thr Asp Leu Gly Leu Val Pro Arg Gln Ile Lys Lys Val Ala Ile Val Gly Gly Gly Leu Met Gly Ser Gly Ile Ala Thr Ala Leu Ile Leu Ser Asn Tyr His Val Val Leu Lys Glu Val Asn Asp Lys Phe Leu Gln Ala Gly Ile Asp Arg Val Arg Ala Asn Leu Gln Sar Arg Val Lys Lys Gly Asn Met Thr Asn Glu Lys Phe Glu Lys Ser Ile Ser Leu Leu Lys Gly Val Leu Asn Tyr Glu Ser Phe Lys Asp Val Asp Mat Val Ile Glu Ala Val Ile Glu Asn Val Ser Leu Lys Gln Gln Ile Phe Ser Asp Leu Glu Lys Tyr Cys Pro Pro His Cys Mat Leu Ala Thr Asn Thr Ser Thr Ile Asp Leu Glu Leu Ile Gly Glu Arg Ile Lys Ser Arg Asp Arg Ile Ile Gly His Thr Ala Ala Gln Val Ile Val Asp Leu Leu Asp Val Gly Lys Asn Ile Lys Lys Thr Pro Val Val Val Gly Asn Cys Thr Gly Phe Ala Val Asn Arg Met Phe Phe Pro Tyr Ser Gln Ala Ala Ile Leu Leu Ala Glu His Gly Val Asp Pro Tyr Gln Ile Asp Arg Ala Ile Ser Lys Phe Gly Met Pro Met Gly Pro Phe Arg Leu Cys Asp Leu Val Gly Phe Gly Val Ala Ala Ala Thr Ala Ser Gln Phe Val Gln Ala Phe Pro Glu Arg Thr Tyr Lys Ser Met Leu Ile Pro Leu Met Gln Glu Asp Lys Asn Ala Gly Glu Ser Thr Arg Lys Gly Phe Tyr Val Tyr Asp Lys Asn Arg Lys Ala Gly Pro Asn Pro Glu Leu Lys Lys Tyr Ile Glu Lys Ala Arg Asn Ser Ser Gly Val Ser Val Asp Pro Lys Leu Thr Lys Leu Pro Glu Lys Asp Ile Val Glu Met Ile Phe Phe Pro Val Val Asn Glu Ala Cys Arg Val Leu Ala Glu Gly Ile Ala Val Lys Ala Ala Asp Leu Asp Ile Ala Gly Val Met Gly Met Gly Phe Pro Ser Tyr Arg Gly Gly Leu Met Phe Trp Ala Asp Ser Leu Gly Ser Asn Tyr Ile Tyr Ser Arg Leu Glu Glu Trp Ser Lys Gln Tyr Gly Gly Phe Phe Lys Pro Cys Gly Tyr Leu Ala Glu Arg Ala Val Gln Gly Ala Thr Leu Ser Ala Pro Gly Gly His Ala Lys Pro Arg Mat Ala His Phe Phe Sar Pro Ala His Ile Met Pro Leu Leu Glu Ile Val Arg Thr Lys <210> 5 <211> 21 <212> DNA
<213> Artificial Sequence <220>
,. .~ . <223> Description of Artificial Sequence:
oligonucleotide primer- Primer 1 <400> 5 gatgggccgc tccaagggtg g 21 <210> 6 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Seguence:
oligonucleotide primer- Primer 2 <400> 6 caacccgaag gtgccgccat t 21 <210> 7 <211> 3907 <212> DNA
<213> Pseudomonas putida <220>
<221> gene <222> (1)..(3907) <223> faoAB gene sequeace <400> 7 gatggcgttt ttactgaaaa tttgcctccg gccatagaat ctcctacggg ggcatccagg 60 ctggtcaatc tgttctgtaa cgacaaagcg gcggctcggc ataaccctga aggggtgggg 120 tgccaagtcg ccgctttgcg ttctgctgcg cagaaaaggc caggcaggcc gggttattca 180 ttcaagcaag tgccagtgta gtcgccttgg ctcttcgcga agcgcaaagc aataagccga 240 tgcttgtggt caatcgctga catgaatgac cggcgatcac tgtcatcatg ttttgtaccc 300 ggacgataga aaatgactgg aaagtcgcgt tttcgccgtg ctcgtgagca tttgacaacg 360 cccatctttt cggagaatgt gtgcacaccc aaatcaaacg ggcgtatgaa ttgagcgttt 420 gtatgtcatg acgccaagcc atacttccga ctagcgcgct gacgggtgtg cctggggcaa 480 ggcagcttag gcctgcttct gctgcagcga tcggcgtgta cagttcagct tccatatcgt 540 ggagatcagt tgatgattta cgaaggtaaa gccatcacgg ttaaggctct tgaaagtggc 600 atcgtcgagc tcaagttcga cctcaagggt gagtccgtca acaagttcaa ccgccttacc 660 ctgaacgagc tgcgccaggc cgtcgatgcc atccgggccg atgcttcggt caagggcgtg 720 atcgtcagga gtggcaagga cgtgttcatc gtcggcgccg acatcaccga gttcgtcgac 780 aacttcaagc tgcctgaggc cgaactggtc gctggcaacc tggaagccaa tcgcatcttc 840 aacgcgttcg aagacctcga agtgccgacc gttgccgcca tcaacggcat cgcgctgggc 900 ggcggcctgg aaatgtgcct ggcggccgac taccgggtca tgtccaccag cgccaggatc 960 ggcctgccgg aagtcaagct gggtatctac ccgggctttg gcggtaccgt gcgcctgccg 1020 cgcctgatcg gctcggacaa cgccatcgag tggatcgccg ccggcaagga aaaccgtgcc 1080 gaagatgccc tgaaagtggg ggccgtcgac gccgtggtcg cccctgagct gctgctggcc 1140 w ° ggtgccctcg'-'acctgatcaa gcgtgccatc agtggcgagc 'tgg~ctacaa ggccaagcgc 1200 cagccgaagc tggaaaagct caagctcaat gccatcgagc agatgatggc cttcgagact 1260 gccaagggct tcgtcgctgg ccaggccggc ccgaactacc cggccccggt cgaagcgatc 1320 aagagcatcc agaaagccgc caacttcggt cgcgacaagg ccctggaagt cgaagccgca 1380 ggctttgcca agctggccaa gacctctgtc gccgagagcc tgatcggctt gttcctcaac 1440 gatcaggaac tcaagcgcaa ggccaaggcg catgacgaga tcgcccacga cgtgaagcag 1500 gccgccgtgc tcggcgccgg catcatgggc ggcggtatcg cctaccagtc ggcggtcaaa 1560 ggtacgccga tcctgatgaa ggacatccgc gaggaagcca ttcagctggg tctgaacgag 1620 gcctccaagt tgcttggcaa ccgcgtcgag aagggccgcc tgaccccggc caagatggcc 1680 gaggccctca acgccattcg cccgaccctg tcctatggcg atttcgccaa tgtcgacatc 1740 gtcgtcgagg ctgtggtcga gaacccgaag gtcaagcaag cggtactggc ggaagtggaa 1800 ggccaggtga aggacgatgc gatcctcgct tccaacacct ctaccatctc catcaacctg 1860 ctggccaagg cgctcaagcg cccggaaaac ttcgtcggca tgcacttctt caacccggtg 1920 il cacatgatgc cgctggttga agtgatccgt ggcgagaagt ccagtgacgt ggcggtcgcc 1980 accaccgtgg cctacgccaa gaaaatgggc aagaacccga tcgtggtcaa cgactgcccg 2040 ggctttttgg tcaaccgcgt gctgttcccg tactttggcg gttttgccaa gctggtcagc 2100 gccggtgtcg acttcgtgcg catcgacaag gtcatggaga agttcggctg gccgatgggc 2160 ccagcctact tgatggacgt ggtcggcatc gacaccggcc accacggccg tgacgtcatg 2220 gccgaaggct tcccggatcg catgaaggac gagcgccgct cggcagtcga cgcgttgtac 2280 gaggccaacc gcctgggcca gaagaacggt aagggcttct acgcctacga aaccgacaag 2340 cgcggcaagc cgaagaaggt cttcgatgcc sccgtgctcg acgtgctcaa accgatcgtg 2400 ttcgagcagc gtgaagtcac tgacgaagac atcatcaact ggatgatggt cccgctgtgc 2460 cttgagaccg tgcgttgcct ggaagacggc atcgtcgaaa ccgctgccga agccgacatg 2520 ggcctggtct acggcattgg tttccctccc ttccgcggtg gtgcgctgcg ttacatcgac 2580 tcgatcggtg tggccgaatt cgtcgccctg gccgatcagt atgccgacct ggggccgctg 2640 taccacccga ccgccaagct gcgtgaaatg gccaagaacg gccagcgctt cttcaactga 2700 gcggtcaacg agctagagcg agagatttga tatgagcctg aatccaagag acgtggtgat 2760 tgtcgacttc ggtcgcacgc caatgggccg ctccaagggt ggcatgcacc gcaacacccg 2820 cgccgaagac atgtcggcgc acctgatcag caagctgctg gaacgcaacg gcaaggtcga 2880 cccgaaagaa gtcgaggacg tgatctgggg ctgcgtcaac cagaccctgg agcagggctg 2940 gaacatcgcc cgcatggctt cgctgatgac cccgatcccg cacacctctg cggcgcagac 3000 cgtcagccgc ctgtgcggct cgtccatgag cgcgctgcac acggccgccc aggcgatcat 3060 gaccggtaac ggtgatgtgt tcgtggtcgg tggcgtggag cacatgggcc acgtcagcat 3120 gatgcatggc gtagacccca acccgcacct gtccttgcat gccgccaagg cttccgggat 3180 gatgggcctg actgcagaaa tgctcggcaa gatgcacggc atcacccgtg agcagcagga 3240 cctgttcggc ttgcgttcgc accagctggc ccacaaggcc acggtcgaag gcaagttcaa 3300 ggacgagatc atcccgatgc agggctacga cgagaacggc ttcctgaagg tgttcgattt 3360 cgacgaaacc attcgcccgg aaaccaccct cgaaggcctg gcatcgctca agcctgcgtt 3420 caacccgaaa ggcggtacgg tcacggccgg tacctcgtcg cagatcaccg acggcgcctc 3480 gtgcatgatc gtcatgtccg gtcagcgtgc catggacctc ggtatccagc cattggcggt 3540 gatccgttcg atggcagtgg ccggtgtcga cccggcaatc atgggctacg gcccggtgcc 3600 atcgacccag aaagccctca agcgtgcggg cttgaccatg gccgatatcg acttcatcga 3660 gctcaacgaa gccttcgctg cgcaggccct gcccgtgctg aaagacttga aagtgctcga 3720 caagatggat gagaaggtta acctgcacgg cggcgccatt gctttgggcc acccgttcgg 3780 ttgctccggg gcgcggattt ccggcaccct gctcaacgtc atgaagcaaa atggcggtac 3840 gctgggtgtt gcgaccatgt gcgtcggcct gggccaaggt atcaccactg tcttcgaacg 3900 cgtctga 3907 <210> g _ , ...~.. _", .. ... . ._ . ,.
<211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence oligonucleotide primer- Primer (iVR471 <400> 8 cggtacccat tgtactccca gtatcat 27 <210> 9 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer GVR472 <400> 9 catttaaata gtagagtatt gaatatg 27 <210> 10 <211> 1558 <212> DNA
<213> Bean Phaseolin <220>
<221> promoter <222> (1)..(1558) <400> 10 cggtacccat tgtactccca gtatcattat agtgaaagtt ttggctctct cgccggtggt 60 tttttacctc tatttaaagg ggttttccac ctaaaaattc tggtatcatt ctcactttac 120 ttgttacttt aatttctcat aatctttggt tgaaattatc acgcttccgc acacgatatc 180 cctacaaatt tattatttgt taaacatttt caaaccgcat aaaattttat gaagtcccgt 240 ctatctttaa tgtagtctaa cattttcata ttgaaatata taatttactt aattttagcg 300 ttggtagaaa gcataaagat ttattcttat tcttcttcat ataaatgttt aatatacaat 360 ataaacaaat tctttacctt aagaaggatt tcccatttta tattttaaaa atatatttat 420 caaatatttt tcaaccacgt aaatctcata ataataagtt gtttcaaaag taataaaatt 480 taactccata atttttttat tcgactgatc ttsaagcaac acccagtgac acaactagcc 540 atttttttct ttgaataaaa aaatccaatt atcattgtat tttttttata caatgaaaat 600 ttcaccaaac aatcatttgt ggtatttctg aagcaagtca tgttatgcaa aattctataa 660 ttcccatttg acactacgga agtaactgaa gatctgcttt tacatgcgag acacatcttc 720 taaagtaatt ttaataatag ttactatatt caagatttca tatatcaaat actcaatatt 780 acttctaaaa aattaattag atataattaa astattactt ttttaatttt aagtttaatt 840 --~.f gttgaatttg tgactattgart~ttattattc tactatgttt aaattgtttt atagatagtt'900 taaagtaaat ataagtaatg tagtagagtg ttagagtgtt accctaaacc ataaactata 960 acatttatgg tggactaatt ttcatatatt tcttattgct tttacctttt cttggtatgt 1020 aagtccgtaa ctagaattac agtgggttgc catggcactc tgtggtcttt tggttcatgc 1080 atgggtcttg cgcaagaaaa agacaaagaa caaagaaaaa agacaaaaca gagagacaaa 1140 acgcaatcac acaaccaact caaattagtc actggctgat caagatcgcc gcgtccatgt 1200 atgtctaaat gccatgcaaa gcaacacgtg cttaacatgc actttaaatg gctcacccat 1260 ctcaacccac acacaaacac attgcctttt tcttcatcat caccacaacc acctgtatat 1320 attcattctc ttccgccacc tcaatttctt cacttcaaca cacgtcaacc tgcatatgcg 1380 tgtcatccca tgcccaaatc tccatgcatg ttccaaccac cttctctctt atataatacc 1440 tataaatacc tctaatatca ctcacttctt tcatcatcca tccatccaga gtactactac 1500 tctactacta taatacccca acccaactca tattcaatac tactctacta tttaaatg 1558 <210>11 <211>983 <212>DNA
<213>Soybean oleosin <220>
<221> promoter <222> (1)..(983) <400> 11 tctagataca tccatttctt aatataatcc tcttattcaa attgcaattg cccagatctc 60 tgtatggact atggctcgag gaatccatac atagagacat tecccactcg tatacttgta 120 tctgtcaaat gcctattgtt gttgaaataa agtaacttat gcatcaattt aaaacaatca 180 gttaggtaat ggacattaat gtaaagtgac ttatagaatc catatttgta taacaaagat 240 gttgagacat tgactattgt gtgctatttc aaaaagatct gataaaatat ttaataaaac 300 attaaaagta aataataatc gctaaacatc atcaaaatat taatcattag attcaatttt 360 tgcaattaaa accagaagaa ctaaatctac attaccatac ttaagactat acatagagaa 420 ttaaacttaa tgttatattt aaggaaaagg acgaaactta naacttaaat acaattgtat 480 gattaatttt agtattgtct ttaatgagaa ttaaagtttt attcactaat ttatgattat 540 ttcatttact aatttatgta atgtgatttc aataagtgag gtsaactccg attgattgaa 600 gataccacca acaccaacac caccaccacc tgcgaaactg tacgtatctc aattgtcctt 660 aataaaaatg taaatagtac attattctcc ttgcctgtca ttatttatgt gcccccagct 720 ttaatttttc tgatgtactt aacccagggc aaactgaaac atgttcctca tgcasagccc 780 caactcatca tgcatcatgt acgtgtcatc atccagcaac tccacttttg ctatataact 840 cctcccccat cacactcccc atctctctaa cacacacata cccccaacta acaataattc 900 cttcacttgc agaacttagt tctctgttgc atcatcatca tcttcattag tgttagccct 960 aacttcacct taaccattta atg 983 <210> 12 <211> 38 <212> DNA
<213> Artificial Seguence <220>
<223> Description of Artificial Sequence:
< , ... ,. " a ~, oligonucleotide primer- Primer JA408 , _ . . . s , " . . _ ".. , ... . . :. ., <400> 12 tctagataca tccatttett aatataatcc tcttattc 38 <210> 13 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- npl <400> 13 catttaaatg gttaaggtga aggtagggct 30 <210> 14 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequences oligonucleotide primer- Primer apt <400> 14 aagcttaaaa tgatttacga aggtaaagcc 30 <210> 15 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer np3 <400> 15 attgctttca gttgaagcgc tg 22 <210> 16 <211> 30 <212> DNA
<213> Artificial Sequence <a2o>
<223> Description of Artificial Sequence:
,.":, : . , of gonucleotide primer- Primer ap4 . . ~ , ,. , . . . . . .
<400> 16 aagcttaaaa tgagcctgaa tccaagagac 30 <210>17 <211>28 <212>DNA
<213>Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer np5 <400> 17 aagctttcag acgcgttcga agacagtg 28 <210> 18 <211> 56 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer N-fox2b <400> 18 tcccccggga ggaggttttt attatgcctg gaaatttatc cttcaaagat agagtt 56 <210> 19 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequences oligonucleotide primer- Primer N-bamfox2b <400> 19 aaggatcctt gatgtcattt acaactacc 29 <210> 20 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer C-fox2~ 't,~'e" , ~ ~ ' ~ .
<400> 20 gctctagata gggaaagatg tatgtaag 28 <210> 21 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer C-bamfox2 WO 99/45122 PC'T/US99/04999 <400> 21 tgacatcaag gatcctttt 19 <210> 22 <211> 39 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer CiVR396 <400> 22 gatttaaatg caagcttaaa taagtatgaa ctaaaatgc 39 <210> 23 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequences oligonucleotide primer- Primer G1VR397 <400> 23 cggtacctta gttggtaggg tgcta 25 <210> 24 <211> 2162 <212> DNA
<213> Pseudomonas putida <220>
<221> gene <222> (1) .. (216-2) . , ~ ,..,. . , <223> FaoA gene sequence <400> 24 aagcttaaat gatttacgaa ggtaaagcca tcacggttaa ggctcttgaa agtggcatcg 60 tcgagctcaa gttcgacctc aagggtgagt ccgtcaacaa gttcaaccgc cttaccctga 120 acgagctgcg ccaggccgtc gatgccatcc gggccgatgc ttcggtcaag ggcgtgatcg 180 tcaggagtgg caaggacgtg ttcatcgtcg gcgccgacat caccgagttc gtcgacaact 240 tcaagctgcc tgaggccgaa ctggtcgctg gcaacctgga agccaatcgc atcttcaacg 300 cgttcgaaga cctcgaagtg ccgaccgttg ccgccatcaa cggcatcgcg ctgggcggcg 360 gcctggaaat gtgcctggcg gccgactacc gggtcatgtc caccagcgcc aggatcggcc 420 tgccggaagt caagctgggt atctacccgg gctttggcgg taccgtgcgc ctgccgcgcc 480 tgatcggctc ggacaacgcc atcgagtgga tcgccgccgg caaggaaaac cgtgccgaag 540 atgccctgaa agtgggggcc gtcgacgccg tggtcgcccc tgagctgctg ctggccggtg 600 ccctcgacct gatcaagcgt gccatcagtg gcgagctgga ctacaaggccwaagcgccagc 660 cgaagctgga aaagctcaag ctcaatgcca tcgagcagat gatggccttc gagactgcca 720 agggcttcgt cgctggccag gccggcccga actacccggc cccggtcgaa gcgatcaaga 780 gcatccagaa agccgccaac ttcggtcgcg acaaggccct ggaagtcgaa gccgcaggct 840 ttgccaagct ggccaagacc tctgtcgccg agagcctgat cggcttgttc ctcaacgatc 900 aggaactcaa gcgcaaggcc aaggcgcatg acgagatcgc ccacgacgtg aagcaggccg 960 ccgtgctcgg cgccggcatc atgggcggcg gtatcgccta ccagtcggcg gtcaaaggta 1020 cgccgatcct gatgaaggac atccgcgagg aagccattca gctgggtctg aacgaggcct 1080 ccaagttgct tggcaaccgc gtcgagaagg gccgcctgac cccggccaag atggccgagg 1140 ccctcaacgc cattcgcccg accctgtcct atggcgattt cgccaatgtc gacatcgtcg 1200 tcgaggctgt ggtcgagaac ccgaaggtca agcaagcggt actggcggaa gtggaaggcc 1260 aggtgaagga cgatgcgatc ctcgcttcca acacctctac catctccatc aacctgctgg 1320 ccaaggcgct caagcgcccg gaaaacttcg tcggcatgca cttcttcaac ccggtgcaca 1380 tgatgccgct ggttgaagtg atccgtggcg agaagtccag tgacgtggcg gtcgccacca 1440 ccgtggccta cgccaagaaa atgggcaaga acccgatcgt ggtcaacgac tgcccgggct 1500 ttttggtcaa ccgcgtgctg ttcccgtact ttggcggttt tgccaagctg gtcagcgccg 1560 gtgtcgactt cgtgcgcatc gacsaggtca tggagaagtt cggctggccg atgggcccag 1620 cctacttgat ggacgtggtc ggcatcgaca ccggccacca cggccgtgac gtcatggccg 1680 aaggcttccc ggatcgcatg aaggacgagc gccgctcggc agtcgacgcg ttgtacgagg 1740 ccaaccgcct gggccagaag aacggtaagg gcttctacgc ctacgaaacc gacaagcgcg 1800 gcaagccgaa gaaggtcttc gatgccaccg tgctcgacgt gctcaaaccg atcgtgttcg 1860 agcagcgtga agtcactgac gaagacatca tcaactggat gatggtcccg ctgtgccttg 1920 agaccgtgcg ttgcctggaa gacggcatcg tcgsaaccgc tgccgaagcc gacatgggcc 1980 tggtctacgg cattggtttc cctcccttcc gcggtggtgc gctgcgttac atcgactcga 2040 tcggtgtggc cgaattcgtc gccctggccg atcagtatgc cgacctgggg ccgctgtacc 2100 acccgaccgc caagctgcgt gaaatggcca agaacggcca gcgcttcttc aactgaaagc 2160 tt 2162 <210> 25 <211> 1190 <212> DNA
<213> Pseudomonas putida <220>
<221> gene .
<222> (1)..(1190) <223> FaoB gene sequence <400> 25 aagcttaaat gagcctgaat ccaagagacg tggtgattgt cgacttcggt cgcacgccaa 60 tgggccgctc caagggtggc atgcaccgca acacccgcgc cgaagacatg tcggcgcacc 120 tgatcagcaa gctgctggaa cgcaacggca aggtcgaccc gaaagaagtc gaggacgtga 180 tctggggctg cgtcaaccag accctggagc agggctggaa catcgcccgc atggcttcgc 240 tgatgacccc gatcccgcac acctctgcgg cgcagaccgt cagccgcctg tgcggctcgt 300 ccatgagcgc gctgcacacg gccgcccagg cgatcatgac cggtaacggt gatgtgttcg 360 tggtcggtgg cgtggagcac atgggccacg tcagcatgat gcatggcgta gaccccaacc 420 cgcacctgtc cttgcatgcc gccaaggctt ccgggatgat gggcctgact gcagaaatgc 480 tcggcaagat gcacggcatc acccgtgagc agcaggacct gttcggcttg cgttcgcacc 540 agctggccca caaggccacg gtcgaaggca agttcaagga cgagatcatc~ccgatgcagg 600 gctacgacga gaacggcttc ctgaaggtgt tcgatttcga cgaaaccatt cgcccggaaa 660 ccaccctcga aggcctggca tcgctcaagc ctgcgttcaa cccgaaaggc ggtacggtca 720 cggccggtac ctcgtcgcag atcaccgacg gcgcctcgtg catgatcgtc atgtccggtc 780 agcgtgccat ggacctcggt atccagccat tggcggtgat ccgttcgatg gcagtggccg 840 gtgtcgaccc ggcaatcatg ggctacggcc cggtgccatc gacccagaaa gccctcaagc 900 gtgcgggctt gaccatggcc gatatcgact tcatcgagct caacgaagcc ttcgctgcgc 960 aggccctgcc cgtgctgaaa gacttgaaag tgctcgacsa gatggatgag aaggttaacc 1020 tgcacggcgg cgccattgct ttgggccacc cgttcggttg ctccggggcg cggatttccg 1080 gcaccctgct caacgtcatg aagcaaaatg gcggtacgct gggtgttgcg accatgtgcg 1140 tcggcctggg ccaaggtatc accactgtct tcgaacgcgt ctgaaagctt 1190 <210> 26 <211> 391 <212> PRT
<213> Pseudomonas putida <220>
<221> P$PTID$
<222> (1) .. (391) <223> FaoB amino acid sequence <400> 26 Met Ser Leu Asn Pro Arg Asp Val Val Ile Val Asp Phe Gly Arg Thr Pro Met Gly Arg Ser Lys Gly Gly Mat His Arg Asn Thr Arg Ala Glu Asp Mat Ser Ala His Leu Ile Ser Lys Leu Leu Glu Arg Asn Gly Lys Val Asp Pro Lys Glu Val Glu Asp Val Ile Trp Gly Cys Val Asn Gln Thr Leu Glu Gln Gly Trp Asa Ile Ala Arg Met Ala Ser Leu Met Thr Pro Ile Pro His Thr Ser Ala Ala Gln Thr Val Ser Arg Leu Cys Gly Ser Ser Mat Ser Ala Leu His Thr Ala Ala Gln Ala Ile Met Thr Gly Asn Gly Asp Val Phe Val Val Gly Gly Val Glu His Met Gly His Val Ser Mat Met His Gly Val Asp Pro Asn Pro His Leu Ser Leu His Ala Ala Lys Ala Ser Gly Met Met Gly Leu Thr Ala Glu Met Leu Gly Lys Met His Gly Ile Thr Arg Glu Gln Gln Asp Leu Phe Gly Leu Arg Ser His Gln Lau Ala His Lys Ala Thr Val Glu Gly Lys Phe Lys Asp Glu Ile Ile Pro Met Gln Gly Tyr Asp Glu Asn Gly Phe Lau Lys Val Phe Asp Phe Asp Glu Thr Ile Arg Pro Glu Thr Thr Leu Glu Gly Leu Ala Ser Leu Lys Pro Ala Phe Asn Pro Lys Gly Gly Thr Val Thr Ala Gly Thr Ser Ser Gln Ile Thr Asp Gly Ala Ser Cys Mat Ile Val Met Ser Gly Gln Arg Ala Met Asp Leu Gly Ile Gln Pro Leu Ala Val Ile Arg Ser Met Ala Val Ala Gly Val Asp Pro Ala Ile Met Gly Tyr Gly Pro Val Pro Ser Thr Gln Lys Ala Leu Lys Arg Ala Gly Leu Thr Met Ala Aap Ile Asp Phe Ile Glu Leu Asn Glu Ala Phe Ala Als Gln Ala Leu Pro Val Leu Lys Asp Leu Lys Val Leu Asp Lys Met Asp Glu Lys Val Asn Leu His Gly Gly Ala Ile Ala Leu Gly His Pro Phe Gly Cys Ser Gly Ala Arg Ile Ser Gly Thr Leu Leu Asn Val Met Lys Gln Asn Gly Gly Thr Leu Gly Val Ala Thr Mat Cys Val Gly Leu Gly Gln Gly Ile Thr Thr Val Phe Glu Arg Val <210> 27 <211> 1244 <212> DNA
<213> Bean Phaseolin <220>
<221> termiaator <222> (1)..(1244) <400> 27 gatttaaatg caagcttaaa taagtatgaa ctaaaatgca tgtaggtgta agagctcatg 60 gagagcatgg aatattgtat ccgaccatgt aacagtataa taactgagct ccatctcact 120 tcttctatga ataaacaaag gatgttatga tatattaaca ctctatctat gcaccttatt 180 gttctatgat aaatttcctc ttattattat aaatcatctg aatcgtgacg gcttatggaa 240 tgcttcaaat agtacaaaaa caaatgtgta ctataagact ttctaaacaa ttctaacttt 300 agcattgtga acgagacata agtgttaaga agacataaca attataatgg aagaagtttg 360 tctccattta tatattatat attacccact tatgtattat attaggatgt taaggagaca 420 taacaattat aaagagagaa gtttgtatcc atttatatat tatatactac ccatttatat 480 attatactta tccacttatt taatgtcttt ataaggtttg atccatgata tttetaatat 540 tttagttgat atgtatatga aagggtacta tttgaactct cttactctgt ataaaggttg 600 gatcatcctt aaagtgggtc tatttaattt tattgcttct tacagataaa aaaaaaatta 660 tgagttggtt tgataaaata ttgaaggatt taaaataata ataaataata aataacatat 720 aatatatgta tataaattta ttataatata acatttatct ataaaaaagt aaatattgtc 780 ataaatctat acaatcgttt agccttgctg gacgactctc aattatttaa acgagagtaa 840 acatatttga ctttttggtt atttaacaaa ttattattta acactatatg aaattttttt 900 tttttatcgg caaggaaata aaattaaatt aggagggaca atggtgtgtc ccaatcctta 960 tacaaccaac ttccacagga aggtcaggtc ggggacaaca aaaaaacagg caagggaaat 1020 tttttaattt gggttgtctt gtttgctgca taatttatgc agtaaaacac tacacataac 1080 ccttttagca gtagagcaat ggttgaccgt gtgcttagct tcttttattt tattttttta 1140 tcagcaaaga ataaataaaa taaaatgaga cacttcaggg atgtttcaac ccttatacaa 1200 aaccccaaaa acaagtttcc tagcacccta ccaactaagg tacc 1244 <210> 28 <211> 225 <212> DNA
<213> Soybean oleosin <220>
<221> tex~iaator <222> (1)..(225) <400> 28 aagcttacgt gatgagtatt aatgtgttgt tatgaactta tgatgttggt ttatgtgggg 60 aaataaatga tgtatgtacc tcttcttgcc tatgtagtag gtttgggtgt tttgttgtct 120 agctttgctt atttagtaat tagtagaagg gatgttcgtt cgtgtctcat aaaaaggggt 180 actaccactc tggcaatgtg atttgtattt gatgaattgt ctaga 225 <210> 29 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer JA410 <400> 29 aagcttacgt gatgagtatt aatgtgttgt tatg 34 <210> 30 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer JA411 <400> 30 tctagacaat tcatcaaata caaatcacat tgcc 34 <210> 31 <211> 715 <212> PRT
<213> Pseudomonas putida <220>
<221> PgPTIDE
<222> (1) . . (715) <223> FaoA amino acid sequence <400> 31 Mat Ile Tyr Glu Gly Lys Ala Ile Thr Val Lys Ala Leu Glu Ser Gly Ile Val Glu Leu Lys Phe Asp Leu Lys Gly Glu Ser Val Asn Lys Phe Asn Arg Lau Thr Leu Asn Glu Leu Arg Gln Ala Val Asp Ala Ile Arg Ala Asp Ala Ser Val Lys Gly Val Ile Val Arg Ser Gly Lys Asp Val Phe Ile Val Gly Ala Asp Ile Thr Glu Phe Val Asp Asn Phe Lys Leu Pro Glu Ala Glu Leu Val Ala Gly Asn Leu Glu Ala Asn Arg Ile Phe Asn Ala Phe Glu Asp Leu Glu Val Pro Thr Val Ala Ala Ile Asn Gly Ile Ala Leu Gly Gly Gly Leu Glu Met Cys Leu Ala Ala Aap Tyr Arg Val Mat Ser Thr Ser Ala Arg Ile Gly Leu Pro Glu Val Lys Leu Gly Ile Tyr Pro Gly Phe Gly Gly Thr Val Arg Leu Pro Arg Leu Ile Gly Ser Asp Asn Ala Ile Glu Trp Ile Ala Ala Gly Lys Glu Asn Arg Ala Glu Asp Ala Leu Lys Val Gly Ala Val Asp Ala Val Val Ala Pro Glu Leu Leu Leu Ala Gly Ala Leu Asp Leu Ile Lys Arg Ala Ile Ser Gly Glu Leu Asp Tyr Lys Ala Lys Arg Gln Pro Lys Leu Glu Lys Leu Lys Leu Asn Ala Ile Glu Gln Met Met Ala Phe Glu Thr Ala Lys Gly Phe wVal-Ala~Gly Gln Ala Gly Pro Asn Tyr Pro Ala Pro Val Glu Ala Ile Lys Ser Ile Gln Lys Ala Als Asn Pha Gly Arg Asp Lys Ala Leu Glu Val Glu Ala Ala Gly Phe Ala Lys Leu Ala Lys Thr Ser Val Ala Glu Ser Leu Ile Gly Leu Phe Leu Asn Asp Gln Glu Leu Lys Arg Lys Ala Lys Ala His Asp Glu Ile Ala His Asp Val Lys ~Gln Ala Ala Val Leu Gly Ala Gly Ile Met Gly Gly Gly Ile Ala Tyr Gln Ser Ala Val Lys Gly Thr Pro Ile Leu Mat Lys Asp Ile Arg Glu Glu Ala Ile Gln Leu Gly Leu Asn Glu Ala Ser Lys Leu Leu Gly Asn Arg Val Glu Lys Gly Arg Leu Thr Pro Ala Lys Met Ala Glu Ala Leu Asn Ala Ile Arg Pro Thr Leu Ser Tyr Gly Asp Phe Ala Asn Val Asp Ile Val Val Glu Ala Val Val Glu Asn Pro Lys Val Lys Gln Ala Val Leu Ala Glu Val Glu Gly Gln Val Lys Asp Asp Ala Ile Leu Ala Ser Asn Thr Ser Thr Ile Ser Ile Asn Leu Leu Ala Lys Ala Leu Lys Arg Pro Glu Aan Phe Val Gly Met His Phe Phe Asn Pro Val His Met Met Pro Leu Val Glu Val Ile Arg Gly Glu Lys Ser Ser Asp Val Ala Val Ala Thr Thr Val Ala Tyr Ala Lys Lys Met Gly Lys Asn Pro Ile Val Val Asn Asp Cys Pro Gly''Phe'Leu'Val Asn Arg Val Leu Phe Pro Tyr Phe Gly Gly Phe Ala '' Lys Leu Val Ser Ala Gly Val Asp Phe Val Arg Ile Asp Lys Val Met Glu Lys Phe Gly Trp Pro Met Gly Pro Ala Tyr Leu Met Asp Val Val Gly Ile Asp Thr Gly His His Gly Arg Asp Val Met Ala Glu Gly Phe Pro Asp Arg Mat Lys Asp Glu Arg Arg Ser Ala val Asp Ala Leu Tyr WO 99!45122 PCTNS99/04999 Glu Ala Asn Arg Leu Gly Gln Lys Asn Gly Lys Gly Phe Tyr Ala Tyr Glu Thr Asp Lys Arg Gly Lys Pro Lys Lys Val Phe Aap Ala Thr Val Leu Asp Val Leu Lys Pro Ile Val Phe Glu Gla Arg Glu Val Thr Asp Glu Asp Ile Ile Asn Trp Met Met Val Pro Leu Cys Leu Glu Thr Val Arg Cys Leu Glu Asp Gly Ile Val Glu Thr Ala Ala Glu Ala Asp Met Gly Leu Val Tyr Gly Ile Gly Phe Pro Pro Phe Arg Gly Gly Ala Leu Arg Tyr Ile Asp Ser Ile Gly Val Ala Glu Phe Val Ala Leu Ala Asp Gln Tyr Ala Asp Leu Gly Pro Leu Tyr His Pro Thr Ala Lys Leu Arg Glu Mat Ala Lys Asn Gly Gln Arg Phe Phe Aan
5,245,023 and 5,250,430; Huisman et. al., J. Biol. Chem. 266(4): 2191-98 (1991)) is suitable for producing the long pendant group PHAs. Plant genes encoding (3-ketothiolase also have been identified (Vollack & Bach, Plant Physiol. 111:1097-107 (1996)).
Despite this ability to modify monomer composition by selection of 10 the syntheses and substrates, it is desirable to modify other features of polymer biosynthesis, such as that which involves fatty acid metabolism.
It is therefore an object of the present invention to provide a method and DNA constructs to introduce fatty acid oxidation enzyme systems for manipulating the cellular metabolism of plants.
15 It is another object of the present invention to provide methods for enhancing the production of PHAs in plants, preferably in the oilseeds thereof.
Summary Of The Invention 20 Methods and systems to modify fatty acid biosynthesis and oxidation in plants to make new polymers are described. Two enzymes are essential: a hydratase such as D-specific enoyl-CoA hydratase, for example, the hydratase obtained from Aeromonas caviae, and a (3-oxidation enzyme system. Some plants have a j3-oxidation enzyme system which is sufficient 25 to modify polymer synthesis when the plants are engineered to express the hydratase.
Examples demonstrate production of polymer by expression of these enzymes in transgenic plants. Examples also demonstrate that modifications in fatty acid biosynthesis can be used to alter plant phenotypes, decreasing or 30 eliminating seed production and increasing green plant biomass, as well as producing PHAs.
Brief Description of the Drawings Figure 1 is a schematic of fatty acid ~i-oxidation routes to produce polyhydroxyalkanoate monomers.
Figure 2 is a schematic showing plasmid constructs pSBS2024 and pSBS2025.
Figures 3A and 3B are schematics showing plasmid constructs pCGmf124 and pCGmf125.
Figures 4A and 4B are schematics showing plasmid constructs pmf1249 and pmf1254.
Figures SA and SB are schematics showing plasmid constructs pCGmf224 and pCGmfZ25.
Figures 6A and 6B are schematics showing plasmid constructs pCGmfIP2S and pCGmf2PlS.
Detailed Description Of The Invention Methods and DNA constrocts for manipulating the cellular metabolism of plants by introducing fatty acid oxidation enzyme systems into the cytoplasm or plastids of developing oilseeds or green tissue are provided. Fatty acid oxidation systems typically comprise several enzyme activities including a ~i-ketothiolase enzyme activity which utilizes a broad range of ~i-ketoacyl-CoA substrates.
It surprisingly was found that expression of at least one of these transgenes from the bean phaseolin promoter results in male sterility.
Interestingly, these plants did not set seed, but instead produced higher than normal levels of biomass (e.g., leafs, stems, stalks). Therefore the methods and constructs described herein also can be used to create male sterile plants, for example, for hybrid production or to increase the production of biomass of forage, such as alfalfa or tobacco. Plants generated using these methods and DNA constructs are useful for producing polyhydroxyalkanoate biopolymers or for producing novel oil compositions.
The methods described herein include the subsequent incorporation of additional transgenes, in particular encoding additional enzymes involved in fatty acid oxidation or polyhydroxyalkanoate biosynthesis. For polyhydroxyalkanoate biosynthesis, the methods include the incorporation of transgenes encoding enzymes, such as NADH and/or NADPH acetoacetyl-CoenzymeA reductases, PHB synthases, PHA synthases, acetoacetyl-CoA
thiolase, hydroxyacyl-CoA epimerases, delta3-cis-delta2-trans enoyl-CoA
isomerases, acyl-CoA dehydrogenase, acyl-CoA oxidase and enoyl-CoA
hydratases by subsequent transformation of the transgenic plants produced using the methods and DNA constructs described herein or by traditional plant breeding methods.
I. Plant Expression Systems In a preferred embodiment, the fatty acid oxidation transgenes are expressed from a seed specific promoter, and the proteins are expressed in the cytoplasm of the developing oilseed. In an alternate preferred embodiment, fatty acid oxidation transgenes are expressed from a seed specific promoter and the expressed proteins are directed to the plastids using plastid targeting signals. In another preferred embodiment, the fatty acid oxidation transgenes are expressed directly from the plastid chromosome where they have been integrated by homologous recombination. The fatty acid oxidation transgenes may also be expressed throughout the entire plant tissue from a constitutive promoter. It is also useful to be able to control the expression of these transgenes by using promoters that can be activated following the application of an agrochemical or other active ingredient to the crop in the field. Additional control of the expression of these genes encompassed by the methods described herein include the use of recombinase technologies for targeted insertion of the transgenes into specific chromosomal sites in the plant chromosome or to regulate the expression of the transgenes.
The methods described herein involve a plant seed having a genome including (a) a promoter operably linked to a first DNA sequence and a 3'-untranslated region, wherein the first DNA sequence encodes a fatty acid oxidation polypeptide and optionally (b) a promoter operably linked to a second DNA sequence and a 3'-untranslated region, wherein the second WO 99145122 PC'TNS99/04999 DNA sequence encodes a fatty acid oxidation polypeptide. Expression of the two transgenes provides the plant with a functional fatty acid f3-oxidation system having at least (3-ketothiolase, dehydrogenase and hydratase activities in the cytoplasm or plastids other than peroxisomes or glyoxisomes. The first and/or second DNA sequence may be isolated from bacteria, yeast, fungi, algae, plants, or animals. It is preferable that at least one of the DNA
sequences encodes a polypeptide with at least two, and preferably three, enzyme activities.
Transformation Vectors DNA constructs useful in the methods described herein include transformation vectors capable of introducing transgenes into plants. Several plant transformation vector options are available, including those described in "Gene Transfer to Plants" (Potrykus, et al., eds.) Springer-Verlag Berlin Heidelberg New York (1995); "Transgenic Plants: A Production System for Industrial and Pharmaceutical Proteins" (Owen, et al., eds.) John Wiiey &
Sons Ltd. England (1996); and "Methods in Plant Molecular Biology: A
Laboratory Course Manual" (Malign, et al. eds.) Cold Spring Laboratory Press, New York (1995). Plant transformation vectors generally include one or more coding sequences of interest under the transcriptional control of S' and 3' regulatory sequences, including a promoter, a transcription termination and/or polyadenylation signal, and a selectable or screenable marker gene. The usual requirements for 5' regulatory sequences include a promoter, a transcription termination and/or a polyadenylation signal. For the expression of two or more polypeptides from a single transcript, additional RNA processing signals and ribozyme sequences can be engineered into the construct (LJ.S. Patent No. 5,519,164). This approach has the advantage of locating multiple transgenes in a single locus, which is advantageous in subsequent plant breeding efforts. An additional approach is to use a vector to specifically transform the plant plastid chromosome by homologous recombination (U.S. Patent No. 5,545,818), in which case it is possible to take advantage of the prokaryotic nature of the plastid genome and insert a number of transgenes as an operon.
Promoters A large number of plant promoters are known and result in either constitutive, or environmentally or developmentally regulated expression of the gene of interest. Plant promoters can be selected to control the expression of the transgene in different plant tissues or organelles for all of which methods are known to those skilled in the art (Gasser & Fraley, Science 244:1293-99 (1989)). The S' end of the transgene may be engineered to include sequences encoding plastid or other subcellular organelle targeting peptides linked in-frame with the transgene. Suitable constitutive plant promoters include the cauliflower mosaic virus 355 promoter (CaM~ and enhanced CaMV promoters (Odell et. al., Nature, 313: 810 (1985)), actin promoter (McElroy et al., Plant Cell 2:163-71 (1990)}, AdhI promoter (Fromm et. al., BiolTechnology 8:833-39 (1990);
Kyozuka et al., Mol. Gen. Genet. 228:40-48 ( 1991 )), ubiquitin promoters, the 15 Figwort mosaic virus promoter, mannopine synthase promoter, nopaline synthase promoter and octopine synthase promoter. Useful regulatable promoter systems include spinach nitrate-inducible promoter, heat shock promoters, small subunit of ribulose biphosphate carboxylase promoters and chemically inducible promoters (U.S. Patent No. 5,364,780 to Hershey et al.).
In a preferred embodiment of the methods described herein, the transgenes are expressed only in the developing seeds. Promoters suitable for this purpose include the napin gene promoter (LJ.S. Patent Nos. 5,420,034 and 5,608,152), the acetyl-CoA carboxylase promoter (LT.S. Patent Nos.
25 5,420,034 and 5,608,152), 2S albumin promoter, seed storage protein promoter, phaseolin promoter (Slightom et. al., Proc. Natl. Acad. Sci. USA
80:1897-1901 (1983)), oleosin promoter (Plant et. al., Plant Mol. Biol.
25:193-205 (1994); Rowley et al., Biochim. Biophys. Acta. 1345:1-4 (199?};
U.S. Patent No. 5,650,554; and PCT WO 93/20216), zero promoter, glutelin promoter, starch synthase promoter, and starch branching enzyme promoter.
The transformation of suitable agronomic plant hosts using these vectors can be accomplished with a variety of methods and plant tissues.
Representative plants useful in the methods disclosed herein include the Brassica family including napus, rappa, sp. carinata and juncea; maize;
soybean; cottonseed; sunflower; palm; coconut; safflower; peanut; mustards including Sinapis alba; and flax. Crops harvested as biomass, such as silage 5 corn, alfalfa, or tobacco, also are useful with the methods disclosed herein.
Representative tissues for transformation using these vectors include protoplasts, cells, callus tissue, leaf discs, pollen, and meristems.
Representative transformation procedures include Agrobacterium-mediated transformation, biolistics, microinjection, electroporation, polyethylene glycol-mediated protoplast transformation, liposome-mediated transformation, and silicon fiber-mediated transformation (U. S. Patent No.
5,464,765; "Gene Transfer to Plants" (Potrykus, et al., eds.) Springer-Verlag Berlin Heidelberg New York (1995); "Transgenic Plants: A Production System for Industrial and Pharmaceutical Proteins" (Oven, et al., eds.) John 15 Wiley & Sons Ltd. England (1996); and "Methods in Plant Molecular Biology: A Laboratory Course Manual" (Malign, et al. eds.) Cold Spring Laboratory Press, New York (1995)).
II. Methods for Making and Screening for Transgenic Plants In order to generate transgenic plants using the constructs described 20 herein, the following procedures can be used to obtain a transformed plant expressing the transgenes subsequent to transformation: select the plant cells that have been transformed on a selective medium; regenerate the plant cells that have been transformed to produce differentiated plants; select transformed plants expressing the transgene at such that the level of desired 25 polypeptide is obtained in the desired tissue and cellular location.
For the specific crops useful for practicing the described methods, transformation procedures have been established, as described for example, in "Gene Transfer to Plants" (Potrykus, et al., eds.) Springer-Verlag Berlin Heidelberg New York (1995); "Transgenic Plants: A Production System for 30 Industrial and Pharmaceutical Proteins" (Owen, et al., eds.) John Wiley &
Sons Ltd. England (1996); and "Methods in Plant Molecular Biology: A
Laboratory Course Manual" (Malign, et al. eds.) Cold Spring Laboratory Press, New York (1995).
Brassica napus can be transformed as described, for example, in U. S.
Patent Nos. 5,188,958 and 5,463,174. Other Brassica such as rappa, carinata and juncea as well as Sinapis alba can be transformed as described S by Moloney et. aL, Plant Cell Reports 8:238-42 (1989). Soybean can be transformed by a number of reported procedures (U.S. Patent Nos.
5,015,580; 5,015,944; 5,024,944; 5,322,783; 5,416,011; and 5,169,770).
Several transformation procedures have been reported for the production of transgenic maize plants including pollen transformation (U.S. Patent No.
5,629,183), silicon fiber-mediated transformation (U.S. Patent No.
5,464,765), electroporation of protoplasts (LJ.S. Patent Nos. 5,231,019;
5,472,869; and 5,384,253) gene gun (U.S. Patent Nos. 5,538,877 and 5,538,880 and Agrobacterium-mediated transformation (EP 0 604 662 Al;
PCT WO 94/00977}. The Agrobacterium-mediated procedure is particularly preferred, since single integration events of the transgene constructs are more readily obtained using this procedure, which greatly facilitates subsequent plant breeding. Cotton can be transformed by particle bombardment (U.S.
Patent Nos. 5,004,863 and 5,159,135). Sunflower can be transformed using a combination of particle bombardment and Agrobacterium infection (EP 0 486 233 A2; U.S. Patent No. 5,030,572). Flax can be transformed by either particle bombardment or Agrobacterium-mediated transformation.
Recombinase technologies include the cre-lox, FLP/FRT, and Gin systems.
Methods for utilizing these technologies are described for example in U.S.
Patent No. 5,527,695 to Hodges et al.; Dale & Ow, Proc. Natl. Acad. Sci.
USA 88:10558-62 (1991); Medberry et. al., Nucleic AcidsRes. 23:485-90 (1995).
Selectable Marker Genes Selectable marker genes useful in practicing the methods described herein include the neomycin phosphotransferase gene nptlI (U.S. Patent Nos.
5,034,322 and 5,530,196), hygromycin resistance gene (U.S. Patent No.
5,668,298), bar gene encoding resistance to phosphinothricin (U.S. Patent No. 5,276,268). EP 0 530 129 Al describes a positive selection system which enables the transformed plants to outgrow the non-transformed lines by expressing a transgene encoding an enzyme that activates an inactive compound added to the growth media. Screenable marker genes useful in the methods herein include the (3-glucuronidase gene (Jefferson et. al., EMBO J. 6:3901-07 (1987); U.S. Patent No. 5,268,463) and native or modified green fluorescent protein gene (Cubitt et. al., Trends Biochem Sci.
20:448-55 (1995); Pang et. al., Plant Physiol. 112:893-900 (1996)). Some of these markers have the added advantage of introducing a trait, such as herbicide resistance, into the plant of interest, thereby providing an additional agronomic value on the input side.
In a preferred embodiment of the methods described herein, more than one gene product is expressed in the plant. This expression can be achieved via a number of different methods, including (1) introducing the encoding DNAs in a single transformation event where all necessary DNAs are on a single vector; (2) introducing the encoding DNAs in a co-transformation event where all necessary DNAs are on separate vectors but introduced into plant cells simultaneously; (3) introducing the encoding DNAs by independent transformation events successively into the plant cells i.e. transformation of transgenic plant cells expressing one or more of the encoding DNAs with additional DNA constructs; and (4) transformation of each of the required DNA constructs by separate transformation events, obtaining transgenic plants expressing the individual proteins and using traditional plant breeding methods to incorporate the entire pathway into a single plant.
III. ~i-Oxidation Enzyme Pathways Production of PHAs in the cytosol of plants requires the cytosolic localization of enzymes that are able to produce R-3-hydroxyacyl CoA
thioesters as substrates for PHA synthases. Both eukaryotes and prokaryotes possess a (3-oxidation pathway for fatty acid degradation that consists of a series of enzymes that convert fatty acyl CoA thioesters to acetyl CoA.
While these pathways proceed via intermediate 3-hydroxyacyl CoA, the stereochemistry of this intermediate varies among organisms. For example, the j3-oxidation pathways of bacteria and the peroxisomal pathway of higher eukaryotes degrade fatty acids to acetyl CoA via S-3-hydroxyacyl CoA
(Schultz, "Oxidation of Fatty Acids" in Biochemistry of Lipids, Lipoproteins andMembranes (Vance et al., eds.) pp. 101-06 (Elsevier, Amsterdam 1991)).
5 In Escherichia coli, an epimerase activity encoded by the ~3-oxidation multifunctional enzyme complex is capable of converting S-3-hydroxyacyl CoA to R-3-hydroxyacyl CoA. Yeast possesses a peroxisomal localized fatty acid degradation pathway that proceeds via intermediate R-3-hydroxyacyl CoA (Hiltunen, et al. J. Biol. Chem. 267: 6646-53 (1992);
10 Filppula, et al. J. Biol. Chem. 270:27453-57 (1995)), such that no epimerase activity is required to produce PHAs.
Plants, like other higher eukaryotes, possesses a [i-oxidation pathway for fatty acid degradation localized subcellularly in the peroxisomes (Gerhardt, "Catabolism of Fatty Acids [a and (3 Oxidation]" in Lipid 15 Metabolism in Plants (Moose, Jr., ed.) pp. 527-65 (CRC Press, Boca Raton, Florida 1993)). Production of PHAs in the cytosol of plants therefore necessitates the cytosolic expression of a (3-oxidation pathway, for conversion of fatty acids to R-3-hydroxyacyl CoA thioesters of the correct chain length, as well as cytosolic expression of an appropriate PHA synthase, 20 to polymerize R-3-hydroxyacyl CoA to polymer.
Fatty acids are synthesized as saturated acyl-ACP thioesters in the plastids of plants (Hartwood, "Plant Lipid Metabolism" in Plant Biochemistry (Dey et al., eds.} pp. 237-72 (Academic Press, San Diego 1997}). Prior to export from the plastid into the cytosol, the majority of fatty 25 acids are desaturated via a d9 desaturase. The pool of newly synthesized fatty acids in most oilseed crops consists predominantly of oleic acid (cis 9-octadecenoic acid), stearic acid (octadecanoic acid), and palmitic acid (hexadecanoic acid). However, some plants, such as coconut and palm kernel, synthesize shorter chain fatty acids (C8-14). The fatty acid is 30 released from ACP via a thioesterase and subsequently converted to an acyl-CoA thioester via an acyl CoA synthetase located in the plastid membrane (Andrews, et al., "Fatty acid and lipid biosynthesis and degradation" in Plant Physiology, Biochemistry, and Molecular Biology (Dennis et al., eds.) pp.
345-46 (Longman Scientific & Technical, Essex, England 1990); Harwood, "Plant Lipid Metabolism" in Plant Biochemistry (Dey et al., eds) p. 246 (Academic Press, San Diego 1997)).
The cytosolic conversion of the pool of newly synthesized acyl CoA
thioesters via fatty acid degradation pathways and the conversion of intermediates from these series of reactions to R-3-hydroxyacyl-CoA
substrates for PHA syntheses can be achieved via the enzyme reactions outlined in Figure 1. The PHA synthase substrates are C4-C16 R-3-10 hydroxyacyl CoAs. For saturated fatty acyl CoAs, conversion to R-3-hydroxyacyl CoA thioesters using fatty acids degradation pathways necessitates the following sequence of reactions: conversion of the acyl CoA
thioester to traps-2-enoyl-CoA (reaction 1 ), hydration of traps-2-enoyl-CoA
to R-3-hyddroxy acyl CoA (reaction 2a, e.g. yeast system operates through 15 this route and the Aeromonas caviae D-specific hydratase yields C4-C7 R-3-hydroxyacyl-CoAs), hydration of traps-2-enoyl-CoA to S-3-hydroxy acyl CoA (reaction 2b), and epimerization of S-3-hydroxyacyl CoA to R-3-hydroxyacyl CoA (reaction 5, e.g. cucumber tetrafunctional protein, bacterial systems). If 3-hydroxyacyl CoA is not polymerized by PHA synthase 20 forming PHA, it can proceed through the remainder of the (3-oxidation pathway as follows: oxidation of 3-hydroxyacyl CoA to form (3-keto acyl CoA (reaction 3) followed by thiolysis in the presence of CoA to yield acetyl CoA and a saturated acyl CoA thioester shorter by two carbon units (reaction 4). The acyl CoA thioester produced in reaction 4 is free to re-enter the (3-25 oxidation pathway at reaction 1 and the acetyl-CoA produced can be converted to R-3-hydroxyacyl CoA by the action of (3-ketothiolase (reaction 7) and NADH or NADPH acetoacetyl-CoA reductase (reaction 6). This latter route is useful for producing R-3-hydroxybutyryl-CoA, R-3-hydroxyvaleryl-CoA and R-3-hydroxyhexanoyl-CoA. The 8-3-30 hydroxyacids of four to sixteen carbon atoms produced by this series of enzymatic reactions can be polymerized by PHA syntheses expressed from a transgene, or transgenes in the case of the two subunit synthase enzymes, into PHA polymers.
For A9 unsaturated fatty acyl CoAs, a variation of the reaction sequences described is required. Three cycles of (3-oxidation, as detailed in Figure l, will remove six carbon units yielding an unsaturated acyl CoA
thioester with a cis double bond at position 3. Conversion of the cis double bond at position 3 to a traps double bond at position 2, catalyzed by D3-cis-~2-traps-enoyl CoA isomerase will allow the ~i-oxidation reaction sequences outlined in Figure 1 to proceed. This enzyme activity is present on the microbial (3-oxidation complexes and the plant tetrafunctional protein, but not on the yeast foxl.
Acyl CoA thioesters also can be degraded to a (3-keto acyl CoA and converted to R-3-hydroxyacyl CoA via a NADH or NADPH dependent reductase (reaction 6).
Multifunctional enzymes that encode S-specific hydratase, S-specific 1 S dehydrogenase, ~i-ketothiolase, epimerase and D3-cis-D2-traps-enoyl CoA
isomerase activities have been found in bacteria such as Escherichia coli (Spratt, et al., J. Bacteriol. 158:535-42 (1984)) and Pseudomonas fragi (Immure, et al., J. Biochem. 107:184-89 ( 1990)). The multifunctional enzyme complexes consist of two copies of each of two subunits such that catalytically active protein forms a heterotetramer. The hydratase, dehydrogenase, epimerase, and 43-cis-DZ-traps-enoyl CoA isomerase activities are located on one subunit, whereas the thiolase is located on another subunit. The genes encoding the enzymes from organisms such as E.
coli (Spratt, et al., J. Bacteriol. 158:535-42 (1984); DiRusso, J. Bacteriod.
172:6459-68 (1990)) and P. fragi (Sato, et al., J. Biochem. 111:8-15 (1992)) have been isolated and sequenced and are suitable for practicing the methods described herein. Furthermore, the E. coli enzyme system has been subjected to site-directed mutagenesis analysis to identify amino acid residues critical to the individual enzyme activities (He & Yang, Biochemistry 35:9625-30 (1996); Yang et. al., Biochemistry 34:6641-47 (1995); He & Yang, Biochemishy 36:11044-49 (1997); He et. al., Biochemistry 36:261-68 (1997); Yang & Elzinga, J. Biol. Chem. 268:6588-92 (1993)). These mutant genes also could be used in some embodiments of the methods described herein.
Mammals, such as rat, possess a trifunctional (3-oxidation enzyme in their peroxisomes that contains hydratase, dehydrogenase, and D3-cis-42-5 traps-enoyl CoA isomerase activities. The trifunctional enzyme from rat liver has been isolated and has been found to be monomeric with a molecular weight of 78 kDa (Palosaari, et al., J. Biol. Chem. 265:2446-49 (1990)).
Unlike the bacterial system, thiolase activity is not part of the multienzyme protein (Schultz, "Oxidation of Fatty Acids" in Biochemistry ojLipids, 10 Lipoproteins and Membranes (Vance et al., eds) p. 95 (Elsevier, Amsterdam (1991)). Epimerization in rat occurs by the combined activities of two distinct hydratases, one which converts R-3-hydroxyacyl CoA to traps-2-enoyl CoA, and another which converts traps-2-enoyl CoA to S-3-hydroxyacyl CoA (Smeland, et al., Biochemical and Biophysical Research 15 Communications 160:988-92 (1989)). Mammals also possess (3-oxidation pathways in their mitochondria that degrade fatty acids to acetyl CoA via intermediate S-3-hydroxyacyl CoA (Schultz, "Oxidation of Fatty Acids" in Biochemistry of Lipids, Lipoproteins and Membranes (Vance et al., eds) p.
96 (Elsevier, Amsterdam (1991)). Genes encoding mitochondria) (i-20 oxidation activities have been isolated from several animals including a Rat mitochondria) long chain acyl CoA hydratase/3-hydroxy acyl CoA
dehydrogenase (GENBANK Accession # D 16478) and a Rat mitochondria) thiolase (GENBANK Accession #s D13921, D00511).
Yeast possesses a multifunctional enzyme, Fox2, that differs from the 25 (3-oxidation complexes of bacteria and higher eukaryotes in that it proceeds via a R-3-hydroxyacyl CoA intermediate instead of S-3-hydroxyacyl CoA
(Hiltunen, et al., J. Biol. Chem. 267:6646-53 (1992)). Fox2 possesses R-specific hydratase and R-specific dehydrogenase enzyme activities. This enzyme does not possess the ~3-cis-02-traps-enoyl CoA isomerase activity 30 needed for degradation of 09-cis-hydroxyacyl CoAs to form R-3-hydroxyacyl CoAs. The gene encoding jox2 from yeast has been isolated and sequenced and encodes a 900 amino acid protein. The DNA sequence of the structural gene and amino acid sequence of the encoded polypeptide is shown in SEQ ID NO:1 and SEQ ID N0:2.
Plants have a tetrafunctional protein similar to the yeast Fox2, but also encoding a 03-cis-02-traps-enoyl CoA isomerase activity (Muller et., al., J. Biol. Chem. 269:20475-81 ( 1994)). The DNA sequence of the cDNA and amino acid sequence of the encoded polypeptide is shown in SEQ ID N0:3 and SEQ ID N0:4.
IV. Targeting of Enzymes to the Cytoplasm of Oil Seed Crops Engineering PHA production in the cytoplasm of plants requires 10 directing the expression of (3-oxidation to the cytosol of the plant. No targeting signals are present in the bacterial systems, such as faoAB. In fungi, yeast, plants, and mammals, (3-oxidation occurs in subcellular organelles. Typically, the genes are expressed from the nuclear chromosome, and the polypeptides synthesized in the cytoplasm are directed 15 to these organelles by the presence of specific amino acid sequences. To practice the methods described herein using genes isolated from eukaryotic sources, e.g., fatty acid oxidation enzymes from eukaryotic sources, such as yeast, fungi, plants, and mammals, the removal or modification of subcellular targeting signals is required to direct the enzymes to the cytosol.
20 It may be useful to add signals for directing proteins to the endoplasmic reticulum. Peptides useful in this process are well known in the art. The general approach is to modify the transgene by inserting a DNA sequence specifying an ER targeting peptide sequence to form a chimeric gene.
Eukaryotic acyl CoA dehydrogenases, as well as other mitrochondrial 25 proteins, are targeted to the mitochondria via leader peptides on the N-terminus of the protein that are usually 20-60 amino acids long (Norwich, Current Opinion in Cell Biology, 2:625-33 (1990)). Despite the lack of an obvious consensus sequence for mitochondria) import leader peptides, mutagenesis of key residues in the leader sequence have been demonstrated 30 to prevent the import of the mitochrondrial protein. For example, the import of Saccharomyces cerevisiae F1-ATPase was prevented by mutagenesis of its leader sequence, resulting in the accumulation of the modified precursor protein in the cytoplasm (Bedwell, et al., Mol. Cell Biol. 9:1014-25 (1989)) Three eukaryotic peroxisomal targeting signals have been reported (Gould, et al., J. Cell Biol. 108:1657-64 (1989); Brickner, et al., J. Plant Physiol., 113:1213-21 (1997)). The tripeptide targeting signal S/A/C-K/H/R-L occurs at the C-terminal end of many peroxisomal proteins (Gould, et al., J. Cell Biol. 108:1657-64 ( 1989)). Mutagenesis of this sequence has been shown to prevent import of proteins into peroxisomes. Some peroxisomal proteins do not contain the tripeptide at the C-terminal end of the protein. For these proteins, it has been suggested that targeting occurs via the tripeptide in an internal position within the protein sequence (Gould, et al., J. Cell Biol. 108:1657-64 (1989)) or via an unknown, unrelated sequence (Brickner, et al., .l. Plant Physiol. 113:1213-21 (1997)). The results of in vitro peroxisomal targeting experiments with fragments of acyl CoA oxidase from Candida tropicalis appear to support the latter theory and suggest that there are two separate targeting signals within the internal amino acid sequence of the polypeptide (Small, et al., The EMBO Journal 7:1167-73 (1988)). In the aforementioned study, the targeting signals were localized to two regions of 118 amino acids in length, and neither of regions was found to contain the targeting signal S/A/C-K/H/R-L. A small number of peroxisomal proteins appear to contain an amino terminal leader sequence for import into peroxisomes (Brickner, et al., J. Plant Physiol. 113:1213-21 (1997)). These targeting signals can be deleted or altered by site directed mutagenesis.
' V. Cultivation and Harvesting of Transgenic Plant The transgenic plants can be grown using standard cultivation techniques. The plant or plant part also can be harvested using standard equipment and methods. The PHAs can be recovered from the plant or plant part using known techniques such as solvent extraction in conjunction with traditional seed processing technologies, as described in PCT WO 97/15681, or can be used directly, for example, as animal feed, where it is unnecessary to extract the PHA from the plant biomass.
Several Iines which did not produce seed, produced much higher levels of biomass. produced much higher levels of biomass. This phenotype therefore may be useful as a means to increase the amount of green biomass produced per acre for silage, forage, or other biomass crops. End uses include the more cost effective production of forage crops for animal feed or as energy crops for electric power generation. Other uses include increasing biomass levels in crops, such as alfalfa or tobacco, for subsequent recovery of industrial products, such as PHAs by extraction.
The compositions and methods of preparation and use thereof described herein are further described by the following non-limiting examples.
Example 1: Isolation and Characterization of the Pseudomonas putida faoAB Genes and Fao Enzyme All DNA manipulations, including PCR, DNA sequencing E. coli transformation, and plasmids purification, were performed using standard procedures, as described, for example, by Sambrook et. al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, New York (1989)). The genes encoding faoAB from Pseudomonas putida were isolated using a probe generated from P. putida genomic DNA by PCR
(polymerase chain reaction) using primers 1 and 2 possessing homology to faoB from Pseudomonas fragi (Sato, et al., J. Biochem. 111:8-15 (1992)).
Primer 1:
5' gat ggg ccg ctc caa ggg tgg 3' (SEQ D7 NO:S) Primer 2:
5' caa ccc gaa ggt gcc gcc att 3' (SEQ >D N0:6) A 1.1 kb DNA fragment was purified from the PCR reaction and used as a probe to screen a P. putida genomic library constructed in plasmid pBKCMV using the lambda ZAP expression system (Stratagene). Plasmid pMFXI was selected from the positive clones and the DNA sequence of the insert containing the faoAB genes and flanking sequences determined. This is shown in SEQ ID N0:7. A fragment containing faoAB was subcloned with the native P. putida ribosome binding site intact into the expression WO 99/45122 PC'T/US99/04999 vector pTRCN forming plasmid pMFX3 as follows. Plasmid pMFXI was digested with BsrG I. The resulting protruding ends were filled in with Klenow. Digestion with Hind III yielded a 3.39 kb blunt ended/Hind III
fragment encoding FaoAB. The expression vector pTRCN was digested with Sma llHind III and ligated with the faoAB fragment forming the 7.57 kb plasmid pMFX3.
Enzymes in the FaoAB multienzyme complex were assayed as follows. Hydratase activity was assayed by monitoring the conversion of NAD to NADH using the coupling enzyme L-~i-hydroxyacyl CoA
10 dehydrogenase as previously described, except that assays were run in the presence of CoA (Filppula, et al., J. Biol. Chem. 270:27453-57 (1995)).
Severe product inhibitation of the coupling enzyme was observed in the absence of CoA. The assay contained (1 mL final volume) 60 p,M crotonyl CoA, 50 p.M Tris-CI, pH 9, 50 p,g bovine serum albumin per mL, 50 mM
KCI, 1 mM NAD, 7 pg L-specific ~i-hydroxyacyl CoA dehydrogenase from porcine heart per mL, and 0.25 mM CoA. The assay was initiated with the addition of FaoAB to the assay mixture. A control assay was performed without substrate to determine the rate of consumption of NAD in the absence of the hydratase generated product, S-hydroxybutyryl CoA. One unit of activity is defined as the consumption of one p,Mol of NAD per min (ESao = 6220 M'1 cni ~ ).
Hydroxyacyl CoA dehydrogenase was assayed in the reverse direction with acetoacetyl CoA as the substrate by monitoring the conversion of NADH to NAD at 340 nm (Binstock, et aL, Methods in Enzymology, 71:403 (1981)). The assay contained (1 mL final volume) 0.1 M KH2P04, pH 7, 0.2 mg bovine serum albumin per mL, 0.1 mM NADH, and 33 p,M
acetoacetyl CoA. The assay was initiated with the addition of FaoAB to the assay mixture. When necessary, enzyme samples were diluted in 0.1 M
KH2POa, pH 7, containing 1 mg bovine serum albumin per mL. A control 30 assay was performed without substrate acetoacetyl CoA to detect the rate of consumption of NADH in the crude due to enzymes other than hydroxyacyl CoA dehydrogenase. One unit of activity is defined as the consumption of WO 99/45122 PC'TNS99/04999 one p.Mol of NADH per minute (s3~o= 6220 M-'crri').
HydroxyacylCoA dehydrogenase was assayed in the forward direction with crotonyl CoA as a substrate by monitoring the conversion of NAD -to NADH at 340 nm (Binstock, et al., Methods in Enzymology, 71:403 5 (1981)). The assay mixture contained (1 mL final volume) 0.1 M KH2P04, pH 8, 0.3 mg bovine serum albumin per mL, 2 mM ~i-mercaptoethanol, 0.25 mM CoA, 30 p,M crotonyl CoA, and an aliquot of FaoAB. The reaction was preincubated for a couple of minutes to allow in situ formation of S-hydroxybutyryl CoA. The assay then was initiated by the addition of NAD
10 (0.45 mM). A control assay was performed without substrate to detect the rate of consumption of NAD due to enzymes other than hydroxyacyl CoA
dehydrogenase. One unit of activity is defined as the consumption of one p.Mol ofNAD per minute (E3ao= 6220 M-'crri') Thiolase activity was determined by monitoring the decrease in 15 absorption at 304 nm due to consumption of substrate acetoacetyl CoA as previously described with some modifications (Palmer, et al., J. Biol. Chem.
266:1-7 (1991)). The assay contained (final volume 1 mL) 62.4 mM Tris-Cl, pH 8.1, 4.8 mM MgCl2, 62.5 p.M CoA, and 62.5 p.M acetoacetyl CoA. The assay was initiated with the addition of FaoAB to the assay mixture. A
20 control sample without enzyme was performed for each assay to detect the rate of substrate degradation of pH 8.1 in the absence of enzyme. One unit of activity is defined as the consumption of one N.MoI of substrate acetoacetyl CoA per minute (s3ao= 169001V1-'cni').
Epimerase activity was assayed as previously described (Binstock, et 25 al., Methods in Enrymology, 71:403 (1981)) except that R-3-hydroxyacyl CoA thioesters were utilized instead of D,L-3-hydroxyacyl CoA mixtures.
The assay contained (final volume 1 mL) 30 p,M R-3-hydroxyacyl CoA, ISO
mM KH2PO4 (pH 8), 0.3 mg/mL BSA, 0.5 mM NAD, 0.1 mM CoA, and 7 p,g/mL L-specific ~i-hydroxyacyl CoA dehydrogenase from porcine heart.
30 The assay was initiated with the addition of FaoAB.
For expression of FaoAB in DHSoc/pMFX3, cultures were grown in 2xTY medium at 30 °C. 2xTY medium contains (per L) 16 g tryptone, 10 g yeast, and 5 g NaCI. A starter culture was grown overnight and used to inoculate (1% inoculum) fresh medium (100 mL in a 250 mL Erlenmeyer flask for small scale growths; 1.5L in a 2.8L flask for large scale growths).
Cells were induced with 0.4 n~IVI IPTG when the absorbance at 600 nm was in the range of 0.4 to 0.6. Cells were cultured an additional 4 h prior to harvest. Cells were lysed by sonication, and the insoluble matter was removed from the soluble proteins by centrifugation. Acyl CoA
dehydrogenase activity was monitored in the reverse direction to ensure activity of the FaoA subunit (SEQ m N0:31 ) and thiolase activity was assayed to determine activity of the Fao subunit. FaoAB in DHSoc/pMFX3 contained dehydrogenase and thiolase activity values of 4.3 and 0.99 U/mg, respectively, which is significantly more than the 0.0074 and 0.0033 U/mg observed for dehydrogenase and thiolase, respectively, in control strain DHSoc/pTRCN.
1 S FaoAB was purified from DHSoc/pMFX3 using a modified procedure previously described for the purification of FaoAB from Pseudomonas fragi (Imamura, et al., .I. Biochem. 107:184-89 ( 1990)). Thiolase activity (assayed in the forward direction) and dehydrogenase activities (assayed in the reverse direction) were monitored throughout the purification. Three liters of DHSoc/pMFX3 cells (2 X 1.5 L aliquots in 2.8 L Erlenmeyer flasks) were grown in 2 x TY medium using the cell growth procedure previously described for preparing cells for enzyme activity analysis. Cells {15.8 g) were resuspended in 32 mL of 10 mM KH2P04, pH 7, and lysed by sonication. Soluble proteins were removed from insoluble cells debris by centrifugation (18,000 RPM, 30 min., 4 °C). The soluble extract was made 50% in acetone and the precipitated protein was isolated by centrifugation and redissolved in 10 mM KHZP04, pH 7. The sample was adjusted to 33%
saturation with (NH4)2504 and the soluble and insoluble proteins were separated by centrifugation. The resulting supernatant was adjusted to 56%
saturation with (NHa)2504 and the insoluble pellet was isolated by centrifugation and dissolved in 10 mM KH2P04, pH 7. The sample was heated at 50°C for 30 min. and the soluble proteins were isolated by centrifugation and dialyzed in a 6,000 to 8,000 molecular weight cut off membrane in 10 mM KH2POa, pH 7 (2 X 3L; 20 h). The sample was loaded on a Toyo Jozo DEAF FPLC column (3 cm x 14 cm) that previously had been equilibrated in 10 mM KHzP04, pH 7. The protein was eluted with a linear gradient (100 mL by 100 mL; 0 to 500 mM NaCI in 10 KHZP04, pH 7) at a flow of 3 mL/min. FaoAB eluted between 300 and 325 mM NaCI. The sample was dialyzed in a 50,000 molecular weight cut off membrane in I O
mM KH2POa, pH 7 (1 X 2L; 15h) prior to loading on a macro-prep hydroxylapatite 18/30 (Biorad) FPLC column (2 cm x 15 cm) that previously had been equilibrated in 10 mM KH2P04, pH 7. The protein was eluted with a linear gradient (250 mL by 250 mL; 10 to 500 mM KH2P04, pH 7) at a flow rate of 3 mL/min. FaoAB eluted between 70 and 130 mM KH2P04, The fractions containing activity were concentrated to 9 mL using a MIZLIPORETM 100,000 molecular weight cutoff concentrator. The buffer was exchanged 3 times with 10 mM KHZP04, pH 7 containing 20% sucrose and frozen at -70°C. Enzyme activities of the hydroxylapatite purified fraction were assayed with a range of substrates. The results are shown in Table 1 below.
T~AIP i ~ Fnwmp Cuhctratec and Activities En me Substrate Activit U/m h dratase croton 1 CoA 8.8 deh dro enase forwardcroton 1 CoA 0.46 deh dro enase reverseacetoace 1 CoA 29 thiolase acetoace 1 CoA 9.9 a imerase R-3-h drox octan 0.022 1 CoA
a imerase R-3- h drox hexan 0.0029 1 CoA
epimerase R-3- hydroxybutyryl0.000022 CoA
Example 2: Production of Antibodies to the FaoAB and FaoAB Polypeptides Following purification of the FaoAB protein as described in Example 1, a sample was separated by SDS-PAGE. The protein band corresponding to the FaoA (SEQ ID N0:31) and FaoB (SEQ ID N0:26) was excised and used to immunize New Zealand white rabbits with complete Freunds adjuvant. Boosts were performed using incomplete Freunds at three week - intervals. Antibodies were recovered from serum by affinity chromatography on Protein A columns (Pharmacia) and tested against the antigen by Western blotting procedures. Control extracts of Brassica seeds were used to test for cross reactivity to plant proteins. No cross reactivity S was detected.
Example 3: Construction of Plasmids for Expression of the Pseudomonas putido fao AB Genes in Transgenic Oilseeds Construction of pSBS2024 Oligonucleotide primers GVR471 5'-CGGTACCCATTGTACTCCCAGTATCAT-3' (SEQ ID N0:8) and GVR472 S'-CA TTTAAA TAGTAGAGTATTGAATATG-3' (SEQ ID N0:9) homologous to sequences flanking the 5' and 3' ends (underlined), respectively, of the bean phaseolin promoter (SEQ 117 NO:10; Slightom et al., 1983) were designed with the addition of KpnI {in italics, nucleotides 1-in SEQ ID N0:8) and SwaI (in italics, nucleotides 1-9 in SEQ ID N0:9) at the 5' ends of GVR471 and GVR472, respectively. These restriction sites were incorporated to facilitate cloning. The primers were used to amplify a 1.4 kb phaseolin promoter, which was cloned at the SmaI site in pUCl9 by blunt ended ligation. The designated plasmid, pCPPI (see Figure 2) was cut with SaII and SwaI and ligated to a SaIIlSwaI phaseolin terminator (SEQ ID
N0:27). The bean phaseolin terminator sequence encompassing the ' polyadeilylation signals was amplified' using the following PCR'piiliiers: ' GVR396:
5'-GATTTAAATGCAAGCTTAAATAAGTATGAACTAAAATGC-3' (SEQ ID N0:22) and GVR397:
S'-CGGTACCTTAGTTGGTAGGGTGCTA-3' (SEQ ID N0.23) and the 1.2Kb fragment (SEQ ID N0:27) cloned into Sall-SaI site of pCCPI
to obtain pSBS2024 (Figure 2). The resulting plasmid which contains a unique HindIII site for cloning was called pSBS2024 (Figure 2).
Construction of pSBS2025 A soybean oleosin promoter fragment (SEQ ID NO:l 1; Rowley et al., 1997) was simplified with primers that flank the DNA sequence.
Primer JA408 5' -TCTAGATACATCCATTTCTTAATATAATCCTCTTATTC-3' (SEQ ID N0:12) contains sequences that are complementary to the 5' end (underlined).
Primer np 1 5' -CA TTTAAA TGGTTAAGGTGAAGGTAGGGCT-3' (SEQ ID N0:13) contains sequences homologous to the 3' end (underlined) of the promoter fragment. The restriction sites Xbal (in italics) and SwaI (in italics) were incorporated at the 5' end of JA408 and npl, respectively, to facilitate cloning. The primers were used to amplify a 975 by promoter fragment, which then was cloned into Small site of pUCl9 (see Figure 2). The resulting plasmid, pCSPI, was cut with Salt and Swal and iigated to the soybean terminator (SEQ ID N0:28). The soybean oleosin terminator was amplified by PCR using the following primers:
JA410:
5'-AAGCTTACGTGATGAGTATTAATGTGTTGTTATG-3' (SEQ ID N0:29) and JA411:
'S'-TCfiAGACAATTCATCAAATACAAATCACATTGCC-3' (SEQ ID N0:30) and the 225 by fragment cloned into the SaII-SwaI site of pCSPl to obtain plasmid pSBS2025 (Figure 6). The designated plasmid, pSBS2025, earned a unique HindIII site for cloning (Figure 2).
Construction of Promoter-coding Sequence Fusions Two oligonucleotide primers were synthesized:
np2 5'AA GCTT'AAAATGATTTACGAAGGTAAAGCC-3' (SEQ ID N0:14) homologous to nucleotides 553 to 573 of the 5' flanking sequences, and np3 5' A TTGCTTT CAGTTGAAGCGCTG-3' (SEQ ID NO:15) complementary to nucleotides 2700 to 2683 flanking the 3' end of mfl (faoA, SEQ ID N0:24) of plasmid pmfx3. A HindIII (in italics) site was introduced at the S' end of primers np2 and np3 to facilitate cloning. In addition, a 3 by AAA sequence (bold) was incorporated to obtain a more favorable sequence surrounding the plant translational initiation codon. Primers np2 and np3 were used to amplify the fragment and cloned into SmaI site of pUCl9. The resulting plasmid was called pCmfl (Figures 3A and 3B). Plasmid pBmfl was constructed in a similar process {Figures SA and SB). In order to generate a HindIII {in italics) at 5' and 3' ends of the mFl (faoB) gene (SEQ
ID N0:25) for cloning, a second set of synthetic primers were designed.
Primers np4 5' AAGCTTAAAATGAGCCTGAATCCAAGAGAC-3' (SEQ ID N0:16) complementary to 5' (nucleotides 2732-2752 bp) and np5 5'AAGCTTTCAGACGCGTTCGAAGACAGTG -3' ' w (SEQ ID N0:17) homologous to 3' (nucleotides 3907-3886 bp) sequences of mf2 (faoB, SEQ
ID N0:25) of plasmid pmfx3 were used in a PCR reaction to amplify the 1.17 kb DNA fragment. The resulting PCR product was cloned into the EcoRV site of pBluescript. The plasmid was referred to as pBmf2.
Both plasmids were individually cut with HindIII and their inserts cloned in plasmids pSBS2024 and pSBS2025, which had previously been linearized with the same restriction enzyme. As a result, the following WO 99/45122 PC'T/US99/04999 plasmids were generated: pmf124 and pmf125 (Figures 3A and 3B) and pmfZ24 and pmf225 (Figures SA and SB) containing the Fao genes (mfl and mf2) fused to either the phaseolin or soybean promoters. DNA sequence analysis confirmed the correct promoter-coding sequence-termination sequence fusions for pmf124, pmf125, pmfZ24, and pmf225.
Example 4: Assembly of Promoter-coding Sequence Fusions into Plant Transformation Vectors After obtaining plasmids pmf124, pmf125, pmf224, and pmf225, promoter-coding sequence fusions were independently cloned into the binary vectors, pCGN1559 (McBride and Summerfelt, 1990) containing the CaMV
35S promoter driving the expression of NPTII gene (conferring resistance to the antibiotic kanamycin) and pSBS2004 containing a parsley ubiquitin promoter driving the PPT gene, which confers resistance to the herbicide phosphinothricine. Binary vectors suitable for this purpose with a variety of selectable markers can be obtained from several sources.
The phaseolin-mf21 fusion cassette was released from the parent plasmid withXbaI and ligated with pCGN1559, which had been linearized with the same restriction enzyme. The resulting plasmid was designated pCGmf124 (Figures 3A and 3B). Plasmid pCGmf125 containing the soybean-mfl fusion was constructed in a similar way (Figures 3A and 3B), except that both pmf125 and pCGNl 559 were cut with BamHI before ligation.
Construction of pmf1249 an pmf1254 The plasmid pSBS2004 was linearized with BamHI fragment containing the soybean-mfl fusion. This plasmid was designated pmf1254 (Figures 4A and 4B). Similarly, the Xbal phaseolin-mfl fusion fragment was ligated to pSBS2004 which had been linearized with the same restriction enzyme. The resulting plasmid was designated pmfI249 (Figures 4A and 4B).
Construction of pCGmfz24 and pCGmf225 The phaseolin-mfZ and soybean-mf2 fusions were constructed by excising the fusions from the vector by cutting with either BamHI or XbaI, and cloned into pCGNI559 which had been linearized with either restriction enzyme (Figures SA and SB).
Construction of ~CGmfl P2S and pCGmf2P 1 S
The two expression cassettes containing the promoter-coding sequence fusions were assembled on the same binary vector as follows:
Plasmid pmf124 containing the phaseolin-mfl fusion was cut with BamHI
and cloned into the BamHI site of pCGN1559 to create pCGmfB 124. This plasmid then was linearized with XbaI and Iigated to the XbaI fragment of pmf225 containing the soybean-mf2 fusion. The final plasmid was designated pCGmfl P2S (Figures 6A and 6B). Plasmid pCGmfZP 1 S was assembled in similar manner. The phaseolin-mfZ fusion was released from pmf224 by cutting with BamHI and cloned at the BamHI site of pCGN 1559.
The resulting plasmid, pCGmfB224, was linearized with XbaI and ligated to the Xbal fragment of pmf125 containing the soybean-mfl fusion (Figures 6A
and 6B).
Example 5: Transformation of Brassica Brassica seeds were surface sterilized in 10% commercial bleach (Javex, Colgate-Palmolive) for 30 min. with gentle shaking. The seeds were washed three times in sterile distilled water. Seeds were placed in germination medium comprising Murashige-Skoog (MS) salts and vitamins, 3% (w/v) sucrose and 0.7% (w/v) phytagar, pH 5.8 at a density of 20 per ' ' plate and maintained at 24 °C and a 16 h light 7 8 h dark' photoperiod 'at a light intensity of 60-80 ,uEni 2s'1 for four to five days.
Each of the constructs, pCGmf124, pCGmf125, pCGmf224, pCGmfIP2S, and pCGmf2PlS were introduced intoAgrobacterium tumefacians strain EHA101 (Hood et al., J. Bacteriol. 168:1291-1301 (1986)) by electroporation. Prior to transformation of cotyledonary petioles, single colonies of strain EHA101 harboring each construct were grown in 5 ml of minimal medium supplemented with 100 mg kanamycin per liter and 100 mg gentamycin per liter for 48 hr at 28 °C. One milliliter of bacterial suspension was pelletized by centrifugation for 1 min in a microfuge. The pellet was resuspended in 1 ml minimal medium.
For transformation, cotyledons were excised from 4 day old, or in some cases S day old, seedlings, so that they included approximately 2 mm of petiole at the base. Individual cotyledons with the cut surface of their petioles were immersed in diluted bacterial suspension for 1 s and immediately embedded to a depth of approximately 2 mm in co-cultivation medium, MS medium with 3% (w/v) sucrose and 0.7% phytagar and enriched with 20 ,uM benzyladenine. The inoculated cotyledons were plated at a density of 10 per plate and incubated under the same growth conditions for 48 h. After co-cultivation, the cotyledons then were transferred to regeneration medium comprising MS medium supplemented with 3%
sucrose, 20 pM benzyladenine, 0.7% (w/v) phytagar, pH 5.8, 300 mg timentinin per liter, and 20 mg kanamycin sulfate per liter.
After two to three weeks, regenerant shoots obtained were cut and maintained on "shoot elongation" medium (MS medium containing, 3%
sucrose, 300 mg timentin per liter, 0.7% (w/v) phytagar, 300 mg timentinin per liter, and 20 mg kanamycin sulfate per liter, pH 5.8) in Magenta jars.
The elongated shoots were transferred to "rooting" medium comprising MS
medium, 3% sucrose, 2 mg indole butyric acid per liter, 0.7% phytagar, and 500 mg carbenicillin per liter. After roots emerged, plantlets were transferred to potting mix (Redi Earth, W.R. Grace and Co.). The plants were maintained in a misting chamber (75% relative humidity) under the same growth conditions. Two to three weeks after growth, leaf samples were taken for neomycin phosphotransferase (NPTII) assays (Moloney et al., Plant Cell Reports 8:238-42 (1989)).
Seeds from the FaoA and FaoB transgenic lines can be analyzed for expression of the fatty acid oxidation polypeptides by western blotting using the anti-FaoA and anti-FaoB antibodies. The FaoB polypeptide (SEQ ID
N0:26) is not functional in the absence of the FaoA gene product; however, the FaoAB gene product has enzyme activity.
Transgenic lines expressing the FaoA and FaoB complex are obtained by crossing the FaoA and FaoB transgenic lines expressing the individual polypeptides and seeds analyzed by western blotting and enzymes assays as described.
Example 6: Transformation of B. napes cv. Westar and Analysis of Transgenic Lines Transformation The protocol used was adopted from a procedure described by Moloney et ad. ( 1989). Seeds of Brassica napes cv. Westar were surface sterilized in 10% commercial bleach (Javex, Colgate-Palmolive Canada Inc.) for 30 min with gentle shaking. The seeds were washed three times in sterile distilled water. Seeds were placed on germination medium comprising Murashige-Skoog (MS} salts and vitamins, 3% sucrose and 0.7% phytagar, pH 5.8 at a density of 20 per plate and maintained at 24 °C in a 16 h light/8 h dark photoperiod at a light intensity of 60-80 p,Eni 2s 1 for four to five days.
Each of the constructs, pCGmf124, pCGmf125, pCGmf224, pCGmfZ25, pCGmfIP2S, and pCGmfZPIS were introduced into Agrobacterium tumefaciens strain EHA101 (Hood et al. 1986} by electroporation. Prior to transformation of cotyledonary petioles, single colonies of strain EHA101 harboring each construct were grown in 5 mL of minimal medium supplemented with 100 mg kanamycin per liter, and 100 mg gentamycin per liter for 48 h at 28 °C. One milliliter of bacterial suspension was pelletized by centrifugation for 1 min in a microfuge. The pellet was resuspended in 1 mL minimal medium.
For transformation, cotyledons were excised from four-day-old, or in some cases five-day-old, seedlings so that they included approximately 2 mm of petiole at the base. Individual cotyledons with the cut surface of their petioles were immersed in diluted bacterial suspension for 1 s and immediately embedded to a depth of approximately 2 mm in co-cultivation medium, MS medium with 3% sucrose and 0.7% phytagar, enriched with 20 N,M benzyladenine. The inoculated cotyledons were plated at a density of 10 per plate and incubated under the same growth conditions for 48 h. After co-cultivation, the cotyledons then were transferred to regeneration medium, which comprised MS medium supplemented with 3% sucrose, 20 p.M
benzyladenine, 0.7% phytagar, pH 5.8, 300 mg timentinin per liter, and 20 mg kanamycin sulfate per liter.
After two to three weeks, regenerant shoots were obtained, cut, and maintained on "shoot elongation" medium (MS medium containing 3%
sucrose, 300mg timentin per liter, 0.7% phytagar, and 20 mg kanamycin per liter, pH 5.8) in Magenta jars. The elongated shoots then were transferred to "rooting" medium, which comprised MS medium, 3% sucrose, 2 mg indole butyric acid per liter, 0.7% phytagar and 500 mg carbenicillin per liter.
After roots emerged, the plantlets were transferred to potting mix (Redi Earth, W.R. Grace and Co. Canada Ltd.). The plants were maintained in a misting chamber (75% RH) under the same growth conditions. Two to three weeks after growth, leaf samples were taken for neomycin phosphotransferase (NPT II) assays (Moloney et al. 1989). The results are shown in Table 2 below. The data show the number of plants that were confirmed to be transformed.
Table 2: NPT II Activity in Transformed Plants Constructs No. of NPTII NPTII No. of plants plants assayedconfirmedconfirmed transformed CGmfl24 47 27 23 33 CGmf125 37 28 18 18 CGmfZ24 49 40 30 39 CGmf225 52 37 28 34 SpCGmfIP2S 27 27 21 21 6 CGmfl,P
pCGmf124 -bean pltaseolin regulatory sequences driving FaoA gene 'pCGmf125 - soybean oleosin regulatory sequences driving FaoA gene 3pCGmf224 - be n phaseolin regulatory sequences driving FaoB gene 4pCGmfl25 - soybean oleosin regulatory sequences driving FaoB gene SpCGmf192S - bean phaseolin and soybean oleosin regulatory sequences driving FaoA &
FaoB genes, respectively 6pCGmflPlS -bean phaseolin and soybean oleosin regulatory sequences driving FaoB &
FaoA genes, respectively The fate of the transforming DNA was investigated for sixteen randomly selected transgenic lines. Southern DNA hybridization analysis showed that the FaoA and/or FaoB were integrated into the genomes of the transgenic lines tested.
Approximately 80% of the pmf124 transgenic plants in which the FaoA gene is expressed from the strong bean phaseolin promoter were observed to be male sterile. Clearly high level expression of the FaoA gene 5 from this promoter results in functional expression of the FaoA gene product which impairs seed and/or pollen development. This result was very unexpected, since it was not anticipated that the plant cells would be capable of carrying out the first step in the (i-oxidation pathway in the cytosol.
This result, however, provides additional applications for expressing ~i-oxidation 10 genes in plants for male sterility for hybrid production or to prevent the production of seed. It was also note that in a side-by-side comparison with normal transgenic lines, the pmf124 lines produced much higher levels of biomass, presumably due to the elimination of seed development. This phenotype therefore may be useful as a means to increase the amount of 15 green biomass produced per acre for silage, forage, or other biomass crops.
Here, the use of an inducible promoter system or recombinase technology could be used to produce seed for planting. Seven of the sterile plants were successfully cross-pollinated with pollen from pmfl25 transgenic lines and set seeds.
20 Northern analysis on RNA from seeds from pmf224 lines containing the phaseolin promoter-FaoB constructs showed a signal indicative of the expected 1.2 kb transcript in all the samples tested except the control.
Northern analysis on RNA from seeds from pmf125 lines containing the weak soybean oleolsin promoter-FaoA constructs revealed a transcript of the .
25 expected size of 2.1 kb. Western blotting on 300-500 leg of protein from approximately 80% of seeds of pmf125 plants where the FaoA gene is expressed from the relatively weak soybean oleosin promoter were inconclusive, although a weak signal was detected in one transgenic line.
Fatt~Acid Analysis 30 Given the unexpected results indicating a strong metabolic effect of expressing the FaoA gene from the strong bean phaseolin promoter in seeds, the fatty acid profile of the seeds from transgenic lines expressing the FaoA
gene from the weak soybean oleosin promoter was analyzed. Seeds expressing only the FaoA gene or also expressing the FaoB gene from the bean phaseolin promoter were examined. The analysis was carried out as described in Millar et al., The Plant Cell 11:1889-902 (1998). Seed fatty 5 acid methyl esters (FAMES) were prepared by placing ten seeds of B. napus in 15 x 45-mm screw capped glass tubes and heating at 80 °C in 0.75 mL
of 1N methanolic HCl reagent (Supelco, PA) and 10 gL of 1 mg 17:0 methyl ester (internal standard) per mL overnight. After cooling the samples, the FAMES were extracted with 0.3 mL hexane and 0.5 mL 0.9% NaCI by 10 vortexing vigorously. The samples were allowed to stand to separate the phases, and 300 pL of the organic phase was drawn and analyzed on a Hewlett-Packard gas chromatograph.
Fatty acid profile analysis indicated the presence of an additional component or enhanced component in the lipid profile in all of the transgenic 15 plants expressing the FaoA gene SEQ m N0:24 which was absent from the control plants. This result again proves conclusively that the FaoA gene is being transcribed and translated and that the FaoA polypeptide SEQ ID NO:
27 is catalytically active. This peak also was observed in eleven additional transgenic plants harboring SoyP-FaoA, PhaP-FaoA-SoyP-FaoB, SoyP-20 FaoA-PhaP-FaoB genes and a sterile (PhaP-FaoA) plant cross-pollinated with SoyP-FaoB. These data clearly demonstrate functional expression of the FaoA gene and that even the very low levels of expression are sufficient to change the lipid profile of the seed. Adapting the methods described herein, one of skill in the art can express these genes at levels intermediate 25 between that obtained with the phaseolin promoter and the soybean oleosin promoter using other promoters such as the Arabidopsis oleosin promoter, napin promoter, or cruciferin promoter, and can use inducible promoter systems or recombinase technologies to control when fatty acid oxidation transgenes are expressed.
Example 7: Yeast ~i-oxidation Multi-functional Enzyme Complex S. cerevisiae contains a (i-oxidation pathway that proceeds via R-WO 99/45122 PC'fNS99/04999 hydroxyacyl CoA rather than the S-3-hydroxyacyl CoA observed in bacteria and higher eukaryotes. The fox2 gene from yeast encodes a hydratase that produces R-3-hydroxyacyl CoA from traps-2-enoyl-CoA and a dehydrogenase that utilizes R-3-hydroxyacyl-CoA to produce ~i-keto acyl CoAs.
The fox2 gene (sequence shown in SEQ ID NO:1) was isolated from S. cerevisiae genomic DNA by PCR in two pieces. Primers N-fox2b and N-bamfox2b were utilized to PCR a 1.1 kb SmaIBamHI fragment encoding the N-terminal region of Fox2, and primers C-fox2 and C-bamfox2 were utilized 10 to PCR a 1.6 kb BamHI/XbaI fragment encoding the C-terminal Fox2 region. The full fox2 gene was reconstructed via subcloning in vector pTRCN.
N-fox2b fox2 tcc ccc ggg agg agg ttt tta tta tgc ctg gaa att tat cct tca aag ata gag tt 1 S (SEQ ID N0:18) N-bamfox2b fox2 aaggatccttgatgtcatttacaactacc (SEQ ID NO:I9) C-fox2 fox2 get cta gat agg gaa aga tgt atg taa g (SEQ ID N0:20) 20 C-bamfox2 fox2 tgacatcaaggatcctttt (SEQ ID N0:21) The foil gene, however, does not possess a p-ketothiolase activity and this activity must be supplied by a second transgene. Representative sources of such a gene include algae, bacteria, yeast, plants, and mammals. The 25 bacterium Alcaligenes eutrophus possesses a broad specificity (3-ketothiolase gene suitable for use in the methods described herein. It can be readily isolated using the acetoacetyl-CoA thiolase gene as a hybridization probe, as described in U.S. Patent 5,661,026 to Peoples et al. This enzyme also has been purified (Haywood et al., FEMSMicro. Lett. 52:91 (1988)), and the 30 purified enzyme is useful for preparing antibodies or determining protein sequence information as a basis for the isolation of the gene.
Example 8: Plant ~i-Oxidation Gene The DNA sequence of the cDNA encoding (3-oxidation tetrafunctional protein, shown in SEQ ID N0:4, can be isolated as described in Preisig-Muller et al., J. Biol. Chem. 269:20475-8I (1994). The equivalent gene can be isolated from other plant species including Arabidopsis, Brassica, soybean, sunflower, and corn using similar procedures or by screening genomic libraries, many of which are commercially available, for example from Clontech Laboratories Inc., Palo Alto, California, USA. A
peroxisomal targeting sequence P-R-M was identified at the carboxy I O terminus of the protein. Constructs suitable for expressing in the plant cytosol can be prepared by PCR amplification of this gene using primers designed to delete this sequence.
SSQUENCB LISTING
<110> Metabolix, Inc.
<120> Modification of Fatty Acid Metabolism in Plants <130> MBX 024 <140>
<141>
<150> 60/077,107 <151> 1998-03-06 <160> 31 <170> PatentIn Ver. 2.0 <210> 1 <211> 2703 <212> DNA
<213> Saccharomyces <220>
<221> gene <222> (1)..(2703) <223> fox2 gene <400> 1 atgcctggaa atttatcctt caaagataga gttgttgtaa tcacgggcgc tggagggggc 60 ttaggtaagg tgtatgcact agcttacgca agcagaggtg caaaagtggt cgtcaatgat 120 ctaggtggca ctttgggtgg ttcaggacat aactccaaag ctgcagactt agtggtggat 180 gagataaaaa aagccggagg tatagctgtg gcaaattacg actctgttaa tgaaaatgga 240 gagaaaataa ttgaaacggc tataaaagaa ttcggcaggg ttgatgtact aattaacaac 300 gctggaatat taagggatgt ttcatttgca aagatgacag aacgtgagtt tgcatctgtg 360 ~w ~gtagatgttc atttgacagg tggctataag ctatcgcgtg°~c'tgettggcc ttatatgcgc 420 tctcagaaat ttggtagaat cattaacacc gcttcccctg ccggtctatt tggaaatttt 480 ggtcaagcta attattcagc agctaaaatg ggcttagttg gtttggcgga aaccctcgcg 540 aaggagggtg ecaaatacaa cattaatgtt aattcaattg cgccattggc tagatcacgt 600 atgacagaaa acgtgttacc accacatatc ttgaaacagt taggaccgga aaaaattgtt 660 cccttagtac tctatttgac acacgaaagt acgaaagtgt caaactccat ttttgaactc 720 gctgctggat tctttggaca gctcagatgg gagaggtctt ctggacaaat tttcaateca 780 gaccccaaga catatactcc tgaagcaatt ttaaataagt ggaaggaaat cacagactat 840 agggacaagc catttaacaa aactcagcat ccatatcaac tctcggatta taatgattta 900 atcaccaaag caaaaaaatt acctcccaat gaacaaggct cagtgsaaat caagtcgctt 960 tgcaacaaag tcgtagtagt tacgggtgca ggaggtggtc ttgggaagtc tcatgcaatc 1020 tggtttgcac ggtacggtgc gaaggtagtt gtaaatgaca tcaaggatcc tttttcagtt 1080 gttgaagaaa taaataaact atatggtgaa ggcacagcca ttccagattc ccatgatgtg 1140 gtcaccgaag ctcctctcat tatccaaact gcaataagta agtttcagag agtagacatc 1200 ttggtcaata acgctggtat tttgcgtgac aaatcttttt taaaaatgaa agatgaggaa 1260 tggtttgctg tcctgaaagt ccaccttttt tccacatttt cattgtcaaa agcagtatgg 1320 ccaatattta ecaaacaaaa gtctggattt attatcsata ctacttctac ctcaggaatt 1380 tatggtaatt ttggacaggc caattatgcc gctgcaaaag ccgccatttt aggattcagt 1440 aaaactattg cactggaagg tgccaagaga ggaattattg ttaatgttat cgctcctcat 1500 gcagaaacgg ctatgacaaa gactatattc tcggagaagg aattatcaaa ccactttgat 1560 gcatctcaag tetccccact tgttgttttg ttggcatctg sagaactaca aaagtattct 1620 ggaagaaggg ttattggcca attattcgaa gttggcggtg gttggtgtgg gcaaaccaga 1680 tggcaaagaa gttccggtta tgtttctatt aaagagacta ttgaaccgga agaaattaaa 1740 gaaaattgga accacatcac tgatttcagt cgcaacacta tcascccgag ctccacagag 1800 gagtcttcta tggcaacctt gcaagccgtg caaaaagcgc actcttcaaa ggagttggat 1860 gatggattat tcaagtacac taccaaggat tgtatcttgt acaatttagg acttggatgc 1920 acaagcaaag agcttaagta cacctacgag aatgatccag acttccaagt tttgcccacg 1980 ttcgccgtca ttccatttat gcaagctact gccacactag ctatggacaa tttagtcgat 2040 aacttcaatt atgcaatgtt actgcatgga gaacaatatt ttaagctctg cacgccgaca 2100 atgccaagta atggaactct aaagacactt gctaaacctt tacaagtact tgacaagaat 2160 ggtaaagccg ctttagttgt tggtggcttc gaaacttatg acattaaaac taagaaactc 2220 atagcttata acgaaggatc gttcttcatc aggggcgcac atgtacctcc agaaaaggaa 2280 gtgagggatg ggaaaagagc caagtttgct gtccaaaatt ttgaagtgcc acatggaaag 2340 gtaccagatt ttgaggcgga gatttctacg aataaagatc aagccgcatt gtacaggtta 2400 tctggcgatt tcaatccttt acatatcgat cccacgctag ccaaagcagt tsaatttcct 2460 acgccaattc tgcatgggct ttgtacatta ggtattagtg cgaaagcatt gtttgaacat 2520 tatggtccat atgaggagtt gaaagtgaga tttaccaatg ttgttttccc aggtgatact 2580 ctaaaggtta aagcttggaa gcaaggctcg gttgtcgttt ttcaaacaat tgatacgacc 2640 agaaacgtca ttgtattgga taacgccgct gtaaaactat cgcaggcaaa atctaaacta 2700 taa 2703 <210> 2 <211> 900 <212> PRT
<213> Saccharomyces <220>
<221> PEPTIDE
<2,22..> ~1) .,. (900) , .. . , . , ~ ., t <223> fox2 encoded polypeptide <400> 2 Met Pro Gly Asn Leu Ser Phe Lys Asp Arg Val Val Val Ile Thr Gly Ala Gly Gly Gly Lau Gly Lys Val Tyr Ala Leu Ala Tyr Ala Ser Arg Gly Ala Lys Val Val Val Asn Asp Leu Gly Gly Thr Leu Gly Gly Ser Gly His Asn Ser Lys Ala Ala Asp Leu Val Val Asp Glu Ile Lys Lys Ala Gly Gly Ile Ala Val Ala Asn Tyr Asp Ser Val Asn Glu Asn Gly Glu Lys Ile Ile Glu Thr Ala Ile Lys Glu Phe Gly Arg Val Asp Val Leu Ile Asn Asn Ala Gly Ile Leu Arg Asp Val Ser Phe Ala Lys Met Thr Glu Arg Glu Phe Ala Ser Val Val Asp Val His Leu Thr Gly Gly Tyr Lys Leu Ser Arg Ala Ala Trp Pro Tyr Mat Arg Ser Gln Lys Phe Gly Arg Ile Ile Asn Thr Ala Ser Pro Ala Gly Leu Phe Gly Asn Phe Gly Gln Ala Asn Tyr Ser Ala Ala Lys Met Gly Leu Val Gly Leu Ala Glu Thr Leu Ala Lys Glu Gly Ala Lys Tyr Asn Ile Asn Val Asn Ser Ile Ala Pro Leu Ala Arg Ser Arg Mat Thr Glu Asn Val Leu Pro Pro His Ile Leu Lys Gln Leu Gly Pro Glu Lys Ile Val Pro Leu Val Leu Tyr Leu Thr His Glu Ser Thr Lys Val Ser Asn Ser Ile Phe Glu Leu Ala Ala Gly Phe Phe Gly Gln Leu Arg Trp Glu Arg Ser Ser Gly Gln Ile Phe Asn Pro Asp Pro Lys Thr Tyr Thr Pro Glu Ala Ile Leu Asn Lys Trp Lys Glu Ile Thr Asp Tyr Arg Asp Lys Pro Phe Asn Lys Thr Gln His Pro Tyr Gln Leu Ser Asp Tyr Asn Asp Leu Ile Thr Lys Ala Lys Lys Leu Pro Pro Asn Glu Gln Gly Ser Val Lys Ile Lys Ser Leu Cys Asn Lys Val Val Val Val Thr Gly Ala Gly Gly Gly Leu Gly Lys Ser His Ala Ile Trp Phe Als Arg Tyr Gly Ala Lys Val Val Val Asn Asp Ile Lys Asp Pro Phe Ser Val Val Glu Glu Ile Aan Lys Leu Tyr Gly Glu Gly Thr Ala Ile Pro Asp Ser His Asp Val Val Thr Glu Ala Pro Leu Ile Ile Gln Thr Ala Ile Ser Lys Phe Gln Arg Val Asp Ile Leu Val Asn Asn Ala Gly Ile Leu Arg Asp Lys Sar Phe Leu Lys Met Lys Asp Glu Glu Trp Phe Ala Val Leu Lys Val His Leu Phe Ser Thr Phe Ser Leu Ser Lys Ala Val Trp Pro Ile Phe Thr Lys Gln Lys Ser Gly Phe Ile Ile Asn Thr Thr Ser Thr Ser Gly Ile Tyr Gly Aan Phe Gly Gln Ala Asn Tyr Ala Ala Ala Lys Ala Ala Ile Leu Gly Phe Ser Lys Thr Ile Ala Leu Glu Gly Ala Lys Arg Gly Ile Ile Val Asn Val Ile Ala Pro His Ala Glu Thr Ala Met Thr Lys Thr Ile Phe Ser Glu Lys Glu Leu Ser Asn His Phe Asp Ala Ser Gln Val Ser Pro Leu Val Val Leu Leu Ala Ser Glu Glu Leu Gln Lys Tyr Ser Gly Arg Arg Val Ile Gly Gln Leu Phe Glu Val Gly Gly Gly Trp Cys Gly Gln Thr Arg Trp Gln Arg Ser Ser Gly Tyr Val Ser Ile Lys Glu Thr Ile Glu Pro Glu Glu Ile Lys Glu Asn Trg Asn His Ile Thr Asp Phe Ser Arg Asn Thr Ile Asn Pro Ser Ser Thr Glu Glu Ser Ser Met Ala Thr Leu Gln Ala Val Gln Lys Ala His Ser Ser Lys Glu Leu Asp Asp Gly Leu Phe Lys Tyr Thr Thr Lys Asp Cys Ile Leu Tyr Asn Leu Gly Leu Gly Cys Thr Ser Lys Glu Leu Lys Tyr Thr Tyr Glu Asn Asp Pro Asp Phe Gln Val Leu Pro Thr Phe Ala Val Ile Pro Phe Met Gln Ala Thr Ala Thr Leu Ala Met Asp Asn Leu Val Asp Asn Phe Asn Tyr Ala Mat Leu Leu His Gly Glu Gln Tyr Phe Lys Leu Cys Thr Pro Thr Met Pro Ser Asn Gly Thr Leu Lys Thr Leu Ala Lys Pro Leu Gln Val Leu Asp Lys Asn Gly Lys Ala Ala Leu Val Val Gly Gly Phe Glu Thr Tyr Asp Ile Lys Thr Lys Lys Leu Ile Ala Tyr Asn Glu Gly Ser Phe Phe Ile Arg Gly Ala His Val Pro Pro Glu Lys Glu Val Arg Asp Gly Lys Arg Ala Lys Phe Ala Val Gln Asn Phe Glu Val Pro His Gly Lys Val Pro Asp Phe Glu Ala Glu Ile Ser Thr Asn Lys Asp Gln Ala Ala Leu Tyr Arg Leu Ser Gly Asp Phe Asn Pro Leu His Ile Asp Pro Thr Leu Ala Lys Ala Val Lys Phe Pro Thr Pro Ile Leu His Gly Leu Cys Thr Leu Gly Ile Ser Ala Lys Ala Leu Phe Glu His Tyr Gly Pro Tyr Glu Glu Leu Lys Val Arg Phe Thr Asn Val Val Phe Pro Gly Asp Thr Leu Lys Val Lys Ale Trp Lys Gln Gly Ser Val Val Val Phe Gln Thr Ile Asp Thr Thr Arg Asn Val Ile Val Leu Asp Asn Ala Ala Val Lys Leu Ser Gln Ala Lys Ser Lys Leu <210> 3 <211> 2177 <212> DNA
<213> Artificial Sequence <220>
<221> gene <222> (1)..(2177) <223> tetrafunctional beta-oxidation gene <220>
<223> Description of Artificial Sequence <400> 3 atgggaagca atgcaaaagg aagaacggta atggaggtgg gaactgatgg agtagcaata 60 atcaccatca tcaaccctcc agttaactcc ttgtcttttg atgtgttatt cagcctgaga 120 gatagttatg aacaagcctt'gagaagagat gatgtgaagg caattgttgt tacaggtgca 180 aagggaaaat tttctggtgg ctttgatata actgcttttg gtgtactcca aggaggaaag 240 ggggagcaac caaatgttag aaacatatca attgaaatga tcactgatat ttttgaagct 300 gcccgaaaac ctgcggttgc agcgatagat ggacttgctt tgggtggagg gttggaggtt 360 gccatggctt gtcatgctcg aatatcaact cctaccgctc aattagggtt gcctgaactt 420 cagctcggaa taattcctgg ttttggagga acacaacggc ttccacgtct tgttggtctc 480 tcaaaggccc tagaaatgat gttgacgtca aagccaatta aaggacaaga agctcattct 540 ttggggttag tggatgccat tgtccctccc gaagagttga tcaacactgc acgtagatgg 600 gctcttgaaa tcctagagcg gagaagacca tgggttcaca gtcttcacag gactgacaag 660 ttagagtctc ttgctgaggc taggaaaata tttaacttag ctagagctca ggcaaagaaa 720 cagtacccaa atcttaagca tacaattgcc tgcattgatg ctgttgaaac gggtgtcgtc 780 tctggccctc gtgctggact ttggaaggag gctgaagaat ttcagggact cctacattct 840 gatacttgca aaagcttaat tcatatcttc tttgcccagc gttcaacaac taaggtacct 900 ggagttactg atctgggttt ggtaccgaga caaatcaaga aagttgctat tgtcggagga 960 ggattaatgg gatctggtat agctacagca ttgattctta gcaactatca tgtggtactt 1020 aaagaagtga acgataagtt cttgcaggct ggcattgaca gagtcagagc aaacctacaa 1080 agccgagtca aaaaagggaa tatgactaat gagaaattcg aaasgagtat ttctttactc 1140 aagggagttc ttaactacga aagttttaaa gatgtagata tggtgataga ggctgttatt 1200 gagaatgttt ctttgaagca acaaatcttt tctgatcttg aaaaatattg ccccccacat 1260 tgcatgcttg ctactaatac ttccacaata gacttggagt tgattggaga gagaataaag 1320 tctcgtgaca ggattattgg aacagctgca caagtgattg ttgatctgct agatgttggg 1380 aagaatataa agaaaacacc agttgtcgtt ggaaattgta caggttttgc tgtcaacaga 1440 atgttttttc cctactctca ggctgcaatt ttacttgcag aacatggggt agatccctat 1500 cagattgaca gggctatttc caagtttgga atgccaatgg gacccttcag gttgtgcgac 1560 cttgttggtt ttggtgtggc agcagcaact gccagtcagt ttgttcaagc ttttccagaa 1620 agaacttata aatcgatgct aattcctctg atgcaagagg ataagaatgc aggtgaatcc 1680 actcgtaaag gtttctatgt ctatgacaag aaccgaaaag ctgggccaaa tccagagtta 1740 aagaaatata tcgagaaggc taggaacagt tctggtgttt cagttgatcc taagctcaca 1800 aagttacccg aaaaggacat tgtggagatg atatttttcc cagtggtgaa tgaagcatgt 1860 cgtgtccttg ccgaaggcat agcagtcaaa gcagctgacc tggacattgc tggtgtaatg 1920 ggaatgggtt tcccctccta cggggaggac tcatgttctg ggcggattct cttggatcaa 1980 attacatcta ttcaaggttg gaggaatggt cgaaacagta tggtggattt ttcaagccct 2040 gtggatactt agctgaaagg gctgttcagg gtgcaactct gagtgctccg ggtggccatg 2100 ctaaacctcg aatgtaagct catttcttta gtcctgctca catcatgccg ctattggaaa 2160 ttgtccgtac caagcat 2177 <210> 4 <211> 725 <212> PRT
<213> Cucumber <220>
<221> PEPTIDE
<222> (1)..(725) <223> tetrafunctional beta-oxidation protein <400> 4 Met Gly Ser Asn Ala Lys Gly Arg Thr Val Mat Glu Val Gly Thr Asp Gly Val Ala Ile Ile Thr Ile Ile Asn Pro Pro Val Asn Sar Leu Ser Phe Asp Val Lau Phe Ser Leu Arg Asp Sar Tyr Glu Gln Ala Leu Arg Arg Asp Asp Val Lys Ala Ile Val Val Thr Gly Als Lya Gly Lys Phe Ser Gly Gly Phe Asp Ile Thr Ala Phe Gly Val Leu Gln Gly Gly Lys Gly Glu Gln Pro Asn Val Arg Asn Ile Ser Ile Glu Met Ile Thr Asp Ile Phe Glu Ala Ala Arg Lys Pro Ala Val Ala Ala Ile Asp Gly Leu Ala Leu Gly Gly Gly Leu Glu Val Ala Met Ala Cys His Ala Arg Ile Ser Thr Pro Thr Ala Gln Leu Gly Leu Pro Glu Lau Gln Leu Gly Ile Ile Pro Gly Phe Gly Gly Thr Gln Arg Lau Pro Arg Leu Val C~ly Leu Ser Lys Ala Leu Glu Met Met Leu Thr Ser Lys Pro Ile Lys Gly Gln Glu Ala His Ser Leu Gly Leu Val Asp Ala Ile Val Pro Pro Glu Glu Leu Ile Asn Thr Ala Arg Arg Trp Ala Leu Glu Ile Leu Glu Arg Arg Arg Pro Trp Val His Ser Leu His Arg Thr Asp Lys Leu Glu Ser Leu Ala Glu Ala Arg Lys Ile Phe Asn Lau Ala Arg Ala Gln Ala Lys Lys Gln Tyr Pro Aan Leu Lys His Thr Ile Ala Cys Ile Asp Ala Val Glu Thr Gly Val Val Sar Gly Pro Arg Ala Gly Leu Trp Lys Glu Ala Glu 260 265 270' Glu Phe Gln Gly Leu Leu His Ser Asp Thr Cys Lys Ser Leu Ile His Ile Phe Phe Ala Gln Arg Ser Thr Thr Lys Val Pro Gly Val Thr Asp Leu Gly Leu Val Pro Arg Gln Ile Lys Lys Val Ala Ile Val Gly Gly Gly Leu Met Gly Ser Gly Ile Ala Thr Ala Leu Ile Leu Ser Asn Tyr His Val Val Leu Lys Glu Val Asn Asp Lys Phe Leu Gln Ala Gly Ile Asp Arg Val Arg Ala Asn Leu Gln Sar Arg Val Lys Lys Gly Asn Met Thr Asn Glu Lys Phe Glu Lys Ser Ile Ser Leu Leu Lys Gly Val Leu Asn Tyr Glu Ser Phe Lys Asp Val Asp Mat Val Ile Glu Ala Val Ile Glu Asn Val Ser Leu Lys Gln Gln Ile Phe Ser Asp Leu Glu Lys Tyr Cys Pro Pro His Cys Mat Leu Ala Thr Asn Thr Ser Thr Ile Asp Leu Glu Leu Ile Gly Glu Arg Ile Lys Ser Arg Asp Arg Ile Ile Gly His Thr Ala Ala Gln Val Ile Val Asp Leu Leu Asp Val Gly Lys Asn Ile Lys Lys Thr Pro Val Val Val Gly Asn Cys Thr Gly Phe Ala Val Asn Arg Met Phe Phe Pro Tyr Ser Gln Ala Ala Ile Leu Leu Ala Glu His Gly Val Asp Pro Tyr Gln Ile Asp Arg Ala Ile Ser Lys Phe Gly Met Pro Met Gly Pro Phe Arg Leu Cys Asp Leu Val Gly Phe Gly Val Ala Ala Ala Thr Ala Ser Gln Phe Val Gln Ala Phe Pro Glu Arg Thr Tyr Lys Ser Met Leu Ile Pro Leu Met Gln Glu Asp Lys Asn Ala Gly Glu Ser Thr Arg Lys Gly Phe Tyr Val Tyr Asp Lys Asn Arg Lys Ala Gly Pro Asn Pro Glu Leu Lys Lys Tyr Ile Glu Lys Ala Arg Asn Ser Ser Gly Val Ser Val Asp Pro Lys Leu Thr Lys Leu Pro Glu Lys Asp Ile Val Glu Met Ile Phe Phe Pro Val Val Asn Glu Ala Cys Arg Val Leu Ala Glu Gly Ile Ala Val Lys Ala Ala Asp Leu Asp Ile Ala Gly Val Met Gly Met Gly Phe Pro Ser Tyr Arg Gly Gly Leu Met Phe Trp Ala Asp Ser Leu Gly Ser Asn Tyr Ile Tyr Ser Arg Leu Glu Glu Trp Ser Lys Gln Tyr Gly Gly Phe Phe Lys Pro Cys Gly Tyr Leu Ala Glu Arg Ala Val Gln Gly Ala Thr Leu Ser Ala Pro Gly Gly His Ala Lys Pro Arg Mat Ala His Phe Phe Sar Pro Ala His Ile Met Pro Leu Leu Glu Ile Val Arg Thr Lys <210> 5 <211> 21 <212> DNA
<213> Artificial Sequence <220>
,. .~ . <223> Description of Artificial Sequence:
oligonucleotide primer- Primer 1 <400> 5 gatgggccgc tccaagggtg g 21 <210> 6 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Seguence:
oligonucleotide primer- Primer 2 <400> 6 caacccgaag gtgccgccat t 21 <210> 7 <211> 3907 <212> DNA
<213> Pseudomonas putida <220>
<221> gene <222> (1)..(3907) <223> faoAB gene sequeace <400> 7 gatggcgttt ttactgaaaa tttgcctccg gccatagaat ctcctacggg ggcatccagg 60 ctggtcaatc tgttctgtaa cgacaaagcg gcggctcggc ataaccctga aggggtgggg 120 tgccaagtcg ccgctttgcg ttctgctgcg cagaaaaggc caggcaggcc gggttattca 180 ttcaagcaag tgccagtgta gtcgccttgg ctcttcgcga agcgcaaagc aataagccga 240 tgcttgtggt caatcgctga catgaatgac cggcgatcac tgtcatcatg ttttgtaccc 300 ggacgataga aaatgactgg aaagtcgcgt tttcgccgtg ctcgtgagca tttgacaacg 360 cccatctttt cggagaatgt gtgcacaccc aaatcaaacg ggcgtatgaa ttgagcgttt 420 gtatgtcatg acgccaagcc atacttccga ctagcgcgct gacgggtgtg cctggggcaa 480 ggcagcttag gcctgcttct gctgcagcga tcggcgtgta cagttcagct tccatatcgt 540 ggagatcagt tgatgattta cgaaggtaaa gccatcacgg ttaaggctct tgaaagtggc 600 atcgtcgagc tcaagttcga cctcaagggt gagtccgtca acaagttcaa ccgccttacc 660 ctgaacgagc tgcgccaggc cgtcgatgcc atccgggccg atgcttcggt caagggcgtg 720 atcgtcagga gtggcaagga cgtgttcatc gtcggcgccg acatcaccga gttcgtcgac 780 aacttcaagc tgcctgaggc cgaactggtc gctggcaacc tggaagccaa tcgcatcttc 840 aacgcgttcg aagacctcga agtgccgacc gttgccgcca tcaacggcat cgcgctgggc 900 ggcggcctgg aaatgtgcct ggcggccgac taccgggtca tgtccaccag cgccaggatc 960 ggcctgccgg aagtcaagct gggtatctac ccgggctttg gcggtaccgt gcgcctgccg 1020 cgcctgatcg gctcggacaa cgccatcgag tggatcgccg ccggcaagga aaaccgtgcc 1080 gaagatgccc tgaaagtggg ggccgtcgac gccgtggtcg cccctgagct gctgctggcc 1140 w ° ggtgccctcg'-'acctgatcaa gcgtgccatc agtggcgagc 'tgg~ctacaa ggccaagcgc 1200 cagccgaagc tggaaaagct caagctcaat gccatcgagc agatgatggc cttcgagact 1260 gccaagggct tcgtcgctgg ccaggccggc ccgaactacc cggccccggt cgaagcgatc 1320 aagagcatcc agaaagccgc caacttcggt cgcgacaagg ccctggaagt cgaagccgca 1380 ggctttgcca agctggccaa gacctctgtc gccgagagcc tgatcggctt gttcctcaac 1440 gatcaggaac tcaagcgcaa ggccaaggcg catgacgaga tcgcccacga cgtgaagcag 1500 gccgccgtgc tcggcgccgg catcatgggc ggcggtatcg cctaccagtc ggcggtcaaa 1560 ggtacgccga tcctgatgaa ggacatccgc gaggaagcca ttcagctggg tctgaacgag 1620 gcctccaagt tgcttggcaa ccgcgtcgag aagggccgcc tgaccccggc caagatggcc 1680 gaggccctca acgccattcg cccgaccctg tcctatggcg atttcgccaa tgtcgacatc 1740 gtcgtcgagg ctgtggtcga gaacccgaag gtcaagcaag cggtactggc ggaagtggaa 1800 ggccaggtga aggacgatgc gatcctcgct tccaacacct ctaccatctc catcaacctg 1860 ctggccaagg cgctcaagcg cccggaaaac ttcgtcggca tgcacttctt caacccggtg 1920 il cacatgatgc cgctggttga agtgatccgt ggcgagaagt ccagtgacgt ggcggtcgcc 1980 accaccgtgg cctacgccaa gaaaatgggc aagaacccga tcgtggtcaa cgactgcccg 2040 ggctttttgg tcaaccgcgt gctgttcccg tactttggcg gttttgccaa gctggtcagc 2100 gccggtgtcg acttcgtgcg catcgacaag gtcatggaga agttcggctg gccgatgggc 2160 ccagcctact tgatggacgt ggtcggcatc gacaccggcc accacggccg tgacgtcatg 2220 gccgaaggct tcccggatcg catgaaggac gagcgccgct cggcagtcga cgcgttgtac 2280 gaggccaacc gcctgggcca gaagaacggt aagggcttct acgcctacga aaccgacaag 2340 cgcggcaagc cgaagaaggt cttcgatgcc sccgtgctcg acgtgctcaa accgatcgtg 2400 ttcgagcagc gtgaagtcac tgacgaagac atcatcaact ggatgatggt cccgctgtgc 2460 cttgagaccg tgcgttgcct ggaagacggc atcgtcgaaa ccgctgccga agccgacatg 2520 ggcctggtct acggcattgg tttccctccc ttccgcggtg gtgcgctgcg ttacatcgac 2580 tcgatcggtg tggccgaatt cgtcgccctg gccgatcagt atgccgacct ggggccgctg 2640 taccacccga ccgccaagct gcgtgaaatg gccaagaacg gccagcgctt cttcaactga 2700 gcggtcaacg agctagagcg agagatttga tatgagcctg aatccaagag acgtggtgat 2760 tgtcgacttc ggtcgcacgc caatgggccg ctccaagggt ggcatgcacc gcaacacccg 2820 cgccgaagac atgtcggcgc acctgatcag caagctgctg gaacgcaacg gcaaggtcga 2880 cccgaaagaa gtcgaggacg tgatctgggg ctgcgtcaac cagaccctgg agcagggctg 2940 gaacatcgcc cgcatggctt cgctgatgac cccgatcccg cacacctctg cggcgcagac 3000 cgtcagccgc ctgtgcggct cgtccatgag cgcgctgcac acggccgccc aggcgatcat 3060 gaccggtaac ggtgatgtgt tcgtggtcgg tggcgtggag cacatgggcc acgtcagcat 3120 gatgcatggc gtagacccca acccgcacct gtccttgcat gccgccaagg cttccgggat 3180 gatgggcctg actgcagaaa tgctcggcaa gatgcacggc atcacccgtg agcagcagga 3240 cctgttcggc ttgcgttcgc accagctggc ccacaaggcc acggtcgaag gcaagttcaa 3300 ggacgagatc atcccgatgc agggctacga cgagaacggc ttcctgaagg tgttcgattt 3360 cgacgaaacc attcgcccgg aaaccaccct cgaaggcctg gcatcgctca agcctgcgtt 3420 caacccgaaa ggcggtacgg tcacggccgg tacctcgtcg cagatcaccg acggcgcctc 3480 gtgcatgatc gtcatgtccg gtcagcgtgc catggacctc ggtatccagc cattggcggt 3540 gatccgttcg atggcagtgg ccggtgtcga cccggcaatc atgggctacg gcccggtgcc 3600 atcgacccag aaagccctca agcgtgcggg cttgaccatg gccgatatcg acttcatcga 3660 gctcaacgaa gccttcgctg cgcaggccct gcccgtgctg aaagacttga aagtgctcga 3720 caagatggat gagaaggtta acctgcacgg cggcgccatt gctttgggcc acccgttcgg 3780 ttgctccggg gcgcggattt ccggcaccct gctcaacgtc atgaagcaaa atggcggtac 3840 gctgggtgtt gcgaccatgt gcgtcggcct gggccaaggt atcaccactg tcttcgaacg 3900 cgtctga 3907 <210> g _ , ...~.. _", .. ... . ._ . ,.
<211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence oligonucleotide primer- Primer (iVR471 <400> 8 cggtacccat tgtactccca gtatcat 27 <210> 9 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer GVR472 <400> 9 catttaaata gtagagtatt gaatatg 27 <210> 10 <211> 1558 <212> DNA
<213> Bean Phaseolin <220>
<221> promoter <222> (1)..(1558) <400> 10 cggtacccat tgtactccca gtatcattat agtgaaagtt ttggctctct cgccggtggt 60 tttttacctc tatttaaagg ggttttccac ctaaaaattc tggtatcatt ctcactttac 120 ttgttacttt aatttctcat aatctttggt tgaaattatc acgcttccgc acacgatatc 180 cctacaaatt tattatttgt taaacatttt caaaccgcat aaaattttat gaagtcccgt 240 ctatctttaa tgtagtctaa cattttcata ttgaaatata taatttactt aattttagcg 300 ttggtagaaa gcataaagat ttattcttat tcttcttcat ataaatgttt aatatacaat 360 ataaacaaat tctttacctt aagaaggatt tcccatttta tattttaaaa atatatttat 420 caaatatttt tcaaccacgt aaatctcata ataataagtt gtttcaaaag taataaaatt 480 taactccata atttttttat tcgactgatc ttsaagcaac acccagtgac acaactagcc 540 atttttttct ttgaataaaa aaatccaatt atcattgtat tttttttata caatgaaaat 600 ttcaccaaac aatcatttgt ggtatttctg aagcaagtca tgttatgcaa aattctataa 660 ttcccatttg acactacgga agtaactgaa gatctgcttt tacatgcgag acacatcttc 720 taaagtaatt ttaataatag ttactatatt caagatttca tatatcaaat actcaatatt 780 acttctaaaa aattaattag atataattaa astattactt ttttaatttt aagtttaatt 840 --~.f gttgaatttg tgactattgart~ttattattc tactatgttt aaattgtttt atagatagtt'900 taaagtaaat ataagtaatg tagtagagtg ttagagtgtt accctaaacc ataaactata 960 acatttatgg tggactaatt ttcatatatt tcttattgct tttacctttt cttggtatgt 1020 aagtccgtaa ctagaattac agtgggttgc catggcactc tgtggtcttt tggttcatgc 1080 atgggtcttg cgcaagaaaa agacaaagaa caaagaaaaa agacaaaaca gagagacaaa 1140 acgcaatcac acaaccaact caaattagtc actggctgat caagatcgcc gcgtccatgt 1200 atgtctaaat gccatgcaaa gcaacacgtg cttaacatgc actttaaatg gctcacccat 1260 ctcaacccac acacaaacac attgcctttt tcttcatcat caccacaacc acctgtatat 1320 attcattctc ttccgccacc tcaatttctt cacttcaaca cacgtcaacc tgcatatgcg 1380 tgtcatccca tgcccaaatc tccatgcatg ttccaaccac cttctctctt atataatacc 1440 tataaatacc tctaatatca ctcacttctt tcatcatcca tccatccaga gtactactac 1500 tctactacta taatacccca acccaactca tattcaatac tactctacta tttaaatg 1558 <210>11 <211>983 <212>DNA
<213>Soybean oleosin <220>
<221> promoter <222> (1)..(983) <400> 11 tctagataca tccatttctt aatataatcc tcttattcaa attgcaattg cccagatctc 60 tgtatggact atggctcgag gaatccatac atagagacat tecccactcg tatacttgta 120 tctgtcaaat gcctattgtt gttgaaataa agtaacttat gcatcaattt aaaacaatca 180 gttaggtaat ggacattaat gtaaagtgac ttatagaatc catatttgta taacaaagat 240 gttgagacat tgactattgt gtgctatttc aaaaagatct gataaaatat ttaataaaac 300 attaaaagta aataataatc gctaaacatc atcaaaatat taatcattag attcaatttt 360 tgcaattaaa accagaagaa ctaaatctac attaccatac ttaagactat acatagagaa 420 ttaaacttaa tgttatattt aaggaaaagg acgaaactta naacttaaat acaattgtat 480 gattaatttt agtattgtct ttaatgagaa ttaaagtttt attcactaat ttatgattat 540 ttcatttact aatttatgta atgtgatttc aataagtgag gtsaactccg attgattgaa 600 gataccacca acaccaacac caccaccacc tgcgaaactg tacgtatctc aattgtcctt 660 aataaaaatg taaatagtac attattctcc ttgcctgtca ttatttatgt gcccccagct 720 ttaatttttc tgatgtactt aacccagggc aaactgaaac atgttcctca tgcasagccc 780 caactcatca tgcatcatgt acgtgtcatc atccagcaac tccacttttg ctatataact 840 cctcccccat cacactcccc atctctctaa cacacacata cccccaacta acaataattc 900 cttcacttgc agaacttagt tctctgttgc atcatcatca tcttcattag tgttagccct 960 aacttcacct taaccattta atg 983 <210> 12 <211> 38 <212> DNA
<213> Artificial Seguence <220>
<223> Description of Artificial Sequence:
< , ... ,. " a ~, oligonucleotide primer- Primer JA408 , _ . . . s , " . . _ ".. , ... . . :. ., <400> 12 tctagataca tccatttett aatataatcc tcttattc 38 <210> 13 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- npl <400> 13 catttaaatg gttaaggtga aggtagggct 30 <210> 14 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequences oligonucleotide primer- Primer apt <400> 14 aagcttaaaa tgatttacga aggtaaagcc 30 <210> 15 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer np3 <400> 15 attgctttca gttgaagcgc tg 22 <210> 16 <211> 30 <212> DNA
<213> Artificial Sequence <a2o>
<223> Description of Artificial Sequence:
,.":, : . , of gonucleotide primer- Primer ap4 . . ~ , ,. , . . . . . .
<400> 16 aagcttaaaa tgagcctgaa tccaagagac 30 <210>17 <211>28 <212>DNA
<213>Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer np5 <400> 17 aagctttcag acgcgttcga agacagtg 28 <210> 18 <211> 56 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer N-fox2b <400> 18 tcccccggga ggaggttttt attatgcctg gaaatttatc cttcaaagat agagtt 56 <210> 19 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequences oligonucleotide primer- Primer N-bamfox2b <400> 19 aaggatcctt gatgtcattt acaactacc 29 <210> 20 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer C-fox2~ 't,~'e" , ~ ~ ' ~ .
<400> 20 gctctagata gggaaagatg tatgtaag 28 <210> 21 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer C-bamfox2 WO 99/45122 PC'T/US99/04999 <400> 21 tgacatcaag gatcctttt 19 <210> 22 <211> 39 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer CiVR396 <400> 22 gatttaaatg caagcttaaa taagtatgaa ctaaaatgc 39 <210> 23 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequences oligonucleotide primer- Primer G1VR397 <400> 23 cggtacctta gttggtaggg tgcta 25 <210> 24 <211> 2162 <212> DNA
<213> Pseudomonas putida <220>
<221> gene <222> (1) .. (216-2) . , ~ ,..,. . , <223> FaoA gene sequence <400> 24 aagcttaaat gatttacgaa ggtaaagcca tcacggttaa ggctcttgaa agtggcatcg 60 tcgagctcaa gttcgacctc aagggtgagt ccgtcaacaa gttcaaccgc cttaccctga 120 acgagctgcg ccaggccgtc gatgccatcc gggccgatgc ttcggtcaag ggcgtgatcg 180 tcaggagtgg caaggacgtg ttcatcgtcg gcgccgacat caccgagttc gtcgacaact 240 tcaagctgcc tgaggccgaa ctggtcgctg gcaacctgga agccaatcgc atcttcaacg 300 cgttcgaaga cctcgaagtg ccgaccgttg ccgccatcaa cggcatcgcg ctgggcggcg 360 gcctggaaat gtgcctggcg gccgactacc gggtcatgtc caccagcgcc aggatcggcc 420 tgccggaagt caagctgggt atctacccgg gctttggcgg taccgtgcgc ctgccgcgcc 480 tgatcggctc ggacaacgcc atcgagtgga tcgccgccgg caaggaaaac cgtgccgaag 540 atgccctgaa agtgggggcc gtcgacgccg tggtcgcccc tgagctgctg ctggccggtg 600 ccctcgacct gatcaagcgt gccatcagtg gcgagctgga ctacaaggccwaagcgccagc 660 cgaagctgga aaagctcaag ctcaatgcca tcgagcagat gatggccttc gagactgcca 720 agggcttcgt cgctggccag gccggcccga actacccggc cccggtcgaa gcgatcaaga 780 gcatccagaa agccgccaac ttcggtcgcg acaaggccct ggaagtcgaa gccgcaggct 840 ttgccaagct ggccaagacc tctgtcgccg agagcctgat cggcttgttc ctcaacgatc 900 aggaactcaa gcgcaaggcc aaggcgcatg acgagatcgc ccacgacgtg aagcaggccg 960 ccgtgctcgg cgccggcatc atgggcggcg gtatcgccta ccagtcggcg gtcaaaggta 1020 cgccgatcct gatgaaggac atccgcgagg aagccattca gctgggtctg aacgaggcct 1080 ccaagttgct tggcaaccgc gtcgagaagg gccgcctgac cccggccaag atggccgagg 1140 ccctcaacgc cattcgcccg accctgtcct atggcgattt cgccaatgtc gacatcgtcg 1200 tcgaggctgt ggtcgagaac ccgaaggtca agcaagcggt actggcggaa gtggaaggcc 1260 aggtgaagga cgatgcgatc ctcgcttcca acacctctac catctccatc aacctgctgg 1320 ccaaggcgct caagcgcccg gaaaacttcg tcggcatgca cttcttcaac ccggtgcaca 1380 tgatgccgct ggttgaagtg atccgtggcg agaagtccag tgacgtggcg gtcgccacca 1440 ccgtggccta cgccaagaaa atgggcaaga acccgatcgt ggtcaacgac tgcccgggct 1500 ttttggtcaa ccgcgtgctg ttcccgtact ttggcggttt tgccaagctg gtcagcgccg 1560 gtgtcgactt cgtgcgcatc gacsaggtca tggagaagtt cggctggccg atgggcccag 1620 cctacttgat ggacgtggtc ggcatcgaca ccggccacca cggccgtgac gtcatggccg 1680 aaggcttccc ggatcgcatg aaggacgagc gccgctcggc agtcgacgcg ttgtacgagg 1740 ccaaccgcct gggccagaag aacggtaagg gcttctacgc ctacgaaacc gacaagcgcg 1800 gcaagccgaa gaaggtcttc gatgccaccg tgctcgacgt gctcaaaccg atcgtgttcg 1860 agcagcgtga agtcactgac gaagacatca tcaactggat gatggtcccg ctgtgccttg 1920 agaccgtgcg ttgcctggaa gacggcatcg tcgsaaccgc tgccgaagcc gacatgggcc 1980 tggtctacgg cattggtttc cctcccttcc gcggtggtgc gctgcgttac atcgactcga 2040 tcggtgtggc cgaattcgtc gccctggccg atcagtatgc cgacctgggg ccgctgtacc 2100 acccgaccgc caagctgcgt gaaatggcca agaacggcca gcgcttcttc aactgaaagc 2160 tt 2162 <210> 25 <211> 1190 <212> DNA
<213> Pseudomonas putida <220>
<221> gene .
<222> (1)..(1190) <223> FaoB gene sequence <400> 25 aagcttaaat gagcctgaat ccaagagacg tggtgattgt cgacttcggt cgcacgccaa 60 tgggccgctc caagggtggc atgcaccgca acacccgcgc cgaagacatg tcggcgcacc 120 tgatcagcaa gctgctggaa cgcaacggca aggtcgaccc gaaagaagtc gaggacgtga 180 tctggggctg cgtcaaccag accctggagc agggctggaa catcgcccgc atggcttcgc 240 tgatgacccc gatcccgcac acctctgcgg cgcagaccgt cagccgcctg tgcggctcgt 300 ccatgagcgc gctgcacacg gccgcccagg cgatcatgac cggtaacggt gatgtgttcg 360 tggtcggtgg cgtggagcac atgggccacg tcagcatgat gcatggcgta gaccccaacc 420 cgcacctgtc cttgcatgcc gccaaggctt ccgggatgat gggcctgact gcagaaatgc 480 tcggcaagat gcacggcatc acccgtgagc agcaggacct gttcggcttg cgttcgcacc 540 agctggccca caaggccacg gtcgaaggca agttcaagga cgagatcatc~ccgatgcagg 600 gctacgacga gaacggcttc ctgaaggtgt tcgatttcga cgaaaccatt cgcccggaaa 660 ccaccctcga aggcctggca tcgctcaagc ctgcgttcaa cccgaaaggc ggtacggtca 720 cggccggtac ctcgtcgcag atcaccgacg gcgcctcgtg catgatcgtc atgtccggtc 780 agcgtgccat ggacctcggt atccagccat tggcggtgat ccgttcgatg gcagtggccg 840 gtgtcgaccc ggcaatcatg ggctacggcc cggtgccatc gacccagaaa gccctcaagc 900 gtgcgggctt gaccatggcc gatatcgact tcatcgagct caacgaagcc ttcgctgcgc 960 aggccctgcc cgtgctgaaa gacttgaaag tgctcgacsa gatggatgag aaggttaacc 1020 tgcacggcgg cgccattgct ttgggccacc cgttcggttg ctccggggcg cggatttccg 1080 gcaccctgct caacgtcatg aagcaaaatg gcggtacgct gggtgttgcg accatgtgcg 1140 tcggcctggg ccaaggtatc accactgtct tcgaacgcgt ctgaaagctt 1190 <210> 26 <211> 391 <212> PRT
<213> Pseudomonas putida <220>
<221> P$PTID$
<222> (1) .. (391) <223> FaoB amino acid sequence <400> 26 Met Ser Leu Asn Pro Arg Asp Val Val Ile Val Asp Phe Gly Arg Thr Pro Met Gly Arg Ser Lys Gly Gly Mat His Arg Asn Thr Arg Ala Glu Asp Mat Ser Ala His Leu Ile Ser Lys Leu Leu Glu Arg Asn Gly Lys Val Asp Pro Lys Glu Val Glu Asp Val Ile Trp Gly Cys Val Asn Gln Thr Leu Glu Gln Gly Trp Asa Ile Ala Arg Met Ala Ser Leu Met Thr Pro Ile Pro His Thr Ser Ala Ala Gln Thr Val Ser Arg Leu Cys Gly Ser Ser Mat Ser Ala Leu His Thr Ala Ala Gln Ala Ile Met Thr Gly Asn Gly Asp Val Phe Val Val Gly Gly Val Glu His Met Gly His Val Ser Mat Met His Gly Val Asp Pro Asn Pro His Leu Ser Leu His Ala Ala Lys Ala Ser Gly Met Met Gly Leu Thr Ala Glu Met Leu Gly Lys Met His Gly Ile Thr Arg Glu Gln Gln Asp Leu Phe Gly Leu Arg Ser His Gln Lau Ala His Lys Ala Thr Val Glu Gly Lys Phe Lys Asp Glu Ile Ile Pro Met Gln Gly Tyr Asp Glu Asn Gly Phe Lau Lys Val Phe Asp Phe Asp Glu Thr Ile Arg Pro Glu Thr Thr Leu Glu Gly Leu Ala Ser Leu Lys Pro Ala Phe Asn Pro Lys Gly Gly Thr Val Thr Ala Gly Thr Ser Ser Gln Ile Thr Asp Gly Ala Ser Cys Mat Ile Val Met Ser Gly Gln Arg Ala Met Asp Leu Gly Ile Gln Pro Leu Ala Val Ile Arg Ser Met Ala Val Ala Gly Val Asp Pro Ala Ile Met Gly Tyr Gly Pro Val Pro Ser Thr Gln Lys Ala Leu Lys Arg Ala Gly Leu Thr Met Ala Aap Ile Asp Phe Ile Glu Leu Asn Glu Ala Phe Ala Als Gln Ala Leu Pro Val Leu Lys Asp Leu Lys Val Leu Asp Lys Met Asp Glu Lys Val Asn Leu His Gly Gly Ala Ile Ala Leu Gly His Pro Phe Gly Cys Ser Gly Ala Arg Ile Ser Gly Thr Leu Leu Asn Val Met Lys Gln Asn Gly Gly Thr Leu Gly Val Ala Thr Mat Cys Val Gly Leu Gly Gln Gly Ile Thr Thr Val Phe Glu Arg Val <210> 27 <211> 1244 <212> DNA
<213> Bean Phaseolin <220>
<221> termiaator <222> (1)..(1244) <400> 27 gatttaaatg caagcttaaa taagtatgaa ctaaaatgca tgtaggtgta agagctcatg 60 gagagcatgg aatattgtat ccgaccatgt aacagtataa taactgagct ccatctcact 120 tcttctatga ataaacaaag gatgttatga tatattaaca ctctatctat gcaccttatt 180 gttctatgat aaatttcctc ttattattat aaatcatctg aatcgtgacg gcttatggaa 240 tgcttcaaat agtacaaaaa caaatgtgta ctataagact ttctaaacaa ttctaacttt 300 agcattgtga acgagacata agtgttaaga agacataaca attataatgg aagaagtttg 360 tctccattta tatattatat attacccact tatgtattat attaggatgt taaggagaca 420 taacaattat aaagagagaa gtttgtatcc atttatatat tatatactac ccatttatat 480 attatactta tccacttatt taatgtcttt ataaggtttg atccatgata tttetaatat 540 tttagttgat atgtatatga aagggtacta tttgaactct cttactctgt ataaaggttg 600 gatcatcctt aaagtgggtc tatttaattt tattgcttct tacagataaa aaaaaaatta 660 tgagttggtt tgataaaata ttgaaggatt taaaataata ataaataata aataacatat 720 aatatatgta tataaattta ttataatata acatttatct ataaaaaagt aaatattgtc 780 ataaatctat acaatcgttt agccttgctg gacgactctc aattatttaa acgagagtaa 840 acatatttga ctttttggtt atttaacaaa ttattattta acactatatg aaattttttt 900 tttttatcgg caaggaaata aaattaaatt aggagggaca atggtgtgtc ccaatcctta 960 tacaaccaac ttccacagga aggtcaggtc ggggacaaca aaaaaacagg caagggaaat 1020 tttttaattt gggttgtctt gtttgctgca taatttatgc agtaaaacac tacacataac 1080 ccttttagca gtagagcaat ggttgaccgt gtgcttagct tcttttattt tattttttta 1140 tcagcaaaga ataaataaaa taaaatgaga cacttcaggg atgtttcaac ccttatacaa 1200 aaccccaaaa acaagtttcc tagcacccta ccaactaagg tacc 1244 <210> 28 <211> 225 <212> DNA
<213> Soybean oleosin <220>
<221> tex~iaator <222> (1)..(225) <400> 28 aagcttacgt gatgagtatt aatgtgttgt tatgaactta tgatgttggt ttatgtgggg 60 aaataaatga tgtatgtacc tcttcttgcc tatgtagtag gtttgggtgt tttgttgtct 120 agctttgctt atttagtaat tagtagaagg gatgttcgtt cgtgtctcat aaaaaggggt 180 actaccactc tggcaatgtg atttgtattt gatgaattgt ctaga 225 <210> 29 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer JA410 <400> 29 aagcttacgt gatgagtatt aatgtgttgt tatg 34 <210> 30 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:
oligonucleotide primer- Primer JA411 <400> 30 tctagacaat tcatcaaata caaatcacat tgcc 34 <210> 31 <211> 715 <212> PRT
<213> Pseudomonas putida <220>
<221> PgPTIDE
<222> (1) . . (715) <223> FaoA amino acid sequence <400> 31 Mat Ile Tyr Glu Gly Lys Ala Ile Thr Val Lys Ala Leu Glu Ser Gly Ile Val Glu Leu Lys Phe Asp Leu Lys Gly Glu Ser Val Asn Lys Phe Asn Arg Lau Thr Leu Asn Glu Leu Arg Gln Ala Val Asp Ala Ile Arg Ala Asp Ala Ser Val Lys Gly Val Ile Val Arg Ser Gly Lys Asp Val Phe Ile Val Gly Ala Asp Ile Thr Glu Phe Val Asp Asn Phe Lys Leu Pro Glu Ala Glu Leu Val Ala Gly Asn Leu Glu Ala Asn Arg Ile Phe Asn Ala Phe Glu Asp Leu Glu Val Pro Thr Val Ala Ala Ile Asn Gly Ile Ala Leu Gly Gly Gly Leu Glu Met Cys Leu Ala Ala Aap Tyr Arg Val Mat Ser Thr Ser Ala Arg Ile Gly Leu Pro Glu Val Lys Leu Gly Ile Tyr Pro Gly Phe Gly Gly Thr Val Arg Leu Pro Arg Leu Ile Gly Ser Asp Asn Ala Ile Glu Trp Ile Ala Ala Gly Lys Glu Asn Arg Ala Glu Asp Ala Leu Lys Val Gly Ala Val Asp Ala Val Val Ala Pro Glu Leu Leu Leu Ala Gly Ala Leu Asp Leu Ile Lys Arg Ala Ile Ser Gly Glu Leu Asp Tyr Lys Ala Lys Arg Gln Pro Lys Leu Glu Lys Leu Lys Leu Asn Ala Ile Glu Gln Met Met Ala Phe Glu Thr Ala Lys Gly Phe wVal-Ala~Gly Gln Ala Gly Pro Asn Tyr Pro Ala Pro Val Glu Ala Ile Lys Ser Ile Gln Lys Ala Als Asn Pha Gly Arg Asp Lys Ala Leu Glu Val Glu Ala Ala Gly Phe Ala Lys Leu Ala Lys Thr Ser Val Ala Glu Ser Leu Ile Gly Leu Phe Leu Asn Asp Gln Glu Leu Lys Arg Lys Ala Lys Ala His Asp Glu Ile Ala His Asp Val Lys ~Gln Ala Ala Val Leu Gly Ala Gly Ile Met Gly Gly Gly Ile Ala Tyr Gln Ser Ala Val Lys Gly Thr Pro Ile Leu Mat Lys Asp Ile Arg Glu Glu Ala Ile Gln Leu Gly Leu Asn Glu Ala Ser Lys Leu Leu Gly Asn Arg Val Glu Lys Gly Arg Leu Thr Pro Ala Lys Met Ala Glu Ala Leu Asn Ala Ile Arg Pro Thr Leu Ser Tyr Gly Asp Phe Ala Asn Val Asp Ile Val Val Glu Ala Val Val Glu Asn Pro Lys Val Lys Gln Ala Val Leu Ala Glu Val Glu Gly Gln Val Lys Asp Asp Ala Ile Leu Ala Ser Asn Thr Ser Thr Ile Ser Ile Asn Leu Leu Ala Lys Ala Leu Lys Arg Pro Glu Aan Phe Val Gly Met His Phe Phe Asn Pro Val His Met Met Pro Leu Val Glu Val Ile Arg Gly Glu Lys Ser Ser Asp Val Ala Val Ala Thr Thr Val Ala Tyr Ala Lys Lys Met Gly Lys Asn Pro Ile Val Val Asn Asp Cys Pro Gly''Phe'Leu'Val Asn Arg Val Leu Phe Pro Tyr Phe Gly Gly Phe Ala '' Lys Leu Val Ser Ala Gly Val Asp Phe Val Arg Ile Asp Lys Val Met Glu Lys Phe Gly Trp Pro Met Gly Pro Ala Tyr Leu Met Asp Val Val Gly Ile Asp Thr Gly His His Gly Arg Asp Val Met Ala Glu Gly Phe Pro Asp Arg Mat Lys Asp Glu Arg Arg Ser Ala val Asp Ala Leu Tyr WO 99!45122 PCTNS99/04999 Glu Ala Asn Arg Leu Gly Gln Lys Asn Gly Lys Gly Phe Tyr Ala Tyr Glu Thr Asp Lys Arg Gly Lys Pro Lys Lys Val Phe Aap Ala Thr Val Leu Asp Val Leu Lys Pro Ile Val Phe Glu Gla Arg Glu Val Thr Asp Glu Asp Ile Ile Asn Trp Met Met Val Pro Leu Cys Leu Glu Thr Val Arg Cys Leu Glu Asp Gly Ile Val Glu Thr Ala Ala Glu Ala Asp Met Gly Leu Val Tyr Gly Ile Gly Phe Pro Pro Phe Arg Gly Gly Ala Leu Arg Tyr Ile Asp Ser Ile Gly Val Ala Glu Phe Val Ala Leu Ala Asp Gln Tyr Ala Asp Leu Gly Pro Leu Tyr His Pro Thr Ala Lys Leu Arg Glu Mat Ala Lys Asn Gly Gln Arg Phe Phe Aan
Claims (24)
1. A method for manipulating the metabolism of a plant, comprising expressing transgenes encoding fatty acid .beta.-oxidation enzyme activities in cytosol, plastids other than the peroxisomes or glyoxisomes, or mitochondria of the plant.
2. The method of claim 1 wherein the fatty acid .beta.-oxidation enzymes are expressed from genes selected from the group consisting of bacterial, yeast, fungal, plant, and mammalian.
3. The method of claim 1 wherein the fatty acid .beta.-oxidation enzymes are expressed from genes from bacteria selected from the group consisting of Escherichia, Pseudomonas, Alcaligenes, and Coryneform.
4. The method claim 3 wherein the genes are Pseudomonas putida faoAB.
5. The method of claim 1 further comprising expressing genes of bacterial, fungal, yeast, plant or animal origin encoding enzymes selected from the group consisting of polyhydroxyalkanoate synthases, acetoacetyl-CoA reductases, .beta.-ketoacyl-CoA thiolases, and enoyl-CoA hydratases, wherein the enzymes encoded by these genes are directed to the cytosol, plastids other than the peroxisomes or glyoxisomes, or mitochondria of the plant.
6. A DNA construct for use in a method of manipulating the metabolism of a plant cell comprising, in phase, (a) a promoter region functional in a plant;
(b) a structural DNA sequence encoding at least one fatty acid .beta.-oxidation enzyme activity; and (c) a 3' nontranslated region of a gene naturally expressed in a plant, wherein the nontranslated region encodes a signal sequence for polyadenylation of mRNA.
(b) a structural DNA sequence encoding at least one fatty acid .beta.-oxidation enzyme activity; and (c) a 3' nontranslated region of a gene naturally expressed in a plant, wherein the nontranslated region encodes a signal sequence for polyadenylation of mRNA.
7. The DNA construct of claim 6 wherein the promoter is a seed specific promoter.
8. The DNA construct of claim 7 wherein the seed specific promoter is selected from the group consisting of napin promoter, phaseolin promoter, oleosin promoter, 2S albumin promoter, zein promoter, .beta.-conglycinin promoter, acyl-carrier protein promoter, and fatty acid desaturase promoter.
9. The DNA construct of claim 6 wherein the promoter is a constitutive promoter.
10. The DNA construct of claim 6 wherein the promoter is selected from the group consisting of CaMV 35S promoter, enhanced CaMV 35S
promoter, and ubiquitin promoter.
promoter, and ubiquitin promoter.
11. A method for enhancing the biological production of polyhydroxyalkanoates in a transgenic plant, comprising expressing genes encoding heterologous fatty acid .beta.-oxidation enzymes in cytosol, plastids other than the peroxisomes or glyoxisomes, or mitochondria of the plant.
12. The method of claim 11 wherein the transgenic plant is selected from the group consisting of Brassica, maize, soybean, cottonseed, sunflower, palm, coconut, safflower, peanut, mustards, flax, tobacco, and alfalfa.
13. A transgenic plant or part thereof comprising heterologous genes encoding fatty acid .beta.-oxidation enzymes in cytosol, plastids other than the peroxisomes or glyoxisomes, or mitochondria of the plant.
14. The plant or part thereof of claim 13 wherein the fatty acid .beta.-oxidation enzymes are expressed from genes selected from the group consisting of bacterial, yeast, fungal, plant, and mammalian.
15. The plant or part thereof of claim 14 wherein the fatty acid oxidation enzymes are expressed from genes from bacteria selected from the group consisting of Escherichia, Pseudomonas, Alcaligenes, and Coryneform.
16. The plant or part thereof of claim 15 wherein the genes are Pseudomonas putida faoAB.
17. The plant or part thereof of claim 13 further comprising genes encoding enzymes selected from the group consisting of polyhydroxyalkanoate synthases, acetoacetyl-CoA reductases, .beta.-ketoacyl-CoA thiolases, and enoyl-CoA hydratases.
18. The plant or part thereof of claim 13 wherein the plant is selected from the group consisting of Brassica, maize, soybean, cottonseed, sunflower, palm, coconut, safflower, peanut, mustards, flax, tobacco, and alfalfa.
19. The plant or part thereof of claim 13 comprising a DNA
construct comprising, in phase, (a) a promoter region functional in a plant;
(b) a structural DNA sequence encoding at least one fatty acid .beta.-oxidation enzyme activity; and (c) a 3' nontranslated region of a gene naturally expressed in a plant, wherein the nontranslated region encodes a signal sequence for polyadenylation of mRNA.
construct comprising, in phase, (a) a promoter region functional in a plant;
(b) a structural DNA sequence encoding at least one fatty acid .beta.-oxidation enzyme activity; and (c) a 3' nontranslated region of a gene naturally expressed in a plant, wherein the nontranslated region encodes a signal sequence for polyadenylation of mRNA.
20. The plant or part thereof of claim 19 wherein the promoter is a seed specific promoter.
21. The plant or part thereof of claim 20 wherein the seed specific promoter is selected from the group consisting of napin promoter, phaseolin promoter, oleosin promoter, 2S albumin promoter, zero promoter, .beta.-conglycinin promoter, acyl-carrier protein promoter, and fatty acid desaturase promoter.
22. The plant or part thereof of claim 19 wherein the promoter is a constitutive promoter.
23. The plant or part thereof of claim 20 wherein the promoter is selected from the group consisting of CaMV 35S promoter, enhanced CaMV
35S promoter, and ubiquitin promoter.
35S promoter, and ubiquitin promoter.
24. A method of preventing or suppressing seed production in a plant, comprising expressing heterologous genes encoding fatty acid .beta.-oxidation enzymes in cytosol or plastids other than the peroxisomes, glyoxisomes or mitochondria of the plant.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US7710798P | 1998-03-06 | 1998-03-06 | |
US60/077,107 | 1998-03-06 | ||
PCT/US1999/004999 WO1999045122A1 (en) | 1998-03-06 | 1999-03-05 | Modification of fatty acid metabolism in plants |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2322729A1 CA2322729A1 (en) | 1999-09-10 |
CA2322729C true CA2322729C (en) | 2010-06-15 |
Family
ID=22136102
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2322729A Expired - Fee Related CA2322729C (en) | 1998-03-06 | 1999-03-05 | Modification of fatty acid metabolism in plants |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1060250A1 (en) |
JP (2) | JP2002505107A (en) |
AU (1) | AU750057B2 (en) |
CA (1) | CA2322729C (en) |
MX (1) | MXPA00008674A (en) |
WO (1) | WO1999045122A1 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003501065A (en) * | 1999-06-08 | 2003-01-14 | カルジーン エルエルシー | Nucleic acid sequences encoding proteins involved in β-oxidation of fatty acids and their use |
FI111087B (en) * | 1999-08-03 | 2003-05-30 | Tuomo Glumoff | Process for Control of Polyhydroxyalkanoate Synthesis as Precursor Molecules Functioning (3R) |
AU7741200A (en) * | 1999-09-29 | 2001-04-30 | E.I. Du Pont De Nemours And Company | Production of polyhydroxyalkanoate in plants |
US7176349B1 (en) | 1999-09-29 | 2007-02-13 | Pioneer Hi-Bred International, Inc. | Production of polyhydroxyalkanoate in plants |
US6914170B2 (en) | 2000-07-06 | 2005-07-05 | Pioneer Hi-Bred International, Inc. | Methods for regulating beta-oxidation in plants |
US6806401B2 (en) | 2000-12-27 | 2004-10-19 | Pioneer Hi-Bred International, Inc. | OAR polynucleotides, polypeptides and their use in PHA production in plants |
JP2002262886A (en) * | 2001-03-14 | 2002-09-17 | Inst Of Physical & Chemical Res | Plant synthesizing co-polyester consisting of short chain fatty acid-derived monomer and method for producing polyester |
US7098298B2 (en) * | 2003-11-21 | 2006-08-29 | Kaneka Corporation | Method for producing polyhydroxyalkanoate crystal |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5610041A (en) * | 1991-07-19 | 1997-03-11 | Board Of Trustees Operating Michigan State University | Processes for producing polyhydroxybutyrate and related polyhydroxyalkanoates in the plastids of higher plants |
GB9306490D0 (en) * | 1993-03-29 | 1993-05-19 | Zeneca Ltd | Plant gene specifying acetyl,coenzyme a carboxylase and transformed plants containing same |
DE19531678C2 (en) * | 1995-08-29 | 1997-10-02 | Ahmed Dr Sheriff | Process for preventing or preventing the formation of breathing plant tissue, in particular for producing seedless plants by means of additional cystosolic pyruvate, phosphate dikinase |
CN1196655A (en) * | 1995-09-21 | 1998-10-21 | 华盛顿州立大学研究基金会 | Conditionally male-fertile plants and methods and compositions for restoring fertility thereof |
US5958745A (en) * | 1996-03-13 | 1999-09-28 | Monsanto Company | Methods of optimizing substrate pools and biosynthesis of poly-β-hydroxybutyrate-co-poly-β-hydroxyvalerate in bacteria and plants |
AU739905B2 (en) * | 1996-08-08 | 2001-10-25 | Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E.V. | Transgenic plant cells and plants with modified acetyl coa production |
US5750848A (en) * | 1996-08-13 | 1998-05-12 | Monsanto Company | DNA sequence useful for the production of polyhydroxyalkanoates |
EP0894864A1 (en) * | 1997-07-29 | 1999-02-03 | Axel Dr. Brennicke | Genes encoding enzymes of the ACDH-family in plants and methods for producing transgenic seeds or plants with enhanced content or altered compsition of fatty acids and/or amino acids |
-
1999
- 1999-03-05 AU AU30714/99A patent/AU750057B2/en not_active Ceased
- 1999-03-05 MX MXPA00008674A patent/MXPA00008674A/en not_active IP Right Cessation
- 1999-03-05 WO PCT/US1999/004999 patent/WO1999045122A1/en active IP Right Grant
- 1999-03-05 JP JP2000534653A patent/JP2002505107A/en not_active Withdrawn
- 1999-03-05 EP EP99912316A patent/EP1060250A1/en not_active Withdrawn
- 1999-03-05 CA CA2322729A patent/CA2322729C/en not_active Expired - Fee Related
-
2009
- 2009-08-04 JP JP2009181255A patent/JP2009291204A/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
WO1999045122A1 (en) | 1999-09-10 |
MXPA00008674A (en) | 2003-07-14 |
AU750057B2 (en) | 2002-07-11 |
JP2009291204A (en) | 2009-12-17 |
CA2322729A1 (en) | 1999-09-10 |
EP1060250A1 (en) | 2000-12-20 |
AU3071499A (en) | 1999-09-20 |
JP2002505107A (en) | 2002-02-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4121541B2 (en) | Production of intermediate chain length polyhydroxyalkanoates from fatty acid biosynthetic pathways | |
JP4755109B2 (en) | Transgenic plant with reduced level of saturated fatty acid and method for producing the same | |
AU763296B2 (en) | Limanthes oil genes | |
EP2478105A1 (en) | Generation of high polyhydroxybutrate producing oilseeds | |
JP2009291204A (en) | Modification of fatty acid metabolism in plant | |
AU770120B2 (en) | Plant multi-gene expression constructs | |
US20090138992A1 (en) | Limnanthes oil genes | |
WO1999035278A1 (en) | Biosynthesis of medium chain length polyhydroxyalkanoates | |
US6586658B1 (en) | Modification of fatty acid metabolism in plants | |
MXPA01001049A (en) | Enzymes for biopolymer production. | |
AU2010295638B2 (en) | Generation of high polyhydroxybutrate producing oilseeds |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20130305 |