CN1732262A - 调节产纤维植物中纤维素生物合成的方法和手段 - Google Patents
调节产纤维植物中纤维素生物合成的方法和手段 Download PDFInfo
- Publication number
- CN1732262A CN1732262A CNA2003801057971A CN200380105797A CN1732262A CN 1732262 A CN1732262 A CN 1732262A CN A2003801057971 A CNA2003801057971 A CN A2003801057971A CN 200380105797 A CN200380105797 A CN 200380105797A CN 1732262 A CN1732262 A CN 1732262A
- Authority
- CN
- China
- Prior art keywords
- seq
- plant
- sequence
- nucleotide sequence
- nucleotide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 66
- 239000000835 fiber Substances 0.000 title claims abstract description 34
- 230000008166 cellulose biosynthesis Effects 0.000 title abstract 4
- 241000196324 Embryophyta Species 0.000 claims abstract description 203
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 185
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 45
- 239000002773 nucleotide Substances 0.000 claims description 168
- 125000003729 nucleotide group Chemical group 0.000 claims description 168
- 229920002678 cellulose Polymers 0.000 claims description 85
- 239000001913 cellulose Substances 0.000 claims description 84
- 229920000742 Cotton Polymers 0.000 claims description 62
- 108020004414 DNA Proteins 0.000 claims description 52
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 45
- 235000018102 proteins Nutrition 0.000 claims description 44
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 43
- 239000004753 textile Substances 0.000 claims description 41
- 230000000295 complement effect Effects 0.000 claims description 29
- 230000008488 polyadenylation Effects 0.000 claims description 25
- 230000000694 effects Effects 0.000 claims description 21
- 230000005030 transcription termination Effects 0.000 claims description 21
- 230000009467 reduction Effects 0.000 claims description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 15
- 230000014509 gene expression Effects 0.000 claims description 14
- 230000035772 mutation Effects 0.000 claims description 14
- 235000013311 vegetables Nutrition 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 10
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 9
- 238000007380 fibre production Methods 0.000 claims description 7
- 229930182555 Penicillin Natural products 0.000 claims description 5
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 claims description 5
- 238000013467 fragmentation Methods 0.000 claims description 5
- 238000006062 fragmentation reaction Methods 0.000 claims description 5
- 229940049954 penicillin Drugs 0.000 claims description 5
- 238000010189 synthetic method Methods 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 230000002708 enhancing effect Effects 0.000 abstract description 2
- 240000002024 Gossypium herbaceum Species 0.000 abstract 1
- 235000004341 Gossypium herbaceum Nutrition 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 105
- 235000010980 cellulose Nutrition 0.000 description 69
- 244000299507 Gossypium hirsutum Species 0.000 description 55
- 241000219198 Brassica Species 0.000 description 36
- 235000003351 Brassica cretica Nutrition 0.000 description 35
- 235000003343 Brassica rupestris Nutrition 0.000 description 35
- 240000001307 Myosotis scorpioides Species 0.000 description 35
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 35
- 235000010460 mustard Nutrition 0.000 description 35
- 230000008859 change Effects 0.000 description 30
- 108091034117 Oligonucleotide Proteins 0.000 description 29
- 230000012010 growth Effects 0.000 description 28
- 102000004190 Enzymes Human genes 0.000 description 27
- 108090000790 Enzymes Proteins 0.000 description 27
- 229940088598 enzyme Drugs 0.000 description 27
- 101100031399 Arabidopsis thaliana PSL5 gene Proteins 0.000 description 25
- 230000000692 anti-sense effect Effects 0.000 description 22
- 239000002299 complementary DNA Substances 0.000 description 20
- 230000002950 deficient Effects 0.000 description 15
- 240000000047 Gossypium barbadense Species 0.000 description 14
- 235000009429 Gossypium barbadense Nutrition 0.000 description 14
- 230000006870 function Effects 0.000 description 13
- 238000009396 hybridization Methods 0.000 description 13
- 102000039446 nucleic acids Human genes 0.000 description 13
- 108020004707 nucleic acids Proteins 0.000 description 13
- 150000007523 nucleic acids Chemical class 0.000 description 13
- 235000001014 amino acid Nutrition 0.000 description 11
- 150000001413 amino acids Chemical class 0.000 description 11
- 230000003197 catalytic effect Effects 0.000 description 11
- XUWPJKDMEZSVTP-LTYMHZPRSA-N kalafungina Chemical compound O=C1C2=C(O)C=CC=C2C(=O)C2=C1[C@@H](C)O[C@H]1[C@@H]2OC(=O)C1 XUWPJKDMEZSVTP-LTYMHZPRSA-N 0.000 description 11
- 101100449952 Arabidopsis thaliana KOR gene Proteins 0.000 description 10
- 150000004676 glycans Chemical class 0.000 description 10
- 101100136092 Drosophila melanogaster peng gene Proteins 0.000 description 9
- 102000003886 Glycoproteins Human genes 0.000 description 9
- 108090000288 Glycoproteins Proteins 0.000 description 9
- 230000004988 N-glycosylation Effects 0.000 description 9
- 230000002441 reversible effect Effects 0.000 description 9
- 235000018322 upland cotton Nutrition 0.000 description 9
- 102100031819 Kappa-type opioid receptor Human genes 0.000 description 8
- 210000002421 cell wall Anatomy 0.000 description 8
- 230000008961 swelling Effects 0.000 description 8
- 241000699666 Mus <mouse, genus> Species 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 108020001588 κ-opioid receptors Proteins 0.000 description 7
- 241000219195 Arabidopsis thaliana Species 0.000 description 6
- 229920000018 Callose Polymers 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 6
- 240000007594 Oryza sativa Species 0.000 description 6
- 235000007164 Oryza sativa Nutrition 0.000 description 6
- 244000061456 Solanum tuberosum Species 0.000 description 6
- 235000002595 Solanum tuberosum Nutrition 0.000 description 6
- 108010028144 alpha-Glucosidases Proteins 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 229920001282 polysaccharide Polymers 0.000 description 6
- 239000005017 polysaccharide Substances 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 238000003908 quality control method Methods 0.000 description 6
- 238000003757 reverse transcription PCR Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 229920002472 Starch Polymers 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000011575 calcium Substances 0.000 description 5
- 108010040093 cellulose synthase Proteins 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000004069 differentiation Effects 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 210000003097 mucus Anatomy 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 235000009566 rice Nutrition 0.000 description 5
- 235000019698 starch Nutrition 0.000 description 5
- 239000008107 starch Substances 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 108010080629 tryptophan-leucine Proteins 0.000 description 5
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 5
- 101100227400 Arabidopsis thaliana FLS4 gene Proteins 0.000 description 4
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 4
- 102000053642 Catalytic RNA Human genes 0.000 description 4
- 108090000994 Catalytic RNA Proteins 0.000 description 4
- 229920002307 Dextran Polymers 0.000 description 4
- 101150062179 II gene Proteins 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 4
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 238000005336 cracking Methods 0.000 description 4
- 230000007547 defect Effects 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 230000003203 everyday effect Effects 0.000 description 4
- -1 mannose oligosaccharides Chemical class 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 230000000394 mitotic effect Effects 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- 108010044292 tryptophyltyrosine Proteins 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 3
- 101100031398 Arabidopsis thaliana PSL4 gene Proteins 0.000 description 3
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- 108700023372 Glycosyltransferases Proteins 0.000 description 3
- 102000051366 Glycosyltransferases Human genes 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 3
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 3
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 3
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000001851 biosynthetic effect Effects 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000002131 composite material Substances 0.000 description 3
- 230000007812 deficiency Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000004720 fertilization Effects 0.000 description 3
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 3
- 229930182470 glycoside Natural products 0.000 description 3
- 150000002338 glycosides Chemical class 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 238000005213 imbibition Methods 0.000 description 3
- 231100000225 lethality Toxicity 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- FIGCMWLCRVTBTQ-SMFCIRJCSA-N nona-N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO[C@@H]2O[C@H](CO[C@@H]3O[C@H](CO[C@@H]4O[C@H](CO[C@@H]5O[C@H](CO[C@@H]6O[C@H](CO[C@@H]7O[C@H](CO[C@@H]8O[C@H](CO[C@@H]9O[C@H](CO)[C@@H](O)[C@H](O)[C@H]9NC(C)=O)[C@@H](O)[C@H](O)[C@H]8NC(C)=O)[C@@H](O)[C@H](O)[C@H]7NC(C)=O)[C@@H](O)[C@H](O)[C@H]6NC(C)=O)[C@@H](O)[C@H](O)[C@H]5NC(C)=O)[C@@H](O)[C@H](O)[C@H]4NC(C)=O)[C@@H](O)[C@H](O)[C@H]3NC(C)=O)[C@@H](O)[C@H](O)[C@H]2NC(C)=O)[C@@H](O)[C@@H]1O FIGCMWLCRVTBTQ-SMFCIRJCSA-N 0.000 description 3
- 229920001542 oligosaccharide Polymers 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 239000011148 porous material Substances 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 230000005082 stem growth Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- 101100166839 Arabidopsis thaliana CESA1 gene Proteins 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 2
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 229920002498 Beta-glucan Polymers 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 108091092236 Chimeric RNA Proteins 0.000 description 2
- 241000020428 Colea Species 0.000 description 2
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 244000004281 Eucalyptus maculata Species 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 2
- 240000001814 Gossypium arboreum Species 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 2
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 2
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 101150100969 KOR gene Proteins 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 2
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 2
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 2
- 108010031099 Mannose Receptor Proteins 0.000 description 2
- 108010038016 Mannose-1-phosphate guanylyltransferase Proteins 0.000 description 2
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 2
- 102000005431 Molecular Chaperones Human genes 0.000 description 2
- 108010006519 Molecular Chaperones Proteins 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 240000002853 Nelumbo nucifera Species 0.000 description 2
- 235000006508 Nelumbo nucifera Nutrition 0.000 description 2
- 235000006510 Nelumbo pentapetala Nutrition 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- 101100168995 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cyt-1 gene Proteins 0.000 description 2
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 2
- 241000364051 Pima Species 0.000 description 2
- 241000219000 Populus Species 0.000 description 2
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 2
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 2
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 2
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- 241000219870 Trifolium subterraneum Species 0.000 description 2
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 2
- LFGHEUIUSIRJAE-TUSQITKMSA-N Trp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LFGHEUIUSIRJAE-TUSQITKMSA-N 0.000 description 2
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 2
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 241000618809 Vitales Species 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000002498 deadly effect Effects 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 150000002031 dolichols Chemical class 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 238000012226 gene silencing method Methods 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 101150023613 mev-1 gene Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 238000005554 pickling Methods 0.000 description 2
- 230000008635 plant growth Effects 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 239000011591 potassium Substances 0.000 description 2
- 229910052700 potassium Inorganic materials 0.000 description 2
- 239000000441 potassium aluminium silicate Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000008521 reorganization Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 239000002689 soil Substances 0.000 description 2
- 239000010421 standard material Substances 0.000 description 2
- 239000004575 stone Substances 0.000 description 2
- OHKOGUYZJXTSFX-KZFFXBSXSA-N ticarcillin Chemical compound C=1([C@@H](C(O)=O)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)C=CSC=1 OHKOGUYZJXTSFX-KZFFXBSXSA-N 0.000 description 2
- 229960004075 ticarcillin disodium Drugs 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- INOZZBHURUDQQR-AJNGGQMLSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 INOZZBHURUDQQR-AJNGGQMLSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- FYGDTMLNYKFZSV-URKRLVJHSA-N (2s,3r,4s,5s,6r)-2-[(2r,4r,5r,6s)-4,5-dihydroxy-2-(hydroxymethyl)-6-[(2r,4r,5r,6s)-4,5,6-trihydroxy-2-(hydroxymethyl)oxan-3-yl]oxyoxan-3-yl]oxy-6-(hydroxymethyl)oxane-3,4,5-triol Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1[C@@H](CO)O[C@@H](OC2[C@H](O[C@H](O)[C@H](O)[C@H]2O)CO)[C@H](O)[C@H]1O FYGDTMLNYKFZSV-URKRLVJHSA-N 0.000 description 1
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 1
- 101710134784 Agnoprotein Proteins 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- QJABSQFUHKHTNP-SYWGBEHUSA-N Ala-Ile-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QJABSQFUHKHTNP-SYWGBEHUSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- CWRBRVZBMVJENN-UVBJJODRSA-N Ala-Trp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CWRBRVZBMVJENN-UVBJJODRSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- ANNKVZSFQJGVDY-XUXIUFHCSA-N Ala-Val-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ANNKVZSFQJGVDY-XUXIUFHCSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 101100223804 Arabidopsis thaliana At5g56368 gene Proteins 0.000 description 1
- 101100223805 Arabidopsis thaliana At5g56369 gene Proteins 0.000 description 1
- 101000908020 Arabidopsis thaliana Cellulose synthase A catalytic subunit 1 [UDP-forming] Proteins 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- 108010010777 Arg-Gly-Asp-Gly Proteins 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- CRNKLABLTICXDV-GUBZILKMSA-N Asp-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N CRNKLABLTICXDV-GUBZILKMSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 101100022254 Candida albicans MAL2 gene Proteins 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 235000012766 Cannabis sativa ssp. sativa var. sativa Nutrition 0.000 description 1
- 235000012765 Cannabis sativa ssp. sativa var. spontanea Nutrition 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 235000011777 Corchorus aestuans Nutrition 0.000 description 1
- 240000004792 Corchorus capsularis Species 0.000 description 1
- 235000010862 Corchorus capsularis Nutrition 0.000 description 1
- 244000050510 Cunninghamia lanceolata Species 0.000 description 1
- BNRHLRWCERLRTQ-BPUTZDHNSA-N Cys-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N BNRHLRWCERLRTQ-BPUTZDHNSA-N 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 1
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- QAFSMQPTMRDQCK-BPUTZDHNSA-N Cys-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 QAFSMQPTMRDQCK-BPUTZDHNSA-N 0.000 description 1
- NBSCHQHZLSJFNQ-QTVWNMPRSA-N D-Mannose-6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O NBSCHQHZLSJFNQ-QTVWNMPRSA-N 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 102100039371 ER lumen protein-retaining receptor 1 Human genes 0.000 description 1
- 244000166124 Eucalyptus globulus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 101000893906 Fowl adenovirus A serotype 1 (strain CELO / Phelps) Protein GAM-1 Proteins 0.000 description 1
- WQOBEIJLPIXUTJ-RTIPJVFBSA-N Glc(a1-3)Glc(a1-3)Man(a1-2)Man(a1-2)Man(a1-3)[Man(a1-2)Man(a1-3)[Man(a1-2)Man(a1-6)]Man(a1-6)]Man(b1-4)GlcNAc(b1-4)GlcNAc Chemical compound O[C@@H]1[C@@H](NC(=O)C)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O[C@H]2[C@H]([C@@H](O[C@@H]3[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O[C@@H]3[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O[C@@H]3[C@H]([C@@H](O[C@@H]4[C@@H]([C@@H](O[C@@H]5[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O5)O)[C@H](O)[C@@H](CO)O4)O)[C@H](O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO[C@@H]3[C@H]([C@@H](O[C@@H]4[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O4)O[C@@H]4[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O4)O)[C@H](O)[C@@H](CO[C@@H]4[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O4)O[C@@H]4[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O4)O)O3)O)O2)O)[C@@H](CO)O1 WQOBEIJLPIXUTJ-RTIPJVFBSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- 241000589236 Gluconobacter Species 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 235000009438 Gossypium Nutrition 0.000 description 1
- 235000014751 Gossypium arboreum Nutrition 0.000 description 1
- 235000009432 Gossypium hirsutum Nutrition 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical group C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 1
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- 101000812437 Homo sapiens ER lumen protein-retaining receptor 1 Proteins 0.000 description 1
- 101000613620 Homo sapiens Protein mono-ADP-ribosyltransferase PARP15 Proteins 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 101710130583 Kappa-type opioid receptor Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000283986 Lepus Species 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 102000001109 Leukocyte L1 Antigen Complex Human genes 0.000 description 1
- 108010069316 Leukocyte L1 Antigen Complex Proteins 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- RMHHNLKYPOOKQN-FXQIFTODSA-N Met-Cys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O RMHHNLKYPOOKQN-FXQIFTODSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 1
- RUTZUJXAVNWLQP-BVSLBCMMSA-N Met-Tyr-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 RUTZUJXAVNWLQP-BVSLBCMMSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 101100446038 Mus musculus Fabp5 gene Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 102000005877 Peptide Initiation Factors Human genes 0.000 description 1
- 108010044843 Peptide Initiation Factors Proteins 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- WSAPMHXTQAOAQQ-BVSLBCMMSA-N Phe-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=CC=C3)N WSAPMHXTQAOAQQ-BVSLBCMMSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 241000218657 Picea Species 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 241001529246 Platymiscium Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 208000020584 Polyploidy Diseases 0.000 description 1
- 206010036590 Premature baby Diseases 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- FYXCBXDAMPEHIQ-FHWLQOOXSA-N Pro-Trp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O FYXCBXDAMPEHIQ-FHWLQOOXSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- 101710124584 Probable DNA-binding protein Proteins 0.000 description 1
- 102100040846 Protein mono-ADP-ribosyltransferase PARP15 Human genes 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 238000010240 RT-PCR analysis Methods 0.000 description 1
- 108700005079 Recessive Genes Proteins 0.000 description 1
- 102000052708 Recessive Genes Human genes 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- TYIHBQYLIPJSIV-NYVOZVTQSA-N Ser-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CO)N TYIHBQYLIPJSIV-NYVOZVTQSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- KHTIUAKJRUIEMA-HOUAVDHOSA-N Thr-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 KHTIUAKJRUIEMA-HOUAVDHOSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- 101710120037 Toxin CcdB Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- NULQKGDFWHIGMD-NYVOZVTQSA-N Trp-Cys-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NULQKGDFWHIGMD-NYVOZVTQSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- VISUNEBASWEMCU-SZMVWBNQSA-N Trp-Glu-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VISUNEBASWEMCU-SZMVWBNQSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 1
- SNWIAPVRCNYFNI-SZMVWBNQSA-N Trp-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SNWIAPVRCNYFNI-SZMVWBNQSA-N 0.000 description 1
- TUUXFNQXSFNFLX-XIRDDKMYSA-N Trp-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N TUUXFNQXSFNFLX-XIRDDKMYSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- WNGMGTMSUBARLB-RXVVDRJESA-N Trp-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(=O)NCC(O)=O)=CNC2=C1 WNGMGTMSUBARLB-RXVVDRJESA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 1
- LQGDFDYGDQEMGA-PXDAIIFMSA-N Tyr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N LQGDFDYGDQEMGA-PXDAIIFMSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- PCBMYXLJUKBODW-UHFFFAOYSA-N [Ru].ClOCl Chemical compound [Ru].ClOCl PCBMYXLJUKBODW-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070113 alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase I Proteins 0.000 description 1
- WQZGKKKJIJFFOK-PQMKYFCFSA-N alpha-D-mannose Chemical compound OC[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-PQMKYFCFSA-N 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 238000004500 asepsis Methods 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- DKXNBNKWCZZMJT-VOXHDCLVSA-N beta-D-glucosyl-(1->4)-aldehydo-D-mannose Chemical compound O=C[C@@H](O)[C@@H](O)[C@@H]([C@H](O)CO)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O DKXNBNKWCZZMJT-VOXHDCLVSA-N 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 235000009120 camo Nutrition 0.000 description 1
- 230000011748 cell maturation Effects 0.000 description 1
- 210000003850 cellular structure Anatomy 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 235000005607 chanvre indien Nutrition 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000002361 compost Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000007598 dipping method Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- CEYULKASIQJZGP-UHFFFAOYSA-L disodium;2-(carboxymethyl)-2-hydroxybutanedioate Chemical compound [Na+].[Na+].[O-]C(=O)CC(O)(C(=O)O)CC([O-])=O CEYULKASIQJZGP-UHFFFAOYSA-L 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004992 fission Effects 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 125000003147 glycosyl group Chemical group 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 239000011487 hemp Substances 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010041601 histidyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 101150083490 mal1 gene Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004660 morphological change Effects 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000000123 paper Substances 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000003415 peat Substances 0.000 description 1
- 108010091617 pentalysine Proteins 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000001739 pinus spp. Substances 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 230000008119 pollen development Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 239000004576 sand Substances 0.000 description 1
- 230000010153 self-pollination Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8245—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified carbohydrate or sugar alcohol metabolism, e.g. starch biosynthesis
- C12N15/8246—Non-starch polysaccharides, e.g. cellulose, fructans, levans
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Nutrition Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明提供了用于在产纤维植物例如棉属植物中调节(即增强或降低)纤维素生物合成和/或纤维品质的嵌合基因,以及利用所述嵌合基因调节纤维素生物合成的方法。本发明还提供了包括本发明的嵌合基因的植物细胞和植物以及此类包括本发明的嵌合基因的植物的种子。本发明还提供了用于在包括一种特定植物品种的不同基因型或变种的群体中鉴别编码参与纤维素生物合成的蛋白质的基因的等位基因变异的方法。
Description
发明领域
本发明涉及农业生物工程学领域。更具体地,本发明提供了参与纤维素生物合成的新的基因以及使用此类基因调节产纤维植物例如棉属植物中的纤维素生物合成的方法。本发明也提供了用于在产纤维植物群体中鉴别并分离这些基因的与产生的纤维的品质有关联的等位基因的方法。
背景技术
纤维素是高等植物细胞壁的主要结构多糖。β-1,4-联葡糖残基(β-1,4-1inked glucosyl residues)的链在合成后迅速组装形成坚硬的、在化学性质上具有抵抗力的微纤维。其机械特性及其在细胞壁的定位影响了细胞在不同反向的相对伸展性,并决定成熟细胞和器官的最终机械特性。这些机械特性对于木材、纸张、纺织品和化学工业极为重要。
很多用于纺织业的高质量纤维是棉属植物产生的。全世界的大约90%的棉属植物是陆地棉(Gossypium hirsutum L.),而海岛棉(Gossypiumbarbadense)占大约8%。
已经在多种植物中通过突变分析鉴定了若干种参与纤维素生物合成的基因。鼠耳芥(Arabidopsis thaliana)的突变株说明,体内纤维素合成需要编码糖基转移酶的AtCesA基因家族成员的活性(Arioli等,1998;Taylor等,1999;Fagard等,2000;Taylor等,2000;Scheible等,2001;Burn等,2002a;Desprez等,2002)、编码膜相关性内-1,4-β-D-葡聚糖酶的AtKOR1基因(At5g49720)活性(Nicol等,1998;Zuo等,2000;Lane等,2001;Sato等,2001)、编码一种功能未知的质膜蛋白的KOBITO1的活性(Pagant等,2002)和编码ER中进行N-糖基化/品质控制途径的酶的基因的活性(Lukowitz等,2001;Bum等,2002b;Gillmor等,2002)。
内-1,4-β-D-葡聚糖酶在纤维素合成中的功能仍有待确定,但其缺乏BnCel16的对结晶纤维素的活性,BnCel16是一种相关的欧洲油菜(Brassica napus)酶(Mlhj等,2001),这提示这种酶可能裂解非结晶葡聚糖链,例如脂质连接引物(lipid-linked primer)或葡聚糖供体(Williamson等,2001;Peng等,2002)。番茄Ce13(LeCe13)是第一种得到鉴定的膜相关性内-1,4-β-D-葡聚糖酶(Brummel1等,1997),而LeCe13的抗体检测到棉纤维蛋白在除草剂抑制纤维素合成的过程中被上调(Peng等,2001)。棉纤维膜成分的体外纤维素合成活性需要Ca2+,而由于外源性的Ca2+-非依赖性内-1,4-β-D-葡聚糖酶可使得纤维素合成活性恢复,因此认为KOR(GhKOR)的棉属植物直向同源物是内源性Ca2+-依赖性因子(Peng等,2002)。截短形式的BnCel16在体外显示出Ca2+-依赖性(Mlhj等,2001)。
进一步遗传学数据指向应答于N-糖基化/品质控制途径的酶缺陷的纤维素合成。这些步骤出现于ER而不是在质膜,且因此可能通过为质膜提供关键的糖蛋白而仅仅间接作用于合成。当富甘露糖寡糖Glc3Man9GlcNac2在ER膜的多萜醇上组装并被转移到含有Asn-X-Ser或Asn-X-Thr基序(其中X是除Pro以外的任何氨基酸)的新合成的蛋白质的Asn残基的时候即开始N-糖基化。
通过葡糖苷酶I和II进一步加工糖蛋白,N-糖基化与负责保证新合成的蛋白质正确折叠的品质控制途径相交(Helenius and Aebi,2001;Vitale,2001)。葡糖苷酶I去除末端α-1,2-联葡糖残基以产生Glc2Man9GlcNac2,而葡糖苷酶II去除下一个α-1,3-葡糖残基。携带所得GlcMan9GlcNac2的多肽特异性结合分子伴侣(钙连接蛋白和钙网蛋白)以及或许其他的可促进新合成的蛋白质的正确折叠的蛋白质。当葡糖苷酶II将结合分子伴侣所需的最后的Glc残基去除后,糖蛋白释放所述分子伴侣。然后糖蛋白葡糖基转移酶将一个Glc残基再次结合到非正确折叠的糖蛋白的Man9GlcNAc2,如此使得它们可再次结合分子伴侣并仍拥有正确折叠的机会。不过,正确折叠的蛋白质则不能被该酶再次葡糖基化,而是通过分泌途径前进以进一步加工和输送。
该途径中的若干个环节的缺陷可影响纤维素合成。序列分析显示,马铃薯MAL1基因编码一种葡糖苷酶II,且反义抑制降低了葡糖苷酶II活性(Taylor等,2000a)。与对照相比,当生长于田间环境时,M4LJ反义植物积聚较少的纤维素,不过,在暖房条件下没有明显的表型。胚芽致死knopf突变体缺乏葡糖苷酶I并严重缺乏纤维素(Gillmor等,2002)。最后,胚芽致死cyt1突变体是来自甘露糖-1-磷酸guanylyltransferase缺陷的纤维素缺陷体,该酶产生组装高甘露糖寡糖所需的(包括其他)UDP-Man,高甘露糖寡糖自多萜醇转移至新生的蛋白质(Lukowitz等,2001)。影响纤维素合成的突变主要针对那些N-糖基化途径与品质控制途径发生交叉的早期步骤。似乎特别重要的是品质控制,而不是在重要蛋白质上产生成熟聚糖,这是由于一种N-乙酰氨基葡萄糖转移酶I的缺陷没有产生可检测到的表型,该缺陷阻断了高尔基体中形成成熟的N-联聚糖的步骤(von Schaewen等,1993)。
Baskin等,1992描述了鼠耳芥属突变体,这些突变体显示出根部径向膨胀,这些突变体称为rsw1、rsw2和rsw3。这些突变系显示出纤维素的产生具有选择性降低(Peng等,2000)。
WO98/00549一般性涉及分离的基因,这些基因编码参与植物中纤维素生物合成的多肽,还涉及转基因植物,其表达有义或反义方向的所述基因或作为核酶、共抑制或基因导向分子的所述基因。更具体地,其请求保护一种核酸分子,其分离自鼠耳芥、稻米(Oryza sativa)、小麦、大麦、玉米、芸苔(Brassica ssp.)、陆地棉和桉(Eucalyptus ssp),其编码一种对纤维素生物合成十分重要的酶,特别是纤维素合酶及其同源物、类似物和衍生物,及其在产生表达改变了的纤维素生物合成特性的转基因植物中的用途。
WO 98/50568公开了使用编码内-1,4-β-葡聚糖酶的核苷酸序列在植物中抑制细胞的生长。所述核苷酸序列与鼠耳芥属KOR蛋白序列或如下所述的一种蛋白序列完全或部分相对应,所述的一种蛋白序列的N端与所述KOR的前107个氨基酸具有至少40%相同性且优选地具有70%相同性。
WO 97/24448描述了重组的和分离的核酸,所述核酸编码一种植物α-葡糖苷酶。其也提供了一种反义核苷酸以及所述分离的或重组的序列和反义序列的用途。该发明的用途包括增强和降低α-葡糖苷酶的表达以及提供新的淀粉。
WO 00/08175涉及编码来自马铃薯的具有α-葡糖苷酶活性的蛋白质的核酸分子。该发明也涉及用于产生合成改性淀粉的转基因植物细胞和植物的方法。该发明进一步涉及含有所述核酸分子的载体和宿主细胞、由该发明的方法得到的植物细胞和植物、由所述植物细胞合成的淀粉以及产生此类淀粉的方法。
WO 98/39455公开了一种参与微生物合成纤维素的基因和酶。提供了编码纤维素酶、纤维素合酶复合体和α-葡糖苷酶的特定的基因。
WO9818949和US6271443提供了来自棉属植物(陆地棉)的两种植物cDNA克隆,它们是编码纤维素合酶的催化亚基的细菌CelA基因的同源物。其还提供了纤维素合酶的编码区的基因组启动子区域。还提供了在改良棉纤维和木材品质中使用纤维素合酶的方法。
不过,现有技术仍然亟需那些已知参与纤维素生物合成的基因的替代物,且没有公开参与纤维素生物合成的野生型基因的核苷酸序列以及在rsw3突变体鼠耳芥系中是突变的核苷酸序列。此外,现有技术没有公开来自棉属植物的参与纤维素生物合成的RSW2或RSW3的棉属植物同源物基因。
这些以及其他的问题通过以下公开的本发明的各种不同实施方式和权利要求书而得以解决。
发明内容
本发明的一个目的是提供一种用于增加产纤维植物,特别是棉属植物中的纤维素生物合成,例如棉绒纤维生物合成的方法,其包括以下步骤:
(a)给所述产纤维植物的细胞提供一种嵌合基因,所述嵌合基因包括以下可操纵连接的DNA片段:
i)在所述植物的所述细胞中可表达的启动子,例如组成型启动子、纤维特异性启动子或者苹果青霉素启动子;
ii)编码包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质(或者所述蛋白质的具有同样酶活性的变体)的DNA区域,例如SEQ ID No.1的自第位核苷酸至第1986位核苷酸的核苷酸序列或SEQ ID No.2的自第47位核苷酸至第1906位核苷酸的核苷酸序列或SEQ ID No.3的或SEQ ID No.4的自第2位核苷酸至第1576位核苷酸的核苷酸序列或SEQ ID No.9的核苷酸序列;
iii)参与转录终止和聚腺苷酸化的3’区域。
本发明的另一个目的是提供一种用于降低产纤维植物,特别是棉属植物中的纤维素生物合成,例如绒毛纤维生物合成的方法,其包括给所述产纤维植物的细胞提供一种能够降低所述产纤维植物的一种内源性基因的表达的嵌合基因的步骤,其中所述内源性基因编码一种包括SEQID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体,所述变体具有同样的酶活性。所述导入的嵌合基因可包括:一种具有21个连续的核苷酸的核苷酸序列,其选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列,例如SEQ ID No.1或SEQ ID No.2或SEQ ID No.3或SEQ ID No.4或SEQ ID No.9的核苷酸序列或其互补序列,所述核苷酸序列可操纵地连接于在植物中可表达的启动子例如组成型启动子或绒毛纤维特异性启动子,以及参与转录终止和聚腺苷酸化的3’区域。所述嵌合基因也可包括:一种具有21个连续的核苷酸的第一核苷酸序列,其选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列,例如SEQ ID No.1、SEQ ID No.2、SEQ ID No.3和SEQ ID No.4或SEQID No.9的核苷酸序列,和一种与所述第一核苷酸序列互补的第二核苷酸序列,所述核苷酸序列可操纵地连接于在植物中可表达的启动子,以及参与转录终止和聚腺苷酸化的3’区域,由此在所述嵌合基因转录后形成一种RNA,所述RNA可在所述第一和第二核苷酸序列之间形成一个双链RNA区域。
本发明进一步涉及一种用于增加产纤维植物中,特别是棉属植物中的纤维素生物合成的嵌合基因,其包括以下可操纵连接的DNA片段:在所述植物的所述细胞中可表达的启动子,例如组成型启动子、(棉绒)纤维特异性启动子或苹果青霉素启动子;编码包括SEQ ID No.6或SEQID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体的DNA区域,所述变体具有同样的酶活性,例如SEQ ID No.1的自第121位核苷酸至第1986位核苷酸或SEQ ID No.2的自第47位核苷酸至第1906位核苷酸或SEQ ID No.3的或SEQ ID No.4的自第2位核苷酸至第1576位核苷酸或SEQ ID No.9的核苷酸序列;以及参与转录终止和聚腺苷酸化的3’端区域。
本发明也涉及一种用于降低产纤维植中,特别是棉属植物中的纤维素生物合成的嵌合基因,其包括一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的核苷酸序列,所述序列选自编码一种包括SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列,例如SEQ ID No.2、SEQ ID No.3或SEQ ID No.4或SEQ ID No.9的核苷酸序列或其互补序列,以及参与转录终止和聚腺苷酸化的3’区域。
本发明还涉及一种用于降低产纤维植物中,特别是棉属植物中的纤维素生物合成的嵌合基因,其包括可操纵地连接于在植物中可表达的启动子的一种具有21个连续的核苷酸的第一核苷酸序列和一种与所述第一核苷酸序列互补的第二核苷酸序列以及参与转录终止和聚腺苷酸化的3’区域,所述第一核苷酸序列选自编码一种包括SEQ ID No.6或SEQID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列,由此在所述嵌合基因转录后形成一种RNA,所述RNA可在所述第一和第二核苷酸序列之间形成一个双链RNA区域。
本发明的另一方面提供了包含本发明的嵌合基因的植物细胞和植物,以及包含本发明的嵌合基因的此类植物的种子。此类植物优选地基本上由包含本发明的嵌合基因的植物细胞组成。
因此本发明涉及本发明的嵌合基因在调节产纤维植物的,例如棉属植物的,纤维素生物合成和纤维品质中的用途。
本发明的另一方面还在于提供了一种用于在包括一种特定植物品种的不同基因型或变种的群体中鉴别编码参与纤维素生物合成的蛋白质的基因的等位基因变异的方法,所述特定植物品种优选地是产纤维植物品种,例如棉属植物,所述等位基因变异单独地或者组合地与纤维素生产的数量和/品质以及纤维生产有关联,所述方法的步骤包括:
a)提供包含不同等位基因形式的编码包括SEQ ID No.5、6、7或8的氨基酸序列的蛋白质的核苷酸序列的特定植物品种或杂种繁殖植物品种的不同变种或基因型的群体;
b)确定所述群体中每一个个体的与纤维生产和/或纤维素生物合成有关的参数;
c)确定所述群体中每一个个体是否存在一种特定等位基因形式的编码包括SEQ ID No.5、6、7或8的氨基酸序列的蛋白质的核苷酸序列;以及
d)将出现特定的纤维或纤维素参数与是否存在一种特定等位基因形式的所述核苷酸序列或此类等位基因形式的一种特定组合相关联。
附图说明
图1:蛋白质GhKOR(SEQ ID No.6)、LeCel3(保藏号T07612)和AtKOR1(保藏号At5g49720;SEQ ID No.5)与BnCel16(保藏号CAB51903)的ClustalW比对。突出的部分是:极性化导向基序,涉及导向细胞板(cell plate)(Zuo等,2000);推定的靠近N端的跨膜区(跨膜);4个潜在参与催化的保守残基(Asp-198、Asp-201、His-516和E-555;标记为o)以及与家族9糖苷水解酶非常相似的代表性部分;富Pro且具有内-1,4-β-葡聚糖酶家族的膜结合成员的特征的C端区域;8个推定的N-糖基化位点(Asn-X-Ser/Thr;标记为G1至G8)。
图2:通过以可操纵地连接于CaMV 35S启动子的GhKOR1 cDNA(SEQ ID No.2)进行转染而与rsw2-1互补。(A)在29℃暴露2天后,rsw2-1根发生膨胀,而野生型(Co)和含有AtKOR1或GhKOR的互补植物则不发生膨胀。(B)两株植物的成熟的茎,各自为rsw2-1(左)野生型和表达GhKOR的rsw2-1。在21℃生长于罐中的植株的照片,直至开始抽苔(bolting),此时将抽苔切下并将植株转移至29℃以使抽苔再生。
图3:编码葡糖苷酶II基因的突变引起径向膨胀。(a)以自野生型基因组扩增的5.8kB片段转化的rsw3互补根部径向膨胀。Columbia野生型(左)、rsw3(中)和以葡糖苷酶II基因的基因组拷贝转化的rsw3卡那霉素抗性T1幼苗(右)。野生型基因抑制径向膨胀。在拍照前,将所有植物转移至30℃放置2天。(b)rsw3突变是插入性突变体5GT5691的等位基因突变,其在葡糖苷酶II基因的第一个外显子中含有Ds元件。Columbia野生型(左)、rsw3(中)和来自5GT5691与rsw3杂交的杂合F1植物。F1杂合子和rsw3纯合子显示温度诱导的径向膨胀。在拍照前,将所有植物转移至30℃放置2天。
图4:Aglu-3/RSW3序列(Genbank NP_201189)与来自马铃薯(保藏号T07391)、小鼠(NP_032086)和裂殖酵母(CAB65603)的ER固有的葡糖苷酶II的序列的比对。Monroe等(1999)的进化枝2证实了这种高保守性。它们包括若干参与催化的残基(Asp 512和Asp 617;*)。rsw3-1突变的位点(Ser599●)在这些共有序列附近并在这些以及其他葡糖苷酶II序列中是保守的。预测的N端信号序列示于框中。在C端没有HDELER-保留序列。
图5:推荐的鼠耳芥属(At5g56360)的和水稻(我们修改的BAA88186)的β-亚基与来自小鼠(AAC53183)的和裂殖酵母(BAA13906)的葡糖苷酶IIβ-亚基的比对。注意推定的N端信号序列(框中)、C端H/VDEL ER-保留信号和靠近N端的甘露糖受体同源区域(MHR)。给出了MHR中的6个半胱氨酸(f酵母中仅为4个)的编号,并标出了参与底物结合的R和Y残基(●)和半胱氨酸5和6之间的底物识别环。在该序列的其他部分,注意N端和C端结构域具有相对高水平的相似性,而中部区域的相似性要低得多并具有植物特异性插入体。
图6:在所有测试的鼠耳芥属组织中均检测到α-亚基(a)和β-亚基(b)的mRNA。使用来自根(道1)、完整莲座叶(2)、叶片(3)、成熟的茎组织(4)、茎生叶片(5)、花芽(6)、花(7)、长角果(8)、黑暗中生长的胚轴(9)的mRNA的RT-PCR。(在另一个实验中证实黑暗中生长的胚轴存在β-亚基)。
图7:rsw3的形态学。
(a)幼苗的根系显示在膨胀和停止延长之前侧根延伸一定距离。植物在21℃生长5天并在30℃生长6天。比例尺=2mm。
(b)根继续生长产生密集的、高度分枝的根系,并在30℃生长21天的植物上产生大量密集的极小的叶片。比例尺=5mm。
(c)在黑暗中21℃生长3天并在30℃生长3天的胚轴。左起:野生型、rsw1-1、rsw2-1、rsw3、rsw1-1rsw2-1、rsw1-1rsw3。相对于其他单突变体,rsw3对胚轴的影响较弱,而rsw1-1rsw3较rsw1-1rsw2-1更弱。比例尺=5mm。
(d)在30℃在琼脂上生长35天的rsw3的光镜检查。自若干个花结出现接近正常大小的花芽的微弱荧光(右上和左下)。比例尺=5mm。
(e)在30℃生长21天的rsw3植物的扫描电子显微镜并显示出现多个花结。比例尺=1mm。
(f)(e)中环形部分的细节显示细小的叶片的非常复杂的排列,其中很多带有近似正常大小和形态的毛状体。比例尺=200μm。
(g)在30℃生长10天的植物上的野生型叶片表面的扫描电子显微镜照片。注意那些清晰显示出的细胞边界、气孔和毛状体。
(h)rsw3叶片的表面显示出铺路石状细胞(pavement cells)的轮廓明显欠清晰、在其次生细胞环的顶部有明显塌陷的毛状体(CT)以及许多气孔,其保卫细胞突出于叶片表面以上。(g)和(h)的比例尺=100μm。
图8:rsw3的茎生长和再生性发生。
(a和b)21℃(a)和30℃(b)时Columbia野生型、rsw3、rsw1和rsw1rsw3双突变体的次生茎延长的动力学。所有植物生长于21℃直至茎开始出现。将其切下并在指定的温度再次生长次生抽苔。单突变体与野生型在21℃仅有极小的差异,不过双突变体延长得更加缓慢且最终高度明显更矮小。在30℃达到的最终高度差异很大,这与其经由的轨迹一样。rsw1延长得更加缓慢,但延长一直持续,至少与其野生型一样。rsw3延长得几乎与野生型一样迅速,但4天后停止延长达大约6天。rsw1rsw3双突变体延长比较不迅速,并在大约第5天停止延长。
(c和d)光镜显示野生型具有间隔良好的花(c)以及rsw3的因其早期停止延长而形成的成簇的花(d)。
(e和f)冷扫描电子显微镜显示了野生型(e)和rsw3(f)的花芽,花芽的大小相似,但rsw3过早开花。注意rsw3的柱头(St)的未成熟状态和萼片(Se)上的细胞的不规则形状。(e)和(f)的比例尺=200μm。
(g和h)冷扫描电子显微镜显示出保持在21℃(g)和30℃(h)的植物产生的吸液后的rsw3的种子。30℃的种子皱缩并缺乏21℃的种子所具有的清晰的细胞图案。
(i-n)吸液后的种子以钌红染色后的光镜照片,显示表面黏液覆层。野生型(i,j)、rsw1(k,l)、rsw3(m,n)。i,k,m中的种子产生于21℃的植物,j,l,n中的种子产生于30℃的植物。分泌黏液的通常是30℃的rsw1(l)和野生型(j)而不是rsw3(n)。
发明的详细描述
本发明基于鉴定了已经发生突变的鼠耳芥属突变体rsw3的野生型基因并阐明了其功能。本发明人还鉴定了相应于rsw2和rsw3鼠耳芥属突变体中的突变基因的棉属植物基因。这些棉属植物基因参与纤维素的产生。
在一个实施方式中,本发明涉及一种用于在植物中增加纤维素产生的方法,其步骤包括为植物的细胞提供一种嵌合基因,所述嵌合基因包括可操纵地连接于一种DNA区域的在植物中可表达的启动子以及参与转录终止和聚腺苷酸化的3’区域,所述DNA区域编码一种包括SEQ IDNo.5、SEQ ID No.6、SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体,所述变体具有与所述蛋白质相似的活性。所述植物可以是产纤维植物例如棉属植物,纤维素产生增加可导致产生更大量的棉纤维,优选地为棉绒纤维,或导致产生的棉纤维的长度发生改变或长度增加,或品质的改变例如抗张强度提高。
在此,“嵌合基因”或“嵌合核酸”指的是任何基因或任何核酸,其在正常情况下不存在于具体的真核生物物种中,或者指的是任何基因,其中启动子在天然情况下不与部分或全部被转录的DNA区域相关联,或不与该基因的至少一种其他调节区域相关联。
在此,术语“启动子”指的是任何在转录起始过程中被DNA-依赖性RNA-聚合酶识别并(直接或间接)结合的DNA。启动子包括转录起始位点和与转录起始因子和RNA聚合酶的结合位点,并可以包含多种可与基因表达调节蛋白结合的其他位点(如增强子)。在此,术语“调节区域”指的是任何参与驱动转录并控制(如调节)特定DNA序列的转录时机和水平的DNA,所述特定DNA序列例如为编码蛋白质或多肽的DNA。例如,5′调节区域(或“启动子区域”)是位于编码序列上游(即5′)的DNA序列并包含启动子和5′非翻译前导序列。3′调节区域是位于编码序列下游(即3′)的DNA序列并包含合适的转录终止(和/或调节)信号,包括一或多个聚腺苷酸化信号。
在本发明的一个实施方式中,启动子是组成型启动子。在本发明的另一个实施方式中,启动子活性可被外在的或内在的刺激增强(诱导型启动子),例如但不限于激素、化学品、机械刺激、非生物性或生物学应激条件。能够以时间的或空间的方式来调节启动子的活性(组织特异性启动子;发育调节的启动子)。
在本发明的一个具体实施方式中,启动子是植物可操纵的(即植物可表达的)启动子。在此,术语“植物可表达的启动子”指的是能够在植物细胞内控制(起始)转录的DNA序列。这包括任一一种植物来源的启动子,但也可以是非植物来源的但能够在植物细胞中指导转录的启动子,即特定的病毒或细菌来源的启动子,例如CaMV 35S启动子(Hapster等,1988)、地三叶草(subterranean clover)病毒启动子No 4.或No.7(WO9606932)或T-DNA基因启动子,但也可以是组织特异性或器官特异性启动子,包括但不限于种子特异性启动子(如WO89/03887)、器官原基特异性启动子(An等,1996)、茎特异性启动子(Keller等,1988)、叶片特异性启动子(Hudspeth等,1989)、叶肉特异性启动子(例如光诱导型Rubisco启动子)、根特异性启动子(Keller等,1989)、块茎特异性启动子(Keil等,1989)、血管组织特异性启动子(Peleman等,1989)、雄蕊选择性启动子(WO 89/10396,WO 92/13956)等等。
优选的植物可表达的启动子包括纤维特异性和/或次生细胞壁特异性启动子,可根据WO 98/18949、WO98/00549或US5932713所述对其进行分离。WO98/18949或US 6,271,443所公开的启动子也是合适的,特别优选的是棉绒纤维特异性启动子。
在上述方法的一个实施方式中,编码包含SEQ ID No.5、SEQ IDNo.6、SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的DNA区域包含的核苷酸序列为SEQ ID No.1的第121至1986位核苷酸、SEQID No.2的第47至1906位核苷酸、SEQ ID No.3或SEQ ID No.4的第2至1576位核苷酸、或SEQ ID No.9。
在上述方法的另一个实施方式中,DNA区域编码包含SEQ ID No.5、SEQ ID No.6、SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的变体。在此,“变体”蛋白质指的是这样的蛋白质,所述蛋白质通过取代、缺失、插入,使得其中一或多个氨基酸与具有SEQ ID No.5、SEQ ID No.6、SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质中的相应位置的氨基酸不同;且所述蛋白质具有由SEQ ID No.5、SEQ IDNo.6、SEQ ID No.7或SEQ ID No.8编码的蛋白质的至少一种功能,例如同样的酶或催化活性。获得变体的方法例如位点特异性诱变方法以及鉴定由这些变体序列所编码的酶活性的测试方法是本领域已知的。优选的取代也称为保守性取代,其中多肽中的一种氨基酸残基被替换为另一种具有相似化学性质的天然存在的氨基酸,例如GlyAla,ValIleLeu,AspGlu,LysArg,AsnGln或PheTrpTyr。
根据说明书,编码变体蛋白质的等位基因形式的核苷酸序列可在严格性条件下通过文库杂交进行鉴定,例如不同变种或植物品系的cDNA或基因组文库,特别是棉属植物变种和植物品系。上述优选的编码区域的功能性等价物是在严格性条件下与编码SEQ ID No.5、6、7或8的氨基酸序列的核苷酸序列或SEQ ID No.1、2、3、4或9的核苷酸序列杂交的核苷酸序列或其足够大的一部分(优选地大约25个连续的核苷酸,特别优选地至少大约50个连续的核苷酸,更加特别优选地至少大约100个连续的核苷酸),并且其编码一种功能性蛋白质,该蛋白质能够互补鼠耳芥属的rsw2或rsw3突变体品系中的至少一种功能,但优选地能够互补全部受影响的能够功能。也可采用例如聚合酶链反应进行扩增而鉴别并分离此类核苷酸,扩增可使用合适的寡核苷酸对,其具有SEQ ID No.1、SEQ ID N.2、SEQ ID No.3、SEQ ID No.4或SEQ ID No.9的核苷酸的至少大约25个连续的核苷酸,特别优选地至少大约50个连续的核苷酸,更加特别优选地至少大约100个连续的核苷酸。
“严格性杂交条件”在此指的是,如果探针与靶序列之间具有至少95%且优选地至少97%的序列相同性,则通常会发生杂交。严格性杂交条件的实例是在包括50%甲酰胺,5×SSC(150mM NaCl,15mM柠檬酸钠),50mM磷酸钠(pH 7.6),5×Denhardt溶液,10%硫酸右旋糖苷和20μg/ml变性的剪断的载体DNA例如鲑精DNA的溶液中孵育过夜,随后在大约65℃在0.1×SSC中洗涤杂交支持物。其他的杂交和洗涤条件是已知的并可参照Sambrook等,Molecular Cloning:A Laboratory Manual,Second Edition,Cold Spring Harbor,NY(1989),特别是11章。
在本发明的另一方面,所鉴定的基因可用于在植物如产纤维植物中,例如棉属植物中,降低纤维素的生物合成。因此,本发明的另一个实施方式提供了一种在植物如产纤维植物中,特别是棉属植物中,降低纤维素生物合成的方法,其步骤包括给所述产纤维植物的细胞提供一种嵌合基因,所述嵌合基因能够降低所述产纤维植物的一种内源性基因的表达,其中所述内源性基因编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体,所述变体具有相同的功能或酶活性。
在本发明的这种方法的一个实施方式中,将嵌合基因提供给所述植物的细胞,其中所述嵌合基因包括一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的序列以及参与转录终止和聚腺苷酸化的3’区域,该序列选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列,例如为一种选自SEQ ID No.1或SEQ ID No.2或SEQ ID No.3或SEQID No.4或SEQ ID No.9的核苷酸序列的具有21个连续的核苷酸的核苷酸序列(所谓“有义”RNA介导的基因沉默)。在本发明的这种方法的另一个实施方式中,将嵌合基因提供给所述植物的细胞,其中所述嵌合基因包括一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的序列以及参与转录终止和聚腺苷酸化的3’区域,该序列选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ IDNo.8的氨基酸序列的蛋白质的核苷酸序列的互补序列,例如为一种选自SEQ ID No.1或SEQ ID No.2或SEQ ID No.3或SEQ ID No.4或SEQ ID No.9的核苷酸序列的互补序列的具有21个连续的核苷酸的核苷酸序列(所谓“反义”RNA介导的基因沉默)。
反义或有义核苷酸序列的长度可以是自大约21个核苷酸(nt)直至长度等同于靶核酸的长度(以核苷酸计)。优选地,所述反义或有义核苷酸序列的总长度为至少大约50nt,100nt,150nt,200nt,或500nt。可以预期的是,除了靶核酸的总长度以外,反义或有义核苷酸序列的总长度没有上限。不过,从实际出发(例如为了嵌合基因的稳定性),预期反义或有义核苷酸序列的长度不应该超过5000nt,特别不应该超过2500nt,且应该在大约1000nt以内。
应该可以理解的是,反义或有义核苷酸序列的总长度越长,对总反义或有义核苷酸序列与靶基因的相应的序列或其互补序列之间的序列相同性的要求也就越不严格。优选地,总反义核苷酸序列应该与相应的靶序列的互补序列具有至少大约75%的序列相同性,优选地至少大约80%,更加优选地至少大约85%,相当优选地大约90%,特别优选地大约95%,更加特别优选地大约100%,相当特别优选的是与靶核酸的相应部分的互补序列完全相同。不过,优选地所述反义或有义核苷酸序列总是包括一种大约20-21nt的序列,其与靶核酸的相应部分或其互补序列具有100%的序列相同性。优选地,为了计算序列相同性并设计相应的反义或有义序列,应当将缺口数量降至最低,特别是对于那些更短的反义或有义序列。
在本发明中,两种相关的核苷酸或氨基酸序列的“序列相同性”,以百分数表示,指的是两种最佳比对的序列中的具有相同残基的位置数目(×100)除以进行比较的位置数目。缺口是比对中的一种位置,在该处,残基出现于一条序列中但未出现于另一条中,缺口被视为不具有相同性的残基位置。通过Needleman和Wunsch算法(Needleman andWunsch 1970)进行两序列的比对。采用标准的软件程序可常规进行计算机辅助序列比对,例如采用GAP,其是Wisconsin Package Version 10.1(Genetics Computer Group,Madison,Wisconsin,USA)的一部分,使用默认评分阵列,缺口创建罚分为50,缺口延伸罚分为3。
本发明的另一种实施方式涉及用于降低产纤维植物的内源性基因表达的方法,其中所述内源性基因编码包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8或的氨基酸序列的蛋白质或其变体,该方法使用置于植物中可表达的启动子控制下的DNA区域,该区域的转录产生所谓双链RNA分子,其既包括有义序列又包括反义序列,能够形成如WO 99/53050所述的双链RNA分子(其全部内容在此并入作为参考)。
因此,在本发明的一个方面,可给植物细胞提供一种嵌合基因,其包括可操纵地连接于一种DNA区域的在植物中可表达的启动子,该DNA区域包含编码区域的一部分,所述部分包含来自编码具有SEQ IDNo.5、6、7或8的氨基酸序列的蛋白质的核酸的编码区域的至少20或21个连续的核苷酸(所谓有义部分),以及一种DNA序列,所述DNA序列至少包含该有义部分的至少20或21个核苷酸的互补DNA序列,但其能够完全互补于该有义部分(所谓反义部分)。所述嵌合基因可包含额外的区域,例如在植物中具有转录终止和聚腺苷酸化功能的区域。转录后可产生一种RNA,其可在有义和反义区域的互补部分之间形成一种双链RNA茎。在有义和反义核苷酸序列之间可存在间隔区域。所述嵌合基因可进一步包含内含子序列,优选地位于间隔区域内部。
在本发明的另一个实施方式涉及一种用于降低产纤维植物的内源性基因的表达的嵌合基因,其中所述内源性基因编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体,所述变体具有相同的功能或酶活性,所述嵌合基因编码一种核酶,其识别并裂解一种RNA,所述RNA具有编码一种包括SEQ IDNo.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体的RNA的核苷酸序列。在另一个实施方式中,所述核酶识别并裂解一种RNA,所述RNA具有包括SEQ ID No.1、2、3或4的核苷酸序列的RNA的核苷酸序列。Haseloff和Gerlach(1988)已经公开了设计并使用核酶的方法,也参见WO 89/05852。
显然,只要RNA分子的核苷酸序列是通引用相应的DNA分子的核苷酸序列进行限定时,核苷酸序列中的胸腺嘧啶(T)均应替换为尿嘧啶(U)。引用的究竟是RNA或DNA分子可从本申请的上下文进行判断。在本发明的另一个实施方式中,提供了核酸(DNA或RNA分子),其可用于改变植物中的纤维素生物合成。因此本发明提供了一种嵌合基因(DNA分子),其包含以下可操纵连接的DNA片段:
i)在所述植物的所述细胞中可表达的启动子;
ii)包含具有至少21个核苷酸的核苷酸序列的DNA区域,所述核苷酸序列选自编码包含SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质(或具有相同酶活性的该蛋白质的变体)的核苷酸序列,例如SEQ ID No.1、2、3、4或9核苷酸序列;和/或
iii)包含具有至少21个核苷酸的核苷酸序列的DNA区域,所述核苷酸序列选自编码包含SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质(或具有相同酶活性的该蛋白质的变体)的核苷酸序列的互补序列,例如SEQ ID No.1、2、3、4或9核苷酸序列;和
iv)参与转录终止和聚腺苷酸化的3’端区域。
本发明还提供了可自本发明的嵌合基因获得的RNA分子。此类RNA分子通过在体内或体外对所述嵌合基因进行转录产生。它们可以通过体外转录而获得,其中转录区域置于一种启动子的控制下,该启动子被单亚基RNA聚合酶识别,其选自噬菌体例如SP6、T3或T7。或者,可通过本领域已知的方法在体外合成所述RNA分子。此外,本领域已知可对RNA核糖核苷骨架进行化学修饰以使得嵌合RNA分子更加稳定。
以上已经根据所提供的方法描述了嵌合基因或RNA分子用于改变纤维素生物合成的不同实施方式并可根据实际情况对这些实施方式进行必要的修改。
可将嵌合基因或RNA稳定地或瞬时地提供给植物细胞。典型地,可通过将嵌合基因整合入植物细胞的基因组而稳定地提供嵌合基因或RNA分子。将嵌合基因导入植物的方法是本领域已知的,并包括土壤杆菌介导的转化、微粒枪输送、微注射、对完好的细胞进行电穿孔、聚乙二醇介导的原生质体转化、原生质体电穿孔、脂质体介导的转化、硅晶须介导的转化等等。以这些方式获得的转化的细胞可再生为成熟的能够繁殖的植物。
在另一个实施方式中,本发明的嵌合基因或嵌合RNA分子可提供在一种DNA或RNA分子,所述DNA或RNA分子能够在植物细胞中自主复制,例如病毒载体。也可将本发明的嵌合基因或RNA分子瞬时地提供给植物的细胞。
本发明的目的还在于提供含有本发明的嵌合基因或RNA分子的植物细胞和植物。通过常规育种方法产生的那些包含本发明的嵌合基因的植物的配子、种子、胚芽、合子或体细胞、后代或杂合体,也属于本发明的范围内。
本发明的方法和手段特别适合用于棉属植物(陆地棉和海岛棉均可),特别适于Coker 312、Coker310、Coker 5Acala SJ-5、GSC25110、FiberMax819、FiberMax832、FiberMax966、FiberMax958、FiberMax989、FiberMax5024(和具有除草剂或害虫抗性性状的FiberMax转基因变种)Siokra 1-3、T25、GSA75、Acala SJ2、AcalaSJ4、Acala SJ5、Acala SJ-C1、Acala B1644、Acala B1654-26、AcalaB1654-43、Acala B3991、Acala GC356、Acala GC510、Acala GAM1、Acala C1、Acala Royale、Acala Maxxa、Acala Prema、Acala B638、Acala B1810、Acala B2724、Acala B4894、Acala B5002、非 Acala“picker”Siokra、“stripper”变种FC2017、Coker 315、STONEVILLE506、STONEVILLE 825、DP50、DP61、DP90、DP77、DES119、McN235、HBX87、HBX191、HBX107、FC 3027、CHEMBRED A1、CHEMBRED A2、CHEMBRED A3、CHEMBRED A4、CHEMBREDB1、CHEMBRED B2、CHEMBRED B3、CHEMBRED C1、CHEMBRED C2、CHEMBRED C3、CHEMBRED C4、PAYMASTER145、HS26、HS46、SICALA、PIMA S6和ORO BLANCO PIMA。FiberMax是Australia of Cotton Seed Distributors Pty Ltd.的注册商标。
不过,这些方法和手段也可用于其他植物品种,例如大麻、黄麻、亚麻和木本植物,包括但不限于松(Pinus spp.)、杨(Populus spp.)、杉(Picea spp)和桉,等等。
在另一个实施方式中,提供了一种用于在一种特定植物品种(优选地为一种产纤维植物品种)的不同基因型或变种的群体中鉴别编码参与纤维素生物合成的蛋白质的基因的等位基因变异的方法,所述等位基因变异单独地或者组合地与纤维素生产的数量和/品质以及纤维生产有关联。所述方法包括以下步骤:
a)提供包含不同等位基因形式的编码包括SEQ ID No.5、6、7或8的氨基酸序列的蛋白质的核苷酸序列的特定植物品种或杂种繁殖植物品种的不同变种或基因型的群体。可采用本申请其他部分所述的方法来鉴别所述不同等位基因形式。优选地提供一种分异群体(segregatingpopulation),其中存在编码参与纤维素生物合成的蛋白质的基因的不同组合的等位基因变异。用于产生分异群体的方法是植物育种领域所熟知的。
b)确定所述群体中每一个个体的与纤维生产和/或纤维素生物合成有关的参数;
c)确定所述群体中每一个个体是否存在一种特定等位基因形式的编码包括SEQ ID No.5、6、7或8的氨基酸序列的蛋白质的核苷酸序列;以及
d)将出现特定的纤维或纤维素参数与是否存在一种特定等位基因形式的所述核苷酸序列或此类等位基因形式的一种特定组合相关联。
根据得到的信息可筛选那些对纤维素生物合成或纤维生产具有所需影响的等位基因。根据得到的信息,可通过使用常规的分子生物学技术来确定是否存在等位基因的形式,由此加快育种程序以分离或产生具有特定纤维或纤维素特征的变种或加快回交程序。本领域已知用于确定多倍体植物的等位基因形式的方法,包括例如Denaturing High-Performance Liquid Chromatography(DHPLC;Underhill等,1997Genome Research 7:996-1005)。显然,在育种或回交程序中,不仅等位基因本身的序列可用于确定其存在与否,也可使用那些与所需等位基因邻近的核苷酸序列,优选地与其直接邻近的并且是连续的核苷酸序列,这些序列只能在有丝分裂过程中通过重组在有丝分裂过程中以低频率自所述等位基因中分离。
在此,“杂种繁殖植物品种”是一种品种,其能够与产纤维植物例如棉属植物杂交(包括使用杂交等技术)并能够产生子代植物。杂种繁殖植物品种可包括所述产纤维植物的野生亲缘植物。通常,对于棉属植物,指的是在海岛棉与陆地棉(G.hirsitum)之间进行杂交的杂种繁殖和在两株海岛棉或两株陆地棉(G.hirsitum)亲代之间进行杂交的种内杂交。
以下非限制性实例描述了用于在产纤维植物中调节纤维素生物合成的方法和手段。除非在实施例中另有说明,所有重组DNA技术均按照以下出版物所述的标准技术进行:Sambrook等,(1989)MolecularCloning:A Laboratory Manual,Second Edition,Cold Spring HarborLaboratory Press,NY和Ausubel等,(1994)Current Protocols in MolecularBiology,Current Protocols,USA(Volumes 1和2)。用于植物分子研究的标准材料和方法见R.D.D.Croy的Plant Molecular Biology Labfax(1993),由BIOS Scientific Publications Ltd(UK)和Blackwell ScientificPublications,UK联合出版。其他标准分子生物学技术的参考书包括Sambrook和Russell(2001)Molecular Cloning:A Laboratory Manual,ThirdEdition,Cold Spring Harbor Laboratory Press,NY,Volumes I and II ofBrown(1998)Molecular Biology LabFax,Second Edition,Academic Press(UK)。用于聚合酶链反应的标准材料和方法可参见:Dieffenbach和Dveksler(1995)PCR Primer:A Laboratory Manual,Cold Spring HarborLaboratory Press,以及McPherson等,(2000)PCR-Basics:FromBackground to Bench,First Edition,Springer Verlag,Germany。
在整个说明书和实施例中,引用了以下序列:
SEQ ID No.1:鼠耳芥属核苷酸序列rsw2(基因组;保藏号At5g4970)
SEQ ID No.2:棉属植物核苷酸序列rsw2(cDNA)
SEQ ID No.3:鼠耳芥属核苷酸序列rsw3(基因组)
SEQ ID No.4:棉属植物核苷酸序列rsw3(相应于3′端;cDNA)
SEQ ID No.5:鼠耳芥属氨基酸序列rsw2
SEQ ID No.6:棉属植物氨基酸序列rsw2
SEQ ID No.7:鼠耳芥属氨基酸序列rsw3
SEQ ID No.8:棉属植物氨基酸序列rsw3(部分)
SEQ ID No.9:鼠耳芥属核苷酸序列rsw2(cDNA)
SEQ ID No.10:寡核苷酸PCR引物(正向rsw2棉属植物)
SEQ ID No.11:寡核苷酸PCR引物(反向rsw2棉属植物)
SEQ ID No.12:寡核苷酸PCR引物(正向LFY3)
SEQ ID No.13:寡核苷酸PCR引物(反向LFY3)
SEQ ID No.14:寡核苷酸PCR引物(正向MBK5/α)
SEQ ID No.15:寡核苷酸PCR引物(反向MBK5/α)
SEQ ID No.16:寡核苷酸PCR引物(在葡糖苷酶IIα正向)
SEQ ID No.17:寡核苷酸PCR引物(在葡糖苷酶IIα反向)10
SEQ ID No.18:寡核苷酸PCR引物(在葡糖苷酶IIβ正向)
SEQ ID No.19:寡核苷酸PCR引物(在葡糖苷酶IIβ反向)
SEQ ID No.20:寡核苷酸PCR引物(分离的基因组拷贝RSW3的正向引物)
SEQ ID No.21:寡核苷酸PCR引物(分离的基因组拷贝RSW3的反向引物)
SEQ ID No.22:寡核苷酸PCR引物(正向RWS3同源物棉属植物)
SEQ ID No.23:寡核苷酸PCR引物(反向RSW3同源物棉属植物)。
实施例
实施例1.分离GhKOR基因(与鼠耳芥属的rsw2突变相应的棉属植物基因)的全长cDNA
NCBI EST数据库中有7个来自Gossypium arboreum 7-10dpa(花期后天数(days post anthesis))纤维文库的EST,其均与AtKOR1的序列具有相似性。上述7个EST中的5个序列是相同的。将三种不同的棉属植物EST与AtKOR1 cDNA进行比对发现,棉属植物克隆AW726657含有ATG起始密码子和47bp的5’非翻译区。克隆BE052640跨越了KOR基因的中部区域并与克隆AW668085重叠,AW668085在与AtKOR1相同的位置上含有TGA终止密码子并含有126bp的3’非翻译序列。ORF的翻译显示出与AtKOR1蛋白的区域具有>80%的氨基酸序列相同性。使用了针对G.arboreum EST的5’和3’非翻译区而设计的引物,自来自陆地棉栽培品种Siokra 1-4的18dpa纤维cDNA文库,扩增一1.9kb PCR产物。正向引物是5’-CCGCTCGAGCGGGCATTTTCCGCCCACTA-3’(SEQ ID No.10),且反向引物是5’-CGGGATCCCGTCACACATGGACAGAAGAA-3’(SEQ ID No.11)。通过对来自陆地棉的18dpa纤维的棉属植物cDNA文库进行PCR而产生棉属植物KOR基因的全长cDNA,并对若干个扩增的产物进行测序(SEQ ID No.2)。encoded a protein该cDNA编码一种具有619个氨基酸(SEQ ID No.6)的蛋白质(GhKOR),其与LeCel3(86%氨基酸相同性)、AtKOR1(82%氨基酸相同性)和BnCel16(82%相同性)高度相似(图1)。所有蛋白质共同具有:参与将AtKOR1导向细胞板的极化导向基序(Zuo等,2000);靠近N端的推定的跨膜区域;作为与糖苷水解酶家族9具有强相似性部分的4个可能参与催化作用的保守残基(Asp-198、Asp-201、His-516和E-555;Nicol等,1998);富Pro的且具有内-1,4-β-D-葡聚糖酶家族的膜结合成员的特征的C端区域;位于N端结构域的8个推定的N-糖基化位点(Asn-X-Ser/Thr),推测其在糖基化过程中位于ER内腔(仅存在于GhKOR的一个额外的位点(14-16位残基)可能朝向细胞浆)。
实施例2.以GhKOR互补鼠耳芥属rsw2-1变体
按照以下方式将编码GhKOR的棉属植物PCR产物克隆到CaMV35S启动子之后:正向引物插入XhoI位点(下划线标出),反向引物插入BamHI位点(下划线标出),使得扩增的1.9kb片段连接于载体pART7的合适位点(Gleave,1992)。这将cDNA以有义方向置于花椰菜花叶病毒(cauliflower mosaic virus)35S启动子之后。通过以NotI消化将完整的表达盒分离并克隆入双载体pART27的相应位点。对扩增产物进行测序以确认其相同性。将这一构建体导入根癌突然杆菌(Agrobacteriumtumefaciens)菌株AGL1并用于通过植物浸渍法(floral dipping)转化rsw2-1突变体和野生型Columbia(Clough和Bent,1998)。
在含有卡那霉素(50μg ml-1)和替卡西林钠/克拉维酸钾(100μg ml-1)的Hoagland培养皿上选择卡那霉素抗性转化体,转移至不含选择物的垂直Hoagland培养皿,在29℃放置2天后筛选根膨胀的转化体。自10株显示野生型表型的单独的T1植株收集T2种子并在T2代检验互补表型的遗传性。对在29℃暴露2天的卡那霉素抗性纯合子T3幼苗的根进行拍照。其他植物在21℃生长于罐中直至开始抽苔,切下抽苔,然后转移至29℃并再生次生抽苔,成熟后拍照。rsw2-1与Columbia在At5g49720具有一个单一核苷酸的改变,其在AtKOR1中将Gly-429取代为Arg,并产生了一种易受温度影响的表型(Baskin等,1992;Lane等,2001)。植物或生长于罐中(泥炭∶堆肥∶沙子的1∶1∶1混合)或无菌生长于培养皿中(MS或Hoagland的培养基加上琼脂)(Bum等,2002a)。除非特别说明,否则生长箱提供100μmol m-2s-1的连续光照,21℃。rsw2突变体的根显示出易受温度影响的径向膨胀(Baskin等,1992),而茎显示出对延长所产生的易受温度影响的抑制(Lane等,2001)。
75株卡那霉素抗性T1幼苗中的63株的根在29℃两天后没有出现膨胀。野生型表型稳定地异常给T3代,并且根(图2A)和茎(图2B)在限制性温度条件下正常延长。卡那霉素抗性纯合子T3植株的茎生长在数量上与野生型没有区别。因此鉴别了一种编码AtKOR1的棉属植物同源物的基因,且发现其能够在rsw2-1纤维素合成突变体中在功能上替代鼠耳芥属基因。
这将涉及在鼠耳芥属中纠正减数分裂和细胞延伸的缺陷的GhKOR(Nicol等,1998;Zuo等,2000;Lane等,2001;Sato等,2001)以及与纤维素合成机制的其他元件和/或产物的适当相互作用。以前的研究鉴别了一种棉纤维蛋白质,其在免疫学上与LeCel3相关(Peng等,2001),而间接证据提示其参与棉属植物纤维膜在体外合成纤维素(Peng等,2002)。与LeCel3、BnCel16和AtKOR1的相似性包括所有已知的具有重要功能的主要特征以及那些目前尚不清楚其功能的特征,例如富Pro C端。内-1,4-β-D-葡聚糖酶在纤维素合成中的作用尚不清楚,不过可能涉及自脂质结合的引物或供体切断尚未结晶的葡聚糖(Williamson等,2001;Peng等,2002)。
实施例3:鉴定并分离鼠耳芥的rsw3突变体中发生突变的基因
rsw3等位基因是一种基于图谱策略而鉴定的单一孟德尔隐性基因座(Baskin等,1992)。rsw3与可见标记物品系W9杂交产生的F2子代将RSW3与第5号染色体短臂上的yi相连锁。对rsw3(Columbia背景)与Landsberg erecta ecotype杂交产生的F2群体进行筛选以产生具有根膨胀表型的植物。使用FastDNA试剂盒(BIO 101,Carlsbad,CA)自每株植物的2-3个莲座叶制备DNA,并使用LFY3(正向引物5’-GACGGCGTCTAGAAGATTC-3’(SEQ ID No.12),反向5’-TAACTTATCGGGCTTCTGC-3’;SEQ ID No.13;以RsaI裂解)和MBK5/α(正向5‘-CCCTCGCTTGGTACAAGGTAT-3’(SEQ ID No.14)和反向5’-TCCTGATCCTCTCACCACGTA-3’(SEQ ID No.15)绘制图谱。使用来自Landsberg erecta ecotype的杂交种的F2,将RSW3绘制于距离LFY3基因座的6cM处(4out of 70条染色体中有4条显示出杂交事件),因此将RSW3定位于yi和LFY3之间。对另外372条染色体的分析鉴定了MBK5/α和rsw3之间的一个重组事件,概念性图谱距离为0.27cM。在rsw3中对该区域中的若干该候选基因进行测序。P1克隆mgi19(AB007646)上的一个基因(At5g63840),编码一种葡糖苷酶II的推定的催化亚基,而rsw3等位基因显示具有一个T到C的取代,根据推定,其造成蛋白质中的Ser599被取代为Phe(野生型RSW3基因的核苷酸序列如SEQ ID No.3所示,所编码的蛋白质的氨基酸序列如SEQ ID No.7所示)。
RSW3序列与糖苷水解酶家族31(Henrissat,1991;Henrissat andBairoch,1993)的序列从大约150位残基起具有很高的相似性。Monroe等通过针对与α-葡糖苷酶的相似性对鼠耳芥属EST进行搜索鉴定了RSW3葡糖苷酶II基因,并将其命名为Aglu-3(Monroe等,1999)。其蛋白产物形成若干葡糖苷酶II的进化枝,这些葡糖苷酶II的催化活性均分别是已知的。它们均与鼠耳芥属的质外体α-糖苷酶分开,而Aglu-3/RSW3与之仅具有8%的序列相同性。图4显示了含有Aglu3/RSW3的进化枝的两种特征基序,其被认为包括催化残基和底物结合残基。Aglu3/RSW3含有这些基序中的所有保守残基,以及被认为是催化残基的Asp512和Asp617(Frandsen和Svensson,1998)。Ser599在rsw3中是突变的,其可能在功能上是重要的,因为其在来自小鼠(NP 032086)、人类(NP 055425)、猪(AAB49757)、粘液菌(AAB18921)、马铃薯(P07391)和棉属植物(见下)的同源基因产物中均是保守的,而且在相关性更加遥远的由鼠耳芥属基因Aglu-1和Aglu-2(Monroe等,1999)编码的质外体α-葡糖苷酶中也是保守的。鼠耳芥属Aglu-3/RSW3基因似乎是单拷贝的,其跨越3.84kb,具有5个内含子,并编码一种推定的2766bp转录产物,推定的翻译产物为104kDa。
最近的生物化学(Trombetta等,1996)和遗传学研究(D’Alessio等,1999;Pelletier等,2000)提示,天然的哺乳动物和酵母的葡糖苷酶II由一种催化性α-链(Aglu-3/RSW3与之同源)和一种较小的非催化性β-链组成,其在ER中保持异二聚体形式。为了确定鼠耳芥属是否含有β-亚基的直向同源物,以小鼠β-亚基进行对NCBI数据库进行BLAST搜索。未知蛋白At5g56360(来自染色体5的P1克隆MCD7(AB009049)上的蛋白质MCD7.9)与小鼠β-亚基具有27%的氨基酸相同性和42%的相似性。水稻染色体1存在一种密切相关的序列(Genbank BAA88186),但其中的终止密码子使其在496位残基后截断。理论上对PAC克隆P0038F12(AP000836)的邻近3’序列进行翻译并考虑到推测的剪切位点,提示其可能编码一种全长的β-亚基,后者与鼠耳芥属基因产物非常相似。该基因产物的推测的序列可得到与推测的外显子相匹配的一种EST(AU030896)的支持。因此图5包括我们对该全长水稻蛋白质的建议。鼠耳芥属、水稻、小鼠和Schizosaccharomyces pombe序列共同具有:C端的HDEL ER-保留信号;其N端的推定的前导序列;富半胱氨酸的N端区域;C端位于HDEL序列前方的MHR(甘露糖受体同源区域)(Munro,2001);预期可能形成螺旋卷曲的序列富含酸性残基且两侧具有在程序(“螺旋”和“Paircoil”)中得到高分的区域的中心区域(Berger等,1995;Lupas等,1991)。
Munro(2001)将MRH结构域与碳水化合物识别联系起来。其包含一种与阳离子依赖性甘露糖-6-磷酸受体具有相似性的区域,所述受体的结晶结构是已知的。至关重要的保守部分(图5)包括形成3个二硫键(不过S.pombe蛋白质缺乏半胱氨酸1和2)的6个Cys残基、半胱氨酸5和6之间的底物识别环以及参与配体结合的Y和R残基(Roberts等,1998)。小鼠α和β亚基之间的相互作用被绘制于β-亚基N端第118位残基(其在所有序列中均相当保守)至273-400位残基(其不具有保守性)(Arendt和Ostergaard,2000)。不过,图5显示所有序列均具有高百分比的酸性氨基酸。
如下所述采用RT-PCR分析编码α-和β-亚基的基因的表达。按照生产商的说明,以RQ1无Rnase的Dnase(Promega,Madison,WI)处理RNA(Parcy等,1994)。针对鼠耳芥属葡糖苷酶II的α-和β-亚基编码区域的3’末端设计PCR引物:
α-正向5’-CGTAGTGGTCTACTGGTTCAA-3’(SEQ ID No.16),
α-反向5’-TGAGCTGTGTCCCAAGAGGAT-3’(SEQ ID No.17),
β-正向5’-GGTGATGAGGATACCAGCGAT-3’(SEQ ID No.18),
β-反向5’-CCCACTCCCTAACCGGAGTTT-3’(SEQ ID No.19)。
每一种引物跨越一个内含子,如此可将RT-PCR产物与基因组DNA和mRNA区别开(对于α-亚基为724bp对452bp,对于β-亚基为996对474)。按照生产商的说明,使用Gibco BRL Superscript一步法RT-PCR试剂盒进行RT-PCR,RT-PCR循环为48℃-45分钟,94℃ 2分钟,(94℃/30秒,54℃/1分钟,68℃/2分钟)×45次,72℃/7分钟。
RT-PCR在所有测试的鼠耳芥属组织中检测到编码α-和β-亚基的基因的表达(图6),但是,在所采用的条件下,无法明确说明相对表达水平。鼠耳芥属的低数目的EST(α-亚基13个,β-亚基4个)说明两种均非高表达(作为对照,在相似的搜索中,一种参与纤维素合成的糖基转移酶AtCesA1/RSW1检测到40个EST)。
实施例4:通过鼠耳芥属基因的基因组拷贝互补rsw3突变
通过PCR扩增BAC F20A11产生了包括830bp的启动子区域的基因组拷贝的葡糖苷酶II的α-亚基,使用了正向引物:5’-CCGCTCGAGCGGTTTCACTCACAACTGTGGTCTCT-3’(SEQ ID No.20)和反向引物:5’-CCGCTCGAGCGGTCTCCTAAGTCCTAACCCCATA-3’(SEQ ID No.21)。两种引物均包括XhoI位点(下划线标出),其能够使得扩增的5.8kb片段连接至双载体pBin19中的SalI位点。扩增产物具有不同于基因组序列的单碱基对改变(C至T)。这造成以Leu替代Ser 142(该残基在马铃薯中是保守的,但在其他物种则不是)(图4),但并不削弱该片段互补rsw3的能力。将该构建体导入根癌土壤杆菌菌株AGL1并用于通过植物浸渍法转化rsw3突变体(Clough和Bent,1998)。在21℃在含有卡那霉素(50μg ml-1)和替卡西林钠/克拉维酸钾(100μg ml-1)的Hoaglands培养皿中选择卡那霉素抗性转化体。将幼苗转移至垂直Hoagland培养皿并在30℃放置2天以筛选根部膨胀。当在21℃生长5天且随后在30℃生长2天后,卡那霉素抗性T1子代显示出野生型的根(图3a)。花序表型(见后)也被互补。
rsw3与标记的突变体SGT5691(Parinov等,1999)之间的杂交种提供了另一个证据,突变体SGT5691在编码推定的糖苷酶II的基因的第一个启动子中含有Ds元件。其大概代表一种无效等位基因,且该突变具有纯合子致死性,因此采用看起来为野生型的半合子植物进行杂交。Ds元件中的NPTII基因将卡那霉素抗性赋予接受来自SGT5691的标记等位基因的F1植物。所有卡那霉素抗性F1幼苗(含有无效等位基因和对温度敏感的等位基因)的根在21℃看起来为野生型,但在30℃发生膨胀(图3b)。这证实Ds插入突变体与EMS产生的突变体rsw3是等位基因,且葡糖苷酶II缺陷引起径向膨胀。
实施例5:对鼠耳芥属中与rsw3突变相关的其他表型的研究
rsw3在容许温度21℃象野生型一样生长,当转移至30℃时幼苗根部膨胀。根部膨出的细胞(Baskin等,1992)通常位于根须的基部,提示RSW3在根须发育的早期阶段具有作用。仅当在48小时之内将膨胀的初生根放回容许温度时其才能重新开始延长,但根可继续产生侧根(图7a)。这些侧根的原基在转移至31℃时是不可见的,侧根先延长数毫米,随之膨胀并停止生长。因此,成熟的生长性植株的根系短小而呈高度分支状(图7b)。双重纤维素缺陷突变体rsw1-rsw3处于限制性温度24小时仅出现根尖膨胀,但由于处于这种高温条件下任何更长的时间均导致死亡,因此在限制性温度条件下24小时后膨胀过程可能已经减弱。
对于在黑暗中生长的胚轴的表型,rsw3的表型明显若于rsw1-1和rsw2-1,而rsw1-1rsw3的表型若于rsw1-1rsw2-1(图7c)。rsw3在光亮处的花结生长被强烈抑制,且许多细小的叶片堆积成密集的垫状,其中无法识别规则的叶序(图7d-f)。野生型叶片的复杂铺路石细胞形状(图7g)在rsw3中变得简单,气孔自叶片表面突出,且一些毛状体似乎要破裂(图5h)。一些拥挤的花结开始出现细小的花序(图7d),不过它们出现得明显迟于野生型花序(对于生长于琼脂中的植物来说为28.6士0.5天对15.5+0.17天;平均值+SE,n=98(rsw3),n=45(野生型))。细小的rsw3花序上的少数花朵是基本上完全长大的,不过花药丝、雌蕊和萼片轻度缩短,而芽在柱头受精之前提早开放(与下文中讨论的图8e、f所示的来自土壤中生长的rsw3植株的芽相似)。
为了研究突变对茎生长的直接作用,将野生型和rsw3在21℃生长于土壤中,这样随后花序的发育将不会受限于仅提供很少光合作用产物的小花结。在这样的条件下,rsw3的花结与野生型十分相似,而再生性生长按照正常时间开始。
切下初生抽苔并在21℃或30℃再生次生抽苔(图6a,b)。rsw3和rsw1-1在21℃的再生遵循略微呈S形的曲线,相对于野生型来说在生长速度和最终高度上没有统计学显著意义的降低。Rsw1-1rsw3的速度和最终高度显示出明显的下降。不过,在30℃,rsw3在少数几天中的生长速度与野生型相似,但其在大约第5天停止延长,而野生型则继续延长到第16天,rsw1-1甚至更长(图8b)。rsw1-1rsw2(Lane等,2001)在30℃没有再生出次生抽苔,而rsw1-1rsw3仅生长大约35mm(图8b),产生极少的花且没有产生种子。
我们每天测量了茎生长的增量以及外层细胞的长度,外层细胞在抽苔生长至大约一半时已经离开延长区(表1)。这样可以估计那时的细胞流(cell flux)(每天离开延长区的细胞数),因为每天的生长增量=细胞长度x细胞流。生长于21℃的rsw3的细胞流或细胞长度没有明显降低。rsw1-1rsw3在21℃的组成型表型完全归因于细胞长度的降低。在30℃,相对于野生型来说,rsw1-1的细胞长度降低57%,而细胞流降低35%。
此类分析要求植物的生长速度、延长区的长度等等均处于近乎稳定的状态。不过,当rsw3和rsw1-1rsw3的延长迅速减慢的时候条件远远达不到稳定状态,因此无法准确推算出这些表型的细胞流。为了至少对在生长减慢的情况下细胞长度是如何改变有一些了解,我们在rsw3茎的大约80mm高度处测量了细胞长度(图8b显示,当这些细胞离开延长区时,茎应该是接近其生长期的终点,这是因为根据那时生长区的长度,那时总的植物高度应该已经超过80mm;野生型为40mm,根据Fukaki等,1996)。即便如此,rsw3的细胞也仅仅略微短于野生型(表1),提示对于茎延长减慢来说,细胞产生速度的下降可能比细胞体积增加的降低更加重要。相反,当茎的延长正在减慢时,我们对30mm的rsw1-1rsw3茎进行取样用于细胞成熟(图8b),细胞的长度降低了57%(表1)。这与双突变体中存在rsw1-1使得平衡明显向细胞长度降低倾斜是一致的。
在一个更加简单的系统中验证了有关细胞分裂和细胞体积增加的结论,该系统使用冷扫描电子显微镜来检测受精的柱头的花中的雄蕊花丝(表2)。结果是相似的:rsw3植物同样显示出细胞数量减少较细胞长度降低的比例更高,并且双突变体rsw1-1rsw3显示出细胞长度进一步下降但细胞数量没有额外降低。Rsw1-1显示出细胞长度较细胞数量降低更明显(表2)。在30℃进行再生的野生型和rsw3的茎在第一朵花开花前均达到大约相同的高度,即使它们的最终高度可能差别很大(图8b)。野生型茎在延长停止前产生大约27朵具有良好间隔的花,但rsw3在延长停止前仅产生大约6朵间隔紧密的花,留下一个花簇(图8c,d)。rsw3花芽在柱头受精之前即提早开花(图8e,f)。
在持续生长于其限制性温度的rsw3植物的细小的抽苔形成很少的花且没有形成种子(图7d)。对于那些在21℃具有完全生长力的植物,既便是在31℃形成的大得多的抽苔上的花(图8d,f)也仅仅拮出极少的种子。这种种子(图8g,h)是皱缩的(可能是由于种子贮存蛋白质积聚量降低所致;Boisson等,2001),其表面缺乏生长于30℃的野生型或生长于21℃的rsw3所具有的规则的细胞结构,并且其在吸涨后仅有极少的分泌黏液(图8i-n)。黏液分泌减少在纤维素缺陷体突变体是不常见的:rsw1-1(CesA1糖基转移酶缺陷;图8k,l)和rsw2-1(KOR内-1,4β-葡聚糖酶缺陷)具有正常的黏液层。
为了将在单倍体阶段对花粉和胚珠发育的影响与在二倍体阶段的影响分开,我们检测了半合子Ds突变体SGT5691(一种推定的葡糖苷酶II催化亚基的无效等位基因)中的结籽(seed set)。通过自身传粉,结籽分异为147个卡那霉素抗性个体和153个敏感个体。显性纯合子致死性等位基因的比例低于预期的2∶1,说明无效等位基因影响的是花粉和/或胚珠的有丝分裂后发育。通过在半合子标记突变体与Landsberg erecta(该突变体的适当的野生型)之间进行交互杂交,我们区分了对雄性和雌性途径的影响。如果花粉或胚珠的发育不受影响,则卡那霉素抗性和敏感性植株将按照1∶1分异,如果无效等位基因降低花粉或胚珠的繁殖力,则比例会降低。来自Ds标记突变体的花粉产生的分异比例为1∶16(6个抗性个体,94个敏感性个体),提示Ds标记花粉产生有活力的种子的能力下降了94%(相对于野生型)。这与当Ds标记胚珠与野生型花粉杂交时的41%的降低程度(比例为1∶1.7,37∶63个体)相当。因此,葡糖苷酶II的无效等位基因对花粉发育单倍体阶段的影响比其对有丝分裂后胚珠的影响要严重得多。
生长于31℃的rsw3的7天幼苗的根仅含有野生型纤维素的51%(表示为每毫克组织干重),该数字与CesA1糖基转移酶(rsw1-1)和KOR内-1,4-β-葡聚糖酶(rsw2-1)中的单个氨基酸取代(Peng等,2000)产生的结果相当。形态学改变提示所有三种基因对于在初生细胞壁中产生纤维素均是必需的。
rsw3幼苗中来源于高尔基体的非纤维素多糖的产生基本没有改变(Peng等,2000)。对纤维素产生的选择性与在葡糖苷酶I缺陷(Gillmor等,2002)中所见的情况相当,葡糖苷酶I产生用于葡糖苷酶II加工的最初底物。其超出了在鼠耳芥属的胚芽致死性cyt1突变体(甘露糖-1-磷酸guanylyltransferase缺陷)(Lukowitz等,2001)和通过反义下调MALl(编码葡糖苷酶IIα-亚基)的马铃薯(Taylor等,2000a)中所见的选择性,其中非纤维素多糖和木质素出现复杂的改变。因此,我们认为,与高尔基体中的非纤维素多糖的合成相比,纤维素合成对N-聚糖加工缺陷的敏感性要高得多。
rsw3中的来源于高尔基体的种子黏液的分泌明显减少,但rsw1-1或rsw2-1则没有明显减少。黏液能够产生,但保留在细胞内(可能是因为纤维素缺陷引起的结构改变),或者黏液产生本身可能是减少的。许多发育阻滞减少黏液的产生(Western等,2001;Western等,2000),不过我们还不能排除这样一种可能性,即rsw3具有产生组成黏液的特定非纤维素多糖所需的高尔基体酶的缺陷型加工。
雄蕊花丝中的细胞数量和体积提示,rsw3对细胞分离的影响强于对细胞体积增加的影响。茎的细胞长度数据与这一发现相符和。rsw3对细胞分裂的显著影响可以解释为何其表型在黑暗中生长的缺少细胞分裂的胚轴中相当弱(Gendreau等,1997)。由于对细胞分裂的影响明显强于对细胞体积增加的影响,rsw3更加类似于rsw2-1(Bum等,2002)而不是rsw1-1(Burn等,2002)或携带针对RSW1/CesA1或CesA3的反义构建体的植物(Bum等,2002),后者对细胞长度的影响更加严重(尽管CesA1的改变对分裂速度没有什么影响,但CesA1可能在分裂的根细胞中表达,因为当rsw1-1处于其限制性温度时,细胞的细胞壁超微结构有改变(Sugimoto等,2001)并膨胀(Baskin等,1992;Beemster和Baskin,1998))。
尽管rsw3中的纤维素生物合成被削弱,但rsw3影响纤维素合成的机制还不清楚。正如在葡糖苷酶I突变(Boisson等,2001)中所注意到的,无法在高尔基体中组装成熟N-联聚糖的突变体显示出最低程度的表型(von Schaewen等,1993),这说明关键蛋白质缺少成熟N-联聚糖并不会引起明显的糖苷酶II缺陷时所见的表型。Glc1Man9GlcNAc2和Man9GlcNAc2产生速度的降低可能会同时减慢糖蛋白/分子伴侣复合体的形成和解离,于是产生一个瓶颈,可降低分泌途径上其他位点的糖蛋白的稳态水平。由于糖蛋白参与多种植物过程,因此还不清楚为何与例如非纤维素多糖合成相比,纤维素合成对ER中的加工缺陷要敏感得多。
当Gillmor等(2002)没有检测到knopf(葡糖苷酶I缺陷)发生SDS-PAGE迁移改变或改变N-糖苷酶F处理时,且当他们没有在knopf中观察到采用非定量免疫染色可观察到的CesA丰度的改变时,他们认为CesA蛋白不发生糖基化。KOR内-1,4-β-葡聚糖酶是一种更好的候选物。当在巴斯德毕赤氏酵母(pichia pastoris)中异源性表达时,KOR的欧洲油菜直向同源物的一种可溶性片段发生大量N-糖基化,且这种N-聚糖对于其体外活性是必需的(Molhoi等,2001)。rsw3和rsw2-1表型对细胞分裂的影响比对细胞体积增加的影响更大,而rsw1-1表型正好与此相反,这进一步提供了证据,支持KOR是一种靶向。
rsw1-1和rsw2-1突变影响了编码那些可能直接参与纤维素合成的质膜酶的基因,而在限制性温度条件下酶性能的改变将迅速影响纤维素合成。相反,rsw3编码一种ER内的加工酶,仅当其限制了对纤维素合成的位点的正确折叠的糖蛋白的供应时,其性能的改变才会降低纤维素合成。当将这三种突变体转移至较高温度时,会在不同的时间出现可见的表型,这似乎反映了它们不同的作用模式。径向膨胀在rsw3出现较晚(潜伏期>24h,而rsw1-1和rsw2-1<12h),而在最初的12小时内高温实际上加速了根的延长,尽管不及野生型(Baskin等,1992)。与此不同,rsw1-1或rsw2-1在最初12小时即减慢,根部明显膨胀,且rsw1-1在4小时内即出现细胞壁超微结构的改变(Sugimoto等,2001)。
已经发现,rsw3的编码推定的糖苷酶IIα-亚基的基因是突变的,已经鉴定了由两种植物基因组编码的推定的β-亚基,并发现rsw3表型的许多方面是初生细胞壁中纤维素合成降低的结果。对细胞分裂的影响似乎比对细胞体积增加的影响更大,这说明KOR内-1,4-β-葡聚糖酶(其突变同样明显影响细胞分裂)可能是受到加工缺陷影响的糖蛋白。除了在纤维素合成中的作用,葡糖苷酶II的温度敏感型等位基因也有助于对ER中的N-糖基化和质量控制的研究,并有助于确立其与其他发育和生理学过程之间的联系。
实施例6:自棉属植物中分离相应于RSW3的(部分)cDNA
使用序列RSW3搜索dbEST鉴定到一种Gossypium aboreumcDNA,具有833bp的高质量序列。使用自EST设计的引物自陆地棉cDNA的18dpa纤维文库来扩增一种700bp产物,使用了如下引物:
Cot-rsw3f 5′-CGGGATGAAGAGGATGTAGAG 3′(SEQ ID No.22)
Cot-rsq3r 5′-GAACCCCTGAGATGATCCCAA 3′(SEQ ID No.23)
将PCR产物用作探针以鉴定更长的cDNA。鉴定到5种推定的克隆并对2种进行了测序。三种克隆有部分重叠,并组装了棉属植物RSW3同源物的cDNA序列(SEQ ID No.4)。编码N端的区域是缺失的。
实施例7:在棉属植物中表达RSW2/RSW3嵌合基因
将分离自鼠耳芥属或棉属植物的相应于RSW2或RSW3的cDNA可操纵地连接于启动子如苹果青霉素启动子和参与转录终止和聚腺苷酸化的3’-端区域。
其次,将选自分离自鼠耳芥属或棉属植物的RSW2或RSW3基因的大约100bp长的片段克隆入置于启动子例如CaMV 35S启动子控制下的反向重复序列。
将嵌合基因导入进一步含有选择标记基因的T-DNA载体,并将得到的T-DNA载体导入含有辅助Ti质粒的根癌土壤杆菌菌株。使用这些土壤杆菌菌株获得转基因棉属植物。
如WO 98/00549所述进一步分析表达不同转基因拷贝的植物的细胞壁成分,例如纤维素、非晶体β-1,4葡聚糖聚合体、淀粉和碳水化合物含量。
表1.根据细胞长度以及,如果出现接近稳定的生长速度时,根据细胞流(每天离开延长区的细胞数),来分析茎延长的速度。结果表示为平均值±SE,n=5。使用T-检验分析其与野生型之间差异的统计学显著性(*=p<0.05;**=p<0.01;***=5p<0.001)。
生长速度(mm/天) | 细胞流(天-1) | 细胞长度(μm) | ||
21℃ | Columbia | 38.7±1.0 | 101±3.5 | 384±4.0 |
rsw3 | 38.4±1.4 | 95.9±4.6 | 402±7.0 | |
rsw1 | 38.9±1.6 | 102±6.9 | 382±9.8 | |
rsw1rsw3 | 30.2±1.9** | 100±7.6 | 299±8.4** | |
30℃ | Columbia | 53.8±1.2 | 133±2.7 | 404±3.2 |
rsw3 | 41.8±3.1** | 378±22 | ||
rsw1 | 15.2±1.4*** | 87.2±7.0** | 174±5.8*** | |
rswlrsw3 | 13.6±1.8*** | 173±15*** |
表2.生长于30℃的成熟雄蕊花丝中的细胞长度和数量。结果表示为平均值±SE,n>7。使用T-检验分析其与野生型之间差异的统计学显著性(*=p<0.05;**=p<0.01;***p=<0.001)。
总长度(μm) | 细胞数 | 细胞长度(μm) | |
Columbia | 2407_38 | 17.0_1.0 | 152.7_6.2 |
rsw3 | 1458_52*** | 11.4_0.3*** | 127.0_0.1** |
rsw1-1 | 1050_57*** | 15.0_0.4 | 72.7_9.8*** |
rsw1-1rsw3 | 415_41*** | 12.4_0.5*** | 29.4_2.1*** |
参考文献:
Arioli et al..(1998)Science 279:717-720.
Arendt et al.(2000)Glycobiology 10:487-492.
Baskin et al..Aust.J.Plant Phys.19:427-437.
Beemster et al.(1998)Plant Physiol,116:1515-1526.
Berger et al.(1995)Proc.Nat Acad.Sci.USA 92:8259-8263.
Boisson et al.(2001)EMBO J.,20:1010-1019.
Brada et al.(1984)Eur.J.Biochem.,141:149-156.
Brummell et al.Proc.Nat.Acad.Sci.USA 94:4794-4799.
Burn et al (2002a)Plant Physiol.129:797-807.
Clough et al.(1998)Plant J.16:735-743.
D’Alessio et al.(1999)J.Biol.Chem.,274:25899-25905.
Desprez,et al.(2002)Plant Physiol.128:482-490.
Fagard et al.(2000)Plant Cell 12:2409-2424.
Frandsen et al.(1998)Plant Mol.Biol.37:1-13
Fukaki et al.(1996)Plant Physiol.110,933-943.
Gendreau et al.(1997)Plant Physiol.,114:295-305.
Gillmor et al.(2002)J.Cell Biol.156:1003-1013.
Gleave,A.P.(1992)Plant Mol.Biol.20:1203-1207.
Helenius et al.(2001)Science 291:2364-2369.
Henrissat (1991)Biochem.J.280:309-316
Henrissat and Bairoch (1993)Biochem.J.293:781-788.
Hino and Rothman (1985)Biochemistry 24:800-805.
Kimura et a1.(1999)Plant Cell 11:2075-2085.
Lane et al.(2001)Plant Physiol.126:278-288.
Lukowitz et al.(2001)Proc.Nat.Acad.Sci.USA 98:2262-2267.
Lupas et al.(2001)Plant Physiol.127:674-684.
Mlhj et al.(2001)Plant Physiol.127:674-684.
Monroe et al.(1999)Plant Physiol.119:385-397.
Munro (2001)Curr.Biol.11:R499-501.
Murashige and Skoog (1962)Phys.Plant 15:473-497.
Nicol et al.(1998)EMBO J.17:5563-5576.
Pagant et al.(2002)Plant Cell 14:2001-2013.
Parcy et al.(1994)Plant Cell 6:1567-1582
Parinov et al.(1999)Plant Cell 11:2263-2270.
Pelletier et al.(2000)Glycobiology 10:815-827.
Peng et al.(2000)Planta 211:406-414.
Peng et al.(2002)Science 295:147-150.
Peng et al.(2001)PlantPhysiol.126:981-992.
Roberts et al.(1998)Cell 93:639-648.
Sato et al.(2001)Plant Cell Physiol 42:251-263.
Scheible et al.(2001)Proc.Nat.Acad.Sci.USA 98:10079-10084.
Silk et al.(1989)Plant Physiol.90:708-713.
Sugimoto et al.(2001)Protoplasma 215:172-183.
Taylor et al.(2000a)Plant J.24:305-316
Taylor et al.(2000)Plant Cell 12:2529-2539.
Taylor et al.(1999)Plant Cell 11:769-780.
Treml et al.(2000)Glycobiology 10:493-502.
Trombetta et al.(2001)Biochemistry 40:10717-10722.
Trombetta et al.(1996)J.Biol.Chem.271:27509-27516.
Vitale (2001)Plant Cell 13:1260-1262
von Schaewen et al.(1993)Plant Physiol 102:1109-1118.
Western et al.(2001)Plant Physiol 127:998-1011.
Western et al.(2000)Plant Physiol 122:345-355.
Williamson et al.(2001a)Protoplasma 215:116-127.
Williamson et al.(2001)Cell.Molec.Life Sci.58:1475-1490.
Zuo et al.(2000)Plant Cell 12:1137-1152.
序列表
<110>澳大利亚国立大学
<120>调节产纤维植物中纤维素生物合成的方法和手段
<130>501987/MRO
<150>US 60/432,674
<151>2002-12-12
<160>23
<170>PatentIn version 3.1
<210>1
<211>2303
<212>DNA
<213>Arabidopsis thaliana
<220>
<221>misc_feature
<222>(121)..(1986)
<223>coding RSW2
<400>1
acatttcttc acttccacac acttttactt ctttctctct tctcttctct tctccagatc 60
tgatcccaaa cctttgattc attgttgttg ttctctgctg ctttatcaga gagcatcatc 120
atgtacggaa gagatccatg gggaggtcca ttggagataa acactgcaga ttccgccacc 180
gacgatgatc gtagtcggaa tttaaacgat ttggatcgtg cggctctttc acgtccacta 240
gatgagacgc agcagagttg gttacttggt ccaacggagc agaagaagaa gaagtacgtc 300
gatctcggtt gtattatcgt tagccgcaag atcttcgtct ggactgttgg tactcttgtt 360
gccgccgcgt tactcgccgg attcattacc ttgatcgtta aaactgtgcc gcgtcatcat 420
cctaagactc cgccgccgga taattatact atagctctac acaaagctct taagttcttc 480
aatgctcaga aatctgggaa attgccaaag cataataacg tgtcatggag aggtaattct 540
gggcttcaag atgggaaagg tgaaacagga agcttctata aagatttggt gggaggttat 600
tatgatgctg gtgatgctat caagttcaat ttccccatgg cttatgctat gactatgttg 660
agctggagtg ttattgaata tagtgctaaa tacgaagctg ctggtgagct cactcatgtt 720
aaggagctta tcaaatgggg aactgattac tttctcaaga ctttcaatag tactgctgat 780
tccattgatg atcttgtgtc acaggttgga tcagggaata ctgatgatgg aaatacagat 840
cctaatgacc attactgttg gatgcgacct gaggatatgg actataaaag gcccgtgact 900
acttgtaatg gtggatgttc ggatctcgct gcagagatgg cagctgctct ggcttcagca 960
tctattgtat tcaaggataa caaggaatat tctaaaaagc ttgtccatgg tgctaaggtg 1020
gtgtatcagt ttggaaggac gaggagaggg agatatagtg caggcactgc ggaatctagc 1080
aagttctata attcaagtat gtattgggat gagttcattt ggggtggtgc ttggatgtat 1140
tatgctaccg gaaatgtaac gtatctcaat ctaatcaccc aacctactat ggccaagcat 1200
gctggtgcct tctggggtgg cccttactat ggtgtattta gctgggacaa caagcttgct 1260
ggtgctcagt tgctgttgag ccggttgagg ttgtttctga gtcctggata tccatatgaa 1320
gaaattctaa ggaccttcca caatcagacc aqcatagtca tgtgctcata cttgcctatt 1380
ttcaacaaat ttaacagaac caatggaggt ttaatagagt tgaatcatgg agctccacag 1440
ccgctgcaat attctgtaaa tgcagctttc ttagcgactc tatacagtga ttatctggat 1500
gctgctgata ctcctggatg gtactgtgga cctaatttct attcgacaag tgtgctacgt 1560
gactttgcta gatcccagat tgattatata ctgggtaaaa accctcggaa aatgagttat 1620
gtcgttggtt ttggcacaaa atacccaaga catgtgcatc acagaggagc ttcgataccc 1680
aagaacaaag tcaagtataa ctgcaaagga ggatggaaat ggagagacag caagaaacca 1740
aacccaaaca cgattgaagg agccatggtt gctggtcctg acaagcgcga cgggtaccgt 1800
gatgtccgta tgaactacaa ctacactgaa ccgactcttg caggcaatgc tggtctagtc 1860
gcagctcttq tggcattatc gggtgaagaa gaagccaccg gtaagataga caaaaacact 1920
attttctcag ctgttcctcc tttgttccct actccaccac ctccaccagc accatggaaa 1980
ccttgagaaa gctagacttg tgtgattctg tcgctgctgc caaaaaaaat gaatgaggta 2040
agaaggattt gggtgtgaga ccagaagatt agaagctaaa cacaagtcag ccataaccaa 2100
actactaagg atttcatttg gctttactag atacaaacac ggggtgggtt actttaccac 2160
aagcattgtc tttcttttct ttttttgggt tgctgttttg ttcttgtgag atatcatata 2220
tatctatgcg ttttactctg tatatgtttg ataccaaact tgtattcttt gataaacaat 2280
ttaatgaact gtattaaact ttt 2303
<210>2
<211>2031
<212>DNA
<213>Artificial Sequence
<220>
<223>cDNA RSW2 homologue from cotton
<220>
<221>misc_feature
<222>(47)..(1906)
<223>coding region RSW2 homologue
<400>2
ggcacgagcc tgcattttcc gcccactact cttccaaatc ctcatcatgt acggcagaga 60
tccgtgggga ggtcccctgg agataaacgc cactgattct gccactgacg acgacaggag 120
caggaatctg caggacctgg atagggctgc actctctcgc cccttggacg agactcagca 180
aagctggctg cttggccccg gggagcaaaa gaagaagaag aagtacgttg atctcggatg 240
tatcattgtg agccgcaaga tctttgtatg gaccgtgggg accctgctag tctccgccct 300
cctggccgga ctcatcaccc tcatcgtcaa gactgtccca cgtcatcacc accgccactc 360
tccgcccgat aactacactc tggctcttca caaggcgctc atgttcttta atgctcagcg 420
ttctqgaaag ctgcccaagc ataataatgt gtcgtggaga gggaactcgg gcctccaaga 480
tggcaaatcc gatccctccg ttttgatgaa agatctggtc ggcggatatt acgatgctgg 540
agatgctatc aagtttaact ttcctgcatc tttttcaatg actatgttga gctggagtgt 600
catcgaatac agtgctaaat acgaggctgc cggcgagctc aatcatgtta aagagatcat 660
caaatggggt actgattatc ttctgaagac cttcaacaat actqctgata ccattgacag 720
gattgctgcg caggtaggga taggagatac atctggagga agttcagccc caaatgatca 780
ttattgctgg atgcgccctg aggacattga ttacccccgt cctgtatatg aatgtcatag 840
ttgctccgat cttgctgctg aaatggctgc tgctttggct tctgcttcca tcgttttcaa 900
agacaacaaa gcatactctc aaaagcttgt ccatggtgcc cgaacactct ttatgtttgc 960
tagggatcaa agaggcagat atagtgctgg tggttctgac cctgccctct tttataattc 1020
ctcaagttac tgggatgagt ttgtttgggg tggagcctgg ttatactatg ccactgggaa 1080
ttcatcctat cttcagttag ctactcatcc taaacttgcc aagcatgctg gtgctttctg 1140
gggtggccca gattatggtg ttcttagctg ggataataag cttgctggtg ctcaggtgct 1200
tctgagccga ttgagattgt ttttgagtcc tgggtatcca tatgaggaaa tattgagtac 1260
qtttcataat caaaccaqca taattatqtq ctcattcctt ccggttttca ctagctttaa 1320
tagaacaaaa ggaggtttga ttcagttaaa ccatggaagg cctcagccac tgcaatacgt 1380
agtcaatgca gccttcttag ccgccctata tagtgattat cttgatactg ctgatacacc 1440
tggatggtat tgtggtccca atttctattc aactgatgtc ctgcgtgaat ttgccaaaac 1500
ccagattgat tatatccttg gcaaaaatcc tcgaaaaatg agctatgttg tgggctttgg 1560
taaccattat ccaaagcatg ttcaccatag aggggcatct atccctaaga ataagatcaa 1620
atataactgt aaagggggat ggaaatggag ggatacgtca aaaccaaacc ccaacacact 1680
tgtgggagcc atggtagctg gacctgacaa gcatgatggg tttcgtgatg ttcgcaccaa 1740
ctacaactat acggagccaa ctctagcagg caacgcaggg ttggttgctg cactcgtggc 1800
attgtctggt gacaagqcaa ccgtgattga caagaatacc attttttctg cagttccacc 1860
aatqtttcct acaccaccac cacctccqqc accttgqaaa ccatgaaaac gttttgatct 1920
ttcttctgtc catgtgtgac ttacagtctg atgattttgg aattagtttt tggtacgtaa 1980
atgaccttgg aagtgtaagt aacgcaaaag gcaagacagg agatgagtga t 2031
<210>3
<211>4500
<212>DNA
<213>Arabidopsis thaliana
<400>3
gtgtactgcg agaactgctt attacataca tggcagataa tccgcgtaga agaagggttt 60
aacggagacg aatttgaact ctccgacgaa ataatcgtct tctccggcat catcttcaga 120
aagctattcc aaattagggt tttgactttt gattgaagaa gacaggtcta gaaacttaca 180
tacaccaatt ttaaaatcga gtttgggccg aattatggac cgtactttgg gctatgggcc 240
ttcattttaa taaacaggtc ggatatatcc accggacccg gaatgatcgt cttcctcagt 300
gttgtatttt ggctttcctc attgcttcct caatctaagg atttccatga acaaggaact 360
aaaatgagat ctcttctctt tgtactatca ctcatttgct tttgctctca aacagcactt 420
tcatggaaga aggaagagtt tcgcagctgt gaccaaactc cattttgtaa acgcgctcga 480
tctcgtactc ccggcgcgtg ttctctaatt gtcggcqatg tttccatcac tgatggagat 540
ctcgtagcga agcttctacc gaaagcgcct aatcaaggcg atggggatca gatcaagccg 600
ttgattcttt ctctctcagt ttacaaggat gggatcgtgc ggcttaaaat cgatgaggac 660
cattcgttga acccgccgaa gaagaggttc caagttcctg atgtggtagt gtctgagttt 720
gaggagaaga agatctggct gcagaaagta gcgacggaga cgatctctgg agacactagt 780
ccgtcttcag tagtttatgt atccgatggt tacgaggcgg tggtgcgaca cgatccgttt 840
gaggtgtatg tgcgtgagaa atcaggtgat cgccgtcgcg ttgtgtcatt gaattctcat 900
ggattatttg attttgagca gttggggagg aaaactgaag gagataactg ggaagagaaa 960
tttaggactc atacagattc tagaccatct ggtcctcaat ctattagttt cgatgtttcg 1020
ttttatgatt ccagtttcgt ttatggaatt cctgaacacg ccactagctt cgcgttgaag 1080
cctaccaagg gtcctggagt tgaggaatct gaaccctaca ggctttttaa tctagatgtg 1140
tttgaatacg atcatgaatc accgtttggg ctttacgggt cgattccgtt catggtttcg 1200
catgggaagt ctggtaaaac ttcaggattt ttctggttga atgctgcgga aatgcagatt 1260
gatgtgttgg ctaatggttg ggatgcagag agtggtattt ctttgccttc tagtcacagt 1320
aggatcgaca cattctggat gagcgaggca gggattgtgg atacattctt tttcgttggg 1380
cctgagccaa aggatgttgt aaagcagtat gcaagtqtga caggtacttc agccatgcct 1440
cagttgtttg ccactggtta tcatcaatgt aggtggaact acaaagatga ggaggatgtg 1500
gcacaggtgg actcgaaatt cgatgaacac gatattcctt atgatgttct ctggcttgac 1560
attgagcata cagatgggaa gagatacttt acatgggata gtgtgttgtt tcctcatcca 1620
gaggagatgc aaaagaaatt ggctgcaaag ggtaggaaga tggtgaccat tgtggatcct 1680
catatcaaga gggatgactc atacttctta cacaaagagg ctactcaqat gggatactat 1740
gttaaggatt catctggaaa agactttgat ggttggtgct ggcctggttc atcatcttac 1800
attgatatgt tgagcccaga gattagaaaa tggtggggtg ggaggttctc gtataagaac 1860
tatgttggtt caactccatc attgtacacc tggaatgaca tgaatgagcc ttctgtattc 1920
aatggtcccg aggtataact ttctgtctga atggtctttt tttcttgttc cgttattgtt 1980
tttctgtaat ctgtatagct catttctcat attcattttg ggattgcagt tgaatatagc 2040
aatccattgt ttttctattg cacaattatg gatatgtttg aactctgata gattatacat 2100
cccttatctt gcatactatg acacctttta ttaattattg cactactaaa gcaagtattt 2160
taagatccat tttatgttta tqtggtttta cattggatat ttgtttctgt gacttcttta 2220
agagtggagt gtaagctatg gttgcatatc tccacctctg atttgcttat atcgtaqaaa 2280
gtttatcata tatgtaaagg tctattactg agatgaagac tggcactttt ttctttcttt 2340
tttgttggag taggttacta tgccaagaga tgcattacat gttgggggtg ttgaacacag 2400
agaagttcat aacgcatatg gatattactt ccacatggcg acttccgatg gacttgttat 2460
gcgtgaagaa ggaaaggata ggccttttgt attgtcaaga gcaatctttc ccggcactca 2520
aagatacgga gcaatttgga ctggagataa cacagccgaa tgggaacacc ttagagtctc 2580
cattccaatg atattgacac ttggtcttac tggaattaca ttctctggta caaacaaatt 2640
tagctgttca aattctgctg gcgttttttt tttctttctc aaatttaatg gaagttttct 2700
tttcttttgc aggagctgat attggtgggt tttttggaaa tcctgaacca gaacttctag 2760
ttaggtggta ccaagtgggt gcttactatc catttttcag gggtcatgct catcacgata 2820
ccaaaagacg agagccttgg ttgtttgggt aagatgtgat ttagtactta attttttctt 2880
gtcaagaggt attattttag tatgcggtcc aggtctagtc tatggatatt tgcttgatgg 2940
atgatcaaqc agattgaaat gtagtgatac tggttattga gaaaagaata caattgcgga 3000
aactaaaacc tggtgttgca ctctagtcag ttgattgtct aaatagttag gccattagtt 3060
tcatcaagta ggcattgcaa cggttgtcca gaagtctctc tgcctttgtt ttgctggctc 3120
ataaatgttg cactttctca ttcgaatcaa atcaatgttc tcttgtttca gtgaacggaa 3180
cacagaactc atgagagatg ccatacacac tcgttacaca ctgctcccat acttctacac 3240
gttgttcaga gaagcaaacg ttacgggtgt tcctgttgta cqcccattat ggatggaatt 3300
cccgcaagat gaagctactt ttagcaacga tgaagccttc atggtcggta gtggtctact 3360
ggttcaagga gtttacacca aggtacttga gcgctaagta caacttccta cttatttata 3420
ttttggcctt tgtatctctt tacttaatca tatactccag ataaatgatc aaaccctgcc 3480
acataccctc ttctcgtctt tctgcaaaat tagggaacaa cgcaagcttc cgtgtatttg 3540
cctggcaaag aatcatggta tgacttgaga aacggtaaga cttacgttgg aggcaagact 3600
cacaagatgg atgctccaga ggagagtatt cctgcgtttc aaaaggcagg aaccatcatc 3660
ccaaggaagg accggtttag gcgaagttcc tctcaaatgg acaatgatcc ttatactttg 3720
qtacqtacaa cacttqcatc acactqtttt atcatctgct atcaqcacca tqaacaaaqt 3780
aaaaccggtt ggtaaaaaga ttatctctga aagtgaaatc ccaatgataa actatgtgat 3840
ctaacatcta aaacccttca ggtggtagct ttgaacagtt ctcaagaagc agaaggtgaa 3900
ctctacatcg atgacggcaa aagctttgaa ttcagacgag gctcttacat ccatcgtcgc 3960
ttcgtcttct caaagggtgt tcttacatca acgaacttag ctcctccaga agctcgtctc 4020
tcttcccaat gcttgatcga cagaattatc ctcttgggac acagctcagg tccaaaatct 4080
gcgttggtgg aaccgttgaa tcaaaaggca gagattgaga tgggacctct gcgaatgggt 4140
gggcttgtag cttcctcggg tacaaaggtg ttgactatcc gcaaaccggg tgttcgagtg 4200
gaccaagact ggaccgtaaa gattctgtga ttgaacggtt tgaaccagtt tcactcatgg 4260
ccgttagaqt ggccgaaatc tgcttttccg gcgacggaat atcacacttt ttaatatatg 4320
tttggagatt tagacttaaa tagttgtaag agctaacagt ttgaaagtca ctttgcattg 4380
ttgtttatct tcatataaat gagtttagat tttgataatt tcagaattcg tggaatcata 4440
attaacaatt ttgataggga aaaataattt gtttttttta gtcagagggt caaataatct 4500
<210>4
<211>1773
<212>DNA
<213>Artificial Sequence
<220>
<223>cDNA RSW3 homologue from cotton(partial 3′end)
<220>
<221>misc_feature
<222>(2)..(1576)
<223>C-terminal part of the coding region
<400>4
atatgatgtt ttgtggcttg atattgagca tactgatgga aagaggtatt tcacatggga 60
taagatgcta ttcccacatc cagaagagat qcaaaggaaa ttggctgcca aaggtaggca 120
tatggtgaca attgtggatc cgcatattaa gagggatgag tcatttcact tgcacaagga 180
tgcttcccag agggggtatt atgtaaagga tgcaactggc aaggattatg atgggtggtg 240
ctggccaggc tcctcctcct acccagatat gttaaatccc gagattaggt catggtgggc 300
tgagaagttc tcctatgata attatgtcgg ttcaactcct tcattgtaca tttggaatga 360
catgaatgag ccttctgtgt ttaatggacc tgaggtgaca atgcccagag atgctttaca 420
tgttggtgga gtggaacatc gggagttaca taatgcctat ggatattact tccacatggc 480
aacagctgaa ggccttctaa agcgtggaga tggtaaggac agaccttttg tcttgtccag 540
agcattcttt gctggaagtc aaaggtatgg agcagtctgg actggtgata attcggcaga 600
ttgggatcat ctcagggttt cagtcccaat ggttttgacg cttggtctta ctggaatgac 660
attctctggg gctgatgttg gtggattttt tggcaatcct gagcctgagt tattagtgcg 720
ttggtatcaa cttggtgctt attatccttt ctttagaggt catgctcatc atgacacaaa 780
aagacgagag ccttggttgt ttggtgaacg aaataccgca cttatgagag atgccatacg 840
aattcgttac accttgcttc catacttcta cacattattc agagaagcaa atgttagtgg 900
tgttcctgtt gtacggccat tatggatgga gttcccatct gatgaagcag ctttcagcaa 960
tgatgaagcc ttcatggttg ggaacagtct tttagtacaa gggatctata ctgcaagggc 1020
taaacatgca tcagtatatt tgcctgggaa ggaatcgtgg tacgacctta gaacaggaac 1080
tgcatataag ggaggaaagg tccataaact tgaagtttca gaagagagca ttcctgcttt 1140
ccaaagagct ggcacaatag tgccaagaaa agaccggttc cgtagaagct ccacacaaat 1200
ggtgcatgat ccttacacac tqqtaatagc tctgaacagt tcccaagcag ctgaaggtga 1260
actctatgtt gatgatggaa aaagctatga cttcaaacat ggggcataca tccatcgccg 1320
ctttgtgttc tcgaatgggc atctaacatc ctctcccgtt ggcaactcta ggttttcgtc 1380
tgactgcatt atcgagcggg ttattcttct tggatttacc cctggggcta aaactgctct 1440
tgtcgaacca ggaaatcaga aggctgaaat cgaacttggt ccacttcggt tcgggggaca 1500
acatgctgct gttgctgtaa ccatccggaa gcctggtgtg agggtggctg aagattggaa 1560
gataaaaatt ttgtaggatg tctatttagt tcggtgaaaa tgtaatgcca agtaaagctc 1620
tcctgctact tcgttattct cgacttttta gagtttatga tggagaaaac tggaaagccg 1680
ttgacatttc cttcgttcaa tttactttct acttttaaga atttaaaaaa aaagtcgacg 1740
cggccgcgaa ttccggaccg gtacctgcag gcg 1773
<210>5
<211>621
<212>PRT
<213>Arabidopsis thaliana
<400>5
Met Tyr Gly Arg Asp Pro Trp Gly Gly Pro Leu Glu Ile Asn Thr Ala
1 5 10 15
Asp Ser Ala Thr Asp Asp Asp Arg Ser Arg Asn Leu Asn Asp Leu Asp
20 25 30
Arg Ala Ala Leu Ser Arg Pro Leu Asp Glu Thr Gln Gln Ser Trp Leu
35 40 45
Leu Gly Pro Thr Glu Gln Lys Lys Lys Lys Tyr Val Asp Leu Gly Cys
50 55 60
Ile Ile Val Ser Arg Lys Ile Phe Val Trp Thr Val Gly Thr Leu Val
65 70 75 80
Ala Ala Ala Leu Leu Ala Gly Phe Ile Thr Leu Ile Val Lys Thr Val
85 90 95
Pro Arg His His Pro Lys Thr Pro Pro Pro Asp Asn Tyr Thr Ile Ala
100 105 110
Leu His Lys Ala Leu Lys Phe Phe Asn Ala Gln Lys Ser Gly Lys Leu
115 120 125
pro Lys His Asn Asn Val Ser Trp Arg Gly Asn Ser Gly Leu Gln Asp
130 135 140
Gly Lys Gly Glu Thr Gly Ser Phe Tyr Lys Asp Leu Val Gly Gly Tyr
145 150 155 160
Tyr Asp Ala Gly Asp Ala Ile Lys Phe Asn Phe Pro Met Ala Tyr Ala
165 170 175
Met Thr Met Leu Ser Trp Ser Val Ile Glu Tyr Ser Ala Lys Tyr Glu
180 185 190
Ala Ala Gly Glu Leu Thr His Val Lys Glu Leu Ile Lys Trp Gly Thr
195 200 205
Asp Tyr Phe Leu Lys Thr Phe Asn Ser Thr Ala Asp Ser Ile Asp Asp
210 215 220
Leu Val Ser Gln Val Gly Ser Gly Asn Thr Asp Asp Gly Asn Thr Asp
225 230 235 240
Pro Asn Asp His Tyr Cys Trp Met Arg Pro Glu Asp Met Asp Tyr Lys
245 250 255
Arg Pro Val Thr Thr Cys Asn Gly Gly Cys Ser Asp Leu Ala Ala Glu
260 265 270
Met Ala Ala Ala Leu Ala Ser Ala Ser Ile Val Phe Lys Asp Asn Lys
275 280 285
Glu Tyr Ser Lys Lys Leu Val His Gly Ala Lys Val Val Tyr Gln Phe
290 295 300
Gly Arg Thr Arg Arg Gly Arg Tyr Ser Ala Gly Thr Ala Glu Ser Ser
305 310 315 320
Lys Phe Tyr Asn Ser Ser Met Tyr Trp Asp Glu Phe Ile Trp Gly Gly
325 330 335
Ala Trp Met Tyr Tyr Ala Thr Gly Asn Val Thr Tyr Leu Asn Leu Ile
340 345 350
Thr Gln Pro Thr Met Ala Lys His Ala Gly Ala Phe Trp Gly Gly Pro
355 360 365
Tyr Tyr Gly Val Phe Ser Trp Asp Asn Lys Leu Ala Gly Ala Gln Leu
370 375 380
Leu Leu Ser Arg Leu Arg Leu Phe Leu Ser Pro Gly Tyr Pro Tyr Glu
385 390 395 400
Glu Ile Leu Arq Thr Phe His Asn Gln Thr Ser Ile Val Met Cys Ser
405 410 415
Tyr Leu Pro Ile Phe Asn Lys Phe Asn Arg Thr Asn Gly Gly Leu Ile
420 425 430
Glu Leu Asn His Gly Ala Pro Gln Pro Leu Gln Tyr Ser Val Asn Ala
435 440 445
Ala Phe Leu Ala Thr Leu Tyr Ser Asp Tyr Leu Asp Ala Ala Asp Thr
450 455 460
Pro Gly Trp Tyr Cys Gly Pro Asn Phe Tyr Ser Thr Ser Val Leu Arg
465 470 475 480
Asp Phe Ala Arg Ser Gln Ile Asp Tyr Ile Leu Gly Lys Asn Pro Arg
485 490 495
Lys Met Ser Tyr Val Val Gly Phe Gly Thr Lys Tyr Pro Arg His Val
500 505 510
His His Arg Gly Ala Ser Ile Pro Lys Asn Lys Val Lys Tyr Asn Cys
515 520 525
Lys Gly Gly Trp Lys Trp Arg Asp Ser Lys Lys Pro Asn Pro Asn Thr
530 535 540
Ile Glu Gly Ala Met Val Ala Gly Pro Asp Lys Arg Asp Gly Tyr Arg
545 550 555 560
Asp Val Arg Met Asn Tyr Asn Tyr Thr Glu Pro Thr Leu Ala Gly Asn
565 570 575
Ala Gly Leu Val Ala Ala Leu Val Ala Leu Ser Gly Glu Glu Glu Ala
580 585 590
Thr Gly Lys Ile Asp Lys Asn Thr Ile Phe Ser Ala Val Pro Pro Leu
595 600 605
Phe Pro Thr Pro Pro Pro Pro Pro Ala Pro Trp Lys Pro
610 615 620
<210>6
<211>619
<212>PRT
<213>cotton
<400>6
Met Tyr Gly Arg Asp Pro Trp Gly Gly Pro Leu Glu Ile Asn Ala Thr
1 5 10 15
Asp Ser Ala Thr Asp Asp Asp Arg Ser Arg Asn Leu Gln Asp Leu Asp
20 25 30
Arg Ala Ala Leu Ser Arg Pro Leu Asp Glu Thr Gln Gln Ser Trp Leu
35 40 45
Leu Gly Pro Gly Glu Gln Lys Lys Lys Lys Lys Tyr Val Asp Leu Gly
50 55 60
Cys Ile Ile Val Ser Arg Lys Ile Phe Val Trp Thr Val Gly Thr Leu
65 70 75 80
Leu Val Ser Ala Leu Leu Ala Gly Leu Ile Thr Leu Ile Val Lys Thr
85 90 95
Val Pro Arg His His His Arg His Ser Pro Pro Asp Asn Tyr Thr Leu
100 105 110
Ala Leu His Lys Ala Leu Met Phe Phe Asn Ala Gln Arg Ser Gly Lys
115 120 125
Leu Pro Lys His Asn Asn Val Ser Trp Arg Gly Asn Ser Gly Leu Gln
130 135 140
Asp Gly Lys Ser Asp Pro Ser Val Leu Met Lys Asp Leu VaL Gly Gly
145 150 155 160
Tyr Tyr Asp Ala Gly Asp Ala Ile Lys Phe Asn Phe Pro Ala Ser Phe
165 170 175
Ser Met Thr Met Leu Ser Trp Ser Val Ile Glu Tyr Ser Ala Lys Tyr
180 185 190
Glu Ala Ala Gly Glu Leu Asn His Val Lys Glu Ile Ile Lys Trp Gly
195 200 205
Thr Asp Tyr Leu Leu Lys Thr Phe Asn Asn Thr Ala Asp Thr Ile Asp
210 215 220
Arg Ile Ala Ala Gln Val Gly Ile Gly Asp Thr Ser Gly Gly Ser Ser
225 230 235 240
Ala Pro Asn Asp His Tyr Cys Trp Met Arg Pro Glu Asp Ile Asp Tyr
245 250 255
Pro Arg Pro Val Tyr Glu Cys His Ser Cys Ser Asp Leu Ala Ala Glu
260 265 270
Met Ala Ala Ala Leu Ala Ser Ala Ser Ile Val Phe Lys Asp Asn Lys
275 280 285
Ala Tyr Ser Gln Lys Leu Val His Gly Ala Arg Thr Leu Phe Met Phe
290 295 300
Ala Arg Asp Gln Arg Gly Arg Tyr Ser Ala Gly Gly Ser Asp Pro Ala
305 310 315 320
Leu Phe Tyr Asn Ser Ser Ser Tyr Trp Asp Glu Phe Val Trp Gly Gly
325 330 335
Ala Trp Leu Tyr Tyr Ala Thr Gly Asn Ser Ser Tyr Leu Gln Leu Ala
340 345 350
Thr His Pro Lys Leu Ala Lys His Ala Gly Ala Phe Trp Gly Gly Pro
355 360 365
Asp Tyr Gly Val Leu Ser Trp Asp Asn Lys Leu Ala Gly Ala Gln Val
370 375 380
Leu Leu Ser Arg Leu Arg Leu Phe Leu Ser Pro Gly Tyr Pro Tyr Glu
385 390 395 400
Glu Ile Leu Ser Thr Phe His Asn Gln Thr Ser Ile Ile Met Cys Ser
405 410 415
Phe Leu Pro Val Phe Thr Ser Phe Asn Arg Thr Lys Gly Gly Leu Ile
420 425 430
Gln Leu Asn His Gly Arg Pro Gln Pro Leu Gln Tyr Val Val Asn Ala
435 440 445
Ala Phe Leu Ala Ala Leu Tyr Ser Asp Tyr Leu Asp Thr Ala Asp Thr
450 455 460
Pro Gly Trp Tyr Cys Gly Pro Asn Phe Tyr Ser Thr Asp Val Leu Arg
465 470 475 480
Glu Phe Ala Lys Thr Gln Ile Asp Tyr Ile Leu Gly Lys Asn Pro Arg
485 490 495
Lys Met Ser Tyr Val Val Gly Phe Gly Asn His Tyr Pro Lys His Val
500 505 510
His His Arg Gly Ala Ser Ile Pro Lys Asn Lys Ile Lys Tyr Asn Cys
515 520 525
Lys Gly Gly Trp Lys Trp Arg Asp Thr Ser Lys Pro Asn Pro Asn Thr
530 535 540
Leu Val Gly Ala Met Val Ala Gly Pro Asp Lys His Asp Gly Phe Arg
545 550 555 560
Asp Val Arg Thr Asn Tyr Asn Tyr Thr Glu Pro Thr Leu Ala Gly Asn
565 570 575
Ala Gly Leu Val Ala Ala Leu Val Ala Leu Ser Gly Asp Lys Ala Thr
580 585 590
Val Ile Asp Lys Asn Thr Ile Phe Ser Ala Val Pro Pro Met Phe Pro
595 600 605
Thr Pro Pro Pro Pro Pro Ala Pro Trp Lys Pro
610 615
<210>7
<211>921
<212>PRT
<213>Arabidopsis thaliana
<400>7
Met Arg Ser Leu Leu Phe Val Leu Ser Leu Ile Cys Phe Cys Ser Gln
1 5 10 15
Thr Ala Leu Ser Trp Lys Lys Glu Glu Phe Arg Ser Cys Asp Gln Thr
20 25 30
Pro Phe Cys Lys Arg Ala Arg Ser Arg Thr Pro Gly Ala Cys Ser Leu
35 40 45
Ile Val Gly Asp Val Ser Ile Thr Asp Gly Asp Leu Val Ala Lys Leu
50 55 60
Leu Pro Lys Ala Pro Asn Gln Gly Asp Gly Asp Gln Ile Lys Pro Leu
65 70 75 80
Ile Leu Ser Leu Ser Val Tyr Lys Asp Gly Ile Val Arg Leu Lys Ile
85 90 95
Asp Glu Asp His Ser Leu Asn Pro Pro Lys Lys Arg Phe Gln Val Pro
100 105 110
Asp Val Val Val Ser Glu Phe Glu Glu Lys Lys Ile Trp Leu Gln Lys
115 120 125
Val Ala Thr Glu Thr Ile Ser Gly Asp Thr Ser Pro Ser Ser Val Val
130 135 140
Tyr Val Ser Asp Gly Tyr Glu Ala Val Val Arg His Asp Pro Phe Glu
145 150 155 160
Val Tyr Val Arg Glu Lys Ser Gly Asp Arg Arg Arg Val Val Ser Leu
165 170 175
Asn Ser His Gly Leu Phe Asp Phe Glu Gln Leu Gly Arg Lys Thr Glu
180 185 190
Gly Asp Asn Trp Glu Glu Lys Phe Arg Thr His Thr Asp Ser Arg Pro
195 200 205
Ser Gly Pro Gln Ser Ile Ser Phe Asp Val Ser Phe Tyr Asp Ser Ser
210 215 220
Phe Val Tyr Gly Ile Pro Glu His Ala Thr Ser Phe Ala Leu Lys Pro
225 230 235 240
Thr Lys Gly Pro Gly Val Glu Glu Ser Glu Pro Tyr Arg Leu Phe Asn
245 250 255
Leu Asp Val Phe Glu Tyr Asp His Glu Ser Pro Phe Gly Leu Tyr Gly
260 265 270
Ser Ile Pro Phe Met Val Ser His Gly Lys Ser Gly Lys Thr Ser Gly
275 280 285
Phe Phe Trp Leu Asn Ala Ala Glu Met Gln Ile Asp Val Leu Ala Asn
290 295 300
Gly Trp Asp Ala Glu Ser Gly Ile Ser Leu Pro Ser Ser His Ser Arg
305 310 315 320
Ile Asp Thr Phe Trp Met Ser Glu Ala Gly Ile Val Asp Thr Phe Phe
325 330 335
Phe Val Gly Pro Glu Pro Lys Asp Val Val Lys Gln Tyr Ala Ser Val
340 345 350
Thr Gly Thr Ser Ala Met Pro Gln Leu Phe Ala Thr Gly Tyr His Gln
355 360 365
Cys Arg Trp Asn Tyr Lys Asp Glu Glu Asp Val Ala Gln Val Asp Ser
370 375 380
Lys Phe Asp Glu His Asp Ile Pro Tyr Asp Val Leu Trp Leu Asp Ile
385 390 395 400
Glu His Thr Asp Gly Lys Arg Tyr Phe Thr Trp Asp Ser Val Leu Phe
405 410 415
Pro His Pro Glu Glu Met Gln Lys Lys Leu Ala Ala Lys Gly Arg Lys
420 425 430
Met Val Thr Ile Val Asp Pro His Ile Lys Arg Asp Asp Ser Tyr Phe
435 440 445
Leu His Lys Glu Ala Thr Gln Met Gly Tyr Tyr Val Lys Asp Ser Ser
450 455 460
Gly Lys Asp Phe Asp Gly Trp Cys Trp Pro Gly Ser Ser Ser Tyr Ile
465 470 475 480
Asp Met Leu Ser Pro Glu Ile Arg Lys Trp Trp Gly Gly Arg Phe Ser
485 490 495
Tyr Lys Asn Tyr Val Gly Ser Thr Pro Ser Leu Tyr Thr Trp Asn Asp
500 505 510
Met Asn Glu Pro Ser Val Phe Asn Gly Pro Glu Val Thr Met Pro Arg
515 520 525
Asp Ala Leu His Val Gly Gly Val Glu His Arg Glu Val His Asn Ala
530 535 540
Tyr Gly Tyr Tyr Phe His Met Ala Thr Ser Asp Gly Leu Val Met Arg
545 550 555 560
Glu Glu Gly Lys Asp Arg Pro Phe Val Leu Ser Arg Ala Ile Phe Pro
565 570 575
Gly Thr Gln Arg Tyr Gly Ala Ile Trp Thr Gly Asp Asn Thr Ala Glu
580 585 590
Trp Glu His Leu Arg Val Ser Ile Pro Met Ile Leu Thr Leu Gly Leu
595 600 605
Thr Gly Ile Thr Phe Ser Gly Ala Asp Ile Gly Gly Phe Phe Gly Asn
610 615 620
Pro Glu Pro Glu Leu Leu Val Arg Trp Tyr Gln Val Gly Ala Tyr Tyr
625 630 635 640
Pro Phe Phe Arg Gly His Ala His His Asp Thr Lys Arg Arg Glu Pro
645 650 655
Trp Leu Phe Gly Glu Arg Asn Thr Glu Leu Met Arg Asp Ala Ile His
660 665 670
Thr Arg Tyr Thr Leu Leu Pro Tyr Phe Tyr Thr Leu Phe Arg Glu Ala
675 680 685
Asn Val Thr Gly Val Pro Val Val Arg Pro Leu Trp Met Glu Phe Pro
690 695 700
Gln Asp Glu Ala Thr Phe Ser Asn Asp Glu Ala Phe Met Val Gly Ser
705 7l0 715 720
Gly Leu Leu Val Gln Gly Val Tyr Thr Lys Gly Thr Thr Gln Ala Ser
725 730 735
Val Tyr Leu Pro Gly Lys Glu Ser Trp Tyr Asp Leu Arg Asn Gly Lys
740 745 750
Thr Tyr Val Gly Gly Lys Thr His Lys Met Asp Ala Pro Glu Glu Ser
755 760 765
Ile Pro Ala Phe Gln Lys Ala Gly Thr Ile Ile Pro Arg Lys Asp Arg
770 775 780
Phe Arg Arg Ser Ser Ser Gln Met Asp Asn Asp Pro Tyr Thr Leu Val
785 790 795 800
Val Ala Leu Asn Ser Ser Gln Glu Ala Glu Gly Glu Leu Tyr Ile Asp
805 810 815
Asp Gly Lys Ser Phe Glu Phe Arg Arg Gly Ser Tyr Ile His Arg Arg
820 825 830
Phe Val Phe Ser Lys Gly Val Leu Thr Ser Thr Asn Leu Ala Pro Pro
835 840 845
Glu Ala Arg Leu Ser Ser Gln Cys Leu Ile Asp Arg Ile Ile Leu Leu
850 855 860
Gly His Ser Ser Gly Pro Lys Ser Ala Leu Val Glu Pro Leu Asn Gln
865 870 875 880
Lys Ala Glu Ile Glu Met Gly Pro Leu Arg Met Gly Gly Leu Val Ala
885 890 895
Ser Ser Gly Thr Lys Val Leu Thr Ile Arg Lys Pro Gly Val Arg Val
900 905 910
Asp Gln Asp Trp Thr Val Lys Ile Leu
915 920
<210>8
<211>524
<212>PRT
<213>cotton
<400>8
Tyr Asp Val Leu Trp Leu Asp Ile Glu His Thr Asp Gly Lys Arg Tyr
1 5 10 15
Phe Thr Trp Asp Lys Met Leu Phe Pro His Pro Glu Glu Met Gln Arg
20 25 30
Lys Leu Ala Ala Lys Gly Arg His Met Val Thr Ile Val Asp Pro His
35 40 45
Ile Lys Arg Asp Glu Ser Phe His Leu His Lys Asp Ala Ser Gln Arg
50 55 60
Gly Tyr Tyr Val Lys Asp Ala Thr Gly Lys Asp Tyr Asp Gly Trp Cys
65 70 75 80
Trp Pro Gly Ser Ser Ser Tyr Pro Asp Met Leu Asn Pro Glu Ile Arg
85 90 95
Ser Trp Trp Ala Glu Lys Phe Ser Tyr Asp Asn Tyr Val Gly Ser Thr
100 105 110
Pro Ser Leu Tyr Ile Trp Asn Asp Met Asn Glu Pro Ser Val Phe Asn
115 120 125
Gly Pro Glu Val Thr Met Pro Arg Asp Ala Leu His Val Gly Gly Val
130 135 140
Glu His Arg Glu Leu His Asn Ala Tyr Gly Tyr Tyr Phe His Met Ala
145 150 155 160
Thr Ala Glu Gly Leu Leu Lys Arg Gly Asp Gly Lys Asp Arg Pro Phe
165 170 175
Val Leu Ser Arg Ala Phe Phe Ala Gly Ser Gln Arg Tyr Gly Ala Val
180 185 190
Trp Thr Gly Asp Asn Ser Ala Asp Trp Asp His Leu Arg Val Ser Val
195 200 205
Pro Met Val Leu Thr Leu Gly Leu Thr Gly Met Thr Phe Ser Gly Ala
210 215 220
Asp Val Gly Gly Phe Phe Gly Asn Pro Glu Pro Glu Leu Leu Val Arg
225 230 235 240
Trp Tyr Gln Leu Gly Ala Tyr Tyr Pro Phe Phe Arg Gly His Ala His
245 250 255
His Asp Thr Lys Arg Arg Glu Pro Trp Leu Phe Gly Glu Arg Asn Thr
260 265 270
Ala Leu Met Arg Asp Ala Ile Arg Ile Arg Tyr Thr Leu Leu Pro Tyr
275 280 285
Phe Tyr Thr Leu Phe Arg Glu Ala Asn Val Ser Gly Val Pro Val Val
290 295 300
Arg Pro Leu Trp Met Glu Phe Pro Ser Asp Glu Ala Ala Phe Ser Asn
305 310 315 320
Asp Glu Ala Phe Met Val Gly Asn Ser Leu Leu Val Gln Gly Ile Tyr
325 330 335
Thr Ala Arg Ala Lys His Ala Ser Val Tyr Leu Pro Gly Lys Glu Ser
340 345 350
Trp Tyr Asp Leu Arg Thr Gly Thr Ala Tyr Lys Gly Gly Lys Val His
355 360 365
Lys Leu Glu Val Ser Glu Glu Ser Ile Pro Ala Phe Gln Arg Ala Gly
370 375 380
Thr Ile Val Pro Arg Lys Asp Arg Phe Arg Arg Ser Ser Thr Gln Met
385 390 395 400
Val His Asp Pro Tyr Thr Leu Val Ile Ala Leu Asn Ser Ser Gln Ala
405 410 415
Ala Glu Gly Glu Leu Tyr Val Asp Asp Gly Lys Ser Tyr Asp Phe Lys
420 425 430
His Gly Ala Tyr Ile His Arg Arg Phe Val Phe Ser Asn Gly His Leu
435 440 445
Thr Ser Ser Pro Val Gly Asn Ser Arg Phe Ser Ser Asp Cys Ile Ile
450 455 460
Glu Arg Val Ile Leu Leu Gly Phe Thr Pro Gly Ala Lys Thr Ala Leu
465 470 475 480
Val Glu Pro Gly Asn Gln Lys Ala Glu Ile Glu Leu Gly Pro Leu Arg
485 490 495
Phe Gly Gly Gln His Ala Ala Val Ala Val Thr Ile Arg Lys Pro Gly
500 505 510
Val Arg Val Ala Glu Asp Trp Lys Ile Lys Ile Leu
515 520
<210>9
<211>2766
<212>DNA
<213>Arabidopsis thaliana
<400>9
atgagatctc ttctctttgt actatcactc atttgctttt gctctcaaac agcactttca 60
tggaagaagg aagagtttcg caqctgtgac caaactccat tttgtaaacg cgctcgatct 120
cgtactcccg gcgcgtgttc tctaattgtc ggcgatgttt ccatcactga tggagatctc 180
gtagcgaagc ttctaccgaa agcgcctaat caaggcgatg gggatcagat caagccgttg 240
attctttctc tctcagttta caaggatggg atcgtgcggc ttaaaatcga tgaggaccat 300
tcgttgaacc cgccgaagaa gaggttccaa gttcctgatg tggtagtgtc tgagtttgag 360
gagaagaaga tctggctgca gaaagtagcg acggagacga tctctggaga cactagtccg 420
tcttcagtag tttatgtatc cgatggttac gaggcggtgg tgcgacacga tccgtttgag 480
gtgtatgtgc gtgagaaatc aggtgatcgc cgtcgcgttg tgtcattgaa ttctcatgga 540
ttatttgatt ttgagcagtt ggggaggaaa actgaaggag ataactggga agagaaattt 600
aggactcata cagattctag accatctggt cctcaatcta ttagtttcga tgtttcgttt 660
tatgattcca gtttcgttta tggaattcct gaacacgcca ctagcttcgc gttgaagcct 720
accaagggtc ctggagttga ggaatctgaa ccctacaggc tttttaatct agatgtgttt 780
gaatacgatc atgaatcacc gtttgggctt tacgggtcga ttccgttcat ggtttcgcat 840
ggqaagtctg gtaaaacttc aggatttttc tggttgaatg ctgcggaaat gcagattgat 900
gtgttggcta atggttggga tgcagagagt ggtatttctt tgccttctag tcacagtagg 960
atcgacacat tctggatgag cgaggcaggg attgtggata cattcttttt cgttgggcct 1020
gagccaaagg atgttgtaaa gcagtatgca agtgtgacag gtacttcagc catgcctcag 1080
ttgtttgcca ctggttatca tcaatgtagg tggaactaca aagatgagga ggatgtggca 1140
caggtggact cgaaattcga tgaacacgat attccttatg atgttctctg gcttgacatt 1200
gagcatacag atgggaagag atactttaca tgggatagtg tgttgtttcc tcatccagag 1260
gagatgcaaa agaaattggc tgcaaagggt aggaagatgg tgaccattgt ggatcctcat 1320
atcaagaggg atgactcata cttcttacac aaaqaggcta ctcagatggg atactatgtt 1380
aaggattcat ctggaaaaga ctttgatggt tggtgctggc ctggttcatc atcttacatt 1440
gatatgttga gcccagagat tagaaaatgg tggggtggga ggttctcgta taagaactat 1500
gttggttcaa ctccatcatt gtacacctgg aatgacatga atgagccttc tgtattcaat 1560
ggtcccgagg ttactatgcc aagagatgca ttacatgttg gqggtgttga acacagagaa 1620
gttcataacg catatggata ttacttccac atggcgactt ccgatggact tgttatgcgt 1680
gaagaaggaa aggataggcc ttttgtattg tcaagagcaa tctttcccgg cactcaaaga 1740
tacggagcaa tttggactgg agataacaca gccgaatggg aacaccttag agtctccatt 1800
ccaatgatat tgacacttgg tcttactgga attacattct ctqgagctga tattggtqgg 1860
ttttttggaa atcctgaacc agaacttcta gttaggtggt accaagtggg tgcttactat 1920
ccatttttca ggggtcatgc tcatcacgat accaaaagac gagagccttg gttgtttggt 1980
gaacggaaca cagaactcat gagagatgcc atacacactc gttacacact gctcccatac 2040
ttctacacgt tgttcagaga agcaaacgtt acgggtgttc ctgttgtacg cccattatgg 2100
atggaattcc cgcaagatga agctactttt agcaacgatg aagccttcat ggtcggtagt 2160
ggtctactgg ttcaaggagt ttacaccaag ggaacaacgc aagcttccgt gtatttgcct 2220
ggcaaagaat catggtatga cttgagaaac ggtaagactt acgttggagg caagactcac 2280
aagatggatg ctccagagga gagtattcct gcgtttcaaa aggcaggaac catcatccca 2340
aggaaggacc ggtttaggcg aagttcctct caaatggaca atgatcctta tactttggtg 2400
gtagctttga acagttctca agaagcagaa ggtgaactct acatcgatga cggcaaaagc 2460
tttgaattca gacgaggctc ttacatccat cgtcgcttcg tcttctcaaa gggtgttctt 2520
acatcaacga acttagctcc tccagaagct cgtctctctt cccaatgctt gatcgacaga 2580
attatcctct tgggacacag ctcaggtcca aaatctgcgt tggtggaacc gttgaatcaa 2640
aaggcagaga ttgagatggg acctctgcga atgggtgggc ttgtagcttc ctcgggtaca 2700
aaggtgttga ctatccgcaa accgggtgtt cgagtggacc aagactggac cgtaaagatt 2760
ctgtga 2766
<210>10
<211>29
<212>DNA
<213>Artificial sequence
<220>
<223>oligonucleotide PCR primer
<400>10
ccgctcgagc gggcattttc cgcccacta 29
<210>11
<211>29
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>11
cgggatcccg tcacacatgg acagaagaa 29
<210>12
<211>19
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>12
gacggcgtct agaagattc 19
<210>13
<211>19
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>13
taacttatcg ggcttctgc 19
<210>14
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>14
ccctcgcttg gtacaaggta t 21
<210>15
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>15
tcctqatcct ctcaccacgt a 21
<210>16
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>16
cgtagtggtc tactggttca a 21
<210>17
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>17
tgagctgtgt cccaagagga t 21
<210>18
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>18
ggtgatgagg ataccagcga t 21
<210>19
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>19
cccactccct aaccggagtt t 21
<210>20
<211>35
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>20
ccgctcgagc ggtttcactc acaactgtgg tctct 35
<210>21
<211>34
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>21
ccgctcgagc ggtctcctaa gtcctaaccc cata 34
<210>22
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>22
cgggatgaag aggatgtaga g 21
<210>23
<211>21
<212>DNA
<213>Artificial Sequence
<220>
<223>oligonucleotide PCR primer
<400>23
gaacccctga gatgatccca a 21
Claims (42)
1.一种用于在产纤维植物中增加纤维素生物合成的方法,包括给所述产纤维植物的细胞提供一种嵌合基因的步骤,所述嵌合基因包括以下可操纵连接的DNA片段:
i)在所述植物的所述细胞中可表达的启动子;
ii)编码包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白或其变体的DNA区域,所述变体具有同样的酶活性;
iii)参与转录终止和聚腺苷酸化的3’区域。
2.权利要求1的方法,其中所述产纤维植物是棉属植物。
3.权利要求1或2的方法,其中所述DNA区域包括SEQ ID No.1的自第121位核苷酸至第1986位核苷酸的核苷酸序列、或SEQ ID No.2的自第47位核苷酸至第1906位核苷酸的核苷酸序列、或SEQ ID No.3的或SEQ ID No.4的自第2位核苷酸至第1576位核苷酸的核苷酸序列或SEQ ID No.9的核苷酸序列。
4.权利要求1至3中任一项的方法,其中所述启动子是组成型启动子。
5.权利要求1至3中任一项的方法,其中所述启动子是纤维特异性启动子。
6.权利要求1至3中任一项的方法,其中所述启动子是苹果青霉素启动子。
7.权利要求1至6中任一项的方法,其中在棉绒纤维中增加所述纤维素生物合成。
8.一种用于在产纤维植物中降低纤维素生物合成的方法,包括给所述产纤维植物的细胞提供一种嵌合基因的步骤,所述嵌合基因能够降低所述产纤维植物的一种内源性基因的表达,其中所述内源性基因编码包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体,所述变体具有同样的酶活性。
9.权利要求8的方法,其中所述产纤维植物是棉属植物。
10.权利要求8或9的方法,其中所述嵌合基因包括一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的核苷酸序列以及参与转录终止和聚腺苷酸化的3’区域,该序列选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列。
11.权利要求8或9的方法,其中所述嵌合基因包括一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的序列以及参与转录终止和聚腺苷酸化的3’区域,该序列选自SEQ ID No.1或SEQID No.2或SEQ ID No.3或SEQ ID No.4或SEQ ID No.9的核苷酸序列。
12.权利要求8或9的方法,其中所述嵌合基因包括一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的核苷酸序列以及参与转录终止和聚腺苷酸化的3’区域,该序列选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列的互补序列。
13.权利要求8或9的方法,其中所述嵌合基因包括一种具有21个连续的核苷酸的核苷酸序列,该序列选自SEQ ID No.1或SEQ ID No.2或SEQ ID No.3或SEQ ID No.4或SEQ ID No.9的核苷酸序列的互补序列。
14. 权利要求8或9的方法,其中所述嵌合基因包括可操纵地连接于在植物中可表达的启动子的一种选自编码一种包括SEQ ID No.5或SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列的具有21个连续的核苷酸的第一核苷酸序列和一种与所述第一核苷酸序列互补的第二核苷酸序列,以及参与转录终止和聚腺苷酸化的3’区域,由此在所述嵌合基因转录后形成一种RNA,所述RNA可在所述第一和第二核苷酸序列之间形成一个双链RNA区域。
15.权利要求8或9的方法,其中所述嵌合基因包括可操纵地连接于在植物中可表达的启动子的一种选自SEQ ID No.1或SEQ ID No.2或SEQ ID No.3或SEQ ID No.4或SEQ ID No.9的核苷酸序列的具有21个连续的核苷酸的第一核苷酸序列和一种与所述第一核苷酸序列互补的第二核苷酸序列,以及参与转录终止和聚腺苷酸化的3’区域,由此在所述嵌合基因转录后形成一种RNA,所述RNA可在所述第一和第二核苷酸序列之间形成一个双链RNA区域。
16.权利要求8至15中任一项的方法,其中所述在植物中可表达的启动子是组成型启动子。
17.权利要求8至15中任一项的方法,其中所述在植物中可表达的启动子是绒毛纤维特异性启动子。
18.权利要求8至17中任一项的方法,其中所述纤维素生物合成在绒毛纤维产生中被降低。
19.一种用于在产纤维植物中增加纤维素生物合成的嵌合基因,其包括以下可操纵连接的DNA片段:
i)在所述植物的所述细胞中可表达的启动子;
ii)编码包括SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质或其变体的DNA区域,所述变体具有同样的酶活性;以及
iii)参与转录终止和聚腺苷酸化的3’端区域。
20.权利要求19的嵌合基因,其中所述产纤维植物是棉属植物。
21.权利要求19或20的嵌合基因,其中所述DNA区域包括SEQID No.2或SEQ ID No.3或SEQ ID No.4的核苷酸序列。
22.权利要求19至21中任一项的嵌合基因,其中所述启动子是组成型启动子。
23.权利要求19至21中任一项的嵌合基因,其中所述启动子是纤维特异性启动子。
24.权利要求19至21中任一项的嵌合基因,其中所述启动子是苹果青霉素启动子。
25.一种用于在产纤维植物中降低纤维素生物合成的嵌合基因,其包含一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的核苷酸序列以及参与转录终止和聚腺苷酸化的3,区域,所述核苷酸序列选自编码一种包括SEQ ID No.6或SEQ ID No.7或SEQ IDNo.8的氨基酸序列的蛋白质的核苷酸序列。
26.一种用于在产纤维植物中降低纤维素生物合成的嵌合基因,其包含一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的序列以及参与转录终止和聚腺苷酸化的3’区域,所述序列选自SEQ ID No.2或SEQ ID No.3或SEQ ID No.4的核苷酸序列。
27.一种用于在产纤维植物中降低纤维素生物合成的嵌合基因,其包含一种可操纵地连接于在植物中可表达的启动子的具有21个连续的核苷酸的核苷酸序列以及参与转录终止和聚腺苷酸化的3’区域,该序列选自编码一种包括SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列的互补序列。
28.一种用于在产纤维植物中降低纤维素生物合成的嵌合基因,其包含一种具有21个连续的核苷酸的核苷酸序列,所述序列选自SEQID No.2或SEQ ID No.3或SEQ ID No.4的核苷酸序列的互补序列。
29.一种用于在产纤维植物中降低纤维素生物合成的嵌合基因,其包含可操纵地连接于在植物中可表达的启动子的一种选自编码一种包括SEQ ID No.6或SEQ ID No.7或SEQ ID No.8的氨基酸序列的蛋白质的核苷酸序列的具有21个连续的核苷酸的第一核苷酸序列和一种与所述第一核苷酸序列互补的第二核苷酸序列,以及参与转录终止和聚腺苷酸化的3’区域,由此在所述嵌合基因转录后形成一种RNA,所述RNA可在所述第一和第二核苷酸序列之间形成一个双链RNA区域。
30.一种用于在产纤维植物中降低纤维素生物合成的嵌合基因,其包含可操纵地连接于在植物中可表达的启动子的一种选自SEQ ID No.2或SEQ ID No.3或SEQ ID No.4的核苷酸序列的具有21个连续的核苷酸的第一核苷酸序列和一种与所述第一核苷酸序列互补的第二核苷酸序列,以及参与转录终止和聚腺苷酸化的3’区域,由此在所述嵌合基因转录后形成一种RNA,所述RNA可在所述第一和第二核苷酸序列之间形成一个双链RNA区域。
31.权利要求25至30中任一项的嵌合基因,其中所述产纤维植物是棉属植物。
32.权利要求25至31中任一项的嵌合基因,其中所述在植物中可表达的启动子是组成型启动子。
33.权利要求25至31中任一项的嵌合基因,其中所述在植物中可表达的启动子是绒毛纤维特异性启动子。
34.一种植物细胞,其包含权利要求19至33中任一项的嵌合基因。
35.一种植物,其包含权利要求34的植物细胞。
36.权利要求35的植物,其中所述植物基本上由包含权利要求19至33中任一项的嵌合基因的植物细胞组成。
37.权利要求31或32的植物的种子,所述种子包含权利要求19至33中任一项的嵌合基因。
38.权利要求19至33中任一项的嵌合基因在调节产纤维植物的纤维素生物合成和纤维品质中的用途。
39.权利要求38的嵌合基因的用途,其中所述产纤维植物是棉属植物。
40.一种用于在包括一种特定植物品种的不同基因型或变种的群体中鉴别编码参与纤维素生物合成的蛋白质的基因的等位基因变异的方法,所述等位基因变异单独地或者组合地与纤维素生产的数量和/品质以及纤维生产有关联,所述方法的步骤包括:
a)提供包含不同等位基因形式的编码包括SEQ ID No.5、6、7或8的氨基酸序列的蛋白质的核苷酸序列的特定植物品种或杂种繁殖植物品种的不同变种或基因型的群体;
b)确定所述群体中每一个个体的与纤维生产和/或纤维素生物合成有关的参数;
c)确定所述群体中每一个个体是否存在一种特定等位基因形式的编码包括SEQ ID No.5、6、7或8的氨基酸序列的蛋白质的核苷酸序列;以及
d)将出现特定的纤维或纤维素参数与是否存在一种特定等位基因形式的所述核苷酸序列或此类等位基因形式的一种特定组合相关联。
41.权利要求40的方法,其中所述植物品种是产纤维植物品种。
42.权利要求41的方法,其中所述产纤维植物品种是棉属植物。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US43267402P | 2002-12-12 | 2002-12-12 | |
US60/432,674 | 2002-12-12 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1732262A true CN1732262A (zh) | 2006-02-08 |
CN100415885C CN100415885C (zh) | 2008-09-03 |
Family
ID=32507980
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2003801057971A Expired - Fee Related CN100415885C (zh) | 2002-12-12 | 2003-12-12 | 调节产纤维植物中纤维素生物合成的方法和手段 |
Country Status (9)
Country | Link |
---|---|
US (2) | US7482508B2 (zh) |
EP (1) | EP1573010A4 (zh) |
CN (1) | CN100415885C (zh) |
AR (2) | AR042447A1 (zh) |
AU (2) | AU2003285210B2 (zh) |
BR (1) | BR0316748A (zh) |
MX (1) | MXPA05005106A (zh) |
WO (1) | WO2004053129A1 (zh) |
ZA (1) | ZA200503587B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105612251A (zh) * | 2013-09-24 | 2016-05-25 | 拜尔作物科学公司 | 异转糖基酶及其用途 |
CN106399358A (zh) * | 2016-06-03 | 2017-02-15 | 华南农业大学 | 莲纤维素合酶基因NnuCESA4的应用 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100404681C (zh) * | 2005-04-29 | 2008-07-23 | 中国科学院遗传与发育生物学研究所 | 棉花纤维的一种特异表达启动子及其应用 |
UY33705A (es) * | 2010-11-03 | 2012-04-30 | Yissum Res Dev Co | Plantas transgenicas con rendimientos de sacarificacion mejorados y metodos para generarlas |
CN113019015B (zh) * | 2019-12-24 | 2022-05-17 | 东北林业大学 | 一种饼形木纤维陶瓷发动机空气滤芯 |
CN114656542B (zh) * | 2020-12-23 | 2023-03-28 | 中国农业大学 | 玉米抗盐碱蛋白质及其编码基因与应用 |
US20240102017A1 (en) * | 2022-05-10 | 2024-03-28 | Amylyx Pharmaceuticals, Inc. | Oligonucleotide compositions and methods thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB9526613D0 (en) | 1995-12-28 | 1996-02-28 | Scottish Crop Research Inst | Sequence |
FR2763077B1 (fr) | 1997-05-07 | 1999-08-13 | Agronomique Inst Nat Rech | Controle de la croissance cellulaire chez une plante par l'utilisation d'un gene d'endo-1,4-beta-d-glucanase |
JP3914348B2 (ja) | 1998-05-29 | 2007-05-16 | 王子製紙株式会社 | 植物の細胞壁成分を改変する方法 |
US6316698B1 (en) * | 1998-11-10 | 2001-11-13 | E. I. Du Pont De Nemours & Company | Plant alpha-glucosidase II homologs |
US20030221218A1 (en) * | 2002-05-17 | 2003-11-27 | The Regents Of The University Of California | Bioengineering cotton fiber properties |
-
2003
- 2003-12-12 US US10/733,407 patent/US7482508B2/en not_active Expired - Fee Related
- 2003-12-12 EP EP03778160A patent/EP1573010A4/en not_active Withdrawn
- 2003-12-12 MX MXPA05005106A patent/MXPA05005106A/es active IP Right Grant
- 2003-12-12 AU AU2003285210A patent/AU2003285210B2/en not_active Ceased
- 2003-12-12 AR ARP030104589A patent/AR042447A1/es not_active Application Discontinuation
- 2003-12-12 BR BR0316748-8A patent/BR0316748A/pt not_active IP Right Cessation
- 2003-12-12 WO PCT/AU2003/001660 patent/WO2004053129A1/en not_active Application Discontinuation
- 2003-12-12 CN CNB2003801057971A patent/CN100415885C/zh not_active Expired - Fee Related
-
2005
- 2005-05-05 ZA ZA200503587A patent/ZA200503587B/en unknown
-
2007
- 2007-09-21 US US11/902,478 patent/US8049066B2/en not_active Expired - Fee Related
-
2009
- 2009-12-15 AU AU2009250962A patent/AU2009250962A1/en not_active Abandoned
-
2010
- 2010-11-10 AR ARP100104163A patent/AR078957A2/es unknown
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105612251A (zh) * | 2013-09-24 | 2016-05-25 | 拜尔作物科学公司 | 异转糖基酶及其用途 |
CN105612251B (zh) * | 2013-09-24 | 2021-01-26 | 拜尔作物科学公司 | 异转糖基酶及其用途 |
CN106399358A (zh) * | 2016-06-03 | 2017-02-15 | 华南农业大学 | 莲纤维素合酶基因NnuCESA4的应用 |
Also Published As
Publication number | Publication date |
---|---|
EP1573010A1 (en) | 2005-09-14 |
AR042447A1 (es) | 2005-06-22 |
AU2009250962A1 (en) | 2010-01-14 |
MXPA05005106A (es) | 2005-12-14 |
WO2004053129A1 (en) | 2004-06-24 |
CN100415885C (zh) | 2008-09-03 |
AU2003285210B2 (en) | 2009-10-29 |
BR0316748A (pt) | 2005-10-18 |
US7482508B2 (en) | 2009-01-27 |
AU2003285210A1 (en) | 2004-06-30 |
US8049066B2 (en) | 2011-11-01 |
US20080235825A1 (en) | 2008-09-25 |
AR078957A2 (es) | 2011-12-14 |
US20040268433A1 (en) | 2004-12-30 |
ZA200503587B (en) | 2006-10-25 |
EP1573010A4 (en) | 2007-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1202246C (zh) | 获得修饰表型的方法和措施 | |
CN1249245C (zh) | 种子散失 | |
CN1170940C (zh) | 生长得到改造的植物以及获得此种植物的方法 | |
CN1250730C (zh) | 提高棉纤维品质的方法 | |
CN1289680C (zh) | 调节真核细胞中程序性细胞死亡的方法和手段 | |
CN1202255C (zh) | 编码具有果糖基转移酶活性的蛋白质的核酸分子及制备长链菊粉的方法 | |
CN1750752A (zh) | 使用短dsRNA序列在植物中进行有效的基因沉默 | |
CN1705748A (zh) | 用于增加植物中总的油类水平的方法 | |
CN1201012C (zh) | 改变植物开花时间的方法 | |
CN1624135A (zh) | 编码植物脱氧海普赖氨酸合成酶的dna,植物真核起始因子5a,转基因植物和控制植物衰老和程序性细胞死亡的方法 | |
CN1169960C (zh) | 非天然存在的植物细胞、包含其的植物组织及其产生方法 | |
CN1753992A (zh) | 种子中蛋白质含量降低的植物及其制备方法和利用方法 | |
CN1246894A (zh) | 具有降低的α葡聚糖L-或H-型块茎磷酸化酶活性水平的低冷甜化转基因马铃薯 | |
CN1487997A (zh) | 诱导植物开花的hd3a基因及其应用 | |
CN1891827A (zh) | 具有迟发性种子散布的特性的种子植物 | |
CN1313616C (zh) | 表达葡糖基转移酶核酸的转基因细胞 | |
CN101037693A (zh) | 新的CkNHX基因及其剪切修饰基因CkNHXn,以及培育耐逆植物的方法 | |
CN1861791A (zh) | 水稻抗白叶枯病隐性基因xal3和它的等位显性基因Xal3 | |
CN1639337A (zh) | 具有催泪成分合成酶活性的蛋白质或多肽、编码该蛋白质或多肽的 DNA、利用该 DNA制造具有催泪成分合成酶活性的蛋白质或多肽的制造方法以及具有抑制该蛋白质或多肽的mRNA翻译的功能的核酸分子 | |
CN1732262A (zh) | 调节产纤维植物中纤维素生物合成的方法和手段 | |
CN1283796C (zh) | 编码半胱氨酸蛋白酶的基因和启动子及生产雄性不育水稻的方法 | |
CN1240837C (zh) | 植物纤维的修饰 | |
CN1202254C (zh) | 水稻抗白叶枯病基因Xa26(t) | |
CN1198921C (zh) | 棉子糖合成酶基因、棉子糖的制备方法及其转化的植物 | |
CN1860230A (zh) | 赋予对黄单胞菌引起的细菌性黑枯病的抗性的来自水稻的核酸 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20080903 Termination date: 20111212 |