CN101031650A - 改变种子表型的方法和组合物 - Google Patents
改变种子表型的方法和组合物 Download PDFInfo
- Publication number
- CN101031650A CN101031650A CNA2004800373016A CN200480037301A CN101031650A CN 101031650 A CN101031650 A CN 101031650A CN A2004800373016 A CNA2004800373016 A CN A2004800373016A CN 200480037301 A CN200480037301 A CN 200480037301A CN 101031650 A CN101031650 A CN 101031650A
- Authority
- CN
- China
- Prior art keywords
- plant
- nucleic acid
- ser
- glu
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims description 103
- 239000000203 mixture Substances 0.000 title description 21
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims abstract description 308
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 191
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 185
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 185
- 229940104302 cytosine Drugs 0.000 claims abstract description 141
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 56
- 229920001184 polypeptide Polymers 0.000 claims abstract description 41
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 40
- 241000196324 Embryophyta Species 0.000 claims description 321
- 239000002773 nucleotide Substances 0.000 claims description 168
- 125000003729 nucleotide group Chemical group 0.000 claims description 168
- 108020004414 DNA Proteins 0.000 claims description 137
- 230000001105 regulatory effect Effects 0.000 claims description 70
- 210000004027 cell Anatomy 0.000 claims description 52
- 230000007067 DNA methylation Effects 0.000 claims description 40
- 230000010152 pollination Effects 0.000 claims description 37
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 29
- 240000007594 Oryza sativa Species 0.000 claims description 27
- 240000008042 Zea mays Species 0.000 claims description 27
- 230000000692 anti-sense effect Effects 0.000 claims description 27
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 22
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 19
- 235000005822 corn Nutrition 0.000 claims description 19
- 238000009396 hybridization Methods 0.000 claims description 19
- 230000009261 transgenic effect Effects 0.000 claims description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 17
- 239000000463 material Substances 0.000 claims description 17
- 230000000295 complement effect Effects 0.000 claims description 16
- 241000209510 Liliopsida Species 0.000 claims description 14
- 241001233957 eudicotyledons Species 0.000 claims description 13
- 230000002452 interceptive effect Effects 0.000 claims description 13
- 230000008569 process Effects 0.000 claims description 13
- 230000010153 self-pollination Effects 0.000 claims description 11
- 102000002322 Egg Proteins Human genes 0.000 claims description 6
- 108010000912 Egg Proteins Proteins 0.000 claims description 6
- 210000004681 ovum Anatomy 0.000 claims description 6
- 210000002711 centrocyte Anatomy 0.000 claims description 5
- 210000001161 mammalian embryo Anatomy 0.000 claims description 4
- 230000008521 reorganization Effects 0.000 claims 1
- 230000014509 gene expression Effects 0.000 abstract description 35
- 108060004795 Methyltransferase Proteins 0.000 abstract description 6
- 102000016397 Methyltransferase Human genes 0.000 abstract description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 36
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 35
- 108010057821 leucylproline Proteins 0.000 description 34
- 230000000875 corresponding effect Effects 0.000 description 32
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 30
- 229940024606 amino acid Drugs 0.000 description 28
- 235000001014 amino acid Nutrition 0.000 description 28
- 108090000623 proteins and genes Proteins 0.000 description 28
- 150000001413 amino acids Chemical group 0.000 description 27
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 27
- 108010050848 glycylleucine Proteins 0.000 description 27
- 235000007164 Oryza sativa Nutrition 0.000 description 26
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 22
- 108010062796 arginyllysine Proteins 0.000 description 21
- 108010038633 aspartylglutamate Proteins 0.000 description 21
- 235000009566 rice Nutrition 0.000 description 21
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 20
- 108010054155 lysyllysine Proteins 0.000 description 20
- 108010034529 leucyl-lysine Proteins 0.000 description 19
- 241000219195 Arabidopsis thaliana Species 0.000 description 17
- 101100170937 Mus musculus Dnmt1 gene Proteins 0.000 description 17
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 17
- 241000219198 Brassica Species 0.000 description 16
- 241000880493 Leptailurus serval Species 0.000 description 16
- 108091030071 RNAI Proteins 0.000 description 16
- 108010047495 alanylglycine Proteins 0.000 description 16
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 16
- 230000009368 gene silencing by RNA Effects 0.000 description 16
- 108010049041 glutamylalanine Proteins 0.000 description 16
- 235000003351 Brassica cretica Nutrition 0.000 description 15
- 235000003343 Brassica rupestris Nutrition 0.000 description 15
- 240000001307 Myosotis scorpioides Species 0.000 description 15
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 15
- 108010009298 lysylglutamic acid Proteins 0.000 description 15
- 108010064235 lysylglycine Proteins 0.000 description 15
- 235000010460 mustard Nutrition 0.000 description 15
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 14
- 230000008859 change Effects 0.000 description 14
- 108010078144 glutaminyl-glycine Proteins 0.000 description 14
- 108010017391 lysylvaline Proteins 0.000 description 14
- 108010070643 prolylglutamic acid Proteins 0.000 description 14
- 238000004458 analytical method Methods 0.000 description 13
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 13
- 108020003589 5' Untranslated Regions Proteins 0.000 description 12
- 108010078791 Carrier Proteins Proteins 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 12
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 12
- 108010092854 aspartyllysine Proteins 0.000 description 12
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 11
- 108010013835 arginine glutamate Proteins 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 230000002018 overexpression Effects 0.000 description 11
- 235000018102 proteins Nutrition 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 10
- 108010079364 N-glycylalanine Proteins 0.000 description 10
- 108010041407 alanylaspartic acid Proteins 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 108010087924 alanylproline Proteins 0.000 description 10
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 10
- 108010010147 glycylglutamine Proteins 0.000 description 10
- 108010015792 glycyllysine Proteins 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 10
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 9
- 235000010469 Glycine max Nutrition 0.000 description 9
- 244000068988 Glycine max Species 0.000 description 9
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 9
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 9
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 9
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 9
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 9
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 9
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 9
- 240000003768 Solanum lycopersicum Species 0.000 description 9
- 206010000210 abortion Diseases 0.000 description 9
- 231100000176 abortion Toxicity 0.000 description 9
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 150000001875 compounds Chemical class 0.000 description 9
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 9
- 108010090894 prolylleucine Proteins 0.000 description 9
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 8
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 8
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 8
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 8
- 241000208125 Nicotiana Species 0.000 description 8
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 8
- 240000004713 Pisum sativum Species 0.000 description 8
- 235000010582 Pisum sativum Nutrition 0.000 description 8
- 241000169446 Promethis Species 0.000 description 8
- 241000209140 Triticum Species 0.000 description 8
- 235000021307 Triticum Nutrition 0.000 description 8
- 239000002253 acid Substances 0.000 description 8
- 230000001143 conditioned effect Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 108010087823 glycyltyrosine Proteins 0.000 description 8
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 8
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 8
- 108020004999 messenger RNA Proteins 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 108010051110 tyrosyl-lysine Proteins 0.000 description 8
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 7
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 7
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 7
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 7
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 7
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 7
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 7
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 7
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 7
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 7
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 7
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 7
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010079547 glutamylmethionine Proteins 0.000 description 7
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 108010036413 histidylglycine Proteins 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 108010053725 prolylvaline Proteins 0.000 description 7
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 7
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 6
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 6
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 6
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 6
- 244000144730 Amygdalus persica Species 0.000 description 6
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 6
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 6
- 108091035707 Consensus sequence Proteins 0.000 description 6
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 6
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 6
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 6
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 6
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 6
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 6
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 6
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 6
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 6
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 6
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 6
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 6
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 6
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 6
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 6
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 6
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 6
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 6
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 6
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 6
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 6
- 235000006040 Prunus persica var persica Nutrition 0.000 description 6
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 6
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 6
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 6
- 108091036066 Three prime untranslated region Proteins 0.000 description 6
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 5
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 5
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 5
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 5
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 5
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 5
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 5
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 5
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 5
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 5
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 5
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 5
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 5
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 5
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 5
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 5
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 5
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 5
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 5
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 5
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 5
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 5
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 5
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 5
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 5
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 5
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 5
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 5
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 5
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 5
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 5
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 5
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 5
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 5
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 5
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 5
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 5
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 description 5
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 5
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 5
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 5
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 5
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 5
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 5
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 5
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 5
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 5
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 5
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 5
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 5
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 5
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 5
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 5
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 5
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 5
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 5
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 5
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 5
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 5
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 5
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 5
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 5
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 5
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 5
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 5
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 5
- 108010070783 alanyltyrosine Proteins 0.000 description 5
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 230000021759 endosperm development Effects 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 5
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- -1 methane amide Chemical class 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 230000007226 seed germination Effects 0.000 description 5
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- 108010009962 valyltyrosine Proteins 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 4
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 4
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 4
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- 241000219194 Arabidopsis Species 0.000 description 4
- 108700040775 Arabidopsis MET1 Proteins 0.000 description 4
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 4
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 4
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 4
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 4
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 4
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 4
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 4
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 4
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 4
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 4
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 4
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 4
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 4
- 108090000994 Catalytic RNA Proteins 0.000 description 4
- 102000053642 Catalytic RNA Human genes 0.000 description 4
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 4
- KEBJBKIASQVRJS-WDSKDSINSA-N Cys-Gln-Gly Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N KEBJBKIASQVRJS-WDSKDSINSA-N 0.000 description 4
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 4
- 230000030933 DNA methylation on cytosine Effects 0.000 description 4
- 241000206602 Eukaryota Species 0.000 description 4
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 4
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 4
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 4
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 4
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 4
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 4
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 4
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 4
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 4
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 4
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 4
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 4
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 4
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 4
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 4
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 4
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 4
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 4
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 4
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 4
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 4
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 4
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 4
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 4
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 4
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 4
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 4
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 4
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 4
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 4
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 4
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 4
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 4
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 4
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 4
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 4
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 4
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 4
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 4
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 4
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 4
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 4
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 4
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 4
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 4
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 4
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 4
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 4
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 4
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 4
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 4
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 4
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 4
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 4
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 4
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 4
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 4
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 4
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 4
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 4
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 4
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 4
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 4
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 4
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 4
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 4
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 4
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 4
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 4
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 4
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 4
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 4
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 4
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 4
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 4
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 4
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 4
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 4
- 244000062793 Sorghum vulgare Species 0.000 description 4
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 4
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 4
- 108700009124 Transcription Initiation Site Proteins 0.000 description 4
- BSSJIVIFAJKLEK-XIRDDKMYSA-N Trp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BSSJIVIFAJKLEK-XIRDDKMYSA-N 0.000 description 4
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 4
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 4
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 4
- HGEHWFGAKHSIDY-SRVKXCTJSA-N Tyr-Asp-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O HGEHWFGAKHSIDY-SRVKXCTJSA-N 0.000 description 4
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 4
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 4
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 4
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 4
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 4
- 235000007244 Zea mays Nutrition 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 4
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 229960004232 linoleic acid Drugs 0.000 description 4
- 229960004488 linolenic acid Drugs 0.000 description 4
- KQQKGWQCNNTQJW-UHFFFAOYSA-N linolenic acid Natural products CC=CCCC=CCC=CCCCCCCCC(O)=O KQQKGWQCNNTQJW-UHFFFAOYSA-N 0.000 description 4
- 108010072591 lysyl-leucyl-alanyl-arginine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 239000002243 precursor Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 230000003252 repetitive effect Effects 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 238000011218 seed culture Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- TUNFSRHWOTWDNC-UHFFFAOYSA-N tetradecanoic acid Chemical compound CCCCCCCCCCCCCC(O)=O TUNFSRHWOTWDNC-UHFFFAOYSA-N 0.000 description 4
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 3
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 3
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 3
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- 101100170933 Arabidopsis thaliana DMT1 gene Proteins 0.000 description 3
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 3
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 3
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 3
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 3
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 3
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 3
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 3
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 3
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 3
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 3
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 3
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 3
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 3
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 3
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 3
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 3
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 3
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 3
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 3
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 3
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- 244000075850 Avena orientalis Species 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 3
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 3
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 3
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 3
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 3
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 3
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 3
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 3
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 3
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 3
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 3
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 3
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 3
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 3
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 3
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 3
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 3
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 3
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 3
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 3
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 3
- QLQDIJBYJZKQPR-BQBZGAKWSA-N Gly-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN QLQDIJBYJZKQPR-BQBZGAKWSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 3
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 3
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 3
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 3
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 3
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 3
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 3
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 3
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 3
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 3
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 3
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 3
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 3
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 3
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 3
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 3
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 3
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 3
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 3
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 3
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 3
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 3
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 3
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 3
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 3
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 3
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 3
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 3
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 3
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 3
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 3
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 3
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 3
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 3
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 3
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 3
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 3
- CNXOBMMOYZPPGS-NUTKFTJISA-N Lys-Trp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O CNXOBMMOYZPPGS-NUTKFTJISA-N 0.000 description 3
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 3
- 241000220225 Malus Species 0.000 description 3
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 244000046052 Phaseolus vulgaris Species 0.000 description 3
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 3
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 3
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 3
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 3
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 3
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 3
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 3
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 3
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 3
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 3
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 3
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 3
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 3
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 3
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 3
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 3
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 3
- 241000220324 Pyrus Species 0.000 description 3
- 241000209056 Secale Species 0.000 description 3
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 3
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 3
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 3
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 3
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 3
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 3
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 3
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 3
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 3
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 3
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 3
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 3
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 3
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 3
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 3
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 3
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 3
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 3
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 3
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 3
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 3
- 244000098338 Triticum aestivum Species 0.000 description 3
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 3
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 3
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 3
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 3
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 3
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 3
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 3
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 3
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 3
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 3
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 3
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 3
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 3
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 3
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 3
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 3
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 3
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 3
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 3
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 241000482268 Zea mays subsp. mays Species 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- GHVNFZFCNZKVNT-UHFFFAOYSA-N decanoic acid Chemical compound CCCCCCCCCC(O)=O GHVNFZFCNZKVNT-UHFFFAOYSA-N 0.000 description 3
- 230000007812 deficiency Effects 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- POULHZVOKOAJMA-UHFFFAOYSA-N dodecanoic acid Chemical compound CCCCCCCCCCCC(O)=O POULHZVOKOAJMA-UHFFFAOYSA-N 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 3
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 3
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010028295 histidylhistidine Proteins 0.000 description 3
- 230000009027 insemination Effects 0.000 description 3
- 238000007689 inspection Methods 0.000 description 3
- 108010027338 isoleucylcysteine Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 description 3
- 229960004452 methionine Drugs 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010025488 pinealon Proteins 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 108010005652 splenotritin Proteins 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- 235000016626 Agrimonia eupatoria Nutrition 0.000 description 2
- 244000307697 Agrimonia eupatoria Species 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 2
- ANNKVZSFQJGVDY-XUXIUFHCSA-N Ala-Val-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ANNKVZSFQJGVDY-XUXIUFHCSA-N 0.000 description 2
- 235000003840 Amygdalus nana Nutrition 0.000 description 2
- 235000011446 Amygdalus persica Nutrition 0.000 description 2
- 235000001271 Anacardium Nutrition 0.000 description 2
- 241000693997 Anacardium Species 0.000 description 2
- 235000003911 Arachis Nutrition 0.000 description 2
- 244000105624 Arachis hypogaea Species 0.000 description 2
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 2
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 2
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 2
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 2
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 2
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 2
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 2
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- 235000005340 Asparagus officinalis Nutrition 0.000 description 2
- 241001106067 Atropa Species 0.000 description 2
- 235000005781 Avena Nutrition 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 235000002566 Capsicum Nutrition 0.000 description 2
- 240000008574 Capsicum frutescens Species 0.000 description 2
- 241000219109 Citrullus Species 0.000 description 2
- 241000207199 Citrus Species 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 241000723377 Coffea Species 0.000 description 2
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 2
- 244000024469 Cucumis prophetarum Species 0.000 description 2
- 241000219122 Cucurbita Species 0.000 description 2
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 2
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 2
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 2
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 2
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 2
- SDXQKJAWASHMIZ-CIUDSAMLSA-N Cys-Glu-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SDXQKJAWASHMIZ-CIUDSAMLSA-N 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 2
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 2
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 2
- CNAMJJOZGXPDHW-IHRRRGAJSA-N Cys-Pro-Phe Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O CNAMJJOZGXPDHW-IHRRRGAJSA-N 0.000 description 2
- YQEHNIKPAOPBNH-DCAQKATOSA-N Cys-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N YQEHNIKPAOPBNH-DCAQKATOSA-N 0.000 description 2
- RGHNJXZEOKUKBD-SQOUGZDYSA-N D-gluconic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 description 2
- 241000208175 Daucus Species 0.000 description 2
- 244000000626 Daucus carota Species 0.000 description 2
- 235000002767 Daucus carota Nutrition 0.000 description 2
- 241000220223 Fragaria Species 0.000 description 2
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 description 2
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 2
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 2
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 2
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 2
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- YJSCHRBERYWPQL-DCAQKATOSA-N Gln-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N YJSCHRBERYWPQL-DCAQKATOSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 2
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 2
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 2
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 2
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 2
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- 235000009438 Gossypium Nutrition 0.000 description 2
- 241000219146 Gossypium Species 0.000 description 2
- 241000208818 Helianthus Species 0.000 description 2
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 2
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 2
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 2
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 2
- 241000209219 Hordeum Species 0.000 description 2
- 108700039609 IRW peptide Proteins 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 2
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- 229930195722 L-methionine Natural products 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 241000208822 Lactuca Species 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- 241000208204 Linum Species 0.000 description 2
- 241000209082 Lolium Species 0.000 description 2
- 241000219745 Lupinus Species 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 2
- AIPHUKOBUXJNKM-KKUMJFAQSA-N Lys-Cys-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AIPHUKOBUXJNKM-KKUMJFAQSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- 240000003183 Manihot esculenta Species 0.000 description 2
- 241000219823 Medicago Species 0.000 description 2
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 2
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 2
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 2
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 2
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 244000061176 Nicotiana tabacum Species 0.000 description 2
- 241000795633 Olea <sea slug> Species 0.000 description 2
- 241000209094 Oryza Species 0.000 description 2
- 241000209117 Panicum Species 0.000 description 2
- 235000006443 Panicum miliaceum subsp. miliaceum Nutrition 0.000 description 2
- 235000009037 Panicum miliaceum subsp. ruderale Nutrition 0.000 description 2
- 235000011096 Papaver Nutrition 0.000 description 2
- 240000001090 Papaver somniferum Species 0.000 description 2
- 241000218196 Persea Species 0.000 description 2
- 241000219833 Phaseolus Species 0.000 description 2
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 2
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 2
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 235000005205 Pinus Nutrition 0.000 description 2
- 241000218602 Pinus <genus> Species 0.000 description 2
- 241000219843 Pisum Species 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 2
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 2
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 2
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 2
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 2
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 2
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 2
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 2
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 2
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 2
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 2
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- JRBWMRUPXWPEID-JYJNAYRXSA-N Pro-Trp-Cys Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(=O)O)C(=O)[C@@H]1CCCN1 JRBWMRUPXWPEID-JYJNAYRXSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- 241000220299 Prunus Species 0.000 description 2
- 235000011432 Prunus Nutrition 0.000 description 2
- 240000005809 Prunus persica Species 0.000 description 2
- 244000184734 Pyrus japonica Species 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 108700005075 Regulator Genes Proteins 0.000 description 2
- 235000003846 Ricinus Nutrition 0.000 description 2
- 241000322381 Ricinus <louse> Species 0.000 description 2
- 241000780602 Senecio Species 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- 239000004141 Sodium laurylsulphate Substances 0.000 description 2
- 235000002634 Solanum Nutrition 0.000 description 2
- 241000207763 Solanum Species 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 2
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 2
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 2
- RIKLKPANMFNREP-FDARSICLSA-N Trp-Met-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 RIKLKPANMFNREP-FDARSICLSA-N 0.000 description 2
- ZHDQRPWESGUDST-JBACZVJFSA-N Trp-Phe-Gln Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ZHDQRPWESGUDST-JBACZVJFSA-N 0.000 description 2
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 2
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 2
- CRCHQCUINSOGFD-JBACZVJFSA-N Trp-Tyr-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CRCHQCUINSOGFD-JBACZVJFSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 2
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 2
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- 241000219873 Vicia Species 0.000 description 2
- 241000219977 Vigna Species 0.000 description 2
- 235000009392 Vitis Nutrition 0.000 description 2
- 241000219095 Vitis Species 0.000 description 2
- 241000209149 Zea Species 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 244000193174 agave Species 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 229960005261 aspartic acid Drugs 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- LGJMUZUPVCAVPU-UHFFFAOYSA-N beta-Sitostanol Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(C)CCC(CC)C(C)C)C1(C)CC2 LGJMUZUPVCAVPU-UHFFFAOYSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 2
- 239000001390 capsicum minimum Substances 0.000 description 2
- 238000000546 chi-square test Methods 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 235000020971 citrus fruits Nutrition 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- UKMSUNONTOPOIO-UHFFFAOYSA-N docosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCCCC(O)=O UKMSUNONTOPOIO-UHFFFAOYSA-N 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000008175 fetal development Effects 0.000 description 2
- 229960002989 glutamic acid Drugs 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 2
- BXWNKGSJHAJOGX-UHFFFAOYSA-N hexadecan-1-ol Chemical compound CCCCCCCCCCCCCCCCO BXWNKGSJHAJOGX-UHFFFAOYSA-N 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 229960003136 leucine Drugs 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 235000020778 linoleic acid Nutrition 0.000 description 2
- OYHQOLUKZRVURQ-IXWMQOLASA-N linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 230000013011 mating Effects 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Natural products C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 230000001035 methylating effect Effects 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 2
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 2
- 238000012856 packing Methods 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 238000005554 pickling Methods 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 230000002335 preservative effect Effects 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 235000014774 prunus Nutrition 0.000 description 2
- 230000005070 ripening Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000008117 seed development Effects 0.000 description 2
- 230000035040 seed growth Effects 0.000 description 2
- 210000000582 semen Anatomy 0.000 description 2
- 229960001153 serine Drugs 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 2
- 229960002898 threonine Drugs 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 2
- 229940038773 trisodium citrate Drugs 0.000 description 2
- 229960004799 tryptophan Drugs 0.000 description 2
- 235000013311 vegetables Nutrition 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- KZJWDPNRJALLNS-VPUBHVLGSA-N (-)-beta-Sitosterol Natural products O[C@@H]1CC=2[C@@](C)([C@@H]3[C@H]([C@H]4[C@@](C)([C@H]([C@H](CC[C@@H](C(C)C)CC)C)CC4)CC3)CC=2)CC1 KZJWDPNRJALLNS-VPUBHVLGSA-N 0.000 description 1
- CSVWWLUMXNHWSU-UHFFFAOYSA-N (22E)-(24xi)-24-ethyl-5alpha-cholest-22-en-3beta-ol Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(C)C=CC(CC)C(C)C)C1(C)CC2 CSVWWLUMXNHWSU-UHFFFAOYSA-N 0.000 description 1
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- AAWZDTNXLSGCEK-LNVDRNJUSA-N (3r,5r)-1,3,4,5-tetrahydroxycyclohexane-1-carboxylic acid Chemical compound O[C@@H]1CC(O)(C(O)=O)C[C@@H](O)C1O AAWZDTNXLSGCEK-LNVDRNJUSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- RBNPOMFGQQGHHO-UHFFFAOYSA-N -2,3-Dihydroxypropanoic acid Natural products OCC(O)C(O)=O RBNPOMFGQQGHHO-UHFFFAOYSA-N 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- FRPZMMHWLSIFAZ-UHFFFAOYSA-N 10-undecenoic acid Chemical compound OC(=O)CCCCCCCCC=C FRPZMMHWLSIFAZ-UHFFFAOYSA-N 0.000 description 1
- QDGAVODICPCDMU-UHFFFAOYSA-N 2-amino-3-[3-[bis(2-chloroethyl)amino]phenyl]propanoic acid Chemical compound OC(=O)C(N)CC1=CC=CC(N(CCCl)CCCl)=C1 QDGAVODICPCDMU-UHFFFAOYSA-N 0.000 description 1
- LODHFNUFVRVKTH-ZHACJKMWSA-N 2-hydroxy-n'-[(e)-3-phenylprop-2-enoyl]benzohydrazide Chemical compound OC1=CC=CC=C1C(=O)NNC(=O)\C=C\C1=CC=CC=C1 LODHFNUFVRVKTH-ZHACJKMWSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- KLEXDBGYSOIREE-UHFFFAOYSA-N 24xi-n-propylcholesterol Natural products C1C=C2CC(O)CCC2(C)C2C1C1CCC(C(C)CCC(CCC)C(C)C)C1(C)CC2 KLEXDBGYSOIREE-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 241000219318 Amaranthus Species 0.000 description 1
- 101100059544 Arabidopsis thaliana CDC5 gene Proteins 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- ALOVURZCXKYKJC-NAKRPEOUSA-N Arg-Asp-Gln-Ser Chemical compound N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O ALOVURZCXKYKJC-NAKRPEOUSA-N 0.000 description 1
- OSASDIVHOSJVII-WDSKDSINSA-N Arg-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N OSASDIVHOSJVII-WDSKDSINSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 1
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- FMWHSNJMHUNLAG-FXQIFTODSA-N Asp-Cys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FMWHSNJMHUNLAG-FXQIFTODSA-N 0.000 description 1
- WXASLRQUSYWVNE-FXQIFTODSA-N Asp-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WXASLRQUSYWVNE-FXQIFTODSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 235000007558 Avena sp Nutrition 0.000 description 1
- 101150077012 BEL1 gene Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 1
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 235000004936 Bromus mango Nutrition 0.000 description 1
- SGNBVLSWZMBQTH-FGAXOLDCSA-N Campesterol Natural products O[C@@H]1CC=2[C@@](C)([C@@H]3[C@H]([C@H]4[C@@](C)([C@H]([C@H](CC[C@H](C(C)C)C)C)CC4)CC3)CC=2)CC1 SGNBVLSWZMBQTH-FGAXOLDCSA-N 0.000 description 1
- 244000045232 Canavalia ensiformis Species 0.000 description 1
- 239000005632 Capric acid (CAS 334-48-5) Substances 0.000 description 1
- WLYGSPLCNKYESI-RSUQVHIMSA-N Carthamin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1[C@@]1(O)C(O)=C(C(=O)\C=C\C=2C=CC(O)=CC=2)C(=O)C(\C=C\2C([C@](O)([C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)C(O)=C(C(=O)\C=C\C=3C=CC(O)=CC=3)C/2=O)=O)=C1O WLYGSPLCNKYESI-RSUQVHIMSA-N 0.000 description 1
- 241000208809 Carthamus Species 0.000 description 1
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 1
- 244000020518 Carthamus tinctorius Species 0.000 description 1
- LPZCCMIISIBREI-MTFRKTCUSA-N Citrostadienol Natural products CC=C(CC[C@@H](C)[C@H]1CC[C@H]2C3=CC[C@H]4[C@H](C)[C@@H](O)CC[C@]4(C)[C@H]3CC[C@]12C)C(C)C LPZCCMIISIBREI-MTFRKTCUSA-N 0.000 description 1
- 244000241235 Citrullus lanatus Species 0.000 description 1
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 1
- 235000005979 Citrus limon Nutrition 0.000 description 1
- 244000131522 Citrus pyriformis Species 0.000 description 1
- 240000000560 Citrus x paradisi Species 0.000 description 1
- 241000737241 Cocos Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- 244000060011 Cocos nucifera Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- AAWZDTNXLSGCEK-UHFFFAOYSA-N Cordycepinsaeure Natural products OC1CC(O)(C(O)=O)CC(O)C1O AAWZDTNXLSGCEK-UHFFFAOYSA-N 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- ISWAQPWFWKGCAL-ACZMJKKPSA-N Cys-Cys-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISWAQPWFWKGCAL-ACZMJKKPSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- LBSKYJOZIIOZIO-DCAQKATOSA-N Cys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N LBSKYJOZIIOZIO-DCAQKATOSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 1
- HEBKCHPVOIAQTA-QWWZWVQMSA-N D-arabinitol Chemical compound OC[C@@H](O)C(O)[C@H](O)CO HEBKCHPVOIAQTA-QWWZWVQMSA-N 0.000 description 1
- RGHNJXZEOKUKBD-UHFFFAOYSA-N D-gluconic acid Natural products OCC(O)C(O)C(O)C(O)C(O)=O RGHNJXZEOKUKBD-UHFFFAOYSA-N 0.000 description 1
- RBNPOMFGQQGHHO-UWTATZPHSA-N D-glyceric acid Chemical compound OC[C@@H](O)C(O)=O RBNPOMFGQQGHHO-UWTATZPHSA-N 0.000 description 1
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 1
- CUOKHACJLGPRHD-BXXZVTAOSA-N D-ribono-1,4-lactone Chemical compound OC[C@H]1OC(=O)[C@H](O)[C@@H]1O CUOKHACJLGPRHD-BXXZVTAOSA-N 0.000 description 1
- ODBLHEXUDAPZAU-ZAFYKAAXSA-N D-threo-isocitric acid Chemical compound OC(=O)[C@H](O)[C@@H](C(O)=O)CC(O)=O ODBLHEXUDAPZAU-ZAFYKAAXSA-N 0.000 description 1
- 101710184591 DNA-cytosine methyltransferase Proteins 0.000 description 1
- ARVGMISWLZPBCH-UHFFFAOYSA-N Dehydro-beta-sitosterol Natural products C1C(O)CCC2(C)C(CCC3(C(C(C)CCC(CC)C(C)C)CCC33)C)C3=CC=C21 ARVGMISWLZPBCH-UHFFFAOYSA-N 0.000 description 1
- 241000512897 Elaeis Species 0.000 description 1
- 235000001942 Elaeis Nutrition 0.000 description 1
- 235000001950 Elaeis guineensis Nutrition 0.000 description 1
- 244000127993 Elaeis melanococca Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000218182 Eschscholzia Species 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- WBYHRQBKJGEBQJ-CIUDSAMLSA-N Gln-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CS)C(=O)O WBYHRQBKJGEBQJ-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 1
- ISXJHXGYMJKXOI-GUBZILKMSA-N Glu-Cys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O ISXJHXGYMJKXOI-GUBZILKMSA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- ZSIDREAPEPAPKL-XIRDDKMYSA-N Glu-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N ZSIDREAPEPAPKL-XIRDDKMYSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- IANBSEOVTQNGBZ-BQBZGAKWSA-N Gly-Cys-Met Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O IANBSEOVTQNGBZ-BQBZGAKWSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102100022087 Granzyme M Human genes 0.000 description 1
- BTEISVKTSQLKST-UHFFFAOYSA-N Haliclonasterol Natural products CC(C=CC(C)C(C)(C)C)C1CCC2C3=CC=C4CC(O)CCC4(C)C3CCC12C BTEISVKTSQLKST-UHFFFAOYSA-N 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- CHZKBLABUKSXDM-XIRDDKMYSA-N His-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC3=CN=CN3)N CHZKBLABUKSXDM-XIRDDKMYSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 1
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 1
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- BRZQWIIFIKTJDH-VGDYDELISA-N His-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BRZQWIIFIKTJDH-VGDYDELISA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 101000900697 Homo sapiens Granzyme M Proteins 0.000 description 1
- 101000949825 Homo sapiens Meiotic recombination protein DMC1/LIM15 homolog Proteins 0.000 description 1
- 101001046894 Homo sapiens Protein HID1 Proteins 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 241000208278 Hyoscyamus Species 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- ZXJFURYTPZMUNY-VKOGCVSHSA-N Ile-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 ZXJFURYTPZMUNY-VKOGCVSHSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- YTRFFJUOYBMLPN-UHFFFAOYSA-N Ile-Lys-Lys-Ser Chemical compound CCC(C)C(N)C(=O)NC(CCCCN)C(=O)NC(CCCCN)C(=O)NC(CO)C(O)=O YTRFFJUOYBMLPN-UHFFFAOYSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- BBIXOODYWPFNDT-CIUDSAMLSA-N Ile-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O BBIXOODYWPFNDT-CIUDSAMLSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- ODBLHEXUDAPZAU-FONMRSAGSA-N Isocitric acid Natural products OC(=O)[C@@H](O)[C@H](C(O)=O)CC(O)=O ODBLHEXUDAPZAU-FONMRSAGSA-N 0.000 description 1
- AYRXSINWFIIFAE-SCLMCMATSA-N Isomaltose Natural products OC[C@H]1O[C@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)[C@@H](O)[C@@H](O)[C@@H]1O AYRXSINWFIIFAE-SCLMCMATSA-N 0.000 description 1
- 108010079091 KRDS peptide Proteins 0.000 description 1
- CKLJMWTZIZZHCS-UWTATZPHSA-N L-Aspartic acid Natural products OC(=O)[C@H](N)CC(O)=O CKLJMWTZIZZHCS-UWTATZPHSA-N 0.000 description 1
- FFEARJCKVFRZRR-UHFFFAOYSA-N L-Methionine Natural products CSCCC(N)C(O)=O FFEARJCKVFRZRR-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- 229930182844 L-isoleucine Natural products 0.000 description 1
- 239000004395 L-leucine Substances 0.000 description 1
- 235000019454 L-leucine Nutrition 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- 229930182821 L-proline Natural products 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 239000005639 Lauric acid Substances 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 235000021353 Lignoceric acid Nutrition 0.000 description 1
- CQXMAMUUWHYSIY-UHFFFAOYSA-N Lignoceric acid Natural products CCCCCCCCCCCCCCCCCCCCCCCC(=O)OCCC1=CC=C(O)C=C1 CQXMAMUUWHYSIY-UHFFFAOYSA-N 0.000 description 1
- 241000227653 Lycopersicon Species 0.000 description 1
- 235000002262 Lycopersicon Nutrition 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- BPDXWKVZNCKUGG-BZSNNMDCSA-N Lys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N BPDXWKVZNCKUGG-BZSNNMDCSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- KTINOHQFVVCEGQ-XIRDDKMYSA-N Lys-Trp-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(O)=O KTINOHQFVVCEGQ-XIRDDKMYSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 101150115300 MAC1 gene Proteins 0.000 description 1
- 241000121629 Majorana Species 0.000 description 1
- 235000011430 Malus pumila Nutrition 0.000 description 1
- 235000015103 Malus silvestris Nutrition 0.000 description 1
- 235000014826 Mangifera indica Nutrition 0.000 description 1
- 240000007228 Mangifera indica Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000557163 Marchantia paleacea Species 0.000 description 1
- 241000334092 Marchantia paleacea subsp. diptera Species 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- PWPBGAJJYJJVPI-PJODQICGSA-N Met-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 PWPBGAJJYJJVPI-PJODQICGSA-N 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- GXYYFDKJHLRNSI-SRVKXCTJSA-N Met-Gln-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GXYYFDKJHLRNSI-SRVKXCTJSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 description 1
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- KVNOBVKRBOYSIV-SZMVWBNQSA-N Met-Pro-Trp Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KVNOBVKRBOYSIV-SZMVWBNQSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 235000010617 Phaseolus lunatus Nutrition 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- 241001092090 Pittosporum Species 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- MZNUJZBYRWXWLQ-AVGNSLFASA-N Pro-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 MZNUJZBYRWXWLQ-AVGNSLFASA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- VVEQUISRWJDGMX-VKOGCVSHSA-N Pro-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 VVEQUISRWJDGMX-VKOGCVSHSA-N 0.000 description 1
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102100022877 Protein HID1 Human genes 0.000 description 1
- 241001290151 Prunus avium subsp. avium Species 0.000 description 1
- 235000014443 Pyrus communis Nutrition 0.000 description 1
- AAWZDTNXLSGCEK-ZHQZDSKASA-N Quinic acid Natural products O[C@H]1CC(O)(C(O)=O)C[C@H](O)C1O AAWZDTNXLSGCEK-ZHQZDSKASA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 241000220259 Raphanus Species 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- JVWLUVNSQYXYBE-UHFFFAOYSA-N Ribitol Natural products OCC(C)C(O)C(O)CO JVWLUVNSQYXYBE-UHFFFAOYSA-N 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- YGSDEFSMJLZEOE-UHFFFAOYSA-N Salicylic acid Natural products OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 1
- 235000007238 Secale cereale Nutrition 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000220261 Sinapis Species 0.000 description 1
- LGJMUZUPVCAVPU-JFBKYFIKSA-N Sitostanol Natural products O[C@@H]1C[C@H]2[C@@](C)([C@@H]3[C@@H]([C@H]4[C@@](C)([C@@H]([C@@H](CC[C@H](C(C)C)CC)C)CC4)CC3)CC2)CC1 LGJMUZUPVCAVPU-JFBKYFIKSA-N 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 235000009184 Spondias indica Nutrition 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 101150088517 TCTA gene Proteins 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 241000219793 Trifolium Species 0.000 description 1
- 241001312519 Trigonella Species 0.000 description 1
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 1
- LAIUAVGWZYTBKN-VHWLVUOQSA-N Trp-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O LAIUAVGWZYTBKN-VHWLVUOQSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- JZHJLBPBQKPTNX-UBHSHLNASA-N Trp-Cys-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 JZHJLBPBQKPTNX-UBHSHLNASA-N 0.000 description 1
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- XXJDYWYVZBHELV-TUSQITKMSA-N Trp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCCCN)C(=O)O)N XXJDYWYVZBHELV-TUSQITKMSA-N 0.000 description 1
- TYYLDKGBCJGJGW-WMZOPIPTSA-N Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 TYYLDKGBCJGJGW-WMZOPIPTSA-N 0.000 description 1
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- LYPKCSYAKLTBHJ-ILWGZMRPSA-N Tyr-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N)C(=O)O LYPKCSYAKLTBHJ-ILWGZMRPSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 240000004922 Vigna radiata Species 0.000 description 1
- 235000010721 Vigna radiata var radiata Nutrition 0.000 description 1
- 235000011469 Vigna radiata var sublobata Nutrition 0.000 description 1
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- UELITFHSCLAHKR-UHFFFAOYSA-N acibenzolar-S-methyl Chemical compound CSC(=O)C1=CC=CC2=C1SN=N2 UELITFHSCLAHKR-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010066875 alanyl-prolyl-tryptophyl-cysteine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 229930013930 alkaloid Natural products 0.000 description 1
- 150000003797 alkaloid derivatives Chemical class 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- WQZGKKKJIJFFOK-PQMKYFCFSA-N alpha-D-mannose Chemical compound OC[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-PQMKYFCFSA-N 0.000 description 1
- SRBFZHDQGSBBOR-LECHCGJUSA-N alpha-D-xylose Chemical compound O[C@@H]1CO[C@H](O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-LECHCGJUSA-N 0.000 description 1
- SRBFZHDQGSBBOR-QMKXCQHVSA-N alpha-L-arabinopyranose Chemical compound O[C@H]1CO[C@@H](O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-QMKXCQHVSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- MJVXAPPOFPTTCA-UHFFFAOYSA-N beta-Sistosterol Natural products CCC(CCC(C)C1CCC2C3CC=C4C(C)C(O)CCC4(C)C3CCC12C)C(C)C MJVXAPPOFPTTCA-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- NJKOMDUNNDKEAI-UHFFFAOYSA-N beta-sitosterol Natural products CCC(CCC(C)C1CCC2(C)C3CC=C4CC(O)CCC4C3CCC12C)C(C)C NJKOMDUNNDKEAI-UHFFFAOYSA-N 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- SGNBVLSWZMBQTH-PODYLUTMSA-N campesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CC[C@@H](C)C(C)C)[C@@]1(C)CC2 SGNBVLSWZMBQTH-PODYLUTMSA-N 0.000 description 1
- 235000000431 campesterol Nutrition 0.000 description 1
- KHAVLLBUVKBTBG-UHFFFAOYSA-N caproleic acid Natural products OC(=O)CCCCCCCC=C KHAVLLBUVKBTBG-UHFFFAOYSA-N 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 229960000541 cetyl alcohol Drugs 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000017858 demethylation Effects 0.000 description 1
- 238000010520 demethylation reaction Methods 0.000 description 1
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- FARYTWBWLZAXNK-WAYWQWQTSA-N ethyl (z)-3-(methylamino)but-2-enoate Chemical compound CCOC(=O)\C=C(\C)NC FARYTWBWLZAXNK-WAYWQWQTSA-N 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 239000001530 fumaric acid Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 1
- 239000000174 gluconic acid Substances 0.000 description 1
- 235000012208 gluconic acid Nutrition 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 229960002885 histidine Drugs 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- DLRVVLDZNNYCBX-RTPHMHGBSA-N isomaltose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)C(O)O1 DLRVVLDZNNYCBX-RTPHMHGBSA-N 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 108010043612 kentsin Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 229960000448 lactic acid Drugs 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 235000005739 manihot Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- VYQNWZOUAUKGHI-UHFFFAOYSA-N monobenzone Chemical compound C1=CC(O)=CC=C1OCC1=CC=CC=C1 VYQNWZOUAUKGHI-UHFFFAOYSA-N 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002417 nutraceutical Substances 0.000 description 1
- 235000021436 nutraceutical agent Nutrition 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 235000021017 pears Nutrition 0.000 description 1
- 229960005190 phenylalanine Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 239000002574 poison Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 229960001109 policosanol Drugs 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 229960002429 proline Drugs 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- HEBKCHPVOIAQTA-ZXFHETKHSA-N ribitol Chemical compound OC[C@H](O)[C@H](O)[C@H](O)CO HEBKCHPVOIAQTA-ZXFHETKHSA-N 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- JXOHGGNKMLTUBP-HSUXUTPPSA-N shikimic acid Chemical compound O[C@@H]1CC(C(O)=O)=C[C@@H](O)[C@H]1O JXOHGGNKMLTUBP-HSUXUTPPSA-N 0.000 description 1
- JXOHGGNKMLTUBP-JKUQZMGJSA-N shikimic acid Natural products O[C@@H]1CC(C(O)=O)=C[C@H](O)[C@@H]1O JXOHGGNKMLTUBP-JKUQZMGJSA-N 0.000 description 1
- PCMORTLOPMLEFB-ONEGZZNKSA-N sinapic acid Chemical compound COC1=CC(\C=C\C(O)=O)=CC(OC)=C1O PCMORTLOPMLEFB-ONEGZZNKSA-N 0.000 description 1
- PCMORTLOPMLEFB-UHFFFAOYSA-N sinapinic acid Natural products COC1=CC(C=CC(O)=O)=CC(OC)=C1O PCMORTLOPMLEFB-UHFFFAOYSA-N 0.000 description 1
- KZJWDPNRJALLNS-VJSFXXLFSA-N sitosterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CC[C@@H](CC)C(C)C)[C@@]1(C)CC2 KZJWDPNRJALLNS-VJSFXXLFSA-N 0.000 description 1
- 235000015500 sitosterol Nutrition 0.000 description 1
- 229950005143 sitosterol Drugs 0.000 description 1
- NLQLSVXGSXCXFE-UHFFFAOYSA-N sitosterol Natural products CC=C(/CCC(C)C1CC2C3=CCC4C(C)C(O)CCC4(C)C3CCC2(C)C1)C(C)C NLQLSVXGSXCXFE-UHFFFAOYSA-N 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- LGJMUZUPVCAVPU-HRJGVYIJSA-N stigmastanol Chemical compound C([C@@H]1CC2)[C@@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@H](C)CC[C@@H](CC)C(C)C)[C@@]2(C)CC1 LGJMUZUPVCAVPU-HRJGVYIJSA-N 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- QZZGJDVWLFXDLK-UHFFFAOYSA-N tetracosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCCCCCC(O)=O QZZGJDVWLFXDLK-UHFFFAOYSA-N 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- ODBLHEXUDAPZAU-UHFFFAOYSA-N threo-D-isocitric acid Natural products OC(=O)C(O)C(C(O)=O)CC(O)=O ODBLHEXUDAPZAU-UHFFFAOYSA-N 0.000 description 1
- 230000025366 tissue development Effects 0.000 description 1
- 230000017423 tissue regeneration Effects 0.000 description 1
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- WCTAGTRAWPDFQO-UHFFFAOYSA-K trisodium;hydrogen carbonate;carbonate Chemical compound [Na+].[Na+].[Na+].OC([O-])=O.[O-]C([O-])=O WCTAGTRAWPDFQO-UHFFFAOYSA-K 0.000 description 1
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 229960002703 undecylenic acid Drugs 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 239000000811 xylitol Substances 0.000 description 1
- 235000010447 xylitol Nutrition 0.000 description 1
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 1
- 229960002675 xylitol Drugs 0.000 description 1
- 229960003487 xylose Drugs 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8218—Antisense, co-suppression, viral induced gene silencing [VIGS], post-transcriptional induced gene silencing [PTGS]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Cell Biology (AREA)
- Virology (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明公开了表达胞嘧啶DNA甲基转移酶并且可以用于赋予改变的种子表型,例如,种子重量增加的植物。还公开了这样的植物,其中内源胞嘧啶DNA甲基转移酶的表达被抑制并且显示了改变的种子表型,例如种子重量的增加。还公开了适合于赋予这样的表型的核酸和多肽。
Description
对相关申请的交叉参考
本申请要求在2003年10月14日提交的美国临时申请序列号60/510,924的35 U.S.C.§119(e)下的利益和优先权,将其全部内容并入本文作为参考。
技术领域
本发明涉及调节植物种子的表型的方法和材料。具体而言,本发明的特点是可以用于调节种子重量的核酸和植物。
背景
基因通常在生物,特别是生物的细胞的发育的过程中差异表达。可以将阐明和操控生物的暂时和空间基因表达图谱用于开发新的和被改善的生物产品。
在大量影响生物的基因表达图谱的调节机制中,基因甲基化的调节具有重要的作用。在许多情形中,通过特定的核苷酸序列的位点特异性的甲基化或去甲基化来调节基因的甲基化。
概述
本发明包括在植物的雄性配子体特异性的细胞或雌性配子体特异性细胞中调节胞嘧啶DNA甲基转移酶相关的核酸的转录和/或翻译。当在杂交中将这样的植物用作亲本时,得到的种子具有改变的种子表型,例如增加的种子重量。因此,本发明的特点是用于产生种子的方法。在一个方面,这些方法包括允许第一植物对第二植物进行授粉。所述第一植物具有第一重组核酸构建体,其包含与有效增加胞嘧啶DNA甲基化水平的第一核酸序列可操作连接的雄性配子体组织特异性调节元件。第二植物具有第二重组核酸构建体,其包含与有效减少胞嘧啶DNA甲基化的水平的第二核酸序列可操作连接的雌性配子体组织特异性调节元件。在第二植物上发育的种子的平均种子重量与在相应的对照植物上发育的种子的平均种子重量相比,重量增加,所述相应的对照植物缺乏第二重组核酸构建体,并且被缺乏第一重组核酸构建体的相应对照植物进行授粉。与在对照植物上形成的种子的平均种子重量相比,这些种子可以具有至少10%更高(例如,10%到约50%更高)的平均种子重量。
第一植物可以是近亲交配的(inbred),杂交的,异质的种群,或人造的种群(synthetic population)。第一植物对于重组核酸构建体而言可以是杂合的,或是纯合的。类似地,第二植物可以是近亲交配的,杂交的,异质的种群,或人造的种群,并且对于重组核酸构建体而言可以是纯合的,或是杂合的。第一和第二植物可以是双子叶植物。第一重组核酸构建体的核酸序列可以编码胞嘧啶DNA甲基转移酶,在所述甲基转移酶中具有一定区域,所述区域具有在SEQ ID NO:50中提出的共有序列。胞嘧啶DNA甲基转移酶可以与在SEQ ID NOS:28,30,34,36,38,和40中提出的拟南芥(Arabidopsis),桃,豌豆,胡萝卜,西红柿,或烟草的氨基酸序列之一具有50%或更多的序列同一性。第二重组核酸构建体的第二核酸序列可以被转录成干扰RNA或反义核酸。
第一和第二植物可以是单子叶植物。第一重组核酸构建体的第一核酸序列可以编码胞嘧啶DNA甲基转移酶,其与在SEQ ID NOS:44和46中显示的玉米或水稻胞嘧啶DNA甲基转移酶的氨基酸序列具有50%或更大的序列同一性(例如,70%,80,90%,或95%)。
在另一方面,本发明的特点是用于产生种子的方法,所述方法包括容许第一植物给第二植物授粉的步骤。所述第一植物具有重组核酸构建体,其包含与有效减少胞嘧啶DNA甲基化水平的第一核酸序列可操作地连接的雄性配子体组织特异性调节元件。在第二植物上发育的种子的平均种子重量与在相应的第二植物上发育的种子的平均种子重量相比,重量减少,所述相应的第二植物被缺乏重组核酸构建体的相应第一植物进行授粉。
在另一个方面,本发明的特点是用于产生种子的方法,其包含容许具有重组核酸构建体的植物的授粉,所述核酸构建体包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件。授粉用缺乏重组核酸构建体的花粉进行。在所述植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏重组核酸构建体,且被缺乏重组核酸构建体的植物授粉。被授粉的植物可以是双子叶植物或单子叶植物。雌性配子体组织特异性调节元件可以是,例如,拟南芥YP0102,YP0102a或YP0285启动子,SEQ ID NOS:6,25,或22。可以将有效减少胞嘧啶DNA甲基化水平的核酸序列转录成干扰RNA或反义RNA,并且可以具有10个核苷酸到4,500个核苷酸的长度,并且可以与来自在SEQ ID NOS:29,31,33,35,37,39,41中提出的拟南芥,桃,大豆,豌豆,胡萝卜,西红柿或烟草的核酸序列之一,或这些序列之一的互补序列具有70%或更大的序列同一性。这些核酸序列可以具有从20个核苷酸到1000个核苷酸的长度,与来自拟南芥,桃,豌豆,胡萝卜,西红柿或烟草的这些相同的核酸序列,或它们的补体之一具有80%或更大的序列同一性。备选地,核酸序列可以具有10个核苷酸到4,500个核苷酸的长度并且与在SEQ ID NOS:43,45,47,49提出的小麦,玉米,水稻或苔类植物核酸序列之一,或这些序列之一的互补序列具有70%或更大的序列同一性。这些核酸序列可以具有20个核苷酸到1,000个核苷酸的长度,并且可以与来自玉米,水稻,小麦或苔类植物的这些相同的核酸序列,或它们的互补序列之一具有80%或更大的序列同一性。授粉可以以来自非转基因植物的花粉进行。
本发明还以产生种子的方法为特征,所述方法包括容许具有重组核酸构建体的植物的授粉的步骤,所述核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的核酸序列可操作地连接的雌性配子体组织特异性调节元件。授粉以缺乏重组核酸构建体的花粉进行。在该植物上发育的种子的平均种子重量比在相应的植物上发育的种子的平均种子重量减少,所述相应的植物缺乏重组核酸构建体,且被缺乏重组核酸构建体的植物进行授粉。
本发明还以用于产生种子的方法为特征,所述方法包括容许第一植物给第二植物授粉的步骤。第一植物具有重组核酸构建体,所述核酸构建体包括与有效增加胞嘧啶DNA甲基化的水平的核酸序列可操作地连接的雄性配子体组织特异性调节元件。在第二植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物被缺乏或不表达重组核酸构建体的植物授粉。第一和第二植物可以是双子叶植物或单子叶植物。有效增加胞嘧啶DNA甲基化水平的核酸序列可以编码胞嘧啶DNA甲基转移酶,所述甲基转移酶包括本文所述的共有多肽区域。
本发明还以用于产生种子的方法为特征,所述方法包括容许在多种植物中进行授粉的步骤,所述植物包括多种第一植物。第一植物中的每种具有第一重组核酸构建体,所述核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的核酸序列可操作连接的雄性配子体组织特异性调节元件,其中在授粉后在第一植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏重组核酸构建体。所述授粉可以主要是自花授粉。多种第一植物可以是双子叶植物或单子叶植物。多种植物还可以包含多种第二植物。所述第二植物具有第二重组核酸构建体,其包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件。在授粉后在第二植物上发育的种子具有的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏重组核酸构建体。在被授粉的植物上发育的种子所具有的平均种子重量可以比在相应的植物上发育的种子的平均种子重量至少大10%,所述相应的植物缺乏重组核酸构建体。
本发明还以转基因宿主细胞为特征,所述宿主细胞包括重组核酸构建体,所述核酸构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列。该核酸序列与一个或多个调节元件可操作地连接,所述调节元件在植物雌性配子体细胞类型中赋予转录。该调节元件可以包括在SEQ ID NOS:6到27中提出的序列之一。在另一个方面,转基因宿主细胞可以包括重组核酸构建体,所述核酸构建体包括有效减少胞嘧啶DNA甲基化的水平的核酸序列,所述核酸序列与一个或多个调节元件可操作地连接,所述调节元件在植物雄性配子体细胞类型中赋予转录。
本发明还以转基因植物为特征,所述植物包括重组核酸构建体,所述构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列。该核酸序列与一个或多个调节元件可操作地连接,所述调节元件在雌性配子体细胞类型中赋予转录。调节元件可以包括在SEQ ID NOS:6到27中提出的序列之一。一个或多个调节元件可以赋予在与卵细胞,合子和胚胎相关的极性细胞核和中央细胞中的优先转录。所述植物可以是双子叶植物或单子叶植物。有效减少胞嘧啶DNA甲基化水平的核酸序列可以被转录成干扰RNA或反义RNA。该核酸序列可以具有10个核苷酸到4,500个核苷酸的长度并且与在SEQ ID NOS:29,31,33,35,37,39,41,43,45,47,49中提出的核酸序列之一,或这些序列之一的互补序列具有70%或更多的序列同一性。例如,这些核酸可以具有20个核苷酸到1,000个核苷酸的长度,并且与这些核酸序列,或它们的互补序列之一具有80%或更多的序列同一性。
本发明还以转基因植物为特征,所述转基因植物包括重组核酸构建体,所述核酸构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列,所述核酸序列与一个或多个调节元件可操作地连接,所述调节元件在雄性配子体细胞类型中赋予转录。
本发明还以生产的制品(article)为特征,所述制品包括包装物质和在包装物质中的两种或更多类型的种子。在一些实施方案中,由第一类型的种子培养的植物在雄性配子体细胞中过量表达胞嘧啶DNA甲基转移酶。由来自第二类型的种子培养的植物可以或不可以具有重组核酸构建体,其在雌性配子体细胞中抑制胞嘧啶DNA甲基转移酶的表达。在其它实施方案中,由第一类型的种子培养的植物缺乏重组核酸,其导致胞嘧啶DNA甲基转移酶在雄性配子体细胞中的过量表达,并且由来自第二类型的种子培养的植物具有重组核酸构建体,其抑制胞嘧啶DNA甲基转移酶在雌性配子体细胞中的表达。
本发明的一个或多个实施方案的细节将在下面的附图和描述中进行阐明。除非另外指出,用于本文的所有的技术和科学术语具有与本发明所属领域的普通技术人员通常所理解的相同的含义。尽管类似的方法和材料或本文所述的那些的等价物可以用在本发明的实践或检验中,下面描述适合的方法和材料。将本文提及的所有的出版物,专利申请,专利和其它的参考文献全文并入本文作为参考。为避免冲突,本说明书,包括定义将进行控制。此外,所述材料,方法和实施例仅是举例说明性的,而不意欲进行限制。从描述和图以及从权利要求,本发明的其它特点,目标和优势将变得显而易见。
附图描述
图1显示拟南芥的Met1基因组DNA序列。下划线的核苷酸表示用于制备实施例1的反义核酸构建体的基因组序列的部分。
图2是在胞嘧啶DNA甲基转移酶中的某些特点的概略示意。
在各个图中的相同的标记符号指示相同的元件。
详细描述
在一个方面,本发明提供调节植物中种子表型的方法。调节种子表型包括在生物诸如玉米(Zea mays)或大豆(Glycine max)的雄性配子体特异性细胞或雌性配子体特异性细胞中转录和/或翻译胞嘧啶DNA甲基转移酶相关的核酸。因此,在一些实施方案中,胞嘧啶DNA甲基转移酶可以在植物的雄性配子体细胞中进行表达,并且来自这样的植物的花粉可以用于产生具有增加的种子重量的种子。在其它的实施方案中,内源胞嘧啶DNA甲基转移酶的转录或翻译在植物的雄性配子体细胞中被抑制,来自这样的植物的花粉可以用于产生种子重量减少的种子。
在其它的实施方案中,胞嘧啶DNA甲基转移酶可以在植物的雌性配子体细胞中进行表达,并且在授粉后,可以形成种子重量减少的种子。在其它的实施方案中,内源胞嘧啶DNA甲基转移酶的转录或翻译在植物的雌性配子体细胞中被抑制,并且在授粉后,可以形成种子重量增加的种子。
通过在雄性配子体细胞中的过量表达或在雌性配子体细胞中的表达不足来调节种子的表型
在雄性配子体细胞中的过量表达
在第一个方面,本发明包括容许第一植物给第二植物授粉由此在第二植物上产生种子。该第一植物包含重组核酸构建体,其包括编码胞嘧啶DNA甲基转移酶多肽的核酸,所述核酸可操作地与一个或多个调节元件连接,所述调节元件在雄性配子体细胞或组织中赋予表达。通过在特异性雄性配子体细胞类型中表达甲基转移酶多肽,可能的是在第一植物中调节基因表达(例如,通过使在正常情况下具有转录活性的基因失活)和当第一植物用于给第二植物授粉时获得一种或多种有益的种子表型。
适合于用在本发明中的胞嘧啶DNA甲基转移酶可以通过评价甲基转移酶的基因中的失去功能型的突变体的表型进行特征鉴定。这些突变体显示在配子体组织中的胞嘧啶残基的总体甲基化不足。此外,尽管重复序列的甲基化不足可以更适中,这些突变体显示在基因组的单拷贝和重复序列中的总体胞嘧啶甲基化的减少。这些突变体的存在指示野生型负体(counterpart)是适合于用在本文所述的方法和组合物中的胞嘧啶DNA甲基转移酶。
许多胞嘧啶DNA甲基转移酶多肽适合于用在本文所述的方法中。一个这样的多肽是由拟南芥Met1基因编码的多肽。编码拟南芥Met1 DNA胞嘧啶甲基转移酶的核苷酸序列显示在SEQ ID NO:29中。拟南芥MET1的Genbank接受号是AT5G49160。此外,具有显示在SEQ ID NO:44中的氨基酸序列的玉米胞嘧啶DNA甲基转移酶,和具有显示在SEQ ID NO:46中的氨基酸序列的水稻胞嘧啶DNA甲基转移酶也是有用的。
生物表
生物 | 生物ID |
玉米 | 311987 |
大豆 | 3847 |
小麦(Triticum aestivum) | 4565 |
可以以多种方式来鉴定其它适合的胞嘧啶DNA甲基转移酶多肽。例如,可以通过从纯性幼苗中制备核提取物并用半甲基化的(CpI)n底物和放射性标记的S-腺苷-甲硫氨酸温育来自抽提物的溶解的蛋白来筛选候选甲基转移酶以鉴定具有胞嘧啶DNA甲基转移酶活性的多肽。见,例如,Kakutani等,Nucleic Acids Res.93:12406-12411(1995)。可以通过用TaqI消化总基因组DNA并用放射性标记消化物中的5′末端胞嘧啶来测量基因组中的总胞嘧啶甲基化水平。接着,将标记的DNA消化成单核苷酸并使用薄层色谱法来估计甲基化和未甲基化胞嘧啶的量。见,例如Kakutani,等.,Nucleic Acids Res.93:12406-12411(1995)。可以从在被HpaII或MspI消化的基因组DNA的DNA印迹中观察到的消化图谱来估计单拷贝和重复序列的甲基化。见,Jeddeloh等,Plant J.9:579-586(1996)和Finnegan等,Proc.Natl.Acad.Sci.USA 93:8449-8454(1996)。适合的胞嘧啶DNA甲基转移酶具有相应的失去功能的突变体,其显示在配子体组织中的胞嘧啶残基的总的甲基化不足,在基因组的单拷贝序列中的总胞嘧啶甲基化中的减少,和重复序列的更适中的甲基化不足。针对已知甲基转移酶的抗体的共免疫沉淀测定也可以用于鉴定候选多肽。鉴定候选多肽的另一种方式是通过甲基转移酶突变体的功能性互补。
还可以通过分析核苷酸和多肽序列的对比来鉴定适合的甲基转移酶的候选物。例如,进行在核苷酸或多肽序列的数据库上的查询可以鉴定胞嘧啶DNA甲基转移酶的直向同源物。序列分析可以包括使用已知甲基转移酶氨基酸序列的非丰余数据库的BLAST或PSI-BLAST分析。在具有大于40%序列同一性的数据库中的那些蛋白质是用于进一步评价作为甲基转移酶的适合性的候选物。如果需要,可以进行对这些候选物的人工检查从而缩小待进一步评价的候选物的数量。可以通过选择似乎具有被怀疑出现在甲基转移酶中的结构域的那些候选物来进行人工检查。适合的候选物包括SEQ ID NOS:42和48。
可以如下确定相对于另一个“靶”核酸或氨基酸序列的任何主题核酸或氨基酸序列(例如,拟南芥胞嘧啶DNA甲基转移酶,或玉米胞嘧啶DNA甲基转移酶)的百分比同一性。首先,可以使用来自包含BLASTN和BLASTP的BLASTZ的卓越版(例如,2.0.14版)中的BLAST2序列(B12seq)程序来对主题核酸或氨基酸序列进行比较和对比。可以在<www.fr.com/blast>或www.ncbi.nlm.nih.Gov获得BLASTZ的卓越版。解释怎样使用BLASTZ,和具体地B12seq程序的指导可以见于BLASTZ随附的“自述文件”。该程序还可以在Karlin等,1990,Proc.Natl.Acad.Sci.87:2264;Karlin等,1990,Proc.Natl.Acad.Sci.90:5873;和Altschul等,1997,Nucl.Acids Res.25:3389中进行详细描述。
使用BLASTN(用于比较核酸序列)或BLASTP(用于比较氨基酸序列)的算法的任一,B12seq进行在主题序列和靶序列之间的比较。典型地,当进行氨基酸序列的对比时,使用BLOSUM62计分矩阵(scoring matrix)的缺省参数,11的缺口存在罚分(gap existence cost),和1的延伸罚分(extensioncost),3的字长(word size),10的预期值(expect value),1的每个残基罚分和0.85的λ比率。输出文件包括在靶序列和主题序列之间的同源性对比区域。一旦进行对比,通过计算来自靶序列的连续的核苷酸或氨基酸残基的数量(即,排除缺口)来确定长度,所述靶序列与主题序列的对比开始自任意匹配的位点并结束在任何其它的匹配位点。匹配的位点是其中相同的核苷酸或氨基酸残基出现在靶序列和主题序列两者中的任一位点。可以将一个或多个残基的缺口插入靶序列或主题序列从而使在结构保守的结构域之间的序列比对最大化(例如,α-螺旋,β-折叠和环)。
通过计算在特定长度内的匹配位点的数目,将该数目除以长度,并将得到的值乘以100来确定特定长度内的百分比同一性。例如,如果(i)将500个氨基酸靶序列与主题氨基酸序列进行比较,(ii)B12seq程序呈现来自与主题序列的区域对比的靶序列的200个氨基酸,其中该200个氨基酸区域的第一和最后一个氨基酸是匹配的,和(iii)在那些200个对比氨基酸中匹配的数目是180,那么该500个氨基酸的靶序列包含200的长度和在该长度内的90%的序列同一性(即,180÷200×100=90)。在一些实施方案中,适合的胞嘧啶DNA甲基转移酶的氨基酸序列与拟南芥Met1胞嘧啶DNA甲基转移酶的氨基酸序列具有大于40%(例如,>80%,>70%,>60%,>50%或>40%)的序列同一性。在其它的实施方案中,适合的胞嘧啶DNA甲基转移酶的氨基酸序列与在SEQ ID NO:44中显示的玉米胞嘧啶DNA甲基转移酶或在SEQ ID NO:46中显示的水稻胞嘧啶DNA甲基转移酶的氨基酸序列具有大于40%的序列同一性(例如,>80%,>70%,>60%,>50%或>40%)。在其它的实施方案中,适合的胞嘧啶DNA甲基转移酶多肽的氨基酸序列具有1500-1600个氨基酸的总长度(例如,1520-1565,1522-1564,1522,1525,1534,1545,1554,1559,1564,或1566;多肽的区域的长度在350-390个氨基酸(例如,350-375,350-380,360-380,370-375,或365-375或372),并且与在SEQ ID NO:50中提出的氨基酸序列具有大于40%的序列同一性(例如,>80%,>70%,>60%,>50%或>40%)。
将理解的是与主题序列对比的核酸或氨基酸靶序列可以导致许多不同的长度,其中每个长度具有其本身的百分比同一性。还将理解的是适合的核酸的长度可以取决于意欲的用途,例如作为全长编码序列,作为反义序列,或RNAi序列。注意的是,可以将百分比同一性值四舍五入为最接近的十分之一。例如将78.11,78.12,78.13,和78.14四舍五入为78.1,而将78.15,78.16,78.17,78.18,和78.19四舍五入为78.2。还要注意的是长度值将总是整数。
在模板,或主题物,多肽中的保守区域的鉴定可以促进同源性多肽序列分析。可以通过将区域定位于模板多肽的原始氨基酸序列来鉴定保守区域,其是重复序列,形成一些次级结构(例如,螺旋和β折叠),建立带正电荷或负电荷的结构域,或代表蛋白质基序或结构域。见,例如,描述多种蛋白质基序和结构域的共有序列的Pfam网点在http://www.sanger.ac.uk/Pfam/和http://genome.wustl.edu/Pfam/。被包括在Pfam数据库的信息的描述见于Sonnhammer等,1998,Nucl.Acids Res.26:320-322;Sonnhammer等,1997,Proteins 28:405-420;和Bateman等,1999,Nucl.Acids Res.27:260-262。从该Pfam数据库,蛋白质基序和结构域的共有序列可以与模板多肽序列进行对比从而确定保守区域。
还可以通过对比来自紧密相关的植物种类的相同的或相关多肽的序列来确定保守区域。紧密相关的植物种类优选地来自相同的家族。或者,使用来自都是单子叶植物或都是双子叶植物的植物种类的序列来进行对比。在一些实施方案中,来自两种不同的植物种类的序列的对比是足够的。例如,可以将来自油菜(canola)和拟南芥的序列用于鉴定一个或多个保守区域。
典型地,显示至少约35%的氨基酸序列同一性的多肽对于鉴定保守的区域是有用的。相关蛋白质的保守区域有时显示至少40%的氨基酸序列同一性(例如,至少50%,至少60%;或至少70%,至少80%,或至少90%的氨基酸序列同一性)。在一些实施方案中,靶多肽和模板多肽的保守区域显示至少92,94,96,98,或99%的氨基酸序列同一性。氨基酸序列同一性可以从氨基酸和核苷酸序列中进行推断。
技术人员将认识到在编码序列中改变,添加,或缺失单个氨基酸或小百分比的氨基酸的对多肽的单个置换,缺失或添加是“保守修饰的变体”,其中所述改变导致了以化学上类似的氨基酸对氨基酸的置换。提供功能上类似的氨基酸的保守性置换表是本领域众所周知的。下面的6个基团每个都包含彼此进行保守性置换的氨基酸:
1)丙氨酸(A),丝氨酸(S),苏氨酸(T);
2)天冬氨酸(D),谷氨酸(E);
3)天冬酰胺(N),谷氨酰胺(Q);
4)精氨酸(R),赖氨酸(K);
5)异亮氨酸(I),亮氨酸(L),甲硫氨酸(M),缬氨酸(V);和
6)苯丙氨酸(F),酪氨酸(Y),色氨酸(W)。
(见,例如,Creighton,Proteins(1984))。
用于适合的胞嘧啶甲基转移酶的区域的共有序列显示于序列表中。将某些符号用在共有序列中从而代表在某些氨基酸残基上的适合的置换并且从而表示在某些位点上的可接受的长度改变:
+=″带正电荷的″例如,H,K,R
a=″脂族的″例如I,L,V,M
t=″小的″例如T,G,A
r=″芳香族的″例如F,Y,W
n=″带负电荷的″例如E,D
p=″极性的″例如N,Q
<#-#>=氨基酸的专用#,任意类型
(X,Y)=一个氨基酸残基,X或Y
在某些情形中,适合的甲基转移酶可以在多肽的共有功能结构域和/或保守区域的基础上进行合成,所述多肽是同源的甲基转移酶。可以通过如上所述的同源性多肽序列分析来鉴定共有结构域和保守区域。用作胞嘧啶DNA甲基转移酶的这些合成性多肽的适宜性可以在基于它们对基因组甲基化状态作用的基础上,或通过在序列表中显示的玉米,水稻,或拟南芥胞嘧啶DNA甲基转移酶的功能性互补来进行评价。
结构域是在多肽中连续的氨基酸的类群,其可以用于鉴定蛋白质家族和/或蛋白质的部分。这些结构域具有“指纹图谱”或“特征”,其可以包含保守的(1)一级序列,(2)次级结构,和/或(3)三维构象。通常,每个结构域都与保守的一级序列或序列基序相关。通常,这些保守的一级序列基序与特异性的体外和/或体内的活性相关。结构域可以是任何长度,包括完整的待转录的多核苷酸。可以用于鉴定直向同源的胞嘧啶DNA甲基转移酶的结构域的实例包括,而不限于,甲基转移酶活性结构域,“真核生物”结构域,TS结构域,BAH结构域,富半胱氨酸的结构域,GK重复结构域,和PC重复结构域。见,图2。
在第一植物中的重组核酸构建体包含一个或多个与编码胞嘧啶DNA甲基转移酶的序列可操作连接的调节元件。调节元件可以包括启动子序列,增强子序列,应答元件,蛋白质识别位点,调节核酸序列表达的可诱导元件,启动子控制元件,蛋白质结合序列,5′和3′UTRs,转录起始位点,终止序列,聚腺苷酸化序列,在氨基酸编码序列诸如分泌信号中的内含子和某些序列,和蛋白酶裂解位点。用于本文时,“可操作的连接”指以关于容许或促进核酸转录和/或翻译的这样的方式来相对于核酸将调节元件定位于构建体中。将被包括的元件的选择取决于一些因素,包括,但不限于,复制效率,选择性,可诱导性,需要的表达水平,和细胞或组织特异性。
典型地,启动子位于待转录的序列的5′,和邻近该序列的转录起始位点的位置。启动子在编码序列的第一外显子的上游和多个转录起始位点的第一个的上游。在某些实施方案中,将启动子位于编码序列的第一外显子的ATG上游约3,000个核苷酸的位置。在其它实施方案中,启动子位于多个转录起始位点的第一个的上游约2,000个核苷酸处。本发明的启动子包括下面定义的至少一个核心启动子。此外,启动子还可以包括至少一个控制元件诸如上游元件。这些元件包括UTRs和任选地,其它的影响多核苷酸转录的DNA序列诸如合成的上游元件。
5′未翻译的区域(UTR)被转录,但是未被翻译,并且位于转录物的起始位点和翻译起始密码子之间并且包括+1核苷酸。3′UTR可以位于在翻译终止密码子和转录物末端之间的地方。UTRs可以具有特定的功能诸如增加的mRNA信息稳定性或翻译弱化作用。3′UTRs的实例包括,但不限于,聚腺苷酸化信号和转录终止序列。
在这些实施方案中,可以使用在雄性配子体细胞中优先驱动转录的调节元件,所述细胞例如小孢子母细胞,或小孢子,包括营养细胞和在营养细胞中的分裂和形成精子细胞的细胞。然而,优选的是在成熟的花粉核中没有观察到转录。此外,在授精后在胚胎或胚乳中来自调节元件的转录不是理想的。因此,优选在授精后快速减少在胚乳组织中的转录。适合的雄性生殖组织特异性启动子是拟南芥YP0180启动子(SEQ ID NO:8)。
细胞类型或组织特异型启动子有时被观察到驱动除靶组织之外的组织中的可操作连接的序列的表达。因此,用于本文时,细胞类型或组织特异型启动子是在靶组织中驱动优先表达的启动子,但是还可以导致在其它细胞类型或组织中的一些表达。鉴定和表征植物基因组DNA中的调节元件的方法包括,例如,在下面的参考文献中描述的那些:Jordano,等,PlantCell,1:855-866(1989);Bustos,等,Plant Cell,1:839-854(1989);Green,等,EMBO J.,7:4035-4044(1988);Meier,等,Plant Cell,3:309-316(1991);和Zhang,等,Plant Physio.,110:1069-1079(1996)。
在雌性配子体细胞中的表达不足
在另一个方面,本发明提供通过在雌性配子发生过程中减少基因组胞嘧啶甲基化的程度来调节植物中的种子表型的方法。在该方面,在杂交中被用作雌性的植物包含核酸构建体,所述核酸构建体包含雌性配子体组织特异性调节元件,所述调节元件与有效减少总体胞嘧啶DNA甲基化水平的核酸序列可操作地连接。用缺乏核酸序列的花粉对植物进行授粉,并且在该植物上形成的种子具有的平均种子重量与在相应的植物上形成的种子的平均种子重量比较,重量增加,所述相应的植物缺乏所述核酸序列。
在该方面,重组核酸构建体可以结合抑制或阻止内源胞嘧啶DNA甲基转移酶的转录和/或翻译的序列。例如,可以使用反义序列。适合的反义序列包括反义核酸,所述反义核酸覆盖编码拟南芥Met1的氨基酸764到1535的基因的部分,或编码氨基酸644到1535的基因的部分,或编码氨基酸485到1535的基因的部分。这些反义核酸分别是约2.3kb,2.7kb,和3.2kb。
此外,在某种意义上,包含内源基因的整个或部分拷贝的构建体可以导致内源基因表达的抑制。因此,所述构建体可以结合编码已经出现在植物中的甲基转移酶的基因的另外的拷贝,或部分拷贝,即,具有与内源胞嘧啶DNA甲基转移酶的有义编码序列相似或相同的序列的DNA,但是所述DNA被转录成未被多腺苷酸化的mRNA,缺乏5′帽结构,或包含不可剪接的内含子。在另一个备选中,该构建体可以结合编码核酶的序列。
在另一个备选中,该构建体可以包括被转录成干扰RNA的序列。见,例如,美国专利6,753,139;美国专利出版物20040053876;和美国专利出版物20030175783。这样的RNA可以是与另一个RNA退火以形成干扰RNA的RNA。这样的RNA还可以是可以与其本身退火的RNA,例如,具有茎-环结构的双链RNA。双链RNA的茎部分的一条链包括与内源胞嘧啶DNA甲基转移酶的有义编码序列相似或相同的序列,并且该序列的长度是约10个核苷酸到约4,500个核苷酸。在一些实施方案中,茎部分与编码序列的5′UTR序列相似或相同。在一些实施方案中,茎部分与编码序列的3′UTR序列相似或相同。与有义编码序列,5′UTR,或3′UTR相似或相同的该序列的长度是10个核苷酸到500个核苷酸,15个核苷酸到300个核苷酸,20个核苷酸到100个核苷酸,或25个核苷酸到100个核苷酸。在一些实施方案中,与有义编码序列,5′UTR或3′UTR相似或相同的序列的长度可以是25个核苷酸到500个核苷酸,25个核苷酸到300个核苷酸,25个核苷酸到1,000个核苷酸,100个核苷酸到2,000个核苷酸,300个核苷酸到2,500个核苷酸,200个核苷酸到500个核苷酸,1,000个核苷酸到3,000个核苷酸,或200个核苷酸到1,000个核苷酸。双链RNA的茎部分的另一条链包括内源胞嘧啶DNA甲基转移酶的反义序列,可以具有与相应的茎部分的互补链的长度相比,更短,相同或更长的长度。双链RNA的环部分可以是10个核苷酸到5,000个核苷酸,例如15个核苷酸到1,000个核苷酸,20个核苷酸到500个核苷酸,或25个核苷酸到200个核苷酸。RNA的环部分可以包括内含子。见,例如WO 99/53050。
为了获得雌性配子体特异性表达,使用在雌性配子体组织中优先驱动转录的调节元件,诸如胚囊启动子。大多数适合的调节元件是在极核或中央细胞,或在极核的前体中,但是不在卵细胞或卵细胞的前体中优先驱动转录的调节元件。其转录模式从极核延伸到早期胚乳组织发育的调节元件也是可接受的,尽管最优选在授精后快速减少在胚乳组织中转录的调节元件。不优选在合子中或发育中的胚胎中的表达。
可以是适合的雌性生殖组织启动子包括来自下列基因的那些:玉米MAC1(见,Sheridan(1996)Genetics,142:1009-1020);玉米Cat3(见,GenBank号L05934;Abler(1993)Plant Mol.Biol.,22:10131-1038);拟南芥viviparous-1(见,Genbank号U93215);拟南芥atmycl(见,Urao(1996)PlantMol.Biol.,32:571-57;Conceicao(1994)Plant,5:493-505)。
其它的雌性配子体组织启动子包括来自下列基因的那些:拟南芥Fie(GenBank号AF129516);拟南芥Mea;和拟南芥Fis2(GenBank号AF096096);胚珠BEL1(Reiser(1995)Cell,83:735-742;Ray(1994)Proc.Natl.Acad.Sci.USA,91:5761-5765;GenBank号U39944);和拟南芥DMC1(见,GenBank号U76670)。
示范性的雌性配子体组织特异性启动子包括下列拟南芥启动子:YP0039(SEQ ID NO:10),YP0101(SEQ ID NO:11),YP0102(SEQ ID NO:6),YP0110(SEQ ID NO:9),YP0117(SEQ ID NO:7),YP0119(SEQ ID NO:12),YP0137(SEQ ID NO:13),DME启动子(SEQ ID NO:15),YP0285(SEQ ID NO:22)和YP0212(SEQ ID NO:14).
可以用在单子叶植物,诸如水稻中的启动子包括下列启动子:Y678g10p3(SEQ ID NO:20),p756a09p3(SEQ ID NO:21),Y790g04p3(SEQ ID NO:23),p780a10p3(SEQ ID NO:24),Y730e07p3(SEQ ID NO:26),Y760g09p3(SEQ ID NO:27),p530c10p3(SEQ ID NO:19),p524d05p3,(SEQ ID NO:18)p523d11p3(SEQ ID NO:17)和p472e10p3(SEQ ID NO:16)。
种子表型
显示如上所述的被调节的基因表达的生物可以在授粉后用于产生种子。这些种子相对于缺乏或不表达甲基转移酶多肽的生物而言,可以具有表型改变。例如,这些被调节的基因表达可以改变下列种子表型中的一种或多种:种子产量,种子组成,胚乳发育,胚胎发育,子叶发育,种子大小,种子发育时间,幼苗生长速率,或种子能育性。典型地,在成熟的种子上,以干重基础来测量表型诸如种子产量,种子组成,种子大小和种子重量。
当来自显示这样的表达的植物的花粉被用作杂交中的传粉者时,在雄性配子体细胞类型中的胞嘧啶DNA甲基转移酶多肽的表达可以导致平均种子重量增加约10%到约50%,例如,约10%到约40%,或约10%到约30%,或约10%或约20%,或约15%到约30%,或约15%到约25%。类似地,当内源胞嘧啶DNA甲基转移酶多肽的表达在雌性配子体细胞类型中被抑制,并且这样的植物在杂交中被用作雌性时,观察到约相同量级的平均种子重量的增加。
典型地,认为植物相对于相应的对照植物的表型诸如种子重量的差异在p≤0.05时具有统计学上的意义,其中进行了适当的参数或非参数统计,例如卡方检验,student’s t检验,Mann-Whitney检验,或F检验。在一些实施方案中,在p<0.01,p<0.005,或p<0.001时,差异才具有统计学意义。例如,在来自转基因测试植物的种子的种子重量与来自非转基因对照植物的种子的种子重量比较的统计学上的显著差异显示在该测试植物中存在的重组核酸改变了种子重量。
将理解的是,在杂交中的亲本都具有胞嘧啶DNA甲基转移酶的被调节的表达,并且由此获得了与其中仅有一个亲本植物具有被调节的甲基转移酶表达的杂交相比,种子表型更大的改变。因此,首先,传粉者植物可以在雄性配子体细胞中显示胞嘧啶DNA甲基转移酶的过量表达。第二,具种子的植物可以具有在雌性配子体细胞中被抑制的内源胞嘧啶DNA甲基转移酶的转录或翻译。在由第一植物授粉后,在第二植物上形成的种子具有与相应的第一和第二植物相比,增加的种子重量,所述相应的第一和第二植物分别不显示胞嘧啶DNA甲基转移酶的过量表达或抑制。这些种子的实例是雌性玉米植物与雄性玉米植物杂交的后代,所述雌性玉米植物含有重组核酸构建体,所述构建体包括与胞嘧啶DNA甲基转移酶序列可操作连接的YP0102a启动子,其通过RNAi机制减少甲基转移酶的活性的量,所述雄性玉米植物包含重组核酸构建体,所述构建体包括与全长的胞嘧啶DNA甲基转移酶编码序列可操作连接的雄性配子体启动子,其导致甲基转移酶的过量表达。
通过在雄性配子体细胞中表达不足或在雌性配子体细胞中的过量表达来调节种子表型
在雄性配子体细胞中的表达不足
在另一个方面中,本发明提供用于产生植物种子的方法,所述植物种子具有一种或多种改变的种子表型。所述方法包括容许第一植物给第二植物授粉的步骤。第一植物包含重组核酸构建体,所述重组构建体包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的一个或多个雄性配子体组织特异性调节元件。在授粉后,在第二植物上发育的种子具有的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量减少,所述相应的植物由缺乏该核酸序列的植物授粉。本文描述了适合的雄性配子体细胞特异性调节元件。本文还描述了有效减少胞嘧啶DNA甲基化水平的核酸,并且该核酸包括反义序列,干扰RNA序列和核酶序列。
在雌性配子体细胞中的过量表达
在另一个方面,用于产生种子的方法可以包括容许包含重组核酸构建体的植物的授粉,所述核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件。用于授粉的花粉缺乏这样的核酸序列。在这样的植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量减少,所述相应的植物缺乏或不表达该核酸序列。本文描述了适合的雌性配子体细胞特异性调节元件。本文还描述了有效增加胞嘧啶DNA甲基化水平的核酸,并且该核酸包括本文所述的胞嘧啶DNA甲基转移酶的编码序列。
种子表型
显示如上所述的被调节的基因表达的生物可以在授粉后用于产生种子。这些种子相对于缺乏或不表达甲基转移酶多肽的生物而言,可以具有表型改变。例如,这些被调节的基因表达可以改变下列种子表型中的一种或多种:种子产量,种子组成,胚乳发育,胚胎发育,子叶发育,种子大小,种子发育时间,或种子能育性。典型地,在成熟的种子上,以干重基础来测量表型诸如种子产量,种子组成,种子大小和种子重量。
当来自显示这样的表达的植物的花粉被用作杂交中的传粉者时,在雄性配子体细胞类型中的内源胞嘧啶DNA甲基转移酶多肽的表达的抑制可以导致平均种子重量减少约10%到约50%,例如,约10%到约40%,或约10%到约30%,或约10%或约20%,或约15%到约30%,或约15%到约25%。类似地,当胞嘧啶DNA甲基转移酶多肽在雌性配子体细胞类型中进行表达,并且这样的植物在杂交中被用作雌性时,观察到约相同量级的平均种子重量的减少。
典型地,认为植物相对于相应的对照植物的表型诸如种子重量的差异在p≤0.05时具有统计学意义,其中进行了适当的参数或非参数统计,例如卡方检验,student’s t检验,Mann-Whitney检验,或F检验。在一些实施方案中,在p<0.01,p<0.005,或p<0.001时,差异才具有统计学意义。例如,在来自转基因测试植物的种子的种子重量与来自非转基因对照植物的种子的种子重量比较的统计学上的显著差异显示在该测试植物中存在的重组核酸改变了种子重量。
将理解的是,在杂交中的亲本都具有胞嘧啶DNA甲基转移酶的被调节的表达,并且由此获得了与其中仅有一个亲本植物具有被调节的甲基转移酶表达的杂交相比,种子表型更大的改变。因此,首先,传粉者植物可以在雄性配子体细胞中抑制内源胞嘧啶DNA甲基转移酶的转录或翻译。第二,具有种子的植物可以在雌性配子体细胞中表达胞嘧啶DNA甲基转移酶。在由第一植物授粉后,在第二植物上形成的种子具有与相应的第一和第二植物相比,减少的种子重量,所述相应的第一和第二植物分别不显示胞嘧啶DNA甲基转移酶的抑制或过量表达。
编码甲基转移酶的核酸
本发明还包括编码胞嘧啶DNA甲基转移酶多肽的核酸,与胞嘧啶DNA甲基转移酶具有同源性的核酸,例如胞嘧啶DNA甲基转移酶的反义序列,胞嘧啶DNA甲基转移酶的核酶序列或胞嘧啶DNA甲基转移酶的干扰RNA序列。用于本文时,核酸指RNA或DNA,包括cDNA,合成DNA或基因组DNA。该核酸可以是单或双链,并且如果是单链,可以是编码或非编码链的任一。关于核酸用于本文时,“分离的”指(i)编码本发明的多肽的部分或全部的天然存在的核酸,但是其不包含这样的序列,即在正常情况下位于基因组中的编码多肽的核酸的一侧或两侧的侧面的编码序列;(ii)结合到载体中或结合到生物的基因组DNA中的核酸从而使得到的分子与任何天然存在的载体或基因组DNA不相同;或(iii)cDNA,基因组核酸片段,由聚合酶链式反应(PCR)产生的片段,或限制性酶切片段。具体而言,被排除于该定义的是存在于核酸分子或细胞的混合物中的核酸。
适合的核酸的实例包括在序列表中显示的编码鼠耳芥(Arabidopsisthaliana),稻(Oryza sativa),玉米(Zea mays)胞嘧啶-5DNA甲基转移酶的核酸。在Genbank接受号AF063403和AC093713中描述了示范性核酸。然而,应该理解的是,本文公开的具有除了特定核苷酸序列之外的核苷酸序列的核酸仍旧可以编码具有例示的氨基酸序列的多肽。遗传密码子的兼并性是本领域众所周知的;即对于许多氨基酸而言,存在超过一种的作为氨基酸的密码子的核苷酸三联体。
重组核酸构建体可以包含除本文所述的其它序列的克隆载体序列。适合的克隆载体序列是可商购的,并且是普通技术人员常规使用的。本发明的核酸构建体还可以包含编码其它多肽的序列。这些多肽可以,例如,促进将核酸构建体引入宿主生物或维持在其中。其它的多肽还可以影响被编码的甲基转移酶的表达,活性,或生化或生理作用。备选地,可以在单独的核酸构建体上提供其它的多肽编码序列。
可以通过例如,DNA合成或聚合物酶链式反应(PCR)来获得编码胞嘧啶DNA甲基转移酶的核酸。PCR指一种方法或技术,其中对靶核酸进行扩增。PCR可以用于从DNA以及RNA中扩增特异性的序列,包括来自总基因组DNA的序列,或来自总细胞RNA的序列。例如,在PCR Primer:ALaboratory Manual,Dieffenbach,C.& Dveksler,G,Eds.,Cold SpringHarbor Laboratory Press,1995描述了各种PCR方法。通常,来自目标区域或除此之外的区域的末端的序列信息用于设计寡核苷酸引物,所述引物与待扩增的模板的相对链的序列相同或相似。可获得各种PCR策略,通过这些策略可以将位点特异性的核苷酸序列修饰引入模板核酸。
核酸的检测可以通过方法诸如琼脂糖凝胶的溴化乙锭染色,DNA或RNA印迹杂交,PCR或原位杂交来进行。杂交典型地包括DNA或RNA印迹法(见,例如,Sambrook等,1989,“Molecular Cloning,A LaboratoryManual”,第二版,Cold Spring Harbor Press,Plainview;NY的9.37-9.52部分)。探针应该在高严格条件下与核酸或其互补序列杂交。高严格条件可以包括,使用低离子强度和高温度洗涤,例如在65℃的0.015MNaCl/0.0015M柠檬酸钠(0.1X SSC),0.1%十二烷基硫酸钠(SDS)。此外,可以将变性剂,诸如甲酰胺在高严格杂交中进行使用,例如在42℃的50%甲酰胺与0.1%的牛血清白蛋白/0.1% Ficoll/0.1%聚乙烯基吡咯烷酮/pH6.5的50mM磷酸钠缓冲液与750mM NaCl,75mM柠檬酸钠。
真核生物
术语“宿主”或“宿主细胞”不仅包括原核生物,诸如大肠杆菌(E.coli),还包括真核生物,如真菌、昆虫、植物和动物细胞。动物细胞包括,例如COS细胞和HeLa细胞。真菌细胞包括酵母细胞,如酿酒酵母(Saccharomyces cereviseae)细胞。可以利用本领域那些普通技术人员已知的技术,如磷酸钙或醋酸锂沉淀、电穿孔、脂转染和微粒轰击,用DNA分子(例如,载体)转化或转染宿主细胞。含有载体的宿主细胞可以用于这样的目的如增殖载体,产生核酸(例如DNA或干扰RNA)或表达多肽或其片段。
植物
本发明中描述的真核生物是含有本文所述重组核酸构建体的植物,所述核酸构建体例如可操作性连接到雄性配子体特异性调节元件或雌性配子体特异性调节元件上的胞嘧啶DNA甲基转移酶编码序列或干扰RNA序列。
在上述方法中用作亲本的植物可以是对于重组构建体杂合或纯合的。不过,当核酸构建体编码胞嘧啶DNA甲基转移酶多肽时,使用对于构建体纯合的植物能够导致种子表型的改变,其比当使用杂合植物时获得的改变量级更大。另一方面,当所述核酸构建体编码核酸诸如反义序列、干扰RNA序列,或核酶时,杂合的植物经常能够导致种子表型的改变,其与用纯合植物观察到的改变一样大。
在另一方面,本发明以处理植物的方法为特征,所述方法包括将重组核酸构建体引入植物细胞中。将外源核酸引入单子叶植物和双子叶植物中的技术是本领域已知的,包括,而不限于,农杆菌介导的转化、病毒性载体介导的转化、电穿孔和基因枪转化,例如,美国专利5,204,253和6,013,863。如果细胞或组织培养物被用作转化的受体组织,可以通过本领域技术人员已知的技术由转化的培养物再生出植物。可以将转基因植物加入到培育程序中,例如,将编码多肽的核酸导入其它品系中,将核酸转移到其它物种中或用作进一步选择其它的所需特性。或者,可以对适用于这些技术的那些物种进行无性繁殖。后代包括特定植物或植物品系的后代。本发明植物的后代包括在F1,F2,F3上形成的种子,以及在随后世代的植物,或在BC1,BC2,BC3上形成的种子,以及在后世代的植物。由转基因植物产生的种子可以生长并且随后自交(或异型杂交和自交)以获得对于重组核酸构建体纯合的种子。
用来实施本发明的适合植物组包括双子叶植物,诸如红花,苜蓿,大豆,油菜籽(高芥子酸和油菜),或向日葵。单子叶植物如玉米,小麦,黑麦,大麦,燕麦,水稻,粟(millet),苋属植物或高粱属植物也是适合的。蔬菜作物或根作物如马铃薯,西瓜,椰菜,豌豆,甜玉米,爆裂种玉米(popcorn),西红柿,豆类(包括云豆,利马豆,干豆,绿豆)等也是适合的。水果作物如桃、梨、苹果、樱桃、橙、柠檬、葡萄柚(grapefruit)、李子、芒果和棕榈也是适合的。因而,本发明对于广泛的植物具有用途,包括来自腰果属(Anacardium),落花生属(Arachis),天门冬属(Asparagus),颠茄属(Atropa),燕麦属(Avena),芸苔属(Brassica),柑桔属(Citrus),西瓜属(Citrullus),辣椒属(Capsicum),红蓝花属(Carthamus),椰子属(Cocos),咖啡属(Coffea),香瓜属(Cucumis),南瓜属(Cucurbita),胡萝卜属(Daucus),油棕属(Elaeis),花菱草属(Eschscholzia),草莓属(Fragaria),大豆属(Glycine),棉属(Gossypium),向日葵属(Helianthus),Heterocallis,大麦属(Hordeum),天仙子属(Hyoscyamus),莴苣属(Lactuca),亚麻属(Linum),黑麦草属(Lolium),羽扇豆属(Lupins),番茄属(Lycopersicon),苹果属(Malus),木薯属(Manihot),Majorana,苜蓿属(Medicago),烟草属(Nicotiana),木犀榄属(Olea),稻属(Oryza),黍属(Panicum),Pannesetum,罂粟属(Papaver),鳄梨属(Persea),菜豆属(Phaseolus),松属(Pinus),Pistachia,豌豆属(Pisum),梨属(Pyrus),李属(Prunus),萝卜属(Raphanus),蓖麻属(Ricinus),黑麦属(Secale),千里光属(Senecio),白芥属(Sinapis),茄属(Solanum),高粱属(Sorghum),Theobromus,胡卢巴属(Trigonella),小麦属(Triticum),野豌豆属(Vicia),葡萄属(Vitis),豇豆属(Vigna)和玉蜀黍属(Zea)的物种。在液体培养基或在半固体培养基上培养的细胞和组织也是适合的。
改变植物种子表型,例如,增加或减小种子重量的能力,能够向农业生产者和向消费者提供利益。例如,平均种子重量的提高能够导致来自收获农作物的总体产量或收获指标提高,由此向农民提供经济利益。而且,平均种子重量的提高能够导致每平方英亩特殊种子成分(specialty seedcomponent)的更大收获,由此提供更大的田地使用效率。例示性的特殊种子成分包括药物,生物碱,类萜,抗体,特殊的淀粉,特殊的油,特殊的蛋白质,和营养药(nutraceuticals)诸如甾醇。相反地,使用本文公开的方法实现平均种子重量的减少能够导致水果或蔬菜作物由于较小的种子而被消费者优先选择。
种子组合物
在另一方面,本发明以一种植物种子组合物为特征,其包含至少两种类型的种子。所述两种类型可以是种群(例如,人造的种群),品系,近交系,杂种,或商业变种。人造的种群是这样一组个体植物,其成员是多亲本交配方案的后代,从而使得所述组作为一个整体代表了所有亲本的等位基因频率。见,例如,美国专利6,320,106。在组合物中每种类型的比例被测量为特定类型种子数除以组合物中种子总数,并且根据需要可以进行配制以符合基于地理位置、所需成熟度等的需求。第一类型的比例可以从大约80%到约99.9%,例如,80%,85%,90%,91%,92%,93%,94%,95%,96%,97%,98%,或99%。第二类型的比例可以从大约0.1%到大约20%,例如,0.5%,1%,2%,3%,4%,或5%。如果第三类型存在于组合物中,第三类型的比例可以从约0.1%到约5%,例如,0.5%,1%,2%,3%,4%,或5%。当配制大量的种子组合物时,或当对相同组合物进行重复配制时,样品中每种类型的比例可能有一些变化。取样误差是统计学上已知的。在本发明中,这些取样误差一般为预测比例的约±5%,例如,90%±4.5%,或5%±0.25%。能够以大约35千克(kg)或更多,大约100kg或更多,大约1,000kg或更多,大约10,000kg或更多,或大约50,000kg或更多的量配制种子组合物。在一些实施方案中,植物种子组合物还包含其它类型,例如,大约0.1到大约5%的第三类型的种子。
由第一类型的种子生长的植物能够在雄性配子体细胞中过量表达胞嘧啶DNA甲基转移酶。由第二类型的种子生长的植物可以或不可以具有一种重组核酸构建体,其在雌性配子体细胞中抑制胞嘧啶DNA甲基转移酶的表达。
例如,本发明的种子组合物可以由两种玉米杂种制得。第一玉米杂种可以构成组合物中90%的种子并且具有构建体,所述构建体包含可操作地连接到有效减少总胞嘧啶DNA甲基化作用水平的核酸序列上的雌性配子体组织特异性调节元件。如果需要,所述第一玉米杂种可以是雄性不育的。第二玉米杂种可以构成组合物中10%的种子并且具有在雄性配子体组织中表达胞嘧啶DNA甲基转移酶的构建体。或者,两种杂种的其中之一不包含本文所述的核酸构建体。在培养了这些组合物的其中之一后,来自第二杂种的花粉将对第一杂种的穗进行授粉,导致对于组合物的所有植物的收获的作物种子重量提高。制备和培养两种种子类型的其它技术在U.S.5,004,864中进行了描述并且对本文所述的方法可以采用这些技术及其改进。还见,U.S.5,706,603。
一般,通过本领域已知的方法将每种类型的种子的基本上均一的混合物在包装材料中进行保存和包装以形成一项制品。这种种子袋优选地具有包装标记附于袋上,例如,系在包装材料上的标签或标记,印在包装材料上的标记或插入袋中的标记。所述包装标记表明其中的种子是多种类型,例如,两种不同类型的混合物。所述包装标记可以表明生长自这些种子的植物相对于相应的对照植物产生具有提高的种子重量的收获作物。
在本发明种子组合物中的类型一般具有相同或非常相似的成熟度,即,相同或非常相似的从萌发到作物种子成熟的天数。然而,在一些实施方案中,本发明的种子组合物中的一种或多种类型与组合物中的其它类型相比可以具有不同的相对成熟度,即,组合物中一种类型的从萌发到成熟种子的天数与组合物中另一种类型的天数在统计学上显著不同。
在下列实施例中对本发明作进一步描述,所述实施例并不限制本发明的范围。
实施例
实施例1:反义拟南芥甲基转移酶构建体
基于图1中所示的拟南芥基因组DNA序列的下划线部分,制备拟南芥Met1胞嘧啶DNA甲基转移酶基因组序列的反义核酸。所述反义核酸的长度为大约2.7kb;其序列显示于序列表中。
利用包含左和右农杆菌T-DNA边界的载体制成Met1反义核酸构建体。将2.7kb Met1反义片段可操作地连接到来自FIE的启动子,其优先驱动在胚囊发育期间于雌性配子体组织中的转录,并且插在T-DNA边界之间。所述启动子的序列显示于SEQ ID NO:5中。还见,美国专利出版物20030126642。所述启动子促进极核,中央细胞和胚乳发育早期部分中的表达,但不驱动卵细胞、合子或雄性配子体组织中可检测的表达。所述反义片段还可操作性地连接到nos 3′末端序列。命名为pRP:Met1 a/s的构建体,也在左和右T-DNA边界之间包含一条可选择的标记基因。
实施例2:含有拟南芥甲基转移酶反义构建体的转基因植物分析
下列符号被用在实施例中除非另外指定:T1:第一代转化体;T2:第二代,自花授粉的T1植物的后代,T3:第三代,自花授粉的T2植物的后代;T4:第四代,自花授粉的T3植物的后代。
通过基本上如在Bechtold,N.等,C.R.Acad.Sci.Paris,316:1194-1199(1993)中所述的植物浸渍法将实施例1的pRP:Met 1a/s反义构建体引入Arabidopsis Columbia中。回收了二十三种独立的转化体。将T1种子萌发并让其自花授粉。在14种转化体中,T2种子的大小是野生型的,在一些或许多长角果中具有败育的胚珠。在这14种转化体中的一种中,一些T2种子是白色的。
在9种转化体中,T2种子的大小可以是野生型的,也可以更大。一些长角果具有败育的种子。将来自这9种转化体每一种的T2种子的样品萌发并通过PCR分析来分析pRP:Met 1a/s构建体的存在。所述9种转化体中的八种被发现以预期的3∶1比例对于pRP∶Met 1a/s构建体发生分离,表示构建体插入在单一位置上。将单一位置转化体培养到成熟并让其自花授粉。对来自所述8种转化体每一种的200T3种子的三个重复样品进行称重。所述8种转化体的5种的平均T3种子重量高于野生型Columbia植物的平均种子重量。
将来自所述8种单一位点转化体的T3种子萌发并让得到的植物进行自花授粉。测量T3植物上的长角果并收集和测量成熟T4种子。源自T2植物#23和T1转化事件#34的十种纯合T3植物的结果,以及源自T2植物#20和T1转化事件#34的五种纯合T3植物的结果显示于表1中。
源自T2植物#23和T1转化事件#32的十种纯合T3植物的结果,以及源自T2植物#13和T1转化事件#32的五种纯合T3植物的结果显示于表2中。
表1
来自事件#34的两种T3纯合子的T4种子的分析
野生型(Col) | #23(10种植物) | #20(5种植物) | |
表型 | |||
平均种子重量±SE(μg/种子) | 23.00±0.273(n=10) | 26.47±0.498(n=10) | 26.88±0.412(n=5) |
最小种子重量 | 21.52 | 24.62 | 25.93 |
最大种子重量 | 23.97 | 29.07 | 28.03 |
P值(种子重量) | -- | 2.218E-05 | 3.055E-05 |
长角果长度±SE | 14.3±0.13(n=30) | 14.5±0.12(n=30) | 14.9±0.19(n=15) |
(mm) | |||
可见的种子数/长角果±SE | 57.4±0.92(n=30) | 52.5±0.95(n=30) | 56.4±1.07(n=15) |
败育的种子数/长角果±SE | 0.6±0.0.27(n=30) | 0.4±0.18(n=30) | 0.0±0.07(n=15) |
败育的% | 0.9±0.43% | 0.7±0.34% | 0±0.1% |
表2
来自事件#32的两种T3纯合子的T4种子的分析
表型 | 野生型(Col) | #23(10种植物) | #13(5种植物) |
平均种子重量±SE(μg/种子) | 22.44±0.180(n=10) | 25.99±0.193(n=10) | 26.51±0.429(n=5) |
最小种子重量 | 21.28 | 25.10 | 25.33 |
最大种子重量 | 23.07 | 26.94 | 27.87 |
P值(种子重量) | 8.14E-11 | 1.10E-07 | |
长角果长度±SE(mm) | 15.3±0.22(n=30) | 15.9±0.20(n=30) | 16.2±0.24(n=15) |
可见的种子数/长角果±SE | 63.3±1.52(n=30) | 61.6±1.56(n=30) | 67.1±1.56(n=15) |
败育的种子数/长角果±SE | 0.3±0.30(n=30) | 0.2±0.15(n=30) | 0.7±0.33(n=15) |
败育的% | 0.5±0.50%(n=30) | 0.3±0.30%(n=30) | 1.2±0.53%(n=15) |
结果显示对于事件#34的后代来说,T4世代种子中的平均种子重量分别提高了15.1%和16.9%。结果显示对于事件#32的后代来说,T4世代种子中的平均种子重量分别提高了15.8%和18.1%。
实施例3:拟南芥甲基转移酶有义构建体
构建含有全长拟南芥Met1甲基转移酶编码序列的核酸。所述核酸长度为大约4.5kb。通过在有义方向将4.5kb Met1核酸可操作地连接到启动子上而形成Met1有义核酸构建体,所述启动子优先在胚囊发育期间于雌性配子体组织中驱动转录。所述启动子促进极核,中央细胞和胚乳发育早期部分中的表达,但不驱动卵细胞、合子或雄性配子体组织中可检测的表达。所述启动子还在胚乳发育的早期部分中驱动表达。所述有义构建体被命名为pRP:Met1s。
实施例4:包含拟南芥甲基转移酶有义构建体的转基因植物的分析
通过基本上如在Bechtold,N.等,C.R.Acad.Sci.Paris,316:1194-1199(1993)中所述的植物浸渍法将实施例3的pRP:Met1s构建体引入拟南芥Wassilewskija(WS)中。回收了十一种独立的转化体。培养T1转化体并让其自花授粉。三种所述转化体产生了T2长角果,其具有野生型种子、小种子和一些败育的胚珠。将来自事件#1的T2种子萌发并让得到的植物自花授粉。测量T2植物上的长角果并收集和测量成熟的T3种子。来自T1转化体,事件#1其中一种的成熟T3种子被观察成为两种类别,看起来具有正常大小的种子和看起来具有更小大小的种子。对两种类型的种子的样品进行分析并将结果显示于表3中。
表3
来自事件#1的T3种子的分析
事件#1的T3种子的重量 | |||
表型 | 野生型(Ws) | I类 | #1 II类 |
平均种子重量±SE(μg/种子) | 20.33±0.329(n=5) | 20.35±0.297(n=5) | 13.75±0.477(n=5) |
最小种子重量 | 19.33 | 19.73 | 12.45 |
最大种子重量 | 21.38 | 21.38 | 15.10 |
P值(种子重量) | 0.959 | 3.25202E-06 | |
长角果长度±SE(mm) | 15.5±0.24(n=15) | 14.9±0.35(n=15) | 11.5±0.45(n=15) |
可见的种子数/长角果±SE | 60.1±1.91(n=15) | 60.4±2.62(n=15) | 47±1.31(n=15) |
败育的种子数/长角果±SE | 2.3±0.76(n=15) | 2.4±0.71(n=15) | 0.6±0.62(n=15) |
败育的% | 1.9±0.82%(n=15) | 2.6±1.17%(n=15) | 1.5±1.54%(n=15) |
结果表明II类种子具有比对照W/S种子小32.5%的平均重量。
实施例5:拟南芥甲基转移酶反义构建体
将实施例1的2.7kb反义核酸可操作地连接到拟南芥DME启动子核酸上。DME启动子的核苷酸序列显示于Kinoshita等,Proc.Natl.Acad.Sci.98:14156-14161(2001)中。将DME:Met1a/s构建体引入如在Bechtold,N.等,C.R.Acad.Sci.Paris,316:1194-1199(1993)中所述的拟南芥栽培品种WS中。将成熟T1种子萌发并让其自花授粉。来自独立转化体的成熟T2种子被观察到分成两种类别,看起来具有正常大小的种子和看起来具有更大大小的种子。将每种类别的T2种子萌发并让其自花授粉。对T3种子分析平均种子重量和DME:Met1 a/s转基因的存在。
实施例6:转基因拟南芥种子的组合物
收集来自实施例2中所述的纯合植物(#34-20和#34-23)的T3种子和来自#34-20和#34-23的两种子代植物(#34-20-10,#34-20-13,#34-23-04和#34-23-06)的T4种子。相对于由品系#34-16-04收集的非转基因T4分离种子中的水平,测量每批种子中82种化合物的水平。进行分析的化合物为:L-丙氨酸,甘氨酸,L-缬氨酸,L-亮氨酸,L-异亮氨酸,L-丝氨酸,L-脯氨酸,L-苏氨酸,高丝氨酸,反式-4-L-羟脯氨酸,L-天冬氨酸,L-蛋氨酸,L-半胱氨酸,L-谷氨酸,L-谷氨酰胺,L-苯丙氨酸,L-天冬酰胺,L-鸟氨酸,L-赖氨酸,L-组氨酸,L-色氨酸,DL-乳酸,羟基乙酸,丙酮酸,草酸,磷酸,甘油酸,苯甲酸,延胡索酸,琥珀酸,柠苹酸,苹果酸,2-羟基苯甲酸,核糖酸-γ-内酯,α-酮戊二酸,奎尼酸,莽草酸,柠檬酸,异柠檬酸,3-磷酸甘油酸,葡萄糖酸,木糖/阿拉伯糖,岩藻糖,果糖、甘露糖、半乳糖、葡萄糖、蔗糖、麦芽糖、海藻糖、异麦芽糖、gycerol、核糖醇、木糖醇/阿拉伯糖醇、甘露醇、环己六醇、麦芽糖醇,十一烯酸、辛酸(C8:0),癸酸(C10:0),月桂酸(C12:0),肉豆蔻酸(C14:0),棕榈酸(C16:0),硬脂酸(C18:0),油酸(C18:1),亚油酸(C18:2),亚麻酸(C18:3),二十二烷酸(C22:0),二十四烷酸(C24:0),L-十四烷醇,十六烷醇,L-十八烷醇,L-二十二烷醇,L-二十八烷醇,L-三十烷醇,角鲨烯,胆甾醇,豆甾烷醇,谷甾醇和菜油甾醇。
从每批种子中以两次重复或三次重复进行抽提以产生重复样品进行GC-MS分析。相对于内部标准和相对于对照水平进行标准化的数据的检查显示含有pRP:Met1 a/s构建体的种子的组合物与对照种子对于82种化合物中的80种而言基本上无差别。相对于对照种子来自#34-23-04,#34-23-06和#34-20-10植物的T4种子的亚油酸和亚麻酸含量减少。相对于对照种子来自#34-20-13植物的T4种子的亚油酸和亚麻酸含量有极微弱的减少。在亲本#34-23或#34-20T3种子中没有观察到亚油酸或亚麻酸含量的减少。
实施例7:包含拟南芥甲基转移酶RNAi构建体的转基因植物的分析
通过将CaMV35S启动子可操作地连接到有效转录成干扰RNA的序列上来形成RNAi构建体。所述RNAi序列包含有义方向上的大约2.7kb的拟南芥Met1序列以及nos终止子序列的反向重复。利用标准分子生物学技术形成所述构建体。见,Brummell等,Plant J.,33:793-800(2004)。将所述构建体插入到载体中,所述载体包含赋予对除草剂Basta的抗性的可选择标记基因。
通过实施例2中所述的农杆菌介导的方法将RNAi构建体载体导入拟南芥中。在对Basta抗性进行选择后再生出八种独立的T1植物,让所述植物进行自花授粉。对来自T1植物的无性组织分析内源Met1转录物的量。作为对照,空白RNAi载体也被引入拟南芥中,所述空白RNAi载体中将CaMV35S启动子可操作性地连接到反向nos终止子序列,并且在发育的相同阶段对来自对照植物的无性组织进行分析。结果显示T1植物中的内源转录物水平范围为对照量的15%到58%。
实施例8:包含水稻甲基转移酶RNAi构建体的转基因植物的分析
下列符号用于本实施例中:T0:由转化的组织培养物再生的植物;T1:第一代,自花授粉的T0植物的后代;T2:第二代,自花授粉的T1植物的后代;T3:第三代,自花授粉的T2植物的后代。
通过将CaMV35S启动子可操作地连接到有效转录成干扰RNA的序列上来形成RNAi构建体。所述RNAi序列包含约600个核苷酸的水稻胞嘧啶DNA甲基转移酶有义链(N端区)以及nos终止子序列的反向重复。使用标准分子生物学技术来制备构建体。35S::水稻Met::反向nos构建体的序列显示于SEQ ID NO:1中。所述构建体的水稻Met部分显示于SEQ ID NO:2中。将所述构建体插入到载体中,所述载体包含赋予对除草剂Basta的抗性的可选择标记基因。
通过农杆菌介导的转化方法将RNAi构建体载体引入水稻栽培品种Kitaake的组织培养物中。由对于Basta抗性选择的组织再生出来自十二种独立事件的T0植物并让其自花授粉。对十二种事件的转化组织分析存在的特异性甲基转移酶的内源转录物的量,其预计会被RNAi构建体所影响。作为对照,在发育的相同阶段分析来自转基因KitaakeT0组织植物的组织培养样品,所述植物包含这样一种载体,其具有连接到反向nos终止子上的35S启动子但缺少甲基转移酶RNAi。结果显示T0植物中的内源转录物水平范围为对照量的2%到53%。
除了使用约600个核苷酸的水稻甲基转移酶C端区的区域之外,以相同的方式制成第二RNAi构建体。第二构建体的序列显示于SEQ ID NO:3中。所述第二构建体的水稻Met部分显示于SEQ ID NO:4中。通过农杆菌介导的方法将第二RNAi构建体引入水稻栽培品种Kitaake中。
已经描述了本发明的若干实施方案。然而,将要理解的是可以在不背离发明的精神和范围的前提下进行各种改进。因此,其它实施方案在下列权利要求的范围之内。
序列表
<110>塞雷斯公司
<120>改变种子表型的方法和组合物
<130>18207-002W01
<150>US 60/510,924
<151>2003-10-14
<160>50
<170>FastSEQ for Windows Version 4.0
<210>1
<211>8812
<212>DNA
<213>人工序列
<220>
<223>合成产生的构建体
<221>misc_feature
<222>(0)...(0)
<223>NB42-35S-OsMET1Nt-RNAi #14
<400>1
aaatccaagc tcgatctagt aacatagatg acaccgcgcg cgataattta tcctagtttg 60
cgcgctatat tttgttttct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa 120
aaacccatct cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat 180
tcaacagaaa ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac 240
tttattgcca aatgtttgaa cgatcgagcg ctagcgccta tatcgctagc gatcgcgagc 300
tacgtacaca tcatgcatcg cgatcgagct tcgcgatcgt tcaaacattt ggcaataaag 360
tttcttaaga ttgaatcctg ttgccggtct tgcgatgatt atcatataat ttctgttgaa 420
ttacgttaag catgtaataa ttaacatgta atgcatgacg ttatttatga gatgggtttt 480
tatgattaga gtcccgcaat tatacattta atacgcgata gaaaacaaaa tatagcgcgc 540
aaactaggat aaattatcgc gcgcggtgtc atctatgtta ctagatcgct agatttcaca 600
tacaccaaaa aaatgctgca taattctcgg ggcagcaagt cggttacccg gccgccgtgc 660
tggaccgggt tgaatggtgc ccgtaacttt cggtagagcg gacggccaat actcaacttc 720
aaggaatctc acccatgcgc gccggcgggg aaccggagtt cccttcagtg aacgttatta 780
gttcgccgct cggtgtgtcg tagatactag cccctggggc cttttgaaat ttgaataaga 840
tttatgtaat cagtctttta ggtttgaccg gttctgccgc tttttttaaa attggatttg 900
taataataaa acgcaattgt ttgttattgt ggcgctctat catagatgtc gctataaacc 960
tattcagcac aatatattgt tttcatttta atattgtaca tataagtagt agggtacaat 1020
cagtaaattg aacggagaat attattcata aaaatacgat agtaacgggt gatatattca 1080
ttagaatgaa ccgaaaccgg cggtaaggat ctgagctaca catgctcagg ttttttacaa 1140
cgtgcacaac agaattgaaa gcaaatatca tgcgatccta gaattaattc aggtaggtca 1200
gatttgagta acaggtctaa caggtctagg aggagcagga agctcgaaat ctctttgcca 1260
gaatccaaca tcatgccatc ctccatgctt gtatccagca gctctaagag ttcctctagc 1320
agtgtatcca agagcctcat gaagtctaac agaaggatcg ttaggaagtc caataacagc 1380
aacaacagac ttgaatcctt gagcctccat agacttaaga agatgagtgt aaagagtaga 1440
tccaagtcca agtctttgat gtctatgaga aacgtaaaca gtagactcaa cagtccaatc 1500
gtaagcgttt ctagccttcc aaggtccagc gtaagcaatt ccagcaacaa ctccctcaac 1560
ctcagcaaca agccaagggt atctatcttg aagtctctca agatcatcga tccactcttg 1620
aggagtttga ggctcagttc tgaagttaac agtagaagtc tcaatgtaat ggttaacaat 1680
atcacaaaca gcagccatat cagcagcagt agcaggtcta atctcaacag gtcttctctc 1740
aggagacatt ttgtttagct gtcaaaacaa aaacaaaaat cgaaacatca gaatcaacaa 1800
aaatacatca accatcaact atacaacaac caaaacgtca acaatataat caaacacaga 1860
tccactgaaa caaaaccaca tatcaccagt tgagctatca tatcaaacca cgagacaaca 1920
ggtatatcaa atctaaggaa catcaccaac caaatacatc agaatcaact ataaccagag 1980
cagatacaga tcgacatgat aaaaaacatg cgaagacgat atcaaaacta aacgctatca 2040
attaatcaga ggattataca tcagactcaa taggaacaat attgatcgac gagtaaacgg 2100
atctaaagct agagaatcaa aagcagtata acaacagcaa agaataagcg ataatcacag 2160
tcaatataga gctaaaacta agaatctaaa ccctaaacag ctacaataat cataagaaga 2220
tgaagatcgg agacactaaa gagagaaaat atctaacctg caagtaagaa tctgaaagga 2280
gtcttgcggc tacgaaaatg tgagaaatat gagagcgcac cctaatcctg gtcgactcga 2340
gggtacttat agctacgagg tgtctagggt tttcgctttc tctttgtggt tctactttta 2400
ctaatttgcc cttacgcgtt ttgggccttt ctattttttt ggttgtgaat ttacccaaca 2460
aagaattaca aaaatggatc cacaaaattc tcatacattt ttttcttcaa tttgaaatgt 2520
taaatagctt ataattatgt gttgtttggt taagaaattg tataattgta taaatttttt 2580
tataaaaaaa ctctcttgat gatcgaaaag gtgacggaaa accctagccg tcatgagttg 2640
gctttgatag atctatggaa ttaaattaat actagtatat aaattgataa atcgaaatta 2700
cagcctaatt aatgggacat aaaacatata tttatctggc gccagaattc gaagctaaat 2760
gccatggatg tttaaaccta aaaacgtccg caatgtgtta ttaagttgtc taagcgtcaa 2820
tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg accggcagct 2880
cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg tcagcgggag 2940
agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa gaacggcagc 3000
ccttgtgtag ggcttattat gcacgcttaa aaataataaa agcagacttg acctgatagt 3060
ttggctgtga gcaattatgt gcttagtgca tctaacgctt gagttaagcc gcgccgcgaa 3120
gcggcgtcgg cttgaacgaa ttgttagaca ttatttgccg actaccttgg tgatctcgcc 3180
tttcacgtag tggacaaatt cttccaactg atctgcgcgc gaggccaagc gatcttcttc 3240
ttgtccaaga taagcctgtc tagcttcaag tatgacgggc tgatactggg ccggcaggcg 3300
ctccattgcc cagtcggcag cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta 3360
ccaaatgcgg gacaacgtaa gcactacatt tcgctcatcg ccagcccagt cgggcggcga 3420
gttccatagc gttaaggttt catttagcgc ctcaaataga tcctgttcag gaaccggatc 3480
aaagagttcc tccgccgctg gacctaccaa ggcaacgcta tgttctcttg cttttgtcag 3540
caagatagcc agatcaatgt cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt 3600
gcgctgccat tctccaaatt gcagttcgcg cttagctgga taacgccacg gaatgatgtc 3660
gtcgtgcaca acaatggtga cttctacagc gcggagaatc tcgctctctc caggggaagc 3720
cgaagtttcc aaaaggtcgt tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt 3780
caccgtaacc agcaaatcaa tatcactgtg tggcttcagg ccgccatcca ctgcggagcc 3840
gtacaaatgt acggccagca acgtcggttc gagatggcgc tcgatgacgc caactacctc 3900
tgatagttga gtcgatactt cggcgatcac cgcttccctc atgatgttta actttgtttt 3960
agggcgactg ccctgctgcg taacatcgtt gctgctccat aacatcaaac atcgacccac 4020
ggcgtaacgc gcttgctgct tggatgcccg aggcatagac tgtaccccaa aaaaacagtc 4080
ataacaagcc atgaaaaccg ccactgcgcc gttaccaccg ctgcgttcgg tcaaggttct 4140
ggaccagttg cgtgagcgca tacgctactt gcattacagc ttacgaaccg aacagggcgc 4200
tcttccgctc gccctttggc gcgccggatt atctggacac caaggcacca ggcgggtcaa 4260
atcaggaata agggcacatt gccccggcgt gagtcggggc aatcccgcaa ggagggtgaa 4320
tgaatcggac gtttgaccgg aaggcataca ggcaagaact gatcgacgcg gggttttccg 4380
ccgaggatgc cgaaaccatc gcaagccgca ccgtcatgcg tgcgccccgc gaaaccttcc 4440
agtccgtcgg ctcgatggtc cagcaagcta cggccaagat cgagcgcgac agcgtgcaac 4500
tggctccccc tgccctgccc gcgccatcgg ccgccgtgga gcgttcgcgt cgtctcgaac 4560
aggaggcggc aggtttggcg aagtcgatga ccatcgacac gcgaggaact atgacgacca 4620
agaagcgaaa aaccgccggc gaggacctgg caaaacaggt cagcgaggcc aagcaggccg 4680
cgttgctgaa acacacgaag cagcagatca aggaaatgca gctttccttg ttcgatattg 4740
cgccgtggcc ggacacgatg cgagcgatgc caaacgacac ggcccgctct gccctgttca 4800
ccacgcgcaa caagaaaatc ccgcgcgagg cgctgcaaaa caaggtcatt ttccacgtca 4860
acaaggacgt gaagatcacc tacaccggcg tcgagctgcg ggccgacgat gacgaactgg 4920
tgtggcagca ggtgttggag tacgcgaagc gcacccctat cggcgagccg atcaccttca 4980
cgttctacga gctttgccag gacctgggct ggtcgatcaa tggccggtat tacacgaagg 5040
ccgaggaatg cctgtcgcgc ctacaggcga cggcgatggg cttcacgtcc gaccgcgttg 5100
ggcacctgga atcggtgtcg ctgctgcacc gcttccgcgt cctggaccgt ggcaagaaaa 5160
cgtcccgttg ccaggtcctg atcgacgagg aaatcgtcgt gctgtttgct ggcgaccact 5220
acacgaaatt catatgggag aagtaccgca agctgtcgcc gacggcccga cggatgttcg 5280
actatttcag ctcgcaccgg gagccgtacc cgctcaagct ggaaaccttc cgcctcatgt 5340
gcggatcgga ttccacccgc gtgaagaagt ggcgcgagca ggtcggcgaa gcctgcgaag 5400
agttgcgagg cagcggcctg gtggaacacg cctgggtcaa tgatgacctg gtgcattgca 5460
aacgctaggg ccttgtgggg tcagttccgg gcgcgcctga agtacatcac cgacgagcaa 5520
ggcaagaccg agcgcctttc cgacgctcac cgggctggtt gccctcgccg ctgggctggc 5580
ggccgtctat ggccctgcaa acgcgccaga aacgccgtcg aagccgtgtg cgagacaccg 5640
cggccgccgg cgttgtggat acctcgcgga aaacttggcc ctcactgaca gatgaggggc 5700
ggacgttgac acttgagggg ccgactcacc cggcgcggcg ttgacagatg aggggcaggc 5760
tcgatttcgg ccggcgacgt ggagctggcc agcctcgcaa atcggcgaaa acgcctgatt 5820
ttacgcgagt ttcccacaga tgatgtggac aagcctgggg ataagtgccc tgcggtattg 5880
acacttgagg ggcgcgacta ctgacagatg aggggcgcga tccttgacac ttgaggggca 5940
gagtgctgac agatgggggg cgcacctatt gacatttgag gggctgtcca caggctgaaa 6000
atccagcatt tgcaagggtt tccgcccgtt tttcggccac cgctaacctg tcttttaacc 6060
tgcttttaaa ccaatattta taaaccttgt ttttaaccag ggctgcgccc tgtgcgcgtg 6120
accgcgcacg ccgaaggggg gtgccccccc ttctcgaacc ctcccggccc gctaaaaggg 6180
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 6240
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 6300
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 6360
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 6420
aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 6480
tcctattccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 6540
ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 6600
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 6660
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 6720
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 6780
ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt 6840
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 6900
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 6960
cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 7020
gagattatca aaaaggatct tcacctagat ccttttaggg ctaccatgga ggcggcggcc 7080
aatcttgctt gtctcgctgg ccggcgccag atctggggaa ccctgtggtt ggcatgcaca 7140
tacaaatgga cgaacggata aaccttttca cgccctttta aatatccgat tattctaata 7200
aacgctcttt tctcttaggt ttacccgcca atatatcctg tcaaacactg atagtttaaa 7260
ctgaaggcgg gaaacgacaa tctgatctct aggtccccag attagccttt tcaatttcag 7320
aaagaatgct aacccacaga tggttagaga ggcttacgca gcaggtctca tcaagacgat 7380
ctacccgagc aataatctcc aggaaatcaa ataccttccc aagaaggtta aagatgcagt 7440
caaaagattc aggactaact gcatcaagaa cacagagaaa gatatatttc tcaagatcag 7500
aagtactatt ccagtatgga cgattcaagg cttgcttcac aaaccaaggc aagtaataga 7560
gattggagtc tctaaaaagg tagttcccac tgaatcaaag gccatggagt caaagattca 7620
aatagaggac ctaacagaac tcgccgtaaa gactggcgaa cagttcatac agagtctctt 7680
acgactcaat gacaagaaga aaatcttcgt caacatggtg gagcacgaca cacttgtcta 7740
ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga cttttcaaca 7800
aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc actttattgt 7860
gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata aaggaaaggc 7920
catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac ccacgaggag 7980
catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt gatgtgatat 8040
ctccactgac gtaagggatg acgcacaatc ccactatcct tcgcaagacc cttcctctat 8100
ataaggaagt tcatttcatt tggagagaac acgggggact ctagtgggcc ctaagcttca 8160
tttaaatcca ctgcagtggt tccaagaaag agagcaatgg tgccactgaa cctggtaatg 8220
agcctgttgc cagcaagaga ccgaagagag cagctgcctg ttctaacttc aaagagaagt 8280
cattggactt atcagaaaaa gattcaatta tcacaatcaa ggaaagtcgg gttgaagaga 8340
aggaaataga ggctgttaat ttgacaagga cgggacctga agatggtcaa ccttgcagaa 8400
aaatcatcga tttcatctta catgatggag atggtaatct gcaacccttt gaaatgtctg 8460
aagttgatga cattttcata acagctctta tcatgccctt ggatgatgat ctggaaaagg 8520
ataggggaaa gggaatatgt tgttcggggt ttggacgaat tgaaaactgg gcgatttctg 8580
gctatgatga aggtgctgca gtaatttggg tctcaacaga aacatcagat tacaaatgtg 8640
tgaagccagc aagcagttac agatcttatt ttgaacactt tagtgagaag gcacgtgtct 8700
gtgttgaagt ctataagaag ttagctagat cagttggtgg aaatcctcag gtggacttag 8760
aagaattaat tgctggtgtt gtccgttcca tccattgcac tggtctagac cc 8812
<210>2
<211>612
<212>DNA
<213>稻(oryza sativa)
<220>
<221>misc_feature
<222>(0)...(0)
<223>OsMet1的N-末端结构域
<400>2
ttccaagaaa gagagcaatg gtgccactga acctggtaat gagcctgttg ccagcaagag 60
accgaagaga gcagctgcct gttctaactt caaagagaag tcattggact tatcagaaaa 120
agattcaatt atcacaatca aggaaagtcg ggttgaagag aaggaaatag aggctgttaa 180
tttgacaagg acgggacctg aagatggtca accttgcaga aaaatcatcg atttcatctt 240
acatgatgga gatggtaatc tgcaaccctt tgaaatgtct gaagttgatg acattttcat 300
aacagctctt atcatgccct tggatgatga tctggaaaag gataggggaa agggaatatg 360
ttgttcgggg tttggacgaa ttgaaaactg ggcgatttct ggctatgatg aaggtgctgc 420
agtaatttgg gtctcaacag aaacatcaga ttacaaatgt gtgaagccag caagcagtta 480
cagatcttat tttgaacact ttagtgagaa ggcacgtgtc tgtgttgaag tctataagaa 540
gttagctaga tcagttggtg gaaatcctca ggtggactta gaagaattaa ttgctggtgt 600
tgtccgttcc at 612
<210>3
<211>8862
<212>DNA
<213>人工序列
<220>
<223>合成产生的构建体
<221>misc_featute
<222>(0)...(0)
<223>(NB42-35S-OsMET1ct-RNAi #2
<400>3
aaatccaagc tcgatctagt aacatagatg acaccgcgcg cgataattta tcctagtttg 60
cgcgctatat tttgttttct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa 120
aaacccatct cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat 180
tcaacagaaa ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac 240
tttattgcca aatgtttgaa cgatcgagcg ctagcgccta tatcgctagc gatcgcgagc 300
tacgtacaca tcatgcatcg cgatcgagct tcgcgatcgt tcaaacattt ggcaataaag 360
tttcttaaga ttgaatcctg ttgccggtct tgcgatgatt atcatataat ttctgttgaa 420
ttacgttaag catgtaataa ttaacatgta atgcatgacg ttatttatga gatgggtttt 480
tatgattaga gtcccgcaat tatacattta atacgcgata gaaaacaaaa tatagcgcgc 540
aaactaggat aaattatcgc gcgcggtgtc atctatgtta ctagatcgct agatttcaca 600
tacaccaaaa aaatgctgca taattctcgg ggcagcaagt cggttacccg gccgccgtgc 660
tggaccgggt tgaatggtgc ccgtaacttt cggtagagcg gacggccaat actcaacttc 720
aaggaatctc acccatgcgc gccggcgggg aaccggagtt cccttcagtg aacgttatta 780
gttcgccgct cggtgtgtcg tagatactag cccctggggc cttttgaaat ttgaataaga 840
tttatgtaat cagtctttta ggtttgaccg gttctgccgc tttttttaaa attggatttg 900
taataataaa acgcaattgt ttgttattgt ggcgctctat catagatgtc gctataaacc 960
tattcagcac aatatattgt tttcatttta atattgtaca tataagtagt agggtacaat 1020
cagtaaattg aacggagaat attattcata aaaatacgat agtaacgggt gatatattca 1080
ttagaatgaa ccgaaaccgg cggtaaggat ctgagctaca catgctcagg ttttttacaa 1140
cgtgcacaac agaattgaaa gcaaatatca tgcgatccta gaattaattc aggtaggtca 1200
gatttgagta acaggtctaa caggtctagg aggagcagga agctcgaaat ctctttgcca 1260
gaatccaaca tcatgccatc ctccatgctt gtatccagca gctctaagag ttcctctagc 1320
agtgtatcca agagcctcat gaagtctaac agaaggatcg ttaggaagtc caataacagc 1380
aacaacagac ttgaatcctt gagcctccat agacttaaga agatgagtgt aaagagtaga 1440
tccaagtcca agtctttgat gtctatgaga aacgtaaaca gtagactcaa cagtccaatc 1500
gtaagcgttt ctagccttcc aaggtccagc gtaagcaatt ccagcaacaa ctccctcaac 1560
ctcagcaaca agccaagggt atctatcttg aagtctctca agatcatcga tccactcttg 1620
aggagtttga ggctcagttc tgaagttaac agtagaagtc tcaatgtaat ggttaacaat 1680
atcacaaaca gcagccatat cagcagcagt agcaggtcta atctcaacag gtcttctctc 1740
aggagacatt ttgtttagct gtcaaaacaa aaacaaaaat cgaaacatca gaatcaacaa 1800
aaatacatca accatcaact atacaacaac caaaacgtca acaatataat caaacacaga 1860
tccactgaaa caaaaccaca tatcaccagt tgagctatca tatcaaacca cgagacaaca 1920
ggtatatcaa atctaaggaa catcaccaac caaatacatc agaatcaact ataaccagag 1980
cagatacaga tcgacatgat aaaaaacatg cgaagacgat atcaaaacta aacgctatca 2040
attaatcaga ggattataca tcagactcaa taggaacaat attgatcgac gagtaaacgg 2100
atctaaagct agagaatcaa aagcagtata acaacagcaa agaataagcg ataatcacag 2160
tcaatataga gctaaaacta agaatctaaa ccctaaacag ctacaataat cataagaaga 2220
tgaagatcgg agacactaaa gagagaaaat atctaacctg caagtaagaa tctgaaagga 2280
gtcttgcggc tacgaaaatg tgagaaatat gagagcgcac cctaatcctg gtcgactcga 2340
gggtacttat agctacgagg tgtctagggt tttcgctttc tctttgtggt tctactttta 2400
ctaatttgcc cttacgcgtt ttgggccttt ctattttttt ggttgtgaat ttacccaaca 2460
aagaattaca aaaatggatc cacaaaattc tcatacattt ttttcttcaa tttgaaatgt 2520
taaatagctt ataattatgt gttgtttggt taagaaattg tataattgta taaatttttt 2580
tataaaaaaa ctctcttgat gatcgaaaag gtgacggaaa accctagccg tcatgagttg 2640
gctttgatag atctatggaa ttaaattaat actagtatat aaattgataa atcgaaatta 2700
cagcctaatt aatgggacat aaaacatata tttatctggc gccagaattc gaagctaaat 2760
gccatggatg tttaaaccta aaaacgtccg caatgtgtta ttaagttgtc taagcgtcaa 2820
tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg accggcagct 2880
cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg tcagcgggag 2940
agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa gaacggcagc 3000
ccttgtgtag ggcttattat gcacgcttaa aaataataaa agcagacttg acctgatagt 3060
ttggctgtga gcaattatgt gcttagtgca tctaacgctt gagttaagcc gcgccgcgaa 3120
gcggcgtcgg cttgaacgaa ttgttagaca ttatttgccg actaccttgg tgatctcgcc 3180
tttcacgtag tggacaaatt cttccaactg atctgcgcgc gaggccaagc gatcttcttc 3240
ttgtccaaga taagcctgtc tagcttcaag tatgacgggc tgatactggg ccggcaggcg 3300
ctccattgcc cagtcggcag cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta 3360
ccaaatgcgg gacaacgtaa gcactacatt tcgctcatcg ccagcccagt cgggcggcga 3420
gttccatagc gttaaggttt catttagcgc ctcaaataga tcctgttcag gaaccggatc 3480
aaagagttcc tccgccgctg gacctaccaa ggcaacgcta tgttctcttg cttttgtcag 3540
caagatagcc agatcaatgt cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt 3600
gcgctgccat tctccaaatt gcagttcgcg cttagctgga taacgccacg gaatgatgtc 3660
gtcgtgcaca acaatggtga cttctacagc gcggagaatc tcgctctctc caggggaagc 3720
cgaagtttcc aaaaggtcgt tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt 3780
caccgtaacc agcaaatcaa tatcactgtg tggcttcagg ccgccatcca ctgcggagcc 3840
gtacaaatgt acggccagca acgtcggttc gagatggcgc tcgatgacgc caactacctc 3900
tgatagttga gtcgatactt cggcgatcac cgcttccctc atgatgttta actttgtttt 3960
agggcgactg ccctgctgcg taacatcgtt gctgctccat aacatcaaac atcgacccac 4020
ggcgtaacgc gcttgctgct tggatgcccg aggcatagac tgtaccccaa aaaaacagtc 4080
ataacaagcc atgaaaaccg ccactgcgcc gttaccaccg ctgcgttcgg tcaaggttct 4140
ggaccagttg cgtgagcgca tacgctactt gcattacagc ttacgaaccg aacagggcgc 4200
tcttccgctc gccctttggc gcgccggatt atctggacac caaggcacca ggcgggtcaa 4260
atcaggaata agggcacatt gccccggcgt gagtcggggc aatcccgcaa ggagggtgaa 4320
tgaatcggac gtttgaccgg aaggcataca ggcaagaact gatcgacgcg gggttttccg 4380
ccgaggatgc cgaaaccatc gcaagccgca ccgtcatgcg tgcgccccgc gaaaccttcc 4440
agtccgtcgg ctcgatggtc cagcaagcta cggccaagat cgagcgcgac agcgtgcaac 4500
tggctccccc tgccctgccc gcgccatcgg ccgccgtgga gcgttcgcgt cgtctcgaac 4560
aggaggcggc aggtttggcg aagtcgatga ccatcgacac gcgaggaact atgacgacca 4620
agaagcgaaa aaccgccggc gaggacctgg caaaacaggt cagcgaggcc aagcaggccg 4680
cgttgctgaa acacacgaag cagcagatca aggaaatgca gctttccttg ttcgatattg 4740
cgccgtggcc ggacacgatg cgagcgatgc caaacgacac ggcccgctct gccctgttca 4800
ccacgcgcaa caagaaaatc ccgcgcgagg cgctgcaaaa caaggtcatt ttccacgtca 4860
acaaggacgt gaagatcacc tacaccggcg tcgagctgcg ggccgacgat gacgaactgg 4920
tgtggcagca ggtgttggag tacgcgaagc gcacccctat cggcgagccg atcaccttca 4980
cgttctacga gctttgccag gacctgggct ggtcgatcaa tggccggtat tacacgaagg 5040
ccgaggaatg cctgtcgcgc ctacaggcga cggcgatggg cttcacgtcc gaccgcgttg 5100
ggcacctgga atcggtgtcg ctgctgcacc gcttccgcgt cctggaccgt ggcaagaaaa 5160
cgtcccgttg ccaggtcctg atcgacgagg aaatcgtcgt gctgtttgct ggcgaccact 5220
acacgaaatt catatgggag aagtaccgca agctgtcgcc gacggcccga cggatgttcg 5280
actatttcag ctcgcaccgg gagccgtacc cgctcaagct ggaaaccttc cgcctcatgt 5340
gcggatcgga ttccacccgc gtgaagaagt ggcgcgagca ggtcggcgaa gcctgcgaag 5400
agttgcgagg cagcggcctg gtggaacacg cctgggtcaa tgatgacctg gtgcattgca 5460
aacgctaggg ccttgtgggg tcagttccgg gcgcgcctga agtacatcac cgacgagcaa 5520
ggcaagaccg agcgcctttc cgacgctcac cgggctggtt gccctcgccg ctgggctggc 5580
ggccgtctat ggccctgcaa acgcgccaga aacgccgtcg aagccgtgtg cgagacaccg 5640
cggccgccgg cgttgtggat acctcgcgga aaacttggcc ctcactgaca gatgaggggc 5700
ggacgttgac acttgagggg ccgactcacc cggcgcggcg ttgacagatg aggggcaggc 5760
tcgatttcgg ccggcgacgt ggagctggcc agcctcgcaa atcggcgaaa acgcctgatt 5820
ttacgcgagt ttcccacaga tgatgtggac aagcctgggg ataagtgccc tgcggtattg 5880
acacttgagg ggcgcgacta ctgacagatg aggggcgcga tccttgacac ttgaggggca 5940
gagtgctgac agatgggggg cgcacctatt gacatttgag gggctgtcca caggctgaaa 6000
atccagcatt tgcaagggtt tccgcccgtt tttcggccac cgctaacctg tcttttaacc 6060
tgcttttaaa ccaatattta taaaccttgt ttttaaccag ggctgcgccc tgtgcgcgtg 6120
accgcgcacg ccgaaggggg gtgccccccc ttctcgaacc ctcccggccc gctaaaaggg 6180
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 6240
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 6300
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 6360
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 6420
aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 6480
tcctattccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 6540
ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 6600
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 6660
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 6720
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 6780
ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt 6840
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 6900
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 6960
cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 7020
gagattatca aaaaggatct tcacctagat ccttttaggg ctaccatgga ggcggcggcc 7080
aatcttgctt gtctcgctgg ccggcgccag atctggggaa ccctgtggtt ggcatgcaca 7140
tacaaatgga cgaacggata aaccttttca cgccctttta aatatccgat tattctaata 7200
aacgctcttt tctcttaggt ttacccgcca atatatcctg tcaaacactg atagtttaaa 7260
ctgaaggcgg gaaacgacaa tctgatctct aggtccccag attagccttt tcaatttcag 7320
aaagaatgct aacccacaga tggttagaga ggcttacgca gcaggtctca tcaagacgat 7380
ctacccgagc aataatctcc aggaaatcaa ataccttccc aagaaggtta aagatgcagt 7440
caaaagattc aggactaact gcatcaagaa cacagagaaa gatatatttc tcaagatcag 7500
aagtactatt ccagtatgga cgattcaagg cttgcttcac aaaccaaggc aagtaataga 7560
gattggagtc tctaaaaagg tagttcccac tgaatcaaag gccatggagt caaagattca 7620
aatagaggac ctaacagaac tcgccgtaaa gactggcgaa cagttcatac agagtctctt 7680
acgactcaat gacaagaaga aaatcttcgt caacatggtg gagcacgaca cacttgtcta 7740
ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga cttttcaaca 7800
aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc actttattgt 7860
gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata aaggaaaggc 7920
catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac ccacgaggag 7980
catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt gatgtgatat 8040
ctccactgac gtaagggatg acgcacaatc ccactatcct tcgcaagacc cttcctctat 8100
ataaggaagt tcatttcatt tggagagaac acgggggact ctagtgggcc ctaagcttca 8160
tttaaatcca ctgcagtggt tgagctaggt ggttcagaca aaccaaagga tgggcaatca 8220
gagaactgtc ttgcaacact tgacattttt gctggttgtg gaggtttatc tgaaggattg 8280
cagcgatcag gattgtcact tactaaatgg gctattgaat atgaagaacc tgctggggat 8340
gcatttggtg aaaaccatcc agaagctgca gtatttgtcg aaaactgcaa tgtgattctg 8400
aaggcaatta tggacaagtg tggtgattct gatgattgca tctccacttc tgaggctgct 8460
gaacgagcag ctaaactttc tgaggacaag attaagaatc tgcccgtgcc tggcgaagta 8520
gaattcataa atggtggccc tccgtgtcag ggtttttctg ggatgaacag attcaatcaa 8580
agtccctgga gcaaagtcca gtgcgagatg atcttagcat tcctgtcatt tgcggagtat 8640
ttccgtccta gattctttct cttagaaaat gttaggaact ttgtctcgtt caacaaagga 8700
cagaccttca gattgacact ggcatcactc ctggagatgg gataccaggt ccgatttgga 8760
attttagagg caggggctta tggtgttgcg cagtccagga aaagggcatt catttgggcc 8820
gctgcacctg gagagactct tccattgcac tggtctagac cc 8862
<210>4
<211>662
<212>DNA
<213>稻(oryza sativa)
<220>
<221>misc feature
<222>(0)...(0)
<223>OsMET1的C-末端结构域
<400>4
ttgagctagg tggttcagac aaaccaaagg atgggcaatc agagaactgt cttgcaacac 60
ttgacatttt tgctggttgt ggaggtttat ctgaaggatt gcagcgatca ggattgtcac 120
ttactaaatg ggctattgaa tatgaagaac ctgctgggga tgcatttggt gaaaaccatc 180
cagaagctgc agtatttgtc gaaaactgca atgtgattct gaaggcaatt atggacaagt 240
gtggtgattc tgatgattgc atctccactt ctgaggctgc tgaacgagca gctaaacttt 300
ctgaggacaa gattaagaat ctgcccgtgc ctggcgaagt agaattcata aatggtggcc 360
ctccgtgtca gggtttttct gggatgaaca gattcaatca aagtccctgg agcaaagtcc 420
agtgcgagat gatcttagca ttcctgtcat ttgcggagta tttccgtcct agattctttc 480
tcttagaaaa tgttaggaac tttgtctcgt tcaacaaagg acagaccttc agattgacac 540
tggcatcact cctggagatg ggataccagg tccgatttgg aattttagag gcaggggctt 600
atggtgttgc gcagtccagg aaaagggcat tcatttgggc cgctgcacct ggagagactc 660
tt 662
<210>5
<211>1187
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>启动子(FIE)
<400>5
ggatcccccg ggctgcagga attcgatatc aagcttatcg atgagtttct caaagtttgg 60
accttgatta tcttgtttgg agatgttcaa atcgttatat ccaaatagtg aacttctaat 120
tttctttttt gataatgtga cttatttgga aaagtattcc aaagtattca aataaaccct 180
ttaaaaatcc attaaataca ttttaaataa gtaaaatgct ctcaacgaag agatatcatg 240
gtaaataaca acagtgagag gataaaatgt taaatcaatt tatttacaac ttcaaatagg 300
cggacatcaa acctacttag cacactttct attttcaaat tggttatggt ttgtctatta 360
gttgttgcat ctatgttttt taattcttat atcggtgatc ttgattttgt tttggtgtat 420
ctaaaatcta ttttagttaa agtgcaagaa aataaaataa aaacttaagg taagagatga 480
aagtaagctt taaataaaac agagcacttc tatggtcgat tatagagcca agttcgttcc 540
tccattttgg cttaatgcaa tattacaagt aaatcttata aaactttcca taagtatcgt 600
attacccatg gatactatga tatataaact ctcggaggtg tagtccagaa gaaatgatcc 660
atatttgcat acagtaaact tgatggaaaa aatatgtggt actgttggaa ttgtagctat 720
tgagtatcaa atttgagaaa aaggtaaaaa aatatgtaaa atttgggtgg aagaaaagaa 780
ttacataaaa ttgagaaatg tatgtaattg acaaaataat gttttcaaaa cataaaaacg 840
tgataccatt taaatccaaa ccttatatca tttaaccatt tttagtaaaa ctaatagtaa 900
tgaatggtca ataatataag attacatatt aaataattac tactttcaga aaatttcaat 960
caaatctata atattccttt gaaaaaaaag aaagacaaat aggtaaactt cgatcgtatc 1020
aatcaaagaa tatatttatt tttcatcgta acgtttaatt ctaagtccta ttaaaaaacg 1080
ttaaatttga tttttcttac catttttttc taaaaggtga gttgtgtgtt gtgtcaggtc 1140
caaaataaaa gtttgtcgtg aggtcaaaat ctacggttac aggatcc 1187
<210>6
<211>1019
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0102
<400>6
gaggtcagtg aagtcgattg ggatttggtt gataacgttt tactcgacta attatatact 60
tcagaaggat agtaatagaa taccaaaata attaaatgat tggttagtgc cttagtggag 120
actttttaac cgattctaat agactaatga tgtagctaag cattatttgg gatcatcact 180
gtttgaaaac gtgaaatgtg ataaaagtta tgaaacgatt aaaatataaa ataaccgtac 240
aaaacattat gtaccgtttt tttctctgtt cttttggcga tttggtttag ttcgttacac 300
tctaaatgtt attgcatata tatatataat gatgcatttg catctgagga acatataatt 360
ccggttaaca cttccaaatc ttatatccgt ctaggtaggg attttataaa tcatttgtgt 420
catcatgcgt tatgcttgtc ggctttgacc ataacgcaga gatatagaac tagcttttac 480
ttaactttta gatttattat ttgatctaga gttaagtgga gatatatagt gtttttgtta 540
gattattggt ggatgtgaga gtttgtcttt agtttcaagt tgagaatata aggcaagagg 600
agactctgag gcaatcagag gttttgattg gcaaaatatc caaaaggccc aaaccaagtc 660
gaagcccatc tcgtacaaaa aaagaaagag atctgtaaga aaaaatattc tttgatattc 720
ttacaaaaat aagtgtaaaa cttttattag tcaaaatctt caatctttaa aaactctcat 780
cactcctacg aaagcgcgtg agagttatga gacattcctt aatagcatta ctcacaagtc 840
acaagttcaa aacgtctgac tgaaacagaa acaagccttt gttgaagtct tgaagaagag 900
acattagtac tcgtcgtata gccataaaag gtaatatacg aaatttcttc gctaatctct 960
tcaccttcct ctacgcgttt cactttcact ttataaatcc aaatctccct tcgaaaaca 1019
<210>7
<211>1023
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0117
<400>7
aagaagtcag tgagtcgatt ggatcacagt cctttatgat aaaacaaact cataattatt 60
ccaccgacaa catgcgtttt aaattatttt ttcttaaatt atattatatt atattgatat 120
caacctagct aaaataattc ggatggcgaa atcggacaat ttttaataga aaaaatgggt 180
atgaagatag tctatgattc cgttcttagc gactagaggg acctgctcaa atctcccggg 240
tgatacgcga tgtcaagctc aatagaaccc cacaaccgac gagaccgaga aatccttgat 300
ttgggctaga agattttgaa ataaatttaa tatattctaa gtaacttgct taaatttttt 360
ttcaaactct aaagacataa ctaacataaa gtaaaaaaaa aaaagttaat acatgggaag 420
aaaaaaatta aactaatgat tagctctcta acgtgtttaa tctcgtatca agtttttttt 480
tttaaattat attgctatta aaacattgta ctattgtttc tattttgttt agctattatt 540
cttgtgaaat gaaaagttgt gtttattcaa ttactaaatg gcaatattta tcttggaaaa 600
ctatacctct aattggatta ggccctagac atcctcttta gcttattgac gttaaaatta 660
ttcccaaaac tattaaagtt tagtagtttg aaagatgcat caagacctac tcagataggt 720
aaaagtagaa aactacagtt agtgtgatta tattttaaaa tatataaaac aatcttatta 780
aactaaatat tcaagatata tactcaaatg gaagataaaa acatttagtc tgttaccact 840
accagcctag ctagtcactg atagtcactt tggaactgag tagatatttg catcttgagt 900
taccatggac tcaaaagtcc aaaaagagac cccgagtgaa aatgctacca acttaataac 960
aaagaagaat ttacagcggt caaaaagtat ctataaatgg ttacacaaca gtagtcataa 1020
gca 1023
<210>8
<211>1005
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0180
<400>8
ttattgttga aacggatggt atccagattc atagagttat acgttgttga cctcgtacag 60
gatgaattca ttatcttctt cttcttttgc agcatggcag gtgatcgatg ggtatgactt 120
gtgatgatag ccatgtccac caaatcagcc aagaaaagat caagacctcg gctgcttacg 180
ttctgttcta taaacgcctt gtagactaaa gaaactgaag cggaaaagac aagaaagagg 240
tatttgcatt tttgccgggt ttggcttatt taaaaacatc attggcttga ttctaattca 300
ctacaagatc aagatgaaag cagctctgcg ttgaggctaa tttacagaag agagagagag 360
agttgggaag aagagcaaaa gaccgagagg acatgttgcg gggaatttat tttattctta 420
caaaaattgg tatctgatta ttttattaac catattcaat tagagaatag aagaatagag 480
aaaagccctt ttgtgggata tggttctaaa ttgttgttta gttcttgtgt gtcagttttg 540
gctctcgtcg accaaagaag attaaagaaa cctctacctt attttaactc aattcttttg 600
tttttgcaat gtcctttgct ttccaaaatt gttagtctta cttttcacta ctttgataga 660
cattgccttt gcgtttccct gattaataag ccagagtact taaatcaaaa ttgactgttt 720
tgtgcatcct gcatcacgtt tccaatcaga accatagtgt tgtcgttgtg tcattatccg 780
aatttaagtg gagacattgg taagttattt ataaactaat tacaatctat ttttctaatt 840
atttcaaata acatatttaa gctctgtagc ttccactaga cggtgaagat ttgaagtgag 900
agctctcttt gcattgctca cccaccaatg gatctaccta cccttcttct tcttctccgc 960
cttttaaacc ctaaaagttt ctctttcctt caacaacgcc acaat 1005
<210>9
<211>1002
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0110
<400>9
ggaacgttag ctgctatagc aaagcatgga atggcaatgt cagatccgga acctgaaata 60
aacgtgtatc agatcgcttc ttcggcgata aacccgctgg ttttcgaaga cttagcggag 120
cttctttata accactacaa aacatctcca tgcatggact ctaaaggtga tcctattatg 180
gtgcgtttga tgaaactttt caattccgtt gatgatttct cggatcattt gtggagagat 240
gctcaagaac ggagtgggtt gatgagtggt atgagttcag tggatagtaa gatgatgcag 300
aagctaaagt ttatatgcaa gaaatctgtt gaacaagcca aacaccttgc tactatttat 360
gagccataca ctttctatgg tggaaggtaa gacagaactc attaacattc taattcttag 420
agcagacaaa accggtaccc gcaaagtttt catctttttt ttttggtttc ttttacagat 480
ttgataacag caatacacag agattaatgg agaatatgtc agaggacgag aagagagaat 540
ttggatttga tgttggaagc attaactgga cggactacat tacaaacgtt cacattcccg 600
gtttaagaag gcatgtcttg aaaggaagag cttaactttg aatctcacta aaccagacca 660
aacagaatcg atcccttctt ttatcttttt atctttttct tttttcatta cgtgtaatcg 720
tgttgtgtct aatatatcag tttgatttgt aataatttga aaaaaaacgg aaatgttgtt 780
atctttaagt ttgcccaaaa tctatagtca tgttcgattc aagacaaagt ttaaagttac 840
aacctgtaaa aatattaata gtctctgatg taaacgtatc ttaaacaaaa ttattaaatg 900
ttgaagttag taacatacaa ttattaatga ataaatgttt aatcaattaa atgtcattta 960
gtgattgtcc tataaaatct cttgttttct tgttttatat ta 1002
<210>10
<211>1000
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0039
<400>10
ccgttcgagt atttgaaaat ttcgggtaca cccgcctaaa taggcggacc ttatctagta 60
tatatataca tttgaactat attgtttact ttttagttga tttaggctat gtatgacatt 120
gacataaatc tacctgttat ttatcacgtg taattcgtgt aaagtgtaaa ctagaaagtt 180
caaatacgta tttgtttttg ttctgttata taggattgtc atagttgtaa atctacaatt 240
tattacaaca tgaataagta cacaagcaat gtaattggat ttaattgcta aactctttac 300
atggtcaatc taaatttgat aagaaatacg tcacatatta ctaagactga tagttttttt 360
gttgtcacca attatttttg ttaaattgac gaaaacaatt ccaaaaactc aaatgtacaa 420
aatcatacag tctcacaaac atctcataga gaaagatata aatctcccat atgggaacga 480
taacacgagg tcgaaatact attcgtaaaa ctaaaacgcc ttagttataa atcgttagtt 540
gtaaccgcgg tcgagaatac atacagatcc acgaaactac tactacacat gctgctgaat 600
tggaatttgg aaaagaccat cttctttagg aagagctcac ccaatgagtg acaaaggtgt 660
cggtggcttg ttttctaccc atatgtatac atcaaatggt agtttcatta acgtttggtt 720
ttgagaaaag taagactttg gctagtagct aggttcgtat ataataaact cttttgagaa 780
agttcatcac tggtggaaaa tgttaaaccg gttttttctc attttttccg ccatgttaac 840
caccggttta aaaagaccgt aacacattga aagattaata agggtatatt tgtaattacg 900
gtttgctggc aatttttaat tattatttta attagagaaa atagagaagc cctatcaatg 960
tacatggtat atatataaaa ggcaaaaccc tagaaaacga 1000
<210>11
<211>1000
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0101
<400>11
ttctcgttct ctagaatatt gctggaccgg attaggtcaa tattattggg ccagattaga 60
tattgaattg tcgacgttgc ttacgttacg ttatatcttg tttaagaatt aaacctatcg 120
acttagtctt aattaagaaa acattgcctt aaattctctg gtctgcgacc gtttttttga 180
ccgttaaccc ctaattaaag aaacaaaata attatagaaa gagcactgaa atgtgattat 240
tttaacagta ctcttatgag aaaattcgta ctttttagtt ttttttttgt acaaatctct 300
aagaaaaaca ctactactaa ttaagaaacg tttcaaacaa ttttattttc gttggctcat 360
aatctttctt tctcggtccg ggactaaccg ttggcaaaaa aaaaaaaaaa gttgacaata 420
attattaaag cgtaaatcat acctctcaaa taaaaacttg aatttggaaa caaagacaac 480
taaaaaactc gaatttaaga gaattcctaa aatcaagtga agtatcatca cttggtaaaa 540
tttcataacc gttggcttct atttctatgt gtgccttggt ttgcaggaga taatatttca 600
tttccaacca atgatattcg tacacatagt caaacaaatg tttgtctttg ttattatatt 660
gagaaagaaa caagaaagag agagagagat agataagacg aaggaagtga agcttccaag 720
cgcccaccgt taaaaatctc gtgtgcaagt ttcaaataca agtggccggt ggtctccata 780
atttgatcgt catccaatta aaaaggaaga aaaagcgtgt tttatacaag aaaactcatt 840
aaaataaaag tccaaaatat ctaaacacta atctaccacg tctattacac acacacacac 900
acacttgatc ttaatttatt ttcaagattc aagaaaatac ccattccatt accacaactt 960
gaccacacgc ctatatataa aacataaaag ccctttcccc 1000
<210>12
<211>1000
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0119
<400>12
taccaaaaat aaggagtttc caaaagatgg ttctgatgag aaacagagcc catccctctc 60
cttttcccct tcccatgaaa gaaatcggat ggtcctcctt caatgtcctc cacctactct 120
tctcttcttt ctttttttct ttcttattat taaccattta attaatttcc ccttcaattt 180
cagtttctag ttctgtaaaa agaaaataca catctcactt atagatatcc atatctattt 240
atatgcatgt atagagaata aaaaagtgtg agtttctagg tatgttgagt atgtgctgtt 300
tggacaattg ttagatgatc tgtccatttt tttctttttt cttctgtgta taaatatatt 360
tgagcacaaa gaaaaactaa taaccttctg ttttcagcaa gtagggtctt ataaccttca 420
aagaaatatt ccttcaattg aaaacccata aaccaaaata gatattacaa aaggaaagag 480
agatattttc aagaacaaca taattagaaa agcagaagca gcagttaagt ggtactgaga 540
taaatgatat agtttctctt caagaacagt ttctcattac ccaccttctc ctttttgctg 600
atctatcgta atcttgagaa ctcaggtaag gttgtgaata ttatgcacca ttcattaacc 660
ctaaaaataa gagatttaaa ataaatgttt cttctttctc tgattcttgt gtaaccaatt 720
catgggtttg atatgtttct tggttattgc ttatcaacaa agagatttga tcattataaa 780
gtagattaat aactcttaaa cacacaaagt ttctttattt tttagttaca tccctaattc 840
tagaccagaa catggatttg atctatttct tggttatgta ttcttgatca ggaaaaggga 900
tttgatcatc aagattagcc ttctctctct ctctctagat atctttcttg aatttagaaa 960
tctttattta attatttggt gatgtcatatataggatcaa 1000
<210>13
<211>1000
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0137
<400>13
tggcacatgc tgaaaccccg agcatctctc cggaagacac gcgtcgttcg ctccaaagaa 60
aacagtcaca gctgccggag aatctccgcc gtcttcttct gccaccggaa aaactctctc 120
caccactttc agtgcccacc tcgtgttata tccactgtat cctcgtagca ccatatcagc 180
ctaataaaat tttatgtatc aaattttaag acatagccga aactacacta tactagacaa 240
taataatatg atttgtttcc tgaaaaatta tggtttcatg agaaacatta atcatctata 300
aaacaaatta gctatggcat cgaagagtta tcaatcaaaa cttatgaatc tttacttaat 360
atatacaaca tatctttacc ttgcggcgga gaagatcggc gagagaagca ccccagccac 420
cgtcactaaa ggattcttca gtgatggaat caccaaagag aaaaatcttc cgtctcatca 480
tcttccacac aatcttcttg agaaaatctg agagataaga taggtgtagt ggttttgctg 540
aagtgatcgt gtttgattta gtaaagaaat gctttattta ttgttgggg gaaacataaat 600
aaataaagta aaagtggatg cactaaatgc tttcacccac taatcaccga cctttcatgg 660
tttattgtga aatacactca tagatagaca tacaatacct tatgtacgta aataacattt 720
tatttgtcga cacttatgta agtaacgcat agattatttt ctatgtgatt gccactctca 780
gactctcagt ttcaaccaat aataacaata actacaacaa cattaatcat aaacatatgc 840
tctggtttac aattaaagct taaattaaga aactgtaaca acgttacaga aaaaaaatgt 900
tatttacgtt ttgtaagatt agtctctaga atcatcaccg ttttttatat attaatgatt 960
ctttcttata tataaaacct ttctcgaaat acccatgaaa 1000
<210>14
<211>985
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>YP0212
<400>14
tacactctta atttaattag agtaagagat caacaaaaat atagaatttt ctttatatcg 60
aagtgctacg accttatata tatagaaaaa aaagcatagg tgaatctcta aattgagatt 120
gtgctgtagt aaacatatta agtttttagt ttttttaaga aatgaatctt tttgttgatt 180
aattcaaact agtagtcatt aagattccgg agattccaat ttagaaaagt caaagattca 240
aagaacaagt ccaggtccac atgttgaatc cgattcatca tccactcatc cttcatatct 300
tcctccaccg tctccgccca aaaaatcaat aacaataaaa aatcctaaaa aaacatattt 360
gattttgaaa aaactttatc atatattata ttaattaaat agttatccga tgactcatcc 420
tatggtcagg gccttgctgt ctctgacgtc cttaattatc attattttta aatttgtctc 480
tctcagaaaa ttacgccaca atcttcctct ttcccttttc cgaaaacagc taatatttgt 540
ggacctaaac taaataacgt agcctctaga ttttatataa ttactaatac tatatgctac 600
tacttgttat tatttactcc aatcatatat gataccaatc aagaatcact acataagtag 660
aaaactttgc aatgagtcca ttaattaaaa ttaagaataa acttaaaatt ttatggtatt 720
ttaagattcc ctttggattg taatgacaag aaatcagcaa attagtcgta actcgtaaga 780
ataaacaaga tcaattttta ctttctttac aaagattccg ttgtaatttt agaaattttt 840
ttttgtcact gtttttttat agattaattt atctgcatca atccgattaa gaagtgtaca 900
catgggcatc tatatatatc taacaggtaa aacgtgtatg tacatgcata aggttttacg 960
tgcttctata aatatatggg gcagt 985
<210>15
<211>2066
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>DME启动子
<400>15
tggtgcaatt agaaacgaac atagtcgtaa aatacgagtt cggtgttata cctttattta 60
cgttaaaaaa atacgagaat tttgtgtcaa atttcaaatt aatttcatga atatatggaa 120
attattagat actctagcga aaatagtgat tatgagcgtt ttacaaaaat acgattttag 180
cattgaactt cctttatgta attcggtcaa atgttggcat gaagaagcaa gtttgcaaca 240
ttaaatttca tttaaaaatc gtgttgacat actttaaaat ctaaatatag gaagaagacc 300
aaaacattaa atttagtaag attctaatga acatttataa gttataactt ataaccaaca 360
aaagttgggt ttagcgttgt tgctttatct gaaaacttgc aaactaaacc attttaatag 420
gactaatgac aattaacaac aaaatacact taagcaacaa cgtcctcgtg aatataattt 480
gggcctcagg cccatattgc taacgccaac tgatatttca ctttattcct tcttcatctc 540
accacactct ctctctatct ctatctctaa cggcatagct gactcagtgt tctccggcat 600
tgactcgcct gagaatcaga aagcttagat cggtgagctt ttagctccat tttctgttta 660
tttacatatt atttcctttt tttctctctc ccttttttat ctggaatttg ttctgctaaa 720
ttttccagct gttacatttt ccgatcacga gaagaatcac tgggttttta tgttaatcaa 780
tacatgttcc tgttttctga tcataaatct cagctattaa cacctgattt tgattctgcg 840
taataaaaac ctctgatttg cttttatctt cactttcccc ataaacattg cttactttat 900
tcgctcttct tttaccgttt ccagctaaaa aattcttcgc tattcaatgt gtttctcgtt 960
ttgttgatga gaaaaatatc tgacaaaaaa tcatttattg cattttatgg tgcagattct 1020
tagttaatgt cgccttctct aaccaagtca gattaaaaag gagtgttcgt ccatgttgct 1080
ttgttttggt gtttggagag agttttcgga gagttaggtg agtgttattt ggggtgaggt 1140
agtgataagg tttgaagggg gagtgattca tcaagtgtgt tatgaattcg agggctgatc 1200
cgggggatag atattttcga gttcctttgg agaatcaaac tcaacaagag ttcatgggtt 1260
cttggattcc atttacaccc aaaaaaccta gatcaagtct gatggtagat gagagagtga 1320
taaaccagga tctaaatggg tttccaggtg gtgaatttgt agacagggga ttctgcaaca 1380
ctggtgtgga tcataatggg gtttttgatc atggtgctca tcagggcgtt accaacttaa 1440
gtatgatgat caatagctta gcgggatcac atgcacaagc ttggagtaat agtgagagag 1500
atcttttggg caggagtgag gtgacttctc ctttagcacc agttatcaga aacaccaccg 1560
gtaatgtaga gccggtcaat ggaaatttta cttcagatgt gggtatggta aatggtcctt 1620
tcacccagag tggcacttct caagctggct ataatgagtt tgaattggat gacttgttga 1680
atcctgatca gatgcccttc tccttcacaa gcttgctgag tggtggggat agcttattca 1740
aggttcgtca atgtgagtga tcaaatctat tttcagtttt tttttttccc tttcttccgt 1800
tcttgcagta cttagagtag aacatgaatt agaatatctt aagaaagtca tggttttgaa 1860
cagatggacc tccagcgtgt aacaagcctc tttacaattt gaattcacca attagaagag 1920
aagcagttgg gtcagtctgt gaaagttcgt ttcaatatgt accgtcaacg cccagtctgt 1980
tcagaacagg tgaaaagact ggattccttg aacagatagt tacaactact ggacatgaaa 2040
tcccagagcc gaaatctgac aaaagt 2066
<210>16
<211>1912
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR p472e10p3_gDNA
<400>16
gcgtacatgg aagttttatg agattgtttt agcgttacat tattgttctc atgggttttg 60
ttgaaccgtg ctatagaacc agaacgaagc aatagtcacg taggataaac caaatcacct 120
tatctattag gtgtatatgg aagttttatg agattgtttt agcgttacgt tattgttctc 180
atggtttttg ctgaaccgtg ttatagaaga acccggaacg aagcaatagt cacgtaggat 240
aaaccaaatc accttatcta ttaggagtat atggaagttt tatgagattg ttttagcgtt 300
acattattgt tctcatggtt tttgctgaac cgtgttatag aacctagaac gaaacaatag 360
tcacatagga taaacaaaat caccttatct attaggtgta tatggaagtt ttatgagatt 420
gttttagcgc tacgttattg ttctcatggt ttttgttgaa ccatgttata gagcccgaaa 480
cgaaacaata gtcacatagg ataaatccaa atcatcttat ctattaggtg tatatggaag 540
ttttatgaga ctgttttagc tttacgttat tgttctcata gtttctgtag aaaccgtaac 600
ctgaaacaaa gcaaatggtt acataggaca aaccaaatca cacaaacttc actaattggt 660
aagcttggta ggctcgcagg aacgaaaaca caactaattg gtaaaataaa tcgcatttga 720
catatctagc taatccgatt aatcttatac tctcatcatc taatttttag ctgaccacca 780
gcttccaaat tttgaaattt gaagctttga ttataggatt tatttttcat ctaagtttac 840
tttccggtct tcgatttcaa attgataatg atacaaatat aaaaactttt acttttattt 900
gaaagccaaa tgaaaaatac cctgaaacga agaaaaagtc atttaagaca aacttagaga 960
taccccgatg tgtatgatca aaatggggtc tgatacactg ctgatcagtt cccacattga 1020
ttttggtgtg atattccgtt ccataatcgt ctttaaaaaa caaaagaggg aaaaaaacaa 1080
aacactatgc aaccgtgcaa atgaaagcat cgtcaaatga ttaaaaacgt caaaccaatt 1140
caatcaaccc caaactccaa accaactttt tttttctctt ttcttttttt tctttttgtc 1200
gatcttgagc gaagcaatcc tccaaagtcc aaaccaccaa tcgaagcaag aacacaaaaa 1260
caaaaaacag caccagcgaa ttcggtgccg cccatcggtt atggctctcg ccccacacat 1320
cttgcgttcc ttctcgcagc aaacatttcc caaatctcaa aaaaaaaaag aaagaaaaga 1380
aaaaccaaaa gaggaggatg ataccgtgat gacaccatgc aaggcagttc gtcacatgat 1440
ctggttcgct ccaaaaagct gatagtaaaa atcatcccaa aatatctcct cggagaaaaa 1500
ttcttaccac accgtccctc tcctgttcat ccctgttcgt ggccgaatct tttgttttta 1560
ccgaggaatc ttttgattag tggttgtagt gacatcatgg acagaagagg aggttggtaa 1620
ttaggcgggg taaaaaagga ccgaggcgac gcgagagctc gtctcctcca ctcctcgtcc 1680
tcgtcctcct cctcctcttc ctccattttt ttttcttttc tttttatttg attacgccgt 1740
cgctgtcgag tagcgcgtca gctgcatccg cggttataag tagcggccac cacccaccac 1800
ccccggcttc ctctcccact gcgccctccg cgtgagcggc agcaagtgtt cactgcgttc 1860
ttcttctcga tttatctttc ttggtttctt gatctgtagc ttattagcgg cc 1912
<210>17
<211>1946
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR p523d11p3_gDNA
<400>17
ccctgattct tctgatggaa ctaggggagg ctgtgtggcc atttttcccg ttggagggtt 60
tcgtctagat ctgtcgggtg tgggacatgc ggattgcagg tgctgtcggt tgtgttggcg 120
gcggcgggtc ctgccgggat agttggccgc cgacggccgc ttggctgttg ggttgcacgg 180
tgtgtgctgg ctggtagcga ggatggtttt agggtgttgg gcgaaagctc tgtccgactc 240
atagccggcc tgacggcgat gaacgtcctt ggacatcatg caatgcccct cctggaggcg 300
tcgtcgcaag agcatctcca gtagagaccc taaatacaat tcctaaacag tttttaggtg 360
ctaaggacaa aaaataaact ccagcaaaac ccatactaca ggtcctaaaa taggaaggac 420
ctcaaatacc cctccgcagt ccctaggcct gggggctgta gaccgaggcc ctatcgccgt 480
ttttctacgc gggaggaaat ttcctgacgt gtggtgtctg tcttccctcc cgcggaatcg 540
ctgccacggc gccgatcttc gccagctcgc tggttccgcc gctcgtggcc gacggtgcga 600
ccatccagta cctccaccgg ccactgcttg tcgtccgcgt gcccgcttgc ttgttttttc 660
gtggtccttg atcagttcgc acactgatgc actatatggt agacaagaat gttctgaaat 720
tcatgaccat cagaaacatg ttctaaacaa tcctgctctc gattggttta tggctaactg 780
tggttctaaa cgatcatggc ataaaaatta ttgttctgtt cctttaaagt ttgtggtgct 840
tggtaggttg agacaattag gctgcttgca attatgcagt agttccttca aagattattc 900
tgcagtgttg ttcttttgtg tcagttgtga gttgaagttt aacttcaagg tttttttttt 960
ctaggaggat ttaagctctt tctgaagttt ctcagataga ttagattgga aaaggtatag 1020
agttaatttt atctattgat tatagttctt atttaattga actacgtagt gtcttgaata 1080
cttgccggta ggatttcact cccatgtttg agaattttga atttgaatta tggtatttaa 1140
aattatggat ttgaatacaa ttgaattcta tacattagaa atattcgtat ttgaattatt 1200
actatgttaa actaggtgta agcatagagt ataatcagaa atacaagaga aaaagaaatg 1260
ggggctaaga aatagggtct gctggtagag ttggaggtaa tttttgaatt cttagaaaat 1320
agggacagcc ctcattcaac ctttgaggac tctaaaatag ggactactgc tggagatgct 1380
ctaacaccct gttccccctt gctgctgggt gaaaaccctt tccagtctcc tgtttatgcg 1440
atggtggcgt cctttccgac gtcgtcacct tcttcaaggc atcgtttttg gagaaaccct 1500
gcaaccagtc cccctgcttt cccatccttc tcccctattc catcccctcc tcctcccctt 1560
ttcttctgtc aagggctcct atgcttggaa actctcatgt atctcttctc tgtaatatat 1620
tcaggtgggg aaatgttgga tttttattga ttggaatact gtattgggtc atctcggtga 1680
caccaaagct gtactttggt ggagtagcaa tctttgccct tattgaccgg ataggatttt 1740
ggttaaattt atctacgttt ttgtttgcgg ttcatctttt ttcctaccag tcttatacaa 1800
gatggtacag tttagcaact gattgttaca ttgcaatata taaatcgaag tgatagaagc 1860
cacctcaagt aaatctaact attgttcata attcaaaggt caagaccaat ttctcagttc 1920
ctgcgactgc gcgaaaaaac aaaacc 1946
<210>18
<211>1951
<212>DHA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR p524d05p3_gDNA
<400>18
cgagatccac cgatggttta cgcgtacgcc gacggctcac acatcccccg gtgcccaaca 60
gaaaccacac accacccgca cgaaaaaaac cgaaccgcac gtgcgcgcgc gctccacgca 120
caccccaaac agacggcacg gcgggagcgc gcgcgcgcac gcgagccgag gagaaaacaa 180
acgggggaaa caagctggaa aagcaaaagg ggaaaagaac ggagcggagg cttcacccac 240
ggccaccgcg acgcgccacc agcgtgcggt gcaatgcaac gtacgccaag ccgaaacggc 300
aggcagcatc gcgcacgcac gcacacacag gccacagcac acgcgagcga cgtacgcgag 360
tgcatgcaga tgcatgcgcg gggctcgcgc gagaccggcc gatgggttcg cttctcttct 420
ctctcccgtc ccgttgcgtc gtcatagaca aaagtcggtt ttgcttttgg ttttttggct 480
ctgaggcact gacgtgcggg ccagcgtacg cctgcgtgcc ccgcatgtca tcgtcgacac 540
cggccgggga ccgggtaaaa tgtgttgcgg gagggagagg gggagagaga gatcgcgcgg 600
gcttcacgca acggcgctac aaatagccac ccacaccacc accccctctc tcaccattcc 660
ttcagttctt tgtctatctc aagacacaaa taactgcagt ctctctgtct ctctctctct 720
ctctctctct ctctctgctt cacttctctg cttgtgttgt tctgttgttc atcaggaaga 780
acatctgcaa gttatacata tatgtttata attctttgtt tcccctctta ttcagatcga 840
tcacatgcat ctttcattgc tcgtttttcc ttacaagtag tctcatacat gctaatttct 900
gtaaggtgtt gggctggaaa ttaattaatt aattaattga cttgccaaga tccatatata 960
tgtcctgata ttaaatcttc gttcgttatg tttggttagg ctgatcaatg ttattctaga 1020
gtctagaga aacacacccag gggttttcca actagctcca caagatggtg ggctagctga 1080
cctagatttg aagtctcact ccttataatt attttatatt agatcatttt ctaatattcg 1140
tgtctttttt tattctagag tctagatctt gtgttcaact ctcgttaaat catgtctctc 1200
gccactggag aaacagatca ggagggttta ttttgggtat aggtcaaagc taagattgaa 1260
attcacaaat agtaaaatca gaatccaacc aattttagta gccgagttgg tcaaaggaaa 1320
atgtatatag ctagatttat tgttttggca aaaaaaaatc tgaatatgca aaatacttgt 1380
atatctttgt attaagaaga tgaaaataag tagcagaaaa ttaaaaaatg gattatattt 1440
cctgggctaa aagaattgtt gatttggcac aattaaattc agtgtcaagg ttttgtgcaa 1500
gaattcagtg tgaaggaata gattctcttc aaaacaattt aatcattcat ctgatctgct 1560
caaagctctg tgcatctccg ggtgcaacgg ccaggatatt tattgtgcag taaaaaaatg 1620
tcatatcccc tagccaccca agaaactgct ccttaagtcc ttataagcac atatggcatt 1680
gtaatatata tgtttgagtt ttagcgacaa tttttttaaa aacttttggt cctttttatg 1740
aacgttttaa gtttcactgt cttttttttt cgaattttaa atgtagcttc aaattctaat 1800
ccccaatcca aattgtaata aacttcaatt ctcctaatta acatcttaat tcatttattt 1860
gaaaaccagt tcaaattctt ttaggctcac caaaccttaa acaattcaat tcagtgcaga 1920
gatcttccac agcaacagct agacaaccac c 1951
<210>19
<211>1836
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR p530c10p3_gDNA
<400>19
gcctctcgac cacgagttta gcacttgtgc aacatatatg cgtgcgatga acatctactg 60
atgcgccatg cgaattttag cgttcgttca tgacgcttcc aacggcacag aggctgagca 120
gcagcatgca tgcatggctc ttgtgaaaac aaaaaaggtt actggtaaat gacatgctgc 180
tgtagctagc tagcagaatg caaggcccat gcatatgcaa tgctatgcga caagtacagt 240
accagcatgt atggtagcca gctaactaat ctatcagcag aggcagcaag ctcgtgcatg 300
gtgtgatgca cttctctcca gtaatctagt ggtaattttc acccaaagcg ttgctcatat 360
ggacagtaat tagtaatatt accaaggttc acaatcccgt tacctgacca aatactactc 420
acgaatggta tctctggttt tcgttaaaac cgttggtaaa ccagcaaaaa tagacaaaat 480
ttgtcaaaat tttaaatttt agtttttttt ttttaactta gccgggaaac cttgaagttt 540
gtgctgtcga gctgtcctgg gaaggacggt tttggttggg attgtgaacc ctggttactg 600
cacttcattt ttgaacagat attagtgcaa cagacaaatg ccaacgcatt tttttctgtt 660
taccggcaag ctgaagcttt tacgatcccc atacagccgt tgctgcaaac ctgccaagaa 720
agagcagcag aaacaggtgt cattttgtgg tggaaagcca agtaaagtaa acagaagatg 780
gaagatagtg aggaccaggg agtgaggcag gggacacatg gcccacgcct ccctgcacat 840
tttcgtgtat aaatacaggt ggatgcatcg ctctcccagc atccatcggt tctctgctct 900
gttcatccat agagtttcct cctcttctcc tttagtgcaa ggtagagaag agcatgtgtg 960
tgtgtgtgtg tgtgtgaact gtgaagtgca gagtgcttct gtagttctgt gttatgtcca 1020
tagtgatctt gttaggattg ttgctatgga tgcatgatgt tatggttgat ctctgaatta 1080
cagtagggac ttttctgaga tctctggatt agtggggggt gctaaatttt tttctggttg 1140
catcagcttg ggtttctggt attggtgtgg gttcttgctc tgaattttgg ttcagaatgt 1200
cgatttgttt gtgtttgttc tctgaagttg agagtagcta tgatccatcc agcacagaac 1260
tgcaggtcct gcctgccggc tgcatataca ggacatgcca ttttgcaagc tctgggctta 1320
tggtttctct tttggagttc ttcttcttgc atgatctgtg ttctctaaca aaggaagcaa 1380
gatttagcaa ctttattcag agacaagaaa aggatctggc aaccttttgt ttctgtttta 1440
tcctactcgt aaagattgtt atttaagcaa aaatttccca aaagttttaa atataatttc 1500
catgatgtgc cactctcatg tccttgaacc tggcactcat tatgggctcc tcagaagtgc 1560
tgtagctaat gtcactaatc ttttgtatct ttgttcatag tcttgtattt tatgatgctt 1620
atccctttgt gctttccatg tttgatgtcc aaatgtcatg gcaatgtttt tgacttctag 1680
taggggtttt agtacctttt tgttagataa gtacatccaa attctgttta tttattcaaa 1740
aatcattctg tttattcact gaaaacattt gtccattcaa tggactcata aactgtctgt 1800
gtttttcagg cttgaggatc catctagaag atagca 1836
<210>20
<211>1895
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR y678g10p3_gDNA
<400>20
acaagcctat ttcaccctta caacaattcg gaagaatata gatgggtttt aaacatttga 60
taatatttgc tccccactca gatttggtta ctcgaaattg tacaagacct gacattcgtc 120
atctggacac tctagtagat aaacgtgctg gctgatgcta gataaacaga tgtaaagatg 180
accacttcac catcaaccgt aaaaccggac gaagatcacc aaaaattgat actttggagc 240
aacgataggc agcttcgatt cagatagcac aatacttaaa agaccacata ctagcatcga 300
attgatacat ctcccctcca aatgaggctc caaaaactat ccattgtttg atacagcaag 360
caataggatg tgagaaactg agattggcat ttgtatttca ctactctcat ctgatgagac 420
atgactaggc tgtaactgaa gctgaatcta aagaggaaga ttagtgtggg attgcagaca 480
aaactgctac tacttccttc ctgcactgca agaagaagaa atctgtatcc agtctgtgtt 540
gaaccccatt aaagcacaca cagcagcttc gattcagaca gcaaaagaaa cattctgata 600
gatagcatca aattgatact agtatttcgt ttgtgtcaaa aaaactctcg atatgtcgta 660
atcaaagctc gaaaatccca tttgtttgat acagcagcaa cagcaagaaa ggaaccccta 720
ctccgatcca gccactgaaa cagtactaat gaatccggat tcgcgcattc atcctatctg 780
atgtgatgaa aagaagctag agtataagaa tctaatctgg gagaaggttg aggtcagtcg 840
tcgaaggcgg atgaggggtc ggcgaggtgg gcgaagcggg cggcggaggc ggaggagagg 900
aggaggaact tgcggacgca ggacctgacg cagtcctcct ccttcttgcc gagggttcgc 960
cggtagaagg tggtgacgca gtcggagaag caccggtgcg acacccagtt gtacagccgt 1020
atcctgcgtg cacaaaaatc catccatcgc tactccactc tctctgcgag gaggaaggga 1080
aggaaagtaa gagattaaac gtacgcgtct cgggtctgga gcttgtcggc gacggcctcc 1140
atgcgcgcct tgtcctcctc ctcctccccg ccgccggcgg ccatggcggc ggcggcgtcc 1200
atgctcttct tcagtagcag cacaagaaga agaagaagaa ggagaaggag aaggagaagc 1260
gtagcccaag ccctaaggcc ctttagtata gttgaagtgg tgagatgggc cgtggtgggc 1320
cttcggtaat tgagcccatg ggctcaaccc cgaaaatgcc agtgggctag gtgaggtaaa 1380
ccgtgcacgt gacgctttca gtttcttttc ttttctttcc ttattatatc atcaaaaaaa 1440
gaaaagaaaa agagaaaaaa aggtatggaa gatactgtat agtatacgct agcagcataa 1500
gctccgtccg tataattatt tcttgtacgc atatgatgta cagtatgtat tttacgagct 1560
gtatactacc attgcgttgg atttatgctg gagctatttg cctatgtagt ggagtattct 1620
agaaggatgc ttgtgcgccg tccattgcct gcagaaacgg acggcgcggg tgggtgggcc 1680
ccacagggcg gtgactgacg cgtgggccac cacattggga tttggctttg ctttgctttc 1740
gtgccttgtc agccgctgcc cccggcccct tcttctcctt cttcttcttc ttcttcttct 1800
tctccctcac catcaccaac aagagagagg aggagtggat tcatcgatcg agaagtcgag 1860
gtagtacata cgttggattg gattggaggaggaga 1895
<210>21
<211>1773
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR p756a09p3_gDNA
<400>21
tgtgcctgtg ccgctttaca gccaagccca ttagcaaggc tcaaagatgg gctagttttt 60
ctcggcccaa gccgtctgtt gaacagcgtg gagaaggcca cacggcccac gtgcatacgc 120
aggccgcgca ctggatttca agatgggctg cgcgaggtgg acggcccaga ttgctacggc 180
cttctacggc gtcacgtttt ttcgtggtgc ggctggtgcc cgtgcttcgc gtacacgaca 240
gtgtacacgc tgcactgcac tccaaagaaa tccgccgaaa gtgcagttat acgtagcgac 300
aatctgcaat acgtaccaac agccgaaagc atatatggac aagcgcccac gcaagccatc 360
agcacaaccc acacgaagag caggtttttt ttttcgaatc aagccatacg gtagtgcgac 420
gtttctattg atatagcagg aaaaaaaata caaatctata gcattgagag actagttagg 480
agaagaaaaa gacggccaca ccacatgcct acatctgatc ctgctactga aaacaaaaca 540
agcacacgac acctagaagg aatggttcac actaagagaa gttttaacaa aggagagagg 600
tggttgttgg aatcaacatg taattccaat agaaaaaaga acttgattag ttgtagtaat 660
ccgtaagtaa acagaatcat atagataatg gtacaagcct gacccagttg ttgatatttt 720
ttttaatctc cctgtcttgc acgtgcggta tagatgctaa tgtgatgtgg cagcaccgac 780
gtcacacctg tgacatctgg ccatatgtct acagctaatg ctgtgttttg ttcaattttt 840
attaaaggca aataaatatc tatatctacg gttgtgccta taccaattga agttatgtca 900
tatgaggcgt tttcgtgcta tctactgatg aaatttacct ctcgtacatc agaaccgtgc 960
aatatcatta cttatgtcag tgtaacggga taaattggta gagtttttga gagtggaagc 1020
ttcctgtttt ttcaaaattt ggtaagatag caataacaat aatgagtttg gtttgttgtc 1080
ctattaaaat ttggtaatgc caaaatttag tagggttaaa aataacaaca aagtaaatat 1140
tccttagttt aaattgtttt agttgaaggt taaacattac caaaaattgg taggttaaaa 120O
atgttaataa aaaaagcaaa gcccttagtt taaattgttt cagttgaatg ttcaacattg 1260
ctcacaaaat gttctcttaa atagtacttt attattacaa agagcatctg aatctgtatt 1320
aaaaaagtac aaaaaaaaac attctgaatc tagaaaggga aaatatctag aagcgactgc 1380
acgcggcccc cacgaaaagc ccatgcacgt gggccccatc ccgaaaaaag agcaacagcc 1440
tcaccgccta cctgcatgtg caagtggacg gtgcgcggct gcgcgccgca acgcgacgcc 1500
cccccccccc cccaccccac cacccaccgg ccccacacgt cagctataca gtgggaccca 1560
cccctccggc cccacatgtc agcaagacag tgatacctcc tcccccgcct cctcgcgcgg 1620
cgcgcaacgc acacgcttcc ccttcatctc agtcgcgcgg actcctcagt cctcacactc 1680
cccacgaact cgaatcccca actataaata atccaccgga aaattcacaa ttcgatcgcc 1740
tctctcgatc ggagatttcg caatttctcc gcc 1773
<210>22
<211>981
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR YP0285
<400>22
gggattatat atgatagacg attgtatttg cgggacattg agatgtttcc gaaaatagtc 60
atcaaatatc aaaccagaat ttgatgtgaa aacactaatt aaaacatata attgacaact 120
agactatatc atttgttaag ttgagcgttg aaagaaaatg aaagagtgta gactgtagta 180
cgtatgagtt tcccaaaaga tggtgcttga atattattgg gaagagactt tggttggttc 240
ggttgaatga agatttttac ctgccatgtt gatagagaaa ggcaaataaa tgtaggggtc 300
gatgtctaac gtaaagactg gatcaaccaa gagtcctcct cctcgtcttc accaaaaaaa 36O
aagagtcctc ctcgtggaaa cttatttctt ctccagccaa gatctcatct catctcttca 420
ctctatgaaa tataaaggaa tcttatggtt tttctaaaaa ctatagtacg tctatatacc 480
aaaggaaaca atataaaatc agttaatctg ataaattttg agtaaataat aaagttaact 540
ttgtacttac ctatatcaaa ctaattcaca aaataaagta ataataacaa agaattttta 600
gtagatccac aatatacaca cacactatga gaaatcataa tagagaattt taatgatttt 660
gtctaactca tagcaacaag tcgctttggc cgagtggtta aggcgtgtgc ctgctaagta 720
catgggctct gcccgcgaga gttcgaatct ctcaggcgac gtttcttttg ttttcggcca 780
taaaggaaaa agcccaatta acacgtctcg cttataagcc cataaagcaa acaatgggct 840
gtctctgtct cactcacaca cgcgttttcc tactttttga ctatttttat aaccggcggg 900
tctgacttaa ttagggtttt ctttaataat cagacactct ctcactcgtt tcgtcaacat 960
tgaacacaga caaaaccgcg t 981
<210>23
<211>1894
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR y790g04p3_gDNA
<400>23
tccgcttgct tggagaattt tgcgcgttca caccggcaga actattattt ttagcttaat 60
caaaccggcc atgtgatccc tgattatttt ctgttttttt aactcaccaa atttatttca 120
aattagaaac atattacatg gttaacctta catttgaatg aactaaagca atcttcaaat 180
ctttcgcaaa gcatctttta ctaggatagg ctaggtgaga tatgttgtga caaacgtgag 240
ctggatcgat gctatagttt gtacacacct ttctcatata aagagtgata aaactccaag 300
gaaaaacaga ttagcacttt tttggggcca tcctaatgca agcaagcaag gcttatatgg 360
cctgtgcttt tttgctttaa taagcctttt agtccccttc cctagtctca tgaagttcat 420
ggcaccaaac acctcaacaa gtggcaaatg atgaaatgat gtaaatgcac aactacttta 480
ttttgggctg gacgtgttgg ttctcaactg aacctgcacc gctatcagac agtgtacata 540
acgcaatcgc tgagcaaagg aaacagaaag gctactgccc agcgccattt tatttggcca 600
tttctgctgc aaaagctctc tttatttgtt tctgaatatt tgaatgccaa tttggcgaca 660
ccaatttcta gagagtttcc gtggtggcaa gacaacctgg tacttattgt atagtgcttt 720
ccttttcgag ttgattttcc atttgcattt gcaaagattt atataacaaa tttgagtata 780
aagaatacat cagtgatgaa gtggcgtgac tggctcaaat cgagctaaga gagatcactc 840
gagcaataat gaacagtgaa tcagaataat ggatacgtta ctgtccagta cattgctact 900
gatccttgat gcgtgtgttt tgtggtgata agtttgagcc gtaaaagcag tggtcgaagc 960
taaacaaaac aacaccatca aaccaatttt ggagttttat ctgggatatt atgcgtggta 1020
gtggtattct tggatgcctt tggtgacata atttgttgtt gaccccaact ttttttaagg 1080
acaaaaatgt ttgtgtcaac actagtgtta ctatgtgccc atgtcatatg tacactgctt 1140
aagcggtgag caccagaaac atacaaccga tgaagcgtac gttgctcaca cgagcaaaag 1200
taactttggt gtaaagatat ttggctcttc tctagtttgt tggagcacat tacgttgcat 1260
tttcgaccta ttataagtca cactaaccat tttacatttt catgatctgc tcaatttcgt 1320
gcacacctcc tgtacatgtt aatttctctc tagtgctaat taacgatggg ctctgcacaa 1380
actcccctgg ttttgataca gacaagtcca attttattcc cgcttaaaac taacaaagct 1440
tgcattttat ctataacacg tctaatttct tgtgggcact gcacatattc ccctggtttt 1500
gatacaggcg tatccaaaat tcactcacac ttaaaagctc aaaaaagctc ccattttaat 1560
caccacacgt ctaacaaatt tcttgttcac atccacagaa gaagctatcc atgctgtact 1620
ttacattgca gtattagact ttttatacta cttttacatt acattattag accttttttt 1680
aacacaaaaa tccacctacc caaccaattt tttgccgggc tggtcctcct ccccccgcat 1740
gagccgcccg tgcgatgacg tctcccggtg ggtcacaccg tcacacaccg tgctataaat 1800
aggggggctt ggcctctccg ccatgagcac cacacttcac cagcttcgct ttgcacaaag 1860
cctcagtgcc tcactgcact tgcaccggtc acta 1894
<210>24
<211>1854
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR p780a10p3_gDNA
<400>24
gggttacgaa ccgggactac aaagggtttc tccatcagtg cacactctaa agaaaatcta 60
gcaccaaccc aggttagccg ctatacatga ccggacgtca ccaaccctat ggaaggatat 120
gatgctgtta ggtacatgga ttagttgctg tctagattac gtgcaggtaa ttaacacatc 180
caggagaaaa cactggacag tgcgtacgta cttaattagt gatcaaccaa aaatatgcat 240
gatattgcaa tccagctaat tacgttaggt gcacataagc cagatgtagt ataagctaag 300
ccagccgttt ccatacgaca tatgcataag gatgcaatta tcctgatgca cgcttgattt 360
aatttgatgg gatgcgtaca tattttgatt ccttgtccta aagtatgcaa aaatccctgt 420
ccatcaggtg tgttgtctac acacggctat gtctcattgt gttatatatg ttgacttgaa 480
ctttttcgca aaatggattt cattaattgg ttccttttca aagtgacttt agtatattat 540
aggaaacggt gaagatgacc tctataccac ctaatttaat cgaccttgtg ttgttaggtg 600
gcacatcaaa tatcattatc tatatctcta cctatacctt atataagtaa cccaggggaa 660
aaaaatcgaa cccatgaatt gtgagatcac aattcagaga ttaaaacaag gtatgccaaa 720
tatgagtata tagtatacca tataaaataa ctcaaattcg aattaagaat aaacatgaaa 780
aatagcaatt ggctttgaag attaattacg tactctgctg aaaaaaaaac caaaagaatc 840
tggaaagaac ataagtgtga aatttcagta tcttctcaac agtacagaag aattatttat 900
attaaaaatt gcatcatttt tttggaaaag ggatatatat atatacacac acacacaaac 960
acacacacac acacacacac acacattcag acagaacata accatatagc catgcacccg 1020
accgatgcta acggctcaca ctcgccaaag tatggctagc taaattttga tcccatgaat 1080
tttctatact ctagcaggcc tatcttcagc caacatcttt ttaatttctt ccctaaccag 1140
aaattggtca tctaaggagt caatttttat tttctctaag ttcaaacaaa cttatttttt 1200
ttggggcgaa tgtacatcta acaggaccca caggtagacg tgattttttc taaaaaaaga 1260
tgttataaaa ttgcaccttg tatcaaaata ctttgacata tatacattcc aaagggagaa 1320
tatgttgcta gacacttgta ataattgatt ggttcagaaa ttaatcacta attgtccgta 1380
aagggtttaa ttaatcgtta gtggttacag ttggatgata tatgccaaaa tgaacggtga 1440
atttcgaatc tttcttgcat ctggtggcta ttaattactt taggagtaaa tttaaaaaac 1500
tatatgtatg ttaatatcaa actatcacaa actacttatt tgagacattg tattataaac 1560
tatagatttc gcaccaaaaa tatcacaaaa ctacatattt aaagcccaaa ctcaaaaaac 1620
tatggttttg ttatataaac gttatatgta aatatgtcaa ccaaacgtcg tcacatggag 1680
aaaccagata aaacagactg acagtctgga gaaccattaa aatcttacaa gatcacacac 1740
tgcaaactgc atgctctctc tccctctcaa cgcctatata agcacatcca tcccccctat 1800
gatcaaagca tcacagaaac cataaacaca caggcatctg attagagaaa tcta 1854
<210>25
<211>1000
<212>DNA
<213>
<220>鼠耳芥(Arabidopsis thaliana)
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR YP0102a
<400>25
atttggttga taacgttttc actcgactaa ttatatactt cagaaggata gtaatagaat 60
accaaaataa ttaaatgatt ggttagtgcc ttagtggaga ctttttaacc gattctaata 120
gactaatgat gtagctaagc atttatttgg gatcatcact gtttgaaaac gtgaaatgtg 180
ataaaagtta tgaaacgatt aaaatataaa ataaccgtac aaaacattat gtaccgtttt 240
tttctctgtt cttttggcga tttggtttag ttcgttacac tctaaatgtt attgcagata 300
tatatataat gatgcatttg catctgagga acatataatt ccggttaaca cttccaaatc 360
ttatatccgt ctaggtaggg attttataaa tcatttgtgt catcatgcgt tatgcttgtc 420
ggctttgacc ataacgcaga gatatagaac tagcttttac ttaactttta gatttattat 480
ttgatctaga gttaagtgga gatatatagt gtttttgtta gattattggt ggatgtgaga 540
gtttgtcttt agtttcaagt tgagaatata aggcaagagg agactctgag gcaatcagag 600
gttttgattg gcaaaatatc caaaaggccc aaaccaagtc gaagcccatc tcgtacaaaa 660
aaagaaagag atctgtaaga aaaaatattc tttgatattc ttacaaaaat aagtgtaaaa 720
cttttattag tcaaaatctt caatctttaa aaactctcat cactcctacg aaagcgcgtg 780
agagttatga gacattcctt aatagcatta ctcacaagtc acaagttcaa aacgtctgac 840
tgaaacagaa acaagccttt gttgaagtct tgaagaagag acattagtac tcgtcgtata 900
gccataaaag gtaatatacg aaatttcttc gctaatctct tcaccttcct ctacgcgttt 960
cactttcact ttataaatcc aaatctccct tcgaaaacat 1000
<210>26
<211>1971
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR y730e07p3_gDNA
<400>26
tcagggaggt atgtggttat cttgccttca agttttacat tttgtttcca tgatattcac 60
atgctgtatt gcaggttatt gctctttgtg atcatccatg cttgttggaa aaggaggaaa 120
ccaaatcatt gttcaggtga atatcggcac ctttatttca tcagcatcaa acagatatgc 180
agagaactta aatggagata tctagtgcaa acactcacat tcctttagtt tgcttaccat 240
atacttcatc cttttgtttc tctctactga ttgagttttg actagaaata ttacatgtta 300
gttgagcata ggagtttcaa aaaccaaaat cttattgaga aattttcaag gtggtttatc 360
cctagttaaa agggctagga ctaaatcgat taactatgca actggcatat caccctaact 420
taatttctaa aagagttctg ctcatgaact tccataaata gttgactatc atactgaaat 480
ttgaaattct agtgagtatc tgatgcccca tctttgctgc agtgctgatg ccatttctcg 540
agcaacaaca attcttgcct cgattcctgg aagagcaact ggagcataca gccacagcca 600
ggtgagcagc tcaggctgat acatttactc actacaaaga aaaaaaaaga atcttaattt 660
caccgtactc atttttccta gggcatcaaa gggctgcgtg atgcaattgc tgctggaatt 720
gcatcacgtg acggataccc tgcaaatgca gacgacattt tccttactga cggagcaagc 780
cctggagtag gaactttacc ttctttttaa atcttactgg acattttttg aataaacagg 840
aagcagttcg aatctcatta tgatgctatt ctccccctct gttttaggtt cacatgatga 900
tgcagttact gataaggaac gagaaagatg gcattctctg cccaattcct caatatcctt 960
tgtactcagc ctccattgct cttcatggtg gagctcttgt atgttttgaa ttctcagcac 1020
attttcaata tggctgcatt catgctgcac caaagcctaa ttgagagcat tttgttttag 1080
gtcccgtatt atcttaatga atcaacaggc tggggtttgg agatctctga ccttaagaag 1140
caactcgaag attctcggtt gaaaggcatt gatgttaggg ctttggtagt tatcaatcca 1200
ggaaatccaa ctgggcaggt ttgcattcat tgctttcttg tctaatttgg agagcatctt 1260
ggattgttgc aatttctgtt cacaccatat tctgcatgta tctacctaag gcatatatat 1320
ttgcaattct tgtatctttt tatgtgattt tccattgtta gggaacatat gtatttttgt 1380
ttgtctgcaa tgtgcatgaa gcatttgcag ctggtgcagg tacccaacaa aagaactgta 1440
atcatgtttt aattcatttg caggttcttg ctgaggaaaa ccaacgggac atagtgaagt 1500
tctgcaaaaa tgagggactt gttcttctgg ctgatgaggt aagcgattgt tacttgagca 1560
actccacaac aaactttcag ctgcttaatt ccttttcgct gtgctgtctg taacatcaac 1620
actattcata ttgataggtg taccaagaga acatctatgt tgacaacaag aaatttaact 1680
ctttcaagaa gatagcgaga tccatgggat acaacgagga tgatctccct ttagtatcat 1740
ttcaatctgt ttctaagggt aaatacgatg atctgttttc ttattttcta ttggcactgg 1800
attctcaaaa ggattttctt gctgacaaca ggatattatg gtgaatgtgg caaaagagga 1860
ggctacatgg agattactgg cttcagtgct ccagttagag agcagatcta caaagtggcg 1920
tcagtgaact tatgttccaa tatcactggc cagatccttg ccagcctcgt c 1971
<210>27
<211>1993
<212>DNA
<213>人工序列
<220>
<223>合成产生的
<221>misc_feature
<222>(0)...(0)
<223>5′-UTR y760g09p3_gDNA
<400>27
gcttggaaca gcagagattt ggcataagaa caaatttgta aatgtaattt gtatgatatt 60
gtagctagac tgtttggagc aaatcaattc cgtggcgcta caaaagaatc tctttttgaa 120
aaaactaaaa ttacaacaaa aacggcacgc tttgcaaacc atggtgtaac gtttgcccac 180
aacaacctgt ataagaaaac aagctttaca gcttcgtaca actctggtta gcaaactaat 240
tttgtcacgc taaggaatca gtttctcata gcaccgacca gtttcaccta taaattagag 300
gatactgcac agcccttgat cacaatacag tgcatttcta caatcttttg ttgcccattc 360
atctgggttt tcttctgctt cttttttttt cctagagagt acggttttct ttgtaattct 420
ttaatttgtt gcaaccatga atgtattggc atctaagatc ttcccttccc gctccaatgt 480
tgccagcgag caacaacaat cgaagcgcga gaaagcaact attgatgacg ctaagaactc 540
gtccaagaac aaaaatcttg accgcagtgt cgatgaggta accgatcttc cccacaaaac 600
atattcataa ataccattac ttgatttttt ttatggaatt ccttattcat gtagaacata 660
ttttctattt gatgaattct ccatgcatga tgtttcaatc ttcttttttt tattgtgtgg 720
agtatataaa agtaattaga atttgtagca cctggacata tgcagcaaat tattcatcta 780
ctactatagt tcggatttat ttttatcgat gcaaattgga tttggataga aatgtacatt 840
cttttatttt agtcagaata aaagtttctt ctatctagaa tatactataa taacatatct 900
atctaaaaca aatatggtac aacacacttg caactagcag caagttccct gaaagatgtt 960
tgtctaatgc tatggtgatc tctttcacta cagtttggtg tatgtgtgtc catagtagaa 1020
tatgagtcct gcaaaagcaa acatcatcat gccaacaaaa atggcccatg tgccatcaat 1080
aattcaaggt gcccgttgat gagtaacaga acatttgatt gtgtcaccct accacaaaca 1140
cacatggaag gccattgcat tccctataag gacatcatgg tcattccaaa atgtactgac 1200
acctgctcaa tgcagacaaa aaccccttca aaaaacagaa gaatctccct cttaaaaaaa 1260
ctgattaaat gattatttct gaaataaaaa tgttgagttt ttatttttaa atagtttata 1320
tcattctatt cttttagaaa cgtagtacaa acatagatac ttacagcgtg cgcatactca 1380
tctatataaa tgcacacctc tgaaaaacta aagagaagtg gaaaaaatgg caagatttac 1440
taataattag attatagttt ttcacatcta ataggaaaat tatagattaa ataatttttt 1500
gaaagaaaaa aatatttgaa aacttattta ttttcaagta tttgaaatta tttaaataaa 1560
gagtaaattt tagaaaacta caactacagt gaaaaaacta tcagtttgct ataactttta 1620
cgtgatatgt tgctacagtt gtcacctaca tgtcctgtag cagtatatca catcaaagtt 1680
gtagttttgt gataattttt catgctattg gtgcaaaaaa ctgaaataga tcattaatat 1740
tacagcaaac tgatagttct atcactgtag ttatagtttt ctgaaattta agatctaaaa 1800
gaagaaaaaa aggggggggg ggggggtgag atttacacac agccacacga cacgaggcag 1860
ggctacccca ctagacaatc tgtccactca ccactggcct cacttccttg atctcttctc 1920
gtcttctcca ccccgcacgc ggccaccccc gcagggaccc cgtgacccgc gcccgcgccc 1980
gcgcctcacc gca 1993
<210>28
<211>1534
<212>PRT
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>肽
<222>(0)...(0)
<223>gi:10177145 DNA(胞嘧啶-5)-甲基转移酶(MET1)(At5g49160)
<400>28
Met Val Glu Asn Gly Ala Lys Ala Ala Lys Arg Lys Lys Arg Pro Leu
1 5 10 15
Pro Glu Ile Gln Glu Val Glu Asp Val Pro Arg Thr Arg Arg Pro Arg
20 25 30
Arg Ala Ala Ala Cys Thr Ser Phe Lys Glu Lys Ser Ile Arg Val Cys
35 40 45
Glu Lys Ser Ala Thr Ile Glu Val Lys Lys Gln Gln Ile Val Glu Glu
50 55 60
Glu Phe Leu Ala Leu Arg Leu Thr Ala Leu Glu Thr Asp Val Glu Asp
65 70 75 80
Arg Pro Thr Arg Arg Leu Asn Asp Phe Val Leu Phe Asp Ser Asp Gly
85 90 95
Val Pro Gln Pro Leu Glu Met Leu Glu Ile His Asp Ile Phe Val Ser
100 105 110
Gly Ala Ile Leu Pro Ser Asp Val Cys Thr Asp Lys Glu Lys Glu Lys
115 120 125
Gly Val Arg Cys Thr Ser Phe Gly Arg Val Glu His Trp Ser Ile Ser
130 135 140
Gly Tyr Glu Asp Gly Ser Pro Val Ile Trp Ile Ser Thr Glu Leu Ala
145 150 155 160
Asp Tyr Asp Cys Arg Lys Pro Ala Ala Ser Tyr Arg Lys Val Tyr Asp
165 170 175
Tyr Phe Tyr Glu Lys Ala Arg Ala Ser Val Ala Val Tyr Lys Lys Leu
180 185 190
Ser Lys Ser Ser Gly Gly Asp Pro Asp Ile Gly Leu Glu Glu Leu Leu
195 200 205
Ala Ala Val Val Arg Ser Met Ser Ser Gly Ser Lys Tyr Phe Ser Ser
210 215 220
Gly Ala Ala Ile Ile Asp Phe Val Ile Ser Gln Gly Asp Phe Ile Tyr
225 230 235 240
Asn Gln Leu Ala Gly Leu Asp Glu Thr Ala Lys Lys His Glu Ser Ser
245 250 255
Tyr Val Glu Ile Pro Val Leu Val Ala Leu Arg Glu Lys Ser Ser Lys
260 265 270
Ile Asp Lys Pro Leu Gln Arg Glu Arg Asn Pro Ser Asn Gly Val Arg
275 280 285
Ile Lys Glu Val Ser Gln Val Ala Glu Ser Glu Ala Leu Thr Ser Asp
290 295 300
Gln Leu Val Asp Gly Thr Asp Asp Asp Arg Arg Tyr Ala Ile Leu Leu
305 310 315 320
Gln Asp Glu Glu Asn Arg Lys Ser Met Gln Gln Pro Arg Lys Asn Ser
325 330 335
Ser Ser Gly Ser Ala Ser Asn Met Phe Tyr Ile Lys Ile Asn Glu Asp
340 345 350
Glu Ile Ala Asn Asp Tyr Pro Leu Pro Ser Tyr Tyr Lys Thr Ser Glu
355 360 365
Glu Glu Thr Asp Glu Leu Ile Leu Tyr Asp Ala Ser Tyr Glu Val Gln
370 375 380
Ser Glu His Leu Pro His Arg Met Leu His Asn Trp Ala Leu Tyr Asn
385 390 395 400
Ser Asp Leu Arg Phe Ile Ser Leu Glu Leu Leu Pro Met Lys Gln Cys
405 410 415
Asp Asp Ile Asp Val Asn Ile Phe Gly Ser Gly Val Val Thr Asp Asp
420 425 430
Asn Gly Ser Trp Ile Ser Leu Asn Asp Pro Asp Ser Gly Ser Gln Ser
435 440 445
His Asp Pro Asp Gly Met Cys Ile Phe Leu Ser Gln Ile Lys Glu Trp
450 455 460
Met Ile Glu Phe Gly Ser Asp Asp Ile Ile Ser Ile Ser Ile Arg Thr
465 470 475 480
Asp Val Ala Trp Tyr Arg Leu Gly Lys Pro Ser Lys Leu Tyr Ala Pro
485 490 495
Trp Trp Lys Pro Val Leu Lys Thr Ala Arg Val Gly Ile Ser Ile Leu
500 505 510
Thr Phe Leu Arg Val Glu Ser Arg Val Ala Arg Leu Ser Phe Ala Asp
515 520 525
Val Thr Lys Arg Leu Ser Gly Leu Gln Ala Asn Asp Lys Ala Tyr Ile
530 535 540
Ser Ser Asp Pro Leu Ala Val Glu Arg Tyr Leu Val Val His Gly Gln
545 550 555 560
Ile Ile Leu Gln Leu Phe Ala Val Tyr Pro Asp Asp Asn Val Lys Arg
565 570 575
Cys Pro Phe Val Val Gly Leu Ala Ser Lys Leu Glu Asp Arg His His
580 585 590
Thr Lys Trp Ile Ile Lys Lys Lys Lys Ile Ser Leu Lys Glu Leu Asn
595 600 605
Leu Asn Pro Arg Ala Gly Met Ala Pro Val Ala Ser Lys Arg Lys Ala
610 615 620
Met Gln Ala Thr Thy Thr Arg Leu Val Asn Arg Ile Trp Gly Glu Phe
625 630 635 640
Tyr Ser Asn Tyr Ser Pro Glu Asp Pro Leu Gln Ala Thr Ala Ala Glu
645 650 655
Asn Gly Glu Asp Glu Val Glu Glu Glu Gly Gly Asn Gly Glu Glu Glu
660 665 670
Val Glu Glu Glu Gly Glu Asn Gly Leu Thr Glu Asp Thr Val Pro Glu
675 680 685
Pro Val Glu Val Gln Lys Pro His Thr Pro Lys Lys Ile Arg Gly Ser
690 695 700
Ser Gly Lys Arg Glu Ile Lys Trp Asp Gly Glu Ser Leu Gly Lys Thr
705 710 715 720
Ser Ala Gly Glu Pro Leu Tyr Gln Gln Ala Leu Val Gly Gly Glu Met
725 730 735
Val Ala Val Gly Gly Ala Val Thr Leu Glu Val Asp Asp Pro Asp Glu
740 745 750
Met Pro Ala Ile tyr Phe Val Glu Tyr Met Phe Glu Ser Thr Asp His
755 760 765
Cys Lys Met Leu His Gly Arg Phe Leu Gln Arg Gly Ser Met Thr Val
770 775 780
Leu Gly Asn Ala Ala Asn Glu Arg Glu Leu Phe Leu Thr Asn Glu Cys
785 790 795 800
Met Thr Thr Gln Leu Lys Asp Ile Lys Gly Val Ala Ser Phe Glu Ile
805 810 815
Arg Ser Arg Pro Trp Gly His Gln Tyr Arg Lys Lys Asn Ile Thr Ala
820 825 830
Asp Lys Leu Asp Trp Ala Arg Ala Leu Glu Arg Lys Val Lys Asp Leu
835 840 845
Pro Thr Glu Tyr Tyr Cys Lys Ser Leu Tyr Ser Pro Glu Arg Gly Gly
850 855 860
Phe Phe Ser Leu Pro Leu Ser Asp Ile Gly Arg Ser Ser Gly Phe Cys
865 870 875 880
Thr Ser Cys Lys Ile Arg Glu Asp Glu Glu Lys Arg Ser Thr Ile Lys
885 890 895
Leu Asn Val Ser Lys Thr Gly Phe Phe Ile Asn Gly Ile Glu Tyr Ser
900 905 910
Val Glu Asp Phe Val Tyr Val Asn Pro Asp Ser Ile Gly Gly Leu Lys
915 920 925
Glu Gly Ser Lys Thr Ser Phe Lys Ser Gly Arg Asn Ile Gly Leu Arg
930 935 940
Ala Tyr Val Val Cys Gln Leu Lru Glu Ile Val Pro Lys Glu Ser Arg
945 950 955 960
Lys Ala Asp Leu Gly Ser Phe Asp Val Lys Val Arg Arg Phe Tyr Arg
965 970 975
Pro Glu Asp Val Ser Ala Glu Lys Ala Tyr Ala Ser Asp Ile Gln Glu
980 985 990
Leu Tyr Phe Ser Gln Asp Thr Val Val Leu Pro Pro Gly Ala Leu Glu
995 1000 1005
Gly Lys Cys Glu Val Arg Lys Lys Ser Asp Met Pro Leu Ser Arg Glu
1010 1015 1020
Tyr Pro Ile Ser Asp His Ile Phe Phe Cys Asp Leu Phe Phe Asp Thr
1025 1030 1035 1040
Ser Lys Gly Ser Leu Lys Gln Leu Pro Ala Asn Met Lys Pro Lys Phe
1045 1050 1055
Ser Thr Ile Lys Asp Asp Thr Leu Leu Arg Lys Lys Lys Gly Lys Gly
1060 1065 1070
Val Glu Ser Glu Ile Glu Ser Glu Ile Val Lys Pro Val Glu Pro Pro
1075 1080 1085
Lys Glu Ile Arg Leu Ala Thr Leu Asp Ile Phe Ala Gly Cys Gly Gly
1090 1095 1100
Leu Ser His Gly Leu Lys Lys Ala Gly Val Ser Asp Ala Lys Typ Ala
1105 1110 1115 1120
Ile Glu Tyr Glu Glu Pro Ala Gly Gln Ala Phe Lys Gln Asn His Pro
1125 1130 1135
Glu Ser Thr Val Phe Val Asp Asn Cys Asn Val Ile Leu Arg Ala Ile
1140 1145 1150
Met Glu Lys Gly Gly Asp Gln Asp Asp Cys Val Ser Thr Thr Glu Ala
1155 1160 1165
Asn Glu Leu Ala Ala Lys Leu Thr Glu Glu Gln Lys Ser Thr Leu Pro
1170 1175 1180
Leu Pro Gly Gln Val Asp Phe Ile Asn Gly Gly Pro Pro Cys Gln Gly
1185 1190 1195 1200
Phe Ser Gly Met Asn Arg Phe Asn Gln Ser Ser Trp Ser Lys Val Gln
1205 1210 1215
Cys Glu Met Ile Leu Ala Phe Leu Ser Phe Ala Asp Tyr Phe Arg Pro
1220 1225 1230
Arg Tyr Phe Leu Leu Glu Asn Val Arg Thr Phe Val Ser Phe Asn Lys
1235 1240 1245
Gly Gln Thr Phe Gln Leu Thr Leu Ala Ser Leu Leu Glu Met Gly Tyr
1250 1255 1260
Gln Val Arg Phe Gly Ile Leu Glu Ala Gly Ala Tyr Gly Val Ser Gln
1265 1270 1275 1280
Ser Arg Lys Arg Ala Phe Ile Trp Ala Ala Ala Pro Glu Glu Val Leu
1285 1290 1295
Pro Glu Trp Pro Glu Pro Met His Val Phe Gly Val Pro Lys Leu Lys
1300 1305 1310
Ile Ser Leu Ser Gln Gly Leu His Tyr Ala Ala Val Arg Ser Thr Ala
1315 1320 1325
Leu Gly Ala Pro Phe Arg Pro Ile Thr Val Arg Asp Thr Ile Gly Asp
1330 1335 1340
Leu Pro Ser Val Glu Asn Gly Asp Ser Arg Thr Asn Lys Glu Tyr Lys
1345 1350 1355 1360
Glu Val Ala Val Ser Trp Phe Gln Lys Glu Ile Arg Gly Asn Thr Ile
1365 1370 1375
Ala Leu Thr Asp His Ile Cys Lys Ala Met Asn Glu Leu Asn Leu Ile
1380 1385 1390
Arg Cys Lys Leu Ile Pro Thr Arg Pro Gly Ala Asp Trp His Asp Leu
1395 1400 1405
Pro Lys Arg Lys Val Thr Leu Ser Asp Gly Arg Val Glu Glu Met Ile
1410 1415 1420
Pro Phe Cys Leu Pro Asn Thr Ala Glu Arg His Asn Gly Trp Lys Gly
1425 1430 1435 1440
Leu Tyr Gly Arg Leu Asp Trp Gln Gly Asn Phe Pro Thr Ser Val Thr
1445 1450 1455
Asp Pro Gln Pro Met Gly Lys Val Gly Met Cys Phe His Pro Glu Gln
1460 1465 1470
His Arg Ile Leu Thr Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro
1475 1480 1485
Asp Ser Tyr Glu Phe Ala Gly Asn Ile Asn His Lys His Arg Gln Ile
1490 1495 1500
Gly Asn Ala Val Pro Pro Pro Leu Ala Phe Ala Leu Gly Arg Lys Leu
1505 1510 1515 1520
Lys Glu Ala Leu His Leu Lys Lys Ser Pro Gln His Gln Pro
1525 1530
<210>29
<211>4845
<212>DNA
<213>鼠耳芥(Arabidopsis thaliana)
<220>
<221>misc_feature
<222>(0)...(0)
<223>NM_124293.3;GI:42568413;DNA(胞嘧啶-5-)-甲基转移酶
(ATHIM),(At5g49160)mRNA,完整的cds
<400>29
gaccaattag ggtttcgcaa tcttccagta gatttcgctt ctcaacggat tttgaaaatg 60
gtggaaaatg gggctaaagc tgcgaagcga aagaagagac cacttccaga gattcaagag 120
gtagaagatg tacctaggac gaggagacca aggcgtgctg cagcgtgtac cagtttcaag 180
gagaaatcta ttcgagtctg tgagaaatct gctactattg aagtaaagaa acagcagatt 240
gtggaggaag agtttctcgc gttacggtta acggctctgg aaactgatgt tgaagatcgt 300
ccaaccagga gactgaatga ttttgttttg tttgattcag atggagttcc acaacctctg 360
gagatgttgg agattcatga catattcgtt tcaggtgcta tcttaccttc agatgtgtgt 420
actgataagg agaaagagaa gggtgtgagg tgtacatcgt ttggacgggt tgagcattgg 480
agtatctctg gttatgaaga tggttcccct gttatttgga tctcaacgga attggcggat 540
tatgattgtc gtaaacctgc tgctagctac aggaaggttt atgattactt ctatgagaaa 500
gctcgtgctt cagtggctgt gtataagaaa ttgtccaagt catctggtgg ggatcctgat 660
ataggtcttg aggagttact tgcggcggtt gtcagatcaa tgagcagtgg aagcaagtac 720
ttttctagtg gtgcggcaat catcgatttt gttatatccc agggagattt tatatataac 780
caactcgctg gtttggatga gacagccaag aaacatgaat caagctatgt tgagattcct 840
gttcttgtag ctctcagaga gaagagtagt aagattgaca agcctctgca gagggaaaga 900
aacccatcta atggtgtgag gattaaagaa gtttctcaag ttgcggagag cgaggccttg 960
acatctgatc aactggttga tggtactgat gatgacagaa gatatgctat actcttacaa 1020
gacgaagaga ataggaaatc tatgcaacag cccagaaaaa acagcagctc aggttctgct 1080
tcaaatatgt tctacattaa gataaatgaa gatgagattg ccaatgatta tcctctccca 1140
tcgtactata agacctccga agaagaaaca gatgaactta tactttatga tgcttcctat 1200
gaggttcaat ctgaacacct gcctcacagg atgcttcaca actgggctct ttataactct 1260
gatttacgat tcatatcact ggaacttcta ccgatgaaac aatgtgatga tattgatgtc 1320
aacatttttg ggtcaggtgt ggtgactgat gataatggaa gttggatttc tttaaacgat 1380
cctgacagcg gttctcagtc acacgatcct gatgggatgt gcatattcct cagtcaaatt 1440
aaagaatgga tgattgagtt tgggagcgat gatattatct ccatttctat acgaacagat 1500
gtggcctggt accgtcttgg gaaaccatca aaactttatg ccccttggtg gaaacctgtt 1560
ctgaaaacag caagggttgg gataagcatt cttacttttc ttagggtgga aagtagggtt 1620
gctaggcttt catttgcaga tgtcacaaaa agactgtctg ggttacaggc gaatgataaa 1680
gcttacattt cttctgaccc cttggctgtt gagagatatt tggtcgtcca tgggcaaatt 1740
attttacagc tttttgcagt ttatccggac gacaatgtca aaaggtgtcc atttgttgtt 1800
ggtcttgcaa gcaaattgga ggataggcac cacacaaaat ggatcatcaa gaagaagaaa 1860
atttcgctga aggaactgaa tctgaatcca agggcaggca tggcaccagt agcatcgaag 1920
aggaaagcta tgcaagcaac aacaactcgc ctggtcaaca gaatttgggg agagttttac 1980
tccaattact ctccagagga tccattgcag gcgactgctg cagaaaatgg ggaggatgag 2040
gtggaagagg aaggcggaaa tggggaggaa gaggttgaag aggaaggtga aaatggtctc 2100
acagaggaca ctgtaccaga acctgttgag gttcagaagc ctcatactcc taagaaaatc 2160
cgaggcagtt ctggaaaaag ggaaataaaa tgggatggtg agagtctagg aaaaacttct 2220
gctggcgagc ctctctatca acaagccctt gttggagggg aaatggtggc tgtaggtggc 2280
gctgtcacct tggaagttga tgatccagat gaaatgccgg ccatctattt tgtggagtac 2340
atgttcgaaa gtacagatca ctgcaaaatg ttacatggta gattcttaca aagaggatct 2400
atgactgttc tggggaatgc tgctaacgag agggaactat tcctgactaa tgaatgcatg 2460
actacacagc tcaaggacat taaaggagta gccagttttg agattcgatc aaggccatgg 2520
gggcatcagt ataggaaaaa gaacatcact gcggataagc ttgactgggc tagagcatta 2580
gaaagaaaag taaaagattt gccaacagag tattactgca aaagcttgta ctcacctgag 2640
agagggggat tctttagtct tccactaagt gatattggtc gcagttctgg gttctgcact 2700
tcatgtaaga taagggagga tgaagagaag aggtctacaa ttaaactaaa tgtttcaaag 2760
acaggctttt tcatcaatgg gattgagtat tctgttgagg attttgtcta tgtcaaccct 2820
gactctattg gtgggttgaa ggagggtagt aaaacttctt ttaagtctgg gcgaaacatt 2880
gggttaagag cgtatgttgt ttgccaattg ctggaaattg ttccaaagga atctagaaag 2940
gctgatttgg gttcctttga tgttaaagtg agaaggtttt ataggcctga ggatgtttc
gcagagaagg cctatgcttc agacatccaa gaattgtatt tcagccagga cacagttgt
ctccctccag gtgctctaga gggaaaatgt gaagtaagaa agaaaagtga tatgccctt
tcccgtgaat atccaatatc agaccatatt ttcttctgtg atcttttctt tgacacctc
aaaggttctc tcaagcagct gcccgccaat atgaagccaa agttctctac tattaagga
gacacacttt taagaaagaa aaagggaaag ggagtagaga gtgaaattga gtctgagat
gtcaagcctg ttgagccacc taaagagatt cgtctggcta ctctagatat ttttgctgg
tgtggtggcc tgtctcatgg actgaaaaag gcgggtgtat ctgatgcaaa gtgggcgat
gagtatgaag agccagctgg gcaggctttt aaacaaaacc atcctgagtc aacagtttt
gttgacaact gcaatgtgat tcttagggct ataatggaga aaggtggaga tcaagatga
tgtgtctcta ctacagaggc aaatgaatta gcagctaaac taactgagga gcagaagag
actctgccac tgcctggtca agtggacttc atcaatggtg gacctccatg tcagggatt
tctggtatga acaggttcaa ccaaagctct tggagtaaag ttcagtgtga aatgatatt
gcattcttgt cctttgctga ctatttccgg ccaaggtatt ttcttctgga gaacgtgag
acctttgtgt cattcaataa agggcagaca tttcagctta ctttggcttc ccttctcga
atgggttacc aggtgagatt tggaatcctg gaggccggtg catatggagt atcccaatc
cgtaaacgag ctttcatttg ggctgctgca ccagaagaag ttctccctga atggcctga
ccgatgcatg tctttggtgt tccaaagttg aaaatctcac tatctcaagg tttacatta
gctgctgttc gtagtactgc acttggtgcc cctttccgtc caatcaccgt gagagacac
attggtgatc ttccatcagt agaaaacgga gactctagga caaacaaaga gtataaaga
gttgcagtct cgtggttcca aaaggagata agaggaaaca cgattgctct cactgatca
atctgcaagg ctatgaatga gcttaacctc attcgatgca aattaatccc aactaggcc
ggggctgatt ggcatgactt gccaaagaga aaggttacgt tatctgatgg gcgcgtaga
gaaatgattc ctttttgtct cccaaacaca gctgagcgcc acaacggttg gaagggact
tatgggagat tagattggca aggaaacttt ccgacttccg tcacggatcc tcagcccat
ggtaaggttg gaatgtgctt tcatcctgaa cagcacagaa tccttacagt ccgtgaatg
gcccgatctc aggggtttcc ggatagctac gagtttgcag ggaacataaa tcacaagca
aggcagattg ggaatgcagt ccctccacca ttggcatttg ctctaggtcg taagctcaa
gaagccctac atctcaagaa gtctcctcaa caccaaccct agataaccac ccaaatttg
catttccttt ttcaataata ttagtcatta tgatccttgt cttgaatgaa actcattgg
gctgatactt ttgataaaga aagcctacga agagtttttg tatattccgt attcggatt
aaaaatctca ttatacaagc aagcaatgat gtctatagac tatga
<210>30
<211>1564
<212>PRT
<213>桃(Prunus persica)
<220>
<221>肽
<222>(0)...(0)
<223>gi|37039880|gb|AAM96952.1|DNA(胞嘧啶-5-)-甲基转移酶
<400>30
Met Gly Ser Ala Ala Ala Ala Glu Ala Ala Glu Ala Ala Ala Leu Leu
1 5 10 15
Glu Ala Lys Gly Ala Asn Gly Thr Lys Pro Pro Ser Ser Ser Ser Ser
20 25 30
Gly Met Thr Lys Lys Lys Lys Gly Lys Gln Asp Ser Gln Lys Ala Ala
35 40 45
Pro Lys Ala Lys Lys Arg Asn Leu Pro Gln Ser Ser Glu Glu Glu Pro
50 55 60
Ser Arg Ser Arg Lys Met Pro Lys Arg Ala Ala Ala Cys Lys Asp Phe
65 70 75 80
Lys Asp Arg Ser Val His Ile Ser Glu Lys Ser Ser Leu Ile Glu Ser
85 90 95
Lys Glu Asp Gln Ile Val Glu Glu Glu Ile Leu Ala Val Arg Leu Thr
100 105 110
Cys Gly Pro Asp Gln Asp Ala Val Arg Pro Asn Arg Arg Leu Thr Asp
115 120 125
Phe Val Leu His Asp Ala Thr Gly Ser Ala Gln Pro Leu Glu Met Leu
130 135 140
Glu Val Ser Asp Met Phe Ile Ser Gly Ala Ile Leu Pro Leu Asn Glu
145 150 155 160
Ser Ser Asp Lys Asp Lys Gly Arg Ser Val Arg Cys Glu Gly Phe Gly
165 170 175
Arg Ile Glu Ser Trp Asp Ile Ser Gly Tyr Glu Asp Gly Ser Pro Val
180 185 190
Ile Trp Leu Ser Thr Glu Val Ala Asp Tyr Asp Cys Arg Lys Pro Ala
195 200 205
Ser Ser Tyr Lys Lys Tyr Phe Asp Gln Phe Phe Glu Lys Ala Arg Ala
210 215 220
Cys Ile Glu Val Tyr Lys Lys Leu Ser Lys Ser Asn Ser Asp Asn Ser
225 230 235 240
Asp Pro Thr Leu Asp Glu Leu Leu Ala Gly Ile Ala Arg Ser Met Ser
245 250 255
Gly Ser Lys Phe Phe Ser Gly Ser Ala Ser Val Lys Asp Phe Val Leu
260 265 270
Ser Gln Gly Glu Phe Ile Tyr Ala Gln Val Ile Gly Leu Glu Glu Thr
275 280 285
Ser Lys Lys Asn Asp Arg Pro Phe Ala Glu Leu Pro Val Leu Ala Ala
290 295 300
Leu Arg Asp Glu Ser Ile Lys Arg Gly Asn Phe Val Gln Ser Lys Pro
305 310 315 320
Gly Ile Ser Ser Gly Thr Leu Lys Ile Gly Gly Glu Asn Gly Val Asp
325 330 335
Ser Ala Gly Ser Ser Val Val Glu Ala Glu Glu Asn Glu Asp Ala Lys
340 345 350
Leu Ala Lys Leu Leu Gln Glu Glu Glu Tyr Trp Lys Ser Met Lys Gln
355 360 365
Arg Lys Arg Gln Gly Pro Ala Ser Val Ser Ser Lys Tyr Tyr Ile Lys
370 375 380
Ile Asn Glu Asp Glu Ile Ala Asn Asp Tyr Pro Leu Pro Ala Tyr Tyr
385 390 395 400
Lys Asn Cys Ile Glu Glu Thr Asp Glu Phe Ile Val Phe Asp Asn Glu
405 410 415
Phe Asp Ile Cys Asn Ala Asp Asp Leu Pro Arg Ser Met Leu His Asn
420 425 430
Trp Cys Leu Tyr Asn Ser Asp Ser Arg Leu Ile Ser Leu Glu Leu Leu
435 440 445
Pro Met Lys Pro Cys Ala Asp Ile Asp Val Thr Ile Phe Gly Ser Gly
450 455 460
Val Met Ser Glu Asp Asp Gly Ser Gly Phe Cys Leu Asp Ser Asp Gly
465 470 475 480
Thr Ser Ser Gly Pro Gly Ala Gln Asp Ala Asp Gly Met Pro Ile Tyr
485 490 495
Leu Ser Ala Ile Lys Glu Trp Met Ile Glu Leu Gly Ala Ser Met Val
500 505 510
Ser Ile Ser Ile Arg Thr Asp Met Ala Trp Tyr Arg Leu Gly Lys Pro
515 520 525
Ser Lys Gln Tyr Ala Leu Trp Tyr Glu Pro Ile Leu Arg Thr Ala Lys
530 535 540
Ile Gly Arg Ser Ile Ile Thr Met Leu Lys Asp Gln Ser Arg Val Ala
545 550 555 560
Arg Leu Ser Phe Ala Asp Val Ile Lys Arg Leu Ser Gly Phe Gln Lys
565 570 575
Asp His Cys Ala Tyr Ile Ser Ser Asp Pro Ala Phe Val Glu Lys Tyr
580 585 590
Val Val Val His Gly Gln Ile Ile Leu Gln Leu Phe Ser Glu Phe Pro
595 600 605
Asp Ala Gln Ile Lys Lys Cys Pro Phe Val Ile Gly Leu Thr Lys Lys
610 615 620
Met Glu Glu Arg His His Thr Lys Trp Leu Val Lys Lys Lys Lys Leu
625 630 635 640
Val Glu Lys Ser Glu Ser Asn Leu Asn Pro Arg Ala Ser Met Ala Pro
645 650 655
Val Val Ser Lys Arg Lys Thr Met Gln Ala Thr Thr Thr Arg Leu Ile
660 665 670
Asn Arg Ile Trp Gly Glu Tyr Tyr Ser Asn Tyr Ser Pro Glu Asp Ser
675 680 685
Lys Glu Gly Asp Ile Gly Glu Lys Lys Glu Glu Glu Glu Val Glu Glu
690 695 700
Glu Asp Val Glu Glu Asp Asp Val Glu Glu Asn Pro Thr Val Met Glu
705 710 715 720
Gln Ala Gln Lys Pro Ser Ser Ile Ser Arg Gln Thr Lys Ser Cys Leu
725 730 735
Asn Asn Arg Glu Ile Leu Trp Glu Gly Glu Pro Val Gly Gln Thr Cys
740 745 750
Ser Gly Glu Ala Leu Tyr Lys Arg Ala Ile Leu Trp Gly Glu Glu Ile
755 760 765
Ser Val Gly Gly Ala Val Leu Val Glu Leu Asp Glu Ser His Glu Leu
770 775 780
Pro Ala Ile Tyr Phe Val Glu Tyr Met Tyr Glu Thr Leu Asn Gly Ser
785 790 795 800
Lys Met Phe His Gly Arg Val Met Glu Arg Gly Ser Gln Thr Val Leu
805 810 815
Gly Asn Thr Ala Asn Glu Arg Glu Val Phe Leu Thr Asn Glu Cys Thr
820 825 830
Asn Leu Ala Leu Lys Glu Val Lys Gln Ala Ala Ala Val Gly Ile Lys
835 840 845
Val Met Pro Trp Gly His Gln Tyr Arg Lys Asp Asn Ala Asp Ala Asn
850 855 860
Arg Thr Asp Arg Ala Arg Ala Glu Glu Arg Lys Arg Lys Gly Leu Pro
865 870 875 880
Thr Glu Tyr Tyr Cys Lys Ser Leu Tyr Cys Pro Glu Arg Gly Ala Phe
885 890 895
Leu Ser Leu Ser Arg Asp Thr Met Gly Leu Gly Ser Gly Ala Cys His
900 905 910
Ser Cys Lys Met Asn Glu Ala Glu Glu Ala Lys Glu Val Phe Lys Val
915 920 925
Asn Ser Ser Lys Thr Gly Phe Val Tyr Arg Gly Val Glu Tyr Ser Val
930 935 940
His Asp Tyr Val Tyr Val Ser Pro His Tyr Phe Gly Val Glu Arg Met
945 950 955 960
Glu Thr Glu Ile Phe Lys Ala Gly Arg Asn Leu Val Leu Lys Ala Tyr
965 970 975
Val Val Cys Gln Val Leu Glu Ile Val Val Met Lys Glu Ser Lys Arg
980 985 990
Pro Glu Ile Glu Ser Thr Gln Val Lys Val Arg Arg Phe Phe Arg Pro
995 1000 1005
Glu Asp Ile Ser Val Glu Lys Ala Tyr Ser Ser Asp Ile Arg Glu Val
1010 1015 1020
Tyr Tyr Ser Glu Gln Thr His Ile Val Pro Val Asp Asn Ile Glu Arg
1025 1030 1035 1040
Lys Cys Glu Val Arg Lys Lys Ser Asp Leu Pro Val Cys Asn Ala Pro
1045 1050 1055
Val Ile Phe Gln His Ile Phe Phe Cys Glu His Leu Tyr Asp Pro Ser
1060 1065 1070
Lys Gly Ser Ile Lys Gln Leu Pro Ala His Ile Lys Leu Arg Tyr Ser
1075 1080 1085
Thr Gly Gly Gly His Ala Asp Ser Arg Lys Arg Lys Gly Lys Cys Lys
1090 1095 1100
Glu Gly Glu Asn Val Ser Glu Val Glu Asn Gln Arg Val Asp Ser Glu
1105 1110 1115 1120
Gln Lys Arg Leu Ala Thr Leu Asp Ile Phe Ala Gly Cys Gly Gly Leu
1125 1130 1135
Ser Asn Gly Leu Arg Gln Ser Gly Ala Ser Ile Thr Lys Trp Ala Ile
1140 1145 1150
Glu Tyr Glu Glu Pro Ala Gly Asp Ala Phe Lys Leu Asn His Pro Glu
1155 1160 1165
Ser Leu Val Phe Ile Asn Asn Cys Asn Val Ile Leu Arg Ala Val Met
1170 1175 1180
Glu Lys Cys Gly Asp Thr Asp Asp Cys Ile Ala Thr Ser Glu Ala Ala
1185 1190 1195 1200
Glu Leu Ala Ala Ser Leu Asp Glu Glu Val Lys Asn Asp Leu Pro Leu
1205 1210 1215
Pro Gly Gln Val Asp Phe Ile Asn Gly Gly Pro Pro Cys Arg Gly Phe
1220 1225 1230
Ser Gly Met Asn Arg Phe Thr Gln Ser Pro Trp Ile Lys Phe His Cys
1235 1240 1245
Lys Met Ile Trp Ala Cys Leu Ala Phe Ala Asp Tyr Phe Arg Pro Lys
1250 1255 1260
Leu Phe Pro Leu Glu Asn Val Arg Lys Phe Val Ser Phe Asn Lys Gly
1265 1270 1275 1280
Gln Thr Phe Gln Leu Thr Leu Ala Ser Leu Leu Glu Met Gly Tyr Gln
1285 1290 1295
Val Arg Phe Gly Ile Leu Glu Ala Gly Ala Tyr Gly Ile Ser Gln Ser
1300 1305 1310
Arg Lys Arg Ala Phe Ile Trp Ala Ala Ala Pro Glu Glu Val Leu Pro
1315 1320 1325
Glu Trp Pro Glu Pro Met His Val Phe Gly Val Pro Lys Leu Lys Ile
1330 1335 1340
Ser Leu Ser Gln Gly Leu His Tyr Ala Ala Val Arg Ser Thr Ala Leu
1345 1350 1355 1360
Gly Ala Pro Phe Arg Pro Ile Thr Val Arg Asp Thr Ile Gly Asp Leu
1365 1370 1375
Pro Ser Val Glu Asn Gly Asp Ser Arg Thr Asn Lys Glu Tyr Lys Glu
1380 1385 1390
Val Ala Val Ser Trp Phe Gln Lys Glu Ile Arg Gly Asn Thr Ile Ala
1395 1400 1405
Leu Thr Asp His Ile Cys Lys Ala Met Asn Glu Leu Asn Leu Ile Arg
1410 1415 1420
Cys Lys Leu Ile Pro Thr Arg Pro Gly Ala Asp Trp His Asp Leu Pro
1425 1430 1435 1440
Lys Arg Lys Val Thr Leu Ser Asp Gly Arg Val Glu Glu Met Thr Pro
1445 1450 1455
Phe Cys Leu Pro Asn Thr Ala Glu Arg His Asn Gly Trp Lys Gly Leu
1460 1465 1470
Tyr Gly Arg Leu Asp Trp Gln Gly Asn Phe Pro Thr Ser Val Thr Asp
1475 1480 1485
Pro Gln Pro Met Gly Lys Val Gly Met Cys Phe His Leu Glu Gln His
1490 1495 1500
Arg Ile Leu Thr Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro Asp
1505 1510 1515 1520
Ser Tyr Glu Phe Ala Gly Asn Ile Asn His Lys His Arg Gln Ile Gly
1525 1530 1535
Asn Ala Val Pro Pro Thr Leu Ala Tyr Ala Leu Gly Thr Lys Leu Lys
1540 1545 1550
Glu Ala Ile Asp Ser Lys Arg Leu Ser Ser Gln Glu
1555 1560
<210>31
<211>4992
<212>DNA
<213>桃(Prunus persica)
<220>
<221>misc_feature
<222>(0)...(0)
<223>AY128652.1;GI:37039879;DNA(胞嘧啶-5-)-甲基转移酶mRNA,完整的cds
<400>31
tcagccctct cattacaccc cacattgcgc attctagggt ttcactggcg agtggggaga 60
aatgggttcc gcagcggcag cagaagcggc agaagcagca gcgctcttgg aggccaaagg 120
tgccaatggg actaaaccac catcttcgtc atcttcagga atgacgaaga agaagaaggg 180
taaacaagat tcccaaaagg cagcacctaa agctaagaag cgaaatttgc ctcagagcag 240
tgaagaagag ccttcccgat ctcggaaaat gccgaagcgg gctgctgctt gcaaagactt 300
taaggatagg tctgttcata tttctgagaa gtctagcctt attgaaagca aggaggacca 360
gatagtggag gaagaaattc ttgccgtacg cctgacttgt ggcccggacc aagatgctgt 420
gcgcccaaac agaagactga ctgattttgt tttgcatgat gcaactggtt ccgcacaacc 480
ccttgagatg ttggaagttt ctgacatgtt tatatctggt gctatattgc ctctcaatga 540
aagttctgac aaggacaagg gaagaagtgt tagatgtgaa ggtttcgggc ggatagaatc 600
ttgggacatc tctggttatg aagatggctc ccccgtaata tggctttcaa ctgaagttgc 660
tgattatgat tgccgtaaac cggccagtag ctacaagaaa tactttgatc aattctttga 720
gaaagcgcgt gcttgcatag aggtttacaa gaagctgtct aaatccaact ccgacaactc 780
cgaccccact cttgatgaat tgcttgctgg tattgcacga tcaatgagcg ggagcaaatt 840
cttttctggg agtgcatctg tcaaagactt tgttctatct caaggcgagt ttatttatgc 900
tcaagtaata ggtctggagg aaacatcaaa gaagaacgat cggccatttg cagagttacc 960
tgtccttgct gccctcagag atgagagtat aaagcgtgga aattttgtgc aatcaaaacc 1020
gggaatttca agtggtactt taaagattgg tggagagaac ggagtggatt cagctggttc 1080
atccgtagtt gaagctgagg aaaatgagga tgcaaagttg gcaaaactct tgcaagagga 1140
agaatactgg aagtcaatga aacaaagaaa gcgccagggt cctgcctctg tgtcaagcaa 1200
atactacatc aaaattaatg aagatgaaat tgccaatgat tatcctctac ccgcttatta 1260
caagaattgc attgaagaaa ctgatgagtt catagttttt gacaatgagt ttgatatctg 1320
taatgctgat gaccttcctc gaagtatgct tcataattgg tgtctataca actcggactc 1380
aagattgatt tcgctcgagc ttcttccaat gaaaccctgc gcagacattg atgttaccat 1440
tttcgggtca ggggttatga gtgaagatga tggaagtggc ttttgtcttg attctgatgg 1500
tacttcaagt ggtccaggag cccaggatgc tgatggaatg ccaatttact tgagtgcgat 1560
aaaggaatgg atgattgaat tgggagcatc aatggtttca atatcaatcc gaacagatat 1620
ggcctggtac agacttggca agccatctaa gcagtatgct ctgtggtatg aaccaattct 1680
gagaacagca aagattggga gaagtataat cactatgctg aaagatcaaa gtcgagtagc 1740
acggctttct ttcgcagatg tcattaagag actgtcaggg ttccaaaagg accattgtgc 1800
ttacatttct tctgatccag catttgttga gaagtatgtc gttgtccatg gacagataat 1860
actgcaactg ttttcagaat ttccagatgc gcagattaaa aaatgtccat ttgtgattgg 1920
tcttacaaag aaaatggagg agaggcacca tactaaatgg ttagtaaaga agaagaagct 1980
tgtggaaaag agtgaatcaa atttgaaccc aagggcatca atggcacctg tggtttccaa 2040
gaggaagaca atgcaagcta caacaacaag gctgatcaac agaatctggg gggagtacta 2100
ttcaaactac tctccagaag attcgaagga gggagatatt ggagaaaaga aagaggagga 2160
ggaagttgaa gaagaggatg tagaagagga tgatgtagaa gagaatccaa ctgtaatgga 2220
gcaagcccag aagccttctt caatttcaag acaaaccaaa tcatgcctca acaacaggga 2280
aattttgtgg gaaggggagc cagtgggcca aacatgttct ggtgaagctc tttataagcg 2340
tgccattctt tggggagaag aaatttctgt tggcggtgct gttttggtgg aacttgatga 2400
atcccatgaa cttcctgcca tttattttgt ggagtatatg tatgaaacat tgaatggaag 2460
caaaatgttt catggaagag tgatggagcg aggatcccag actgttcttg gcaacactgc 2520
caatgagagg gaggtatttt tgacaaatga gtgcacaaat ttggcattaa aggaagttaa 2580
acaggcagct gctgtgggca ttaaagtaat gccgtggggg catcagtata ggaaggataa 2640
tgctgatgct aacagaactg atagagcaag ggcagaagag aggaagagga agggtttgcc 2700
gactgaatat tactgtaaaa gcttgtattg cccagagaga ggtgctttcc ttagtctttc 2760
acgtgatact atgggtctgg gttctggtgc ctgccactct tgcaaaatga atgaagccga 2820
ggaggccaag gaagtattta aagtgaattc atcaaaaact ggttttgtat acaggggagt 2880
tgagtactca gttcatgatt atgtctatgt aagtccccat tattttggtg tggaaaggat 2940
ggaaactgaa attttcaagg ctggaaggaa tttggtgctg aaagcttatg tcgtgtgcca 3000
agtgctggag atagttgtta tgaaggagtc taaacgacct gaaatagaat ctacccaggt 3060
taaagtaaga agatttttca gaccagagga catatctgtt gagaaggcat acagttcgga 3120
tattagagag gtctactaca gtgaacaaac acacatcgtg cctgttgata atatagaaag 3180
aaaatgtgaa gtcagaaaga agagtgatct tccagtatgt aatgctcctg tcattttcca 3240
gcatattttc ttctgtgaac atctatatga tccttctaaa gggtctatta agcagttgcc 3300
agctcacatc aaactgaggt actcaacagg aggtgggcat gctgattcta gaaagagaaa 3360
gggcaagtgc aaagaaggag aaaatgtttc agaagttgag aaccagagag ttgattctga 3420
gcagaaacgc ctagccacat tggatatatt tgctggttgc ggtggcttgt ctaatgggtt 3480
gcgtcagtct ggtgcttcaa taaccaagtg ggcaattgag tatgaagagc ctgctgggga 3540
tgctttcaaa ctcaaccatc ctgagtcatt ggtttttatc aataactgca atgtgatctt 3600
aagggccgta atggaaaaat gtggggacac agatgattgt attgcaactt ctgaagctgc 3660
tgaattggct gcatcacttg atgaggaggt taaaaatgat ttgccgttgc cggggcaggt 3720
agatttcatc aatggaggac ctccatgccg gggtttctct ggaatgaata ggttcaccca 3780
aagcccttgg attaaatttc attgtaaaat gatttgggct tgcttagcct ttgccgacta 3840
cttccggcca aagttgttcc cgctggagaa tgtgaggaaa tttgtgtcat tcaataaagg 3900
gcagacattt cagcttactt tggcttccct tctcgaaatg ggttaccagg tgagatttgg 3960
aatcctggag gccggtgcat atggaatatc ccaatctcgt aaacgagctt tcatttgggc 4020
tgctgcacca gaagaagttc tccctgaatg gcctgagccg atgcatgtct ttggtgttcc 4080
aaagttgaa aatctcactat ctcaaggttt acattatgct gctgttcgta gtactgcact 4140
tggtgcccct ttccgtccaa tcaccgtgag agacacaatt ggtgatcttc catacgtaga 4200
aaacggagac tctaggacaa acaaagagta taaagaggtt gcagtctcgt ggttccaaaa 4260
ggagataaga ggaaacacga ttgctctcac tgatcatatc tgcaaggcta tgaatgagct 4320
taacctcatt cgatgcaaat taatcccaac taggcctggg gctgattggc atgacttgcc 4380
aaagagaaag gttacgttat ctgatgggcg cgtagaagaa atgactcctt tttgtctccc 4440
aaacacagct gagcgccaca acggttggaa gggactatat gggagattag attggcaagg 4500
aaactttccg acttccgtca cggatcctca gcccatgggt aaggttggaa tgtgctttca 4560
tcttgaacag cacagaatcc ttacagtccg tgaatgcgcc cgttctcagg ggtttccgga 4620
tagctacgag tttgcaggga acataaatca caagcacagg cagattggga atgcagttcc 4680
tcctactttg gcctatgcat tggggactaa actcaaggaa gcaattgaca gcaagaggtt 4740
gtcttcacaa gagtaagagt ggttgttgtt gtttgtttct atgtaatact gatagttcca 4800
tttggttgcc ttctaaggca aaaacacagc tcagtttgtt gtctttgatt ttcttcttat 4860
attgtgtttg taaacttgtc ttgattgagg aacttcaatt aaatacacac aagcattttt 4920
cttcaggaga caagtgtcac aaaagtttgg tacatatata tatttgaaat tattttactt 4980
tavtttagaaa aa 4992
<210>32
<211>265
<212>PRT
<213>大豆(Glycine max)
<220>
<221>肽
<222>(0)...(0)
<223>Ceres克隆:520982 Met1同系物
<400>32
Met Glu Lys Cys Gly Asp Thr Asp Asp Cys Ile Ser Thr Ser Glu Ala
1 5 10 15
Ala Glu Leu Ala Ala Lys Leu Asp Glu Lys Glu Ile Ser Ser Leu Pro
20 25 30
Met Pro Gly Gln Val Asp Phe Ile Asn Gly Gly Pro Pro Cys Gln Gly
35 40 45
Phe Ser Gly Met Asn Arg Phe Asn Gln Ser Ser Trp Ser Lys Val Gln
50 55 60
Cys Glu Met Ile Leu Ala Phe Leu Ser Phe Ala Asp Tyr Phe Arg Pro
65 70 75 80
Arg Tyr Phe Leu Leu Glu Asn Val Arg Asn Phe Val Ser Phe Asn Lys
85 90 95
Gly Gln Thr Phe Arg Leu Thr Leu Ala Ser Leu Leu Glu Met Gly Tyr
100 105 110
Gln Val Arg Phe Gly Ile Leu Glu Ala Gly Ala Tyr Gly Val Ser Gln
115 120 125
Ser Arg Lys Arg Ala Phe Ile Trp Ala Ala Ser Pro Glu Asp Val Leu
130 135 140
Pro Glu Trp Pro Glu Pro Met His Val Phe Ser Ala Pro Glu Leu Lys
145 150 155 160
Ile Thr Leu Ser Glu Asn Val Gln Tyr Ala Ala Val Arg Ser Thr Ala
165 170 175
Asn Gly Ala Pro Leu Arg Ser Ile Thr Va1 Gln Asp Thr Ile Gly Asp
180 185 190
Leu Pro Ala Val Gly Asn Gly Ala Ser Lys Gly Asn Met Glu Tyr Gln
195 200 205
Asn Asp Pro Val Ser Trp Phe Gln Lys Lys Ile Arg Gly Asp Met Val
210 215 220
Val Leu Thr Asp His Ile Ser Lys Glu Met Asn Glu Leu Asn Leu Ile
225 230 235 240
Arg Cys Gln Lys Ile Pro Lys Arg Pro Gly Ala Asp Trp Arg Asp Leu
245 250 255
Pro Glu Glu Lys Val Lys Leu Asn Ile
260 265
<210>33
<211>1594
<212>DNA
<213>大豆(Glycine max)
<220>
<221>misc_feature
<222>(0)...(0)
<223>Ceres克隆:520982 Met1同系物
<400>33
aattgcaatg ttattcttag ggctgtaatg gagaagtgtg gggacacaga tgattgtatc 60
tcaacatccg aagctgcaga attggctgca aagcttgatg agaaggaaat aagtagttta 120
ccaatgcctg gacaagttga tttcatcaat ggtggtcctc catgtcaggg tttctctggg 180
atgaataggt ttaaccagag cagttggagt aaagtccagt gtgagatgat attggcattc 240
ttatcctttg ccgattattt ccggccaagg tatttcttgt tggagaatgt gaggaacttt 300
gtgtctttca ataaagggca gacattccgt ttaactttgg cttcacttct tgagatgggc 360
tatcaggtga ggtttggtat ccttgaggct ggagcatatg gggtttccca gtcaagaaaa 420
agggcattca tatgggcagc ctctcctgag gatgtgcttc ctgaatggcc tgaaccaatg 480
catgtctttt cggcccctga gttgaagatt acattatcag aaaatgtcca gtatgctgct 540
gtccgcagta ctgcaaatgg tgctccatta cgttcaataa ctgttcaaga tactattggt 600
gatctcccag ctgttggcaa tggagcctca aaaggaaaca tggagtatca aaatgatcca 660
gtctcatggt ttcaaaagaa gattcgaggt gatatggttg tcttgactga tcatatatca 720
aaggagatga atgaattgaa cttgattcga tgccagaaaa ttcccaagag accaggcgct 780
gattggcgtg accttccaga agaaaaggtg aagttaaata tttgagtttt agcataacat 840
tttttgtgat ctatctaata tgtgaaatct aatgaaatgc agataaaatt gtctactgga 900
caagttgttg atttgatacc atggtgcttg ccaaacacgg ctaagcggca caatcagtgg 960
aagggactgt ttggcaggtt ggattggcaa gggaatttcc caacttccat tactgaccct 1020
cagccaatgg ggaaggttgg aatgtgcttc caccctgacc aagataggat tcttactgtt 1080
cgtgaatgtg ctcggtctca aggcttccca gatagctatc aatttgctgg caatatcata 1140
cacaagcacc ggcagattgg taatgctgtg cctcctcctc tggcatctgc attggggaga 1200
aagctcaagg aagcagtgga cagtaagagctccacttaga agatggggct tctacatttt 1260
ttgaaatatc atgcttattg tattcatatc agtcaccaag atattgcaaa tcattattca 1320
gggttccaga aactagaaac ccttgtatat agtgatatcc attggtcatt tgttttgagg 1380
ctaattcctt gtttaacttt cctcaaccaa ggaattgtat ggatgatgtt atgatgttca 1440
ttttctatca actagtattt tcttgattag ataatatttt ggctgtttat gacagaaatg 1500
gctgggaatt tagaattacc tcccaatgta tatagttgac aattgagacc aattttgtca 1560
ttttttttaa cttgttatga atatttgttg ttgc 1594
<210>34
<211>1554
<212>PRT
<213>豌豆(Pisum sativum)
<220>
<221>肽
<222>(0)...(0)
<223>gi|2654108|gb|AAC49931.1|胞嘧啶-5 DNA
甲基转移酶
<400>34
Met Gly Ser Ala Ser Leu Leu Asn Pro Ser Asp Ser Ser Leu Pro Gly
1 5 10 15
Gly Lys Asp Ser Thr Ser Lys Glu Glu Pro Val Ser Asn Thr Glu Gly
20 25 30
Glu Val Met Ala Gly Gly Lys Gln Lys Lys Arg Ser Leu Ser Glu Ser
35 40 45
Ser Glu Gln Pro Ala Pro Thr Arg Lys Val Pro Lys Arg Ser Ala Ser
50 55 60
Ala Ala Ser Lys Asn Leu Lys Glu Lys Ser Phe Ser Ile Ser Asp Lys
65 70 75 80
Ser Cys Leu Val Glu Thr Lys Lys Asp Gln Val Ala Glu Gly Glu Leu
85 90 95
Leu Ala Val Arg Met Thr Ala Gly Gln Glu Asp Asp Arg Pro Asn Arg
100 105 110
Arg Leu Thr Asp Phe Ile Leu His Asp Glu Ser Gly Ala Ala Gln Ala
115 120 125
Leu Glu Met Leu Glu Ile Lys Asp Leu Phe Ile Thr Gly Leu Ile Leu
130 135 140
Pro Leu Glu Gly Asn Ala Asp Lys Lys Lys Glu Gln Gly Val Arg Cys
145 150 155 160
His Gly Phe Gly Arg Ile Glu Ser Trp Asp Ile Ser Gly Tyr Glu Asp
165 170 175
Gly Ser Pro Val Ile Trp Ile Ser Thr Glu Ile Ala Asp Tyr Asp Cys
180 185 190
Gln Lys Pro Ala Gly Thr Tyr Lys Lys Tyr Tyr Asp Leu Phe Phe Glu
195 200 205
Lys Ala Arg Ala Cys Leu Glu Val Tyr Lys Lys Leu Ala Lys Ser Ser
210 215 220
Gly Gly Asp Pro Asp Ile Ser Leu Asp Glu Leu Leu Ala Gly Met Ala
225 230 235 240
Arg Ser Met Ser Gly Ser Lys Tyr Phe Ser Gly Thr Ala Ser Leu Lys
245 250 255
Glu Phe Ile Ile Ser Gln Gly Asp Phe Ile Tyr Lys Gln Leu Ile Gly
260 265 270
Leu Asp Thr Met Leu Lys Ala Asn Asp Lys Gly Phe Glu Asp Ile Pro
275 280 285
Ala Leu Ile Ala Leu Arg Asp Glu Ser Lys Lys Gln Ala His Phe Ala
290 295 300
Asn Thr Gln Val Arg Pro Ser Asn Ala Thr Leu Arg Ile Gly Ser Gly
305 310 315 320
Ile Val Asp Glu Glu Lys Lys Asn Gln Met Asp Ser Val Asp Glu Glu
325 330 335
Asp Glu Asp Ala Lys Leu Ala Arg Leu Leu Gln Asp Glu Glu Tyr Trp
340 345 350
Lys Ser Asn Arg Gln Arg Lys Asn Ser Arg Ser Ser Ser Ser Ser Asn
355 360 365
Lys Phe Tyr Ile Lys Ile Asn Glu Asp Glu Ile Ala Asn Asp Tyr Pro
370 375 380
Leu Pro Ala Tyr Tyr Lys Thr Ser Leu Gln Glu Thr Asp Glu Phe Ile
385 390 395 400
Val Phe Asp Asn Asp Cys Asp Ile Tyr Asp Thr Glu Asp Pro Ser Arg
405 410 415
Ser Met Leu His Asn Trp Ala Leu Tyr Asn Ser Asp Ser Arg Leu Ile
420 425 430
Ser Leu Glu Leu Leu Pro Met Lys Pro Cys Ser Glu Met Asp Val Thr
435 440 445
Ile Phe Gly Ser Gly Thr Met Thr Ser Asp Asp Gly Ser Gly Phe Asn
450 455 460
Leu Asp Thr Glu Ala Gly Gln Ser Ser Val Ala Ser Gly Ala Gln Asp
465 470 475 480
Thr Asp Gly Ile Pro Ile Tyr Leu Ser Ala Ile Lys Glu Trp Met Ile
485 490 495
Glu Phe Gly Ser Ser Met Val Phe Ile Ser Ile Arg Thr Asp Leu Ala
500 505 510
Gly Ile Gly Leu Gly Lys Pro Ser Lys Gln Tyr Thr Pro Trp Tyr Asp
515 520 525
Thr Val Leu Lys Thr Ala Arg Ile Ala Ile Ser Ile Ile Thr Leu Leu
530 535 540
Lys Glu Gln Ser Arg Val Ser Arg Leu Ser Phe Pro Asp Val Ile Lys
545 550 555 560
Lys Val Ser Glu Tyr Thr Gln Asp Asn Lys Ser Tyr Ile Ser Ser Asp
565 570 575
Pro Leu Ala Val Glu Arg Tyr Ile Val Val His Gly Gln Ile Ile Leu
580 585 590
Gln Leu Phe Ala Glu Phe Pro Asp Asp Lys Ile Arg Lys Ser Pro Phe
595 600 605
Val Thr Gly Leu Met Asn Lys Met Glu Glu Arg His His Thr Lys Trp
610 615 620
Leu Val Lys Lys Lys Lys Leu Ser Pro Lys Ser Glu Pro Asn Leu Asn
625 630 635 640
Pro Arg Ala Ala Met Ala Pro Val Val Ser Lys Arg Lys Ala Met Gln
645 650 655
Ala Thr Ala Thr Lys Leu Ile Asn Arg Ile Trp Gly Glu Tyr Tyr Ser
660 665 670
Asn His Leu Pro Glu Glu Ser Lys Glu Gly Thr Ala Ile Glu Glu Lys
675 680 685
Asp Asp Asp Glu Ala Glu Glu Gln Glu Glu Asn Glu Asp Glu Asp Ala
690 695 700
Glu Glu Glu Thr Val Leu Leu Glu Glu Thr Leu Lys Pro Arg Ile Val
705 710 715 720
Ser Lys Gln Ile Lys Ala Phe Ser Asp Asp Gly Glu Val Arg Trp Glu
725 730 735
Gly Val Pro Glu Arg Lys Thr Ser Ser Gly Leu Pro Leu Tyr Lys Gln
740 745 750
Ala Ile Ile His Gly Gly Ser Cys Phe Cys Gly Asn Ile Cys Val Ser
755 760 765
Arg Lys Leu Met Asn Gln Met Ser Phe Leu Ile Tyr Ile Thr Leu Asn
770 775 780
Ile Cys Leu Asn Pro Lys Asn Gly Glu Lys Met Phe His Gly Arg Met
785 790 795 800
Met Gln His Gly Cys His Thr Val Leu Gly Asn Ala Ala Ser Glu Arg
805 810 815
Glu Val Phe Leu Thr Asn Glu Cys Arg Asp Leu Gly Leu Gln Asp Val
820 825 830
Lys Gln Ile Asn Val Ala Ser Ile Arg Lys Thr Pro Trp Gly His Gln
835 840 845
His Arg Lys Ala Ser Ala Ala Ala Gly Lys Ile Asp Arg Glu Arg Ala
850 855 860
Asp Glu Arg Lys Lys Lys Gly Leu Pro Thr Glu Tyr Tyr Cys Lys Ser
865 870 875 880
Leu tyr Trp Pro Glu Arg Gly Ala Phe Phe Ser Leu Pro Phe Asp Thr
885 890 895
Leu Gly Leu Gly Ser Gly Val Cys His Ser Cys Asn Ile Gln Glu Ala
900 905 910
Asp Lys Ala Lys Glu Ile Phe Lys Val Asn Ser Ser Lys Ser Ser Phe
915 920 925
Val Leu Asp Gly Thr Glu Tyr Ser Leu Asn Asp Tyr Val Tyr Val Ser
930 935 940
Pro Phe Glu Phe Glu Glu Lys Ile Glu Gln Gly Thr His Lys Ser Gly
945 950 955 960
Arg Asn Val Gly Leu Lys Ala Phe Val Val Cys Gln Val Leu Glu Ile
965 970 975
Ile Ala Lys Lys Glu Thr Lys Gln Ala Glu Ile Lys Ser Thr Glu Leu
980 985 990
Lys Val Arg Arg Phe Phe Arg Pro Glu Asp Val Ser Ser Glu Lys Ala
995 1000 1005
Tyr Cys Ser Asp Val Gln Glu Val Tyr Phe Ser Asp Glu Thr Tyr Thr
1010 1015 1020
Ile Ser Val Gln Ser Val Glu Gly Lys Cys Glu Val Arg Lys Lys Ile
1025 1030 1035 1040
Asp Ile Pro Glu Gly Ser Ala Pro Gly Ala Phe His Asn Val Phe Phe
1045 1050 1055
Cys Glu Leu Leu Tyr Asp Pro Ala Thr Gly Ser Leu Lys Lys Leu Pro
1060 1065 1070
Ser His Ile Lys Val Lys Tyr Ser Ser Gly Pro Thr Ala Asp Asn Ala
1075 1080 1085
Ala Arg Lys Lys Lys Gly Lys Cys Lys Glu Gly Asp Ser Ile Ser Val
1090 1095 1100
Pro Asp Ile Lys Ser Lys Thr Ser Asn Glu Asn Arg Leu Ala Thr Leu
1105 1110 1115 1120
Asp Ile Phe Ala Gly Cys Gly Ala Leu Ser Glu Gly Leu His Lys Ser
1125 1130 1135
Gly Ala Ser Ser Thr Lys Trp Ala Ile Glu Tyr Glu Glu Pro Ala Gly
1140 1145 1150
Asn Ala Phe Lys Ala Asn His Pro Glu Ala Leu Val Phe Ile Asn Asn
1155 1160 1165
Cys Asn Val Ile Leu Arg Ala Ile Met Glu Lys Cys Gly Asp Ile Asp
1170 1175 1180
Glu Cys Ile Ser Thr Ala Glu Ala Ala Glu Leu Ala Ser Lys Leu Asp
1185 1190 1195 1200
Asp Lys Asp Leu Asn Ser Leu Pro Leu Pro Gly Gln Val Asp Phe Ile
1205 1210 1215
Asn Gly Gly Pro Pro Cys Gln Gly Phe Ser Gly Met Asn Arg Phe Asn
1220 1225 1230
Thr Ser Thr Trp Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu
1235 1240 1245
Ser Phe Ala Asp Tyr Phe Arg Pro Arg Tyr Phe Leu Leu Glu Asn Val
1250 1255 1260
Arg Asn Phe Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Thr Leu
1265 1270 1275 1280
Ala Ser Leu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Ile Leu Glu
1285 1290 1295
Ala Gly Ala Phe Gly Val Ser Gln Ser Arg Lys Arg Ala Phe Ile Trp
1300 1305 1310
Ala Ala Ser Pro Glu Asp Val Leu Pro Glu Trp Pro Glu Pro Met His
1315 1320 1325
Val Phe Ser Ala Pro Glu Leu Lys Ile Thr Leu Ala Glu Asn Val Gln
1330 1335 1340
Tyr Ala Ala Val Cys Ser Thr Ala Asn Gly Ala Pro Leu Arg Ala Ile
1345 1350 1355 1360
Thr Val Arg Asp Thr Ile Gly Glu Leu Pro Ala Val Gly Asn Gly Ala
1365 1370 1375
Ser Arg Thr Asn Met Glu Tyr Gln Ser Asp Pro Ile Ser Trp Phe Gln
1380 1385 1390
Lys Lys Ile Arg Gly Asn Met Ala Val Leu Thr Asp His Ile Ser Lys
1395 1400 1405
Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Gln Lys Ile Pro Lys Arg
1410 1415 1420
Pro Gly Cys Asp Trp Arg Asp Leu Pro Asp Glu Lys Ile Lys Leu Ser
1425 1430 1435 1440
Thr Gly Gln Leu Val Asp Leu Ile Pro Trp Cys Leu Pro His Thr Ala
1445 1450 1455
Lys Arg His Asn Gln Trp Lys Gly Leu Phe Gly Arg Leu Asp Trp Gln
1460 1465 1470
Gly Asn Phe Pro Thr Ser Ile Thr Asp Pro Gln Pro Met Gly Lys Val
1475 1480 1485
Gly Met Cys Phe His Pro Asp Gln Asp Arg Ile Leu Thr Val Arg Glu
1490 1495 1500
Cys Ala Arg Ser Gln Gly Phe Pro Asp His Tyr Gln Phe Ser Gly Asn
1505 1510 1515 1520
Ile Ile His Lys His Arg Gln Ile Gly Asn Ala Val Pro Pro Pro Leu
1525 1530 1535
Ala Phe Ala Leu Gly Arg Lys Leu Lys Glu Ala Leu Asp Ser Lys Ser
1540 1545 1550
Ala Asn
<210>35
<211>4910
<212>DNA
<213>豌豆(Pisum sativum)
<220>
<221>misc_feature
<222>(0)...(0)
<223>AF034419.1;GI:2654107;胞嘧啶-5 DNA
甲基转移酶mRNA,完整的cds
<400>35
cttcagatct acaacccgcg ttttggatac aaggaaaatt ttccaactca tgggttccgc 60
ttcgcttttg aatccctccg attcgtctct accgggtggc aaggacagca cgagtaaaga 120
agagcctgtt tcaaacactg aaggggaagt tatggctggt ggtaagcaaa agaagcgaag 180
tttgtcagag agcagtgagc agcctgctcc tactcggaaa gtgccgaaac gatctgcaag 240
tgcagcaagt aaaaatttga aggagaagtc tttttccata tctgataagt cttgtcttgt 300
tgaaactaag aaggatcagg ttgcagaagg agaattgcta gcagtccgta tgactgctgg 360
acaagaggat gaccgcccaa atagaagact tacagacttt atccttcatg atgaaagtgg 420
tgcagcacag gcacttgaga tgcttgaaat caaggattta ttcatcactg gacttatatt 480
gccactagaa ggaaatgctg acaagaaaaa agagcaaggt gttagatgtc atggttttgg 540
tcgaattgag tcatgggaca tatctggtta tgaggatggc tctccagtga tatggatttc 600
tactgagatt gctgactatg attgccagaa accagctggt acctacaaaa aatactatga 660
tcttttcttt gaaaaagctc gggcttgctt agaagtgtac aaaaaactag caaagtcttc 720
tgggggagat cctgacataa gccttgatga gttacttgct ggcatggcac ggtcaatgag 780
tggtagcaag tacttttctg gaactgcatc actaaaggaa ttcattattt ctcagggtga 840
ttttatttat aagcaactca ttggtttaga cacaatgttg aaggcaaatg acaaggggtt 900
tgaagatatt cctgctttga ttgctcttag agatgagagc aagaaacaag cacactttgc 960
aaacacacaa gtgaggccat caaatgcgac tttaaggatt ggttcgggaa ttgtagatga 1020
agagaaaaag aatcagatgg attctgtaga tgaagaggat gaggatgcaa agttagctcg 1080
actattgcag gatgaagagt attggaaatc taacaggcag aggaaaaact ctagatcatc 1140
atcttcatct aataaattct atatcaagat taatgaagat gagattgcaa atgattatcc 1200
tctccctgct tattataaaa cttctcttca agaaacggat gaatttatag tttttgataa 1260
tgactgtgac atatatgaca ctgaagatcc ttctagaagc atgttgcaca attgggcttt 1320
atacaactct gattctagat tgatttccct ggaacttctt cccatgaaac cttgttcaga 1380
gatggatgtt acaatctttg gatcaggtac aatgacttca gatgatggaa gtggtttcaa 1440
tcttgataca gaggctggcc aatcttccgt tgcttctgga gcacaagaca ctgatggtat 1500
tccaatttat ctgagtgcaa taaaagagtg gatgattgaa tttggatcat ctatggtttt 1560
catatccatc cgaacagatt tggctggtat aggacttggc aaaccatcaa agcagtacac 1620
tccttggtat gacacagtat tgaaaactgc aagaattgct ataagcatta tcacgttgtt 1680
gaaggagcag agccgtgtat cacggctttc atttccagat gttataaaaa aagtatctga 1740
gtatactcag gacaataagt catatatttc ttctgatcca ttggctgtag aaagatatat 1800
tgttgtccat ggacagataa ttctgcaact atttgcagaa tttccagatg acaagatcag 1860
gaagtctcct ttcgtgactg gtcttatgaa caaaatggaa gaaaggcacc ataccaaatg 1920
gttagtgaag aagaagaaac tgtcgccaaa gagtgagcca aatttgaatc ctagggcagc 1980
aatggctcct gttgtatcta aaaggaaagc tatgcaagct acagcaacaa agctaatcaa 2040
tagaatatgg ggtgagtatt actcaaacca cttacccgag gaatcaaaag aaggaactgc 2100
tattgaagaa aaggatgatg atgaagcaga ggaacaggaa gagaatgaag acgaggatgc 2160
tgaggaagag acagtactgt tggaggaaac actaaagcca cgtatagttt ccaaacagat 2220
taaagcattt tctgatgatg gagaggttag atgggaaggg gttcccgaaa ggaaaaccag 2280
ttctggattg cctctttata agcaggcaat tattcatgga ggaagttgtt tctgtgggaa 2340
tatctgtgtc agtcggaagt tgatgaatca gatgagcttc ctgatatata ttacattgaa 2400
tatatgtttg aatccaaaga atggggaaaa gatgtttcat ggtaggatga tgcaacatgg 2460
ttgtcacact gttcttggca atgccgcaag tgagagagag gtgtttttga ctaatgagtg 2520
cagggatttg ggactgcaag atgttaagca gataaatgtt gcaagcatcc gaaaaacacc 2580
ttgggggcat cagcatcgaa aggctagtaa tgctgcaggt aaaatcgata gagagagagc 2640
tgatgaaagg aagaagaaag gactgcctac tgaatattac tgtaaaagct tgtactggcc 2700
tgagaggggt gctttcttca gtcttccgtt tgatacgctg ggtttagggt ctggtgtctg 2760
tcactcttgc aatatacaag aagctgacaa ggcgaaggaa attttcaaag taaattcgtc 2820
taagtctagt tttgtattgg atggaacaga atattctctc aatgactatg tttatgtaag 2880
cccttttgaa tttgaggaaa agatagagca gggaactcat aagagtggga ggaatgtagg 2940
gctgaaagct tttgttgtat gccaagtgct cgagatcatt gccaaaaagg aaacaaaaca 3000
agctgaaata aaatctacag aactcaaagt cagaagattc tttcgaccag aagatgtatc 3060
aagtgagaaa gcatactgct ctgatgtaca agaggtgtat ttcagtgatg aaacatatac 3120
tatctctgtt caatctgtag aaggtaaatg tgaagtcagg aaaaagattg atatccctga 3180
aggaagtgcc cctggagcct ttcacaatgt ctttttctgt gaactcctgt atgatcctgc 3240
cacaggatcg ctcaagaagt tgccatctca tatcaaagta aaatattcta gtggacctac 3300
agctgataat gcagctagaa agaaaaaggg aaaatgtaaa gagggagata gcatttcagt 3360
gcctgatata aaaagtaaaa catcaaatga aaaccgttta gcaaccctgg acatttttgc 3420
aggatgcggt gccttatcag aggggttgca taagtctggt gcttcatcaa ctaaatgggc 3480
tattgaatat gaagaaccag ctggcaatgc attcaaagct aatcatcctg aagctttggt 3540
gtttattaac aactgtaatg taattctcag ggctataatg gagaaatgtg gagatataga 3600
tgaatgtatc tcaacagccg aggctgcaga attggcctct aagcttgatg ataaggattt 3660
gaatagttta ccattacctg ggcaagttga tttcattaat ggggggcctc catgccaggg 3720
tttctctggg atgaatagat ttaacacaag cacttggagt aaagtccagt gtgagatgat 3780
attagcgttc ttatcctttg ctgattattt ccggccgagg tatttcctct tggagaatgt 3840
gaggaacttt gtgtctttta ataaaggaca gactttccgt ttaactttgg cttcacttct 3900
cgagatgggt taccaggtga ggtttggtat cctcgaggct ggagcttttg gtgtttctca 3960
gtcaagaaaa agggcattta tatgggctgc ctctccagaa gatgtgcttc ctgagtggcc 4020
agaaccaatg catgtcttct ctgcccctga gttgaaaatc acattggcag aaaatgtcca 4080
gtatgctgcc gtctgcagta ctgcaaatgg tgctccgtta cgggcaataa ctgttcgtga 4140
taccattggt gaactcccag ctgttggcaa tggagcctct aggacaaaca tggagtatca 4200
aagcgatcct atctcgtggt ttcaaaagaa gatccgaggc aatatggctg tcttgactga 4260
tcatatatca aaggaaatga atgagttgaa cttgatccga tgtcagaaaa ttcctaagag 4320
accaggttgt gattggcgtg atcttccaga cgaaaagata aaactttcaa ctggacaact 4380
tgttgatttg ataccatggt gcttgccaca cacagctaag aggcataatc aatggaaggg 4440
actgtttggt aggttagatt ggcaagggaa tttcccaact tccatcaccg accctcaacc 4500
aatggggaag gttggaatgt gcttccatcc cgatcaagat agaattctta ctgttcgtga 4560
atgcgcccga tctcaaggct ttccagacta ctatcaattt tctggtaaca tcatacacaa 4620
gcacaggcag attggtaacg cggttcctcc tcctctggca tttgcattag gaaggaaact 4680
caaggaagca ttggatagta agagcgccaa ttagaggatt agggcgcatc tttcaaaaag 4740
catcttttta tcatatagtt ttgtctttca gtgttctgga aacaacccaa cccttgtata 4800
tagttgtttt cttggctatt tttcttagtt taatcaattc tttgtttaaa aggattgatg 4860
gaatggatta tgctataaaa ctcatttttt ctatcaaaaa aaaaaaaaaa 4910
<210>36
<211>1545
<212>PRT
<213>Daucus carota
<220>
<221>肽
<222>(0)...(0)
<223>gi|2895087|gb|AAC39355.1|Met1-类型
胞嘧啶DNA-甲基转移酶
<400>36
Met Gly Ser Ser Ala Val Val Asp Ala Pro Ala Leu Asp Ala Gly Leu
1 5 10 15
Glu Thr Lys Lys Asn Lys Arg Lys Asn Ala Asp Cys Asp Ser Glu Lys
20 25 30
Thr Ala Val Ser Gly Gln Lys Lys Gln Arg Ala His Ala Leu Lys Ser
35 40 45
Ser Glu Thr Pro Val Gly Ser Arg Lys Met Pro Lys Arg Ala Ala Ala
50 55 60
Cys Ala Asp Phe Lys Glu Lys Ser Ile Gln Ile Ser Lys Lys Ser Ser
65 70 75 80
Ile Ile Glu Thr Lys Lys Asp Arg Ser Val Asp Glu Glu Glu Val Ala
85 90 95
Val Arg Leu Thr Ala Gly Gln Glu Asp Gly Arg Pro Cys Arg Arg Leu
100 105 110
Thr Asp Phe Ile Phe His Asn Ser Asp Gly Ile Pro Gln Ala Phe Glu
115 120 125
Met Leu Glu Val Asp Asp Leu Tyr Ile Ser Gly Leu Ile Leu Pro Leu
130 135 140
Glu Asp Ser Ser Gln Lys Glu Ala Cys Ser Ile Lys Cys Glu Gly Phe
145 150 155 160
Gly Arg Ile Glu Asn Trp Ala Leu Ser Gly Tyr Glu Glu Gly Val Pro
165 170 175
Thr Ile Trp Val Ser Thr Asp Val Ala Asp Tyr Asp Cys Val Lys Pro
180 185 190
Ser Ala Ser Tyr Lys Lys His Tyr Glu His Leu Phe Ala Lys Ala Thr
195 200 205
Ala Cys Val Glu Val Tyr Lys Lys Leu Ser Lys Ser Ser Gly Gly Asn
210 215 220
Pro Asp Leu Ser Leu Asp Glu Leu Leu Ala Gly Val Val Arg Gly Leu
225 230 235 240
Ser Gly Met Lys Cys Phe Ser Arg Ser Val Ser Ile Lys Asp Phe Ile
245 250 255
Ile Ser Gln Gly Asp Phe Ile Tyr Asn Gln Leu Val Gly Leu Asp Glu
260 265 270
Thr Ser Lys Lys Thr Asp Gln Gln Phe Leu Glu Leu Pro Val Leu Ile
275 280 285
Ala Leu Arg Glu Glu Ser Ser Lys His Gly Asp Pro Ser Ile Gly Lys
290 295 300
Val Ala Ser Thr Asn Gly Thr Leu Thr Ile Gly Pro Lys Ile Lys Asp
305 310 315 320
Gly Glu Asn Lys Lys Asp Ser Ala Thr Glu Glu Asp Glu Gly Val Lys
325 330 335
Val Ala Arg Leu Leu Gln Glu Glu Glu Phe Trp Asn Ser Met Lys Gln
340 345 350
Lys Lys Gly Arg Gly Ser Ser Thr Ser Ser Asn Lys Tyr Tyr Ile Lys
355 360 365
Ile Asn Glu Asp Glu Ile Ala Asn Asp Tyr Pro Leu Pro Ala Tyr Tyr
370 375 380
Lys Thr Ala Asn Gln Glu Thr Asp Glu Tyr Ile Ile Phe Asp Gly Gly
385 390 395 400
Ala Asp Ala Cys Tyr Thr Asp Asp Leu Pro Arg Ser Met Leu His Asn
405 410 415
Trp Ala Leu Tyr Asn Ser Asp Ser Arg Leu Ile Ser Leu Glu Leu Leu
420 425 430
Pro Met Lys Gly Cys Ala Asp Ile Asp Val Thr Ile Phe Gly Ser Gly
435 440 445
Val Met Thr Glu Asp Asp Gly Thr Gly Phe Asn Leu Asp Gly Asp Thr
450 455 460
Ser Gln Ser Ser Ser Ala Gly Leu Gly Thr Ala Asn Val Asp Gly Ile
465 470 475 480
Pro Ile Tyr Leu Ser Ala Ile Lys Glu Trp Met Ile Glu Phe Gly Ser
485 490 495
Ser Met Val Phe Ile Ser Ile Arg Thr Asp Met Ala Trp Tyr Arg Leu
500 505 510
Gly Lys Pro Ser Lys Gln Tyr Ala Ser Trp Tyr Glu Pro Val Leu Lys
515 520 525
Thr Ala Arg Val Ala Ile Ser Ile Ile Thr Leu Leu Lys Glu Gln Ala
530 535 540
Arg Val Ser Arg Leu Ser Phe Met Asp Val Ile Lys Arg Val Ser Glu
545 550 555 560
Phe Glu Lys Gly His Pro Ala Tyr Ile Ser Ser Val Pro Ala Ala Val
565 570 575
Glu Arg Tyr Val Val Val His Gly Gln Ile Ile Leu Gln Gln Phe Leu
580 585 590
Glu Phe Pro Asp Glu Lys Ile Lys Lys Ser Ala Phe Val Ile Gly Leu
595 600 605
Thr Asn Lys Met Glu Glu Arg His His Thr Lys Trp Leu Met Lys Lys
610 615 620
Lys Lys Leu Leu Gln Arg Asp Glu Pro Asn Leu Asn Pro Arg Ala Ala
625 630 635 640
Leu Ala Pro Val Val Ser Lys Arg Lys Ala Met Gln Ala Thr Thr Thr
645 650 655
Arg Leu Ile Asn Arg Ile Trp Gly Glu Phe Tyr Ser Asn Tyr Ser Pro
660 665 670
Glu Asp Met Lys Glu Gly Ile Thr Gly Glu Asp Lys Glu Glu Glu Glu
675 680 685
Pro Glu Glu Gln Glu Glu Ile Glu Glu Glu Glu Glu Lys Glu Thr Leu
690 695 700
Thr Ala Leu Glu Lys Thr Pro Thr Pro Thr Ser Thr Pro Arg Lys Thr
705 710 715 720
Lys Ser Ile Pro Lys Val Lys Asp Ile Arg Trp Asn Arg Lys Ser Val
725 730 735
Gly Glu Thr Leu Ser Gly Glu Ala Leu Tyr Lys Gln Ala Ile Val Tyr
740 745 750
Gly Thr Glu Ile Ala Val Gly Gly Ala Val Leu Val Asp Asp Glu Ser
755 760 765
Ala Gln Leu Pro Ala Ile Tyr Tyr Val Glu Tyr Met Phe Glu Thr Leu
770 775 780
Asn Gly Ile Lys Met Leu His Gly Arg Met Leu Gln Gln Gly Ser Leu
785 790 795 800
Thr Ile Leu Gly Asn Thr Ala Asn Glu Cys Glu Val Phe Leu Thr Asn
805 810 815
Asp Cys Met Aap Phe Glu Leu Ala Asp Val Lys Lys Ala Val Val Glu
820 825 830
Ile Arg Ser Arg Pro Trp Gly His Gln Tyr Arg Lys Val Asn Ala Asn
835 840 845
Ala Asp Lys Ile Tyr Arg Ala Gly Val Glu Glu Arg Lys Lys Asn Gly
850 855 860
Leu Glu Thr Glu Tyr Tyr Cys Lys Ser Leu Tyr Cys Pro Asp Lys Gly
865 870 875 880
Ala Phe Leu Ser Leu Pro Leu Asn Ser Met Gly Leu Gly Ser Gly Ile
885 890 895
Cys Ser Ser Cys Lys Leu Asp Lys Asp Leu Thr Glu Lys Glu Lys Phe
900 905 910
Val Val His Ser Asp Lys Thr Ser Phe Val Phe Asn Gly Thr Glu Tyr
915 920 925
Ser Ile His Asp Phe Leu Tyr Val Ser Pro Gln Gln Phe Ser Thr Glu
930 935 940
Arg Val Gly Asn Glu Thr Phe Lys Gly Gly Arg Asn Val Gly Leu Lys
945 950 955 960
Ala Tyr Ala Ile Cys Gln Leu Leu Glu Ile Ile Val Pro Lys Ala Pro
965 970 975
Lys Gln Ala Glu Pro His Ser Thr Glu Ile Lys Val Arg Arg Phe Tyr
980 985 990
Arg Pro Glu Asp Ile Ser Asp Glu Lys Ala Tyr Cys Ser Asp Ile Arg
995 1000 1005
Glu Val Tyr Tyr Ser Glu Glu Thr His Thr Ile Asp Ala Glu Thr Val
1010 1015 1020
Glu Gly Arg Cys Glu Val Arg Lys Lys Asn Asp Leu Pro Ser Cys Asp
1025 1030 1035 1040
Ala Pro Thr Ile Phe Asp His Val Phe Phe Cys Glu Tyr Leu Tyr Asp
1045 1050 1055
Pro Ala Lys Gly Ser Leu Lys Gln Leu Pro Pro Asn Ile Lys Leu Arg
1060 1065 1070
Tyr Ser Ala Val Lys Gly Ala His Val Ser Ser Leu Arg Lys Asn Lys
1075 1080 1085
Gly Lys Cys Lys Glu Gly Glu Asp Asp Leu Asp Ser Leu Lys Ser Lys
1090 1095 1100
Val Asn Cys Leu Ala Thr Leu Asp Ile Phe Ala Gly Cys Gly Gly Leu
1105 1110 1115 1120
Ser Glu Gly Leu Gln Lys Ser Gly Val Cys Thr Thr Lys Trp Ala Ile
1125 1130 1135
Glu Tyr Glu Glu Ala Ala Gly Asp Ala Phe Lys Leu Asn His Pro Glu
114O 1145 1150
Ser Leu Met Phe Ile Asn Asn Cys Asn Val Ile Leu Lys Ala Ile Met
1155 1160 1165
Asp Lys Thr Gly Asp Ala Asp Asp Cys Ile Ser Thr Pro Glu Ala Ala
1170 1175 1180
Glu Leu Ala Ala Lys Leu Ser Glu Glu Glu Ile Lys Asn Leu Pro Leu
1185 1190 1195 1200
Pro Gly Gln Val Asp Phe Ile Asn Gly Gly Pro Pro Cys Gln Gly Phe
1205 1210 1215
Ser Gly Met Asn Arg Phe Asn Gln Ser Ser Trp Ser Lys Val Gln Cys
1220 1225 1230
Glu Met Ile Leu Ala Phe Leu Ser Phe Ala Asp tyr Tyr Arg Pro Lys
1235 1240 1245
Tyr Phe Leu Leu Glu Asn Val Arg Thr Phe Val Ser Phe Asn Lys Gly
1250 1255 1260
Gln Thr Phe Arg Leu Ala Ile Ala Ser Leu Leu Asp Met Gly Tyr Gln
1265 1270 1275 1280
Val Arg Phe Gly Ile Leu Glu Ala Gly Ala Tyr Gly Val Pro Gln Ser
1285 1290 1295
Arg Lys Arg Ala Phe Ile Trp Ala Ala Ser Pro Glu Glu Thr Leu Pro
1300 1305 1310
Glu Trp Pro Glu Pro Met His Val Phe Ala Ala Pro Glu Leu Lys Ile
1315 1320 1325
Ala Leu Pro Glu Asn Lys Tyr Tyr Ala Ala Val Arg Ser Thr Gln Thr
1330 1335 1340
Gly Ala Pro Phe Arg Ser Ile Thr Val Arg Asp Thr Ile Gly Asp Leu
1345 1350 1355 1360
Pro Met Val Ser Asn Gly Ala Ser Arg Thr Ser Ile Glu Tyr Gln Met
1365 1370 1375
Asp Pro Ile Ser Trp Phe Gln Lys Lys Ile Arg Ala Asn Met Met Val
1380 1385 1390
Leu Thr Asp His Ile Ser Lys Glu Met Asn Glu Leu Asn Leu Ile Arg
1395 1400 1405
Cys Gln Arg Ile Pro Lys Arg Arg Gly Ala Asp Trp Gln Asp Leu Pro
1410 1415 1420
Asp Glu Lys Val Lys Leu Ser Ser Gly Gln Leu Val Asp Leu Ile Pro
1425 1430 1435 1440
Trp Cys Leu Pro Asn Thr Ala Lys Arg His Asn Gln Trp Lys Gly Leu
1445 1450 1455
Phe Gly Arg Leu Asp Trp Glu Gly Ser Phe Pro Thr Ser Ile Thr Asp
1460 1465 1470
Pro Gln Pro Met Gly Lys Val Gly Met Cys Phe His Pro Asp Gln His
1475 1480 1485
Arg Ile Val Thr Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro Asp
1490 1495 1500
Ser Tyr Gln Phe Tyr Gly Asn Ile Leu His Lys His Gln Gln Ile Gly
1505 1510 1515 1520
Asn Ala Val Pro Pro Pro Leu Ala Tyr Ala Leu Gly Met Lys Leu Lys
1525 1530 1535
Glu Ala Leu Glu Ser Lys Gly Cys Met
1540 1545
<210>37
<211>5097
<212>DNA
<213>Daucus carota
<220>
<221>misc_feature
<222>(0)...(0)
<223>AF007807.1;GI:2895086;Met1-类型 胞嘧啶
DNA-甲基转移酶mRNA,完整的cds
<400>37
atcgatttcc ccgaagaccc gaatcaaacc gggtcgggtc cattgcttta tgaaattgaa 60
ccgccaaaat gtatgggcgg gaaaggacaa ttaaaaaata tgtttgcgcg gttttttgtt 120
cttttccaaa atttgcagac gttttgggga taaataagag gacccagatc gataaagata 180
caagatagtc aaaagggtcc tataattcgt ggatttttag ttcagagttt gaattttttg 240
gttttgggtt cttgaaatct tggtttctgg ggtctttgtt tgatttgctt aatgggatct 300
tcagctgttg ttgatgctcc agctctcgat gcaggtcttg aaacgaagaa aaataagcga 360
aagaatgcag attgtgattc tgagaagaca gcagtaagtg gccaaaagaa acagagagca 420
catgccttaa agagtagtga gacacctgtt ggctcccgta aaatgccaaa gcgtgctgct 480
gcttgtgcag atttcaaaga gaaatctatt caaatatcta agaaatcttc aatcattgaa 540
acaaaaaagg accgttctgt agatgaagag gaagtagctg ttcggttaac ggctggacaa 600
gaagatggtc ggccatgtag gaggctaact gactttatat tccataattc tgatggcata 660
ccgcaggcct ttgaaatgtt ggaagttgat gatttatata tctctggcct gattttgcct 720
cttgaggaca gctcccaaaa ggaagcatgt agcatcaaat gtgaagggtt tggacgaatt 780
gagaactggg ctctatctgg ctatgaagaa ggggttccaa caatatgggt ctcaactgat 840
gttgcagatt atgattgtgt caaaccatca gctagttaca agaagcacta tgaacattta 900
tttgccaaag ctactgcttg tgttgaggtg tacaagaaac tgtcaaaatc ttcaggtgga 960
aatcctgatc tgagtttgga tgagttgctt gctggggttg ttcgtggact gagtggtatg 1020
aaatgctttt ctcgtagtgt atccatcaaa gatttcatta tatctcaggg tgactttatt 1080
tacaatcaac ttgttggctt ggatgagaca tctaagaaaa ctgatcagca atttcttgag 1140
ctaccagtcc ttatagcttt aagagaagaa agtagcaagc atggagaccc ttctatcgga 1200
aaggttgcat ctactaatgg aacattaaca attggtccaa aaattaaaga cggtgagaac 1260
aaaaaggatt ctgcaacaga ggaagatgag ggtgtaaaag tggcaagatt gttgcaggaa 1320
gaagagttct ggaactcaat gaagcagaaa aaaggccggg gatcaagcac ttcttctaac 1380
aaatattaca taaaaattaa tgaggatgag attgctaatg actatcctct accagcatat 1440
tacaagacag ctaaccaaga aacggatgaa tatataattt ttgatggcgg tgctgatgcg 1500
tgttatactg atgatttgcc tcgaagtatg cttcataact gggcattgta caactctgac 1560
tcgaggctca tttccttgga gctccttcca atgaaagggt gtgctgatat tgatgtcact 1620
atatttggat caggggtgat gactgaggat gatggaactg gattcaatct tgatggtgac 1680
acgtctcaat cttcctcagc tggattgggg acagcaaatg ttgatgggat cccaatatac 1740
ctgagtgcta taaaggaatg gatgattgaa tttggatcct caatggtttt tatatcaatt 1800
cgcacagata tggcctggta taggcttggt aagccatcaa aacagtatgc atcgtggtat 1860
gaaccagttc ttaaaacggc cagggtcgct ataagtatta ttacattatt aaaggagcag 1920
gccagggttt ctcgtctttc ttttatggat gtcattaaaa gagtttcgga gtttgaaaag 1980
ggtcatcctg cttacatatc atctgttccg gcagctgttg agagatatgt agttgtgcat 2040
ggacaaataa ttttgcagca gttcttagaa tttcctgatg agaagattaa aaagtctgca 2100
tttgtgattg gtctcacaaa caaaatggaa gaaaggcacc acactaaatg gcttatgaag 2160
aagaagaagt tattgcagag ggatgaacca aacttaaatc ccagagcagc cctagcccct 2220
gtagtgtcta aaaggaaggc tatgcaggca acaactacac gactaatcaa cagaatctgg 2280
ggtgagtttt attcgaacta ctctccagaa gatatgaaag agggaataac tggtgaagat 2340
aaggaggaag aagaacctga agagcaagag gaaattgagg aggaagagga gaaggaaaca 2400
ttgactgctt tagaaaaaac tcctacaccc acctcaacgc caagaaaaac aaaatcaatt 2460
cctaaagtga aggacataag gtggaaccgt aaatctgttg gtgaaacatt aagtggtgaa 2520
gctctataca aacaagcaat agtttatgga actgaaattg cagttggggg tgctgttctg 2580
gtggatgacg aatctgccca acttccagcc atctattacg tggagtacat gtttgaaact 2640
ttgaatggca taaaaatgct tcatgggaga atgttgcaac aaggatccct aacaatactc 2700
gggaatacag caaatgaatg tgaagtattt ctcacgaatg attgtatgga ttttgaatta 2760
gcggatgtta aaaaagctgt tgtagaaatt cggtcaaggc cttggggaca ccagtacaga 2820
aaagtgaatg caaatgctga taaaatctat agagcaggag ttgaggagag gaaaaagaat 2880
ggattggaaa ctgaatacta ttgcaaaagc ttgtattgtc cagataaagg tgcttttctt 2940
agccttcctc ttaatagtat gggtctgggt tcaggcatat gcagctcttg caaattagat 3000
aaagatctca ctgaaaaaga aaaatttgta gtccactcag acaagacaag ttttgtgttc 3060
aacggaactg aatattctat tcatgatttt ctctacgtga gtcctcagca atttagtaca 3120
gaaagggtag ggaatgaaac cttcaagggt ggaagaaatg tgggattaaa agcttatgct 3180
atatgtcaac tactcgaaat tattgtcccc aaggcaccca aacaagctga gccacattct 3240
actgagatta aggtaaggag attttaccgg ccagaagaca tttcagatga gaaggcatac 3300
tgctctgaca ttcgagaggt ttattacagc gaagaaacac atacaattga tgccgagaca 3360
gttgaaggga gatgtgaagt gaggaaaaag aatgatcttc catcatgcga tgcgcctact 342O
atttttgatc atgtattctt ttgcgaatat ctgtacgatc ctgctaaagg atctctcaaa 3480
cagttgccac caaatatcaa attgaggtat tcagctgtga agggtgcaca tgtttcttct 3540
cttagaaaga acaagggtaa gtgtaaggaa ggggaggatg atttagattc tctgaaatca 3600
aaagtaaact gtttggcaac cttagacatc tttgctggtt gcggaggcct ttcagaagga 3660
ttgcagaaat ccggtgtttg tacaacgaag tgggcaattg agtatgaaga ggctgctgga 3720
gatgcattta agcttaacca tccagagtcg ttgatgttta tcaataattg caatgttatt 3780
ttaaaggcta tcatggataa gactggagat gcagatgatt gtatttcaac cccagaggct 3840
gcagaattag ctgcaaaatt aagtgaggag gaaataaaga atttgccgct gccaggacaa 3900
gtggatttta ttaatggagg gcccccatgt cagggatttt ctggaatgaa tagatttaac 3960
caaagcagct ggagtaaagt ccagtgtgag atgattttgg cgttcttatc ctttgctgat 4020
tattatcgac caaagtattt tcttcttgag aatgtcagga cttttgtgtc cttcaacaag 4080
ggacagacat ttcgtctagc tatagcttca cttcttgata tgggttacca ggttcggttt 4140
ggtatacttg aggctggagc atatggagtt cctcagtcta ggaagcgagc atttatctgg 4200
gcagcatctc ctgaagaaac tctcccagag tggccagagc ctatgcatgt ctttgctgca 4260
ccagagctaa aaattgcatt accagaaaac aagtactatg ctgctgtccg gagtactcaa a320
actggggcac catttagatc aatcactgtt agggatacaa taggagatct tccgatggtt 4380
agcaatgggg catctaggac aagtatagag tatcaaatgg atcctatctc ctggttccaa 4440
aagaaaatcc gtgcaaacat gatggtcttg acagatcaca tatcaaaaga aatgaatgaa 4500
ctcaatctca ttcgctgtca aagaatccct aagcggcgag gtgctgattg gcaagacctt 4560
cctgatgaaa aggtcaagct gtcttccggg caattagttg acttgatacc ttggtgcctt 4620
ccaaatacag ccaagaggca caaccagtgg aaggggctgt tcggaaggtt ggactgggag 4680
ggaagttttc caacttctat cactgacccc caaccaatgg gaaaggtcgg aatgtgcttc 4740
catcctgatc agcacaggat tgtaacagtc cgagagtgtg ctcgttctca aggcttccca 4800
gatagctacc agttttatgg taacattcta cacaagcacc aacaaattgg aaacgctgtt 4860
cctcctcctc tggcgtatgc actggggatg aaactcaaag aagccttaga gagtaaaggg 4920
tgcatgtagt ttctcactca cttgcctcgc tagtctgatt gaactgatgc aagcaatttg 4980
taaattaaaa tctactgttt agtcgtcgtt tcgtgcttgc aatagaaagc aactagaatt 5040
gtcataggtc tttcgaaaca ttggatcaat agaaagcaac tagaattgtt gtaggtc 5097
<210>38
<211>1559
<212>PRT
<213>番茄(Lycopersicon esculentum)
<220>
<221>肽
<222>(0)...(0)
<223>gi|2887280|emb|CAA05207.1|DNA
胞嘧啶-5-甲基转移酶
<400>38
Met Ala Ser Pro Gln Pro Asn Ser Glu Ser Val Leu Glu Leu Pro Asn
1 5 10 15
Asn Asp Lys Ser Gly His Lys Lys Asn Lys Arg Lys Gln Asp Ser Val
20 25 30
Ser Lys Arg Lya Ala Ser Ala Thr Gly Lys Lys Glu Lys Lys Gln Ala
35 40 45
Val Ser Glu Thr Ile Glu Glu Pro Thr Ala Gly Arg Lys Arg Pro Lys
50 55 60
Arg Ala Ala Ala Cys Ser Asp Phe Lys Glu Lys Ser Val His Leu Ser
65 70 75 80
Lys Lys Ser Ser Val Ile Glu Thr Lys Lys Asp His Cys Val Asp Glu
85 90 95
Glu Asp Val Ala Ile Arg Leu Thr Ala Gly Leu Gln Glu Ser Gln Arg
100 105 110
Pro Cys Arg Arg Leu Thr Asp Phe Val Phe His Asn Ser Glu Gly Ile
115 120 125
Pro Gln Pro Phe Gly Met Ser Glu Val Asp Asp Leu Phe Ile Ser Gly
130 135 140
Leu Ile Leu Pro Leu Glu Asp Ser Leu Asp Lys Val Lys Ala Lys Gly
145 150 155 160
Ile Arg Cys Glu Gly Phe Gly Arg Ile Glu Glu Trp Ala Ile Ser Gly
165 170 175
Tyr Glu Asp Gly Thr Pro Val Ile Trp Ile Ser Thr Glu Thr Ala Asp
180 185 190
Tyr Asp Cys Leu Lys Pro Ser Gly Ser Tyr Lys Lys Phe Tyr Asp His
195 200 205
Phe Leu Ala Lys Ala Thr Ala Cys Val Glu Val Tyr Lys Lys Leu Ser
210 215 220
Lys Ser Ser Gly Gly Asn Pro Asp Leu Ser Leu Asp Glu Leu Leu Ala
225 230 235 240
Gly Val Val Arg Ala Met Thr Gly Ile Lys Cys Phe Ser Gly Gly Val
245 250 255
Ser Ile Arg Asp Phe Val Ile Thr Gln Gly Gly Phe Ile Tyr Lys Glu
260 265 270
Leu Ile Gly Leu Asp Asp Thr Ser Lys Lys Thr Asp Gln Leu Phe Val
275 280 285
Glu Leu Pro Val Leu Ala Ser Leu Arg Asp Glu Ser Ser Lys His Glu
290 295 300
Thr Leu Ala Gln Pro Glu Thr Ile Ser Ser Gly Asn Gly Leu Arg Ile
305 310 315 320
Gly Pro Lys Ala Gly Asn Gly Gly Asp Lys Ile Val Glu Ser Gly Leu
325 330 335
Ala Asn Gly Pro Ala Pro Glu Asp Glu Asp Leu Lys Leu Ala Lys Leu
340 345 350
Leu His Glu Glu Glu Tyr Trp Cys Ser Leu Lys Gln Lys Lys Asp Arg
355 360 365
Asn Thr Ser Ser Ser Ser Ser Lys Ile Tyr Ile Lys Ile Asn Glu Asp
370 375 380
Glu Ile Ala Ser Asp Tyr Pro Leu Pro Ala Tyr Tyr Lys Thr Ser Asn
385 390 395 400
Glu Glu Thr Asp Glu Tyr Ile Val Phe Asp Ser Gly Val Glu Thr Tyr
405 410 415
His Ile Asp Glu Leu Pro Arg Ser Met Leu His Asn Trp Ala Leu Tyr
420 425 430
Asn Ser Asp Ser Arg Leu Ile Ser Leu Glu Leu Leu Pro Met Lys Ala
435 440 445
Cys Ala Asp Ile Asp Val Thr Ile Phe Gly Ser Gly Val Met Thr Ala
450 455 460
Asp Asp Gly Ser Gly Tyr Asn Phe Asp Thr Asp Ala Asn His Ser Ser
465 470 475 480
Ser Gly Gly Ser Arg Ser Ala Glu Ile Asp Gly Met Pro Ile Tyr Leu
485 490 495
Ser Ala Ile Lys Glu Trp Met Ile Glu Phe Gly Ser Ser Met Ile Phe
500 505 510
Ile Ser Ile Arg Thr Asp Met Ala Trp Tyr Arg Leu Gly Lys Pro Leu
515 520 525
Lys Gln Tyr Ala Pro Trp Tyr Glu Pro Val Ile Lys Thr Ala Arg Leu
530 535 540
Ala Val Ser Ile Ile Thr Leu Leu Lys Glu Gln Asn Arg Val Ala Arg
545 550 555 560
Leu Ser Phe Gly Glu Val Ile Lys Arg Val Ser Glu Phe Lys Lys Asp
565 570 575
His Pro Ala Tyr Ile Ser Ser Asn Val Asp Ala Val Glu Arg Tyr Val
580 585 590
Val Val His Gly Gln Ile Ile Leu Gln Gln Phe Ser Glu Phe Pro Asp
595 600 605
Val Ser Ile Arg Asn Cys Ala Phe Ala Val Gly Leu Ser Arg Lys Met
610 615 620
Glu Glu Arg His His Thr Lys Trp Val Ile Lys Lys Lys Lys Val Met
625 630 635 640
Gln Arg Leu Glu Gln Asn Leu Asn Pro Arg Ala Ser Met Ala Pro Ser
645 650 655
Val Lys Arg Lys Ala Met Gln Ala Thr Thr Thr Arg Leu Ile Asn Arg
660 665 670
Ile Trp Gly Glu Tyr Tyr Ser Asn Tyr Ser Pro Glu Val Ser Lys Glu
675 680 685
Val Ala Asp Cys Glu Val Lys Asp Asp Glu Glu Pro Asp Glu Gln Glu
690 695 700
Glu Asn Glu Glu Asp Asp Val Pro Glu Arg Asn Leu Asp Val Pro Glu
705 710 715 720
Lys Ala His Thr Pro Ser Ser Thr Arg Arg His Ile Lys Ser Arg Ser
725 730 735
Asp Ser Lys Glu Ile Asn Trp Asp Gly Glu Ser Ile Gly Lys Thr Ala
740 745 750
Ser Gly Glu Gln Leu Phe Lys Lys Ala Arg Val His Gly His Glu Ile
755 760 765
Ala Val Gly Asp Ser Val Leu Val Glu His Asp Glu Pro Asp Glu Leu
770 775 780
Gly Cys Ile Tyr Phe Val Glu Tyr Met Phe Glu Lys Leu Asp Gly Ser
785 790 795 800
Lys Met Leu His Gly Lys Met Met Gln Arg Gly Ser Asp Thr Val Leu
805 810 815
Gly Asn Ala Ala Asn Glu Arg Glu Val Phe Leu Ile Asn Glu Cys Met
820 825 830
Asn Leu Gln Leu Gly Asp Val Lys Glu Ser Ile Ala Val Asn Ile Arg
835 840 845
Met Met Pro Trp Gly His Gln His Arg Asn Thr Asn Ala Asp Lys Leu
850 855 860
Glu Thr Ala Lys Ala Glu Asp Arg Lya Arg Lys Gly Leu Pro Thr Glu
865 870 875 880
Phe Tyr Cys Lys Ser Phe Tyr Arg Pro Glu Lys Gly Ala Phe Phe Arg
885 890 895
Leu Pro Phe Asp Lys Met Gly Leu Gly Asn Gly Leu Cys Tyr Ser Cys
900 905 910
Glu Leu Gln Gln Thr Asp Gln Glu Lys Glu Ser Phe Lys Phe Asp Met
915 920 925
Ser Lys Ser Ser Phe Val Tyr Leu Gly Thr Glu Tyr Ser Val Asp Asp
930 935 940
Phe Val Tyr Val Ser Pro Asp His Phe Thr Ala Glu Arg Gly Gly Asn
945 950 955 960
Gly Thr Phe Lys Ala Gly Arg Asn Val Gly Leu Met Ala Tyr Val Val
965 970 975
Cys Gln Leu Leu Glu Ile Val Gly Pro Lys Gly Ser Lys Gln Ala Lys
980 985 990
Val Asp Ser Thr Asn Val Lys Val Arg Arg Phe Phe Arg Pro Glu Asp
995 1000 1005
Ile Ser Ser Asp Lys Ala Tyr Ser Ser Asp Ile Arg Glu Ile Tyr Tyr
1010 1015 1020
Ser Glu Asp Ile His Thr Val Pro Val Glu Ile Ile Lys Gly Lys Cys
1025 1030 1035 1040
Glu Val Arg Lys Lys Tyr Asp Ile Ser Ser Glu Asp Val Pro Ala Met
1045 1050 1055
Phe Asp His Ile Phe Phe Cys Glu Tyr Leu Tyr Asp Pro Leu Asn Gly
1060 1065 1070
Ser Leu Lys Lys Leu Pro Ala Gln Ile Asn Leu Ile Leu Ser Lys Ile
1075 1080 1085
Lys Leu Asp Asp Ala Thr Ser Arg Lys Arg Lys Gly Lys Gly Lys Glu
1090 1095 1100
Gly Val Asp Glu Val Gly Glu Leu Asn Glu Thr Ser Pro Gln Asn Arg
1105 1110 1115 1120
Leu Ser Thr Leu Asp Ile Phe Ala Gly Cys Gly Gly Leu Ser Glu Gly
1125 1130 1135
Leu Gln His Ser Gly Val Thr Asp Thr Asn Trp Ala I1e Glu Tyr Glu
1140 1145 1150
Ala Pro Ala Gly Asp Ala Phe Arg Leu Asn His Pro Lys Thr Lys Val
1155 1160 1165
Phe Ile His Asn Cys Asn Val Ile Leu Arg Ala Val Met Gln Lys Cys
1170 1175 1180
Gly Asp Ser Asp Asp Cys Ile Ser Thr Pro Glu Ala Ser Glu Leu Ala
1185 1190 1195 1200
Ala Ala Met Asp Glu Ser Glu Leu Asn Ser Leu Pro Leu Pro Gly Gln
1205 1210 1215
Val Asp Phe Ile Asn Gly Gly Pro Pro Cys GlG Gly Phe Ser Gly Met
1220 1225 1230
Asn Arg Phe Asn Gln Ser Thr Trp Ser Lys Val Gln Cys Glu Met Ile
1235 1240 1245
Leu Ala Phe Leu Ser Phe Ala Asp Tyr Tyr Arg Pro Lys Phe Phe Leu
1250 1255 1260
Leu Glu Asn Val Arg Asn Phe Val Ser Phe Asn Gln Lys Gln Thr Phe
1265 1270 1275 1280
Arg Leu Thr Val Ala Ser Leu Leu Glu Met Gly Tyr Gln Val Arg Phe
1285 1290 1295
Gly Ile Leu Glu Ala Gly Ala Tyr Gly Val Pro Gln Ser Arg Lys Arg
1300 1305 1310
Ala Phe Ile Trp Ala Gly Ser Pro Glu Glu Val Leu Pro Glu Trp Pro
1315 1320 1325
Glu Pro Met His Val Phe Ala Val Pro Glu Leu Lye Ile Ala Leu Ser
1330 1335 1340
Glu Thr Ser Tyr Tyr Ala Ala Val Arg Ser Thr Ala Ser Gly Ala Pro
1345 1350 1355 1360
Phe Arg Ser Leu Thr Val Arg Asp Thr Ile Gly Asp Leu Pro Val Val
1365 1370 1375
Gly Asn Gly Ala Ser Lys Thr Cys Ile Glu Tyr Gln Gly Asp Pro Val
1380 1385 1390
Ser Trp Phe Gln Lys Lys Ile Arg Gly Ser Ser Ile Thr Leu Ser Asp
1395 1400 1405
His Ile Ser Lys Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Gln Arg
1410 1415 1420
Ile Pro Lys Arg Pro Gly Ala Asp Trp Arg Asp Leu Glu Asp Glu Lys
1425 1430 1435 1440
Val Lys Leu Ser Asn Gly Gln Leu Val Asp Leu Ile Pro Trp Cys Leu
1445 1450 1455
Pro Asn Thr Ala Lys Arg His Asn Gln Trp Lys Gly Leu Phe Gly Arg
1460 1465 1470
Leu Asp Trp Asp Gly Asn Phe Pro Thr Ser Ile Thr Asp Pro Gln Pro
1475 1480 1485
Met Gly Lys Val Gly Met Cys Phe His Pro Asp Gln Asp Arg Ile Val
1490 1495 1500
Thr Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro Asp Ser Tyr Gln
1505 1510 1515 1520
Phe Ala Gly Asn Ile Leu His Lys His Arg Gln Ile Gly Asn Ala Val
1525 1530 1535
Pro Pro Pro Leu Ala Tyr Ala Leu Gly Arg Lys Leu Lys Glu Ala Val
1540 1545 1550
Glu Ser Lys Asn Arg Leu Thr
1555
<210>39
<211>5377
<212>DNA
<213>番茄(Lycopersicon esculentum)
<220>
<221>misc_feature
<222>(0)...(0)
<223>AJ002140.1;GI:2887279;DNA(胞嘧啶-5-)-甲基转移酶mRNA
<400>39
cccgcccaaa tccccccaaa aacctatctc atttgtcctc ttcctgttgg agaactcagc 60
aacagcacac cccatctccc tcaacttctc cgccgcacca gcttctactc tccatttccg 120
ccgaaaaatc acctttcacc ggcaaagcag cagcctctgg tccctccttt atcttctccc 180
ttcgctgctc tggccaccat cgctggtcgc tggcgagaac tacaacgaaa atcccttcgc 240
ctccgctctc ctctctccct cttccgccgc cctgctcctc acttctcact tctccattga 300
agtcgacgga cgggataacg gcagcgacga ctgctccgcc cagctactgt ggcgaagtag 360
cagcaagctg tgaccagcaa actcggcaaa ctccggccaa agcagcgata acaactcagg 420
ccagcagtgg ggcagcaacg actgctctgc ccagcagctt tggcgacggc aagctgtgac 480
cagcaaactc cggcgaagca gcgataacaa ctcaggccag cagtggggca gcaactccgg 540
ttggtgagga tggcgtcacc ccaacctaat tcggagtcgg tattagaact tccgaacaac 600
gacaaatctg gacacaaaaa gaacaaacgc aaacaagatt ctgtgtcaaa aaggaaggca 660
tctgcaactg gtaagaagga aaagaaacag gctgtttctg aaactattga ggagcccact 720
gctggacgta aaaggcctaa gcgagctgct gcctgttcag attttaaaga gaaatctgtg 780
catttatcaa aaaagtcttc agtcattgaa acaaagaagg accattgtgt agacgaagag 840
gatgtagcta ttaggttaac tgcgggtctg caagagtctc aacgaccctg tagaagatta 900
acggattttg tttttcataa ctcagaagga ataccacaac cgtttggaat gtctgaggtt 960
gatgatctgt ttatcagtgg cctcatttta ccacttgagg acagtcttga caaagtaaaa 1020
gcaaaaggaa ttagatgtga aggctttggg cgtattgaag aatgggctat ctctggctat 1080
gaagatggaa ctcctgtcat atggatctca actgagacag ctgattatga ttgtttaaaa 1140
ccctcaggta gttataagaa gttttatgac cacttcttgg ccaaggcgac ggcttgcgtt 1200
gaggtttata agaagctttc aaagtcatct ggagggaatc ctgatttaag tcttgacgag 1260
ttgcttgcag gggttgtccg agcgatgact ggcataaaat gcttttcagg tggagtatcc 1320
atcagggact ttgtcatcac tcagggcggg ttcatatata aggaacttat tggtctggat 1380
gatacatcaa agaagactga tcaacttttt gttgagctac ctgtcctagc ttcccttaga 1440
gatgaaagca gcaagcacga gacacttgca caaccagaga ctatatcatc tggtaatggt 1500
ctacgtattg gcccaaaagc aggaaatgga ggagacaaga tagttgaatc tggtttggcc 1560
aatggtccag cgccagaaga tgaagatcta aaattggcta aattgttgca tgaagaggag 1620
tattggtgct ccttgaagca gaagaaagac cgtaatacat cttcctcatc cagcaaaata 1680
tacatcaaga tcaatgagga tgagattgca agtgattatc ctttacctgc atattacaaa 1740
acatctaatg aagagactga tgagtatatt gtctttgaca gtggggttga aacataccat 1800
attgatgagt tgcctcgcag catgcttcat aattgggcat tatacaactc ggactcaagg 1860
ctaatatctt tagaactgct gccaatgaaa gcttgtgctg atattgatgt aaccattttt 1920
gggtctggag tgatgactgc tgatgatggg tctggctaca attttgacac agatgctaat 1980
cattcctctt caggtggttc tagatcagct gaaattgatg gaatgccaat ttacctgagt 2040
gctataaaag aatggatgat tgagtttggg tcctcaatga tctttatatc aattcggact 2100
gatatggcct ggtataggct tgggaagcca ttgaaacagt atgctccttg gtacgaacca 2160
gtcataaaga ctgcaagatt ggcagtgagc atcattactt tgttaaagga acagaatcgt 2220
gtggctagac tttcttttgg agaagttatt aaaagggttt cagagttcaa gaaagaccat 2280
cctgcttata tatcatctaa tgtagatgca gtggaaaggt atgtggttgt acatgggcaa 2340
attattctcc agcagttttc tgaatttcct gatgtaagca ttaggaattg tgcatttgcg 2400
gttggtctct caaggaaaat ggaagagagg caccatacaa aatgggtgat taagaagaag 2460
aaggtgatgc agagactgga acagaactta aatcctagag catctatggc gccatctgta 2520
aaaaggaaag ctatgcaggc tactacaaca aggctaatca acagaatctg gggggaatac 2580
tattccaatt actcacccga ggtgtcaaag gaggtggctg attgtgaggt gaaggatgat 2640
gaagaaccag atgagcaaga ggaaaatgaa gaggatgatg ttccggagag gaacttggat 2700
gttccagaga aagctcatac accttcttct acaagaaggc atattaagtc acgttctgac 2760
agcaaagaaa taaactggga tggggaatcc ataggtaaaa cagcttctgg tgaacagttg 2820
tttaaaaaag ctagagttca tggacatgag atagctgttg gagattcagt tctagtggaa 2880
catgatgaac cagatgagct tggttgtatt tactttgttg aatacatgtt tgaaaaattg 2940
gatggtagca aaatgcttca tggaaaaatg atgcaacgag gatctgacac tgtacttgga 3000
aatgcagcta atgagagaga ggtatttttg atcaatgaat gcatgaatct gcaactagga 3060
gatgtcaaag aaagtatagc tgtcaatatc agaatgatgc cttggggaca ccagcataga 3120
aacacgaatg ctgataaact tgaaacagca aaagcagaag acagaaagag gaagggattg 3180
ccgacggaat tttactgcaa aagcttttat cgccctgaaa aaggtgcttt tttcagactc 3240
ccgtttgata agatgggcct tggtaatggt ttatgctact cttgtgagtt gcagcaaact 3300
gatcaggaaa aggaatcctt taagtttgat atgtccaaat ccagttttgt atatctgggg 3360
actgagtatt cagttgatga ctttgtttat gtaagccccg atcactttac tgcagaaaga 3420
gggggaaatg gaactttcaa agccggaaga aatgtggggt tgatggccta tgtagtatgt 3480
caattactag aaattgttgg acctaaggga tctaaacaag ctaaagtaga ttctacaaat 3540
gttaaagtca gaagattctt cagaccagag gatatatctt cagataaggc atactcttct 3600
gatatccggg agatctatta cagtgaagat atacatacag ttcctgtgga aataatcaaa 3660
ggaaaatgtg aagtgaggaa gaagtatgat atttcctctg aagatgtccc tgccatgttc 3720
gaccatattt tcttttgtga atatttgtat gatccattga atggatccct taagaagtta 3780
ccagctcaga taaacctgat attgtcaaaa attaagctag atgacgcaac atctaggaag 3840
aggaagggga agggaaaaga aggagtggat gaagttgggg aactaaatga aacttctcca 3900
cagaatcgtt tgtccacatt agatatcttt gctggttgtg gtggcttgtc tgaggggttg 3960
cagcattcgg gtgtcacaga tacaaattgg gcaattgaat acgaagcgcc tgctggagat 4020
gcatttagac ttaatcatcc aaagacaaag gtgttcatac ataattgcaa tgtgattttg 4080
agggctgtca tgcagaagtg tggagattct gatgactgta tctcaactcc agaggcttct 4140
gaattagctg cagcaatgga tgagagcgaa ctgaatagtt tgccactgcc tggacaagtt 4200
gatttcatta atggaggccc tccttgtcag gggttttctg gaatgaatag atttaatcag 4260
agcacctgga gtaaagtaca gtgtgagatg attctggcat ttttatcctt tgctgattat 4320
tatcggccca agttttttct cttggagaat gttaggaatt ttgtttcgtt caaccaaaaa 4380
caaacatttc gcttaactgt tgcttccctt cttgagatgg gttatcaggt taggtttggt 4440
atccttgaag ccggagcgta tggagttcct cagtctagga agagagcatt tatctgggct 4500
ggctccccag aggaggttct tccagagtgg ccagaaccaa tgcatgtttt tgctgtccca 4560
gaattaaaaa tcgcattatc tgaaacttca tactatgcag ctgtgaggag tactgctagt 4620
ggagctccat tccgttcact tactgtcaga gacacaattg gagatcttcc tgttgttggc 4680
aatggggcaa gcaagacttg catagagtat caaggtgatc cagtatcctg gttccaaaag 4740
aaaatccggg gcagctcaat aacattatct gatcacattt caaaagagat gaatgagctt 4800
aacctaatca ggtgccaaag aatccccaag cggccaggag ctgattggcg tgaccttgaa 4860
gatgaaaagg ttaaactatc taatggtcaa ctagttgatt tgattccatg gtgcctgcct 4920
aacactgcta agcggcacaa ccagtggaag gggctctttg gaaggttgga ttgggatggg 4980
aacttcccca cttctattac tgatccccag ccgatgggca aggtggggat gtgctttcat 5040
ccagatcaag acaggattgt tacagttcgt gaatgtgcac gttctcaagg tttcccagac 5100
agctaccaat ttgctggtaa catcttgcac aagcacaggc aaataggaaa tgctgttcca 5160
cctcctttgg catatgcgct tggaagaaaa ctcaaagaag ctgttgagag caaaaatagg 5220
ctcacttaga acttttttaa gctgtgaatt ttacatgcat gtcaattacc attcacattg 5280
ccaaattata tcagttactc atttattaaa tttgcagttt cacctataac cctctattta 5340
gaggttgggt tcaaacaaaa ttgattaaaa cattact 5377
<210>40
<211>1556
<212>PRT
<213>烟草(Nicotiana tabacum)
<220>
<221>肽
<222>(0)...(0)
<223>gi|7288140|dbj|BAA92852.1|DNA
胞嘧啶-5-甲基转移酶
<400>40
Met Ala Tyr Ser Phe Phe His Phe Phe Ala Gly Tyr Ser Gly His Lys
1 5 10 15
Lys Glu Lys Ser Lys Arg Asp Ser Val Ser Lys Arg Lys Ala Pro Ala
20 25 30
Thr Asp Lys Lys Glu Lys Lys Gln Pro Val Ser Glu Ala Ile Glu Glu
35 40 45
Pro Thr Ala Ala Arg Lys Arg Pro Lys Arg Ala Ala Ala Cys Ser Asn
50 55 60
Phe Lys Glu Lys Asn Val His Leu Ser Lys Asn Ser Ala Val Ile Glu
65 70 75 80
Thr Lys Lys Asp Gln Cys Val Glu Glu Glu Val Leu Ala Ile Arg Leu
85 90 95
Thr Ala Gly Leu Gln Asp Ser Gln Arg Pro Cys Arg Arg Leu Thr Asp
100 105 110
Phe Ile Phe His Asn Leu Glu Gly Ile Pro Gln Pro Phe Glu Met Ser
115 120 125
Glu Val Asp Asp Leu Phe Ile Thr Gly Leu Ile Leu Pro Leu Glu Asp
130 135 140
Asn Asn Asp Lys Glu Lys Ala Lys Gly Ile Arg Cys Glu Gly Phe Gly
145 150 155 160
Arg Ile Glu Glu Trp Ala Ile Ser Gly Tyr Glu Asp Gly Thr Pro Ile
165 170 175
Ile Trp Ile Ser Thr Glu Thr Ala Asp Tyr Asp Cys Lys Lys Pro Ser
180 185 190
Gly Gly Tyr Lys Lys Phe Tyr Asp His Phe Phe Ala Lys Ala Thr Ala
195 200 205
Cys Ile Glu Val Tyr Lys Lys Leu Ser Lys Ser Ser Gly Gly Asn Pro
210 215 220
Asp Leu Ser Leu Asp Gly Leu Leu Ala Gly Val Val Arg Ala Met Ser
225 230 235 240
Gly Leu Lys Cys Phe Ser Gly Gly Val Ser Ile Arg Asp Phe Leu Ile
245 250 255
Ser Gln Gly Glu Phe Val Tyr Lys Gln Leu Ile Gly Gln Asp Asp Thr
260 265 270
Ser Lys Lys Thr Asp Gln Leu Phe Val Glu Leu Pro Val Leu Ala Ser
275 280 285
Leu Arg Asp Glu Ser Ser Asn Gln Glu Met Leu Ser Gln Pro Glu Pro
290 295 300
Leu Ser Phe Gly Arg Thr Leu Thr Ile Gly Pro Lys Val Gly Lys Gly
305 310 315 320
Glu Gly Lys Arg Asp Gln Ser Asp Leu Thr Thr Gly Pro Glu Gln Glu
325 330 335
Glu Glu Asp Leu Lys Leu Ala Lys Leu Leu His Glu Gln Glu Tyr Trp
340 345 350
His Ser Leu Asn Gln Lys Thr Ser Arg Ser Thr Ser Ser Ser Ser Ser
355 360 365
Lys Phe Tyr Ile Lys Ile Asn Glu Asp Glu Ile Ala Ser Asp Tyr Pro
370 375 380
Leu Pro Ala Tyr Tyr Lys Thr Cys Asn Glu Glu Thr Asp Glu Tyr Ile
385 390 395 400
Val Phe Asp Ser Gly Val Asp Thr Tyr Tyr Ile Asp Asp Leu Pro Arg
405 410 415
Ser Met Leu His Asn Trp Ala Leu Tyr Asn Ser Asp Ser Arg Leu Ile
420 425 430
Ser Ser Glu Leu Leu Pro Met Lys Pro Cys Ala Asp Ile Asp Val Thr
435 440 445
Ile Phe Gly Ser Gly Val Met Thr Ala Asp Asp Gly Ser Gly Tyr Asn
450 455 460
Val Asp Ala Asp Ala Asn Asn Ser Ser Ser Gly Gly Ser Gly Ser Ala
465 470 475 480
Glu Ile Asp Gly Met Pro Ile Tyr Leu Ser Ala Ile Lys Glu Trp Met
485 490 495
Ile Glu Phe Gly Ser Ser Met Ile Phe Ile Ser Ile Arg Thr Asp Met
500 505 510
Ala Trp Tyr Arg Leu Gly Lys Pro Ser Lys Gln Tyr Ala Prp Trp Tyr
515 520 525
Glu Pro Val Leu Lys Thr Ala Lys Leu Ala Val Ser Ile Ile Thr Leu
530 535 540
Leu Lys Glu Gln Ser Arg Cys Ala Arg Leu Ser Phe Gly Asp Val Ile
545 550 555 560
Lys Arg Val Ser Glu Phe Lys Lys His His Pro Ala Tyr Ile Ser Ser
565 570 575
Asn Thr Asp Val Val Glu Arg Tyr Val Val Val His Gly Gln Ile Ile
580 585 590
Leu Gln Gln Phe Ser Glu Phe Pro Asp Glu Ser Ile Arg Lys Cys Ala
595 600 605
Phe Val Ile Gly Leu Ser Arg Lys Met Glu Glu Arg His His Thr Lys
610 615 620
Trp Leu Ile Lys Lys Lys Lys Val Val Gln Arg His Glu Gln Asn Leu
625 630 635 640
Asn Pro Arg Ala Ser Met Ala Pro Ser Val Lys Arg Lys Ala Met Gln
645 650 655
Ala Thr Thr Thr Arg Leu Ile Asn Arg Ile Trp Gly Glu Tyr Tyr Ser
660 665 670
Asn Thr Ser Pro Glu Thr Ser Lys Glu Val Val Ala Cys Glu Val Lys
675 680 685
Asp Asp Glu Glu Val Asp Glu Gln Glu Glu Asn Asp Glu Asp Asp Ala
690 695 700
Gln Glu Glu Asn Leu Glu Val Ser Glu Lys Thr His Thr Pro Cys Ser
705 710 715 720
Thr Arg Arg His Ile Lys Ser Arg Ser Asp Ser Lys Glu Ile Asn Trp
725 730 735
Asp Gly Glu Ser Ile Gly Lys Thr Ala Ser Gly Glu Leu Leu Phe Lys
740 745 750
Lys Pro Arg Ile His Gly Asn Glu Ile Ala Val Gly Asp Ser Val Leu
755 760 765
Val Glu His Asp Glu Pro Asp Glu Leu Pro Ser Ile Tyr Phe Val Glu
770 775 780
Tyr Met Phe Glu Lys Leu Asp Gly Ser Lys Met Leu His Gly Arg Met
785 790 795 800
Met Gln Arg Gly Ser Glu Thr Val Leu Gly Asn Ala Ala Asn Glu Arg
805 810 815
Glu Val Phe Leu Ile Asn Glu Cys Met Asp Leu Gln Leu Gly Asp Val
820 825 830
Lys Glu Ser Val Val Val Ser Ile Arg Met Met Pro Trp Gly His Gln
835 840 845
His Arg Lys Ala Asn Ala Tyr Val Asp Lys Leu Asp Arg Ala Lys Ala
850 855 860
Glu Asp Arg Lys Lys Lys Gly Leu Pro Ser Glu Phe Tyr Cys Lys Ser
865 870 875 880
Phe Tyr Gln Pro Asp Arg Gly Ala Phe Phe Arg Leu Pro Phe Asp Lys
885 890 895
Met Gly Leu Gly Asn Gly Leu Cys Tyr Ser Cys Glu Leu Gln Gln Ile
900 905 910
Asp Gln Glu Lys Glu Ser Phe Lys Leu Asp Met Ser Asn Ser Ser Phe
915 920 925
Val Tyr Leu Gly Thr Glu Tyr Ser Ile Asp Asp Phe Val Tyr Ile His
930 935 940
Pro Asp His Phe Ala Val Glu Arg Gly Gly Ser Gly Thr Phe Lys Ala
945 950 955 960
Gly Arg Asn Val Gly Leu Met Ala Thr Val Val Cys Gln Leu Ile Glu
965 970 975
Ile Ser Gly Pro Lys Gly Ser Lys Gln Ala Lys Val Asp Ser Thr Asn
980 985 990
Val Lys Val Arg Arg Phe Phe Arg Pro Glu Asp Ile Ser Ser Asp Lys
995 1000 1005
Ala Tyr Ser Ser Asp Ile Arg Glu Ile Tyr Tyr Ser Glu Glu Ile His
1010 1015 1020
Thr Val Pro Val Glu Thr Ile Glu Gly Lys Cys Glu Val Arg Lys Lys
1025 1030 1035 1040
Tyr Asp Ile Pro Ser Glu Asp Val Pro Ala Thr Phe Asp His Val Phe
1045 1050 1055
Phe Cys Glu Tyr Leu Tyr Asp Pro Leu Asn Gly Ser Leu Lys Gln Leu
1060 1065 1070
Pro Ala Gln Val Lys Leu Arg Phe Ser Arg Val Lys Leu Asp Asp Ala
1075 1080 1085
Ala Ser Arg Lys Arg Lys Gly Lys Gly Lys Glu Gly Glu Asp Glu Leu
1090 1095 1100
Arg Val Gly Gln Leu Asn Val Ala Ser Gln Gln Asn Arg Leu Ala Thr
1105 1110 1115 1120
Leu Asp Ile Phe Ala Gly Cys Gly Gly Leu Ser Glu Gly Leu Gln Arg
1125 1130 1135
Ser Gly Val Ser Asp Thr Lys Trp Ala Ile Glu Tyr Glu Glu Pro Ala
1140 1145 1150
Gly Asp Ala Phe Lys Leu Asn His Pro Glu Ala Lys Val Phe Ile Gln
1155 1160 1165
Asn Cys Asn Val Ile Leu Arg Ala Val Met Gln Lys Cys Gly Asp Ala
1170 1175 1180
Glu Asn Cys Ile Ser Thr Ser Glu Ala Ser Glu Leu Ala Ala Ala Met
1185 1190 1195 1200
Asp Glu Asn Glu Leu Asn Ser Leu Pro Leu Pro Gly Gln Val Asp Phe
1205 1210 1215
Ile Asn Gly Gly Pro Pro Cys Gln Gly Phe Ser Gly Met Asn Arg Phe
1220 1225 1230
Asn Gln Ser Thr Trp Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe
1235 1240 1245
Leu Ser Phe Ala Asp Tyr Tyr Arg Pro Lys Phe Phe Leu Leu Glu Asn
1250 1255 1260
Val Arg Asn Phe Val Ser Phe Asn Gln Lys Gln Thr Phe Arg Leu Thr
1265 1270 1275 1280
Val Ala Ser Leu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Ile Leu
1285 1290 1295
Glu Ala Gly Ala Phe Gly Val Pro Gln Ser Arg Lys Arg Ala Phe Ile
1300 1305 1310
Trp Ala Ala Ser Pro Glu Glu Ile Leu Pro Glu Trp Pro Glu Pro Met
2315 1320 1325
His Val Phe Gly Val Pro Glu Leu Lys Ile Thr Leu Ser Glu Thr Cys
1330 1335 1340
His Tyr Ala Ala Val Arg Ser Thr Ala Ser Gly Ala Pro Phe Arg Ser
1345 1350 1355 1360
Leu Thr Val Arg Asp Thr Ile Gly Asp Leu Pro Ala Val Gly Asn Gly
1365 1370 1375
Ala Ser Lys Thr Cys Ile Glu Tyr Gln Val Asp Pro Ile Ser Trp Phe
1380 1385 1390
Gln Arg Lys Ile Arg Gly Asn Ser Ile Thr Leu Ser Asp His Ile Thr
1395 1400 1405
Lys Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Gln Arg Ile Pro Lys
1410 1415 1420
Arg Pro Gly Ala Asp Trp Arg Asp Leu Pro Asp Glu Lys Val Lys Leu
1425 1430 1435 1440
Cys Asn Gly Gln Leu Val Asp Leu Ile Pro Trp Cys Leu Pro Asn Thr
1445 1450 1455
Ala Lys Arg His Asn Gln Trp Lys Gly Leu Phe Gly Arg Leu Asp Trp
1460 1465 1470
Asp Gly Asn Phe Pro Thr Ser Phe Thr Asp Pro Gln Pro Met Gly Lys
1475 1480 1485
Val Gly Met Cys Phe His Pro Aap Gln Asp Arg Ile Val Thr Val Arg
1490 1495 1500
Glu Cys Ala Arg Ser Gln Gly Phe Pro Asp Ser Tyr Gln Phe Ala Gly
1505 1510 1515 1520
Asn Ile Leu His Lys His Arg Gln Ile Gly Asn Ala Val Pro Pro Pro
1525 1530 1535
Leu Ala Tyr Ala Leu Gly Arg Lys Leu Lys Glu Ala Val Glu Ser Lys
1540 1545 1550
Lys Arg Ser Thr
1555
<210>41
<211>4822
<212>DNA
<213>烟草(Nicotiana tabacum)
<220>
<221>misc_feature
<222>(0)...(0)
<223>AB030726.1;GI:7288139;DNA(胞嘧啶-5-)-甲基转移酶mRNA,完整的cds
<400>41
atggcttatt cttttttcca tttttttgct ggttattcag gacacaaaaa ggagaaaagc 60
aaacgagatt ctgtgtcaaa aaggaaggca cctgcaactg acaagaagga aaagaaacag 120
cctgtttctg aagctattga ggagcccact gctgcacgca aaaggcccaa gcgagctgct 180
gcttgttcaa attttaaaga gaaaaatgtt catttatcaa aaaattctgc agtcattgaa 240
acaaagaagg accaatgcgt agaggaagag gttttggcta ttcggttaac tgcgggtcta 300
caggattctc agcgaccctg tagaagacta acagatttta tctttcataa tttggaagga 360
ataccacaac cttttgaaat gtctgaagtt gatgatctgt ttattactgg tctcatttta 420
ccacttgagg acaataatga caaagaaaaa gcaaaaggaa ttagatgtga aggctttggg 480
cgtatagaag aatgggctat ctctggctat gaagatggaa ctcctatcat atggatctca 540
acagagacag ctgattatga ttgtaaaaaa ccctcaggtg gctataagaa gttttatgac 600
cacttcttcg ccaaagctac agcctgcatt gaggtttaca aaaagctgtc gaaatcttct 660
ggaggaaatc ctgatttaag ccttgatggg ttgcttgcag gggttgtccg agcaatgagt 720
ggtttaaaat gcttttcggg tggtgtatca atcagggact ttctcatttc tcagggagag 780
tttgtctata agcaacttat cggtcaggac gatacatcaa agaagactga tcagcttttt 840
gttgagttac ctgtcctggc ttcccttaga gatgaaagca gcaatcagga aatgctttca 900
caaccagagc ctttatcatt tggtaggact ctaactatag gtccaaaagt aggcaaagga 960
gaaggcaaga gagatcaatc tgatttaacc actggtccag aacaagaaga ggaagatctg 1020
aaattggcca aactgttaca tgaacaggag tactggcact ccttgaacca gaagacaagc 1080
cgtagtacat cttcctcatc tagcaaattt tacatcaaga tcaatgagga tgagattgca 1140
agtgattatc ctttacctgc atattacaag acatgtaatg aagagaccga tgagtatatc 1200
gtctttgaca gtggggttga tacatactat attgatgact tgcctcgcag tatgcttcat 1260
aattgggcat tgtacaactc agactcaaga ctaatttctt cagagctcct gcctatgaaa 1320
ccatgcgctg atattgatgt aaccatattt gggtctggag tgatgactgc tgatgatgga 1380
tctggataca atgttgatgc tgatgctaat aactcctctt caggtggttc tggatcagct 1440
gagattgatg gaatgccaat ttatttgagt gcaataaaag aatggatgat tgagtttggg 1500
tcctcgatga tctttatatc tattcggact gatatggcct ggtataggct tgggaagcca 1560
tcaaaacagt atgctccttg gtatgaacca gtcctaaaga ctgcgaagtt ggcagtgagc 1620
attattactt tgttaaagga acaaagtcgt tgtgctagac tttcttttgg agatgtcatt 1680
aaaagggttt cagagttcaa gaaacaccat cctgcttata tatcatctaa tacagatgtg 1740
gtggaaagat atgtggttgt acatggacag attattctgc agcagttttc agaatttcct 1800
gatgaaagca ttaggaaatg tgcatttgtg attggcctct caaggaaaat ggaggagagg 1860
caccatacaa aatggttgat taagaagaag aaggttgtgc agagacatga acagaactta 1920
aatcctagag catctatggc gccatctgta aaaaggaaag ctatgcaggc tactacaaca 1980
agactaatca acagaatctg gggggagtac tattccaatt actcacctga gacgtcaaag 2040
gaggttgttg cttgtgaggt gaaggatgat gaagaagtag atgagcagga ggaaaatgac 2100
gaggatgatg ctcaagagga gaacttggaa gtttcagaga aaactcatac accttgctct 2160
acaagaaggc atattaagtc acgttctgac agcaaagaaa taaactggga tggggaatcc 2220
ataggtaaaa cagcgtctgg tgaactgttg tttaaaaagc ctagaattca tggaaatgag 2280
attgctgttg gagattcagt tctggtggaa catgatgaac cagatgaact tccttctatt 2340
tactttgtcg aatacatgtt tgaaaaattg gatggtagca aaatgctcca tggaagaatg 2400
atgcaacggg gatctgaaac tgtacttgga aatgcagcta atgaaagaga ggtatttttg 2460
atcaatgaat gcatggattt gcaactagga gatgtcaaag aaagtgtagt tgtcagtatc 2520
aggatgatgc catggggaca tcagcataga aaagcgaatg cttatgttga taaacttgat 2580
agagcaaagg cagaagacag gaagaagaag ggattgccat ccgaatttta ttgcaaaagc 2640
ttttatcagc ctgacagagg tgctttcttc agacttccgt ttgataagat gggtcttggt 2700
aatggcttat gttactcctg tgagttgcag caaattgatc aggaaaagga atcttttaag 2760
ttggatatgt ccaactccag ttttgtatat ctggggactg agtattcaat tgatgacttt 2820
gtttatatac accctgatca ctttgctgta gaaagagggg gaagtggaac tttcaaagct 2880
gggagaaatg tggggttgat ggcctatgta gtgtgtcaac taatagagat ttctggcccc 2940
aagggatcta aacaagctaa agtagattct accaacgtca aagtcaggag attcttcaga 3000
ccagaggaca tttcttcaga taaggcatac tcttctgata ttcgggagat ctactatagt 3060
gaggagatac atacagttcc ggtagaaaca attgaaggta aatgtgaagt gaggaagaag 3120
tatgatattc cgtctgaaga tgtccctgcc acctttgacc atgttttctt ttgtgaatat 3180
ttgtatgatc cattgaatgg atccctcaaa cagttaccag ctcaggtaaa gctgagattc 3240
tcaagagtta aactagatga tgctgcatct aggaagagaa agggaaaagg caaggaagga 3300
gaggatgaac tgagagttgg gcaactaaat gtagcttctc aacagaatcg tttggccaca 3360
ctagatatct ttgctggttg tggtggcctg tctgaggggt tgcagcgttc gggtgtctca 3420
gatacaaaat gggcaattga atatgaagag cctgctggag atgcgtttaa acttaatcat 3480
ccagaggcaa aggtgttcat acagaattgc aatgtgattc tgagggctgt catgcaaaag 3540
tgtggagatg ctgagaactg tatctcaacc tcagaggctt ctgaattagc tgcagcaatg 3600
gatgagaacg aactgaatag tttgccactg ccaggacaag tggacttcat aaatggaggc 3660
cctccttgtc aggggttttc tggaatgaat agatttaatc agagcacctg gagtaaagtt 3720
cagtgcgaga tgattctggc atttttatcc tttgctgatt attatcggcc taagttcttt 3780
ctcttggaga atgttaggaa ttttgtgtcg ttcaaccaaa aacaaacatt tcgcttaact 3840
gttgcttccc ttcttgagat gggttatcag gtgaggtttg gtatccttga agctggagcg 3900
tttggagttc ctcagtctag gaagagagca tttatctggg ctgcttcccc agaggagatt 3960
cttccagagt ggccagaacc aatgcatgta tttggtgtcc cagaattaaa aatcacatta 4020
tctgaaactt gtcactatgc agctgtgagg agtactgcta gtggagctcc attccgttcg 4080
cttactgtca gagacacaat tggagatctt cctgctgttg gcaacggagc atccaagacc 4140
tgtatagagt atcaagttga cccgatatcc tggttccaaa ggaaaattcg gggcaactca 4200
ataacattat ccgatcacat tacgaaagag atgaacgagc ttaacctaat caggtgccaa 4260
agaattccta agcggccagg agccgactgg cgtgaccttc cggatgaaaa ggttaaacta 4320
tgtaatggtc aactggttga tttgattccg tggtgcctgc ctaacactgc taagaggcac 4380
aaccagtgga aggggctctt tgggaggttg gattgggatg ggaacttccc cacttccttt 4440
actgaccccc agccgatggg taaggtgggg atgtgttttc atcccgacca agacaggatt 4500
gttacagttc gtgaatgtgc gcgttctcaa ggtttcccag atagctatca atttgctggt 4560
aacattttgc acaagcacag gcaaatagga aatgctgttc cacctccttt ggcatatgca 4620
ctgggaagga aacttaagga agctgttgag agcaagaaga ggtccactta gaagtttgta 4680
aattttgtgg aacaagagat gagtggtcat actgcacctg aatttaagct ttcaaattta 4740
aatgtcaaac agcatgattc acatgtcaat tttctgttgt acaagatagc ttattgcaga 4800
atcaatgtta cataaaaaaa aa 4822
<210>42
<211>152
<212>PRT
<213>小麦(Triticum aestivum)
<220>
<221>肽
<222>(0)...(0)
<223>Ceres克隆:890048;Met1同系物
<221>变体
<222>142,143,146,148
<223>Xaa=任何氨基酸
<400>42
Asp His Ile Ser Lys Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Lys
1 5 10 15
His Ile Pro Lys Arg Pro Gly Cys Asp Trp His Asp Leu Pro Asp Glu
20 25 30
Lys Val Lys Leu Ser Ser Gly Gln Met Val Asp Leu Ile Pro Trp Cys
35 40 45
Leu Pro Asn Thr Ala Lys Arg His Asn Gln Trp Lys Gly Leu Tyr Gly
50 55 60
Arg Leu Asp Trp Glu Gly Asn Phe Pro Thr Ser Val Thr Asp Pro Gln
65 70 75 80
Pro Met Gly Lys Val Gly Met Cys Phe His Pro Asp Gln Asp Arg Ile
85 90 95
Ile Thr Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro Asp Ser Tyr
100 105 110
Gln Phe Ser Gly Thr Ile Gln Ser Lys His Arg Gln Ile Gly Asn Ala
115 120 125
Val Pro Pro Pro Leu Ala Phe Ala Leu Gly Arg Lys Leu Xaa Xaa Ala
130 135 140
Val Xaa Gly Xaa His Gln Gln Ala
145 150
<210>43
<211>457
<212>DNA
<213>小麦(Triticum aestivum)
<220>
<221>misc_feature
<222>(0)...(0)
<223>Ceres克隆:890048;Met1同系物
<400>43
cgatcacata tctaaggaga tgaatgaatt aaatctcata agatgcaaac atattcccaa 60
acgacctggt tgtgactggc atgacctgcc agatgagaag gtgaagctat cttctgggca 120
aatggtggac ctgatacctt ggtgcttgcc taacaccgct aaaaggcaca atcagtggaa 180
gggtctgtat gggaggttag attgggaggg caatttcccc acgtctgtga ctgatcctca 240
gccgatgggc aaggttggca tgtgcttcca ccctgaccag gataggatta tcacggtccg 300
cgaatgtgcg cgatctcagg gctttcctga cagctaccag ttttcgggca ccattcagag 360
caagcacagg cagattggca atgctgtgcc accccctctt gcctttgcgc ttgggaggaa 420
gctgamtsaa gccgttsatg ggaakcacca gcaggcc 457
<210>44
<211>1525
<212>PRT
<213>玉米(zea mays)
<220>
<221>肽
<222>(0)...(0)
<223>gi|3132825|gb|AAC16389.1|假定的
胞嘧啶-5-甲基转移酶
<400>44
Met Gln Ser Lys Ala Thr Lys Glu Gly Arg Gly Ile His Arg Lys Gln
1 5 10 15
Gln Ala Gly Glu Trp Ile Ser Gly Tyr Asn Arg Arg Gly Ala Ser Trp
20 25 30
Ser Arg Lys Ser Asp Gly His Val Thr Arg Lys Arg Pro Arg Arg Ser
35 40 45
Ala Ala Cys Ser Asp Phe Lys Glu Lys Ser Ile Arg Leu Ser Glu Lys
50 55 60
Lys Ser Val Val Met Val Lys Lys Asn Arg Met Glu Glu Glu Glu Val
65 70 75 80
Asp Ala Val Asn Leu Thr Lys Leu Gly Pro Glu Asp Pro Pro Pro Cys
85 90 95
Arg Lys Leu Ile Asp Phe Ile Leu His Asp Ala Glu Gly Asn Pro Gln
100 105 110
Pro Phe Glu Met Ser Glu Ile Asp Asp Phe Phe Ile Thr Ala Leu Ile
115 120 125
Met Pro Met Asp Asp Asp Leu Glu Lys Glu Arg Glu Arg Gly Val Arg
130 135 140
Cys Glu Gly Phe Gly Arg Ile Glu Asp Trp Asn Ile Ser Gly Tyr Asp
145 150 155 160
Glu Gly Thr Pro Val Ile Trp Val Ser Thr Asp Val Ala Asp Tyr Glu
165 170 175
Cys Val Lys Pro Ser Thr Asn Tyr Lys Ser Tyr Phe Asp His Phe Tyr
180 185 190
Glu Lys Ala Gln Val Cys Val Glu Val Phe Lys Lys Leu Ala Lys Ser
195 200 205
Val Gly Gly Asn Pro Asn Gln Gly Leu Asp Glu Leu Leu Ala Ser Val
210 215 220
Val Arg Ser Thr Asn Ala Met Lys Gly Tyr Ser Gly Thr Met Ser Lys
225 230 235 240
Asp Leu Val Ile Ser Ile Gly Glu Phe Val Tyr Asn Gln Leu Val Gly
245 250 255
Leu Asp Glu Thr Ser Asn Asn Asp Asp Glu Lys Phe Ala Thr Leu Pro
260 265 270
Val Leu Leu Ser Leu Arg Asp Gln Cys Arg Ser Arg Val Glu Leu Thr
275 280 285
Lys Leu Pro Ser Asn Phe Ser Asn Thr Ser Leu Lys Ile Lys Asp Ser
290 295 300
Glu Cys Asp Glu Thr Ala Glu Asp Asp Asp Asp Ala Lys Leu Ala Arg
305 310 315 320
Leu Leu Gln Gln Glu Glu Glu Trp Lys Met Met Lys Lys Gln Arg Gly
325 330 335
Arg Arg Gly Thr Pro Ser Gln Lys Asn Val Tyr Ile Lys Ile Ser Glu
340 345 350
Ala Glu Ile Ala Asn Asp Tyr Pro Leu Pro Ala Tyr Tyr Lys Pro Phe
355 360 365
Ser Gln Glu Met Asp Glu Tyr Ile Phe Asp Ser Asp Asp Ser Ile Phe
370 375 380
Ser Asp Asp Val Pro Val Arg Ile Leu Asn Asn Trp Thr Leu Tyr Asn
385 390 395 400
Ala Asp Ser Arg Leu Ile Ser Leu Glu Leu Ile Pro Met Lys Ser Gly
405 410 415
Ala Glu Asn Asp Val Val Val Phe Gly Ser Gly Phe Met Arg Asp Asp
420 425 430
Asp Gly Ser Cys Cys Ser Thr Ala Glu Ser Val Lys Ser Ser Ser Ser
435 440 445
Ser Ser Lys Ala Asp Gln Leu Asp Ala Gly Ile Pro Ile Tyr Leu Ser
450 455 460
Pro Ile Lys Glu Trp Ile Ile Glu Phe Gly Gly Ser Met Ile Cys Ile
465 470 475 480
Thr Ile Arg Thr Asp Va1 Ala Trp Tyr Lys Leu Arg Gln Pro Thr Lys
485 490 495
Gln Tyr Ala Pro Trp Cys Glu Pro Val Leu Lys Thr Ala Arg Leu Ala
500 505 510
Val Ser Ile Ile Thr Leu Leu Lys Glu Gln Ser Arg Ala Ser Lys Leu
515 520 525
Ser Phe Ala Asp Val Ile Arg Lys Val Ala Glu Phe Asp Lys Gly Asn
530 535 540
Pro Ala Phe Ile Ser Ser Asn Ile Thr Leu Val Glu Arg Tyr Ile Val
545 550 555 560
Val His Gly Gln Ile Ile Leu Gln Gln Phe Ala Asp Phe Pro Asp Glu
565 570 575
Thr Ile Arg Arg Ser Ala Phe Val Ser Gly Leu Leu Leu Lys Met Glu
580 585 590
Gln Arg Arg His Thr Lys Leu Val Met Lys Lys Lys Thr Gln Val Met
595 600 605
Arg Gly Glu Asn Leu Asn Pro Ser Ala Ala Met Gly Pro Ala Ser Arg
610 615 620
Lys Lys Ala Met Arg Ala Thr Thr Thr Arg Leu Ile Asn Arg Ile Trp
625 630 635 640
Ser Asp Tyr Tyr Ala His His Phe Pro Glu Asp Ser Lys Glu Gly Asp
645 650 655
Gly Asn Glu Thr Lys Glu Ile Asp Asp Glu Gln Glu Glu Asn Glu Asp
660 665 670
Glu Asp Ala Glu Asp Glu Gly Gln Ile Glu Glu Asn Ile Ser Lys Thr
675 680 685
Pro Pro Ser Thr Arg Ser Arg Lys Leu Leu Ser Gln Thr Cys Lys Glu
690 695 700
Ile Arg Trp Glu Gly Glu Thr Ser Gly Lys Thr Leu Ser Gly Glu Thr
705 710 715 720
Leu Tyr Lys Cys Ala Tyr Val Arg Glu Leu Arg Ile Pro Val Gly Gly
725 730 735
Thr Val Ala Leu Glu Asp Asp Ser Gly Asp Thr Val Ile Cys Phe Val
740 745 750
Glu Tyr Met Phe Gln Lys Val Asp Gly Ser Lys Met Val His Gly Arg
755 760 765
Ile Leu Gln Lys Gly Ser Gln Thr Ile Leu Gly Asn Ala Ala Asn Glu
770 775 780
Arg Glu Val Phe Leu Thr Asn Asp Cys Leu Glu Phe Lys Leu Asp Asp
785 790 795 800
Ile Lys Glu Leu Val Met Val Asp Ile Gln Ser Arg Pro Trp Gly His
805 810 815
Lys Tyr Arg Lys Glu Asn Ser Glu Ala Asp Lys Val Glu Gln Val Lys
820 825 830
Ala Glu Glu Arg Lys Lys Lys Gly Gln Pro Met Val Tyr Phe Cys Lys
835 840 845
Ser Leu Tyr Trp Pro Glu Lys Gly Ala Phe Phe Ala Leu Ser Arg Asp
850 855 860
Lys Met Gly Leu Gly Ser Gly Leu Cys Ser Ser Cys Asp Asn Ile Glu
865 870 875 880
Pro Asp Ser Asp Glu Leu Lys Ile Phe Ser Lys Thr Ser Phe Val Tyr
885 890 895
Arg Lys Val Thr Tyr Asn Val Asn Glu Phe Leu Tyr Ile Arg Pro Asp
900 905 910
Phe Phe Ala Glu Asp Glu Asp Arg Ala Thr Phe Lys Ala Gly Arg Asn
915 920 925
Val Gly Leu Lys Pro Tyr Ala Val Cys Gln Ile Leu Ser Ile Pro Glu
930 935 940
Gly Ala Gly Ser Lys Lys Leu Asn Pro Ala Ser Ala Asn Ile Ser Ala
945 950 955 960
Arg Arg Phe Tyr Arg Pro Asp Asp Ile Ser Ser Ala Lys Ala Tyr Ala
965 970 975
Ser Asp Ile Arg Glu Val Tyr Tyr Ser Glu Asp Val Ile Asp Val Pro
980 985 990
Val Asp Met Ile Glu Gly Lys Cys Glu Val Arg Lys Lys Asn Asp Leu
995 1000 1005
Ala Ser Ser Asp Leu Pro Val Met Phe Glu His Val Phe Phe Cys Glu
1010 1015 1020
Leu Ile Tyr Asp Arg Ala Ser Gly Ala Leu Lys Gln Leu Pro Pro Asn
1025 1030 1035 1040
Val Arg Phe Met Ser Met Val Gln Arg Thr Ser Ala Leu Lys Lys Asn
1045 1050 1055
Lys Gly Lys Gln Ile Cys Glu Pro Asp Gln Ile Asp Ser Gly Lys Trp
1060 1065 1070
Leu Asp Val Pro Lys Glu Asn Arg Leu Ala Thr Leu Asp Ile Phe Ala
1075 1080 1085
Gly Cys Gly Gly Leu Ser Glu Gly Leu Gln Gln Ala Gly Val Ser Phe
1090 1095 1100
Thr Lys Trp Ala Ile Glu Tyr Glu Glu Pro Ala Gly Glu Ala Phe Asn
1105 1110 1115 1120
Lys Asn His Pro Glu Ala Val Val Phe Val Asp Asn Cys Asn Val Ile
1125 1130 1135
Leu Lys Ala Ile Met Asp Lys Cys Gly Asp Thr Asp Asp Cys Val Ser
1140 1145 1150
Thr Ser Glu Ala Ala Glu Gln Ala Ala Lys Leu Pro Glu Val Asn Ile
1155 1160 1165
Asn Asn Leu Pro Val Pro Gly Glu Val Glu Phe Ile Asn Gly Gly Pro
1170 1175 1180
Pro Cys Gln Gly Phe Ser Gly Met Asn Arg Phe Asn Gln Ser Pro Trp
1185 1190 1195 1200
Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu Ser Phe Ala Glu
1205 1210 1215
Tyr Phe Arg Pro Arg Phe Phe Leu Leu Glu Asn Val Arg Asn Phe Val
1220 1225 1230
Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Ala Val Ala Ser Leu Leu
1235 1240 1245
Glu Met Gly Tyr Gln Val Arg Phe Gly Ile Leu Glu Ala Gly Ala Phe
1250 1255 1260
Gly Val Ala Gln Ser Arg Lys Arg Ala Phe Ile Trp Ala Ala Ala Pro
1265 1270 1275 1280
Gly Glu Met Leu Pro Asp Trp Pro Glu Pro Met His Val Phe Ala Ser
1285 1290 1295
Pro Glu Leu Lys Ile Thr Leu Pro Asp Gly Gln Tyr Tyr Ala Ala Ala
1300 1305 1310
Arg Ser Thr Ala Gly Gly Ala Pro Phe Arg Ala Ile Thr Val Arg Asp
1315 1320 1325
Thr Ile Gly Asp Leu Pro Lys Val Gly Asn Gly Ala Ser Lys Leu Thr
1330 1335 1340
Leu Glu Tyr Gly Gly Glu Pro Val Ser Trp Phe Gln Lys Lys Ile Arg
1345 1350 1355 1360
Gly Ser Met Met Val Leu Asn Asp His Ile Ser Lys Glu Met Asn Glu
1365 1370 1375
Leu Asn Leu Ile Arg Cys Gln His Ile Pro Lys Arg Pro Gly Cys Asp
1380 1385 1390
Trp His Asp Leu Pro Asp Glu Lys Val Lys Leu Ser Asn Gly Gln Met
1395 1400 1405
Ala Asp Leu Ile Pro Trp Cys Leu Pro Asn Thr Ala Lys Arg His Asn
1410 1415 1420
Gln Trp Lys Gly Leu Tyr Gly Arg Leu Asp Trp Glu Gly Asn Phe Pro
1425 1430 1435 1440
Thr Ser Val Thr Asp Pro Gln Pro Met Gly Lys Val Gly Met Cys Phe
1445 1450 1455
His Pro Asp Gln Asp Arg Ile Ile Thr Val Arg Glu Cys Ala Arg Ser
1460 1465 1470
Gln Gly Phe Pro Asp Ser Tyr Glu Phe Ala Gly Asn Ile Gln Asn Lys
1475 1480 1485
His Arg Gln Ile Gly Asn Ala Val Pro Pro Pro Leu Ala Tyr Ala Leu
1490 1495 1500
Gly Arg Lys Leu Lys Glu Ala Val Asp Lys Arg Gln Glu Ala Ser Ala
1505 1510 1515 1520
Gly Val Pro Ala Pro
1525
<210>45
<211>7955
<212>DNA
<213>玉米(Zea mays)
<220>
<221>misc_feature
<222>(0)...(0)
<223>AF063403.1;GI:3132824;假定的胞嘧啶-5 DNA
甲基转移酶(ZMET1)基因,完整的cds
<400>45
ccctccactg ctcctacctt taacgaagca gcctggcagc acataaactt tcattttgaa 60
cttgttcaac ccgctgctgt gtttatggat ctttggcatc attgatggca ttaaactttt 120
gagtctggca cttactgatc tccaccttga accaggacat ttcttcatcc cattttgctt 180
cctttctgtt ctttgttgct ttctcaaatc ttccctaaac ccaaccaaat ttctttaaac 240
aaaaacgtgt atatgtgcat ttttagccca cacgcggatt cgagaacaag ctctatgagc 300
atcttcctcc ctattgactg tcaaaaaaaa gacggtgatg catgacacca cctcacctta 360
tcgaatcatg tctcccttgt tctgttctcc aaccatgctg cacacctgcc atttgtcata 420
tactcatcaa aattcatata aaacccccaa tcgtatcaat tccaatcccg tactagttaa 480
aagataacta tgtggagttg tcgcttcttc ccgtaatgta gttaagttag agggccctgg 540
tgtggcgctc ccgtcctggg tttgagcctt ggcattgcac cggtggtgca cccacctcat 600
ggctggtggc ggtgcaaatg gttctgtgac caccaatgaa gcgagtgcac atgaggttct 660
tgcctgcttt ccgtggtttg gtgggtccct atcttaatac agtcaaatgt acatctctcc 720
ttgatcaaat ttccccgtta acccatgtgg attatgtggt attgagtcgt aaatccatag 780
caagtcaaaa ttcatcacaa tccattccaa tacactccaa tccacatgga attggaataa 840
ccgaacaatg ccttagttgg aaatggagtc attccagtct cttacatctg acacaaatat 900
ctttcctgag ttgtgacaac cagtgttacc cagacatctg cgttcctttt ttttgcggag 960
ccagaaaact tgtcggtttc caagtggtgt accccccccc cccccccccc aatttttttt 1020
tgtcaaactg gacacctgca cccgtaccgg acacaaatac ccgcactagc atgtgtccca 1080
tgtgacactg gtaaagtatt tggcattttg tgttcccatt taccctccca taatggtaat 1140
gtcagttgtt gcagaatctt acgttttaag caaatcatgt gaattggtta ccgttttcct 1200
atacacattt cacatgaacc attgggattg gtattgcaac tatgataaca gaggtatgct 1260
gagtgttcag taaattcaaa ccatttttga ggatctattt tgtttctcca agggtacact 1320
ggtagattaa ttacataggc tctggcattc cagtggctta tattattatt ttttctttct 1380
attcttggaa tggtcggata ttaaactgcc taccttttaa aatgtggtct cctgatgcaa 1440
tattgtggct catgtagttt taaatttagg aaagggaaca ctatttacag gctacaactc 1500
cattttttac cactaatgac attttagaaa aaaaaatgaa ggtatttcta aatgatcttt 1560
tgtcttaaat attgtctttg ttgctgcact tcacaggtct atatttttct agttactgat 1620
agcaagcatt aacaatcttt tgtcatttgg tcagtattta ttctgttcct taaatctagt 1680
cagtcaccct aaccttcctt ttttgttgat tttgtgtttt gtctgcatct ctggccggtg 1740
tgtttttctt ttctttctgt tcacttttca gtactgctat tttaactttt gttcccctat 1800
ataggcatat atctgattga tatgctgacc aatgattttt caggaacaag aagatgcaga 1860
gcaaagccac aaaagaagga agaggaatcc acagaaaaca acaagctgga gaatggatct 1920
ctggatacaa cagaagaggt gcatcatgga gtcgaaaaag tgatggacat gttacccgca 1980
agagaccaag gagatcagcg gcctgttctg atttcaaaga gaaatccata cgcttatccg 2040
aaaaaaaatc tgttgtcatg gtcaagaaga atcggatgga ggaggaagaa gtagatgctg 2100
tcaatctgac aaaacttgga ccagaagatc caccaccttg ccggaagttg atcgatttta 2160
tcttgcatga tgcagaaggg aacccacaac cctttgaaat gtcagaaatt gatgacttct 2220
ttataacagc tcttatcatg cccatggatg atgatctaga aaaagagcgt gaaagaggag 2280
tacgctgtga aggatttggg cgaattgagg actggaatat ttctggttat gatgaaggta 2340
ctcctgtaat ctgggtgtca actgatgttg ctgactatga atgtgtgaaa ccatcaacca 2400
attacaaatc ttattttgac cacttctatg agaaggctca ggtgtgtgtt gaagttttca 2460
aaaagcttgc aaaatcagtt ggtgggaatc ctaaccaggg cctggatgaa ttgcttgcta 2520
gtgttgttcg gtcaaccaat gccatgaaag gatatagtgg aaccatgagc aaagatttgg 2580
tgatatccat tggagaattt gtatacaatc aacttgttgg tttggatgag acatcaaaca 2640
atgatgatga aaagtttgct accctgccag ttcttctttc tctaagagac cagtgcagat 2700
ctagggtgga actgaccaag ttgccctcta acttctcgaa cacaagtctg aaaattaagg 2760
actcagagtg tgatgagaca gcagaagacg atgatgatgc aaaattagct agattacttc 2820
aacaagaaga agaatggaaa atgatgaaga aacagagggg taggcgtgga acaccatccc 2880
agaaaaatgt ctacataaaa atcagtgaag ctgagattgc caatgactat ccccttcctg 2940
catactataa gccatttagc caggaaatgg atgaatacat atttgatagt gatgacagca 3000
tattttctga tgatgtgcca gttaggatac tcaataactg gacactgtac aatgcagatt 3060
ccaggcttat atctttggaa ttgatcccta tgaaatcagg ggcagaaaat gatgtggttg 3120
tctttggatc tggtttcatg agagatgatg atggcagttg ctgttctaca gctgagtctg 3180
tgaaatcttc gtcttcctcc agcaaagctg accaactgga tgcgggaatc cctatttatt 3240
tgagcccaat caaagaatgg attatagagt ttggtggctc aatgatttgt ataaccattc 3300
ggactgatgt ggcctggtaa gtaccctcag ctactttctt tcagtacact gcttcattat 3360
gtggtcatta actgtgttct taacagttgt gtcactgtat cctcttatac catttgaaca 3420
tcacttttag ctcttttaat ctttgctcca ttacaactta catttagagt tttatttcag 3480
gtacaagcta cgccaaccaa caaaacaata tgctccatgg tgtgagcctg tactgaaaac 3540
agcaaggctt gctgttagca tcattaccct gttgaaagag cagagtcgtg cctcaaagct 3600
ttcttttgct gatgtcataa gaaaagtagc tgaatttgac aaaggaaacc ctgcatttat 3660
atcttcaaac atcacacttg ttgagagata tattgtggtg catggacaga taatactcca 3720
gcagtttgca gattttccag atgagactat tcgtcggagt gcatttgtca gtggtctttt 3780
attgaagatg gaacagagga ggcatacaaa gttagttatg aagaaaaaaa ctcaagtgat 3840
gaggggagag aatctgaatc caagtgcagc aatgggtcca gcatcgagga aaaaagcaat 3900
gcgtgcaaca acaaccaggc tcatcaacag aatctggagt gattactatg cacatcattt 3960
ccctgaagat tccaaggagg gagatggaaa tgaaacaaaa gaaattgatg atgaacaaga 4020
agaaaatgaa gatgaggatg ctgaagatga aggacagatt gaggagaaca tctcaaagac 4080
tcctccatca acacggtccc ggaagttgct atcacaaact tgtaaggaaa tcagatggga 4140
aggtgaaaca tctgggaaaa cattgtctgg agaaactcta tataaatgtg cttatgttag 4200
ggaactcaga atacctgttg gtggaacagt ggctctagaa gatgattcag gagacacagt 4260
catttgtttt gttgagtaca tgttccagaa agttgatggt tcaaaaatgg ttcatgggag 4320
gattctgcaa aaagggtcac agacaattct tggcaatgca gcaaatgaga gggaggtttt 4380
cttaactaat gactgcttag aattcaaatt agatgacatc aaggaattgg taatggttga 4440
tatccaatca aggccttggg gtcacaagta cagaaaagag aattctgaag ctgataaagt 4500
tgagcaggtc aaagcagaag agagaaagaa aaagggccag cccatggtat atttctgcaa 4560
aagcttgtac tggcctgaga agggtgcctt ctttgccctc tcccgagata aaatgggtct 4620
tggtagtggt ttatgtagtt cttgtgataa tatagagcca gattctgatg aattgaaaat 4680
attctcgaag accagctttg tctacagaaa ggttacatat aatgtcaatg agtttttata 4740
cataagacct gatttttttg ctgaagatga ggatcgtgca accttcaagg ctggccgaaa 4800
tgtgggtcta aagccctatg cagtttgtca aatattgtcc atccctgaag gggctggatc 4860
taaaaaactc aatccagcat cagcaaatat cagtgctaga agattttaca gaccagatga 4920
catttcatca gccaaagcct atgcatctga catcagagag gtcatctttt ttttctatct 4980
tgtatgcttg atttatctac tccataactt cattgttact ttttctcaaa catgtgagca 5040
aatcctagag tcctgagaat ggtcattctt gtttctttct tgttaacttt agtttgttcg 5100
attcaggtct actatagtga ggatgtaatt gatgtgcctg tggatatgat agagggaaaa 5160
tgtgaggtta gaaagaagaa cgatcttgca agttcagacc ttccagtgat gtttgaacat 5220
gtatttttct gtgaacttat atatgaccgt gccagtggag ctctcaagca ggttagctgt 5280
actgtactga agttgctatt ctgattcatt gagtggcagt tttgatagtt tcctgaatgt 5340
gtgttccatg tctggagcag ttgcctccaa atgttaggtt tatgtctatg gtgcaaagga 5400
caagtgcgtt gaaaaagaac aaaggaaagc agatctgtga gcctgatcaa atagattcag 5460
gtaaatggtt ggatgtgcct aaagagaacc gtctagctac tcttgacatt tttgctggct 5520
gtggaggttt atcagaaggg ctgcagcaag ctggtatgta ttgttaacac tgatgctgta 5580
taccatgaac atgaccaaca aataaaaaat ttcctcattg ttcaatgctg taggtgtatc 5640
ttttacaaaa tgggcgattg aatacgaaga gcctgctggt gaagcattta ataaaaatca 5700
tccagaggct gtggtctttg tagataactg caatgtgatt ctaaagtaag tgcaaattgt 5760
ttgatgccat tattatattt tttgttgttg aacagaacca atatttttgg taatgcaggg 5820
caattatgga taaatgtggg gatactgatg attgtgtttc aacttctgaa gctgctgaac 5880
aagcagcaaa acttccagaa gtgaacatta ataatcttcc agtccctggc gaagttgaat 5940
tcataaatgg tggtcctccg tgtcaggttt gttattatct acagttctat gtataggcca 6000
gaaaatcatc agtcacctgt tcagttttgt cattcaaatg cttgaattgt ttattctttt 6060
gttgtcaggg attctctggg atgaatagat tcaaccaaag cccatggagt aaagttcagt 6120
gtgagatgat tctagcattc ctctcattcg ctgagtattt ccgtcccaga ttctttctgt 6180
tagaaaatgt tcggaacttt gtttccttca acaaagggca gaccttccgt ttggcagttg 6240
catctcttct ggagatggga taccaggtat ttctgttaat tcattatctg ctaagaccta 6300
tagcttacac tttttatggt ggtttaaatc tgtatactta gaaattgttt gccatttggt 6360
taggtccggt ttggaattct agaagcaggg gcttttggtg ttgcccagtc caggaaaagg 6420
gcgtttattt gggctgctgc acctggagag atgcttcctg attggccaga gccgatgcat 6480
gtgtttgcta gccctgagct gaagataaca ctgcctgatg gccaatacta tgcagctgca 6540
agaagcactg ctggtggagc gcctttccga gcgattactg ttagagatac aattggggat 6600
ctgcctaaag tgggaaatgg tgccagcaaa ctcacgcttg aggtaactgg tgcttcttga 6660
tcatctattt ttttcttttc tttgagttat atgctaaatg agctactgat tatcttgtgc 6720
agtatggagg tgagcccgtg tcttggttcc agaagaagat aagagggagt atgatggtac 6780
tgaatgatca catatctaag gagatgaatg agctgaacct aataaggtgt caacacattc 6840
cgaaacggcc gggttgtgat tggcatgacc taccggacga gaaggtaatt ttctgaaatc 6900
tgttgttata ttccttctgt ccatggagca ctgacccttg gcccttgcta ttcttacagg 6960
ttaagctgtc aaatgggcag atggctgacc tgataccttg gtgcctgccc aacacagcca 7020
agaggcacaa tcagtggaaa ggactgtacg ggaggctgga ctgggaaggc aacttcccca 7080
catccgtcac tgatccccag ccaatgggca aggtcggcat gtgcttccac cctgatcaag 7140
acaggatcat cacagtccgg gaatgtgctc ggtcacaggt aagctggtct acatccattt 7200
ccatctgcaa aatgacaatg acactcctgt ctaatatgat ccaatctttg ccgtgcaggg 7260
ctttcctgac agctatgaat ttgcgggcaa catccagaac aagcaccggc agattggcaa 7320
tgccgtgccc ccgcctcttg cctatgcact tgggaggaag ctcaaggaag ccgttgacaa 7380
gcgtcaggaa gccagcgcag gcgtgcctgc accatgagaa gttttccttc catcaaacca 7440
tgacccatga agctaagcgc tgaggtcgtc cttgaggacc agttaatttt ggttttatca 7500
gtcttaatgg actcctgaat gtatatgtta gagaagtgtc gattgttgat tgttaccctg 7560
attcagggta gcggttatat ctaaaaactt gagaaaatct agtgtactct agttgctatg 7620
tgttccattt tgttgactct aaactttcaa ctagttttgg tgattaatga caacatgaga 7680
ttaacttaaa ttttgtagag gtatttaaat taggccacta atagtgacta tttagtcgct 7740
caattttttt gcccctaatt atggaatttg ttttttaaag gatgaacaac aagattaaat 7800
ggattagttc aagtgtcgat tcgggctaag actatccgta gcggtttttt ctaacttttt 7860
ctctatgtgc cacctttata tcatgtcata ctagcaattc taattaattg gttaagggca 7920
tcctattaca tcattgtggt agcattgttt tgggt 7955
<210>46
<211>1522
<212>PRT
<213>稻(oryza sativa)
<220>
<221>肽
<222>(0)...(0)
<223>gi|18653391|gb|AAL77415.1|假定的
胞嘧啶-5DNA甲基转移酶(japonica cultivar-组)
<400>46
Met Asp Thr Cys Leu Tyr Gly Thr Lys Arg Arg Arg Ala Lys Val His
1 5 10 15
Lys Glu Asp Glu Pro Val Glu Asn Glu Asn Leu Glu Ser Glu Phe Asp
20 25 30
Val Ser Lys Lys Glu Ser Asn Gly Ala Thr Glu Pro Gly Asn Glu Pro
35 40 45
Val Ala Ser Lys Arg Pro Lys Arg Ala Ala Ala Cys Ser Asn Phe Lys
50 55 60
Glu Lys Ser Leu Asp Leu Ser Glu Lys Asp Ser Ile Ile Thr Ile Lys
65 70 75 80
Glu Ser Arg Val Glu Glu Lys Glu Ile Glu Ala Val Asn Leu Thr Arg
85 90 95
Thr Gly Pro Glu Asp Gly Gln Pro Cys Arg Lys Ile Ile Asp Phe Ile
100 105 110
Leu His Asp Gly Asp Gly Asn Leu Gln Pro Phe Glu Met Ser Glu Val
115 120 125
Asp Asp Ile Phe Ile Thr Ala Leu Ile Met pro Leu Asp Asp Asp Leu
130 135 140
Glu Lys Asp Arg Gly Lys Gly Ile Cys Cys Ser Gly phe Gly Arg Ile
145 150 155 160
Glu Asn Trp Ala Ile Ser Gly Tyr AspGlu Gly Ala Ala Val Ile Trp
165 170 175
Val Ser Thr Glu Thr Ser Asp Tyr Lys Cys Val Lys Pro Ala Ser Ser
180 185 190
Tyr Arg Ser Tyr Phe Glu His phe Ser Glu Lys Ala Arg Val Cys Val
195 200 205
Glu Val Tyr Lys Lys Leu Ala Arg Ser Val Gly Gly Asn Pro Gln Val
210 215 220
Asp Leu Glu Glu Leu Ile Ala Gly Val Val Arg Ser Ile Asn Ser Asn
225 230 235 240
Arg Ser Phe Asn Gly Thr Val Thr Lys Asp Phe Val Ile Ser Ser Gly
245 250 255
Glu Phe Ile Tyr Lys Gln Leu Ile Gly Leu Asp His Thr Ala Gly Asn
260 265 270
Asp Asp Glu Met Leu Ala Thr Leu Pro Val Leu Val Ala Leu Lys Asp
275 280 285
Glu Cys Lys Ser Arg Ala Gly Phe Thr His Leu Pro Ala Met Pro Ser
290 295 300
Asn Gly Thr Leu Arg Ile Lys Asp Gly Gln Asp Lys Gly Leu Thr Glu
305 310 315 320
Asp Glu Asp Ala Lys Leu Ala Arg Leu Leu Gln Glu Glu Glu Glu Trp
325 330 335
Lys Met Met Lys Glu Arg Gly Lys Arg Gly Thr Ser Gln Lys Asn Ile
340 345 350
Tyr Ile Lys Ile Cys Glu Thr Glu Ile Ala Asn Asp Tyr Pro Leu Pro
355 360 365
Ala Tyr Tyr Lys Pro Tyr Asn Gln Glu Met Asp Glu Tyr Ile Phe Asp
370 375 380
Ser Asp Ile Gly Met Tyr Ser Asp Asp Val Pro Val Arg Ile Leu Asp
385 390 395 400
Asn Trp Ala Leu Tyr Asn Ser Asp Ser Arg Leu Ile Ser Leu Glu Leu
405 410 415
Ile Pro Met Lys Ala Gly Ala Glu Asn Asp Ile Val Val Phe Gly Ser
420 425 430
Gly Phe Met Arg Glu Asp Asp Gly Ser Cys Cys Ser Thr Ala Glu Leu
435 440 445
Ala Gln Leu His Ser Ser Ser Ser Lys Ser Gly Arg Glu Asp Pro Gly
450 455 460
Val Pro Ile Tyr Leu Ser Pro Ile Lys Glu Trp Val Val Glu Phe Gly
465 470 475 480
Gly Ser Met Ile Cys Ile Thr Ile Arg Thr Asp Val Ala Trp Tyr Lys
485 490 495
Leu Arg Gln Pro Thr Lys Gln Tyr Ala Pro Trp Cys Glu Pro Val Leu
500 505 510
Lys Thr Ala Arg Leu Ala Val Ser Ile Ile Thr Leu Leu Lys Glu Gln
515 520 525
Ser Arg Ala Ser Lys Leu Ser Phe Ala Glu Val Ile Lys Lys Val Ala
530 535 540
Glu Phe Asp Ser Arg His Pro Ala Phe Ile Ser Ser Lys Ala Pro Thr
545 550 555 560
Val Glu Arg Tyr Val Val Val His Gly Gln Ile Ile Leu Gln Gln Phe
565 570 575
Ala Asp Phe Pro Asp Glu SerVal Lys Arg Cys Ala Phe Ile Thr Gly
580 585 590
Leu Leu Ala Lys Met Glu Glu Ser Arg His Thr Lys Leu Ala Ile Lys
595 600 605
Lys Lys Ser Gln Gln Met Arg Gly Glu Asn Leu Asn Pro Ser Ala Lys
610 615 620
Met Gly Pro Ile Leu Arg Lys Lys Leu Met Arg Ala Thr Thr Thr Met
625 630 635 640
Leu Ile Ser Lys Ile Trp Gly Glu Tyr Tyr Ala Thr Tyr Phe Pro Gly
645 650 655
Asp Thr Lys Glu Glu Asp Gln Asn Glu Pro Lys Glu Ile Asp Asp Asp
660 665 670
Gln Glu Glu Asn Glu Asp Asn Asp Ala Glu Glu Glu Val Asn Val Gln
675 680 685
Asp Glu Lys Ala Thr Arg Thr Pro Pro Ser Thr Arg Ser Arg Lys Ser
690 695 700
Ser Ala Asp Thr Arg Lys Glu Ile Lys Trp Glu Gly Gln Thr Ala Gly
705 710 715 720
Lys Thr Val Ser Gly Glu Val Leu Tyr Lys Cys Val Ile Val Gln Asp
725 730 735
Leu Ser Ile Ser Val Gly Ala Thr Val Thr Thr Glu Asp Asp Ser Gly
740 745 750
Glu Thr Ile Met Cys Phe Val Glu Tyr Met Tyr Glu Lys Leu Asp Gly
755 760 765
Lys Asn Met Ile His Gly Ile Ile Leu Gln Glu Gly Ser Gln Thr Val
770 775 780
Leu Gly Asn Ala Ala Asn Asp Arg Glu Val Phe Leu Thr Asn Asp Cys
785 790 795 800
Leu Glu Phe Glu Ala Ser Asp Ile Lys Glu Leu Val Thr Val Asn Ile
805 810 815
Gln Ser Leu Pro Trp Gly His Lys Tyr Arg Lys Glu Asn Ser Glu Ala
820 825 830
Lys Arg Ile Glu Lys Ala Lys Ala Glu Glu Arg Lys Arg Lys Gly Leu
835 840 845
Pro Val Glu Tyr Ile Cys Lys Ser Leu Tyr Trp Pro Glu Lys Gly Gly
850 855 860
Phe Phe Ser Leu Pro Tyr Asp Lys Ile Gly Asn Gly Thr Gly Ile Cys
865 870 875 880
Ser Ser Cys Glu Arg Lys Pro Val Gly Asn Glu Phe Lys Leu Leu Ser
885 890 895
Glu Ser Ser Phe Val Phe Glu Asn Ile Thr Tyr Asn Ile His Asp Phe
900 905 910
Leu Tyr Ile Arg Pro Glu Phe Phe Ser Gln Gly Glu Gly His Glu Thr
915 920 925
Tyr Lys Ala Gly Arg Asn Val Gly Leu Lys Pro Tyr Ala Val Cys His
930 935 940
Leu Leu Ser Val His Gly Pro Ala Gly Ser Arg Lys Ala Asn Pro Glu
945 950 955 960
Ser Thr Lys Val Lys Val Arg Arg Phe Tyr Arg Pro Asp Asp Ile Ser
965 970 975
Ser Thr Lys Ala Tyr Ser Ser Asp Ile Arg Glu Val Tyr Tyr Ser Glu
980 985 990
Asp Ile Ile Ser Val Pro Val Val Met Ile Glu Gly Lys Cys Glu Val
995 1000 1005
Arg Leu Lys Asp Asp Leu Pro Asn Ser Asp Leu Pro Ala Val Val Glu
1010 1015 1020
His Val Phe Cys Cys Glu Tyr Leu Tyr Asp Pro Ala Asn Gly Ala Leu
1025 1030 1035 1040
Lys Gln Leu Pro Pro Asn Val Arg Leu Val Thr Leu Thr Arg Lys Val
1045 1050 1055
Pro Ala Ser Lys Lys Asn Lys Gly Lys Gln Ile Cys Asp Ile Glu Leu
1060 1065 1070
Gly Gly Ser Asp Lys Pro Lys Asp Gly Gln Ser Glu Asn Cys Leu Ala
1075 1080 1085
Thr Leu Asp Ile Phe Ala Gly Cys Gly Gly Leu Ser Glu Gly Leu Gln
1090 1095 1100
Arg Ser Gly Leu Ser Leu Thr Lys Trp Ala Ile Glu Tyr Glu Glu Pro
1105 1110 1115 1120
Ala Gly Asp Ala Phe Gly Glu Asn His Pro Glu Ala Ala Val Phe Val
1125 1130 1135
Glu Asn Cys Asn Val Ile Leu Lys Ala Ile Met Asp Lys Cys Gly Asp
1140 1145 1150
Ser Asp Asp Cys Ile Ser Thr Ser Glu Ala Ala Glu Arg Ala Ala Lys
1155 1160 1165
Leu Ser Glu Asp Lys Ile Lys Asn Leu Pro Val Pro Gly Glu Val Glu
1170 1175 1180
Phe Ile Asn Gly Gly Pro Pro Cys Gln Gly Phe Ser Gly Met Asn Arg
1185 1190 1195 1200
Phe Asn Gln Ser Pro Trp Ser Lys Val Gln Cys Glu Met Ile Leu Ala
1205 1210 1215
Phe Leu Ser Phe Ala Glu Tyr Phe Arg Pro Arg Phe Phe Leu Leu Glu
1220 1225 1230
Asn Val Arg Asn Phe Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu
1235 1240 1245
Thr Leu Ala Ser Leu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Ile
1250 1255 1260
Leu Glu Ala Gly Ala Tyr Gly Val Ala Gln Ser Arg Lys Arg Ala Phe
1265 1270 1275 1280
Ile Trp Ala Ala Ala Pro Gly Glu Thr Leu Pro Glu Trp Pro Glu Pro
1285 1290 1295
Met His Val Phe Ala Ser Pro Glu Leu Lys Ile Thr Leu Pro Asp Gly
1300 1305 1310
Lys Phe Tyr Ala Ala Val Lys Ser Thr Ala Ala Gly Ala Pro Phe Arg
1315 1320 1325
Ser Ile Thr Val Arg Asp Thr Ile Gly Asp Leu Pro Ala Val Glu Asn
1330 1335 1340
Gly Ala Gly Lys Pro Thr Ile Gln Tyr Gly Ser Gly Pro Val Ser Tro
1345 1350 1355 1360
Phe Gln Lys Lys Ile Arg Ser Asp Met Ala Ser Leu Asn Asp His Ile
1365 1370 1375
Ser Lys Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Lys His vIle Pro
1380 1385 1390
Lys Arg Pro Gly Cys Asp Trp His Asp Leu Pro Asp Glu Lys Val Lys
1395 1400 1405
Leu Ser Thr Gly Gln Met Val Asp Leu Ile Pro Trp Cys Leu Pro Asn
1410 1415 1420
Thr Ala Lys Arg His Asn Gln Trp Lys Gly Leu Tyr Gly Arg Leu Asp
1425 1430 1435 1440
Trp Glu Gly Asn Phe Pro Thr Ser Val Thr Asp Pro Gln Pro Met Gly
1445 1450 1455
Lys Val Gly Met Cys Phe His Pro Glu Gln Asp Arg Ile Ile Thr Val
1460 1465 1470
Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro Asp Ser Tyr Arg Phe Ala
1475 1480 1485
Gly Asn Ile Gln Asn Lys His Arg Gln Ile Gly Asn Ala Val Pro Pro
1490 1495 1500
Pro Leu Ala Tyr Ala Leu Gly Arg Lys Leu Lys Gln Ala Ile Asp Ala
1505 1510 1515 1520
Lys Arg
<210>47
<211>13449
<212>DNA
<213>稻(oryza sativa)
<220>
<221>misc_feature
<222>(0)...(0)
<223>AF462029.1;GI:18653390(japonica cultivar-组)
假定的胞嘧啶-5DNA甲基转移酶基因,完整的cds
<400>47
tcgaattgga gctcattagg aagttaggca accaaatata tagataatct agtattctgt 60
attcggttgg tccttcttat ttctaagtta ttctggaata agagaggaga aaaatcctaa 120
tgtgggcaca ctgattcatt ccatattttt tatgactctt gcaacgtatt cataaagcaa 180
tagtattgga gtaacaactc tacccagtgt ccaaccaaaa ttttgtaaat ttggggagtc 240
ttactgccaa cagctgccaa tagaaacact tgatctgtta ccagttaatt cttgtaatat 300
cgtgcttcaa ccattgattt atacctttaa ccagtctgca gaaagtgtat acatgaggag 360
ttcgtcgata aggagtcctg ctgctgccct acttgcaaca ttgatctggg gtgtgctccg 420
ttggagaaac tcaggtttgt ctttgtcact gtttccctga aatattttat ctacactatt 480
gcatacttct gaggcttaaa accattgttc agttattgta ttattttctt ttctatgtca 540
tgcccccaat ttgatattgc gatttcaatg cagcgcatgt acatacacct aagaagttag 600
tgacatttgg gagttgggag agttttttct ttaccacctc tgctctcttg catgtacaaa 660
tgagatagtc ttctgtgagc aatcatcctt cctccatcaa tataacaact attgtgtcct 720
gcttttttga tattattatg catacaacac tatgttcttc ctcaattgcc agtgataatc 780
gtgattttgc aataccagcc tagtgctgtg ccactgttct gtttgcatca catggactgt 840
tataacatca taataatctc catctgttca atcttggctt ggatggtctt aaagacttta 900
taattgtttt gtttttgctc ggtttatttt aatgattagt tgtttgttcg ttacaagtta 960
taacttcgaa aggggtgaga tgagcccaaa cttagttcac acgatcacac cttgattaat 1020
ctttagccgc ttgataccga gttatcagca gcatataaca aacaagtttt actttccctt 1080
tttgtgatat tgtagcttat ggtattttca attgtgtttt gttcagagtt gatcacagta 1140
tgcaattcgt aaggtcaagg atatttcctt tcaaacgaag gaaggttgag aacccagaaa 1200
tcatttgtcc agttgcgtca cctgtcaaga ggaaagaaag atcactatct tcacttacaa 1260
tccctgctcc tcaggtgtct atacagaaat gtttgacaaa gaggagaacg aaagcttcgt 1320
gcttacgcaa ctttcctttg gtatgtagtt ctcttgtaat agttcccctg agaatcggat 1380
gtgtgaagaa ttctaatttc ttatctgtaa atccactatt agaattgttc aaatgaatgt 1440
gactctgcat caccagaatg ctgtaagcaa ccctataggt taggtcatac catgtatttc 1500
cattgtttta gtgccataga gccatctaca tggtatgcca gctattgtcg tattattgca 1560
gctggcacaa tcacagatgt gtgctaggta acagttcttg gcactgtccg catgtgttat 1620
tggtatgcta aaggttgtga tagtcaacat gtgaatattt acgggctaat acataacaaa 1680
tggttatctg aatcttctga atttatttat agtccaggcg acattaatga tagatgcaca 1740
atacttttat tttgccttca aaactcaaat agtttcctac aaattgaggc aatccataga 1800
aatttcatac tccaaatacc tttgttccaa agacatatag gaaagtttcg ttaggattga 1860
aattcttcaa aaatcctcca aatacctttt gttgcaaagg agatgttatg catgaattaa 1920
atgcaaccga tgcccctccc ctccaaattc taaatcgatg cgatttttat tgctgttgtg 1980
tagagtattt ttttctggtc tgattttaat gaataaattt tcttttcccc gccttgtgat 2040
gtagcattct acctcgaggg gcagtaaaga tacatcaaag aaacttgggg gttggagacc 2100
attgggttgt caacttaagc ttggcaaaga caaaaaatct ctcaaatcaa gtgtaaaaga 2160
tacaaacaga accaaaagta agtctggtga tacagatgat ggtgctcctg ctagtaaagc 2220
aaaggctaga gaacccttta caagatatgg gcgtgcagct aagaggactg gaagaaagaa 2280
attgctcatg ttgaagaaca aaaagaaaag gttcaaggca aagcagccca gtaaaaagag 2340
gagattccga gcactatggt tttatctact tgctgctttt gaccagtgag aaacactaga 2400
cttgtcttgt gagatttgtg ccgtttcttc aaaggaaaag cagaaatcat gctttgtgtt 2460
tgtaattatt tcaggagagg agtaccaact ttaccacaac taccagcaaa gtatttgagg 2520
atcaagtgag ttatttggtt gcacttttct agtcaaaaca gtatttgtta agccacacta 2580
acaactgcct tttgttatat ttgtcaaagg gatgttgatt tgcctgcttc tattatacag 2640
aagtaccttg cacagaaact taacctctca agtgaaactg aggtatgtct ttacatattt 2700
ctttcttgta gtgatcatca aaaccagtct gttttttgtg ccagtatcca gattagttag 2760
aatcagcagt gagaaagaat tccatcaata tttggctttc ctgccaggga caagactggc 2820
taactggagc taaaatctca caatctaaac atagttagat gtagttatgg cacattctta 2880
gctgttcttt ggaggaaaac aacttacttc atgattatat ttgaaaaggc aagaaaattc 2940
accatactgt ttccaattgg tgcttgattg aaggctgaag ctgaccggtg ttcagttgat 3000
tttgtgctgt agatactttt gccactttgt tttgcgcata accacgtttg tctttgcaat 3060
gaacaacata actgttcgct gcttcaagtt tcatggtgct gcttgacact tgcaacatgt 3120
gaaattatta ccctatattg acaaaagttg acagaacatt ctgctcattg cacagcctat 3180
tttcttctgc tctcaccaac gagtcgtggt tgcaggtaga agtgttgtgt ggtggcaaag 3240
tagtgaacca agggatgaca ctgcatgatc tagcagattg ctggcttgag aaaggaccaa 3300
agagccgaat gcgctcatcg gtaggctccc cggccactgg attcatggtg acattgttct 3360
atagaaggcc agatgtggat gtgtcctcat ccccagctcc accccaacct gacactgaaa 3420
gttgccatag ctgatgcaga gctttgcttc gtttctgaac catctgtgat tggcttcact 3480
cattgggctg tcagcccttg attgatctgc gaatcggttc caatttgtgt gaggctcaag 3540
ccacaccaat tcactaatat gtagataaat gctactaatt ttacaagcca tgttgggctt 3600
ctgatcatct accttcttca accattattt tccttttttt tctcttcaac cattgtacct 3660
aggtatgtga tctgtaatga gatgttaatc gttaagtatc agttgttaga gctagggact 3720
tcatgttgtc tccacgtagc tagtagtact agattatgct tgtgtatttg aatttgctgg 3780
tgcattcatc atgcatgtat atatataagc tctggaattc cttgtagtta actggatgtt 3840
aagctgagaa tgtataagct ctagattatc agtttaccac tccatgcgtg atgaaacgtg 3900
ctactgctta ctctgttccc aaatataaat aacttgaaat tgtttaccag agatcaaggt 3960
ttgaaaagaa aatgctacag tgtttcttat taaggccgag tttagttaca aacttttttt 4020
tttcaaactt ccaacttttc catcacatca aaactttctt acacacataa actttcaact 4080
tttctttcac atcgttccaa tttcaaccaa acttcgaatt ttagcatgaa ctaaacacgc 4140
cctaagtgga gagtggttag ggctcattcg ggatgtaggt tgaacgaaca cagtgattgg 4200
aaaaaaaata ggaatgtgat aggaatacat gtacaaaaca tatgatttga atatacataa 4260
atttcgtagg aacagatggc taggtgaata cacagtacag ggtgtgttta gttcacgcca 4320
aaattggaag tttggttgaa attgaaacga tatgatggaa aagttgaaag tttgtgtgta 4380
taggaaagtt ttgatgtgat ggaaaagttg gaagtttgaa gaaaaagttt ggaattaaac 4440
ttggccacaa tctaattaac gcaatataaa ttcattttgt atttcattct cttctttttc 4500
tgcttattat tattaccccc tatgtttcaa aatgtttgac accgttgact ttttagcacg 4560
tgtttgacca ttcgttttat tcaaaaaatt taagtaatta tttattcttt tcatatcatt 4620
tgattcattg ttaaatatac tttcatgtac acatatagtt ttatatattt tacaaatttt 4680
tttagtaaga caaacggtca aacacgtgct aaaaagtcaa cggtgtaaaa cattttgaaa 4740
tggagagagt attattttta aaaggaagcc aaagtccaaa ctcgaaatat tcggggctcc 4800
cgcccaaggg gtcccgctct ctcgtctcct cgggactcag ccccaaaaac tcaaatcccc 4860
cgccttcttg tcccctccgc ttccccttcc acttccaccc ccacgtcgcc tcacctcgcc 4920
tcctctcccc ctccaaaccc caccaccaca gagaaaaccc cagggagaag gacaagggct 4980
ccacatacca acggcgccct cctcctcgac tagctcccgc cggtaatccc ctctccccct 5040
cctcgcgctg cttcgattgc ttggttcgcg tggcgcgatt gcgcgtgcgg tggtgggttt 5100
tggttggtag ttttgtctct gccttggttg cttgtggggg ttcgtcgctg atcgtggtgt 5160
cgtggggaga gctgatcgcg gtcgcgtgat gcggtgtgtc tgcggcgtct cggtcggctc 5220
gcgtccggct tcacgctgtg ttgttttctg acgcgatcgt acattcgccg agattttttt 5280
tttgggtgta tcggcgtggt gggtgaggcg gcgattttgt tcgtctgctc cgtatcatct 5340
tcgatcgctt gttccactgc tgatgctgtg cgcgagcgtg tcctctattt cgtctgtgcg 5400
aatgtggatg agtccatttg agtttttggt gccatttttt tcatgctccg agtagcatgg 5460
cgctttgatg gttaatgcgg ctggttttgt tgtgcagggg ttgcgaactc ggcgatggat 5520
cagtgacccg ccgtggtgaa gcctgcagat tctacctata aggtatatcg ccccacccct 5580
cttcctcgga tttactagta gctgaattgt tgttgactgg tgaatgataa tcgagcggaa 5640
gctgttcagg attttgcact gctgcttgtt atgctctgtg tggccctcta gtaatgtggc 5700
tttcattaaa tcagtggttg ctgcaccact gttgaaaatg cattgcacta tttacacatt 5760
caacaatcta tgtggtataa tactaatgag aataaagtgt ttattacatt attccattat 5820
ctaaataaat tatttaattt tagtgcactg tagtacgtta cacattcagc aatctatgtg 5880
gggttgttaa cttagcgcat ttgtgtttgc acatgggaat gggatcaact tgtgttgtac 5940
actagtataa ctgtagcttc cttatggagc ctagctcaat atgctataga aaccgtctgc 6000
tgaaactaat aggattctcc aagaaggaga tgatgcggtg atggtgtgtt gctgttatta 6060
ttttcttttt gtaaactgtt tttgatgtca tacaaacttc ttgatgtact tgctaaactc 6120
ttgagttttg cattttggtt cctatatttg ttttgaactt ctaattgaaa tgcctgcttg 6180
attccaaatt tacaggggag ctatggcgaa aagtccacgt tctgttgtta ccacaggtct 6240
tcttctgtct actttgtaac tgcttatctc actttcacat aacatctcat gcatttatga 6300
tttaactaag ttttagtcag atcgtaaggc agtttatgca gtgtagctta cagcttattt 6360
ttaccattgt gagttactga attcaactag agccacacac atatataatt atatgcctgc 6420
atatatcact cataatcact tggagttatc atgtttgatc ttgcttgcaa tctagaatct 6480
tgcaatagct ttctacatat acatgcttga cagttataag taagatgctg atgttgattt 6540
acttgtttat atttttaacg tgcttggttc atctgatgga tacatgcttg tatggtatcc 6600
aattatttca aattgtaata taaccaacac cattttgtct ctcaggaaca aaaaggcgta 6660
gagcaaaggt tcataaagaa gatgagcctg ttgagaatga aaacttggag agtgaatttg 6720
atgtttccaa gaaagagagc aatggtgcca ctgaacctgg taatgagcct gttgccagca 6780
agagaccgaa gagagcagct gcctgttcta acttcaaaga gaagtcattg gacttatcag 6840
aaaaagattc aattatcaca atcaaggaaa gtcgggttga agagaaggaa atagaggctg 6900
ttaatttgac aaggacggga cctgaagatg gtcaaccttg cagaaaaatc atcgatttca 6960
tcttacatga tggagatggt aatctgcaac cctttgaaat gtctgaagtt gatgacattt 7020
tcataacagc tcttatcatg cccttggatg atgatctgga aaaggatagg ggaaagggaa 7080
tatgttgttc ggggtttgga cgaattgaaa actgggcgat ttctggctat gatgaaggtg 7140
ctgcagtaat ttgggtctca acagaaacat cagattacaa atgtgtgaag ccagcaagca 7200
gttacagatc ttattttgaa cactttagtg agaaggcacg tgtctgtgtt gaagtctata 7260
agaagttagc tagatcagtt ggtggaaatc ctcaggtgga cttagaagaa ttaattgctg 7320
gtgttgtccg ttccattaat tcaaacagaa gcttcaacgg aacagtaacc aaagactttg 7380
tgatctcctc tggtgagttc atatataaac agcttattgg attagaccat acagctggca 7440
atgatgatga gatgttggcc acactgccag ttcttgttgc actgaaagat gaatgtaaat 7500
caagagcagg attcacacat ttgccagcta tgccctcgaa tggaactctg aggattaagg 7560
atgggcaaga caagggactg actgaggatg aggatgcaaa attagcaaga ctgttgcagg 7620
aagaggaaga atggaaaatg atgaagcaga gaggcaagcg tggaacttca cagaaaaata 7680
tctacatcaa gatttgtgaa actgaaattg ccaacgacta cccacttcca gcctactata 7740
aaccatataa ccaagaaatg gatgagtaca tatttgatag tgatattggt atgtattctg 7800
atgatgtacc tgtaagaatc cttgacaact gggctctata caattcagat tccagactca 7860
tttctttgga gctcatccct atgaaagctg gtgcagaaaa tgatattgtg gtatttggat 7920
ctggttttat gagagaggat gatggtagtt gctgttcaac agctgagcta gcacagttac 7980
attcttcctc aagtaaatct ggccgggaag atccaggagt tccaatttat ttgagcccaa 8040
ttaaagagtg ggttgtagaa tttggtggtt caatgatctg cataaccatt cgaactgacg 8100
ttgcttggta aataccctgg cagttctatt ttctttttgt attaccatta tctccaaggg 8160
gtaccatatt ttagctttgt tagtcttgat cattgccagc tcatgatgga aaaataaact 8220
caatgcattt cggataacat atcttacaca cacacacaca cacacacgaa tttggcattt 8280
tgtttgaagc atggaatttt gcaaccatgt tgtgtttacc ttctctctaa tttacatctg 8340
gtaatcaatt ccaggtacaa attacgccag ccaacaaagc aatatgctcc atggtgtgag 8400
cctgtgctga aaacagcaag gctagctgtt agtatcatca cccttttaaa agagcaaagt 8460
cgcgcttcaa agctttcttt tgctgaagtt atcaagaaag tagcagaatt tgacagtaga 8520
caccctgcat ttatatcatc gaaagcacca accgttgaaa gatatgtcgt ggtgcatgga 8580
cagataatac ttcagcagtt tgcagacttt ccagatgaat ctgtcaaacg gtgtgccttc 8640
atcacaggtc ttctagcaaa gatggaggaa agtaggcaca caaagttggc catcaagaaa 8700
aaatctcaac agatgagagg ggagaatctg aacccaagcg caaaaatggg tccaatactg 8760
agaaagaagc ttatgcgtgc tacaactaca atgttgatca gcaagatatg gggtgaatac 882O
tatgccactt atttccctgg ggatacaaag gaagaagatc agaatgaacc aaaggaaatt 8880
gatgatgatc aagaagaaaa tgaagacaat gatgctgaag aggaggtaaa tgttcaagat 8940
gagaaggcca caaggactcc accatcaaca cggtctagaa agtcgtcagc agatactcgc 9000
aaggaaatca aatgggaagg tcaaacagct ggaaaaacag tgtctggaga agttctgtac 9060
aaatgtgtta ttgttcaaga cctcagtatt tctgttggtg cgacagtcac aacagaggat 9120
gattcaggag aaaccatcat gtgttttgtt gagtatatgt atgagaaact tgatggtaaa 9180
aatatgattc atgggataat tctgcaagaa ggttcacaga ctgttcttgg caatgctgca 9240
aatgatagag aggttttctt gactaatgac tgtttagaat ttgaagcaag tgacatcaaa 9300
gagttggtga ctgttaatat ccaatcactg ccttggggcc acaagtacag aaaagagaat 9360
tctgaagcta agagaattga aaaggccaag gcagaggaga ggaaaaggaa gggcctgcca 9420
gtggaatata tttgcaaaag cttatactgg cctgagaaag gtggattctt ctcccttccg 9480
tatgataaaa ttggaaatgg cacaggcatc tgtagctcct gtgagagaaa accagttggc 9540
aatgaattca agttactttc tgagagcagc tttgtctttg agaatattac gtataacatc 9600
catgactttc tgtatatcag gcctgaattt ttctcccaag gggagggcca tgagacctac 9660
aaggctggaa ggaatgtggg tctaaaacct tatgcagtct gccatctgct gagtgttcat 9720
ggtcctgctg gatcaaggaa agctaatcca gaatcgacaa aagtgaaagt aagaaggttt 9780
taccgacctg atgacatttc atcaacaaaa gcctactcat cagacatccg agaggtttgc 9840
cttttttcca tcatctgcat cattggcaat actgtgattt cacctaaacc tatctttttt 9900
ggcctttggt atttgattgt tgtgtacttt gtgatttgat ccaggtgtac tacagtgaag 9960
atataataag tgtacctgtg gtgatgatag agggaaaatg tgaggttcga ctgaaggatg 10020
accttccaaa ttcagatctt ccagcggtgg ttgaacatgt cttttgttgt gaatatttat 10080
atgatcctgc taatggagct ctcaaacagg tcagctactg ccaaattttt cttcagaatc 10140
cctagttatc tgcattgttt ccactgggag atgtctttgt attattgacc gagcttgtct 10200
tgcatgatct ttaaccagct accgcccaat gttagacttg tgacactgac aaggaaggta 10260
cctgcttcaa aaaagaacaa aggaaagcaa atttgtgaca ttgagctagg tggttcagac 10320
aaaccaaagg atgggcaatc agagaactgt cttgcaacac ttgacatttt tgctggttgt 10380
ggaggtttat ctgaaggatt gcagcgatca ggtatgcttt gctcatgtag atgttgcttc 10440
ataggaacat tttgactcca gttaccttct gaccattgga ttgtacagga ttgtcactta 10500
ctaaatgggc tattgaatat gaagaacctg ctggggatgc atttggtgaa aaccatccag 10560
aagctgcagt atttgtcgaa aactgcaatg tgattctgaa gtacgccatt tttgtttacc 10620
ctctttgata tgcttatcat gtatatgtaa attgtatctt cagcacgtat ctctatacga 10680
tcatgcaggg caattatgga caagtgtggt gattctgatg attgcatctc cacttctgag 10740
gctgctgaac gagcagctaa actttctgag gacaagatta agaatctgcc cgtgcctggc 10800
gaagtagaat tcataaatgg tggccctccg tgtcaggtca gttgctatgt ggcttttgcc 10860
tgtataccag ggagctccta acaacacatt cgacattgca agccaattgc ttgacctttt 10920
gacctatcct tttttagggt ttttctggga tgaacagatt caatcaaagt ccctggagca 10980
aagtccagtg cgagatgatc ttagcattcc tgtcatttgc ggagtatttc cgtcctagat 11040
tctttctctt agaaaatgtt aggaactttg tctcgttcaa caaaggacag accttcagat 11100
tgacactggc atcactcctg gagatgggat accaggtgct tgacacttcc tcttcacttg 11160
tgcttgtgct atagcatttc catttctgta tacattctaa ccttgtttac atgttcttag 11220
gtccgatttg gaattttaga ggcaggggct tatggtgttg cgcagtccag gaaaagggca 11280
ttcatttggg ccgctgcacc tggagagact cttccagagt ggcctgaacc aatgcacgtc 11340
tttgctagcc ctgagctgaa aataactcta cctgatggca agttctacgc cgctgtcaag 11400
agcaccgctg caggagcccc tttccgctca attacagttc gagatacaat tggggatcta 11460
ccagctgtgg aaaatggcgc cggcaaacca acaattcagg tataccctac atatcgcact 11520
agcttcactc gccaagttct cctgttctta agctgccgct ttatgtcagt tgaataaact 11580
ttgtatgatg tgctacagta cggaagcggt cctgtgtctt ggttccagaa gaagattaga 11640
agcgacatgg cttcactgaa tgaccacata tctaaagaga tgaatgagct gaacctcata 11700
agatgcaagc acattccaaa gcgcccaggt tgcgactggc atgacctgcc agatgaaaag 11760
gtactaacat ttggccctct aattaacttc tcctgcctcc tgttttattt ttaaactctg 11820
taaacaccaa ttactgttca ttgactgtgc aagtacaggt gaagctgtcc acagggcaga 11880
tggtggactt gatcccttgg tgcttgccca acacagccaa aaggcacaat cagtggaaag 11940
gactgtacgg taggttggac tgggagggca atttccccac ttctgtaacg gatcctcagc 12000
caatggggaa ggtcggcatg tgcttccatc ctgagcagga caggatcatt actgtccgtg 12060
aatgtgctcg atcccaggta cacataccaa ttttcacacc ccatacattc actgctgcaa 12120
caggttaatg atgcttaact aatcatcaag tcattgacta acccaaacaa acaaattttc 12180
aggaagtttt atccttcaaa gtaaatttag tactacattt tgtctcaatc agcactgtag 12240
cagtagattt agttctttaa ccataaatca atggatatat tgtcatctct cttttcggca 12300
gaactgcttt gtccattcct tcttgaacct gttcaaacat gcattcattc taccgagatg 12360
ccattattgc atctgcaact ttgttgccct ttttctgaat cttctgatct gtttctgaat 12420
cttctgatct gttcctacat gacactgtca ccattgtatg cacgcagggc ttccccgata 12480
gctaccgttt cgctggcaac atccagaaca agcacaggca gatcgggaat gccgtgccac 12540
cgccccttgc ctatgccctc gggaggaagc tcaagcaagc catcgacgcc aagcgttgag 12600
tggcttttaa cttcactgca tcgccctcat tttttggtcg gtccaaatag gtttaactaa 12660
gcattacagt tttctatatt ttgtgagcaa ttggactcct aaaattaatt ctgggatggt 12720
tacatggatt accttttgta tatctaactt gctggtagga ctctgatacc atcaagatat 12780
tggttcatag aactatagaa gttcagaaga gaatcatagc actggggggg ggggggatag 12840
aaagcttttg taaacagtac aactcttatt aatatgactg caatatgatg aggattagca 12900
taatcagaat taattctcgt tttccagagt tgtgtattgg caaactggca atatcagctt 12960
ttgtgctagg caaacatgtc cctgcttcag gtcagtgcca cttgataata tacagctttc 13020
ttacacagct aattttttca aaataaatcc ttttcttgac ctgttggttt attcatatga 13080
acattcgatg tattgcattt tgatcttgat gttatgttca gttcacaact tgatttttct 13140
ttctttcttt ttattttgag aagggaagga tggatggctt acagttaggc aggctgacaa 13200
ttttcctcca aagcaacttg aaatcatcat aatcagccca aaaaattcac ccaaatgagc 13260
atactacatc aaacaaatgt aaaactccct tgaaaaatga aaacgaaaat tctatacaca 13320
acattgcaag ctacagaaat ccaagaacac aagcacaaga tcagaatcac atcaagaatc 13380
ctcttagaag aagaaaaaaa aacaccttcg tctcatctca tttcagtgtg ttgatgcttc 13440
ttcatcttg 13449
<210>48
<211>284
<212>PRT
<213>Marchantia paleacea var.diptera
<220>
<221>肽
<222>(0)...(0)
<223>gi|24416628|dbj|BAC22505.1|胞嘧啶甲基转移酶
<400>48
Gln Arg Val Trp Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu
1 5 10 15
Ser Tyr Ala Asp Tyr Phe Arg Pro Arg Tyr Phe Leu Leu Glu Asn Val
20 25 30
Arg Asn Phe Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Thr Met
35 40 45
Ala Ser Leu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Val Leu Gln
50 55 60
Ala Gly Asn Phe Gly Val Ser Gln Ser Arg Lys Arg Ala Phe Ile Trp
65 70 75 80
Ala Ala Ala Pro Asp Glu Ser Leu Pro Asp Trp Pro Glu Ala Arg His
85 90 95
Val Ser Ala Ser Ser Gln Leu Gly Val Thr Leu Pro Gly Gly Gly Gln
100 105 110
Tyr Ala Ala Val Arg Asp Ala Gly Leu Gly Ala Pro Phe Arg Ala Ile
115 120 125
Thr Val Arg Asp Thr Ile Ala Asp Leu Pro Pro Val Ala Asn Gly Ala
130 135 140
Asp Thr Leu Lys Thr Val Tyr Thr Gln Pro Ala Glu Ser Trp Phe Gln
145 150 155 160
Met His Ile Arg Gly Lys Thr Asp Val Leu Thr Asp His Ile Ser Lys
165 170 175
Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Gln Arg Ile Pro Lys Arg
180 185 190
Pro Gly Ala Asp Cys Arg Asp Leu Pro Ala Glu Lys Ile Lys Leu Ser
195 200 205
Thr Gly Gln Leu Val Asp Leu Ile Pro Trp Cys Leu Pro Asn Thr Ala
210 215 220
Ala Arg His Asn Gln Trp Lys Gly Leu Phe Gly Arg Leu Asp Trp Asp
225 230 235 240
Gly Asn Phe Pro Thr Ser Ile Thr Asp Pro Gln Pro Met Gly Lys Val
245 250 255
Gly Met Cys Phe His Pro Val Gln Asn Arg Ile Val Thr Val Arg Glu
260 265 270
Cys Ala Arg Ser Gln Gly Phe Pro Asp Ser Tyr Lys
275 280
<210>49
<211>855
<212>DNA
<213>Marchantia paleacea vard.iptera
<220>
<221>misc_feature
<222>(0)...(0)
<223>AB080617.1;GI:24416627;胞嘧啶甲基转移酶的基因,部分的cds
<400>49
tcaaagagta tggtctaaag tacaatgtga gatgattcta gcgttcttat cctacgccga 60
ctatttccgt cctcgatact tcttgcttga aaatgttcgg aacttcgtgt cattcaacaa 120
gggccaaact ttcagattaa caatggcctc tctcctcgag atgggttatc aggtacgctt 180
tggcgtccta caagctggga actttggtgt ttctcagtct aggaagaggg cattcatctg 240
ggcagcagct ccagatgagt cattaccaga ttggcctgag gccagacacg tctctgcaag 300
ctcacaacta ggagtaactt tgcctggtgg tgggcagtac gccgcagtga gagacgcagg 360
gctgggtgcc cctttcaggg ccattactgt cagagacaca atcgctgacc ttcccccggt 420
ggctaacggt gctgacaccc taaagacagt ctatacccaa cctgctgagt cgtggtttca 480
aatgcatatt agagggaaga ccgacgtatt gactgatcac atttccaagg aaatgaatga 540
actgaatttg attcgctgcc agcgtattcc caaaaggccc ggggccgatt gccgggatct 600
tcctgccgag aagattaaat tgtccacagg acaactggtc gacctcatac cctggtgcct 660
gcctaatacg gccgctcggc acaaccagtg gaagggtctc tttggacgtc ttgattggga 720
cggcaatttt cccacttcga tcaccgatcc tcagcccatg gggaaagtag gaatgtgctt 780
ccatcccgtt caaaatcgaa ttgtcacagt ccgagagtgt gcccgctctc aggggtttcc 840
ggattcctat aagtt 855
<210>50
<211>372
<212>PRT
<213>人工序列
<220>
<223>共有序列
<221>变体
<222>4
<223>Xaa=Gly或Cys
<221>变体
<222>7
<223>Xaa=Gln或Thr
<221>变体
<222>11,33,167,224,268,271,324,372
<223>Xaa=Ile,Leu,Val,或Met
<221>变体
<222>14,30,139,155,162,195,203,342
<223>Xaa=Thr,Gly,或Ala
<221>变体
<222>17
<223>Xaa=Asn或Ala
<221>变体
<222>24
<223>Xaa=Thr或Asp
<221>变体
<222>26,218,281
<223>Xaa=Glu或Lys
<221>变体
<222>27,165,296,340
<223>Xaa=Gln或Glu
<221>变体
<222>28
<223>Xaa=Lys或Ile
<221>变体
<222>90,344
<223>Xaa=Thr或Asn
<221>变体
<222>101,187,267
<223>Xaa=Arg或Gln
<221>变体
<222>142,270,319
<223>Xaa=Glu或Asp
<221>变体
<222>156,225
<223>Xaa=Val或Ala
<221>变体
<222>158
<223>Xaa=Lys或Glu
<221>变体
<222>166
<223>Xaa=Gly或Asn
<221>变体
<222>168
<223>Xaa=His或Gln
<221>变体
<222>177
<223>Xaa=Leu或Asn
<221>变体
<222>181
<223>Xaa=Phe或Leu
<221>变体
<222>183
<223>Xaa=Pro或Set
<221>变体
<222>197
<223>Xaa=Glu或Gly
<221>变体
<222>200
<223>Xaa=Asp或Ala
<221>变体
<222>202,254
<223>Xaa=His,Lys,或Arg
<221>变体
<222>205
<223>Xaa=Lys或Met
<221>变体
<222>208,243
<223>Xaa=Lys或Gln
<221>变体
<222>209
<223>Xaa=Glu或Asn
<221>变体
<222>210
<223>Xaa=Val或Asp
<221>变体
<222>211
<223>Xaa=Ala或Pro
<221>变体
<222>222
<223>Xaa=Asn或Asp
<221>变体
<222>223
<223>Xaa=Thr或Met
<221>变体
<222>231
<223>Xaa=Cys或Ser
<221>变体
<222>233
<223>Xaaa=Ala或Glu
<221>变体
<222>244,258,369,370
<223>Xaa=任何氨基酸
<221>变体
<222>247,262
<223>Xaa=Thr或Lys
<221>变体
<222>251
<223>Xaa=Ala或Cys
<221>变体
<222>259
<223>Xaa=Arg或Glu
<221>变体
<222>264
<223>Xaa=Ser或Asn
<221>变体
<222>265
<223>Xaa=Asp或Ser
<221>变体
<222>269
<223>Xaa=Glu或Val
<221>变体
<222>274
<223>Xaa=Phe,Tyr,或Trp
<221>变体
<222>285
<223>Xaa=Gly或Gln
<221>变体
<222>321
<223>Xaa=His或Asp
<221>变体
<222>347
<223>Xaa=His或Ser
<400>372
Met Glu Lys Xaa Gly Asp Xaa Asp Asp Cys Xaa Ser Thr Xaa Glu Ala
1 5 10 15
Xaa Glu Leu Ala Ala Lys Leu Xaa Glu Xaa Xaa Xaa Ser Xaa Leu Pro
20 25 30
Xaa Pro Gly Gln Val Asp Phe Ile Asn Gly Gly Pro Pro Cys Gln Gly
35 40 45
Phe Ser Gly Met Asn Arg Phe Asn Gln Ser Ser Trp Ser Lys Val Gln
50 55 60
Cys Glu Met Ile Leu Ala Phe Leu Ser Phe Ala Asp Tyr Phe Arg Pro
65 70 75 80
Arg Tyr Phe Leu Leu Glu Asn Val Arg Xaa Phe Val Ser Phe Asn Lys
85 90 95
Gly Gln Thr Phe Xaa Leu Thr Leu Ala Ser Leu Leu Glu Met Gly Tyr
100 105 110
Gln Val Arg Phe Gly Ile Leu Glu Ala Gly Ala Tyr Gly Val Ser Gln
115 120 125
Ser Arg Lys Arg Ala Phe Ile Trp Ala Ala Xaa Pro Glu Xaa Val Leu
130 135 140
Pro Glu Trp Pro Glu Pro Met His Val Phe Xaa Xaa Pro Xaa Leu Lys
145 150 155 160
Ile Xaa Leu Ser Xaa Xaa Xaa Xaa Tyr Ala Ala Val Arg Ser Thr Ala
165 170 175
Xaa Gly Ala Pro Xaa Arg Xaa Ile Thr Val Xaa Asp Thr Ile Gly Asp
180 185 190
Leu Pro Xaa Val Xaa Asn Gly Xaa Ser Xaa Xaa Asn Xaa Glu Tyr Xaa
195 200 205
Xaa Xaa Xaa Val Ser Trp Phe Gln Lys Xaa Ile Arg Gly Xaa Xaa Xaa
210 215 220
Xaa Leu Thr Asp His Ile Xaa Lys Xaa Met Asn Glu Leu Asn Leu Ile
225 230 235 240
Arg Cys Xaa Xaa Ile Pro Xaa Arg Pro Gly Xaa Asp Trp Xaa Asp Leu
245 250 255
Pro Xaa Xaa Lys Val Xaa Leu Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Ile
260 265 270
Pro Xaa Cys Leu Pro Asn Thr Ala Xaa Arg His Asn Xaa Trp Lys Gly
275 280 285
Leu Tyr Gly Arg Leu Asp Trp Xaa Gly Asn Phe Pro Thr Ser Val Thr
290 295 300
Asp Pro Gln Pro Met Gly Lys Val Gly Met Cys Phe His Pro Xaa Gln
305 310 315 320
Xaa Arg Ile Xaa Thr Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro
325 330 335
Asp Ser Tyr Xaa Phe Xaa Gly Xaa Ile Xaa Xaa Lys His Arg Gln Ile
340 345 350
Gly Asn Ala Val Pro Pro Pro Leu Ala Phe Ala Leu Gly Arg Lys Leu
355 360 365
Xaa Xaa Ala Xaa
370
Claims (72)
1.一种用于产生种子的方法,所述方法包括容许第一植物给第二植物授粉的步骤,所述第一植物具有第一重组核酸构建体,所述第一重组核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的第一核酸序列可操作连接的雄性配子体组织特异性调节元件,所述第二植物具有第二重组核酸构建体,所述第二重组核酸构建体包括与有效减少胞嘧啶DNA甲基化水平的第二核酸序列可操作连接的雌性配子体组织特异性调节元件,其中在所述第二植物上发育的种子具有的平均种子重量,与在相应的第二植物上发育的种子的平均种子重量相比,重量增加,所述相应的第二植物缺乏所述第二重组核酸构建体,并且被缺乏所述第一重组核酸构建体的相应的第一植物进行授粉。
2.权利要求1的方法,其中所述第一植物是近亲交配的,杂交的,异质的种群或人造的种群。
3.权利要求1的方法,其中所述第二植物是近亲交配的,杂交的,异质的种群,或人造的种群。
4.权利要求1的方法,其中所述第一植物对于所述重组核酸构建体是杂合的。
5.权利要求1的方法,其中所述第一植物对于所述重组核酸构建体是纯合的。
6.权利要求1的方法,其中所述第二植物对于所述重组核酸构建体是杂合的。
7.权利要求1的方法,其中所述第二植物对于所述重组核酸构建体是纯合的。
8.权利要求1的方法,其中所述第一和第二植物是双子叶植物。
9.权利要求8的方法,其中所述第一重组核酸构建体的所述第一核酸序列编码胞嘧啶DNA甲基转移酶,所述甲基转移酶包含具有在SEQ IDNO:50中提出的序列的多肽区域。
10.权利要求8的方法,其中所述第一重组核酸构建体的所述第一核酸序列编码胞嘧啶DNA甲基转移酶,所述甲基转移酶与在SEQ ID NOS:28,30,34,36,38,和40中提出的序列之一具有50%或更大的序列同一性。
11.权利要求8的方法,其中所述第二重组核酸构建体的所述第二核酸序列被转录为干扰RNA。
12.权利要求8的方法,其中所述第二重组核酸构建体的所述第二核酸序列被转录为反义核酸。
13.权利要求1的方法,其中所述第一和第二植物是单子叶植物。
14.权利要求13的方法,其中所述第一重组核酸构建体的所述第一核酸序列编码胞嘧啶DNA甲基转移酶,所述甲基转移酶与在SEQ ID NOS:44和46中显示的氨基酸序列之一具有50%或更大的序列同一性。
15.权利要求14的方法,其中第一核酸序列与在SEQ ID NOS:44和46中显示的氨基酸序列之一具有80%或更大的序列同一性。
16.权利要求15的方法,其中第一核酸序列具有在SEQ ID NO:44中提出的氨基酸序列。
17.权利要求15的方法,其中第一核酸序列具有在SEQ ID NO:46中提出的氨基酸序列。
18.权利要求13的方法,其中所述第一和第二植物是玉米或水稻植物。
19.权利要求1的方法,其中所述雄性配子体组织特异性调节元件包括在SEQ ID NO:8中提出的序列。
20.权利要求1的方法,其中在所述被授粉的植物上发育的种子的平均种子重量比在相应的第二植物上发育的种子的平均种子重量至少大10%,所述相应的第二植物缺乏所述第二重组核酸构建体,并且被相应的缺乏所述第一重组核酸构建体的第一植物所授粉。
21.权利要求20的方法,其中在所述被授粉的植物上发育的种子的平均种子重量比在相应的第二植物上发育的种子的平均种子重量多大约10%到约50%,所述相应的第二植物缺乏所述第二重组核酸构建体,并且被缺乏第一重组核酸构建体的相应第一植物所授粉。
22.一种用于产生种子的方法,所述方法包括容许第一植物给第二植物授粉的步骤,所述第一植物具有重组核酸构建体,所述重组核酸构建体包括与有效减少胞嘧啶DNA甲基化水平的第一核酸序列可操作连接的雄性配子体组织特异性调节元件,其中在所述第二植物上发育的种子具有的平均种子重量,与在相应的第二植物上发育的种子的平均种子重量相比,重量减少,所述相应的第二植物被缺乏所述重组核酸构建体的相应的第一植物进行授粉。
23.一种用于产生种子的方法,所述方法包括容许植物授粉的步骤,所述植物具有重组核酸构建体,所述构建体包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件,所述授粉以缺乏所述重组核酸构建体的花粉进行,其中在所述植物上发育的种子具有的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏所述重组核酸构建体,并且由缺乏所述重组核酸构建体的植物进行授粉。
24.权利要求23的方法,其中所述被授粉的植物是双子叶植物。
25.权利要求24的方法,其中所述调节元件是选自由SEQ ID NOS:6,25,和22组成的组的雌性配子体组织特异性启动子。
26.权利要求24的方法,其中所述有效减少胞嘧啶DNA甲基化的水平的核酸序列被转录成干扰RNA。
27.权利要求26的方法,其中所述核酸序列具有10个核苷酸到4,500个核苷酸的长度,并且与在SEQ ID NOS:29,31,33,35,37,39,41中提出的核酸序列之一,或其互补序列之一具有70%或更大的序列同一性。
28.权利要求27的方法,其中所述核酸序列具有20个核苷酸到1,000个核苷酸的长度,并且与在SEQ ID NOS:29,31,33,35,37,39,41中提出的核酸序列之一,或其互补序列之一具有80%或更大的序列同一性。
29.权利要求23的方法,其中有效减少胞嘧啶DNA甲基化水平的所述核酸序列被转录为反义核酸。
30.权利要求23的方法,其中所述被授粉的植物是单子叶植物。
31.权利要求30的方法,其中所述有效减少胞嘧啶DNA甲基化水平的所述核酸序列被转录为干扰RNA。
32.权利要求31的方法,其中所述核酸序列具有10个核苷酸到4,500个核苷酸的长度,并且与在SEQ ID NOS:43,45,47,49中提出的核酸序列之一,或其互补序列之一具有70%或更大的序列同一性。
33.权利要求32的方法,其中所述核酸具有20个核苷酸到1,000个核苷酸的长度,并且与在SEQ ID NOS:43,45,47,49中提出的核酸序列之一,或其互补序列之一具有80%或更大的序列同一性。
34.权利要求30的方法,其中有效减少胞嘧啶DNA甲基化水平的所述核酸序列被转录成反义核酸。
35.权利要求23的方法,其中所述授粉以来自非转基因植物的花粉进行。
36.一种用于产生种子的方法,所述方法包括容许植物授粉的步骤,所述植物具有重组核酸构建体,所述重组核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件,所述授粉以缺乏所述重组核酸构建体的花粉进行,其中在所述植物上发育的种子具有的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量减少,所述相应的植物缺乏所述重组核酸构建体,并被缺乏所述重组核酸构建体的植物进行授粉。
37.一种用于产生种子的方法,所述方法包括容许第一植物给第二植物授粉的步骤,所述第一植物具有重组核酸构建体,所述重组核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的核酸序列可操作连接的雄性配子体组织特异性调节元件,其中在所述第二植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物被缺乏或不表达所述重组核酸构建体的植物所授粉。
38.权利要求37的方法,其中所述第一和第二植物是双子叶植物。
39.权利要求38的方法,其中有效增加胞嘧啶DNA甲基化水平的所述核酸序列编码胞嘧啶DNA甲基转移酶,所述甲基转移酶包括具有在SEQ ID NO:50中显示的氨基酸序列的多肽区域。
40.权利要求37的方法,其中所述雄性配子体组织特异性调节元件是SEQ ID NO:8拟南芥YP0180启动子。
41.权利要求37的方法,其中所述第一和第二植物是单子叶植物。
42.权利要求41的方法,其中所述核酸序列编码胞嘧啶DNA甲基转移酶,所述甲基转移酶与在SEQ ID NO:44和SEQ ID NO:46中显示的氨基酸序列之一具有50%或更大的序列同一性。
43.权利要求37的方法,其中在所述被授粉的植物上发育的种子的平均种子重量比所述相应的植物上发育的种子的平均种子重量多至少10%,所述相应的植物缺乏所述重组核酸构建体。
44.权利要求43的方法,其中在所述被授粉的植物上发育的种子的平均种子重量比在所述相应的植物上发育的种子的平均种子重量多约10%到约50%,所述相应的植物缺乏所述重组核酸构建体。
45.一种用于产生种子的方法,所述方法包括容许在多种植物中授粉的步骤,所述多种植物包括多种第一植物,所述第一植物的每一种都具有第一重组核酸构建体,所述第一重组核酸构建体包括与有效增加胞嘧啶DNA甲基化水平的核酸序列可操作连接的雄性配子体组织特异性调节元件,其中在授粉后在所述第一植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏所述重组核酸构建体。
46.权利要求45的方法,其中所述授粉主要是自花授粉。
47.权利要求45的方法,其中所述多种第一植物是双子叶植物。
48.权利要求45的方法,其中所述多种植物还包含多种第二植物,所述第二植物具有第二重组核酸构建体,所述第二重组核酸构建体包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件,并且其中在授粉后在所述第二植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏所述重组核酸构建体。
49.权利要求48的方法,其中所述第一和第二植物是单子叶植物。
50.权利要求49的方法,其中所述多种植物还包括多种第二植物,所述第二植物具有重组的核酸构建体,所述核酸构建体包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件,并且其中在授粉后在所述第二植物上发育的种子的平均种子重量与在相应的植物上发育的种子的平均种子重量相比,重量增加,所述相应的植物缺乏所述重组核酸构建体。
51.权利要求45的方法,其中在所述授粉的植物上发育的种子的平均种子重量比在所述相应的植物上发育的种子的平均种子重量多至少10%,所述相应的植物缺乏所述重组核酸构建体。
52.权利要求51的方法,其中在所述授粉的植物上发育的种子的平均种子重量比在所述相应的植物上发育的种子的平均种子重量多约10%到约50%,所述相应的植物缺乏所述重组核酸构建体。
53.一种转基因宿主细胞,其包括重组核酸构建体,所述重组核酸构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列,所述核酸序列与一个或多个调节元件可操作地连接,所述调节元件赋予植物雌性配子体细胞类型中的转录。
54.权利要求53的宿主细胞,其中所述一个或多个调节元件包括在SEQ ID NOS:6,22,和25中提出的序列之一。
55.一种转基因宿主细胞,其包括重组核酸构建体,所述构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列,所述核酸序列与一个或多个调节元件可操作地连接,所述调节元件在植物雄性配子体细胞类型中赋予转录。
56.权利要求55的宿主细胞,其中所述一个或多个调节元件包括在SEQ ID NO:8中提出的序列。
57.一种转基因植物,其包括重组核酸构建体,所述重组核酸构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列,所述核酸序列与一个或多个调节元件可操作地连接,所述调节元件在雌性配子体细胞类型中赋予转录。
58.权利要求57的植物,其中所述一个或多个调节元件相对于与卵细胞,合子和胚胎,在极性细胞核和中央细胞中赋予优先转录。
59.权利要求57的植物,其中所述一个或多个调节元件包括选自SEQID NOS:6-27的序列。
60.权利要求57的植物,其中所述植物是双子叶植物。
61.权利要求60的植物,其中有效减少胞嘧啶DNA甲基化水平的所述核酸序列被转录为干扰RNA。
62.权利要求61的植物,其中所述核酸序列具有10个核苷酸到4,500个核苷酸的长度并与在SEQ ID NOS:29,31,33,35,37,39和41中提出的核酸序列之一,或其互补序列之一具有70%或更多的序列同一性。63.权利要求62的植物,其中所述核酸具有20个核苷酸到1,000个核苷酸的长度,并且与在SEQID NOS:29,31,33,35,37,39,41中提出的核酸序列之一或其互补序列之一具有80%或更多的序列同一性。
64.权利要求60的植物,其中有效减少胞嘧啶DNA甲基化水平的所述核酸序列被转录为反义核酸。
65.权利要求57的植物,其中所述植物是单子叶植物。
66.权利要求65的植物,其中所述有效减少胞嘧啶DNA甲基化水平的核酸序列被转录为干扰RNA。
67.权利要求66的植物,其中所述核酸序列具有10个核苷酸到4,500个核苷酸的长度,并且与在SEQ ID NOS:43,45,47,49中提出的序列之一或其互补序列之一具有70%或更多的序列同一性。
68.权利要求67的植物,其中所述核酸具有20个核苷酸到1,000个核苷酸的长度,并且与在SEQ ID NOS:43,45,47,49中提出的核酸序列之一或其互补序列之一具有80%或更多的序列同一性。
69.权利要求65的植物,其中所述有效减少胞嘧啶DNA甲基化水平的核酸序列被转录为反义核酸。
70.一种转基因植物,其包括重组核酸构建体,所述重组核酸构建体包括有效减少胞嘧啶DNA甲基化水平的核酸序列,所述核酸序列与一个或多个调节元件可操作地连接,所述调节元件在雄性配子体细胞类型中赋予转录。
71.一种制品,其包括包装材料,和在所述包装材料中的至少第一类型的种子和第二类型的种子,其中所述第二类型的所述种子具有重组核酸构建体,所述重组核酸构建体包括与有效减少胞嘧啶DNA甲基化水平的核酸序列可操作连接的雌性配子体组织特异性调节元件。
72.权利要求71的制品,其中所述第一类型的种子是非转基因种子。
73.权利要求71的制品,其中所述种子是玉米种子。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US51092403P | 2003-10-14 | 2003-10-14 | |
US60/510,924 | 2003-10-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101031650A true CN101031650A (zh) | 2007-09-05 |
Family
ID=34465166
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2004800373016A Pending CN101031650A (zh) | 2003-10-14 | 2004-10-14 | 改变种子表型的方法和组合物 |
Country Status (7)
Country | Link |
---|---|
US (1) | US20050081261A1 (zh) |
EP (1) | EP1687438A4 (zh) |
CN (1) | CN101031650A (zh) |
AU (1) | AU2004282575A1 (zh) |
BR (1) | BRPI0415431A (zh) |
CA (1) | CA2542451A1 (zh) |
WO (1) | WO2005038040A2 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108882691A (zh) * | 2015-11-18 | 2018-11-23 | 联邦科学技术研究组织 | 具有增厚的糊粉层的水稻谷粒 |
CN109288117A (zh) * | 2018-10-22 | 2019-02-01 | 福建中烟工业有限责任公司 | 一种组合物及其在卷烟中的应用 |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7169915B2 (en) * | 2003-10-14 | 2007-01-30 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
CA2598436A1 (en) * | 2005-02-22 | 2006-08-31 | Ceres, Inc. | Modulating plant alkaloids |
WO2006113481A1 (en) * | 2005-04-14 | 2006-10-26 | Ceres Inc. | Secondary metabolite production via manipulation of genome methylation |
US7312376B2 (en) * | 2005-04-20 | 2007-12-25 | Ceres, Inc. | Regulatory regions from Papaveraceae |
WO2006133461A1 (en) * | 2005-06-08 | 2006-12-14 | Ceres Inc. | Identification of terpenoid-biosynthesis related regulatory protein-regulatory region associations |
WO2007041536A2 (en) * | 2005-09-30 | 2007-04-12 | Ceres, Inc. | Modulating plant tocopherol levels |
US20090178160A1 (en) * | 2005-10-25 | 2009-07-09 | Joon-Hyun Park | Modulation of Triterpenoid Content in Plants |
US20070199090A1 (en) * | 2006-02-22 | 2007-08-23 | Nestor Apuya | Modulating alkaloid biosynthesis |
US20090222957A1 (en) * | 2006-04-07 | 2009-09-03 | Ceres Inc. | Regulatory protein-regulatory region associations related to alkaloid biosynthesis |
CA3029666A1 (en) * | 2016-06-30 | 2018-01-04 | Cold Spring Harbor Laboratory | Control of meiotic crossover in maize |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5004864A (en) * | 1988-11-28 | 1991-04-02 | Iowa State University Research Foundation, Inc. | Dominant amylose-extender mutant of maize |
US6946587B1 (en) * | 1990-01-22 | 2005-09-20 | Dekalb Genetics Corporation | Method for preparing fertile transgenic corn plants |
US5204253A (en) * | 1990-05-29 | 1993-04-20 | E. I. Du Pont De Nemours And Company | Method and apparatus for introducing biological substances into living cells |
US5706603A (en) * | 1990-11-16 | 1998-01-13 | E. I. Du Pont De Nemours And Company | Production method for corn with enhanced quality grain traits |
US5512466A (en) * | 1990-12-26 | 1996-04-30 | Monsanto Company | Control of fruit ripening and senescence in plants |
US5773691A (en) * | 1992-03-19 | 1998-06-30 | E. I. Du Pont De Nemours And Company | Chimeric genes and methods for increasing the lysine and threonine content of the seeds of plants |
WO1995029246A1 (en) * | 1994-04-21 | 1995-11-02 | Zeneca Limited | Plant gene specifying acetyl coenzyme a carboxylase and transformed plants containing same |
AU7443596A (en) * | 1995-10-13 | 1997-04-30 | Purdue Research Foundation | Improvement of fruit quality by inhibiting production of lipoxygenase in fruits |
DE19608918A1 (de) * | 1996-03-07 | 1997-09-11 | Planttec Biotechnologie Gmbh | Nucleinsäuremoleküle, die neue Debranching-Enzyme aus Mais codieren |
WO1998004725A1 (en) * | 1996-07-31 | 1998-02-05 | Yale University | Methods for altering the rate of plant development and plants obtained therefrom |
US6011200A (en) * | 1997-07-30 | 2000-01-04 | Yale University | Methods for altering the rate of plant development and plants obtained therefrom |
US6429356B1 (en) * | 1996-08-09 | 2002-08-06 | Calgene Llc | Methods for producing carotenoid compounds, and specialty oils in plant seeds |
US6329567B1 (en) * | 1996-08-20 | 2001-12-11 | The Regents Of The University Of California | Methods for improving seeds |
AUPP249298A0 (en) * | 1998-03-20 | 1998-04-23 | Ag-Gene Australia Limited | Synthetic genes and genetic constructs comprising same I |
US6320106B1 (en) * | 1998-10-29 | 2001-11-20 | Pioneer Hi-Bred International, Inc. | Maize synthetic population PH9K0 |
GB9914210D0 (en) * | 1999-06-17 | 1999-08-18 | Danisco | Promoter |
US6538182B1 (en) * | 1999-07-06 | 2003-03-25 | Senesco, Inc. | DNA encoding a plant deoxyhypusine synthase, a plant eukaryotic initiation factor 5A, transgenic plants and a method for controlling senescence programmed and cell death in plants |
GB9918061D0 (en) * | 1999-07-30 | 1999-10-06 | Univ Bath | Modified plants |
DE19937643A1 (de) * | 1999-08-12 | 2001-02-22 | Aventis Cropscience Gmbh | Transgene Zellen und Pflanzen mit veränderter Aktivität des GBSSI- und des BE-Proteins |
GB9925459D0 (en) * | 1999-10-27 | 1999-12-29 | Plant Bioscience Ltd | Gene silencing |
WO2001053470A2 (en) * | 2000-01-24 | 2001-07-26 | Wisconsin Alumni Research Foundation | Nucleic acid and amino acid sequences encoding a de novo dna methyltransferase |
US6476296B1 (en) * | 2000-04-21 | 2002-11-05 | The Regents Of The University Of California | Nucleic acids that control seed and fruit development in plants |
EP1456379A4 (en) * | 2001-06-22 | 2006-06-07 | Univ California | COMPOSITIONS AND METHODS FOR MODULATING PLANT DEVELOPMENT |
CN1643147B (zh) * | 2002-03-14 | 2010-04-14 | 联邦科学和工业研究组织 | 监测和调节基因沉默的方法和工具 |
US20040053876A1 (en) * | 2002-03-26 | 2004-03-18 | The Regents Of The University Of Michigan | siRNAs and uses therof |
US7169915B2 (en) * | 2003-10-14 | 2007-01-30 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US7402667B2 (en) * | 2003-10-14 | 2008-07-22 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
JP4312012B2 (ja) * | 2003-09-12 | 2009-08-12 | トヨタ自動車株式会社 | パラコート(登録商標)耐性遺伝子並びに維管束及びトライコーム特異的プロモーター |
-
2004
- 2004-10-14 BR BRPI0415431-2A patent/BRPI0415431A/pt not_active IP Right Cessation
- 2004-10-14 US US10/966,482 patent/US20050081261A1/en not_active Abandoned
- 2004-10-14 EP EP04795235A patent/EP1687438A4/en not_active Withdrawn
- 2004-10-14 WO PCT/US2004/034048 patent/WO2005038040A2/en active Search and Examination
- 2004-10-14 CN CNA2004800373016A patent/CN101031650A/zh active Pending
- 2004-10-14 AU AU2004282575A patent/AU2004282575A1/en not_active Abandoned
- 2004-10-14 CA CA002542451A patent/CA2542451A1/en not_active Abandoned
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108882691A (zh) * | 2015-11-18 | 2018-11-23 | 联邦科学技术研究组织 | 具有增厚的糊粉层的水稻谷粒 |
CN108882691B (zh) * | 2015-11-18 | 2023-07-07 | 联邦科学技术研究组织 | 具有增厚的糊粉层的水稻谷粒 |
CN109288117A (zh) * | 2018-10-22 | 2019-02-01 | 福建中烟工业有限责任公司 | 一种组合物及其在卷烟中的应用 |
CN109288117B (zh) * | 2018-10-22 | 2022-06-17 | 福建中烟工业有限责任公司 | 一种组合物及其在卷烟中的应用 |
Also Published As
Publication number | Publication date |
---|---|
EP1687438A2 (en) | 2006-08-09 |
AU2004282575A1 (en) | 2005-04-28 |
WO2005038040A3 (en) | 2006-11-09 |
AU2004282575A2 (en) | 2005-04-28 |
WO2005038040A2 (en) | 2005-04-28 |
BRPI0415431A (pt) | 2006-12-05 |
US20050081261A1 (en) | 2005-04-14 |
CA2542451A1 (en) | 2005-04-28 |
EP1687438A4 (en) | 2008-05-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1245516C (zh) | 编码乙酰乳酸合酶基因的基因 | |
CN1222621C (zh) | 新的植物质体启动子序列 | |
CN1268749C (zh) | 用于改变植物中酶和乙酰辅酶a水平的材料和方法 | |
CN1257978C (zh) | 编码小麦中参与淀粉合成的酶的核酸分子 | |
CN103756972B (zh) | 来自半片藻属的脂肪酸脱饱和酶的利用 | |
CN1946284A (zh) | 具有改良的生长特性的植物及其制备方法 | |
CN1541270A (zh) | 抗除草剂植物 | |
CN1871353A (zh) | 来自报春的脂肪酸去饱和酶 | |
CN1032030A (zh) | 具有草甘膦耐性的5-烯醇丙酮酰-3-磷酸莽草酸合酶 | |
CN1798843A (zh) | 植物中细胞分裂素活性的调节 | |
CN1671850A (zh) | 二酰甘油酰基转移酶核酸序列及相关产物 | |
CN1285875A (zh) | 突变的羟基苯丙酮酸双氧化酶、基dna序列和含该基因且耐除草剂的植物的分离 | |
CN1836045A (zh) | 用于早期种子发育的新型植物启动子 | |
CN1930293A (zh) | 具有降低的饱和脂肪酸水平的转基因植物及其制备方法 | |
CN101031650A (zh) | 改变种子表型的方法和组合物 | |
CN113930442A (zh) | 产生pufa的材料和方法及含有pufa的组合物 | |
CN1753992A (zh) | 种子中蛋白质含量降低的植物及其制备方法和利用方法 | |
CN1252097A (zh) | 转基因植物选择方法 | |
CN1247569A (zh) | 改变了甾醇生物合成途径的转基因植物 | |
CN1810977A (zh) | 增强植物和真菌中的2-乙酰基-1-吡咯的合成的核酸 | |
CN101080492A (zh) | 用于调节植物中油质蛋白表达的方法 | |
CN1852985A (zh) | 产生精细化学品的方法 | |
CN1582335A (zh) | 水稻转座子基因 | |
CN1246464C (zh) | 可增强植物对渗透压的抵抗力的新转录因子 | |
CN101061228A (zh) | 异戊烯基转移酶序列及其使用方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20070905 |