CN114539371B - 小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用 - Google Patents
小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用 Download PDFInfo
- Publication number
- CN114539371B CN114539371B CN202011350857.8A CN202011350857A CN114539371B CN 114539371 B CN114539371 B CN 114539371B CN 202011350857 A CN202011350857 A CN 202011350857A CN 114539371 B CN114539371 B CN 114539371B
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- glu
- protein
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 159
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 79
- 235000021307 Triticum Nutrition 0.000 title claims abstract description 75
- 241000221785 Erysiphales Species 0.000 title claims abstract description 43
- 241000209140 Triticum Species 0.000 title claims abstract 5
- 241000196324 Embryophyta Species 0.000 claims abstract description 101
- 201000010099 disease Diseases 0.000 claims abstract description 53
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 53
- 238000000034 method Methods 0.000 claims abstract description 20
- 239000012620 biological material Substances 0.000 claims abstract description 19
- 229920001184 polypeptide Polymers 0.000 claims abstract description 8
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 8
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 8
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 5
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 5
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 5
- 230000009261 transgenic effect Effects 0.000 claims description 30
- 230000014509 gene expression Effects 0.000 claims description 29
- 108020004414 DNA Proteins 0.000 claims description 23
- 208000035240 Disease Resistance Diseases 0.000 claims description 21
- 108020004707 nucleic acids Proteins 0.000 claims description 19
- 102000039446 nucleic acids Human genes 0.000 claims description 19
- 150000007523 nucleic acids Chemical class 0.000 claims description 19
- 239000013598 vector Substances 0.000 claims description 10
- 244000005700 microbiome Species 0.000 claims description 7
- 102000053602 DNA Human genes 0.000 claims description 6
- 210000000056 organ Anatomy 0.000 claims description 5
- 230000001105 regulatory effect Effects 0.000 claims description 4
- 230000002401 inhibitory effect Effects 0.000 claims description 2
- 230000001276 controlling effect Effects 0.000 claims 1
- 244000098338 Triticum aestivum Species 0.000 description 79
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- 108010034529 leucyl-lysine Proteins 0.000 description 14
- 239000013604 expression vector Substances 0.000 description 12
- 150000001413 amino acids Chemical group 0.000 description 11
- 230000002068 genetic effect Effects 0.000 description 9
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 8
- 239000003795 chemical substances by application Substances 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 108010050848 glycylleucine Proteins 0.000 description 8
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 6
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 6
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 108010092114 histidylphenylalanine Proteins 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 238000003259 recombinant expression Methods 0.000 description 6
- 240000008056 Triticum dicoccoides Species 0.000 description 5
- 235000018680 Triticum dicoccoides Nutrition 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 4
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 4
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 4
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 4
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 4
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 4
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 4
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 4
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 4
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 4
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 4
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 4
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 4
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 4
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 4
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 4
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 4
- 108091081024 Start codon Proteins 0.000 description 4
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 4
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 4
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 4
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 238000009395 breeding Methods 0.000 description 4
- 230000001488 breeding effect Effects 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 241000589158 Agrobacterium Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 241000209510 Liliopsida Species 0.000 description 3
- 108091000080 Phosphotransferase Proteins 0.000 description 3
- 239000004480 active ingredient Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 241001233957 eudicotyledons Species 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000017074 necrotic cell death Effects 0.000 description 3
- 108010058731 nopaline synthase Proteins 0.000 description 3
- 102000020233 phosphotransferase Human genes 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 2
- GRIFPSOFWFIICX-GOPGUHFVSA-N Ala-His-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GRIFPSOFWFIICX-GOPGUHFVSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- JWCCFNZJIRZUCL-AVGNSLFASA-N Arg-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JWCCFNZJIRZUCL-AVGNSLFASA-N 0.000 description 2
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 2
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- LWXJVHTUEDHDLG-XUXIUFHCSA-N Asn-Leu-Leu-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LWXJVHTUEDHDLG-XUXIUFHCSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 2
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 241000895502 Blumeria graminis f. sp. tritici Species 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 2
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 2
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 2
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 2
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 2
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 2
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 2
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 2
- IWVNIQXKTIQXCT-SRVKXCTJSA-N Cys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IWVNIQXKTIQXCT-SRVKXCTJSA-N 0.000 description 2
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 2
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 2
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 2
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 2
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 2
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- SWTSERYNZQMPBI-WDSOQIARSA-N His-Trp-Met Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C1=CN=CN1 SWTSERYNZQMPBI-WDSOQIARSA-N 0.000 description 2
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 2
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 2
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 2
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 2
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 2
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 2
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 2
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- OEYKVQKYCHATHO-SZMVWBNQSA-N Lys-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N OEYKVQKYCHATHO-SZMVWBNQSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- DZTDEZSHBVRUCQ-FXQIFTODSA-N Met-Asp-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DZTDEZSHBVRUCQ-FXQIFTODSA-N 0.000 description 2
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 2
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 2
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 2
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 2
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 2
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 2
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 2
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 2
- APECKGGXAXNFLL-RNXOBYDBSA-N Phe-Trp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 APECKGGXAXNFLL-RNXOBYDBSA-N 0.000 description 2
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 2
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 2
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 2
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 2
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 2
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- DVIIYMVCSUQOJG-QEJZJMRPSA-N Trp-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DVIIYMVCSUQOJG-QEJZJMRPSA-N 0.000 description 2
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 2
- CFMGQWYCEJDTDG-XIRDDKMYSA-N Trp-Lys-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 CFMGQWYCEJDTDG-XIRDDKMYSA-N 0.000 description 2
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 2
- XQMGDVVKFRLQKH-BBRMVZONSA-N Trp-Val-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O)=CNC2=C1 XQMGDVVKFRLQKH-BBRMVZONSA-N 0.000 description 2
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 2
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 2
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 2
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 2
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 2
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 2
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 2
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 2
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- UELITFHSCLAHKR-UHFFFAOYSA-N acibenzolar-S-methyl Chemical compound CSC(=O)C1=CC=CC2=C1SN=N2 UELITFHSCLAHKR-UHFFFAOYSA-N 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012255 expression quantity analysis Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000002363 herbicidal effect Effects 0.000 description 2
- 239000004009 herbicide Substances 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- ZNJFBWYDHIGLCU-HWKXXFMVSA-N jasmonic acid Chemical compound CC\C=C/C[C@@H]1[C@@H](CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-HWKXXFMVSA-N 0.000 description 2
- 230000003902 lesion Effects 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010053062 lysyl-arginyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 230000035807 sensation Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 101150078635 18 gene Proteins 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000219193 Brassicaceae Species 0.000 description 1
- 101001033883 Cenchritis muricatus Protease inhibitor 2 Proteins 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 101150111720 EPSPS gene Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 101100288095 Klebsiella pneumoniae neo gene Proteins 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 102000002704 Leucyl aminopeptidase Human genes 0.000 description 1
- 108010004098 Leucyl aminopeptidase Proteins 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 241000710118 Maize chlorotic mottle virus Species 0.000 description 1
- 108091022912 Mannose-6-Phosphate Isomerase Proteins 0.000 description 1
- 208000031888 Mycoses Diseases 0.000 description 1
- 101710202365 Napin Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 101710089395 Oleosin Proteins 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- 244000062793 Sorghum vulgare Species 0.000 description 1
- 208000035199 Tetraploidy Diseases 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 240000002805 Triticum turgidum Species 0.000 description 1
- 235000007247 Triticum turgidum Nutrition 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000000172 allergic effect Effects 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 208000010668 atopic eczema Diseases 0.000 description 1
- 101150103518 bar gene Proteins 0.000 description 1
- 238000010352 biotechnological method Methods 0.000 description 1
- 229940027138 cambia Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 208000022602 disease susceptibility Diseases 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 101150054900 gus gene Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 101150029559 hph gene Proteins 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- ZNJFBWYDHIGLCU-UHFFFAOYSA-N jasmonic acid Natural products CCC=CCC1C(CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-UHFFFAOYSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 230000003071 parasitic effect Effects 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000012250 transgenic expression Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8282—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for fungal resistance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Botany (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了抗病蛋白及其相关生物材料与培育抗病植物的方法。该抗病蛋白为如下A1)、A2)或A3)或A4):A1)其氨基酸序列如序列表中SEQ ID NO:3所示;A2)其氨基酸序列如序列表中SEQ ID NO:6所示;A3)来源于小麦与A1)或A2)所示的蛋白质具有90%以上的同一性且与功能相同的蛋白质;A4)在A1)或A2)或A3)的N末端或/和C末端连接蛋白标签得到的融合蛋白质。该抗病蛋白及其编码基因可用于提高植物对白粉病的抗性。
Description
技术领域
本发明涉及生物技术领域中小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用。
背景技术
小麦白粉病是一种世界性真菌病害,是由专性寄生的布氏白粉菌(Blumeriagraminis f.sp.tritici)引起的,是严重威胁小麦生产的主要病害之一。选育和利用具有持久和广谱抗性的小麦品种,是防治小麦白粉病最为安全、有效和经济的措施。从普通小麦及其近缘种中发掘新的抗病基因、实现抗病基因多样化是小麦育种的重要目标之一。
在小麦属以及小麦近缘种属中已经正式命名的小麦抗白粉病基因位点有68个(McIntosh et al.2020),但其中只有12个基因被克隆。Pm3b是第一个被克隆的小麦抗白粉病基因,其通过图位克隆的方法获得,它编码一个典型的CC-NBS-LRR(CNL)类型的抗病蛋白(Yahiaoui et al.,2004)。
野生二粒小麦是普通六倍体小麦的四倍体祖先,复杂的环境演变以及长期的进化使得野生二粒小麦具有丰富的遗传多样性。目前已经从野生二粒小麦中挖掘到多个抗白粉病基因,但是只有Pm41(Li et al.,2020)被克隆,因此从野生二粒小麦中发掘并克隆小麦的主效抗白粉病基因对提高小麦的产量和品质具有重要意义。
发明内容
本发明所要解决的技术问题是如何调控植物的抗病性(如植物对白粉病的抗性)。
为了解决以上技术问题,本发明提供了一种来源于小麦的抗病蛋白,为如下A1)、A2)或A3)或A4):
A1)其氨基酸序列如序列表中SEQ ID NO:3所示;
A2)其氨基酸序列如序列表中SEQ ID NO:6所示;
A3)来源于小麦与A1)或A2)所示的蛋白质具有90%以上的同一性且与功能相同的蛋白质;
A4)在A1)或A2)或A3)的N末端或/和C末端连接蛋白标签得到的融合蛋白质。
上述蛋白质中,序列表中的SEQ ID NO:3和SEQ ID NO:6均由1454个氨基酸残基组成。
上述蛋白质可人工合成,也可先合成其编码基因,再进行生物表达得到。
上述蛋白质中,所述蛋白标签(protein-tag)是指利用DNA体外重组技术,与目的蛋白一起融合表达的一种多肽或者蛋白,以便于目的蛋白的表达、检测、示踪和/或纯化。所述蛋白标签可为Flag标签、His标签、MBP标签、HA标签、myc标签、GST标签和/或SUMO标签等。
上述蛋白质中,同一性是指氨基酸序列的同一性。可使用国际互联网上的同源性检索站点测定氨基酸序列的同一性,如NCBI主页网站的BLAST网页。例如,可在高级BLAST2.1中,通过使用blastp作为程序,将Expect值设置为10,将所有Filter设置为OFF,使用BLOSUM62作为Matrix,将Gap existence cost,Per residue gap cost和Lambda ratio分别设置为11,1和0.85(缺省值)并进行检索一对氨基酸序列的同一性进行计算,然后即可获得同一性的值(%)。
上述蛋白质中,所述90%以上的同一性可为至少91%、92%、95%、96%、98%、99%或100%的同一性。
与上述抗病蛋白相关的生物材料也属于本发明的保护范围。
本发明所提供的与上述抗病蛋白相关的生物材料,为下述B1)至B5)中的任一种:
B1)编码上述抗病蛋白的核酸分子;
B2)含有B1)所述核酸分子的表达盒;
B3)含有B1)所述核酸分子的重组载体、或含有B1)所述表达盒的重组载体;
B4)含有B1)所述核酸分子的重组微生物、或含有B2)所述表达盒的重组微生物、或含有B3)所述重组载体的重组微生物;
B5)含有B1)所述核酸分子的转基因植物细胞系、转基因植物组织或转基因植物器官。
其中,所述核酸分子可以是DNA,如cDNA、基因组DNA或重组DNA;所述核酸分子也可以是RNA,如mRNA或hnRNA等。
上述生物材料中,B1)所述核酸分子为如下b1)或b2)或b3)或b4):
b1)序列表中SEQ ID NO:1所示DNA分子;
b2)序列表中SEQ ID NO:2所示DNA分子;
b3)序列表中SEQ ID NO:4所示DNA分子;
b4)序列表中SEQ ID NO:5所示DNA分子。
其中,序列表中的SEQ ID NO:2和SEQ ID NO:5均由4365个核苷酸组成,其分别编码序列表中的SEQ ID NO:3和SEQ ID NO:6所示的蛋白质。
上述生物材料中,B2)所述的含有编码上述抗病蛋白的核酸分子的表达盒,是指能够在宿主细胞中表达上述抗病蛋白的DNA,该DNA不但可包括启动编码上述抗病蛋白的基因转录的启动子,还可包括终止编码上述抗病蛋白的基因转录的终止子。进一步,所述表达盒还可包括增强子序列。可用于本发明的启动子包括但不限于:组成型启动子,组织、器官和发育特异的启动子,和诱导型启动子。启动子的例子包括但不限于:花椰菜花叶病毒的组成型启动子35S;来自西红柿的创伤诱导型启动子,亮氨酸氨基肽酶("LAP",Chao等人(1999)Plant Physiology 120:979-992);来自烟草的化学诱导型启动子,发病机理相关1(PR1)(由水杨酸和BTH(苯并噻二唑-7-硫代羟酸S-甲酯)诱导);西红柿蛋白酶抑制剂II启动子(PIN2)或LAP启动子(均可用茉莉酮酸曱酯诱导);热休克启动子(美国专利5,187,267);四环素诱导型启动子(美国专利5,057,422);种子特异性启动子,如谷子种子特异性启动子pF128(CN101063139B(中国专利2007 1 0099169.7)),种子贮存蛋白质特异的启动子(例如,菜豆球蛋白、napin,oleosin和大豆beta conglycin的启动子(Beachy等人(1985)EMBOJ.4:3047-3053))。它们可单独使用或与其它的植物启动子结合使用。此处引用的所有参考文献均全文引用。合适的转录终止子包括但不限于:农杆菌胭脂碱合成酶终止子(NOS终止子)、花椰菜花叶病毒CaMV 35S终止子、tml终止子、豌豆rbcS E9终止子和胭脂氨酸和章鱼氨酸合酶终止子(参见,例如:Odell等人(I985)Nature 313:810;Rosenberg等人(1987)Gene,56:125;Guerineau等人(1991)Mol.Gen.Genet,262:141;Proudfoot(1991)Cell,64:671;Sanfacon等人Genes Dev.,5:141;Mogen等人(1990)Plant Cell,2:1261;Munroe等人(1990)Gene,91:151;Ballad等人(1989)Nucleic Acids Res.17:7891;Joshi等人(1987)Nucleic Acid Res.,15:9627)。
可用现有的植物表达载体构建含有编码上述抗病蛋白的基因的表达盒的重组表达载体。所述植物表达载体包括双元农杆菌载体和可用于植物微弹轰击的载体等。如pAHC25、pWMB123、pBin438、pCAMBIA1302、pCAMBIA2301、pCAMBIA1301、pCAMBIA1300、pBI121、pCAMBIA1391-Xa或pCAMBIA1391-Xb(CAMBIA公司)等。所述植物表达载体还可包含外源基因的3’端非翻译区域,即包含聚腺苷酸信号和任何其它参与mRNA加工或基因表达的DNA片段。所述聚腺苷酸信号可引导聚腺苷酸加入到mRNA前体的3’端,如农杆菌冠瘿瘤诱导(Ti)质粒基因(如胭脂碱合成酶基因Nos)、植物基因(如大豆贮存蛋白基因)3’端转录的非翻译区均具有类似功能。使用本发明的基因构建植物表达载体时,还可使用增强子,包括翻译增强子或转录增强子,这些增强子区域可以是ATG起始密码子或邻接区域起始密码子等,但必需与编码序列的阅读框相同,以保证整个序列的正确翻译。所述翻译控制信号和起始密码子的来源是广泛的,可以是天然的,也可以是合成的。翻译起始区域可以来自转录起始区域或结构基因。为了便于对转基因植物细胞或植物进行鉴定及筛选,可对所用植物表达载体进行加工,如加入可在植物中表达的编码可产生颜色变化的酶或发光化合物的基因(GUS基因、萤光素酶基因等)、抗生素的标记基因(如赋予对卡那霉素和相关抗生素抗性的nptII基因,赋予对除草剂膦丝菌素抗性的bar基因,赋予对抗生素潮霉素抗性的hph基因,和赋予对methatrexate抗性的dhfr基因,赋予对草甘磷抗性的EPSPS基因)或是抗化学试剂标记基因等(如抗除莠剂基因)、提供代谢甘露糖能力的甘露糖-6-磷酸异构酶基因。从转基因植物的安全性考虑,可不加任何选择性标记基因,直接以逆境筛选转化植株。
上述生物材料中,所述重组微生物具体可为酵母,细菌,藻和真菌。
为了解决上述技术问题,本发明还提供了植物抗病剂。
本发明所提供的植物抗病剂含有上述抗病蛋白或/和上述抗病蛋白相关的生物材料。
上述植物抗病剂的活性成分可为上述抗病蛋白或/和上述抗病蛋白相关的生物材料,上述植物抗病剂的活性成分还可含有其他生物成分或/和非生物成分,上述植物抗病剂的其他活性成分本领域技术人员可根据植物的抗病效果确定。
上述植物抗病剂中,所述植物抗病剂可为抗植物白粉病的药剂。
上述蛋白质、或上述生物材料的下述P1-P5中的任一种应用也属于本发明的保护范围:
P1、权利要求1所述蛋白质、或权利要求2或3所述生物材料在调控植物抗病性中的应用;
P2、权利要求1所述蛋白质、或权利要求2或3所述生物材料在制备提高植物抗病性的产品中的应用;
P3、权利要求1所述蛋白质、或权利要求2或3所述生物材料在培育抗病植物中的应用;
P4、权利要求1所述蛋白质、或权利要求2或3所述生物材料在制备植物抗病产品中的应用;
P5、权利要求1所述蛋白质、或权利要求2或3所述生物材料在植物育种中的应用。
为了解决上述技术问题,本发明还提供了一种培育抗病植物的方法。
本发明所提供的培育抗病植物的方法,包括提高目的植物中上述抗病蛋白或其编码基因的表达量,得到抗病植物;所述抗病植物的抗病性高于所述目的植物的抗病性。
上述方法中,所述提高目的植物中上述抗病蛋白或其编码基因的表达量可通过将所述蛋白的编码基因导入所述目的植物中实现。
上述方法中,其中所述编码基因可先进行如下修饰,再导入目的植物中,以达到更好的表达效果:
1)修饰邻近起始甲硫氨酸的基因序列,以使翻译有效起始;例如,利用在植物中已知的有效的序列进行修饰;
2)与各种植物表达的启动子连接,以利于其在植物中的表达;所述启动子可包括组成型、诱导型、时序调节、发育调节、化学调节、组织优选和组织特异性启动子;启动子的选择将随着表达时间和空间需要而变化,而且也取决于靶物种;例如组织或器官的特异性表达启动子,根据需要受体在发育的什么时期而定;尽管证明了来源于双子叶植物的许多启动子在单子叶植物中是可起作用的,反之亦然,但是理想地,选择双子叶植物启动子用于双子叶植物中的表达,单子叶植物的启动子用于单子叶植物中的表达;
3)与适合的转录终止子连接,也可以提高本发明基因的表达效率;例如来源于CaMV的tml,来源于rbcS的E9;任何已知在植物中起作用的可得到的终止子都可以与本发明基因进行连接;
4)引入增强子序列,如内含子序列(例如来源于Adhl和bronzel)和病毒前导序列(例如来源于TMV,MCMV和AMV)。
所述蛋白质的编码基因可通过使用Ti质粒,植物病毒栽体,直接DNA转化,微注射,电穿孔等常规生物技术方法导入植物细胞(Weissbach,1998,Method for PlantMolecular Biology VIII,Academy Press,New York,pp.411-463;Geiserson and Corey,1998,Plant Molecular Biology(2nd Edition)。
上述方法中,所述抗病植物可为转基因植物,也可为通过杂交等常规育种技术获得的植物。
为了解决上述技术问题,本发明还提供了一种降低植物抗病性的方法。
本发明所提供的降低植物抗病性的方法,包括抑制或降低目的植物中上述抗病蛋白编码基因的表达。
上述方法中,所述抑制或降低目的植物中上述抗病蛋白编码基因的表达课通过EMS诱变所述目的植物实现。
上述方法中,所述转基因植物理解为不仅包含第一代到第二代转基因植物,也包括其子代。对于转基因植物,可以在该物种中繁殖该基因,也可用常规育种技术将该基因转移进入相同物种的其它品种,特别包括商业品种中。所述转基因植物包括种子、愈伤组织、完整植株和细胞。
上文中,所述植物均为单子叶植物或双子叶植物。所述单子叶植物具体可为小麦,双子叶植物具体为十字花科植物,如拟南芥。
上文中,所述抗病性可为抗白粉病。
上文中,所述白粉病可由禾布氏白粉菌(Blumeria graminis f.sp.tritici)引起。
本发明的发明人从小麦品系3D249和野生二粒小麦IW172中分别克隆获得了MlWE18和MlIW172基因。并通过EMS诱变技术使野生二粒小麦IW172中的MlIW172基因发生突变,得到MlIW172突变体。MlIW172突变体与野生二粒小麦IW172相比,其对小麦白粉病的抗性显著降低。而将MlWE18和MlIW172基因分别转入高感白粉病小麦品种Fielder中,得到表达MlWE18或MlIW172基因的转基因小麦。表达MlWE18或MlIW172基因的转基因小麦与小麦Fielder相比,对白粉病的抗性显著提高。这表明MlWE18和MlIW172基因是与植物抗白粉病相关的基因,可调控小麦对白粉病的抗性。利用MlWE18和MlIW172及其编码基因可提高植物对白粉病的抗性。
附图说明
图1为蛋白MlWE18和MlIW172编码基因对比图。
图2为MlIW172突变体的表型及突变位点鉴定图。
图3为转MlWE18基因和转MlIW172基因的转基因小麦T1代家系的白粉病接种鉴定结果;其中(a)为转基因表达载体结构;(b)转MlWE18基因的T1代家系抗白粉病表型;(c)转MlIW172基因的T1代家系抗白粉病表型。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。
下述实施例中的实验方法,如无特殊说明,均为常规方法,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
下述实施例中的野生二粒小麦IW172(Ouyang et al.(2014)Fine physical andgenetic mapping of powdery mildew resistance gene MlIW172 originating fromwild emmer(Triticum dicoccoides).PLoS ONE 9,e100160.),公众可以从中国科学院遗传与发育研究所获得,以重复本申请实验,不可作为其它用途使用。
下述实施例中的小麦品种3D249(Han et al.(2009)Molecular mapping ofpowdery mildew resistance gene MlWE18 in wheat originated from wild emmer(Triticum turgidum var.dicoccoides).Acta Agron.Sin.35,1791–1797.),公众可以从中国科学院遗传与发育研究所获得,以重复本申请实验,不可作为其它用途使用。
下述实施例中的小麦fielder(Lu et al.(2020)A rare gain of functionmutation in a wheat tandem kinase confers resistance to powderymildew.Nat.Commun.11,680.),公众可以从中国科学院遗传与发育研究所获得,以重复本申请实验,不可作为其它用途使用。
下述实施例中的白粉菌生理小种E09(Lu et al.(2020)A rare gain offunction mutation in a wheat tandem kinase confers resistance to powderymildew.Nat.Commun.11,680.),公众可以从中国科学院遗传与发育研究所获得,以重复本申请实验,不可作为其它用途使用。
下述实施例中的植物表达载体pCAMBIA1300-Bar(Lu et al.(2020)A rare gainof function mutation in a wheat tandem kinase confers resistance to powderymildew.Nat.Commun.11,680.),公众可以从中国科学院遗传与发育研究所获得,以重复本申请实验,不可作为其它用途使用。
小麦白粉病病级标准(Ouyang et al.(2014)Fine physical and geneticmapping of powdery mildew resistance gene MlIW172 originating from wild emmer(Triticum dicoccoides).PLoS ONE 9,e100160.),具体见表1。
表1小麦白粉病病级标准
小麦白粉病病级(IT) | 小麦白粉病病症 |
0级 | 植株叶片无病斑和任何坏死反应 |
0;级 | 植株叶片出现坏死反应 |
1级 | 叶片病斑少,菌丝层稀薄可见绿色叶面,产孢量较少 |
2级 | 叶片病斑较少,但菌丝层较厚,不透绿,能产生一定量孢子 |
3级 | 叶片病斑较多,菌丝层厚,产孢量大,但病斑不连片 |
4级 | 叶片病斑多,菌丝层厚,产孢量大,病斑连成片 |
其中,0级代表免疫、0;级代表过敏性坏死、1级代表高抗、2级代表中抗、3级代表中感、4级代表高感。0~2级为抗病,3~4级为感病。
实施例1、小麦抗病基因MlWE18和MlIW172的克隆
提取小麦品系3D249的总RNA,并反转录为cDNA。经过大量序列分析、表达量分析与功能验证,从cDNA中发现了一个DNA编码序列,如序列表的序列2所示,其编码的蛋白质如序列表的序列3所示。
将序列表的序列3所示的蛋白质命名为MlWE18蛋白。将编码MlWE18蛋白的基因命名为MlWE18基因,其开放阅读框如序列表的序列2所示。
提取野生二粒小麦IW172的总RNA,并反转录为cDNA。经过大量序列分析、表达量分析与功能验证,从cDNA中发现了一个DNA编码序列,如序列表的序列5所示,其编码的蛋白质如序列表的序列6所示。
将序列表的序列6所示的蛋白质命名为MlIW172蛋白。将编码MlIW172蛋白的基因命名为MlIW172基因,其开放阅读框如序列表的序列5所示。
基因MlWE18和MlIW172均含有4,365个碱基,编码1,454个氨基酸。二者在基因区域(从起始密码子到终止密码子)只有4个SNP变异,其中2个引起同义氨基酸变异,两个为非同义氨基酸的变异(图1)。
实施例2、MlIW172基因功能分析
为验证MlIW172基因功能,利用0.5%的EMS溶液对15,000粒抗病亲本IW172的种子进行处理,最终获得了9份纯合感病突变体,利用高保真酶对9份突变体的候选基因进行扩增,发现9份突变体均在MlIW172发生了突变(图2a)。其中,Mut332和Mut1502翻译提前终止,其余7个突变体都是产生单个氨基酸的变异。由此表明该MlIW172基因能够有效对抗白粉菌(图2b)。在9个感病突变体中,3个发生在NB-ARC结构域,4个发生在LRRs结构域,其余两个发生在两种结构域之间的区域。
实施例3、转MlWE18和MlIW172基因小麦的获得及其鉴定
一、重组表达载体的构建
利用高保真DNA聚合酶进行扩增,经过扩增测序验证,从小麦品系3D249和野生二粒小麦IW172中各获得一个12230bp的DNA片段ProNLRWE18:NLRWE18(序列1所示)和ProNLRIW172:NLRIW172(序列4所示),其中包括起始密码子上游的2103bp,基因全长4365bp和终止密码子下游5762bp。
利用pEASY-Uni Seamless Cloning and Assembly Kit(CU101-01)同源重组试剂盒将pCAMBIA1300-Bar载体的BamHⅠ和HindⅢ酶切位点中间的片段分别替换为DNA片段ProNLRWE18:NLRWE18和DNA片段ProNLRIW172:NLRIW172,并保持pCAMBIA1300-Bar载体的其它序列不变,得到重组表达载体pCAMBIA1300-Bar-ProNLRWE18:NLRWE18(含有MlWE18基因,表达氨基酸序列如序列表中SEQ ID NO:3所示的蛋白质)和pCAMBIA1300-Bar-ProNLRIW172:NLRIW172(含有MlIW172基因,表达氨基酸序列如序列表中SEQ ID NO:6所示的蛋白质)。两个重组表达载体分别转化大肠杆菌,挑取单克隆进行测序验证。
二、转MlWE18和MlIW172基因小麦的获得
利用农杆菌介导的遗传转化方法分别将重组表达载体pCAMBIA1300-Bar-ProNLRWE18:NLRWE18和pCAMBIA1300-Bar-ProNRLIW172:NLRIW172导入受体小麦品种Fielder的愈伤组织中。愈伤组织分化的小苗转移到生长筛选培养基中,22-24℃光照培养。获得了转pCAMBIA1300-Bar-ProNLRWE18:NLRWE18再生小麦植株(简称转MlWE18基因的T0代转基因植株)和转pCAMBIA1300-Bar-ProNLRIW172:NLRIW172再生小麦植株(简称转MlIW172基因的T0代转基因植株)。经PCR检测,转MlWE18和MlIW172基因各获得4个T0代转基因阳性植株。将转MlWE18基因的4株T0代转基因植株命名为COM-WE18-1、COM-WE18-2、COM-WE18-3和COM-WE18-4;将转MlIW172基因的4株T0代转基因植株命名为COM-IW172-1、COM-IW172-2、COM-IW172-3和COM-IW172-4。
三、转MlWE18和MlIW172基因小麦的抗性鉴定
选取转基因植株COM-WE18-1、COM-WE18-2、COM-WE18-3、COM-WE18-4、COM-IW172-1、COM-IW172-2、COM-IW172-3和COM-IW172-4的T1代家系(分别命名为L-COM-WE18-1、L-COM-WE18-2、L-COM-WE18-3、L-COM-WE18-4、L-COM-IW172-1、L-COM-IW172-2、L-COM-IW172-3和L-COM-IW172-4)进行表型和基因型鉴定,每个家系随机选取18株转基因小麦作为待测小麦进行试验。利用未转基因的小麦品种Fielder为阴性对照进行抗性鉴定。
用白粉病生理小种E09分别接种待测小麦,进行小麦白粉病抗性鉴定。同时设置小麦Fielder的作为对照。具体方法如下:接种后两周左右,当感病对照品种Fielder充分发病时进行抗病性鉴定和记载,3天后复查一次。根据小麦白粉病病级标准(表1)调查发病病情。
同时,分别提取待测小麦的基因组DNA,将基因组DNA作为模板,利用上游引物5’-GTATGGCAGGGCAGACATTT-3’和下游引物5’-CATCAGACTTGAGGCCTTTC-3’进行PCR扩增,分别以重组表达质粒pCAMBIA1300-Bar-ProNLRWE18:NLRWE18和pCAMBIA1300-Bar-ProNLRIW172:NLRIW172为阳性对照,小麦Fielder的基因组DNA为阴性对照,进行PCR扩增,预期扩增产物片段大小约为500bp。
PCR反应程序如下:94℃预变性5min;94℃变性30s,58℃复性30s,72℃延伸30s,35个循环;72℃延伸10min。PCR扩增产物进行2%琼脂糖凝胶电泳检测,紫外拍照,记录结果。
结果表明(表2和图3):家系L-COM-WE18-1、L-COM-WE18-2、L-COM-WE18-3、L-COM-WE18-4、L-COM-IW172-1、L-COM-IW172-2、L-COM-IW172-3和L-COM-IW172-4中的转基因小麦均发生抗感分离,基因型鉴定结果与表型一致,表明MlWE18和MlIW172基因可以有效的对抗小麦白粉菌。
表2 MlWE18和MlIW172转基因家系基因型和表型鉴定结果
序列表
<120> 小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用
<130> GNCRJ202882
<160> 6
<170> SIPOSequenceListing 1.0
<210> 1
<211> 12230
<212> DNA
<213> 小麦 (Triticum aestivum L.)
<400> 1
ccgtcagcta gatagtaccc cttgttgtat tggtgcccat tgatctcgaa gttcaccgga 60
ggagaatggc cctcaacgag cttggcaaaa acaggagagc actgcagcac gttaatgtca 120
ttgtgagttc ctggtatacc aaagaaggag tgccaaattc agaggtcctg tgtggctacc 180
gcctcaagca ccacactgca accgcctttg acgcctttgt acatcccctg ccaaccaaat 240
gggcagttct tccattttcc aatgcatgca gtcgatgctt ccaagcatcc caggaaatcc 300
tcttactgca ttctgggcta ggatccgagc agtgtcttcc gcattgggtg ttctcaagta 360
ttgtggccca aacactgcca ccactgcccg acagaacttg tagaaacact ctatgctggt 420
ggactcggcc atgcgcccat agtcgtcgag tgaatcactt ggagctccat atgcaagcat 480
cctcatcgtt gtcgtgcact tttggatgga ggtgaatcca agagcgccgg tgcaatccat 540
cttgcacttg aagtagttgt cgaactcccg gatggaattc acaatcctga ggaagagctt 600
tcggctcatc cgataacggc gccgaaatgt tctctcgcca tgaagtggag catcgacgaa 660
gtagtcggag tagagcatgc agtagccttg cagacgatgc cggttctttg ctttcacccg 720
ccccggcgcc gagccacctc gccacggctt ttcattgctc gccagcagct gggcgagggt 780
ggcgagcacc atgagatgct cttcttcctg gacgtcggcc gcggcttcct cctccagtag 840
cgcggcgagc tcttcctcct catccgagtc catcgccgag gcaggcaaaa cgccgaacac 900
cttgcgctcg gtgggcgtgt acccgccatt aaaccgcgcc ttcgcggccg gaaacggcgg 960
ccggaaacgc ccagctgctg tgggaggggc tgccgcggcg aagtgctgct attttccggc 1020
ggggaatggc tatctagcgg agtagggcgg cggccgtcgc cgggatatag ctagtggtgg 1080
ccgagggcgc ggggggtgcg aggcgagtcg ggggaagaaa accttgactt ttcccctgtc 1140
ggtgtgggcc agacgtgctt ttccctagcg ccggagcccc caacggctcc ccagcgcgcc 1200
gggttcgccc tgtgaccgcc gggcggaaaa aaggtccgag ccggcgattt tcggcgtcct 1260
gggggcgcga ctgggccgtt ttttcggcgc cggcgccgaa aaagtggcct gagggggcct 1320
gttgggggcg cggctggaga tgccattagc aacggcgacg gcaagcttct gcctcggtcg 1380
ggataagaca aaccaatatt ttatttttca ttttattttc caatgacagg tggaccctat 1440
tgggacacac atgttatgat gtaacgctac caaacgtcgc cgttacctgt tacatgttaa 1500
gcgttgccat gtcagtctaa ttgtcggacc ccatctgtca taatctttct taacataatc 1560
agatcggctg atcagtgatt tttgcaaatc aattacacaa gacgtggtgt tttctgcaac 1620
aaaagtgtag tatagtgttt tcttttgacc tagccttcaa aataatggtt ttctgcaact 1680
tactcgtcat agccctaata cagaaaaaaa tggataaccg gctcccacat ctgtgtctac 1740
agaccaatgc tagagtaaat ttgcgaggca tgccgacttg cgatgcaata attcagaccc 1800
cacatgttaa atgtagcaat gaaaaccgac gtgacgggat acacaataat tgagagcgca 1860
catggtgtgt tggtggcgca gcttcacata gacatggaca tgacacgatg cgtttttttg 1920
ttcttctctt ggcttggaaa cgcagacgaa cgagtcttta tcagcagtgg tccgctgcca 1980
ttccatccac ccaaacccaa gagcattcgc ttcccagccc caagaaaaat cgcccaaaat 2040
cccacctgcc ataagccccc aataattgac ccctgctagc gccggcgtcg gcggccggag 2100
aagatggaat cggcgattgg cgcggcaacc gggctcgtcg gcagcgtggt gaatctgctc 2160
tccaacgagc tcgtgggggc gtacgtcgcc agcaccgagc tcggcctcaa ctccgtcgag 2220
atcaagaaag atctcctgcg cgcgcaagcg ctgttgcagg aggcccagac gaggggcccc 2280
aaggacaatc acggcctgaa ggggctgctg caggagctca ccgtcaaggc cgacgaggcc 2340
gaggatgcgc tggatgagct ccactacttc atcattcagg accagctcga cggcacccgc 2400
tatgcggtgc cggacctggg cgacgacctt cgcggtcatg ctcgccacgg tcgccatgct 2460
attcgccatg tcgtcggtaa ctgccttgca tgcttttctt gttcgtcgac tatgaaagat 2520
gatgatagtg gtggtggccg tggtgctgct gctgttccta ttaacccact caatgcaacc 2580
aagactgata gtgctggtca agatggtcca gttgaaaagc tgaaatttca tagagtggcc 2640
atgtccaaga aaattaaatc agtgatggag gaactacact ccctgtgtga gcccgtatcc 2700
aaattgctct gcataactcc acaccagggt agcactgaga ctgctgtcaa cctaaatcgt 2760
cctcctacgg gatcaatcag tgcgcagcat acattgtatg gcagggcaga cattttcgag 2820
ggaaccaaag atgatataac cagtggcaga ttctataccg aaaccctttc agttcttcct 2880
gtagttggtc caggtggcat tggaaagaca actttcgccc aatacctgta taatgataaa 2940
aggatacaag agcactttgc tgttagggtc tgggtatgtg tatcgactga ttttgatgtg 3000
cttaagctca gccaacagat ccttagctgc atacaaggaa gcaactctcc aaatcaaaca 3060
acaagtttag accagctcca aatatccatc gcagagaaaa tcatgtcgaa aaggtttcta 3120
attgtgtttg atgatatgtg gaaatgtagt aatcagggct gggaagacct gctagctcct 3180
ttgaggaaag gggaaaccaa gggtaacatg gttctcgtca caactcgatt tccatccata 3240
gctgaaatgg tgaaaacaac accttcaata ccactgaaag gcctcaagtc tgatgaattc 3300
ttcagattct ttgaagcatt catatttgat gaaaaaaaac ctgaatacca aggtgaccta 3360
gcaagtattg cacagactat tgctaagaaa ttgaagggtt cccctctagc aggcaaaacc 3420
gttggtcgat tattgaggaa ggagatttct cggaaacact ggatggaagt tctcgaaaac 3480
aacaaatggc aaaaccaaga aaatgatgat ggcattatga catctctcag gattagctat 3540
gattaccttc cttcccatct gaaaaaatgt ttccaatatt ttgccttatt ccctgaagat 3600
tacaagttta agaatttaga gattacatat ttctggattg caataggaat catagaaaaa 3660
gatgaggatt acatggaaca actagtggaa aatggttttc ttgtgaagga gaatgctcat 3720
tggtcaagcc aacagtacta tgtactgcat gatttactac atgaactgtc ccgatgtgtt 3780
tcctcacaag aatgcctcaa tatatatgat agtaaaaggt ttaaagctga tgccgttcca 3840
aaatctattc atcatttgtc cattaccatt gaagacaact atgatgtgac ttttactgaa 3900
gaaatgatta agttgaagag caaggtagac attgcaaatt tgcgggcttt gatgattttt 3960
agaagatatg gagaaacaat tggtgagatt ttaaatgata ctttcaagga tgtagagggt 4020
cttcgtgtcc tatttatagt tgtgcagtcc ccaaattcgt tcccatgtga tttctcaaaa 4080
cttatccacc tccgatacct aaaacttgga acatgggacg aagtgacttt gcctagcaca 4140
ctgtccagac tttatcacct aaaactattg gacctgagtg gttggcatgg tagttatgaa 4200
ttgcctagag acactagccg ccttatcagt ttatgccatt ttattgctgg aaaaaaattc 4260
cattgcaata ttgctgaggt tggaaagatg aagtgtttaa aggagcttaa agaattccat 4320
gtcatgaaag agagtgttgg atatgaactg tgtgagttgg ggcagttaac agatcttgga 4380
ggagaactct gtatacgtaa tcttgaaaag gtgacaacca aggatgaggc tatggaagcc 4440
aagcttgagt caaaaggtga tttgaaaggg ttggggttgg tttggggatc atctgatgcc 4500
gttgatggta caacaaaatt agttgatgtt cttgatgctc ttcagccaca ccctaacctt 4560
gcagcacttg gcatcatagg tcatggtggt acaagtggtc ctagttggtt gtgtggtgac 4620
attaactttg agttgttgga gtatcttcac ctagagggag tatcttgggt cactcttcct 4680
ccttttgagt acttgccgca gctcacatca ctcacattga gcaacattta tgaagttagt 4740
gagatccgcc atgtctttgg tggcgctaca aacaaaagtt ttttgcagtt gaagctgatt 4800
gtccttgatt cattgcctga acttattgag tgggttgggg tgcctaatga ttcgtttgca 4860
aggcttgaaa ctatttattg caggtcatgc cccaatctcc gtacgctacc cttcctgcgg 4920
gagtactctg ctggctgtta taaccatctg tcgaaacttg aaatttttga ttgcccaaag 4980
ttgtctctcc ctcccatgcc tcacagttcc acactgatat tttgtcatgt gagacagagt 5040
tccgcagagc tgacttattc tgcagggaac aacatgtata ttaatgggta taatagtggt 5100
gaggtggact tgcataacct gcgaaaagta gagaaaatga gtattacaga tgtatcacac 5160
atttcaatga cagagctgaa caatctgaaa tccctaagaa gactagatgt cagaagatgc 5220
agcttcacgt gccatggact gcaggacctc gcgtgcctcc aatatcttac agtatacaac 5280
tgtggcgtct tctttccatg gcctaccgaa gcagctcaca ccatcaagct tttccctgct 5340
tccctcgaga cacttgagat tgaaggagag tcaggcatgc agtcaatggc tctgctcagc 5400
aatctgaaat ccctaaggag actagatgtc agaagatgca gcatcacgtg ccatggactg 5460
caggacctcg catgcctcca atcacttaca gtacaagact gtggcaactt ctttccatgg 5520
cctaccgaag cagctcacac cgtcaatcct ttccctcaca ccatcaagcc tttccctgct 5580
tccctcgaga cacttgagat tgaaggagag ttaggcatgc agccagtggc tttgctcagc 5640
aatctgaaat ccctaagaag actagatgtc agaagatgca gcatcacgtg ccatggactg 5700
caggacctcg cgtgcctcca atcagttaca gtaaaagaat gtggcaactt ctttctgcgg 5760
cccatcgaag cagctcacac catcatcaat cctttgcctg cttccctcga agaacttgag 5820
attgatggag agtcaagcat ggagtcaatg gctctgctct cgaacctcac ttgtctcacc 5880
gatcttatac tggtgaactc tgagaattta acagtggatg gttttaatcc tctcatcaca 5940
gtcaacctca acagtttgaa gatgtacaac agaggaaatt gtcttagccg ttctatatca 6000
gcggatttgc tctcggaatt ggcggtagca aggaccaacc tgttaccgcc tgcaggctcc 6060
tatcaactac gagaacttac agtggattgc atctcagcag tgcttgttgc tcccatctgt 6120
agcctccttg ccgctaccct ccacacttta gaattctggt atgatcaccg ggcagaaagc 6180
ttcacggaag aggaagagag agctcttcag ctcctcacat ccctctgctt catcaatttt 6240
atggattgcc cgaatcttct gtgccttcct caaggcctac acagccttcc ttctcttaag 6300
acactatttg tccaagattg tcctaaaatc cgatctctgg acaaggggga ctttcccact 6360
tcactggagt gtctattagt gcagggttgc agtcccgggc tacaagagca agccaagaaa 6420
ttaaaaggaa cagaaccaga ttttaatgtg atacttgaac ttgagtagct caaggtaatt 6480
aacagaactc ctgctatcta tgtattcctt cttttcctac gaggatcaca attctctact 6540
gttttaaagt actaactagt aacttgttaa aatctgacag gtggattgcc gacacagctg 6600
ctcagtcgtt ttagctgcat tctgtacata agtcatgaaa cctctccgcc taccattgtg 6660
tacgtatatc ggtgcaagtg atcatgttgc atctccactt aatgtggtca atccagtcgc 6720
cggttaatta gtgagtaagt gcacgcaaaa ccagcgtttt gggtaccgag cgaaatttcg 6780
cgatctcgcg cggttaccgc gtttaccgtt gcccctcgag aaacactcat cccgagcaaa 6840
atatctcgaa tattttgaaa tttttgaatt caaacgttca ttgtctagtt aaataggcag 6900
catctcttat atacaaccag aactggtagt ggcctagcgg caaaggcatc atttccgtag 6960
cagcaagtcg cgggttcgag tctcgcctcg cgcaggttta tttttttcga aaaaaagaaa 7020
aagtttaaaa atactgcaat gaccagcagt cgaacttgcg acgtgcgtgc aagctacact 7080
aaccactacg ccaggaagtg tttggcgata cttatattag taaaatgcct aatatatcga 7140
cgtcaaaaat atttgaattc aaattttgat tttaaatttc gcccgagttt ttttcagaat 7200
ttcgcggtta ccgtggtaac cgcgaattcc ggtgaccctc gagaaaaaag gacagcttgg 7260
gatccaaaac ctggcgcaaa accaacttgt actttacgtg tttctatgtt ttatgatgta 7320
ctgctacctg actatataca tcttgatttt cttaacagct gcggtacctg tttttgtgtg 7380
attttatcac aaactgcagc atacttgccg ccgctatttg agactgagca ggagtactac 7440
agtatgcatg tttagcgggt agttgcatgg taaattttga aggagggcca agaggatctg 7500
gaggtgaatg catgcctata tatcaccaag aaagtggatt ctattttgct gagcatttaa 7560
atgttggttt tctaaccttc cgttcactac tctgagagtg gattggcaat cagaacttaa 7620
ttggtagcta gagtacatcc aaacaataaa agaagcacta acaaaagtat catcaagtag 7680
ctgtgcagtt ctaccaggac catttcaaag gttagtccgc cactcttttc ctctttcttt 7740
tccccttctt attctaagtt tatcatctca gtggttcttg tcactcccat atgcagttta 7800
ctcgccgcta actaccctct atacattgta attttctatt gatgatcgtg tggcaagttc 7860
acctgttgac caacggctca aagttatgac agagcaataa tatccttttt ttagtcttac 7920
actcattgct tggattctgc agaacaatta cgaccaacag tggattcaga agttggcatt 7980
aagttcaaag ctcggttgtt ccatcgaaac aacgagggta acaatggaag catctgtttg 8040
gagatctatg atggagagct gagtgcaaag gaagtagatt acttcccgtg gtgatccatg 8100
cgtataaaca agttacattt cgtatttact tgtggcataa ttcaggtttt cagttcgtgt 8160
atgtggtggc agcatcaccg tgtagcggtg tcggtgtgtc aaccctcaga atttcagttt 8220
cacttttacc aggttcagtc atgttgacca acgactcaaa gtaaggactg ggcaaactat 8280
atccatgatg atgaattcca tgaccttgct catcattttc tgagtattgt attcattact 8340
cgaattttgc agccaacagt gaattcagaa gccggtattt agaagctaaa ttggttgtta 8400
catccaaaca agaaaagtaa cctgcacaga gcctctgtct tgatggctgc atggagagct 8460
gcacaggtgt gcatgatcta ccatttgaaa gagcctctat tcatggtgca tgcattggct 8520
tccccagtga gttcacatct tgtctttcac aaaattagct atttgggtgg agagcaattt 8580
atcctctcgt aagttcatct atataagttg ttttagatga taagagtgct tgttacttat 8640
tgtatatggt gtaggcagca gaccttttct ccatttttgt tggtttaaac agtttttttc 8700
ttctgaatat atgtcgatgg taattaaaac aagttttcat gcgatattgg aacagattga 8760
gtatttgttc tttcagtcca attccactgt atccccaaca accgccaact acatcgtgtg 8820
caggcagatt tgttatctgg tgatgtttca gtgtcaacct ccaaagtttc agttccagtt 8880
tagcaggttc tccggcctgt tgaccaacca ctcaaagtaa tgactgagca attatttcca 8940
taaataatca attcaattat cttgttcatc agtcatcact ctatgatgtg tctctgcata 9000
acagttcaca cgcaacagtg aatttcgaga gctgcgcatt cagaagttca gctggctcgg 9060
ttgttccatc caaacaatat aagaaaccag ccgttatccc ttcccctggt gtctcgaatg 9120
gagagctaca aggttttgcg taatagacga gttccaaggt aattgagtat aaggccgcgc 9180
ttggattgag tgtaacttaa tacaggtgtg tgaattttac aagtgtattt tacagggcta 9240
gactaatttg cggccggaaa agatacagac gtacgggggc gtgtattttt tacaggtgta 9300
tcggaaggaa acacggtaat ccaaacgcgg cctaaggtga gtggcaacac acgcgttttc 9360
aaagaattaa gttagtaccg tacacattta catgttcaag ttttcggtgc tgactcgttc 9420
cattcagtgc aggtcagcag gtgtggtgtg tctcactcct ttggtgagca ctgaagcctg 9480
aggaagaatc tatggttcaa gatatgaagc agggtgcaac tgctctggct gacgcactga 9540
ccaaatgcgt cttgagacca aatgcgtctg tggtcttcca caggcgacca catcctctgg 9600
ccacgcagcc gtctttcatc tctgctcagg ccagacttgg ccccgccgcc acactctcca 9660
ccctctccct cttcatcgtc caccttgtta gttagtgtcc atagtcgctg ggttcagcta 9720
gcaaactctg taacgtgcac atacgtgaga tgcctatcta caccactcct tcaccgctca 9780
ccacgctgga tcgatcgaac aggtacaaca atggccagaa cttggcacac ataccagata 9840
atgatgatgc tgaattgatg atgatgatgt ccgtgttgga tgatgatgca atgcatgttt 9900
cttgtatgtt gaccattaca ttgattcttc gtactgtcaa cttacctaga gggcgtatcc 9960
ttgattcaaa attaaaagga actcacaagg ctggaacata ctaatgcagg tcagttccca 10020
agcaatggct tgagaagaaa aatgtatgta ttatatattt agaatggtga gtgtgcatgt 10080
atggccgaga aaacaaaaca atatttggtg taccggatgg ggcattcttc ttagttgagt 10140
ttgaagaagc tgagaaggcc aatgacacgc ttctccggaa atataattgg gcatcgcact 10200
ctgattgtcc actcttgccc tttatataag gtgtatttgt tttttgataa aattccaaaa 10260
tgtaaggtgc atttcatcta aatcctcgta attccctttt tatccctcta gaaagagaag 10320
agtatctccc acccgattga ttgtatctct ccttgtatga aaataaggaa actatctccc 10380
tctcgattgc atgcatctct tactttccta gactcgctga tttacttgcc actaaccaac 10440
aaatttttca agggtaattt tgttctaaca cctttataat tatgcgtctt ggtcaccatg 10500
ccaaaaataa tacaccttac acaaagaaac ggagggagta tttgctaaga gggtgtcaag 10560
aatgagaaga taacttctcc ggaaaaaaat gaaaaaaaaa taaacacagt gagaatagac 10620
taggccatct actgagtcac gggctgggcc ggtcctctca caaacagtta acggcccttt 10680
tggtaccatc gtttggtgtt gccctcggct gggcctccag tgggcaggac gacgaggtcg 10740
gacggacgaa cgtggtcggc gcggggagct gctcgtcgcc ggcgacaggt ttgcccctcc 10800
gccgactcga gtccaccacg caccgtgctc ccctcatgta actcacatcg ggtcccccta 10860
ccttgctctg tactgcagct ccggtgggcg ccaatggaga tcccaacgga tccaatcaat 10920
ggaggctcga gggagggaga ttttctccag ctccgtcagc catcgccgtc gccgtgctcg 10980
acactcacat ggacccaatc caacggtaaa atccatcatt ttcttctact ttcctaggaa 11040
ttccaccacc accaccatac ccgcattcac atggcactca cgactgcttg cttggtgctt 11100
gatcagtgtt ctggatggag ggaatctgga ccaatccgtt ggctgcacaa cacaacaatt 11160
aatagttgaa cagcgatagc ctcgactgcg ctagggaagg caccggtcag tcttctaatg 11220
caatcattct aaatgctcat cacctccgat ctttgtgttt ggattgtgtc aacaaagctg 11280
ctttcttgat gggcaccatg acaaccatgt tcctagtttc gtaccggctt ctcacatagt 11340
aatagtacat catcactaaa tcgatccaag cacgagtaca gttggctgta tgtaaaaaat 11400
cccacaagag agtatttcta accacgctca cataaaccga catggccaga gtcatgatac 11460
cgacataccg tgctgagcat ctctgtgcag gagggaatta aggtctcgac attcacacgg 11520
acccaaggcc aagagaaaag tagcatcggc catcattttc tttctaggaa tccatcgcta 11580
gctactacca cattcacatg gaacagacga attgcttggt agtcactgtt ttggatggat 11640
ccggaatagc ttcttggcta gcaacgaagc tggttacttg gccacacaga aaatatcttt 11700
gattgggacg gggatgaaga ttgtttggct ctttactaga tgagaatcct ttggctgcag 11760
catacaacaa taattgaacg gtgacagcat cgactttggc tagtcaccag tcaatctggc 11820
aaagtttttc tatgaactga tcagctcctc gtgtttggat gtatcaaagt tgttgctttc 11880
ttgatgggag ccatgacaac aatgtcacaa ggttcatgca tcgcggcttc ctagatgaca 11940
catgaataaa cgatcaaagc actagtgcag tccgctataa aagggatcaa ccagaccctc 12000
ttctcaactg cgtgcacata aacaacacac agcctttggt accatatcca ccacagcaca 12060
aactagacac tttttccctg tctgccccag ccccatagga gatggaggcg gcgattggcc 12120
tcgcaagcgg gctcgtcgac agcgtgctga ctctgctttc caatgagctc gtgggggcgt 12180
atgtagccag ctctgagcgt gacctgatgt tcacgcaagg actattgcac 12230
<210> 2
<211> 4365
<212> DNA
<213> 小麦 (Triticum aestivum L.)
<400> 2
atggaatcgg cgattggcgc ggcaaccggg ctcgtcggca gcgtggtgaa tctgctctcc 60
aacgagctcg tgggggcgta cgtcgccagc accgagctcg gcctcaactc cgtcgagatc 120
aagaaagatc tcctgcgcgc gcaagcgctg ttgcaggagg cccagacgag gggccccaag 180
gacaatcacg gcctgaaggg gctgctgcag gagctcaccg tcaaggccga cgaggccgag 240
gatgcgctgg atgagctcca ctacttcatc attcaggacc agctcgacgg cacccgctat 300
gcggtgccgg acctgggcga cgaccttcgc ggtcatgctc gccacggtcg ccatgctatt 360
cgccatgtcg tcggtaactg ccttgcatgc ttttcttgtt cgtcgactat gaaagatgat 420
gatagtggtg gtggccgtgg tgctgctgct gttcctatta acccactcaa tgcaaccaag 480
actgatagtg ctggtcaaga tggtccagtt gaaaagctga aatttcatag agtggccatg 540
tccaagaaaa ttaaatcagt gatggaggaa ctacactccc tgtgtgagcc cgtatccaaa 600
ttgctctgca taactccaca ccagggtagc actgagactg ctgtcaacct aaatcgtcct 660
cctacgggat caatcagtgc gcagcataca ttgtatggca gggcagacat tttcgaggga 720
accaaagatg atataaccag tggcagattc tataccgaaa ccctttcagt tcttcctgta 780
gttggtccag gtggcattgg aaagacaact ttcgcccaat acctgtataa tgataaaagg 840
atacaagagc actttgctgt tagggtctgg gtatgtgtat cgactgattt tgatgtgctt 900
aagctcagcc aacagatcct tagctgcata caaggaagca actctccaaa tcaaacaaca 960
agtttagacc agctccaaat atccatcgca gagaaaatca tgtcgaaaag gtttctaatt 1020
gtgtttgatg atatgtggaa atgtagtaat cagggctggg aagacctgct agctcctttg 1080
aggaaagggg aaaccaaggg taacatggtt ctcgtcacaa ctcgatttcc atccatagct 1140
gaaatggtga aaacaacacc ttcaatacca ctgaaaggcc tcaagtctga tgaattcttc 1200
agattctttg aagcattcat atttgatgaa aaaaaacctg aataccaagg tgacctagca 1260
agtattgcac agactattgc taagaaattg aagggttccc ctctagcagg caaaaccgtt 1320
ggtcgattat tgaggaagga gatttctcgg aaacactgga tggaagttct cgaaaacaac 1380
aaatggcaaa accaagaaaa tgatgatggc attatgacat ctctcaggat tagctatgat 1440
taccttcctt cccatctgaa aaaatgtttc caatattttg ccttattccc tgaagattac 1500
aagtttaaga atttagagat tacatatttc tggattgcaa taggaatcat agaaaaagat 1560
gaggattaca tggaacaact agtggaaaat ggttttcttg tgaaggagaa tgctcattgg 1620
tcaagccaac agtactatgt actgcatgat ttactacatg aactgtcccg atgtgtttcc 1680
tcacaagaat gcctcaatat atatgatagt aaaaggttta aagctgatgc cgttccaaaa 1740
tctattcatc atttgtccat taccattgaa gacaactatg atgtgacttt tactgaagaa 1800
atgattaagt tgaagagcaa ggtagacatt gcaaatttgc gggctttgat gatttttaga 1860
agatatggag aaacaattgg tgagatttta aatgatactt tcaaggatgt agagggtctt 1920
cgtgtcctat ttatagttgt gcagtcccca aattcgttcc catgtgattt ctcaaaactt 1980
atccacctcc gatacctaaa acttggaaca tgggacgaag tgactttgcc tagcacactg 2040
tccagacttt atcacctaaa actattggac ctgagtggtt ggcatggtag ttatgaattg 2100
cctagagaca ctagccgcct tatcagttta tgccatttta ttgctggaaa aaaattccat 2160
tgcaatattg ctgaggttgg aaagatgaag tgtttaaagg agcttaaaga attccatgtc 2220
atgaaagaga gtgttggata tgaactgtgt gagttggggc agttaacaga tcttggagga 2280
gaactctgta tacgtaatct tgaaaaggtg acaaccaagg atgaggctat ggaagccaag 2340
cttgagtcaa aaggtgattt gaaagggttg gggttggttt ggggatcatc tgatgccgtt 2400
gatggtacaa caaaattagt tgatgttctt gatgctcttc agccacaccc taaccttgca 2460
gcacttggca tcataggtca tggtggtaca agtggtccta gttggttgtg tggtgacatt 2520
aactttgagt tgttggagta tcttcaccta gagggagtat cttgggtcac tcttcctcct 2580
tttgagtact tgccgcagct cacatcactc acattgagca acatttatga agttagtgag 2640
atccgccatg tctttggtgg cgctacaaac aaaagttttt tgcagttgaa gctgattgtc 2700
cttgattcat tgcctgaact tattgagtgg gttggggtgc ctaatgattc gtttgcaagg 2760
cttgaaacta tttattgcag gtcatgcccc aatctccgta cgctaccctt cctgcgggag 2820
tactctgctg gctgttataa ccatctgtcg aaacttgaaa tttttgattg cccaaagttg 2880
tctctccctc ccatgcctca cagttccaca ctgatatttt gtcatgtgag acagagttcc 2940
gcagagctga cttattctgc agggaacaac atgtatatta atgggtataa tagtggtgag 3000
gtggacttgc ataacctgcg aaaagtagag aaaatgagta ttacagatgt atcacacatt 3060
tcaatgacag agctgaacaa tctgaaatcc ctaagaagac tagatgtcag aagatgcagc 3120
ttcacgtgcc atggactgca ggacctcgcg tgcctccaat atcttacagt atacaactgt 3180
ggcgtcttct ttccatggcc taccgaagca gctcacacca tcaagctttt ccctgcttcc 3240
ctcgagacac ttgagattga aggagagtca ggcatgcagt caatggctct gctcagcaat 3300
ctgaaatccc taaggagact agatgtcaga agatgcagca tcacgtgcca tggactgcag 3360
gacctcgcat gcctccaatc acttacagta caagactgtg gcaacttctt tccatggcct 3420
accgaagcag ctcacaccgt caatcctttc cctcacacca tcaagccttt ccctgcttcc 3480
ctcgagacac ttgagattga aggagagtta ggcatgcagc cagtggcttt gctcagcaat 3540
ctgaaatccc taagaagact agatgtcaga agatgcagca tcacgtgcca tggactgcag 3600
gacctcgcgt gcctccaatc agttacagta aaagaatgtg gcaacttctt tctgcggccc 3660
atcgaagcag ctcacaccat catcaatcct ttgcctgctt ccctcgaaga acttgagatt 3720
gatggagagt caagcatgga gtcaatggct ctgctctcga acctcacttg tctcaccgat 3780
cttatactgg tgaactctga gaatttaaca gtggatggtt ttaatcctct catcacagtc 3840
aacctcaaca gtttgaagat gtacaacaga ggaaattgtc ttagccgttc tatatcagcg 3900
gatttgctct cggaattggc ggtagcaagg accaacctgt taccgcctgc aggctcctat 3960
caactacgag aacttacagt ggattgcatc tcagcagtgc ttgttgctcc catctgtagc 4020
ctccttgccg ctaccctcca cactttagaa ttctggtatg atcaccgggc agaaagcttc 4080
acggaagagg aagagagagc tcttcagctc ctcacatccc tctgcttcat caattttatg 4140
gattgcccga atcttctgtg ccttcctcaa ggcctacaca gccttccttc tcttaagaca 4200
ctatttgtcc aagattgtcc taaaatccga tctctggaca agggggactt tcccacttca 4260
ctggagtgtc tattagtgca gggttgcagt cccgggctac aagagcaagc caagaaatta 4320
aaaggaacag aaccagattt taatgtgata cttgaacttg agtag 4365
<210> 3
<211> 1454
<212> PRT
<213> 小麦 (Triticum aestivum L.)
<400> 3
Met Glu Ser Ala Ile Gly Ala Ala Thr Gly Leu Val Gly Ser Val Val
1 5 10 15
Asn Leu Leu Ser Asn Glu Leu Val Gly Ala Tyr Val Ala Ser Thr Glu
20 25 30
Leu Gly Leu Asn Ser Val Glu Ile Lys Lys Asp Leu Leu Arg Ala Gln
35 40 45
Ala Leu Leu Gln Glu Ala Gln Thr Arg Gly Pro Lys Asp Asn His Gly
50 55 60
Leu Lys Gly Leu Leu Gln Glu Leu Thr Val Lys Ala Asp Glu Ala Glu
65 70 75 80
Asp Ala Leu Asp Glu Leu His Tyr Phe Ile Ile Gln Asp Gln Leu Asp
85 90 95
Gly Thr Arg Tyr Ala Val Pro Asp Leu Gly Asp Asp Leu Arg Gly His
100 105 110
Ala Arg His Gly Arg His Ala Ile Arg His Val Val Gly Asn Cys Leu
115 120 125
Ala Cys Phe Ser Cys Ser Ser Thr Met Lys Asp Asp Asp Ser Gly Gly
130 135 140
Gly Arg Gly Ala Ala Ala Val Pro Ile Asn Pro Leu Asn Ala Thr Lys
145 150 155 160
Thr Asp Ser Ala Gly Gln Asp Gly Pro Val Glu Lys Leu Lys Phe His
165 170 175
Arg Val Ala Met Ser Lys Lys Ile Lys Ser Val Met Glu Glu Leu His
180 185 190
Ser Leu Cys Glu Pro Val Ser Lys Leu Leu Cys Ile Thr Pro His Gln
195 200 205
Gly Ser Thr Glu Thr Ala Val Asn Leu Asn Arg Pro Pro Thr Gly Ser
210 215 220
Ile Ser Ala Gln His Thr Leu Tyr Gly Arg Ala Asp Ile Phe Glu Gly
225 230 235 240
Thr Lys Asp Asp Ile Thr Ser Gly Arg Phe Tyr Thr Glu Thr Leu Ser
245 250 255
Val Leu Pro Val Val Gly Pro Gly Gly Ile Gly Lys Thr Thr Phe Ala
260 265 270
Gln Tyr Leu Tyr Asn Asp Lys Arg Ile Gln Glu His Phe Ala Val Arg
275 280 285
Val Trp Val Cys Val Ser Thr Asp Phe Asp Val Leu Lys Leu Ser Gln
290 295 300
Gln Ile Leu Ser Cys Ile Gln Gly Ser Asn Ser Pro Asn Gln Thr Thr
305 310 315 320
Ser Leu Asp Gln Leu Gln Ile Ser Ile Ala Glu Lys Ile Met Ser Lys
325 330 335
Arg Phe Leu Ile Val Phe Asp Asp Met Trp Lys Cys Ser Asn Gln Gly
340 345 350
Trp Glu Asp Leu Leu Ala Pro Leu Arg Lys Gly Glu Thr Lys Gly Asn
355 360 365
Met Val Leu Val Thr Thr Arg Phe Pro Ser Ile Ala Glu Met Val Lys
370 375 380
Thr Thr Pro Ser Ile Pro Leu Lys Gly Leu Lys Ser Asp Glu Phe Phe
385 390 395 400
Arg Phe Phe Glu Ala Phe Ile Phe Asp Glu Lys Lys Pro Glu Tyr Gln
405 410 415
Gly Asp Leu Ala Ser Ile Ala Gln Thr Ile Ala Lys Lys Leu Lys Gly
420 425 430
Ser Pro Leu Ala Gly Lys Thr Val Gly Arg Leu Leu Arg Lys Glu Ile
435 440 445
Ser Arg Lys His Trp Met Glu Val Leu Glu Asn Asn Lys Trp Gln Asn
450 455 460
Gln Glu Asn Asp Asp Gly Ile Met Thr Ser Leu Arg Ile Ser Tyr Asp
465 470 475 480
Tyr Leu Pro Ser His Leu Lys Lys Cys Phe Gln Tyr Phe Ala Leu Phe
485 490 495
Pro Glu Asp Tyr Lys Phe Lys Asn Leu Glu Ile Thr Tyr Phe Trp Ile
500 505 510
Ala Ile Gly Ile Ile Glu Lys Asp Glu Asp Tyr Met Glu Gln Leu Val
515 520 525
Glu Asn Gly Phe Leu Val Lys Glu Asn Ala His Trp Ser Ser Gln Gln
530 535 540
Tyr Tyr Val Leu His Asp Leu Leu His Glu Leu Ser Arg Cys Val Ser
545 550 555 560
Ser Gln Glu Cys Leu Asn Ile Tyr Asp Ser Lys Arg Phe Lys Ala Asp
565 570 575
Ala Val Pro Lys Ser Ile His His Leu Ser Ile Thr Ile Glu Asp Asn
580 585 590
Tyr Asp Val Thr Phe Thr Glu Glu Met Ile Lys Leu Lys Ser Lys Val
595 600 605
Asp Ile Ala Asn Leu Arg Ala Leu Met Ile Phe Arg Arg Tyr Gly Glu
610 615 620
Thr Ile Gly Glu Ile Leu Asn Asp Thr Phe Lys Asp Val Glu Gly Leu
625 630 635 640
Arg Val Leu Phe Ile Val Val Gln Ser Pro Asn Ser Phe Pro Cys Asp
645 650 655
Phe Ser Lys Leu Ile His Leu Arg Tyr Leu Lys Leu Gly Thr Trp Asp
660 665 670
Glu Val Thr Leu Pro Ser Thr Leu Ser Arg Leu Tyr His Leu Lys Leu
675 680 685
Leu Asp Leu Ser Gly Trp His Gly Ser Tyr Glu Leu Pro Arg Asp Thr
690 695 700
Ser Arg Leu Ile Ser Leu Cys His Phe Ile Ala Gly Lys Lys Phe His
705 710 715 720
Cys Asn Ile Ala Glu Val Gly Lys Met Lys Cys Leu Lys Glu Leu Lys
725 730 735
Glu Phe His Val Met Lys Glu Ser Val Gly Tyr Glu Leu Cys Glu Leu
740 745 750
Gly Gln Leu Thr Asp Leu Gly Gly Glu Leu Cys Ile Arg Asn Leu Glu
755 760 765
Lys Val Thr Thr Lys Asp Glu Ala Met Glu Ala Lys Leu Glu Ser Lys
770 775 780
Gly Asp Leu Lys Gly Leu Gly Leu Val Trp Gly Ser Ser Asp Ala Val
785 790 795 800
Asp Gly Thr Thr Lys Leu Val Asp Val Leu Asp Ala Leu Gln Pro His
805 810 815
Pro Asn Leu Ala Ala Leu Gly Ile Ile Gly His Gly Gly Thr Ser Gly
820 825 830
Pro Ser Trp Leu Cys Gly Asp Ile Asn Phe Glu Leu Leu Glu Tyr Leu
835 840 845
His Leu Glu Gly Val Ser Trp Val Thr Leu Pro Pro Phe Glu Tyr Leu
850 855 860
Pro Gln Leu Thr Ser Leu Thr Leu Ser Asn Ile Tyr Glu Val Ser Glu
865 870 875 880
Ile Arg His Val Phe Gly Gly Ala Thr Asn Lys Ser Phe Leu Gln Leu
885 890 895
Lys Leu Ile Val Leu Asp Ser Leu Pro Glu Leu Ile Glu Trp Val Gly
900 905 910
Val Pro Asn Asp Ser Phe Ala Arg Leu Glu Thr Ile Tyr Cys Arg Ser
915 920 925
Cys Pro Asn Leu Arg Thr Leu Pro Phe Leu Arg Glu Tyr Ser Ala Gly
930 935 940
Cys Tyr Asn His Leu Ser Lys Leu Glu Ile Phe Asp Cys Pro Lys Leu
945 950 955 960
Ser Leu Pro Pro Met Pro His Ser Ser Thr Leu Ile Phe Cys His Val
965 970 975
Arg Gln Ser Ser Ala Glu Leu Thr Tyr Ser Ala Gly Asn Asn Met Tyr
980 985 990
Ile Asn Gly Tyr Asn Ser Gly Glu Val Asp Leu His Asn Leu Arg Lys
995 1000 1005
Val Glu Lys Met Ser Ile Thr Asp Val Ser His Ile Ser Met Thr Glu
1010 1015 1020
Leu Asn Asn Leu Lys Ser Leu Arg Arg Leu Asp Val Arg Arg Cys Ser
1025 1030 1035 1040
Phe Thr Cys His Gly Leu Gln Asp Leu Ala Cys Leu Gln Tyr Leu Thr
1045 1050 1055
Val Tyr Asn Cys Gly Val Phe Phe Pro Trp Pro Thr Glu Ala Ala His
1060 1065 1070
Thr Ile Lys Leu Phe Pro Ala Ser Leu Glu Thr Leu Glu Ile Glu Gly
1075 1080 1085
Glu Ser Gly Met Gln Ser Met Ala Leu Leu Ser Asn Leu Lys Ser Leu
1090 1095 1100
Arg Arg Leu Asp Val Arg Arg Cys Ser Ile Thr Cys His Gly Leu Gln
1105 1110 1115 1120
Asp Leu Ala Cys Leu Gln Ser Leu Thr Val Gln Asp Cys Gly Asn Phe
1125 1130 1135
Phe Pro Trp Pro Thr Glu Ala Ala His Thr Val Asn Pro Phe Pro His
1140 1145 1150
Thr Ile Lys Pro Phe Pro Ala Ser Leu Glu Thr Leu Glu Ile Glu Gly
1155 1160 1165
Glu Leu Gly Met Gln Pro Val Ala Leu Leu Ser Asn Leu Lys Ser Leu
1170 1175 1180
Arg Arg Leu Asp Val Arg Arg Cys Ser Ile Thr Cys His Gly Leu Gln
1185 1190 1195 1200
Asp Leu Ala Cys Leu Gln Ser Val Thr Val Lys Glu Cys Gly Asn Phe
1205 1210 1215
Phe Leu Arg Pro Ile Glu Ala Ala His Thr Ile Ile Asn Pro Leu Pro
1220 1225 1230
Ala Ser Leu Glu Glu Leu Glu Ile Asp Gly Glu Ser Ser Met Glu Ser
1235 1240 1245
Met Ala Leu Leu Ser Asn Leu Thr Cys Leu Thr Asp Leu Ile Leu Val
1250 1255 1260
Asn Ser Glu Asn Leu Thr Val Asp Gly Phe Asn Pro Leu Ile Thr Val
1265 1270 1275 1280
Asn Leu Asn Ser Leu Lys Met Tyr Asn Arg Gly Asn Cys Leu Ser Arg
1285 1290 1295
Ser Ile Ser Ala Asp Leu Leu Ser Glu Leu Ala Val Ala Arg Thr Asn
1300 1305 1310
Leu Leu Pro Pro Ala Gly Ser Tyr Gln Leu Arg Glu Leu Thr Val Asp
1315 1320 1325
Cys Ile Ser Ala Val Leu Val Ala Pro Ile Cys Ser Leu Leu Ala Ala
1330 1335 1340
Thr Leu His Thr Leu Glu Phe Trp Tyr Asp His Arg Ala Glu Ser Phe
1345 1350 1355 1360
Thr Glu Glu Glu Glu Arg Ala Leu Gln Leu Leu Thr Ser Leu Cys Phe
1365 1370 1375
Ile Asn Phe Met Asp Cys Pro Asn Leu Leu Cys Leu Pro Gln Gly Leu
1380 1385 1390
His Ser Leu Pro Ser Leu Lys Thr Leu Phe Val Gln Asp Cys Pro Lys
1395 1400 1405
Ile Arg Ser Leu Asp Lys Gly Asp Phe Pro Thr Ser Leu Glu Cys Leu
1410 1415 1420
Leu Val Gln Gly Cys Ser Pro Gly Leu Gln Glu Gln Ala Lys Lys Leu
1425 1430 1435 1440
Lys Gly Thr Glu Pro Asp Phe Asn Val Ile Leu Glu Leu Glu
1445 1450
<210> 4
<211> 12230
<212> DNA
<213> 小麦 (Triticum aestivum L.)
<400> 4
ccgtcagcta gatagtaccc cttgttgtat tggtgcccat tgatctcgaa gttcaccgga 60
ggagaatggc cctcaacgag cttggcaaaa acaggagagc actgcagcac gttaatgtca 120
ttgtgagttc ctggtatacc aaagaaggag tgccaaattc agaggtcctg tgtggctacc 180
gcctcaagca ccacactgca accgcctttg acgcctttgt acatcccctg ccaaccaaat 240
gggcagttct tccattttcc aatgcatgca gtcgatgctt ccaagcatcc caggaaatcc 300
tcttactgca ttctgggcta ggatccgagc agtgtcttcc gcattgggtg ttctcaagta 360
ttgtggccca aacactgcca ccactgcccg acagaacttg tagaaacact ctatgctggt 420
ggactcggcc atgcgcccat agtcgtcgag tgaatcactt ggagctccat atgcaagcat 480
cctcatcgtt gtcgtgcact tctggatgga ggtgaatcca agagcgccgg tgcaatccat 540
cttgcacttg aagtagttgt cgaactcccg gatggaattc acaatcctga ggaagagctt 600
tcggctcatc cgataacggc gccgaaatgt tctctcgcca tgaagtggag catcgacgaa 660
gtagtcggag tagagcatgc agtagccttg cagacgatgc cggttctttg ctttcacccg 720
ccccggcgcc gagccacctc gccacggctt ttcattgctc gccagcagct gggcgagggt 780
ggcgagcacc atgagatgct cttcttcctg gacgtcggcc gcggcttcct cctccagtag 840
cgcggcgagc tcttcctcct catccgagtc catcgccgag gcaggcaaaa cgccgaacac 900
cttgcgctcg gtgggcgtgt acccgccatt aaaccgcgcc ttcgcggccg gaaacggcgg 960
ccggaaacgc ccagctgctg tgggaggggc tgccgcggcg aagtgctgct attttccggc 1020
ggggaatggc tatctagcgg agtagggcgg cggccgtcgc cgggatatag ctagtggtgg 1080
ccgagggcgc ggggggtgcg aggcgagtcg ggggaagaaa accttgactt ttcccctgtc 1140
ggtgtgggcc agacgtgctt ttccctagcg ccggagcccc caacggctcc ccagcgcgcc 1200
gggttcgccc tgtgaccgcc gggcggaaaa aaggtccgag ccggcgattt tcggcgtcct 1260
gggggcgcga ctgggccgtt ttttcggcgc cggcgccgaa aaagtggcct gagggggcct 1320
gttgggggcg cggctggaga tgccattagc aacggcgacg gcaagcttct gcctcggtcg 1380
ggataagaca aaccaatatt ttatttttca ttttattttc caatgacagg tggaccctat 1440
tgggacacac atgttatgat gtaacgctac caaacgtcgc cgttacctgt tacatgttaa 1500
gcgttgccat gtcagtctaa ttgtcggacc ccatctgtca taatctttct taacataatc 1560
agatcggctg atcagtgatt tttgcaaatc aattacacaa gacgtggtgt tttctgcaac 1620
aaaagtgtag tatagtgttt tcttttgacc tagccttcaa aataatggtt ttctgcaact 1680
tactcgtcat agccctaata cagaaaaaaa tggataaccg gctcccacat ctgtgtctac 1740
agaccaatgc tagagtaaat ttgcgaggca tgccgacttg cgatgcaata attcagaccc 1800
cacatgttaa atgtagcaat gaaaaccgac gtgacgggat acacaataat tgagagcgca 1860
catggtgtgt tggtggcgca gcttcacata gacatggaca tgacacgatg cgtttttttg 1920
ttcttctctt ggcttggaaa cgcagacgaa cgagtcttta tcagcagtgg tccgctgcca 1980
ttccatccac ccaaacccaa gagcattcgc ttcccagccc caagaaaaat cgcccaaaat 2040
cccacctgcc ataagccccc aataattgac ccctgctagc gccggcgtcg gcggccggag 2100
aagatggaat cggcgattgg cgcggcaacc gggctcgtcg gcagcgtggt gaatctgctc 2160
tccaacgagc tcgtgggggc gtacgtcgcc agcaccgagc tcggcctcaa ctccgtcgag 2220
atcaagaaag atctcctgcg cgcgcaagcg ctgttgcagg aggcccagac gaggggcccc 2280
aaggacaatc acggcctgaa ggggctgctg caggagctca ccgtcaaggc cgacgaggcc 2340
gaggatgcgc tggatgagct ccactacttc atcattcagg accagctcga cggcacccgc 2400
tatgcggtgc cggacctggg cgacgacctt cgcggtcatg ctcgccacgg tcgccatgct 2460
attcgccatg tcgtcggtaa ctgccttgca tgcttttctt gttcgtcgac tatgaaagat 2520
gatgatagtg gtggtggccg tggtgctgct gctgttccta ttaacccact caatgcaacc 2580
aagactgata gtgctggtca agatggtcca gttgaaaagc tgaaatttca tagagtggcc 2640
atgtccaaga aaattaaatc agtgatggag gaactacact ccctgtgtga gcccgtatcc 2700
aaattgctct gcataactcc acaccagggt agcactgaga ctgctgtcaa cctaaatcgt 2760
cctcctacgg gatcaatcag tgcgcagcat acattgtatg gcagggcaga cattttcgag 2820
ggaaccaaag atgatataac cagtggcaga ttctataccg aaaccctttc agttcttcct 2880
gtagttggtc caggtggcat tggaaagaca actttcgccc aatacctgta taatgataaa 2940
aggatacaag agcactttgc tgttagggtc tgggtatgtg tatcgactga ttttgatgtg 3000
cttaagctca gccaacagat ccttagctgc atacaaggaa gcaactctcc aaatcaaaca 3060
acaagtttag accagctcca aatatccatc gcagagaaaa tcatgtcgaa aaggtttcta 3120
attgtgtttg atgatatgtg gaaatgtagt aatcagggct gggaagacct gctagctcct 3180
ttgaggaaag gggaaaccaa gggtaacatg gttctcgtca caactcgatt tccatccata 3240
gctgaaatgg tgaaaacaac accttcaata ccactgaaag gcctcaagtc tgatgaattc 3300
ttcagattct ttgaagcatt catatttgat gaaaaaaaac ctgaatacca aggtgaccta 3360
gcaagtattg cacagactat tgctaagaaa ttgaagggtt cccctctagc aggcaaaacc 3420
gttggtcgat tattgaggaa ggagatttct cggaaacact ggatggaagt tctcgaaaac 3480
aacaaatggc aaaaccaaga aaatgatgat ggcattatga catctctcag gattagctat 3540
gattaccttc cttcccatct gaaaaaatgt ttccaatatt ttgccttatt ccctgaagat 3600
tacaagttta agaatttaga gattacatat ttctggattg caataggaat catagaaaaa 3660
gatgaggatt acatggaaca actagtggaa aatggttttc ttgtgaagga gaatgctcat 3720
tggtcaagcc aacagtacta tgtactgcat gatttactac atgaactgtc ccgatgtgtt 3780
tcctcacaag aatgcctcaa tatatatgat agtaaaaggt ttaaagctga tgccgttcca 3840
aaatctattc atcatttgtc cattaccatt gaagacaact atgatgtgac ttttactgaa 3900
gaaatgatta agttgaagag caaggtagac attgcaaatt tgcgggcttt gatgattttt 3960
agaagatatg gagaaacaat tggtgagatt ttaaatgata ctttcaagga tgtagagggt 4020
cttcgtgtcc tatttatagt tgtgcagtcc ccaaattcgt tcccatgtga tttctcaaaa 4080
cttatccacc tccgatacct aaaacttgga acatgggacg aagtgacttt gcctagcaca 4140
ctgtccagac tttatcacct aaaactattg gacctgagtg gttggcatgg tagttatgaa 4200
ttgcctagag acactagccg ccttatcagt ttatgccatt ttattgctgg aaaaaaattc 4260
cattgcaata ttgctgaggt tggaaagatg aagtgtttaa aggagcttaa agaattccat 4320
gtcatgaaag agagtgttgg atatgaactg tgtgagttgg ggcagttaac agatcttgga 4380
ggagaactct gtatacgtaa tcttgaaaag gtgacaacca aggatgaggc tatggaagcc 4440
aagcttgagt caaaaggtga tttgaaaggg ttggggttgg tttggggatc atctgatgcc 4500
gttgatggta caacaaaatt agttgatgtt cttgatgctc ttcagccaca ccctaacctt 4560
gcagcacttg gcatcatagg tcatggtggt acaagtggtc ctagttggtt gtgtggtgac 4620
attaactttg agttgttgga gtatcttcac ctagagggag tatcttgggt cactcttcct 4680
ccttttgagt acttgccgca gctcacatca ctcacattga gcaacattta tgaagttagt 4740
gagatccgcc atgtctttgg tggcgctaca aacaaaagtt ttttgcagtt gaagctgatt 4800
gtccttgatt cattgcctga acttattgag tgggttgggg tgcctaatga ttcgtttgca 4860
aggcttgaaa ctatttattg caggtcatgc cccaatctcc gtacgctacc cttcctgcgg 4920
gagtactctg ctggctgtta taaccatctg tcgaaacttg aaatttttga ttgcccaaag 4980
ttgtctctcc ctcccatgcc tcacagttcc acactgatat tttgtcatgt gagacagagt 5040
tccgcagagc tgacttattc tgcagggaac aacatgtata ttaatgggta taatagtggt 5100
gaggtggact tgcataacct gcgaaaagta gagaaaatga gtattacaga tgtatcacac 5160
atttcaatga cagagctgaa caatctgaaa tccctaagaa gactagatgt cagaagatgc 5220
agcttcacgt gccatggact ccaggacctc gtgtgcctcc aatatcttac agtatacaac 5280
tgtggcgtct tctttccatg gcctaccgaa gcagctcaca ccatcaagct tttccctgct 5340
tccctcgaga cacttgagat tgaaggagag tcaggcatgc agtcaatggc tctgctcagc 5400
aatctgaaat ccctaaggag actagatgtc agaagatgca gcatcacgtg ccatggactg 5460
caggacctcg catgcctcca atcacttaca gtacaagact gtggcaactt ctttccatgg 5520
cctaccgaag cagctcacac cgtcaatcct ttccctcaca ccatcaagcc tttccctgct 5580
tccctcgaga cacttgagat tgaaggagag ttaggcatgc agccagtagc tttgctcagc 5640
aatctgaaat ccctaagaag actagatgtc agaagatgca gcatcacgtg ccatggactg 5700
caggacctcg cgtgcctcca atcagttaca gtaaaagaat gtggcaactt ctttctgcgg 5760
cccatcgaag cagctcacac catcatcaat cctttgcctg cttccctcga agaacttgag 5820
attgatggag agtcaagcat ggagtcaatg gctctgctct cgaacctcac ttgtctcacc 5880
gatcttatac tggtgaactc tgagaattta acagtggatg gttttaatcc tctcatcaca 5940
gtcaacctca acagtttgaa gatgtacaac agaggaaatt gtcttagccg ttctatatca 6000
gcggatttgc tctcggaatt ggcggtagca aggaccaacc tgttaccgcc tgcaggctcc 6060
tatcaactac gagaacttac agtggattgc atctcagcag tgcttgttgc tcccatctgt 6120
agcctccttg ccgctaccct ccacacttta gaattctggt atgatcaccg ggcagaaagc 6180
ttcacggaag aggaagagag agctcttcag ctcctcacat ccctctgctt catcaatttt 6240
atggattgcc cgaatcttct gtgccttcct caaggcctac acagccttcc ttctcttaag 6300
acactatttg tccaagattg tcctaaaatc cgatctctgg acaaggggga ctttcccact 6360
tcactggagt gtctattagt gcagggttgc agtcccgggc tacaagggca agccaagaaa 6420
ttaaaaggaa cagaaccaga ttttaatgtg atacttgaac ttgagtagct caaggtaatt 6480
aacagaactc ctgctatcta tgtattcctt cttttcctac gaggatcaca attctctact 6540
gttttaaagt actaactagt aacttgttaa atctgacagg tggattgccg acacagctgc 6600
tcagtcgttt tagctgcatt ctgtacataa gtcatgaaac ctctccgcct accattgtgt 6660
acgtatatcg gtgcaagtga tcatgttgca tctccactta atgtggtcaa tccagtcgcc 6720
ggttaattag tgagtaagtg cacgcaaaac cagcgttttg ggtaccgagc gaaatttcgc 6780
gatctcgcgc ggttaccgcg tttaccgttg cccctcgaga aacactcatc ccgagcaaaa 6840
tatctcgaat attttgaaat ttttgaattc aaacgttcat tgtctagtta aataggcagc 6900
atctcttata tacgaccaga actggtagtg gcctagcggc aaaggcatca tttccgtagc 6960
agcaagtcgc gggttcgagt ctcgcctcgc gcaggtttat ttttttcgaa aaaaagcaaa 7020
agtttaaaaa tactgcaatg accagcagtc gaacttgcga cgtgcgtgca agctacacta 7080
accactacgc caggaagtgt ttggcgatac ttatattagt aaaatgccta atatatcgac 7140
gtcaaaaata tttgaattca aattttgatt ttaaatttcg cccgagtttt tttcagaatt 7200
tcgcggttac cgtggtaacc gcgaattccg gtgaccctcg agaaaaaagg acagcttggg 7260
atccaaaacc tggcgcaaaa ccaacttgta ctttacgtgt ttctatgttt tatgatgtac 7320
tgctacctga ctatatacat cttgattttc ttaacagctg cggtacctgt ttttgtgtga 7380
ttttatcaca aactgcagca tacttgccgc cgctatttga gactgagcag gagtactaca 7440
gtatgcatgt ttagcgggta gttgcatggt aaattttgaa ggagggccaa gaggatctgg 7500
aggtgaatgc atgcctatat atcaccaaga aagtggattc tattttgctg agcatttaaa 7560
tgttggtttt ctaaccttcc gttcactact ctgagagtgg attggcaatc agaacttaat 7620
tggtagctag agtacatcca aacaataaaa gaagcactaa caaaagtatc atcaagtagc 7680
tgtgcagttc taccaggacc atttcaaagg ttagtccgcc actcttttcc tctttctttt 7740
ccccttctta ttctaagttt atcatctcag tggttcttgt cactcccata tgcagtttac 7800
tcgccgctaa ctaccctcta tacattgtaa ttttctattg atgatcgtgt ggcaagttca 7860
cctgttgacc aacggctcaa agttatgaca gagcaataat atcctttttt tagtcttaca 7920
ctcattgctt ggattctgca gaacaattac gaccaacagt ggattcagaa gttggcatta 7980
agttcaaagc tcggttgttc catcgaaaca acgagggtaa caatggaagc atctgtttgg 8040
agatctatga tggagagctg agtgcaaagg aagtagatta cttcccgtgg tgatccatgc 8100
gtataaacaa gttacatttc gtatttactt gtggcataat tcaggttttc agttcgtgta 8160
tgtggtggca gcatcaccgt gtagcggtgt cggtgtgtca accctcagaa tttcagtttc 8220
acttttacca ggttcagtca tgttgaccaa cgactcaaag taaggactgg gcaaactata 8280
tccatgatga tgaattccat gaccttgctc atcattttct gagtattgta ttcattactc 8340
gaattttgca gccaacagtg aattcagaag ccggtattta gaagctaaat tggttgttac 8400
atccaaacaa gaaaagtaac ctgcacagag cctctgtctt gatggctgca tggagagctg 8460
cacaggtgtg catgatctac catttgaaag agcctctatt catggtgcat gcattggctt 8520
ccccagtgag ttcacatctt gtctttcaca aaattagcta tttgggtgga gagcaattta 8580
tcctctcgta agttcatcta tataagttgt tttagatgat aagagtgctt gttacttatt 8640
gtatatggtg taggcagcag accttttctc catttttgtt ggtttaaaca gtttttttct 8700
tctgaatata tgtcgatggt aattaaaaca agttttcatg cgatattgga acaggttgag 8760
tatttgttct ttcagtccaa ttccactgta tccccaacaa ccgccaacta catcgtgtgc 8820
aggcagattt gttatctggt gatgtttcag tgtcaacctc caaagtttca gttccagttt 8880
agcaggttct ccggcctgtt gaccaaacca ctcaaagtaa tgactgagca attatttcca 8940
taaataatca attcaattat cttgttcatc agtcatcact ctatgatgtg tctctgcata 9000
acagttcaca cgcaacagtg aatttcgaga gctgcgcatt cagaagttca gctggctcgg 9060
ttgttccatc caaacaatat aagaaaccag ccgttatccc ttcccctggt gtctcgaatg 9120
gagagctaca aggttttgcg taatagacga gttccaaggt aattgagtat aaggccgcgc 9180
ttggattgag tgtaacttaa tacaggtgtg tgaattttac aagtgtattt tacagggcta 9240
gactaatttg cggccggaaa agatacagac gtacgggggc gtgtattttt tacaggtgta 9300
tcggaaggaa acacggtaat ccaaacgcgg cctaaggtga gtggcaacac acgcgttttc 9360
aaagaattaa gttagtaccg tacacattta catgttcatg ttttcggtgc tgactcgttc 9420
cattcagtgc aggtcagcag gtgtggtgtg tctcactcct ttggtgagca ctgaagcctg 9480
aggaagaatc tatggttcaa gatatgaagc agggtgcaac tgctctggct gacgcactga 9540
ccaaatgcgt cttgagacca aatgcgtctg tggtcttcca caggcgacca catcctctgg 9600
ccacgcagcc gtctttcatc tctgctcagg ccagacttgg ccccgccgcc acactctcca 9660
ccctctccct cttcatcgtc caccttgtta gttagtgtcc atagtcgctg ggttcagcta 9720
gcaaactctg taacgtgcac atacgtgaga tgcctatcta caccactcct tcaccgctca 9780
ccacgctgga tcgatcgaac aggtacaaca atggccagaa cttggcacac ataccagata 9840
atgatgatgc tgaattgatg atgatgatgt ccgtgttgga tgatgatgca atgcatgttt 9900
cttgtatgtt gaccattaca ttgattcttc gtactgtcaa cttacctaga gggcgtatcc 9960
ttgattcaaa attaaaagga actcacaagg ctggaacata ctaatgcagg tcagttccca 10020
agcaatggct tgagaagaaa aatgtatgta ttatatattt agaatggtga gtgtgcatgt 10080
atggccgaga aaacaaaaca atatttggtg taccggatgg ggcattcttc ttagttgagt 10140
ttgaagaagc tgagaaggcc aatgacacgc ttctccggaa atataattgg gcatcgcact 10200
ctgattgtcc actcttgccc tttatataag gtgtatttgt tttttgataa aattccaaaa 10260
tgtaaggtgc atttcatcta aatcctcgta attccctttt tatccctcta gaaagagaag 10320
agtatctccc acctgattga ttgtatctct ccttgtatga aaataaggaa actatctccc 10380
tctcgattgc atgcatctct tactttccta gactcgctga tttacttgcc actaaccaac 10440
aaatttttca agggtaattt tgttctaaca cctttataat tatgcgtctt ggtcaccatg 10500
ccaaaaataa tacaccttac acaaagaaac ggagggagta tttgctaaga gggtgtcaag 10560
aatgagaaga taacttctcc ggaaaaaaat gaaaaaaaaa taaacacagt gagaatagac 10620
taggccatct actgagtcac gggctgggcc ggtcctctca caaacagtta acggcccttt 10680
tggtaccatc gtttggtgtt gccctcggct gggcctccag tgggcaggac gacgaggtcg 10740
gacggacgaa cgtggtcggc gcggggagct gctcgtcgcc ggcgacaggt ttgcccctcc 10800
gccgactcga gtccaccacg caccgtgctc ccctcatgta actcacatcg ggtcccccta 10860
ccttgctctg tactgcagct ccggtgggcg ccaatggaga tcccaacgga tccaatcaat 10920
ggaggctcga gggagggaga ttttctccag ctccgtcagc catcgccgtc gccgtgctcg 10980
acactcacat ggacccaatc caacggtaaa atccatcatt ttcttctact ttcctaggaa 11040
ttccaccacc accaccatac ccgcattcac atggcactca cgactgcttg cttggtgctt 11100
gatcagtgtt ctggatggag ggaatctgga ccaatccgtt ggctgcacaa cacaacaatt 11160
aatagttgaa cagcgatagc ctcgactgcg ctagggaagg caccggtcag tcttctaatg 11220
caatcattct aaatgctcat cacctccgat ctttgtgttt ggattgtgtc aacaaagctg 11280
ctttcttgat gggcaccatg acaaccatgt tcctagtttc gtaccggctt ctcacatagt 11340
aatagtacat catcactaaa tcgatccaag cacgagtaca gttggctgta tgtaaaaaat 11400
cccacaagag agtatttcta accacgctca cataaaccga catggccaga gtcatgatac 11460
cgacataccg tgctgagcat ctctgtgcag gagggaatta aggtctcgac attcacacgg 11520
acccaaggcc aagagaaaag tagcatcggc catcattttc tttctaggaa tccatcgcta 11580
gctactacca cattcacatg gaacagacga attgcttggt agtcactgtt ttggatggat 11640
ccggaatagc ttcttggcta gcaacgaagc tggttacttg gccacacaga aaatatcttt 11700
gattgggacg gggatgaaga ttgtttggct ctttactaga tgagaatcct ttggctgcag 11760
catacaacaa taattgaacg gtgacagcat cgactttggc tagtcaccag tcaatctggc 11820
aaagtttttc tatgaactga tcagctcctc gtgtttggat gtatcaaagt tgttgctttc 11880
ttgatgggag ccatgacaac aatgtcacaa ggttcatgca tcgcggcttc ctagatgaca 11940
catgaataaa cgatcaaagc actagtgcag tccgctataa aagggatcaa ccagaccctc 12000
ttctcaactg cgtgcacata aacaacacac agcctttggt accatatcca ccacagcaca 12060
aactagacac tttttccctg tctgccccag ccccatagga gatggaggcg gcgattggcc 12120
tcgcaagcgg gctcgtcgac agcgtgctga ctctgctttc caatgagctc gtgggggcgt 12180
atgtagccag ctctgagcgt gacctgatgt tcacgcaagg actattgcac 12230
<210> 5
<211> 4365
<212> DNA
<213> 小麦 (Triticum aestivum L.)
<400> 5
atggaatcgg cgattggcgc ggcaaccggg ctcgtcggca gcgtggtgaa tctgctctcc 60
aacgagctcg tgggggcgta cgtcgccagc accgagctcg gcctcaactc cgtcgagatc 120
aagaaagatc tcctgcgcgc gcaagcgctg ttgcaggagg cccagacgag gggccccaag 180
gacaatcacg gcctgaaggg gctgctgcag gagctcaccg tcaaggccga cgaggccgag 240
gatgcgctgg atgagctcca ctacttcatc attcaggacc agctcgacgg cacccgctat 300
gcggtgccgg acctgggcga cgaccttcgc ggtcatgctc gccacggtcg ccatgctatt 360
cgccatgtcg tcggtaactg ccttgcatgc ttttcttgtt cgtcgactat gaaagatgat 420
gatagtggtg gtggccgtgg tgctgctgct gttcctatta acccactcaa tgcaaccaag 480
actgatagtg ctggtcaaga tggtccagtt gaaaagctga aatttcatag agtggccatg 540
tccaagaaaa ttaaatcagt gatggaggaa ctacactccc tgtgtgagcc cgtatccaaa 600
ttgctctgca taactccaca ccagggtagc actgagactg ctgtcaacct aaatcgtcct 660
cctacgggat caatcagtgc gcagcataca ttgtatggca gggcagacat tttcgaggga 720
accaaagatg atataaccag tggcagattc tataccgaaa ccctttcagt tcttcctgta 780
gttggtccag gtggcattgg aaagacaact ttcgcccaat acctgtataa tgataaaagg 840
atacaagagc actttgctgt tagggtctgg gtatgtgtat cgactgattt tgatgtgctt 900
aagctcagcc aacagatcct tagctgcata caaggaagca actctccaaa tcaaacaaca 960
agtttagacc agctccaaat atccatcgca gagaaaatca tgtcgaaaag gtttctaatt 1020
gtgtttgatg atatgtggaa atgtagtaat cagggctggg aagacctgct agctcctttg 1080
aggaaagggg aaaccaaggg taacatggtt ctcgtcacaa ctcgatttcc atccatagct 1140
gaaatggtga aaacaacacc ttcaatacca ctgaaaggcc tcaagtctga tgaattcttc 1200
agattctttg aagcattcat atttgatgaa aaaaaacctg aataccaagg tgacctagca 1260
agtattgcac agactattgc taagaaattg aagggttccc ctctagcagg caaaaccgtt 1320
ggtcgattat tgaggaagga gatttctcgg aaacactgga tggaagttct cgaaaacaac 1380
aaatggcaaa accaagaaaa tgatgatggc attatgacat ctctcaggat tagctatgat 1440
taccttcctt cccatctgaa aaaatgtttc caatattttg ccttattccc tgaagattac 1500
aagtttaaga atttagagat tacatatttc tggattgcaa taggaatcat agaaaaagat 1560
gaggattaca tggaacaact agtggaaaat ggttttcttg tgaaggagaa tgctcattgg 1620
tcaagccaac agtactatgt actgcatgat ttactacatg aactgtcccg atgtgtttcc 1680
tcacaagaat gcctcaatat atatgatagt aaaaggttta aagctgatgc cgttccaaaa 1740
tctattcatc atttgtccat taccattgaa gacaactatg atgtgacttt tactgaagaa 1800
atgattaagt tgaagagcaa ggtagacatt gcaaatttgc gggctttgat gatttttaga 1860
agatatggag aaacaattgg tgagatttta aatgatactt tcaaggatgt agagggtctt 1920
cgtgtcctat ttatagttgt gcagtcccca aattcgttcc catgtgattt ctcaaaactt 1980
atccacctcc gatacctaaa acttggaaca tgggacgaag tgactttgcc tagcacactg 2040
tccagacttt atcacctaaa actattggac ctgagtggtt ggcatggtag ttatgaattg 2100
cctagagaca ctagccgcct tatcagttta tgccatttta ttgctggaaa aaaattccat 2160
tgcaatattg ctgaggttgg aaagatgaag tgtttaaagg agcttaaaga attccatgtc 2220
atgaaagaga gtgttggata tgaactgtgt gagttggggc agttaacaga tcttggagga 2280
gaactctgta tacgtaatct tgaaaaggtg acaaccaagg atgaggctat ggaagccaag 2340
cttgagtcaa aaggtgattt gaaagggttg gggttggttt ggggatcatc tgatgccgtt 2400
gatggtacaa caaaattagt tgatgttctt gatgctcttc agccacaccc taaccttgca 2460
gcacttggca tcataggtca tggtggtaca agtggtccta gttggttgtg tggtgacatt 2520
aactttgagt tgttggagta tcttcaccta gagggagtat cttgggtcac tcttcctcct 2580
tttgagtact tgccgcagct cacatcactc acattgagca acatttatga agttagtgag 2640
atccgccatg tctttggtgg cgctacaaac aaaagttttt tgcagttgaa gctgattgtc 2700
cttgattcat tgcctgaact tattgagtgg gttggggtgc ctaatgattc gtttgcaagg 2760
cttgaaacta tttattgcag gtcatgcccc aatctccgta cgctaccctt cctgcgggag 2820
tactctgctg gctgttataa ccatctgtcg aaacttgaaa tttttgattg cccaaagttg 2880
tctctccctc ccatgcctca cagttccaca ctgatatttt gtcatgtgag acagagttcc 2940
gcagagctga cttattctgc agggaacaac atgtatatta atgggtataa tagtggtgag 3000
gtggacttgc ataacctgcg aaaagtagag aaaatgagta ttacagatgt atcacacatt 3060
tcaatgacag agctgaacaa tctgaaatcc ctaagaagac tagatgtcag aagatgcagc 3120
ttcacgtgcc atggactcca ggacctcgtg tgcctccaat atcttacagt atacaactgt 3180
ggcgtcttct ttccatggcc taccgaagca gctcacacca tcaagctttt ccctgcttcc 3240
ctcgagacac ttgagattga aggagagtca ggcatgcagt caatggctct gctcagcaat 3300
ctgaaatccc taaggagact agatgtcaga agatgcagca tcacgtgcca tggactgcag 3360
gacctcgcat gcctccaatc acttacagta caagactgtg gcaacttctt tccatggcct 3420
accgaagcag ctcacaccgt caatcctttc cctcacacca tcaagccttt ccctgcttcc 3480
ctcgagacac ttgagattga aggagagtta ggcatgcagc cagtagcttt gctcagcaat 3540
ctgaaatccc taagaagact agatgtcaga agatgcagca tcacgtgcca tggactgcag 3600
gacctcgcgt gcctccaatc agttacagta aaagaatgtg gcaacttctt tctgcggccc 3660
atcgaagcag ctcacaccat catcaatcct ttgcctgctt ccctcgaaga acttgagatt 3720
gatggagagt caagcatgga gtcaatggct ctgctctcga acctcacttg tctcaccgat 3780
cttatactgg tgaactctga gaatttaaca gtggatggtt ttaatcctct catcacagtc 3840
aacctcaaca gtttgaagat gtacaacaga ggaaattgtc ttagccgttc tatatcagcg 3900
gatttgctct cggaattggc ggtagcaagg accaacctgt taccgcctgc aggctcctat 3960
caactacgag aacttacagt ggattgcatc tcagcagtgc ttgttgctcc catctgtagc 4020
ctccttgccg ctaccctcca cactttagaa ttctggtatg atcaccgggc agaaagcttc 4080
acggaagagg aagagagagc tcttcagctc ctcacatccc tctgcttcat caattttatg 4140
gattgcccga atcttctgtg ccttcctcaa ggcctacaca gccttccttc tcttaagaca 4200
ctatttgtcc aagattgtcc taaaatccga tctctggaca agggggactt tcccacttca 4260
ctggagtgtc tattagtgca gggttgcagt cccgggctac aagggcaagc caagaaatta 4320
aaaggaacag aaccagattt taatgtgata cttgaacttg agtag 4365
<210> 6
<211> 1454
<212> PRT
<213> 小麦 (Triticum aestivum L.)
<400> 6
Met Glu Ser Ala Ile Gly Ala Ala Thr Gly Leu Val Gly Ser Val Val
1 5 10 15
Asn Leu Leu Ser Asn Glu Leu Val Gly Ala Tyr Val Ala Ser Thr Glu
20 25 30
Leu Gly Leu Asn Ser Val Glu Ile Lys Lys Asp Leu Leu Arg Ala Gln
35 40 45
Ala Leu Leu Gln Glu Ala Gln Thr Arg Gly Pro Lys Asp Asn His Gly
50 55 60
Leu Lys Gly Leu Leu Gln Glu Leu Thr Val Lys Ala Asp Glu Ala Glu
65 70 75 80
Asp Ala Leu Asp Glu Leu His Tyr Phe Ile Ile Gln Asp Gln Leu Asp
85 90 95
Gly Thr Arg Tyr Ala Val Pro Asp Leu Gly Asp Asp Leu Arg Gly His
100 105 110
Ala Arg His Gly Arg His Ala Ile Arg His Val Val Gly Asn Cys Leu
115 120 125
Ala Cys Phe Ser Cys Ser Ser Thr Met Lys Asp Asp Asp Ser Gly Gly
130 135 140
Gly Arg Gly Ala Ala Ala Val Pro Ile Asn Pro Leu Asn Ala Thr Lys
145 150 155 160
Thr Asp Ser Ala Gly Gln Asp Gly Pro Val Glu Lys Leu Lys Phe His
165 170 175
Arg Val Ala Met Ser Lys Lys Ile Lys Ser Val Met Glu Glu Leu His
180 185 190
Ser Leu Cys Glu Pro Val Ser Lys Leu Leu Cys Ile Thr Pro His Gln
195 200 205
Gly Ser Thr Glu Thr Ala Val Asn Leu Asn Arg Pro Pro Thr Gly Ser
210 215 220
Ile Ser Ala Gln His Thr Leu Tyr Gly Arg Ala Asp Ile Phe Glu Gly
225 230 235 240
Thr Lys Asp Asp Ile Thr Ser Gly Arg Phe Tyr Thr Glu Thr Leu Ser
245 250 255
Val Leu Pro Val Val Gly Pro Gly Gly Ile Gly Lys Thr Thr Phe Ala
260 265 270
Gln Tyr Leu Tyr Asn Asp Lys Arg Ile Gln Glu His Phe Ala Val Arg
275 280 285
Val Trp Val Cys Val Ser Thr Asp Phe Asp Val Leu Lys Leu Ser Gln
290 295 300
Gln Ile Leu Ser Cys Ile Gln Gly Ser Asn Ser Pro Asn Gln Thr Thr
305 310 315 320
Ser Leu Asp Gln Leu Gln Ile Ser Ile Ala Glu Lys Ile Met Ser Lys
325 330 335
Arg Phe Leu Ile Val Phe Asp Asp Met Trp Lys Cys Ser Asn Gln Gly
340 345 350
Trp Glu Asp Leu Leu Ala Pro Leu Arg Lys Gly Glu Thr Lys Gly Asn
355 360 365
Met Val Leu Val Thr Thr Arg Phe Pro Ser Ile Ala Glu Met Val Lys
370 375 380
Thr Thr Pro Ser Ile Pro Leu Lys Gly Leu Lys Ser Asp Glu Phe Phe
385 390 395 400
Arg Phe Phe Glu Ala Phe Ile Phe Asp Glu Lys Lys Pro Glu Tyr Gln
405 410 415
Gly Asp Leu Ala Ser Ile Ala Gln Thr Ile Ala Lys Lys Leu Lys Gly
420 425 430
Ser Pro Leu Ala Gly Lys Thr Val Gly Arg Leu Leu Arg Lys Glu Ile
435 440 445
Ser Arg Lys His Trp Met Glu Val Leu Glu Asn Asn Lys Trp Gln Asn
450 455 460
Gln Glu Asn Asp Asp Gly Ile Met Thr Ser Leu Arg Ile Ser Tyr Asp
465 470 475 480
Tyr Leu Pro Ser His Leu Lys Lys Cys Phe Gln Tyr Phe Ala Leu Phe
485 490 495
Pro Glu Asp Tyr Lys Phe Lys Asn Leu Glu Ile Thr Tyr Phe Trp Ile
500 505 510
Ala Ile Gly Ile Ile Glu Lys Asp Glu Asp Tyr Met Glu Gln Leu Val
515 520 525
Glu Asn Gly Phe Leu Val Lys Glu Asn Ala His Trp Ser Ser Gln Gln
530 535 540
Tyr Tyr Val Leu His Asp Leu Leu His Glu Leu Ser Arg Cys Val Ser
545 550 555 560
Ser Gln Glu Cys Leu Asn Ile Tyr Asp Ser Lys Arg Phe Lys Ala Asp
565 570 575
Ala Val Pro Lys Ser Ile His His Leu Ser Ile Thr Ile Glu Asp Asn
580 585 590
Tyr Asp Val Thr Phe Thr Glu Glu Met Ile Lys Leu Lys Ser Lys Val
595 600 605
Asp Ile Ala Asn Leu Arg Ala Leu Met Ile Phe Arg Arg Tyr Gly Glu
610 615 620
Thr Ile Gly Glu Ile Leu Asn Asp Thr Phe Lys Asp Val Glu Gly Leu
625 630 635 640
Arg Val Leu Phe Ile Val Val Gln Ser Pro Asn Ser Phe Pro Cys Asp
645 650 655
Phe Ser Lys Leu Ile His Leu Arg Tyr Leu Lys Leu Gly Thr Trp Asp
660 665 670
Glu Val Thr Leu Pro Ser Thr Leu Ser Arg Leu Tyr His Leu Lys Leu
675 680 685
Leu Asp Leu Ser Gly Trp His Gly Ser Tyr Glu Leu Pro Arg Asp Thr
690 695 700
Ser Arg Leu Ile Ser Leu Cys His Phe Ile Ala Gly Lys Lys Phe His
705 710 715 720
Cys Asn Ile Ala Glu Val Gly Lys Met Lys Cys Leu Lys Glu Leu Lys
725 730 735
Glu Phe His Val Met Lys Glu Ser Val Gly Tyr Glu Leu Cys Glu Leu
740 745 750
Gly Gln Leu Thr Asp Leu Gly Gly Glu Leu Cys Ile Arg Asn Leu Glu
755 760 765
Lys Val Thr Thr Lys Asp Glu Ala Met Glu Ala Lys Leu Glu Ser Lys
770 775 780
Gly Asp Leu Lys Gly Leu Gly Leu Val Trp Gly Ser Ser Asp Ala Val
785 790 795 800
Asp Gly Thr Thr Lys Leu Val Asp Val Leu Asp Ala Leu Gln Pro His
805 810 815
Pro Asn Leu Ala Ala Leu Gly Ile Ile Gly His Gly Gly Thr Ser Gly
820 825 830
Pro Ser Trp Leu Cys Gly Asp Ile Asn Phe Glu Leu Leu Glu Tyr Leu
835 840 845
His Leu Glu Gly Val Ser Trp Val Thr Leu Pro Pro Phe Glu Tyr Leu
850 855 860
Pro Gln Leu Thr Ser Leu Thr Leu Ser Asn Ile Tyr Glu Val Ser Glu
865 870 875 880
Ile Arg His Val Phe Gly Gly Ala Thr Asn Lys Ser Phe Leu Gln Leu
885 890 895
Lys Leu Ile Val Leu Asp Ser Leu Pro Glu Leu Ile Glu Trp Val Gly
900 905 910
Val Pro Asn Asp Ser Phe Ala Arg Leu Glu Thr Ile Tyr Cys Arg Ser
915 920 925
Cys Pro Asn Leu Arg Thr Leu Pro Phe Leu Arg Glu Tyr Ser Ala Gly
930 935 940
Cys Tyr Asn His Leu Ser Lys Leu Glu Ile Phe Asp Cys Pro Lys Leu
945 950 955 960
Ser Leu Pro Pro Met Pro His Ser Ser Thr Leu Ile Phe Cys His Val
965 970 975
Arg Gln Ser Ser Ala Glu Leu Thr Tyr Ser Ala Gly Asn Asn Met Tyr
980 985 990
Ile Asn Gly Tyr Asn Ser Gly Glu Val Asp Leu His Asn Leu Arg Lys
995 1000 1005
Val Glu Lys Met Ser Ile Thr Asp Val Ser His Ile Ser Met Thr Glu
1010 1015 1020
Leu Asn Asn Leu Lys Ser Leu Arg Arg Leu Asp Val Arg Arg Cys Ser
1025 1030 1035 1040
Phe Thr Cys His Gly Leu Gln Asp Leu Val Cys Leu Gln Tyr Leu Thr
1045 1050 1055
Val Tyr Asn Cys Gly Val Phe Phe Pro Trp Pro Thr Glu Ala Ala His
1060 1065 1070
Thr Ile Lys Leu Phe Pro Ala Ser Leu Glu Thr Leu Glu Ile Glu Gly
1075 1080 1085
Glu Ser Gly Met Gln Ser Met Ala Leu Leu Ser Asn Leu Lys Ser Leu
1090 1095 1100
Arg Arg Leu Asp Val Arg Arg Cys Ser Ile Thr Cys His Gly Leu Gln
1105 1110 1115 1120
Asp Leu Ala Cys Leu Gln Ser Leu Thr Val Gln Asp Cys Gly Asn Phe
1125 1130 1135
Phe Pro Trp Pro Thr Glu Ala Ala His Thr Val Asn Pro Phe Pro His
1140 1145 1150
Thr Ile Lys Pro Phe Pro Ala Ser Leu Glu Thr Leu Glu Ile Glu Gly
1155 1160 1165
Glu Leu Gly Met Gln Pro Val Ala Leu Leu Ser Asn Leu Lys Ser Leu
1170 1175 1180
Arg Arg Leu Asp Val Arg Arg Cys Ser Ile Thr Cys His Gly Leu Gln
1185 1190 1195 1200
Asp Leu Ala Cys Leu Gln Ser Val Thr Val Lys Glu Cys Gly Asn Phe
1205 1210 1215
Phe Leu Arg Pro Ile Glu Ala Ala His Thr Ile Ile Asn Pro Leu Pro
1220 1225 1230
Ala Ser Leu Glu Glu Leu Glu Ile Asp Gly Glu Ser Ser Met Glu Ser
1235 1240 1245
Met Ala Leu Leu Ser Asn Leu Thr Cys Leu Thr Asp Leu Ile Leu Val
1250 1255 1260
Asn Ser Glu Asn Leu Thr Val Asp Gly Phe Asn Pro Leu Ile Thr Val
1265 1270 1275 1280
Asn Leu Asn Ser Leu Lys Met Tyr Asn Arg Gly Asn Cys Leu Ser Arg
1285 1290 1295
Ser Ile Ser Ala Asp Leu Leu Ser Glu Leu Ala Val Ala Arg Thr Asn
1300 1305 1310
Leu Leu Pro Pro Ala Gly Ser Tyr Gln Leu Arg Glu Leu Thr Val Asp
1315 1320 1325
Cys Ile Ser Ala Val Leu Val Ala Pro Ile Cys Ser Leu Leu Ala Ala
1330 1335 1340
Thr Leu His Thr Leu Glu Phe Trp Tyr Asp His Arg Ala Glu Ser Phe
1345 1350 1355 1360
Thr Glu Glu Glu Glu Arg Ala Leu Gln Leu Leu Thr Ser Leu Cys Phe
1365 1370 1375
Ile Asn Phe Met Asp Cys Pro Asn Leu Leu Cys Leu Pro Gln Gly Leu
1380 1385 1390
His Ser Leu Pro Ser Leu Lys Thr Leu Phe Val Gln Asp Cys Pro Lys
1395 1400 1405
Ile Arg Ser Leu Asp Lys Gly Asp Phe Pro Thr Ser Leu Glu Cys Leu
1410 1415 1420
Leu Val Gln Gly Cys Ser Pro Gly Leu Gln Gly Gln Ala Lys Lys Leu
1425 1430 1435 1440
Lys Gly Thr Glu Pro Asp Phe Asn Val Ile Leu Glu Leu Glu
1445 1450
Claims (5)
1.一种蛋白质或其相关生物材料的下述P1-P5中的任一种应用:
P1、一种蛋白质或其相关生物材料在调控植物抗病性中的应用;
P2、一种蛋白质或其相关生物材料在制备提高植物抗病性的产品中的应用;
P3、一种蛋白质或其相关生物材料在培育抗病植物中的应用;
P4、一种蛋白质或其相关生物材料在制备植物抗病产品中的应用;
所述蛋白质为如下A1)或A2):
A1)其氨基酸序列如序列表中SEQ ID NO:6所示;
A2)在A1)的N末端或/和C末端连接蛋白标签得到的融合蛋白质;
所述相关生物材料为下述B1)至B7)中的任一种:
B1)编码所述蛋白质的核酸分子;
B2)含有B1)所述核酸分子的表达盒;
B3)含有B1)所述核酸分子的重组载体、或含有B2)所述表达盒的重组载体;
B4)含有B1)所述核酸分子的重组微生物、或含有B2)所述表达盒的重组微生物、或含有B3)所述重组载体的重组微生物;
B5)含有B1)所述核酸分子的转基因植物细胞系、或含有B2)所述表达盒的转基因植物细胞系;
B6)含有B1)所述核酸分子的转基因植物组织、或含有B2)所述表达盒的转基因植物组织;
B7)含有B1)所述核酸分子的转基因植物器官、或含有B2)所述表达盒的转基因植物器官;
所述抗病为抗白粉病;
所述植物为小麦。
2.根据权利要求1所述的应用,其特征在于:B1)所述核酸分子为如下b1)或b2):
b1)序列表中SEQ ID NO:4所示DNA分子;
b2)序列表中SEQ ID NO:5所示DNA分子。
3.一种培育抗病植物的方法,包括提高目的植物中蛋白质或其编码基因的表达量,得到抗病植物;所述抗病植物的抗病性高于所述目的植物的抗病性;
所述蛋白质为如下A1)或A2):
A1)其氨基酸序列如序列表中SEQ ID NO:6所示;
A2)在A1)的N末端或/和C末端连接蛋白标签得到的融合蛋白质;
所述抗病性为抗白粉病;
所述植物为小麦。
4.根据权利要求3所述的方法,其特征在于:所述提高目的植物中蛋白质或其编码基因的表达量是通过将所述蛋白质的编码基因导入所述目的植物实现的。
5.一种降低植物抗病性的方法,包括抑制或降低目的植物中蛋白质的编码基因的表达;
所述蛋白质为如下A1)或A2):
A1)其氨基酸序列如序列表中SEQ ID NO:6所示;
A2)在A1)的N末端或/和C末端连接蛋白标签得到的融合蛋白质;
所述抗病性为抗白粉病;
所述植物为小麦。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011350857.8A CN114539371B (zh) | 2020-11-26 | 2020-11-26 | 小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011350857.8A CN114539371B (zh) | 2020-11-26 | 2020-11-26 | 小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114539371A CN114539371A (zh) | 2022-05-27 |
CN114539371B true CN114539371B (zh) | 2023-11-24 |
Family
ID=81668270
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011350857.8A Active CN114539371B (zh) | 2020-11-26 | 2020-11-26 | 小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114539371B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114773443B (zh) * | 2022-04-20 | 2024-07-30 | 华南农业大学 | 一种抗大豆白粉病基因GmRmd1及编码蛋白和应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107236746A (zh) * | 2017-07-28 | 2017-10-10 | 福建农林大学 | 小麦白粉病抗性基因PmR2及其克隆与应用 |
CN111763249A (zh) * | 2019-03-08 | 2020-10-13 | 中国科学院遗传与发育生物学研究所 | 植物白粉病抗性相关蛋白Pm5e及其编码基因和应用 |
-
2020
- 2020-11-26 CN CN202011350857.8A patent/CN114539371B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107236746A (zh) * | 2017-07-28 | 2017-10-10 | 福建农林大学 | 小麦白粉病抗性基因PmR2及其克隆与应用 |
WO2019019421A1 (zh) * | 2017-07-28 | 2019-01-31 | 福建农林大学 | 小麦白粉病抗性基因PmR2及其克隆与应用 |
CN111763249A (zh) * | 2019-03-08 | 2020-10-13 | 中国科学院遗传与发育生物学研究所 | 植物白粉病抗性相关蛋白Pm5e及其编码基因和应用 |
Non-Patent Citations (7)
Also Published As
Publication number | Publication date |
---|---|
CN114539371A (zh) | 2022-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2019276382B2 (en) | Use of Yr4DS gene of Aegilops tauschii in stripe rust resistance breeding of Triticeae plants | |
US9663794B2 (en) | Heat-resistance rice gene OsZFP, screening marker and separation method thereof | |
KR101372114B1 (ko) | 벼 징크 핑거 단백질 전사 인자 dst 및 가뭄 및 염 내성을 조절하기 위한 이의 용도 | |
CN101365786B (zh) | 具有改良的生长特征的植物及其生产方法 | |
KR20120126061A (ko) | 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법 | |
CN110894220B (zh) | 种子相关蛋白在调控植物种子大小中的应用 | |
CA2475467C (en) | Gene conferring resistance to phytophthora infestans (late-blight) in solanaceae | |
CA2985273A1 (en) | Late blight resistance genes and methods | |
CN110041416B (zh) | GmABCA9基因在提高大豆蛋白含量和粒重中的应用 | |
CN114410651B (zh) | 玉米灰斑病抗性相关蛋白及其编码基因与应用 | |
KR100990370B1 (ko) | 벼 도열병균에 대한 내성을 증진시키는 유전자 및 이의용도 | |
CN101778942A (zh) | 产率相关性状增强的植物及制备其的方法 | |
CN116286724A (zh) | 凝集素类受体蛋白TaLecRLK2及其编码基因与应用 | |
CN111434679B (zh) | 株型相关蛋白在调控植物株型中的应用 | |
CN114539371B (zh) | 小麦白粉病抗性相关蛋白MlWE18和MlIW172及其应用 | |
WO2024174431A1 (zh) | 种子重量及产量相关的蛋白GmPHD6及其相关生物材料和应用 | |
CN102732531A (zh) | 一种水稻稻瘟病抗性基因RMg7或RMg8或RMg9及其应用 | |
CN113980919B (zh) | 调控玉米穗腐病抗性的dna序列及其突变体、分子标记和应用 | |
CN111826391A (zh) | 一种nhx2-gcd1双基因或其蛋白的应用 | |
CN112646820B (zh) | 改变玉米开花期的基因及方法 | |
CN110698552B (zh) | 水稻富含WD40重复蛋白OsWD40-141及其编码基因和应用 | |
CN111269915B (zh) | 白叶枯病抗性相关基因Xa39(t)及其相关生物材料与培育抗白叶枯病水稻的方法 | |
CN112961230B (zh) | 一种与植物耐盐相关的OsFLP蛋白质及其相关生物材料与应用 | |
CN110923214A (zh) | 抗病转TaM2K基因小麦的培育方法及其相关生物材料 | |
CN113462661B (zh) | 从玉米中分离的siz1蛋白及其编码基因和在品种改良中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |