CN1297661C - A rice blast resistance gene, its encoded protein and use thereof - Google Patents
A rice blast resistance gene, its encoded protein and use thereof Download PDFInfo
- Publication number
- CN1297661C CN1297661C CNB2003101184339A CN200310118433A CN1297661C CN 1297661 C CN1297661 C CN 1297661C CN B2003101184339 A CNB2003101184339 A CN B2003101184339A CN 200310118433 A CN200310118433 A CN 200310118433A CN 1297661 C CN1297661 C CN 1297661C
- Authority
- CN
- China
- Prior art keywords
- seq
- sequence
- gene
- protein
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 142
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 40
- 235000007164 Oryza sativa Nutrition 0.000 title abstract description 54
- 235000009566 rice Nutrition 0.000 title abstract description 51
- 240000007594 Oryza sativa Species 0.000 title description 4
- 201000010099 disease Diseases 0.000 claims abstract description 63
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 63
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 9
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 3
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 3
- 150000001413 amino acids Chemical group 0.000 claims description 24
- 230000009261 transgenic effect Effects 0.000 claims description 12
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 9
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 9
- 239000004473 Threonine Substances 0.000 claims description 9
- 125000000539 amino acid group Chemical group 0.000 claims description 9
- 239000013604 expression vector Substances 0.000 claims description 8
- 239000012528 membrane Substances 0.000 claims description 8
- 101710186708 Agglutinin Proteins 0.000 claims description 3
- 101710146024 Horcolin Proteins 0.000 claims description 3
- 101710189395 Lectin Proteins 0.000 claims description 3
- 101710179758 Mannose-specific lectin Proteins 0.000 claims description 3
- 101710150763 Mannose-specific lectin 1 Proteins 0.000 claims description 3
- 101710150745 Mannose-specific lectin 2 Proteins 0.000 claims description 3
- 230000008034 disappearance Effects 0.000 claims description 2
- 239000002157 polynucleotide Substances 0.000 claims description 2
- 241000196324 Embryophyta Species 0.000 abstract description 54
- 241000209094 Oryza Species 0.000 abstract description 51
- 108020004414 DNA Proteins 0.000 abstract description 19
- 230000000694 effects Effects 0.000 abstract description 3
- 238000012258 culturing Methods 0.000 abstract 1
- 230000002708 enhancing effect Effects 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 28
- 235000001014 amino acid Nutrition 0.000 description 16
- 208000035240 Disease Resistance Diseases 0.000 description 12
- 239000002299 complementary DNA Substances 0.000 description 12
- 230000003321 amplification Effects 0.000 description 10
- 238000003199 nucleic acid amplification method Methods 0.000 description 10
- 244000052616 bacterial pathogen Species 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 230000014509 gene expression Effects 0.000 description 6
- 238000000034 method Methods 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 101001059454 Homo sapiens Serine/threonine-protein kinase MARK2 Proteins 0.000 description 5
- 102100028904 Serine/threonine-protein kinase MARK2 Human genes 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 244000052769 pathogen Species 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 238000003757 reverse transcription PCR Methods 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000011156 evaluation Methods 0.000 description 4
- 238000011081 inoculation Methods 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 230000001717 pathogenic effect Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical group CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 3
- 241000244206 Nematoda Species 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- 108091005682 Receptor kinases Proteins 0.000 description 3
- 240000003768 Solanum lycopersicum Species 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 230000004665 defense response Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 231100000252 nontoxic Toxicity 0.000 description 3
- 230000003000 nontoxic effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 108700026215 vpr Genes Proteins 0.000 description 3
- 108010085238 Actins Proteins 0.000 description 2
- 241000345998 Calamus manan Species 0.000 description 2
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- 240000005979 Hordeum vulgare Species 0.000 description 2
- 235000007340 Hordeum vulgare Nutrition 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- 101000706985 Pinus strobus Putative disease resistance protein PS10 Proteins 0.000 description 2
- 102000001253 Protein Kinase Human genes 0.000 description 2
- 102100040631 Proton-activated chloride channel Human genes 0.000 description 2
- 101710101078 Proton-activated chloride channel Proteins 0.000 description 2
- 101150090155 R gene Proteins 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- 206010042602 Supraventricular extrasystoles Diseases 0.000 description 2
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 239000002523 lectin Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 239000002304 perfume Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 108060006633 protein kinase Proteins 0.000 description 2
- 235000012950 rattan cane Nutrition 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 235000013599 spices Nutrition 0.000 description 2
- 239000007921 spray Substances 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- PENWAFASUFITRC-UHFFFAOYSA-N 2-(4-chlorophenyl)imidazo[2,1-a]isoquinoline Chemical group C1=CC(Cl)=CC=C1C1=CN(C=CC=2C3=CC=CC=2)C3=N1 PENWAFASUFITRC-UHFFFAOYSA-N 0.000 description 1
- 108091006112 ATPases Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 102000057290 Adenosine Triphosphatases Human genes 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- 101100529494 Arabidopsis thaliana RPW8.1 gene Proteins 0.000 description 1
- 101100529495 Arabidopsis thaliana RPW8.2 gene Proteins 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 241000895523 Blumeria graminis f. sp. hordei Species 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- YRKJQKATZOTUEN-ACZMJKKPSA-N Cys-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N YRKJQKATZOTUEN-ACZMJKKPSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- 206010011732 Cyst Diseases 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- 101000722833 Geobacillus stearothermophilus 30S ribosomal protein S16 Proteins 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 102000011714 Glycine Receptors Human genes 0.000 description 1
- 108010076533 Glycine Receptors Proteins 0.000 description 1
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- UIIMIKFNIYPDJF-WDSOQIARSA-N Leu-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 UIIMIKFNIYPDJF-WDSOQIARSA-N 0.000 description 1
- 108010006444 Leucine-Rich Repeat Proteins Proteins 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- ZJSXCIMWLPSTMG-HSCHXYMDSA-N Lys-Trp-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJSXCIMWLPSTMG-HSCHXYMDSA-N 0.000 description 1
- 241001330975 Magnaporthe oryzae Species 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000010624 Medicago sativa Nutrition 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 101150058540 RAC1 gene Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 241000589771 Ralstonia solanacearum Species 0.000 description 1
- 102100022122 Ras-related C3 botulinum toxin substrate 1 Human genes 0.000 description 1
- 241000124033 Salix Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- 241000723873 Tobacco mosaic virus Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- 241001123669 Verticillium albo-atrum Species 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000000546 chi-square test Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 208000031513 cyst Diseases 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 108010030074 endodeoxyribonuclease MluI Proteins 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 101150054900 gus gene Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 210000004901 leucine-rich repeat Anatomy 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000008261 resistance mechanism Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 101150003708 rrs1 gene Proteins 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
Images
Landscapes
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
The present invention discloses a rice blast resistant gene, and an encoded protein and application thereof. The rice blast resistant gene provided by the present invention is one of the following nucleotide sequences: 1) SEQ ID No: 1 in a sequence list; 2)SEQ ID No: 2 in the sequence list; 3) SEQ ID No: 3 polyribonucleotide of a protein sequence in an encoding sequence list; 4) a DNA sequence which has more than 90 % of homology with DNA limited by the SEQ ID No: 1 or the SEQ ID No: 2 in the sequence list and can encode proteins having the same functions. The encoding protein of the rice blast resistant gene has an amino acid residue radical sequence of the SEQ ID No: 3 in the sequence list, or is derived by the SEQ ID No: 3 by replacing, deleting, or adding one or a plurality of amino acid residue radicals of the amino acid residue radical sequence of the SEQ ID No: 3, and has the same activity of the amino acid residue radical sequence of the SEQ ID No: 3. The gene of the present invention has significance for culturing disease resistant plant varieties, enhancing crop yield and enlarging crop planting areas.
Description
Technical field
The present invention relates to a kind of plant disease resistance genes and proteins encoded thereof and application, particularly relate to a kind of blast resistant gene and proteins encoded thereof and application.
Background technology
Plant tends to be subjected to the attack of various diseases such as bacterium, fungi, virus and nematode at occurring in nature, shows as disease-resistant (Resistance) or susceptible (Susceptibility) reaction.To disease resistance of plant and disease-resistant Study on Mechanism is the problem that plant pathology and breeding for disease resistance are extremely paid close attention to always.
From 1992 are cloned into first disease-resistant gene of plant Hm1 from corn since, be cloned into more than 40 plant disease-resistant (R) gene at present, these disease-resistant genes relate to Different Kinds of Pathogens microorganisms such as bacterium, fungi, virus and nematode.The most encoded protein products of these genes have similar constitutional features such as NBS (Nucleotidebinding site, nucleotide binding site), LRR (Leucine-rich repeat, rich leucine repeats), TM (Transmembrane domain, membrane spaning domain), PK (Protein kinase, protein kinase), LZ (Leucine zipper, leucine zipper), CC (Coiled Coil, coiled coil) and TIR (Toll-Interleukin-1 Receptor, Toll il-1 zone) etc.And whether contain these structural domains according to them, also these disease-resistant genes can be divided into LZ (CC)-NBS-LRR, TIR-NBS-LRR, TM-LRR, TM-LRR, 5 classes such as LRR-TM-STK.But, except above-mentioned 5 class R genes, also cloned the R gene of other specific type at present.As the Hm1 gene of corn is to belong to a genoid of representing in the plant disease resistance genes with the effect of the pathogenic bacteria affinity factor, its toxin reductase enzyme of encoding, because it doesn't matter for resistance that produces and the nontoxic gene of pathogenic bacteria, thereby its disease-resistant mechanism does not meet gene-for-gene theory; The Hs1 of the anti-Cyst nematode of beet
Pra-1The albumen of coded by said gene contains incomplete LRR and strides the film district; Membranin that contains 6 transmembrane protein spirals (Membranespanning helices) at least of the recessive mildew-resistance of barley (Erysiphegraminis f.sp.Hordei) mlo genes encoding, and do not have the similar structural domain of other R gene; The mildew-resistance gene RPW8 encoded protein of Arabidopis thaliana has a coiled coil district of striding in membrane structure and the born of the same parents, and comprises RPW8.1, and two different sites of RPW8.2 can produce resistance of wide spectrum to pathogenic bacteria; The resisting tobacco mosaic virus gene Rtm1 of Arabidopis thaliana and the Rtm2 heat shock protein of encoding respectively; The TIR-NBS-LRR albumen though the disease-resistant gene RRS1-R in the Arabidopis thaliana also encodes, but its proteic C-terminal comprises the nuclear localization signal structure that the transcription activating protein of being made up of 60 amino acid with WRKY family has similar structures, and this gene pairs pathogenic bacteria, Ralstoniasolanacearum, have non-microspecies specificity resistance, and be recessive inheritance; Tomato is to be similar to the cell surface Glycine Receptors albumen that participates in the endocytosis signal pathway to the disease-resistant gene Ve of pathogenic bacteria Verticillium alboatrum coding, in addition the anti-stem rust gene Rpg1 encoded protein of in barley, cloning recently contain two kinases districts and hydrophobicity more weak stride the film district.Therefore, the classification for disease-resistant gene is not a simple process.Only be cloned into more disease-resistant gene, just may understand the feature of plant disease resistance genes more all sidedly, and resolve the real mechanism of plant disease-resistant.
Paddy rice is one of main in the world food crop, is supporting the population in the whole world nearly 1/4.Great disease such as rice blast, bacterial leaf spot is the major reason that influences its output always, disease resistance and the resistance mechanism thermalization day by day of research paddy rice, as having located more than 20 rice blast resistance gene and more than 10 rice blast resistance QTLs site at present, localized bacterial leaf spot resistant ospc gene has also reached more than 20.But since nineteen ninety-five is cloned into first rice bacterial blight resistance gene Xa21, other three disease-resistant genes such as bacterial leaf spot resistant ospc gene Xa1, blast resisting Pi-b and Pi-ta only in paddy rice, have been cloned at present.This is for monocotyledonous model plant, and some is slow really for the progress of clone's resistant gene.And the Xa21 that is cloned, Xa1, four disease-resistant gene encoded protein products such as Pi-b, Pi-ta all contain NBS and LRR structure, belong to the big class of disease-resistant gene.And set out according to the disease-resistant gene constitutional features of on plant, being cloned into, infer in the paddy rice also should have polytype disease-resistant gene.Therefore, clone the disease-resistant gene more, that kind is wider more adding system in depth disclose the disease-resistant mechanism of paddy rice, also could be used for the disease resistance of genetically engineered improvement paddy rice better, improve its output.
The innovation and creation content
The purpose of this invention is to provide a kind of blast resistant gene and proteins encoded thereof.
Blast resistant gene provided by the present invention, name is called Pi-d2, derives from paddy rice (Oryza sativavar.Lansheng), is one of following nucleotide sequences:
1) the SEQ ID № in the sequence table: 1;
2) the SEQ ID № in the sequence table: 2;
3) SEQ ID № in the code sequence tabulation: the polynucleotide of 3 protein sequences.。
The proteins encoded Pi-d2 of blast resistant gene Pi-d2, be to have SEQ ID № in the sequence table: the protein of 3 amino acid residue sequences, or with SEQ ID №: 3 amino acid residue sequence is through replacement, disappearance or the interpolation of one or several amino-acid residue and have the № with SEQ ID: 3 amino acid residue sequence is identical active by SEQ ID №: 3 deutero-protein.
The serine/threonine protein acceptor class kinases that sequence 3 is made up of 825 amino-acid residues in the sequence table.In the sequence 3, being a membrane spaning domain (TM) from aminoterminal 12-34 amino acids residue sequence, is the structural domain with signal peptide function; From aminoterminal 48-165 amino acids residue sequence is an allosome recognition structure territory (B-Lectin) similar to exogenous agglutinin protein, participates in the identification of pathogen; From aminoterminal 436-458 amino acids residue sequence is a membrane spaning domain (TM), is the intermediary that above-mentioned recognition signal is passed to serine/threonine protein kitase (STYK) structural domain of carboxyl terminal; From aminoterminal 501-771 amino acids residue sequence is a serine/threonine protein kitase (STYK) structural domain, and its phosphorylation can be conducted signal to disease-resistant defense response system.Infer that signal peptide participates in identification to rice blast pathogen ZB15 nontoxic protein with allosome recognition structure territory, excites the STYK zone then.
Contain expression carrier of the present invention and clone and all belong to protection scope of the present invention.
Arbitrary segmental primer is to also within protection scope of the present invention among the amplification Pi-d2.
Utilize any carrier that can guide foreign gene in plant, to express, blast resistant gene Pi-d2 provided by the present invention is imported vegetable cell, can obtain disease-resistant or disease-resistant enhanced transgenic cell line and transfer-gen plant.Gene of the present invention can add any general promotor, strengthen promotor or inducible promoter in being building up to plant expression vector the time before its transcription initiation Nucleotide.For the ease of transgenic plant or transgenic plant cells being identified and being screened, can process employed carrier, as the antibiotic marker thing (gentamicin, kantlex etc.) that adds the alternative mark (gus gene, luciferase gene etc.) of plant or have resistance.For the security that transgenic plant discharge, when making up plant expression vector, also can not carry any marker gene, directly screen in seedling stage with the Pyricularia oryzae inoculation.The expression vector of Pi-d2 of the present invention can be by using conventional biological method transformed plant cells or tissues such as Ti-plasmids, Ri plasmid, plant viral vector, directly DNA conversion, microinjection, electricity be led, agriculture bacillus mediated or particle gun, and the plant transformed tissue cultivating become plant.By the plant transformed host both can be monocotyledons, also can be dicotyledons, as: paddy rice, wheat, corn, cucumber, tomato, willow, turfgrass, lucerne place etc.Gene pairs of the present invention is cultivated the disease-resistant plants kind, enlarges the crop-planting scope, and it is significant to improve crop yield.
The present invention will be further described below in conjunction with drawings and Examples.
Description of drawings
Figure 1A is the Fine Mapping synoptic diagram of Pi-d2
Figure 1B is the folded group's synoptic diagram of striding of Pi-d2 designation of chromosome zone
Fig. 2 is the predictive genes result of GENSCAN to the dna sequence dna of 180kb
Fig. 3 is the domain analyses of Pi-d2 proteins encoded
Fig. 4 A is the cluster analysis in Pi-d2 and Pto and Xa21 serine/threonine kinase district
Fig. 4 B is that the sequence similarity in Pi-d2 and Pto and Xa21 serine/threonine kinase district compares
Fig. 5 is the Southern results of hybridization of the copy number of Pi-d2 full-length gene in rice genome
Fig. 6 A is complementation test expression vector pZH01/Pi-d2
Fig. 6 B is that Pi-d2 transgenosis T0 is for the Molecular Detection result
Fig. 6 C is for the resistance qualification result to part transgenic line T1
Fig. 6 D is that the T2 of transgenic line T0-10 is for the Molecular Detection result
Fig. 7 A is the RT-PCR analytical results of Pi-d2 on transcriptional level
Fig. 7 B is the Northern analytical results of Pi-d2 on transcriptional level
Embodiment
Use the stronger rice blast physiological strain ZB15 of rice district, China south virulence paddy and susceptible rice varieties Lijiang xintuanheigu (LTH), south of the River perfume (or spice) glutinous (JNXN) and the F of hybridizing acquisition over the ground
1, BC
1F
1, F
2Colony inoculates evaluation, and the result is as shown in table 1, and ground paddy shows as disease-resistant to ZB15, Lijiang xintuanheigu, the south of the River fragrant glutinous all show as susceptible, the F of ground paddy and Lijiang xintuanheigu, the fragrant glutinous hybridization acquisition in the south of the River
1Plant also all shows as disease-resistant to ZB15, and ground paddy and Lijiang xintuanheigu, the fragrant glutinous hybridization in the south of the River, the B of colony of acquisition further backcrosses
1F
1In disease-resistant individual plant and susceptible individual plant meet 1: 1, and each F
2The disease-resistant susceptible separation of colony is than all meeting 3: 1.These results show that rice varieties ground paddy is controlled by endonuclear single-gene to the resistance of rice blast microspecies ZB15, and are complete dominant inheritance, name to be Pi-d2.
Table 1 parent ground paddy (Digu), Lijiang xintuanheigu (LTH), south of the River perfume (or spice) glutinous (JNXN) and filial generation thereof are to the disease-resistant susceptible situation (R=is disease-resistant, and S=is susceptible) of rice blast pathogenic bacteria microspecies ZB15
Parent and progeny population thereof | Disease-resistant, susceptible individual plant number | The expectation ratio | x 2 | P 0.05,0.01 | |
R | S | R∶S | |||
Digu | 37 | ||||
LTH | 32 | ||||
‘Digu/LTH’F 1 | 18 | ||||
‘LTH/Digu’F 1 | 26 | ||||
‘(Digu/LTH)/LTH’F 1 | 23 | 18 | 1∶1 | 0.2602 | 3.84-6.63 |
‘(LTH/Digu)/Digu’F 1 | 32 | 28 | 1∶1 | 0.2667 | 3.84-6.63 |
‘Digu/LTH’F 2 | 372 | 100 | 3∶1 | 3.406 | 3.84-6.63 |
‘LTH/Digu’F 2 | 422 | 118 | 3∶1 | 2.8643 | 3.84-6.63 |
JNXN | 21 | ||||
‘Digu/JNXN’F 1 | 17 | ||||
‘JNXN/Digu’F 1 | 23 | ||||
‘(Digu/JNXN)/JNXN’F 1 | 19 | 17 | 1∶1 | 0.1111 | 3.84-6.63 |
‘(JNXN/Digu)/Digu’F 1 | 21 | 16 | 1∶1 | 1.3889 | 3.84-6.63 |
‘Digu/JNXN’F 2 | 84 | 37 | 3∶1 | 2.0083 | 3.84-6.63 |
’JNXN/Digu’F 2 | 105 | 39 | 3∶1 | 0.3333 | 3.84-6.63 |
The Japan that contains known disease-resistant gene is differentiated the F of system and ZYQ8 and they and ground paddy hybridization acquisition with rice blast microspecies ZB15
1And F
2Colony inoculates evaluation.As shown in table 2, BL-1, K60, Pi-4 number, K1, Fu Jin, No. 5, rattan slope, plum rains are bright etc., and 7 kinds all show susceptible reaction to ZB15, and the F that obtains with the hybridization of ground paddy
1All show disease resistance response, hybridize the F that obtains with ground paddy
2Disease-resistant, susceptible separation meet 3: 1 than all, show that the contained blast resistant gene of the contained blast resistant gene Pi-d2 of ground paddy and these 7 kinds is different.
Table 2 ground paddy and each rice blast differential variety filial generation F
1And F
2Colony is to the disease-resistant susceptible separation case (R=is disease-resistant, and S=is susceptible) of rice blast pathogenic bacteria ZB15
Rice blast differential variety (disease-resistant gene) | Differential variety is disease-resistant/susceptible response situation | F 1Disease-resistant/susceptible response situation of plant | F 1Disease-resistant/susceptible response situation of plant | |||||
The plant sum | Disease-resistant strain number | Susceptible strain number | Chi-square test | |||||
The expectation ratio | x 2 | P 0.05,0.01 | ||||||
New No. 2 (Pik ') likes to know No. 5 (Pi-i) plum rains of the careless flute e of the rising sun (Pia) (Pik) BL-1 (Pi-b) K60 (Pi-k ") Pi.No.4 (Pi-ta ') K1 (Pi-ta) good fortune brocade (Pi-z) rattan slope bright (Pi-k ") blue or green No. 8 (Pi-11) of No. 1 (Pi-z ') K59 of Zhai (Pi-t) narrow leaf | S S S S S S S S S S R R R | R R R R R R R R R R R R R | 93 117 196 198 215 205 231 195 228 250 231 218 265 | 73 85 141 141 153 150 155 158 157 157 229 192 227 | 20 32 55 57 62 55 76 37 71 53 2 26 38 | 3∶1 3∶1 3∶1 3∶1 3∶1 3∶1 3∶1 3∶1 3∶1 3∶1 -- -- -- | 0.6057 0.3447 0.9800 1.3878 1.4900 0.2748 1.8185 0.8654 4.2632 1.7280 -- -- -- | 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 3.84-6.63 -- -- -- |
And rice varieties K59, the F of narrow leaf blue or green No. 8 and No. 1, Zhai and they and ground paddy hybridization acquisition
1ZB15 is all shown as disease resistance response, but its corresponding F
2Colony all separates and susceptible individual plant occurred, and its separation did not meet 3: 1 than both, did not meet 15: 1 yet, showed K59, may also contain the blast resistant gene to ZB15 among No. 1, narrow leaf blue or green No. 8 and the Zhai, but different with Pi-d2 in the ground paddy, nor equipotential.
With ground paddy and Lijiang xintuanheigu is the parent, has developed a F
2Colony, with ZB15 to F
2Colony inoculates processing, identify disease-resistant and susceptible individual plant, and disease-resistant individual plant and susceptible individual plant resisted, feel and hiving off, 6 individual plants of difference picked at random from disease-resistant and disease plant, get the equivalent blade, and, form disease-resistant and susceptible DNA pond with the blade of disease-resistant and susceptible individual plant mixed extraction DNA respectively, the disease-resistant and susceptible DNA pond of ZB15 is named respectively be BR15 and BS15.Successively parent ground paddy and Lijiang xintuanheigu are carried out dna polymorphism screen with being distributed in 12 chromosomal 127 RFLP marks of paddy rice and 322 SSR marks more fifty-fifty, find wherein to have 72 RFLP marks and 119 SSR marks can disclose the polymorphism of two parent DNA, and then analyze with these 191 molecule marker enantiopathies, susceptible DNA pond, and further with this assignment of genes gene mapping on paddy rice the 6th karyomit(e).
The Fine Mapping of embodiment 4, blast resistant gene
Utilizing ground paddy and Lijiang xintuanheigu to make up the F of expansion for the parent
2Colony, and with this F
2Colony plants in the greenhouse, and in its one core phase of three leaves, ZB15 carries out spray inoculation with the rice blast physiological strain, and keeps 100% humidity and 25-28 ℃ culture environment, inoculates after 10 days F
2Individual plant resists susceptible evaluation.Obtain the susceptible especially significantly F of about 4000 strains
2Individual plant is to these F
2The blade of individual plant mixes by 5 strains builds pond extraction DNA, obtains about 800 susceptible DNA ponds, is used for Fine Mapping.With with the molecule marker of Pi-d2 positioning analysis, and the sequences Design primer that provides according to CIDC's cara gene (http://btn.genomics.org.cn) and international genome plan (http://rgp.dna.affrc.go.jp), 800 susceptible DNA ponds are analyzed, be located between molecule marker CAPs1 and CAPs8, wherein molecule marker CAPs1 and CAPs8 and target gene have 1 and 3 exchanges respectively, and molecule marker CAPs2, dCAPs1, dCAPs2 then with target gene be divided into from, shown in Figure 1A.
The acquisition of embodiment 5, candidate gene
At first to molecule marker CAPs1 and CAPs8 near trans-regional PACs sequences several analyze the back and make up its PAC and stride folded group (shown in Figure 1B), and find this regional karyomit(e) exchange frequency very low (about 500kb/cM), therefore, be difficult to target gene is positioned at the limited range of one or two gene.So the physical groups that CAPs1 and CAPs8 stride folded 180KB has been carried out predictive genes with GENSCAN, predict the outcome as shown in Figure 2, this zone contains 33 genes (as shown in table 3) altogether, serine/threonine receptor kinase protein (STK Protein) wherein, DNA conjugated protein (DNA Binding PROTEIN), ATP zymoprotein (ATPases), unknown function albumen (Unkown function Protein), each 1 of the rich proline(Pro) structural domain (Proline rich domain) of shearing associated protein, 2 of ThermoScript II (Reverse transcriptase), 5 in a type games albumen (MviN-like protein) in the bacterium, infer albumen (Putative Protein) 21, infer that the gene of the serine/threonine protein receptor kinase of wherein encoding is likely target gene.So, according to its dna sequence dna design primer of transcribing the zone carry out 5 ' and 3 ' the terminal amplification (RapidAmplification cDNA Ends, RACE) and transcription amplification (RT-PCR) obtain its full-length cDNA.Detailed process is as follows: for 5 ' terminal amplification, with primer 5 ' (P) TGAATGGGTGAC 3 ' RNA (having removed DNA) in the paddy rice ground paddy is carried out reverse transcription earlier and obtain cDNA, be template with this cDNA then, use special primer A1:5 ' CTTGCAGTGGTTCACATGGC 3 ' and S1:5 ' CCAAAGCCAAAGACAGAGCC 3 ' to carry out first round amplification, be template with the first PCR product that obtains again, carry out two with special primer A2:5 ' CAGTCTGGTCTGCCAATCCT 3 ' and S2:5 ' CTTGCAGTGGTTCACATGGC 3 ' and take turns amplification.Similarly, for 3 ' terminal amplification, earlier template ribonucleic acid is carried out reverse transcription, use special primer Con3R:5 ' TCGACCCGCCCATCCTTGT 3 ' and Adapter:5 ' GACTCGAGTCGACATCGA3 ' then increasing that reverse transcription obtains with 5 ' GACTCGAGTCGACATCGA (T) 163 '.For the zone between 5 ' and 3 ' end, with the cDNA in 3 ' the terminal amplification procedure is template, with two couples of special primer S1+CON2R (5 '-CCAAAGCCAAAGACAGAGCC-3 ', and 5 '-ATTTGAAGGCGTTTGCGTAGA-3 ') and CON2F+CON3R (5 '-TTGGCTATCATAGGCGTCC-3 ' and 5 '-TCGACCCGCCCATCCTTGT-3 ') acquisition of increasing respectively.By this full-length cDNA of analysis revealed 825 amino acid of encoding altogether, shown in the sequence in the sequence table 3.Simultaneously with primer GenRF: upstream 5 ' end: 5 ' AGCATCAACATAGACGTAGCGTGG 3 ', downstream 3 ' end: 5 ' CTAGTTACAGATCACTGTGCCAT 3 '; Utilize high-fidelity enzyme Pfu amplification, obtained the genome sequence that this gene pairs is answered, shown in the sequence in the sequence table 1, by relatively its genome and cDNA sequence are found this gene, only contain an open reading frame, it has comprised the complete area of cDNA coding, and this result shows that this gene does not contain intron.
Table 3GENSCAN is to the predictive genes of the dna sequence dna of 180kb II as a result
The gene numbering | Amino acid length | Homologous |
1 | 825 | The serine/threonine |
2 | 358 | |
3 | 91 | Infer albumen |
4 | 383 | Infer albumen |
5 | 123 | Infer albumen |
6 | 484 | Infer albumen |
7 | 666 | Motion |
8 | 105 | Infer albumen |
9 | 150 | |
10 | 559 | Infer albumen |
11 | 335 | Infer albumen |
12 | 381 | Motion albumen |
13 | 877 | Infer albumen |
14 | 1027 | ThermoScript II |
15 | 445 | Infer albumen |
16 | 1001 | Motion albumen |
17 | 353 | Infer albumen |
The gene numbering | Amino acid length | Homologous gene |
18 | 450 | Infer albumen |
19 | 358 | Motion |
20 | 668 | Infer albumen |
21 | 448 | Motion albumen |
22 | 375 | |
23 | 691 | Rich proline(Pro) structural domain |
24 | 207 | Infer albumen |
25 | 357 | Unknown function albumen |
26 | 140 | Infer albumen |
27 | 750 | Infer albumen |
28 | 447 | Infer albumen |
29 | 690 | DNA is conjugated |
30 | 75 | Infer albumen |
31 | 261 | Infer albumen |
32 | 1721 | ThermoScript II |
33 | 911 | The ATP zymoprotein |
The structural analysis of embodiment 6, Pi-d2 proteins encoded
With Simple Modular Architecture Research Tool (SMART) (http://smart.embl-heidelberg.de/) method the coded albumen full length amino acid sequence of gene Pi-d2 is analyzed, the result as shown in Figure 3, this proteic amino least significant end promptly in sequence table the aminoterminal 1-32 amino acids residue sequence of sequence 3 be a membrane spaning domain (TM), be one and comprise 23 amino acid whose signal peptide structures (Signaling domain); The aminoterminal 48-165 amino acids residue sequence of sequence 3 is allosome recognition structure territories (B-Lectin) similar to exogenous agglutinin protein in sequence table, participates in the identification of pathogen; The aminoterminal 436-458 amino acids residue sequence of sequence 3 is membrane spaning domains (TM) in sequence table, is the intermediary that above-mentioned recognition signal is passed to serine/threonine protein kitase (STYK) structural domain of carboxyl terminal; The aminoterminal 501-771 amino acids residue sequence of sequence 3 is a serine/threonine protein kitase (STYK) structural domains in sequence table, and its phosphorylation can be conducted signal to disease-resistant defense response system; In sequence table the aminoterminal 419-431 amino acids residue sequence of sequence 3 and in sequence table the aminoterminal 797-808 amino acids residue sequence of sequence 3 be low complexity zone.Infer that signal peptide participates in identification to rice blast pathogen ZB15 nontoxic protein with allosome recognition structure territory, excite the activity in STYK zone then, thereby the intravital disease-resistant defense response of activated plant system realizes the disease resistance to pathogenic bacteria ZB15.
The comparison of embodiment 7, Pi-d2 and Pto and Xa21 serine/threonine kinase region amino acid sequence
Because in the present plant disease resistance genes of cloning, only there are the Pto of tomato and the Xa21 of paddy rice to have the serine/threonine kinase structural domain, therefore, from NCBI (http://www.ncbi.nlm.nih.gov/), search the aminoacid sequence of Pto and the corresponding STYK structural domain of Xa21 and compare with the aminoacid sequence of the STYK structural domain of Pi-d2, the result is shown in Fig. 4 A, show between Pi-d2 and Pto, Xa21 relative very conservatively, estimate that these zones are that such serine/threonine kinase performance is active necessary in some zone.Three's similarity comparative result show that the homology of the STYK of Pi-d2 and Pto, Xa21 is respectively 32.3%, 26.2%, and the homology of STYK only is 21.2% between Pto and Xa21 shown in Fig. 4 B.Explanation is on evolving, and Pi-d2 may be than nearer with Xa21 with the sibship of Pto.
Paddy, Lijiang xintuanheigu, the Taibei 309 (TP309) change film according to the rice material genomic dna of SDS method extraction after enzymes such as ScaI, DraI are cut over the ground, utilize the Pi-d2 full length cDNA sequence to be probe, carry out Southern hybridization, the result as shown in Figure 5, show that cutting three samples hybridization of Hou Digu, Lijiang xintuanheigu, TP309 through ScaI, DraI enzyme all shows a band line, and the hybrid belt line is all variant between disease-resistant variety ground paddy and susceptible variety Lijiang xintuanheigu, TP309.Show that this gene exists with single copy form in rice genome, and dna sequence dna there are differences in disease-resistant material and susceptible material.1,2,3 genomic dnas of representing paddy rice ground paddy, Lijiang xintuanheigu, the Taibei 309 respectively among the figure.
Embodiment 9, have complementary functions
Because this gene does not contain intron, paddy cDNA is a template with paddy rice ground, with 5 '-TTGGG
TCTAGAAGCATCAACATAGACGTAGCGTGG-3 ' and 5 '-TTTGC
GTCGACCTAGTTACAGATCACTGTGCCAT-3 ' is that (wherein band is respectively restriction enzyme XbaI in the zone of setting-out down to primer, the recognition site of SalI, the protection base is cut for enzyme in the italic zone), utilize the high-fidelity pcr amplification to obtain the total length of this gene, and by after repeatedly order-checking verifies that repeatedly its sequence is correct, it is structured in contains 35S promoter pZH01 and go up as the expression vector that has complementary functions, the result as shown in Figure 6A, show that gene Pi-d2 is implemented in the downstream of the tobacco mosaic disease virus promoter 35S of expression vector pZH01, the expression vector pZH01/Pi-d2 that obtains having complementary functions, and utilize the susceptible rice varieties TP309 of agrobacterium mediation converted, 41 transgenic lines have been obtained, to these 41 strains is to carry out Molecular Detection in T0 generation, and the primer is upstream a 5 ' end: 5 ' TTGGCTATCATAGGCGTCC 3 '; Downstream 3 ' end: 5 ' ATTTGAAGGCGTTTGCGTAGA 3 '.The result is shown in Fig. 6 B, wherein 3 strains are the genotype that shows as TP309,35 strain systems show as the heterozygous of ground paddy and TP309, the performance of 3 strain systems closely is a ground paddy type, showing has 38 positive transgenic lines, and wherein genotype three strains systems being ground paddy type are likely that multi-copy integration has taken place target gene.Among Fig. 6 B, A is the Taibei 309; B is a ground paddy; C is a Lijiang xintuanheigu; 1-41 is transgenosis T0 generation, and totally 41 strains are three kinds of molecule banding patterns; * be the Taibei 309 types (the strain system that does not change over to), totally 3; # is ground paddy type (may be multiple copied strain system), totally 3; Heterozygous, totally 35.
Tie up to T1 for spraying respectively and injection inoculation in seedling stage and division Sheng phase to wherein obtaining seed-bearing 20 strains, the result is shown in Fig. 6 C, show that the 8th, 10 strains tie up to T1 for obviously ZB15 being had resistance, further these two strains being tied up to T2 inoculates for individual plant, and disease-resistant, the susceptible individual plant that is to each strain carries out Molecular Detection, used special primer 5 ' TTGGCTATCATAGGCGTCC 3 ' and 5 ' ATTTGAAGGCGTTTGCGTAGA 3 ', and PCR product enzyme is cut rear electrophoresis with restriction enzyme MluI.The result shows in disease-resistant individual plant all can detect target gene shown in Fig. 6 D, then fails to detect the target transgenosis in susceptible individual plant, illustrates that transfer-gen plant is to be provided by target disease-resistant gene Pi-d2 to the resistance of ZB15 really.Among Fig. 6 D, DL2000 is a molecular weight standard; A, B, C represent disease-resistant ground paddy successively, susceptible Lijiang xintuanheigu, the susceptible Taibei 309; 1 to 12 represents the disease-resistant individual plant of this strain system respectively; The susceptible individual plant of this strain system of 13 to 17 expressions.
Further these two disease-resistant transgenic line offsprings are inoculated with other rice blast physiological strains, the result shows that these two transgenic lines are to rice blast microspecies ZB13, Zhong-10-8-14, Zh2-1, Zk-10-2 etc. are susceptible, this result has verified that further transgenic line is by due to the disease-resistant gene Pi-d2 that changes over to the disease resistance of ZB15, and has disease-resistant specificity.Simultaneously proved that also this gene is exactly target disease-resistant gene Pi-d2.
The expression analysis of embodiment 10, disease-resistant gene
With rice blast physiological strain ZB15 over the ground paddy and Lijiang xintuanheigu carry out spray inoculation in seedling stage and handle, by handling back 0,12,24,48, RNA is extracted in 96 hours (hr) sampling according to a conventional method then.And each sample RNA is carried out being used for RT-PCR after DNA removes analyze, primer is a pair of specificity amplification primer (5 '-TTGGCTATCATAGGCGTCC-3 ' and 5 '-ATTTGAAGGCGTTTGCGTAGA-3 ') in the gene Pi-d2, and the cDNA sequences Design Actin primer of the paddy rice Actin gene Rac1 that delivers according to (Plant Mol.Biol.14 (2), 163-171 (1990)) such as McElroy (5 '-CCTCGTCTCGACCTTGCTGGG-3 ' and 5 '-GAGAACAAGCAGGAGGACGGC-3 ') in contrast.The result is shown in Fig. 7 A, and disease-resistant gene Pi-d2 is not subjected to inducing of ZB15, and at rice root, stem and Ye Zhongjun expression is arranged.Also the result with RT-PCR is consistent shown in Fig. 7 B for the result of Northern.Show that this gene as other most resistant gene in plant of being cloned (as Xa21 etc.), is constitutive expression, its expression amount is not very high, obtains but all can detect by RT-PCR and Northern hybridization.
Sequence table
<160>3
<210>1
<211>6261
<212>DNA
<213〉paddy rice belongs to paddy rice (Oryza sativa var.Lansheng)
<400>1
agcatcaaca tagacgtagc gtggccgtat ccatttttta atgcatatat aaatttctcc 60
atcttttgcg atctctctgt tgctcacagt gtggccgtac acgctaaaca aatactccag 120
tactactact ccttattata ctcagcgtcc caaaatataa cttctatgct tcaaatttta 180
tctcgaaatt acactctcct cccaatcaat cacaaccttt caattccacc atttttataa 240
tcccgtattt aacaaacatt ttatatttca gaatgaaggt agttgtataa tattagtaat 300
aaataaacta ttcttttttc aaaaaataca ggacgcaatt tgataatgta aagcgtaaat 360
gcacacttaa ctagcacaac tctactaaat tcctttcaaa attctacacc atgagatctc 420
tggttctatt tagtgtctgt attgtatttt tttagtatga tatgatctaa acggtaaggt 480
aataataata acttttacta cctccattcc aaattagtag tcgctttcac ttttttcttt 540
tttcttgtaa cgtttgacca ttcgtcttat ttaaaaaatt agtgcaaata taaaaataga 600
gaagtcatac ttaaagtgct tttaataata aagcaaatca taaaaaaaac aaatattaat 660
tcgagtaaat tgcacccaca gtacaacaac ttgataggtg ggtgcgatat agtgcaagaa 720
cttgagaatt gaacgttcga gtgcaacaac ttgacaagtg ggtgcgtttt agtacaagaa 780
cttgacaatt tagtattttg gtgcatcaac taagctaagc atatgctagg tgagtttttc 840
tcacgatatc aatatgccat tacatagcaa tcatacggat attaccttgt aatattgaat 900
tttatcttta agaattatac tgcatatccc aatttacaac caattacctt taagaatttt 960
tgttttcttc acattcctaa atctcaaccg aaatataata atttctacat ctagttaatt 1020
gacttttagt tattagttat tagaataaaa atataaatat gcttatatgc aaatccacgc 1080
tagtatattg tcaaaataag aacttaaaaa ttgaatatgc ataaaaagtg ctccaaataa 1140
ttaagttgtc gcataacttc ttatcttatt gtatcaaaat cgtacgtatc ttacaagttg 1200
ttgcatttat acaatcactt ataaaatttt ttagctaaac tgcacatacc tgcgaagttg 1260
atgcactaga atgctcgttt ctcaagttat tgcaccaaat tgcacccacc tactaagttg 1320
atacatcatg ggtgtaattt actctattaa ttctataatt tttgaataag acgaaagatt 1380
aaaagttaaa aaaaactcaa agcgacaagt actgtaggac ggagaaagta tttattttta 1440
cgtgtcttgt agttaagtac agtatatttt taggtggagg gatgaataat ctccctggtt 1500
gagaggtgga gagagaagga tggtacccat tgaaaaatga agaattccaa taccacttgc 1560
aggtaggttg ttgctcgttt taattcctac tgccctcttg ctgctcacca actgctactc 1620
tctcctctcc caccatttcg tcaccctgcc tcctcctgcc tctgcctctg ctcggctagt 1680
ctagtgtact actctatctt ctcctcctct ctttctccta cccaaatcca tccacccaac 1740
atcttttttc tcagccgtcg tctcatctcg ccttcgtagc gttgcgtcgc gccgcgtcta 1800
ccctttccag gtgagcctcc ccgattacat cgctcgttct agctagctag ctagctagct 1860
cctgtgttct tgctgtttct ttgtttcttt ttgcgttttt aattttcctt tcttggatct 1920
ttctttgcta gctcgttttg atttgtgtgc ttctgtattt gtgtgttttg tgggcgaaag 1980
gggggctttc ttggttcttg ttccgggcga ttgttattct tgttctggag aatcggatta 2040
cgagtgtgcc ttgcagtgga gttttgtaat gttaggctcc aattgagcaa gaaaaagatg 2100
aggttcttgt taaattctag gttatccagc tcttgtttgg ttggctggtc tggaactctg 2160
aattccctct gtagcttttt actaggtaat ggaatcccct ttgtaatgca aaatgtagat 2220
gcctctgcca tggctatgtt ttatctttag atgtccggaa ttctcttcta agggagccta 2280
aacacggcag ttcagttcag attaattatg ggtctgggtt tgaagaaaag aataaaggga 2340
gtgaaataaa gaaacaaaca ggtctgaatt tgtgaccctg tttagctgaa agtgaaactg 2400
aaaaggggaa cagaaaggaa gaattgacca tcgatttaat gttaggtgtt actgcatatg 2460
tgcaagcaaa gattgttctt ttggcagtgg ataatgctct gatggtctct ggccgtgatc 2520
aagtggttgt ttttctctgc aggatgtggt ttaccttttg ctgttggtca aaaggaactt 2580
gtgccaccac tttctagttt gtttgttttg acacttggtg ctggcaattc ggttgctcac 2640
gggctgatca attccttttg ctgatttctt tttgagagaa atcccttttt ctttttcttt 2700
tttatatata tgccttttta tcttgcaact cattcatttc tcatgtttct gaaggtaacc 2760
tggataagac ggtggaggag ctatcaaatg tttatagagg agctcaagaa accaatagaa 2820
agctctatca ggaactgttc ccaattcagg tagtatactg gtttccgttt accaattgtt 2880
tcttgctatc gatcgatatc attaatcatc atatcttgtt caagccttct tgaaggttca 2940
aacatccttg taaatctgaa ctgagacatc atcttagtca ctgttgtttg tccagcagcc 3000
cacctgcatg ttcacatgta atctgacacc actaatttga acctactgtt actgtcaggc 3060
attgaaaaaa ccaggcatct ctgaatatcc tctatttgta gctatcatga acaacctgtt 3120
tgagacccct cataagattc ttgcatgctt gccagtgcaa tttagaatca agttacttag 3180
caaaattttc tgaataaatt ttgtagtatg ctcctcagct atatttattc tgcctatggt 3240
ctgatatttc tcaccaaaca gagcttctaa caatcttcat tctgtgcact gcagattatg 3300
caaatgtgtg gatggttact gaaggttgtt cgttgggaaa acttaaattg tgtgcacatg 3360
gaagctcatg gcaatcgtcg cagcagtcca acataccttg ttatgctgtg gatgatttcg 3420
gtagctagcc tattgataac atgtcgtggc agtatccaga agcaagttct ctttccaggg 3480
ttcactgccg cgcaaatgga ttacattgat aacgatggga tatttctgct ttctaatggc 3540
tctgtctttg gctttggttt tgtcacgagc aatgtctcag acaacacgtt ctacattctt 3600
gcagtggttc acatggccac tactaccaca gtctggtctg ccaatcctaa ctctcctgtc 3660
acccattcag atgacttttt tttcgacaag gatggcaatg ccttcctgca gtcaggagga 3720
ggctccaatg tatgggctgc caatatctcc gggaaaggga ctgccacctc tatgcaacta 3780
ctggactctg gcaatcttgt agtgcttggg aaagatgcct cttctcctct ctggcaaagt 3840
ttcagccatc cgacagacac tcttctgtct ggtcagaatt tcatcgaagg gatgacgctg 3900
atgagcaagt ccaacacagt acagaacatg acctatacac ttcagatcaa atctgggaac 3960
atgatgttat acgccggctt cgagacacct caaccatact ggtctgcaca gcaggatagc 4020
aggataattg tcaacaagaa cggtgacagc atctactctg caaacctcag ttcagcttct 4080
tggtccttct atgatcaatc agggtccctt ctatcacaac ttgtcatcgc gcaagaaaat 4140
gccaatgcca cattgtctgc tgtccttggt agtgatggat tgatagcttt ctatatgctg 4200
cagggtggaa atggcaagag taaattctcg atcacagttc cggcagactc ttgtgacatg 4260
ccagcctact gcagtcctta caccatttgc agtagtggga caggttgcca atgcccttcg 4320
gccctcggct cgtttgcaaa ctgcaatcct ggtgttacat cagcatgcaa atcgaacgag 4380
gagtttccgc tggttcaact ggatagtgga gttggatatg taggcactaa cttcttccct 4440
cctgcggcta agacgaacct tacgggttgt aagagtgcct gtacaggcaa ctgctcttgt 4500
gttgctgtgt tctttgatca atcttcaggc aattgtttcc ttttcaacca gatcggaagc 4560
ttgcagcaca aaggtgggaa tacaactcgt ttcgcatctt ttatcaaggt atcaagcaga 4620
ggaaaaggtg ggagtgatag tggcagtggg aagcacaata ccattattat tgtcattata 4680
ctcggaactt tggctatcat aggcgtcctt atttatattg gtttctggat ctacaagagg 4740
aagaggcatc ctccaccatc acaagacgac gctggttcat cggaagatga tggatttctg 4800
caaacaatat ccggagcacc agtgcggttc acttacaggg agctccagga tgcgacaagc 4860
aacttctgta acaagcttgg tcagggaggg tttggatctg tgtatcttgg tacactccca 4920
gacggcagtc gtattgctgt gaagaagctg gagggcatag gccaaggaaa gaaagagttc 4980
cgctctgagg taacgatcat tggtagtatc caccacatcc atcttgtcaa actccgaggc 5040
ttttgtactg agggaccaca caggcttctt gcctacgagt acatggcgaa tgggtcgctg 5100
gataagtgga ttttccattc taaagaagat gatcacctgc tcgactggga tacaaggttt 5160
aacattgcgc ttggaacggc aaagggattg gcatacctcc atcaggactg cgattcgaag 5220
attgtacact gtgacattaa gcctgagaat gttctacttg acgacaactt catcgcaaag 5280
gtatctgatt ttggccttgc caagttgatg accagggagc agagccatgt tttcactacg 5340
ctcagaggca cgcgtgggta ccttgcacct gagtggctca ccaactatgc catctcagag 5400
aagagtgatg tgtacagcta cggcatggtt ttgcttgaga taatcggtgg gaggaagagc 5460
tacgatccct cggagatctc cgagaaggct cacttccctt cctttgcatt caagaagctg 5520
gaggaaggtg atcttcagga catcttcgac gccaagctga agtacaatga caaggatggg 5580
cgggtcgaga ccgcgatcaa ggtcgcgctc tggtgcatcc aggatgattt ctaccagaga 5640
ccatccatgt caaaggttgt gcagatgctc gaaggcgtct gcgaggtgct ccagccaccg 5700
gtgtcgtcgc agatcgggta caggctctac gcaaacgcct tcaaatcgag cagcgaggag 5760
gggacttcat cagggatgtc ggactacaac agtgatgctc tgctttcagc tgtgaggctc 5820
tctggtccca gatgatgtga agaatcccat gtacagtgcc ttgtctagtt aggttgcaaa 5880
gtgtgcaaat tttgctgtag tttccagtgt tttggtgatc atttgcttca cactattgta 5940
catatcttct tggtcatttc tggtggtagt ttatacatat cttgctgatt atttatggtg 6000
gtagtttatc ggtgccattc tttttttgtt gcccttttgc ttatacataa ggtctccaaa 6060
acctttgaca attacctttt gtagttatgt cttagtaaaa ataataggaa atgcaatgat 6120
acaaaagcct ttttcatcag acctttcagt atcattttca agtcacaatt cttgtaacct 6180
tttgtgtatt caagaggtca ttgtttctga aatttgacat taaaaaaatg gcataacaat 6240
ggcacagtga tctgtaacta g 6261
<210>2
<211>2935
<212>DNA
<213〉paddy rice belongs to paddy rice (Oryza sativa var.Lansheng)
<400>2
tcttcattct gtgcactgca gattatgcaa atgtgtggat ggttactgaa ggttcagtcc 60
aacatacctt gttatgctgt ggatgatttc ggtagttcgt tgggaaaact taaattgtgt 120
gcacatggaa gctcatggca atcgtcgcag cagtccaaca taccttgtta tgctgtggat 180
gatttcggta gctagcctat tgataacatg tcgtggcagt atccagaagc aagttctctt 240
tccagggttc actgccgcgc aaatggatta cattgataac gatgggatat ttctgctttc 300
taatggctct gtctttggct ttggttttgt cacgagcaat gtctcagaca acacgttcta 360
cattcttgca gtggttcaca tggccactac taccacagtc tggtctgcca atcctaactc 420
tcctgtcacc cattcagatg actttttttt cgacaaggat ggcaatgcct tcctgcagtc 480
aggaggaggc tccaatgtat gggctgccaa tatctccggg aaagggactg ccacctctat 540
gcaactactg gactctggca atcttgtagt gcttgggaaa gatgcctctt ctcctctctg 600
gcaaagtttc agccatccga cagacactct tctgtctggt cagaatttca tcgaagggat 660
gacgctgatg agcaagtcca acacagtaca gaacatgacc tatacacttc agatcaaatc 720
tgggaacatg atgttatacg ccggcttcga gacacctcaa ccatactggt ctgcacagca 780
ggatagcagg ataattgtca acaagaacgg tgacagcatc tactctgcaa acctcagttc 840
agcttcttgg tccttctatg atcaatcagg gtcccttcta tcacaacttg tcatcgcgca 900
agaaaatgcc aatgccacat tgtctgctgt ccttggtagt gatggattga tagctttcta 960
tatgctgcag ggtggaaatg gcaagagtaa attctcgatc acagttccgg cagactcttg 1020
tgacatgcca gcctactgca gtccttacac catttgcagt agtgggacag gttgccaatg 1080
cccttcggcc ctcggctcgt ttgcaaactg caatcctggt gttacatcag catgcaaatc 1140
gaacgaggag tttccgctgg ttcaactgga tagtggagtt ggatatgtag gcactaactt 1200
cttccctcct gcggctaaga cgaaccttac gggttgtaag agtgcctgta caggcaactg 1260
ctcttgtgtt gctgtgttct ttgatcaatc ttcaggcaat tgtttccttt tcaaccagat 1320
cggaagcttg cagcacaaag gtgggaatac aactcgtttc gcatctttta tcaaggtatc 1380
aagcagagga aaaggtggga gtgatagtgg cagtgggaag cacaatacca ttattattgt 1440
cattatactc ggaactttgg ctatcatagg cgtccttatt tatattggtt tctggatcta 1500
caagaggaag aggcatcctc caccatcaca agacgacgct ggttcatcgg aagatgatgg 1560
atttctgcaa acaatatccg gagcaccagt gcggttcact tacagggagc tccaggatgc 1620
gacaagcaac ttctgtaaca agcttggtca gggagggttt ggatctgtgt atcttggtac 1680
actcccagac ggcagtcgta ttgctgtgaa gaagctggag ggcataggcc aaggaaagaa 1740
agagttccgc tctgaggtaa cgatcattgg tagtatccac cacatccatc ttgtcaaact 1800
ccgaggcttt tgtactgagg gaccacacag gcttcttgcc tacgagtaca tggcgaatgg 1860
gtcgctggat aagtggattt tccattctaa agaagatgat cacctgctcg actgggatac 1920
aaggtttaac attgcgcttg gaacggcaaa gggattggca tacctccatc aggactgcga 1980
ttcgaagatt gtacactgtg acattaagcc tgagaatgtt ctacttgacg acaacttcat 2040
cgcaaaggta tctgattttg gccttgccaa gttgatgacc agggagcaga gccatgtttt 2100
cactacgctc agaggcacgc gtgggtacct tgcacctgag tggctcacca actatgccat 2160
ctcagagaag agtgatgtgt acagctacgg catggttttg cttgagataa tcggtgggag 2220
gaagagctac gatccctcgg agatctccga aaaggctcac ttcccttcct ttgcattcaa 2280
gaagctggag gaaggtgatc ttcaggacat cttcgacgcc aagctgaagt acaatgacaa 2340
ggatgggcgg gtcgagaccg cgatcaaggt cgcgctctgg tgcatccagg atgatttcta 2400
ccagagacca tccatgtcaa aggttgtgca gatgctcgaa ggcgtctgcg aggtgctcca 2460
gccaccggtg tcgtcgcaga tcgggtacag gctctacgca aacgccttca aatcgagcag 2520
cgaggagggg acttcatcag ggatgtcgga ctacaacagt gatgctctgc tttcagctgt 2580
gaggctctct ggtcccagat gatgtgaaga atcccatgta cagtgccttg tctagttagg 2640
ttgcaaagtg tgcaaatttt gctgtagttt ccagtgtttt ggtgatcatt tgcttcacac 2700
tattgtacat atcttcttgg tcatttctgg tggtagttta tacatatctt gctgattatt 2760
tatggtggta gtttatcggt gccattcttt ttttgttgcc cttttgctta tacataaggt 2820
ctccaaaacc tttgacaatt accttttgta gttatgtctt ggtaaaaata ataggaaatg 2880
caatgataca aaagcctttt tcatcagaaa aaaaaaaaaa aaaaaaaaaa aaaaa 2935
<210>3
<211>825
<212>PRT
<213〉paddy rice belongs to paddy rice (Oryza sativa var.Lansheng)
<400>3
Met Glu Ala His Gly Asn Arg Arg Ser Ser Pro Thr Tyr Leu Val Met
1 5 10 15
Leu Trp Met Ile Ser Val Ala Ser Leu Leu Ile Thr Cys Arg Gly Ser
20 25 30
Ile Gln Lys Gln Val Leu Phe Pro Gly Phe Thr Ala Ala Gln Met Asp
35 40 45
Tyr Ile Asp Asn Asp Gly Ile Phe Leu Leu Ser Asn Gly Ser Val Phe
50 55 60
Gly Phe Gly Phe Val Thr Ser Asn Val Ser Asp Asn Thr Phe Tyr Ile
65 70 75 80
Leu Ala Val Val His Met Ala Thr Thr Thr Thr Val Trp Ser Ala Asn
85 90 95
Pro Asn Ser Pro Val Thr His Ser Asp Asp Phe Phe Phe Asp Lys Asp
100 105 110
Gly Asn Ala Phe Leu Gln Ser Gly Gly Gly Ser Asn Val Trp Ala Ala
115 120 125
Asn Ile Ser Gly Lys Gly Thr Ala Thr Ser Met Gln Leu Leu Asp Ser
130 135 140
Gly Asn Leu Val Val Leu Gly Lys Asp Ala Ser Ser Pro Leu Trp Gln
145 150 155 160
Ser Phe Ser His Pro Thr Asp Thr Leu Leu Ser Gly Gln Asn Phe Ile
165 170 175
Glu Gly Met Thr Leu Met Ser Lys Ser Asn Thr Val Gln Asn Met Thr
180 185 190
Tyr Thr Leu Gln Ile Lys Ser Gly Asn Met Met Leu Tyr Ala Gly Phe
195 200 205
Glu Thr Pro Gln Pro Tyr Trp Ser Ala Gln Gln Asp Ser Arg Ile Ile
210 215 220
Val Asn Lys Asn Gly Asp Ser Ile Tyr Ser Ala Asn Leu Ser Ser Ala
225 230 235 240
Ser Trp Ser Phe Tyr Asp Gln Ser Gly Ser Leu Leu Ser Gln Leu Val
245 250 255
Ile Ala Gln Glu Asn Ala Asn Ala Thr Leu Ser Ala Val Leu Gly Ser
260 265 270
Asp Gly Leu Ile Ala Phe Tyr Met Leu Gln Gly Gly Asn Gly Lys Ser
275 280 285
Lys Phe Ser Ile Thr Val Pro Ala Asp Ser Cys Asp Met Pro Ala Tyr
290 295 300
Cys Ser Pro Tyr Thr Ile Cys Ser Ser Gly Thr Gly Cys Gln Cys Pro
305 310 315 320
Ser Ala Leu Gly Ser Phe Ala Asn Cys Asn Pro Gly Val Thr Ser Ala
325 330 335
Cys Lys Ser Asn Glu Glu Phe Pro Leu Val Gln Leu Asp Ser Gly Val
340 345 350
Gly Tyr Val Gly Thr Asn Phe Phe Pro Pro Ala Ala Lys Thr Asn Leu
355 360 365
Thr Gly Cys Lys Ser Ala Cys Thr Gly Asn Cys Ser Cys Val Ala Val
370 375 380
Phe Phe Asp Gln Ser Ser Gly Asn Cys Phe Leu Phe Asn Gln Ile Gly
385 390 395 400
Ser Leu Gln His Lys Gly Gly Asn Thr Thr Arg Phe Ala Ser Phe Ile
405 410 415
Lys Val Ser Ser Arg Gly Lys Gly Gly Ser Asp Ser Gly Ser Gly Lys
420 425 430
His Asn Thr Ile Ile Ile Val Ile Ile Leu Gly Thr Leu Ala Ile Ile
435 440 445
Gly Val Leu Ile Tyr Ile Gly Phe Trp Ile Tyr Lys Arg Lys Arg His
450 455 460
Pro Pro Pro Ser Gln Asp Asp Ala Gly Ser Ser Glu Asp Asp Gly Phe
465 470 475 480
Leu Gln Thr Ile Ser Gly Ala Pro Val Arg Phe Thr Tyr Arg Glu Leu
485 490 495
Gln Asp Ala Thr Ser Asn Phe Cys Asn Lys Leu Gly Gln Gly Gly Phe
500 505 510
Gly Ser Val Tyr Leu Gly Thr Leu Pro Asp Gly Ser Arg Ile Ala Val
515 520 525
Lys Lys Leu Glu Gly Ile Gly Gln Gly Lys Lys Glu Phe Arg Ser Glu
530 535 540
Val Thr Ile Ile Gly Ser Ile His His Ile His Leu Val Lys Leu Arg
545 550 555 560
Gly Phe Cys Thr Glu Gly Pro His Arg Leu Leu Ala Tyr Glu Tyr Met
565 570 575
Ala Asn Gly Ser Leu Asp Lys Trp Ile Phe His Ser Lys Glu Asp Asp
580 585 590
His Leu Leu Asp Trp Asp Thr Arg Phe Asn Ile Ala Leu Gly Thr Ala
595 600 605
Lys Gly Leu Ala Tyr Leu His Gln Asp Cys Asp Ser Lys Ile Val His
610 615 620
Cys Asp Ile Lys Pro Glu Asn Val Leu Leu Asp Asp Asn Phe Ile Ala
625 630 635 640
Lys Val Ser Asp Phe Gly Leu Ala Lys Leu Met Thr Arg Glu Gln Ser
645 650 655
His Val Phe Thr Thr Leu Arg Gly Thr Arg Gly Tyr Leu Ala Pro Glu
660 665 670
Trp Leu Thr Asn Tyr Ala Ile Ser Glu Lys Ser Asp Val Tyr Ser Tyr
675 680 685
Gly Met Val Leu Leu Glu Ile Ile Gly Gly Arg Lys Ser Tyr Asp Pro
690 695 700
Ser Glu Ile Ser Glu Lys Ala His Phe Pro Ser Phe Ala Phe Lys Lys
705 710 715 720
Leu Glu Glu Gly Asp Leu Gln Asp Ile Phe Asp Ala Lys Leu Lys Tyr
725 730 735
Asn Asp Lys Asp Gly Arg Val Glu Thr Ala Ile Lys Val Ala Leu Trp
740 745 750
Cys Ile Gln Asp Asp Phe Tyr Gln Arg Pro Ser Met Ser Lys Val Val
755 760 765
Gln Met Leu Glu Gly Val Cys Glu Val Leu Gln Pro Pro Val Ser Ser
770 775 780
Gln Ile Gly Tyr Arg Leu Tyr Ala Asn Ala Phe Lys Ser Ser Ser Glu
785 790 795 800
Glu Gly Thr Ser Ser Gly Met Ser Asp Tyr Asn Ser Asp Ala Leu Leu
805 810 815
Ser Ala Val Arg Leu Ser Gly Pro Arg
820 825
Claims (9)
1, a kind of blast resistant gene is one of following nucleotide sequences:
1) the SEQ ID № in the sequence table: 1;
2) the SEQ ID № in the sequence table: 2;
3) SEQ ID № in the code sequence tabulation: the polynucleotide of 3 protein sequences.
2, gene according to claim 1 is characterized in that: described blast resistant gene is the SEQ ID № in the sequence table: 1 or SEQ ID №: 2.
3, gene according to claim 2 is characterized in that: the reading frame of described blast resistant gene is SEQ ID № in the sequence table: 1 SEQ ID № in 5 ' end the 3361st to the 5835th bit base or sequence table: 2 from 5 ' end the 125th to the 2599th bit base.
4, the proteins encoded of blast resistant gene, be to have SEQ ID № in the sequence table: the protein of 3 amino acid residue sequences, or with SEQ ID №: 3 amino acid residue sequence is through replacement, disappearance or the interpolation of one or several amino-acid residue and have the № with SEQ ID: 3 amino acid residue sequence is identical active by SEQ ID №: 3 deutero-protein.
5, protein according to claim 4 is characterized in that: the proteins encoded of described blast resistant gene is the SEQ ID № in the sequence table: 3.
6, according to claim 4 or 5 described protein, it is characterized in that: SEQ ID № in the described sequence table: 3 be a membrane spaning domain from aminoterminal 12-34 amino acids residue sequence; From aminoterminal 48-165 amino acids residue sequence is an allosome recognition structure territory similar to exogenous agglutinin protein; From aminoterminal 436-458 amino acids residue sequence is a membrane spaning domain; From aminoterminal 501-771 amino acids residue sequence is a serine/threonine protein kitase structural domain.
7, the expression vector that contains the described blast resistant gene of claim 1.
8, the transgenic cell line that contains the described blast resistant gene of claim 1.
9, the application of the described blast resistant gene of claim 1 in cultivating the disease-resistant plants kind.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2003101184339A CN1297661C (en) | 2003-12-16 | 2003-12-16 | A rice blast resistance gene, its encoded protein and use thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2003101184339A CN1297661C (en) | 2003-12-16 | 2003-12-16 | A rice blast resistance gene, its encoded protein and use thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1629293A CN1629293A (en) | 2005-06-22 |
CN1297661C true CN1297661C (en) | 2007-01-31 |
Family
ID=34843790
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2003101184339A Expired - Fee Related CN1297661C (en) | 2003-12-16 | 2003-12-16 | A rice blast resistance gene, its encoded protein and use thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1297661C (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101062944B (en) * | 2007-05-16 | 2011-11-16 | 中国科学院遗传与发育生物学研究所 | Vegetable disease-resistant protein and its coding gene and application |
CN104372011B (en) * | 2014-09-11 | 2017-01-11 | 南京大学 | Rice blast resistance gene RMg41 and applications thereof |
CN104404052B (en) * | 2014-09-11 | 2017-01-25 | 南京大学 | Rice blast resistance gene RMg39 and its application |
CN107760692A (en) * | 2017-11-17 | 2018-03-06 | 四川大学 | Mannose-binding protein is used for the application of prepare transgenosis anti-rice blast rice |
CN109134633B (en) * | 2018-09-25 | 2020-10-09 | 四川农业大学 | Rice blast resistant protein and gene, isolated nucleic acid and application thereof |
CN114805507B (en) * | 2021-01-28 | 2024-02-09 | 中国科学院遗传与发育生物学研究所 | Rice OsREIN1 T219I Protein, encoding gene and application thereof |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1104251A (en) * | 1993-07-29 | 1995-06-28 | 农林水产省农业生物资源研究所 | Nucleic acid markers for rice blast resistance genes and rice blast resistance genes isolated by the use of these markers |
-
2003
- 2003-12-16 CN CNB2003101184339A patent/CN1297661C/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1104251A (en) * | 1993-07-29 | 1995-06-28 | 农林水产省农业生物资源研究所 | Nucleic acid markers for rice blast resistance genes and rice blast resistance genes isolated by the use of these markers |
Also Published As
Publication number | Publication date |
---|---|
CN1629293A (en) | 2005-06-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1154740C (en) | Root cortex specific gene promoter | |
CN1807453A (en) | Bacterial leaf spot resistance related protein and its coding gene and uses | |
CN1950509A (en) | High lysine maize compositions and methods for detection thereof | |
CN1201012C (en) | Alteration of flowering time in plants | |
CN1258593C (en) | Putrescine-N-methyl transferase promoter | |
CN1844396A (en) | Gene adjusting and controlling rice tillering angle and its coded protein and use | |
CN1844393A (en) | Resistance gene Pi37 against rice blast and use thereof | |
CN1854154A (en) | Rice blast resistant related protein, its coding gene and use | |
CN1831127A (en) | Key gene for controlling chlorophyll metabolism and method for establishing plant green residence character therewith | |
CN101062944A (en) | Vegetable disease-resistant protein and its coding gene and application | |
CN1861791A (en) | Recessive gene xa13 of rice bacterial blight resistance and its allelic dominant gene xa13 | |
CN1297661C (en) | A rice blast resistance gene, its encoded protein and use thereof | |
CN1202254C (en) | Paddy rice anti bacterial leaf-blight gene Xa26(t) | |
CN1751065A (en) | Gene resistant to aphis gossypii | |
CN1840542A (en) | Rice tillering related protein, genes encoding same, and use thereof | |
CN1821406A (en) | Resistance gene Pi 36 of rice blast and its use | |
CN1908171A (en) | Amylose content control gene DU1 of rice endosperm and application thereof | |
CN1306041C (en) | Molecular marker of rice blast resistance gene, its dedicated primer and application thereof | |
CN101050232A (en) | Pi15 resistance gene of rice blast, and application | |
CN1295334C (en) | Wheat antidisense related gene TaEDR1 and its application | |
CN1869232A (en) | Paddy rice hybrid fertility gene and its application | |
CN1570110A (en) | Rice grain gelatinization temperature main control gene ALK and its uses | |
CN1566146A (en) | Paddy rice stalk extension gene, coded protein and application thereof | |
CN1546666A (en) | Bacterial leaf spot resistance related gene of rice, protein and its uses | |
CN1816624A (en) | Rice transposon gene |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20070131 Termination date: 20141216 |
|
EXPY | Termination of patent right or utility model |