CN102041262B - 稻瘟病抗性基因Pik-p及其应用 - Google Patents
稻瘟病抗性基因Pik-p及其应用 Download PDFInfo
- Publication number
- CN102041262B CN102041262B CN 200910236466 CN200910236466A CN102041262B CN 102041262 B CN102041262 B CN 102041262B CN 200910236466 CN200910236466 CN 200910236466 CN 200910236466 A CN200910236466 A CN 200910236466A CN 102041262 B CN102041262 B CN 102041262B
- Authority
- CN
- China
- Prior art keywords
- leu
- gene
- ile
- ser
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 227
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 71
- 235000009566 rice Nutrition 0.000 title claims abstract description 71
- 240000007594 Oryza sativa Species 0.000 title claims 2
- 230000014509 gene expression Effects 0.000 claims abstract description 24
- 239000002773 nucleotide Substances 0.000 claims abstract description 14
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 14
- 239000002299 complementary DNA Substances 0.000 claims description 14
- 239000013604 expression vector Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 241000209094 Oryza Species 0.000 abstract description 81
- 201000010099 disease Diseases 0.000 abstract description 60
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 60
- 241000196324 Embryophyta Species 0.000 abstract description 56
- 150000001413 amino acids Chemical class 0.000 abstract description 23
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 13
- 229920001184 polypeptide Polymers 0.000 abstract description 11
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 11
- 238000009399 inbreeding Methods 0.000 abstract 1
- 230000001131 transforming effect Effects 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 33
- 230000009368 gene silencing by RNA Effects 0.000 description 22
- 108091030071 RNAI Proteins 0.000 description 21
- 108091000080 Phosphotransferase Proteins 0.000 description 18
- 102000020233 phosphotransferase Human genes 0.000 description 18
- 230000006870 function Effects 0.000 description 17
- 230000009182 swimming Effects 0.000 description 16
- 238000009395 breeding Methods 0.000 description 15
- 230000001488 breeding effect Effects 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- 102000004169 proteins and genes Human genes 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 12
- 125000000539 amino acid group Chemical group 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 239000003550 marker Substances 0.000 description 11
- 235000001014 amino acid Nutrition 0.000 description 10
- 238000012408 PCR amplification Methods 0.000 description 9
- 108010005233 alanylglutamic acid Proteins 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 230000008659 phytopathology Effects 0.000 description 8
- 208000035240 Disease Resistance Diseases 0.000 description 7
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 7
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 7
- 241001330975 Magnaporthe oryzae Species 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- 238000003167 genetic complementation Methods 0.000 description 7
- 238000011081 inoculation Methods 0.000 description 7
- 230000009466 transformation Effects 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 241000209510 Liliopsida Species 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 244000052616 bacterial pathogen Species 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 241001233957 eudicotyledons Species 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 108020005345 3' Untranslated Regions Proteins 0.000 description 4
- 108020003589 5' Untranslated Regions Proteins 0.000 description 4
- 108700028369 Alleles Proteins 0.000 description 4
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 239000003147 molecular marker Substances 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 239000013641 positive control Substances 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 101000706985 Pinus strobus Putative disease resistance protein PS10 Proteins 0.000 description 3
- 230000008485 antagonism Effects 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000013467 fragmentation Methods 0.000 description 3
- 238000006062 fragmentation reaction Methods 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- GNYCTMYOHGBSBI-SVZOTFJBSA-N (3s,6r,9s,12r)-6,9-dimethyl-3-[6-[(2s)-oxiran-2-yl]-6-oxohexyl]-1,4,7,10-tetrazabicyclo[10.3.0]pentadecane-2,5,8,11-tetrone Chemical compound C([C@H]1C(=O)N2CCC[C@@H]2C(=O)N[C@H](C(N[C@H](C)C(=O)N1)=O)C)CCCCC(=O)[C@@H]1CO1 GNYCTMYOHGBSBI-SVZOTFJBSA-N 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- 241000228439 Bipolaris zeicola Species 0.000 description 2
- 108700003861 Dominant Genes Proteins 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- 108010051041 HC toxin Proteins 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- SVJRVFPSHPGWFF-DCAQKATOSA-N Lys-Cys-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVJRVFPSHPGWFF-DCAQKATOSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 2
- 108091092878 Microsatellite Proteins 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- 102000001253 Protein Kinase Human genes 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000010835 comparative analysis Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 230000004665 defense response Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 244000053095 fungal pathogen Species 0.000 description 2
- 238000012252 genetic analysis Methods 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- GNYCTMYOHGBSBI-UHFFFAOYSA-N helminthsporium carbonum toxin Natural products N1C(=O)C(C)NC(=O)C(C)NC(=O)C2CCCN2C(=O)C1CCCCCC(=O)C1CO1 GNYCTMYOHGBSBI-UHFFFAOYSA-N 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 229910052742 iron Inorganic materials 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108060006633 protein kinase Proteins 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- 102100037563 40S ribosomal protein S2 Human genes 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- 101100038641 Arabidopsis thaliana RPP5 gene Proteins 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- SXNJBDYEBOUYOJ-DCAQKATOSA-N Asn-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N SXNJBDYEBOUYOJ-DCAQKATOSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- XSXVLWBWIPKUSN-UHFFFAOYSA-N Asp-Leu-Glu-Asp Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(O)=O)C(O)=O XSXVLWBWIPKUSN-UHFFFAOYSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101150079123 Bad gene Proteins 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 101100190464 Caenorhabditis elegans pid-2 gene Proteins 0.000 description 1
- 101100190466 Caenorhabditis elegans pid-3 gene Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- UQHYQYXOLIYNSR-CUJWVEQBSA-N Cys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N)O UQHYQYXOLIYNSR-CUJWVEQBSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- IWVNIQXKTIQXCT-SRVKXCTJSA-N Cys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IWVNIQXKTIQXCT-SRVKXCTJSA-N 0.000 description 1
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- 101001098029 Homo sapiens 40S ribosomal protein S2 Proteins 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- WJBOZUVRPOIQNN-KJYZGMDISA-N Ile-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)C1=CN=CN1 WJBOZUVRPOIQNN-KJYZGMDISA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- 102000019223 Interleukin-1 receptor Human genes 0.000 description 1
- 108050006617 Interleukin-1 receptor Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 108010006444 Leucine-Rich Repeat Proteins Proteins 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 108700012133 Lycopersicon Pto Proteins 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- XBYKTPZCWQQSGB-IHRRRGAJSA-N Met-Cys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBYKTPZCWQQSGB-IHRRRGAJSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- NLDXSXDCNZIQCN-ULQDDVLXSA-N Met-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 NLDXSXDCNZIQCN-ULQDDVLXSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 1
- 208000025174 PANDAS Diseases 0.000 description 1
- 208000021155 Paediatric autoimmune neuropsychiatric disorders associated with streptococcal infection Diseases 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 108010009341 Protein Serine-Threonine Kinases Proteins 0.000 description 1
- 102000009516 Protein Serine-Threonine Kinases Human genes 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 101150085390 RPM1 gene Proteins 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- 102000006289 Transcription Factor TFIIA Human genes 0.000 description 1
- 108010083262 Transcription Factor TFIIA Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 1
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- SUGLEXVWEJOCGN-ONUFPDRFSA-N Trp-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)O SUGLEXVWEJOCGN-ONUFPDRFSA-N 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- SFSZDJHNAICYSD-PMVMPFDFSA-N Tyr-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC4=CC=C(C=C4)O)N SFSZDJHNAICYSD-PMVMPFDFSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- WGHVMKFREWGCGR-SRVKXCTJSA-N Val-Arg-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WGHVMKFREWGCGR-SRVKXCTJSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 244000000005 bacterial plant pathogen Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 101150039352 can gene Proteins 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000036978 cell physiology Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000008260 defense mechanism Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000012407 engineering method Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 101150054900 gus gene Proteins 0.000 description 1
- 101150047832 hpt gene Proteins 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 210000004901 leucine-rich repeat Anatomy 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 244000000010 microbial pathogen Species 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 150000007523 nucleic acids Chemical group 0.000 description 1
- 238000002161 passivation Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008521 reorganization Effects 0.000 description 1
- 230000008261 resistance mechanism Effects 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 230000007923 virulence factor Effects 0.000 description 1
- 239000000304 virulence factor Substances 0.000 description 1
- 238000001086 yeast two-hybrid system Methods 0.000 description 1
Images
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明公开了一种水稻稻瘟病抗性基因Pik-p的核苷酸序列及其编码的氨基酸多肽序列。该序列包含有2个基因,都属于CC-NBS-LRR抗性基因家族的成员,且皆为组成型表达的基因。本发明还涉及用该基因转化水稻或其它植物培育抗病品种以及根据该基因序列产生的分子标记在育种中的应用。
Description
技术领域
本发明涉及基因工程领域,具体涉及一种水稻稻瘟病抗性基因Pik-p的克隆及其应用。
背景技术
植物在生长的过程中,常常受到多种病原物的侵害,而植物则采取多种防御策略以保护自身,避免受其侵袭。植物中一个最重要的防御机理就是能够识别专性致病微生物的存在并启动自身的防御应答系统。植物对病原菌的识别由抗病基因所介导。因此,抗病基因产物结构的分析与研究,是了解植物抗病机制的基础,对于植物病害的预防和控制也具有重要的指导意义。
迄今为止,已经从单子叶植物和双子叶植物中分离了80多个抗病基因。对这些抗病基因的结构和产物的研究发现,虽然寄主植物不同,所拮抗的病原物也有真菌、细菌、病毒和线虫等差异,但抗病基因的结构和产物却有许多共同的结构特征,如C-端存在富亮氨酸重复序列(leucine-rich repeat,LRR),N-端存在核苷酸结合位点(nucleotide binding site,NBS),亮氨酸拉链(leucine zipper,LZ),卷曲螺旋结构域(coiled-coil,CC),跨膜结构域(transmembrane domain,TM),蛋白激酶(protein kinase,PK),以及果蝇Toll蛋白及哺乳动物白介素-1受体(Toll andinterleukin-1 receptor,TIR)等。根据它们所编码蛋白的结构特征,可将抗病基因分为7类(Hammond-Kosack&Jones,1997;Dangl&Jones 2001;Iyer&McCouch2004)。
第一类,毒素还原酶类抗病基因。如玉米抗病基因Hm1,它是第1个被克隆的植物抗病基因,它控制对真菌Cochliobolus carbonum小种1的抗性。Hm1编码的解毒酶能钝化病原真菌所产生的HC毒素,而HC毒素是真菌C.carbonum小种1产生的致病因子,它决定该病菌只能感染玉米的某些基因型(Johal等,1992)。第二类,NBS-LRR类抗病基因。它们编码的蛋白近N端为NBS,而近C端则由LRR组成。如RPS2(Bent et al.,1994)、RPM1(Grant et al.,1995)、I2(Simon et al.,1998);RPP5(Parker et al.,1997)、N(Dodd et al.,2001)、L6(Lawrence et al.,1995)、Mla1(Zhou et al.,2001)、Mla6(Halterman et al.,2001);水稻的抗病基因如Xa1(Yoshimura et al.,1998)、Pib(Wang et al.,1999)、Pita(Bryan et al.,2000)等。第三类,PK类抗病基因。如番茄Pto基因,其产物是一种位于细胞内的丝氨酸-苏氨酸蛋白激酶,没有LRR结构域(Martin et al.,1993)。第四类,LRR-TM类抗病基因。番茄抗叶霉病不同生理小种的基因Cf-2(Dixon et al.,1996)、Cf-4(Thomas et al.,1997)、Cf-5(Dixon et al.,1998)、Cf-9(Jones et al.,1994),以及甜菜抗胞囊线虫的基因Hs1pro-1(Cai et al.,1997)等。第五类,LRR-TM-PK类抗病基因,以水稻抗白叶枯病基因Xa21(Song et al.,1995)为代表。第六类,以拟南芥的RPW8为代表,其编码的蛋白仅含有完整的CC和NBS结构域(Xiao et al.,2001)。第七类,以水稻的xa5基因为代表,其编码的蛋白为一个转录因子(TFIIAγ)(Iyer&McCouch,2004)。
编码NBS-LRR类抗病蛋白的抗病基因是植物抗病基因中最大的一类抗病基因,根据NBS-LRR类抗病蛋白N末端的结构特点,又可将该类基因分为两大类TIR-NBS-LRR(TNL)和CC-NBS-LRR(CNL)(Meyers et al.,2002;Pan et al.,2000;Cannon et al.,2002;Richly et al.,2002)。TNL类抗病基因主要发现在双子叶植物,至今还没在单子叶植物基因组中发现(Bai et al.,2002;Meyers et al.,2002)。目前在单子叶植物中鉴定到的抗病基因主要编码CNL类抗病蛋白,双子叶植物中也存在大量的CNL类抗病基因,相对而言,单子叶植物中的CNL类抗病基因比双子叶植物中的更丰富多样(Cannon et al.,2002)。
研究表明,NBS、CC、TIR结构域可能参与信号传导(Hammond-Kosack&Jones1997),尽管这类R蛋白不具有内在的激酶活性,但NBS可以激活激酶或G蛋白,NBS具有结合ATP或GTP以及水解酶活性(Traut et al.,1994)。TIR结构可能参与植物抗病防卫反应下游的信号传导(Jones et al.,1994)。最近的证据表明,亚麻的L族多样性选择也发生在TIR区域,这个区域与相应的LRR区域共进化形成特异性(Luck et al.,2000)。LRR结构域的功能主要涉及到蛋白质-蛋白质和与配体的相互作用(Jones&Jones,1996;Kajava et al.,1998),据推测是抗病基因产物直接和病原菌无毒基因编码的产物或间接产物相互作用的部位(Bent,1996;Bakeret al.,1997)。Jia et al.(2000)通过酵母双杂交证明AVR-Pita编码的蛋白加工为一个176氨基酸的活性蛋白AVR-Pita176小种特异性的激发子,传递到植物细胞质特异地与Pita受体的LRD区域结合,从而激活细胞内Pita介导的防卫反应。当Pita中的Ala变为Ser后Av-rPita176不能与LRD结合,因而表现感病。这一结果直接证明了LRR结构域可能就是病原菌识别的区域;Pita与AvrPita的互作第一次从分子水平验证了水稻与稻瘟病菌的“基因对基因”关系。
水稻是世界上最重要的粮食作物之一,约有一半以上的人口以稻米为主食。由病原真菌Magnapothe oryzae(无性态:Pyricularia oryzae)引起的稻瘟病是对水稻生产危害最严重的病害之一,每年都造成严重的粮食损失。从环境保护与农业的可持续发展的观点来看,抗病品种的育成与利用是防治稻瘟病最安全有效的方法。但是,由于稻瘟病菌群体的多样性及易变性,加之人们缺乏对抗性基因的有效利用,以及对抗性机理缺乏充分的了解,以致抗病品种的感病化问题不但没有得到解决,反而因有效抗源基因的缺乏和抗病品种的短命化而成为育种学家最为棘手的问题。因此,发掘、鉴定和克隆抗病基因并合理地应用于抗病育种计划已成为农业科研中优先解决的重要问题。
随着分子生物学的快速发展,迄今,至少有80多个水稻稻瘟病主效抗性基因已经被报道,其中60个已经被分子定位。目前,除了水稻的第3染色体上还没有被鉴定到稻瘟病主效抗性基因之外,其余的11条染色体上均被鉴定到有主效抗性基因位点,并且有的染色体上含有多个稻瘟病抗性位点,而且有些基因位点的抗性基因是成簇存在的。目前,在稻瘟病抗性基因的克隆研究方面,正式报道的已有13个抗性基因,Pib(Wang et al.,1999),Pita(Bryan et al.,2000),Pi9(Qu et al.,2006),Pid2(Chen et al.,2006),Piz-t和Pi2(Zhou et al.,2006b),Pi36(Liu et al.,2007a),Pi37(Lin et al.,2007),Pik-m(Ashikawa et al.,2008),Pit(Hayashi et al.,2009),Pi5(Lee et al.,2009),Pid3(Shang et al.,2009)和pi21(Fukuoka et al.,2009)。
克隆抗病基因是对水稻抗性机理深入研究的前提,揭示水稻抗稻瘟病的分子机理可以更好地控制和降低稻瘟病菌对水稻的危害。同时,对克隆的抗病基因的修饰和改造,可以人为地控制和增加植物的抗病性,拓宽植物的抗谱。这些方面是采用常规植物育种和改良技术所不能达到的。
发明内容
本发明的目的是提供一种水稻稻瘟病抗性基因和包含调控这个基因的启动子的DNA片段。
本发明的另一个目的是提供上述稻瘟病抗性基因所编码的蛋白质。
本发明的另一个目的是提供上述含有上述抗性基因的载体。
本发明的另一个目的是提供上述载体的宿主细胞。
本发明的另一个目的是提供上述基因在制备转基因植物中的应用。
本发明的进一步目的是提供上述抗性基因产生的分子标记及其在选育对稻瘟病具有抗病性的水稻中的应用。
本发明从水稻K60中分离得到Pik-p基因,包括编码两个NBS-LRR类蛋白的基因Pikp3、Pikp4,其核苷酸序列如SEQ ID NO.1或SEQ ID NO.2所示,它们分别编码SEQ ID NO.3以及SEQ ID NO.4所示的氨基酸序列,结构如图5a和5b所示。它们的蛋白都包含2个主要的结构域:NBS和LRR区域,其中Pikp3 NBS结构域含有保守的kinase 1a:GLPGGGKTTVAR,kinase 2:KKYLIVIDDIW,kinase3a:DLGGRIIMTTRLNSI;Pikp4 NBS结构域含有保守的kinase 1a:VLSIVGFGGVGKTTIA,kinase 2:LEQLLAEKSYILLIDDIW和kinase 3a:EQVPEEIWKICGGLPLAIVT。而Pikp3蛋白的C-末端为16个不完整的LRR重复,其亮氨酸含量为14.0%;Pikp4蛋白的C-末端为13个不完整的LRR重复,其亮氨酸含量为17.0%(图5a,5b)。Pik-p基因中编码NBS或LRR的核苷酸片段可能具有独立的功能。将Pik-p基因中编码不同结构域的片段与其他核酸片段重组,可构成嵌合基因或蛋白质,使之具有新的功能。
应当理解,在不影响蛋白活性的前提下(不在蛋白的活性中心),本领域技术人员可对SEQ ID NO.3或SEQ ID NO.4所示的氨基酸序列进行各种取代、添加和/或缺失一个或几个氨基酸获得具有同等功能的氨基酸序列。此外,考虑到密码子的简并性,例如可在其编码区,在不改变氨基酸序列的条件下,或在其非编码区在不影响蛋白表达的条件下,对编码上述蛋白的基因序列进行修改。因此,本发明还包含对编码上述蛋白的基因序列进行的替换、添加和/或缺失一个或多个核苷酸,具有与上述编码基因具有相同功能的核苷酸序列。本发明还包括基于所述基因的正义序列或反义序列,包括含有所述核苷酸序列或其片段的克隆载体或表达载体、含有所述载体的宿主细胞、含有所述核苷酸序列或其片段的转化的植物细胞和转基因植物。
本发明同样包括将Pik-p抗性基因的主要结构部分有效地连接上合适的调节序列所形成的嵌合基因,以及在基因组中包含这种基因的植物和这种植物的种子。这种基因可以是天然的或是嵌合的。例如,将包含该基因的片段和一个组成型表达的启动子连接,该启动子可以在任何条件下和细胞发育的任何时期表达。这种组成型表达的启动子包括花椰菜花叶病毒35S的启动子等。另一方面,也可以将该基因和一个组织特异性表达的启动子或发育时期特异性表达的启动子或精确环境诱导的启动子连接,这些启动子称之为诱导型启动子。这样,环境的改变,发育时期的不同都可以改变该基因的表达。同样地,也可以将该基因的表达限定在某一个组织内,使由该基因诱导的抗病反应得到人为的控制。其中环境条件包含病原菌的攻击、厌氧条件和光等,组织和发育时期包括叶、果实、种子和花等。
本发明还进一步克隆得到所述基因的启动子,包含调控该基因的启动子的DNA片段分别如SEQ ID NO.5和SEQ ID NO.6所示。
根据本发明提供的Pik-p基因序列信息(SEQ ID No.1和SEQ ID No.2),本领域技术人员可以通过以下方法容易地获得与Pikp等同的基因:(1)通过数据库检索获得;(2)以Pik-p基因片段为探针筛选水稻或其它植物的基因组文库或cDNA文库获得;(3)根据Pik-p基因序列信息设计寡核苷酸引物,用PCR扩增的方法从水稻或其它植物的基因组、mRNA和cDNA中获取;(4)在Pik-p基因序列的基础上用基因工程方法改造获得;(5)用化学合成的方法获得该基因。
本发明提供的稻瘟病抗性基因Pik-p具有重要的应用价值,其赋予植物对稻瘟病菌(Magnaporthe oryzae)所引起的病害产生特异性的抗病反应。其适用于对所有对该病原菌敏感的植物,这些植物包括单子叶植物和双子叶植物。应用之一是将所述的Pik-p基因序列连接到任何一种植物转化载体,用任何一种转化方法将Pik-p抗病基因导入水稻或其他植物细胞,可获得表达所述基因的转基因抗病品种,从而应用于生产。本发明所述的基因构建到植物转化载体中,可以对所述基因或其调控序列适当修饰,也可以在其转录起始密码子前用其它启动子取代所述基因原有的启动子,从而拓宽和增强植物对病原菌的抗性。
本发明提供的抗性基因的另一个应用是根据所述基因序列信息产生特异性的分子标记,包括但不限于SNP(单核苷酸多态)、SSR(简单序列重复多态)、RFLP(限制性内切酶长度多态)、CAP(切割扩增片段多态)。用此标记可鉴定水稻或其它植物的抗性基因型,用于分子标记辅助选择育种,从而提高育种的选择效率。
本发明具有明显的优点和效果。将克隆的抗病基因转入感病的植物,有助于产生新的抗病植物。特别是可以用转化技术在植物中累加多个抗病基因,而不会产生传统育种技术中伴随出现的基因组中不良基因的连锁问题,并且可以缩短育种时间。抗病基因的克隆是克服传统育种中不能在植物种间转移抗病基因的问题的前提。
本发明能够进一步提供或应用利用上述DNA片段获得的抗病的转基因植株和相应的种子,以及用本发明的基因或基于该基因的重组体转化的植株或由这类植株获得的种子。可以用有性杂交的方式将本发明的基因转入其他的植株。
附图说明
图1.水稻稻瘟病抗性基因Pik-p的图位克隆技术路线图。
图2a.水稻稻瘟病抗性基因Pik-p之组成基因Pikp3之正向遗传互补T0植株的抗性鉴定实物图。图中所有转化植株都表现感病,说明单一的Pikp3不具备抗性功能。
图2b.水稻稻瘟病抗性基因Pik-p之另一组成基因Pikp4之正向遗传互补T0植株的抗性鉴定实物图。图中所有转化植株都表现感病,说明单一的Pikp4也不具备抗性功能。
图2c.水稻稻瘟病抗性基因Pik-p(由Pikp3和Pikp4组成)之正向遗传互补T0植株的抗性鉴定实物图。图中箭头1所指的是抗病植株,箭头2所指的是感病植株,说明抗性基因Pik-p由Pikp3和Pikp4组成才能表现功能。
图2d.水稻稻瘟病抗性基因Pik-p的正向遗传互补的部分T0转化体的选择标记GUS和HPT基因的PCR检测图。
图中泳道1:分子量标记DL2000;泳道2:H2O,阴性对照;泳道3:空载体,阳性对照;泳道4-12:T0转化体。结果表明,所有转化体都检测到了2个选择性标记的PCR扩增产物,说明正义转化构建体已经整合到了高感受体品种Q1063中。R:高抗;MR:中抗;S:感病。其中的感病个体(S)可能是由于外源基因的整合等问题,在受体品种中并没有得到充分的表达。
图3a.水稻稻瘟病抗性基因Pik-p反向遗传互补(RNAi)植株的抗性鉴定图。图中箭头1所指的是抗病植株,箭头2所指的是感病植株。
图3b.水稻稻瘟病抗性基因Pik-p之组成基因Pikp3之RNAi T0转化体基于载体特异性标记的PCR检测图。
图中泳道1:分子量标记DL2000;泳道2:H2O,阴性对照;泳道3:空载体,阳性对照;泳道4-15:RNAi T0转化体。其中,所有的感病个体(S)都检测到了载体特异性标记(pMCG)的PCR扩增产物,说明RNAi构建体已经整合到受体品种K60(含有Pik-p)中,由此干涉(抑制)了该基因的表达而表现感病;抗病个体(R)没有检测到载体特异性标记(pMCG)的PCR扩增产物,说明RNAi构建体没有整合到受体品种K60(含有Pik-p)中,由此该基因的表达没有受到干涉(抑制)而表现抗病。
图3c.水稻稻瘟病抗性基因Pik-p之另一个组成基因Pikp4之RNAi T0转化体基于载体特异性标记的PCR检测图。
图中泳道1:分子量标记DL2000;泳道2:H2O,阴性对照;泳道3:空载体,阳性对照;泳道4-18:RNAi T0转化体。其中,所有的感病个体(S)都检测到了载体特异性标记(pMCG)的PCR扩增产物,说明RNAi构建体已经整合到受体品种K60(含有Pik-p)中,由此干涉(抑制)了该基因的表达而表现感病;而抗病个体(R)都没有检测到载体特异性标记(pMCG)的PCR扩增产物,说明RNAi构建体没有整合进受体品种K60(含有Pik-p)中,由此该基因的表达没有受到干涉(抑制)而表现抗病。
图3d.水稻稻瘟病抗性基因Pik-p之组成基因Pikp4之RNAi 2个T1转化系基于载体特异性标记的共分离鉴定图。
上图和中图为RNAi T1转化系KP4RNAi-28;下图为RNAi T1转化系KP4RNAi-3。3个图中,泳道1:分子量标记DL2000;泳道2:H2O,阴性对照;泳道3:空载体,阳性对照;其余泳道:RNAi T1转化个体。结果表明,所有的感病后代个体(S)都检测到了载体特异性标记(GUS)的PCR扩增产物,而抗病个体(R)都没有检测到载体特异性标记的PCR扩增产物,说明2个RNAi T1转化系后代个体的基因型与表型共分离。
图4a.水稻稻瘟病抗性基因Pik-p之组成基因Pikp3的基因结构图。
图中浅色方框表示5’和3’-UTR;深色方框表示外显子;线条表示内含子;KP3RNAi1和KP3RNAi2表示RNAi干涉所用片段及其对应的蛋白质编码区。
图4b.水稻稻瘟病抗性基因Pik-p之另一组成基因Pikp4的基因结构图
图中浅色方框表示5’和3’-UTR;深色方框表示外显子;线条表示内含子;KP4RNAi、KP4 RNAi1和KP3RNAi2表示RNAi干涉所用片段及其对应的蛋白质编码区。
图5a.水稻稻瘟病抗性基因Pik-p之组成基因Pikp3编码的氨基酸多肽序列结构图。
图中CC结构域中的黑体字表示形成CC motif的氨基酸,NBS区域的黑体字表示NBS中保守的氨基酸序列。LRR区域后面还带有一段CNL(C端非LRR区域)氨基酸序列。双下线标示的4个氨基酸序列为等位基因特异的氨基酸替换(substitution,与已被克隆的Pik-m基因相比)。
图5b.水稻稻瘟病抗性基因Pik-p之组成基因Pikp4编码的氨基酸多肽序列结构图。
图中CC结构域中的带下划线的黑体字表示形成CC结构的氨基酸,NBS区域带下划线的黑体字表示NBS中保守的氨基酸序列。下部分独立列出的氨基酸残基为C-末端LRR区域。双下线标示的3个氨基酸序列为等位基因特异的氨基酸替换(substitution,与已被克隆的Pik-m基因相比)。
图6.水稻稻瘟病抗性基因Pik-p表达特性的定量RT-PCR检测图。
上图为组成基因Pikp3在不同的接种时间点(0hr,12hr,24hr,72hr)的相对表达水平,由此可以看出,该基因呈逐步上调后而下降的表达类型。上图为另一个组成基因Pikp4在不同的接种时间点(0hr,12hr,24hr,72hr)的相对表达水平,由此可以看出,该基因呈下调后上调而又下调的表达类型。2个组成基因都表现为组成型表达,因为接种之前和之后都能检测到2个基因的表达。
图7.利用水稻稻瘟病抗性基因Pik-p特异性分子标记的基因型鉴定图。图中,M:分子量标记DL2000;P1:抗性品种K60的基因型(Pikp/Pikp);P2:感病品种AS20-1的基因型(pikp/pikp);R1和R3:抗性个体的杂合基因型(Pikp/pikp);S1,S2,S3,S4:感病个体的基因型(pikp/pikp);R2:抗性个体的纯合基因型(Pikp/Pikp)。
具体实施方式
以下实施例进一步说明本发明的内容,但不应理解为对本发明的限制。在不背离本发明精神和实质的情况下,对本发明方法、步骤或条件所作的修改或替换,均属于本发明的范围。
若未特别指明,实施例中所用的技术手段为本领域技术人员所熟知的常规手段。
在本发明的实施例部分,我们阐述了Pik-p基因的分离过程(图1)和该基因的特点,分离的Pik-p基因能够与适当的载体连接,转入植物体中,使该植物体带有一定的抗性。
实施例1:抗性基因Pik-p的抗性特征
首先,为了比较和明确在Pik位点上4个等位基因(Pik,Pik-p,Pik-s,Pik-m)的抗谱,利用从广东(60个菌株),福建(40个),湖南(40个),贵州(60个),云南(43个),四川(66个),江苏(72个),辽宁(108个),吉林(60个),黑龙江(63个),收集的总计612个菌株对上述4个等位基因所持品种(分别是Kusabue,K60,Shin 2和Tsuyuake)进行了抗谱比较分析。结果表明,Pik-p基因表达了与其他3个等位基因不同的抗性反应;该基因在广东(96.7%)、江苏(90.9%)和四川(90.9%)表现高水平的抗性,由此说明该基因至少可在上述地区使用(Wang et al.2009,Phytopathology,99:900-905)。其次,利用上述612个菌株对Pik-p基因与目前中国水稻抗稻瘟病育种计划中常用的抗源基因(Pi1,Pi2,Pi3,Pi4等)进行了抗谱比较分析。结果表明,Pik-p基因表达与这些抗源基因差异明显的抗性反应,由此说明该基因可在上述育种计划中与这些抗源基因聚合使用,以获得更广、更持久的抗性(Wang et al.2009,Phytopathology,99:900-905)。
实施例2:水稻稻瘟病抗性基因Pik-p的遗传分析及初步定位
本发明对由粳稻抗病品种K60与2个籼稻感病品种AS20-1及Kasalath的杂交组合由来的F2群体(F2-1和F2-2),接种对双亲品种表现分明的非亲和性/亲和性反应的菌株M028(稻瘟病菌单一孢子分离的菌株)。结果表明,这2个F2群体中抗病植株与感病植株的分离比都符合3∶1。由此推断,K60所表现的对接种菌株M028的抗性是由一对显性基因控制的。对这2个F2群体对应的抗性基因进行连锁分析,结果表明它们受同一显性基因控制,因此,构建了由这2个F2群体由来的340个体和341个体组成的一个作图群体(Wang et al.2009,Phytopathology,99:900-905)。
前期的遗传分析表明Pik-p基因与Pik-m基因是等位的,而Li et al(2007,Molecular Breeding,20:179-188)对Pik-m基因进行了精细定位。因此,申请人首先利用Li et al(2007,Molecular Breeding,20:179-188)开发的微卫星(simplesequence repeat,SSR)标记(RM254,RM456C,RM224,RM7443,RM5926),进行分离群体分析(bulked-sgregant analysis,BSA)。结果表明,只有最靠近端粒的RM5926在2个群体中都表现连锁性多态,并分别鉴定到了28和14个重组体(Wang et al.2009,Phytopathology,99:900-905)。然后,选择Li et al(2007,Molecular Breeding,20:179-188)开发的靠近着丝粒的分子标记K37进行连锁分析,分别获得了3和2个重组体。根据上述结果推断,Pik-p基因位点被界定于RM5926——K37之间,其中鉴定到了总共47个不同的重组体。
实施例3:水稻稻瘟病抗性基因Pik-p的精细定位及电子物理作图
为了缩小Pik-p位点的染色体区域,进一步在RM5926——K37区域内,从Li等(2007)开发的9个标记中获得了7个多态性标记(RM144,K25,K15-2,K22,K28,K34,K39)。这些多态性标记对上述47个不同的重组体进行连锁分析的结果表明,Pik-p基因被界定于K28——K39之间约0.67cM的区域内。为了进一步缩小Pik-p位点的染色体区域,在K28——K39区域内开发了4个多态性标记(K33,K40,K41和K42),结果表明,这4个标记都与Pik-p位点表现完全的共分离。因此,Pik-p位点最终被界定于K28——K39之间约0.67cM的区域内,其中4个标记与之完全共分离(Wang et al.2009,Phytopathology,99:900-905)。
为了构建Pik-p位点的电子物理图,把与之连锁的标记通过生物信息学方法着陆于水稻品种Nipponbare的参考序列上,并由此构建了该基因位点的重叠群。由参考序列可以推测Pik-p位点被界定于侧翼标记K28和K39之间约126kb的范围内(Wang et al.2009,Phytopathology,99:900-905)。
实施例4:水稻稻瘟病抗性基因Pik-p候选基因的注释及序列分析
为了确定Pik-p的候选基因,申请人利用日本晴的参考序列,通过3种基因预测软件RiceGAAS(http://ricegaas.dna.affrc.go.jp)、Gramene(http://143.48.220.116/resources/)和Softberry的FGENESH(http://www.softberry.com)对目的基因区域进行了基因预测及注释分析,初步确定了Pik-p的候选抗性基因为4个核苷酸结合位点和富亮氨酸重复(nucleotide binding site-leucine-rich repeat,NBS-LRR)的候选基因(KP1,KP2,KP3和KP4)。
对4个候选基因进行基于基因特异性标记的存在/缺失(presence/absence,P/A)分析,结果表明,在日本晴参考序列存在的2个候选基因(KP1和KP2)在K60中是不存在的。因此,将KP3和KP4通过以下的基因功能互补实验确认其功能。
实施例5:水稻稻瘟病抗性基因Pik-p的正向遗传学互补实验及转化体的抗性鉴定
首先,利用高保真酶phusion结合长片段PCR(long-range PCR,LR-PCR)技术,以K60(Wang et al.2009,Phytopathology,99:900-905)的总DNA为模板,根据或参考Liu et al.(2007,Genetics,176:2541-2549)和Lin et al.(2007,Genetics,177:1871-1880)描述的实验方法和程序,分别扩增、克隆了KP3(正向引物:术,以K60(Wang et al.2009,Phytopathology,99:900-905)的总DNA为模板,根据或参考Liu et al.(2007,Genetics,176:2541-2549)和Lin et al.(2007,Genetics,177:1871-1880)描述的实验方法和程序,分别扩增、克隆了KP3(正向引物:TTTTGGCGCGCCGCCAGTGTCCACCAACCACAGTAATAAA;反向引物:TTTTGGCGCGCCGAACAGCCTGAGGAAGCAGAACATCGTC)和KP4(正向引物:TTTTGGCGCGCCGCAAGATCAGTACCATCACGAGTAATAGCA;反向引物:TTTTGGCGCGCCAGGGACAGAATGACAGTGTAAGTGAGTTTG),并进行了遗传互补实验,结果表明,这个2个候选基因都没有在感病受体品种Q1063(Lin et al.,2007,Genetics,177:1871-1880)中实现功能互补(表1;图2a,2b)。此外,对KP4进行超表达(overexpression,OX)分析,也没有获得功能互补的转化体(表1)。
表1.Pik-p 2个组成基因的正、反向遗传互补实验分析
a候选基因/构建体的缩写作:KP3,Pikp3;KP4,Pikp4;KP3+4,Pikp3+Pikp4;OX,过表达;RNAi,RNA干涉(RNA interference)。
b正向遗传学构建体转化到高度感病的品种Q1063中;反向遗传学构建体转化到Pik-p供体品种K60中。
c来自各个构建体的T0植株接种对Pik-p无毒性的菌株CHL381。R,抗病;MR,中抗;MS,中感;S,感病。
d正向遗传学构建体的功能互补成功率(%)为,R+MR/R+MR+MS+S*100;反向遗传学构建体的功能互补成功率(%)为,MS+S/R+MR+MS+S*100。
有趣的是,当将上述2个候选基因拼接为一个整体的基因进行遗传互补实验时,则获得了功能互补的转化体(表1;图2c)。利用转化载体中的GUS和潮霉素基因(HPT)标记对转化体进行了PCR检测。检测结果证明目基因片段已经导入这些转化体中。部分的阳性个体可能是由于外源基因的整合等问题,在受体品种中并没有得到充分的表达而表现感病(图2d)。
6:水稻稻瘟病抗性基因Pik-p的反向遗传学互补实验及转化体的抗性鉴定
申请人利用反向遗传学(RNAi)手段,对候选基因进行了进一步的功能验证分析。利用PCR技术扩增了一个含Pikp4 cDNA部分序列的516bp的片段(正向引物:CACCTTGAGCTGTCCGGCAAGTT,反向引物:CAGTGACGATGCCATCAACAA)并将其克隆至双元RNAi载体pANDA中(Mikiand Shimamoto,2004,Plant Cell Physiology,45:490-495;构建体命名为KP4RNAi),导入了农杆菌菌株。同样地,利用PCR技术扩增Pikp4 cDNA其他位置745bp (正向引物:TTTACTAGTGGTACCTCAGCGGTATCCGTGGTG,连续3个T为保护碱基,下划线表示为方便载体构建添加的2个酶切位点Spe I-Kpn I,反向引物:TTTGAGCTCGGATCCGCGCCTGAGGTGAGGTTCT,下划线表示为方便载体构建添加的2个酶切位点Sac I-BamH I,下同),586bp(正向引物:TTTACTAGTGGTACCTCACCGATGACGAGTCCC,反向引物:TTTGAGCTCGGATCCTGCCAGTGTCCACCAACC)的片段并将其克隆至另一个双元RNAi载体ds-pC1301(Yuan et al.,Planta,2007,226:953-960)上(构建体命名为KP4 RNAi1和KP4RNAi2)。运用相同的技术和方法构建了Pikp3基因的RNAi构建体750bp(正向引物:TTTACTAGTGGTACCTGGCCAGATCTCCCAATCGT,反向引物:TTTGAGCTCGGATCCTGTGCCTGTGCAGGCTCAGT)和489bp(正向引物:TTTACTAGTGGTACCTTCAATAGCTGAGAAGTGCC,反向引物:TTTGAGCTCGGATCCCCAAAGTAACCTTCTGCTTC)分别命名为构建体KP3RNAi1和KP3RNAi2)。然后,分别将上述构建体按照上述方法,对由抗病品种K60成熟种子诱导的愈伤组织进行了遗传转化,分别获得了248,261,403,411,467株的转化体,功能互补的成功率分别为47.2%,32.2%,15.1%,36.5%,22.0%(表1;图3a)。利用转化载体中的GUS基因及多位点克隆位点(pMCG)标记对转化体进行了PCR检测,结果表明,构建体已经导入这些转化体中,而且上述标记的基因型与表现型(抗、感)完全共分离(图3b,3c,3d)。也就是说,在抗病品种K60的遗传背景下,因为RNA干涉而抑制抗性基因Pik-p表达,使之表现为感病。
上述正、反遗传学的功能互补实验结果表明,抗性基因Pik-p由2个基因Pikp3和Pikp4组成才能表现其功能。
实施例6:Pik-p基因的结构
采用移步法对Pik-p的DNA序列进行了测定。利用RACE技术,获得了Pik-p的全长cDNA序列并对其进行了测序。Pik-p基因DNA长度为16.9kb(包含Pikp3和Pikp4),其中,Pikp3的全长cDNA序列为3652bp,含有一个3429bp的开放读码框,5’和3’非翻译区分别为60bp和163bp。通过比较基因组DNA和cDNA序列,发现该基因的开放读码框含有2个内含子和3个外显子(图4a)。另一个组成基因Pikp4的全长cDNA为3613bp,含有一个3066bp的开放读码框,5’和3’非翻译区分别为264bp和283bp。通过比较基因组DNA和cDNA序列,发现该基因的开放读码框含有1个内含子和2个外显子,而在5’非翻译区内有一个内含子(图4b)。
实施例7:Pik-p抗性蛋白的结构
Pik-p基因包含的2个组成基因编码的蛋白序列如序列表中Pikp3 SEQ ID No.3和Pikp4 SEQ ID No.4所示。Pikp3基因编码1个由1142个氨基酸残基组成的蛋白多肽,分子量为126.65KD,等电点为6.12。利用COIL分析表明该蛋白多肽有CC(coil-coil)结构域。Pikp3蛋白属于NBS-LRR蛋白,NBS结构域中保守的kinase1a(GLPGGGKTTVAR)位于该多肽的第291个氨基酸残基;kinase 2(KKYLIVIDDIW)位于该多肽的第377个氨基酸残基;kinase 3a(DLGGRIIMTTRLNSI)位于该多肽的第402个氨基酸残基。而该蛋白的C-端的610-960个氨基酸残基为16个不完整LRR重复,其亮氨酸含量为14.0%,其C末端还有个非LRR结构(CNL)的氨基酸序列(图5a)。图中双下线标示的4个氨基酸序列为等位基因特异的氨基酸替换(substitution,与已被克隆的Pik-m基因相比)。
Pikp4基因编码1个由1021个氨基酸残基组成的蛋白多肽,分子量为114.57KD,等电点为8.64。利用COIL分析表明该蛋白多肽没有CC(coil-coil)结构域,但是利用更为强大的预测工具Paircoil2分析发现有CC结构域。Pikp4蛋白属于NBS-LRR蛋白,NBS结构域中保守的kinase 1a(GFGGVGKTTIA)位于该多肽的第212个氨基酸残基;kinase 2(KSYILLIDDIW)位于该多肽的第330个氨基酸残基;kinase 3a(GGRIIVTTRFQAV)位于该多肽的第359个氨基酸残基。而该蛋白的C-末端的610-960个氨基酸残基为13个不完整LRR重复,其亮氨酸含量为17.0%(图5b)。图中双下线标示的3个氨基酸序列为等位基因特异的氨基酸替换(substitution,与已被克隆的Pik-m基因相比)。
实施例8:Pik-p基因的表达特性分析
利用定量RT-PCR技术对Pik-p基因的表达模式进行了分析。在抗病品种K60接种后不同的时间点(0hr,12hr,24hr,72hr)上采集叶片并提取其总RNA,利用反转录试剂盒SuperScriptTMReverse Transcriptase II进行反转录cDNA第一条链前合成。RT-PCR的引物为,
KP3RT-F:GAAGCTCTGATCAACGGTATTCC,
KP3RT-R:TCTTTGATCATCTTCGGGATACG;
KP4RT-F:TGAACTTCCACGATTGGATCCAC,
KP4RT-R:ACATGGTTCTTGAATACATCATGTC。
实时定量RT-PCR使用CFX96 Real-time PCR检测系统和SYBR Premix Ex Taq试剂盒(宝生物公司),操作按照试剂盒说明进行。实验结果表明,抗性品种在上述不同的接种时间点上均能扩增出特异的片段,说明Pik-p基因的2个组成基因均能在叶片组织中表达,即属于组成型表达的基因(图6)。
实施例9:转化Pki-p基因产生的抗性植株的应用
将Pik-p基因克隆到植物转化载体pCAMBIA1301,导入农杆菌菌株EHA105中,用于转化感病的水稻品种Q1063,获得了405株转化植株,其中82株表现抗性反应(表1)。这说明可以用pik-p基因转化水稻感病品种,经过自交、纯化选择等一系列育种程序之后,即可产生抗病品种而应用于生产。
实施例10:Pik-p基因序列在分子标记辅助选择育种中的应用
利用本发明提供的Pik-p基因序列信息可开发基因特异性分子标记,用于鉴定Pik-p位点上的各种基因型,即Pikp/Pikp、Pikp/pikp和pikp/pikp的植株,这些信息可在分子标记辅助选择育种过程中加以应用,以提高育种的目的性和效率。
本例根据Pikp3序列开发的基因特异性标记(CRG11F:GACGTCGTGAAGAAAGAAGC;CRG11R:CAATGTTTTCACTGCT CCCGT)对2个亲本及其杂交F2后代个体基因型的鉴定。结果如图7所示,抗性个体R2与亲本P1的电泳结果相同,两者基因型相同,均为纯合基因型(Pikp/Pikp)。
序列说明
SEQ ID NO.1&2是Pikp3和Pikp4的核苷酸序列,SEQ ID NO.3&4是SEQ ID NO.1&2的编码产物,SEQ ID NO.5&6是带有启动子的Pikp3和Pikp4核苷酸序列,SEQ ID NO.7&8和SEQID NO.9&10是分别用来扩增KP3和KP4的引物对,SEQ ID NO.11&12是扩增Pikp4 cDNA序列的一个516bp片段的引物对,SEQ ID NO.13&14和SEQ ID NO.15&16分别是引物对KP3RT-F、KP3RT-R和KP4RT-F、KP4RT-R,SEQ ID NO.17&18是引物对CRG11F&CRG11R,SEQ IDNO.19~24分别是Pikp3的kinase 1a、kinase 2、kinase 3a序列以及Pikp4的kinase 1a、kinase 2、kinase 3a序列。
参考文献:
1.Ashikawa I et al.Two adjacent nucleotide-binding site-leucine-rich repeat class genes are required to conferPikm-specific rice blast resistance.Genetics,2008,180:2267-2276.
2.Bai J et al.Diversity in nucleotide binding site-leucine-rich repeat genes in cereals.Genome Res,2002,12:1871-1884.
3.Baker B et al.Signaling in plant-microbe interactions.Science,1997,276:726-733.
4.Bent AF et al.RPS2 of Arabidopsis thaliana:a leucine-rich repeat class of plant disease resistance genes.Science,1994,265:1856-1860.
5.Bent AF.Plant Disease Resistance Genes:Function Meets Structure.Plant Cell,1996,8:1757-1771.
6.Bryan GT et al.A single amino acid difference distinguishes resistant and susceptible alleles of the rice blastresistance gene Pi-ta.Plant Cell,2000,12:2033-2045.
7.Cai D et al.Positional cloning of a gene for nematode resistance in sugar beet.Science,1997,275:832-834.
8.Cannon SB et al.Diversity,distribution,and ancient taxonomic relationships within the TIR and non-TIRNBS-LRR resistance gene subfamilies.J Mol Evo,2002,54:548-562.
9.Chen X et al.A B-lectin receptor kinase gene conferring rice blast resistance.Plant J,2006,46:794-804.
10.Dangl JL et al.Plant pathogens and integrated defence responses to infection.Nature,2001,411:826-833.
11.Dixon MB et al.The tomato Cf-2 disease resistance locus comprises two functional genes encoding leucine-richrepeat proteins.Cell,1996,84:451-459.
12.Dixon MS et al.The tomato Cf-5 disease resistance gene and six homologs show pronounced allelic variation inleucine-rich repeat copy number.Plant Cell,1998,10:1915-1925.
13.Dodds PN et al.Contrasting modes of evolution acting on the complex N locus for rust resistance in flax.Plant J,2001,27:439-452.
14.Fukuoka S et al..Loss of function of a proline-containing protein confers durable disease resistance in rice.Science,2009,325:998-1001.
15.Grant MR et al.Structure of the Arabidopsis RPM1 gene enabling dual specificity disease resistance.Science,1995,269:843-846.
16.Hayashi K and Yoshida H.Refunctionalization of the ancient rice blast disease resistance gene Pit by therecruitment of a retrotransposon as a promoter.Plant J,2009,57:413-425.
17.Hammond-Kosack KE et al.Plant disease resistance genes.Annu Rev Plant Physiol Plant Mol Biol,1997,48:573-607.
18.Iyer AS et al.The rice bacterial blight resistance gene xa5 encodes a hovel form of disease resistance.MolPlant-Microbe Interact,2004,17:1348-1354.
19.Jia YL et al.Direct interaction of resistance gene and avirulence gene products confers rice blast resistance.EMBO J,2000,19:4004-4014.
20.Johal GS et al.Reductase activity encoded by the HM1 disease resistance gene in maize.Science,1992,258:985-987.
21.Jones DA et al.Isolation of the tomato Cf-9 gene for resistance to Cladosporium fulvum by transposon tagging.Science,1994,266:789-793.
22.Jones DA,Jones JDG.The role of leucine-rich repeat proteins in plant defenses.Adv Bot Res Inc Adv PlantPathol,1996,24:89-167.
23.Kajava AV et al.Structural diversity of leucine-rich repeat proteins.J Mol Biol,1998,277:519-527.
24.Lawrence GJ et al.The L6 gene for flax rust resistance is related to the Arabidopsis bacterial resistance geneRPS2 and the tobacco viral resistance gene N.Plant Cell,1995,7:1195-1206.
25.Lee S et al.Rice Pi5-Mediated Resistance to Magnaporthe oryzae Requires the Presence of Two CC-NB-LRRGenes.Genetics,2009,181:1627-1638.
26.Lin F et al.The Blast Resistance Gene Pi37 Encodes an NBS-LRR Protein and is a Member of a ResistanceGene Cluster on Rice Chromosome 1.Genetics,2007,177:1871-1880.
27.Liu X et al.The in silico map-based cloning of Pi36,a rice coiled-coil nucleotide-binding site leucine-richrepeat gene that confers race-specific resistance to the blast fungus.Genetics,2007,176:2541-2549.
28.Luck JE et al.Regions outside of the leucine-rich repeats of flax rust resistance proteins play a role inspecificity determination.Plant Cell,2000,12:1367-1377.
29.Martin GB et al.Map-based cloning of a protein kinase gene conferring disease resistance in tomato.Science,1993,262:1432-1436.
30.Meyers BC et al.Plant disease resistance genes encode members of an ancient and diverse protein family withinthe nucleotide-binding superfamily.Plant J,1999,20:317-332.
31.Meyers BC et al.TIR-X and TIR-NBS proteins:two new families related to disease resistance TIR-NBS-LRRproteins encoded in Arabidopsis and other plant genomes.Plant J,2002,32:77-92.
32.Miki D and Shimamoto K.Simple RNAi vectors for stable and transient suppression of gene function in rice.Plant Cell Physiology,2004,45:490-495.
33.Pan Q et al.Divergent evolution of plant NBS-LRR resistance gene homologues in dicot and cereal genomes.JMol Evol,2000,50:203-213.
34.Parker JE et al.The Arabidopsis downy mildew resistance gene RPP5 shares similarity to the toll andinterleukin-1 receptors with N and L6.Plant Cell,1997,9:879-894.
35.Qu S et al.The broad-spectrum blast resistance gene Pi9 encodes a nucleotide-binding site-leucine-rich repeatprotein and is a member of a multigene family in rice.Genetics,2006,172:1901-1914.
36.Richly E et al.Mode of amplification and reorganization of resistance genes during recent Arabidopsis thalianaevolution.Mol Biol Evol,2002,19:76-84.
37.Shang J et al.Identification of a new rice blast resistance gene,Pid3,by genomewide comparison of pairednucleotide-binding site--leucine-rich repeat genes and their pseudogene alleles between the two sequenced ricegenomes.Genetics,2009,182:1303-1311.
38.Simons G et al.Dissection of the fusarium I2 gene cluster in tomato reveals six homologs and one active genecopy.Plant Cell,1998,10:1055-1068.
39.Song WY et al.A receptor kinase-like protein encoded by the rice disease resistance gene,Xa21.Science,1995,270:1772-1804.
40.Thomas CM et al.Characterization of the tomato Cf-4 gene for resistance to Cladosporium fulvum identifiessequences that determine recognitional specificity in Cf-4 and Cf-9.Plant Cell,1997,9:2209-2224.
41.Traut TW et al.The functions and consensus motifs of 9 types of peptide segments that form different types ofnucleotide-binding sites.Eur J Biochem,1994,229:9-19.
42.Wang L et al.Characterization of rice blast resistance genes in the Pik cluster and fine mapping of the Pik-plocus.Phytopathology,2009,99:900-905.
43.Wang ZX et al.The Pib gene for rice blast resistance belongs to the nucleotide binding and leucine-rich repeatclass of plant disease resistance genes.Plant J,1999,19:55-64.
44.Xiao S et al.Broad-spectrum mildew resistance in Arabidopsis thaliana mediated by RPW8.Science,2001,291:118-120.
45.Yoshimura S et al.Expression of Xa1,a bacterial blight-resistance gene in rice,is induced by bacterialinoculation.Proc Natl Acad Sci,1998,95:1663-1668.
46.Yuan B et al.Mitogen-activated protein kinase OsMPK6 negatively regulates rice diseaseresistance to bacterial pathogens.Planta,2007,226:953-960.
47.Zhou B et al.The eight amino-acid differences within three leucine-rich repeats between Pi2 and Piz-tresistance proteins determine the resistance specificity to Magnaporthe grisea.Mol Plant-Microbe Interac,2006,19:1216-1228.
序列表
<110>华南农业大学
<120>稻瘟病抗性基因Pik-p及其应用
<130>KHP09113305.2
<160>24
<170>PatentIn version 3.5
<210>1
<211>8096
<212>DNA
<213>稻属水稻(Orysa sativa L.)
<220>
<221>5’UTR
<222>(1)..(60)
<220>
<221>CDS
<222>(61)..(624),(775)..(1342),(4114)..(6410)
<220>
<221>3’UTR
<222>(6411)..(6573)
<400>1
gagagtagca ctcaacgcaa aaggggatcg gccgagccgc cgatcgaggc gcgagtaggg 60
atggaggcgg ctgccatggc cgtaaccgca gccacggggg ccttggcgcc cgtgctagtg 120
aagctggccg ctttgctgga cgacggggag tgcaatcttc tggaggggag ccggagcgac 180
gcagagttca tcagatccga gctggaggcc gttcattctc tcctcacccc aaatatcttg 240
gggaggatgg gggatgacga tgcggcgtgc aaggatggct tgattgcgga ggtccgggag 300
ctgtcctacg acctggatga tgccgtcgac gacttcttgg agctcaattt cgagcagcga 360
agaagcgcaa gccctttcgg tgagctcaag gcaagagttg aggagcatgt ctccaatcgc 420
ttctctgact ggaagctacc ggcggcgagc cttccgccgt cgtcggtaca ccgccgagct 480
ggcttgccgc caccagatgc agagctggtg gggatggaca aacgtatgga agagctcacc 540
aaattgctgg aacaagggag caatgatgct tcacgatggc gcaagcgaaa accgcatttc 600
ccgctcagaa aaacagggct aaaggtacgg ttggatctcg attcttcaca aatatagctt 660
cctccgatag cagtgccaat gaatttgttt acttctctct tgattcttta atttggaagt 720
actgtataac aaacatggag ggatcgtcat ttaacttaat tttttttgtt gcagcaaaaa 780
atcgtgatca aggttgccat ggagggcaat aattgccgtt caaaagcaat ggctttagtt 840
gcgagcactg gaggagtgga ctcggttgcg ctcgtaggtg atctaagaga caagatagag 900
gtggtcggtt atggcattga ccccatcaag ctgatctccg cgctccggaa gaaggtgggc 960
gatgcggagt tgctgcaggt cagccaagca aataaagatg tgaaggagac gacgccgatg 1020
cttgcgccgg tgaaatccat atgtgaattt cacaaggtca aaacagtttg catccttgga 1080
ttgccaggtg gaggcaaaac aacggttgcc agagaattat atgacgcctt gggaacgcac 1140
ttcccatgcc gggttttcgt gtcagtctct ccaagttcta gtcccagtcc caatctcaca 1200
aagactcttg cagacatttt cgctcaagca caactaggag taaccgatac acttagcaca 1260
ycatatggtg ggagtgggac cgggagagct cttcaacaac atctcatcga caacatatca 1320
gctttcctcc tcaacaaaaa gtaagcaata tcttatatct cttatctagc ttgtccttaa 1380
tccggcttcc caaattaaag tgaacaggct atatgtccat ataatgttgg gttttttttt 1440
tctgtcctcg atgaggtata tatcaagata tgccaaactt tttttaatta gcgttacttt 1500
attctaccta gtaaatagat acagaaatgg ggcaggcccc tcctttgtga caggccaatg 1560
gttttatgat gtggaaacaa ctcccttatc actgaagctc ttgctattac tcgtgatggt 1620
actgatcttg caaaggcttg cagtatttgt aaatccacag ggagtggtca tctttttttt 1680
tttaagataa tgaatagaaa tctggcctct atatagaaag ccaaaggtta cacaagtgca 1740
tacactccaa atctcaaagc tgagaaacac gaaaaacaaa cgaccaaaaa gaaaagacga 1800
taagattaag tttcaagctt ttcctccttg cttcagaaaa cagggaaatg cagagctagc 1860
ccgctcttcc cctgccatgc cctctcactg ccattgcaac taccacaaaa gctcccctgt 1920
ttcgccgtga gcgaagggag gggatggaac tagtctgctg ctttgttgtc gccaagcact 1980
ggtttcatgc tgcccaaaca acctatgccc cgctagcgat aacgagtctg ttgccgccat 2040
gctgttgtcg ctggagagca cttgtcgctg ctcagctatg tgttaagtaa gctactaaca 2100
atctctccat tctcttgtag actgcttata aaagattgta cttcaacttt ttattaatga 2160
aatagagttc ctgccttaca acaacaagaa cgagagagag agagagagcc aatatcacga 2220
gttagcactg tgaagacttc ctgactgtta tgtcttcaag aacagtataa aagcatttgc 2280
cactataaaa atatgatagc cctcaagatc attaaaaata tgtcatgttc ttttaccttc 2340
ataacaacca gtatcacaaa tatgaaaaca tttaaaggaa aatcagtatt tgccatttat 2400
gtaaagaaaa actagactag aactctgggc ttttatttta gtgcacactg tatacatgtg 2460
tgttctttga cgtatatact tgatattttt tcaaaaactt ttgggggcat atctgtaacc 2520
catgccccat gcatccaaag tagtagtgag acaactttcg tatcaatgga catgtggcat 2580
taaatttaat cccaatattt gttctctcga agtcctgaac atataattgg acaaaattat 2640
ataatttaat taggccaatt gagccatata tacacacact ataagaaact tctacaaaac 2700
aaacattcgc tccttttagg ttttgaccag tctatacatg aaccagccaa cggtgtgctg 2760
gtctgactat aaatgttttt ttagagagta aaggcacctt ccagtacctc tctcgaggac 2820
agtaaaaaca cccatgcttt aaggttatga acaacaaagc atagaacaac cacaagaaaa 2880
aaaaaaacaa ctagaacaag ttctggaact agttagggat tcgggaagcc attgcttgct 2940
ggcttgggac aagctatgcc tgccattgga ctatggtgga gactgggcat aaaggatctc 3000
aatctcttgg agattggcct atgtcctagg tggccatggc tgactgcaag gagatatatt 3060
tcttgcaacg gaaattttca ttgcaaaagc tgaaaatcgt tgcgatatgt acctattgtg 3120
acataattga atccattggg agccgtagca ataaatcccg tggtaatagg ctattgcaat 3180
gattttccga tggttgcata aaaaggtgct attacaatgg ttttgcattg ctattgcaac 3240
aacttgatat gtcataggta tttattgcaa cagtttgtac catgctgcaa taagatttcg 3300
caacacagac ctattcatta aatagcctat ttccacacat tcatttttgt ggcaataatt 3360
cagtgcgaca agcatgttct atagctataa cctgatgcaa cgggttagac ttgttgcagt 3420
aacttattag cacatttaat tttttactaa atttatattg attcaactat ttcaacatca 3480
ttcaatacag taaatactgg aaaatgcaag caatctccaa aaattcacca ataagagaaa 3540
cgaattaaat catatacaca attatttcac accgtacagt gcacttaagt gtaggaaata 3600
taatgctata tacatttcca accctcaata caggttctgt aaataatgaa aagaaacatc 3660
aatttgattt cttgaccggt ccctctgaga cacacagtgc tttcctcagc aacatcgttg 3720
ccgatctatg taacttgtaa actccttgtg cttaataaat tctggccagg gttacttcag 3780
cctgcatggc gaaaaaaaag aacttgttag ggagatcgca aacaaatgtt agctgcccca 3840
tttcttctct actgaccata aaagctctag gatcaggtca tgaaaacaat gccgtcagac 3900
cggaaatact caagattcag aacacgactc ctaggaactg caagtttaaa tttcaacaaa 3960
agcaaaatac aaccagatga caaactaata tatatcagct gactgttcat ctcaagtgga 4020
ctcacatctg tgaaattgtt agttctatgc atatatgcta atgtaatact aatatgctat 4080
tgaggcttat tactcttctc gacttttata ggtatctcat tgtaatcgat gacatttggc 4140
attgggaaga atgggaagtc atcagaaagt ccattcccaa gaatgatctg ggtggtagaa 4200
taatcatgac tactcgtctt aattcaatag ctgagaagtg ccacactgat gacaatgatg 4260
tttttgtcta cgaagttggg gatctagata ataatgatgc tttgtcgttg tcttggggga 4320
tagcaacaaa gtctggggca ggcaacagga tcggaactgg agaggataat ccatgctatg 4380
atattgtgaa catgtgttat ggtatgcctt tagcacttat ttggctgtcg tcagcattgg 4440
ttggagagat agaagaatta ggtggtgctg aagtgaaaaa atgtagggat ttgagacaca 4500
tagaggatgg tattttggac atcccatcct tacaaccatt ggcggagagt ttatgccttg 4560
gttataacca tcttcctctt tatctgagga ctttgttgtt gtactgtagt gcataccatt 4620
ggtctaacag aatcgaaagg ggtcgtctgg tcaggaggtg gattgcggaa ggatttgtgt 4680
cggaagagaa agaagcagaa ggttactttg gcgagcttat taacagagga tggattacgc 4740
agcacggaga caacaacagt tataattact atgagatcca ccccgtgatg ctggccttcc 4800
tgagatgcaa gtccaaggag tacaattttt taacatgctt gggtctggga tctgatacta 4860
gtactagtgc atcctcccca aggttgattc gccggctgtc tcttcagggg gggtatccag 4920
tggactgctt gtcaagcatg agtatggatg tgtcacacac ttgcagcctt gtcgtccttg 4980
gtgacgtggc gcgacccaag ggaatcccct tctatatgtt taagcgcttg cgagtgttgg 5040
accttgaaga taataaggat atacaggatt ctcatctgca gggcatatgt gaacagttaa 5100
gcctcagagt gaggtacctt ggtctcaagg gaacgcggat ccgaaagctc cctcaggaga 5160
tgaggaagct gaagcatttg gagattttgt atgtggggag cactcggatc agtgaacttc 5220
cgcaagagat tggagagctg aagcatctgc ggattctgga cgtgagaaac acggacatca 5280
ctgagctccc actgcagata cgggagctgc agcatctgca cactctggac gtgaggaaca 5340
ctccaatcag tgagctcccg ccgcaggttg gcaagctgca gaatctcaag attatgtgcg 5400
tgaggagcac tggggttagg gagctcccaa aggagattgg ggagctgaat catctacaga 5460
ctctggacgt gagaaacacg agggtgagag agctgccatg gcaagctggc cagatctccc 5520
aatcgttgcg cgtgcttgcc ggtgacagtg gcgatggcgt gcggttgccc gaaggcgtct 5580
gcgaagctct gatcaacggt attccagggg ctacgcgtgc aaaatgcagg gaggttctgt 5640
ccatcgcgat catcgatcgt ttcggacctc cccttgttgg gatattcaaa gttcccggca 5700
gtcatatgcg tatcccgaag atgatcaaag accacttccg cgttctttct tgcctagaca 5760
tcaggctctg ccacaagctt gaggatgatg accaaaagtt cctcgccgag atgcccaacc 5820
tgcagacgct cgtgctgagg ttcgaggccc taccaagaca acccataacc atcaacggca 5880
caggcttcca gatgctggag agcttccgtg tcgacagccg ggtgccaagg atagccttcc 5940
atgaagacgc catgcccaac ctcaagcttc tcgagttcaa gttctacgcc ggcccagcaa 6000
gcaacgatgc catcggcatc accaacctga agagcctcca aaaggtggtc tttcggtgct 6060
cgccatggta caagagcgac gcccctggca tcagcgccac cattgacgtc gtgaagaaag 6120
aagccgagga gcatcccaac cggccgatca ccctcctcat caatgctggg tataaggaga 6180
tatcaactga gtcacacggg agcagtgaaa acattgcggg cagcagtggg atcgatactg 6240
agcctgcaca ggcacagcat gataatctcc ctgctgttcg agatgactac aagggaaaag 6300
ggattcttct tgatggcagg tgtcctacct gcggccgagc gactaaaatt gaagaggaaa 6360
cccaagatcg agtagcagat attgaaattc aaacagaaac tactagctag ctagtaccgc 6420
tgtctaattt tatttgtatt aaatactctc cttcatgtaa tactctaagt ccccgctcaa 6480
tttttcttgc ctccgcaact ctacatgata ctcaaaactg atttcttacc tccctttgat 6540
tgtagaaaaa acatgggaag tttggaggat ttcaatccta tgaaaaggac tttgagaaac 6600
aaattacgga atcctatctt ttcctttaaa agatatataa tagaaaatcc tatagagatt 6660
tttataggaa agttaacaaa agcctcaatc ttatggagaa ttcctgtgtt tatttgcata 6720
gaggaaagtg agactcacct aatcaggggc ggttctagtg ggtggtccag gtgatccctg 6780
gaccacccta aaatttggcc catgagaacc gccccccccc tcctcctccc ggggttggtc 6840
gaaaccagaa aaaaggccac tgagctaagc tagggttagg gagaggtgta gccatgaact 6900
agccacagca gcacgcatcg ctggctgccg cctccgcctc tgcgtcagcg ccagcgtgcc 6960
gtctccccgg ctctcgactc tgcctccgcc tccgccttgg cgccgatgcg tcgcctcctc 7020
ggctcccgct tctacctcag cgccagcgtg ccgcctccgc cttagcgccg cctcctcgcc 7080
tcctcccttc cccgactcct ccagtccccc actcctgtcg gttgctatcg tcggcgggcg 7140
gcgaccaacg cctacaagcc aactacggct gcctgggcaa gccggcggcc acccccgccc 7200
ctggaaaggg aaaacattgc cattccacag gcggcgccag gtcccctttc tctctagctt 7260
tcttgttgct cagcaaagtt caaaggtcag tagacatttc ttgttggttt tttgcaatgt 7320
tgccctatgc agtagtgagt agtcagtact cagtagtagt gctagtgaaa aatttctgcc 7380
ttgagtgtag tgaaaagcaa gctgagaaat agcatgtgtg acaaattatt aaatgatggt 7440
ttaatcacat ctattgagcg ggttattttc tctcaagtta gcaaagaaga cattatgcat 7500
aatttcatgt ctatatatga acggagagta gaaaagaagt agggttttgt aatcttctac 7560
ttatctatta taagatattt gaatgattat gttctaactt tcgtaacttc aaatttagat 7620
gttggtttta agcaagttgt gatgttgtga gatatattgc tacatttcaa ttatattttt 7680
attatttttg acacatttgg cttgaatttg taacaatgtg acaatttgga gcatcaagta 7740
ggactacccg acgattcatg aaaccaatgt gccgatattt agcggttctt aatcatactg 7800
aggtaagaag gcaacatttt gcaaaagtag aatttcttca tgaaaatcag gcttatagct 7860
ttaatgttca caaactagct aagtttcctt gtaccttaga ggaggggcat catgttcgat 7920
tctcaaatac tccttgtaat ctaagtagga atgtaaataa tttggttgtt aatgaataaa 7980
atgccagaat tttattctaa aagaatatgt aattcaaatt aatattattc tagatgtatc 8040
ttgaatagcc cattagcact ggaccagtct ttcaagttgg gtcagtggtc ggtacc 8096
<210>2
<211>7543
<212>DNA
<213>稻属水稻(Orysa sativa L.)
<220>
<221>5’UTR
<222>(1)..(155),(1037)..(1145)
<220>
<221>CDS
<222>(1146)..(2137),(2302)..(4375)
<220>
<221>3’UTR
<222>(4376)..(4658)
<400>2
gcctggccag ctctggtcgg aggagggttc agttccatgg gtgcgccccc tgcccatccc 60
tccacttcta ctttcgtctt gagatctcgc cgtttctgat ctccttgcca tggcaaccag 120
ttgatttggc tgccaagatc gtccgtgctt tgcaaaggtg aacaatcttt ttcttcttgg 180
gctttcactt tgttgtgtta atagactcag cgccgtcgca acttgcaaat aaaaaggttc 240
agatcggaaa atggaaatac cgtacacagt agtctataga gaatcgacta gtatttcctg 300
actgtaagta aggaagttgt aggtaccatg ttctgatgat tatctttgaa gcttggaggg 360
ggacaaaaag ttgggagaga aacaagaatg tatttgttca cttgattggg ggcagtaggt 420
aaaatctcag cgttaggaat ttgcacctct ttattaagtt tgactaaatt tatagaaaaa 480
aattagcaac acttaaaaca ccaaattagt ttcattgaat ccaacattga atatattttg 540
ataatatgtt tactttgtgt tcaaaatgtt actatatttt tctataaatt tgatcaaatt 600
tcaataagtt tgactagaaa aaaaagtcaa accgacttat aatatgaata taaatggagg 660
gagtagatta taaattatct ttattataat actttgagtg tacaaacgaa tgaataaagt 720
ttacaaagtt gaaaagttat ttttaaatac tattccgata aacctgtata tagagaactt 780
gtgaaaaaaa acaccgtttg aaatgtgtct gtgataatct ataagaggaa cggggcctaa 840
atatattcct tttttcctac gaaaatgcaa gaagtattgc ctaatatatt gatagagctg 900
aagttttaaa gtaaaatgaa acatttgcat gaggcagcac tggacaaaaa gcgcttgtga 960
aaaaaatatg ttccaaatat tattaattca ttcatggtat ggtctttatt ttcattttcc 1020
ccccttatcc atgggcagga cacggaaatt ctcagcaggt tcgcgcttga tctgaactgt 1080
ctgtggctca tactctcttg tgcttgcgcc agctgaaagt tgcagtgaga agtacagaga 1140
acaagatgga gttggtggta ggtgcttccg aagccaccat gaaatctctc ttgggcaagc 1200
tgggcaatct tctagcccag gagtatgctc tcatcagcgg tatccgtggt gacatccagt 1260
acatcaatga cgagcttgcc agcatgcagg ccttcctccg tgatctcagc aacgtgccag 1320
agggtcacag tcatggccac cggatgaagg actggatgaa gcagatccga gacatcgcct 1380
atgatgttga ggactgtatc gatgactttg cccaccgcct ccctcaggat tccatcagcg 1440
atgccaaatg gtccttccta ctcacaaaaa tctatgaact atggacatgg tggccacgtc 1500
gtgtgattgc ttccaacatt gcccaactca aggtacgggc acaacagatc gcagatcgac 1560
gtagtagata cggagtgaac aacccagaac accttgacag tagcagcagt gccaggaccc 1620
gtgctgtcaa ttacgaaatt gctgagtatc aggtcacaag ccctcagatc attggtataa 1680
aggagcctgt ggggatgaag acggtcatgg aggagcttga ggtttggtta actaatcctc 1740
aagctgaaaa tgggcaagct gttctgtcca tagtcggttt tggaggtgtg ggaaagacta 1800
ccattgccac agcattgtac agaaaagtca gtgataaatt tcagtgccgg gcatcagtag 1860
ctgtgtctca gaactatgac caaggcaaag tcctcaatag tattctgagt caagtcagca 1920
atcaggagca gggcagcagc acaacaatta gtgagaaaaa gaacctcacc tcaggcgcta 1980
agagcatgtt gaagacagcc ctgtcactgc tcagaggtaa ttgtatatgt cagccagaaa 2040
atgatggaaa ccctgataat acaccaatca ggctgcagga aacaacggac gatgatcaaa 2100
accccagaaa actggaacag ctcctggccg aaaagaggta cctttttttg taaataaaat 2160
tgctttgctt atctgtaaat taacttactc atcccactct aaatctaatg tttatttcct 2220
tctatacaca gcacaactcc atcttttgaa tgggttttat ttttctcact tgtgctcatt 2280
ttttttttat catctctgca gttatatcct cttgattgat gacatttggt ctgccgaaac 2340
atgggagagt atcagatcga ttttgcctaa aaataataaa ggcggtagaa taatagtgac 2400
tacaagattt caagctgttg gttcaacatg ctcccctctt gaaactgatc gtttgcatac 2460
agttgatttt ctcaccgatg acgagtccca aaacttattc aatacaagta tttgtgaatc 2520
aaagataaga aaagatagca acaaagtaga cgagcaagtc cctgaggaaa tatggaaaat 2580
atgtggggga ttgcctttgg ccatagtcac catggctggt cttgtcgcct gcaacccaag 2640
gaaagcctgc tgcgattgga gtaaactttg caaatcatta tttccagagc aagaaactcc 2700
tcttaccctc gatggtgtta caaggatact ggattgttgt tacaatgatt tgcctgcgga 2760
tctgaagact tgcttattgt acttgagtat atttccgaag ggttggaaaa ttagtaggaa 2820
acgtttgtcc cggcgatgga tagctgaagg ttttgctaat gagaagcaag ggttaaccca 2880
ggaaagagtt gcagaggcat actttaatca actcacaaga aggaacttag tacgtcccat 2940
ggagcatggc agcaatggga aggtaaaaac gtttcaagtt catgacatgg ttcttgaata 3000
catcatgtcc aaatcaatcg aagagaattt tattactgtg gttggtggac actggcagat 3060
gactgcacca agcaataaag tccgtcgact gtcgatgcaa agcagtggat ccaatcgtgg 3120
aagttcaaca aaaggcctga acttggctca agtgagatca ctgacggtgt ttgggaacct 3180
gaaccatatg ccattccatt cattcaacta tgggataata caggtgctgg atcttgagga 3240
ctggaagggt ttgaaagaga gacatatgac ggagatatgt caaatgcttt tactcaagta 3300
tttgagcatc cgacgaacag aaatttccaa aattccctcc aagattcaga aacttgagta 3360
cttggaaact cttgacataa gggagacata tgtcagggac ctgcctaagt caatagtcca 3420
gctaaaacgg atcattagca tacttggagg gaataaaaac acacggaagg ggctgaggtt 3480
gcctcaagaa aaaagtaaga agccaattaa aaacccgtcg cctcaaggaa aaacaaagga 3540
gcccgcaaag aaaggattct tatcccaaga aaaaggtaaa ggcgcaatga aagcactccg 3600
tgtactgtca gggattgaga ttgttgagga atcatcagaa gtagctgcag gccttcatca 3660
gttgacaggg ctaaggaagc ttgccatata caagctcaat ataacaaagg gtggtgatac 3720
cttcaaacaa ttacagtcct ccattgagta ccttggcagc tgtggtctgc agactctggc 3780
catcaatgat gagaattctg aatttatcaa ctcactgggc gacatgcccg cgcctccaag 3840
atatcttgtc gcccttgagc tgtctggcaa gttggagaag ctacccaagt ggatcaccag 3900
catcactact ctcaacaagc taaccatatc tgtaacagtt cttaggactg aaactttgga 3960
gatcctccac attttacctt cattgttttc cctcaccttc gccttttcac ttagtgcagc 4020
gaagcaggat caggacataa taaaggacat ccttgagaat aataaattgg acagtgatgg 4080
ggaaatcgtc attccagctg aaggattcaa gagtcttaag ctgcttcgct tctttgcacc 4140
tttagtgccg aagctcagct ttttggacaa gaatgcaatg ccagcactcg aaatcattga 4200
aatgcggttt aaagacttcg aaggtctatt tggcatcgaa atccttgaaa atctccgtga 4260
ggtgcatctc aaagttagtg atggggcaga agcaataacc aagttccttg taaatgattt 4320
gaaggttaat actgagaaac caaaagtatt tgttgatggc atcgtcactg catgagaagt 4380
aaaattgctg caaatcggag aacttaccaa tcatctgagg cttcccctct attattactc 4440
tcttagaata tattgttatt attgctcacc ttgcaaaata aaatagggat ggcatagcat 4500
attgctacaa cgtaccatgg ttccatcata gttgatttca cttgtcatta cagtgtctgt 4560
tcagttgtgt tttctattaa taaaagggag atctccgcaa gaaaccatta ttatacttat 4620
attcggttat gaactctata aatgatggga ttgctatatt tatggtgcca caatttccat 4680
gagtgcggta tttttttttc tagcagttgg ctggtgttaa gatttgtgct gccattgctc 4740
ctctattatt ggtgcccaaa attacgcttt gcactatgtt cacagttgta aaacttacct 4800
aaatttcgtg tattagtact gtaatgttgt gattttcgcg tccagttata tttttttctt 4860
tgccagaagt ttgatttcaa ggtatatctg gtaagcttca gccggattcg ttaagttttt 4920
agtttaatca gctaaacttt taacagtatg catgcactag tttctcccgt tatcttatcc 4980
aatataatta atgggctaat tggatctatg ccattacaaa catggcaaga ttcagaaaaa 5040
tgctactaca atttgtctat tcatagctgt gctagtgaaa ttttacaaaa ttggaaccgt 5100
gctattgata tcacgttttc catccatctt ttcttttttt tccttttctt ctttatcccg 5160
tcttcttccc gcaccgacga gaggcgagtg gcggcgggac cggcaagcga tggcaggacc 5220
tcggcagctc agtggtggca gcggcaaaag ggaaagggga aggggtagga gagaacctgg 5280
cccgccgacg caagcatgtt ggtcgtcgtc ttccccgagg agcttgagca gtggcaatgg 5340
tagaggaatc caaggaggcc gaggaggttg agcggcacta atggtagagg atgccgagga 5400
ggtcgagcat cagcgagctt gccctgggtg ggcgagtgag ggagagctct cccaagtgtc 5460
tgcgaagctc tcatcttcgt tttcctcatc cccactttgc tcctcttcca ctactgctgt 5520
tgaagctagc ggccctctct caccagatct gggcggagct caagcgaggc gccaccaccg 5580
acgatggagc ttcgcgtgga ccccacagcc tcacccgatt gccatccctc ctaccctcat 5640
cactccatct ccaagcccaa atcccacctc ccccaccggc tcctcctcct cagcccctgc 5700
cgtggcagcc gaggccaccg atgggccaac cctcaacatc gtctcgaagc agctccgcgc 5760
actgtggaag aagcacaacc aaatcctcca gatggaggag tcgctcactg gcgggaggaa 5820
gctgaacaag gagcaggagg aggtgctccg atccaagccc atcgtcatcg cgctcattga 5880
tgagctcgag cggatgcgtg ccctgctcgc cgacgccctc gccgaggagc tctcctcctg 5940
ccccgcccct tccctatctg tcactccctc ctcctccacc tcctccggcg ctgatttgtc 6000
cgtcaaggat ctcctcacgc tcatctactt cggctccctc ttcgacgtca agtcgcagat 6060
cgagttcgtc accaccatgc tcgcacgcta aggagctaga ctgatgcatc acctacgact 6120
atgtccgtcc ttgccgcagc acgacttgct acggtcccgc ctcacgcccc gtcgctcgcc 6180
ggtcccgccg ccacttgctt ctcatcggtg cgggaagaag atgggagaaa gaaggaagaa 6240
aacgggaaaa aaaagagatg gatgaaaaac gtgacagcaa tagcacggtt ccaatttcgt 6300
aaaatttcaa tggcaccgct acgaatagaa gaattgtagt agcatttttc tgaatcaaca 6360
tatttgtaat ggcatggatc caataaaccc caaatttatt ttatgccaat acaaatattg 6420
gcaactggtg tcattaacgt ataaatgggt cgcctacata aaaccttccg cgtccttctt 6480
ggttgaacag gtaattaatt gcaaccattt tctgaaaaaa ttgactggaa gtctgaaacc 6540
gtccctgcta ctccctgaat gctttagttc aatctctatg aagctccatc caaacaaaat 6600
ctccagatga tatttcatta aaaacatcat gtccagaact gtccagtatc caagaatggc 6660
tatgatcttt tctgaattct ggactcggga cttatattac atgtatcaaa tccagttatt 6720
ttctggagct acaccagatg aaatcaccaa gtaaaacaat tatataagga tggtgagttg 6780
ttaattaatt acagctttga tagaccactc tgtttgttaa tgcctgacaa gagttcacaa 6840
agaaaaaaat acacaggatt acaaaacttt ttagatacta tctgatgtgt gttaattctc 6900
tcaaatacta tgtacctgaa caaacaagag agaacatgtt gtgaattgtg atctcctcat 6960
gcgagtgacc tgaaactctc tacagggtca gactcctcaa gcatcactgc agcctggaaa 7020
caatcagcag catcattgat cctcccatca ttcctgtgaa ctctccctaa atgaagccaa 7080
gccatcctgt tcgtaggctc aattctcagt gcatcagaga ggaagcatct cgctgcaggg 7140
agataccttg acccttgctt gcagagaagt gcaccaatgg ccaccttgga tggaacatgc 7200
tctagctcta tcgagaatgc gttgacgtag gcagccagtg cctccttgtt ctgatctcgc 7260
gcctcgagca tgaaacctga acttatagca tccacgagtc agattttgat gttgaattca 7320
gttgaatgca gagaattgtc atgttagtgt tacatagaag tggtgaaagt tcaagtttag 7380
ttgagttgat caccttctgc atgcattgtt gcagccgagt aggattttag ggctctagcc 7440
ttctgcagac atatctcagc gtctctccag attgagagac tggagtacag gtttgcaagt 7500
ccttgccata tttcaaactc acttacactg tcattctgtc cct 7543
<210>3
<211>1142
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>3
Met Glu Ala Ala Ala Met Ala Val Thr Ala Ala Thr Gly Ala Leu Ala
1 5 10 15
Pro Val Leu Val Lys Leu Ala Ala Leu Leu Asp Asp Gly Glu Cys Asn
20 25 30
Leu Leu Glu Gly Ser Arg Ser Asp Ala Glu Phe Ile Arg Ser Glu Leu
35 40 45
Glu Ala Val His Ser Leu Leu Thr Pro Asn Ile Leu Gly Arg Met Gly
50 55 60
Asp Asp Asp Ala Ala Cys Lys Asp Gly Leu Ile Ala Glu Val Arg Glu
65 70 75 80
Leu Ser Tyr Asp Leu Asp Asp Ala Val Asp Asp Phe Leu Glu Leu Asn
85 90 95
Phe Glu Gln Arg Arg Ser Ala Ser Pro Phe Gly Glu Leu Lys Ala Arg
100 105 110
Val Glu Glu His Val Ser Asn Arg Phe Ser Asp Trp Lys Leu Pro Ala
115 120 125
Ala Ser Leu Pro Pro Ser Ser Val His Arg Arg Ala Gly Leu Pro Pro
130 135 140
Pro Asp Ala Glu Leu Val Gly Met Asp Lys Arg Met Glu Glu Leu Thr
145 150 155 160
Lys Leu Leu Glu Gln Gly Ser Asn Asp Ala Ser Arg Trp Arg Lys Arg
165 170 175
Lys Pro His Phe Pro Leu Arg Lys Thr Gly Leu Lys Gln Lys Ile Val
180 185 190
Ile Lys Val Ala Met Glu Gly Asn Asn Cys Arg Ser Lys Ala Met Ala
195 200 205
Leu Val Ala Ser Thr Gly Gly Val Asp Ser Val Ala Leu Val Gly Asp
210 215 220
Leu Arg Asp Lys Ile Glu Val Val Gly Tyr Gly Ile Asp Pro Ile Lys
225 230 235 240
Leu Ile Ser Ala Leu Arg Lys Lys Val Gly Asp Ala Glu Leu Leu Gln
245 250 255
Val Ser Gln Ala Asn Lys Asp Val Lys Glu Thr Thr Pro Met Leu Ala
260 265 270
Pro Val Lys Ser Ile Cys Glu Phe His Lys Val Lys Thr Val Cys Ile
275 280 285
Leu Gly Leu Pro Gly Gly Gly Lys Thr Thr Val Ala Arg Glu Leu Tyr
290 295 300
Asp Ala Leu Gly Thr His Phe Pro Cys Arg Val Phe Val Ser Val Ser
305 310 315 320
Pro Ser Ser Ser Pro Ser Pro Asn Leu Thr Lys Thr Leu Ala Asp Ile
325 330 335
Phe Ala Gln Ala Gln Leu Gly Val Thr Asp Thr Leu Ser Thr Ser Tyr
340 345 350
Gly Gly Ser Gly Thr Gly Arg Ala Leu Gln Gln His Leu Ile Asp Asn
355 360 365
Ile Ser Ala Phe Leu Leu Asn Lys Lys Tyr Leu Ile Val Ile Asp Asp
370 375 380
Ile Trp His Trp Glu Glu Trp Glu Val Ile Arg Lys Ser Ile Pro Lys
385 390 395 400
Asn Asp Leu Gly Gly Arg Ile Ile Met Thr Thr Arg Leu Asn Ser Ile
405 410 415
Ala Glu Lys Cys His Thr Asp Asp Asn Asp Val Phe Val Tyr Glu Val
420 425 430
Gly Asp Leu Asp Asn Asn Asp Ala Leu Ser Leu Ser Trp Gly Ile Ala
435 440 445
Thr Lys Ser Gly Ala Gly Asn Arg Ile Gly Thr Gly Glu Asp Asn Pro
450 455 460
Cys Tyr Asp Ile Val Asn Met Cys Tyr Gly Met Pro Leu Ala Leu Ile
465 470 475 480
Trp Leu Ser Ser Ala Leu Val Gly Glu Ile Glu Glu Leu Gly Gly Ala
485 490 495
Glu Val Lys Lys Cys Arg Asp Leu Arg His Ile Glu Asp Gly Ile Leu
500 505 510
Asp Ile Pro Ser Leu Gln Pro Leu Ala Glu Ser Leu Cys Leu Gly Tyr
515 520 525
Asn His Leu Pro Leu Tyr Leu Arg Thr Leu Leu Leu Tyr Cys Ser Ala
530 535 540
Tyr His Trp Ser Asn Arg Ile Glu Arg Gly Arg Leu Val Arg Arg Trp
545 550 555 560
Ile Ala Glu Gly Phe Val Ser Glu Glu Lys Glu Ala Glu Gly Tyr Phe
565 570 575
Gly Glu Leu Ile Asn Arg Gly Trp Ile Thr Gln His Gly Asp Asn Asn
580 585 590
Ser Tyr Asn Tyr Tyr Glu Ile His Pro Val Met Leu Ala Phe Leu Arg
595 600 605
Cys Lys Ser Lys Glu Tyr Asn Phe Leu Thr Cys Leu Gly Leu Gly Ser
610 615 620
Asp Thr Ser Thr Ser Ala Ser Ser Pro Arg Leu Ile Arg Arg Leu Ser
625 630 635 640
Leu Gln Gly Gly Tyr Pro Val Asp Cys Leu Ser Ser Met Ser Met Asp
645 650 655
Val Ser His Thr Cys Ser Leu Val Val Leu Gly Asp Val Ala Arg Pro
660 665 670
Lys Gly Ile Pro Phe Tyr Met Phe Lys Arg Leu Arg Val Leu Asp Leu
675 680 685
Glu Asp Asn Lys Asp Ile Gln Asp Ser His Leu Gln Gly Ile Cys Glu
690 695 700
Gln Leu Ser Leu Arg Val Arg Tyr Leu Gly Leu Lys Gly Thr Arg Ile
705 710 715 720
Arg Lys Leu Pro Gln Glu Met Arg Lys Leu Lys His Leu Glu Ile Leu
725 730 735
Tyr Val Gly Ser Thr Arg Ile Ser Glu Leu Pro Gln Glu Ile Gly Glu
740 745 750
Leu Lys His Leu Arg Ile Leu Asp Val Arg Asn Thr Asp Ile Thr Glu
755 760 765
Leu Pro Leu Gln Ile Arg Glu Leu Gln His Leu His Thr Leu Asp Val
770 775 780
Arg Asn Thr Pro Ile Ser Glu Leu Pro Pro Gln Val Gly Lys Leu Gln
785 790 795 800
Asn Leu Lys Ile Met Cys Val Arg Ser Thr Gly Val Arg Glu Leu Pro
805 810 815
Lys Glu Ile Gly Glu Leu Asn His Leu Gln Thr Leu Asp Val Arg Asn
820 825 830
Thr Arg Val Arg Glu Leu Pro Trp Gln Ala Gly Gln Ile Ser Gln Ser
835 840 845
Leu Arg Val Leu Ala Gly Asp Ser Gly Asp Gly Val Arg Leu Pro Glu
850 855 860
Gly Val Cys Glu Ala Leu Ile Asn Gly Ile Pro Gly Ala Thr Arg Ala
865 870 875 880
Lys Cys Arg Glu Val Leu Ser Ile Ala Ile Ile Asp Arg Phe Gly Pro
885 890 895
Pro Leu Val Gly Ile Phe Lys Val Pro Gly Ser His Met Arg Ile Pro
900 905 910
Lys Met Ile Lys Asp His Phe Arg Val Leu Ser Cys Leu Asp Ile Arg
915 920 925
Leu Cys His Lys Leu Glu Asp Asp Asp Gln Lys Phe Leu Ala Glu Met
930 935 940
Pro Asn Leu Gln Thr Leu Val Leu Arg Phe Glu Ala Leu Pro Arg Gln
945 950 955 960
Pro Ile Thr Ile Asn Gly Thr Gly Phe Gln Met Leu Glu Ser Phe Arg
965 970 975
Val Asp Ser Arg Val Pro Arg Ile Ala Phe His Glu Asp Ala Met Pro
980 985 990
Ash Leu Lys Leu Leu Glu Phe Lys Phe Tyr Ala Gly Pro Ala Ser Asn
995 1000 1005
Asp Ala Ile Gly Ile Thr Asn Leu Lys Ser Leu Gln Lys Val Val
1010 1015 1020
Phe Arg Cys Ser Pro Trp Tyr Lys Ser Asp Ala Pro Gly Ile Ser
1025 1030 1035
Ala Thr Ile Asp Val Val Lys Lys Glu Ala Glu Glu His Pro Asn
1040 1045 1050
Arg Pro Ile Thr Leu Leu Ile Asn Ala Gly Tyr Lys Glu Ile Ser
1055 1060 1065
Thr Glu Ser His Gly Ser Ser Glu Asn Ile Ala Gly Ser Ser Gly
1070 1075 1080
Ile Asp Thr Glu Pro Ala Gln Ala Gln His Asp Asn Leu Pro Ala
1085 1090 1095
Val Arg Asp Asp Tyr Lys Gly Lys Gly Ile Leu Leu Asp Gly Arg
1100 1105 1110
Cys Pro Thr Cys Gly Arg Ala Thr Lys Ile Glu Glu Glu Thr Gln
1115 1120 1125
Asp Arg Val Ala Asp Ile Glu Ile Gln Thr Glu Thr Thr Ser
1130 1135 1140
<210>4
<211>1021
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>4
Met Glu Leu Val Val Gly Ala Ser Glu Ala Thr Met Lys Ser Leu Leu
1 5 10 15
Gly Lys Leu Gly Asn Leu Leu Ala Gln Glu Tyr Ala Leu Ile Ser Gly
20 25 30
lle Arg Gly Asp Ile Gln Tyr Ile Asn Asp Glu Leu Ala Ser Met Gln
35 40 45
Ala Phe Leu Arg Asp Leu Ser Asn Val Pro Glu Gly His Ser His Gly
50 55 60
His Arg Met Lys Asp Trp Met Lys Gln Ile Arg Asp Ile Ala Tyr Asp
65 70 75 80
Val Glu Asp Cys Ile Asp Asp Phe Ala His Arg Leu Pro Gln Asp Ser
85 90 95
Ile Ser Asp Ala Lys Trp Ser Phe Leu Leu Thr Lys Ile Tyr Glu Leu
100 105 110
Trp Thr Trp Trp Pro Arg Arg Val Ile Ala Ser Asn Ile Ala Gln Leu
115 120 125
Lys Val Arg Ala Gln Gln Ile Ala Asp Arg Arg Ser Arg Tyr Gly Val
130 135 140
Asn Asn Pro Glu His Leu Asp Ser Ser Ser Ser Ala Arg Thr Arg Ala
145 150 155 160
Val Asn Tyr Glu Ile Ala Glu Tyr Gln Val Thr Ser Pro Gln Ile Ile
165 170 175
Gly Ile Lys Glu Pro Val Gly Met Lys Thr Val Met Glu Glu Leu Glu
180 185 190
Val Trp Leu Thr Asn Pro Gln Ala Glu Asn Gly Gln Ala Val Leu Ser
195 200 205
Ile Val Gly Phe Gly Gly Val Gly Lys Thr Thr Ile Ala Thr Ala Leu
210 215 220
Tyr Arg Lys Val Ser Asp Lys Phe Gln Cys Arg Ala Ser Val Ala Val
225 230 235 240
Ser Gln Asn Tyr Asp Gln Gly Lys Val Leu Asn Ser Ile Leu Ser Gln
245 250 255
Val Ser Asn Gln Glu Gln Gly Ser Ser Thr Thr Ile Ser Glu Lys Lys
260 265 270
Asn Leu Thr Ser Gly Ala Lys Ser Met Leu Lys Thr Ala Leu Ser Leu
275 280 285
Leu Arg Gly Asn Cys Ile Cys Gln Pro Glu Asn Asp Gly Asn Pro Asp
290 295 300
Asn Thr Pro Ile Arg Leu Gln Glu Thr Thr Asp Asp Asp Gln Asn Pro
305 310 315 320
Arg Lys Leu Glu Gln Leu Leu Ala Glu Lys Ser Tyr Ile Leu Leu Ile
325 330 335
Asp Asp Ile Trp Ser Ala Glu Thr Trp Glu Ser Ile Arg Ser Ile Leu
340 345 350
Pro Lys Asn Asn Lys Gly Gly Arg Ile Ile Val Thr Thr Arg Phe Gln
355 360 365
Ala Val Gly Ser Thr Cys Ser Pro Leu Glu Thr Asp Arg Leu His Thr
370 375 380
Val Asp Phe Leu Thr Asp Asp Glu Ser Gln Asn Leu Phe Asn Thr Ser
385 390 395 400
Ile Cys Glu Ser Lys Ile Arg Lys Asp Ser Asn Lys Val Asp Glu Gln
405 410 415
Val Pro Glu Glu Ile Trp Lys Ile Cys Gly Gly Leu Pro Leu Ala Ile
420 425 430
Val Thr Met Ala Gly Leu Val Ala Cys Asn Pro Arg Lys Ala Cys Cys
435 440 445
Asp Trp Ser Lys Leu Cys Lys Ser Leu Phe Pro Glu Gln Glu Thr Pro
450 455 460
Leu Thr Leu Asp Gly Val Thr Arg Ile Leu Asp Cys Cys Tyr Asn Asp
465 470 475 480
Leu Pro Ala Asp Leu Lys Thr Cys Leu Leu Tyr Leu Ser Ile Phe Pro
485 490 495
Lys Gly Trp Lys Ile Ser Arg Lys Arg Leu Ser Arg Arg Trp Ile Ala
500 505 510
Glu Gly Phe Ala Asn Glu Lys Gln Gly Leu Thr Gln Glu Arg Val Ala
515 520 525
Glu Ala Tyr Phe Asn Gln Leu Thr Arg Arg Asn Leu Val Arg Pro Met
530 535 540
Glu His Gly Ser Asn Gly Lys Val Lys Thr Phe Gln Val His Asp Met
545 550 555 560
Val Leu Glu Tyr Ile Met Ser Lys Ser Ile Glu Glu Asn Phe Ile Thr
565 570 575
Val Val Gly Gly His Trp Gln Met Thr Ala Pro Ser Asn Lys Val Arg
580 585 590
Arg Leu Ser Met Gln Ser Ser Gly Ser Asn Arg Gly Ser Ser Thr Lys
595 600 605
Gly Leu Asn Leu Ala Gln Val Arg Ser Leu Thr Val Phe Gly Asn Leu
610 615 620
Asn His Met Pro Phe His Ser Phe Asn Tyr Gly Ile Ile Gln Val Leu
625 630 635 640
Asp Leu Glu Asp Trp Lys Gly Leu Lys Glu Arg His Met Thr Glu Ile
645 650 655
Cys Gln Met Leu Leu Leu Lys Tyr Leu Ser Ile Arg Arg Thr Glu Ile
660 665 670
Ser Lys Ile Pro Ser Lys Ile Gln Lys Leu Glu Tyr Leu Glu Thr Leu
675 680 685
Asp Ile Arg Glu Thr Tyr Val Arg Asp Leu Pro Lys Ser Ile Val Gln
690 695 700
Leu Lys Arg Ile Ile Ser Ile Leu Gly Gly Asn Lys Asn Thr Arg Lys
705 710 715 720
Gly Leu Arg Leu Pro Gln Glu Lys Ser Lys Lys Pro Ile Lys Asn Pro
725 730 735
Ser Pro Gln Gly Lys Thr Lys Glu Pro Ala Lys Lys Gly Phe Leu Ser
740 745 750
Gln Glu Lys Gly Lys Gly Ala Met Lys Ala Leu Arg Val Leu Ser Gly
755 760 765
Ile Glu Ile Val Glu Glu Ser Ser Glu Val Ala Ala Gly Leu His Gln
770 775 780
Leu Thr Gly Leu Arg Lys Leu Ala Ile Tyr Lys Leu Asn Ile Thr Lys
785 790 795 800
Gly Gly Asp Thr Phe Lys Gln Leu Gln Ser Ser Ile Glu Tyr Leu Gly
805 810 815
Ser Cys Gly Leu Gln Thr Leu Ala Ile Asn Asp Glu Asn Ser Glu Phe
820 825 830
Ile Asn Ser Leu Gly Asp Met Pro Ala Pro Pro Arg Tyr Leu Val Ala
835 840 845
Leu Glu Leu Ser Gly Lys Leu Glu Lys Leu Pro Lys Trp Ile Thr Ser
850 855 860
Ile Thr Thr Leu Asn Lys Leu Thr Ile Ser Val Thr Val Leu Arg Thr
865 870 875 880
Glu Thr Leu Glu Ile Leu His Ile Leu Pro Ser Leu Phe Ser Leu Thr
885 890 895
Phe Ala Phe Ser Leu Ser Ala Ala Lys Gln Asp Gln Asp Ile Ile Lys
900 905 910
Asp Ile Leu Glu Asn Asn Lys Leu Asp Ser Asp Gly Glu Ile Val Ile
915 920 925
Pro Ala Glu Gly Phe Lys Ser Leu Lys Leu Leu Arg Phe Phe Ala Pro
930 935 940
Leu Val Pro Lys Leu Ser Phe Leu Asp Lys Asn Ala Met Pro Ala Leu
945 950 955 960
Glu Ile Ile Glu Met Arg Phe Lys Asp Phe Glu Gly Leu Phe Gly Ile
965 970 975
Glu Ile Leu Glu Asn Leu Arg Glu Val His Leu Lys Val Ser Asp Gly
980 985 990
Ala Glu Ala Ile Thr Lys Phe Leu Val Asn Asp Leu Lys Val Asn Thr
995 1000 1005
Glu Lys Pro Lys Val Phe Val Asp Gly Ile Val Thr Ala
1010 1015 1020
<210>5
<211>9382
<212>DNA
<213>稻属水稻(Orysa sativa L.)
<220>
<221>promoter
<222>(1)..(1286)
<220>
<221>5’UTR
<222>(1287)..(1346)
<220>
<221>CDS
<222>(1347)..(1910),(2061)..(2628),(5400)..(7696)
<220>
<221>3’UTR
<222>(7697)..(7859)
<400>5
cagaagattg gggggggggg ggggggggga cagcggcgcg gtgggaagat tggggccttg 60
ggggtgaggc ttttgggcgg aggccaataa tactgcactg cattcgcaga ctcgttgact 120
gactcgtggt catgccgttt ctccctaata tttcctccgt cctttaatat aatattaaag 180
tctttaatat tatacttgta ctactccctc cattataaaa tataagtcat aaccacccct 240
aacccaaaca ccaagaaaat aattattatt atcttgtagt ttagaacatc ctaataaata 300
caatgcatgc atccaataga atcagagagc atgaataata atttaataaa aaagaggtca 360
tagttaattg cctatatgca tgcatgcctt atattatgaa acatctaaga aaagtggttg 420
tgccttatat tatggaatag agggagtcct atttaaccat ttatcttatt ttaaaaaaat 480
tatgcaaata ataaaaataa aaagttgtgc ttaaaagatt ttagataata aaataagtaa 540
aagaaaatta aataataatt ttcaaacttt tttttaataa aacgaaggat cgtactccct 600
ccattctaaa ttgatctacg tatttcatag gtacaccaag accaagaaaa actaataact 660
ctttcatatt atatttactc taacaacaaa ctcaatgcat gcatcatccc caatatttcc 720
tagccaatag caaatcaaga tattgcatgt gggttataaa tacttgtgtg catggaagca 780
tgcatcaatg tccatttact ccaatgcaca caaataacga atagacttaa taaatgaaca 840
caaatatata gatcatttag gaataaccta aaaaaactat atatagatca atttagaatg 900
aagggagtac aaagtaaaat gttaaaatac cttatattag agaacaaacg aactaattat 960
gtgaaagcag ttctctaacc cattgcgaga ggaccgagaa aaataccacc actttaagag 1020
cagttacaat agcaggctat aagccaactg caaacatatt ttaaggagat aaataaggag 1080
agagaagggt agcgggctat agatttgtag ccaactgtag cacagactcc aagacacagt 1140
gtgtatgaca ggtgggacca ggtattaata gtatagtact tgtatgaata agctattaga 1200
ttggctataa ataaattgaa gctactactt ggctgtactg ttaaccttgc tctaattaca 1260
ctcggagtcg gagagtcagg aggtgagaga gtagcactca acgcaaaagg ggatcggccg 1320
agccgccgat cgaggcgcga gtagggatgg aggcggctgc catggccgta accgcagcca 1380
cgggggcctt ggcgcccgtg ctagtgaagc tggccgcttt gctggacgac ggggagtgca 1440
atcttctgga ggggagccgg agcgacgcag agttcatcag atccgagctg gaggccgttc 1500
attctctcct caccccaaat atcttgggga ggatggggga tgacgatgcg gcgtgcaagg 1560
atggcttgat tgcggaggtc cgggagctgt cctacgacct ggatgatgcc gtcgacgact 1620
tcttggagct caatttcgag cagcgaagaa gcgcaagccc tttcggtgag ctcaaggcaa 1680
gagttgagga gcatgtctcc aatcgcttct ctgactggaa gctaccggcg gcgagccttc 1740
cgccgtcgtc ggtacaccgc cgagctggct tgccgccacc agatgcagag ctggtgggga 1800
tggacaaacg tatggaagag ctcaccaaat tgctggaaca agggagcaat gatgcttcac 1860
gatggcgcaa gcgaaaaccg catttcccgc tcagaaaaac agggctaaag gtacggttgg 1920
atctcgattc ttcacaaata tagcttcctc cgatagcagt gccaatgaat ttgtttactt 1980
ctctcttgat tctttaattt ggaagtactg tataacaaac atggagggat cgtcatttaa 2040
cttaattttt tttgttgcag caaaaaatcg tgatcaaggt tgccatggag ggcaataatt 2100
gccgttcaaa agcaatggct ttagttgcga gcactggagg agtggactcg gttgcgctcg 2160
taggtgatct aagagacaag atagaggtgg tcggttatgg cattgacccc atcaagctga 2220
tctccgcgct ccggaagaag gtgggcgatg cggagttgct gcaggtcagc caagcaaata 2280
aagatgtgaa ggagacgacg ccgatgcttg cgccggtgaa atccatatgt gaatttcaca 2340
aggtcaaaac agtttgcatc cttggattgc caggtggagg caaaacaacg gttgccagag 2400
aattatatga cgccttggga acgcacttcc catgccgggt tttcgtgtca gtctctccaa 2460
gttctagtcc cagtcccaat ctcacaaaga ctcttgcaga cattttcgct caagcacaac 2520
taggagtaac cgatacactt agcacaycat atggtgggag tgggaccggg agagctcttc 2580
aacaacatct catcgacaac atatcagctt tcctcctcaa caaaaagtaa gcaatatctt 2640
atatctctta tctagcttgt ccttaatccg gcttcccaaa ttaaagtgaa caggctatat 2700
gtccatataa tgttgggttt tttttttctg tcctcgatga ggtatatatc aagatatgcc 2760
aaactttttt taattagcgt tactttattc tacctagtaa atagatacag aaatggggca 2820
ggcccctcct ttgtgacagg ccaatggttt tatgatgtgg aaacaactcc cttatcactg 2880
aagctcttgc tattactcgt gatggtactg atcttgcaaa ggcttgcagt atttgtaaat 2940
ccacagggag tggtcatctt ttttttttta agataatgaa tagaaatctg gcctctatat 3000
agaaagccaa aggttacaca agtgcataca ctccaaatct caaagctgag aaacacgaaa 3060
aacaaacgac caaaaagaaa agacgataag attaagtttc aagcttttcc tccttgcttc 3120
agaaaacagg gaaatgcaga gctagcccgc tcttcccctg ccatgccctc tcactgccat 3180
tgcaactacc acaaaagctc ccctgtttcg ccgtgagcga agggagggga tggaactagt 3240
ctgctgcttt gttgtcgcca agcactggtt tcatgctgcc caaacaacct atgccccgct 3300
agcgataacg agtctgttgc cgccatgctg ttgtcgctgg agagcacttg tcgctgctca 3360
gctatgtgtt aagtaagcta ctaacaatct ctccattctc ttgtagactg cttataaaag 3420
attgtacttc aactttttat taatgaaata gagttcctgc cttacaacaa caagaacgag 3480
agagagagag agagccaata tcacgagtta gcactgtgaa gacttcctga ctgttatgtc 3540
ttcaagaaca gtataaaagc atttgccact ataaaaatat gatagccctc aagatcatta 3600
aaaatatgtc atgttctttt accttcataa caaccagtat cacaaatatg aaaacattta 3660
aaggaaaatc agtatttgcc atttatgtaa agaaaaacta gactagaact ctgggctttt 3720
attttagtgc acactgtata catgtgtgtt ctttgacgta tatacttgat attttttcaa 3780
aaacttttgg gggcatatct gtaacccatg ccccatgcat ccaaagtagt agtgagacaa 3840
ctttcgtatc aatggacatg tggcattaaa tttaatccca atatttgttc tctcgaagtc 3900
ctgaacatat aattggacaa aattatataa tttaattagg ccaattgagc catatataca 3960
cacactataa gaaacttcta caaaacaaac attcgctcct tttaggtttt gaccagtcta 4020
tacatgaacc agccaacggt gtgctggtct gactataaat gtttttttag agagtaaagg 4080
caccttccag tacctctctc gaggacagta aaaacaccca tgctttaagg ttatgaacaa 4140
caaagcatag aacaaccaca agaaaaaaaa aaacaactag aacaagttct ggaactagtt 4200
agggattcgg gaagccattg cttgctggct tgggacaagc tatgcctgcc attggactat 4260
ggtggagact gggcataaag gatctcaatc tcttggagat tggcctatgt cctaggtggc 4320
catggctgac tgcaaggaga tatatttctt gcaacggaaa ttttcattgc aaaagctgaa 4380
aatcgttgcg atatgtacct attgtgacat aattgaatcc attgggagcc gtagcaataa 4440
atcccgtggt aataggctat tgcaatgatt ttccgatggt tgcataaaaa ggtgctatta 4500
caatggtttt gcattgctat tgcaacaact tgatatgtca taggtattta ttgcaacagt 4560
ttgtaccatg ctgcaataag atttcgcaac acagacctat tcattaaata gcctatttcc 4620
acacattcat ttttgtggca ataattcagt gcgacaagca tgttctatag ctataacctg 4680
atgcaacggg ttagacttgt tgcagtaact tattagcaca tttaattttt tactaaattt 4740
atattgattc aactatttca acatcattca atacagtaaa tactggaaaa tgcaagcaat 4800
ctccaaaaat tcaccaataa gagaaacgaa ttaaatcata tacacaatta tttcacaccg 4860
tacagtgcac ttaagtgtag gaaatataat gctatataca tttccaaccc tcaatacagg 4920
ttctgtaaat aatgaaaaga aacatcaatt tgatttcttg accggtccct ctgagacaca 4980
cagtgctttc ctcagcaaca tcgttgccga tctatgtaac ttgtaaactc cttgtgctta 5040
ataaattctg gccagggtta cttcagcctg catggcgaaa aaaaagaact tgttagggag 5100
atcgcaaaca aatgttagct gccccatttc ttctctactg accataaaag ctctaggatc 5160
aggtcatgaa aacaatgccg tcagaccgga aatactcaag attcagaaca cgactcctag 5220
gaactgcaag tttaaatttc aacaaaagca aaatacaacc agatgacaaa ctaatatata 5280
tcagctgact gttcatctca agtggactca catctgtgaa attgttagtt ctatgcatat 5340
atgctaatgt aatactaata tgctattgag gcttattact cttctcgact tttataggta 5400
tctcattgta atcgatgaca tttggcattg ggaagaatgg gaagtcatca gaaagtccat 5460
tcccaagaat gatctgggtg gtagaataat catgactact cgtcttaatt caatagctga 5520
gaagtgccac actgatgaca atgatgtttt tgtctacgaa gttggggatc tagataataa 5580
tgatgctttg tcgttgtctt gggggatagc aacaaagtct ggggcaggca acaggatcgg 5640
aactggagag gataatccat gctatgatat tgtgaacatg tgttatggta tgcctttagc 5700
acttatttgg ctgtcgtcag cattggttgg agagatagaa gaattaggtg gtgctgaagt 5760
gaaaaaatgt agggatttga gacacataga ggatggtatt ttggacatcc catccttaca 5820
accattggcg gagagtttat gccttggtta taaccatctt cctctttatc tgaggacttt 5880
gttgttgtac tgtagtgcat accattggtc taacagaatc gaaaggggtc gtctggtcag 5940
gaggtggatt gcggaaggat ttgtgtcgga agagaaagaa gcagaaggtt actttggcga 6000
gcttattaac agaggatgga ttacgcagca cggagacaac aacagttata attactatga 6060
gatccacccc gtgatgctgg ccttcctgag atgcaagtcc aaggagtaca attttttaac 6120
atgcttgggt ctgggatctg atactagtac tagtgcatcc tccccaaggt tgattcgccg 6180
gctgtctctt cagggggggt atccagtgga ctgcttgtca agcatgagta tggatgtgtc 6240
acacacttgc agccttgtcg tccttggtga cgtggcgcga cccaagggaa tccccttcta 6300
tatgtttaag cgcttgcgag tgttggacct tgaagataat aaggatatac aggattctca 6360
tctgcagggc atatgtgaac agttaagcct cagagtgagg taccttggtc tcaagggaac 6420
gcggatccga aagctccctc aggagatgag gaagctgaag catttggaga ttttgtatgt 6480
ggggagcact cggatcagtg aacttccgca agagattgga gagctgaagc atctgcggat 6540
tctggacgtg agaaacacgg acatcactga gctcccactg cagatacggg agctgcagca 6600
tctgcacact ctggacgtga ggaacactcc aatcagtgag ctcccgccgc aggttggcaa 6660
gctgcagaat ctcaagatta tgtgcgtgag gagcactggg gttagggagc tcccaaagga 6720
gattggggag ctgaatcatc tacagactct ggacgtgaga aacacgaggg tgagagagct 6780
gccatggcaa gctggccaga tctcccaatc gttgcgcgtg cttgccggtg acagtggcga 6840
tggcgtgcgg ttgcccgaag gcgtctgcga agctctgatc aacggtattc caggggctac 6900
gcgtgcaaaa tgcagggagg ttctgtccat cgcgatcatc gatcgtttcg gacctcccct 6960
tgttgggata ttcaaagttc ccggcagtca tatgcgtatc ccgaagatga tcaaagacca 7020
cttccgcgtt ctttcttgcc tagacatcag gctctgccac aagcttgagg atgatgacca 7080
aaagttcctc gccgagatgc ccaacctgca gacgctcgtg ctgaggttcg aggccctacc 7140
aagacaaccc ataaccatca acggcacagg cttccagatg ctggagagct tccgtgtcga 7200
cagccgggtg ccaaggatag ccttccatga agacgccatg cccaacctca agcttctcga 7260
gttcaagttc tacgccggcc cagcaagcaa cgatgccatc ggcatcacca acctgaagag 7320
cctccaaaag gtggtctttc ggtgctcgcc atggtacaag agcgacgccc ctggcatcag 7380
cgccaccatt gacgtcgtga agaaagaagc cgaggagcat cccaaccggc cgatcaccct 7440
cctcatcaat gctgggtata aggagatatc aactgagtca cacgggagca gtgaaaacat 7500
tgcgggcagc agtgggatcg atactgagcc tgcacaggca cagcatgata atctccctgc 7560
tgttcgagat gactacaagg gaaaagggat tcttcttgat ggcaggtgtc ctacctgcgg 7620
ccgagcgact aaaattgaag aggaaaccca agatcgagta gcagatattg aaattcaaac 7680
agaaactact agctagctag taccgctgtc taattttatt tgtattaaat actctccttc 7740
atgtaatact ctaagtcccc gctcaatttt tcttgcctcc gcaactctac atgatactca 7800
aaactgattt cttacctccc tttgattgta gaaaaaacat gggaagtttg gaggatttca 7860
atcctatgaa aaggactttg agaaacaaat tacggaatcc tatcttttcc tttaaaagat 7920
atataataga aaatcctata gagattttta taggaaagtt aacaaaagcc tcaatcttat 7980
ggagaattcc tgtgtttatt tgcatagagg aaagtgagac tcacctaatc aggggcggtt 8040
ctagtgggtg gtccaggtga tccctggacc accctaaaat ttggcccatg agaaccgccc 8100
cccccctcct cctcccgggg ttggtcgaaa ccagaaaaaa ggccactgag ctaagctagg 8160
gttagggaga ggtgtagcca tgaactagcc acagcagcac gcatcgctgg ctgccgcctc 8220
cgcctctgcg tcagcgccag cgtgccgtct ccccggctct cgactctgcc tccgcctccg 8280
ccttggcgcc gatgcgtcgc ctcctcggct cccgcttcta cctcagcgcc agcgtgccgc 8340
ctccgcctta gcgccgcctc ctcgcctcct cccttccccg actcctccag tcccccactc 8400
ctgtcggttg ctatcgtcgg cgggcggcga ccaacgccta caagccaact acggctgcct 8460
gggcaagccg gcggccaccc ccgcccctgg aaagggaaaa cattgccatt ccacaggcgg 8520
cgccaggtcc cctttctctc tagctttctt gttgctcagc aaagttcaaa ggtcagtaga 8580
catttcttgt tggttttttg caatgttgcc ctatgcagta gtgagtagtc agtactcagt 8640
agtagtgcta gtgaaaaatt tctgccttga gtgtagtgaa aagcaagctg agaaatagca 8700
tgtgtgacaa attattaaat gatggtttaa tcacatctat tgagcgggtt attttctctc 8760
aagttagcaa agaagacatt atgcataatt tcatgtctat atatgaacgg agagtagaaa 8820
agaagtaggg ttttgtaatc ttctacttat ctattataag atatttgaat gattatgttc 8880
taactttcgt aacttcaaat ttagatgttg gttttaagca agttgtgatg ttgtgagata 8940
tattgctaca tttcaattat atttttatta tttttgacac atttggcttg aatttgtaac 9000
aatgtgacaa tttggagcat caagtaggac tacccgacga ttcatgaaac caatgtgccg 9060
atatttagcg gttcttaatc atactgaggt aagaaggcaa cattttgcaa aagtagaatt 9120
tcttcatgaa aatcaggctt atagctttaa tgttcacaaa ctagctaagt ttccttgtac 9180
cttagaggag gggcatcatg ttcgattctc aaatactcct tgtaatctaa gtaggaatgt 9240
aaataatttg gttgttaatg aataaaatgc cagaatttta ttctaaaaga atatgtaatt 9300
caaattaata ttattctaga tgtatcttga atagcccatt agcactggac cagtctttca 9360
agttgggtca gtggtcggta cc 9382
<210>6
<211>8806
<212>DNA
<213>稻属水稻(Orysa sativa L.)
<220>
<221>promoter
<222>(1)..(1263)
<220>
<221>5’UTR
<222>(1264)..(1418),(2300)..(2408)
<220>
<221>CDS
<222>(2409)..(3400),(3565)..(5638)
<220>
<221>3’UTR
<222>(5639)..(5921)
<400>6
gagtgtaatt agagcaaggt taacagtaca gccaagtagt agcttcaatt tatttatagc 60
caatctaata gcttattcat acaagtacta tactattaat acctggtccc acctgtcata 120
cacactgtgt cttggagtct gtgctacagt tggctacaaa tctatagccc gctacccttc 180
tctctcctta tttatctcct taaaatatgt ttgcagttgg cttatagcct gctattgtaa 240
ctgctcttaa agtggtggta tttttctcgg tcctctcgca atgggttaga gaactgcttt 300
cacataatta gttcgtttgt tctctaatat aaggtatttt aacattttac tttgtactcc 360
cttcattcta aattgatcta tatatagttt ttttaggtta ttcctaaatg atctatatat 420
ttgtgttcat ttattaagtc tattcgttat ttgtgtgcat tggagtaaat ggacattgat 480
gcatgcttcc atgcacacaa gtatttataa cccacatgca atatcttgat ttgctattgg 540
ctaggaaata ttggggatga tgcatgcatt gagtttgttg ttagagtaaa tataatatga 600
aagagttatt agtttttctt ggtcttggtg tacctatgaa atacgtagat caatttagaa 660
tggagggagt acgatccttc gttttattaa aaaaaagttt gaaaattatt atttaatttt 720
cttttactta ttttattatc taaaatcttt taagcacaac tttttatttt tattatttgc 780
ataatttttt taaaataaga taaatggtta aataggactc cctctattcc ataatataag 840
gcacaaccac ttttcttaga tgtttcataa tataaggcat gcatgcatat aggcaattaa 900
ctatgacctc ttttttatta aattattatt catgctctct gattctattg gatgcatgca 960
ttgtatttat taggatgttc taaactacaa gataataata attattttct tggtgtttgg 1020
gttaggggtg gttatgactt atattttata atggagggag tagtacaagt ataatattaa 1080
agactttaat attatattaa aggacggagg aaatattagg gagaaacggc atgaccacga 1140
gtcagtcaac gagtctgcga atgcagtgca gtattattgg cctccgccca aaagcctcac 1200
ccccaaggcc ccaatcttcc caccgcgccg ctgtcccccc cccccccccc ccccaatctt 1260
ctggcctggc cagctctggt cggaggaggg ttcagttcca tgggtgcgcc ccctgcccat 1320
ccctccactt ctactttcgt cttgagatct cgccgtttct gatctccttg ccatggcaac 1380
cagttgattt ggctgccaag atcgtccgtg ctttgcaaag gtgaacaatc tttttcttct 1440
tgggctttca ctttgttgtg ttaatagact cagcgccgtc gcaacttgca aataaaaagg 1500
ttcagatcgg aaaatggaaa taccgtacac agtagtctat agagaatcga ctagtatttc 1560
ctgactgtaa gtaaggaagt tgtaggtacc atgttctgat gattatcttt gaagcttgga 1620
gggggacaaa aagttgggag agaaacaaga atgtatttgt tcacttgatt gggggcagta 1680
ggtaaaatct cagcgttagg aatttgcacc tctttattaa gtttgactaa atttatagaa 1740
aaaaattagc aacacttaaa acaccaaatt agtttcattg aatccaacat tgaatatatt 1800
ttgataatat gtttactttg tgttcaaaat gttactatat ttttctataa atttgatcaa 1860
atttcaataa gtttgactag aaaaaaaagt caaaccgact tataatatga atataaatgg 1920
agggagtaga ttataaatta tctttattat aatactttga gtgtacaaac gaatgaataa 1980
agtttacaaa gttgaaaagt tatttttaaa tactattccg ataaacctgt atatagagaa 2040
cttgtgaaaa aaaacaccgt ttgaaatgtg tctgtgataa tctataagag gaacggggcc 2100
taaatatatt ccttttttcc tacgaaaatg caagaagtat tgcctaatat attgatagag 2160
ctgaagtttt aaagtaaaat gaaacatttg catgaggcag cactggacaa aaagcgcttg 2220
tgaaaaaaat atgttccaaa tattattaat tcattcatgg tatggtcttt attttcattt 2280
tcccccctta tccatgggca ggacacggaa attctcagca ggttcgcgct tgatctgaac 2340
tgtctgtggc tcatactctc ttgtgcttgc gccagctgaa agttgcagtg agaagtacag 2400
agaacaagat ggagttggtg gtaggtgctt ccgaagccac catgaaatct ctcttgggca 2460
agctgggcaa tcttctagcc caggagtatg ctctcatcag cggtatccgt ggtgacatcc 2520
agtacatcaa tgacgagctt gccagcatgc aggccttcct ccgtgatctc agcaacgtgc 2580
cagagggtca cagtcatggc caccggatga aggactggat gaagcagatc cgagacatcg 2640
cctatgatgt tgaggactgt atcgatgact ttgcccaccg cctccctcag gattccatca 2700
gcgatgccaa atggtccttc ctactcacaa aaatctatga actatggaca tggtggccac 2760
gtcgtgtgat tgcttccaac attgcccaac tcaaggtacg ggcacaacag atcgcagatc 2820
gacgtagtag atacggagtg aacaacccag aacaccttga cagtagcagc agtgccagga 2880
cccgtgctgt caattacgaa attgctgagt atcaggtcac aagccctcag atcattggta 2940
taaaggagcc tgtggggatg aagacggtca tggaggagct tgaggtttgg ttaactaatc 3000
ctcaagctga aaatgggcaa gctgttctgt ccatagtcgg ttttggaggt gtgggaaaga 3060
ctaccattgc cacagcattg tacagaaaag tcagtgataa atttcagtgc cgggcatcag 3120
tagctgtgtc tcagaactat gaccaaggca aagtcctcaa tagtattctg agtcaagtca 3180
gcaatcagga gcagggcagc agcacaacaa ttagtgagaa aaagaacctc acctcaggcg 3240
ctaagagcat gttgaagaca gccctgtcac tgctcagagg taattgtata tgtcagccag 3300
aaaatgatgg aaaccctgat aatacaccaa tcaggctgca ggaaacaacg gacgatgatc 3360
aaaaccccag aaaactggaa cagctcctgg ccgaaaagag gtaccttttt ttgtaaataa 3420
aattgctttg cttatctgta aattaactta ctcatcccac tctaaatcta atgtttattt 3480
ccttctatac acagcacaac tccatctttt gaatgggttt tatttttctc acttgtgctc 3540
attttttttt tatcatctct gcagttatat cctcttgatt gatgacattt ggtctgccga 3600
aacatgggag agtatcagat cgattttgcc taaaaataat aaaggcggta gaataatagt 3660
gactacaaga tttcaagctg ttggttcaac atgctcccct cttgaaactg atcgtttgca 3720
tacagttgat tttctcaccg atgacgagtc ccaaaactta ttcaatacaa gtatttgtga 3780
atcaaagata agaaaagata gcaacaaagt agacgagcaa gtccctgagg aaatatggaa 3840
aatatgtggg ggattgcctt tggccatagt caccatggct ggtcttgtcg cctgcaaccc 3900
aaggaaagcc tgctgcgatt ggagtaaact ttgcaaatca ttatttccag agcaagaaac 3960
tcctcttacc ctcgatggtg ttacaaggat actggattgt tgttacaatg atttgcctgc 4020
ggatctgaag acttgcttat tgtacttgag tatatttccg aagggttgga aaattagtag 4080
gaaacgtttg tcccggcgat ggatagctga aggttttgct aatgagaagc aagggttaac 4140
ccaggaaaga gttgcagagg catactttaa tcaactcaca agaaggaact tagtacgtcc 4200
catggagcat ggcagcaatg ggaaggtaaa aacgtttcaa gttcatgaca tggttcttga 4260
atacatcatg tccaaatcaa tcgaagagaa ttttattact gtggttggtg gacactggca 4320
gatgactgca ccaagcaata aagtccgtcg actgtcgatg caaagcagtg gatccaatcg 4380
tggaagttca acaaaaggcc tgaacttggc tcaagtgaga tcactgacgg tgtttgggaa 4440
cctgaaccat atgccattcc attcattcaa ctatgggata atacaggtgc tggatcttga 4500
ggactggaag ggtttgaaag agagacatat gacggagata tgtcaaatgc ttttactcaa 4560
gtatttgagc atccgacgaa cagaaatttc caaaattccc tccaagattc agaaacttga 4620
gtacttggaa actcttgaca taagggagac atatgtcagg gacctgccta agtcaatagt 4680
ccagctaaaa cggatcatta gcatacttgg agggaataaa aacacacgga aggggctgag 4740
gttgcctcaa gaaaaaagta agaagccaat taaaaacccg tcgcctcaag gaaaaacaaa 4800
ggagcccgca aagaaaggat tcttatccca agaaaaaggt aaaggcgcaa tgaaagcact 4860
ccgtgtactg tcagggattg agattgttga ggaatcatca gaagtagctg caggccttca 4920
tcagttgaca gggctaagga agcttgccat atacaagctc aatataacaa agggtggtga 4980
taccttcaaa caattacagt cctccattga gtaccttggc agctgtggtc tgcagactct 5040
ggccatcaat gatgagaatt ctgaatttat caactcactg ggcgacatgc ccgcgcctcc 5100
aagatatctt gtcgcccttg agctgtctgg caagttggag aagctaccca agtggatcac 5160
cagcatcact actctcaaca agctaaccat atctgtaaca gttcttagga ctgaaacttt 5220
ggagatcctc cacattttac cttcattgtt ttccctcacc ttcgcctttt cacttagtgc 5280
agcgaagcag gatcaggaca taataaagga catccttgag aataataaat tggacagtga 5340
tggggaaatc gtcattccag ctgaaggatt caagagtctt aagctgcttc gcttctttgc 5400
acctttagtg ccgaagctca gctttttgga caagaatgca atgccagcac tcgaaatcat 5460
tgaaatgcgg tttaaagact tcgaaggtct atttggcatc gaaatccttg aaaatctccg 5520
tgaggtgcat ctcaaagtta gtgatggggc agaagcaata accaagttcc ttgtaaatga 5580
tttgaaggtt aatactgaga aaccaaaagt atttgttgat ggcatcgtca ctgcatgaga 5640
agtaaaattg ctgcaaatcg gagaacttac caatcatctg aggcttcccc tctattatta 5700
ctctcttaga atatattgtt attattgctc accttgcaaa ataaaatagg gatggcatag 5760
catattgcta caacgtacca tggttccatc atagttgatt tcacttgtca ttacagtgtc 5820
tgttcagttg tgttttctat taataaaagg gagatctccg caagaaacca ttattatact 5880
tatattcggt tatgaactct ataaatgatg ggattgctat atttatggtg ccacaatttc 5940
catgagtgcg gtattttttt ttctagcagt tggctggtgt taagatttgt gctgccattg 6000
ctcctctatt attggtgccc aaaattacgc tttgcactat gttcacagtt gtaaaactta 6060
cctaaatttc gtgtattagt actgtaatgt tgtgattttc gcgtccagtt atattttttt 6120
ctttgccaga agtttgattt caaggtatat ctggtaagct tcagccggat tcgttaagtt 6180
tttagtttaa tcagctaaac ttttaacagt atgcatgcac tagtttctcc cgttatctta 6240
tccaatataa ttaatgggct aattggatct atgccattac aaacatggca agattcagaa 6300
aaatgctact acaatttgtc tattcatagc tgtgctagtg aaattttaca aaattggaac 6360
cgtgctattg atatcacgtt ttccatccat cttttctttt ttttcctttt cttctttatc 6420
ccgtcttctt cccgcaccga cgagaggcga gtggcggcgg gaccggcaag cgatggcagg 6480
acctcggcag ctcagtggtg gcagcggcaa aagggaaagg ggaaggggta ggagagaacc 6540
tggcccgccg acgcaagcat gttggtcgtc gtcttccccg aggagcttga gcagtggcaa 6600
tggtagagga atccaaggag gccgaggagg ttgagcggca ctaatggtag aggatgccga 6660
ggaggtcgag catcagcgag cttgccctgg gtgggcgagt gagggagagc tctcccaagt 6720
gtctgcgaag ctctcatctt cgttttcctc atccccactt tgctcctctt ccactactgc 6780
tgttgaagct agcggccctc tctcaccaga tctgggcgga gctcaagcga ggcgccacca 6840
ccgacgatgg agcttcgcgt ggaccccaca gcctcacccg attgccatcc ctcctaccct 6900
catcactcca tctccaagcc caaatcccac ctcccccacc ggctcctcct cctcagcccc 6960
tgccgtggca gccgaggcca ccgatgggcc aaccctcaac atcgtctcga agcagctccg 7020
cgcactgtgg aagaagcaca accaaatcct ccagatggag gagtcgctca ctggcgggag 7080
gaagctgaac aaggagcagg aggaggtgct ccgatccaag cccatcgtca tcgcgctcat 7140
tgatgagctc gagcggatgc gtgccctgct cgccgacgcc ctcgccgagg agctctcctc 7200
ctgccccgcc ccttccctat ctgtcactcc ctcctcctcc acctcctccg gcgctgattt 7260
gtccgtcaag gatctcctca cgctcatcta cttcggctcc ctcttcgacg tcaagtcgca 7320
gatcgagttc gtcaccacca tgctcgcacg ctaaggagct agactgatgc atcacctacg 7380
actatgtccg tccttgccgc agcacgactt gctacggtcc cgcctcacgc cccgtcgctc 7440
gccggtcccg ccgccacttg cttctcatcg gtgcgggaag aagatgggag aaagaaggaa 7500
gaaaacggga aaaaaaagag atggatgaaa aacgtgacag caatagcacg gttccaattt 7560
cgtaaaattt caatggcacc gctacgaata gaagaattgt agtagcattt ttctgaatca 7620
acatatttgt aatggcatgg atccaataaa ccccaaattt attttatgcc aatacaaata 7680
ttggcaactg gtgtcattaa cgtataaatg ggtcgcctac ataaaacctt ccgcgtcctt 7740
cttggttgaa caggtaatta attgcaacca ttttctgaaa aaattgactg gaagtctgaa 7800
accgtccctg ctactccctg aatgctttag ttcaatctct atgaagctcc atccaaacaa 7860
aatctccaga tgatatttca ttaaaaacat catgtccaga actgtccagt atccaagaat 7920
ggctatgatc ttttctgaat tctggactcg ggacttatat tacatgtatc aaatccagtt 7980
attttctgga gctacaccag atgaaatcac caagtaaaac aattatataa ggatggtgag 8040
ttgttaatta attacagctt tgatagacca ctctgtttgt taatgcctga caagagttca 8100
caaagaaaaa aatacacagg attacaaaac tttttagata ctatctgatg tgtgttaatt 8160
ctctcaaata ctatgtacct gaacaaacaa gagagaacat gttgtgaatt gtgatctcct 8220
catgcgagtg acctgaaact ctctacaggg tcagactcct caagcatcac tgcagcctgg 8280
aaacaatcag cagcatcatt gatcctccca tcattcctgt gaactctccc taaatgaagc 8340
caagccatcc tgttcgtagg ctcaattctc agtgcatcag agaggaagca tctcgctgca 8400
gggagatacc ttgacccttg cttgcagaga agtgcaccaa tggccacctt ggatggaaca 8460
tgctctagct ctatcgagaa tgcgttgacg taggcagcca gtgcctcctt gttctgatct 8520
cgcgcctcga gcatgaaacc tgaacttata gcatccacga gtcagatttt gatgttgaat 8580
tcagttgaat gcagagaatt gtcatgttag tgttacatag aagtggtgaa agttcaagtt 8640
tagttgagtt gatcaccttc tgcatgcatt gttgcagccg agtaggattt tagggctcta 8700
gccttctgca gacatatctc agcgtctctc cagattgaga gactggagta caggtttgca 8760
agtccttgcc atatttcaaa ctcacttaca ctgtcattct gtccct 8806
<210>7
<211>40
<212>DNA
<213>人工序列
<400>7
ttttggcgcg ccgccagtgt ccaccaacca cagtaataaa 40
<210>8
<211>40
<212>DNA
<213>人工序列
<400>8
ttttggcgcg ccgaacagcc tgaggaagca gaacatcgtc 40
<210>9
<211>42
<212>DNA
<213>人工序列
<400>9
ttttggcgcg ccgcaagatc agtaccatca cgagtaatag ca 42
<210>10
<211>42
<212>DNA
<213>人工序列
<400>10
ttttggcgcg ccagggacag aatgacagtg taagtgagtt tg 42
<210>11
<211>23
<212>DNA
<213>人工序列
<400>11
caccttgagc tgtccggcaa gtt 23
<210>12
<211>21
<212>DNA
<213>人工序列
<400>12
cagtgacgat gccatcaaca a 21
<210>13
<211>33
<212>DNA
<213>人工序列
<400>13
tttactagtg gtacctcagc ggtatccgtg gtg 33
<210>14
<211>34
<212>DNA
<213>人工序列
<400>14
tttgagctcg gatccgcgcc tgaggtgagg ttct 34
<210>15
<211>33
<212>DNA
<213>人工序列
<400>15
tttactagtg gtacctcacc gatgacgagt ccc 33
<210>16
<211>33
<212>DNA
<213>人工序列
<400>16
tttgagctcg gatcctgcca gtgtccacca acc 33
<210>17
<211>35
<212>DNA
<213>人工序列
<400>17
tttactagtg gtacctggcc agatctccca atcgt 35
<210>18
<211>35
<212>DNA
<213>人工序列
<400>18
tttgagctcg gatcctgtgc ctgtgcaggc tcagt 35
<210>19
<211>35
<212>DNA
<213>人工序列
<400>19
tttactagtg gtaccttcaa tagctgagaa gtgcc 35
<210>20
<211>35
<212>DNA
<213>人工序列
<400>20
tttgagctcg gatccccaaa gtaaccttct gcttc 35
<210>21
<211>23
<212>DNA
<213>人工序列
<400>21
gaagctctga tcaacggtat tcc 23
<210>22
<211>23
<212>DNA
<213>人工序列
<400>22
tctttgatca tcttcgggat acg 23
<210>23
<211>23
<212>DNA
<213>人工序列
<400>23
tgaacttcca cgattggatc cac 23
<210>24
<211>25
<212>DNA
<213>人工序列
<400>24
acatggttct tgaatacatc atgtc 25
<210>25
<211>20
<212>DNA
<213>人工序列
<400>25
gacgtcgtga agaaagaagc 20
<210>26
<211>21
<212>DNA
<213>人工序列
<400>26
caatgttttc actgctcccg t 21
<210>27
<211>12
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>27
Gly Leu Pro Gly Gly Gly Lys Thr Thr Val Ala Arg
1 5 10
<210>28
<211>11
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>28
Lys Lys Tyr Leu Ile Val Ile Asp Asp Ile Trp
1 5 10
<210>29
<211>15
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>29
Asp Leu Gly Gly Arg Ile Ile Met Thr Thr Arg Leu Asn Ser Ile
1 5 10 15
<210>30
<211>11
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>30
Gly Phe Gly Gly Val Gly Lys Thr Thr Ile Ala
1 5 10
<210>31
<211>11
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>31
Lys Ser Tyr Ile Leu Leu Ile Asp Asp Ile Trp
1 5 10
<210>32
<211>13
<212>PRT
<213>稻属水稻(Orysa sativa L.)
<400>32
Gly Gly Arg Ile Ile Val Thr Thr Arg Phe Gln Ala Val
1 5 10
Claims (6)
1.稻瘟病抗性基因Pik-p,其编码如SEQ ID NO.3所示的氨基酸序列。
2.如权利要求1所述的基因,其特征在于,其核苷酸序列如SEQID NO.1所示。
3.权利要求1或2所述基因的cDNA序列。
4.权利要求1或2所述基因编码的蛋白。
5.由权利要求1或2所述基因或权利要求3所述cDNA与启动子构建的表达盒。
6.含有权利要求1或2所述基因或权利要求3所述cDNA的表达载体。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200910236466 CN102041262B (zh) | 2009-10-22 | 2009-10-22 | 稻瘟病抗性基因Pik-p及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200910236466 CN102041262B (zh) | 2009-10-22 | 2009-10-22 | 稻瘟病抗性基因Pik-p及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102041262A CN102041262A (zh) | 2011-05-04 |
CN102041262B true CN102041262B (zh) | 2012-12-19 |
Family
ID=43907810
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 200910236466 Expired - Fee Related CN102041262B (zh) | 2009-10-22 | 2009-10-22 | 稻瘟病抗性基因Pik-p及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102041262B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110904124A (zh) * | 2019-10-25 | 2020-03-24 | 华南农业大学 | 稻瘟病菌无毒基因AvrPit及其应用 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102618533A (zh) * | 2012-03-08 | 2012-08-01 | 华南农业大学 | 稻瘟病抗性基因Pik-m功能特异性分子标记PikmFNP及其方法与应用 |
CN108588087B (zh) * | 2018-05-16 | 2022-06-03 | 南京农业大学 | 一种提高植物抗病性的基因GmLecRK-R及其应用 |
CN108977568B (zh) * | 2018-08-22 | 2021-11-09 | 福建省农业科学院生物技术研究所 | 一种稻瘟病抗性基因Pik-p功能特异性分子标记及其应用 |
CN113744800B (zh) * | 2021-06-09 | 2022-06-24 | 华南农业大学 | 一套具有包容性且精准鉴别并挖掘稻瘟病Pik抗病等位基因家族的技术体系及应用和分子标记 |
CN116640774B (zh) * | 2023-06-14 | 2024-02-09 | 江苏省农业科学院 | 水稻稻瘟病抗性基因Pik-W25及其应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1821406A (zh) * | 2006-03-06 | 2006-08-23 | 华南农业大学 | 水稻稻瘟病抗性基因Pi36及其应用 |
CN1844393A (zh) * | 2006-03-06 | 2006-10-11 | 华南农业大学 | 水稻稻瘟病抗性基因Pi37及其应用 |
-
2009
- 2009-10-22 CN CN 200910236466 patent/CN102041262B/zh not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1821406A (zh) * | 2006-03-06 | 2006-08-23 | 华南农业大学 | 水稻稻瘟病抗性基因Pi36及其应用 |
CN1844393A (zh) * | 2006-03-06 | 2006-10-11 | 华南农业大学 | 水稻稻瘟病抗性基因Pi37及其应用 |
Non-Patent Citations (2)
Title |
---|
Ashikawa I等.GenBank:AB462256.1.《GenBank》.2009, * |
Wang L等.Characterization of rice blast resistance genes in the Pik cluster and fine mapping of the Pik-p locus.《Phytopathology》.2009,第99卷(第8期),900-905. * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110904124A (zh) * | 2019-10-25 | 2020-03-24 | 华南农业大学 | 稻瘟病菌无毒基因AvrPit及其应用 |
Also Published As
Publication number | Publication date |
---|---|
CN102041262A (zh) | 2011-05-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102094027B (zh) | 稻瘟病抗性基因Pi7及其应用 | |
Fudal et al. | Heterochromatin-like regions as ecological niches for avirulence genes in the Leptosphaeria maculans genome: map-based cloning of AvrLm6 | |
Chen et al. | A Pid3 allele from rice cultivar Gumei2 confers resistance to Magnaporthe oryzae | |
CA2985273C (en) | Late blight resistance genes and methods | |
CN102041262B (zh) | 稻瘟病抗性基因Pik-p及其应用 | |
CN1555414B (zh) | 来源于植物的抗性基因 | |
BR112015007209B1 (pt) | Método de prevenção, redução ou retardo de infecção por phakopsora em plantas de soja, construção de vetor recombinante, método de produção de plantas transgênicas de soja e método de cultivo de plantas resistentes a ferrugem | |
CN101906427A (zh) | 稻瘟病抗性基因Pi1及其应用 | |
CN100569947C (zh) | 水稻稻瘟病抗性基因Pi37及其应用 | |
WO2014127835A1 (en) | Plant-derived resistance gene | |
EP1654276B1 (en) | Fungus resistant plants and their uses | |
KR100990370B1 (ko) | 벼 도열병균에 대한 내성을 증진시키는 유전자 및 이의용도 | |
KR20090038871A (ko) | 개선된 병원체 내성을 갖는 식물체의 생성 | |
CN102051368B (zh) | 稻瘟病抗性基因Pik及其应用 | |
AU2003226098B2 (en) | Generation of plants with improved pathogen resistance | |
US20060143734A1 (en) | Nucleic acids from rice conferring resistance to bacterial blight disease caused by xanthomonas spp | |
CN100556916C (zh) | 水稻稻瘟病抗性基因Pi36及其应用 | |
CN102021176A (zh) | 稻瘟病抗性基因Pik-h及其应用 | |
CN101050232B (zh) | 水稻稻瘟病抗性基因Pi15及其应用 | |
US20040006788A1 (en) | Procedures and materials for conferring disease resistance in plants | |
EP1516050B1 (en) | Generation of plants with improved pathogen resistance | |
CA2216406A1 (en) | Plant pathogen resistance genes and uses thereof | |
Qu et al. | The Broad-Spectrum Blast Resistance Gene Pi9 Encodes an NBS-LRR Protein and is a Member of a Multigene Family in Rice | |
Yang | Map-based Cloning of an Anthracnose Resistance Gene in Medicago truncatula | |
Lin et al. | The Blast Resistance Gene Pi37 Encodes an NBS-LRR Protein and is a Member of |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20121219 |
|
CF01 | Termination of patent right due to non-payment of annual fee |