CN110791487B - Rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof - Google Patents
Rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof Download PDFInfo
- Publication number
- CN110791487B CN110791487B CN201911164723.4A CN201911164723A CN110791487B CN 110791487 B CN110791487 B CN 110791487B CN 201911164723 A CN201911164723 A CN 201911164723A CN 110791487 B CN110791487 B CN 110791487B
- Authority
- CN
- China
- Prior art keywords
- os11g47290
- loc
- sequence
- rice
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 120
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 77
- 235000009566 rice Nutrition 0.000 title claims abstract description 77
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 68
- 108091005682 Receptor kinases Proteins 0.000 title abstract description 17
- 240000007594 Oryza sativa Species 0.000 title 1
- 241000209094 Oryza Species 0.000 claims abstract description 78
- 108091033409 CRISPR Proteins 0.000 claims abstract description 52
- 238000000034 method Methods 0.000 claims abstract description 33
- 230000001580 bacterial effect Effects 0.000 claims abstract description 24
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 11
- 150000007523 nucleic acids Chemical class 0.000 claims description 43
- 108020004707 nucleic acids Proteins 0.000 claims description 36
- 102000039446 nucleic acids Human genes 0.000 claims description 36
- 239000002773 nucleotide Substances 0.000 claims description 24
- 125000003729 nucleotide group Chemical group 0.000 claims description 24
- 108020004414 DNA Proteins 0.000 claims description 20
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 18
- 238000010362 genome editing Methods 0.000 claims description 17
- 230000009261 transgenic effect Effects 0.000 claims description 14
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 10
- 230000002401 inhibitory effect Effects 0.000 claims description 9
- 239000000126 substance Substances 0.000 claims description 7
- 102000053602 DNA Human genes 0.000 claims description 6
- 108020001507 fusion proteins Proteins 0.000 claims description 3
- 102000037865 fusion proteins Human genes 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 2
- 108091026890 Coding region Proteins 0.000 claims description 2
- 230000005764 inhibitory process Effects 0.000 claims description 2
- 230000002452 interceptive effect Effects 0.000 claims description 2
- 230000009467 reduction Effects 0.000 claims description 2
- 241000196324 Embryophyta Species 0.000 abstract description 80
- 208000035240 Disease Resistance Diseases 0.000 abstract description 17
- 238000005516 engineering process Methods 0.000 abstract description 13
- 230000008827 biological function Effects 0.000 abstract description 5
- 241000589652 Xanthomonas oryzae Species 0.000 abstract description 4
- 238000009395 breeding Methods 0.000 abstract description 4
- 230000001488 breeding effect Effects 0.000 abstract description 4
- 201000010099 disease Diseases 0.000 abstract description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 4
- 230000003902 lesion Effects 0.000 abstract description 3
- 238000012271 agricultural production Methods 0.000 abstract description 2
- 230000001276 controlling effect Effects 0.000 abstract description 2
- 230000001105 regulatory effect Effects 0.000 abstract description 2
- 230000006870 function Effects 0.000 abstract 1
- 239000013604 expression vector Substances 0.000 description 22
- 238000003259 recombinant expression Methods 0.000 description 22
- 239000013598 vector Substances 0.000 description 17
- 238000010276 construction Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 150000001413 amino acids Chemical group 0.000 description 8
- 206010020649 Hyperkeratosis Diseases 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- 238000001976 enzyme digestion Methods 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 241001272684 Xanthomonas campestris pv. oryzae Species 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 4
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000007857 nested PCR Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 241000589158 Agrobacterium Species 0.000 description 3
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 3
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 3
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 206010064571 Gene mutation Diseases 0.000 description 3
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 239000003112 inhibitor Substances 0.000 description 3
- 210000001161 mammalian embryo Anatomy 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- 241000588746 Raoultella planticola Species 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- KIMOCKLJBXHFIN-YLVFBTJISA-N Trp-Ile-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O)=CNC2=C1 KIMOCKLJBXHFIN-YLVFBTJISA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 238000002791 soaking Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 1
- LDTJBEOANMQRJE-CIUDSAMLSA-N His-Cys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LDTJBEOANMQRJE-CIUDSAMLSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 241000746966 Zizania Species 0.000 description 1
- 235000002636 Zizania aquatica Nutrition 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000012881 co-culture medium Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 1
- 229960001225 rifampicin Drugs 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 1
- 238000009331 sowing Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8218—Antisense, co-suppression, viral induced gene silencing [VIGS], post-transcriptional induced gene silencing [PTGS]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8281—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for bacterial resistance
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention discloses a rice receptor kinase gene LOC _ Os11g47290, and a coding protein and application thereof. The invention finds the application of the rice receptor kinase gene LOC _ Os11g47290 and the coding protein thereof in regulating and controlling the disease resistance of plants, and the resistance of rice to bacterial blight can be obviously improved by destroying the biological function of the coding protein of the LOC _ Os11g47290 gene. The invention realizes efficient LOC _ Os11g47290 gene site-directed knockout by using CRISPR/Cas9 technology, and the lesion length is obviously shortened after rice is subjected to site-directed knockout and inoculated with Xanthomonas oryzae GD1358 and V. The new function of the LOC _ Os11g47290 gene provided by the invention provides a new method for disease-resistant breeding of plants, and has very important application value in agricultural production.
Description
Technical Field
The invention relates to the technical field of plant genetic engineering, in particular to a rice receptor kinase gene LOC _ Os11g47290, and a coding protein and application thereof.
Background
Bacterial blight caused by Xanthomonas oryzae pv. oryzae is an important bacterial disease restricting rice production, and has serious harm to rice planting industry, and can generally reduce the yield of rice by about 20-30% and seriously reach 50%. The most economic and effective measure for preventing and treating the bacterial blight of rice is to culture and plant disease-resistant varieties by using the resistance genes. However, most of the currently reported 44 rice bacterial leaf blight resistance genes/loci (http:// www.shigen.nig.ac.jp/rice/oryzae base/gene/list) show the problems of narrow resistance spectrum or difficult utilization, and only the genes Xa3, Xa4, Xa21, Xa23 and the like are widely applied in production.
Since Xanthomonas oryzae rice pathogenic varieties are easy to mutate, the resistance of varieties is easy to lose due to the co-evolution of rice and Xanthomonas oryzae (Xanthomonas oryzae rice pathogenic varieties). Therefore, the disease resistance of rice varieties is improved by identifying and knocking out the bacterial leaf blight susceptibility gene, and the method has important application value for rice disease resistance breeding.
Disclosure of Invention
In order to solve the problems in the prior art, the invention aims to provide rice receptor kinase LOC _ Os11g47290 and application of a protein coded by the same in regulation and control of plant disease resistance.
The invention provides application of any one of the following substances A-C in regulation and control of plant disease resistance;
A. LOC _ Os11g47290 protein;
B. a nucleic acid encoding the LOC _ Os11g47290 protein;
C. an expression cassette, a recombinant vector or a recombinant microorganism comprising said nucleic acid.
The invention also provides application of substances shown as b1 or b2 in improving the disease resistance of plants: b1, substances that inhibit or reduce the activity or content of LOC _ Os11g47290 protein in plants; b2, a substance that inhibits or reduces expression in a plant of a nucleic acid encoding a LOC _ Os11g47290 protein.
In the above use, the LOC _ Os11g47290 protein is any one of (a1) to (a4) below:
(a1) Protein shown as a sequence 1 in a sequence table;
(a2) a fusion protein obtained by attaching a tag to the N-terminus or/and the C-terminus of the protein of (a 1);
(a3) protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the (a1) and is related to plant disease resistance;
(a4) and (b) a protein which has 98% or more identity to (a1) and is involved in plant disease resistance.
In the above application, the nucleic acid encoding LOC _ Os11g47290 protein is any one of the following (b1) - (b 3):
(b1) the coding region is a DNA molecule shown in a sequence 2 in a sequence table;
(b2) a DNA molecule having 95% or more identity to (b1) and encoding said protein;
(b3) a DNA molecule which hybridizes with the nucleotide sequence defined in any one of (b1) or (b2) under stringent conditions and encodes the protein.
The amino acid sequence shown as the sequence 1 is a coding protein sequence of the rice LOC _ Os11g47290 gene, and a person skilled in the art can substitute, delete and/or add one or more amino acids according to the amino acid sequence disclosed by the invention, conservative substitution of the amino acids and other conventional technical means in the field without influencing the activity of the amino acid sequence, so as to obtain a mutant with the same activity as the coding protein of the rice LOC _ Os11g47290 gene disclosed by the invention.
The nucleotide sequence shown in the sequence 2 is the nucleotide sequence of the rice LOC _ Os11g47290 gene. The rice LOC _ Os11g47290 gene provided by the invention can be any nucleotide sequence capable of coding the coding protein of the rice LOC _ Os11g47290 gene. Considering the degeneracy of codons and the preference of codons of different species, the skilled person can use codons suitable for the expression of a particular species as required.
In the application, the disease resistance is bacterial blight resistance.
In the above application, the substance represented by b1 or b2 is LOC _ Os11g47290 protein or an inhibitor of a nucleic acid encoded by LOC _ Os11g47290 protein, and the inhibitor can be a nucleic acid, and can also exist in the form of an expression cassette, a vector, a recombinant bacterium or a host cell containing the nucleic acid; the inhibition factors are specifically as follows:
1) interfering RNA;
2) CRISPR/Cas9 system;
in the CRISPR/Cas9 system, the target sequence of the sgRNA is a nucleotide sequence in a form of XXXGG in the nucleic acid encoding the LOC _ Os11g47290 protein, wherein XXX is a nucleic acid sequence of any 19-20bp in the nucleic acid encoding the LOC _ Os11g47290 protein, and N is any one base in A, T, G, C.
The CRISRP/Cas9 system can cut the XXXNGG form nucleotide sequence in the rice LOC _ Os11g47290 gene at the upstream 3-4bp of NGG to generate DNA double-strand break, thereby introducing the insertion deletion of the nucleotide sequence, further causing the translation of the gene to terminate in advance or the protein conformation to change, and finally destroying the biological function of the coding protein of the gene; wherein XXXNGG is nucleotide sequence, wherein XXX is nucleic acid sequence 19-20bp, N is any one base of A, T, G, C.
Preferably, XXX is a nucleic acid sequence of any 19-20bp in the first exon in a nucleic acid (genome) encoding a LOC _ Os11g47290 protein; in embodiments of the invention, more preferably, the sgRNA is sgRNA1 and/or sgRNA 2; the target sequence of the sgRNA1 is sequence 3 (from 367 th to 386 th of the rice LOC _ Os11g47290 gene); the target sequence of the sgRNA2 is sequence 4 (from 952 th to 971 th of the rice LOC _ Os11g47290 gene).
In embodiments of the invention, the pYLCRISPR/Cas9 system includes any one of the following recombinant vectors (CRISRP/Cas9 gene editing plasmid):
the recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290-T1 contains genes encoding U6a-47290-sgRNA1 and Cas9, U6a-47290-sgRNA1 has a target sequence region consisting of 20 nucleotides, and the corresponding target sequence of the target sequence region on LOC _ Os11g47290 gene is 47290-T1 shown in sequence 3; the specific construction method of the recombinant expression vector is shown in the embodiment;
the recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290-T2 contains genes encoding U6b-47290-sgRNA2 and Cas9, U6b-47290-sgRNA2 has a target sequence region consisting of 20 nucleotides, and the corresponding target sequence of the target sequence region on LOC _ Os11g47290 gene is 47290-T2 shown in sequence 4; the specific construction method of the recombinant expression vector is shown in the examples.
The recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290 contains U6a-47290-sgRNA-1, U6b-47290-sgRNA-2 and Cas9 encoding genes, U6a-47290-sgRNA-1 has a target sequence region consisting of 20 nucleotides, and a target sequence corresponding to the target sequence region on LOC _ Os11g47290 gene is 47290-T1 shown in sequence 3; u6b-47290-sgRNA-2 has a target sequence binding region consisting of 20 nucleotides, and the target sequence region corresponds to a target sequence shown as sequence 4 in sequence 47290-T2 on LOC _ Os11g47290 gene; the specific construction method of the recombinant expression vector is shown in the examples.
In the application, the genetic breeding is to construct a transgenic plant resistant to bacterial blight.
The improvement of the disease resistance can be expressed as a reduction in the length of lesion of bacterial blight of plants.
Another object of the present invention is to provide a method for improving disease resistance of plants, which is any one of the following methods 1) to 3), and the method can improve the disease resistance of plants;
1) the method comprises the following steps: inhibiting or reducing the activity or content of LOC _ Os11g47290 protein in a target plant;
2) the method comprises the following steps: inhibiting or reducing expression of a nucleic acid encoding a LOC _ Os11g47290 protein in a plant of interest;
3) the method comprises the following steps: performing gene editing on a nucleic acid encoding the LOC _ Os11g47290 protein in a target plant;
The invention also provides a method for preparing the transgenic plant with high disease resistance, which is any one of the following methods 1) to 3),
1) the method comprises the following steps: inhibiting or reducing the activity or content of LOC _ Os11g47290 protein in a target plant to obtain a transgenic plant;
2) the method comprises the following steps: inhibiting or reducing the expression of LOC _ Os11g47290 protein coding nucleic acid in a target plant to obtain a transgenic plant;
3) the method comprises the following steps: carrying out gene editing on LOC _ Os11g47290 protein coding nucleic acid in a target plant to obtain a transgenic plant;
the disease resistance in the transgenic plant is higher than that of the target plant;
the target plant is a wild-type plant or a recipient plant.
The method also comprises the following steps: the plant with the edited LOC _ Os11g47290 protein coding nucleic acid is selected from the transgenic plant or the plant after gene editing, and is the transgenic plant with high disease resistance.
The method for selecting a plant with an edited LOC _ Os11g47290 protein-encoding nucleic acid from a transgenic plant or a gene-edited plant comprises 1) directly amplifying a vector fragment containing a nucleic acid inhibitor encoding the LOC _ Os11g47290 protein; 2) and amplifying and sequencing the edited genome segment containing the LOC _ Os11g47290 protein coding nucleic acid target sequence in the plant.
In the embodiment of the invention, the amplification primers used in the amplified and sequenced edited genome segment containing the LOC _ Os11g47290 protein coding nucleic acid target sequence in the plant are specifically a primer consisting of a single-stranded DNA molecule shown in a sequence 11 and a single-stranded DNA molecule shown in a sequence 12.
In the above method, the disease resistance is resistance to bacterial blight.
In the above method, the inhibiting or reducing the activity or content of LOC _ Os11g47290 protein in the plant, or the inhibiting or reducing the expression of a nucleic acid encoding LOC _ Os11g47290 protein in the plant is effected by gene editing of a nucleic acid encoding LOC _ Os11g47290 protein.
In the method, the gene editing is realized by means of a CRISPR/Cas9 system;
in the CRISPR/Cas9 system, the target sequence of the sgRNA is a nucleotide sequence in a form of XXXGG in the nucleic acid for coding the LOC _ Os11g47290 protein, wherein XXX is a nucleic acid sequence of any 19-20bp in the nucleic acid for coding the LOC _ Os11g47290 protein, and N is any one base in A, T, G, C;
preferably, XXX is a nucleic acid sequence of any 19-20bp in the first exon in a nucleic acid (genome) encoding a LOC _ Os11g47290 protein; in embodiments of the invention, more preferably, the sgRNA is sgRNA1 and/or sgRNA 2; the target sequence of the sgRNA1 is sequence 3 (from 367 th to 386 th of the rice LOC _ Os11g47290 gene); the target sequence of the sgRNA2 is sequence 4 (from 952 th to 971 th of the rice LOC _ Os11g47290 gene).
In the method, gene editing specifically comprises the steps of firstly constructing a CRISRP/Cas9 gene editing plasmid containing a sgRNA target sequence binding region shown in sequence 3 or/and sequence 4, and then transferring the CRISRP/Cas9 gene editing plasmid into a plant to realize gene editing. Wherein, the CRISRP/Cas9 gene editing plasmid is a II type CRISPR system. In embodiments of the invention, the pYLCISPR/Cas 9 system includes any one of the following recombinant vectors (pYLCISRP/Cas 9 gene editing plasmid): the recombinant expression vector pYLCISPR/Cas 9Pubi-H-47290-T1, the recombinant expression vector pYLCISPR/Cas 9Pubi-H-47290-T2 or the recombinant expression vector pYLCISPR/Cas 9 Pubi-H-47290.
Still another object of the present invention is to provide a specific sgRNA or an expression cassette, a vector, a host cell, an engineered bacterium or a transgenic plant cell line containing a gene encoding the sgRNA.
The specific sgRNA provided by the invention is sgRNA1 and/or sgRNA 2;
the target sequence of the sgRNA1 is a sequence 3; the target sequence of the sgRNA2 is sequence 4.
In the present invention, the plant may be a monocotyledonous plant or a dicotyledonous plant, preferably a host plant of the species blight bacterium, including but not limited to rice.
Experiments prove that the rice plant with LOC _ Os11g47290 gene mutation can be efficiently obtained by using the sgRNA to perform CRISRP/Cas 9-mediated gene editing, and the biological function of the encoding protein of the rice LOC _ Os11g47290 gene can be damaged by insertion or deletion from 367 th to 386 th or/and from 952 th to 971 th of the sequence shown in the sequence 2, so that the rice shows the property of improving the bacterial leaf blight resistance level, and the breeding efficiency of disease-resistant plants based on the LOC _ Os11g47290 gene mutation is effectively improved.
The invention has the beneficial effects that:
(1) the invention discovers that the rice receptor kinase gene LOC _ Os11g47290 and the coding protein thereof participate in regulating and controlling the immune response of rice to bacterial blight bacteria, and the resistance of the rice to the bacterial blight can be obviously improved by destroying the biological function of the coding protein of the LOC _ Os11g47290 gene. The experiment proves that: the LOC _ Os11g47290 is knocked out at fixed points to inoculate bacterial blight of rice, the length of inoculated bacterial blight V leaf spots is shortened by 14.4%, and the length of inoculated bacterial blight GD1358 leaf spots is shortened by 48.1%, so that the mutation of a nucleotide sequence of the LOC _ Os11g47290 gene to prepare the rice material for resisting bacterial blight has very important application value in agricultural production.
(2) The invention utilizes CRISRP/Cas9 technology to modify LOC _ Os11g47290 gene in a genome targeting way, and realizes efficient site-directed knockout of LOC _ Os11g 47290. The invention discovers that the site-directed knockout of LOC _ Os11g47290 can be efficiently realized by using nucleotide sequences from 367 th to 386 th or/and from 952 th to 971 th in a rice LOC _ Os11g47290 gene as a target sequence, and the biological function of a protein coded by the rice LOC _ Os11g47290 gene can be damaged by insertion or deletion from 367 th to 386 th or/and from 952 th to 971 th in the sequence shown in a sequence 2, so that the rice shows a trait of improving the bacterial leaf blight resistance level, and the breeding efficiency of disease-resistant plants based on the mutation of the LOC _ Os11g47290 gene is effectively improved.
Drawings
FIG. 1 is a sequencing peak diagram for vector activity detection based on pYRCISPR/Cas 9 technology rice receptor kinase LOC _ Os11g47290 site-directed knockout method provided in example 1 of the present invention.
FIG. 2 shows the mutation type of the nucleotide sequence of LOC _ Os11g47290 gene in Cas9-47290 homozygous mutant plant under Nipponbare background of rice provided in example 2 of the present invention; underlined nucleotide sequences are target sequences and replaced or inserted nucleotide sequences are in black boxes.
FIG. 3 shows the mutation type of the LOC _ Os11g47290 gene amino acid sequence in the Cas9-47290 homozygous mutant plant under Nipponbare background of rice provided in example 2 of the present invention.
FIG. 4 is a statistical chart of lesion phenotype after inoculating Cas9-47290 homozygous mutant plants with Klebsiella planticola GD1358 and Klebsiella planticola V respectively under Nipponbare background of rice provided in example 2 of the present invention, wherein P is < 0.05.
Detailed Description
The experimental procedures used in the following examples are all conventional procedures unless otherwise specified.
Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
The following examples are illustrated by Nipponbare (Oryza sativa ssp. japonica; non-patent documents describing this material are Yongqing Jian, Yonghong Wang, Dawei Xue, Jung Wang, Meixian Yan, Guifu Liu, Guojun Dong, Dali Zeng, Zefu Lu, Xudong Zhu, Qian Qian and Jianyang Li.Regulation of OsL 14 by OsmiR156 define ideal plant architecture in Nature Genetics, 2010,42, 541-544; publicly available from the Applicant).
Bacterial blight strain V: the resistance response to 5 races of southern east rice blight fungus was described in "Ying Xianhua, Wushang faithful. IRBB21(Xa 21.) plant protection journal, 2002,29(2): 97-100", which was obtained from the applicant after the consent of the first teacher of Guangdong academy of agricultural sciences.
Bacterial blight of rice yue 1358(GD 1358): described in "the research of pathogenic type of bacterial blight of rice in china, plant pathology newspaper, 1990, 20 (2): 81-88 ", the public is available from the applicant.
Example 1 site-directed knockout method of rice receptor kinase gene LOC _ Os11g47290 based on pYLCRISPR/Cas9 system
Sequence analysis and target sequence screening of rice receptor kinase gene LOC _ Os11g47290
The nucleotide sequence of the rice receptor kinase gene LOC _ Os11g47290 is shown as a sequence 2, and the amino acid sequence of the encoded protein is a sequence 1 in a sequence table. Sequence analysis shows that the gene comprises 7 exons, which are respectively the 1 st-1696 th site (first exon), the 2042 nd and 2941 th site (second exon), the 3130 and 3278 th sites (third exon), the 3677 th and 3890 th sites (fourth exon), the 6492 and 6611 th sites (fifth exon), the 6976 and 7019 th sites (sixth exon) and the 7163 and 7171 th sites (seventh exon) of the sequence 2 sequence.
The sequence on the first exon of the rice receptor kinase gene LOC _ Os11g47290 is the 47290-T1 and 47290-T2 target sequences of the site-directed knockout method of the rice receptor kinase gene LOC _ Os11g47290 based on pYLCRISPR/Cas9 system.
Through a large number of screens, the antisense strand from 367 to 386 of the first exon of the rice LOC _ Os11g47290 gene and the sense strand from 952 to 971 of the first exon are targeted by using the pYLCRISPR/Cas9 technology to be used as target sequences 47290-T1 and 47290-T2 respectively, wherein the target sequences are shown as a sequence 3 and a sequence 4.
pYLCRISPR/Cas9 System vectors (including pYLsgRNA-OsU6a, pYLsgRNA-OsU6b, and pYLCRISPR/Cas9Pubi-H vectors) non-patent literature describing this material is "Ma X., Zhang Q., Zhu Q., Liu W., Chen Y., Qiu R., Wang B., Yang Z., Li H., Lin Y., Xie Y., Shen R., Chen S., Wang Z., Cheng Z., Chen Y., Guo J., Chen L., Zhao X., Dong Z., and Liu Y. -G. (2015.) A Robust/85CRISPR/9 System for Con elementary, High-efficiency sample edition single electron oligo meter and plant 1274.8. After the subtropical agriculture biological resource protection of the university of south China's college of Life sciences and the consent of the national focus laboratory Liu dazzling teacher, the public can obtain the carrier from the applicant.
Design of pYLCRISPR/Cas9 system vector primer and construction of recombinant expression vector thereof
1. Design and synthesis of pYLCISPR/Cas 9 technology target sequence primer
Designing a target sequence primer targeting LOC _ Os11g47290 gene based on pYLCRISPR/Cas9 technology, wherein sequences of 47290-T1 target sequence primers 47290-gRT1 and 47290-U6aT1 are shown as sequence 5 and sequence 6 respectively; 47290-T2 target sequences primer 47290-gRT2 and primer 47290-U6bT2 are shown as sequence 7 and sequence 8 respectively;
relevant primers for 47290-T1 and 47290-T2, based on pYLCRISPR/Cas9 technology, were synthesized separately.
Sequence 5: 47290-gRT 1: 5'-TGCAGCCTTCCTATAACACCgttttagagctagaaat-3'
And (3) sequence 6: 47290-U6aT 1: 5'-GGTGTTATAGGAAGGCTGCACggcagccaagccagca-3'
And (3) sequence 7: 47290-gRT 2: 5'-CATGTGCCGGAATGGTTAGCgttttagagctagaaat-3'
And (2) sequence 8: 47290-U6bT 2: 5'-GCTAACCATTCCGGCACATGCaacacaagcggcagca-3' are provided.
2. Construction of pYLCRISPR/Cas9 technical recombinant expression vector
1) Construction of recombinant expression vectors containing 47290-T1 or 47290-T2 alone
The recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290-T1 contains encoding genes of U6a-47290-sgRNA1 and Cas9, U6a-47290-sgRNA1 has a target sequence region consisting of 20 nucleotides, and the corresponding target sequence of the target sequence region on LOC _ Os11g47290 gene is 47290-T1 shown in sequence 3; the specific construction method of the recombinant expression vector is as follows:
Using pYLsgRNA-OsU6a vector as template, using primers UF (5'-CTCCGTTTTACCTGTGGAATCG-3') and 47290-U6aT1 to perform PCR amplification, and naming the correct sequence as U6aT 1; using pYLsgRNA-OsU6a vector as template, using primers gR-R (5'-CGGAGGAAAATTCCATCCAC-3') and 47290-gRT1 to perform PCR amplification, and naming the correct sequence as gRT 1; the two fragments were ligated together by means of nested PCR using primers Pps-GGL (5'-TTCAGAGGTCTCTCTCGACTAGTATGGAATCGGCAGCAAAGG-3') and Pgs-GGR (5'-AGCGTGGGTCTCGACCGACGCGTATCCATCCACTCCAAGCTC-3') and named U6a-47290-sgRNA 1;
then, U6a-47290-sgRNA1 and pYRCISPR/Cas 9Pubi-H are subjected to enzyme digestion by BsaI enzyme, and a vector pYRCISPR/Cas 9Pubi-H-47290-T1 is obtained in a way of enzyme digestion and side ligation.
The recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290-T2 contains genes encoding U6b-47290-sgRNA2 and Cas9, U6b-47290-sgRNA2 has a target sequence region consisting of 20 nucleotides, and the corresponding target sequence of the target sequence region on LOC _ Os11g47290 gene is 47290-T2 shown in sequence 4; the specific construction method of the recombinant expression vector is as follows:
using pYLsgRNA-OsU6b vector as template, using primers UF (5'-CTCCGTTTTACCTGTGGAATCG-3') and 47290-U6bT2 to perform PCR amplification, and naming the correct sequence as U6bT 2; using pYLsgRNA-OsU6b vector as template, using primers gR-R (5'-CGGAGGAAAATTCCATCCAC-3') and 47290-gRT2 to perform PCR amplification, and naming the correct sequence as gRT 2; the two fragments were ligated together by means of nested PCR using primers Pps-GGL (5'-TTCAGAGGTCTCTCTCGACTAGTATGGAATCGGCAGCAAAGG-3') and Pgs-GGR (5'-AGCGTGGGTCTCGACCGACGCGTATCCATCCACTCCAAGCTC-3') and named U6b-47290-sgRNA 2;
Then BsaI enzyme is used for simultaneously carrying out enzyme digestion on the U6b-47290-sgRNA2 and the pYLCRISPR/Cas9Pubi-H vector, and the vector pYLCRISPR/Cas9Pubi-H-47290-T2 is obtained in a mode of enzyme digestion and connection.
2) Construction of recombinant expression vectors containing 47290-T1 and 47290-T2
The pYRCISPR/Cas 9Pubi-H-47290 contains genes encoding U6a-47290-sgRNA-1, U6b-47290-sgRNA-2 and Cas9, U6a-47290-sgRNA-1 has a target sequence region consisting of 20 nucleotides, and the corresponding target sequence of the target sequence region on the LOC _ Os11g47290 gene is 47290-T1 shown in a sequence 3; u6b-47290-sgRNA-2 has a target sequence region of 20 nucleotides, and the target sequence region corresponds to a target sequence of 47290-T2 shown in sequence 4 on LOC _ Os11g47290 gene. The specific construction method comprises the following steps: 1) u6aT1 and gRT1 are connected together by nested PCR by using primers Pps-GGL (5'-TTCAGAGGTCTCTCTCGACTAGTATGGAATCGGCAGCAAAGG-3') and Pgs-GG2(5 ' -AGCGTGGGTCTCGTCAGGGTCCATCCACTCCAAGCTC-3), and are named as U6 a-47290-sgRNA-1; 2) u6bT2 and gRT2 are connected together by nested PCR by using primers Pps-GG2 (5'-TTCAGAGGTCTCTCTGACACTGGAATCGGCAGCAAAGG-3') and Pgs-GGR (5'-AGCGTGGGTCTCGACCGACGCGTATCCATCCACTCCAAGCTC-3'), and are named as U6 b-47290-sgRNA-2; 3) the BsaI enzyme simultaneously carries out enzyme digestion on U6a-47290-sgRNA-1, U6b-47290-sgRNA-2 and pYLCRISPR/Cas9Pubi-H, and the vector pYLCRISPR/Cas9Pubi-H-47290 is obtained in a mode of enzyme digestion and connection.
3. Activity assay of recombinant expression vectors
The recombinant expression vectors pYLCRISPR/Cas9Pubi-H-47290-T1 and pYLCRISPR/Cas9Pubi-H-47290-T2 prepared in the above 2 are introduced into Nipponbare protoplasts of rice through PEG mediation respectively (see https:// bio-protocol. org/bio101/e1010125), and the protoplasts of plasmids pYLCRISPR/Cas9Pubi-H-47290-T1 and pYLCRISPR/Cas9Pubi-H-47290-T1 are obtained after 16 hours.
Genomic DNAs of protoplasts transiently transduced with plasmids pYLCRISPR/Cas9Pubi-H-47290-T1 and pYLCRISPR/Cas9Pubi-H-47290-T2 were extracted, respectively, a partial nucleotide sequence of LOC _ Os11g47290 gene was amplified using sequence 11 and sequence 12 in the sequence listing, and sequencing was performed.
The results are shown in FIG. 1, FIG. 1 is a sequencing peak diagram for vector activity detection based on a fixed-point knockout method of rice receptor kinase gene LOC _ Os11g47290 by pYLCRISPR/Cas9 technology, and it can be seen that the pYLCRISPR/Cas9 technology can induce the mutation of LOC _ Os11g47290 gene at target sequences 47290-T1 and 47290-T2.
4. Obtaining of recombinant Agrobacterium tumefaciens
And (3) carrying out heat shock transformation on the recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290 obtained in the step (2) to obtain the recombinant agrobacterium containing the recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290, which is named as EH105-Cas 9-47290.
Agrobacterium tumefaciens EHA 105: BioIVectror NTCC type culture Collection, commercially available.
Example 2 application of pYLCISPR/Cas 9 technology-based site-specific knockout method in rice variety
First, pYLCRISPR/Cas9 technology site-directed knockout of LOC _ Os11g47290 gene
Infecting a rice variety Nipponbare (hereinafter referred to as wild rice) mature embryo induced callus by recombinant agrobacterium EH105-Cas9-47290, and respectively naming obtained rice transformation plants as NIP-Cas 9-47290; the specific method of the experiment is as follows:
1. the recombinant Agrobacterium obtained in example 1 was inoculated into YEB liquid medium (containing 50. mu.g/ml kanamycin and 20. mu.g/ml rifampicin), and shake-cultured at 28 ℃ and 200rpm to OD600 of 0.6-0.8; centrifuging at 5000rpm and 4 deg.C for 5min, and resuspending thallus precipitate with AAM liquid culture medium (acetosyringone concentration of 200 μ M/L, pH 5.2) to OD600 of 0.6-0.8.
2. Respectively removing glumes of mature seeds of Nipponbare of a rice variety, soaking in 75% ethanol for 1min, then sterilizing in NaClO solution (mixed with water at a ratio of 1:2, and adding 1 drop of Tween 20) for 20min by oscillation, and repeating for 2 times. Washing with sterile water for several times until no foreign odor exists, inoculating sterilized Nipponbare seed of rice to NBD2 culture medium to induce callus, culturing in dark at 26 deg.C for 8-10 days, cutting off root and residual endosperm, and subculturing for 10 days to obtain mature embryo callus.
3. And (3) respectively soaking the mature embryo callus obtained in the step (2) in the recombinant agrobacterium tumefaciens resuspension obtained in the step (1), removing the rice material after 20-30min, inoculating the rice material on a co-culture medium (the concentration of the acetosyringone is 100 mu M/L and the pH value is 5.2) containing two layers of filter paper, and co-culturing for 3 days under the dark condition at the temperature of 26 ℃.
4. And (4) inoculating the callus co-cultured in the step (3) into a screening culture medium (the hygromycin concentration is 50mg/L and the pH value is 5.8), screening and culturing for 12 days under a dark condition at the temperature of 28 ℃, and transferring the resistant callus to a selection culture medium containing 50mg/L Hyg for continuous screening.
5. After repeated screening for 2 times, transferring the resistant callus to a differentiation medium (24 hours of illumination/day) for induced differentiation; when new rootless seedlings are generated, transferring regenerated seedlings to 1/2MS culture medium for root induction; and after the plantlets are thrilled, moving the plantlets into an artificial climate chamber for nutrient solution cultivation to obtain a regenerated plant NIP-Cas 9-47290.
6. After the obtained regeneration plant is transplanted to survive, extracting the total DNA of the leaves of the regeneration plant, carrying out PCR amplification on the self primer sequence 9 and the sequence 10 of the recombinant expression vector pYLCRISPR/Cas9Pubi-H-47290 to screen a positive transformation plant, and amplifying the plant with a 1225bp strip, namely the positive transformation plant.
The number of the detected regenerated plants, the number of the positive transformed plants and the percentage of the number of the positive transformed plants to the number of the detected regenerated plants, namely the positive rate (%) are counted, and the results are shown in table 1.
Table 1. positive rate detection result of pYRCISPR/Cas 9Pubi-H-47290 transformed rice variety
7. PCR amplification was carried out using the genome of the number of positive transformed plants as a template, and a sequence shown by sequence 11 for a specific primer LOC _ Os11g47290-TF for rice receptor kinase gene LOC _ Os11g47290 and sequence 12 for LOC _ Os11g47290-TF (a fragment containing 2 target sequences in amplification gene LOC _ Os11g 47290). Nipponbare was used as a control.
Sequencing and verifying the obtained 1096bp amplification product, and recording the mutation of the amplification product as the number of the mutant transformation plants compared with Nipponbare; the sequencing verification results are shown in table 2. The number of the regenerated plants, the number of the plants transformed with mutation and the percentage of the number of the plants transformed with mutation to the number of the regenerated plants, i.e., the mutation efficiency (%) were counted, and the results are shown in table 2.
Table 2. detection results of pYLCRISPR/Cas9Pubi-H-47290 induced mutation of rice receptor kinase gene LOC _ Os11g47290
Secondly, site-directed knockout of 47290 gene mutation transformed plant phenotype by pYLCRISPR/Cas9 technology
And collecting seeds of the 39 obtained mutation-transformed plants, and sowing to obtain T1 generation plants.
Extracting genome DNA of 200T 1 generation plants, amplifying by using a primer shown in a sequence 11 and a primer shown in a sequence 12 to obtain a PCR product, sending to sequence, selecting a homozygous mutant strain to obtain 1 homozygous mutant type, wherein the mutant type is 40.
The mutant forms of the homozygous mutant types are as follows: both target sequences 47290-T1 and 47290-T2 were mutated and base T was replaced with base A at position 371 and base G was replaced with base A at position 380 in target sequence 1; insertion of base T between positions 968 and 969 of the target sequence 2 leads to premature termination of translation of the gene LOC _ Os11g 47290. The nucleotide sequence of this homozygous mutant type is shown in FIG. 2, and the amino acid sequence is shown in FIG. 3. The plants with this mutant form were designated as T1 generation Cas9-47290 mutant plants.
T1 generation Cas9-47290 mutant plants (Cas9-47290) are inoculated with the white leaf blight bacteria GD1358 and V. Wild type nipponlily was used as a control. 15 strains are inoculated to each strain; the experiment was repeated 3 times and the results averaged.
After 14 days of inoculation with the bacterial blight strain GD1358 and V, leaf phenotype is observed.
The length of inoculated leaf spots was counted, and as a result, as shown in FIG. 4, the Cas9-47290 mutant exhibited a phenotype in which bacterial blight spots were shortened, the length of leaf spots inoculated with Bacillus subtilis GD1358 was shortened by 48.1%, and the length of leaf spots inoculated with Bacillus subtilis V was shortened by 14.4%, compared to the wild type (Nipponbare on the abscissa of the bar chart).
The results show that the Cas9-47290 mutant created by the CRISPR/Cas9 technology enhances the disease resistance of the rice to the white leaf blight bacteria GD1358 and V.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
SEQUENCE LISTING
<110> institute of genetics and developmental biology, institute of academy of agricultural sciences, China
<120> rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof
<160> 12
<170> PatentIn version 3.5
<210> 1
<211> 1044
<212> PRT
<213> Artificial sequence
<400> 1
Met Gly Val Gly Pro His Cys Thr Thr Ser Leu Leu Ile Ile Leu Ala
1 5 10 15
Val Val Ile Thr Ser Ser Leu Leu Thr Thr Thr Ile Lys Ala Asp Glu
20 25 30
Pro Ser Asn Asp Thr Asp Ile Ala Ala Leu Leu Ala Phe Lys Ala Gln
35 40 45
Phe Ser Asp Pro Leu Gly Phe Leu Arg Asp Gly Trp Arg Glu Asp Asn
50 55 60
Ala Ser Cys Phe Cys Gln Trp Ile Gly Val Ser Cys Ser Arg Arg Arg
65 70 75 80
Gln Arg Val Thr Ala Leu Glu Leu Pro Gly Ile Pro Leu Gln Gly Ser
85 90 95
Ile Thr Pro His Leu Gly Asn Leu Ser Phe Leu Tyr Val Leu Asn Leu
100 105 110
Ala Asn Thr Ser Leu Thr Gly Thr Leu Pro Gly Val Ile Gly Arg Leu
115 120 125
His Arg Leu Glu Leu Leu Asp Leu Gly Tyr Asn Ala Leu Ser Gly Asn
130 135 140
Ile Pro Ala Thr Ile Gly Asn Leu Thr Lys Leu Glu Leu Leu Asn Leu
145 150 155 160
Glu Phe Asn Gln Leu Ser Gly Pro Ile Pro Ala Glu Leu Gln Gly Leu
165 170 175
Arg Ser Leu Gly Ser Met Asn Leu Arg Arg Asn Tyr Leu Ser Gly Leu
180 185 190
Ile Pro Asn Ser Leu Phe Asn Asn Thr Pro Leu Leu Gly Tyr Leu Ser
195 200 205
Ile Gly Asn Asn Ser Leu Ser Gly Pro Ile Pro His Val Ile Phe Ser
210 215 220
Leu His Val Leu Gln Val Leu Val Leu Glu His Asn Gln Leu Ser Gly
225 230 235 240
Ser Leu Pro Pro Ala Ile Phe Asn Met Ser Arg Leu Glu Lys Leu Tyr
245 250 255
Ala Thr Arg Asn Asn Leu Thr Gly Pro Ile Pro Tyr Pro Ala Glu Asn
260 265 270
Gln Thr Leu Met Asn Ile Pro Met Ile Arg Val Met Cys Leu Ser Phe
275 280 285
Asn Gly Phe Ile Gly Arg Ile Pro Pro Gly Leu Ala Ala Cys Arg Lys
290 295 300
Leu Gln Met Leu Glu Leu Gly Gly Asn Leu Leu Thr Asp His Val Pro
305 310 315 320
Glu Trp Leu Ala Gly Leu Ser Leu Leu Ser Thr Leu Val Ile Gly Gln
325 330 335
Asn Glu Leu Val Gly Ser Ile Pro Val Val Leu Ser Asn Leu Thr Lys
340 345 350
Leu Thr Val Leu Asp Leu Ser Ser Cys Lys Leu Ser Gly Ile Ile Pro
355 360 365
Leu Glu Leu Gly Lys Met Thr Gln Leu Asn Ile Leu His Leu Ser Phe
370 375 380
Asn Arg Leu Thr Gly Pro Phe Pro Thr Ser Leu Gly Asn Leu Thr Lys
385 390 395 400
Leu Ser Phe Leu Gly Leu Glu Ser Asn Leu Leu Thr Gly Gln Val Pro
405 410 415
Glu Thr Leu Gly Asn Leu Arg Ser Leu Tyr Ser Leu Gly Ile Gly Lys
420 425 430
Asn His Leu Gln Gly Lys Leu His Phe Phe Ala Leu Leu Ser Asn Cys
435 440 445
Arg Glu Leu Gln Phe Leu Asp Ile Gly Met Asn Ser Phe Ser Gly Ser
450 455 460
Ile Ser Ala Ser Leu Leu Ala Asn Leu Ser Asn Asn Leu Gln Tyr Phe
465 470 475 480
Tyr Ala Asn Asp Asn Asn Leu Thr Gly Ser Ile Pro Ala Thr Ile Ser
485 490 495
Asn Leu Ser Asn Leu Asn Val Ile Gly Leu Phe Asp Asn Gln Ile Ser
500 505 510
Gly Thr Ile Pro Asp Ser Ile Met Leu Met Asp Asn Leu Gln Ala Leu
515 520 525
Asp Leu Ser Ile Asn Asn Leu Phe Gly Pro Ile Pro Gly Gln Ile Gly
530 535 540
Thr Pro Lys Gly Met Val Ala Leu Ser Leu Ser Gly Asn Asn Leu Ser
545 550 555 560
Ser Tyr Ile Pro Asn Gly Gly Ile Pro Lys Tyr Phe Ser Asn Leu Thr
565 570 575
Tyr Leu Thr Ser Leu Asn Leu Ser Phe Asn Asn Leu Gln Gly Gln Ile
580 585 590
Pro Ser Gly Gly Ile Phe Ser Asn Ile Thr Met Gln Ser Leu Met Gly
595 600 605
Asn Ala Gly Leu Cys Gly Ala Pro Arg Leu Gly Phe Pro Ala Cys Leu
610 615 620
Glu Lys Ser Asp Ser Thr Arg Thr Lys His Leu Leu Lys Ile Val Leu
625 630 635 640
Pro Thr Val Ile Val Ala Phe Gly Ala Ile Val Val Phe Leu Tyr Leu
645 650 655
Met Ile Ala Lys Lys Met Lys Asn Pro Asp Ile Thr Ala Ser Phe Gly
660 665 670
Ile Ala Asp Ala Ile Cys His Arg Leu Val Ser Tyr Gln Glu Ile Val
675 680 685
Arg Ala Thr Glu Asn Phe Asn Glu Asp Asn Leu Leu Gly Val Gly Ser
690 695 700
Phe Gly Lys Val Phe Lys Gly Arg Leu Asp Asp Gly Leu Val Val Ala
705 710 715 720
Ile Lys Ile Leu Asn Met Gln Val Glu Arg Ala Ile Arg Ser Phe Asp
725 730 735
Ala Glu Cys His Val Leu Arg Met Ala Arg His Arg Asn Leu Ile Lys
740 745 750
Ile Leu Asn Thr Cys Ser Asn Leu Asp Phe Arg Ala Leu Phe Leu Gln
755 760 765
Phe Met Pro Asn Gly Asn Leu Glu Ser Tyr Leu His Ser Glu Ser Arg
770 775 780
Pro Cys Val Gly Ser Phe Leu Lys Arg Met Glu Ile Met Leu Asp Val
785 790 795 800
Ser Met Ala Met Glu Tyr Leu His His Glu His His Glu Val Val Leu
805 810 815
His Cys Asp Leu Lys Pro Ser Asn Val Leu Phe Asp Glu Glu Met Thr
820 825 830
Ala His Val Ala Asp Phe Gly Ile Ala Lys Met Leu Leu Gly Asp Asp
835 840 845
Asn Ser Ala Val Ser Ala Ser Met Leu Gly Thr Ile Gly Tyr Met Ala
850 855 860
Pro Val Phe Glu Leu Gly Leu Leu Cys Ser Ala Asp Ser Pro Glu Gln
865 870 875 880
Arg Thr Ala Met Ser Asp Val Val Val Thr Leu Lys Lys Ile Arg Lys
885 890 895
Asp Tyr Val Lys Leu Met Ala Thr Thr Arg Pro Gly Lys Lys Leu Met
900 905 910
Ala Thr Thr Ala Asn Arg Thr Ser Lys Gly Pro Gln Asp Asn Arg Val
915 920 925
Phe Arg Glu His Ile Phe Arg Leu Ser Cys Thr Gln Lys Arg Ser Asp
930 935 940
Ser Gln Met Arg Ala Gly Thr Ile Thr Gly Ser His Leu Ser Lys Thr
945 950 955 960
Asn Glu Ala Val Gly Gln Arg Arg Lys Lys Val Val Gly Thr Gly Asp
965 970 975
Asn Ser Arg Pro Ala Leu His Glu Leu Gln Gly Arg Glu Lys Gly Lys
980 985 990
Glu Glu Gly Gly Arg Gly Gly Ser Asp Trp Ile Gly Gly Ala Gly Asp
995 1000 1005
Arg Trp Glu Arg Ser Pro Ala His Gln Ala Tyr Pro Leu Met Gly
1010 1015 1020
Pro Glu Arg Lys Glu His Ser Lys Glu Arg Lys Gly Arg Lys Arg
1025 1030 1035
Asn Gly Glu Glu Asn Phe
1040
<210> 2
<211> 7171
<212> DNA
<213> Artificial sequence
<400> 2
atgggtgttg gtcctcattg tactactagt ctgctaataa tactggccgt cgtcatcacg 60
tcgtctttgc tcacgacgac gatcaaggca gatgagccga gcaatgatac cgacatcgcc 120
gcgctgcttg ccttcaaggc acagttctct gaccctctgg gtttcctccg tgacggctgg 180
agggaggaca atgcatcctg cttctgccag tggatcggcg tgtcgtgcag ccgccgccgg 240
cagcgcgtca ccgccctgga gctgccgggc attcccctgc aagggtcgat cacccctcac 300
ctcggtaacc tctctttcct ctacgtcctc aacctcgcca acaccagcct cacggggaca 360
ctcccgggtg ttataggaag gctgcatcgc ctggagctcc ttgatcttgg ctacaatgcc 420
ctgtcaggta acatcccagc caccatagga aacctcacca aacttgagct tcttaatctc 480
gagtttaacc agctatctgg tccaatccca gcagagctgc agggcctgcg aagccttggc 540
agtatgaatc tccgtaggaa ctatctcagt ggcttgattc ccaacagtct attcaacaac 600
accccattgt taggttatct cagcattggc aacaacagct tgtcagggcc aataccgcac 660
gtgatattct cgttgcacgt gctgcaggtc cttgttctag agcacaatca attgtccggc 720
tcactgcccc cagccatctt caacatgtcc agacttgaaa agctgtatgc cactcgaaac 780
aatctcactg gacctatccc atacccagct gaaaaccaga ccttgatgaa catccccatg 840
attcgggtga tgtgtctctc tttcaacgga ttcataggcc gaattccacc tgggcttgcg 900
gcatgccgga aactccagat gcttgagtta ggtgggaatc tcttgacgga tcatgtgccg 960
gaatggttag cgggcttgtc cctgctaagc accttggtta taggtcagaa tgagcttgtc 1020
ggttcgatcc cagttgtgct aagcaatctc accaagctca ccgtgcttga tctgtcatct 1080
tgcaagctaa gtggaatcat tccattggaa ctaggaaaga tgacacaact caacatcttg 1140
cacctctcat ttaatcgcct aactggtcct tttcctacct cccttggtaa cttgacaaaa 1200
ttatcttttc taggattaga atctaacctg ctgaccggac aagtacctga gacccttggg 1260
aacctcaggt ctctatactc ccttggtatt ggaaagaatc atctacaagg gaaacttcac 1320
ttctttgccc ttctctccaa ttgtagggaa ctccaattcc tcgacatagg aatgaattct 1380
ttctcaggga gcatttctgc gagtttacta gcaaacctct ctaacaactt acaatatttt 1440
tatgcaaatg ataacaactt aactggcagt attcctgcta ccatatcaaa tctgtctaac 1500
ctgaatgtaa taggcctttt tgacaaccaa ataagcggca caattccaga ttctataatg 1560
ctaatggata atctacaggc attggacctc tctataaaca atttgtttgg accaatccca 1620
ggacaaattg gtaccccaaa aggaatggtc gcattatctc tcagtggcaa caatctttct 1680
agttacatcc ctaacggtgt tggcaatcta agcacgttgc agtactgatt tctgtcatat 1740
aataggttgt catcagttat acccgcaagc ttagttaatc ttagtaatct tctccaacta 1800
gatatttcta ataataactt aactggttca ttgccttctg atctcagttc cttcaaagta 1860
ataggcctaa tggacatctc agcaaataat ttggtcggta gcctcccaac ttcgttggga 1920
tagctccaac tgtcaagcta cctgaattta tctcaaaaca cattcaatga ttcaattcca 1980
gactctttca aaggtctaat taatttagaa acattggatc tgtctcataa caatctttca 2040
ggaggcatac caaaatactt ttccaactta acctatctta cttctttgaa cctctccttt 2100
aacaatctac aaggtcagat accaagtgga ggtatttttt caaacatcac tatgcaatct 2160
ttaatgggaa atgctggact ctgtggtgct ccacgtctgg gatttcctgc atgtctggag 2220
aagtccgact cgactagaac aaaacacttg ctgaagattg tgctccctac tgtcattgtg 2280
gcgtttggtg ccattgttgt gttcctatac ctaatgattg caaagaaaat gaaaaatcca 2340
gatattacgg cttcttttgg catagcagat gcgatttgcc acaggctagt gtcctaccaa 2400
gaaatcgttc gcgctaccga aaatttcaat gaggacaacc tacttggagt tggaagtttt 2460
ggcaaagttt tcaagggtcg gctggatgac ggtttggtgg ttgcaatcaa aatcctcaac 2520
atgcaggttg aacgagctat taggagtttt gatgcagagt gccatgtctt gcggatggcc 2580
agacatcgca acctgataaa gatactaaat acatgttcca acttggattt cagagcactg 2640
tttcttcagt tcatgcccaa tggaaacttg gagtcatact tgcactctga aagcaggcct 2700
tgtgtgggat cattcctcaa aaggatggag attatgctag atgtgtcaat ggctatggaa 2760
tatctgcatc atgaacacca tgaggttgtc ctgcattgtg atttgaaacc tagcaacgtg 2820
ctatttgatg aagagatgac tgcacatgta gcagactttg gcatagcaaa gatgttgtta 2880
ggggatgaca attcagcagt ttcagcaagc atgctaggca caattgggta catggcacca 2940
ggtacttaca ctagccaatc taaagccaca tacatttttt gttggagttg ttacaaattc 3000
acttcaagat attggccggg cttctgacga agcggagcgg aggcccgtcg ccggaggctt 3060
gggccggagc cagtgggttg gtgtatcgtg tagccgccgc aggtaacatg catggcttcc 3120
ttgtgccagt gttcgagctg ggcttgctct gttcggctga ctcccccgag caaaggacgg 3180
cgatgagcga tgtggtcgtg acactgaaga agattagaaa ggactatgtc aaattgatgg 3240
caaccacccg acccggcaaa aaattgatgg caaccacagt aagcgttgtg cagcagtgat 3300
tcatcgctct atcgttgtat atgagcgaat gaaatacata tcatttgcat cattttcttc 3360
ttctgcatta ggaatagcat cagtgatcga ttaccctttg tttgtatttg tgtatggttg 3420
aattgaatat atatatatat atatatatat atatataaca atttcgttgg tgtaaatatg 3480
tgattgaacc gctggtcaat aaatttgcat cacgaaaatg ggagtagatg atgtgctact 3540
tatgttttct tatttctggc caaaataaat aaataaaaaa ggaatattat cggcacagca 3600
tcacaactcc ggctcgttca gccttaaaca accacaatta acagtcctaa gcagagaaac 3660
ttaacaagct tttcaggcta acagaacatc aaaaggtcca caagacaaca gggtcttcag 3720
agagcacatc ttcaggctgt catgcacgca aaaaagatca gacagccaga tgagagcggg 3780
tacaataacc ggtagccatc tcagtaaaac taacgaggct gtaggacaga gaagaaaaaa 3840
ggtagtagga acgggcgaca atagtcggcc agctctacat gagctccaag gtgaaaaagc 3900
ctacaagata ggaattttca atttttaaaa atggccatgg gcccacaagt agtgagatgc 3960
atgcaatgaa agatgctagt cagttgggtc ccacaagaaa aagaaataca tctcgggatt 4020
aactgctagc ttactctagt taatatggac atggatggag ccggcagcca gcaatactat 4080
tgaacctgct taagagcaag tgcaatagta ggctatatac cagctataaa catactttaa 4140
agagataaag gaagagagag aggaatagca gattacagca cggattacaa gacgtaatat 4200
gtgtataaca tgtgagacca gatattaata gtatagtaag caactattgt atgaactagg 4260
aaggaagccc gcgcagatgc gcaggcatct aatttattgt tttatgattt atgaaataat 4320
gctaaaagat gtcatgtctc atctttttta tgctgtgaga taaactgact ttaatatact 4380
tagtaaatga taagaatgtc tccaagatta aaagaaaatc agtctgttcc gattacttat 4440
aaattctcct attgtcacca caacttatta ttattgtttt aaaagttatg atctataaat 4500
agataaaact ggatatgtac caataaaata agcctaatac atcattgata cactaaaata 4560
ttcactactt atctaagtgt tcatgtttca cttatttccg aatcaatata taaatatttt 4620
aaaaatacca cttataccaa ttaatgaaat aacaccgtta aagataaaca tgaatgcccc 4680
tctaacgggt gaagcccatc gtagccacca agactttctt tctctgttga gttttctaaa 4740
aaaaacgaaa ataaaatttt tgataaatta atccaaaaaa ttaaaactaa attggattta 4800
cttaaaaaat agactaaatt gttttttatc attttttaca tttgtagatt taaattagtg 4860
ttgtatattt atcaagtgat attagaaatc ctaaaatata atgtgaaact atttgaaaat 4920
taccacaaca ttaattatta tatgttggtc ccatgtgtca tgatgtattt aggaccataa 4980
tatacctttt ataagacata agtatttcat ttaattaaag catgaaacta atcacaaatt 5040
gaatataatt gttgcttact atactattaa tatatggtcc cacctgtcat acgcatattg 5100
cgtcttgtag tccgtgctgc agctggctac aaatctgtag cccgctgctc ttctctctca 5160
tcgtttatct cattaaaata tatttgtagc tggctaatag cttgctattt tacttgctct 5220
aagataacag tcctaccaat ttaaataaaa attaatgatg agattagaca aagtcacagg 5280
gtggcctaca agagttggta gaaatggcac ataagataaa atatgatata gtagtaaaaa 5340
tatagctaat atagctcagg aattggtgat ttaattaaaa aaacctacag ctgctaaagt 5400
gtatgcgtag atcgtctata aaaataattg tttatttaca acaccagatc acttctattg 5460
taaattccta gcgtttgcgc attcacactc agattactct ctatgaaaaa actgatctaa 5520
taatttatga taataaatta tgatgtacat gtatcatgta atttgaaaaa agaagaaaaa 5580
aaggagaatt aggatctgga tggcatcaat cgagctcgac gaggcagatc agtctccatg 5640
gaaatgataa caaattatga tgcacatgta ttgtgtaatt agaaataaaa ggagagaggg 5700
agaatccaga tatggatggc atcaagctaa cagaagcata agtatgtacg tacgtactta 5760
tgggcggaaa atctaatggg cgatcaatcg ctaccgcgat ctggtagcga tcgcatcccc 5820
tcccccctcc ctccttccac gtttcctttt ctggcaccgc attacttttc tattttagta 5880
aaatttatgc acctaaagtt tatacaccta aagtttatag acccaaagtt tagagaccca 5940
aagtttataa atcaaaagtt tatatatccg attcaaattt aaatttgaat tcaaatattt 6000
tttatatata gtatttatat acatctaaag tttatacacc taaagtttat agacccaaag 6060
tttataaatc aaaagtttat ataccctatt caaatttgaa tttgaaatat attcgattca 6120
aaatttgaat ttgaattcaa atatttttta tatatagtat ttagatacat ctaaagttta 6180
tacacctaaa gtttatagac ccaaagttta gagacccaaa gtttacatac ccaattcaaa 6240
tttgaatttg aattgtatcc gattcaaatt taaatttgaa ttcaaatatt tttatatata 6300
gtatttctat acatctaaag tttatacacc taaagtttat agaccgaaag tttttagtaa 6360
aaagtttata tacccgattc aaatttgaat ttgaattcaa atattagttt atagatccaa 6420
agtttataag tcaaaaattt acataaccgt ttcaattctg aatttaaatt taaatattta 6480
tggtgtagta ggaagagaaa aaggaaagga ggaagggggg agaggaggga gcgactggat 6540
aggaggggcg ggtgatcgct gggagcgatc acccgcccat caggcatacc cacttatggg 6600
tccggagaga aagggtgaga aagatttagt tttgatccaa cggttaatat tattgggtcc 6660
accactttaa ataaaaactt acggtcagat atttttcttt ttctcagaat attatttaaa 6720
ttattagagc gccatgtggc gacttaggag cgtttatata taggagtctc acgtggtagc 6780
ttgagagcgt acgtagaaag tttaatgaac ttttagtata taaataatag atagatagat 6840
ttaggatttt ctctaatttg ctagaacgcc acgtggtggc ctaggatcgt tattggtcca 6900
aataatatgt aattacttat ttatataaat aatatgtaat ttagaagtat acttattcaa 6960
tatgtattgt tgcagaacat agtaaagaga ggaaggggag gaaaagaaac ggagaggagg 7020
tactcataca tgacgcacgg caggattcct tctttgtttt ttagaaagta agagaataaa 7080
ccaatctaat ctaacggctt ggagtaacgg gcccactaat tttaatgaaa attaatggct 7140
agatgttttg ctttttttat agaatttcta g 7171
<210> 3
<211> 20
<212> DNA
<213> Artificial sequence
<400> 3
tgcagccttc ctataacacc 20
<210> 4
<211> 20
<212> DNA
<213> Artificial sequence
<400> 4
catgtgccgg aatggttagc 20
<210> 5
<211> 37
<212> DNA
<213> Artificial sequence
<400> 5
tgcagccttc ctataacacc gttttagagc tagaaat 37
<210> 6
<211> 37
<212> DNA
<213> Artificial sequence
<400> 6
ggtgttatag gaaggctgca cggcagccaa gccagca 37
<210> 7
<211> 37
<212> DNA
<213> Artificial sequence
<400> 7
catgtgccgg aatggttagc gttttagagc tagaaat 37
<210> 8
<211> 37
<212> DNA
<213> Artificial sequence
<400> 8
gctaaccatt ccggcacatg caacacaagc ggcagca 37
<210> 9
<211> 22
<212> DNA
<213> Artificial sequence
<400> 9
gcggtgtcat ctatgttact ag 22
<210> 10
<211> 22
<212> DNA
<213> Artificial sequence
<400> 10
ccgacataga tgcaataact tc 22
<210> 11
<211> 20
<212> DNA
<213> Artificial sequence
<400> 11
atgagccgag caatgatacc 20
<210> 12
<211> 20
<212> DNA
<213> Artificial sequence
<400> 12
ccaagggagg taggaaaagg 20
Claims (5)
1. The application of substances for inhibiting or reducing the expression of LOC _ Os11g47290 protein coding nucleic acid in rice in improving the bacterial leaf blight resistance of the rice or improving the bacterial leaf blight resistance of the rice:
the LOC _ Os11g47290 protein is any one of the following (a 1) - (a 2):
(a1) protein shown as a sequence 1 in a sequence table;
(a2) A fusion protein obtained by attaching a tag to the N-terminus or/and the C-terminus of the protein of (a 1);
the substances are as follows:
1) interfering RNA;
2) CRISPR/Cas9 system;
in the CRISPR/Cas9 system, the target sequence of sgRNA is the nucleotide sequence of XXXGG form in the encoding nucleic acid of LOC _ Os11g47290 protein, wherein XXX is the nucleic acid sequence of any 19-20 bp in the encoding nucleic acid of LOC _ Os11g47290 protein, and N is any one base in A, T, G, C.
2. Use according to claim 1, characterized in that:
the LOC _ Os11g47290 protein coding nucleic acid is a DNA molecule with a coding region shown as a sequence 2 in a sequence table.
3. Use according to claim 2, characterized in that:
the sgRNA is sgRNA1 and/or sgRNA 2;
the target sequence of the sgRNA1 is a sequence 3; the target sequence of the sgRNA2 is sequence 4.
4. A method for improving the bacterial leaf blight resistance of rice is any one of the following methods 1) to 2), and the bacterial leaf blight resistance of rice is improved;
1) the method comprises the following steps: inhibiting or reducing the expression of LOC _ Os11g47290 protein-encoding nucleic acid in target rice;
2) the method comprises the following steps: carrying out gene editing on LOC _ Os11g47290 protein coding nucleic acid in target rice;
Or, a method for producing a transgenic rice plant having high resistance to bacterial blight, which is any one of the following 1) to 2),
1) the method comprises the following steps: inhibiting or reducing the expression of LOC _ Os11g47290 protein coding nucleic acid in target rice to obtain transgenic rice;
2) the method comprises the following steps: carrying out gene editing on LOC _ Os11g47290 protein coding nucleic acid in target rice to obtain transgenic rice;
the bacterial leaf blight resistance of the transgenic rice is higher than that of the target rice;
the LOC _ Os11g47290 protein is any one of the following (a 1) - (a 2):
(a1) protein shown as a sequence 1 in a sequence table;
(a2) a fusion protein obtained by attaching a tag to the N-terminus or/and the C-terminus of the protein of (a 1);
the inhibition or reduction of the expression of the nucleic acid encoding the LOC _ Os11g47290 protein in the rice is realized by performing gene editing on the nucleic acid encoding the LOC _ Os11g47290 protein, wherein the gene editing is realized by virtue of a CRISPR/Cas9 system;
in the CRISPR/Cas9 system, the target sequence of sgRNA is the nucleotide sequence of XXXGG form in the encoding nucleic acid of LOC _ Os11g47290 protein, wherein XXX is the nucleic acid sequence of any 19-20 bp in the encoding nucleic acid of LOC _ Os11g47290 protein, and N is any one base in A, T, G, C.
5. The method of claim 4, wherein:
the sgRNA is sgRNA1 and/or sgRNA 2;
the target sequence of the sgRNA1 is a sequence 3; the target sequence of the sgRNA2 is sequence 4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911164723.4A CN110791487B (en) | 2019-11-25 | 2019-11-25 | Rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911164723.4A CN110791487B (en) | 2019-11-25 | 2019-11-25 | Rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110791487A CN110791487A (en) | 2020-02-14 |
CN110791487B true CN110791487B (en) | 2022-05-24 |
Family
ID=69445910
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911164723.4A Active CN110791487B (en) | 2019-11-25 | 2019-11-25 | Rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110791487B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111269305B (en) * | 2020-03-02 | 2021-06-15 | 中国农业科学院作物科学研究所 | Rice OsARFC1 gene and function and application of encoding protein thereof |
CN113403308B (en) * | 2020-12-25 | 2022-10-21 | 华南农业大学 | Method for improving bacterial leaf blight resistance of rice |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002272291A (en) * | 2001-03-21 | 2002-09-24 | Idemitsu Kosan Co Ltd | Disease injury-resistant rice |
CN109369790A (en) * | 2018-12-04 | 2019-02-22 | 中国农业科学院作物科学研究所 | The white blight resistance-associated protein OsBBR1 of rice and its encoding gene and application |
CN109400688A (en) * | 2018-12-04 | 2019-03-01 | 中国农业科学院作物科学研究所 | The application of OsHAP2C and its encoding gene in adjusting and controlling rice bacterial leaf spot resistance |
-
2019
- 2019-11-25 CN CN201911164723.4A patent/CN110791487B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002272291A (en) * | 2001-03-21 | 2002-09-24 | Idemitsu Kosan Co Ltd | Disease injury-resistant rice |
CN109369790A (en) * | 2018-12-04 | 2019-02-22 | 中国农业科学院作物科学研究所 | The white blight resistance-associated protein OsBBR1 of rice and its encoding gene and application |
CN109400688A (en) * | 2018-12-04 | 2019-03-01 | 中国农业科学院作物科学研究所 | The application of OsHAP2C and its encoding gene in adjusting and controlling rice bacterial leaf spot resistance |
Non-Patent Citations (2)
Title |
---|
Leucine Rich Repeat family protein [Oryza sativa Japonica Group],ABA95543.1;Buell,C.R.等;《Genbank》;20110505;第1-3页 * |
水稻白叶枯病抗性基因的鉴定、定位、克隆与育种应用;夏春等;《分子植物育种》;20121128;第10卷(第06期);第761-771页 * |
Also Published As
Publication number | Publication date |
---|---|
CN110791487A (en) | 2020-02-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022135246A1 (en) | R gene for controlling matching of soybean-rhizobium, protein and use thereof | |
CN104450711B (en) | OsmiR156f genes are increasing paddy rice effective tillering and the application in improving paddy rice single plant yield | |
CN110878303B (en) | Rice Os11g0681100 gene and function and application of encoded protein thereof | |
CN111118005A (en) | MiRNA related to rice blast resistance, corresponding precursor and application | |
CN110791487B (en) | Rice receptor kinase gene LOC _ Os11g47290, and coding protein and application thereof | |
CN111116725B (en) | Gene Os11g0682000 and application of protein coded by same in regulation and control of bacterial leaf blight resistance of rice | |
CN110699369B (en) | Rice receptor kinase gene OsRLCK21 and protein coded by same and application thereof | |
CN101503690A (en) | Method for promoting plant seed augmentation and cotton fibre growth by using RDL1 gene | |
CN111676235B (en) | Application of GTP-binding protein gene GhROP6 in regulation and control of cotton fiber properties | |
CN109112138B (en) | Gene OsVAS1 for regulating and controlling ideal plant type of rice | |
CN109486840A (en) | The NmeCas9 gene of codon vegetalization transformation and its application | |
CN105695479B (en) | Chrysanthemum symmetry gene CmCYC2c and application thereof | |
CN111499709B (en) | RGN1 protein related to grain number per ear of rice as well as encoding gene and application thereof | |
CN110699363B (en) | Rice retrotransposon gene LOC _ Os11g45295, and coding protein and application thereof | |
CN115677839B (en) | Rice OsTOBP 1C protein and application of encoding gene thereof | |
CN110628813B (en) | Rice lipase gene Os07g0586800 and function and application of encoding protein thereof | |
WO2018184333A1 (en) | Use of protein nog1 in regulation of plant yield and grain number per ear | |
CN116003563B (en) | Application of calmodulin binding protein CaMBP in regulating cold tolerance of plant | |
CN112080481B (en) | Spike-type related gene OsFRS5 and application and phenotype recovery method thereof | |
CN104098662A (en) | Rice drought resistance related protein, coding gene and application thereof | |
CN113929757B (en) | Method for enhancing cold tolerance of rice by mutating calcium ion binding protein OsCIP1/2 | |
CN114516908B (en) | Rice grain shape regulatory protein HOS59, encoding gene and application thereof | |
CN113846120B (en) | Application of protein TaTIN103 in regulation and control of wheat tillering | |
CN110295192B (en) | Bivalent RNAi expression vector for constructing TYLCV and ToCV by Gateway technology and application thereof | |
CN108409845B (en) | Application of protein TaNRT2.5 in regulation and control of nitrogen fertilizer utilization efficiency of plants |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |