CN108034671B - Plasmid vector and method for establishing plant population by using same - Google Patents
Plasmid vector and method for establishing plant population by using same Download PDFInfo
- Publication number
- CN108034671B CN108034671B CN201711294415.4A CN201711294415A CN108034671B CN 108034671 B CN108034671 B CN 108034671B CN 201711294415 A CN201711294415 A CN 201711294415A CN 108034671 B CN108034671 B CN 108034671B
- Authority
- CN
- China
- Prior art keywords
- nucleotide sequence
- sequence
- plasmid vector
- gene
- enzyme cutting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000013600 plasmid vector Substances 0.000 title claims abstract description 74
- 238000000034 method Methods 0.000 title claims abstract description 23
- 239000002773 nucleotide Substances 0.000 claims abstract description 122
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 122
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 87
- 102000004190 Enzymes Human genes 0.000 claims abstract description 67
- 108090000790 Enzymes Proteins 0.000 claims abstract description 67
- 230000014509 gene expression Effects 0.000 claims abstract description 53
- 238000003776 cleavage reaction Methods 0.000 claims abstract description 39
- 230000007017 scission Effects 0.000 claims abstract description 39
- 238000011144 upstream manufacturing Methods 0.000 claims abstract description 33
- 108091033409 CRISPR Proteins 0.000 claims abstract description 29
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 22
- 241000196324 Embryophyta Species 0.000 claims description 70
- 238000005520 cutting process Methods 0.000 claims description 56
- 235000007164 Oryza sativa Nutrition 0.000 claims description 34
- 235000009566 rice Nutrition 0.000 claims description 33
- 239000013598 vector Substances 0.000 claims description 30
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 27
- 206010020649 Hyperkeratosis Diseases 0.000 claims description 20
- 230000009261 transgenic effect Effects 0.000 claims description 17
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 14
- 108700008625 Reporter Genes Proteins 0.000 claims description 12
- 229920002401 polyacrylamide Polymers 0.000 claims description 12
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 claims description 10
- 108091034117 Oligonucleotide Proteins 0.000 claims description 9
- 230000008685 targeting Effects 0.000 claims description 9
- 108091006047 fluorescent proteins Proteins 0.000 claims description 8
- 239000003550 marker Substances 0.000 claims description 7
- 238000012408 PCR amplification Methods 0.000 claims description 6
- 230000000295 complement effect Effects 0.000 claims description 6
- 238000012216 screening Methods 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 5
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 claims description 4
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 claims description 4
- 102000009572 RNA Polymerase II Human genes 0.000 claims description 4
- 108010009460 RNA Polymerase II Proteins 0.000 claims description 4
- 108091005948 blue fluorescent proteins Proteins 0.000 claims description 3
- 238000002744 homologous recombination Methods 0.000 claims description 3
- 230000006801 homologous recombination Effects 0.000 claims description 3
- 108091026890 Coding region Proteins 0.000 claims description 2
- 108010005774 beta-Galactosidase Proteins 0.000 claims description 2
- 210000001938 protoplast Anatomy 0.000 claims description 2
- 241000209094 Oryza Species 0.000 claims 2
- 108060001084 Luciferase Proteins 0.000 claims 1
- 230000010076 replication Effects 0.000 abstract description 8
- 108020004414 DNA Proteins 0.000 description 45
- 240000007594 Oryza sativa Species 0.000 description 32
- 239000005090 green fluorescent protein Substances 0.000 description 26
- 239000013612 plasmid Substances 0.000 description 22
- 230000035772 mutation Effects 0.000 description 20
- 239000012634 fragment Substances 0.000 description 17
- 101150102092 ccdB gene Proteins 0.000 description 11
- 238000010276 construction Methods 0.000 description 10
- 241000589158 Agrobacterium Species 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 230000009466 transformation Effects 0.000 description 8
- 101150066002 GFP gene Proteins 0.000 description 6
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 6
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 6
- 101710166076 Mitogen-activated protein kinase 5 Proteins 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 238000010354 CRISPR gene editing Methods 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 238000001976 enzyme digestion Methods 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 102000012410 DNA Ligases Human genes 0.000 description 4
- 108010061982 DNA Ligases Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 210000004027 cell Anatomy 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 239000000843 powder Substances 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 3
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000005782 double-strand break Effects 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- 241000209504 Poaceae Species 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000030609 dephosphorylation Effects 0.000 description 2
- 238000006209 dephosphorylation reaction Methods 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 238000010363 gene targeting Methods 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 2
- 229940097277 hygromycin b Drugs 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 229940027257 timentin Drugs 0.000 description 2
- 238000012800 visualization Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 206010010144 Completed suicide Diseases 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- -1 DSM2 Proteins 0.000 description 1
- 108091005941 EBFP Proteins 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- ZXQPJYWZSFGWJB-AVGNSLFASA-N Glu-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXQPJYWZSFGWJB-AVGNSLFASA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- 101001038784 Homo sapiens E3 ubiquitin-protein ligase listerin Proteins 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 102000016443 LTN1 Human genes 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 101150081330 MOC1 gene Proteins 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 1
- 101100043229 Oryza sativa subsp. japonica SPL14 gene Proteins 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 101100041989 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sds23 gene Proteins 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- 240000006394 Sorghum bicolor Species 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- RSUXQZNWAOTBQF-XIRDDKMYSA-N Trp-Arg-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RSUXQZNWAOTBQF-XIRDDKMYSA-N 0.000 description 1
- MICFJCRQBFSKPA-UMPQAUOISA-N Trp-Met-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 MICFJCRQBFSKPA-UMPQAUOISA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- NJLQMKZSXYQRTO-FHWLQOOXSA-N Tyr-Glu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NJLQMKZSXYQRTO-FHWLQOOXSA-N 0.000 description 1
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 230000002745 absorbent Effects 0.000 description 1
- 239000002250 absorbent Substances 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000012271 agricultural production Methods 0.000 description 1
- 238000007605 air drying Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000005266 casting Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 108010082025 cyan fluorescent protein Proteins 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 108010045262 enhanced cyan fluorescent protein Proteins 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 239000003517 fume Substances 0.000 description 1
- 230000005251 gamma ray Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 230000035876 healing Effects 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 108700029760 synthetic LTSP Proteins 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010060175 trypsinogen activation peptide Proteins 0.000 description 1
- 239000007160 ty medium Substances 0.000 description 1
- 108010012567 tyrosyl-glycyl-glycyl-phenylalanyl Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The application relates to a plasmid vector and a method for establishing plant mutant population on a large scale by using the plasmid vector. The plasmid vector comprises a gene expression cassette for expressing a Cas9 protein, a first promoter, a nucleotide sequence comprising a suicide gene sequence, a gRNA scaffold element and a termination signal at the origin of replication, and at least one first enzyme cleavage site located 5 'of the nucleotide sequence comprising the suicide gene sequence and at least one second enzyme cleavage site located 3' of the nucleotide sequence comprising the suicide gene sequence; and the first and second enzyme cleavage sites are absent from other positions of the plasmid vector; wherein the first promoter is located upstream of the termination signal, the nucleotide sequence comprising a suicide gene sequence and the gRNA scaffold element and are both located between the first promoter and the termination signal.
Description
Technical Field
The present application relates to a plasmid vector and a method for establishing a plant population using the same.
Background
The CRISPR/Cas9 genome site-directed editing technology has become an important means for the current plant gene function and application research. The gene targeting technology can artificially modify genetic materials at specific sites of a genome in an organism, and genetic information change caused by the gene targeting technology can be stably transmitted and functionally presented in generations. The principle of genome site-specific editing is that an artificial nuclease is used for cutting at a target site of a biological genome to generate a DNA Double Strand Break (DSB), so that a DSB repair mechanism of a cell is activated to achieve the aim. This technique generally allows for simultaneous screening of mutations at as few as one site or at as few as a few sites.
Although large scale mutant population creation work has been done on plants using CRISPR/Cas9 technology. However, the lack of negative selection markers (such as suicide gene ccdB) in some plasmid vectors leads to a low positive rate during the construction of the vector library; and the plasmid vector containing the negative selection marker ccdB is constructed by only T4DNA ligase, and the process has high cost and efficiency but very complicated operation. In addition, the existing plasmid vector for CRISPR/Cas9 library construction does not contain a reporter gene for tracking, so that false positive materials cannot be directly removed in the preparation process of the mutant library, and the late stage is confronted with a complex and large-scale seed production process. Therefore, it cannot be effectively used for large-scale mutant population construction and screening.
In addition, rice (Oryza sativa L.) among plants is taken as an example: as the first crop to complete genome sequencing, rice research has entered the age of functional genomics. The key of the current rice research is to excavate relevant functional genes of important economic traits, explain the action mechanism of the genes and utilize the genes in production. Rice mutants are artificially created and screened by separating natural rice-induced mutants, or by physical mutagenesis means such as gamma ray and fast neutron radiation, chemical mutagenesis means such as MNU and EMS treatment, and biotechnology means such as T-DNA insertion, TALEN and CRISPR fixed-point editing technology, and the like, at present, researchers combine technologies such as map-based cloning and the like to separate and identify a plurality of key genes for controlling rice growth, development, stress resistance and nutrition absorption related processes such as MOC1, IPA1, DSM2, DLT and LTN1, and the molecular mechanism in the key genes is analyzed. However, compared with tens of thousands of functional genes carried by the whole genome of rice, the previous results are still cup of salary. How to comprehensively analyze and annotate the functions of all genes in the rice so as to finally improve the rice according to local conditions and better serve the agricultural production in China is the key point of the research on functional genomes of the rice. Among them, establishing the population resources of rice mutants is the most important. At present, a plurality of scientific research institutions in the countries such as the United states, China, Japan and the like establish huge rice EMS induced mutant populations, radiation induced mutant populations, T-DNA or transposon (Ac/Ds system, Tos17 system) insertion mutant populations. However, mutant groups constructed by physicochemical mutagenesis and insertion mutation have the problems of complex operation, low mutation rate, long time consumption for screening and separation, high cost and the like. For example, for the physicochemical mutation of rice mutant groups, the later stage relates to the fussy processes of hybridization, gene map-location cloning and the like; for the insertion mutant population, the target gene is easy to clone, but the coverage rate of the coding gene of the whole genome is not high. In particular, they all involve a large-scale seed production process in the later period, and consume a large amount of manpower and material resources.
Therefore, if a technology for constructing a plasmid library and creating a rice mutant population with high efficiency and high throughput by a one-step cloning method can be developed, false-positive transformation materials can be directly removed by methods such as fluorescence observation and the like so as to avoid the tedious work in the seed production process of the subsequent mutant population materials, save labor and cost, and greatly accelerate the steps of gene function research, new germplasm resource creation and variety improvement in the post-genomics era of rice.
Disclosure of Invention
To overcome the drawbacks of the prior art, one of the present applications provides a plasmid vector comprising a gene expression cassette comprising a Cas9 protein, a first promoter, a nucleotide sequence comprising a suicide gene sequence, a gRNA scaffold element and a termination signal, and at least one first enzyme cleavage site located 5 'of the nucleotide sequence comprising the suicide gene sequence and at least one second enzyme cleavage site located 3' of the nucleotide sequence comprising the suicide gene sequence; and the first and second enzyme cleavage sites are absent from other positions of the plasmid vector; wherein the first promoter is located upstream of the termination signal, and the nucleotide sequence comprising the suicide gene sequence and the gRNA scaffold element are both located between the first promoter and the termination signal.
In a specific embodiment, the first enzyme cleavage site and the second enzyme cleavage site may be the same or different. The first and second cleavage sites are set so that when the plasmid vector is used, the nucleotide sequence comprising the suicide gene sequence is excised, while the plasmid vector is linearized. For a plasmid vector which fails to successfully excise a nucleotide sequence containing a suicide gene sequence, the suicide gene exerts its characteristics to suicide the plasmid vector during transformation, thereby achieving the purpose of improving the positive rate.
In one embodiment, the plasmid vector further comprises a marker gene expression cassette.
In a specific embodiment, when a plurality of marker genes are contained, the marker genes may be used independently as individual expression cassettes or may be used in common as one expression cassette, which may be appropriately selected according to the ordinary skill in the art.
In one embodiment, the marker gene expression cassette comprises a selection gene expression cassette and a reporter gene expression cassette.
In a specific embodiment, the selection gene expression cassette comprises a selection gene expression cassette for use in plants.
In one embodiment, the selection gene expression cassette is a hygromycin resistance gene expression cassette or a G418 resistance gene expression cassette.
In one embodiment, the amino acid sequence encoded by the resistance gene or the nucleotide sequence of the resistance gene described above can be easily obtained by those skilled in the art, for example, the nucleotide sequence genebank accession number of the hygromycin resistance gene is KY420085.1, and the amino acid sequence genebank accession number of the hygromycin resistance gene is ASK 07515.1.
In a specific embodiment, the reporter gene is selected from at least one of a β galactosidase gene, a luciferase gene, a fluorescent protein gene, and a seed coat color gene.
In a specific embodiment, the reporter gene is selected from at least one of fluorescent protein genes. In one embodiment, the fluorescent protein gene is selected from at least one of a green fluorescent protein gene, a red fluorescent protein gene, a cyan fluorescent protein gene, a blue fluorescent protein gene, and a yellow fluorescent protein gene.
In one embodiment, the green fluorescent protein includes proteins that exhibit green fluorescence or enhanced green fluorescence after various mutations, such as enhanced green fluorescent protein. The red fluorescent protein includes various proteins which still show red fluorescence or red fluorescence enhancement after mutation, such as enhanced red fluorescent protein. Cyan fluorescent proteins include various proteins that exhibit cyan fluorescence or cyan fluorescence enhancement after mutation, such as enhanced cyan fluorescent protein. Blue fluorescent proteins include proteins that exhibit blue fluorescence or blue fluorescence enhancement after various mutations, such as enhanced blue fluorescent proteins. Yellow fluorescent proteins include proteins that exhibit yellow fluorescence or enhanced yellow fluorescence after various mutations, such as enhanced yellow fluorescent protein.
In a specific embodiment, the reporter gene expression cassette is selected from the group consisting of green fluorescent protein expression cassettes.
In one embodiment, the amino acid sequence encoded by the reporter gene or the sequence of the reporter gene (e.g., the amino acid sequence of a fluorescent protein or the nucleotide sequence of a fluorescent protein gene) can be readily obtained by one skilled in the art, for example, from genebank. For example, the nucleotide sequence genebank accession number of the green fluorescent protein is KY464890.1, and the encoded amino acid sequence genebank accession number is AQT 31663.1.
In a specific embodiment, the nucleotide sequence of the first promoter is selected from the group consisting of nucleotide sequences specific to the plant species to be mutated.
In a specific embodiment, the nucleotide sequence of the first promoter is selected from nucleotide sequences specific for gramineae plants.
In a specific embodiment, the gramineae is selected from at least one of rice, wheat, corn and sorghum.
In a specific embodiment, the first promoter is an RNA polymerase type III promoter.
In a specific embodiment, the nucleotide sequence of the first promoter is the nucleotide sequence shown as SEQ ID No. 1.
In a specific embodiment, the nucleotide sequence of the termination signal is the nucleotide sequence shown as SEQ ID No. 2.
In a specific embodiment, the nucleotide sequence of the gRNA scaffold element is set forth in SEQ ID No. 3.
In a specific embodiment, the gene expression cassette comprising a Cas9 protein expression cassette includes, from 5 'end to 3' end, a second promoter, a gene encoding a Cas9 protein, and a first terminator.
In a specific embodiment, the Cas9 protein may be a Cas9 protein conventional in the art, e.g., its amino acid sequence may be the amino acid sequence shown as SEQ ID No. 4.
In a specific embodiment, the nucleotide sequence of the gene encoding Cas9 protein is the nucleotide sequence shown as SEQ ID No. 5.
In a specific embodiment, the second promoter is an RNA polymerase II type promoter.
In a specific embodiment, the nucleotide sequence of the second promoter is as shown in SEQ ID No. 6.
In a specific embodiment, the nucleotide sequence of the first terminator is the 8 th to 260 th nucleotide sequences in genebank accession number FJ 362600.1.
In one embodiment, the selection gene expression cassette comprises a third promoter from 5 'to 3', a selection gene, and a second terminator.
In a specific embodiment, the third promoter is an RNA polymerase type II promoter.
In a specific embodiment, the nucleotide sequence of the third promoter is the 10382 th to 11162 th nucleotide sequences in genebank accession number FJ 362600.1.
In a specific embodiment, the nucleotide sequence of the second terminator is the 8 th to 260 th nucleotide sequences in genebank accession number FJ 362600.1.
In a specific embodiment, the reporter expression cassette comprises a fourth promoter from 5 'to 3', a reporter gene, and a third terminator.
In a specific embodiment, the fourth promoter is an RNA polymerase II type promoter.
In a specific embodiment, the nucleotide sequence of the fourth promoter is the 10382 th to 11162 th nucleotide sequences in genebank accession number FJ 362600.1.
In a specific embodiment, the nucleotide sequence of the third terminator is the 8 th to 260 th nucleotide sequences in genebank accession number FJ 362600.1.
In a specific embodiment, the suicide gene is the ccdB gene; preferably, the nucleotide sequence comprising the suicide gene sequence is shown in SEQ ID No. 7.
In a specific embodiment, the selection gene expression cassette further comprises a second selection gene expression cassette for use in bacteria.
In a specific embodiment, the second selection gene expression cassette is selected from at least one of a kanamycin resistance gene expression cassette, a penicillin resistance gene expression cassette, a tetracycline resistance gene expression cassette, a streptomycin resistance gene expression cassette.
In one embodiment, the amino acid sequence encoded by the resistance gene or the nucleotide sequence of the resistance gene can be easily obtained by those skilled in the art, for example, the kanamycin resistance gene sequence is the nucleotide sequence from 9156 to 9950 in genebank accession No. KX400856.1, and the encoded amino acid sequence is the amino acid sequence of genebank accession No. ASN 63838.1.
In a specific embodiment, the second selection gene expression cassette comprises, from 5 'to 3', a fifth promoter, a second selection gene and a fourth terminator.
In one embodiment, it is preferred that the fifth promoter is a promoter that can be used in bacteria.
In a specific embodiment, the nucleotide sequence of the fourth terminator is preferably the 8 th to 260 th nucleotide sequences in genebank accession number FJ 362600.1.
In one embodiment, the plasmid vector further comprises an origin of replication, which may be used in bacteria.
In a specific embodiment, the nucleotide sequence of the origin of replication has the sequence 4066 to 5066 nucleotides in KY 420084.1.
The second application provides a method for establishing a plant mutant population, which is obtained by targeting at least one plant endogenous gene, comprising the following steps:
the method comprises the following steps: obtaining at least one I element which comprises an I-1 nucleotide sequence, an I-2 nucleotide sequence and an I-3 nucleotide sequence from 5 'end to 3' end in sequence;
wherein the I-1 nucleotide sequence is identical to a sequence which is 20bp or more upstream of the 5' end from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site; the I-2 nucleotide sequence is a target nucleotide sequence which is consistent with a part of nucleotide sequences on the plant endogenous gene; the I-3 nucleotide sequence is consistent with a sequence which is 20bp above the downstream of the 3' end from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site;
step two: exchanging said I element by homologous recombination respectively onto said plasmid vector as defined in any of the present applications, resulting in a targeting vector in which said I element is located between said first promoter and a termination signal and said I-2 nucleotide sequence and said gRNA scaffold are transcriptionally fused;
step three: introducing the targeting vector into plant callus or plant protoplast, and culturing to obtain plant;
step four: screening the plant plants to obtain transgenic plants containing targeted plant endogenous genes; further, the transgenic plant containing the targeted plant endogenous gene is capable of producing transgenic plant seed containing the targeted plant endogenous gene.
In a specific embodiment, said I-1 nucleotide sequence is identical to a sequence 50-80bp upstream from the 5' end from said first cleavage site of said plasmid vector linearized with said first cleavage site and said second cleavage site.
In a specific embodiment, said nucleotide sequence I-3 is identical to a sequence 50-80bp downstream from the 3' end from said second cleavage site of said plasmid vector after being linearized by said first cleavage site and said second cleavage site.
In the present application, the principle of determining the target nucleotide sequence is the same as that of the conventional art. For example:
in a specific embodiment, the target nucleotide sequence is determined by:
1) determining at least one plant endogenous gene on a plant genome that is to be targeted;
2) searching the coding sequence of the plant endogenous gene or the reverse complementary sequence thereof for a PAM (Polyacrylamide) module sequence capable of being recognized by the Cas9 protein, and determining the nucleotide sequences 17 to 21 upstream of the 5 'end of the PAM module sequence as the target nucleotide sequence under the condition of ensuring that the nucleotide sequences 17 to 21 upstream of the 5' end of the PAM module sequence are specific sequences in the genome (namely, the sequences have high specificity in the genome and cannot be sequences with high sequence consistency in the genome).
In a specific embodiment, when the Cas9 protein is the amino acid sequence shown in SEQ ID No.4, the recognition PAM module is one of 5 ' -NGG-3 ', 5 ' -NGA-3 ', 5 ' -gann-3 ', 5 ' -AAGN-3 ', the target nucleotide sequence is 17 to 21 nucleotide sequences upstream of the 5 ' end of the PAM module, and nucleotide sequences containing five consecutive ts are eliminated; wherein N is one of A, G, C and T.
In one embodiment, at least one of the I-th elements is obtained by:
I) obtaining at least one oligonucleotide sequence comprising a nucleotide sequence II-1, a nucleotide sequence II-2, and a nucleotide sequence II-3 in that order from a 5 'end to a 3' end; wherein the II-1 nucleotide sequence is identical to a sequence which is 20bp or more upstream of the 5' end from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site; the II-2 nucleotide sequence comprises a target nucleotide sequence identical to a partial nucleotide sequence on the endogenous gene of the plant; the II-3 nucleotide sequence is consistent with a sequence which is 20bp above the downstream of the 3' end from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site;
II) obtaining an upstream primer and a downstream primer; wherein the upstream primer is consistent with a sequence which is more than 20bp from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site to the upstream of the 5' end; the downstream primer is reversely complementary with a sequence which is more than 20bp from the second enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site to the downstream of the 3' end;
III) carrying out PCR amplification by taking the oligonucleotide sequence in the step I) as a template and the upstream primer and the downstream primer in the step II) as a primer pair, thereby obtaining the I element.
In a specific embodiment, said II-1 nucleotide sequence is identical to a sequence 20-35bp upstream from the 5' end of said plasmid vector from said first cleavage site after being linearized by said first cleavage site and said second cleavage site.
In a specific embodiment, said II-3 nucleotide sequence is identical to a sequence 20-35bp downstream from the 3' end from said second cleavage site of said plasmid vector after being linearized by said first cleavage site and said second cleavage site.
In a specific embodiment, the upstream primer is identical to a sequence 50-80bp upstream from the 5' end of the plasmid vector from the first cleavage site linearized by the first cleavage site and the second cleavage site.
In a specific embodiment, the downstream primer is reverse complementary to a sequence 50-80bp downstream from the 3' end from the second cleavage site of the plasmid vector after being linearized by the first cleavage site and the second cleavage site.
In one embodiment, when the plurality of oligonucleotide sequences of step I) is multiple, PCR amplification is performed in step III) using a mixture of the plurality of oligonucleotide sequences as a template, thereby obtaining a mixed plurality of the I element.
In one embodiment, when the I element is a plurality, the I-2 nucleotide sequences in the plurality are different in pairs.
In one embodiment, when the I element is a plurality of I elements, the I-1 nucleotide sequences in the plurality of I elements may be the same or different, but for the sake of simplicity of operation, it is preferred that the I-1 nucleotide sequences are the same.
In one embodiment, when the I element is a plurality of I elements, the I-3 nucleotide sequences in the plurality of I elements may be the same or different, but for the sake of simplicity of operation, it is preferred that the I-3 nucleotide sequences are the same.
In one embodiment, the I element is exchanged with the plasmid vector into a Cas9 vector by conventional molecular biology procedures, preferably Gibson assembly (Gibson assembly).
In one embodiment, the target nucleotide sequence may be a plurality of target nucleotide sequences, or, as it were, the I-th element may be a plurality of I-th elements. Thus, a plurality of I elements obtained by, for example, PCR, are mixed with the plasmid vector of any one of the present applications, and the plurality of I elements are ligated together with the plasmid vector by Gibbson assembly to obtain a targeting vector library. When the targeting vector is subjected to small-scale or large-scale plant transformation through conventional genetic manipulation, a mutant population of plants can be created.
In the present application, the nucleotide sequence is upstream of the portion near the 5 ' end and downstream of the portion near the 3 ' end, wherein the upstream and downstream are relative positions in the nucleotide sequence, and the nucleotide sequence located upstream is closer to the 5 ' end of the sequence. The beneficial effect of this application lies in:
the plasmid vector provided by the application greatly improves the positive rate of the obtained mutant population to 100% at most, and reduces the probability of false positive plants in the plant mutant population; meanwhile, the plasmid vector carries a green fluorescent protein gene expression cassette, and by means of the visualization of fluorescent protein, the single plant in the CRISPR mutant population can be tracked for the later generation, the T-DNA separation condition in the later generation plant and the like very conveniently, so that the field workload is greatly reduced.
Drawings
FIG. 1 shows a schematic flow chart of the construction of the vector pHZLib2 in example 1.
FIG. 2 shows typical mutation types of OsMPK1 and OsMPK3 mutant plants in OsMPK mutant population constructed using pHZLib 2.
FIG. 3 is a graph showing the results of the expression of GFP in rice resistance-cured rice after transformation of rice with plasmid pHZLIb2-OsMPKs, which was observed by confocal laser microscopy.
FIG. 4 is a graph showing the result of GFP expression in the roots of transgenic plants of the T0 generation after transformation of rice with plasmid pHZLIb2-OsMPKs in a stereoscopic fluorescence microscope.
Detailed Description
The present invention will be described in detail below with reference to examples and the accompanying drawings. These examples are illustrative only and do not limit the scope of the present invention in any way. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention, and that such changes and modifications may be made without departing from the spirit and scope of the invention.
Example 1: construction of recombinant plasmid
The technical route for constructing the vector is as follows:
1. construction of pUbi-ccdB-Cas9 recombinant plasmid
Hind III digestion was carried out on the self-contained binary vector of this laboratory, pH3, and the 8.8kb vector backbone was recovered and self-ligated using T4DNA Ligase (Takara), and then named pH 4. The elements included in pH4 are: CaMV35S promoter, hygromycin gene, NOS terminator, pVS1RepA, pVS1 origin of replication.A large 5.5kb fragment was recovered by double digestion at pH4 with BsaI and Hind III, which included the major elements: CaMV35S promoter, hygromycin gene and NOS terminator. Meanwhile, the pH4 is taken as a template, H3-F (SEQ ID No.8, a restriction enzyme BsaI restriction site is introduced) and H3-R (SEQ ID No.9) are taken as primers, and high fidelity enzyme I-5 is utilizedTMA small 3.7kb fragment (comprising the elements pVS1RepA and pVS1 origin of replication) was amplified from the pH4 plasmid at 2 × High-Fidelity Master Mix (available from Biotechnology Ltd., Beijing) and ligated to the above 5.5kb large fragment and named pH5. HindIII cleavage at pH5 was performed to recover the linearized vector backbone, comprising the elements: CaMV35S promoter, hygromycin gene, NOS terminator, pVS1RepA, pVS1 origin of replication.
BsmBI enzyme digestion is carried out on an own vector pGEMT of the laboratory, attR1R2, a 1kb fragment (comprising attR1, ccdB gene sequence and attR2) is recovered by gel casting and is connected with the linearized pH5 vector framework, sequencing is carried out to identify that the attR1R2 is inserted in the positive direction (namely the expression direction of GFP is consistent), the name is pH5-ccdB, and the expression direction of the ccdB gene is consistent with the expression direction of a promoter on the vector. The laboratory vector pGEMT, Ubip and pH5-ccdB were digested simultaneously with Hind III and SacI, respectively, and after ligation, the plasmid was named pH 5-ccdB-Ubip. The plasmid of pH5-ccdB-Ubip and the plasmid pUC57: Cas9 with artificially synthesized Cas9 fragment (SEQ ID No.5, the amino acid sequence of which is shown in SEQ ID No. 4) are subjected to BamHI and SpeI double enzyme digestion respectively, a 12kb vector framework and a 4.2kb Cas9 gene fragment are recovered and are connected, and the plasmid is named as pUbi-ccdB-Cas 9. The plasmid pUbi-ccdB-Cas9 was constructed as follows: CaMV35S promoter, hygromycin gene, NOS terminator, pVS1RepA, pVS1 origin of replication, attR1, ccdB gene sequence, attR2, Ubip promoter, Cas9 gene, NOS terminator.
2. pENTR4-gRNA recombinant plasmid construction
A792 bp GFP gene fragment (Biotechnology engineering, Shanghai, Inc.) was artificially synthesized and cloned into pUC57, and named pUC57: GFP. Plasmid pUC57, GFP and vector pUC19-3Flag were digested with BamHI and SpeI, and 792bp GFP gene fragment and 3.9kb pUC19-3Flag vector backbone (including the major elements: CaMV35S promoter) were recovered separately3Flag gene, NOS terminator, etc.), the GFP gene fragment was ligated to pUC19-3Flag vector using T4DNA ligase (Takara), named pUC19, GFP, and was used for future use after colony PCR and enzyme digestion verification. 35S-F3(SEQ ID No.10, restriction enzyme XhoI cleavage site is introduced) and NOS-R (SEQ ID No.11, restriction enzyme XhoI cleavage site is introduced) are used as primers, and I-5 is usedTM2 XHigh-Fidelity Master Mix was subjected to PCR amplification using vector PUC19: GFP as a template to obtain a target fragment of 1.8kb in size (i.e., GFP expression cassette: 35s-GFP-Nos, abbreviated as GFP-cassette, wherein 35s represents 35s promoter, GFP represents GFP gene, and Nos represents Nos terminator), digested with XhoI, and recovered. Meanwhile, XhoI digestion is carried out on pENTR4-gRNA which is a laboratory self-contained vector, a linearized fragment of 2.75kb (the element is U6 promoter, gRNA scaffold) is recovered, then dephosphorylation treatment is carried out on the linearized pENTR4-gRNA by using FastAP Thermosensive Alkaline Phosphatase (Thermo Scientific) to prevent self-ligation, and then the GFP fragment of 1.8kb is connected to the linearized pENTR4-gRNA fragment which is subjected to the dephosphorylation treatment by using T4DNA Ligase (Takara) to be named as pENTR4-gRNA, and sequencing is carried out for standby after colony PCR and enzyme digestion verification. The plasmid pENTR4-gRNA-GFP comprises the following components: u6 promoter, gRNA scaffold, CaMV35S promoter, GFP gene, NOS terminator.
3. Construction of pHZLib2 vector
The method comprises the steps of digesting pENTR4-gRNA-GFP with NheI, recovering a fragment of about 4kb (the elements are U6 promoter, gRNA scaffold, CaMV35S promoter, GFP gene and NOS terminator), exchanging a gRNA expression cassette (including a first promoter U6 promoter (SEQ ID No.1) + a termination signal (SEQ ID No.2) in pENTR4-gRNA3-GFP by gateway technology, wherein the gRNA expression cassette does not contain gRNA) and a GFP expression cassette into pUbi-ccdB-Cas9, and naming as pUbi-gRNA-GFP-Cas9 (the elements include CaMV35S promoter, hygromycin gene, NOS terminator, pVS1RepA, pVS1 replication origin, 685B 2, U6 promoter, gRNA scaffno, termination signal (SEQ ID No.2), CaMV35 promoter, NOS terminator, pVS1RepA, pVS1 replication origin, 685B 2, U6 promoter, and the DNA expression cassette is digested with correct sequence of the promoter, and the sequence is verified by digestion of the pUATTsRNA promoter, the CATV 35 promoter, the CATV terminator, the cDNA terminator, the CATV 9, the SEQ ID promoter, and the SEQ ID NO is verified by digestion of the DNA. ccdB-F1(SEQ ID No.12) and ccdB-R1(SEQ ID No.13) are used as referencesSubstance, utilizing I-5TM2 xHigh-Fidelity Master Mix takes a vector pGEMT-attR1R2 as a template to carry out PCR amplification, and a target fragment (containing ccdB gene) with the size of about 800bp is obtained; digesting the plasmid pUbi-gRNA-GFP-Cas9 by BsaI enzyme, recovering a linearization band of about 10kb, and then utilizing TreidefTMThe SoSoSoSoo Cloning Kit (TsingKe) inserts the ccdB gene into pUbi-gRNA3-GFP-rCas9 to obtain a final vector pHZLib2, and the final vector is subjected to colony PCR and enzyme digestion verification for later use. The plasmid pHZLib2 is composed of the following components: CaMV35S promoter (genebank accession number FJ362600.1, nucleotide sequence 10382 to 11162), hygromycin gene (genebank accession number KY420085.1), NOS terminator (genebank accession number FJ362600.1, nucleotide sequence 8 to 260), pVS1RepA (genebank accession number KY420084.1, nucleotide sequence 5755 to 6435), pVS1 origin of replication (genebank accession number KY420084.1, nucleotide sequence 4066 to 5066), attB1(SEQ ID 6314), U6 promoter (SEQ ID No.1), nucleic acid containing ccdB gene (SEQ ID No.7), gRNA scaffold (SEQ ID No.3), termination signal (SEQ ID No.2), CaMV35 promoter (genebank accession number FJ362600.1, nucleotide sequence 10382 to 11162), gene (SEQ ID No. 464890.1), nucleotide sequence FVbank accession number GFP 98), nucleotide sequence Uebk accession number GFP promoter (SEQ ID No.15) and nucleotide sequence 464890.1, NOS terminator (genebank accession number FJ362600.1, nucleotide sequences 8 to 260), Kan resistance gene expression cassette (genebank accession number KX400856.1, wherein the kana resistance gene is located at nucleotide sequences 9156 to 9950).
Example 2: plasmid library for constructing rice endogenous MPK gene family by using pHZLib2 system and rice transformation
1. Construction of plasmid library of pHZLib2-OsMPKs
Synthetic oligonucleotide sequence OsMPK1-oligo-1(SEQ ID No. 16: CCCGCGCGCTGTCGCTTGTGTG)TC ATCCAGTACAACATCTTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG, the target nucleotide sequence of OsMPK1 is underlined), OsMPK2-oligo-1(SEQ ID No. 17: CCCGCGCGCTGTCGCTTGTGTGATGGCCATCACGGTGGCATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG, the target nucleotide sequence of OsMPK2 is underlined),OsMPK3-oligo-1(SEQ ID No.18:CCCGCGCGCTGTCGCTTGTGTGAAGTATTACTACTCGATGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG, the target nucleotide sequence of OsMPK3 is underlined), OsMPK4-oligo-1(SEQ ID No. 19: CCCGCGCGCTGTCGCTTGTGTGCTAATGGCATGGGAAACCAGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG, the target nucleotide sequence of OsMPK4 is underlined), OsMPK5-oligo-1(SEQ ID No. 20: CCCGCGCGCTGTCGCTTGTGTGTCAGGCC GACGATGACGCAGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG, the target nucleotide sequence of OsMPK5 is underlined), OsMPK6-oligo-1(SEQ ID No. 21: CCCGCGCGCTGTCGCTT GTGTGTGTACGGGAACTTCTTCGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG, the target nucleotide sequence of OsMPK6 is underlined) and OsMPK12-oligo-1(SEQ ID No. 22: CCCGCGCGCTGTCGCTTGTGTGCCAACCAGTCGTCC AACGCGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAG underlined as the target nucleotide sequence of OsMPK 12), mixed at equal volume and equal concentration with the above seven oligonucleotides as templates, Array-F1(SEQ ID No. 23: CACATTGCCCAGCTAACTCGAACGCGACCAACTTATAAACCCGCGCGCTGTCGCTTGTGT) and Array-R1(SEQ ID No. 24: ACTCGGTG CCACTTTTTCAAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAAAAC) as primers, using I-5TMThe above 7 oligonucleotides were mixed and amplified at 2 Xhigh-Fidelity Master Mix, and the PCR product was recovered at approximately 140 bp. At the same time, the vector pHZLib2 was digested with BsaI, and the vector backbone of about 10kb (releasing the ccdB gene fragment) was recovered. Reuse of TreliefTMThe gRNA of the ccdB gene was replaced with the gRNA of the MPK gene by SoSoSoo Cloning Kit (TsingKe), and the replacement was named pHZLib 2-OsMPKs. Randomly picked 10 colonies were sequenced with up to 100% accuracy and contained all 7 grnas.
2. pHZLib2-OsMPKs plasmid library transformed japonica rice Kitaake
1) Rice callus induction:
treating the dehulled mature rice seeds with 50% commercial sterilizing solution for 25 minutes; cleaning with sterile water for 3-5 times, transferring the seeds to a sterile culture dish, and sucking out excessive water; placing the seeds on MSD plate (4.43g/L MS powder; 30g/L sucrose; 2 ml/L2, 4-D; 8g/L plant gel; pH5.7), culturing in light culture room for 10 days, inducing callus formation; embryos and shoots of the seeds were removed and the calli were transferred to a new MSD petri dish and cultured for 5 days until they could be used for agrobacterium transformation.
2) And (3) agrobacterium transformation:
the pHZLib2-OsMPKs plasmid library is transferred into an agrobacterium strain EHA105 by an electric shock method to construct the EHA105 strain containing the pHZLib2-OsMPKs plasmid library. Activating the EHA105 strain, and culturing in TY medium overnight at room temperature for 12 hours; the Agrobacterium was collected by centrifugation and resuspended in MSD solution to OD600Stand up to 0.2 for use.
3) Agrobacterium infection of rice callus:
placing the callus in the agrobacterium suspension for 30 minutes; removing the agrobacterium suspension, transferring the callus onto sterile absorbent paper to remove redundant agrobacterium liquid, transferring the callus onto a new MSD culture medium containing 100 mu M acetosyringone, and culturing at room temperature in a dark place for 2-3 days.
4) Rice resistance callus screening:
transferring the dark cultured callus onto MSD culture medium (100mg/L timentin; 50mg/L hygromycin B) for 2 weeks to 2 months until the surface of the callus shows resistant callus; the medium was changed every 2 weeks.
5) Resistant callus differentiation and rooting
Transferring the resistant callus to a regeneration culture medium (4.43g/L MS powder, 30g/L sucrose, 25g/L sorbitol, 0.5mg/L NAA, 3mg/L BA, 100mg/L timentin, 50mg/L hygromycin B, 12g/L agar powder, pH5.7) until the resistant callus grows into a plant seedling, and transferring the resistant callus once every 7-10 days; the seedlings were transferred to 1/2MS medium (2.21g/L MS powder; 15g/L sucrose; 8g/L plant gel; pH5.7) for rooting.
Example 3: detection of OsMPKs rice mutant population
1) Extraction of genomic DNA
Cutting about 0.1g of T0 plant leaves, quickly freezing by liquid nitrogen, and grinding by using a grinder; adding 600 μ l 2 × CTAB DNA extract (containing 1/1000 β -mercaptoethanol), shaking, mixing, and splitting at 65 deg.C for 45 min. Add 500. mu.l chloroform and shake vigorously to form an emulsion, centrifuge at 14000rpm for 10 min. After centrifugation, the supernatant was transferred to a 1.5ml centrifuge tubeAdding isopropanol with the same volume, reversing, mixing uniformly, centrifuging at 14000rpm for 10min, removing supernatant, adding 700 μ l 70% ethanol to wash white precipitate, centrifuging at 14000rpm for 5min, removing supernatant, and air drying in a fume hood for 10 min. Add 30. mu.l ddH2O dissolves the DNA. The DNA solution was stored at-20 ℃ for further use.
2) PCR amplification and sequencing detection of mutation sites
The detection sequencing of the gRNA of 109 randomly selected T0 plant seedlings was carried out by using a universal primer pair U6P-F3(SEQ ID No.25) and G4-R1(SEQ ID No.26), and the results showed that 16 transgenic plants each including OsMPK1, OsMPK4 and OsMPK5, 17 transgenic plants including OsMPK2, 15 transgenic plants including OsMPK3 and OsMPK12 and 14 transgenic plants including OsMPK 6. Designing a specific PCR primer according to the target site of the OsMPKs gene: OsMPK1-F1(SEQ ID No.27) and OsMPK1-R1(SEQ ID No. 28); OsMPK2-F1(SEQ ID No.29) and OsMPK2-R1(SEQ ID No. 30); OsMPK3-F1(SEQ ID No.31) and OsMPK3-R1(SEQ ID No. 32); OsMPK4-F1(SEQ ID No.33) and OsMPK4-R1(SEQ ID No. 34); OsMPK5-F1(SEQ ID No.35) and OsMPK5-R1(SEQ ID No. 36); OsMPK6-F1(SEQ ID No.37) and OsMPK6-R1(SEQ ID No. 38); OsMPK12-F1(SEQ ID No.39) and OsMPK12-R1(SEQ ID No. 40); by using I-5TM2 × High-Fidelity Master Mix, using the above-mentioned primers as template, respectively PCR-amplifying to obtain OsMPK1, OsMPK2, OsMPK3, OsMPK4, OsMPK5, OsMPK6 and OsMPK12 gene fragments, whose sizes are 655bp, 438bp, 399bp, 358bp, 468bp, 530bp and 466bp in turn. The PCR products were directly Sanger sequenced. Sanger sequencing results showed: 13 strains of each of the OsMPK1, OsMPK3 and OsMPK5 mutants, 14 strains of each of the OsMPK2 and OsMPK4 mutants, 12 strains of the OsMPK6 mutant and 15 strains of the OsMPK12 mutant have mutation efficiency of 81.25-100 percent and can generate various types of mutations. For example, 13 mutant plants and a #1 plant heterozygote with the mutation type of +6/-6 are detected by the gene OsMPK1, which indicates that the OsMPK1 double alleles of diploid rice are mutated and the mutation types are different; the #2 plant contains a wild-type gene, and the single gene is mutated into-1 + 2. For example, 13 mutant plants are detected together with the OsMPK3 gene, and the mutation type of the No.1 plant is + A, so that a double allele mutant homozygous material is obtained; #2 plantThe mutation type was + T/+ C, and a heterozygote with biallelic mutation was also obtained (FIG. 2).
The mutation rate of T0 transgenic plants constructed by pHZLib2-OsMPKs plasmid library is up to more than 80%, and the system has high mutation efficiency.
TABLE 2 efficiency of construction of OsMPKs mutant populations Using the cosmid library pHZLib2-OsMPKs
Example 4: green fluorescent protein GFP visualization of transgenic resistant calli
Transgenic resistant callus and wild Kitaake callus are respectively picked and observed under a handheld UV lamp, and meanwhile, the callus is made into a temporary slide glass to be observed under a laser confocal microscope. The results show that the cells of rice callus showed green color under the condition of GFP excitation light (FIG. 3, left), while the cells showed no green fluorescent protein expression without GFP excitation light (FIG. 3), and the right side of FIG. 3 shows the additive effect of the above two, from which the obvious green fluorescence was observed in the case of transgenic resistance healing (FIG. 3, right). Meanwhile, the roots of the rice T0 generation plants are observed under the ultraviolet excitation of a body type fluorescence microscope, and obvious green fluorescence can be observed at the roots of the transgenic plants (figure 4). The GFP in the system can be expressed normally in rice cells, and green fluorescence is shown, so that the system can be used for tracking transgenic plants of progeny segregation populations.
Sequence listing
<110> institute of plant protection of Chinese academy of agricultural sciences
<120> a plasmid vector and a method for establishing a plant population using the same
<130> LHA1760716
<160> 40
<170> SIPOSequenceListing 1.0
<210> 1
<211> 245
<212> DNA
<213> Artificial sequence (non)
<400> 1
ggatcatgaa ccaacggcct ggctgtattt ggtggttgtg tagggagatg gggagaagaa 60
aagcccgatt ctcttcgctg tgatgggctg gatgcatgcg ggggagcggg aggcccaagt 120
acgtgcacgg tgagcggccc acagggcgag tgtgagcgcg agaggcggga ggaacagttt 180
agtaccacat tgcccagcta actcgaacgc gaccaactta taaacccgcg cgctgtcgct 240
tgtgt 245
<210> 2
<211> 7
<212> DNA
<213> Artificial sequence (non)
<400> 2
ttttttt 7
<210> 3
<211> 76
<212> DNA
<213> Artificial sequence (non)
<400> 3
gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac ttgaaaaagt 60
ggcaccgagt cggtgc 76
<210> 4
<211> 1417
<212> PRT
<213> Artificial sequence (non)
<400> 4
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Asp Lys Lys Tyr Ser Ile Gly Leu
35 40 45
Asp Ile Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu Tyr
50 55 60
Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly Asn Thr Asp Arg His
65 70 75 80
Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu Phe Asp Ser Gly Glu
85 90 95
Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr
100 105 110
Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln Glu Ile Phe Ser Asn Glu
115 120 125
Met Ala Lys Val Asp Asp Ser Phe Phe His Arg Leu Glu Glu Ser Phe
130 135 140
Leu Val Glu Glu Asp Lys Lys His Glu Arg His Pro Ile Phe Gly Asn
145 150 155 160
Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr His
165 170 175
Leu Arg Lys Lys Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu
180 185 190
Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe Arg Gly His Phe Leu
195 200 205
Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp Val Asp Lys Leu Phe
210 215 220
Ile Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe Glu Glu Asn Pro Ile
225 230 235 240
Asn Ala Ser Gly Val Asp Ala Lys Ala Ile Leu Ser Ala Arg Leu Ser
245 250 255
Lys Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys
260 265 270
Lys Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr
275 280 285
Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln
290 295 300
Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln
305 310 315 320
Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu Ser
325 330 335
Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile Thr
340 345 350
Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Arg Tyr Asp Glu His His
355 360 365
Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro Glu
370 375 380
Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly
385 390 395 400
Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys
405 410 415
Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu
420 425 430
Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser
435 440 445
Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile Leu Arg Arg
450 455 460
Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile Glu
465 470 475 480
Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg
485 490 495
Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr Ile
500 505 510
Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln
515 520 525
Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu
530 535 540
Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr
545 550 555 560
Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys Pro
565 570 575
Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile Val Asp Leu Leu Phe
580 585 590
Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr Phe
595 600 605
Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu Asp
610 615 620
Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile
625 630 635 640
Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu
645 650 655
Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu
660 665 670
Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met Lys
675 680 685
Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg Lys
690 695 700
Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu Asp
705 710 715 720
Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn Phe Met Gln Leu Ile
725 730 735
His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln Val
740 745 750
Ser Gly Gln Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly
755 760 765
Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp
770 775 780
Glu Leu Val Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val Ile
785 790 795 800
Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn Ser
805 810 815
Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly Ser
820 825 830
Gln Ile Leu Lys Glu His Pro Val Glu Asn Thr Gln Leu Gln Asn Glu
835 840 845
Lys Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val Asp
850 855 860
Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp His Ile
865 870 875 880
Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys Val Leu
885 890 895
Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu
900 905 910
Glu Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala
915 920 925
Lys Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg
930 935 940
Gly Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu
945 950 955 960
Val Glu Thr Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp Ser
965 970 975
Arg Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg Glu Val
980 985 990
Lys Val Ile Thr Leu Lys Ser Lys Leu Val Ser Asp Phe Arg Lys Asp
995 1000 1005
Phe Gln Phe Tyr Lys Val Arg Glu Ile Asn Asn Tyr His His Ala His
1010 1015 1020
Asp Ala Tyr Leu Asn Ala Val Val Gly Thr Ala Leu Ile Lys Lys Tyr
1025 1030 1035 1040
Pro Lys Leu Glu Ser Glu Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp
1045 1050 1055
Val Arg Lys Met Ile Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr
1060 1065 1070
Ala Lys Tyr Phe Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu
1075 1080 1085
Ile Thr Leu Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr
1090 1095 1100
Asn Gly Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala
1105 1110 1115 1120
Thr Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1125 1130 1135
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1140 1145 1150
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro Lys
1155 1160 1165
Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val Leu Val
1170 1175 1180
Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys Ser Val Lys
1185 1190 1195 1200
Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser Phe Glu Lys Asn
1205 1210 1215
Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys Glu Val Lys Lys Asp
1220 1225 1230
Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu Phe Glu Leu Glu Asn Gly
1235 1240 1245
Arg Lys Arg Met Leu Ala Ser Ala Gly Glu Leu Gln Lys Gly Asn Glu
1250 1255 1260
Leu Ala Leu Pro Ser Lys Tyr Val Asn Phe Leu Tyr Leu Ala Ser His
1265 1270 1275 1280
Tyr Glu Lys Leu Lys Gly Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu
1285 1290 1295
Phe Val Glu Gln His Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile
1300 1305 1310
Ser Glu Phe Ser Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys
1315 1320 1325
Val Leu Ser Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln
1330 1335 1340
Ala Glu Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro
1345 1350 1355 1360
Ala Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1365 1370 1375
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1380 1385 1390
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp Arg
1395 1400 1405
Pro Lys Lys Lys Arg Lys Val Gly Gly
1410 1415
<210> 5
<211> 4254
<212> DNA
<213> Artificial sequence (non)
<400> 5
atggactata aggatcacga tggcgactac aaggatcatg acattgacta taaggatgac 60
gacgataaga tggcacctaa gaagaaaagg aaagtcggca ttcatggcgt tccggcagcc 120
gacaaaaagt atagcatcgg cctcgatatt gggacaaact ctgtgggctg ggcggtaatt 180
accgacgagt acaaggtgcc tagtaagaaa tttaaagtgc tcggaaacac tgacaggcac 240
tctataaaga agaacctgat cggggcactg cttttcgact ccggagagac ggcggaggcg 300
acgcgtctca agcgtaccgc gcgccgcagg tacacaagaa ggaagaatag gatctgctac 360
ttgcaggaaa tcttcagtaa cgagatggcg aaggtcgacg atagtttctt tcatcggttg 420
gaagaatcgt tcctcgtaga ggaggacaaa aagcacgagc gtcacccaat attcgggaat 480
attgttgacg aggttgccta ccatgagaaa tatcctacaa tatatcacct ccgtaagaag 540
cttgtcgatt caactgataa ggctgatctc agactcatct atcttgccct cgcacatatg 600
attaagtttc gtggccactt cttgattgaa ggcgacctca acccggacaa ctcagatgtt 660
gacaagcttt ttatacagct cgtccagaca tataaccagc tgtttgaaga gaatcccatc 720
aatgcgagtg gggttgatgc taaggccatt ttgtccgcca ggttgtccaa atctcgcaga 780
ctggaaaacc tgatcgcaca gcttcccggt gaaaagaaaa acgggctctt cggcaatctc 840
atcgcactgt ccctcggcct caccccaaac ttcaagtcta acttcgacct ggccgaggat 900
gcgaagctcc agctgtcaaa agatacatac gacgacgatt tggacaatct gcttgcgcaa 960
ataggcgacc agtatgcgga cctgttcctg gctgccaaaa atctgtcaga tgcaatcctc 1020
ctgtccgata tattgcgtgt gaacaccgaa atcacgaagg caccgcttag cgcatccatg 1080
atcaagagat acgacgagca ccatcaggac ctcacactcc tcaaggcgct tgttcgtcag 1140
cagcttcccg agaaatataa ggaaattttt ttcgatcaaa gcaagaatgg atatgctggc 1200
tatattgacg gtggcgcttc gcaggaggag ttctataaat tcattaagcc gattctggag 1260
aagatggacg gaacggagga gctcctcgtc aagcttaacc gggaagacct gttgcggaag 1320
cagaggactt ttgataacgg ctctattccg caccaaatcc atctgggtga gttgcacgca 1380
atcttgagaa gacaagagga tttctacccg ttccttaagg ataacagaga gaagatagaa 1440
aaaatactga ccttcaggat accatactat gtgggcccac tggcgcgcgg aaatagtcgt 1500
ttcgcatgga tgactagaaa gtccgaagaa acgatcacgc catggaattt tgaggaagtg 1560
gtcgacaagg gcgcctctgc ccagagcttc atcgaaagga tgaccaattt tgacaaaaat 1620
ctgcctaacg aaaaggtgct tccgaagcac agcctgttgt atgaatactt cacagtttat 1680
aacgagctca ctaaggtcaa gtacgtcacg gagggcatgc gtaagcctgc tttcctgtct 1740
ggtgaacaaa aaaaggcgat tgtggacctc cttttcaaga cgaaccgtaa agttactgtg 1800
aagcaactga aagaggatta ctttaagaaa attgagtgct tcgacagtgt ggagatttcc 1860
ggtgtcgagg accggtttaa cgccagcctg ggtacgtatc atgacctgct taaaattatc 1920
aaggataaag atttcctgga taatgaagag aacgaagata tactggagga cattgtgttg 1980
actttgaccc tcttcgagga cagagagatg attgaggaaa gactgaagac ctacgcacac 2040
ctttttgatg acaaggtcat gaaacaactc aagcgccggc gctatactgg ctggggccgg 2100
ctttctcgca agctcatcaa tgggattcgg gataagcaat caggcaagac aattttggac 2160
ttcctcaaat ccgacggatt cgcaaatagg aattttatgc agctgataca tgacgactct 2220
ttgacattca aagaagacat acagaaggct caggtgtccg gccaaggaga ttctttgcac 2280
gagcatatcg ctaacttggc aggtagcccc gccataaaaa agggcattct tcaaacggta 2340
aaagttgttg acgaactcgt gaaggttatg ggccgtcata agccggaaaa cattgttatt 2400
gaaatggcta gggaaaatca gacgacccag aagggacaga aaaatagcag ggagcggatg 2460
aagagaattg aagagggaat taaggagctt ggatctcaga ttcttaagga gcaccctgtg 2520
gagaacaccc aacttcagaa tgaaaagctc tacctttact accttcaaaa cggccgggat 2580
atgtacgtcg atcaggaact tgacattaac cggttgagcg attatgacgt tgaccatatt 2640
gtgccccaat ctttccttaa agacgactct atcgacaata aagtgctgac gcgcagcgat 2700
aaaaatcgcg gtaagtcgga taatgtcccg tcggaagagg tggttaaaaa aatgaagaac 2760
tattggaggc aactcctgaa tgccaagctg atcactcaga ggaaattcga caatctcacc 2820
aaggcagaaa ggggtggact tagcgagctc gacaaggccg gttttatcaa aagacagctg 2880
gtggagacac gccaaatcac caaacacgtt gcccagatcc tggattcgag gatgaacacg 2940
aagtatgacg agaacgacaa gttgattagg gaagtcaagg tcatcacttt gaagtccaag 3000
ctggtgagcg actttcgcaa agacttccag ttttacaaag tcagggaaat taataactac 3060
caccacgccc acgacgccta ccttaacgcc gtggttggca cagcactcat caagaaatac 3120
cctaagctcg aatctgagtt cgtctatggc gactataagg tctacgacgt tagaaaaatg 3180
atcgcgaaat ctgagcagga aataggcaag gcaactgcca agtacttctt ctattccaat 3240
atcatgaact tttttaagac ggagattacc ctggcgaatg gtgagatccg caagcgccct 3300
ttgattgaga caaacggaga aacaggagag atcgtatggg acaaagggcg ggactttgct 3360
actgttagga aggtgctctc tatgccacaa gttaacattg tcaaaaaaac tgaagtgcag 3420
acaggtgggt ttagcaagga atctatcctg ccgaagagga actctgacaa gctgatcgcc 3480
cgcaagaaag attgggatcc gaaaaagtac ggaggattcg actcccccac agttgcgtac 3540
tccgtgcttg tcgtggccaa agtggagaag ggcaagtcta agaagctcaa gagcgtcaaa 3600
gagttgttgg ggatcacgat tatggagcgg tcgtctttcg aaaagaatcc gatagatttt 3660
ctcgaggcca agggttataa agaagtcaag aaggatctta tcatcaagct ccctaagtac 3720
tccctctttg agcttgaaaa cggacggaaa agaatgctgg cttcagcggg tgaacttcag 3780
aagggtaatg aactcgctct gccctcaaaa tatgtgaatt tcctttacct ggcatcacac 3840
tatgagaagc ttaaggggtc cccagaggac aacgagcaga agcaactgtt cgttgaacaa 3900
cacaagcact accttgacga gattatcgag caaatcagcg agtttagcaa gcgcgttata 3960
ctggcagacg caaatcttga taaggtcctt agcgcctaca acaagcatag agacaaaccc 4020
atccgggagc aggccgagaa cattattcat ctcttcacct tgacgaatct tggggccccg 4080
gccgcgttca agtacttcga tactaccata gacagaaagc gctatacatc gacaaaggaa 4140
gttcttgacg ccacgctgat ccaccaaagt ataacaggcc tctatgagac acgcatcgac 4200
ctttcgcagt tgggcggtga ccgccccaaa aagaagagga aagttggcgg gtga 4254
<210> 6
<211> 1706
<212> DNA
<213> Artificial sequence (non)
<400> 6
aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta tctatcttta 60
tacatatatt taaactttac tctacgaata atataatcta tagtactaca ataatatcag 120
tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa ttgagtattt 180
tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct ttttttttgc 240
aaatagcttc acctatataa tacttcatcc attttattag tacatccatt tagggtttag 300
ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta ttttagcctc 360
taaattaaga aaactaaaac tctattttag tttttttatt taataattta gatataaaat 420
agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta aaaaaactaa 480
ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg tcgacgagtc 540
taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag cagacggcac 600
ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg ttggacttgc 660
tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg gcacggcagg 720
cggcctcctc ctcctctcac ggcacggcag ctacggggga ttcctttccc accgctcctt 780
cgctttccct tcctcgcccg ccgtaataaa tagacacccc ctccacaccc tctttcccca 840
acctcgtgtt gttcggagcg cacacacaca caaccagatc tcccccaaat ccacccgtcg 900
gcacctccgc ttcaaggtac gccgctcgtc ctcccccccc ccccctctct accttctcta 960
gatcggcgtt ccggtccatg gttagggccc ggtagttcta cttctgttca tgtttgtgtt 1020
agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc gacctgtacg 1080
tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc tgggatggct 1140
ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg catagggttt 1200
ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg gtcatctttt 1260
catgcttttt ttttgtcttg gttgtgatga tgtggtgtgg ttgggcggtc gttcattcgt 1320
tctagatcgg agtagaatac tgtttcaaac tacctggtgt atttattaat tttggaactg 1380
tatgtgtgtg tcatacatct tcatagttac gagtttaaga tggatggaaa tatcgatcta 1440
ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc 1500
atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat 1560
aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt 1620
tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca 1680
ccctgttgtt tggtgttact tctgca 1706
<210> 7
<211> 569
<212> DNA
<213> Artificial sequence (non)
<400> 7
tcaaaaagag gtatgctatg aagcagcgta ttacagtgac agttgacagc gacagctatc 60
agttgctcaa ggcatatatg atgtcaatat ctccggtctg gtaagcacaa ccatgcagaa 120
tgaagcccgt cgtctgcgtg ccgaacgctg gaaagcggaa aatcaggaag ggatggctga 180
ggtcgcccgg tttattgaaa tgaacggctc ttttgctgac gagaacaggg gctggtgaaa 240
tgcagtttaa ggtttacacc tataaaagag agagccgtta tcgtctgttt gtggatgtac 300
agagtgatat tattgacacg cccgggcgac ggatggtgat ccccctggcc agtgcacgtc 360
tgctgtcaga taaagtctcc cgtgaacttt acccggtggt gcatatcggg gatgaaagct 420
ggcgcatgat gaccaccgat atggccagtg tgccggtctc cgttatcggg gaagaagtgg 480
ctgatctcag ccaccgcgaa aatgacatca aaaacgccat taacctgatg ttctggggaa 540
tataaatgtc aggctccctt atacacagc 569
<210> 8
<211> 31
<212> DNA
<213> Artificial sequence (non)
<400> 8
gctggtctca ggaaacagac cttttcgacc t 31
<210> 9
<211> 19
<212> DNA
<213> Artificial sequence (non)
<400> 9
tctagaagct tggcactgg 19
<210> 10
<211> 28
<212> DNA
<213> Artificial sequence (non)
<400> 10
aggctcgagc aagacgatct acccgagc 28
<210> 11
<211> 33
<212> DNA
<213> Artificial sequence (non)
<400> 11
gaactcgagc cgatctagta acatagatga cac 33
<210> 12
<211> 47
<212> DNA
<213> Artificial sequence (non)
<400> 12
acccgcgcgc tgtcgcttgt gtagagacca ttggcggccg cattagg 47
<210> 13
<211> 48
<212> DNA
<213> Artificial sequence (non)
<400> 13
cttgctattt ctagctctaa aactgagacc gtcgacctgc agactggc 48
<210> 14
<211> 25
<212> DNA
<213> Artificial sequence (non)
<400> 14
acaagtttgt acaaaaaagc aggct 25
<210> 15
<211> 25
<212> DNA
<213> Artificial sequence (non)
<400> 15
acccagcttt cttgtacaaa gtggt 25
<210> 16
<211> 74
<212> DNA
<213> Artificial sequence (non)
<400> 16
cccgcgcgct gtcgcttgtg tgtcatccag tacaacatct tgttttagag ctagaaatag 60
caagttaaaa taag 74
<210> 17
<211> 74
<212> DNA
<213> Artificial sequence (non)
<400> 17
cccgcgcgct gtcgcttgtg tgatggccat cacggtggca tgttttagag ctagaaatag 60
caagttaaaa taag 74
<210> 18
<211> 73
<212> DNA
<213> Artificial sequence (non)
<400> 18
cccgcgcgct gtcgcttgtg tgaagtatta ctactcgatg gttttagagc tagaaatagc 60
aagttaaaat aag 73
<210> 19
<211> 74
<212> DNA
<213> Artificial sequence (non)
<400> 19
cccgcgcgct gtcgcttgtg tgctaatggc atgggaaacc agttttagag ctagaaatag 60
caagttaaaa taag 74
<210> 20
<211> 74
<212> DNA
<213> Artificial sequence (non)
<400> 20
cccgcgcgct gtcgcttgtg tgtcaggccg acgatgacgc agttttagag ctagaaatag 60
caagttaaaa taag 74
<210> 21
<211> 73
<212> DNA
<213> Artificial sequence (non)
<400> 21
cccgcgcgct gtcgcttgtg tgtgtacggg aacttcttcg gttttagagc tagaaatagc 60
aagttaaaat aag 73
<210> 22
<211> 74
<212> DNA
<213> Artificial sequence (non)
<400> 22
cccgcgcgct gtcgcttgtg tgccaaccag tcgtccaacg cgttttagag ctagaaatag 60
caagttaaaa taag 74
<210> 23
<211> 60
<212> DNA
<213> Artificial sequence (non)
<400> 23
cacattgccc agctaactcg aacgcgacca acttataaac ccgcgcgctg tcgcttgtgt 60
<210> 24
<211> 70
<212> DNA
<213> Artificial sequence (non)
<400> 24
actcggtgcc actttttcaa gttgataacg gactagcctt attttaactt gctatttcta 60
gctctaaaac 70
<210> 25
<211> 17
<212> DNA
<213> Artificial sequence (non)
<400> 25
ctgtgatggg ctggatg 17
<210> 26
<211> 18
<212> DNA
<213> Artificial sequence (non)
<400> 26
ctgcagaatt gcccttcg 18
<210> 27
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 27
gccgaaagcg accaaatctc 20
<210> 28
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 28
cgagctgtca tccaaaccca 20
<210> 29
<211> 19
<212> DNA
<213> Artificial sequence (non)
<400> 29
ctcctcccaa cgccattga 19
<210> 30
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 30
tcacgatcac acgaggttga 20
<210> 31
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 31
cttgaagact gggactgcgt 20
<210> 32
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 32
gctcccgcaa agtccttagt 20
<210> 33
<211> 22
<212> DNA
<213> Artificial sequence (non)
<400> 33
atgccatgag atcttgtctt gc 22
<210> 34
<211> 21
<212> DNA
<213> Artificial sequence (non)
<400> 34
ttttcgttgg ttgcacggtt t 21
<210> 35
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 35
gagagagcca ctagcagcag 20
<210> 36
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 36
ccactcatcg acgcgtatca 20
<210> 37
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 37
ccgagtcaaa aagaggggga 20
<210> 38
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 38
ataacatttt tgggccgccg 20
<210> 39
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 39
gtccgagctt ggaaggagaa 20
<210> 40
<211> 20
<212> DNA
<213> Artificial sequence (non)
<400> 40
caggtgccac gaagatctga 20
Claims (25)
1. A plasmid vector for rice comprising a gene expression cassette comprising a Cas9 protein, a first promoter, a nucleotide sequence comprising a suicide gene sequence, a gRNA scaffold element and a termination signal, and at least one first enzyme cleavage site located 5 'of the nucleotide sequence comprising the suicide gene sequence and at least one second enzyme cleavage site located 3' of the nucleotide sequence comprising the suicide gene sequence; and the first and second enzyme cleavage sites are absent from other positions of the plasmid vector;
wherein the first promoter is located upstream of the termination signal, the nucleotide sequence comprising a suicide gene sequence and the gRNA scaffold element and are both located between the first promoter and the termination signal;
obtaining at least one I element which comprises an I-1 nucleotide sequence, an I-2 nucleotide sequence and an I-3 nucleotide sequence from 5 'end to 3' end in sequence;
wherein the I-1 nucleotide sequence is identical to a sequence which is 20bp or more upstream of the 5' end from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site;
the I-2 nucleotide sequence is a target nucleotide sequence which is consistent with a part of nucleotide sequences on plant endogenous genes;
the I-3 nucleotide sequence is consistent with a sequence which is 20bp above the downstream of the 3' end from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site;
exchanging said I element by homologous recombination respectively onto said plasmid vector to obtain a targeting vector in which said I element is located between said first promoter and a termination signal and said I-2 nucleotide sequence and said gRNA scaffold are transcriptionally fused;
the amino acid sequence of the Cas9 protein is shown as SEQ ID No. 4; the nucleotide sequence of the gene for encoding the Cas9 protein is shown as SEQ ID No. 5;
the nucleotide sequence containing the suicide gene sequence is shown as SEQ ID No. 7;
the nucleotide sequence of the gRNA scafffold element is shown as SEQ ID No. 3;
the plasmid vector further comprises a marker gene expression cassette;
the marker gene expression cassette comprises a selection gene expression cassette and a reporter gene expression cassette;
the selection gene expression cassette comprises a selection gene expression cassette for use in a plant;
the reporter gene is selected from at least one of beta galactosidase gene, luciferase gene, fluorescent protein gene and seed coat color gene.
2. The plasmid vector of claim 1 wherein the selection gene expression cassette is a hygromycin resistance gene expression cassette or a G418 resistance gene expression cassette.
3. The plasmid vector of claim 1 wherein the reporter gene is selected from at least one of the fluorescent protein genes.
4. The plasmid vector according to claim 3, wherein the fluorescent protein gene is at least one selected from the group consisting of a green fluorescent protein gene, a red fluorescent protein gene, a cyan fluorescent protein gene, a blue fluorescent protein gene, and a yellow protein gene.
5. The plasmid vector of claim 1, wherein the nucleotide sequence of the first promoter is selected from the group consisting of nucleotide sequences specific to rice.
6. A plasmid vector according to claim 5, characterized in that the first promoter is an RNA polymerase type III promoter.
7. The plasmid vector of claim 6, wherein the nucleotide sequence of the first promoter is the nucleotide sequence shown as SEQ ID No. 1.
8. The plasmid vector of claim 1, wherein the nucleotide sequence of the termination signal is the nucleotide sequence shown as SEQ ID No. 2.
9. A plasmid vector according to any of claims 1-8, characterized in that the gene expression cassette comprising the expressed Cas9 protein comprises a second promoter from 5 'end to 3' end, a gene encoding the Cas9 protein and a first terminator.
10. The plasmid vector of claim 1 wherein the second promoter is an RNA polymerase II type promoter.
11. The plasmid vector of claim 10, wherein the nucleotide sequence of the second promoter is set forth in SEQ ID No. 6.
12. The plasmid vector according to claim 10, wherein the nucleotide sequence of the first terminator is the 8 th to 260 th nucleotide sequence of genebank accession number FJ 362600.1.
13. The plasmid vector of claim 1 wherein the selection gene expression cassette comprises a third promoter from 5 'to 3', a selection gene, and a second terminator.
14. A plasmid vector according to claim 13, wherein the third promoter is an RNA polymerase type II promoter.
15. The plasmid vector of claim 13, wherein the nucleotide sequence of the third promoter is the 10382 th to 11162 th nucleotide sequence of genebank accession number FJ 362600.1.
16. The plasmid vector of claim 13 wherein the nucleotide sequence of the second terminator is the 8 th to 260 th nucleotide sequence of genebank accession number FJ 362600.1.
17. The plasmid vector of claim 1 wherein the reporter expression cassette comprises a fourth promoter from 5 'to 3', a reporter gene, and a third terminator.
18. The plasmid vector of claim 17, wherein the fourth promoter is an RNA polymerase II type promoter; the nucleotide sequence of the fourth promoter is the 10382 to 11162 nucleotide sequences in genebank accession number FJ 362600.1.
19. The plasmid vector of claim 17, wherein the nucleotide sequence of the third terminator is the 8 th to 260 th nucleotide sequence of genebank accession number FJ 362600.1.
20. A method of establishing a mutant population of plants obtained by targeting at least one gene endogenous to a plant, comprising the steps of:
the method comprises the following steps: obtaining at least one I element which comprises an I-1 nucleotide sequence, an I-2 nucleotide sequence and an I-3 nucleotide sequence from 5 'end to 3' end in sequence;
wherein the I-1 nucleotide sequence is identical to a sequence which is 20bp or more upstream of the 5' end from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site;
the I-2 nucleotide sequence is a target nucleotide sequence which is consistent with a part of nucleotide sequences on the plant endogenous gene;
the I-3 nucleotide sequence is consistent with a sequence which is 20bp above the downstream of the 3' end from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site;
step two: exchanging said I element by homologous recombination respectively onto said plasmid vector of any one of claims 1-19 to obtain a targeting vector in which said I element is located between said first promoter and a termination signal and said I-2 nucleotide sequence and said gRNA scaffold are transcriptionally fused;
step three: introducing the targeting vector into plant callus or plant protoplast, and culturing to obtain plant;
step four: screening the plant plants to obtain transgenic plants containing targeted plant endogenous genes; further, the transgenic plant containing the targeted plant endogenous gene is capable of producing transgenic plant seed containing the targeted plant endogenous gene.
21. The method of claim 20, wherein the I-1 nucleotide sequence is identical to a sequence 50-80bp upstream from the 5' end of the first cleavage site of the plasmid vector linearized by the first cleavage site and the second cleavage site;
the I-3 nucleotide sequence is consistent with a sequence which is 50-80bp from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site to the downstream of the 3' end.
22. The method of claim 20, wherein the target nucleotide sequence is determined by:
1) determining at least one plant endogenous gene on a plant genome that is to be targeted;
2) searching the coding sequence of the plant endogenous gene or the reverse complementary sequence thereof for a PAM (Polyacrylamide) module sequence capable of being recognized by the Cas9 protein, and determining 17-21 nucleotide sequences upstream of the 5 'end of the PAM module sequence as the target nucleotide sequence under the condition of ensuring that 17-21 nucleotide sequences upstream of the 5' end of the PAM module sequence are specific sequences in the genome.
23. The method of claim 20, wherein when the Cas9 protein is the amino acid sequence shown in SEQ ID No.4, the PAM recognition module is one of 5 ' -NGG-3 ', 5 ' -NGA-3 ', 5 ' -GAGN-3 ', 5 ' -AAGN-3 ', the target nucleotide sequence is 17 to 21 nucleotide sequences upstream of the 5 ' end of the PAM module, eliminating nucleotide sequences containing five consecutive ts;
wherein N is one of A, G, C and T.
24. Method according to any of claims 20-23, characterized in that at least one of said I-th elements is obtained by:
I) obtaining at least one oligonucleotide sequence comprising a nucleotide sequence II-1, a nucleotide sequence II-2, and a nucleotide sequence II-3 in that order from a 5 'end to a 3' end; wherein the II-1 nucleotide sequence is identical to a sequence which is 20bp or more upstream of the 5' end from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site;
the II-2 nucleotide sequence comprises a target nucleotide sequence identical to a partial nucleotide sequence on the endogenous gene of the plant;
the II-3 nucleotide sequence is consistent with a sequence which is 20bp above the downstream of the 3' end from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site;
II) obtaining an upstream primer and a downstream primer; wherein,
the upstream primer is consistent with a sequence which is more than 20bp from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site to the upstream of the 5' end;
the downstream primer is reversely complementary with a sequence which is more than 20bp from the second enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site to the downstream of the 3' end;
III) carrying out PCR amplification by taking the oligonucleotide sequence in the step I) as a template and the upstream primer and the downstream primer in the step II) as a primer pair, thereby obtaining the I element.
25. The method of claim 24, wherein the II-1 nucleotide sequence is identical to a sequence 20-35bp upstream from the 5' end of the first cleavage site of the plasmid vector linearized by the first cleavage site and the second cleavage site;
the II-3 nucleotide sequence is consistent with a sequence which is 20-35bp from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site to the downstream of the 3' end;
the upstream primer is consistent with a sequence which is 50-80bp from the first enzyme cutting site of the plasmid vector linearized by the first enzyme cutting site and the second enzyme cutting site to the upstream of the 5' end;
and the downstream primer is reversely complementary with a sequence which is 50-80bp from the second enzyme cutting site of the plasmid vector after being linearized by the first enzyme cutting site and the second enzyme cutting site to the downstream of the 3' end.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711294415.4A CN108034671B (en) | 2017-12-08 | 2017-12-08 | Plasmid vector and method for establishing plant population by using same |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711294415.4A CN108034671B (en) | 2017-12-08 | 2017-12-08 | Plasmid vector and method for establishing plant population by using same |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108034671A CN108034671A (en) | 2018-05-15 |
CN108034671B true CN108034671B (en) | 2021-09-28 |
Family
ID=62101465
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711294415.4A Active CN108034671B (en) | 2017-12-08 | 2017-12-08 | Plasmid vector and method for establishing plant population by using same |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108034671B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109593776B (en) * | 2018-05-16 | 2021-02-26 | 华中农业大学 | Method for quickly and efficiently obtaining non-transgenic directional gene mutant plant and application |
CN108841845A (en) * | 2018-06-21 | 2018-11-20 | 广东石油化工学院 | A kind of CRISPR/Cas9 carrier and its construction method with selection markers |
CN114045303B (en) * | 2018-11-07 | 2023-08-29 | 中国农业科学院植物保护研究所 | Artificial gene editing system for rice |
CN109411022A (en) * | 2018-11-07 | 2019-03-01 | 宁夏农林科学院农业生物技术研究中心(宁夏农业生物技术重点实验室) | A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy |
CN117402855B (en) * | 2023-12-14 | 2024-03-19 | 中国农业科学院植物保护研究所 | Cas protein, gene editing system and application |
-
2017
- 2017-12-08 CN CN201711294415.4A patent/CN108034671B/en active Active
Non-Patent Citations (2)
Title |
---|
Large chromosomal deletions and heritable small genetic changes induced by CRISPR/Cas9 in rice;Huanbin Zhou等;《Nucleic Acids Research》;20140908;第42卷(第17期);第10903-10914页,特别是第10903页最后1段-第10904页左栏第2段,右栏第4段-第10905页右栏第2段-最后1段-第10906页右栏第3段,图1 * |
水稻靶标基因单碱基定向替换技术的建立;周焕斌等;《中国科学》;20170612;第47卷(第11期);第1177-1185页 * |
Also Published As
Publication number | Publication date |
---|---|
CN108034671A (en) | 2018-05-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107177625B (en) | Artificial vector system for site-directed mutagenesis and site-directed mutagenesis method | |
CN107988229B (en) | Method for obtaining tillering-changed rice by modifying OsTAC1 gene through CRISPR-Cas | |
CN105132451B (en) | A kind of single transcriptional units directed modification skeleton carrier of CRISPR/Cas9 and its application | |
CN108034671B (en) | Plasmid vector and method for establishing plant population by using same | |
Cui et al. | Tnt1 retrotransposon mutagenesis: a tool for soybean functional genomics | |
US20210348179A1 (en) | Compositions and methods for regulating gene expression for targeted mutagenesis | |
CN113801891B (en) | Construction method and application of beet BvCENH3 gene haploid induction line | |
CN111100852B (en) | Directional mutation method of OsALS1 and crop endogenous gene directed evolution method | |
CN104911191B (en) | A kind of sterility changing gene FG3 and its application | |
Li et al. | Rapid generation of selectable marker-free transgenic rice with three target genes by co-transformation and anther culture | |
CN112011547A (en) | Major gene for controlling rape leaf shape and application thereof | |
CN107365772B (en) | Plant pollen specific promoter PSP1 and application thereof | |
CN115927381A (en) | Rape RNA processing factor NCBP gene and application thereof | |
CN112795571B (en) | Herbicide-resistant corn transformant and preparation method thereof | |
CN108341857A (en) | A kind of and rice yield GAP-associated protein GAP and its encoding gene and application | |
CN108660139A (en) | Plant fertility controlling gene NP2 and its coding albumen and application | |
CN112522299A (en) | Method for obtaining rice with increased tillering by using OsTNC1 gene mutation | |
CN106883291A (en) | Plant plant type GAP-associated protein GAP PROG2 and its encoding gene and application | |
CN113493786A (en) | Method for improving rice grain traits by blocking or weakening expression of OsMIR3979 in rice | |
CN115820716B (en) | Method for efficiently obtaining gene editing plant without carrying transgenic element | |
CN112852866B (en) | Method for cultivating plant male sterile line by utilizing mitochondrial gene editing system | |
CN116463348B (en) | Sg RNA for editing corn ZmCENH gene by using CRISPR/Cas9 system and application thereof | |
CN114181943B (en) | Method for creating early maturing corn germplasm and application thereof | |
CN113136397B (en) | Recombinant vector for improving gene editing efficiency of gentiana rigescens and preparation method and application thereof | |
CN110129359B (en) | Method for detecting gene editing event and determining gene editing efficiency and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |