CN114989281A - Gene ZmEIN2-1 for controlling water content of corn kernels and molecular marker thereof - Google Patents
Gene ZmEIN2-1 for controlling water content of corn kernels and molecular marker thereof Download PDFInfo
- Publication number
- CN114989281A CN114989281A CN202210595676.4A CN202210595676A CN114989281A CN 114989281 A CN114989281 A CN 114989281A CN 202210595676 A CN202210595676 A CN 202210595676A CN 114989281 A CN114989281 A CN 114989281A
- Authority
- CN
- China
- Prior art keywords
- seq
- leu
- water content
- corn
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 240000008042 Zea mays Species 0.000 title claims abstract description 116
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 110
- 235000002017 Zea mays subsp mays Nutrition 0.000 title claims abstract description 99
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 title claims abstract description 85
- 235000005822 corn Nutrition 0.000 title claims abstract description 85
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 title claims abstract description 69
- 239000003147 molecular marker Substances 0.000 title claims description 27
- 230000018044 dehydration Effects 0.000 claims abstract description 62
- 238000006297 dehydration reaction Methods 0.000 claims abstract description 62
- 238000000034 method Methods 0.000 claims abstract description 57
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 43
- 230000014509 gene expression Effects 0.000 claims abstract description 16
- 238000012216 screening Methods 0.000 claims abstract description 7
- 241000196324 Embryophyta Species 0.000 claims description 87
- 239000003550 marker Substances 0.000 claims description 47
- 235000013339 cereals Nutrition 0.000 claims description 36
- 239000000463 material Substances 0.000 claims description 35
- 150000007523 nucleic acids Chemical class 0.000 claims description 28
- 239000002773 nucleotide Substances 0.000 claims description 23
- 125000003729 nucleotide group Chemical group 0.000 claims description 23
- 238000001514 detection method Methods 0.000 claims description 22
- 102000039446 nucleic acids Human genes 0.000 claims description 20
- 108020004707 nucleic acids Proteins 0.000 claims description 20
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 14
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 14
- 235000009973 maize Nutrition 0.000 claims description 14
- 238000012408 PCR amplification Methods 0.000 claims description 9
- 230000000295 complement effect Effects 0.000 claims description 9
- 238000012360 testing method Methods 0.000 claims description 9
- 230000000694 effects Effects 0.000 claims description 8
- 108090000848 Ubiquitin Proteins 0.000 claims description 7
- 102000044159 Ubiquitin Human genes 0.000 claims description 7
- 230000002441 reversible effect Effects 0.000 claims description 2
- 230000001276 controlling effect Effects 0.000 abstract description 14
- 230000001105 regulatory effect Effects 0.000 abstract description 2
- 238000010353 genetic engineering Methods 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 39
- 230000002068 genetic effect Effects 0.000 description 29
- 235000018102 proteins Nutrition 0.000 description 29
- 108020004414 DNA Proteins 0.000 description 27
- 238000009396 hybridization Methods 0.000 description 24
- 239000000523 sample Substances 0.000 description 19
- 235000007244 Zea mays Nutrition 0.000 description 16
- 230000006798 recombination Effects 0.000 description 16
- 108700028369 Alleles Proteins 0.000 description 15
- 235000001014 amino acid Nutrition 0.000 description 15
- 238000005215 recombination Methods 0.000 description 15
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 14
- 238000013507 mapping Methods 0.000 description 14
- 239000012634 fragment Substances 0.000 description 13
- 238000003752 polymerase chain reaction Methods 0.000 description 13
- 108010047495 alanylglycine Proteins 0.000 description 12
- 229940024606 amino acid Drugs 0.000 description 12
- 150000001413 amino acids Chemical class 0.000 description 12
- 230000002018 overexpression Effects 0.000 description 11
- 238000010362 genome editing Methods 0.000 description 9
- 108091033319 polynucleotide Proteins 0.000 description 9
- 102000040430 polynucleotide Human genes 0.000 description 9
- 239000002157 polynucleotide Substances 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 238000005406 washing Methods 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 7
- 238000012098 association analyses Methods 0.000 description 7
- 210000000349 chromosome Anatomy 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 230000009261 transgenic effect Effects 0.000 description 7
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 238000003306 harvesting Methods 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 108700024394 Exon Proteins 0.000 description 5
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 238000009395 breeding Methods 0.000 description 5
- 230000001488 breeding effect Effects 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 230000000875 corresponding effect Effects 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 210000002257 embryonic structure Anatomy 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 230000010152 pollination Effects 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 238000004064 recycling Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 3
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 3
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 3
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 3
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108091060211 Expressed sequence tag Proteins 0.000 description 3
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 3
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 3
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 3
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 3
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 3
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 3
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 3
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 3
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 3
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 3
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 3
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 3
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 3
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 3
- 241000209510 Liliopsida Species 0.000 description 3
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 3
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 3
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 3
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 3
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 3
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 3
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 3
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 3
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 3
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 3
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 3
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 3
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 3
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 3
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 3
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 3
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 3
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 3
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 3
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 3
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 3
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 3
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 3
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 3
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 3
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 3
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 3
- VSYROIRKNBCULO-BWAGICSOSA-N Tyr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O VSYROIRKNBCULO-BWAGICSOSA-N 0.000 description 3
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 3
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 3
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 3
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 3
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 3
- 230000009418 agronomic effect Effects 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 238000010219 correlation analysis Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 230000002349 favourable effect Effects 0.000 description 3
- 238000003208 gene overexpression Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 239000003068 molecular probe Substances 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000005204 segregation Methods 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108091027544 Subgenomic mRNA Proteins 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 238000012239 gene modification Methods 0.000 description 2
- 230000005017 genetic modification Effects 0.000 description 2
- 235000013617 genetically modified food Nutrition 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000010153 self-pollination Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 230000001568 sexual effect Effects 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 239000001707 (E,7R,11R)-3,7,11,15-tetramethylhexadec-2-en-1-ol Substances 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 1
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241001057636 Dracaena deremensis Species 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- 108010044467 Isoenzymes Proteins 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- BLUHKGOSFDHHGX-UHFFFAOYSA-N Phytol Natural products CC(C)CCCC(C)CCCC(C)CCCC(C)C=CO BLUHKGOSFDHHGX-UHFFFAOYSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- HNZBNQYXWOLKBA-UHFFFAOYSA-N Tetrahydrofarnesol Natural products CC(C)CCCC(C)CCCC(C)=CCO HNZBNQYXWOLKBA-UHFFFAOYSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000209149 Zea Species 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- BOTWFXYSPFMFNR-OALUTQOASA-N all-rac-phytol Natural products CC(C)CCC[C@H](C)CCC[C@H](C)CCCC(C)=CCO BOTWFXYSPFMFNR-OALUTQOASA-N 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 239000010903 husk Substances 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000000442 meristematic effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- BOTWFXYSPFMFNR-PYDDKJGSSA-N phytol Chemical compound CC(C)CCC[C@@H](C)CCC[C@@H](C)CCC\C(C)=C\CO BOTWFXYSPFMFNR-PYDDKJGSSA-N 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 230000003234 polygenic effect Effects 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000000611 regression analysis Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Botany (AREA)
- Immunology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Mycology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention relates to a gene ZmEIN2-1 for controlling the water content of corn kernels, related molecular markers and application thereof in screening or improving the water content or dehydration rate traits of the corn kernels, belonging to the field of molecular genetics. The invention provides a sequence of a ZmEIN2-1 gene, and discloses 2 InDel sites which are remarkably related to the water content or dehydration rate of corn kernels in the ZmEIN2-1 gene: InDel _7/0 and InDel _ 303/000. The invention discloses a method for screening the moisture content or dehydration rate of corn kernels by using molecular markers developed based on the InDel _7/0 and InDel _303/000 loci. Furthermore, the invention discloses a method for regulating and controlling the water content or dehydration rate of corn kernels by changing the expression of ZmEIN2-1 protein through a genetic engineering means.
Description
Technical Field
The invention relates to a gene ZmEIN2-1 for controlling the water content of corn kernels, related molecular markers and application thereof in screening or improving the water content or dehydration rate of the corn kernels, belonging to the field of molecular genetics.
Background
The kernel moisture is a key factor influencing the mechanical harvesting quality, safe storage and economic benefit of the corn. The grain water content during harvesting has great influence on corn harvesting, drying, storage, transportation and processing utilization, and the excessively high water content causes economic loss to corn growers and operators, reduces economic benefit, and is easy to cause grain mildew and influence corn quality. In addition, corn kernel recycling has become one of the major factors limiting corn production in our country, and the most critical step in corn kernel recycling is that the water content of corn kernels at the time of recycling cannot reach the standard water content of less than or equal to 25% that can be achieved by the kernel recycling (Wang Z, Wang X, Zhang L, Liu X, Di H, Li T, Jin X. QTL undercut field yield after physical information in mail (Zea Mays L.) [ J ]. Euphytoica, 2012,185(3): clasping 528.). Therefore, the method is very important for breeding corn varieties with low kernel water content during harvesting. In addition, the low grain water content can shorten the growth cycle of the corn, which has great production significance for harvesting before the frost period in high latitude areas in China and for wheat planting in Huang-Huai-Hai areas without being influenced.
Some QTLs for controlling the moisture content and dehydration rate of corn grains are obtained in some current researches and distributed on 10 chromosomes of corn, wherein the QTL located on the No.1 chromosome mainly comprises: q45dGM1-1, qHTGM1-1, qHTGM1-2 and qAUDDC1-1(Zhang J, Zhang F, Tang B, Ding Y, Xia L, Qi J, Mu X, Gu L, Lu D, Chen Y. molecular mapping of qualitative trap location for Grain model and field map analysis in mail (Zea maps L.) [ J ]. physical Plant,2020,169, 1):64-72), mQTL1-1, mQTL1-2, mQTL1-3 and mL 1-4 (DeqY, Doy, Yang M, Wang Q, Shi Q, Zhou Q, Deng F, Qin Z, Qio H. Q, yu H, Liu Y, Deng S, Liu Q, Liu B, Xu M.genetic separation of grain water content and purification rate related to mechanical harvest in mail [ J ]. BMC Plant Biol,2020,20(1): 118). Meanwhile, studies have been conducted to clone ZmGAR2(Li W, Yu Y, Wang L, Luo Y, Peng Y, Xu Y, Liu X, Wu S, Jian L, Xu J, Xiao Y, Yan J. the genetic architecture of the dynamic changes in grain mobility in mail [ J ]. Plant Biotechnol J,2021,19(6): 1195-. Different genes for controlling the character are cloned, more effective gene resources and molecular markers can be provided for molecular breeding, breeding and creating different rapidly dehydrated corn materials.
Therefore, the invention utilizes the corn related population and the linked population to clone and position a new gene ZmEIN2-1 for controlling the water content and the dehydration rate of corn kernels through correlation analysis and by combining with a map, identifies a molecular marker linked with the gene and the marker, can screen the water content or the dehydration rate of the corn kernels, and cultivates a corn variety with low water content and high dehydration speed.
Disclosure of Invention
The invention aims to provide a nucleic acid sequence of a gene ZmEIN2-1 influencing the water content character of corn kernels and an amino acid sequence coded by the same.
The second purpose of the invention is to provide 2 molecular markers closely linked with the moisture content character of corn grains: InDel _7/0 and InDel _ 303/000.
The invention also aims to disclose a method for identifying and screening the water content or dehydration rate character of the corn kernel by using the molecular marker.
The fourth purpose of the invention is to disclose a method for improving the water content or dehydration rate character of corn kernels.
In order to achieve the purpose, the invention adopts the following technical scheme:
the invention provides an application of protein in controlling the water content or dehydration rate character of corn kernels, which is characterized in that: the amino acid sequence of the protein is shown in any one of SEQ ID NO.1, SEQ ID NO.17 or SEQ ID NO. 18.
The invention also provides application of the nucleic acid in controlling the water content or dehydration rate character of corn kernels, and the nucleic acid is characterized in that the nucleic acid encodes the protein.
In some embodiments, the nucleotide sequence or reverse complement of the nucleic acid is as set forth in any one of SEQ ID No.2, SEQ ID No.3, SEQ ID No.4, SEQ ID No.13, SEQ ID No.14, SEQ ID No.15, or SEQ ID No. 16.
The invention also provides a molecular marker, which is characterized in that 7 bases AGTATCT are inserted into the position of 1044-1045 of the sequence shown in SEQ ID NO. 2; or 303 bases at position 2506 and 2808 of the sequence shown in SEQ ID NO.2
CTTGATCACCAATTCGCGAAAGGGCCTCTAGCTGAGTTGGTTAGGTGGTCTGAATAGCACTCCTCAGGTCCTGGGTTCGACTCCCCGTGGGAGCGAATTTCAGGCTGTGGTTAAAAAAATCCCCTCGTCTGTCCCACGCCAAAGCACAGGTCTAAGACTCAGCCCGGTCGTGGTCGTTCTCACATGGGCTTCGATGCCGCTGTGTATGGGTGGGGTAGGGGTTCGGGGGTTTTCTTGACCTGTGTGAGAAGGTATTTTTCTTAATACAATACCCGGGGCTGTCTTACCCCCCGCAGGTCAAGT。
The invention also provides a method for identifying or assisting in identifying the water content or dehydration rate character of corn grains, which is characterized by comprising the following steps of: (1) detecting the molecular marker in the material to be detected; (2) if the detection result is that the marker is included, the material to be detected shows the characteristics of high water content of grains or low dehydration rate; if the detection result is that the marker is not included, the material to be detected shows the character of low water content of the grains or high dehydration rate.
In some embodiments, the method of detecting the molecular marker described above employs PCR amplification.
In some embodiments, the primer pair used for the PCR amplification consists of SEQ ID NO.5/SEQ ID NO.6 or SEQ ID NO.9/SEQ ID NO. 10.
In some embodiments, the PCR amplification product represented by SEQ ID NO.7 or SEQ ID NO.11 comprising the marker described above; the PCR amplification product which does not contain the marker is shown as SEQ ID NO.8 or SEQ ID NO. 12.
The invention also provides a method for screening the corn material with the characteristics of low grain water content or high dehydration rate, which is characterized in that the molecular marker in the material to be detected is detected according to the method, and the material which does not contain the molecular marker is screened.
The invention also provides a method for reducing the water content of corn kernels or increasing the dehydration rate, which is characterized in that the expression and/or activity of the protein is increased in the corn material to be improved, and plants with low water content of corn kernels or high dehydration rate are selected.
In some embodiments, the method of increasing protein expression is by using a high activity promoter to drive expression of a nucleic acid sequence encoding the protein.
In some embodiments, the high activity promoter is a maize ubiquitin promoter.
In some embodiments, the maize ubiquitin promoter sequence is set forth in SEQ ID No. 19.
The invention also provides application of the molecular marker and the method in improving the water content or dehydration rate of corn kernels.
Compared with the prior art, the invention has the beneficial effects that: the ZmEIN2-1 gene and the protein coded by the gene have the function of regulating and controlling the water content or dehydration rate of corn grains, and the function of the gene is not reported in the prior published data. The invention also provides functional molecular markers InDel _7/0 and InDel _303/000 closely linked with ZmEIN2-1 and a detection method of the markers, which can specifically identify genotypes with different grain water contents or dehydration rates from a corn population, and carry out auxiliary identification and improvement on the grain water content or dehydration rate characters of the corn variety, thereby obtaining the corn variety with low water content or high dehydration rate. The water content of corn grains can be reduced and the dehydration rate can be improved by over-expressing the ZmEIN2-1 gene.
Drawings
FIG. 1 is a Manhattan diagram of whole genome association analysis of corn kernel moisture content change index (taking AUDDC _4_1 as an example). The vertical axis represents the p-value of each marker association analysis test, taken as-log 10; the horizontal axis represents the position of the chromosome. Arrows indicate target SNPs.
FIG. 2 is a QTL mapping chart of corn kernel moisture content variation index (taking BLUP _ AUDDC _4_3, 13HN _ AUDDC _5_4 and 14SY _ AUDDC _4_3 as examples). The vertical axis represents the LOD value for each marker association analysis test; the horizontal axis represents the position of the chromosome.
FIG. 3 is a fine mapping of ZmEIN2-1 gene.
FIG. 4 is a schematic diagram of the gene structure, molecular markers and target sites of ZmEIN 2-1. Triangles indicate the positions of the 2 molecular markers and the TE insertion site of the Mu mutant; arrows indicate target sites; ATG: a start codon; TAA: stop codon
FIG. 5 InDel _7/0 molecular marker causes protein structure change
FIG. 6 nucleic acid sequence and encoded protein structure of wild type ZmEIN2-1 gene and edited gene. WT: a wild type; KO: gene editing; target sequences are underlined; "-" indicates a base deletion.
FIG. 7 is a diagram of a ZmEIN2-1 gene overexpression vector.
FIG. 8 shows the results of gene expression levels of two transformation events overexpressing ZmEIN2-1 gene. WT: a wild type; OE: overexpression
Detailed Description
The following definitions and methods are provided to better define the present application and to guide those of ordinary skill in the art in the practice of the present application. Unless otherwise indicated, terms are to be understood in accordance with their ordinary usage by those of ordinary skill in the relevant art. All patent documents, academic papers, industry standards and other publications, etc., cited herein are incorporated by reference in their entirety.
As used herein, "maize" is any maize plant and includes all plant varieties that can be bred with maize, including whole plants, plant cells, plant organs, plant protoplasts, plant cell tissue cultures from which plants can be regenerated, plant calli, intact plant cells in plants or plant parts, such as embryos, pollen, ovules, seeds, leaves, flowers, branches, fruits, stems, roots, root tips, anthers, and the like. Unless otherwise indicated, nucleic acids are written from left to right in the 5 'to 3' direction; amino acid sequences are written from left to right in the amino to carboxy direction. Amino acids may be referred to herein by their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical nomenclature Commission. Similarly, nucleotides may be represented by commonly accepted single-letter codes. Numerical ranges include the numbers defining the range. As used herein, "nucleic acid" includes reference to deoxyribonucleotide or ribonucleotide polymers in either single-or double-stranded form, and unless otherwise limited, includes known analogs (e.g., peptide nucleic acids) having the basic properties of natural nucleotides that hybridize to single-stranded nucleic acids in a manner similar to naturally occurring nucleotides. As used herein, the term "encode" or "encoded" when used in the context of a particular nucleic acid means that the nucleic acid contains the necessary information to direct translation of the nucleotide sequence into a particular protein. The information encoding the protein is represented using a codon. As used herein, "full-length sequence" in reference to a particular polynucleotide or protein encoded thereby refers to the entire nucleic acid sequence or the entire amino acid sequence having a native (non-synthetic) endogenous sequence. The full-length polynucleotide encodes the full-length, catalytically active form of the particular protein. The terms "polypeptide," "polypeptide," and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The term is used for amino acid polymers in which one or more amino acid residues is an artificial chemical analogue of a corresponding naturally occurring amino acid. The term is also used for naturally occurring amino acid polymers. The terms "residue" or "amino acid" are used interchangeably herein to refer to an amino acid that is incorporated into a protein, polypeptide, or peptide (collectively, "protein"). The amino acid can be a naturally occurring amino acid, and unless otherwise limited, can include known analogs of natural amino acids that can function in a similar manner as naturally occurring amino acids.
The term "trait" refers to a physiological, morphological, biochemical or physical characteristic of a plant or a particular plant material or cell. In some cases, this property is visible to the human eye, such as seed or plant size, or can be measured by biochemical techniques, such as detecting the protein, starch or oil content of the seed or leaf, or by observing metabolic or physiological processes, for example by measuring tolerance to water deprivation or specific salt or sugar or nitrogen concentrations, or by observing the expression levels of one or more genes, or by agronomic observations such as osmotic stress tolerance or yield.
By "transgenic" is meant any cell, cell line, callus, tissue, plant part or plant whose genome has been altered by the presence of a heterologous nucleic acid (such as a recombinant DNA construct). The term "transgene" as used herein includes those initial transgenic events as well as those generated by sexual crosses or asexual propagation from the initial transgenic events and does not encompass genomic (chromosomal or extra-chromosomal) alteration by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition or spontaneous mutation.
"plant" includes reference to whole plants, plant organs, plant tissues, seeds, and plant cells, and progeny of same. Plant cells include, but are not limited to, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. "progeny" comprises any subsequent generation of the plant.
In this application, the words "comprise", "comprising" or variations thereof are to be understood as embracing each and every element, number or step recited in addition to those elements, numbers or steps. By "test plant" or "test plant cell" is meant a plant or plant cell in which genetic modification has been effected, or a progeny cell of such a modified plant or cell, which progeny cell comprises the modification. The "control" or "control plant cell" provides a reference point for measuring the phenotypic change of the test plant or plant cell.
Negative or control plants may include, for example: (a) a wild-type plant or cell, i.e., a plant or cell having the same genotype as the starting material for the genetic modification to produce a test plant or cell; (b) plants or plant cells having the same genotype as the starting material but which have been transformed with an empty construct (i.e., a construct that has no known effect on the trait of interest, such as a construct comprising a target gene); (c) a plant or plant cell that is a non-transformed isolate of a subject plant or plant cell; (d) a plant or plant cell that is genetically identical to the subject plant or plant cell but that has not been exposed to conditions or stimuli that induce expression of the gene of interest; or (e) the subject plant or plant cell itself, under conditions in which the gene of interest is not expressed.
Those skilled in the art will readily recognize that advances in the field of molecular biology, such as site-specific and random mutagenesis, polymerase chain reaction methods, and protein engineering techniques, provide a wide range of suitable tools and procedures for engineering or engineering amino acid sequences and potential gene sequences of proteins of agricultural interest.
In some embodiments, changes may be made to the nucleotide sequences of the present application to make conservative amino acid substitutions. The principles and examples of conservative amino acid substitutions are further described below. In certain embodiments, substitutions that do not alter the amino acid sequence of the nucleotide sequences of the present application can be made in accordance with the codon preferences disclosed for monocots, e.g., codons encoding the same amino acid sequence can be substituted with monocot preferred codons without altering the amino acid sequence encoded by the nucleotide sequence. In some embodiments, a portion of the nucleotide sequence in this application is replaced with a different codon that encodes the same amino acid sequence, such that the nucleotide sequence is not altered while the amino acid sequence encoded thereby is not altered. Conservative variants include those sequences that, due to the degeneracy of the genetic code, encode the amino acid sequence of one of the proteins of the embodiments. In some embodiments, a partial nucleotide sequence herein is replaced according to monocot preferred codons. One skilled in the art will recognize that amino acid additions and/or substitutions are generally based on the relative similarity of the amino acid side-chain substituents, e.g., hydrophobicity, charge, size, etc., of the substituents. Exemplary amino acid substituent groups having various of the foregoing properties are known to those skilled in the art and include arginine and lysine; glutamic acid and aspartic acid; serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine. Guidance regarding suitable amino acid substitutions that do not affect the biological activity of the Protein of interest can be found in the model of the Atlas of Protein sequences and structures (Protein Sequence and Structure Atlas) (Natl. biomed. Res. Foundation, Washington, D.C.) (incorporated herein by reference). Conservative substitutions such as exchanging one amino acid for another with similar properties may be made. Identification of sequence identity includes hybridization techniques. For example, all or part of a known nucleotide sequence is used as a probe for selective hybridization to other corresponding nucleotide sequences present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., a genomic library or cDNA library) from a selected organism. The hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as 32P or other detectable marker. Thus, for example, hybridization probes can be prepared by labeling synthetic oligonucleotides based on the sequence of the embodiment. Methods for preparing hybridization probes and constructing cDNA and genomic libraries are generally known in the art. Hybridization of the sequences may be performed under stringent conditions. As used herein, the term "stringent conditions" or "stringent hybridization conditions" refers to conditions under which a probe will hybridize to its target sequence to a detectably greater degree (e.g., at least 2-fold, 5-fold, or 10-fold over background) than to other sequences. Stringent conditions are sequence dependent and differ in different environments. By controlling the stringency of hybridization and/or by controlling the washing conditions, it is possible to identify target sequences which are 100% complementary to the probes (homologous probe method). Alternatively, stringency conditions can be adjusted to allow some sequence mismatches in order to detect lower similarity (heterologous probe methods). Typically, the probe is less than about 1000 or 500 nucleotides in length. Typically, stringent conditions are conditions in which the salt concentration is less than about 1.5M Na ion, typically about 0.01M to 1.0M Na ion concentration (or other salt) at pH 7.0 to 8.3, and the temperature conditions are: when used with short probes (e.g., 10 to 50 nucleotides), at least about 30 ℃; when used with long probes (e.g., greater than 50 nucleotides), at least about 60 ℃. Stringent conditions may also be achieved by the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization at 37 ℃ using 30% to 35% formamide buffer, 1M NaCl, 1% SDS (sodium dodecyl sulfate), washing at 50 ℃ to 55 ℃ in1 × to 2 × SSC (20 × SSC ═ 3.0M NaCl/0.3M trisodium citrate). Exemplary moderately stringent conditions include hybridization in 40% to 45% formamide, 1.0M NaCl, 1% SDS at 37 ℃ and washing in 0.5X to 1 XSSC at 55 ℃ to 60 ℃. Exemplary high stringency conditions include hybridization in 50% formamide, 1M NaCl, 1% SDS at 37 deg.C, and a final wash in 0.1 XSSC at 60 deg.C to 65 deg.C for at least about 20 minutes. Optionally, the wash buffer may comprise about 0.1% to about 1% SDS. The duration of hybridization is generally less than about 24 hours, and typically from about 4 hours to about 12 hours. Specificity usually depends on the post-hybridization wash, the critical factors being the ionic strength and temperature of the final wash solution. The Tm (thermal melting point) of a DNA-DNA hybrid can be approximated by the formula of Meinkoth and Wahl (1984) anal. biochem.138: 267-284: tm 81.5 ℃ +16.6(logM) +0.41 (% GC) -0.61 (% formamide) -500/L; where M is the molar concentration of monovalent cations,% GC is the percentage of guanosine and cytosine nucleotides in the DNA,% formamide is the percentage formamide of the hybridization solution, and L is the base pair length of the hybrid. The Tm is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe. Washing is typically performed at least until equilibrium is reached and a low background level of hybridization is achieved, such as for 2 hours, 1 hour, or 30 minutes. Decrease Tm by about 1 ℃ per 1% mismatch; thus, Tm, hybridization and/or wash conditions can be adjusted to hybridize to sequences of desired identity. For example, if a sequence with > 90% identity is desired, the Tm can be lowered by 10 ℃. Typically, stringent conditions are selected to be about 5 ℃ lower than the Tm for the specific sequence and its complement under defined ionic strength and pH. However, under very stringent conditions, hybridization and/or washing may be performed at 4 ℃ below the Tm; hybridization and/or washing may be performed at 6 ℃ below the Tm under moderately stringent conditions; under low stringency conditions, hybridization and/or washing can be performed at 11 ℃ below the Tm.
In some embodiments, fragments of the nucleotide sequences and the amino acid sequences encoded thereby are also included. As used herein, the term "fragment" refers to a portion of the nucleotide sequence of a polynucleotide or a portion of the amino acid sequence of a polypeptide of an embodiment. Fragments of the nucleotide sequences may encode protein fragments that retain the biological activity of the native or corresponding full-length protein, and thus have protein activity. Mutant proteins include biologically active fragments of the native protein that comprise contiguous amino acid residues that retain the biological activity of the native protein. Some embodiments also include a transformed plant cell or transgenic plant comprising the nucleotide sequence of at least one embodiment. In some embodiments, plants are transformed with an expression vector comprising at least one embodiment of the nucleotide sequence and operably linked thereto a promoter that drives expression in plant cells. Transformed plant cells and transgenic plants refer to plant cells or plants that comprise a heterologous polynucleotide within their genome. Generally, the heterologous polynucleotide is stably integrated within the genome of the transformed plant cell or transgenic plant such that the polynucleotide is transmitted to progeny. The heterologous polynucleotide may be integrated into the genome alone or as part of an expression vector. In some embodiments, the plants to which the present application relates include plant cells, plant protoplasts, plant cell tissue cultures from which plants can be regenerated, plant calli, plant clumps, and plant cells, which are whole plants or parts of plants, such as embryos, pollen, ovules, seeds, leaves, flowers, branches, fruits, nuts, ears, cobs, husks, stalks, roots, root tips, anthers, and the like. The present application also includes plant cells, protoplasts, tissues, calli, embryos, and flowers, stems, fruits, leaves, and roots derived from the transgenic plants of the present application or progeny thereof, and thus comprising at least in part the nucleotide sequences of the present application.
The term "amplification" in the context of nucleic acid amplification is any process in which additional copies of a selected nucleic acid (or a transcribed form thereof) are produced. Typical amplification methods include various polymerase-based replication methods, including Polymerase Chain Reaction (PCR), ligase-mediated methods such as Ligase Chain Reaction (LCR), and RNA polymerase-based amplification (e.g., by transcription) methods.
An allele is "associated with" a trait when it is linked to the trait, and when the allele present is an indication that the desired trait or trait form will occur in a plant containing the allele.
The term "quantitative trait locus" or "QTL" as used herein refers to a polymorphic locus having at least one allele associated with differential expression of a phenotypic trait in at least one genetic background (e.g., in at least one breeding population or progeny). QTLs can function through a monogenic or polygenic mechanism.
The term "QTL mapping" as used herein refers to the mapping of a QTL to a genetic map using methods similar to single gene mapping, and determining the distance (expressed as recombination rate) between the QTL and a genetic marker. According to the number of labels, there are several methods, including single label, double label and multiple label. According to different statistical analysis methods, the method can be divided into variance and mean analysis, regression and correlation analysis, moment estimation, maximum likelihood method and the like. The number of marked intervals can be divided into zero interval mapping, single interval mapping and multi-interval mapping. In addition, there are comprehensive analysis methods combining different methods, such as QTL Complex Interval Mapping (CIM) Multiple Interval Mapping (MIM), multiple QTL mapping, Multiple Trait Mapping (MTM), and the like.
The term "molecular marker" as used herein refers to a specific DNA fragment that reflects some difference in the genome between individual or population groups of an organism.
The term "major gene" as used herein refers to a gene that determines a trait from a single gene, referred to as a major gene, and the term "minor gene" as used herein refers to a gene that has only a partial effect on each of several non-alleles of a phenotype of the same trait, referred to as additive or polygenes. Each gene has only a small portion of the phenotypic effect in the additive genes and is therefore also referred to as a mini-gene.
The term "inbred line" as used herein refers to a line which has regular and consistent agronomic traits and simple genetic basis, obtained by selecting individual plants with good agronomic traits through several generations of continuous elimination of bad panicles under the condition of artificially controlled self-pollination.
The term "backcrossing" as used herein refers to a process in which a progeny and either of two parents are crossed.
The term "cross" or "crossed" as used herein refers to a gamete fusion (e.g., cell, seed, or plant) that produces progeny through pollination. The term includes sexual crosses (pollination of one plant by another) and selfing (self-pollination, e.g., when the pollen and ovule are from the same plant). The term "crossing" refers to the act of fusion of gametes via pollination to produce progeny.
The term "backcrossing" as used herein refers to a process in which progeny of a cross are repeatedly backcrossed to one of the parents. In a backcrossing scheme, the "donor" parent refers to the parent plant having the desired gene or locus to be introgressed. The "recipient" parent (used one or more times) or "recurrent" parent (used two or more times) refers to the parent plant into which the gene or locus has been introgressed. Initial hybridization yields F 1 Generation; then, the term "BC 1 "indicates the second use of recurrent parent," BC 2 "refers to the rotation of the parent for a third use, etc.
The term "closely linked" as used herein means that recombination between two linked loci occurs at a frequency of equal to or less than about 10% (i.e., the frequency of separation on the genetic map does not exceed 10 cM). In other words, closely linked loci co-segregate in at least 90% of the cases. Marker loci are particularly useful in the present invention when they exhibit a significant probability of co-segregation (linkage) with a desired trait (e.g., pathogen resistance). Closely linked loci such as a marker locus and a second locus can exhibit a recombination frequency within the locus of 10% or less, preferably about 9% or less, more preferably about 8% or less, more preferably about 7% or less, more preferably about 6% or less, more preferably about 5% or less, more preferably about 4% or less, more preferably about 3% or less, more preferably about 2% or less. In highly preferred embodiments, the cognate locus exhibits a recombination frequency of about 1% or less, such as about 0.75% or less, more preferably about 0.5% or less, more preferably about 0.25% or less. Two loci that are located on the same chromosome and that are separated by a distance such that recombination between the two loci occurs at a frequency of less than 10% (e.g., about 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.75%, 0.5%, 0.25%, or less) are also said to be "close to" each other. In some cases, two different markers can have the same genetic map coordinates. In that case, the two tags are close enough to each other that recombination between them occurs at a frequency too low to be detected.
Centimorgans ("cM") is a measure of the frequency of recombination. 1cM equals 1% of the probability that a marker at one locus will separate from a marker at a second locus by a single generation hybridization.
A "favorable allele" is an allele at a particular locus that confers or contributes to an agronomically desirable phenotype, such as increased moisture content of corn kernels, and allows for the identification of plants having an agronomically desirable phenotype. A "favorable" allele of a marker is a marker allele that cosegregates with a favorable phenotype.
A "genetic map" is a description of the genetic linkage between loci on one or more chromosomes in a given species, typically depicted in a graphical or tabular format. For each genetic map, the distance between loci is measured by the frequency of recombination between them, and recombination between loci can be detected using a variety of markers. Genetic maps are the product of the mapped population, the type of marker used, and the polymorphic potential of each marker across different populations. The order and genetic distance between loci may differ from one genetic map to another. However, a generic box using common labels can associate information of one map with another map. One of ordinary skill in the art can use a framework of common markers to identify marker locations and loci of interest on the genetic map of each individual.
A "genetic map location" is a location on a genetic map on the same linkage group relative to surrounding genetic markers where a given marker can be found in a given population.
"Gene mapping" is a method of defining linkage relationships of loci by using standard genetic principles of genetic markers, population segregation of markers, and recombination frequency.
"genetic recombination frequency" is the frequency of crossover events (recombination) between two loci. Recombination frequency can be observed after segregation of the marker and/or post-meiotic trait.
The term "genotype" is the genetic makeup of an individual (or group of individuals) at one or more loci, as contrasted with an observable trait (phenotype). The genotype is defined by the alleles of one or more known loci that the individual has inherited from its parent. The term genotype may be used to refer to the genetic makeup of an individual at a single locus, the genetic makeup at multiple loci, or more generally, the term genotype may be used to refer to the genetic makeup of all genes of an individual in their genome.
"germplasm" refers to an individual (e.g., a plant), a group of individuals (e.g., a line, variety, or family of plants), or cloned or derived genetic material from a line, variety, species, or culture. The germplasm may be part of an organism or cell, or may be isolated from an organism or cell. Germplasm generally provides the genetic material with a specific molecular makeup that provides the physical basis for some or all of the genetic traits of an organism or cell culture. As used herein, germplasm includes cells, seeds, or tissues from which new plants can be grown, or plant parts such as leaves, stems, pollen, or cells, which can be cultured into whole plants.
A "marker" is a nucleotide sequence or its encoded product (e.g., a protein) that serves as a reference point. For markers to be used for detecting recombination, they require detection of differences or polymorphisms within the population being monitored. For molecular markers, this means that differences at the DNA level are due to polynucleotide sequence differences (e.g. SSR, RFLP, FLP, and SNP). Genomic variability can be of any origin, such as insertions, deletions, duplications, repetitive elements, point mutations, recombination events, or the presence and sequence of transposable elements. Molecular markers may be derived from genomic or expressed nucleic acids (e.g., ESTs) and may also refer to nucleic acids used as probes or primer pairs capable of amplifying sequence fragments using PCR-based methods.
Markers corresponding to genetic polymorphisms between members of a population can be detected by methods established in the art. These methods include, for example, DNA sequencing, PCR-based sequence-specific amplification methods, restriction fragment length polymorphism detection (RFLP), isozyme marker detection, polynucleotide polymorphism detection by allele-specific hybridization (ASH), amplified variable sequence detection of plant genomes, autonomous sequence replication detection, simple repeat sequence detection (SSR), single nucleotide polymorphism detection (SNP), or amplified fragment length polymorphism detection (AFLP). Established methods are also known for detecting Expressed Sequence Tags (ESTs) and SSR markers derived from EST sequences, as well as Randomly Amplified Polymorphic DNA (RAPD).
A "marker allele" or "allele of a marker locus" can refer to one of a plurality of polymorphic nucleotide sequences located at a marker locus in a population that is polymorphic with respect to the marker locus.
A "marker probe" is a nucleic acid sequence or molecule that can be used to identify the presence or absence of a marker locus by nucleic acid hybridization, e.g., a nucleic acid molecular probe that is complementary to a sequence of a marker locus. Labeled probes comprising 30 or more contiguous nucleotides of a marker locus (all or part of a "marker locus sequence") can be used for nucleic acid hybridization. Alternatively, in some aspects a molecular probe refers to any type of probe that is capable of distinguishing (i.e., genotype) a particular allele present at a marker locus.
As noted above, the term "molecular marker" may be used to refer to a genetic marker, or its encoded product (e.g., a protein) that serves as a point of reference when identifying linked loci. The tag can be derived from a genomic nucleotide sequence or from an expressed nucleotide sequence (e.g., from spliced RNA, cDNA, etc.), or from an encoded polypeptide. The term also refers to nucleic acid sequences that are complementary to or flanked by marker sequences, such as nucleic acids that are used as probes or primer pairs capable of amplifying the marker sequences. A "molecular marker probe" is a nucleic acid sequence or molecule that can be used to identify the presence or absence of a marker locus, e.g., a nucleic acid probe that is complementary to a marker locus sequence. Alternatively, in some aspects a molecular probe refers to any type of probe that is capable of distinguishing (i.e., genotype) a particular allele present at a marker locus. Nucleic acids are "complementary" when they specifically hybridize in solution, for example, according to the Watson-Crick base-pairing rules. Some of the markers described herein are also referred to as hybridization markers when located in regions of indels, such as the non-collinear regions described herein. This is because the insertion region is a polymorphism with respect to a plant having no insertion. Thus, the marker need only indicate the presence or absence of the indel region. Any suitable label detection technique may be used to identify such hybridization labels, e.g., KASP technique, PCR amplification.
The invention locates a gene ZmEIN2-1 which influences the water content and dehydration rate traits of corn kernels from a corn associated population, the gene is positioned at the position of a reference genome Chromosome1:270891072 and 270897163 of B73V 5 version, and the genome sequence is shown as SEQ ID NO. 2. By determining the transcript of the ZmEIN2-1 gene, the nucleotide sequence of the coding region of the gene and the sequence of the encoded protein were determined.
The invention further analyzes the variation sites in the corn material represented by different kernel water contents, finds 2 variation sites (InDel _7/0 and InDel _303/000) linked with characters, wherein InDel _7/0 is positioned at the position of 1045-1051 of a sequence shown in SEQ ID NO.2 and represents the insertion of 7 bases (AGTATCT); InDel _303/000 is located at position 2513-2815 of the sequence shown in SEQ ID NO.2 and shows 303-base insertion deletion.
Based on InDel _7/0 and InDel _303/000 sites, the invention develops a molecular marker detection method, can identify the genotypes of InDel _7/0 and InDel _303/000, and identifies or assists in identifying the water content or dehydration rate character of corn kernels according to the genotype identification result.
The invention provides an embodiment of a method for driving over-expression of ZmEIN2-1 gene by using a maize ubiquitin promoter, reducing the water content of maize grains or improving the dehydration rate.
The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention. Modifications or substitutions to methods, steps or conditions of the present invention may be made without departing from the spirit and scope of the present invention. Unless otherwise indicated, the examples follow conventional experimental conditions, such as the Molecular cloning laboratory Manual of Sambrook et al (Sambrook J & Russell D W, Molecular cloning: a laboratory Manual,2001), or the conditions as recommended by the manufacturer's instructions. Unless otherwise specified, the chemical reagents used in the examples are all conventional commercially available reagents, and the technical means used in the examples are conventional means well known to those skilled in the art.
Example 1 cloning of corn grain moisture Gene
The invention utilizes the association group constructed in the laboratory (the construction method of the association group refers to Yang X, Gao S, Xu S, Zhang Z, Prasanna B.M., Li L, Li J, Yan J.Characterisation of a global dental collection and an item potential evaluation for analysis of complex quantitative traitments in the main [ J]Molecular Breeding,2011,28(4): 511-526.). Through the measurement of the moisture content of the corn kernels in 5 environments, the phenotypic values of the moisture content (GM) of the corn kernels in 5 stages are obtained: respectively 34 days, 40 days, 46 days, 52 days and 58 days after pollination, recording34DAP, 40DAP, 45DAP, 52DAP, 58 DAP. And simultaneously calculating BLUP phenotypic values of 5 environments by using an optimality unbiased estimation method (BLUP), and calculating a grain water content change index (AUDDC) by using the phenotypic values, wherein the evaluation method refers to a method to estimate a rate of dry in main [ J ] of]Crop sci.,2010,50(6): 2347-2354), denoted as AUDDC _2_1, AUDDC _3_2, AUDDC _4_3, AUDDC _5_4, AUDDC _3_1, AUDDC _4_2, AUDDC _5_3, AUDDC _4_1, AUDDC _5_2, AUDDC _5_1 (where the numbers indicate the moisture content of the grains for 5 stages, for example: AUDDC — 2 — 1 is the AUDDC value between the second kernel water content (40DAP) and the first kernel water content (34 DAP). Genotype data using the 1.25M SNP marker of the related population and the population structure and genetic relationship deduced from the genotype (genotype data from Liu H, Luo X, Niu L, Xiao Y, Chen L, Liu J, Wang X, Jin M, Li W, Zhang Q, Yan J.Distancet eLs QT and Non-coding Sequences Play Critical Roles in Regulation Gene Expression and Quantitative transfer Variation in Maize [ J J.J. ]]414-426) using a mixed linear model, performing genome-wide association analysis on the 15 personality, with a threshold P of 2.0 × 10 -6 Under the standard of (3), SNP sites chr1.S _264272396 and BLUP _ GM (BLUP _34_ DAP, BLUP _46_ DAP), BLUP _ AUDDC (BLUP _ AUDDC _4_1, BLUP _ AUDDC _3_1, BLUP _ AUDDC _5_1, BLUP _ AUDDC _4_2, BLUP _ AUDDC _2_1, BLUP _ AUDDC _4_3, BLUP _ AUDDC _3_2, BLUP _ AUDDC _5_2) were identified,
the 14JL _ GM (14JL _40_ DAP, 14JL _46_ DAP) and 14JL _ BLUP (14JL _ AUDDC _3_1, 14JL _ AUDDC _2_1, 14JL _ AUDDC _3_2, 14JL _ AUDDC _4_1, 14JL _ AUDDC _4_2, 14JL _ AUDDC _5_1) traits were significantly correlated (Table 1; FIG. 1). Meanwhile, the RIL group constructed by K22 XDan 340 is utilized by the inventor, the phenotype data collection method and the correlation analysis experimental part are utilized, and then a QTL site is positioned by utilizing a high-density genetic linkage map (Pan Q, Li L, Yang X, Tong H, Xu S, Li Z, Li W, Muehlbauer GJ, Yan J, genome-side recombination dynamics associated with in main [ J ] New Phytol,2016,210(3): 1083) and 1094), the main effect QTL positioned in chromosome 1.09 of No.1 DDC, the characters positioned to the QTL mainly comprise HN-GM (13 HN-52, 13 HN-58 DAP), 13 _ AUDDC (13 DDC-5 _ DDC, 13 _ HN-5 _4), 14-14 DDC (14 HN-14 DDC-14, 14 _ 14 DDC _ 14 DDC _3_ 13 DDC _ 13 DDC _3_ 13 DDC _ 13 DDC _ 13 DDC _3_ 13 _4, 13 _ DDC _3_ 13 _ DDC _3_ DDC _4, 14SY _ AUDDC _4_3, 14SY _ AUDDC _5_1, 14SY _ AUDDC _5_2, 14SY _ AUDDC _5_3, 14SY _ AUDDC _5_4), 14WH _ AUDDC (14WH _ AUDDC _5_2, 14SY _ AUDDC _3_1) and BLUP _ AUDDC (BLUP _ AUDDC _4_3) (table 2; FIG. 2), the SNP associated with the QTL and the association analysis is the same site and is named qDR 1-2. The QTL interval is narrowed to 60kb by map-based cloning, two genes are arranged in the interval (figure 3), and the gene with the most significant SNP in association analysis is taken as a candidate gene and is named as ZmEIN 2-1.
TABLE 1 significant SNP (Chr1. S-264272396) and trait Association analysis results
TABLE 2 QTL (qDR1-1) basic information
Example 2 Gene Structure and functional site analysis
The ZmEIN2-1 gene is numbered Zm00001eb054060 in the B73 reference genome, and is positioned in Chr1:270891072-270897163 for a total of 6092 bp. The sequence is shown as SEQ ID NO. 2. The function of the gene is annotated as ethylene insensitivity, and no research has shown that the function is related to the dehydration rate of corn. The ZmEIN2-1 gene has 3 transcripts in total, and the sequences are shown as SEQ ID NO.3 (transcript T1), SEQ ID NO.13 (transcript T2) and SEQ ID NO.15 (transcript T3). The transcript 1 contains 8 exons, 6 of the exons are coding exons (the structure is shown in figure 4), the sequence of a coding region is shown as SEQ ID NO.4, and the sequence of coded amino acid is shown as SEQ ID NO. 1.
Through sequencing and PCR identification of the whole gene region of different character expression materials in the related population, the gene is found to have 2 variation sites. The first mutation site is located at the position of Chromosome1: 270892115-270892116 (sequence 1044-1045 shown in SEQ ID NO. 2), and a 7bp insertion deletion is formed on the 1 st exon: AGTATCT (position shown in FIG. 4). Insertion of 7 bases into this position leads to premature termination of gene transcription, which is significantly associated with the moisture content trait. Therefore, the variation can be developed into a molecular marker InDel _7/0 for assisting in identifying the moisture content and the dehydration rate of the corn kernel. Wherein, the corn kernel containing 7 basic groups of genotype InDel _7 has higher water content and slower dehydration; corn kernels of genotype InDel _0 without 7 bases have lower moisture and faster dehydration.
The second mutation site was located at the position Chromosome1:270893584 and 270893886(SEQ ID NO.2, SEQ ID NO. 2506 and 2808) and was a 303bp indel between the 4 th and 5 th exons:
CTTGATCACCAATTCGCGAAAGGGCCTCTAGCTGAGTTGGTTAGGTGGTCTGAATAGCACTCCTCAGGTCCTGGGTTCGACTCCCCGTGGGAGCGAATTTCAGGCTGTGGTTAAAAAAATCCCCTCGTCTGTCCCACGCCAAAGCACAGGTCTAAGACTCAGCCCGGTCGTGGTCGTTCTCACATGGGCTTCGATGCCGCTGTGTATGGGTGGGGTAGGGGTTCGGGGGTTTTCTTGACCTGTGTGAGAAGGTATTTTTCTTAATACAATACCCGGGGCTGTCTTACCCCCCGCAGGTCAAGT (position shown in figure 4). This marker was found to be significantly associated with the phenotype BLUP _ AUDDC _4_1 by T-test analysis (Table 3). Therefore, the variation can be developed into a molecular marker InDel _303/000 for assisting in identifying the moisture content and the dehydration rate of the corn kernel. Wherein, the corn kernel containing the genotype InDel _303 with 303 basic groups has higher water content and slower dehydration; the corn kernel of the genotype InDel _000 without 303 basic groups has lower moisture and faster dehydration.
Table 3 molecular marker InDel _303/000 affects corn kernel moisture and its changes (BLUP _ AUDDC _4_1)
The higher the AUDDC _4_1 value, the higher the moisture content of the kernel, the lower the dehydration rate.
Example 3 corn kernel water content related molecular marker detection method
The molecular markers can be identified using PCR methods based on the genomic sequence near the positions InDel _7/0 and InDel _ 303/000. The primer pair used for PCR identification at position InDel _7/0 is shown in SEQ ID NO.5 and SEQ ID NO. 6. The primer pair used for PCR identification at position InDel _303/000 is shown in SEQ ID NO.9 and SEQ ID NO. 10.
The molecular marker detection adopts the following method:
(1) extracting corn genome DNA:
1. about 1.5g of corn leaves are ground in liquid nitrogen and transferred into a 2mL centrifuge tube.
2. Add 750. mu.l CTAB extraction buffer pre-heated to 65 ℃ and mix rapidly. Water bath is carried out in a water bath kettle at 65 ℃ for about 30 minutes, and the centrifuge tube is gently shaken for 2-3 times from time to time in the middle.
3. Taking out the centrifuge tube, adding equal volume of chloroform: isoamyl alcohol (24: 1), shake the tube on a shaker for 10 minutes until the solution separates into layers with the lower layer being dark green and the upper layer being pale yellow.
4. Centrifuge at 12000rpm for 10 minutes at room temperature and transfer the supernatant to a 1.5mL centrifuge tube.
5. To the supernatant was added 2/3 volumes of pre-cooled isopropanol and carefully mixed. Placing the mixture into a freezer with the temperature of-20 ℃ for 30 minutes.
6. Then, the mixture was centrifuged at 12000rpm for 10 minutes at 4 ℃.
7. The supernatant was decanted, and 1mL of 75% ethanol was added and soaked for 5 minutes. The washing was repeated once more. Then the liquid was poured off, the centrifuge tube was left for 30min, dried at room temperature, and 200. mu.l of water was added to dissolve the DNA.
8. The DNA mass was checked with 1% agarose and the DNA concentration was determined. The DNA was stored in a freezer at-20 ℃ until use.
(2) The PCR system and procedure were:
and (3) PCR system:
PCR procedure:
(3) gel imaging
Detection on 1% agarose gel.
The primer pair used for PCR at position InDel _7/0 is shown as SEQ ID NO.5 and SEQ ID NO. 6. Amplifying a 84bp band (shown as SEQ ID NO. 7) by using the material with the genotype of InDel _7, wherein the materials show that the water content of grains is high and the dehydration rate is slow; 77bp bands (shown as SEQ ID NO. 8) are amplified from the material with the genotype of InDel _0, and the material shows that the moisture content of grains is low and the dehydration rate is high.
The primer pair used at position InDel _303/000 is shown in SEQ ID NO.9 and SEQ ID NO. 10. Amplifying a band (the sequence is shown as SEQ ID NO. 11) of 935bp by using the material with the genotype of InDel _303, wherein the materials show that the grains have high water content and slow dehydration rate; the 632bp band (shown as SEQ ID NO. 12) is amplified from the material with the genotype of InDel _000, and the materials show that the moisture content of grains is low and the dehydration rate is high.
The method is used for detecting a plurality of self-bred line materials, and the result shows that the detection of the marker can well distinguish the water content or dehydration rate characters of grains among different materials. The results of the partial material marker tests and the data on the dehydration behavior are shown in Table 4.
TABLE 4 detection results of some materials for markers and dehydration behavior
Example 4 inactivation of ZmEIN2-1 protein, alteration of corn kernel moisture content and dehydration Rate
The ZmEIN2-1 gene has the function of controlling the water content and the dehydration rate of corn kernels, and in order to further verify the regulation and control mode of the gene on the water content and the dehydration rate of the kernels, the gene is knocked out by using a CRISPR-Cas9 gene editing technology. The gene encodes a protein containing an Nramp-like domain predicted by the protein structure (FIG. 6). Two targets, GTAATGTTCCCCAAGTTGAG and GCAGGTTCAGAGAAGATTTC (target positions see fig. 4), were designed at the 5 th and 6 th exons, respectively, with respect to the genomic sequence, sgrnas (sgRNA _1 and sgRNA _2) were synthesized, and a gene editing vector was constructed. Transformation of the maize variety KN5585 with the vector resulted in the successful gene-edited transformant which lost 714 bases and at the same time inserted 76 bases and prematurely terminated gene translation, resulting in the loss of the protein domain, and the edited protein structure is shown in FIG. 6. The gene editing material investigates the property data such as AUDDC and water content change and the like under two environments of Hainan Sansui and Jilin princess mountains, and proves that the water content of corn grains is increased and the dehydration rate is slowed down after the gene editing (tables 5 and 6).
TABLE 5 influence of ZmEIN2-1 Gene editing on corn kernel moisture changes
WT: a wild type; KO: gene editing
TABLE 6 influence of ZmEIN2-1 Gene editing on corn grain moisture content
WT: a wild type; KO: gene editing
In addition, a mutant in which transposon was inserted into the 5' UTR region of EIN2-1 gene was obtained by screening a maize mutant library (FIG. 4). The mutant and the wild material have differences in phenotype, the water content of the mutant grains is high, and dehydration is slow (tables 7 and 8).
TABLE 7 ZmEIN2-1 Gene mutant affecting corn kernel moisture changes
WT: a wild type; MU: mutants
TABLE 8 ZmEIN2-1 Gene mutant affecting corn grain moisture content
WT: a wild type; MU: mutants
Example 5 overexpression of EIN2-1 Gene reduces corn grain Water content and increases dehydration Rate
After the EIN2-1 gene is knocked out, the water content of corn kernels is increased, and dehydration is slowed. Therefore, in order to reduce the water content of the corn kernels and improve the dehydration rate of the corn kernels, the gene can be overexpressed.
Overexpression can select a strong promoter (such as ubiquitin, actin, 35S and the like) to drive the EIN2-1 gene (the genome sequence shown in SEQ ID NO.2, or the cDNA sequence shown in SEQ ID NO.3, SEQ ID NO.13 or SEQ ID NO.15, or the coding region sequence shown in SEQ ID NO.4, SEQ ID NO.14 or SEQ ID NO. 16) to express. In this example, a maize strong promoter ubiquitin (SEQ ID NO.19) was selected to drive a segment of genome sequence (specifically, the sequence at position 210-4530 of SEQ ID NO. 2) containing a coding region to express, and a overexpression vector was constructed (the vector map is shown in FIG. 7). The overexpression vector is transformed into a maize inbred line KN5585, obtained transformed seedlings are detected, two transformation events ZMEIN2-1# OE1 and ZMEIN2-1# OE2 with stable overexpression of the target genes are identified, and the results of gene expression amount are shown in figure 8. After investigating the water content and dehydration rate of the kernels, the water content of the over-expressed corn plant kernels is reduced, and the dehydration rate is increased (tables 9 and 10).
TABLE 9 influence of ZmEIN2-1 gene overexpression on corn kernel moisture change
OE 1: overexpression Material 1; WT 1: wild type control 1; OE 2: overexpression Material 2; WT 2: wild type control 2.
TABLE 10 influence of maize grain water content by ZmEIN2-1 gene overexpression
OE 1: overexpression of 1; WT 1: wild type control 1; OE 2: overexpression Material 2; WT 2: wild type control 2; na: and (4) not detecting.
Although the invention has been described in detail with respect to the general description and the specific embodiments thereof, it will be apparent to those skilled in the art that modifications and improvements can be made based on the invention. Accordingly, it is intended that all such modifications and alterations be included within the scope of this invention as defined in the appended claims.
Sequence listing
<110> university of agriculture in Huazhong
<120> gene ZmEIN2-1 for controlling water content of corn kernels and molecular marker thereof
<130> 1
<160> 19
<170> SIPOSequenceListing 1.0
<210> 1
<211> 476
<212> PRT
<213> Zea mays
<400> 1
Met Asp Gly Val Arg Cys Met Glu Ser Pro Ala Ala Gly Asp Ala Pro
1 5 10 15
His Ser Ser Phe Arg Thr Leu Gly Pro Thr Leu Leu Val Ser Met Gly
20 25 30
Tyr Leu Asp Phe Gly Lys Trp Leu Val Ala Val Glu Ala Gly Ser Arg
35 40 45
Phe Gly Tyr Asp Leu Val Leu Leu Val Leu Leu Phe Asn Leu Ser Ala
50 55 60
Ile Leu Cys Gln Tyr Leu Ser Ser Cys Ile Gly Ile Val Thr Lys Lys
65 70 75 80
Asn Leu Ala Glu Ile Cys His Gln Glu Tyr Asp Gln Lys Ile Cys Val
85 90 95
Val Leu Gly Leu Gln Ala Gly Leu Ser Leu Leu Thr Ser Glu Phe Thr
100 105 110
Met Ile Ala Gly Ile Val Val Gly Phe Asn Leu Val Phe Glu Ser Asn
115 120 125
Asn Pro Ile Thr Val Val Cys Phe Thr Ser Val Val Val Asn Leu Leu
130 135 140
Pro Tyr Thr Leu Ser Leu Leu Asp Lys Arg Lys Ala Gly Met Phe Asn
145 150 155 160
Ala Tyr Ala Ser Gly Phe Thr Leu Val Cys Phe Val Leu Gly Leu Leu
165 170 175
Val Ser His Pro Lys Thr Pro Val Asn Thr Asn Val Met Phe Pro Lys
180 185 190
Leu Ser Gly Glu Ser Ala Tyr Ser Leu Met Val Leu Leu Gly Thr Asn
195 200 205
Ile Ile Val His Asn Phe Tyr Thr His Ser Ser Val Val Gln Val Gln
210 215 220
Arg Arg Phe Gln Gly His Thr Leu Gly Thr Leu Phe His Asp His Leu
225 230 235 240
Phe Ser Val Leu Phe Ala Phe Ser Gly Ile Phe Leu Val Asn Tyr Ile
245 250 255
Leu Met Ser Ser Ala Val Asp Glu Ser Lys Asn Thr Met Ala Ile Asn
260 265 270
Phe Gln Asp Ala Arg Gln Leu Met Asn Gln Val Phe Thr Ser Pro Val
275 280 285
Ala Pro Ile Val Leu Leu Val Val Leu Leu Phe Ser Gly His Ile Val
290 295 300
Ser Met Thr Cys Ile Ile Gly Ser Asp Val Ile Ser Glu Asp Leu Phe
305 310 315 320
Gly Ile Lys Leu Pro Leu Phe Val His His Leu Leu Pro Lys Val Phe
325 330 335
Ala Met Ile Thr Thr Ile Tyr His Ala Lys Val Val Gly Phe Glu Gly
340 345 350
Leu Tyr Gln Leu Leu Met Val Cys Pro Ile Ile Gln Ala Met Leu Leu
355 360 365
Pro Ser Ser Val Ile Pro Val Phe Arg Ile Ser Ser Ser Arg Ser Leu
370 375 380
Met Gly Arg Tyr Arg Ile Ser Arg Cys Val Glu Ile Leu Cys Phe Leu
385 390 395 400
Ala Phe Leu Leu Thr Leu Phe Thr Asn Ile Ile Phe Val Ala Glu Val
405 410 415
Leu Phe Gly Asp Ser Thr Trp Thr Asn Asp Leu Lys Gly Asn Thr Glu
420 425 430
Asn Pro Ile Leu Leu Pro Tyr Ile Val Val Val Leu Ile Ser Cys Gly
435 440 445
Ser Ile Gly Phe Ala Leu Phe Leu Ala Val Thr Pro Leu Lys Ser Ala
450 455 460
Cys Asn Glu Ala Glu Arg Ala Leu Ser Val His Ser
465 470 475
<210> 2
<211> 6091
<212> DNA
<213> Zea mays
<400> 2
tagcttgtaa gtttgcatct gattgcacaa ccgttgttcc cgtgcgtggg ggactgcggc 60
tcgttcttgg atgaggttct gacttctgag gaggatgggg atggctcacg gcgccgtgac 120
gcgacgatcg gcgtcggcag cggcgcgtcc ctgctgtccc tcctgcatgc ttgccaccca 180
gcttcgcctc gccacgcgtc tcgtggacca ttgcgcgcgt ccctcgtgcc acggcgcacc 240
cgcagccatg agtgccagcg ctcccgtctc ccgactcccg agcaaatcgc tcggcgccta 300
ctactagccc gaaacctagg tggtgggaac tcgggatacg aggtcctctt ctccgccgct 360
tgtcctcggc aagcacaaac gcggtaagcc accaaccgcc tttcgatcgc cgggctccat 420
ccctaagcca tctcgccgcc tcgtgagtcg tgagccgtga gccaaccgac ctgccatcat 480
caattagtga cttcctgaac ctgcttcgtg cgtcctacca ctccaaagtt ccaacaatca 540
agagatttaa acaaagcctt ccataacttg ttgccaataa ggtaactgtc actattgtgc 600
tgctcatcct ttccatctaa gagaaaactt tactgtgtcc tgtgcatact cctctttcat 660
ccgtaactct ccgtaaccat ttgcgggaaa taatgtaaag ttcattctcg gcaaagtatt 720
gtggcgtgtg atggatagag cttaaatcag gagcgtcttc atattgttgt aattggctcg 780
ttgcagtgca ggatttggag gccactgggc tttggaaatt caaagtagac tgcttggatc 840
ataacgagag agcacaaagg gcctgggttt tgtaaacagt tggctttcac tagagtcgat 900
ccattccata ctgagctttt gcatatcatc aattcatcac acaatggatg gtgtgcgatg 960
catggagtcc ccagctgctg gtgatgcccc gcatagttct ttccgaaccc ttgggccaac 1020
actcctggtt tcaatggggt atcttgactt cggaaaatgg ttggtggcag tggaagctgg 1080
gtctcggttt ggctatgatc ttgtcctgct cgtgctgctt tttaatctat cagctattct 1140
gtgtcagtac ctctcgagct gtatcggcat tgtcaccaaa aagaatcttg cggaggtaaa 1200
aatttactca gtcctctctt gccattgtaa actgtagaat tttcaccaga actacggtgt 1260
ggagatctca gttgataaga atggtgattc attaggatag ggtgggccca ccataaattt 1320
ctgtacccgt aatattggtg ttaatttgtt ttgcgtttat gagagatgtc tggggaaaaa 1380
aagctcatgg agtcatggca tcctaactcc taagcgttgt agctatgctg tcgtacttct 1440
gttgaaaaaa tgccatcatc gttcaaatga aaacatcaga tatccacatg tgacaacaat 1500
agaagtatcc cttttatttt acattttctt catctatttt caacagtgtc ttgtgcagat 1560
cttgacttct ttcttgttat atttgtacat gaacaaatag atttgccacc aggagtatga 1620
tcagaaaata tgtgttgttc ttggtcttca agcaggactg tccttgctaa cttctgaatt 1680
taccatggta tgtctacaaa tacttcagaa aatccatttt ctgatgatgg ttgatcttgg 1740
acttttatgt gctcatgata catttgcttg caccattcct aattaacaga gcatgtaagc 1800
agctcggtac atttgaaaca cttagggctg gtttggtgac agtcacctaa ccagcactta 1860
acattttcaa gagaaaaaca cttaacatat gatacttttc agattgcagg catcgtagtt 1920
gggttcaacc ttgtatttga aagcaacaat cccatcacag tcgtatgttt tacaagtgtt 1980
gtagttaatc tgctaccata tacactctcc cttctggtaa atttataagt ctctccaaag 2040
aactgtatat aaaagctgac ttgctctaaa acattttata atttattcta gttcctatat 2100
ttgtgctgat tgtaggacaa gaggaaggct ggaatgttta atgcctacgc atctggcttt 2160
acactagttt gttttgtgct tggtttatta gtgagccatc ccaaaacccc tgtcaatacc 2220
aacgtaatgt tccccaagtt gagtggtgaa agtgcttact cactgatggt actacttggc 2280
acaaatataa tagtacacaa cttttatact cattcatcag ttgttcaggt aaatttctga 2340
caattccatt cccatgttta tgcaattagc tttgtcaagc atgtcttcca ttaaaagttt 2400
gtcttatcaa acctgaatgc atctttcatg gcgaaagtga tgttttgtgt taggattatg 2460
tacaaaaaaa tggtttatgt tagggtactg agtgtattgg ttgttcttga tcaccaattc 2520
gcgaaagggc ctctagctga gttggttagg tggtctgaat agcactcctc aggtcctggg 2580
ttcgactccc cgtgggagcg aatttcaggc tgtggttaaa aaaatcccct cgtctgtccc 2640
acgccaaagc acaggtctaa gactcagccc ggtcgtggtc gttctcacat gggcttcgat 2700
gccgctgtgt atgggtgggg taggggttcg ggggttttct tgacctgtgt gagaaggtat 2760
ttttcttaat acaatacccg gggctgtctt accccccgca ggtcaagttt tgatcaccaa 2820
ttcaccatag gggtgctgaa tgcactgagt tgttcttgat tcattctttg ctagtttggt 2880
ctgctttggc ttcatgtatc agtgttcatt cagatatatt atcacttgtg caggttcaga 2940
gaagatttca gggtcatacc cttggtactc tgtttcatga tcaccttttt tctgtactat 3000
ttgcattttc tggtatcttt cttgtgaatt atattctgat gagttcagca gttgatgagt 3060
ccaaaaacac aatggccatt aacttccaag atgctagaca gctaatgaat caggttcatg 3120
tcattgttct cttcttatcc gtcaagtctt ttgtttaatt ggctatttgt tctccgttca 3180
ccttcctatt aatgtaaaga gcagcattcg aattctcaaa ttataagaca tgaccatagg 3240
tttggtgttg tggcatgcta ctatacttgc atgtttggat tttcaacttt gtgaagttat 3300
tgtattataa gtacattacc tattttttgg gttgaccaaa tggtgcacga gtttcaatat 3360
tttactatgg tcaaaaggac ctaattggag tagcagagtt ttaggaaaaa tatagttcta 3420
gtatatttgg ggttcaagtt atagatatta gaaatcctat agtgtgtgtt tgctttgtgg 3480
aagtgcctca tgctagatca ggtcgtgtat catgagttta ttccatgaat ttttgggtgg 3540
aacaaaccca ttcttcatgt gtatacatta gcttgtgaga attgagttgg tgatgcatca 3600
actcattcta ttctacagac caaacaaaaa gtgatcatga tggatcacca tattactcaa 3660
acccaataca cgttgttcta aacaagggct aaggttggga gaaatgatgt gttctaggga 3720
agccactggg gacactctct taggctcatt ccctgaatag ggacataaag actgagttat 3780
aagtaagctg ctgggatgct ctttttctct tcaactttct tgaaaactta gcctccgcat 3840
gaactggcca gcgcatattc aataaaaata gtatcccttt gagcttggag tgtgcattga 3900
aggaggaagc tccttcctga atttcgtctg gattcttgta ttttgcaggt gttcacaagt 3960
cctgtggcgc caattgtatt attagtggtc cttctctttt caggccacat cgtgtcaatg 4020
acatgtatta ttggtagtga tgtaatttca gaggatctct tcggcataaa gctgcctctt 4080
tttgtgcacc atctgttacc taaggttttt gccatgatta ctactatata ccatgcgaag 4140
gttgtgggtt ttgaagggtt atatcagtta ctcatggtct gcccaattat ccaagctatg 4200
ctccttcctt cgtctgttat acctgttttc cgcatttcct cgtcgagatc attgatggga 4260
agataccgga tatctcgatg tgttgaaata ttgtgcttcc tagcatttct tctaacactg 4320
tttacaaata tcatttttgt ggcggaagtt ctttttggtg atagcacttg gacaaatgac 4380
ctgaaaggga acactgaaaa ccctattcta cttccatata tcgtcgtagt cctgatatca 4440
tgtggatcta ttggttttgc actgttcctt gctgttactc cactaaaatc agcatgtaat 4500
gaagctgaga gagcgttgtc tgtgcactca taacagagag aaacgttgga tgccactcgt 4560
cacagcaaaa ccgacttctc tgagaatagg gcacatgaag catatgaaga acagaggtcc 4620
ttagccatcc ttgttccaca ggattcactg aaaggtgata gaaataattt tcgtagcata 4680
tccgtcatct caaattagct tttgatcaca agggcagttt ctgacgctac tctacatgca 4740
aatcatatgt ctagtaatcc agaagttcat ccttagcatt gactggacag agcctatgtc 4800
aactgaaggt tgtgccctta aaattgactg gacagagagc ctatgtcaac tgaaaaaggt 4860
tgtgcaaata gactcggatg cctgcacgga caacgctctg tttgtggaga agtctgaata 4920
tttcttttat caatcactat ttctcttcca tttaaattca caattaacag gatggttctt 4980
atcatgactg tacctctgtt tgtgggagtg gaagtgttgg cttatctacc tattactcaa 5040
agaaacacca ctgtctcacc tgacacctcc ggtgattgct gcaaacagaa tgctttgttg 5100
gttgttgaat gaagcaaatt aatgtggaca gtggtactgc aaatcaggtg gttcctctat 5160
aggatacacc atgtgtggag tctgcaacaa ggttctagtc tcaacttgca catatggtgg 5220
ctcttgcgag ttgtgacatg atcgtctgac atcaaagcgt gaaagacgtt aataaatctg 5280
ttcaggagaa gcagtcatct agcgtggcaa gcttggaatt gacaatgagt gaggtggaca 5340
ctggtagatc tagtagcagg gtcaaaggat gatttctttt accaagtatg cttgcattcg 5400
tgaggtgatc cttttctctt cttacctgtt tttagtaaaa atagaagtag aaactgtaat 5460
cttcccatga gaaagacgag cttcttgaga tctagatcca ctcaatgagt tgtggcctgt 5520
gggttttgga aaggtttttg ttttagtcga gcatcccatc tttgcagctg aatttcacat 5580
tttagtaaac aacagctaat tggaccatag aaggttgatc atattgcatc tctttttttc 5640
gtgaccatgt ctaagcacat ttttcattag gaggaaggag agtttacaac acacatctta 5700
gtaagcatga tccttgggta gtttacatca tatgatgaat ggttacttct agtatttgcg 5760
tatgccttca gtagtacatt ggatatatcc tctatatttg aactttcgat ggaaagaatt 5820
tgtaggccgc ttatatagac cctatttgga accacttata atcgattcaa ttataatcat 5880
cttactgtga ggatcttttc tattctcgat tatgctagtc caaagttata aacggagatc 5940
aaaagaatca gaaataatct gtgccatagt ctcagtggtg aacaatgcag ttgcccagtt 6000
gtcccgttca aattcctgaa ctattccgct cattgtcgcc accagtttgc tgatttcttc 6060
tttgatttcc ttgatgtagg caaatcaatg a 6091
<210> 3
<211> 3093
<212> DNA
<213> Zea mays
<400> 3
ttagcttgta agtttgcatc tgattgcaca accgttgttc ccgtgcgtgg gggactgcgg 60
ctcgttcttg gatgaggttc tgacttctga ggaggatggg gatggctcac ggcgccgtga 120
cgcgacgatc ggcgtcggca gcggcgcgtc cctgctgtcc ctcctgcatg cttgccaccc 180
agcttcgcct cgccacgcgt ctcgtggacc attgcgcgcg tccctcgtgc cacggcgcac 240
ccgcagccat gagtgccagc gctcccgtct cccgactccc gagcaaatcg ctcggcgcct 300
actactagcc cgaaacctag gtggtgggaa ctcgggatac gaggtcctct tctccgccgc 360
ttgtcctcgg caagcacaaa cgcggtaagc caccaaccgc ctttcgatcg ccgggctcca 420
tccctaagcc atctcgccgc ctcgtgagtc gtgagccgtg agccaaccga cctgccatca 480
tcaattagtg acttcctgaa cctgcttcgt gcgtcctacc actccaaagt tccaacaatc 540
aagagattta aacaaagcct tccataactt gttgccaata agtgcaggat ttggaggcca 600
ctgggctttg gaaattcaaa gtagactgct tggatcataa cgagagagca caaagggcct 660
gggttttgta aacagttggc tttcactaga gtcgatccat tccatactga gcttttgcat 720
atcatcaatt catcacacaa tggatggtgt gcgatgcatg gagtccccag ctgctggtga 780
tgccccgcat agttctttcc gaacccttgg gccaacactc ctggtttcaa tggggtatct 840
tgacttcgga aaatggttgg tggcagtgga agctgggtct cggtttggct atgatcttgt 900
cctgctcgtg ctgcttttta atctatcagc tattctgtgt cagtacctct cgagctgtat 960
cggcattgtc accaaaaaga atcttgcgga gatttgccac caggagtatg atcagaaaat 1020
atgtgttgtt cttggtcttc aagcaggact gtccttgcta acttctgaat ttaccatgat 1080
tgcaggcatc gtagttgggt tcaaccttgt atttgaaagc aacaatccca tcacagtcgt 1140
atgttttaca agtgttgtag ttaatctgct accatataca ctctcccttc tggacaagag 1200
gaaggctgga atgtttaatg cctacgcatc tggctttaca ctagtttgtt ttgtgcttgg 1260
tttattagtg agccatccca aaacccctgt caataccaac gtaatgttcc ccaagttgag 1320
tggtgaaagt gcttactcac tgatggtact acttggcaca aatataatag tacacaactt 1380
ttatactcat tcatcagttg ttcaggttca gagaagattt cagggtcata cccttggtac 1440
tctgtttcat gatcaccttt tttctgtact atttgcattt tctggtatct ttcttgtgaa 1500
ttatattctg atgagttcag cagttgatga gtccaaaaac acaatggcca ttaacttcca 1560
agatgctaga cagctaatga atcaggtgtt cacaagtcct gtggcgccaa ttgtattatt 1620
agtggtcctt ctcttttcag gccacatcgt gtcaatgaca tgtattattg gtagtgatgt 1680
aatttcagag gatctcttcg gcataaagct gcctcttttt gtgcaccatc tgttacctaa 1740
ggtttttgcc atgattacta ctatatacca tgcgaaggtt gtgggttttg aagggttata 1800
tcagttactc atggtctgcc caattatcca agctatgctc cttccttcgt ctgttatacc 1860
tgttttccgc atttcctcgt cgagatcatt gatgggaaga taccggatat ctcgatgtgt 1920
tgaaatattg tgcttcctag catttcttct aacactgttt acaaatatca tttttgtggc 1980
ggaagttctt tttggtgata gcacttggac aaatgacctg aaagggaaca ctgaaaaccc 2040
tattctactt ccatatatcg tcgtagtcct gatatcatgt ggatctattg gttttgcact 2100
gttccttgct gttactccac taaaatcagc atgtaatgaa gctgagagag cgttgtctgt 2160
gcactcataa cagagagaaa cgttggatgc cactcgtcac agcaaaaccg acttctctga 2220
gaatagggca catgaagcat atgaagaaca gaggtcctta gccatccttg ttccacagga 2280
ttcactgaaa gtaatccaga agttcatcct tagcattgac tggacagagc ctatgtcaac 2340
tgaaggttgt gcccttaaaa ttgactggac agagagccta tgtcaactga aaaaggttgt 2400
gcaaatagac tcggatgcct gcacggacaa cgctctgttt gtggagaagt ctgaatattt 2460
cttttatcaa tcactatttc tcttccattt aaattcacaa ttaacaggat ggttcttatc 2520
atgactgtac ctctgtttgt gggagtggaa gtgttggctt atctacctat tactcaaaga 2580
aacaccactg tctcacctga cacctccggt gattgctgca aacagaatgc tttgttggtt 2640
gttgaatgaa gcaaattaat gtggacagtg gtactgcaaa tcaggtggtt cctctatagg 2700
atacaccatg tgtggagtct gcaacaaggt tctagtctca acttgcacat atggtggctc 2760
ttgcgagttg tgacatgatc gtctgacatc aaagcgtgaa agacgttaat aaatctgttc 2820
aggagaagca gtcatctagc gtggcaagct tggaattgac aatgagtgag gtggacactg 2880
gtagatctag tagcagggtc aaaggatgat ttcttttacc aagtatgctt gcattcgtga 2940
ggtgatcctt ttctcttctt acctgttttt agtaaaaata gaagtagaaa ctgtaatctt 3000
cccatgagaa agacgagctt cttgagatct agatccactc aatgagttgt ggcctgtggg 3060
ttttggaaag gtttttgttt tagtcgagca tcc 3093
<210> 4
<211> 1431
<212> DNA
<213> Zea mays
<400> 4
atggatggtg tgcgatgcat ggagtcccca gctgctggtg atgccccgca tagttctttc 60
cgaacccttg ggccaacact cctggtttca atggggtatc ttgacttcgg aaaatggttg 120
gtggcagtgg aagctgggtc tcggtttggc tatgatcttg tcctgctcgt gctgcttttt 180
aatctatcag ctattctgtg tcagtacctc tcgagctgta tcggcattgt caccaaaaag 240
aatcttgcgg agatttgcca ccaggagtat gatcagaaaa tatgtgttgt tcttggtctt 300
caagcaggac tgtccttgct aacttctgaa tttaccatga ttgcaggcat cgtagttggg 360
ttcaaccttg tatttgaaag caacaatccc atcacagtcg tatgttttac aagtgttgta 420
gttaatctgc taccatatac actctccctt ctggacaaga ggaaggctgg aatgtttaat 480
gcctacgcat ctggctttac actagtttgt tttgtgcttg gtttattagt gagccatccc 540
aaaacccctg tcaataccaa cgtaatgttc cccaagttga gtggtgaaag tgcttactca 600
ctgatggtac tacttggcac aaatataata gtacacaact tttatactca ttcatcagtt 660
gttcaggttc agagaagatt tcagggtcat acccttggta ctctgtttca tgatcacctt 720
ttttctgtac tatttgcatt ttctggtatc tttcttgtga attatattct gatgagttca 780
gcagttgatg agtccaaaaa cacaatggcc attaacttcc aagatgctag acagctaatg 840
aatcaggtgt tcacaagtcc tgtggcgcca attgtattat tagtggtcct tctcttttca 900
ggccacatcg tgtcaatgac atgtattatt ggtagtgatg taatttcaga ggatctcttc 960
ggcataaagc tgcctctttt tgtgcaccat ctgttaccta aggtttttgc catgattact 1020
actatatacc atgcgaaggt tgtgggtttt gaagggttat atcagttact catggtctgc 1080
ccaattatcc aagctatgct ccttccttcg tctgttatac ctgttttccg catttcctcg 1140
tcgagatcat tgatgggaag ataccggata tctcgatgtg ttgaaatatt gtgcttccta 1200
gcatttcttc taacactgtt tacaaatatc atttttgtgg cggaagttct ttttggtgat 1260
agcacttgga caaatgacct gaaagggaac actgaaaacc ctattctact tccatatatc 1320
gtcgtagtcc tgatatcatg tggatctatt ggttttgcac tgttccttgc tgttactcca 1380
ctaaaatcag catgtaatga agctgagaga gcgttgtctg tgcactcata a 1431
<210> 5
<211> 20
<212> DNA
<213> unknown (Artificial Synthesis)
<400> 5
cgcatagttc tttccgaacc 20
<210> 6
<211> 19
<212> DNA
<213> Unknown (Artificial Synthesis)
<400> 6
caccaaccat tttccgaag 19
<210> 7
<211> 84
<212> DNA
<213> Zea mays
<400> 7
cgcatagttc tttccgaacc cttgggccaa cactcctggt ttcaatgggg tatctagtat 60
cttgacttcg gaaaatggtt ggtg 84
<210> 8
<211> 77
<212> DNA
<213> Zea mays
<400> 8
cgcatagttc tttccgaacc cttgggccaa cactcctggt ttcaatgggg tatcttgact 60
tcggaaaatg gttggtg 77
<210> 9
<211> 20
<212> DNA
<213> unknown (Artificial Synthesis)
<400> 9
gcctacgcat ctggctttac 20
<210> 10
<211> 20
<212> DNA
<213> Unknown (Artificial Synthesis)
<400> 10
ggccattgtg tttttggact 20
<210> 11
<211> 935
<212> DNA
<213> Zea mays
<400> 11
gcctacgcat ctggctttac actagtttgt tttgtgcttg gtttattagt gagccatccc 60
aaaacccctg tcaataccaa cgtaatgttc cccaagttga gtggtgaaag tgcttactca 120
ctgatggtac tacttggcac aaatataata gtacacaact tttatactca ttcatcagtt 180
gttcaggtaa atttctgaca attccattcc catgtttatg caattagctt tgtcaagcat 240
gtcttccatt aaaagtttgt cttatcaaac ctgaatgcat ctttcatggc gaaagtgatg 300
ttttgtgtta ggattatgta caaaaaaatg gtttatgtta gggtactgag tgtattggtt 360
gttcttgatc accaattcgc gaaagggcct ctagctgagt tggttaggtg gtctgaatag 420
cactcctcag gtcctgggtt cgactccccg tgggagcgaa tttcaggctg tggttaaaaa 480
aatcccctcg tctgtcccac gccaaagcac aggtctaaga ctcagcccgg tcgtggtcgt 540
tctcacatgg gcttcgatgc cgctgtgtat gggtggggta ggggttcggg ggttttcttg 600
acctgtgtga gaaggtattt ttcttaatac aatacccggg gctgtcttac cccccgcagg 660
tcaagttttg atcaccaatt caccataggg gtgctgaatg cactgagttg ttcttgattc 720
attctttgct agtttggtct gctttggctt catgtatcag tgttcattca gatatattat 780
cacttgtgca ggttcagaga agatttcagg gtcataccct tggtactctg tttcatgatc 840
accttttttc tgtactattt gcattttctg gtatctttct tgtgaattat attctgatga 900
gttcagcagt tgatgagtcc aaaaacacaa tggcc 935
<210> 12
<211> 632
<212> DNA
<213> Zea mays
<400> 12
gcctacgcat ctggctttac actagtttgt tttgtgcttg gtttattagt gagccatccc 60
aaaacccctg tcaataccaa cgtaatgttc cccaagttga gtggtgaaag tgcttactca 120
ctgatggtac tacttggcac aaatataata gtacacaact tttatactca ttcatcagtt 180
gttcaggtaa atttctgaca attccattcc catgtttatg caattagctt tgtcaagcat 240
gtcttccatt aaaagtttgt cttatcaaac ctgaatgcat ctttcatggc gaaagtgatg 300
ttttgtgtta ggattatgta caaaaaaatg gtttatgtta gggtactgag tgtattggtt 360
gtttttgatc accaattcac cataggggtg ctgaatgcac tgagttgttc ttgattcatt 420
ctttgctagt ttggtctgct ttggcttcat gtatcagtgt tcattcagat atattatcac 480
ttgtgcaggt tcagagaaga tttcagggtc atacccttgg tactctgttt catgatcacc 540
ttttttctgt actatttgca ttttctggta tctttcttgt gaattatatt ctgatgagtt 600
cagcagttga tgagtccaaa aacacaatgg cc 632
<210> 13
<211> 1313
<212> DNA
<213> Zea mays
<400> 13
aaatcgctcg gcgcctacta ctagcccgaa acctaggtgg tgggaactcg ggatacgagg 60
tcctcttctc cgccgcttgt cctcggcaag cacaaacgcg gtaagccacc aaccgccttt 120
cgatcgccgg gctccatccc taagccatct cgccgcctcg tgagtcgtga gccgtgagcc 180
aaccgacctg ccatcatcaa ttagtgactt cctgaacctg cttcgtgcgt cctaccactc 240
caaagttcca acaatcaaga gatttaaaca aagccttcca taacttgttg ccaataagtg 300
caggatttgg aggccactgg gctttggaaa ttcaaagtag actgcttgga tcataacgag 360
agagcacaaa gggcctgggt tttgtaaaca gttggctttc actagagtcg atccattcca 420
tactgagctt ttgcatatca tcaattcatc acacaatgga tggtgtgcga tgcatggagt 480
ccccagctgc tggtgatgcc ccgcatagtt ctttccgaac ccttgggcca acactcctgg 540
tttcaatggg gtatcttgac ttcggaaaat ggttggtggc agtggaagct gggtctcggt 600
ttggctatga tcttgtcctg ctcgtgctgc tttttaatct atcagctatt ctgtgtcagt 660
acctctcgag ctgtatcggc attgtcacca aaaagaatct tgcggagatt tgccaccagg 720
agtatgatca gaaaatatgt gttgttcttg gtcttcaagc aggactgtcc ttgctaactt 780
ctgaatttac catgattgca ggcatcgtag ttgggttcaa ccttgtattt gaaagcaaca 840
atcccatcac agtcgtatgt tttacaagtg ttgtagttaa tctgctacca tatacactct 900
cccttctgga caagaggaag gctggaatgt ttaatgccta cgcatctggc tttacactag 960
tttgttttgt gcttggttta ttagtgagcc atcccaaaac ccctgtcaat accaacgtaa 1020
tgttccccaa gttgagtggt gaaagtgctt actcactgat ggtactactt ggcacaaata 1080
taatagtaca caacttttat actcattcat cagttgttca ggttcagaga agatttcagg 1140
gtcataccct tggtactctg tttcatgatc accttttttc tgtactattt gcattttctg 1200
gtatctttct tgtgaattat attctgatga gttcagcagt tgatgagtcc aaaaacacaa 1260
tggccattaa cttccaagat gctagacagc taatgaatca ggcaaatcaa tga 1313
<210> 14
<211> 858
<212> DNA
<213> Zea mays
<400> 14
atggatggtg tgcgatgcat ggagtcccca gctgctggtg atgccccgca tagttctttc 60
cgaacccttg ggccaacact cctggtttca atggggtatc ttgacttcgg aaaatggttg 120
gtggcagtgg aagctgggtc tcggtttggc tatgatcttg tcctgctcgt gctgcttttt 180
aatctatcag ctattctgtg tcagtacctc tcgagctgta tcggcattgt caccaaaaag 240
aatcttgcgg agatttgcca ccaggagtat gatcagaaaa tatgtgttgt tcttggtctt 300
caagcaggac tgtccttgct aacttctgaa tttaccatga ttgcaggcat cgtagttggg 360
ttcaaccttg tatttgaaag caacaatccc atcacagtcg tatgttttac aagtgttgta 420
gttaatctgc taccatatac actctccctt ctggacaaga ggaaggctgg aatgtttaat 480
gcctacgcat ctggctttac actagtttgt tttgtgcttg gtttattagt gagccatccc 540
aaaacccctg tcaataccaa cgtaatgttc cccaagttga gtggtgaaag tgcttactca 600
ctgatggtac tacttggcac aaatataata gtacacaact tttatactca ttcatcagtt 660
gttcaggttc agagaagatt tcagggtcat acccttggta ctctgtttca tgatcacctt 720
ttttctgtac tatttgcatt ttctggtatc tttcttgtga attatattct gatgagttca 780
gcagttgatg agtccaaaaa cacaatggcc attaacttcc aagatgctag acagctaatg 840
aatcaggcaa atcaatga 858
<210> 15
<211> 1526
<212> DNA
<213> Zea mays
<400> 15
aaatcgctcg gcgcctacta ctagcccgaa acctaggtgg tgggaactcg ggatacgagg 60
tcctcttctc cgccgcttgt cctcggcaag cacaaacgcg gtaagccacc aaccgccttt 120
cgatcgccgg gctccatccc taagccatct cgccgcctcg tgagtcgtga gccgtgagcc 180
aaccgacctg ccatcatcaa ttagtgactt cctgaacctg cttcgtgcgt cctaccactc 240
caaagttcca acaatcaaga gatttaaaca aagccttcca taacttgttg ccaataagtg 300
caggatttgg aggccactgg gctttggaaa ttcaaagtag actgcttgga tcataacgag 360
agagcacaaa gggcctgggt tttgtaaaca gttggctttc actagagtcg atccattcca 420
tactgagctt ttgcatatca tcaattcatc acacaatgga tggtgtgcga tgcatggagt 480
ccccagctgc tggtgatgcc ccgcatagtt ctttccgaac ccttgggcca acactcctgg 540
tttcaatggg gtatcttgac ttcggaaaat ggttggtggc agtggaagct gggtctcggt 600
ttggctatga tcttgtcctg ctcgtgctgc tttttaatct atcagctatt ctgtgtcagt 660
acctctcgag ctgtatcggc attgtcacca aaaagaatct tgcggagatt tgccaccagg 720
agtatgatca gaaaatatgt gttgttcttg gtcttcaagc aggactgtcc ttgctaactt 780
ctgaatttac catgattgca ggcatcgtag ttgggttcaa ccttgtattt gaaagcaaca 840
atcccatcac agtcgtatgt tttacaagtg ttgtagttaa tctgctacca tatacactct 900
cccttctgga caagaggaag gctggaatgt ttaatgccta cgcatctggc tttacactag 960
tttgttttgt gcttggttta ttagtgagcc atcccaaaac ccctgtcaat accaacgtaa 1020
tgttccccaa gttgagtggt gaaagtgctt actcactgat ggtactactt ggcacaaata 1080
taatagtaca caacttttat actcattcat cagttgttca ggttcagaga agatttcagg 1140
gtcataccct tggtactctg tttcatgatc accttttttc tgtactattt gcattttctg 1200
gtatctttct tgtgaattat attctgatga gttcagcagt tgatgagtcc aaaaacacaa 1260
tggccattaa cttccaagat gctagacagc taatgaatca ggttcatgtc attgttctct 1320
tcttatccgt caagtctttt gtttaattgg ctatttgttc tccgttcacc ttcctattaa 1380
tgtaaagagc agcattcgaa ttctcaaatt ataagacatg accataggtt tggtgttgtg 1440
gcatgctact atacttgcat gtttggattt tcaactttgt gaagttattg tattataagt 1500
acattaccta ttttttgggt tgacca 1526
<210> 16
<211> 891
<212> DNA
<213> Zea mays
<400> 16
atggatggtg tgcgatgcat ggagtcccca gctgctggtg atgccccgca tagttctttc 60
cgaacccttg ggccaacact cctggtttca atggggtatc ttgacttcgg aaaatggttg 120
gtggcagtgg aagctgggtc tcggtttggc tatgatcttg tcctgctcgt gctgcttttt 180
aatctatcag ctattctgtg tcagtacctc tcgagctgta tcggcattgt caccaaaaag 240
aatcttgcgg agatttgcca ccaggagtat gatcagaaaa tatgtgttgt tcttggtctt 300
caagcaggac tgtccttgct aacttctgaa tttaccatga ttgcaggcat cgtagttggg 360
ttcaaccttg tatttgaaag caacaatccc atcacagtcg tatgttttac aagtgttgta 420
gttaatctgc taccatatac actctccctt ctggacaaga ggaaggctgg aatgtttaat 480
gcctacgcat ctggctttac actagtttgt tttgtgcttg gtttattagt gagccatccc 540
aaaacccctg tcaataccaa cgtaatgttc cccaagttga gtggtgaaag tgcttactca 600
ctgatggtac tacttggcac aaatataata gtacacaact tttatactca ttcatcagtt 660
gttcaggttc agagaagatt tcagggtcat acccttggta ctctgtttca tgatcacctt 720
ttttctgtac tatttgcatt ttctggtatc tttcttgtga attatattct gatgagttca 780
gcagttgatg agtccaaaaa cacaatggcc attaacttcc aagatgctag acagctaatg 840
aatcaggttc atgtcattgt tctcttctta tccgtcaagt cttttgttta a 891
<210> 17
<211> 285
<212> PRT
<213> Zea mays
<400> 17
Met Asp Gly Val Arg Cys Met Glu Ser Pro Ala Ala Gly Asp Ala Pro
1 5 10 15
His Ser Ser Phe Arg Thr Leu Gly Pro Thr Leu Leu Val Ser Met Gly
20 25 30
Tyr Leu Asp Phe Gly Lys Trp Leu Val Ala Val Glu Ala Gly Ser Arg
35 40 45
Phe Gly Tyr Asp Leu Val Leu Leu Val Leu Leu Phe Asn Leu Ser Ala
50 55 60
Ile Leu Cys Gln Tyr Leu Ser Ser Cys Ile Gly Ile Val Thr Lys Lys
65 70 75 80
Asn Leu Ala Glu Ile Cys His Gln Glu Tyr Asp Gln Lys Ile Cys Val
85 90 95
Val Leu Gly Leu Gln Ala Gly Leu Ser Leu Leu Thr Ser Glu Phe Thr
100 105 110
Met Ile Ala Gly Ile Val Val Gly Phe Asn Leu Val Phe Glu Ser Asn
115 120 125
Asn Pro Ile Thr Val Val Cys Phe Thr Ser Val Val Val Asn Leu Leu
130 135 140
Pro Tyr Thr Leu Ser Leu Leu Asp Lys Arg Lys Ala Gly Met Phe Asn
145 150 155 160
Ala Tyr Ala Ser Gly Phe Thr Leu Val Cys Phe Val Leu Gly Leu Leu
165 170 175
Val Ser His Pro Lys Thr Pro Val Asn Thr Asn Val Met Phe Pro Lys
180 185 190
Leu Ser Gly Glu Ser Ala Tyr Ser Leu Met Val Leu Leu Gly Thr Asn
195 200 205
Ile Ile Val His Asn Phe Tyr Thr His Ser Ser Val Val Gln Val Gln
210 215 220
Arg Arg Phe Gln Gly His Thr Leu Gly Thr Leu Phe His Asp His Leu
225 230 235 240
Phe Ser Val Leu Phe Ala Phe Ser Gly Ile Phe Leu Val Asn Tyr Ile
245 250 255
Leu Met Ser Ser Ala Val Asp Glu Ser Lys Asn Thr Met Ala Ile Asn
260 265 270
Phe Gln Asp Ala Arg Gln Leu Met Asn Gln Ala Asn Gln
275 280 285
<210> 18
<211> 296
<212> PRT
<213> Zea mays
<400> 18
Met Asp Gly Val Arg Cys Met Glu Ser Pro Ala Ala Gly Asp Ala Pro
1 5 10 15
His Ser Ser Phe Arg Thr Leu Gly Pro Thr Leu Leu Val Ser Met Gly
20 25 30
Tyr Leu Asp Phe Gly Lys Trp Leu Val Ala Val Glu Ala Gly Ser Arg
35 40 45
Phe Gly Tyr Asp Leu Val Leu Leu Val Leu Leu Phe Asn Leu Ser Ala
50 55 60
Ile Leu Cys Gln Tyr Leu Ser Ser Cys Ile Gly Ile Val Thr Lys Lys
65 70 75 80
Asn Leu Ala Glu Ile Cys His Gln Glu Tyr Asp Gln Lys Ile Cys Val
85 90 95
Val Leu Gly Leu Gln Ala Gly Leu Ser Leu Leu Thr Ser Glu Phe Thr
100 105 110
Met Ile Ala Gly Ile Val Val Gly Phe Asn Leu Val Phe Glu Ser Asn
115 120 125
Asn Pro Ile Thr Val Val Cys Phe Thr Ser Val Val Val Asn Leu Leu
130 135 140
Pro Tyr Thr Leu Ser Leu Leu Asp Lys Arg Lys Ala Gly Met Phe Asn
145 150 155 160
Ala Tyr Ala Ser Gly Phe Thr Leu Val Cys Phe Val Leu Gly Leu Leu
165 170 175
Val Ser His Pro Lys Thr Pro Val Asn Thr Asn Val Met Phe Pro Lys
180 185 190
Leu Ser Gly Glu Ser Ala Tyr Ser Leu Met Val Leu Leu Gly Thr Asn
195 200 205
Ile Ile Val His Asn Phe Tyr Thr His Ser Ser Val Val Gln Val Gln
210 215 220
Arg Arg Phe Gln Gly His Thr Leu Gly Thr Leu Phe His Asp His Leu
225 230 235 240
Phe Ser Val Leu Phe Ala Phe Ser Gly Ile Phe Leu Val Asn Tyr Ile
245 250 255
Leu Met Ser Ser Ala Val Asp Glu Ser Lys Asn Thr Met Ala Ile Asn
260 265 270
Phe Gln Asp Ala Arg Gln Leu Met Asn Gln Val His Val Ile Val Leu
275 280 285
Phe Leu Ser Val Lys Ser Phe Val
290 295
<210> 19
<211> 1991
<212> DNA
<213> Zea mays
<400> 19
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca ccggcagcta cgggggattc ctttcccacc 840
gctccttcgc tttcccttcc tcgcccgccg taataaatag acaccccctc cacaccctct 900
ttccccaacc tcgtgttgtt cggagcgcac acacacacaa ccagatctcc cccaaatcca 960
cccgtcggca cctccgcttc aaggtacgcc gctcgtcctc cccccccccc tctctacctt 1020
ctctagatcg gcgttccggt ccatggttag ggcccggtag ttctacttct gttcatgttt 1080
gtgttagatc cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg gatgcgacct 1140
gtacgtcaga cacgttctga ttgctaactt gccagtgttt ctctttgggg aatcctggga 1200
tggctctagc cgttccgcag acgggatcga tttcatgatt ttttttgttt cgttgcatag 1260
ggtttggttt gcccttttcc tttatttcaa tatatgccgt gcacttgttt gtcgggtcat 1320
cttttcatgc ttttttttgt cttggttgtg atgatgtggt ctggttgggc ggtcgttcta 1380
gatcggagta gaattctgtt tcaaactacc tggtggattt attaattttg gatctgtatg 1440
tgtgtgccat acatattcat agttacgaat tgaagatgat ggatggaaat atcgatctag 1500
gataggtata catgttgatg cgggttttac tgatgcatat acagagatgc tttttgttcg 1560
cttggttgtg ttgatgtggt gtggttgggc ggtcgttcat tcgttctaga tcggagtaga 1620
atactgtttc aaactacctg gtgtatttat taattttgga actgtatgtg tgtgtcatac 1680
atcttcatag ttacgagttt aagatggatg gaaatatcga tctaggatag gtatacatgt 1740
tgatgtgggt tttactgatg catatacatg atggcatatg cagcatctat tcatatgctc 1800
taaccttgag tacctatcta ttataataaa caagtatgtt ttataattat tttgatcttg 1860
atatacttgg atgatggcat atgcagcagc tatatgtgga tttttttagc cctgccttca 1920
tacgctattt atttgcttgg tactgtttct tttgtcgatg ctcaccctgt tgtttggtgt 1980
tacttctgca g 1991
Claims (7)
1. The application of the protein in controlling the water content or dehydration rate of corn grains is characterized in that: the amino acid sequence of the protein is shown in any one of SEQ ID NO.1, SEQ ID NO.17 or SEQ ID NO. 18.
2. Use of a nucleic acid for controlling a moisture content or dehydration rate trait in corn kernel, wherein the nucleic acid encodes a protein according to claim 1; optionally, the nucleotide sequence or reverse complementary sequence of the nucleic acid is shown as any one of SEQ ID NO.2, SEQ ID NO.3, SEQ ID NO.4, SEQ ID NO.13, SEQ ID NO.14, SEQ ID NO.15 or SEQ ID NO. 16.
3. A molecular marker, which is characterized in that 7 bases AGTATCT are inserted into the position of 1044-1045 of the sequence shown in SEQ ID NO. 2; or 303 bases CTTGATCACCAATTCGCGAAAGGGCCTCTAGCTGAGTTGGTTAGGTGGTCTGAATAGCACTCCTCAGGTCCTGGGTTCGACTCCCCGTGGGAGCGAATTTCAGGCTGTGGTTAAAAAAATCCCCTCGTCTGTCCCACGCCAAAGCACAGGTCTAAGACTCAGCCCGGTCGTGGTCGTTCTCACATGGGCTTCGATGCCGCTGTGTATGGGTGGGGTAGGGGTTCGGGGGTTTTCTTGACCTGTGTGAGAAGGTATTTTTCTTAATACAATACCCGGGGCTGTCTTACCCCCCGCAGGTCAAGT at position 2506 and 2808 of the sequence shown in SEQ ID NO. 2.
4. A method for identifying or assisting in identifying the water content or dehydration rate character of corn kernels is characterized by comprising the following steps: (1) detecting the molecular marker of claim 3 in a test material; (2) if the detection result is that the marker is included, the material to be detected shows the character of high water content of grains or slow dehydration rate; if the detection result is that the marker is not included, the material to be detected shows the characteristics of low water content of grains or high dehydration rate; optionally, the detection method of the molecular marker adopts PCR amplification;
optionally, the primer pair adopted by the PCR amplification consists of SEQ ID NO.5/SEQ ID NO.6 or SEQ ID NO.9/SEQ ID NO. 10;
optionally, the PCR amplification product containing the marker is shown as SEQ ID NO.7 or SEQ ID NO. 11; the expression PCR amplification product not containing the above marker is shown in SEQ ID NO.8 or SEQ ID NO. 12.
5. A method for screening corn materials with the characteristics of low grain water content or high dehydration rate, which is characterized in that the molecular marker in claim 3 in a material to be tested is detected according to the method in claim 4, and materials which do not contain the molecular marker in claim 3 are screened.
6. A method for reducing the moisture content of corn kernels or increasing the rate of dehydration, characterized in that the expression and/or activity of the protein of claim 1 is increased in the corn material to be modified, plants are selected which have a low moisture content of corn kernels or a high rate of dehydration;
optionally, the method of increasing the expression of a protein is to use a highly active promoter to drive expression of a nucleic acid sequence encoding the protein;
optionally, the high-activity promoter is a maize ubiquitin promoter;
optionally, the sequence of the maize ubiquitin promoter is shown in SEQ ID NO. 19.
7. Use of the molecular marker and method of claims 3-6 for improving moisture content or dehydration rate traits of corn kernel.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210595676.4A CN114989281B (en) | 2022-05-30 | 2022-05-30 | Corn kernel water content control gene ZmEIN2-1 and molecular marker thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210595676.4A CN114989281B (en) | 2022-05-30 | 2022-05-30 | Corn kernel water content control gene ZmEIN2-1 and molecular marker thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114989281A true CN114989281A (en) | 2022-09-02 |
CN114989281B CN114989281B (en) | 2024-04-05 |
Family
ID=83028735
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210595676.4A Active CN114989281B (en) | 2022-05-30 | 2022-05-30 | Corn kernel water content control gene ZmEIN2-1 and molecular marker thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114989281B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111719012A (en) * | 2020-06-29 | 2020-09-29 | 吉林省农业科学院 | dCAPS molecular marker primer pair for identifying dehydration rate genotype of corn kernel and application |
CN112521471A (en) * | 2020-11-27 | 2021-03-19 | 华中农业大学 | Gene and molecular marker for controlling water content of corn kernels and application thereof |
-
2022
- 2022-05-30 CN CN202210595676.4A patent/CN114989281B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111719012A (en) * | 2020-06-29 | 2020-09-29 | 吉林省农业科学院 | dCAPS molecular marker primer pair for identifying dehydration rate genotype of corn kernel and application |
CN112521471A (en) * | 2020-11-27 | 2021-03-19 | 华中农业大学 | Gene and molecular marker for controlling water content of corn kernels and application thereof |
Also Published As
Publication number | Publication date |
---|---|
CN114989281B (en) | 2024-04-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9663794B2 (en) | Heat-resistance rice gene OsZFP, screening marker and separation method thereof | |
CN112375130B (en) | Corn ear length gene and molecular marker and application thereof | |
CN111153974A (en) | Corn disease-resistant gene and molecular marker and application thereof | |
CN108165554B (en) | Corn leaf width control gene ZmNL4 and application thereof | |
US11479784B2 (en) | Modulation of seed vigor | |
CN110862993B (en) | Gene ZKM89 for controlling plant height and ear position height of corn and application thereof | |
CN111235180A (en) | Method for shortening flowering phase of corn | |
CN110903368B (en) | Gene for controlling female character of corn, kit for creating female sterile line of corn, mutant genotype and method | |
CN111172173A (en) | Method for reducing plant height of corn or delaying flowering | |
CN112521471B (en) | Gene and molecular marker for controlling water content of corn kernels and application thereof | |
JP6639382B2 (en) | Brassica plant comprising mutant DA1 allele | |
CN101747420B (en) | Dominant dwarf rice related protein, encoding gene thereof and application | |
CN110862440B (en) | Gene ZKM465 for controlling corn plant height and application thereof | |
CN116445446B (en) | Wild cabbage glycosyltransferase BoUGT C2 gene and application | |
CN108484741B (en) | Protein for controlling grain weight of crop seeds and application thereof | |
CN114989281B (en) | Corn kernel water content control gene ZmEIN2-1 and molecular marker thereof | |
CN114958867A (en) | Corn ear grain weight and yield regulation gene KWE2, and encoding protein, functional marker, expression vector and application thereof | |
CN110862994B (en) | Gene ZKM76 for controlling corn plant height and ear position and application thereof | |
CN110358774B (en) | Gene, protein, gene expression cassette, expression vector, host cell, method and application for controlling rice flowering time | |
CN111172171A (en) | Gene for controlling plant height and flowering phase of corn and application thereof | |
CN112724210A (en) | Plant amyloplast development related protein OsSSG7 and coding gene and application thereof | |
CN110959043A (en) | Method for improving agronomic traits of plants by using BCS1L gene and guide RNA/CAS endonuclease system | |
CN114540375B (en) | Gene and molecular marker for regulating and controlling flowering period and photoperiod adaptability of corn and application of gene and molecular marker | |
CN110229801B (en) | Gene for controlling rice leaf senescence and protein coded by same | |
CN116426562A (en) | Corn kernel type gene and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Yan Jianbing Inventor after: Li Wenqiang Inventor after: Yu Yanhui Inventor after: Liu Longyu Inventor after: Wu Shenshen Inventor before: Yan Jianbing Inventor before: Li Wenqiang Inventor before: Jin Yanhui Inventor before: Liu Longyu Inventor before: Wu Shenshen |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |