CN114990112A - 一种皮刺特异启动子 - Google Patents
一种皮刺特异启动子 Download PDFInfo
- Publication number
- CN114990112A CN114990112A CN202110213440.5A CN202110213440A CN114990112A CN 114990112 A CN114990112 A CN 114990112A CN 202110213440 A CN202110213440 A CN 202110213440A CN 114990112 A CN114990112 A CN 114990112A
- Authority
- CN
- China
- Prior art keywords
- promoter
- thorn
- plant
- gene
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000002773 nucleotide Substances 0.000 claims abstract description 23
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 23
- 238000011161 development Methods 0.000 claims abstract description 13
- 230000001105 regulatory effect Effects 0.000 claims abstract description 13
- 230000014509 gene expression Effects 0.000 claims description 50
- 210000004027 cell Anatomy 0.000 claims description 46
- 239000013604 expression vector Substances 0.000 claims description 11
- 241000588724 Escherichia coli Species 0.000 claims description 4
- 241000589158 Agrobacterium Species 0.000 claims description 2
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 2
- 230000001276 controlling effect Effects 0.000 abstract description 6
- 238000010353 genetic engineering Methods 0.000 abstract description 2
- 108090000623 proteins and genes Proteins 0.000 description 75
- 241000196324 Embryophyta Species 0.000 description 67
- 239000012634 fragment Substances 0.000 description 30
- 235000018102 proteins Nutrition 0.000 description 21
- 102000004169 proteins and genes Human genes 0.000 description 21
- 238000009396 hybridization Methods 0.000 description 20
- 244000050053 Rosa multiflora Species 0.000 description 19
- 235000000656 Rosa multiflora Nutrition 0.000 description 19
- 125000003275 alpha amino acid group Chemical group 0.000 description 18
- 210000001519 tissue Anatomy 0.000 description 18
- 238000000034 method Methods 0.000 description 17
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 14
- 235000001014 amino acid Nutrition 0.000 description 14
- 239000000523 sample Substances 0.000 description 14
- 230000000694 effects Effects 0.000 description 13
- 241000109365 Rosa arkansana Species 0.000 description 12
- 235000005066 Rosa arkansana Nutrition 0.000 description 12
- 150000007523 nucleic acids Chemical class 0.000 description 12
- 229940024606 amino acid Drugs 0.000 description 11
- 150000001413 amino acids Chemical class 0.000 description 11
- 108020004707 nucleic acids Proteins 0.000 description 11
- 102000039446 nucleic acids Human genes 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 9
- 239000002299 complementary DNA Substances 0.000 description 9
- 241000701489 Cauliflower mosaic virus Species 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 108091033319 polynucleotide Proteins 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 239000002157 polynucleotide Substances 0.000 description 7
- 238000012163 sequencing technique Methods 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- 230000009261 transgenic effect Effects 0.000 description 7
- 238000005406 washing Methods 0.000 description 7
- 108020004705 Codon Proteins 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 230000002401 inhibitory effect Effects 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 102100038436 Neuronal pentraxin-1 Human genes 0.000 description 5
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 108090000765 processed proteins & peptides Proteins 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 101150084750 1 gene Proteins 0.000 description 4
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 4
- 108060001084 Luciferase Proteins 0.000 description 4
- 239000005089 Luciferase Substances 0.000 description 4
- 108700008625 Reporter Genes Proteins 0.000 description 4
- 235000011449 Rosa Nutrition 0.000 description 4
- 241000220317 Rosa Species 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- 241000209510 Liliopsida Species 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 240000008254 Rosa chinensis Species 0.000 description 3
- 235000000664 Rosa chinensis Nutrition 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 210000002257 embryonic structure Anatomy 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- 241001084594 Leptostachya Species 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 241000208125 Nicotiana Species 0.000 description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- 108010052090 Renilla Luciferases Proteins 0.000 description 2
- 108091027981 Response element Proteins 0.000 description 2
- 235000004789 Rosa xanthina Nutrition 0.000 description 2
- 241000109329 Rosa xanthina Species 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000009456 molecular mechanism Effects 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000011895 specific detection Methods 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 101100025365 Arabidopsis thaliana MYB5 gene Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- IQCJOIHDVFJQFV-LKXGYXEUSA-N Asp-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O IQCJOIHDVFJQFV-LKXGYXEUSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000331 Firefly luciferases Proteins 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- 235000000100 Hibiscus rosa sinensis Nutrition 0.000 description 1
- 241000251511 Holothuroidea Species 0.000 description 1
- 101000597553 Homo sapiens Protein odr-4 homolog Proteins 0.000 description 1
- 101000687474 Homo sapiens Rhombotin-1 Proteins 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 241000207746 Nicotiana benthamiana Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 102100024869 Rhombotin-1 Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 235000016785 Rosa della China Nutrition 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 210000001339 epidermal cell Anatomy 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 239000010903 husk Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 229920005610 lignin Polymers 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000000442 meristematic effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 206010033675 panniculitis Diseases 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 210000004304 subcutaneous tissue Anatomy 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000005758 transcription activity Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/8223—Vegetative tissue-specific promoters
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明涉及一种皮刺特异启动子,属于植物基因工程领域。本发明公开了一种皮刺特异启动子,所述启动子的核苷酸序列由SEQ ID NO:1第1672‑1998位所示序列组成,可用于调控植物皮刺发育。
Description
技术领域
本发明涉及一种皮刺特异启动子,属于植物基因工程领域。
背景技术
蔷薇属植物多具有很高的观赏价值,然而蔷薇属的很多植物,如玫瑰、月季、蔷薇等,很多茎杆上都生有皮刺,给植株的田间栽培管理,花朵采摘、运输等过程带来了诸多不便。因此,培育无刺或软刺蔷薇属植物新品种将有效提升其品质,并大幅提高生产效率,显著降低生产成本。
皮刺在植物体中广泛存在,它是由植物茎枝上的表皮细胞及皮下组织突起形成的。在起源上,与表皮毛具有一定的相似之处。并且,皮刺的硬化与木质素积累密切相关。近年来,对植物皮刺形成的分子机理有了一定的研究成果,一些与皮刺发育相关的候选基因也被相继分离,例如CPC、WER、MYB5(袁晓雨.玫瑰皮刺形成相关基因的挖掘与应用研究[D].扬州大学,2019.)以及TTG1(栾晓芳.玫瑰刺形成相关转录因子RrTTG1的克隆与表达分析[D].扬州大学,2014.)等。
皮刺形成分子机理的深入理解有助于最终对皮刺性状的人工改良。然而,抑制或软化皮刺这种特定的组织也需要使用皮刺特异性的启动子。因此,挖掘一些具有皮刺特异性表达特性的启动子尤为重要。
为解决上述问题,本发明通过对无刺野蔷薇和有刺野蔷薇的带刺和不带刺茎段的转录组进行了分析,找到了带刺和不带刺茎段组织的差异表达基因。通过表达模式验证,发现RmNACx是一个在皮刺中特异优势表达的基因。进一步克隆了该基因的5’端启动子序列,并分析了启动子的表达活性和不同截短片段的表达活性。由于这些启动子片段均具有表达活性,利用上述启动子可以驱动目标基因在皮刺中特异表达,并能够用于调控皮刺发育。
发明内容
本发明的目的之一在于提供一种皮刺特异启动子。
本发明的目的之二在于提供了包含上述皮刺特异启动子的基因表达盒、表达载体和宿主细胞。
本发明的目的之三在于公开上述启动子、基因表达盒、表达载体和宿主细胞在调控皮刺发育中的应用。
为实现上述目的,本发明采用如下技术方案:
本发明提供一种启动子,其特征在于,所述启动子的核苷酸序列由SEQ ID NO:1第1673-2000位所示序列组成。当启动子的核酸分子包含SEQ ID NO:1第1672-1998位所示序列的核苷酸时,该核酸分子具有驱动目标基因在皮刺组织表达的活性。
在一些实施方案中,上述启动子的核苷酸序列为如下任意一个:
1)SEQ ID NO:1第1672-1998位所示序列;2)SEQ ID NO:1第1304-1998位所示序列;3)SEQ ID NO:1第788-1998位所示序列;4)SEQ ID NO:1第349-1998位所示序列;5)SEQID NO:1所示序列;6)SEQ ID NO:5所示序列。
本发明还提供一种基因表达盒,其特征在于,所述表达盒含有上述任一项启动子。当基因表达盒是上述启动子连接皮刺发育抑制基因或抑制片段组成时,该表达盒可用于特异性抑制皮刺生长,培育无刺或软刺植物品种或制备无刺或软刺茎段。
本发明还提供一种表达载体,其特征在于,所述表达载体含有上述的表达盒。
本发明还提供一种宿主细胞,其特征在于,所述宿主细胞含有上述的表达载体。
在一些实施方案中,上述宿主细胞为原核生物细胞。
在一些实施方案中,上述宿主细胞为大肠杆菌或农杆菌细胞。
本发明还提供上述的启动子、基因表达盒、表达载体、宿主细胞在调控皮刺发育中的应用。
与现有的技术相比,本发明的有益效果是:本发明通过转录组测序和表达模式验证,发现了RmNACx基因的皮刺特异优势表达特性,这一特性在以前的公开资料中是没有报道的。本发明进一步克隆了该基因的5’端启动子序列,该序列并未收录在公开的数据库中,是一段新的核苷酸序列。本发明验证了启动子的特异表达活性和不同截短片段的表达活性。由于这些启动子片段均具有皮刺特异表达活性,利用上述启动子可以驱动目标基因在皮刺中特异表达,从而用于调控皮刺发育,培育无刺或软刺植物品种或无刺或软刺茎段。
附图说明
图1转录组测序取样部位示意图。左图为无刺野蔷薇,右图为野蔷薇。方框标识具体取样部位。IP:野蔷薇带刺茎段、INP:野蔷薇不带刺茎段、IPL:IP对应的无刺野蔷薇茎段。
图2 RmNACx和Rm6733.1基因在野蔷薇不同组织中的RNA水平表达分析。A:RmNACx基因相对表达量;B:Rm6733.1基因相对表达量。纵轴为相对内参基因的表达量,横轴为不同的组织样品。petal:花瓣;flower bud:花蕾;sepal:萼片;anther:花药;stigma:柱头;apical bud:顶芽;lateral bud:侧芽;leaf:叶片;stem:茎;prickle:皮刺;root:根。
图3 pGreenII-0800-LUC载体图谱。各元件英文及缩写含义列举如下:
LB T-DNA repeat T-DNA左边界重复序列
CaMV 35S promoter 花椰菜花叶病毒35S启动子
Rluc 海参荧光素酶(Renilla Luciferase)报告基因
CaMV poly(A)single 花椰菜花叶病毒35S多聚腺苷酸信号
luciferase 萤火虫荧光素酶(Firefly luciferase)报告基因
CaMV poly(A)single 花椰菜花叶病毒35S多聚腺苷酸信号
RB T-DNA repeat T-DNA右边界重复序列
ori 大肠杆菌复制起始区序列
KanR 卡那霉素抗性序列
pSa ori pVS1转录起始区
图4 RmNACx启动子转录活性分析。A:启动子上的作用元件预测和位置以及截短片段位置示意图;B:启动子活性检测的表达盒。35S:花椰菜花叶病毒(CaMV)的35S启动子。REN:REN(Renilla Luciferase)报告基因,作为内对照,确保LUC报告基因检测的均一化;C:LUC荧光素酶的活性(以荧光亮度来量化);D:烟草叶片瞬时转化后用CCD检测系统采集的荧光信号数据转换成的彩色图像。叶片不同区域对应NP1~NP5不同的启动子片段,0800-LUC是空载体对照。
具体实施方式
提供以下定义和方法用以更好地界定本申请以及在本申请实践中指导本领域普通技术人员。除非另作说明,术语按照相关领域普通技术人员的常规用法理解。本文所引用的所有专利文献、学术论文、行业标准及其他公开出版物等,其中的全部内容整体并入本文作为参考。
除非另有所指,核酸以5’至3’方向从左向右书写;氨基酸序列以氨基至羧基方向从左向右书写。氨基酸在本文可以用其通常所知的三字母符号或IUPAC-IUB生物化学命名委员会推荐的单字母符号来表示。同样地,可以用通常接受的单字母码表示核苷酸。数字范围包括限定该范围的数字。如本文所用,“核酸”包括涉及单链或双链形式的脱氧核糖核苷酸或核糖核苷酸多聚物,并且除非另有限制,包括具有天然核苷酸基本性质的已知类似物(例如,肽核酸),所述类似物以与天然存在的核苷酸类似的方式与单链核酸杂交。如本文所用,术语“编码”或“所编码的”用于特定核酸的上下文时,指该核酸包含指导该核苷酸序列翻译成特定蛋白的必需信息。使用密码子表示编码蛋白的信息。如本文所用,涉及特定多核苷酸或其所编码的蛋白的“全长序列”指具有天然(非合成)内源序列的整个核酸序列或整个氨基酸序列。全长多核苷酸编码该特定蛋白的全长、催化活性形式。本文可互换地使用术语“多肽”、“多肽”和“蛋白”,以指氨基酸残基的多聚物。该术语用于氨基酸多聚物,其中一个或多个氨基酸残基是相应天然存在的氨基酸的人工化学类似物。该术语还用于天然存在的氨基酸多聚物。本文可互换地使用术语“残基”或“氨基酸残基”或“氨基酸”,以指被并入蛋白、多肽或肽(统称“蛋白”)的氨基酸。氨基酸可以是天然存在的氨基酸,并且除非另有限制,可以包括天然氨基酸的已知类似物,所述类似物可以与天然存在的氨基酸相似的方式起作用。
术语“性状”是指植物或特定的植物材料或细胞的生理、形态、生化或物理特性。在一些情况下,此特性对人眼是可见的,诸如种子或植株大小,或者可通过生物化学技术测量,诸如检测种子或叶的蛋白质、淀粉或油含量,或者通过观察代谢或生理过程,例如通过测量对水剥夺或特定盐或糖或氮浓度的耐受性,或者通过观察一个或多个基因的表达水平,或者通过农艺观察结果诸如渗透胁迫耐受性或收率。
“转基因”是指其基因组因异源核酸(诸如重组DNA构建体)的存在而发生改变的任何细胞、细胞系、愈伤组织、组织、植株部分或植株。本文所用的术语“转基因”包括那些最初的转基因事件以及从最初的转基因事件通过有性杂交或无性繁殖而产生的那些,并且不涵盖通过常规植物育种方法或通过自然发生的事件(诸如随机异花受精、非重组病毒感染、非重组细菌转化、非重组转座或自发突变)进行的基因组(染色体或染色体外)改变。
“植物”包括对整株植物、植物器官、植物组织、种子和植物细胞以及它们的子代的标引。植物细胞包括但不限于来自种子、悬浮培养物、胚芽、分生区域、愈伤组织、叶、根、苗、配子体、孢子体、花粉和小孢子的细胞。“子代”包含植物的任何后续世代。
在本申请中,将词语“包括”、“包含”或其变体应理解为除所描述的元素、数或步骤外,还包含其它元素、数或步骤。“受试植物”或“受试植物细胞”是指遗传改造已经生效的植物或植物细胞,或者如此改造的植物或细胞的子代细胞,该子代细胞包含所述改造。“对照”或“对照植物”或“对照植物细胞”提供用于测量受试植物或植物细胞表型改变的参考点。
阴性或对照植物可以包括,例如:(a)野生型植物或细胞,即与遗传改造起始材料具有相同基因型的植物或细胞,所述遗传改造产生受试植物或细胞;(b)与所述起始材料具有相同基因型但已用空构建体(即用对目的性状无已知效果的构建体,诸如包含标物基因的构建体)转化的植物或植物细胞;(c)是受试植物或植物细胞的非转化分离子的植物或植物细胞;(d)与所述受试植物或植物细胞在遗传上一致但未暴露于会诱导目的基因表达的条件或刺激物的植物或植物细胞;或(e)受试植物或植物细胞自身,其处于目的基因不被表达的条件下。
本领域技术人员会容易地认同,诸如位点特异性诱变和随机诱变、聚合酶链式反应方法和蛋白工程化技术的分子生物学领域的进步提供了广泛的适当的工具和操作步骤,以用于改造或者工程化农业上感兴趣的蛋白的氨基酸序列和潜在的基因序列。
在一些实施方案中,可以对本申请的核苷酸序列进行改变,以进行保守氨基酸替换。保守氨基酸替换的原则和实例在下文中进一步描述。在某些实施方案中,可以依照公开的单子叶密码子偏好性对本申请的核苷酸序列进行不改变氨基酸序列的替换,例如可以用单子叶植物偏好的密码子替换编码同一氨基酸序列的密码子,而不改变该核苷酸序列所编码的氨基酸序列。在一些实施方案中,以编码同一氨基酸序列的不同密码子替换本申请中的部分核苷酸序列,从而在改变核苷酸序列的同时不改变其编码的氨基酸序列。保守变体包括由于遗传密码子简并性而编码实施方案的蛋白中的一种的氨基酸序列的那些序列。在一些实施方案中,根据单子叶植物偏好密码子替换本申请中的部分核苷酸序列。本领域技术人员会认识到氨基酸添加和/或取代通常基于氨基酸侧链取代基的相对相似性,例如,所述取代基的疏水性、电荷、大小等等。具有各种前述所考虑性质的示例性氨基酸取代基团为本领域技术人员所公知,并且包括精氨酸与赖氨酸;谷氨酸和天门冬氨酸;丝氨酸和苏氨酸;谷氨酰胺和天冬酰胺;以及缬氨酸、亮氨酸和异亮氨酸。关于不影响目的蛋白生物学活性的适当氨基酸取代的指南可以在Dayhoff等人(1978)Atlas of Protein Sequence andStructure(蛋白序列和结构图集)(Natl.Biomed.Res.Found.,Washington,D.C)(通过引用并入本文)的模型中找到。可以进行诸如将一个氨基酸换作具有相似性质的另一个氨基酸的保守性取代。序列一致性的鉴定包括杂交技术。例如,将已知核苷酸序列的全部或部分用作与其它相应核苷酸序列选择性杂交的探针,所述其它相应核苷酸序列存在于来自所选生物体的已克隆基因组DNA片段或cDNA片段群(即基因组文库或cDNA文库)。所述杂交探针可以是基因组DNA片段、cDNA片段、RNA片段或其它寡核苷酸,并且可以用诸如32P的可检测基团或其它可检测标志物来标记。因而,例如,可以通过标记基于实施方案序列的合成寡核苷酸制备杂交探针。制备杂交探针和构建cDNA及基因组文库的方法通常为本领域已知。可以在严紧条件下进行所述序列的杂交。如本文所用,术语“严紧条件”或“严紧杂交条件”表示如下条件,即在该条件下,相对于与其它序列杂交,探针将以可检测的更大程度(例如,背景的至少2倍、5倍或10倍)与其靶序列杂交。严紧条件是序列依赖性的并且在不同环境中有所不同。通过控制杂交严紧性和/或控制清洗条件,可以鉴定与所述探针100%互补的靶序列(同源探针法)。可选择地,可以调节严紧条件,以允许一些序列错配,以便检测较低的相似度(异源探针法)。通常,探针长度少于约1000或500个核苷酸。通常,严紧条件是如下的条件,即在该条件中,盐浓度为pH 7.0至8.3下,少于约1.5M Na离子,通常约0.01M至1.0M Na离子浓度(或其它盐),并且温度条件为:当用于短探针时(例如10到50个核苷酸),至少约30℃;当用于长探针时(例如大于50个核苷酸),至少约60℃。还可以通过添加诸如甲酰胺的去稳定剂来实现严紧条件。示例性的低严紧条件包括37℃下使用30%至35%的甲酰胺缓冲液、1M NaCl、1%SDS(十二烷基硫酸钠)杂交,50℃至55℃下在1×至2×SSC(20×SSC=3.0MNaCl/0.3M柠檬酸三钠)中清洗。示例性的中度严紧条件包括37℃下在40%至45%甲酰胺、1.0M NaCl、1%SDS中杂交,55℃至60℃下在0.5×至1×SSC中清洗。示例性的高严紧条件包括37℃下在50%甲酰胺、1M NaCl、1%SDS中杂交,60℃至65℃下在0.1×SSC中最后清洗至少约20分钟。任选地,清洗缓冲液可以包含约0.1%至约1%SDS。杂交持续时间通常少于约24小时,通常为约4小时至约12小时。特异性通常依赖杂交后的清洗,关键因素在于最后清洗溶液的离子强度和温度。DNA-DNA杂合体的Tm(热力学熔点)可以近似自Meinkoth andWahl(1984)Anal.Biochem.138:267-284的公式:Tm=81.5℃+16.6(logM)+0.41(%GC)-0.61(%甲酰胺)-500/L;其中M是一价阳离子的克分子浓度,%GC是DNA中鸟苷和胞嘧啶核苷酸的百分数,“甲酰胺%”是杂交溶液的甲酰胺百分数,而L是杂合体的碱基对长度。Tm是(确定的离子强度和pH下)50%的互补靶序列与完全匹配的探针杂交时的温度。通常将清洗至少进行至达到平衡,并且达到低的杂交背景水平,诸如进行2小时、1小时或30分钟。每1%的错配对应使Tm降低约1℃;因而,可以调节Tm、杂交和/或清洗条件,从而与所需一致性的序列杂交。例如,如果需要≥90%一致性的序列,可以将Tm降低10℃。通常,将严紧条件选择为比确定离子强度和pH下的特异序列及其互补序列的Tm低约5℃。然而,在非常严紧的条件下,可以在比所述Tm低4℃下进行杂交和/或清洗;在中度严紧条件下,可以在比所述Tm低6℃下进行杂交和/或清洗;在低严紧条件下,可以在比所述Tm低11℃下进行杂交和/或清洗。
在一些实施方案中,还包括核苷酸序列及其编码的氨基酸序列的片段。如本文所用,术语“片段”指实施方案的多核苷酸的核苷酸序列的一部分或者多肽的氨基酸序列的一部分。核苷酸序列的片段可以编码蛋白片段,所述蛋白片段保留天然或相应全长蛋白的生物学活性,并因而具有蛋白活性。突变体蛋白包括天然蛋白的生物活性片段,其包含保留天然蛋白生物学活性的连续氨基酸残基。一些实施方案还包括转化的植物细胞或转基因植物,其包含至少一种实施方案的核苷酸序列。在一些实施方案中,使用表达载体转化植物,所述表达载体包含至少一种实施方案的核苷酸序列以及与其可操作地连接的在植物细胞中驱动表达的启动子。转化的植物细胞和转基因植物表示基因组内包含异源多核苷酸的植物细胞或植物。一般来说,所述异源多核苷酸在转化的植物细胞或转基因植物的基因组内稳定地整合,以致将所述多核苷酸传递给后代。可以将所述异源多核苷酸单独地或作为表达载体的一部分整合进基因组。在一些实施方案中,本申请涉及的植物包括植物细胞、植物原生质体、可以再生出植物的植物细胞组织培养物、植物愈伤组织、植物团块和植物细胞,其为完整的植物或者植物的部分,诸如胚胎,花粉,胚珠,种子,叶,花,枝,果实,果仁,穗,穗轴,壳,秸秆,根,根尖,花药等等。本申请还包括源于本申请的转基因植物或其子代、并因而至少部分地包含本申请的核苷酸序列的植物细胞、原生质体、组织、愈伤组织、胚胎以及花、茎、果实、叶以及根。
在核酸扩增的情况下术语“扩增”是其中产生附加拷贝的选择核酸(或其转录形式)的任何过程。一般的扩增方法包括基于多种聚合酶的复制方法,包括聚合酶链反应(PCR)、连接酶介导的方法如连接酶链反应(LCR)以及基于RNA聚合酶的扩增(例如通过转录)方法。
以下实施例用于说明本发明,但不用来限制本发明的范围。在不背离本发明精神和实质的情况下,对本发明方法、步骤或条件所作的修改或替换,均属于本申请的范围。若无特别指明,实施例按照常规实验条件,如Sambrook等人的分子克隆实验手册(SambrookJ&Russell D W,Molecular cloning:a laboratory manual,2001),或按照制造厂商说明书建议的条件。若未特别指明,实施例中所用的化学试剂均为常规市售试剂,实施例中所用的技术手段为本领域技术人员所熟知的常规手段。
实施例1有刺野蔷薇和无刺野蔷薇茎段的转录组测序
野蔷薇(Rosa multiflora)是蔷薇属的一种落叶灌木,二倍体,茎杆和叶柄上着生有皮刺,是现代月季的主要亲本之一。无刺野蔷薇(Rosa multiflora Thunb.var.inermis)是野蔷薇的一个变种,其茎杆光滑,完全无刺。本发明所使用的野蔷薇和无刺野蔷薇均为种植于华中农业大学内的普通种,在其他地方也多有种植,公众可以获取。
总RNA分别从野蔷薇带刺茎段(internode segment with prickle,IP)、野蔷薇不带刺茎段(internode segment without prickle,INP)和IP相对应的无刺野蔷薇茎段(internode segment of prickleless,IPL)提取(取样部位见图1)。在总RNA提取和DNaseI处理后,用Oligo(dT)磁珠分离mRNA,构建cDNA文库。用ABI StepOnePlus Real-Time PCR系统和Agilent2100生物分析仪检测cDNA文库的质量,并在Illumina HiSeq TM2000测序平台上进行测序。在RNA-Seq实验中,每种样本设置了三个生物学重复,3种样本,共计有9个样本。测序实验完成后,对数据进行分析,以寻找不同样品中的差异表达基因。
在IP vs INP组中,有96个差异表达基因,其中94个上调表达,2个下调表达;在IPvs IPL中,有8281个差异表达基因,其中上调表达基因有4012个,下调表达基因有4269个;在IPL vs INP中,有8370个差异表达基因,其中上调表达基因有4324个,下调表达基因有4046个。
实施例2 RmNACx基因的克隆及表达模式的鉴定
在上述差异表达基因中,查找在IP组织中表达量较高,同时在INP和IPL组织中表达量较低或无表达的基因,发现与月月粉NAC domain-containing protein 104-like(Genebank:XM_024319124.1)基因同源的一个基因符合这一特征,本发明将野蔷薇中的该基因命名为RmNACx。
根据月月粉中的NAC domain-containing protein 104-like同源基因的序列设计引物(RmNACx-F:GTCGACTTAGTTTGGAAAACTTATTTCATCA,RmNACx-R:GGTACCATGGCAGATAACCCTTCT),使用野蔷薇带刺茎段IP组织提取的cDNA样品克隆RmNACx基因cDNA片段并测序,获得RmNACx基因的编码序列如SEQ ID NO:3所示,进一步验证发现无刺和有刺野蔷薇中RmNACx基因的基因组序列是一样的,只是基因表达存在差异。基因推导的氨基酸序列如SEQ ID NO:4所示。经过比对,发现SEQ ID NO:3所示的核苷酸序列和SEQ IDNO:4所示的氨基酸序列并未收录在公开数据库中。
另外,使用野蔷薇叶片组织提取的cDNA样品扩增RmNACx基因时,获得了一个与RmNACx高度同源的基因,命名为Rm6733.1。测序后发现,Rm6733.1基因的编码序列如SEQ IDNO:6所示,基因推导的氨基酸序列如SEQ ID NO:7所示。RmNACx和Rm6733.1基因的编码序列相似度为93.77%,推导的氨基酸序列相似度为90.36%。
实施例3 RmNACx和Rm6733.1基因表达模式鉴定
为了比较RmNACx和Rm6733.1这两个基因的表达模式的差异,分别设计检测两个基因的特异引物对,其中RmNACx的特异检测引物对为RmNACx-F:GGTTACGAGTAATGGATACTGGAAG和RmNACx-R:GAGGAAGAGGCATAAGTATCAGAAT;Rm6733.1的特异检测引物对为Rm6733.1-F:AATGAATATAGCGGATACGTGTTAT和Rm6733.1-R:TTTGGCAAACTTATTTCATCAAGAT。选用野蔷薇中的UBC作为内参基因,检测引物为UBC-F:GCCAGAGATTGCCCATATGTA,UBC-R:TCACAGAGTCCTAGCAGCACA。
利用上述特异检测引物对,以野蔷薇中的不同组织样品为模板,检测RmNACx和Rm6733.1基因的RNA水平表达模式,结果如图2。RmNACx基因在皮刺有特异性的优势表达,表现为在皮刺中的相对表达量约为0.05,在根中的表达量小于0.01,在其他检测的组织中几乎没有表达;Rm6733.1基因在多个组织(根、茎、叶、芽、柱头等)中均有表达,皮刺中的表达量相对较低。因此,RmNACx基因在皮刺中具有优势表达特性。
实施例3 RmNACx基因启动子的克隆以及启动子片段活性检测
基因的表达模式主要受到启动子的影响,为了分离RmNACx基因的启动子序列,本发明根据月月粉NAC domain-containing protein 104-like基因的启动子序列设计扩增引物NACx-Pro-F:GGATCCTAGCTTTTGAGCTAGTGGTGTCTCT和NACx-Pro-R:CCCGGGGAATAGTAGCTAGCGTGTAAACGAAG,使用野蔷薇嫩叶提取的DNA为模板克隆RmNACx的启动子片段并测序。获得RmNACx约2.3kb的启动子序列如SEQ ID NO:5所示。
依据SEQ ID NO:5所示的序列,对其中的应答元件进行预测(具体方法参考http://bioinformatics.psb.ugent.be/webtools/plantcare/html/),发现启动子包括G-Box、TCT-Motif、ATCT-Motif、MRE应答元件。根据这些元件的位置,分别扩增不同的启动子片段NP1~NP5,其中NP1仅包含一个G-Box元件,NP2包含MRE和G-Box元件,NP3包含ATCT-Motif、MRE、G-Box元件,NP4包含TCT-Motif、ATCT-Motif、MRE、G-Box元件,NP5包含所有元件。设计了5对引物(RmNACx-Pro1-0800-F:ctatagggcgaattgggtaccATATGTCCCACAGTCCCACTCATT,RmNACx-Pro2-0800-F:ctatagggcgaattgggtaccTTATGGTAAATACTATAAAGCAAATCAAACA,RmNACx-Pro3-0800-F:ctatagggcgaattgggtaccTATATTTATGTGGGGGCAAAATCTT,RmNACx-Pro4-0800-F:ctatagggcgaattgggtaccGTTTTACCCTTGTATAATACTCGAAGGA,RmNACx-Pro5-0800-F:ctatagggcgaattgggtaccAAGGCTAGAAGCCAAGAACAAAGC,RmNACx-Pro-0800-R:aagcttatcgataccgtcgacGAATAGTAGCTAGCGTGTAAACGAAGT,5个片段的反向引物是一样的,小写字母表示人工增加的酶切位点),以包含SEQ ID NO:5片段的质粒为模板,克隆了5个长度不同的启动子片段。NP1~NP5序列分别为SEQ ID NO:1第1672-1998位所示序列、SEQ ID NO:1第1304-1998位所示序列、SEQ ID NO:1第788-1998位所示序列、SEQ ID NO:1第349-1998位所示序列、SEQ ID NO:1所示序列。
将这NP1~NP5这5个RmNACx启动子片段分别插入到PGreenII-0800-LUC载体(载体图见图3,载体序列见SEQ ID NO:2)的多克隆位点处,瞬时转化本氏烟草(具体操作方法参照:卞书迅,等.苹果U6启动子的克隆及功能分析[J].中国农业科学,2019,52(23):4364-4373)。实验结果显示,仅包含G-Box元件的NP1序列片段也有一定的活性,当启动子片段长度逐渐加长时,启动子转录活性也逐渐增加,但当RmNACx启动子片段达到1KB之后(NP3~NP5),其启动子活性差别不大(结果见图4)。
上述结果表明,含有NP1片段(SEQ ID NO:1第1672-1998位所示序列)的核酸分子,例如NP1~NP5以及SEQ ID NO:5所示序列的分子具有驱动目的基因在皮刺中特异优势表达的功能,因此利用该启动子结合皮刺发育相关的目的基因构建基因表达盒或相关表达载体或包含载体的宿主细胞转化月季、玫瑰、蔷薇等有刺或无刺品种,能够精准地调控这些物种的皮刺发育,而不会影响其他组织的正常发育。当基因表达盒是上述启动子连接皮刺发育抑制基因或抑制片段组成时,该表达盒可用于特异性抑制皮刺生长,培育无刺或软刺植物品种或制备无刺或软刺茎段。
虽然,上文中已经用一般性说明及具体实施方案对本发明作了详尽的描述,但在本发明基础上,可以对之作一些修改或改进,这对本领域技术人员而言是显而易见的。因此,在不偏离本发明精神的基础上所做的这些修改或改进,均属于本发明要求保护的范围。
序列表
<110> 华中农业大学
<120> 一种皮刺特异启动子
<130> 1
<160> 7
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1998
<212> DNA
<213> Rosa multiflora Thunb.
<400> 1
aaggctagaa gccaagaaca aagccagagg ttatatatat aaagactagt aattatgagc 60
gagtaattat tggtgtatgg gcacactgga acttaaaatt ggttcacgac caaacccata 120
ttcatgtgtg cgacgtggat ggcaagaggt gcatgcatga tctcagcatg aagtggtgat 180
gatgctgact cacattatca actttttatt tctagggttc ttctaaatag taaatacata 240
tatacagatt gtatcaatgt ggcacagcca ctcactctta ttgcagctgc ttcaagtacc 300
aagtcctttg ttctactcat ttgccttttt gttttctgtt taactcctgt tttacccttg 360
tataatactc gaaggagaaa ttgagaaaaa gtacacccaa caaccccacc cattgtaggc 420
actcttcccc ttccctggtg ccggcctacc ctctcagcaa tgatgttgaa ataatagggt 480
tcaatctaca agaatccgat gcttggaaac tcttacattg gcagttactt cttgaaatta 540
cctttctttt tgtagttgtc gatcattgtc caatgtcact gcctgtatta tctgtataga 600
agttacgatc tcgtcgtttg atgccgaaag tttgaccaat gatccactgc tgagtcatcc 660
ttgcatcgct atttgcttat gcgaatgaag atctaataat ggcacaatca gtgtcagtgg 720
cggacgcagg attatattta tgtgggggca aaatattttt aaaagaaata gaatgcggac 780
gtaagactat atttatgtgg gggcaaaatc tttttgaaag aaatagaatt ttgaaagaaa 840
tagaatgcgg acgcaggatt atatttatgt gggggcaaaa tctttttgaa agaaatagaa 900
tgttccttaa tgaaagatgg gttaatattt acttttaata cttccacaac ttcataataa 960
aaaataaaaa ataaaaatcc aatccataaa tattttgaaa aaaaaaatgc atttatgagc 1020
aaaacataac aactatcaat caatataact ctaattatat acaagaaaaa gaaaaaaaat 1080
tccgaaaaca aaagaagaag agaatcgaca atcaacaagc tgcaaaataa ttaacaatac 1140
atgtaaaata gactagaaag agagtttttc tattcatttc ttttatgttt gtcggccaac 1200
aaacacagtt ttttactaat actatatgta aaagtcttgt attattctag agtagggact 1260
tttattactc ttcaacgggg gcaatattgt tgaaacgatt tcattatggt aaatactata 1320
aagcaaatca aacaaaacca aaaaaatgag tgggagtagc tgccctcagt gggccttagc 1380
tgcgtccgcc actgattgtc attatgctta accgggttga atacgcatga ttaaacaatt 1440
gatcatttac atgtcaactt gtcaaaacac gattaataag tggttgggtt tagatcatca 1500
actcgccaat ttgcaaataa cccgcttaac ctaataccaa cgttaaaatt caagaaatgt 1560
aaacatttat attttttgtt taaagaagtt tctatttttt ttttctgttt tttttttttg 1620
aaacgagaag tttttattct ttattttgta aacaattttg gtggcacaga aatatgtccc 1680
acagtcccac tcatttgaca gcactcagta tctaaacagg tacgcagttg cccgccgcct 1740
tgcatctcag agcactttta ccatcatcat caacgaacca atcaactcct ccatttgtta 1800
tgaaaaccac cttctttttc acgtcactca ttcactcact tatttttttt tttccgttgt 1860
ttttgttctt cacttcatct ctctctctct ctctctctct ctctctctct ctctctctct 1920
ctctctctct ctctcgcttt acagacgcta taaataccat cgtcagctga aacttcgttt 1980
acacgctagc tactattc 1998
<210> 2
<211> 6382
<212> DNA
<213> Escherichia coli
<400> 2
tttttatccc cggaagcctg tggatagagg gtagttatcc acgtgaaacc gctaatgccc 60
cgcaaagcct tgattcacgg ggctttccgg cccgctccaa aaactatcca cgtgaaatcg 120
ctaatcaggg tacgtgaaat cgctaatcgg agtacgtgaa atcgctaata aggtcacgtg 180
aaatcgctaa tcaaaaaggc acgtgagaac gctaatagcc ctttcagatc aacagcttgc 240
aaacacccct cgctccggca agtagttaca gcaagtagta tgttcaatta gcttttcaat 300
tatgaatata tatatcaatt attggtcgcc cttggcttgt ggacaatgcg ctacgcgcac 360
cggctccgcc cgtggacaac cgcaagcggt tgcccaccgt cgagcgccag cgcctttgcc 420
cacaacccgg cggccggccg caacagatcg ttttataaat tttttttttt gaaaaagaaa 480
aagcccgaaa ggcggcaacc tctcgggctt ctggatttcc gatccccgga attagagatc 540
ttggcaggat atattgtggt gtaacgttat cgtaccccta ctccaaaaat gtcaaagata 600
cagtctcaga agaccaaagg gctattgaga cttttcaaca aagggtaatt tcgggaaacc 660
tcctcggatt ccattgccca gctatctgtc acttcatcga aaggacagta gaaaaggaag 720
gtggctccta caaatgccat cattgcgata aaggaaaggc tatcattcaa gatgcctctg 780
ccgacagtgg tcccaaagat ggacccccac ccacgaggag catcgtggaa aaagaagacg 840
ttccaaccac gtcttcaaag caagtggatt gatgtgacat ctccactgac gtaagggatg 900
acgcacaatc ccactatcct tcgcaagacc cttcctctat ataaggaagt tcatttcatt 960
tggagaggac agcccaccac catgacttcg aaagtttatg atccagaaca aaggaaacgg 1020
atgataactg gtccgcagtg gtgggccaga tgtaaacaaa tgaatgttct tgattcattt 1080
attaattatt atgattcaga aaaacatgca gaaaatgctg ttattttttt acatggtaac 1140
gcggcctctt cttatttatg gcgacatgtt gtgccacata ttgagccagt agcgcggtgt 1200
attataccag accttattgg tatgggcaaa tcaggcaaat ctggtaatgg ttcttatagg 1260
ttacttgatc attacaaata tcttactgca tggtttgaac ttcttaattt accaaagaag 1320
atcatttttg tcggccatga ttggggtgct tgtttggcat ttcattatag ctatgagcat 1380
caagataaga tcaaagcaat agttcacgct gaaagtgtag tagatgtgat tgaatcatgg 1440
gatgaatggc ctgatattga agaagatatt gcgttgatca aatctgaaga aggagaaaaa 1500
atggttttgg agaataactt cttcgtggaa accatgttgc catcaaaaat catgagaaag 1560
ttagaaccag aagaatttgc agcatatctt gaaccattca aagagaaagg tgaagttcgt 1620
cgtccaacat tatcatggcc tcgtgaaatc ccgttagtaa aaggtggtaa acctgacgtt 1680
gtacaaattg ttaggaatta taatgcttat ctacgtgcaa gtgatgattt accaaaaatg 1740
tttattgaat cggacccagg attcttttcc aatgctattg ttgaaggtgc caagaagttt 1800
cctaatactg aatttgtcaa agtaaaaggt cttcattttt cgcaagaaga tgcacctgat 1860
gaaatgggaa aatatatcaa atcgttcgtt gagcgagttc tcaaaaatga acaataattc 1920
tagccggtac gctgaaatca ccagtctctc tctacaaatc tatctctctc tattttctcc 1980
ataaataatg tgtgagtagt ttcccgataa gggaaattag ggttcttata gggtttcgct 2040
catgtgttga gcatataaga aacccttagt atgtatttgt atttgtaaaa tacttctatc 2100
aataaaattt ctaattccta aaaccaaaat ccagtactaa aatccagatc gataacatta 2160
acgtttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc 2220
gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg cgattaagtt 2280
gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt gaattgtaat 2340
acgactcact atagggcgaa ttgggtaccg ggccccccct cgaggtcgac ggtatcgata 2400
agcttgatat cgaattcctg cagcccgggg gatccactag ttctagagcg gccgccaccg 2460
cggtggagat cgaattccat ggaagacgcc aaaaacataa agaaaggccc ggcgccattc 2520
tatccgctgg aagatggaac cgctggagag caactgcata aggctatgaa gagatacgcc 2580
ctggttcctg gaacaattgc ttttacagat gcacatatcg aggtggacat cacttacgct 2640
gagtacttcg aaatgtccgt tcggttggca gaagctatga aacgatatgg gctgaataca 2700
aatcacagaa tcgtcgtatg cagtgaaaac tctcttcaat tctttatgcc ggtgttgggc 2760
gcgttattta tcggagttgc agttgcgccc gcgaacgaca tttataatga acgtgaattg 2820
ctcaacagta tgggcatttc gcagcctacc gtggtgttcg tttccaaaaa ggggttgcaa 2880
aaaattttga acgtgcaaaa aaagctccca atcatccaaa aaattattat catggattct 2940
aaaacggatt accagggatt tcagtcgatg tacacgttcg tcacatctca tctacctccc 3000
ggttttaatg aatacgattt tgtgccagag tccttcgata gggacaagac aattgcactg 3060
atcatgaact cctctggatc tactggtctg cctaaaggtg tcgctctgcc tcatagaact 3120
gcctgcgtga gattctcgca tgccagagat cctatttttg gcaatcaaat cattccggat 3180
actgcgattt taagtgttgt tccattccat cacggttttg gaatgtttac tacactcgga 3240
tatttgatat gtggatttcg agtcgtctta atgtatagat ttgaagaaga gctgtttctg 3300
aggagccttc aggattacaa gattcaaagt gcgctgctgg tgccaaccct attctccttc 3360
ttcgccaaaa gcactctgat tgacaaatac gatttatcta atttacacga aattgcttct 3420
ggtggcgctc ccctctctaa ggaagtcggg gaagcggttg ccaagaggtt ccatctgcca 3480
ggtatcaggc aaggatatgg gctcactgag actacatcag ctattctgat tacacccgag 3540
ggggatgata aaccgggcgc ggtcggtaaa gttgttccat tttttgaagc gaaggttgtg 3600
gatctggata ccgggaaaac gctgggcgtt aatcaaagag gcgaactgtg tgtgagaggt 3660
cctatgatta tgtccggtta tgtaaacaat ccggaagcga ccaacgcctt gattgacaag 3720
gatggatggc tacattctgg agacatagct tactgggacg aagacgaaca cttcttcatc 3780
gttgaccgcc tgaagtctct gattaagtac aaaggctatc aggtggctcc cgctgaattg 3840
gaatccatct tgctccaaca ccccaacatc ttcgacgcag gtgtcgcagg tcttcccgac 3900
gatgacgccg gtgaacttcc cgccgccgtt gttgttttgg agcacggaaa gacgatgacg 3960
gaaaaagaga tcgtggatta cgtcgccagt caagtaacaa ccgcgaaaaa gttgcgcgga 4020
ggagttgtgt ttgtggacga agtaccgaaa ggtcttaccg gaaaactcga cgcaagaaaa 4080
atcagagaga tcctcataaa ggccaagaag ggcggaaaga tcgccgtgta attctagaga 4140
attcgctgaa atcaccagtc tctctctaca aatctatctc tctctatttt ctccataaat 4200
aatgtgtgag tagtttcccg ataagggaaa ttagggttct tatagggttt cgctcatgtg 4260
ttgagcatat aagaaaccct tagtatgtat ttgtatttgt aaaatacttc tatcaataaa 4320
atttctaatt cctaaaacca aaatccagta ctaaaatcca gatccactag ccttgacagg 4380
atatattggc gggtaaacta agtcgctgta tgtgtttgtt tgagatctca tgtgagcaaa 4440
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 4500
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 4560
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 4620
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 4680
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 4740
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 4800
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 4860
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 4920
cactagaaga acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaagaag 4980
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 5040
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 5100
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 5160
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 5220
tatatatgtg taacattggt ctagtgatta gaaaaactca tcgagcatca aatgaaactg 5280
caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 5340
aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 5400
tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 5460
aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gtttatgcat 5520
ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 5580
aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 5640
aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 5700
aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttccctgg 5760
gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 5820
aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa caacattggc 5880
aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5940
gtagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 6000
agcatccatg ttggaattta atcgcggcct tgagcaagac gtttcccgtt gaatatggct 6060
cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 6120
atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttgttgaat 6180
aaatcgaact tttgctgagt tgaaggatca gatcacgcat cttcccgaca acgcagaccg 6240
ttccgtggca aagcaaaagt tcaaaatcac caactggtcc acctacaaca aagctctcat 6300
caaccgtggc tccctcactt tctggctgga tgatggggcg attcaggcga tccccatcca 6360
acagcccgcc gtcgagcggg ct 6382
<210> 3
<211> 594
<212> DNA
<213> 人工合成(unknown)
<400> 3
atggcagata acccttctca gctccctcct ggttttcgat ttttcccaac agatgaagag 60
ctcgttgtcc atttccttca acgcaaggca gctcttttac ccctccaccc cgatgtcatt 120
cctgatctcg atctctgtcc atacgatcca tgggagctaa atggtaaggc tctgtgtgag 180
ggtaagcagt ggtacttcta tagcaggagg gcgcaaaatc gggttacgag taatggatac 240
tggaagccat taggcaccga ggaacctgtc atatcttata cttctaacaa caaagttgga 300
atcaaaagat acttcgggtt ctacgtggga gaagctccat tggggatcca aacaaactgg 360
acaatgaatg agtatcgtct ctctgcagat tctgatactt atgcctcttc ctccaccact 420
agatcatcga agagaagagg gcatcggaaa gcagttaatg aatatagtgg atacgtgttg 480
tgtcgagttt atgagtgcga cgaagatgag gacgatgatg ggacagagct ttcatgtttg 540
gacgaggtgt acttatcttt ggatgatctt gatgaaataa gttttccaaa ctaa 594
<210> 4
<211> 197
<212> PRT
<213> Rosa multiflora Thunb.
<400> 4
Met Ala Asp Asn Pro Ser Gln Leu Pro Pro Gly Phe Arg Phe Phe Pro
1 5 10 15
Thr Asp Glu Glu Leu Val Val His Phe Leu Gln Arg Lys Ala Ala Leu
20 25 30
Leu Pro Leu His Pro Asp Val Ile Pro Asp Leu Asp Leu Cys Pro Tyr
35 40 45
Asp Pro Trp Glu Leu Asn Gly Lys Ala Leu Cys Glu Gly Lys Gln Trp
50 55 60
Tyr Phe Tyr Ser Arg Arg Ala Gln Asn Arg Val Thr Ser Asn Gly Tyr
65 70 75 80
Trp Lys Pro Leu Gly Thr Glu Glu Pro Val Ile Ser Tyr Thr Ser Asn
85 90 95
Asn Lys Val Gly Ile Lys Arg Tyr Phe Gly Phe Tyr Val Gly Glu Ala
100 105 110
Pro Leu Gly Ile Gln Thr Asn Trp Thr Met Asn Glu Tyr Arg Leu Ser
115 120 125
Ala Asp Ser Asp Thr Tyr Ala Ser Ser Ser Thr Thr Arg Ser Ser Lys
130 135 140
Arg Arg Gly His Arg Lys Ala Val Asn Glu Tyr Ser Gly Tyr Val Leu
145 150 155 160
Cys Arg Val Tyr Glu Cys Asp Glu Asp Glu Asp Asp Asp Gly Thr Glu
165 170 175
Leu Ser Cys Leu Asp Glu Val Tyr Leu Ser Leu Asp Asp Leu Asp Glu
180 185 190
Ile Ser Phe Pro Asn
195
<210> 5
<211> 2315
<212> DNA
<213> Rosa multiflora Thunb.
<400> 5
tagcttttga gctagtggtg tctcttttga caacaatgtt gaatgaggtt ctaagtatga 60
agcccccacc cccaaaacta aaagaacatt ttgaagtaaa tatatggatt attgagattc 120
atcgcaccta ctttgcatga aggccctagg taggtaggtt gtacatattg tcatctgtgt 180
atacgcatgt gtaagaattc tttatccgac aatttcatta gagaatagat gcagttgtac 240
tttatggagc tcttttgatg ctgattataa catcttccaa attttcctat taaatattga 300
tgtccagtct ttaaaacaag gctagaagcc aagaacaaag ccagaggtta tatatataaa 360
gactagtaat tatgagcgag taattattgg tgtatgggca cactggaact taaaattggt 420
tcacgaccaa acccatattc atgtgtgcga cgtggatggc aagaggtgca tgcatgatct 480
cagcatgaag tggtgatgat gctgactcac attatcaact ttttatttct agggttcttc 540
taaatagtaa atacatatat acagattgta tcaatgtggc acagccactc actcttattg 600
cagctgcttc aagtaccaag tcctttgttc tactcatttg cctttttgtt ttctgtttaa 660
ctcctgtttt acccttgtat aatactcgaa ggagaaattg agaaaaagta cacccaacaa 720
ccccacccat tgtaggcact cttccccttc cctggtgccg gcctaccctc tcagcaatga 780
tgttgaaata atagggttca atctacaaga atccgatgct tggaaactct tacattggca 840
gttacttctt gaaattacct ttctttttgt agttgtcgat cattgtccaa tgtcactgcc 900
tgtattatct gtatagaagt tacgatctcg tcgtttgatg ccgaaagttt gaccaatgat 960
ccactgctga gtcatccttg catcgctatt tgcttatgcg aatgaagatc taataatggc 1020
acaatcagtg tcagtggcgg acgcaggatt atatttatgt gggggcaaaa tatttttaaa 1080
agaaatagaa tgcggacgta agactatatt tatgtggggg caaaatcttt ttgaaagaaa 1140
tagaattttg aaagaaatag aatgcggacg caggattata tttatgtggg ggcaaaatct 1200
ttttgaaaga aatagaatgt tccttaatga aagatgggtt aatatttact tttaatactt 1260
ccacaacttc ataataaaaa ataaaaaata aaaatccaat ccataaatat tttgaaaaaa 1320
aaaatgcatt tatgagcaaa acataacaac tatcaatcaa tataactcta attatataca 1380
agaaaaagaa aaaaaattcc gaaaacaaaa gaagaagaga atcgacaatc aacaagctgc 1440
aaaataatta acaatacatg taaaatagac tagaaagaga gtttttctat tcatttcttt 1500
tatgtttgtc ggccaacaaa cacagttttt tactaatact atatgtaaaa gtcttgtatt 1560
attctagagt agggactttt attactcttc aacgggggca atattgttga aacgatttca 1620
ttatggtaaa tactataaag caaatcaaac aaaaccaaaa aaatgagtgg gagtagctgc 1680
cctcagtggg ccttagctgc gtccgccact gattgtcatt atgcttaacc gggttgaata 1740
cgcatgatta aacaattgat catttacatg tcaacttgtc aaaacacgat taataagtgg 1800
ttgggtttag atcatcaact cgccaatttg caaataaccc gcttaaccta ataccaacgt 1860
taaaattcaa gaaatgtaaa catttatatt ttttgtttaa agaagtttct attttttttt 1920
tctgtttttt ttttttgaaa cgagaagttt ttattcttta ttttgtaaac aattttggtg 1980
gcacagaaat atgtcccaca gtcccactca tttgacagca ctcagtatct aaacaggtac 2040
gcagttgccc gccgccttgc atctcagagc acttttacca tcatcatcaa cgaaccaatc 2100
aactcctcca tttgttatga aaaccacctt ctttttcacg tcactcattc actcacttat 2160
tttttttttt ccgttgtttt tgttcttcac ttcatctctc tctctctctc tctctctctc 2220
tctctctctc tctctctctc tctctctctc tcgctttaca gacgctataa ataccatcgt 2280
cagctgaaac ttcgtttaca cgctagctac tattc 2315
<210> 6
<211> 594
<212> DNA
<213> Rosa multiflora Thunb.
<400> 6
atggcagata acccttttca gctccctcct ggttttcgat ttttcccgac agatgaagag 60
ctcgtagtcc atttccttca acgcaaggca gctctcttac ccctccaccc cgatgtcatt 120
cctgatctcg atctctatcc atacgatcca tgggagctaa atggtaaggc tctatctgag 180
ggtaagcagt ggtacttcta tagcaggagg acgcaaaatc gggttacgag tagtggatat 240
tggaagctgt taggcaccga ggaacctgtc atatcttcta cttctaacaa caaaattgga 300
atcaaaagat acttcgggtt ctacgtggga aaagctcctt cagggatcaa aacaaactgg 360
acaatgcatg agtatcgtct ctcagcagtc tcggatactt gtgcttcttc ttccacccct 420
agatcatcga agagaagagg gcaccggaaa gcagttaatg aatatagcgg atacgtgtta 480
tgtcgagttt atgagcgcgg agaagatgag gacgatgatg ggactgagct ttcatgtttg 540
gacgaggtgt tcttatcttt ggatgatctt gatgaaataa gtttgccaaa ctaa 594
<210> 7
<211> 197
<212> PRT
<213> Rosa multiflora Thunb.
<400> 7
Met Ala Asp Asn Pro Phe Gln Leu Pro Pro Gly Phe Arg Phe Phe Pro
1 5 10 15
Thr Asp Glu Glu Leu Val Val His Phe Leu Gln Arg Lys Ala Ala Leu
20 25 30
Leu Pro Leu His Pro Asp Val Ile Pro Asp Leu Asp Leu Tyr Pro Tyr
35 40 45
Asp Pro Trp Glu Leu Asn Gly Lys Ala Leu Ser Glu Gly Lys Gln Trp
50 55 60
Tyr Phe Tyr Ser Arg Arg Thr Gln Asn Arg Val Thr Ser Ser Gly Tyr
65 70 75 80
Trp Lys Leu Leu Gly Thr Glu Glu Pro Val Ile Ser Ser Thr Ser Asn
85 90 95
Asn Lys Ile Gly Ile Lys Arg Tyr Phe Gly Phe Tyr Val Gly Lys Ala
100 105 110
Pro Ser Gly Ile Lys Thr Asn Trp Thr Met His Glu Tyr Arg Leu Ser
115 120 125
Ala Val Ser Asp Thr Cys Ala Ser Ser Ser Thr Pro Arg Ser Ser Lys
130 135 140
Arg Arg Gly His Arg Lys Ala Val Asn Glu Tyr Ser Gly Tyr Val Leu
145 150 155 160
Cys Arg Val Tyr Glu Arg Gly Glu Asp Glu Asp Asp Asp Gly Thr Glu
165 170 175
Leu Ser Cys Leu Asp Glu Val Phe Leu Ser Leu Asp Asp Leu Asp Glu
180 185 190
Ile Ser Leu Pro Asn
195
Claims (9)
1.启动子,其特征在于,所述启动子的核苷酸序列由SEQ ID NO:1第1672-1998位所示序列组成。
2.权利要求1所述的启动子,其特征在于,所述启动子的核苷酸序列为如下任意一个:
1)SEQ ID NO:1第1672-1998位所示序列;
2)SEQ ID NO:1第1304-1998位所示序列;
3)SEQ ID NO:1第788-1998位所示序列;
4)SEQ ID NO:1第349-1998位所示序列;
5)SEQ ID NO:1所示序列;
6)SEQ ID NO:5所示序列。
3.基因表达盒,其特征在于,所述表达盒含有权利要求1或权利要求2任一项所述的启动子。
4.表达载体,其特征在于,所述表达载体含有权利要求3所述的表达盒。
5.宿主细胞,其特征在于,所述宿主细胞含有权利要求4所述的表达载体。
6.权利要求5所述的宿主细胞,其特征在于,所述宿主细胞为原核生物细胞。
7.权利要求6所述的宿主细胞,其特征在于,所述宿主细胞为大肠杆菌或农杆菌细胞。
8.权利要求1和权利要求2所述的启动子,权利要求4所述的表达载体,权利要求5-7所述的宿主细胞在调控皮刺发育中的应用。
9.权利要求3所述的基因表达盒在调控皮刺发育中的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110213440.5A CN114990112B (zh) | 2021-02-26 | 2021-02-26 | 一种皮刺特异启动子 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110213440.5A CN114990112B (zh) | 2021-02-26 | 2021-02-26 | 一种皮刺特异启动子 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114990112A true CN114990112A (zh) | 2022-09-02 |
CN114990112B CN114990112B (zh) | 2024-03-12 |
Family
ID=83018405
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110213440.5A Active CN114990112B (zh) | 2021-02-26 | 2021-02-26 | 一种皮刺特异启动子 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114990112B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116790599A (zh) * | 2023-07-21 | 2023-09-22 | 华中农业大学 | 蔷薇属u6启动子及其应用 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1798843A (zh) * | 2003-04-04 | 2006-07-05 | 先锋高级育种国际公司 | 植物中细胞分裂素活性的调节 |
CN101565702A (zh) * | 2008-04-25 | 2009-10-28 | 中国科学院上海生命科学研究院 | Gif1启动子及其应用 |
CN112011566A (zh) * | 2009-08-31 | 2020-12-01 | 巴斯夫植物科学有限公司 | 用于在植物中增强种子特异性基因表达而促进增强的多不饱和脂肪酸合成的调节性核酸分子 |
-
2021
- 2021-02-26 CN CN202110213440.5A patent/CN114990112B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1798843A (zh) * | 2003-04-04 | 2006-07-05 | 先锋高级育种国际公司 | 植物中细胞分裂素活性的调节 |
CN101220357A (zh) * | 2003-04-04 | 2008-07-16 | 先锋高级育种国际公司 | 植物中细胞分裂素活性的调节 |
CN101565702A (zh) * | 2008-04-25 | 2009-10-28 | 中国科学院上海生命科学研究院 | Gif1启动子及其应用 |
CN112011566A (zh) * | 2009-08-31 | 2020-12-01 | 巴斯夫植物科学有限公司 | 用于在植物中增强种子特异性基因表达而促进增强的多不饱和脂肪酸合成的调节性核酸分子 |
Non-Patent Citations (1)
Title |
---|
YU ZHANG等: "Nonglandular prickle formation is associated with development and secondary metabolism-related genes in Rosa multiflora", 《PHYSIOLOGIA PLANTARUM》, 31 December 2021 (2021-12-31), pages 1147 - 1162 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116790599A (zh) * | 2023-07-21 | 2023-09-22 | 华中农业大学 | 蔷薇属u6启动子及其应用 |
CN116790599B (zh) * | 2023-07-21 | 2024-02-27 | 华中农业大学 | 蔷薇属u6启动子及其应用 |
Also Published As
Publication number | Publication date |
---|---|
CN114990112B (zh) | 2024-03-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107164347B (zh) | 控制水稻茎秆粗度、分蘖数、穗粒数、千粒重和产量的理想株型基因npt1及其应用 | |
CN113881652B (zh) | 新型Cas酶和系统以及应用 | |
KR20220076467A (ko) | 신규 핵염기 편집기 및 이의 사용 방법 | |
KR20210139265A (ko) | 표적 서열에서 핵염기를 변형하기 위한 아데노신 데아미나제 염기 편집기 및 이의 사용 방법 | |
KR20210041008A (ko) | 핵산 표적 서열을 변형시키기 위한 다중-이펙터 핵염기 편집기 및 이를 이용하는 방법 | |
KR20210023833A (ko) | 프로그래밍가능한 염기 편집기 시스템을 이용하여 단일염기다형성을 편집하는 방법 | |
US6733965B2 (en) | Microsatellite DNA markers and uses thereof | |
JPH1094392A (ja) | ワタ遺伝子 | |
KR20220010540A (ko) | 프로그래밍가능한 염기 편집기 시스템을 이용하여 단일염기다형성을 편집하는 방법 | |
AU2020100575A4 (en) | APPLICATION OF GhMAH1 PROTEIN AND CODING GENE IN REGULATING COTTON FIBER LENGTH | |
CN113337502B (zh) | 一种gRNA及其用途 | |
WO2023202116A1 (zh) | Cas酶和系统以及应用 | |
KR20220066289A (ko) | 전사 또는 발현을 가능하게 하는 돌연변이를 편집하기 위한 조성물 및 방법 | |
CN109819659A (zh) | 提高植物非生物胁迫耐性的构建体和方法 | |
KR102254956B1 (ko) | 백색 구피색 양파를 판별할 수 있는 분자 마커 및 그 용도 | |
CN109206496B (zh) | 蛋白质GhFLS1在调控植物耐热性中的应用 | |
JP2007020466A (ja) | 大麻CannabissativaL.の識別方法 | |
CN114990112B (zh) | 一种皮刺特异启动子 | |
KR102154701B1 (ko) | 수박 왜성 개체 선발용 신규 유전자 마커 및 이의 이용 | |
CN110408646B (zh) | 一种植物遗传转化筛选载体及其应用 | |
CN115261363B (zh) | Apobec3a的rna脱氨酶活性测定方法及rna高活性的apobec3a变体 | |
CN110835630B (zh) | 一种高效的sgRNA及其在基因编辑中的应用 | |
KR20230124553A (ko) | 글리코겐축적병 1a형을 치료하기 위한 조성물 및 방법 | |
CN110923263B (zh) | 水稻β-淀粉酶BA1及其编码基因与应用 | |
CN114245823A (zh) | 非生物胁迫耐性植物和方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |