CN116904482A - Kinesin gene affecting upland cotton fiber properties and SNP marker thereof - Google Patents
Kinesin gene affecting upland cotton fiber properties and SNP marker thereof Download PDFInfo
- Publication number
- CN116904482A CN116904482A CN202311060948.1A CN202311060948A CN116904482A CN 116904482 A CN116904482 A CN 116904482A CN 202311060948 A CN202311060948 A CN 202311060948A CN 116904482 A CN116904482 A CN 116904482A
- Authority
- CN
- China
- Prior art keywords
- cotton
- kinesin
- gene
- fiber
- snp marker
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 229920000742 Cotton Polymers 0.000 title claims abstract description 40
- 244000299507 Gossypium hirsutum Species 0.000 title claims abstract description 33
- 235000009429 Gossypium barbadense Nutrition 0.000 title claims abstract description 29
- 235000018322 upland cotton Nutrition 0.000 title claims abstract description 29
- 239000003550 marker Substances 0.000 title claims description 15
- 108010063296 Kinesin Proteins 0.000 title abstract description 54
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 41
- 239000000835 fiber Substances 0.000 claims abstract description 29
- 241000219146 Gossypium Species 0.000 claims description 19
- 230000009418 agronomic effect Effects 0.000 claims description 7
- 239000000463 material Substances 0.000 claims description 7
- 238000000034 method Methods 0.000 claims description 7
- 238000001514 detection method Methods 0.000 claims description 2
- 238000011161 development Methods 0.000 abstract description 9
- 238000012098 association analyses Methods 0.000 abstract description 6
- 101001008952 Dictyostelium discoideum Kinesin-related protein 11 Proteins 0.000 abstract description 4
- 101001139068 Dictyostelium discoideum Kinesin-related protein 4 Proteins 0.000 abstract description 4
- 101000914247 Homo sapiens Centromere-associated protein E Proteins 0.000 abstract description 4
- 230000002068 genetic effect Effects 0.000 abstract description 4
- 102100025832 Centromere-associated protein E Human genes 0.000 abstract description 3
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000013507 mapping Methods 0.000 abstract description 3
- 102000010638 Kinesin Human genes 0.000 description 16
- 230000007614 genetic variation Effects 0.000 description 12
- 101001006793 Dictyostelium discoideum Kinesin-related protein 6 Proteins 0.000 description 8
- 235000018102 proteins Nutrition 0.000 description 7
- 102000004169 proteins and genes Human genes 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- 241000209094 Oryza Species 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 241000196324 Embryophyta Species 0.000 description 4
- 235000007164 Oryza sativa Nutrition 0.000 description 4
- 238000000513 principal component analysis Methods 0.000 description 4
- 235000009566 rice Nutrition 0.000 description 4
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 3
- 102000029749 Microtubule Human genes 0.000 description 3
- 108091022875 Microtubule Proteins 0.000 description 3
- 235000001014 amino acid Nutrition 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 238000010219 correlation analysis Methods 0.000 description 3
- 210000004688 microtubule Anatomy 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- 241000219195 Arabidopsis thaliana Species 0.000 description 2
- 235000009432 Gossypium hirsutum Nutrition 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 230000011278 mitosis Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 101001050566 Dictyostelium discoideum Kinesin-related protein 2 Proteins 0.000 description 1
- 101001139158 Dictyostelium discoideum Kinesin-related protein 3 Proteins 0.000 description 1
- 101001006792 Dictyostelium discoideum Kinesin-related protein 5 Proteins 0.000 description 1
- 101001006786 Dictyostelium discoideum Kinesin-related protein 7 Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 241000195887 Physcomitrella patens Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 210000004081 cilia Anatomy 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000012297 crystallization seed Substances 0.000 description 1
- 230000003436 cytoskeletal effect Effects 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- 210000002615 epidermis Anatomy 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 210000002288 golgi apparatus Anatomy 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 210000005061 intracellular organelle Anatomy 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000021121 meiosis Effects 0.000 description 1
- 210000001724 microfibril Anatomy 0.000 description 1
- 210000003632 microfilament Anatomy 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 230000004260 plant-type cell wall biogenesis Effects 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000006574 secondary cell wall biogenesis Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000012256 transgenic experiment Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Botany (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Genetics & Genomics (AREA)
- Mycology (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The application combines upland cotton (G.hirsutum L.) association mapping to identify Kinesin genes which play an important role in cotton fiber and cotton boll development, and association analysis identifies that one of Kinesin-7 subfamily genes (Ghir_D04G 017880) is obviously related to fiber strength. The Kinesin gene family plays an important role in cotton fiber and cotton boll development, and the identification of the Kinesin gene with functions and related molecular markers is beneficial to the genetic improvement of fiber quality and yield.
Description
Technical Field
The application belongs to the technical field of molecular biology, and particularly relates to a key Kinesin gene affecting upland cotton fiber properties and SNP markers thereof.
Background
Because of the high cellulose content in cotton, it is used as the most important fiber crop planting in the world. Meanwhile, cotton fibers belong to single cells, and are an excellent model for researching cellulose formation. There is a reported class of molecular motile proteins that play a critical role in the directional deposition of cellulose microfibrils and the mechanical strength of secondary walls. This motor protein is designated as Kinesin, which is responsible for intracellular organelle movement, signal transduction, mitosis and meiosis, and cytoskeletal construction. Kinesin kinesins convert chemical energy into mechanical energy by hydrolyzing adenosine triphosphate (adenosine triphosphate, ATP) as the driving force. Typically such kinesins can specifically bind to microtubules and move on linear microtubule rails, thereby completing specific physiological activities; in addition, kinesins have highly conserved kinesin domains that contain a globular domain of 360 residues, known as the catalytic core, and thus genes encoding such proteins are known as the kinesin gene family. The Kinesin gene family can be divided into 14 subfamilies, i.e., kinesin-1 to Kinesin-14 subfamilies, depending on the specific Kinesin domain.
Extensive studies of the Kinesin gene family have been conducted in animals, where the Kinesin-13 subfamily has been demonstrated to play a key role in regulating mitosis, cilia assembly and depolymerization. In plants, moscateli et al (1988) first identified Kinesin motor proteins in tobacco pollen tubes, and subsequently identified that the Kinesin gene functions significantly in maize, arabidopsis, tomato, etc. plants. Of the Kinesin superfamily, the Kinesin-13 subfamily has been shown to be involved in plant growth and development, and in moss (Physcomitrella Patens), kineins-13 and kineins-8 have been shown to affect cell division and cell growth (Leong et al, 2020). In addition, the Kinesin-13 subfamily in Arabidopsis thaliana and rice (Oryza spp.) has also been studied, and researchers found Kinesin-13 protein in the Golgi apparatus of Arabidopsis thaliana, and it was shown by T-DNA mediated gene knockout experiments that Kinesin-13 affects the morphology of leaf epidermis trichomes; furthermore, knock-out and over-expression experiments confirm that Kinesin-13 plays a key role in secondary cell wall synthesis. Meanwhile, in rice, kinesin-13 genes for regulating the grain length of the rice are identified through map cloning, and Wu and the like verify that the Kinesin-13 genes affect the seed length and plant height of the rice through map cloning and transgenic experiments.
Preuss et al found that the proteins associated with the Kinesin driver gene family were involved in the development of cotton fibroblasts. Furthermore, the kinesin gene GHKCH2 was observed to interact with microtubules and microfilaments in developing cotton fibers. Li et al report that although the GHKIS13A1 gene is at 6Y
Relatively low expression levels were maintained in cotton fibers, but overexpression of GHKIS13A1 in arabidopsis resulted in branching of the epidermal hair.
Cotton fibers are an ideal model for studying plant cell elongation and cell wall synthesis. Thus, it is necessary to study the Kinesin gene family throughout the genome and to identify key Kinesin genes involved in the cotton development process. In recent years, a large number of Quantitative Trait Loci (QTLs) have been mined based on high quality cotton reference genome and whole genome association analysis (GWAS). Thus, binding gene families and association analysis are viable strategies for mining functional genes of a gene family of interest.
Disclosure of Invention
Aiming at the bottleneck problems of improving the quality and the yield of the fiber faced by the prior upland cotton, the application aims to: the upland cotton whole genome Kinesin gene is identified by using upland cotton reference genome, upland cotton population phenotype data and resequencing data, and important Kinesin genes which can influence the upland cotton fiber phenotype are mined so as to be applied to genetic improvement of the quality and yield of the upland cotton fiber in the future.
The application aims at providing a Gh_Kinesin7 gene affecting the fiber properties of upland cotton, and the sequence of the gene is shown as SEQ ID NO. 1.
Further, the full-length sequence of the Gh_Kinesin7 gene is shown as SEQ ID NO. 2.
Further, the genes can be used to improve cotton agronomic traits.
Further, the cotton is upland cotton and the agronomic trait is a fiber trait.
The second object of the application is to provide a method for improving the agronomic characters of cotton, which comprises the step of highly expressing the Gh_Kinesin7 gene in cotton varieties, wherein the Gh_Kinesin7 gene sequence is shown as SEQ ID NO. 1.
Further, the cotton variety is upland cotton and the agronomic trait is a fiber trait.
The application further aims to provide a SNP marker related to cotton fiber properties, wherein the polymorphism of the SNP marker is T/C, and the SNP marker is positioned at the 5869bp position of the sequence shown in SEQ ID NO. 2.
Further, the individual fiber properties of genotype TT are significantly stronger than the CC genotype.
The fourth object of the present application is to provide a primer set for detecting the SNP marker, wherein the primer set is as follows: gh_Kin7 pre-primer as shown in SEQ ID NO. 3: GTGCTTGCTTCTGATCCGTC; gh_Kin7 post primer as shown in SEQ ID NO. 4: CTCTCTGCCCCCACCTGAAA.
The fifth object of the present application is to provide a kit comprising the above-mentioned detection primer pair.
The application aims at providing a method for identifying the fiber properties of cotton materials, which comprises the step of detecting the cotton materials by adopting the primer pair and/or the kit, wherein the fiber properties are good when the SNP marker genotype is TT.
The beneficial effects of the application are as follows: the application combines the correlation mapping of upland cotton (G.hirsutum L.) to identify Kinesin genes with important roles in cotton fiber and cotton boll development, and the correlation analysis identifies that one of the Kinesin-7 subfamily genes (Ghir_D04G 017880) is obviously related to fiber strength, the Kinesin gene family has important roles in cotton fiber and cotton boll development, and the identification of the Kinesin-7 genes with functions and related molecular markers based on the genes are beneficial to genetic improvement of fiber quality and yield.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a whole genome identification of the Kinesin Gene family in example 1
FIG. 2 is a genome-wide SNPs and population structure analysis of natural populations of upland cotton of 205 materials in example 2
FIG. 3 is 3 fiber traits of the natural population of upland cotton in 8 field test cells in example 2
FIG. 4 is a correlation analysis of upland cotton fiber strength and candidate gene analysis in example 3.
Detailed Description
The technical scheme of the present application will be clearly and completely described in the following in connection with specific embodiments. The following examples are preferred embodiments of the present application, but are not intended to limit the scope of the present application in any way. The substitution of simple parameters in the embodiments of the present application cannot be described in detail in the examples, but the present application is not limited thereto, and any other modifications, substitutions, combinations, and simplifications made without departing from the theoretical spirit and principles of the present application should be considered as equivalent substitution modes, and are included in the scope of the present application.
Unless specifically stated otherwise, the reagents, methods and apparatus employed in the present application are those conventional in the art. Reagents and materials used in the following examples are commercially available unless otherwise specified.
EXAMPLE 1 identification of upland cotton Whole genome Kinesin Gene
A hidden Markov model of the Kinesin protein conserved domain was obtained from the Pfam database (http:// Pfam. Xfam. Org /) (Pfam Number: PF 00225). Upland cotton reference genomes and annotation files were obtained from the cottonGen website (https:// www.cottongen.org /). The Kinesin gene (P < 1E-20) containing a conserved Kinesin protein domain was identified by HMMSearch software (Wheeler and Eddy, 2013), and 170 Kinesin members were identified in upland cotton by looking up the Kinesin gene family conserved domain in the upland cotton genome. And further analyzing the protein sequence of the Kinesin genes by an online tool CDD (http:// www.ncbi.nlm.nih.gov/CDD /), and confirming the existence of the conserved domain of the Kinesin protein genes. In addition, 161 Kinesin genes containing conserved domains were identified in NCBI by conserved domain search screening; two kinesin genes were filtered out, and the evolutionary tree could not be constructed because there was a large genetic difference between the two genes. Thus, 159 Kinesin genes were obtained in upland cotton. Carrying out multi-sequence comparison on the protein sequence of the Kinesin gene by using Mega-X software to obtain a phylogenetic relationship; the phylogenetic tree was then drawn using on-line Itol software (https:// Itol. Embl. De /) (fig. 1).
Example 2 acquisition of upland cotton Natural population genotype and phenotype data
In order to locate the Kinesin functional gene associated with gossypium hirsutum fiber development, candidate gene association analysis was performed on the 3 fiber traits and whole genome genetic variation of the natural population of gossypium hirsutum. The upland cotton natural population comprises 205 materials, and early studies were planted in 4 cotton ecological areas in 2012 and 2013 respectively, so that phenotype data were collected in 8 test cells, and the 8 test points were named: BJ2012 (north region 2012), BJ2013 (north region 2013), NJ2012 (south region 2012), NJ2013 (south region 2013), CJ2012 (Yangtze river basin 2012), CJ2013 (Yangtze river basin 2013), HH2012 (yellow river basin 2012) and HH2013 (yellow river basin 2013). And carrying out association analysis and identification on the genome-wide Kinesin gene by utilizing 3 upland cotton fiber traits, wherein the three fiber traits are respectively Fiber Length (FL), fiber Strength (FS) and coat-Length (LW). Natural population Fiber Length (FL) mean performs best in CJ2012 cells; the Xinjiang area generally exhibits better FL and FS quality than the Yangtze river basin and the yellow river basin; the coefficient of variation of LW is higher than FL and FS (FIG. 2) compared to FL and FS.
The natural population was resequenced approximately 5-fold deep using whole genome resequencing methods at the early stage of the study. Sequencing data was aligned with the TM-1 reference genome using BWA0.7.10 software. Identification of SNP genetic variation in BAM files by GATK 3.1.1 software, genetic variation was filtered by VCFTools software, and 3,203,378 genetic variations were finally identified in total from the resequencing of natural populations of upland cotton (fig. 3 a). The subspecies of the natural population were estimated using Admix 1.3.0, and the principal component analysis of the natural population was calculated using TASSEL 5.0 and high quality SNP. Analysis of the population structure showed that the population could be divided into 4 subspecies (fig. 3 b), with Principal Component Analysis (PCA) substantially consistent with this result (fig. 3 c).
Example 3 analysis of the association of Kinesin Gene with phenotype, the association of a phenotypic trait with Whole genome Kinesin Gene
To study the relationship between the Kinesin gene and fiber development, we performed candidate gene association mapping between actin genes and 3 fiber traits. Genetic variation of all Kinesin genes was extracted using TASSEL 5.0, and 1318 genetic variations were extracted from the whole genome genetic variation of 159 Kinesin genes of upland cotton. Candidate gene association localization was performed on 3 fiber traits and SNPs in 159 Kinesin genes using the MLM (PCA+K) model of TASSEL 5.0 software, and the threshold was set to-log (1/N, N is the number of markers).
Correlation analysis results show that 19 Kinesin genes are obviously related to FL, and the phenotype variation interpretation (PVE) range of the related markers is 5.981% -15.524%; 8 Kinesin genes are obviously related to FS, and PVEs of related markers range from 5.836% to 13.352%; the 3 Kinesin genes were significantly associated with LW, with correlation-tagged PVEs ranging from 6.149% to 16.195% (as shown in table 1).
TABLE 1 analysis of candidate Gene correlation between genetic variation and Whole genome Kinesin Gene
A gene of the Kinesin-7 subfamily (Ghir_D04G 017880) was identified by years of multi-point candidate gene association analysis as being significantly associated with FS, which gene was located between 53421041-53430230bp of the D04 chromosome and was designated Gh_Kinesin7 (Table 1 and FIG. 4 a). Gh_Kinesin7 full length 9190 bp) (shown as SEQ ID NO. 2) comprises 27 exons and 26 introns, the full length of the coding sequence is 3000bp, and 999 amino acids are coded. The conserved domain analysis shows that the Kinesin motor domain of Gh_Kinesin7 is located between 103 and 420 amino acids. 15 genetic variations were identified in total for gh_kinesin7, with 11 genetic variations for gh_kinesin7 being a significant marker for candidate gene association localization and 5 genetic variations in the Kinesin motor domain. Two genetic variations resulted in mutations in the amino acids of the Kinesin motor domain, with the SNP of D04:53423182 inducing aspartic acid to serine and the SNP of D04:53424009 inducing alanine to threonine (FIG. 4 e).
A significantly associated SNP marker (SD 04: 53426909) was located at the 5869bp position of the gh_kinesin7 gene, which could explain the 13.35% variation in fiber strength in NJ2012 (table 4 a), the FL average (average=29.50 mm) for the TT genotype group was significantly higher than the CC genotype group (average=26.91 mm) (P value= 0.00848, fig. 4 b), and only homozygous genotypes were present in the natural population. The following Gh_Kin7 primer tags were designed for this SNP variation:
Gh_Kin7 pre-primer (5 '. Fwdarw.3'): GTGCTTGCTTCTGATCCGTC (SEQ ID NO: 3);
Gh_Kin7 post primer (5 '. Fwdarw.3'): CTCTCTGCCCCCACCTGAAA (SEQ ID NO: 4).
Gh_Kinesin7 was preferentially expressed in 10-DPA fibers, and RT-PCR confirmed the expression of Gh_Kinesin7 in cotton varieties (FIGS. 4 c-d).
All documents mentioned in this disclosure are incorporated by reference in this disclosure as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the application as defined in the appended claims.
Claims (10)
1. The Gh_Kinesin7 gene affecting the fiber property of upland cotton is characterized in that the sequence of the gene is shown as SEQ ID NO. 1.
2. The gene according to claim 1, wherein the full length sequence of the Gh_Kinesin7 gene is shown in SEQ ID NO. 2.
3. Use of a gene according to any one of claims 1-2 for improving agronomic traits in cotton.
4. The use according to claim 3, wherein the cotton is upland cotton and the agronomic trait is a fiber trait.
5. A method for improving the agronomic characters of cotton is characterized by comprising the step of high expressing Gh_Kinesin7 gene in cotton varieties, wherein the Gh_Kinesin7 gene sequence is shown as SEQ ID NO. 1.
6. A SNP marker related to cotton fiber characteristics is characterized in that polymorphism of the SNP marker is T/C, and the SNP marker is positioned at a 5869bp position of a sequence shown in SEQ ID NO. 2.
7. The SNP marker according to claim 6, wherein the individual fiber property of genotype TT is significantly stronger than that of CC genotype.
8. A primer pair for detecting the SNP marker as set forth in claim 6, characterized in that the primer pair is: gh_Kin7 pre-primer as shown in SEQ ID NO. 3: GTGCTTGCTTCTGATCCGTC; gh_Kin7 post primer as shown in SEQ ID NO. 4: CTCTCTGCCCCCACCTGAAA.
9. A kit comprising the detection primer pair of claim 7.
10. A method for identifying the fiber properties of cotton materials, comprising detecting cotton materials using the primer set of claim 7 and/or the kit of claim 8, wherein the fiber properties are indicated to be good when the SNP marker genotype is TT.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311060948.1A CN116904482A (en) | 2023-08-22 | 2023-08-22 | Kinesin gene affecting upland cotton fiber properties and SNP marker thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311060948.1A CN116904482A (en) | 2023-08-22 | 2023-08-22 | Kinesin gene affecting upland cotton fiber properties and SNP marker thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116904482A true CN116904482A (en) | 2023-10-20 |
Family
ID=88358427
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311060948.1A Pending CN116904482A (en) | 2023-08-22 | 2023-08-22 | Kinesin gene affecting upland cotton fiber properties and SNP marker thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116904482A (en) |
-
2023
- 2023-08-22 CN CN202311060948.1A patent/CN116904482A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7381133B2 (en) | Method for designing RNA-binding proteins using PPR motif and its use | |
Ma et al. | High-quality genome assembly and resequencing of modern cotton cultivars provide resources for crop improvement | |
Zhou et al. | Characterization of a large sex determination region in Salix purpurea L.(Salicaceae) | |
Iorizzo et al. | De novo assembly and characterization of the carrot transcriptome reveals novel genes, new markers, and genetic diversity | |
Xia et al. | Development of high-density SNP markers and their application in evaluating genetic diversity and population structure in Elaeis guineensis | |
Yang et al. | Target SSR-Seq: a novel SSR genotyping technology associate with perfect SSRs in genetic analysis of cucumber varieties | |
JP2002510506A (en) | Method for obtaining plant having genetic disorder in gene sequence | |
Yan et al. | Simultaneous identification of multiple causal mutations in rice | |
Tobias et al. | Analysis of expressed sequence tags and the identification of associated short tandem repeats in switchgrass | |
Kroc et al. | Development and validation of a gene-targeted dCAPS marker for marker-assisted selection of low-alkaloid content in seeds of narrow-leafed lupin (Lupinus angustifolius L.) | |
Anisimova et al. | Nucleotide sequence polymorphism in the RFL-PPR genes of potato | |
CN116334248A (en) | Liquid chip for local chicken genetic resource protection and variety identification and application thereof | |
CN106636083B (en) | Corn single-plant spike weight major QTL, and obtaining method and application thereof | |
CN116904482A (en) | Kinesin gene affecting upland cotton fiber properties and SNP marker thereof | |
CN116286849A (en) | Gene AhWRI1 for regulating peanut oil content and application thereof | |
CN111876433B (en) | Wheat staged color-changing mutant gene with both marker recognition and ornamental values and application thereof | |
CN109762924B (en) | Molecular marker for salt tolerance in jute and application thereof | |
Grigoreva et al. | Development of SNP Set for the Marker-Assisted Selection of Guar (Cyamopsis tetragonoloba (L.) Taub.) Based on a Custom Reference Genome Assembly | |
Grigoreva et al. | Development of SNP Set for the Marker-Assisted Selection of Guar (Cyamopsis tetragonoloba (L.) Taub.) Based on a Custom Reference Genome Assembly. Plants 2021, 10, 2063 | |
US20230193290A1 (en) | Sec12-like protein gene cpu1 and application thereof in improving soybean phosphorus efficiency | |
KR102564133B1 (en) | Genetic marker for selection of watermelon without lateral branch, tendril and ligule trait and use thereof | |
Song et al. | Combined genome and transcriptome analysis of elite fiber quality in Gossypium barbadense | |
Jang et al. | Genetic analysis and identification of molecular markers associated with lobed leaf shape in watermelons using next generation sequencing | |
Villalba-Bermell et al. | Identification, characterization and transcriptional analysis of long non-coding RNAs in Cucurbits | |
Zhu et al. | Genome-wide identification of the key kinesin genes during fiber and boll development in upland cotton (Gossypium hirsutum L.) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |