CN111223520A - Whole genome selection model for predicting nicotine content in tobacco and application thereof - Google Patents

Whole genome selection model for predicting nicotine content in tobacco and application thereof Download PDF

Info

Publication number
CN111223520A
CN111223520A CN201911141188.0A CN201911141188A CN111223520A CN 111223520 A CN111223520 A CN 111223520A CN 201911141188 A CN201911141188 A CN 201911141188A CN 111223520 A CN111223520 A CN 111223520A
Authority
CN
China
Prior art keywords
tobacco
nicotine content
model
predicting
population
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911141188.0A
Other languages
Chinese (zh)
Other versions
CN111223520B (en
Inventor
童治军
肖炳光
方敦煌
陈学军
姚恒
焦芳婵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunnan Academy of Tobacco Agricultural Sciences
Original Assignee
Yunnan Academy of Tobacco Agricultural Sciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunnan Academy of Tobacco Agricultural Sciences filed Critical Yunnan Academy of Tobacco Agricultural Sciences
Priority to CN201911141188.0A priority Critical patent/CN111223520B/en
Publication of CN111223520A publication Critical patent/CN111223520A/en
Application granted granted Critical
Publication of CN111223520B publication Critical patent/CN111223520B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Molecular Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioethics (AREA)
  • Public Health (AREA)
  • Evolutionary Computation (AREA)
  • Epidemiology (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Artificial Intelligence (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Physiology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a whole genome selection model for predicting nicotine content in tobacco and application thereof, wherein the whole genome selection model for predicting nicotine content in tobacco is Bayes BNICIn order to optimize the prediction accuracy of the model on the nicotine content phenotype value in the tobacco, core parameter values of a candidate prediction model Bayes B, such as the number of molecular markers (n 1), the scale of a training population (n 2), the ratio of the training population to a test population (n 3), the model prediction accuracy value (n 4), and the like, are clearly specified. The application is to use the whole genome to select a model Bayes BNICUse of genotypic data from a population of tobacco to predict its nicotine content. The invention discloses a tobacco nicotine content whole genome selection model Bayes BNICCan be based on tobacco populationThe nicotine content value of each plant in the group is accurately predicted according to the genotype, so that excellent tobacco varieties (lines) with different nicotine content levels in tobacco quality breeding are cultured.

Description

Whole genome selection model for predicting nicotine content in tobacco and application thereof
Technical Field
The invention belongs to the technical field of biology, and particularly relates to a whole genome selection model for predicting nicotine content of tobacco and application thereof.
Background
The nicotine content in tobacco is not only a specific substance of the genus of tobacco, but also one of the key factors for measuring the smoking quality of finished cigarettes. Research shows that nicotine exists in the whole growth period process of tobacco and is distributed in the whole tobacco, and the nicotine content in the cured tobacco directly influences the smoking quality of the final finished product cigarette, and the tobacco belongs to quantitative characters controlled by environment, small effect and numerous gene loci. In the offspring group generated by the hybridization among the tobacco varieties with different nicotine contents or the natural group composed of different tobacco varieties, the nicotine content value in each tobacco plant presents standard normal distribution conforming to quantitative characters (Shaoguang, Luxinu, Jojoba, Liyonping, Sunyuyiyou, Guogui, QTL preliminary analysis of several chemical components of flue-cured tobacco, journal of crops, 2008,34 (10): 1762-. Based on a specific tobacco population (i.e., a population derived from two fixed parent varieties) and by combining genotype data obtained by a small number of molecular markers and phenotypic data of tobacco nicotine content obtained from multiple points over the years, researchers tried to perform preliminary location analysis on the genetic loci controlling the tobacco nicotine content traits and obtained a small number of QTL loci (Children's army, Chen's school) controlling the tobacco nicotine content traitsArmy, the magnificent of the Square Dundon, Zengjian Ming, Wuxing Fu, Shouguang, 231 flue-cured tobacco germplasm resource SSR marker genetic diversity and correlation analysis of the SSR marker genetic diversity with agronomic characters and chemical components, Chinese tobacco science and report, 2017,207; julio E., Denoyes Rothan B., Verrier J.L., Dorlhac de Borne F., Detection of QTLs linked to leaf and mouse properties in Nicotiana atabacum base on a study of 114 recombinantly organized lines, Mol Breeding, 2006,10: 1012-. However, the process of obtaining the tobacco nicotine content data is very long and difficult, and the tobacco nicotine content phenotype data can be obtained after complex and tedious procedures such as harvesting, baking, primary selection, grading, sample preparation, detection and the like are usually required after a long field growth period. The whole process at least takes more than 2 years, and each process is very easily influenced by factors such as environment, technology, instruments, human factors and the like, so that the accuracy of the finally obtained nicotine content phenotype value is low. In addition, there have been a few studies to clone and verify key genes in the nicotine synthesis and metabolic pathways of tobacco, and attempts to apply the genes to new varieties (lines) of tobacco with different nicotine content levels (Li d., Lewis r. s., Jack a. m., Dewey r. e., Bowen s. w., Miller r. d.,CYP82E5v2andCYP82E10gene variants reducing nicotine to nornicot conversion in tobacaco, MolBreding, 2011, 9: 853. ang. 863. Xu S. Q., Brockmoller T., Navarro-Quezada A., KuhlH., Gase K., Ling Z. H., Zhou W., Kreitzer C., Stanke M., Tang H. B., Lyons E., Paney P., Pandey S. P., Timmermann B., Gaquerel E., and Baldwin I.T., Wild tobaco genes derived the evaluation of microbial biosynthesis. So far, genetic analysis research on molecular level, especially on genetic loci/QTL for controlling nicotine content traits of tobacco is still relatively few, and accuracy of results is low, so that breeding work of high-quality new varieties aiming at the nicotine content of tobacco by using biotechnology means such as molecular marker-assisted selection is restricted.
In view of this, the invention utilizes flue-cured tobacco variety YY3 (high nicotine content, 2.6 + -0.11%) and flue-cured tobacco variety K326 (low nicotine content,1.2 ± 0.13%) a population of recombinant inbred lines (RILs, F7) derived on the basis of fixed parents was constructed, and in addition, a natural population containing 347 different tobacco varieties (lines) was also constructed, the two different types representing the whole tobacco population or tobacco varieties (lines); on the other hand, a Bayes B candidate prediction model obtained by primary screening is adopted to further screen, optimize and construct a whole genome selection model Bayes B for predicting the nicotine content in tobaccoNICAnd the breeding work of the tobacco varieties with different nicotine content levels by Marker Assisted Selection (MAS) in the whole genome range is accelerated, so that the high-quality tobacco varieties with ideal nicotine content levels can be scientifically, efficiently, accurately and reliably cultured.
Disclosure of Invention
The first purpose of the invention is to provide a whole genome selection model for predicting the nicotine content of tobacco; the second purpose is to provide the application of the whole genome selection model for predicting the nicotine content of tobacco.
The first purpose of the invention is realized by that the whole genome selection model for predicting the nicotine content of tobacco is Bayes BNICThe whole genome selection model for predicting the nicotine content in tobacco is obtained by establishing core parameter values of the molecular marker number (n 1), the training population scale (n 2), the training population to test population ratio (n 3) and the model prediction precision value (n 4) in the model on the basis of a Bayes B candidate prediction model obtained by primary screening, and the formula is as follows:
Figure 793942DEST_PATH_IMAGE002
or
Figure 179924DEST_PATH_IMAGE004
Note:
Figure 428502DEST_PATH_IMAGE006
represents: will be provided withThe core 4 parameters n1, n2, n3 and n4 are substituted into the Bayes B candidate prediction model in turn.
The second object of the present invention is achieved by the genome wide selection model Bayes BNICThe method is applied to analyzing the genotype data of each plant in tobacco groups or varieties (lines) so as to accurately predict and obtain the nicotine content of tobacco.
In order to scientifically, efficiently and accurately select tobacco varieties with different nicotine content levels and specifically select offspring materials with ideal nicotine content, the invention provides a whole genome selection model Bayes B for predicting the nicotine content of tobaccoNICThe method comprises the steps of respectively acquiring and analyzing genotype data and nicotine content character phenotype data in roasted leaves of recombinant inbred line populations (RILs), natural populations and comprehensive populations mixed with the RILs and the natural populations by utilizing the model, screening, optimizing and verifying core parameters such as molecular marker number (n 1), training population scale (n 2), training population to test population ratio (n 3) and model prediction precision value (n 4) on the basis of a Bayes B candidate prediction model obtained by primary selection, and finally obtaining a whole genome selection model Bayes B for predicting the nicotine content in tobaccoNICThe method can be used for auxiliary selection of the tobacco nicotine content QTL site in the whole genome range so as to improve the efficiency of molecular marker auxiliary selection and the efficiency of ideal nicotine content variety breeding.
On one hand, the invention utilizes flue-cured tobacco variety YY3 (high nicotine content, 2.6 +/-0.11%) and flue-cured tobacco variety K326 (low nicotine content, 1.2 +/-0.13%) to construct a recombinant inbred line (RILs, F7) group, and simultaneously, also constructs a natural group containing 347 different tobacco varieties (lines), and utilizes the two groups to represent all tobacco groups or varieties (lines); on the other hand, a candidate prediction model Bayes B obtained by primary selection is adopted, and a whole genome selection model Bayes B for predicting the nicotine content in tobacco is further screened, optimized and constructedNICAnd accelerating the molecular Marker Assisted Selection (MAS) in the whole genome range to select and breed tobacco varieties with ideal tobacco nicotine content.
The invention relates to a whole genome selection model Bayes B for predicting nicotine content in tobaccoNICHas the characteristics of science, high efficiency, accuracy and low cost, so the model Bayes BNICCan be applied to culturing a new variety (line) of high-quality tobacco with ideal nicotine content.
Drawings
FIG. 1 is a graph based on the distribution of nicotine content phenotype values in cured lamina of a comprehensive population of tobacco;
wherein, the abscissa represents the nicotine content value in the cured leaves of each individual plant (line) in the tobacco comprehensive population; the ordinate represents the number of tobacco plants;
FIG. 2 is a schematic diagram of a candidate prediction model obtained by preliminary screening of a tobacco synthetic population;
wherein the abscissa represents 4 original models provided in the R language package: bayes A, Bayes B, Bayes C, and rrBLUP; the ordinate represents the precision of the original model for predicting the nicotine content phenotype value of the tobacco group to be detected;
FIG. 3 is the number of molecular markers (n 1) vs. Bayes BNICThe influence of the nicotine content prediction precision of the model;
wherein, the abscissa is the number of molecular markers; ordinate is Bayes BNICThe model predicts the nicotine content accurately; 1K, 2K, 4K, 7K, 11K, 16K, 22K, 29K, 37K and All shown on the abscissa in the figure respectively represent the number of SNP markers for genotyping the tobacco composite population (the mixed population of the recombinant inbred line population and the natural population) to be 1000, 2000, 4000, 7000, 11000, 16000, 22000, 29000, 37000 and 50000;
FIG. 4 is training population size (n 2) versus Bayes BNICThe influence of the nicotine content prediction precision of the model;
wherein the abscissa is Bayes BNICThe model predicts the nicotine content accurately; the ordinate is the training population size (the number of tobacco plants contained in the training population);
FIG. 5 is a ratio of training population to test population (n 3) versus Bayes BNICNicotine content prediction accuracy impact of model;
Wherein the abscissa is the ratio of the training population to the test population (i.e., the number of tobacco plants in the training population: the number of tobacco plants in the test population); ordinate is Bayes BNICThe model predicts the nicotine content accurately; the right hand icons 1/5, 1/4, 1/3, 1/2, 1, 2, 3, 4, 5, 6, and 10 in the figure represent training populations, respectively: the ratios of the test populations were 1:5, 1:4, 1:3, 1:2, 1:1, 2:1, 3:1, 4:1, 5:1, 6:1 and 10:1, respectively.
Detailed Description
The present invention is further illustrated by the following examples and the accompanying drawings, but the present invention is not limited thereto in any way, and any modifications or alterations based on the teaching of the present invention are within the scope of the present invention.
The examples do not specify particular techniques or conditions, and are performed according to the techniques or conditions described in the literature in the art or according to the product specifications. The reagents or instruments used are not indicated by the manufacturer, and are all conventional products available by purchase.
The whole genome selection model for predicting the nicotine content in tobacco is Bayes BNICThe model is a specific numerical value of the number (n 1) of core parameter molecular markers, the scale (n 2) of a training population, the proportion (n 3) of the training population to a testing population and the model prediction precision value (n 4) obtained through experimental screening, optimization and verification on the basis of a Bayes B candidate prediction model obtained through preliminary screening.
The model is Bayes BNICThe corresponding 4 core parameter values are respectively:
number of molecular markers (n 1): a, when cost performance is high (lower or less investment is pursued, higher return or profit is pursued), n1=4000 labels; b, high prediction accuracy (in pursuit of obtaining the most accurate tobacco nicotine content values), n1=7000 markers.
Training population size (n 2): a, when the cost performance is high, n2=200 individuals; b, high prediction accuracy, n2=250 individuals.
Training population to test population ratio (n 3): a, n3=5:1 (i.e., the number of tobacco plants in the training population: the number of tobacco plants in the test population =5: 1) for cost performance; b, when the prediction precision is high, n3=10: 1.
Model prediction accuracy value (n 4): a, when the cost performance is high, n4= 0.52; b, when the prediction accuracy is high, n4= 0.56.
The whole genome selection model Bayes B of the inventionNICThe application of analyzing the genotype data of each plant in the tobacco population or variety (line) in the seedling stage (early stage) so as to predict and obtain the nicotine content in the tobacco.
The application of the whole genome selection model for predicting the nicotine content in tobacco is to utilize Bayes BNICAnd (3) analyzing the genotype value of the tobacco group or variety (line) to be detected in the seedling stage (early stage) by the model, and predicting and obtaining the nicotine content of the tobacco to be detected according to the genotype value.
The invention is further illustrated by the following specific examples:
example 1
Whole genome selection model Bayes B for predicting nicotine content in tobaccoNICConstruction and application of
First, experimental material
Constructing a recombinant inbred line (RILs, F7) group by using a flue-cured tobacco variety YY3 (high nicotine content, 2.6 +/-0.11%) and a flue-cured tobacco variety K326 (low nicotine content, 1.2 +/-0.13%), wherein the RILs group comprises 300 strains; in addition, to construct a predictive model with tobacco commonality, a natural population of tobacco was constructed that was distinct from the fixed parental derivative population, which was composed of 347 different tobacco varieties (lines).
Second, obtaining the nicotine content data in the baked leaves of two different types of tobacco groups
Transplanting the test material to a field after the test material is grown, and starting tobacco leaf harvesting, baking, grading, sample preparation, detection and statistics of the nicotine content in the tobacco leaves after the field tobacco plants are mature. From the results of the finally obtained phenotypic data, it is known that: the nicotine content values in the flue-cured tobacco leaves of all the groups are distributed continuously and positively, and belong to typical quantitative characters, and are shown in figure 1.
Third, SNP marker analysis
Extracting tobacco genome DNA: the conventional CTAB method or plant tissue DNA extraction kit can be adopted, and the method can refer to the existing literature or the instruction in the kit. But the extracted tobacco DNA needs to be purified to remove RNA, protein and other organic impurities, so that the requirement of developing the SNP chip is met; if the tobacco sample is subjected to genome re-sequencing to mine the SNP marker, the corresponding tobacco DNA quality needs to be processed according to the requirements of a sequencing company.
Four, whole genome selection model Bayes BNICConstruction and application of
4.1 preliminary screening of Bayes B candidate predictive models
The method comprises the steps of simulating the tobacco nicotine content value of 4 original models (Bayes A, Bayes B, Bayes C and rrBLUP) provided in an R language package by utilizing SNP genotype data of tobacco strains of a tobacco recombinant inbred line (RILs, 300 strains), a tobacco natural population (347 different tobacco varieties) and a tobacco comprehensive population (647 strains) formed by mixing the two and the tobacco plant SNP genotype data, preliminarily screening the original models capable of well predicting the target character phenotype value, and finally screening to obtain Bayes B as a whole genome selection candidate prediction model conforming to the predicted tobacco nicotine content, wherein the figure is 2.
4.2 Bayes BNICConstruction and application of model
Screening, optimizing and verifying a whole genome selection model aiming at tobacco nicotine content prediction is respectively carried out on a recombinant inbred line population containing 300 and 347 single plants (lines), a natural population and a tobacco comprehensive population containing 647 single plants (lines) generated after mixing the natural population and the recombinant inbred line population containing 300 and 347 single plants (lines) by utilizing a Bayes B candidate prediction model obtained by primary screening. The specific method comprises the following steps:
firstly, 50000 high-quality SNP markers uniformly distributed on the whole genome of tobacco are utilized to analyze a tobacco comprehensive group containing 647 single plants (lines) and obtain genotype data; secondly, detecting and counting the nicotine content in the tobacco leaves of each individual plant (line) of the tobacco comprehensive group after baking to obtain the tobacco nicotineContent phenotype data; thirdly, with Bayes B obtained by primary screening as a candidate prediction model, substituting the genotype data and nicotine content phenotype data of each individual plant (line) in the tobacco comprehensive population into the candidate prediction model, and continuously screening, optimizing and verifying the specific numerical values of the number of core parameter molecular markers (n 1), training population scale (n 2), training population to test population ratio (n 3) and model prediction accuracy value (n 4) in the candidate prediction model, thereby constructing a whole genome selection model Bayes B for predicting the nicotine content in tobaccoNIC. That is, 50000 SNP markers were divided into 10 gradients (1000, 2000, 4000, 7000, 11000, 16000, 22000, 29000, 37000, and 50000) for determination of Bayes BNICThe number of molecular markers in the model (n 1) parameter, the results are shown in FIG. 3; the comprehensive population is used as training population according to 50, 100, 150, 200, 250 and 300 single plants (lines), and the recombinant inbred line population (300 lines), the natural population (347 lines) and the comprehensive population (647 lines) are respectively used as training population for determining Bayes BNICTraining population size in the model (n 2), the results of which are detailed in FIG. 4; 647 lines in the synthetic population are determined according to the number of tobacco plants (lines) in the training population: bayes B was determined by 11 gradients of 1:5, 1:4, 1:3, 1:2, 1:1, 2:1, 3:1, 4:1, 5:1, 6:1 and 10:1 of tobacco plants (lines) in the test population, respectivelyNICThe training population to test population ratio in the model (n 3) is shown in FIG. 5. Fourth, newly established Bayes BNICModel calculation is carried out to obtain a predicted value of the nicotine content in the roasted leaves of each individual plant (line) in the tobacco comprehensive group, then the predicted value is compared with the respective real detection value of the nicotine content to obtain prediction accuracy, and the first 2 values with the highest prediction accuracy are selected as a high prediction accuracy value (n 4= 0.56) and a high cost performance prediction accuracy value (n 4= 0.52). Finally, after experimental verification, optimization and practical application, a whole genome selection model Bayes B for predicting the nicotine content of tobacco is constructedNICThe formula is as follows:
Figure DEST_PATH_IMAGE008
or
Figure DEST_PATH_IMAGE010
Note:
Figure DEST_PATH_IMAGE012
represents: the 4 parameters n1, n2, n3 and n4 of the core are substituted into the Bayes B candidate prediction model in sequence.

Claims (7)

1. A whole genome selection model for predicting the nicotine content in tobacco is characterized in that the whole genome selection model for predicting the nicotine content in tobacco is Bayes BNICThe whole genome selection model for predicting the nicotine content in tobacco is obtained by establishing core parameter values of the molecular marker number (n 1), the training population scale (n 2), the training population to test population ratio (n 3) and the model prediction precision value (n 4) in the model on the basis of a Bayes B candidate prediction model obtained by primary screening, and the formula is as follows:
Figure DEST_PATH_IMAGE001
or
Figure 181942DEST_PATH_IMAGE002
Note:
Figure DEST_PATH_IMAGE003
represents: the 4 parameters n1, n2, n3 and n4 of the core are substituted into the Bayes B candidate prediction model in sequence.
2. The genome-wide selection model for predicting tobacco nicotine content of claim 1, wherein the number of molecular markers (n 1) is:
a, cost effective (lower or less investment pursuit, higher return or profit), n1=4000 labels
b, high prediction accuracy (in pursuit of obtaining the most accurate tobacco nicotine content values), n1=7000 markers.
3. The genome-wide selection model for predicting tobacco nicotine content of claim 1, wherein the training population scale (n 2) is:
a, when the cost performance is high, n2=200 individuals;
b, high prediction accuracy, n2=250 individuals.
4. The genome-wide selection model for predicting tobacco nicotine content of claim 1, wherein the ratio of training population to test population (n 3) is:
a, when cost effective, n3=5:1 (i.e., number of plants in training population: number of plants in test population =5: 1);
b, when the prediction precision is high, n3=10: 1.
5. The genome-wide selection model for predicting tobacco nicotine content of claim 1, wherein the model prediction accuracy value (n 4) is:
a, when the cost performance is high, n4= 0.52;
b, when the prediction accuracy is high, n4= 0.56.
6. Use of the whole genome selection model for predicting tobacco nicotine content according to any one of claims 1 to 5, characterized in that the whole genome selection model for predicting tobacco nicotine content is used, Bayes BNICAnd analyzing the genotype data of the whole tobacco or variety (line) and accurately predicting the nicotine content phenotype value in the cured leaf of each tobacco plant in the population in the whole genome range.
7. The use according to claim 6, characterized in that the whole genome selection model Bayes B is usedNICAnd analyzing the genotype data of the tobacco population or the tobacco variety (line), and accurately predicting the nicotine content value after tobacco baking in the whole genome range, thereby realizing that the accurate value of the nicotine content can be obtained in the seedling stage (early stage) of the tobacco.
CN201911141188.0A 2019-11-20 2019-11-20 Whole genome selection model for predicting nicotine content in tobacco and application thereof Active CN111223520B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911141188.0A CN111223520B (en) 2019-11-20 2019-11-20 Whole genome selection model for predicting nicotine content in tobacco and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911141188.0A CN111223520B (en) 2019-11-20 2019-11-20 Whole genome selection model for predicting nicotine content in tobacco and application thereof

Publications (2)

Publication Number Publication Date
CN111223520A true CN111223520A (en) 2020-06-02
CN111223520B CN111223520B (en) 2023-09-12

Family

ID=70829015

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911141188.0A Active CN111223520B (en) 2019-11-20 2019-11-20 Whole genome selection model for predicting nicotine content in tobacco and application thereof

Country Status (1)

Country Link
CN (1) CN111223520B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111172294A (en) * 2020-02-25 2020-05-19 云南省烟草农业科学研究院 Codominant SSR marker closely linked with tobacco nicotine synthesis major regulatory gene nic1 and application thereof
CN111518896A (en) * 2020-06-28 2020-08-11 云南中烟工业有限责任公司 Primer group, application, product and method for detecting nicotine dependence related SNP site
CN111883205A (en) * 2020-07-14 2020-11-03 云南省烟草农业科学研究院 Prediction method for selecting harmful components of tobacco based on whole genome and application
CN111887473A (en) * 2020-07-14 2020-11-06 云南省烟草农业科学研究院 Prediction method for smoke harmful component release amount based on whole genome selection and application
WO2023082780A1 (en) * 2021-11-10 2023-05-19 云南省烟草农业科学研究院 Molecular marker nicotine associated snp 1 for identifying content of nicotine in tobacco and kit and application thereof
WO2024056056A1 (en) * 2022-09-15 2024-03-21 中国科学院植物研究所 Prediction device and early warning system for cadmium accumulation traits in rice grains on basis of genomic selection research

Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006002240A2 (en) * 2004-06-19 2006-01-05 Chondrogene, Inc. Computer systems and methods for constructing biological classifiers and uses thereof
US20070269804A1 (en) * 2004-06-19 2007-11-22 Chondrogene, Inc. Computer system and methods for constructing biological classifiers and uses thereof
US20080163824A1 (en) * 2006-09-01 2008-07-10 Innovative Dairy Products Pty Ltd, An Australian Company, Acn 098 382 784 Whole genome based genetic evaluation and selection process
US20110296753A1 (en) * 2010-06-03 2011-12-08 Syngenta Participations Ag Methods and compositions for predicting unobserved phenotypes (pup)
US20120115138A1 (en) * 2009-04-07 2012-05-10 Biocrates Life Sciences Ag Method for in vitro diagnosing a complex disease
US20140220568A1 (en) * 2011-06-24 2014-08-07 Universiteit Gent Means and methods for the determination of prediction models associated with a phenotype
WO2016088949A1 (en) * 2014-12-05 2016-06-09 연세대학교 산학협력단 System for predicting genes associated with plant complex traits by using arabidopsis thaliana gene network
WO2016100061A1 (en) * 2014-12-18 2016-06-23 Pioneer Hi-Bred International, Inc. Improved molecular breeding methods
US20170159045A1 (en) * 2015-12-07 2017-06-08 Zymergen, Inc. Microbial strain improvement by a htp genomic engineering platform
US20170235914A1 (en) * 2011-06-10 2017-08-17 Philip Morris Products S.A. Systems and methods for quantifying the impact of biological perturbations
US20170245446A1 (en) * 2014-10-27 2017-08-31 Pioneer Hi-Bred International, Inc. Improved molecular breeding methods
CN108388765A (en) * 2018-02-12 2018-08-10 中国农业科学院作物科学研究所 Full-length genome selection and use value unbiased esti-mator tool GS1.0 based on the network platform
GB201810897D0 (en) * 2018-07-03 2018-08-15 Chronomics Ltd Phenotype prediction
WO2018234639A1 (en) * 2017-06-22 2018-12-27 Aalto University Foundation Sr. Method and system for selecting a plant variety
CN109360606A (en) * 2018-11-19 2019-02-19 广西壮族自治区农业科学院水稻研究所 A kind of method of low-density SNP genome area Accurate Prediction BSA-seq candidate gene
CA3074655A1 (en) * 2017-09-05 2019-03-14 Adaptive Phage Therapeutics, Inc. Methods to determine the sensitivity profile of a bacterial strain to a therapeutic composition
CN109727641A (en) * 2019-01-22 2019-05-07 袁隆平农业高科技股份有限公司 A kind of full-length genome prediction technique and device
CN110476214A (en) * 2017-03-30 2019-11-19 孟山都技术有限公司 System and method for identifying the Aggregate effect of the genome editor of multiple genome editors and prediction identification

Patent Citations (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070269804A1 (en) * 2004-06-19 2007-11-22 Chondrogene, Inc. Computer system and methods for constructing biological classifiers and uses thereof
WO2006002240A2 (en) * 2004-06-19 2006-01-05 Chondrogene, Inc. Computer systems and methods for constructing biological classifiers and uses thereof
US20080163824A1 (en) * 2006-09-01 2008-07-10 Innovative Dairy Products Pty Ltd, An Australian Company, Acn 098 382 784 Whole genome based genetic evaluation and selection process
US20120115138A1 (en) * 2009-04-07 2012-05-10 Biocrates Life Sciences Ag Method for in vitro diagnosing a complex disease
US20110296753A1 (en) * 2010-06-03 2011-12-08 Syngenta Participations Ag Methods and compositions for predicting unobserved phenotypes (pup)
US20170235914A1 (en) * 2011-06-10 2017-08-17 Philip Morris Products S.A. Systems and methods for quantifying the impact of biological perturbations
US20140220568A1 (en) * 2011-06-24 2014-08-07 Universiteit Gent Means and methods for the determination of prediction models associated with a phenotype
US20170245446A1 (en) * 2014-10-27 2017-08-31 Pioneer Hi-Bred International, Inc. Improved molecular breeding methods
WO2016088949A1 (en) * 2014-12-05 2016-06-09 연세대학교 산학협력단 System for predicting genes associated with plant complex traits by using arabidopsis thaliana gene network
US20170359978A1 (en) * 2014-12-18 2017-12-21 Pioneer Hi-Bred International, Inc. Molecular breeding methods
WO2016100061A1 (en) * 2014-12-18 2016-06-23 Pioneer Hi-Bred International, Inc. Improved molecular breeding methods
US20170159045A1 (en) * 2015-12-07 2017-06-08 Zymergen, Inc. Microbial strain improvement by a htp genomic engineering platform
CN110476214A (en) * 2017-03-30 2019-11-19 孟山都技术有限公司 System and method for identifying the Aggregate effect of the genome editor of multiple genome editors and prediction identification
WO2018234639A1 (en) * 2017-06-22 2018-12-27 Aalto University Foundation Sr. Method and system for selecting a plant variety
CA3074655A1 (en) * 2017-09-05 2019-03-14 Adaptive Phage Therapeutics, Inc. Methods to determine the sensitivity profile of a bacterial strain to a therapeutic composition
CN108388765A (en) * 2018-02-12 2018-08-10 中国农业科学院作物科学研究所 Full-length genome selection and use value unbiased esti-mator tool GS1.0 based on the network platform
GB201810897D0 (en) * 2018-07-03 2018-08-15 Chronomics Ltd Phenotype prediction
CN109360606A (en) * 2018-11-19 2019-02-19 广西壮族自治区农业科学院水稻研究所 A kind of method of low-density SNP genome area Accurate Prediction BSA-seq candidate gene
CN109727641A (en) * 2019-01-22 2019-05-07 袁隆平农业高科技股份有限公司 A kind of full-length genome prediction technique and device

Non-Patent Citations (19)

* Cited by examiner, † Cited by third party
Title
G.SU: "Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances", JOURNAL OF DAIRY SCIENCE, vol. 97, pages 6547 - 6559 *
L A PEIXOTO: "Determination of the optimal number of markers and individuals in a training population necessary for maximum prediction accuracy in F2 populations by using genomic selection models", GENETICS AND MOLECULAR RESEARCH, vol. 15, no. 04 *
刘小刚: "玉米产量相关性状的全基因组选择", 中国博士学位论文全文数据库 (农业科技辑), no. 01, pages 4 - 30 *
刘策,孟焕文,程智慧: ""植物全基因组选择育种技术原理与研究进展"", 《分子植物育种》 *
刘策,孟焕文,程智慧: ""植物全基因组选择育种技术原理与研究进展"", 《分子植物育种》, vol. 18, no. 16, 10 October 2019 (2019-10-10) *
化烨: "基于基因组选择模型预测拟南芥开花时间的研究", 《陇东学院学报》 *
化烨: "基于基因组选择模型预测拟南芥开花时间的研究", 《陇东学院学报》, no. 03, 15 May 2016 (2016-05-15) *
吴桂琴;闫奕源;李花妮;李光奇;韩文朋;赵丽红;: "基因组选择技术在家禽育种中的应用", 中国家禽, no. 09 *
唐金梅等: "全基因组选择在植物育种中的研究进展", 《贵州农业科学》 *
唐金梅等: "全基因组选择在植物育种中的研究进展", 《贵州农业科学》, no. 08, 15 August 2016 (2016-08-15) *
孙鑫等: "烤烟烟碱含量的全基因组关联分析", 《中国烟草学报》 *
孙鑫等: "烤烟烟碱含量的全基因组关联分析", 《中国烟草学报》, no. 01, 25 January 2017 (2017-01-25) *
王欣;孙辉;胡中立;徐辰武;: "基因组选择方法研究进展", 扬州大学学报(农业与生命科学版), no. 01 *
王欣等: "基因组选择方法研究进展", 《扬州大学学报(农业与生命科学版)》 *
王欣等: "基因组选择方法研究进展", 《扬州大学学报(农业与生命科学版)》, no. 01, 27 April 2018 (2018-04-27) *
谈成等: "基因组选择技术在农业动物育种中的应用", 《遗传》 *
谈成等: "基因组选择技术在农业动物育种中的应用", 《遗传》, no. 11, 13 November 2017 (2017-11-13) *
赵雄伟: ""玉米控制重金属镉积累的遗传基础解析"", 《中国博士学位论文全文数据库》 *
赵雄伟: ""玉米控制重金属镉积累的遗传基础解析"", 《中国博士学位论文全文数据库》, no. 03, 1 December 2018 (2018-12-01) *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111172294A (en) * 2020-02-25 2020-05-19 云南省烟草农业科学研究院 Codominant SSR marker closely linked with tobacco nicotine synthesis major regulatory gene nic1 and application thereof
CN111172294B (en) * 2020-02-25 2023-03-24 云南省烟草农业科学研究院 Co-dominant SSR marker closely linked with tobacco nicotine synthesis major regulatory gene nic1 and application thereof
CN111518896A (en) * 2020-06-28 2020-08-11 云南中烟工业有限责任公司 Primer group, application, product and method for detecting nicotine dependence related SNP site
CN111518896B (en) * 2020-06-28 2023-09-26 云南中烟工业有限责任公司 Primer group, application, product and method for detecting SNP locus related to nicotine dependency
CN111883205A (en) * 2020-07-14 2020-11-03 云南省烟草农业科学研究院 Prediction method for selecting harmful components of tobacco based on whole genome and application
CN111887473A (en) * 2020-07-14 2020-11-06 云南省烟草农业科学研究院 Prediction method for smoke harmful component release amount based on whole genome selection and application
CN111887473B (en) * 2020-07-14 2022-07-01 云南省烟草农业科学研究院 Prediction method for smoke harmful component release amount based on whole genome selection and application
CN111883205B (en) * 2020-07-14 2023-10-20 云南省烟草农业科学研究院 Prediction method for selecting harmful ingredients of tobacco based on whole genome and application
WO2023082780A1 (en) * 2021-11-10 2023-05-19 云南省烟草农业科学研究院 Molecular marker nicotine associated snp 1 for identifying content of nicotine in tobacco and kit and application thereof
WO2024056056A1 (en) * 2022-09-15 2024-03-21 中国科学院植物研究所 Prediction device and early warning system for cadmium accumulation traits in rice grains on basis of genomic selection research

Also Published As

Publication number Publication date
CN111223520B (en) 2023-09-12

Similar Documents

Publication Publication Date Title
CN111223520B (en) Whole genome selection model for predicting nicotine content in tobacco and application thereof
Liu et al. Enhancing grain-yield-related traits by CRISPR–Cas9 promoter editing of maize CLE genes
Li et al. Genomic insights into historical improvement of heterotic groups during modern hybrid maize breeding
Piperidis et al. GISH characterization of Erianthus arundinaceus chromosomes in three generations of sugarcane intergeneric hybrids
JP2019022516A (en) Selection of symbiota by screening multiple host-symbiont associations
Sinha et al. Genomics and breeding innovations for enhancing genetic gain for climate resilience and nutrition traits
CN109830261B (en) Method for screening quantitative trait candidate genes
Kawasaki et al. Genome analysis of Lotus japonicus
CN110782943B (en) Whole genome selection model for predicting plant height of tobacco and application thereof
Shu et al. Genomic selection of seed weight based on low-density SCAR markers in soybean
Pégard et al. Favorable conditions for genomic evaluation to outperform classical pedigree evaluation highlighted by a proof-of-concept study in poplar
CN110853711B (en) Whole genome selection model for predicting fructose content of tobacco and application thereof
Jia et al. QTL mapping and BSA-seq map a major QTL for the node of the first fruiting branch in cotton
Lv et al. Stable QTLs for plant height on chromosome A09 identified from two mapping populations in peanut (Arachis hypogaea L.)
Wang et al. Breeding evaluation and precise mapping of Fhb8 for Fusarium head blight resistance in wheat (Triticum aestivum)
Hirel et al. The molecular genetics of nitrogen use efficiency in crops
Lubanga et al. Genomic selection strategies to increase genetic gain in tea breeding programs
CN110853710B (en) Whole genome selection model for predicting starch content of tobacco and application thereof
CN108441574A (en) A kind of gene finely positioning method being sequenced for Plant Genome based on map based cloning principle and segregating population is inter-subspecies hybrid
CN117248069A (en) Haplotype related to upland cotton plant height and application thereof
Bhat et al. Phenomics: a challenge for crop improvement in genomic era
CN108504769A (en) A kind of InDel molecular labelings and its application with the Chinese cabbage group flowering time close linkage of photoperiod-sensitive
CN111798920B (en) Tobacco economic character phenotype value prediction method based on whole genome selection and application
Lan et al. Application of molecular marker assisted selection in gene pyramiding and selection of new cultivars
Han et al. Resequencing of 558 Chinese mungbean landraces identifies genetic loci associated with key agronomic traits

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant