CN113113086B - Wheat scab resistance dsMQTL analysis method - Google Patents

Wheat scab resistance dsMQTL analysis method Download PDF

Info

Publication number
CN113113086B
CN113113086B CN202110350135.0A CN202110350135A CN113113086B CN 113113086 B CN113113086 B CN 113113086B CN 202110350135 A CN202110350135 A CN 202110350135A CN 113113086 B CN113113086 B CN 113113086B
Authority
CN
China
Prior art keywords
dsmqtl
qtl
wheat scab
interval
software
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110350135.0A
Other languages
Chinese (zh)
Other versions
CN113113086A (en
Inventor
李韬
花辰
温卓
孙政玺
李磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yangzhou University
Original Assignee
Yangzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yangzhou University filed Critical Yangzhou University
Priority to CN202110350135.0A priority Critical patent/CN113113086B/en
Publication of CN113113086A publication Critical patent/CN113113086A/en
Application granted granted Critical
Publication of CN113113086B publication Critical patent/CN113113086B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Epidemiology (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Bioethics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Public Health (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a wheat scab resistance dsMQTL analysis method, which comprises the following steps: collecting wheat scab QTL data, taking the average value and standard deviation of the QTL interval as screening standards, removing the QTL with oversized interval, and carrying out primary screening on the QTL; the screened data is imported dsMQTL into software to obtain dsMQTL. The invention omits the step of constructing or selecting genetic map in classical meta-analysis, the mapping result obviously improves the robustness and uniqueness of meta-QTL interval, and the research result is beneficial to the comparison of different experimental results, the fine positioning of subsequent target QTL and the genetic improvement assisted by molecular markers.

Description

Wheat scab resistance dsMQTL analysis method
Technical Field
The invention relates to a wheat scab resistance dsMQTL analysis method, and belongs to the technical field of data analysis.
Background
Since QTL localization was first published using molecular markers, many QTLs associated with different phenotypic traits have been reported on animals and plants. A large portion of QTLs are difficult to reproduce in different genetic contexts and environments.
Classical QTL element analysis methods first require the selection or construction of genetic maps as vectors for the original QTL mapping. Because the sizes, types, molecular marker types and numbers of mapping populations used for different QTL positioning are different, common markers are not available or less common markers exist among genetic maps constructed by different experiments, the traditional meta-analysis method often selects and integrates different genetic maps to increase the marker density, but the integration of different genetic maps is a complex process, and because of different sources of sub-maps, the integration maps used for different meta-analyses are also different, so that the comparability of different research results is poor. More importantly, some QTL result intervals are too large, so that a large number of invalid intervals appear in the meta-analysis results, and the meta-analysis accuracy is seriously influenced. Therefore, the reasonable removal of the large-interval QTL is beneficial to increasing the accuracy and reliability of the analysis result, but the existing meta-analysis method solves the problem.
At present, a meta-analysis method capable of effectively removing negative influence of a large-area QTL and outputting stable and reliable results is lacking.
Disclosure of Invention
The invention aims to overcome the defects of the prior meta-analysis method, and provides the wheat scab-resistant dsMQTL analysis method, so that the output result is stable and reliable, and the comparison of different experimental results is convenient.
In order to solve the technical problems, the invention provides a wheat scab resistance dsMQTL analysis method, which comprises the following steps:
collecting QTL data of wheat scab,
Taking the average value and standard deviation of the QTL interval as screening standards, removing the QTL with oversized interval, and carrying out primary screening on the QTL;
The screened data is imported dsMQTL into software to obtain dsMQTL.
Preferably, the wheat scab QTL data collected includes chromosome, breed name, inoculation method, inoculation environment, number of experiments, number of experimental sites, experimental population size, confidence interval, physical location, peak signature and location, LOD value, and phenotype interpretation rate.
Preferably, the method of preliminary screening is: calculating the average value and standard deviation of the physical interval length of all the QTL, taking the average value and standard deviation of the QTL interval as the standard, and eliminating the QTL with the average value and standard deviation of the QTL interval. Taking the average value and standard deviation of the QTL interval as screening standards, removing the QTL with oversized interval while retaining more original QTL information as much as possible.
Preferably, results dsMQTL are obtained using a sequencing map as QTL mapping vector. The selection or construction step of genetic maps in conventional meta-analysis is not required.
Preferably, the dsMQTL software is based on the python environment, running in the python environment, and is able to fit most QTLs.
Preferably, the dsMQTL software uses least square method to perform interval fitting, and the software includes 4 models of 1-dsMQTL,2-dsMQTL,3-dsMQTL and 4-dsMQTL, and the conditions of 1, 2,3 and 4 dsMQTL on chromosomes are assumed respectively.
Preferably, after fitting and checking the 4 models respectively, selecting a model which is most in line with the QTL reliability accumulation curve, and generating subsequent dsMQTL on the premise of meeting the following two standards: (1) The cumulative curves of the QTL confidence coefficient are clustered into peaks, and a model can be fitted; (2) one dsMQTL at least 3 partially overlapping original QTLs.
The invention has the beneficial effects that:
(1) In the data preparation stage of meta-analysis, the QTL with the oversized interval is removed, so that the condition that an invalid interval appears in the result is avoided, and the interval is further narrowed;
(2) The traditional meta analysis needs a great deal of time to make an integrated map, and the method directly uses a sequencing map, so that the time of researchers is greatly saved.
(3) The invention can obviously improve the robustness and the uniqueness of the dsMQTL interval, and the research result is favorable for comparing different experimental results, the subsequent fine positioning of the target QTL and the genetic improvement assisted by the molecular marker, and is more favorable for targeted screening and mining of the character related functional genes of the dsMQTL interval. In principle, the dsMQTL method allows meta-analysis of any species.
Drawings
FIG. 1 is a graph of the QTL accumulation on the 3B chromosome, with the abscissa representing physical distance, the ordinate representing the confidence of the QTL (the sum of all QTL confidence in that range), each thin line representing the QTL, and the black bold line representing the accumulation of QTL values.
Fig. 2 is a dsMQTL-fit on the 3B chromosome, with the abscissa representing physical distance, the ordinate representing the confidence level (addition of all QTL confidence levels in the range) of dsMQTL, the darker curve being the QTL accumulation curve, the lighter curve being the dsMQTL-fit, and the lighter line on the abscissa being the dsMQTL interval.
Detailed Description
The invention is further described below with reference to the accompanying drawings. The following examples are only for more clearly illustrating the technical aspects of the present invention, and are not intended to limit the scope of the present invention.
Examples: wheat 3B chromosome scab resistance dsMQTL analysis
(1) Wheat scab QTL data collection
The study collects the QTL localization literature reported on wheat 3B chromosome and FHB resistance from 2001 to date. The collated content not only has QTL result information in each document but also contains each experimental condition. The data of the chromosome, the variety name, the inoculation method, the inoculation environment, the experiment times, the experiment place number, the experiment population size, the confidence interval (mark), the physical position, the peak value mark and the position, the LOD value, the phenotype interpretation rate and the like, which are reported in the literature and relate to each QTL, are summarized. If the case of no physical space is given in the article is encountered, it can be obtained by a two-sided tag on URGI (https:// heat-ugi. Versailles. Inra. Fr/Seq-reproducer/Assembles).
(2) QTL screening
The QTL interval in the collected data has large difference, the QTL reference value of the interval which is too large is small, the dsMQTL interval is too large, and the reliability is reduced. We choose QTL interval size mean + standard deviation as the screening criteria. The overall situation is shown by the average of QTL interval sizes, and the degree of dispersion is shown by the standard deviation. The advantage of taking the average value and the standard deviation as the screening standard is that the confidence interval of the high reproducibility QTL can be effectively reduced under the condition that the interval is not lost.
(3) Selection of a sequencing map
The physical map is selected from a sequencing map IWGSC REFSEQ V1.0.0 of the wheat variety Chinese spring.
(4) Performing dsMQTL analysis
For use by researchers, the method developed dsMQTL analysis software based on the python language. In software, we assume that QTLs are normally distributed with the peak as the center, the variance of which is estimated by QTL data with the chromosome, and in order to best fit experimental data in the sense of minimum variance, the method adopts least square method construction modeling. The procedure involved 4 models, 1-dsMQTL,2-dsMQTL,3-dsMQTL,4-dsMQTL, assuming 1, 2,3, 4 dsMQTL on the chromosome, respectively, which fit most of the data well.
The original QTL physical intervals and experimental elements are organized into compatible formats of Python programs. After fitting test is carried out on the 4 models respectively, selecting the model which is most in line with the QTL reliability accumulation curve, and generating subsequent dsMQTL on the premise of meeting the following two standards: (1) The cumulative curves of the QTL confidence coefficient are clustered into peaks, and a model can be fitted; (2) One dsMQTL has at least 3 (partially) overlapping original QTLs.
(6) DsMQTL reliability verification
Fhb1 located on the 3B chromosome short arm is considered the most effective and stable QTL discovered so far. In the current study, researchers have cloned two different Fhb1 candidate genes: one believes that Fhb1 encodes a chimeric lectin protein gene (PFT), and the other two groups reported histidine-rich calbindin (TaHRC) as a candidate gene, but with distinct functions. These two genes are closely linked on the chromosome and therefore, both genes can be used to represent the Fhb1 locus (rather than the Fhb1 gene itself). TaHRC is used herein to represent the Fhb1 site. Therefore, we used Fhb1 as an internal reference to test dsMQTL for reliability. The physical position of the cloned QTL (expressed as a genetic marker) should overlap with the peak position of dsMQTL or be less than the wheat linkage-imbalance decay distance (LD), otherwise dsMQTL is considered unreliable.
The experiment proves that the obtained result is that:
(1) Wheat scab QTL data collection and screening
84 QTLs were retrieved in total by systematic searches from documents on the 3B chromosome reported so far in 2001 that are related to wheat scab resistance QTL mapping. Because of sparse mark density and small population in partial experiments, the positioned QTL interval is too large, and part of QTL interval occupies 1/3-1/2 of the physical whole length of the chromosome, so that the QTL of the large interval has limited information provided in inheritance, has little significance in practical application, and increases the error of element analysis and reduces the positioning accuracy, therefore, the method takes each chromosome as a unit, firstly calculates the average value+standard deviation (Table 1) of the length of all the QTL physical intervals, and eliminates the QTL of the QTL interval with the average value+standard deviation to obtain 72 original QTLs.
Table 1 QTL screening criteria
(2) DsMQTL analysis results
And (3) finishing the collected data into an excel file, and then importing dsMQTL software. dsMQTL software developed by the method needs to be operated in a python3 environment, and after an initial interface appears in the opened software; clicking open first, selecting file position to import file; then clicking Preview to check whether the imported file is correct; then clicking Calculation will pop up the results automatically at this point, and clicking Exportresult can export the results in excel format.
According to the distribution characteristics of the QTL on each chromosome, a cumulative curve is drawn (figure 1), then a model which is best fit with the QTL reliability cumulative curve is selected to obtain dsMQTL (figure 2), 3 dsMQTL (table 2) are obtained on the 3B chromosome, and the redundancy is reduced by about 20 times. The dsMQTL-1 position corresponds to Fhb1.
Table 2 dsMQTL results on 3B chromosome
The Fhb1 candidate gene TaHRC (gene: traesCS B02G 019900) is located at the 8.5Mb position on the short arm of the chromosome 3B of the reference sequence. dsMQTL-1 and Fhb1 were matched in the interval 0.72Mb-13.99Mb with peaks at 7,357,892bp and peaks at TaHRC a distance of only 1.2Mb, 20Mb below the wheat LD attenuation, demonstrating the reliability and accuracy of dsMQTL.
The experimental results prove that the meta-analysis result is accurate and reliable, and the time consumed by researchers is greatly shortened.
The foregoing is merely a preferred embodiment of the present invention, and it should be noted that modifications and variations could be made by those skilled in the art without departing from the technical principles of the present invention, and such modifications and variations should also be regarded as being within the scope of the invention.

Claims (4)

1. The wheat scab resistance dsMQTL analysis method is characterized by comprising the following steps:
collecting QTL data of wheat scab,
Calculating the average value and standard deviation of the physical length of all QTL intervals, and eliminating QTL of the QTL intervals with the average value and standard deviation of the physical length;
importing the screened data into dsMQTL software to obtain dsMQTL;
the dsMQTL software uses a least square method to perform interval fitting, the software comprises 4 models of 1-dsMQTL,2-dsMQTL,3-dsMQTL and 4-dsMQTL, and the conditions of 1,2,3 and 4 dsMQTL on chromosomes are respectively assumed;
After fitting test is carried out on the 4 models respectively, selecting the model which is most in line with the QTL reliability accumulation curve, and generating subsequent dsMQTL on the premise of meeting the following two standards: (1) The cumulative curves of the QTL confidence coefficient are clustered into peaks, and a model can be fitted; (2) one dsMQTL at least 3 partially overlapping original QTLs.
2. The wheat scab resistance dsMQTL assay of claim 1, wherein the collected wheat scab QTL data includes chromosome, breed name, inoculation method, inoculation environment, number of trials, number of experimental sites, experimental population size, confidence interval, physical location, peak signature and location, LOD value and phenotype interpretation rate.
3. The method for analyzing wheat scab resistance dsMQTL according to claim 1, wherein the result dsMQTL is obtained by mapping the vector with a sequencing map as QTL.
4. The wheat scab resistance dsMQTL assay of claim 1, wherein the dsMQTL software is based on a python environment.
CN202110350135.0A 2021-03-31 2021-03-31 Wheat scab resistance dsMQTL analysis method Active CN113113086B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110350135.0A CN113113086B (en) 2021-03-31 2021-03-31 Wheat scab resistance dsMQTL analysis method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110350135.0A CN113113086B (en) 2021-03-31 2021-03-31 Wheat scab resistance dsMQTL analysis method

Publications (2)

Publication Number Publication Date
CN113113086A CN113113086A (en) 2021-07-13
CN113113086B true CN113113086B (en) 2024-04-19

Family

ID=76713274

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110350135.0A Active CN113113086B (en) 2021-03-31 2021-03-31 Wheat scab resistance dsMQTL analysis method

Country Status (1)

Country Link
CN (1) CN113113086B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011050296A1 (en) * 2009-10-22 2011-04-28 Seminis Vegetable Seeds, Inc. Methods and compositions for identifying downy mildew resistant cucumber plants
CN105018478A (en) * 2015-07-09 2015-11-04 中国农业科学院麻类研究所 Molecular marker tightly linked to fiber fineness of ramie, obtaining method and application thereof
CN108060260A (en) * 2018-01-29 2018-05-22 吉林省农业科学院 SNP marker relevant with soya seeds methionine content, section, primer and application
CN109112157A (en) * 2017-06-22 2019-01-01 华中农业大学 The silencer CNV-18bp of panicle development gene and the application in rice yield improvement
CN109913572A (en) * 2019-04-08 2019-06-21 鲁东大学 The molecular labeling of wheat spike length main effect QTL compact linkage and its application
CN112048568A (en) * 2020-10-13 2020-12-08 湖北省农业科学院粮食作物研究所 Acquisition of corn seedling stage stain-resistant major QTL qWT7.02 and development and application of molecular marker primer thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011050296A1 (en) * 2009-10-22 2011-04-28 Seminis Vegetable Seeds, Inc. Methods and compositions for identifying downy mildew resistant cucumber plants
CN105018478A (en) * 2015-07-09 2015-11-04 中国农业科学院麻类研究所 Molecular marker tightly linked to fiber fineness of ramie, obtaining method and application thereof
CN109112157A (en) * 2017-06-22 2019-01-01 华中农业大学 The silencer CNV-18bp of panicle development gene and the application in rice yield improvement
CN108060260A (en) * 2018-01-29 2018-05-22 吉林省农业科学院 SNP marker relevant with soya seeds methionine content, section, primer and application
CN109913572A (en) * 2019-04-08 2019-06-21 鲁东大学 The molecular labeling of wheat spike length main effect QTL compact linkage and its application
CN112048568A (en) * 2020-10-13 2020-12-08 湖北省农业科学院粮食作物研究所 Acquisition of corn seedling stage stain-resistant major QTL qWT7.02 and development and application of molecular marker primer thereof

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
三个小麦赤霉病抗源的抗性 QTL 定位;张旭;《麦类作物学报》;第26卷(第3期);28-33 *
三个小麦赤霉病抗源的抗性QTL 定位;张旭;《麦类作物学报》;第26卷(第3期);28-33 *
基于全基因组遗传标记开展上海 白 猪 ( 上系) 种质特性及其杂种优势预测研究;盛中华;《中国博士学位全文数据库》;44 *
大豆底荚高度 QTL 定位及候选基因挖掘;赵圆圆;《中国油料作物学报》;第42卷(第1期);51-60 *
大豆底荚高度QTL定位及候选基因挖掘;赵圆圆;《中国油料作物学报》;第42卷(第1期);51-60 *

Also Published As

Publication number Publication date
CN113113086A (en) 2021-07-13

Similar Documents

Publication Publication Date Title
Cai et al. Widespread ancient whole‐genome duplications in Malpighiales coincide with Eocene global climatic upheaval
Zhang et al. Phylogenomics from low‐coverage whole‐genome sequencing
Adams et al. Automating image matching, cataloging, and analysis for photo-identification research
CN109739997A (en) Address control methods, apparatus and system
Okello et al. Effective population size dynamics reveal impacts of historic climatic events and recent anthropogenic pressure in African elephants
Elston et al. Query: Estimating" Heritability" of a dichotomous trait
CN108985327A (en) A kind of Terrain Matching self-organizing Optimum Classification method based on factorial analysis
Živković et al. Germ banks affect the inference of past demographic events
CN110147418A (en) It is a kind of to judge whether address standardizes, the method and system of Address Standardization
CN109829804A (en) A kind of tax risk recognition methods towards marker samples missing administrative region
CN113113086B (en) Wheat scab resistance dsMQTL analysis method
CN114580368A (en) Intelligent report collection and load reduction method, device, terminal and storage medium
CN112199376B (en) Standard knowledge base management method and system based on cluster analysis
Raimondeau et al. Lateral gene transfer generates accessory genes that accumulate at different rates within a grass lineage
Guarino et al. Analysis of georeferenced data and the conservation and use of plant genetic resources
CN114003812A (en) Address matching method, system, device and storage medium
CN109492680A (en) Semi-supervised learning method based on adaptive figure
CN108846260B (en) Genetic map construction method and device for genetic segregation population
Moore‐Pollard et al. Compositae‐ParaLoss‐1272: A complementary sunflower‐specific probe set reduces paralogs in phylogenomic analyses of complex systems
CN117746997B (en) Cis-regulation die body identification method based on multi-mode priori information
CN115074459A (en) Method for developing specific fingerprint spectrum of tobacco hybrid
CN112182134B (en) Construction method and device of space-time database of service system
CN115238023B (en) Method for performing data association based on multi-source data
CN115577065B (en) Address resolution method and device
WO2024016389A1 (en) Ubiquitination site identification method, apparatus and system, and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant