CN106755314B - Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus - Google Patents

Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus Download PDF

Info

Publication number
CN106755314B
CN106755314B CN201611032452.3A CN201611032452A CN106755314B CN 106755314 B CN106755314 B CN 106755314B CN 201611032452 A CN201611032452 A CN 201611032452A CN 106755314 B CN106755314 B CN 106755314B
Authority
CN
China
Prior art keywords
microsatellite marker
throughput sequencing
microsatellite
sequences
fragments
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611032452.3A
Other languages
Chinese (zh)
Other versions
CN106755314A (en
Inventor
方治伟
周俊飞
李论
彭海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jianghan University
Original Assignee
Jianghan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jianghan University filed Critical Jianghan University
Priority to CN201611032452.3A priority Critical patent/CN106755314B/en
Publication of CN106755314A publication Critical patent/CN106755314A/en
Application granted granted Critical
Publication of CN106755314B publication Critical patent/CN106755314B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a development method of a wheat microsatellite marker locus and a length detection method of a microsatellite marker in the microsatellite marker locus. The development method comprises the following steps: obtaining a mixed sample; extracting the genome of the mixed sample; fragmenting a genome to obtain a genome fragment; respectively hybridizing the probe sets with the genome fragments; purifying the successfully hybridized genome fragments in the plurality of hybridization solutions; mixing a plurality of the purified hybrid genomic fragments, and detecting the purified genomic fragments by using high-throughput sequencing; obtaining an effective said high throughput sequencing fragment; classifying the valid high throughput sequencing fragments. The detection method comprises the following steps: selecting a microsatellite marker locus to be detected; and amplifying the microsatellite marker in the microsatellite marker locus to be detected by using a multiplex amplification primer to obtain the length of the microsatellite marker in the microsatellite marker locus. The method is simple, rapid, comprehensive and accurate.

Description

Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus
Technical Field
The invention relates to the technical field of biology, in particular to a development method of a wheat microsatellite marker locus and a method for detecting the length of a microsatellite marker in the microsatellite marker locus.
Background
The microsatellite marker is also called Short Tandem Repeat (STR) or simple repeat (SSR) and is composed of more than 2 nucleotides which are tandem repeats of a repeat unit. The microsatellite marker loci refer to loci containing microsatellite markers on a genome, the microsatellite marker loci are abundant and uniformly distributed on the genome, and the development of the microsatellite marker loci refers to a process for searching the microsatellite marker loci on the genome. In different samples, the repetition times of the repeat units of the microsatellite markers in the same microsatellite marker locus may be different, and length variation exists among samples, so that the polymorphism of the microsatellite marker locus mainly refers to the length polymorphism of different microsatellite markers of the same microsatellite marker locus. Microsatellite marker detection techniques refer to techniques that detect the length of a microsatellite marker in a microsatellite marker locus. The length polymorphism of the microsatellite markers of different samples can be used for identifying the identity of the samples, so the application of the microsatellite marker technology is very wide, and the microsatellite marker technology comprises biodiversity identification, animal and plant variety fingerprint identity card identification and the like.
The traditional development and detection of wheat microsatellite marker loci comprise the following steps: extracting a genome, fragmenting the genome, connecting joints, amplifying, hybridizing with a simple repetitive sequence, purifying a hybridization product, cloning the hybridization product, converting a cloning product escherichia coli, picking single clones, performing first-generation sequencing on a target site of each single clone, analyzing a sequencing result to obtain a microsatellite marker site, detecting polymorphism of the microsatellite marker site in a plurality of wheat samples, developing the microsatellite marker site with high polymorphism, amplifying one by one and detecting the microsatellite marker in each microsatellite marker site to be detected in each sample to be detected by electrophoresis.
In the process of implementing the invention, the inventor finds that the prior art has at least the following problems:
the development and detection process of the wheat microsatellite marker locus is complex, low in flux and extremely time-consuming and labor-consuming; secondly, the electrophoretic detection of the microsatellite marker locus has low resolution, the detection result is inaccurate, and the accurate result needs to be corrected by a reference sample and the like. Problems derived from this include: the number of the developed microsatellite marker loci is less, usually less than 200, and accounts for about 1% of all the microsatellite marker loci on the genome; the wheat samples for detecting the polymorphism of the microsatellite marker locus are few, usually about tens of wheat samples, so that the polymorphism detection result is inaccurate; the conservation of flanking sequences of the microsatellite marker loci is unknown, so that the universality of primers for amplifying the microsatellite marker loci is influenced; the number of detected microsatellite marker sites is limited, and dozens of microsatellite marker sites are generally detected in a sample to be detected, so that the DNA identity card information of the established sample is incomplete and inaccurate.
Disclosure of Invention
In order to solve the problems in the prior art, the embodiment of the invention provides a development method of a wheat microsatellite marker locus and a method for detecting the length of a microsatellite marker in the microsatellite marker locus. The technical scheme is as follows:
in one aspect, the embodiment of the invention provides a method for developing a wheat microsatellite marker locus, which comprises the following steps:
mixing n wheat samples with polymorphism in equal mass to obtain a mixed sample, wherein n is more than 1;
extracting the genome of the mixed sample;
fragmenting the genome of the mixed sample to obtain a genome fragment;
using a plurality of probes with simple repetitive sequences as probe sets, hybridizing the genomic fragments with each probe in the probe sets respectively to obtain a plurality of hybridization solutions, and purifying the genomic fragments successfully hybridized in the hybridization solutions respectively to obtain a plurality of purified hybrid genomic fragments;
after a plurality of purified hybrid genome segments are mixed in equal mass, detecting the mixed purified hybrid genome segments by using high-throughput sequencing to obtain a first high-throughput sequencing segment;
screening said first high-throughput sequencing fragment for an effective high-throughput sequencing fragment comprising a microsatellite marker within a microsatellite marker locus;
classifying the effective high-throughput sequencing fragments according to homology of sequences on two sides of a microsatellite marker in the effective high-throughput sequencing fragments, wherein the effective high-throughput sequencing fragments of the same class are the effective high-throughput sequencing fragments of the same microsatellite marker locus, if the number of the effective high-throughput sequencing fragments of the same microsatellite marker locus is more than or equal to α 1, one microsatellite marker locus is successfully developed, wherein α 1 is a first judgment threshold and α 1 is more than or equal to (the high-throughput sequencing depth is multiplied by the proportion of the effective high-throughput sequencing fragments/the number of the microsatellite marker loci capable of being detected on a genome) multiplied by probability.
In general, to facilitate purification of the successfully hybridized genomic fragment from the hybridization solution, the probe may be functionally labeled, e.g.
Hybridizing a biotin-labeled probe with a simple repetitive sequence with the genome segment to obtain a hybridization solution;
and purifying the successfully hybridized genome fragment in the hybridization solution by using streptavidin magnetic beads to obtain a purified genome fragment.
In the above step, because the probe has a biotin label, the successfully hybridized genome segment is also labeled with biotin, so that the successfully hybridized genome segment can be purified from the hybridization solution by using streptavidin magnetic beads. The technology of using biotin labeling and streptavidin magnetic bead purification is a well-known technology.
Specifically, α 1 is 20 or more.
Specifically, the microsatellite marker refers to a sequence formed by tandem repeat of a repeating unit consisting of more than or equal to 2 bases.
Specifically, the number of bases of the sequences on both sides of the microsatellite marker in the effective high-throughput sequencing fragment is more than or equal to 1, and the number of bases of the sequences on at least one side of the microsatellite marker in the effective high-throughput sequencing fragment is more than or equal to 10.
Specifically, the method for selecting the n samples having polymorphisms includes: selecting wheat samples with different external forms, wheat samples with different biological classifications, wheat samples marked with different labels or wheat samples of wild resources in different ecological areas.
Specifically, the number of the probes is 12, the repeating unit in the simple repeating sequence of each probe is CT, GA, TG, AC, TA, TGT, CCA, ATC, CCT, AGA, ATG or CAA, the repeating number of the simple repeating sequence of each probe is 6-20, preferably 6-15, for example, the repeating number is 8 or 12.
Specifically, the sequence of the probe is shown as SEQ IN NO 1-SEQ IN NO 12 IN the sequence table.
In another aspect, an embodiment of the present invention provides a method for detecting a length of a microsatellite marker in a microsatellite marker locus successfully developed by the above development method, where the method includes:
selecting a microsatellite marker locus to be detected from the successfully developed microsatellite marker loci;
amplifying the microsatellite marker in the microsatellite marker locus to be detected by using a multiplex amplification primer to obtain an amplification product, carrying out high-throughput sequencing on the amplification product to obtain a second high-throughput sequencing fragment, and analyzing the second high-throughput sequencing fragment to obtain the length of the microsatellite marker in the microsatellite marker locus.
Specifically, the method for selecting the microsatellite marker loci to be detected from the successfully developed microsatellite marker loci comprises the following steps:
selecting the microsatellite marker locus with the standard of the microsatellite marker locus to be detected as the maximum H value, wherein the H value is the polymorphism index of the microsatellite marker locus,
Figure BDA0001153853460000041
wherein i is the ith category when classified according to the length of the microsatellite marker in the effective high-throughput sequencing fragment of the microsatellite marker locus, i is a natural number; ai is the ratio of the number of valid high throughput sequencing fragments of the ith class to the total number of valid high throughput sequencing fragments.
Specifically, the method for preparing the multiplex amplification primer comprises the following steps:
extracting the microsatellite marker from all the effective high-throughput sequencing fragments of the selected microsatellite marker locus to be detected and selecting the longest microsatellite marker as the microsatellite marker of the template sequence of the multiplex amplification primer;
extracting left sequences of the microsatellite markers from all the effective high-throughput sequencing fragments of the selected microsatellite marker loci to be detected, selecting all sequences with the length being more than α 2 bases, selecting the sequences with the highest frequency from all the selected sequences, taking the sequences with the highest frequency as reference sequences, comparing the reference sequences with the left sequences of all the microsatellite markers, and obtaining the coverage multiple and the variation frequency of each base in the sequences with the highest frequency, wherein in the sequences with the highest frequency, the bases with the coverage multiple being less than or equal to 1/α 3 or the variation frequency being more than or equal to α 3 are changed into N and then taken as the left sequences of the template sequences of the multiple amplification primers, wherein N is any one or more than four bases of A, T, C and G, α 2 is a second judgment threshold, α 2 is (the average length of the first high-throughput sequencing fragment-the length of the microsatellite marker loci) 2; α 3 is a third judgment threshold, α 3 is not less than or equal to 365 × the first high-throughput sequencing fragment (the accuracy of the first high-throughput sequencing fragment is obtained by taking the sequence of the multiple amplification primers as the template sequences of the multiple amplification primers);
obtaining a right sequence of the template sequence of the multiplex amplification primer sequence according to a method identical to the left sequence of the template sequence of the multiplex amplification primer;
and sequentially connecting the left sequences of the template sequences of the multiple amplification primers, the microsatellite markers of the template sequences of the multiple amplification primers and the right sequences of the template sequences of the multiple amplification primers to obtain the template sequences of the multiple amplification primers of the microsatellite marker loci, and obtaining the multiple amplification primers by utilizing the template sequences of the multiple amplification primers of the microsatellite marker loci.
Specifically, the method for obtaining the length of the microsatellite marker in the microsatellite marker locus comprises the following steps: obtaining a left border sequence of the second high-throughput sequencing fragment and a right border sequence of the second high-throughput sequencing fragment after removing the microsatellite marker in the second high-throughput sequencing fragment; aligning each of the second high-throughput sequencing fragments to the microsatellite marker locus to be detected by using the left border sequence and the right border sequence; intercepting the microsatellite marker in the second high-throughput sequencing fragment of each microsatellite marker locus to be detected; classifying the obtained microsatellite markers according to length, and calculating the truth degree R of the ith classi=Ni/NmaxWherein i is the ith class, N, when classified by the length of the microsatellite marker in said effective high throughput sequencing fragment of said microsatellite marker locusiIs the second of the ith classHigh throughput sequencing of the number of fragments, Nmax(ii) the maximum of the number of the second high-throughput sequencing fragments for all classes; if the degree of truth Riα 4, the length of the microsatellite marker of the ith class is the length of the microsatellite marker in the microsatellite marker locus, if the true degree R isi<α 4, the length of the i-th class of microsatellite markers is not the length of the microsatellite markers within the microsatellite marker locus, wherein α 4 is the fourth decision threshold and α 4 is 0.15.
Specifically, the method for fragmenting the genome of the mixed sample is mechanical disruption or enzyme digestion.
The technical scheme provided by the embodiment of the invention has the following beneficial effects: the development and detection technology of the wheat microsatellite marker locus provided by the invention is simple, rapid, high-flux, comprehensive and accurate. The time consumption is shortened from 1 to 2 years to 1 to 2 days; the quantity of the developed microsatellite marker loci is improved to be close to 100 percent from about 1 percent of all the microsatellite marker loci in the genome; the number of the wheat samples for testing the polymorphism of the microsatellite marker locus is increased from dozens to no limit, and the accuracy of testing the polymorphism result is greatly improved; the conservativeness of the flanking sequence of the microsatellite marker locus can be obtained, and the universality of a primer for amplifying the microsatellite marker locus is ensured; the multiple microsatellite marker loci are used as one locus for detection, one-by-one detection is not performed, multiple wheat samples to be detected are only detected once, multiple detections are not performed, the workload of microsatellite marker locus detection is greatly reduced, and therefore the number of the detected microsatellite marker loci is almost not limited. The detection result of the microsatellite marker locus is a base, and the accuracy is close to 100%; the detection resolution of the microsatellite marker locus is improved to the highest fraction: a single base; the detection result does not need to be corrected by referring to varieties.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in further detail below.
The procedures or specifications of the procedures not shown or described in detail in the examples of the present invention are well known to those skilled in the art of molecular biology. Reagents or biological materials not mentioned in the examples of the present invention are commonly available reagents or biological materials, which are well known to those skilled in ordinary molecular biology and commercially available.
Examples
The development method of the wheat microsatellite marker locus comprises the following steps:
and mixing n wheat samples with polymorphism in equal amount to obtain a mixed sample, wherein n is more than 1.
Samples with polymorphisms include: the method comprises the following steps of (1) selecting wheat samples with different external forms (form polymorphism), wheat samples with different biological classifications (such as different varieties or varieties), wheat samples with different markers (such as protein markers) or wild resource wheat samples in different ecological regions, wherein the more wheat samples are selected (the larger the n value is), the more abundant the polymorphism is, and the wider the applicability of the developed microsatellite marker locus is. In this example, species of microsatellite marker loci to be developed are medium wheat 895, Zhouman 26, Bainong 207, Luyuan 502, Zhouman 32, Jimai 22, Shannong 20, Yumai 158, Zhengmai 7698, Yunong 416, Guomai 301, West nong 979, Tianmin 198, Liangxing 99, Pumai 26, Shannong 28, Xinmai 208, Bainong 418, Bingmai 8, Zhengmai 7698, Zhengmai 379, Zhongmai 998, Zangmai 9, Shannong 22, Jindi 828, Liang 66, Xinmai 296, Shixin 633, Bainong AK58, Zhouman 27, Xike 1, and Jiamai 266, and 32 varieties, which are widely used in China and publicly known and obtained on the market. Wherein, the microsatellite marker refers to a sequence formed by the tandem repeat of a repeating unit consisting of more than or equal to 2 bases.
The equal-mass leaves of the 32 wheat varieties are taken and mixed, the genome of the mixed sample is extracted, and the extraction method is carried out according to an operation manual of a novel plant genome extraction kit with the product number DP320 of Tiangen Biochemical technology (Beijing) Co. The wheat sample selected in this example is a leaf, and as a general knowledge, the wheat sample may be taken from a seed or the like.
And fragmenting the genome of the mixed sample to obtain a genome fragment. Specifically, the method for fragmenting the genome of the mixed sample includes: mechanical disruption or enzymatic cleavage. The length of genome fragmentation is controlled within the range of fragment lengths detectable upon high throughput sequencing. In this embodiment, the high throughput sequencing employs a PI chip of a PROTON high throughput sequencer, and the detection length is about 200bp, so the peak value of the length of the obtained genome fragment is also controlled to be about 200bp as much as possible. In this example, an automatic acoustic focusing crusher Covaris S220 (manufactured by Covaris, usa, model number S220) was used to crush the genome of the mixed sample, the crushing method was performed according to the method for obtaining 200bp (peak) target fragment described in the instruction manual of the apparatus, "DNA cutting with S220/E220 Focused-ultrasonic" (version number: 010308Rev G), the genome fragment of the mixed sample was obtained after crushing, and after detecting the genome fragment according to the procedure of its double-stranded DNA using a Q5000 spectrophotometer manufactured by Quawell, usa, the concentration was diluted or concentrated to 100ng/μ L, and the genome fragment was obtained.
A plurality of biotin-labeled probes having a simple repetitive sequence were used as probe sets, and the probe sets were hybridized with the genome fragments to obtain a hybridization solution. The number of bases of the repeating unit in the probe having a simple repeating sequence is 2 or more. Specifically, the repeat unit in the simple repeat sequence of the probe is CT, GA, TG, AC, TA, TGT, CCA, ATC, CCT, AGA, ATG or CAA, and these 12 probes can hybridize to all possible microsatellite markers having repeat units of 2 bases and 3 bases, and thus can be used for the fishing of microsatellite markers in genomic fragments in all species. In the previous experiment, the efficiency of hooking the microsatellite marker by different probe lengths is detected, and the efficiency is higher when the repetition frequency of the simple repeated sequence of the probe is 6-20, and the preferred repetition frequency is 6-15, such as 8 or 12. IN this example, the probe set includes 12 probes, and the sequences of the 12 probes are shown as SEQ ID NO. 1-SEQ ID NO. 12 IN the sequence listing, respectively. The probes are synthesized by Beijing Optimalaceae New Biotechnology Limited and labeled with 5' end biotin. Previous experiments show that the efficiency of respectively fishing the microsatellite markers from the genome fragments by using different probes is better than that of fishing the microsatellite markers from the genome fragments by mixing all the probes, so that the microsatellite markers from the genome fragments are respectively fished by using different probes in the embodiment, specifically, each probe is respectively dissolved into a solution with equimolar concentration (10 pM/mu L) by using enzyme-free water, and 1 mu L of each 12 probes in the probe set are respectively uniformly mixed with the genome fragment of a 5 mu g mixed sample and then hybridized to respectively obtain 12 hybridization solutions. The procedure for hybridization was: 95 ℃ for 10 minutes, 65 ℃ for 10 minutes and 37 ℃ for 10 minutes.
And purifying the successfully hybridized genome fragment in the hybridization solution by using streptavidin magnetic beads to obtain the purified genome fragment. Specifically, the purification process of using streptavidin magnetic beads to purify 12 kinds of hybridization solutions respectively is as follows: the obtained 1 of the 12 hybridization solutions was placed on a magnetic frame (manufactured by Invitrogen, usa) until the hybridization solution was clarified, the solution was aspirated, the magnetic beads were washed with enzyme-free water 2 times, 10 μ L of enzyme-free water was mixed with streptavidin magnetic beads, heated in a PCR instrument at 95 ℃ for 5 minutes, and rapidly placed on the magnetic frame, and the obtained solution was the purified hybrid genome fragment of the first probe. All 12 purified hybridizing genome fragments are sequentially obtained in the same manner as the purified hybridizing genome fragment of the first probe obtained, and mixed together, i.e., finally, purified hybridizing genome fragments of all probes are obtained. In order to successfully purify the hybridized genome segment, in this embodiment, a biotin-labeled probe with a simple repeat sequence is used in combination with streptavidin magnetic beads, and in other embodiments, the hybridization and purification of the genome segment can be performed in other manners.
And detecting the purified hybrid genome fragment by using second-generation high-throughput sequencing to obtain a first high-throughput sequencing fragment. The method comprises the steps of constructing a second-generation high-throughput sequencing library by using a DNA library preparation Kit (manufactured by NEB company, UK, and having a product code of E6270L) and according to an operation manual of the Kit, amplifying ePCR (emulsion PCR) before sequencing by using the obtained second-generation high-throughput sequencing library and a Kit Ion PI Template OT2200Kit v2 (manufactured by Invirripen company, USA, and having a product code of 4485146), and obtaining an ePCR amplification product by using the operation method according to the operation manual of the Kit. High-throughput Sequencing was performed on a Proton second-generation high-throughput sequencer using an ePCR amplification product and a Kit Ion PI Sequencing 200Kit v2 (manufactured by Invirrigen, USA, Cat. No. 4485149), and the operation was performed according to the manual of the Kit. In this example, the high-throughput sequencing amount is set to 10M sequencing fragments (1M ═ 100 ten thousand), the sequencing length is set to 500 cycles, and after the sequencing is finished, the first high-throughput sequencing fragment is obtained.
From the first high-throughput sequencing fragments, effective high-throughput sequencing fragments are screened. The effective high-throughput sequencing fragment comprises microsatellite markers in microsatellite marker sites, the number of bases of sequences on two sides of the microsatellite markers in the effective high-throughput sequencing fragment is more than or equal to 1, and the number of bases of sequences on at least one side of the microsatellite markers in the effective high-throughput sequencing fragment is more than or equal to 10. Analyzing whether each of the first high-throughput sequencing fragments contains a microsatellite marker, and removing the first high-throughput sequencing fragments which do not contain the microsatellite marker. In the reserved first high-throughput sequencing segment, whether the number of bases of sequences on both sides of the microsatellite marker is more than or equal to 1 is analyzed, if so, the microsatellite marker is complete in the first high-throughput sequencing segment, which is necessary because the polymorphism of the microsatellite marker refers to the length polymorphism of the microsatellite marker, and the length polymorphism of the microsatellite marker can be correctly obtained only by ensuring the completeness of the microsatellite marker, so that the subsequent analysis can be correctly carried out. The first high-throughput sequencing fragment with both side sequences of the microsatellite marker being less than 10 bases cannot accurately perform subsequent homology analysis, and errors are introduced due to the excessively short sequence, so that the first high-throughput sequencing fragment with both side sequences of the microsatellite marker being less than 10 bases is further removed. Through the above processes, the first high-pass sequencing fragment which is finally reserved is the effective high-pass sequencing fragment.
When each of the first high-throughput sequencing fragments is analyzed to determine whether it contains a microsatellite marker, analysis software commonly used in the prior art can be used for performing a numerator analysis, or each of the first high-throughput sequencing fragments can be simply and manually determined.
The effective high-throughput sequencing fragments are classified according to the homology of sequences on both sides of a microsatellite marker in an effective high-throughput sequencing fragment, the effective high-throughput sequencing fragments of the same class are effective high-throughput sequencing fragments of the same microsatellite marker site, if the number of effective high-throughput sequencing fragments of the same microsatellite marker site is equal to or greater than 5631, a microsatellite marker site is successfully developed, wherein α is a first determination threshold and α 1 (the high-throughput sequencing depth is equal to the number of effective microsatellite marker sites/number of detectable microsatellite marker sites on the genome) is guaranteed, the specific value of α is adjusted according to the depth of high-throughput sequencing, the microsatellite markers in the effective high-throughput sequencing fragments are removed, the remaining two side sequences are combined into a complete sequence, the pairwise alignment analysis is performed between the combined complete sequences by using the Megablalast software (version 2.2.26), the alignment analysis parameters of the alignment between the combined high-throughput markers are set as 1e-5, the parameters-p is set as 0, the parameters of homology-5000-ten thousand, the homology of the high-throughput marker markers in the high-throughput sequencing fragments, the high-throughput marker fragments are set as the high-throughput marker sites, the high-throughput marker sequences of the high-throughput marker sites, the high-throughput sequencing fragments, the high-throughput marker sites, the high-throughput marker fragments of the high-throughput marker fragments are found in the high-throughput sequencing fragments, the high-throughput marker fragments of the high-throughput sequencing fragments, the high-throughput marker fragments of the high-sequence of the high-throughput marker fragments, the high-throughput marker fragments are found high-sequence of the high-throughput marker fragments, the high-sequence of the high-throughput marker fragments are found high-throughput sequencing fragments, the high-sequence of the high-throughput marker fragments of the high-sequence of the high-throughput marker fragments, the high-throughput marker fragments are found high-sequence of the high-marker fragments, the high-sequence of the high-throughput marker fragments, the high-sequence of the high-throughput marker fragments are found high-sequence of the high-throughput marker fragments, the high-sequence of the high-throughput marker fragments, the high-sequence of the high-throughput marker fragments are found high-throughput marker fragments, the high-throughput marker fragments are found high-sequence, the high-throughput marker fragments, the high-sequence of the high-satellite marker fragments, the high-sequence of the high-satellite marker fragments, the high-sequence of the high-marker fragments, the high-.
Selecting the microsatellite marker locus with the maximum H value as the standard of the microsatellite marker locus to be detected, wherein the H value is the polymorphism index of the microsatellite marker locus,
Figure BDA0001153853460000101
wherein i is a microsatellite marker in an efficient high throughput sequencing fragment at a microsatellite marker siteWhen the length of (a) is classified, the ith class, i is a natural number; ai is the ratio of the number of valid high throughput sequencing fragments of the ith class to the total number of valid high throughput sequencing fragments. Putative microsatellite marker sites as in table 1 were classified by the length of the microsatellite marker in the efficient high throughput sequencing fragment for a total of 3: (TG)20, (TG)21 and (TG)22, so S ═ 3; the total number of effective high throughput sequencing fragments for this microsatellite marker locus was 40, with the number of 1 st microsatellite marker (TG)20 being 3, so a 1-3/40-80%, a 2-32/40-80%, and a 3-5/40-12.50% were also calculated. And substituting the above values into a calculation formula of H to obtain the H value of the microsatellite marker locus to be 0.98.
Calculating the H values of all the successfully developed microsatellite marker sites in this embodiment according to the same calculation method as the assumed microsatellite marker sites in table 1, wherein the H values of all the obtained microsatellite marker sites are arranged from large to small, and the microsatellite marker sites with the top 50 bits in the sequence are selected as the microsatellite marker sites to be detected in the sample to be detected in this embodiment. The parameter 50 is determined according to actual needs, for example, 1 microsatellite marker locus is needed when the purity of wheat is identified, about 50 microsatellite marker loci are generally selected when the wheat fingerprint is constructed, and about 300 microsatellite marker loci are required to be selected to meet the requirements when the substantive derivation relation between varieties is analyzed. The microsatellite marker loci with the largest H value are selected because the microsatellite marker loci have the strongest distinguishing capability, more samples can be distinguished and information can be provided as much as possible by using the fewest microsatellite marker loci, and the distinguishing of the samples is the most core task of the microsatellite marker technology.
Microsatellite markers in all valid high throughput sequencing fragments of the microsatellite marker locus were extracted, as set of 3 (TG)20, 32 (TG)21 and 5 (TG)22 microsatellite markers within the putative microsatellite marker locus listed in Table 1. The sequences to the left of the microsatellite markers in all available high throughput sequencing fragments from which the microsatellite marker locus was extracted constitute the sequences to the left of the microsatellite marker locus, as the sequences to the left of the putative microsatellite marker loci of Table 1 are a collection of 3 (A)2G (A)2, 5 (A)87G (A)3, 27 (A)86G (A)3 and 5 (A)81G (A) 4. In the same manner, the sequences on the right side of the microsatellite marker sites are obtained, and the sequences on the right side of the putative microsatellite markers shown in Table 1 are a set of 3 (A)4G (A)80, 5 (A)3G (A)2, 27 (A)3G (A)81 and 5 2G (A) 85.
The method for detecting the length of the microsatellite marker in the wheat microsatellite marker locus comprises the following steps:
selecting a microsatellite marker locus to be detected from the successfully developed microsatellite marker loci, and designing a multiplex amplification primer for amplifying the microsatellite marker locus to be detected. The selection of microsatellite marker sites and the design of multiplex amplification primers are described below using the putative microsatellite marker sites in Table 1 as an example.
A method of designing multiplex amplification primers for amplifying selected microsatellite marker sites comprises: extracting microsatellite markers from all effective high-throughput sequencing fragments of the selected microsatellite marker loci, and selecting the longest microsatellite marker from the microsatellite markers as a microsatellite marker designed by a multiplex amplification primer; among the putative microsatellite marker sites in Table 1, (TG)22 is the longest microsatellite marker, and thus, (TG)22 is the microsatellite marker of the template sequence designed for the multiplex amplification primers for that microsatellite marker site. The longest microsatellite marker is selected to ensure that the length of the microsatellite locus amplified by the designed multiplex amplification primers does not exceed the amplification capacity of multiplex PCR, thereby reducing data loss in microsatellite detection.
The present invention provides a method for identifying a mutation in a microsatellite marker, wherein a left-side sequence of the microsatellite marker is extracted from all effective high-throughput sequencing fragments of the selected microsatellite marker site, wherein all sequences having a length of α bases are selected from the sequences having a length of the first high-throughput sequencing fragment, wherein the average length of the first high-throughput sequencing fragment is 200bp, the length of the microsatellite marker site of the multiplex amplification primer is 44(TG repeats 22 times, wherein the length is 44%) in the multiplex amplification primer design, wherein the average length of the first high-throughput sequencing fragment is 200bp, the length of the microsatellite marker is 44% in the multiplex amplification primer design, wherein the multiplex amplification primer is 5 (a) in the multiplex amplification primer design is 78, wherein all sequences having a length of greater than α are 5 (a) or 5 (a 87G) (a)3, 27G (a)3G (a) and 5G 81G) 4 in the multiplex primer set of the multiplex primer selected microsatellite marker left-side sequence having a length of greater than 7G 27, wherein the highest frequency of the multiplex amplification is 3, wherein the multiplex primer is selected from the multiplex primer set of the multiplex amplification primer set, wherein the multiplex amplification primer set of the multiplex primers having a 27, wherein the multiplex amplification primer set, the multiplex amplification frequency of the multiplex amplification primer set, the multiplex amplification sequence, the set, the multiplex amplification sequence, the set is no more than the set, the multiplex amplification sequence, the set, the multiplex amplification sequence, the set is No. the multiplex amplification sequence, the set is No. the set, the multiplex amplification sequence, the set is No. the set, the set is No. 3, the set is No. 3, the set is No. 3, the set is No. no more than the set, No. 3, the set is No. 5, No. 3, No. no more than the set, No. no.
Obtaining the right sequence of the template sequence of the multiplex amplification primer by the method which is completely identical with the left sequence of the template sequence of the multiplex amplification primer. In the putative microsatellite marker loci of Table 1, the right sequence of the template sequence of the multiplex amplification primers is (A)2NNN (A) 80. Sequentially connecting the left sequences of the template sequences of the multiple amplification primers, the microsatellite markers of the template sequences of the multiple amplification primers and the right sequences of the template sequences of the multiple amplification primers to obtain the template sequences of the multiple amplification primers of the microsatellite marker loci, and obtaining the multiple amplification primers by utilizing the template sequences of the multiple amplification primers of the microsatellite marker loci. The template sequence of the multiplex amplification primers for the putative microsatellite marker loci in Table 1 is (A)85NNN (A)2(TG)22(A)2NNN (A) 80.
The template sequences of the multiplex amplification primers for the 50 microsatellite marker loci finally selected in this example were obtained according to the same method and parameters as described above.
TABLE 1 first high throughput sequencing fragment of a putative microsatellite marker site
Figure BDA0001153853460000121
Figure BDA0001153853460000131
In the first high throughput sequencing fragment type shown in table 1, the underlined parts represent microsatellite markers, the letters in parentheses represent repeat units of the microsatellite markers, and the numbers after the parentheses represent the number of repetitions of the repeat units.
And (3) designing a multiplex amplification primer for amplifying the selected microsatellite marker locus by using the template sequence of the multiplex amplification primer for all the microsatellite marker loci. The specific method comprises the following steps: the obtained template sequences of the multiple amplification primers of 50 microsatellite marker loci are connected by 100N to construct an artificial reference genome. Logging in multiple PCR primers to design a webpage https:// ampliseq.com/, and selecting DNA Hotspot designs (single-pole) at the option of "Application type". And uploading to construct an artificial reference genome after selecting "Custom" from the option of "Select the genome you with to use". The "DNA Type" option selects "Standard DNA". In the "Add Hotspot" option, the start and end positions of each microsatellite marker in the constructed artificial reference genome are filled in, and finally the "Submit targets" button is clicked to Submit and obtain the sequences of the multiplex amplification primers. In this embodiment, 49 microsatellite marker loci in which the multiplex amplification primer is successfully designed are selected from the 50 microsatellite marker loci, and the 49 microsatellite marker loci are the microsatellite marker loci to be detected. This example employs multiplex PCR technology provided by Saimer Feishale, USA, which can amplify 12000 test regions simultaneously, so the present invention has the ability to detect 12000 microsatellite marker sites at a time, which is 12000 times higher than the detection ability of the traditional microsatellite marker sites.
And amplifying the microsatellite marker in the microsatellite marker locus to be detected by using the multiplex amplification primers to obtain an amplification product, and performing high-throughput sequencing on the amplification product to obtain a second high-throughput sequencing fragment. In this embodiment, the sample to be detected is 100 wheat leaves taken from the wuhan wheat field, 100 wheat leaves are mixed in equal amount to obtain a mixed sample, and the genomic DNA of the mixed sample is extracted by using a plant genomic DNA extraction kit (cat # DP305, manufactured by tiangen biotechnology and technology (beijing) ltd) according to the method provided by the operation manual thereof. The genomic DNA of the mixed sample was amplified using 49 pairs of multiplex amplification primers and library construction Kit 2.0 (manufactured by Life technology, USA, Cat. No. 4475345) designed and according to the Kit's operating manual to construct a high-throughput sequencing library, and ePCR (Emulsion PCR) amplification was performed before sequencing using the obtained high-throughput sequencing library and Kit Ion PI Template OT2200Kit v2 (manufactured by Invirrtigen, Cat. No. 4485146), and the operating method was performed according to the Kit's operating manual to obtain ePCR products. High-throughput Sequencing was performed on a Proton second generation high-throughput sequencer using the ePCR product and a Kit Ion PI Sequencing 200Kit v2 (manufactured by Invirriggen, USA, Cat. No. 4485149), and the procedure was performed according to the manual of the Kit. In this example, the high-throughput sequencing amount is set to 1M sequencing fragment (1M ═ 100 ten thousand), the high-throughput sequencing length is set to 500 cycles, and after the sequencing is finished, the second high-throughput sequencing product is obtained.
The length of the microsatellite marker within the microsatellite marker locus is obtained by analyzing the second high throughput sequencing product. The specific method comprises the following steps: removing the microsatellite markers in the second high-throughput sequencing fragment to obtain a left border sequence of the second high-throughput sequencing fragment and a right border sequence of the second high-throughput sequencing fragment; comparing each of the second high-throughput sequencing fragments to a microsatellite marker locus to be detected by using the left border sequence and the right border sequence; intercepting the microsatellite marker in the second high-throughput sequencing fragment of each microsatellite marker locus to be detected; classifying the obtained microsatellite markers according to length, and calculating the truth degree R of the ith classi=Ni/NmaxWherein N isiNumber of second high throughput sequencing fragments for ith class, NmaxMaximum of the number of second high-throughput sequencing fragments for all classes; if degree of truth Riα 4, the length of the microsatellite marker in the ith category is the length of the microsatellite marker in the microsatellite marker locus, if the truth R is greater than the truth Ri<α 4, the length of the microsatellite marker in the ith category is not the length of the microsatellite marker in the microsatellite marker locus, wherein α 4 isAnd a fourth decision threshold. The polymorphism of the microsatellite marker within the microsatellite marker locus is length polymorphism caused by inconsistency of the number of repetitions of a simple repeat sequence in the microsatellite marker, and therefore, the detection of the microsatellite marker locus mainly refers to the detection of the length of the microsatellite marker within the microsatellite marker locus. Typically, the species is diploid, and if the sample is homozygous, the same microsatellite marker locus will contain only one microsatellite marker allele, and if the sample is heterozygous, the same microsatellite marker locus will have 2 different microsatellite marker alleles. If the sample is polyploid, such as wheat and cotton, the decision criteria should be adjusted accordingly. The microsatellite marker locus may generate sliding when performing multiplex amplification, so that in the second high-throughput sequencing segment, the length of the microsatellite marker generated by sliding is different from the length of the real microsatellite marker in the mixed sample, thereby generating interference noise with a degree of truth RiCan reflect the strength of interference noise, RiFor example, if a microsatellite marker locus is known to have a percentage of interfering microsatellite markers generated by more than 95 slides in 100 detections, then we can determine the value of α to be 390.3, then we have a 95% confidence that we have no guarantee that the microsatellite marker of the i-th classification is a true value, and if we have a lower probability of false determination that a microsatellite marker of the i-th classification is a true value, then we have a smaller probability of false determination that a microsatellite marker of the i-th classification is a true value, but if we have a smaller probability of false determination that a true value is a false value, then we have a smaller probability of false determination that a true value of 464, then we have a smaller probability of false determination that a true value of a microsatellite marker of 464, then we have a true value of 0.6/X is a value of α, if it is a heterozygous determination that 0.6/X is a value for α, where X is a ploidy level of interference of a detected microsatellite marker of more than 95In the present embodiment, because the value of α 4 is determined to be α 4 in a lack of reference data and the sample to be detected is diploid and is a heterozygote, the value of α is 0.6/4 to 0.15, the length difference between the false microsatellite marker and the amplification product of the real microsatellite marker generated by sliding is not large, and the conventional microsatellite marker detection method is mostly electrophoresis, so that the small length difference cannot be distinguished, even if the small length difference can be distinguished, the accurate quantification cannot be realized, and therefore, when the conventional microsatellite marker is detected, the R cannot be calculated or cannot be accurately calculatediCausing a large number of inaccurate and even erroneous conclusions.
In the following, it is assumed that Table 1 is a detected microsatellite marker locus again, and how to detect the microsatellite marker locus to be detected in the mixed sample is described. In the second high-throughput sequencing fragment of the putative microsatellite marker loci in table 1, the truncated microsatellite markers are a set of 3 (TG)20, 32 (TG)21 and 5 (TG)22, the truncated microsatellite markers are classified by repeat unit and are all TGs, and the microsatellite markers of the repeat unit with the highest occurrence frequency are reserved and are a set of 3 (TG)20, 32 (TG)21 and 5 (TG) 22; the remaining microsatellite markers were further classified by length to obtain 3 classes, respectively (TG)20, (TG)21 and (TG) 22. Of these 3 classes, the class that occupied the most the number of second high-throughput sequencing fragments was the 2 nd class (TG)21, Nmax=N232. The number of second high-throughput sequencing fragments occupied by the 1 st class (TG)20 was 3, N1Then, R1 is 3/32<α 4 is 0.15, so it is determined that the 1 st class (TG)20 is not truly present, and is due to sliding2=1,R35/32, it is determined that the 2 nd and 3 rd categories are truly present according to the same criteria. Thus, the length of the microsatellite marker in the microsatellite marker site to be detected in the mixed sample is the length of the microsatellite markers of class 2 and class 3, i.e., the tableThe length of the microsatellite marker within the microsatellite marker locus to be detected which is postulated in 1 is 42bp and 44bp (21 repetitions of the TG in the second category and thus 21X 2bp to 42 bp; 22 repetitions of the TG in the third category and thus 22X 2bp to 44 bp).
The detection was performed again in the same manner and parameters as in the above-mentioned hypothetical example, and the lengths of the microsatellite markers within the 49 microsatellite marker sites to be detected in this example were successfully detected.
The development method and the detection method of the microsatellite marker locus provided by the embodiment of the invention are quick, simple, comprehensive and accurate. The traditional development method of the microsatellite marker locus can only discover about 1 percent of the microsatellite marker loci in a genome and can only verify the polymorphism of the microsatellite marker loci in less than 100 samples due to large workload. For the invention, theoretically, all the microsatellite marker loci on the genome can be found, in the embodiment of the wheat microsatellite development, more than 1 ten thousand microsatellite marker loci are found, and the discovery capability of the microsatellite marker loci is improved by 50 times, and if the high-throughput sequencing quantity (which is easy to achieve) is increased, the discovery capability of the microsatellite marker loci can be improved to 80 times or even close to 100 times, which is easy to achieve. The embodiment of the invention combines the development (discovery) of the microsatellite marker locus and the polymorphism detection into a whole without extra work, but the traditional polymorphism detection work of the microsatellite marker locus is time-consuming and difficult to realize, for example, the polymorphism of 16562 microsatellite marker loci is detected in 32 wheat varieties, which is equivalent to 32 times of 16562 times of 529984 times of PCR amplification and electrophoresis in the traditional detection, and the workload is not imaginable. In addition, the traditional development technology of the microsatellite marker locus has large workload and no capability of detecting a plurality of sequences of the same microsatellite marker locus, so that the conservation of a multiplex amplification primer cannot be analyzed, the universality of the developed microsatellite marker multiplex amplification primer is poor, and the problem is solved by the embodiment of the invention. Taking 49 microsatellite marker loci detected at a time in the method for detecting the length of the microsatellite marker loci in the wheat of the invention as an example, 49 PCR amplifications and electrophoresis are needed for the traditional detection method. For the present invention, even if 1 ten thousand microsatellite marker loci are detected, the workload is not increased, but for the conventional detection method, the workload is increased by 1 ten thousand times. The traditional detection method is to judge the length of the microsatellite marker by electrophoresis, but the electrophoresis has errors, so reference varieties are needed to be compared, the detection workload is increased, moreover, few laboratories can have a set of complete reference varieties, but the embodiment of the invention adopts high-throughput sequencing to obtain a base sequence, and the obtained result is an absolute value, so no errors exist, and therefore, the reference varieties are not needed any more. In addition, different individuals cannot be distinguished by electrophoresis detection, for example, a sample in wheat detection of the invention is a mixture of 100 individuals, and in an electrophoresis result, the proportion of different microsatellite markers of the same microsatellite marker locus cannot be accurately calculated, so that the individual plants cannot be distinguished, and important indexes such as the rate of mixed plants cannot be calculated.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.
Sequence listing
<110> university of Jianghan
<120> development method of wheat microsatellite marker locus and method for detecting length of microsatellite marker in microsatellite marker locus
<160>12
<170>PatentIn version 3.4
<210>1
<211>24
<212>DNA
<213> Artificial sequence
<400>1
ctctctctct ctctctctct ctct 24
<210>2
<211>24
<212>DNA
<213> Artificial sequence
<400>2
gagagagaga gagagagaga gaga 24
<210>3
<211>24
<212>DNA
<213> Artificial sequence
<400>3
tgtgtgtgtg tgtgtgtgtg tgtg 24
<210>4
<211>24
<212>DNA
<213> Artificial sequence
<400>4
acacacacac acacacacac acac 24
<210>5
<211>24
<212>DNA
<213> Artificial sequence
<400>5
tatatatata tatatatata tata 24
<210>6
<211>24
<212>DNA
<213> Artificial sequence
<400>6
tgttgttgtt gttgttgttg ttgt 24
<210>7
<211>24
<212>DNA
<213> Artificial sequence
<400>7
ccaccaccac caccaccacc acca 24
<210>8
<211>24
<212>DNA
<213> Artificial sequence
<400>8
atcatcatca tcatcatcat catc 24
<210>9
<211>24
<212>DNA
<213> Artificial sequence
<400>9
cctcctcctc ctcctcctcc tcct 24
<210>10
<211>24
<212>DNA
<213> Artificial sequence
<400>10
agaagaagaa gaagaagaag aaga 24
<210>11
<211>24
<212>DNA
<213> Artificial sequence
<400>11
atgatgatga tgatgatgat gatg 24
<210>12
<211>24
<212>DNA
<213> Artificial sequence
<400>12
caacaacaac aacaacaaca acaa 24

Claims (5)

1. A wheat microsatellite marker locus development method is characterized by comprising the following steps:
the method for selecting n wheat samples with polymorphism comprises the following steps of mixing n wheat samples with polymorphism in equal mass to obtain a mixed sample, wherein n is greater than 1: selecting wheat samples with different external forms, wheat samples with different biological classifications, wheat samples marked with different labels or wheat samples of wild resources in different ecological areas;
extracting the genome of the mixed sample;
fragmenting the genome of the mixed sample to obtain a genome fragment;
using a plurality of probes with simple repetitive sequences as a probe set, hybridizing each probe in the probe set with the genome fragment respectively to obtain a plurality of hybridization solutions, wherein the number of the probes is 12, the repetitive unit in each simple repetitive sequence of the probe is CT, GA, TG, AC, TA, TGT, CCA, ATC, CCT, AGA, ATG or CAA, the repetition frequency of each simple repetitive sequence of the probe is 6-20, and purifying the successfully hybridized genome fragments in the hybridization solutions respectively to obtain a plurality of purified hybridization genome fragments;
after a plurality of purified hybrid genome segments are mixed in equal mass, detecting the mixed purified hybrid genome segments by using high-throughput sequencing to obtain a first high-throughput sequencing segment;
screening effective high-throughput sequencing fragments from the first high-throughput sequencing fragments, wherein the effective high-throughput sequencing fragments comprise microsatellite markers in microsatellite marker sites, the base numbers of sequences on two sides of the microsatellite markers in the effective high-throughput sequencing fragments are more than or equal to 1, and the base numbers of sequences on at least one side of the microsatellite markers in the effective high-throughput sequencing fragments are more than or equal to 10;
classifying the effective high-throughput sequencing fragments according to homology of sequences on two sides of a microsatellite marker in the effective high-throughput sequencing fragments, wherein the effective high-throughput sequencing fragments of the same class are the effective high-throughput sequencing fragments of the same microsatellite marker locus, if the number of the effective high-throughput sequencing fragments of the same microsatellite marker locus is more than or equal to α 1, one microsatellite marker locus is successfully developed, wherein α 1 is a first judgment threshold and α 1 is more than or equal to (the high-throughput sequencing depth is multiplied by the proportion of the effective high-throughput sequencing fragments/the number of the microsatellite marker loci capable of being detected on a genome) multiplied by probability.
2. The method of claim 1, wherein α 1 is 20 or more.
3. The method of claim 1, wherein the number of repetitions of the simple repeat sequence of each of the probes is 6 to 15.
4. The development method of claim 1, wherein the probe has a sequence as shown in SEQ ID NO 1-SEQ ID NO 12 of the sequence Listing.
5. A method for detecting the length of a microsatellite marker located within a microsatellite marker locus successfully developed by the development method according to any one of claims 1 to 4, wherein said method for detecting comprises:
selecting microsatellite marker loci to be detected from the successfully developed microsatellite marker loci, wherein the method for selecting the microsatellite marker loci to be detected from the successfully developed microsatellite marker loci comprises the following steps:
selecting the microsatellite marker locus with the standard of the microsatellite marker locus to be detected as the maximum H value, wherein the H value is the polymorphism index of the microsatellite marker locus,
Figure FDA0002309692000000021
wherein S is the number of microsatellite marker sites classified according to the length of the microsatellite markers in the effective high-throughput sequencing fragment, i is the ith category when the microsatellite marker sites are classified according to the length of the microsatellite markers in the effective high-throughput sequencing fragment, and i is a natural number; ai is the proportion of the number of the effective high-throughput sequencing fragments of the ith category to the total number of the effective high-throughput sequencing fragments, a microsatellite marker in the microsatellite marker locus to be detected is amplified by utilizing a multiplex amplification primer to obtain an amplification product, the amplification product is subjected to high-throughput sequencing to obtain a second high-throughput sequencing fragment, the length of the microsatellite marker in the microsatellite marker locus is obtained by analyzing the second high-throughput sequencing fragment, and the length of the microsatellite marker in the microsatellite marker locus is obtainedThe method for marking the length of the microsatellite marker comprises the following steps: obtaining a left border sequence of the second high-throughput sequencing fragment and a right border sequence of the second high-throughput sequencing fragment after removing the microsatellite marker in the second high-throughput sequencing fragment; aligning each of the second high-throughput sequencing fragments to the microsatellite marker locus to be detected by using the left border sequence and the right border sequence; intercepting the microsatellite marker in the second high-throughput sequencing fragment of each microsatellite marker locus to be detected; classifying the obtained microsatellite markers according to length, and calculating the truth degree R of the ith classi=Ni/NmaxWherein i is the ith class, N, when classified by the length of the microsatellite marker in said effective high throughput sequencing fragment of said microsatellite marker locusiNumber of said second high-throughput sequencing fragments for said ith class, Nmax(ii) the maximum of the number of the second high-throughput sequencing fragments for all classes; if the degree of truth Riα 4, the length of the microsatellite marker of the ith class is the length of the microsatellite marker in the microsatellite marker locus, if the true degree R isi<α 4, the length of the i-th class of microsatellite markers is not the length of the microsatellite markers within the microsatellite marker locus, wherein α 4 is a fourth decision threshold and α 4 is 0.15;
the method for preparing the multiplex amplification primer comprises the following steps:
extracting the microsatellite marker from all the effective high-throughput sequencing fragments of the selected microsatellite marker locus to be detected and selecting the longest microsatellite marker as the microsatellite marker of the template sequence of the multiplex amplification primer;
extracting left sequences of the microsatellite markers from all the effective high-throughput sequencing fragments of the selected microsatellite marker loci to be detected, selecting all sequences with the length being more than α 2 bases, selecting the sequences with the highest frequency from all the selected sequences, taking the sequences with the highest frequency as reference sequences, comparing the reference sequences with the left sequences of all the microsatellite markers, and obtaining the coverage multiple and the variation frequency of each base in the sequences with the highest frequency, wherein in the sequences with the highest frequency, the bases with the coverage multiple being less than or equal to 1/α 3 or the variation frequency being more than or equal to α 3 are changed into N and then taken as the left sequences of the template sequences of the multiple amplification primers, wherein N is any one or more than four bases of A, T, C and G, α 2 is a second judgment threshold, α 2 is (the average length of the first high-throughput sequencing fragment-the length of the microsatellite marker loci) 2; α 3 is a third judgment threshold, α 3 is not less than or equal to 365 × the first high-throughput sequencing fragment (the accuracy of the first high-throughput sequencing fragment is obtained by taking the sequence of the multiple amplification primers as the template sequences of the multiple amplification primers);
obtaining a right sequence of the template sequence of the multiplex amplification primer sequence according to a method identical to the left sequence of the template sequence of the multiplex amplification primer;
and sequentially connecting the left sequences of the template sequences of the multiple amplification primers, the microsatellite markers of the template sequences of the multiple amplification primers and the right sequences of the template sequences of the multiple amplification primers to obtain the template sequences of the multiple amplification primers of the microsatellite marker loci, and obtaining the multiple amplification primers by utilizing the template sequences of the multiple amplification primers of the microsatellite marker loci.
CN201611032452.3A 2016-11-16 2016-11-16 Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus Active CN106755314B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611032452.3A CN106755314B (en) 2016-11-16 2016-11-16 Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611032452.3A CN106755314B (en) 2016-11-16 2016-11-16 Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus

Publications (2)

Publication Number Publication Date
CN106755314A CN106755314A (en) 2017-05-31
CN106755314B true CN106755314B (en) 2020-04-28

Family

ID=58971841

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611032452.3A Active CN106755314B (en) 2016-11-16 2016-11-16 Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus

Country Status (1)

Country Link
CN (1) CN106755314B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101041852A (en) * 2006-03-21 2007-09-26 北京杂交小麦工程技术研究中心 Method for building-up SSR finger-print code of wheat breed
CN101144100A (en) * 2006-09-13 2008-03-19 北京市农林科学院 AFLP-STS primer for establishing wheat fingerprint and method for establishing wheat fingerprint by using SSR mark and AFLP-STS mark

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101041852A (en) * 2006-03-21 2007-09-26 北京杂交小麦工程技术研究中心 Method for building-up SSR finger-print code of wheat breed
CN101144100A (en) * 2006-09-13 2008-03-19 北京市农林科学院 AFLP-STS primer for establishing wheat fingerprint and method for establishing wheat fingerprint by using SSR mark and AFLP-STS mark

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
小麦EST-SSR 标记的开发、染色体定位和遗传作图;陈海梅 等;《科学通报》;20051031;第50卷(第20期);第2208-2216页 *
小麦EST-SSR 标记的开发和遗传作图;潘海涛 等;《中国农业科学》;20101231;第43卷(第3期);第452-461页 *

Also Published As

Publication number Publication date
CN106755314A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
US20230212694A1 (en) Snp marker related to wool traits of fine-wool sheep, and detection primer set, kit, detection method and use thereof
CN115198023B (en) Hainan cattle liquid-phase breeding chip and application thereof
CN107345256A (en) One kind is based on transcript profile sequencing exploitation grass vetch EST SSR primer sets and methods and applications
CN107760789B (en) Genotyping detection kit for parent-child identification and individual identification of yaks
CN109337997B (en) Camellia polymorphism chloroplast genome microsatellite molecular marker primer and method for screening and discriminating kindred species
CN109554486A (en) SNP marker relevant to grass carp character and its application
KR20190135797A (en) Genetic maker for parentage and thereod in Turbot
CN106520958B (en) Method for developing microsatellite marker locus and method for detecting length of microsatellite marker in microsatellite marker locus
CN113564266B (en) SNP typing genetic marker combination, detection kit and application
CN109825565B (en) Cynoglossus semilaevis true and false male fish screening method based on fluorescent molecular marker system
CN109234449A (en) A kind of special codominance KASP molecular labeling of the general 2RL chromosome of rye and its application
CN106520955B (en) Development method of rice microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus
CN106520961B (en) Corn microsatellite marker locus development method and length detection method of microsatellite markers in microsatellite marker locus
CN106636362B (en) Soybean microsatellite marker locus development method and microsatellite marker length detection method in microsatellite marker locus
CN106520959B (en) Development method of orchid microsatellite marker locus and method for detecting length of microsatellite marker in microsatellite marker locus
CN106755314B (en) Development method of wheat microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus
CN110305974B (en) PCR analysis primer for distinguishing common mouse inbred lines based on detection of five SNP loci and analysis method thereof
CN106520954B (en) Development method of cotton microsatellite marker locus and length detection method of microsatellite marker in microsatellite marker locus
CN106566890B (en) Method for developing rape microsatellite marker locus and method for detecting length of microsatellite marker in microsatellite marker locus
CN106755312B (en) Method for developing potato microsatellite marker locus and method for detecting length of microsatellite marker in microsatellite marker locus
CN106520960B (en) Sesame microsatellite marker locus development method and method for detecting length of microsatellite marker in microsatellite marker locus
CN111763668B (en) Sequencing primer group and PCR-based whole genome sequencing method
CN113699253A (en) Laoshan milk goat low-density liquid-phase SNP chip and application thereof
CN113151501A (en) Method for auxiliary detection of growth traits of cattle WBP1L gene CNV marker and application thereof
CN110423826A (en) A kind of C57BL/6 subbreed mouse KASP genetic detection kit and primer

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant