CN104805192A - Method for testing substantive derivation relation of oilseed rape varieties - Google Patents

Method for testing substantive derivation relation of oilseed rape varieties Download PDF

Info

Publication number
CN104805192A
CN104805192A CN201510150605.3A CN201510150605A CN104805192A CN 104805192 A CN104805192 A CN 104805192A CN 201510150605 A CN201510150605 A CN 201510150605A CN 104805192 A CN104805192 A CN 104805192A
Authority
CN
China
Prior art keywords
measured
rape
test zone
genotype
varieties
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510150605.3A
Other languages
Chinese (zh)
Inventor
方治伟
彭海
陈红
胡长峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Agriculture Ministry Technology Development Center
Jianghan University
Original Assignee
Agriculture Ministry Technology Development Center
Jianghan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agriculture Ministry Technology Development Center, Jianghan University filed Critical Agriculture Ministry Technology Development Center
Priority to CN201510150605.3A priority Critical patent/CN104805192A/en
Publication of CN104805192A publication Critical patent/CN104805192A/en
Pending legal-status Critical Current

Links

Landscapes

  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a method for testing the substantive derivation relation of oilseed rape varieties. The method comprises the following steps: respectively amplifying the DNA in two samples by using primers, and respectively obtaining amplification products of the two oilseed rape varieties to be tested in a testing area for establishing high-throughput sequencing libraries of the two oilseed rape varieties to be tested; performing high-throughput sequencing on the two high-throughput sequencing libraries, so as to respectively obtain sequencing fragment groups of the two oilseed rape varieties to be tested; analyzing the two sequencing fragment groups, so as to respectively obtain the genotypes of the two oilseed rape varieties to be tested; comparing the genotypes of the two oilseed rape varieties to be tested, so as to obtain the proportion of differential genotypes between the oilseed rape varieties to be tested; judging the substantive derivation relation of the two oilseed rape varieties to be tested according to the proportion of differential genotypes between the oilseed rape varieties to be tested. The method can be used for accurately, quickly and simply judging the substantive derivation relation between the oilseed rape varieties to be tested.

Description

A kind of method of testing the substantive derived relation of rape variety
Technical field
The present invention relates to biological technical field, particularly a kind of method of testing the substantive derived relation of rape variety.
Background technology
UPOV (International Union for the Protection of New Varieties of Plants: UPOV) pact text in 1991 has done the regulation of principle to Essentially derived variety; namely Essentially derived variety refers to that the B kind obtained by A breed breeding does not have substantial change; B kind is called the Essentially derived variety of A kind, has substantive derived relation between A kind and B kind.The method judging whether to have between two kinds substantive derived relation detects genotypic difference ratio between these two kinds, when this difference ratio exceedes certain value, can think, between two kinds, not there is substantive derived relation, on the contrary, then think, between two kinds, there is substantive derived relation.
The method that current detection substance derives from sexual intercourse is also little, only methodical roughly flow process is: by SSR marker or SNP marker, increase each test zone of rape variety to be measured, the genotype of each test zone obtained is detected again by electrophoresis or generation order-checking, according to genotype, judge the substantive derived relation between rape variety to be measured.
Realizing in process of the present invention, contriver finds that prior art at least exists following problem:
After substantive derived relation needs to detect gene locuss a large amount of between rape variety to be measured, accurately could judge, between two kinds, whether there is substantive derived relation.Detect substantive derivation in the method for sexual intercourse existing, detection site causes the judgement conclusion of substantive derived relation inaccurate less.Simultaneously, existing SSR marker and SNP marker are due to needs amplification and each test zone of independent detection separately, therefore, test zone number is too much, workload will inevitably be caused greatly to increase, therefore, existing method testing number of regions is all within 300, completely can not represent the full gene type of rape variety to be measured, thus cause detected result inaccurate, the judgement conclusion of substantive derived relation is also inaccurate.
Summary of the invention
Detecting the inaccurate problem of substantive derived relation to solve in prior art, embodiments providing a kind of method of testing the substantive derived relation of rape variety.Described technical scheme is as follows:
Embodiments provide a kind of method of testing the substantive derived relation of rape variety, described method comprises:
Obtain the variant sites between different rape variety;
By described variant sites determination test zone;
Respectively two rape varieties to be measured are sampled, extract and obtain the DNA of the sampling sample of two described rape varieties to be measured;
The primer of the described test zone of preparation amplification;
Described primer is utilized to increase to the DNA of two described sampling samples respectively, obtain the amplified production of two described rape varieties to be measured at described test zone respectively, described amplified production is respectively used to the high-throughput sequencing library of the described rape variety to be measured of structure two;
Respectively high-flux sequence is carried out to the described high-throughput sequencing library of two described rape varieties to be measured, obtains the sequenced fragments group of two described rape varieties to be measured respectively;
Analyze the described sequenced fragments group of two described rape varieties to be measured, obtain two rape variety genotype to be measured respectively, described rape variety genotype to be measured is the combination of variation base in described test zone, and genotypic frequency >=30% of described rape variety to be measured;
Relatively two described rape variety genotype to be measured, obtain the ratio of Differential genotype between rape variety to be measured;
According to the ratio of Differential genotype between described rape variety to be measured, judge the substantive derived relation of two described rape varieties to be measured.
Particularly, described test zone does not comprise the amplification generation genotypic region of hybrid strain;
Described hybrid strain genotype refers to frequency >=0.02%, and has insertion or the disappearance of discontinuous base in quantity >=2 of distinguishing base between all genotype of described hybrid strain genotype and described rape variety to be measured or described distinguishing base.
Particularly, the number of described test zone meets the following conditions as BINOMDIST (SD*TN, TN, 0.80*SD, TRUE) >=95%, and wherein, TN is the number of described test zone, and SD is decision threshold; The condition implication that the number of described test zone meets is: when the number of described test zone be TN, described decision threshold be SD and between described rape variety to be measured, the ratio of Differential genotype is 0.80*SD time, judge that the probability guarantee that the ratio of Differential genotype between described rape variety to be measured is less than described decision threshold SD is more than or equal to 95%.
Particularly, to the method that two described rape varieties to be measured are sampled be respectively: the sampling sample obtaining two described rape varieties to be measured after the sample mixing to two described rape varieties difference random selecting more than 100 to be measured.
Particularly, the method for the substantive derived relation judged between described rape variety to be measured is:
When between described rape variety to be measured during the ratio < SD of Differential genotype, two described rape varieties to be measured have substantive derived relation; When between described rape variety to be measured during the ratio >=SD of Differential genotype, two described rape varieties to be measured do not have substantive derived relation, and wherein, SD is decision threshold.
Further, when there is substantive derived relation if judge between two described rape varieties to be measured, probability >=BINOMDIST (SD*TRN, TRN, OD, TRUE) that conclusion is correct; If when judging that two described rape varieties to be measured do not have substantive derived relation, probability >=BINOMDIST ((1-SD) * TRN, TRN, 1-OD, TRUE) that conclusion is correct; Wherein, TRN is the number of the total test zone of two described rape varieties to be measured, OD is the ratio of Differential genotype between described rape variety to be measured, BINOMDIST is excel2010 function, BINOMDIST (SD*TRN, TRN, OD, TRUE) implication is: when the number of described total test zone is TRN, between described rape variety to be measured, the ratio OD of Differential genotype is less than the probability of described decision threshold SD; BINOMDIST ((1-SD) * TRN, TRN, 1-OD, TRUE) implication is: when the number of described total test zone is TRN, between described rape variety to be measured, the ratio OD of Differential genotype is greater than the probability of described decision threshold SD.
Particularly, determine that the method for described test zone is by described variant sites:
Pass through discrimination calculate the value of discrimination, wherein, a is the kind sum be detected in variation window area, bi is i-th kind of genotypic kind number in described variation window area, and bi>1, k comprises the genotypic number being greater than a kind, and described variation window area is centered by each single nucleotide variations site, and the both sides to described single nucleotide variations site respectively extend 1/2 of survey sequence length as the window detected;
Described test zone is the large and equally distributed region of described discrimination on region or nuclear genome that on cytoplasmic skeleton, discrimination is large.
The beneficial effect that the technical scheme that the embodiment of the present invention provides is brought is: the method that the embodiment of the present invention provides is increased and high-flux sequence by multidigit point, ensure the large sample sampling of the test zone of rape variety to be measured, successfully achieve the target accurately judging substantive derived relation between rape variety to be measured, and test is simple, quick.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, below embodiment of the present invention is described further in detail.
Embodiment measures the substantive derived relation between rape variety " Soviet Union 2051 " and " 430AB "
The rape variety to be measured that the embodiment of the present invention provides is rape variety " Soviet Union 2051 " and " 430AB ", and the two is open, known kind.
One, the variant sites between different rape variety is obtained.
Variant sites between different rape variety can obtain from the documents and materials announced, but the results contrast that the method obtains is fragmentary, in the present embodiment, by the genome sequence of different rape and the genome sequence with reference to rape variety being compared, obtain the variant sites between a large amount of different rape varieties.
Further, the method obtaining the genome sequence of different rape variety is as follows:
The genome sequence of the different rape varieties of the present embodiment shows two kinds of sources, the first is the genomic high-flux sequence sequence to 10 rape varieties such as Huang, and pertinent literature information is as follows: Huang et al.:Identification of genome-wide singlenucleotide polymorphisms in allopolyploid cropBrassica napus.BMC Genomics201314:717.The genome sequence of these 10 rape varieties is published in NCBI Short Read Archive (http://www.ncbi.nlm.nih.gov/sra), and reception number is SRA057227; The second is for having carried out high-flux sequence by the method provided in the above-mentioned article delivered of Huang etc. to " 430AB ", " P65 " and cross-fertilize seed " peaceful assorted No. 9 ".The present embodiment obtains the genomic high-flux sequence sequence of 13 rape varieties altogether.
Further, the genome sequence of different varieties is utilized to obtain variant sites.
Particularly, because the order-checking degree of depth of these 13 rape varieties is not high, only can identify single nucleotide variations (SNP) site, such as the repeat number of other variation type makes a variation, and due to a low credibility, does not identify.Utilize Frederick Sanger comparison software (version number is 0.4) by the genomic high-flux sequence sequence alignment of these 13 rape varieties to rape cell core reference genome (version: Release v1.01, download address: http://www.ncbi.nlm.nih.gov) and tenuigenin with reference on genome, this tenuigenin comprises plastosome with reference to genome and chloroplast(id) reference genome with reference to genome, it is at NCBI (National Center forBiotechnology Information, US National Biotechnology Information center) on reception number be respectively NC_016734.1 and AP006444.1.During contrast, Insert Fragment length is set to 500bp, and other parameter settings are default value.The Ssaha Pileup software package (version number is 0.5) adopted identifies the SNP site of each kind.This SNP site is defined as base pair that difference determines, the insertion of single base or the disappearance of single base.The base pair that this difference is determined refers to and does not comprise the uncertain base pair of difference, the uncertain base pair of difference refers to it is the base pair between some degeneracy base, as R represents A or G, therefore, may there are differences between A and R, also may not there are differences, therefore, between A and R, difference is indefinite, is not SNP mutually.Therefore, the SNP site in the embodiment of the present invention is not for comprise the uncertain base pair of above-mentioned difference.By the definition of above SNP site, the embodiment of the present invention obtains 911346 SNP site altogether between all 13 rape varieties, and wherein 18543 SNP site are positioned on cytoplasmic skeleton, and remaining SNP site is positioned on nuclear genome.Namely the genotype hereinafter mentioned refers to the combination of multiple SNP site in test zone, and nuclear gene type refers to that genotype is positioned on nuclear genome, and plasmagene type refers to that genotype is positioned on cytoplasmic skeleton.Such as, in table 1, the 1st test zone is positioned on nuclear genome, and be nuclear gene type, this test zone has 3 SNP site, and the genotype of this test zone is the combination of these 3 SNP site.
Two, by variant sites determination test zone, concrete grammar is as follows:
Test zone is the large and equally distributed region of SNP site of discrimination on region or nuclear genome that on cytoplasmic skeleton, discrimination is large, wherein, and discrimination wherein, a is the kind sum be detected in variation window area, bi is i-th kind of genotypic kind number in variation window area, and bi>1, k comprises the genotypic number being greater than a kind, variation window area is centered by each single nucleotide variations site (SNP site), and the both sides to single nucleotide variations site respectively extend 1/2 of survey sequence length as the window detected; Test zone is the large and equally distributed region of discrimination on region or nuclear genome that on cytoplasmic skeleton, discrimination is large.The Computing Principle of discrimination is as follows: all interracial number of combinations are wherein, the combination between the different varieties in same gene type is undistinguishable, and its number is so, can not be by the ratio of the breed combination distinguished can by the ratio of breed combination distinguished and discrimination as can be seen here, discrimination is larger, more different varieties can be distinguished, and the test of variation window area to substantive derived relation that discrimination is large is more effective.If the variation window area skewness on nuclear genome, can cause some region adjacent, thus linkage inheritance, information is easily overlapping, therefore, nuclear genome is selected the principle of compositionality of test zone be: the large and SNP site of discrimination is uniformly distributed.Cytoplasmic skeleton without linkage inheritance problem, so, cytoplasmic skeleton only needs the region that selection area calibration is large.
First, centered by each SNP site obtained, respectively extend 99bp and 100bp to the left and right, form the variation window of 200bp.According to 911346 SNP site obtained, 911346 variation windows can be obtained, calculate the discrimination of these variation window areas such as, in the 1st variation window area, detect a=13 kind altogether, total k=2 kind genotype CCA, TTA, their kind number is respectively b1=4 and b2=7, therefore, its implication is: by the 1st variation window area, the breed combination of 65% in 13 kinds can be distinguished, the breed combination of other 35% cannot distinguish, and needs more variation window just can distinguish.After the same method, the discrimination of whole 911346 the variation windows of calculating acquisition is also therefrom chosen and is arranged in 6000 maximum variation windows of nuclear genome discrimination, 100 the make a variation windows maximum with being arranged in cytoplasmic skeleton discrimination.Check 6000 the variation windows being arranged in nuclear genome one by one, each variation window and the next distance made a variation between window, if distance is more than 200K (1K=1000 base), reexamine after then abandoning the less variation window of wherein discrimination, till the adjacent distance looking into variation window is all greater than 200K.The criterion distance of 200K is selected to be because rapeseed gene group size is about 930M (1M=,100 ten thousand bases), the test zone that 2000 are positioned at nuclear genome is selected in by final, average test zone spacing is about 500K, but because seldom there is variant sites in such as kinetochore etc., some specific regions, therefore, mean distance should be less than 500K.By above method, have selected the variation window that 4367 are positioned at nuclear genome, they with obtain to be arranged in together with 100 maximum windows that make a variations of cytoplasmic skeleton discrimination totally 4467 windows that make a variation as the test zone be selected in.Wherein, 100 variation windows that selection area calibration is maximum, be empirical value, this quantity can be modified as the case may be.
Three, respectively to two rape variety sampling to be measured, extract and obtain the DNA of the sampling sample of two rape varieties to be measured, the preparation method of sampling sample is: after the sample mixing to two rape varieties difference random selecting more than 100 to be measured, obtain the sampling sample of two rape varieties to be measured.
In the present embodiment, have chosen 10000 seed germinations of rape variety to be measured " Soviet Union 2051 ", the roughly equal bud mixing of random selecting 8000 sizes is placed in mortar, fully pulverizes after adding liquid nitrogen in mortar.The article No. adopting Beijing Tian Gen biochemical technology company limited to produce is that the plant genome DNA extraction test kit of DP305 extracts and obtains the DNA of rape variety to be measured " Soviet Union 2051 " mixing sample, and DNA extraction method is undertaken by the operational manual of this test kit.Utilize the Qubit that American I nvitrigen company produces dsDNA HS Assay Kit (article No. is Q32852) and specification sheets thereof carry out quantitatively to the DNA obtained, and the DNA dilution that the rape variety to be measured quantitatively " is revived 2051 " is 10.00ng/ μ l.
After the same method, sampling to rape variety to be measured " 430AB " and extract DNA, is 10.00ng/ μ l by the DNA dilution of the rape variety to be measured " 430AB " after quantitatively equally.
Four, the primer in amplification assay region is prepared, specific as follows:
Test zone adopts multiplex PCR (Polymerase Chain Reaction, polymerase chain reaction) technology to detect, and multiple PCR technique refers to and add multiple PCR primer, the multiple sites simultaneously in amplification gene group in same PCR reaction.The key of this technology designs and synthesizes multiple PCR primer, the multiple PCR technique that the present embodiment adopts LifeTechnology company of the U.S. to provide, and it can arrange the heavy PCR primer of as many as 12000.
The number of test zone meets the following conditions: BINOMDIST (SD*TN, TN, 0.80*SD, TRUE) >=95%, and wherein, TN is the number of test zone, and SD is decision threshold; Formula BINOMDIST (SD*TN, TN, 0.80*SD, TRUE) implication is: when the number of test zone be TN, between rape variety to be measured the ratio of Differential genotype be 0.80*SD and decision threshold is SD time, between rape variety to be measured, the ratio of Differential genotype is less than the probability of decision threshold.The implication of this condition is: when when between rape variety to be measured, the ratio of Differential genotype is 80% of judgment threshold, and the judgement rape variety to be measured determined by the number of test zone has accuracy >=95% of substantive derived relation.The judgment threshold of substantive derived relation is Breeding Situation, mark mode according to various countries, requires Stringency and artificially determine.In the present embodiment, SD is defined as 5.0%.The number TN progressively strengthening test zone finds, when TN >=1000, above-mentioned formula is set up, therefore, the number of test zone should >=1000.Concerning existing SSR and SNP test, 200 test zones have calculated that it is enough, if the ratio of Differential genotype is 80% of judgment threshold between rape variety to be measured, its accuracy only >=BINOMDIST (5.0%*200,200,0.80*5.0%, TRUE)=82%, therefore, the method that this enforcement provides, can obtain conclusion more accurately.
Primer acquisition process is as follows: log in LifeTechnology company multiple PCR primer Photographing On-line webpage https: //ampliseq.com/protected/help/pipelineDetails.action, submits relevant information to by its requirement.Wherein, in the present embodiment, " Application type " option selects " DNA Hotspot designs (single-pool) ".If select multi-pool, then multiplex PCR will divide multitube to carry out, cost can increase to some extent, and the primer of single-pool only needs a multiplex PCR, save cost, shortcoming is that some universal test regions design of primers may failure, but alternative universal test region on genome is more, therefore, abandon some alternative universal test regions and do not affect result.Permeate the nucleus of rape variety to be measured reference genome and tenuigenin reference genome a file, and select " Custom " in " Select the genome you wish to use " option after, upload the file of fusion as reference genome during design multiple PCR primer.DNA type option selects " Standard DNA ", in Add Hotspot option, add the positional information of the SNP site in the universal test region needing design, comprise chromosome information, the initiation site of SNP and the end locus of SNP, its certain embodiments is in table 1.Finally click " Submit targets " button to submit to and the multiple PCR primer obtaining design.In the present embodiment, from all 4467 test zones, design and demonstrate 2302 pairs of multiple PCR primers, for corresponding 2302 test zones that increase.The method of checking multiple PCR primer is for pressing method provided by the invention, extract the leaves genomic DNA on same strain rape, and utilize the multiple PCR primer of design to increase to the genomic dna obtained, build storehouse, high-flux sequence analyze sequenced fragments group, remove the corresponding primer of following test zone: the sequenced fragments number of this test zone is less than 1000 or there is hybrid strain genotype, and the primer remained is the multiple PCR primer be proved to be successful.So, test zone does not comprise amplification and produces the genotypic region of hybrid strain, hybrid strain genotype refers to frequency >=0.02%, and has insertion or the disappearance of discontinuous base in quantity >=2 of distinguishing base between all genotype of hybrid strain genotype and rape variety to be measured or distinguishing base.Because genomic DNA source is in same strain rape leaf, can not there is hybrid strain kind, therefore, hybrid strain genotype is the PCR or order-checking Preference mistake that are caused by the special construction of test zone, removes these test zones and avoids this type of system mistake.Regulation test zone is another object not comprising the genotypic test zone of amplification generation hybrid strain: the test zone remained is except being used as the test of the substantive derived relation between rape variety to be measured, the calculating of hybrid strain rate can also be done, achieve the multiple use of same set of test primer.The multiple PCR primer be proved to be successful is supplied to client in fluid form and uses after also being mixed by the said firm.2302 test zones of above-mentioned successful design multiple PCR primer are the test zone finally detected for rape variety to be measured, and wherein, 55 test zones are positioned on cytoplasmic skeleton, remaining 2247 test zones are positioned on nuclear genome.Existing Essentially derived variety judges the test site all do not adopted on cytoplasmic skeleton, and cytoplasmic difference, different varietal character performances can be produced equally, the judgement of Essentially derived variety relation should be used for.
Five, utilize primer to increase to the DNA of two sampling samples respectively, obtain the amplified production of two rape varieties to be measured at test zone respectively, amplified production is respectively used to the high-throughput sequencing library of structure two rape varieties to be measured, and concrete grammar is as follows:
After utilizing library construction Kit 2.0 (produced by LifeTechnology company of the U.S., article No. is 4475345) multiplexed PCR amplification test zone, amplified production is utilized to build high-throughput sequencing library.This test kit comprises following reagent: 5 × Ion AmpliSeq tMhiFi Mix, FuPa reagent, transferring reagent, sequence measuring joints solution and DNA ligase.The method of library construction presses operational manual " the Ion AmpliSeq of this test kit tMlibraryPreparation " (publication number: MAN0006735, version: A.0) carry out.By multiplexed PCR amplification 2302 test zones, the amplification system of multiplex PCR is as follows: 5 × Ion AmpliSeq tMthe DNA10ng of the test zone primer mixed solution 4 μ l of HiFi Mix 4 μ l, preparation, rape variety to be measured " Soviet Union 2051 " and without enzyme water 11 μ l.The amplification program of multiplex PCR is as follows: 99 DEG C, 2 minutes; (99 DEG C, 15 seconds; 60 DEG C, 4 minutes) × 25 circulations; 10 DEG C of insulations.After utilizing FuPa reagent to digest primer unnecessary in multiplexed PCR amplification product, then carry out phosphorylation, concrete grammar is: in the amplified production of multiplex PCR, add 2 μ L FuPa reagent, after mixing, by following program reaction in PCR instrument: 50 DEG C, and 10 minutes; 55 DEG C, 10 minutes; 60 DEG C, 10 minutes; 10 DEG C of preservations, obtain mixture a, and mixture a is containing the amplified production solution through phosphorylation.The amplified production of phosphorylation is connected upper sequence measuring joints, and concrete grammar is: in mixture a, add transferring reagent 4 μ L, sequence measuring joints solution 2 μ L and DNA ligase 2 μ L, after mixing, by following program reaction in PCR instrument: 22 DEG C, and 30 minutes; 72 DEG C, 10 minutes; 10 DEG C of preservations, obtain mixed solution b.10 μ L are dissolved in without in enzyme water after utilizing the ethanol precipitation methods purifying mixed solution b of standard.Utilize the Qubit that American I nvitrigen company produces dsDNA HS Assay Kit (article No. is Q32852) also measures according to its specification sheets, and after obtaining the mass concentration of mixed solution b, mixed solution b after purifying is diluted to 15ng/ml, obtains the high-throughput sequencing library that concentration is about the test zone of 100pM.
After the same method, rape variety to be measured " 430AB " is carried out to the structure of high-throughput sequencing library, obtain the high-throughput sequencing library that concentration is about the test zone of 100pM equally.
Six, carry out high-flux sequence respectively to the high-throughput sequencing library of two rape varieties to be measured, obtain the sequenced fragments group of two rape varieties to be measured respectively, concrete grammar is as follows:
Determine the high-flux sequence degree of depth: the degree of depth >=5000 times of high-flux sequence, i.e. segments >=5000 fragment in average coverage test district, 5000 times is an empirical value, can adjust according to practical situation.Why specify this value, be because the order-checking amount cost of 5000 times not high but be enough to accurately calculate 30% testing gene type frequency, therefore, specify 5000 times of degree of depth as high-flux sequence.
High-throughput sequencing library is utilized to carry out high-flux sequence
Utilize high-throughput sequencing library and test kit Ion PI Template OT2200Kit v2 (invirtrigen company of the U.S. production of all test zones obtained, article No. is 4485146) check order before ePCR (Emulsion PCR, emulsion polymerization enzyme chain reaction) amplification, working method is undertaken by the operational manual of this test kit.(invirtrigen company of the U.S. produces to utilize ePCR product and test kit Ion PI Sequencing 200Kit v2, article No. is 4485149) on Proton bis-generation high-flux sequence instrument, carry out high-flux sequence, working method is undertaken by the operational manual of this test kit.In the present embodiment, high-flux sequence flux is set to 10000 times, average coverage test region.
Pre-treatment is carried out to a large amount sequencing result
By the comparison of high-flux sequence fragment to all 2302 test zones, after removing the sequenced fragments that comparison is unsuccessful and genotype detection is incomplete, remaining all sequenced fragments are called sequenced fragments group.The incomplete sequenced fragments of genotype detection refers to the sequenced fragments that all SNP site shown in " position of SNP on reference genome " in table 1 could not be detected, the reason that genotype detection is incomplete is that sequenced fragments is too short, and the unsuccessful reason of comparison is that sequenced fragments mostly is non-specific amplification product.
Seven, the sequenced fragments group of two rape varieties to be measured is analyzed, obtain two rape variety genotype to be measured respectively, rape variety genotype to be measured is the combination of variation base in test zone, and genotypic frequency >=30% of rape variety to be measured, concrete grammar is as follows:
By the comparison of sequenced fragments group to all test zones, and add up the sequenced fragments number in each test zone, remove the test zone of sequenced fragments number≤1000, remaining test zone is for detecting successful test zone.In the present embodiment, obtain 2117 altogether and detect successful test zone.Comparison is called the sequenced fragments of this test zone to the fragment of test zone, and the base composition extracting in table 1 position shown in " SNP with reference to the position on genome " from sequenced fragments is called the genotype of this sequenced fragments.Genotypic frequency refers in sequenced fragments group, represents the ratio that this genotypic sequenced fragments number accounts for the sequenced fragments sum of this genotype place test zone.Rape variety genotype to be measured is the combination of variation base in test zone, and genotypic frequency >=30% of rape variety to be measured.In general, in the sample extracted, the amount of hybrid is not higher than 10%, order-checking mistake is no more than 1%, the two total is no more than 11%, therefore, for site of isozygotying, rape variety genotype to be measured only has one, and its frequency should be greater than 89%, and for heterozygous sites, rape variety genotype to be measured has 2 kinds, and its ratio should be greater than 45.5%, therefore, specify genotypic frequency >=30% of rape variety to be measured, can get rid of and have hybrid strain and to the genotypic interference of rape variety to be measured because mixing in check order mistake and rape variety to be measured.
Such as, in sequenced fragments group, the sequenced fragments in the 1st order-checking region adds up to 11230 articles, there are TTA, CCA, TTT, TTG ... totally 29 kinds of genotype, represent these genotypic sequenced fragments numbers 10840,281,2,2 respectively ..., these genotypic frequencies are 10840/11230=96.52%, 281/11230=2.50%, 2/11230=0.02%, 2/11230=0.02% ...By the genotypic definition of rape variety to be measured, TTA is rape variety to be measured " Soviet Union 2051 " genotype of the 1st test zone, and other genotype is the genotype that order-checking mistake or hybrid strain cause.By identical method, judge and obtain whole 2117 rape variety to be measured " Soviet Union 2051 " genotype detecting successful test zones.
By with the rape variety to be measured method that " to revive 2051 " identical, 2117 that obtain rape variety to be measured " 430AB " are equally detected successful test zone, and rape variety to be measured " 430AB " is in the genotype of the successful test zone of all detections, partial results is in table 1.The present embodiment does not have completely to list whole rape variety to be measured in all test zone genotype as space is limited, only lists certain embodiments.Equally based on length restriction, also have some areas also only to list part related example in the present embodiment, all the other unlisted data can according to the method completion of the present embodiment.
Table 1 is product to be tested rape kind genotype and relevant information thereof
Eight, compare two rape variety genotype to be measured, obtain the ratio of Differential genotype between rape variety to be measured, concrete grammar is as follows:
If in the test, all rape variety genotype to be measured, all without disappearance, claim this test zone to be the total test zone of rape variety to be measured.In total test zone, if genotype is incomplete same between rape variety to be measured, this genotype is then claimed to be Differential genotype between rape variety to be measured, such as, if the genotype of rape variety A to be measured is AA, AA, AA, AA, AA/CC, AA/CC, the genotype that rape variety B to be measured is corresponding is AA, CC, AA/CC, CC/TT, AA/CC, CC/TT, and the Differential genotype so in these 6 test zones between rape variety to be measured judges as follows: be not, be, be, be, be not and be.It is genetic heterozygosis site that "/" in above genotype represents this test zone, therefore detects latter two genotype before "/".The number of the number/total test zone of ratio=the have test zone of Differential genotype of Differential genotype between rape variety to be measured.
In the present embodiment, the total test zone that rape variety to be measured " 430AB " and rape variety to be measured " revive 2051 " is 2117, in the 1st total test zone, their genotype is respectively CCA and TTA, they are not identical, therefore, are the Differential genotype between rape variety to be measured.According to said method, judge in all 2117 total test zones one by one, whether there are differences genotype, result is: in all total test zones, there are differences genotypic test zone number is 292, so, the ratio OD=292/2117=13.79% of Differential genotype between rape variety to be measured.
According to ratio and the decision threshold of Differential genotype between rape variety to be measured, judge the substantive derived relation of two rape varieties to be measured, judge that the method for the substantive derived relation of two rape varieties to be measured is: when between rape variety to be measured during the ratio < SD of Differential genotype, two rape varieties to be measured have substantive derived relation; When between rape variety to be measured during the ratio >=SD of Differential genotype, two rape varieties to be measured do not have substantive derived relation, and wherein, SD is decision threshold.
In the present embodiment, the ratio=13.79%>SD=5.0% of Differential genotype between rape variety to be measured, therefore judges that rape variety to be measured " 430AB " and rape variety to be measured " revive 2051 ", does not have substantive derived relation.
If when judging that two rape varieties to be measured have substantive derived relation, probability >=BINOMDIST (SD*TRN, TRN, OD, TRUE) that conclusion is correct; If when judging that two rape varieties to be measured do not have substantive derived relation, probability >=BINOMDIST ((1-SD) * TRN, TRN, 1-OD, TRUE) that conclusion is correct; Wherein, TRN is the number of the total test zone of two rape varieties to be measured, SD is decision threshold, OD is the ratio of Differential genotype between rape variety to be measured, and BINOMDIST is excel2010 function, BINOMDIST (SD*TRN, TRN, OD, TRUE) implication be: when the number of total test zone is TRN, between rape variety to be measured, the ratio OD of Differential genotype is less than the probability of decision threshold SD; BINOMDIST ((1-SD) * TRN, TRN, 1-OD, TRUE) implication is: when the number of total test zone is TRN, between rape variety to be measured, the ratio OD of Differential genotype is greater than the probability of decision threshold SD.
In the present embodiment, judge that two rape varieties to be measured have substantive derived relation and the number of total test zone is 2117, therefore, probability >=BINOMDIST ((1-SD) * TRN, TRN, 1-OD that conclusion is correct, TRUE)=BINOMDIST ((1-5.0%) * 2117,2117,1-13.79%, TRUE)=100%.Therefore, in this enforcement, judge that the conclusion between two rape varieties to be measured without substantive derived relation is accurately.
Result verification
At present, the standard method that between the whole world kind that also neither one is generally acknowledged, substantive derived relation is assert, but 1991 of UPOV have such description in text: " Essentially derived variety can by such as natural or bring out saltant type, somaclonal variation type selection; from the selection of the variation strain original variety plant, and backcross or genetic engineering transform obtain ".In the present embodiment, rape variety to be measured " 430AB " and rape variety to be measured there is no above-mentioned seed selection relation between " reviving 2051 ", so, by the description of UPOV text, these two rape varieties to be measured should not have substantive derived relation, show, the conclusion of the present embodiment is correct.
The embodiment of the present invention, by high-flux sequence and the amplification of multidigit point, achieves the large sample sampling in rape variety build-in test region to be measured, ensure that the accuracy that substantive derived relation detects.Meanwhile, utilize multidigit point amplification technique, the testing quality entity of multiple test zone is equivalent to the workload of the test zone that traditional SSR or SNP detects, therefore, the present embodiment is not only accurate, and method is simple, quick.Such as, 2117 test zones are detected if same, utilize traditional SSR or SNP detection method, (SSR detects by electrophoresis method to need to carry out the detection of the amplified production of 2117 pcr amplifications and 2117 times, SNP detects by the method for generation test), its workload, speed and testing cost are all can not be received.In addition, obtain the sequence of each base in test zone in the present invention, resolving power has reached ultimate attainment, and quantity of information is also maximum, that other detection method is all incomparable, such as, the length information of fragment that what SSR obtained is, possible Problems existing is: fragment length difference is too small, may can't detect, even the fragment of equal length, the based composition of its inside also may be different, but SSR can't detect equally.A SNP detects and can only detect a pair SNP site, and in the present embodiment, can detect all SNP of test zone, can also find the variation in test zone outside SNP, and as repeat number, disappearance and insertion etc., these single SNP detect and cannot find.In addition, except the order-checking of an employing generation detects except SNP, also have and adopt the mode of hybridization to detect SNP, in this case, what obtain is hybridization signal, and noise is large and cannot find unknown variation, and a probe only can detect a pair SNP, resolving power and quantity of information are also much smaller than the present invention.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (7)

1. test a method for the substantive derived relation of rape variety, it is characterized in that, described method comprises:
Obtain the variant sites between different rape variety;
By described variant sites determination test zone;
Respectively two rape varieties to be measured are sampled, extract and obtain the DNA of the sampling sample of two described rape varieties to be measured;
The primer of the described test zone of preparation amplification;
Described primer is utilized to increase to the DNA of two described sampling samples respectively, obtain the amplified production of two described rape varieties to be measured at described test zone respectively, described amplified production is respectively used to the high-throughput sequencing library of the described rape variety to be measured of structure two;
Respectively high-flux sequence is carried out to the described high-throughput sequencing library of two described rape varieties to be measured, obtains the sequenced fragments group of two described rape varieties to be measured respectively;
Analyze the described sequenced fragments group of two described rape varieties to be measured, obtain two rape variety genotype to be measured respectively, described rape variety genotype to be measured is the combination of variation base in described test zone, and genotypic frequency >=30% of described rape variety to be measured;
Relatively two described rape variety genotype to be measured, obtain the ratio of Differential genotype between rape variety to be measured;
According to the ratio of Differential genotype between described rape variety to be measured, judge the substantive derived relation of two described rape varieties to be measured.
2. method according to claim 1, is characterized in that, described test zone does not comprise amplification and produces the genotypic region of hybrid strain;
Described hybrid strain genotype refers to frequency >=0.02%, and has insertion or the disappearance of discontinuous base in quantity >=2 of distinguishing base between all genotype of described hybrid strain genotype and described rape variety to be measured or described distinguishing base.
3. method according to claim 1, is characterized in that, the number of described test zone meets the following conditions as BINOMDIST (SD*TN, TN, 0.80*SD, TRUE) >=95%, wherein, TN is the number of described test zone, and SD is decision threshold; The condition implication that the number of described test zone meets is: when the number of described test zone be TN, described decision threshold be SD and between described rape variety to be measured, the ratio of Differential genotype is 0.80*SD time, judge that the probability guarantee that the ratio of Differential genotype between described rape variety to be measured is less than described decision threshold SD is more than or equal to 95%.
4. method according to claim 1, it is characterized in that, to the method that two described rape varieties to be measured are sampled be respectively: the sampling sample obtaining two described rape varieties to be measured after the sample mixing to two described rape varieties difference random selecting more than 100 to be measured.
5. method according to claim 1, is characterized in that, judges that the method for the substantive derived relation between described rape variety to be measured is:
When between described rape variety to be measured during the ratio < SD of Differential genotype, two described rape varieties to be measured have substantive derived relation; When between described rape variety to be measured during the ratio >=SD of Differential genotype, two described rape varieties to be measured do not have substantive derived relation, and wherein, SD is decision threshold.
6. method according to claim 5, is characterized in that, when there is substantive derived relation if judge between two described rape varieties to be measured, and probability >=BINOMDIST (SD*TRN, TRN, OD, TRUE) that conclusion is correct; If when judging that two described rape varieties to be measured do not have substantive derived relation, probability >=BINOMDIST ((1-SD) * TRN, TRN, 1-OD, TRUE) that conclusion is correct; Wherein, TRN is the number of the total test zone of two described rape varieties to be measured, OD is the ratio of Differential genotype between described rape variety to be measured, BINOMDIST is excel 2010 function, BINOMDIST (SD*TRN, TRN, OD, TRUE) implication is: when the number of described total test zone is TRN, between described rape variety to be measured, the ratio OD of Differential genotype is less than the probability of described decision threshold SD; BINOMDIST ((1-SD) * TRN, TRN, 1-OD, TRUE) implication is: when the number of described total test zone is TRN, between described rape variety to be measured, the ratio OD of Differential genotype is greater than the probability of described decision threshold SD.
7. method according to claim 1, is characterized in that, determines that the method for described test zone is by described variant sites:
Pass through discrimination calculate the value of discrimination, wherein, a is the kind sum be detected in variation window area, bi is i-th kind of genotypic kind number in described variation window area, and bi>1, k comprises the genotypic number being greater than a kind, and described variation window area is centered by each single nucleotide variations site, and the both sides to described single nucleotide variations site respectively extend 1/2 of survey sequence length as the window detected;
Described test zone is the large and equally distributed region of described discrimination on region or nuclear genome that on cytoplasmic skeleton, discrimination is large.
CN201510150605.3A 2015-03-31 2015-03-31 Method for testing substantive derivation relation of oilseed rape varieties Pending CN104805192A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510150605.3A CN104805192A (en) 2015-03-31 2015-03-31 Method for testing substantive derivation relation of oilseed rape varieties

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510150605.3A CN104805192A (en) 2015-03-31 2015-03-31 Method for testing substantive derivation relation of oilseed rape varieties

Publications (1)

Publication Number Publication Date
CN104805192A true CN104805192A (en) 2015-07-29

Family

ID=53690388

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510150605.3A Pending CN104805192A (en) 2015-03-31 2015-03-31 Method for testing substantive derivation relation of oilseed rape varieties

Country Status (1)

Country Link
CN (1) CN104805192A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013053183A1 (en) * 2011-10-14 2013-04-18 深圳华大基因研究院 Method and system for genotyping predetermined region in nucleic acid sample
WO2014048062A1 (en) * 2012-09-28 2014-04-03 未名兴旺系统作物设计前沿实验室(北京)有限公司 Snp loci set and usage method and application thereof
CN104328507A (en) * 2014-10-11 2015-02-04 中国水稻研究所 SNP chip used for identifying rice variety, preparation method and application

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013053183A1 (en) * 2011-10-14 2013-04-18 深圳华大基因研究院 Method and system for genotyping predetermined region in nucleic acid sample
WO2014048062A1 (en) * 2012-09-28 2014-04-03 未名兴旺系统作物设计前沿实验室(北京)有限公司 Snp loci set and usage method and application thereof
CN104328507A (en) * 2014-10-11 2015-02-04 中国水稻研究所 SNP chip used for identifying rice variety, preparation method and application

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
RICHARD JUDSON: "How many SNPs does a genome-wide haplotype map require?", 《PHARMACOGENOMICS》 *
YVES ROUSSELLE, LIZ JONES, STEPHEN SMITH & BARRY NELSON ET AL.: "IDENTIFICATION OF SNP MARKERS TO AID ASSESSMENT OF ESSENTIAL DERIVATION IN MAIZE", 《WORKING GROUP ON BIOCHEMICAL AND MOLECULAR TECHNIQUES AND DNA-PROFILING IN PARTICULAR,FOURTEENTH SESSION》 *
何风华等: "二项分布在遗传学概率计算中的应用", 《生物学通报》 *

Similar Documents

Publication Publication Date Title
CN104263848B (en) A kind of deaf susceptibility gene mutation detection kit and preparation method thereof and application
CN104846076B (en) A method of specificity, consistency and the stability of measurement cross-bred rape new varieties
CN106834507A (en) DMD gene traps probe and its application in DMD detection in Gene Mutation
CN104830975A (en) Novel method for testing corn parent source authenticity and proportion
CN105603074A (en) Qualitative and quantitative microorganism detecting method
CN105603081B (en) Non-diagnosis-purpose qualitative and quantitative detection method for intestinal microorganisms
CN104805186B (en) A kind of method for testing corn variety substance derived relation
CN105603082A (en) Qualitative and quantitative detection method for paddy microorganisms
CN104805193A (en) Method for testing substantive derivation relation of rice varieties
CN105603075A (en) Qualitative and quantitative wheat microorganism detecting method
CN104498591A (en) Liquid chip-based quantitative determination method of Alu gene methylation level
CN104805192A (en) Method for testing substantive derivation relation of oilseed rape varieties
Liu et al. Quality control of next-generation sequencing-based in vitro diagnostic test for onco-relevant mutations using multiplex reference materials in plasma
CN104805195A (en) Novel method for testing rice parental source authenticity and proportion of rice parental source
CN104805188B (en) A kind of method for testing soybean varieties substance derived relation
CN111199776B (en) Method and device for evaluating analysis quality of tumor genome sequencing data and application
CN105603076A (en) Qualitative and quantitative soil microorganism detecting method
CN104805191B (en) A kind of method of the specificity for testing pure lines corn variety, uniformity and stability
CN104805187B (en) A kind of method of the specificity for testing pure lines new soybean varieties, uniformity and stability
CN104805185B (en) A kind of method of test plants kind substance derived relation
CN104878085A (en) New method for testing authenticity and proportion of parental origin of rape
CN104805189B (en) A kind of method of the specificity for determining hybrid plant new varieties, uniformity and stability
CN104805190B (en) A kind of method of the specificity for determining hybrid maize variety, uniformity and stability
CN104805182B (en) A kind of method for the specificity, uniformity and stability for determining new hybrid rice varieties
CN104805184B (en) A kind of method of the specificity for testing pure lines new rice variety, uniformity and stability

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150729

RJ01 Rejection of invention patent application after publication