CN109266729B - Large fragment deletion detection method based on genome second-generation sequencing - Google Patents

Large fragment deletion detection method based on genome second-generation sequencing Download PDF

Info

Publication number
CN109266729B
CN109266729B CN201811147424.5A CN201811147424A CN109266729B CN 109266729 B CN109266729 B CN 109266729B CN 201811147424 A CN201811147424 A CN 201811147424A CN 109266729 B CN109266729 B CN 109266729B
Authority
CN
China
Prior art keywords
genome
plant
reads
deletion
sequencing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811147424.5A
Other languages
Chinese (zh)
Other versions
CN109266729A (en
Inventor
袁运栋
王永红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Genetics and Developmental Biology of CAS
Original Assignee
Institute of Genetics and Developmental Biology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Genetics and Developmental Biology of CAS filed Critical Institute of Genetics and Developmental Biology of CAS
Priority to CN201811147424.5A priority Critical patent/CN109266729B/en
Publication of CN109266729A publication Critical patent/CN109266729A/en
Application granted granted Critical
Publication of CN109266729B publication Critical patent/CN109266729B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a large fragment deletion detection method based on genome second-generation sequencing. The method provided by the invention is adopted to detect the deletion of large fragments (fragments larger than 100 bp), the comparison speed is high, the working efficiency is improved, and the analysis of the deletion of the large fragments of the whole genome can be completed within about 1 hour. The method lays a foundation for later analysis of candidate genes in the deletion segment and research of a regulation and control network. The invention fills the blank of the technology for detecting large fragment deletion of genome by a second-generation sequencing method.

Description

Large fragment deletion detection method based on genome second-generation sequencing
Technical Field
The invention relates to a large fragment deletion detection method based on genome second-generation sequencing.
Background
In order to quickly obtain the ideal mutant of the plant, a regulation network corresponding to the important phenotype of the plant is researched, and the seed of the plant can be mutagenized by physical or chemical factors and the like, so that the mutation time of the plant is shortened. Chemical mutagenesis, such as the mutagen Ethyl Methanesulfonate (EMS), can cause single base mutations, physical mutagenesis involving various radiation, etc., such as gamma radiation, can cause fragment deletions to plant DNA bases. With the maturity of the second-generation genome sequencing technology and the further reduction of the price, the method for analyzing cloned genes by using mixed packet analysis (BSA) based on the second-generation sequencing technology is easy to implement, the experimental efficiency of the method is greatly improved compared with the traditional map-based cloning technology, the single base mutation comparison technology generated by the second-generation sequencing is mature at present, but the deletion of large fragments of a genome caused by gamma rays is still difficult due to the technical limitation of the second-generation sequencing, the size of the fragments generated by the second-generation sequencing is generally 150bp at present, the analysis of the deletion of the fragments smaller than 150bp by using the traditional analysis software based on the second-generation sequencing is easy, but the analysis of the deletion of the fragments with the length larger than 150bp and even more than dozens of kbp is not easy, and no software can directly analyze the specific position information of the large fragment deletion so far.
Disclosure of Invention
The invention aims to provide a method for detecting large fragment deletion based on genome second-generation sequencing.
The invention firstly protects a method for detecting genome fragment deletion in plant mutants, which comprises the following steps (1) and (2):
(1) carrying out whole genome sequencing on the mutant to be detected;
(2) after the step (1) is completed, the following steps (a) to (d) are carried out on the sequencing result:
(a) cutting off 2/3-length sequences from the 5 '-3' direction of the complete sequences of all reads, aligning the rest parts to a plant reference genome, screening reads which can be aligned to the plant reference genome by 100%, and recording the alignment position of each read;
(b) cutting off 2/3-length sequences from 3 '-5' direction of the complete sequences of all reads obtained by screening in the step (a), aligning the rest parts to a plant reference genome, screening reads which can be aligned to the plant reference genome by 100%, and recording the aligned position of each read;
(c) comparing the complete sequences of all reads obtained by screening in the step (b) to a plant reference genome, removing the reads which can be completely compared to the plant reference genome by 100%, and reserving the rest of the reads;
(d) processing all reads retained in step (c) as follows: subtracting the position of the same read aligned to the genome in the steps (a) and (b), and if the value is greater than 2/3 of the sequencing read length, determining that the fragment is deleted;
the plant reference genome is a reference genome of a plant to which the mutant plant belongs.
The length of the deletion fragment is more than 100 bp.
The genome sequencing can specifically adopt second-generation sequencing. The second generation sequencing may be specifically 454 technology by Roch corporation, Solexa and Hiseq technology by illumina corporation, and Solid technology by ABI corporation.
The step (2) is realized by using sequence alignment software; the sequence alignment software was bowtie 2.
The step (2) further comprises a step (e): and (d) sequencing all sites with fragment deletion according to the size through the analysis of the step (d), and calculating the number of deletion sites of the whole genome.
The invention also provides a system for detecting deletion of a genome fragment in a plant mutant, which comprises a whole genome sequencing instrument, sequence alignment software and an instruction recorded with any one of the methods. The sequence alignment software may specifically be bowtie 2.
The length of the deletion fragment is more than 100 bp.
Any of the above mutants can be obtained by various methods, for example, by purchasing a mutant having the target phenotype from an existing mutant library, or by screening a mutant having the target phenotype by using EMS (ethyl methane sulfonate) mutagenesis, gamma ray mutagenesis, natural mutation, or the like.
The invention also protects the application of any one of the methods in the genome analysis of the plant mutant.
The invention also protects the application of any one of the systems in the genome analysis of the plant mutant.
The invention also protects the application of any one of the methods in the whole genome regulatory network research.
The invention also protects the application of any one of the systems in the whole genome regulatory network research.
Any of the above plant mutants may specifically be a rice mutant.
Any one of the plant reference genomes described above may specifically be a rice nipponica reference genome.
The method provided by the invention is adopted to detect the deletion of large fragments (fragments larger than 100 bp), the comparison speed is high, the working efficiency is improved, and the analysis of the deletion of the large fragments of the whole genome can be completed within about 1 hour. Can analyze the candidate gene in the deletion segment and lay the foundation for the research of the regulation and control network. The invention fills the blank of the technology for detecting large fragment deletion of genome by a second-generation sequencing method.
Drawings
FIG. 1 shows phenotypic observations of rice kittake and mutants.
Fig. 2 shows the result of the deletion detection verification.
Detailed Description
The following examples are given to facilitate a better understanding of the invention, but do not limit the invention. The experimental procedures in the following examples are conventional unless otherwise specified. The test materials used in the following examples were purchased from a conventional biochemical reagent store unless otherwise specified. The quantitative tests in the following examples, all set up three replicates and the results averaged.
Rice kittake (japonica rice): has a multi-tillering phenotype; reference documents: the expression of rice defense related genes under the stress of Laodelphax striatellus [ J ] crop report, 2012,38(9): 1625-; the public is available from the institute of genetics and developmental biology, academy of Chinese sciences.
Example 1 obtaining of mutagenic Material
1. Taking rice kittake seeds as raw materials, and carrying out gamma ray mutagenesis (taking the seeds and carrying out radiation treatment on the seeds by adopting gamma rays with the intensity of 25.00 c/kg).
2. Sowing and cultivating the seeds obtained in the step 1 into plants, and harvesting the seeds after the plants are selfed (M)1Seed generation).
3. Sowing and cultivating the seeds obtained in the step 2 into plants, and harvesting the seeds after the plants are selfed (M)2Seed generation).
4. And (4) sowing the seeds obtained in the step (3) and cultivating the seeds into plants, screening out one mutant strain with the tillering angle of more than 20 from the plants, and naming the mutant strain as a mutant Mu 1.
A photograph of the phenotypes of rice kittake and mutant Mu1 is shown in FIG. 1.
In practice, mutants can be obtained by any conventional method, for example, mutants can be purchased from a conventional mutant library, and desired mutants can be obtained by screening by methods such as EMS (ethyl methane sulfonate) mutagenesis, gamma ray mutagenesis, or natural mutagenesis.
Example 2 detection and analysis of Large fragment deletions
1. The mutant Mu1 obtained in example 1 was subjected to whole genome sequencing using a second generation sequencing technique. The whole genome sequencing may be performed by a second-generation sequencing technique such as 454 by Roch corporation, Solexa by illumina corporation, Hiseq technique, and Solid technique by ABI corporation.
2. After the step 1 is completed, the bowtie2 comparison software is adopted for comparison, and the method comprises the following steps:
(1) cutting off sequences of 2/3 length from 5 '-3' direction of all complete reads obtained by sequencing in the step 1, comparing the rest partial sequences to the Nipponbare reference genome of rice, screening all the reads sequences which can be compared on the genome of the rice by 100%, and recording the comparison position of each read;
(2) comparing all the complete reads obtained by screening in the step (1) from the 3 '-5' direction to 2/3-length sequences, comparing the rest sequences to the Nipponbare reference genome of rice, screening all the reads sequences which can be compared to the genome of the rice by 100%, and recording the comparison position of each read;
(3) comparing the complete sequences of all reads obtained by screening in the step (2) with the rice genome, removing the reads which can be compared with the rice genome by 100%, eliminating false positives, and reserving the rest of the reads;
(4) and (4) performing the following treatment on all reads reserved in the step (3): subtracting the position of the same read aligned to the genome in the steps (1) and (2), and if the value is greater than 2/3 of the sequencing read length, determining that the deletion exists;
(5) through the analysis of the step (4), all sites with deletion can be sorted according to size, and the number of deletion sites of the whole genome can be calculated.
Through the analysis, 39353bp of sequence is deleted in the No. 3 chromosome position 1848K-1888K interval of the mutant Mu 1.
3. To verify the results of steps 1 and 2, the genomic DNA of mutant Mu1 and the genomic DNA of rice kittake were PCR amplified using a primer pair consisting of primers 2f and 2r (designed for the deleted 39353bp fragment), respectively, and the amplified products were analyzed by electrophoresis.
2f:5’-TCATGTACTTTTAGCAGTA-3’;
2r:5’-TTTACGAGTAATATTTCC-3’。
Reaction system of PCR amplification: taq enzyme (Takara) 0.1. mu.l, 10 XBuffer 1. mu.l, dNTP (10 Mm/L): 1. mu.l, template 0.5. mu.l, primer 2f (10. mu.M/L) 0.5. mu.l, primer 2r (10. mu.M/L) 0.5. mu.l, water: 6.4 microliter.
Reaction conditions for PCR amplification: first 94 ℃ for 4 minutes, then 94 ℃ for 1 minute, 50 ℃ for 1 minute, 72 ℃ for 1 minute for 35 cycles, and finally 72 ℃ for 5 minutes.
If a deletion is true, a 150bp fragment can be amplified in the mutant Mu1(mutant), while in the control (rice kittake, WT), the PCR product size is 39353bp due to the absence of the deletion, and a band cannot be observed by electrophoresis due to the fact that the fragment is too large to be amplified. The results are shown in FIG. 2. The result shows that the large fragment deletion detection method based on the second-generation sequencing is correct and reliable.
The large fragment deletion detection is carried out by using commercially available mutants or mutants obtained by other mutagenesis methods, and the large fragment deletion condition of the mutants can be correctly detected through multiple experimental verification, so that the universality of the method is verified.

Claims (7)

1. A method for detecting genome fragment deletion in plant mutants comprises the following steps (1) and (2):
(1) carrying out whole genome sequencing on the mutant to be detected;
(2) after the step (1) is completed, the following steps (a) to (d) are carried out on the sequencing result:
(a) cutting off 2/3-length sequences from the 5 '-3' direction of the complete sequences of all reads, aligning the rest parts to a plant reference genome, screening reads which can be aligned to the plant reference genome by 100%, and recording the alignment position of each read;
(b) cutting off 2/3-length sequences from 3 '-5' direction of the complete sequences of all reads obtained by screening in the step (a), aligning the rest parts to a plant reference genome, screening reads which can be aligned to the plant reference genome by 100%, and recording the aligned position of each read;
(c) comparing the complete sequences of all reads obtained by screening in the step (b) to a plant reference genome, removing the reads which can be completely compared to the plant reference genome by 100%, and reserving the rest of the reads;
(d) processing all reads retained in step (c) as follows: subtracting the position of the same read aligned to the genome in the steps (a) and (b), and if the value is greater than 2/3 of the sequencing read length, determining that the fragment is deleted;
the plant reference genome is a reference genome of a plant to which the mutant plant belongs; the length of the deletion fragment is more than 100 bp;
the genome sequencing adopts second-generation sequencing.
2. The method of claim 1, wherein: the step (2) is realized by using sequence alignment software; the sequence alignment software was bowtie 2.
3. A system for detecting genomic fragment deletions in plant mutants comprising a whole genome sequencing instrument, sequence alignment software and instructions describing the method of claim 1 or 2; the length of the deletion fragment is more than 100 bp.
4. Use of the method of claim 1 or 2 for genomic analysis of plant mutants.
5. Use of the system of claim 3 for the genomic analysis of plant mutants.
6. Use of the method of claim 1 or 2 in genome-wide regulatory network research.
7. Use of the system of claim 3 in genome-wide regulatory network research.
CN201811147424.5A 2018-09-29 2018-09-29 Large fragment deletion detection method based on genome second-generation sequencing Active CN109266729B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811147424.5A CN109266729B (en) 2018-09-29 2018-09-29 Large fragment deletion detection method based on genome second-generation sequencing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811147424.5A CN109266729B (en) 2018-09-29 2018-09-29 Large fragment deletion detection method based on genome second-generation sequencing

Publications (2)

Publication Number Publication Date
CN109266729A CN109266729A (en) 2019-01-25
CN109266729B true CN109266729B (en) 2020-11-27

Family

ID=65194744

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811147424.5A Active CN109266729B (en) 2018-09-29 2018-09-29 Large fragment deletion detection method based on genome second-generation sequencing

Country Status (1)

Country Link
CN (1) CN109266729B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103617256A (en) * 2013-11-29 2014-03-05 北京诺禾致源生物信息科技有限公司 Method and device for processing file needing mutation detection
CN105483244A (en) * 2015-12-28 2016-04-13 武汉菲沙基因信息有限公司 Super-long genome-based variation detection algorithm and detection system
CN106355045A (en) * 2016-08-30 2017-01-25 天津诺禾致源生物信息科技有限公司 Amplicon next-generation sequencing based small fragment insertion and deletion detection method and device
CN106951731A (en) * 2017-03-28 2017-07-14 上海至本生物科技有限公司 A kind of large fragment insertion or the Forecasting Methodology and system of missing
CN107229839A (en) * 2017-05-25 2017-10-03 西安电子科技大学 A kind of Indel detection methods based on new-generation sequencing data
AU2018203684A1 (en) * 2010-12-30 2018-06-14 Foundation Medicine, Inc. Optimization of multigene analysis of tumor samples
CN108220404A (en) * 2018-02-06 2018-06-29 东莞博奥木华基因科技有限公司 A kind of method and system for identifying DNA large fragment deletions

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140052381A1 (en) * 2012-08-14 2014-02-20 Life Technologies Corporation Systems and Methods for Detecting Homopolymer Insertions/Deletions

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2018203684A1 (en) * 2010-12-30 2018-06-14 Foundation Medicine, Inc. Optimization of multigene analysis of tumor samples
CN103617256A (en) * 2013-11-29 2014-03-05 北京诺禾致源生物信息科技有限公司 Method and device for processing file needing mutation detection
CN105483244A (en) * 2015-12-28 2016-04-13 武汉菲沙基因信息有限公司 Super-long genome-based variation detection algorithm and detection system
CN106355045A (en) * 2016-08-30 2017-01-25 天津诺禾致源生物信息科技有限公司 Amplicon next-generation sequencing based small fragment insertion and deletion detection method and device
CN106951731A (en) * 2017-03-28 2017-07-14 上海至本生物科技有限公司 A kind of large fragment insertion or the Forecasting Methodology and system of missing
CN107229839A (en) * 2017-05-25 2017-10-03 西安电子科技大学 A kind of Indel detection methods based on new-generation sequencing data
CN108220404A (en) * 2018-02-06 2018-06-29 东莞博奥木华基因科技有限公司 A kind of method and system for identifying DNA large fragment deletions

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision";Alexej Abyzov et al.;《BIOINFORMATICS》;20110113;第27卷(第5期);第595-603页,参见第595页左栏第2段 *
"Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads";Kai Ye et al.;《BIOINFORMATICS》;20090626;第25卷(第21期);第2865-2871页,参见第2865页摘要、2866页第2.2节、2.3节及图1、第2867页右栏第2.5节 *
"基于二代测序的基因组缺失变异综合检测策略";管瑞;《中国优秀硕士学位论文全文数据库》;20160315(第03期);第1-54页 *

Also Published As

Publication number Publication date
CN109266729A (en) 2019-01-25

Similar Documents

Publication Publication Date Title
Nadeem et al. DNA molecular markers in plant breeding: current status and recent advancements in genomic selection and genome editing
Della Coletta et al. How the pan-genome is changing crop genomics and improvement
Geng et al. Rapid identification of candidate genes for seed weight using the SLAF-Seq method in Brassica napus
Ren et al. Construction of a high-density DArTseq SNP-based genetic map and identification of genomic regions with segregation distortion in a genetic population derived from a cross between feral and cultivated-type watermelon
JP5960917B1 (en) Rice whole genome breeding chip and its application
Li et al. Frequency and type of inheritable mutations induced by γ rays in rice as revealed by whole genome sequencing
US6484105B2 (en) Method for obtaining a plant with a genetic lesion in a gene sequence
Xu et al. Whole-genome resequencing: changing the paradigms of SNP detection, molecular mapping and gene discovery
CN112094937B (en) SNP molecular marker related to pod and seed size on peanut A06 chromosome and application thereof
Sato History and future perspectives of barley genomics
US10106849B2 (en) High throughput method of screening a population for members comprising mutation(s) in a target sequence using alignment-free sequence analysis
Fitzgerald et al. An assessment of heavy ion irradiation mutagenesis for reverse genetics in wheat (Triticum aestivum L.)
Kim et al. Development of a high-throughput SNP marker set by transcriptome sequencing to accelerate genetic background selection in Brassica rapa
Pereira et al. Patterns of DNA methylation changes in elite Eucalyptus clones across contrasting environments
Lyu et al. Pan-genome analysis sheds light on structural variation-based dissection of agronomic traits in melon crops
US20070048768A1 (en) Methods for screening for gene specific hybridization polymorphisms (GSHPs) and their use in genetic mapping and marker development
Lyu et al. TEAseq-based identification of 35,696 Dissociation insertional mutations facilitates functional genomic studies in maize
CN109266729B (en) Large fragment deletion detection method based on genome second-generation sequencing
US20190241981A1 (en) Plant breeding using next generation sequencing
CN111926102A (en) Molecular marker of rice photo-thermo-sensitive male sterility gene pms3 and application thereof
Paterson et al. Structural genomics and genome sequencing
Kim et al. Development of SNP-based molecular markers by re-sequencing strategy in peanut
Taranto et al. An overview of genotyping by sequencing in crop species and its application in pepper
CN107794261B (en) Molecular marker closely linked with major QTL (quantitative trait loci) of rape grain number per pod and application thereof
Liu et al. Construction and characterization of a bacterial artificial chromosome library for Gossypium mustelinum

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant