WO2012083845A1 - Methods for removal of vector fragments in sequencing library and uses thereof - Google Patents

Methods for removal of vector fragments in sequencing library and uses thereof Download PDF

Info

Publication number
WO2012083845A1
WO2012083845A1 PCT/CN2011/084303 CN2011084303W WO2012083845A1 WO 2012083845 A1 WO2012083845 A1 WO 2012083845A1 CN 2011084303 W CN2011084303 W CN 2011084303W WO 2012083845 A1 WO2012083845 A1 WO 2012083845A1
Authority
WO
WIPO (PCT)
Prior art keywords
library
sequencing
sequencing library
probe
vector
Prior art date
Application number
PCT/CN2011/084303
Other languages
French (fr)
Chinese (zh)
Inventor
胡帅星
武靖华
樊帆
陈琳
赵美茹
王俊
汪建
Original Assignee
深圳华大基因科技有限公司
深圳华大基因研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大基因科技有限公司, 深圳华大基因研究院 filed Critical 深圳华大基因科技有限公司
Publication of WO2012083845A1 publication Critical patent/WO2012083845A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the present invention relates to the field of molecular biology, particularly in the field of genome sequencing, and in particular, to methods for removing carrier fragments in sequencing libraries and uses thereof. More specifically, the present invention provides a method for removing a vector fragment in a sequencing library, a method for sequencing a genomic clone library, and a kit for genome sequencing. Background technique
  • De nove sequencing is also called de novo sequencing. It can sequence the genome of a certain species without any gene sequence information. After sequencing, bioinformatics analysis method can be used to splicing and assembling the sequences obtained by sequencing. A genomic sequence map of the species is obtained, which can be applied to other genomics, genetic studies of the species. Therefore, the e wove sequencing of biological organisms is of great significance.
  • Illumina's DNA sequencing platform specifically, by preparing a sequencing library of the organism and then sequencing it.
  • the sequencing library preparation method based on Illumina's DNA sequencing platform is divided into small-sequence DNA sequencing library preparation and large-segment DNA sequencing library preparation.
  • Fosmid clone library as a template (also sometimes referred to as construction in this paper).
  • the Fosmid clone library is obtained by transferring the total DNA of a certain organism together with the Fosmid vector into a host cell in a recombinant form, and then forming a plurality of clones by cell proliferation.
  • a suitable length of the insertable genomic DNA fragment is about 40 kb.
  • Construction of a Fosmid small fragment sequencing library refers to the preparation of a sequencing library by mixing a certain number of Fosmid clones together as a library template.
  • the Fosmid vector was also disrupted into a small number of small fragments of DNA below lkb due to disruption of the Fosmid clone during the database construction process. Therefore, the sequencing library contains a large number of Fosmid vector fragments. Therefore, when the entire sequencing library is sequenced, a large amount of unnecessary vector sequence data is generated, and the unnecessary sequence data also affects subsequent data analysis, resulting in Sequencing resources are wasted, sequencing costs are high, and efficiency is low.
  • the present invention aims to solve at least one of the technical problems existing in the prior art.
  • the present invention provides methods for removing vector fragments in a sequencing library and uses thereof, so that the Fosmid vector fragment will be removed prior to sequencing the sequencing library using a Solexa DNA sequencer to avoid sequencing the Fosmid vector. Reduce unnecessary data reading and analysis, which can significantly reduce the cost of sequencing and improve sequencing efficiency.
  • the invention provides a method for removing a vector fragment in a sequencing library.
  • the method comprises the steps of:
  • a labeled probe is provided that is capable of hybridizing to a vector fragment.
  • the vector is a Fosmid vector.
  • hybridization refers to two single-stranded nucleic acid molecules having complementary sequences to each other under certain conditions (suitable temperature and ionic strength, etc.) annealed to form a double strand according to the principle of complementary pairing of bases.
  • the process of nucleic acids also known as nucleic acid hybridization.
  • the nucleic acid hybridization can be carried out between Li A-Li A or between Li A-RNA or RNA-RNA, and base pairing can be performed as long as there is a complementary sequence in both of the hybrids.
  • nucleic acid hybridization includes solid-liquid phase hybridization and liquid phase hybridization, wherein liquid phase hybridization is a hybridization reaction carried out in a solution, which means that a nucleic acid molecule to be tested is hybridized with a known nucleic acid molecule (probe) in a solution to form a hybrid. Complex.
  • vector fragment may be the entire Fosmid vector, or a fragment of all vectors formed by disrupting the entire Fosmid vector, whereby the expression “probe” and vector Fragmentation of the fragment "represents that the sequence of the probe can be complementary to the entire Fosmid vector sequence, thereby enabling hybridization with the entire Fosmid vector, or the sequence of the probe can be complementary to the sequence of the fragments of all of the above vectors, thereby enabling fragmentation with all vectors. Hybridization is carried out.
  • the probe is hybridized to the sequencing library such that the probe and the vector fragment form a labeled double-stranded nucleic acid.
  • the hybridized duplex can be isolated and detected using the label on the probe, wherein the label is known in the art and can include, but is not limited to, Radioactive co-location Or a nuclides, biotin, acridinium ester or polyA.
  • the label is known in the art and can include, but is not limited to, Radioactive co-location Or a nuclides, biotin, acridinium ester or polyA.
  • corresponding nucleic acid isolation and detection methods known in the art can be utilized, including but not limited to hydroxyapatite (HAP) or affinity adsorption (see, for example, Henegariu 0 et al. (1999).
  • HAP hydroxyapatite
  • affinity adsorption see, for example, Henegariu 0 et al. (1999).
  • PCR 3 PCR in situ hybridization: a practical approach, Volume 3. Oxford: Oxford University Press, which is incorporated herein by reference in its entirety, the hybridization product is isolated and tested.
  • the expression "probe and vector fragment forms a labeled double-stranded nucleic acid” means that the probe can hybridize with the entire Fosmid vector to form a labeled double-stranded nucleic acid, or it can be a probe and all of the above vectors.
  • the fragment hybridizes to the labeled double-stranded nucleic acid.
  • the probe is subjected to a liquid phase hybridization reaction with a sequencing library.
  • the probe can be hybridized with the sequencing library in a mass ratio of from about 1:1 to about 2:1.
  • it may further comprise adding a linker blocking agent to the sequencing library.
  • the ratio of linker blocking agent to sequencing library may range from about 0.3 pM/ng to 0.8 pM/ng. According to a specific example of the invention, the ratio of the linker blocking agent to the sequencing library is 0.5 pM/ng. According to an embodiment of the present invention, the hybridization reaction may take a time of 1, 4, 16, 24 or more hours.
  • the labeled double-stranded nucleic acid is then removed using the labeled molecular entity on the specific binding probe to remove the vector fragment in the sequencing library.
  • the aforementioned labeled probe may be a biotinylated probe and the molecular entity is avidin.
  • the molecular entity is streptavidin, and according to a specific example of the invention, the molecular entity is formed on the magnetic beads.
  • the inventors have surprisingly found that the method for removing vector fragments in a sequencing library according to an embodiment of the present invention enables efficient removal of vector fragments in a Fosmid small fragment sequencing library for sequencing based on a high throughput sequencing platform such as Illumina Solexa
  • the platform after sequencing the processed sequencing library, can effectively avoid waste of sequencing resources and generate unnecessary sequencing data, ultimately reducing the cost of sequencing and improving the sequencing efficiency.
  • the method for removing a vector fragment in a sequencing library of the present invention may comprise the following steps:
  • the vector is used to construct a genomic clone library, and is preferably a Fosmid vector.
  • the probe is labeled with biotin and the molecular entity that specifically binds to the label is avidin.
  • the molecular entity is streptavidin, and according to one embodiment of the present invention, streptavidin may be formed on the magnetic beads.
  • the probe is subjected to a liquid phase hybridization reaction with a sequencing library.
  • the biotinylated probe can be prepared by the following steps:
  • the PCR product can be fragmented using a Covaris instrument.
  • the PCR product is specifically fragmented into fragments of about 300 bp according to the present invention.
  • the hybridization reaction can be carried out by denaturation of the probe and the sequencing library at 95 °C, followed by annealing at 65 °C.
  • the probe and the sequencing library can be subjected to a hybridization reaction at a mass ratio of about 1:1 to about 2:1.
  • a linker blocker can optionally be added to the sequencing library prior to hybridization of the probe to the sequencing library.
  • the linker blocking agent is a nucleic acid capable of hybridizing to a linker used in the construction of the sequencing library, and preferably, the nucleotide sequence of the linker blocker is completely complementary to the nucleotide sequence of the linker, thereby When the probe is hybridized to the sequencing library, the linker is hybridized to the linker to avoid hybridization of the linker to the probe and pairing between the linkers.
  • the ratio of the linker blocking agent to the library may be from about 0.3 pM/ng to about 0.8 pM/ng, for example 0.5 pM/ng.
  • the amount of the linker blocking agent added is too large, the removal efficiency of the vector fragment in the library is lowered.
  • the amount of the linker blocking agent added is too small, a large amount of non-vector library DNA fragments are captured by the probe. Remove.
  • the hybridization can be carried out for 1, 4, 16, 24 or more hours.
  • the present invention provides a method for sequencing a genomic clone library, comprising the steps of:
  • a sequencing library is constructed using a genomic clone library in which a vector fragment is included in the sequencing library.
  • the genomic clone library is a Fosmid clone library and the vector is a Fosmid vector.
  • the sequencing library can be constructed by: fragmenting the DNA in the genomic clone library to obtain a DNA fragment; ligating the linker to both ends of the DNA fragment to obtain a ligation product; and PCR-amplifying the ligation product , in order to obtain an amplification product; and recovering and purifying the amplification product, the amplification product constituting a sequencing library.
  • the vector fragment in the sequencing library is removed using a method for removing a vector fragment in a sequencing library according to an embodiment of the present invention.
  • the vector fragment in the sequencing library may be removed after the sequencing library is constructed, or during the process of constructing the sequencing library; according to a specific example of the present invention, after the DNA in the genomic clone library is fragmented , removing the vector fragment in the sequencing library.
  • sequencing library after removal of the vector fragment was sequenced.
  • the method of sequencing is not particularly limited. According to one embodiment of the invention, sequencing can be performed using a Solexa sequencer.
  • the method for sequencing a genomic clone library according to an embodiment of the present invention can efficiently sequence a genomic clone library, that is, a Fosmid small fragment sequencing library, since the method can effectively remove Fosmid small fragments.
  • a genomic clone library that is, a Fosmid small fragment sequencing library
  • the method can be effectively applied to the centralized commercial multi-species genome sequencing which is carried out on a large scale at present, thereby saving cost and improving efficiency, and further, capable of performing genomic clone library according to an embodiment of the present invention.
  • the method of sequencing is applied to new areas such as analysis, disease diagnosis, and personalized (individualized) medicine.
  • the method for sequencing a genomic clone library of the present invention may comprise the following steps:
  • the genomic clone library is a Fosmid clone library and the vector is a Fosmid vector.
  • a method for removing a vector fragment in a sequencing library according to the present invention can be utilized to remove a vector fragment in a sequencing library.
  • the construction of the sequencing library can be performed by the following steps:
  • the vector fragment in the sequencing library can be removed after construction of the sequencing library. Root According to some embodiments of the invention, the vector fragments in the sequencing library can be removed during the construction of the sequencing library. According to a specific example of the present invention, the vector fragment can be removed after fragmenting the DNA in the genomic library.
  • the sequencing library can be sequenced using a Solexa sequencer.
  • the present invention provides a kit for genome sequencing, comprising: a vector for constructing a genomic clone library; a labeled probe capable of interacting with the vector or The fragment is hybridized; and a molecular entity capable of specifically binding to the label on the probe.
  • the vector is a Fosmid vector.
  • the probe is a biotinylated probe and the molecular entity is avidin.
  • the molecular entity is streptavidin.
  • the molecular entity is streptavidin formed on the magnetic beads.
  • kits of the invention may also include other reagents required for genomic clone library construction, sequencing, etc., which may further include linkers and linker blocking agents in accordance with embodiments of the present invention.
  • linkers and linker blocking agents in accordance with embodiments of the present invention.
  • a kit according to an embodiment of the present invention can be effectively used for genome sequencing, and in particular, a vector in a kit can be used to efficiently construct a library of genomic clones, thereby utilizing labeled probes and molecules in the kit.
  • the entity is capable of efficiently removing the vector fragments in the library, thereby enabling efficient genome sequencing of the library from which the vector fragments are removed based on a high throughput sequencing platform.
  • the application of the kit of the present invention to genome sequencing can effectively avoid waste of sequencing resources and generate unnecessary sequencing data, thereby reducing the cost of sequencing, improving the sequencing efficiency, and obtaining accurate sequencing results and easy data analysis.
  • the kit for genome sequencing of the present invention may comprise: a vector for constructing a genomic clone library, a labeled probe capable of hybridizing with a vector or a fragment thereof, and capable of specifically binding The labeled molecular entity of the probe.
  • the vector is a Fosmid vector.
  • the probe is labeled with biotin and the molecular entity that specifically binds to the label is avidin, and according to a specific example of the invention, the preferred molecular entity is streptavidin.
  • streptavidin may be formed on the magnetic beads.
  • the kit may also include other reagents including, but not limited to, linkers and linker blocking agents.
  • the probe was prepared by a PCR reaction using a Fosmid vector (the sequence of which is shown in SEQ ID NO: 1) as a template.
  • the reaction system of PCR is as follows:
  • the dNTP mixed solution used is a mixed solution of biotin-dNTP and normal dNTP in a ratio of 15:85, and the final concentration is 1 ⁇ .
  • the forward primer for preparing the probe was: 5'-CCTGGGGTGCCTAATGAGTG-3' (SEQ ID NO: 2).
  • Reverse primer for the preparation of the probe was: 5'-CGTCGTTTTACAACGTCGTGA-3' (SEQ ID NO: 3).
  • the PCR reaction conditions were: 95 ° C, 2 minutes; 12 cycles of 95 ° C, 30 seconds, 65 ° C, 30 seconds, 72 ° C, 8 minutes; 72 ° C, 10 minutes; 4 ° C storage.
  • the PCR product was purified using the DNA clean and ConcentratorTM-25 kit, and then the PCR product was disrupted using a Covaris S2 instrument to obtain a DNA fragment, which was then determined by gel electrophoresis to determine the size of the DNA fragment.
  • the test results showed that the size of the DNA fragment was mainly about 300 bp.
  • the DNA fragment was purified by MinElute PCR Purification Kit and dissolved in 20 ⁇ l elution buffer to obtain a biotin-labeled probe. Then, use Qubit (HS) to quantify the concentration of the probe and set aside.
  • HS Qubit
  • the Fosmid clone library of Puccinia striiformis f.sp 'rid genomic DNA was prepared using the CopyControlTM HTP Fosmid Library Construction Kit (Epicentre, USA) according to the manufacturer's detailed instructions.
  • SC hybridization buffer and SC hybridization component A are derived from the sequence capture hybridization kit.
  • the amount of the library was 120 ng
  • the amount of the probe was 120 ng (quantified by Qubit)
  • a linker blocking agent was added to the hybridization system.
  • the linker blocker was obtained from the tag sequencing primer kit (Illumina) and the mixed Rd2 sequencing primer (Multixing Rd2 Sequencing Primer kit, Illumina), wherein the sequence of the linker blocker is as follows:
  • Linker - sputum 1 5'-AATGATACGGCGACCACCGAGATCTACACTCTTTCCC TACACG ACGCTCTTCCGATCT-3' (SEQ ID NO: 4);
  • Linker - sputum 2 5'-ACAAGCAGAAGACGGCATACGAGATAAGCAATGGTGA CTGGA GTTCAGACGTGTGCTCTTCCGATCT-3' (SEQ ID NO: 5).
  • the amount of both linker blocking agents was 0.06 nM, and the ratio of the amount of the linker to the library was 0.5 pM/ng.
  • hybridization conditions are: denaturation at 95 ° C 10
  • hybridization was carried out at 65 ° C according to the time specified in the table below to obtain a hybridization product, which was used.
  • Library DNA (i.e., vector fragment) capable of hybridizing with the probe in the above hybridization product was captured and isolated using streptavidin magnetic beads (M280 magnetic beads) according to the manufacturer's instructions, and then derived from samples 1 and 2
  • streptavidin magnetic beads M280 magnetic beads
  • library DNA i.e., vector fragments hybridized with the probes from samples 1 and 2, respectively, were collected and labeled 24-1-V and 4- 12-V.
  • the library DNAs obtained above namely 24-l-D, 4- 12-D, 24-1-V and 4- 12-V, were purified by QIAquick PCR Purification Kit according to the manufacturer's instructions.
  • the library DNA obtained above was subjected to PCR amplification using PFX enzyme, wherein the PCR reaction system was as follows:
  • the forward primer for library DNA amplification is: 5'-CAAGCAGAAGACGGCATACGA -3' (SEQ ID NO: 6)
  • reverse primer for library DNA amplification 5'- AATGATACGGCGACCACCGAGATC -3' (SEQ ID NO: 7).
  • the PCR reaction conditions were: 94 ° C, 2 minutes; 12 cycles of 94 ° C, 15 seconds, 58 ° C, 30 seconds, 72 ° C, 30 seconds; 72 ° C, 5 minutes; 4 ° C storage. Thereby, an amplification product was obtained.
  • the amplified product was purified, dissolved in 30 ⁇ l of ultrapure water, and the concentration of the obtained amplification product was measured by Nanodrop, and the results were as follows:
  • the effect of removing the vector fragment in the library by the method for removing the vector fragment in the sequencing library of the present invention is analyzed by Q-PCR.
  • a sample which was not treated by the method of the present invention was selected as a control.
  • the untreated sample and the treated sample were diluted to the same concentration, and Q-PCR was performed to detect the content of the vector fragment in the two samples.
  • the degree of enrichment indicates a multiple of the amount of the carrier fragment in the control sample relative to the amount of the carrier fragment in the treated sample, and therefore, the effect of carrier removal can be obtained from the value of the enrichment degree, for example, when the enrichment degree is N , which indicates that the sample treated by the method of the present invention removes the 1-1/N carrier fragment.
  • the library DNA from which the vector fragment was removed i.e., the above 24-1-1D and 4112-D
  • the library DNA from which the vector fragment was not removed i.e., the sequencing library of the wheat stripe rust DNA
  • the Q-PCR reaction system is as follows:
  • SYBR Premix and Rox Reference Dye II are from SYBR@ Premix Ex TaqTM Among them, the forward primer for Q-PCR is: 5'-TTgTTCCCACgCCTGCTGAgTTGT-3' (SEQ ID NO: 8), and the reverse primer for Q-PCR is: 5'-ATCCCgAATTTgCTCCTCCATCCAC-3' (SEQ ID NO : 9 Q-PCR reaction conditions are: 95 ° C, 30 seconds; 40 cycles of 95 ° C, 15 seconds, 60 ° C, 1 minute.
  • the degree of enrichment of the vector fragment is greater than 5, which indicates that the content of the vector fragment in the sample treated by the method of the present invention is less than 1/5 of that of the control sample, thereby indicating that the method of the present invention removes the initial library. At least 80% of the vector fragments.
  • the probe prepared by the method for preparing a probe used in Example 1 was used.
  • a Fosmid clone library was constructed using blood genomic DNA from a Chinese adult male, and then a multi-sample preparation oligonucleotide kit (Illumina, PE-400-) was used according to the manufacturer's instructions. 1002) A sequencing library was constructed, called a YH Fosmid sequencing library. Then, the probe prepared above and the YH Fosmid sequencing library (hereinafter sometimes referred to simply as "library” in the present example) were prepared and hybridized according to the ratios in the following table:
  • Joint sealer 2 0.025 nM
  • the water was made up to 15 ⁇ wherein, in the above hybrid system, the mass ratio of the library to the probe was 1:2, and the linker blocking agent was the same as the linker of Example 1.
  • the hybridization conditions were: denaturation at 95 ° C for 10 minutes, followed by hybridization at 65 ° C for 24 hours, thereby obtaining a hybridization product, which was used.
  • the library DNA capable of hybridizing with the probe that is, the vector fragment
  • the library DNA from which the vector fragment is removed is collected. It was labeled as YH-DNA-1 and its concentration was determined. The results are as follows:
  • the library DNA (i.e., vector fragment) that hybridizes to the probe is collected and labeled as YH-Vector-1.
  • the obtained library DNA was purified using QIAquick PCR Purification Kit according to the manufacturer's instructions and used.
  • the method of removing the vector fragment in the library in this example was analyzed by Q-PCR.
  • the effect of Q-PCR is as follows:
  • the degree of enrichment of the vector fragment is greater than 5, indicating the sample treated by the method of the present invention.
  • the content of the carrier fragment was less than 1/5 of the control sample.
  • the method of the invention removes at least 80% of the vector fragments in the original library.
  • the method for removing a vector fragment in a sequencing library of the present invention, a method for sequencing a genomic clone library, and a kit for genome sequencing can effectively remove a vector fragment in a Fosmid small fragment sequencing library, thereby enabling It can be applied to the sequencing of genomic clone libraries, which can reduce the cost of sequencing and improve the efficiency of sequencing and data analysis.

Abstract

Methods for removal of vector fragments in sequencing library, methods for sequencing clone library of genomes and kits for sequencing genomes are provided. Wherein methods for removal of vector fragments in sequencing library include: providing labeled probes which are able to hybridize with vector fragments; hybridizing the probes with the sequencing library, so that the probes and the vector fragments form labeled double-stranded nucleic acid; and utilizing labeled molecular entity on specific binding probes to remove labeled double-stranded nucleic acid, accordingly remove vector fragments in sequencing library.

Description

用于除去测序文库中载体片段的方法及其用途 优先权信息  Method for removing carrier fragments in a sequencing library and its use
本申请请求 2010 年 12 月 22 日向中国国家知识产权局提交的、 专利申请号为 201010600214.4的专利申请的优先权和权益, 并且通过参照将其全文并入此处。 技术领域  The present application claims priority to and the benefit of the patent application No. 201010600214.4 filed on Dec. 22, 2010, to the Chinese National Intellectual Property Office, which is hereby incorporated by reference in its entirety. Technical field
本发明涉及分子生物学领域, 特别是基因组测序领域, 具体地, 本发明涉及用于 除去测序文库中载体片段的方法及其用途。 更具体地, 本发明提供了一种用于除去测序 文库中的载体片段的方法、 一种用于对基因组克隆文库进行测序的方法以及一种用于基因 组测序的试剂盒。 背景技术  The present invention relates to the field of molecular biology, particularly in the field of genome sequencing, and in particular, to methods for removing carrier fragments in sequencing libraries and uses thereof. More specifically, the present invention provides a method for removing a vector fragment in a sequencing library, a method for sequencing a genomic clone library, and a kit for genome sequencing. Background technique
De nove测序法也称从头测序法, 其不需要任何基因序列信息即可对某个物种的基 因组进行测序, 在测序后, 用生物信息学的分析方法对测序获得的序列进行拼接、 组 装, 能够获得该物种的基因组序列图谱, 从而能够应用于该物种的其它基因组学、 遗 传学研究。 由此, 对生物进行 )e wove测序意义重大。  De nove sequencing is also called de novo sequencing. It can sequence the genome of a certain species without any gene sequence information. After sequencing, bioinformatics analysis method can be used to splicing and assembling the sequences obtained by sequencing. A genomic sequence map of the species is obtained, which can be applied to other genomics, genetic studies of the species. Therefore, the e wove sequencing of biological organisms is of great significance.
现阶段常利用 Illumina公司的 DNA测序平台对生物进行 De nove测序, 具体地, 主要通过制备该生物的测序文库, 然后对其进行测序实现。 其中, 根据测序片段的大 小, 基于 Illumina公司的 DNA测序平台的测序文库制备方法分为小片段 DNA的测序 文库制备和大片段 DNA 的测序文库制备。 而为了解决非单倍体基因组序列杂合度过 高、 De novo测序序列拼接组装难度大的问题, 目前通常釆用以 Fosmid克隆文库为模 板构建 DNA小片段测序文库 (在本文中有时也称为构建 Fosmid小片段测序文库) 的 方法。 其中, Fosmid克隆文库是通过将某种生物的总 DNA与 Fosmid载体一起以重组 的形式转移到宿主细胞中, 然后通过细胞增殖形成多个克隆的整体而得到的。 对于 Fosmid载体, 可插入的基因组 DNA片段的合适的长度为大约 40 kb。 构建 Fosmid小 片段测序文库, 是指将一定数量的 Fosmid克隆混合到一起作为建库模板进行测序文库 的制备。  At this stage, De nove sequencing of organisms is often carried out using Illumina's DNA sequencing platform, specifically, by preparing a sequencing library of the organism and then sequencing it. Among them, according to the size of the sequencing fragment, the sequencing library preparation method based on Illumina's DNA sequencing platform is divided into small-sequence DNA sequencing library preparation and large-segment DNA sequencing library preparation. In order to solve the problem that the heterozygosity of the non-haploid genome sequence is too high and the De novo sequencing sequence is difficult to assemble and assemble, it is common to construct a DNA small fragment sequencing library using the Fosmid clone library as a template (also sometimes referred to as construction in this paper). Method of Fosmid Small Fragment Sequencing Library). Among them, the Fosmid clone library is obtained by transferring the total DNA of a certain organism together with the Fosmid vector into a host cell in a recombinant form, and then forming a plurality of clones by cell proliferation. For Fosmid vectors, a suitable length of the insertable genomic DNA fragment is about 40 kb. Construction of a Fosmid small fragment sequencing library refers to the preparation of a sequencing library by mixing a certain number of Fosmid clones together as a library template.
然而, 目前构建 Fosmid小片段测序文库的方法仍有待改进。 发明内容 However, the current method for constructing a Fosmid small fragment sequencing library remains to be improved. Summary of the invention
本发明是基于发明人的下列发现而完成的:  The present invention has been completed based on the following findings of the inventors:
当利用 Solexa DNA测序仪对由 Fosmid克隆文库建立的测序文库进行测序时, 由 于在建库过程中对 Fosmid克隆进行了打断处理, Fosmid 载体也被打断成了许多 lkb 以下的小片段 DNA, 因而测序文库中包含有大量的 Fosmid载体片段, 因此, 当对整 个测序文库进行测序时, 会产生大量的不需要的载体序列数据, 并且这些不需要的序 列数据还会影响后续的数据分析, 造成测序资源浪费、 测序成本高、 效率低。  When the sequencing library established by the Fosmid clone library was sequenced using the Solexa DNA sequencer, the Fosmid vector was also disrupted into a small number of small fragments of DNA below lkb due to disruption of the Fosmid clone during the database construction process. Therefore, the sequencing library contains a large number of Fosmid vector fragments. Therefore, when the entire sequencing library is sequenced, a large amount of unnecessary vector sequence data is generated, and the unnecessary sequence data also affects subsequent data analysis, resulting in Sequencing resources are wasted, sequencing costs are high, and efficiency is low.
本发明旨在至少解决现有技术中存在的技术问题之一。 由此, 本发明提供了用于 除去测序文库中的载体片段的方法及其用途, 以便在利用 Solexa DNA测序仪对测序文 库进行测序前, 将 Fosmid载体片段将去除, 以避免对 Fosmid载体进行测序, 减少不 必要的数据读取及分析, 从而能够显著降低测序成本, 提高测序效率。  The present invention aims to solve at least one of the technical problems existing in the prior art. Thus, the present invention provides methods for removing vector fragments in a sequencing library and uses thereof, so that the Fosmid vector fragment will be removed prior to sequencing the sequencing library using a Solexa DNA sequencer to avoid sequencing the Fosmid vector. Reduce unnecessary data reading and analysis, which can significantly reduce the cost of sequencing and improve sequencing efficiency.
根据本发明的一个方面, 本发明提供了一种用于除去测序文库中的载体片段的方法。 根据本发明的实施例, 该方法包括以下步骤:  According to one aspect of the invention, the invention provides a method for removing a vector fragment in a sequencing library. According to an embodiment of the invention, the method comprises the steps of:
首先, 提供经过标记的探针, 该探针能够与载体片段进行杂交。 根据本发明的实施例, 载体是 Fosmid载体。 其中, 在本文中所使用的术语 "杂交", 是指相互间具有互补序列 的两个单链核酸分子在一定条件下 (适宜的温度及离子强度等) 按碱基互补配对原则 退火形成双链核酸的过程, 也称为核酸杂交。 核酸杂交可以在 丽 A-丽 A之间, 也可在 丽 A-RNA或 RNA-RNA之间进行, 只要杂交双方存在互补序列, 可以进行碱基配对即可。 通常, 杂交的双方是待测核酸分子和已知核酸分子, 在杂交体系中已知序列的核酸分 子称作探针( probe )。 此外, 核酸杂交包括固-液相杂交和液相杂交, 其中液相杂交是 在溶液中进行的杂交反应, 其是指待测核酸分子与已知核酸分子 (探针) 在溶液中退 火形成杂交复合物。 在本文中, 所使用的术语 "载体片段" 可以是整个 Fosmid载体, 也 可以是将整个 Fosmid载体打断后形成的所有载体的片段, 由此, 在本文中的表达方式 "探 针能够与载体片段进行杂交"表示探针的序列可以与整个 Fosmid载体序列互补, 从而能够 与整个 Fosmid载体进行杂交, 也可以是探针的序列与上述所有载体的片段的序列互补, 从 而能够与所有载体的片段进行杂交。  First, a labeled probe is provided that is capable of hybridizing to a vector fragment. According to an embodiment of the invention, the vector is a Fosmid vector. Wherein, the term "hybridization" as used herein refers to two single-stranded nucleic acid molecules having complementary sequences to each other under certain conditions (suitable temperature and ionic strength, etc.) annealed to form a double strand according to the principle of complementary pairing of bases. The process of nucleic acids, also known as nucleic acid hybridization. The nucleic acid hybridization can be carried out between Li A-Li A or between Li A-RNA or RNA-RNA, and base pairing can be performed as long as there is a complementary sequence in both of the hybrids. Generally, the two sides of the hybrid are the nucleic acid molecule to be tested and the known nucleic acid molecule, and the nucleic acid molecule of the known sequence in the hybrid system is called a probe. In addition, nucleic acid hybridization includes solid-liquid phase hybridization and liquid phase hybridization, wherein liquid phase hybridization is a hybridization reaction carried out in a solution, which means that a nucleic acid molecule to be tested is hybridized with a known nucleic acid molecule (probe) in a solution to form a hybrid. Complex. As used herein, the term "vector fragment" may be the entire Fosmid vector, or a fragment of all vectors formed by disrupting the entire Fosmid vector, whereby the expression "probe" and vector Fragmentation of the fragment "represents that the sequence of the probe can be complementary to the entire Fosmid vector sequence, thereby enabling hybridization with the entire Fosmid vector, or the sequence of the probe can be complementary to the sequence of the fragments of all of the above vectors, thereby enabling fragmentation with all vectors. Hybridization is carried out.
其次, 将探针与测序文库进行杂交, 以便探针与载体片段形成带标记的双链核酸。 当 利用经过标记的探针进行杂交时, 在杂交反应结束后, 能够利用探针上的标记分离和 检测杂交后的双链, 其中, 标记是本领域内是已知的, 可以包括但不限于放射性同位 素或核素、 生物素、 吖啶翁酯 ( acridinium ester )或 polyA等。 根据探针上的标记的类 型, 可以利用本领域内已知的相应的核酸分离和检测方法, 包括但不限于羟基磷灰石 ( HAP )法或亲和吸附法(例如参见 Henegariu 0等人. (1999). Custom fluorescent-nucleotide synthesis as an alternative method for nucleic acid labeling. Nature Biotechnology. 18:345-348; Ezaki T等人. 1989. Fluorometric Deoxyribonucleic Acid-Deoxyribonucleic Acid Hybridization in Microdilution Wells as an Alternative to Membrane Filter Hybridization in which Radioisotopes Are Used To Determine Genetic Relatedness among Bacterial Strains. Int. J. of Systemic Bacteriology 29 (3): 224-229; Herrington C 等人. 1998. PCR 3: PCR in situ hybridization: a practical approach, Volume 3. Oxford: Oxford University Press , 通过参照将其 全文并入本文), 将杂交产物进行分离和检测。 在本文中所使用的表达方式 "探针与载体 片段形成带标记的双链核酸", 是指探针可以与整个 Fosmid载体杂交形成带标记的双链核 酸, 也可以是探针与上述所有载体的片段杂交带标记的双链核酸。 根据本发明的实施例, 将探针与测序文库进行液相杂交反应。 根据本发明的一些实施例, 可以将探针与测序文库 以大约 1:1至大约 2:1的质量比进行杂交反应。根据本发明的具体示例,在进行杂交反应前, 可以进一步包括向测序文库中添加接头封闭剂。 根据本发明的实施例, 接头封闭剂和测序 文库的比例可以为大约 0.3 pM/ng-0.8 pM/ng。 根据本发明的一个具体示例, 接头封闭剂和 测序文库的比例为 0.5 pM/ng。 才艮据本发明的实施例, 杂交反应的时间可以为 1 , 4, 16, 24 或更多个小时。 Second, the probe is hybridized to the sequencing library such that the probe and the vector fragment form a labeled double-stranded nucleic acid. When hybridization is carried out using a labeled probe, after hybridization is completed, the hybridized duplex can be isolated and detected using the label on the probe, wherein the label is known in the art and can include, but is not limited to, Radioactive co-location Or a nuclides, biotin, acridinium ester or polyA. Depending on the type of label on the probe, corresponding nucleic acid isolation and detection methods known in the art can be utilized, including but not limited to hydroxyapatite (HAP) or affinity adsorption (see, for example, Henegariu 0 et al. (1999). Custom Fluorescent Deoxyribonucleic Acid-Deoxyribonucleic Acid Hybridization in Microdilution Wells as an Alternative to Membrane Filter Hybridization in which Radioisotopes Are Used To Determine Genetic Relatedness among Bacterial Strains. Int. J. of Systemic Bacteriology 29 (3): 224-229; Herrington C et al. 1998. PCR 3: PCR in situ hybridization: a practical approach, Volume 3. Oxford: Oxford University Press, which is incorporated herein by reference in its entirety, the hybridization product is isolated and tested. As used herein, the expression "probe and vector fragment forms a labeled double-stranded nucleic acid" means that the probe can hybridize with the entire Fosmid vector to form a labeled double-stranded nucleic acid, or it can be a probe and all of the above vectors. The fragment hybridizes to the labeled double-stranded nucleic acid. According to an embodiment of the invention, the probe is subjected to a liquid phase hybridization reaction with a sequencing library. According to some embodiments of the invention, the probe can be hybridized with the sequencing library in a mass ratio of from about 1:1 to about 2:1. According to a specific example of the present invention, before the hybridization reaction is carried out, it may further comprise adding a linker blocking agent to the sequencing library. According to an embodiment of the invention, the ratio of linker blocking agent to sequencing library may range from about 0.3 pM/ng to 0.8 pM/ng. According to a specific example of the invention, the ratio of the linker blocking agent to the sequencing library is 0.5 pM/ng. According to an embodiment of the present invention, the hybridization reaction may take a time of 1, 4, 16, 24 or more hours.
然后利用特异性结合探针上的标记的分子实体, 去除带标记的双链核酸, 从而除去测 序文库中的载体片段。 根据本发明的实施例, 前述经过标记的探针可以是经过生物素标记 的探针, 并且分子实体是亲和素。 根据本发明的一些实施例, 分子实体是链霉亲和素, 根 据本发明的具体示例, 分子实体形成于磁珠上。  The labeled double-stranded nucleic acid is then removed using the labeled molecular entity on the specific binding probe to remove the vector fragment in the sequencing library. According to an embodiment of the invention, the aforementioned labeled probe may be a biotinylated probe and the molecular entity is avidin. According to some embodiments of the invention, the molecular entity is streptavidin, and according to a specific example of the invention, the molecular entity is formed on the magnetic beads.
发明人惊奇地发现, 利用根据本发明实施例的用于除去测序文库中的载体片段的方 法, 能够有效地除去 Fosmid小片段测序文库中的载体片段, 从而基于高通量测序平台 例如 Illumina Solexa测序平台, 对处理过的测序文库进行测序后, 能够有效地避免浪 费测序资源和产生不需要的测序数据, 最终降低测序成本, 提高测序效率。  The inventors have surprisingly found that the method for removing vector fragments in a sequencing library according to an embodiment of the present invention enables efficient removal of vector fragments in a Fosmid small fragment sequencing library for sequencing based on a high throughput sequencing platform such as Illumina Solexa The platform, after sequencing the processed sequencing library, can effectively avoid waste of sequencing resources and generate unnecessary sequencing data, ultimately reducing the cost of sequencing and improving the sequencing efficiency.
具体地, 根据本发明的实施例, 本发明的用于除去测序文库中的载体片段的方法可 以包括下列步骤:  Specifically, according to an embodiment of the present invention, the method for removing a vector fragment in a sequencing library of the present invention may comprise the following steps:
1 )制备经标记的探针, 该探针能够与载体或其片段杂交; 2 )将探针与测序文库进行杂交反应, 从而使探针与载体或其片段形成带标记的双链核 酸; 1) preparing a labeled probe that is capable of hybridizing to a vector or a fragment thereof; 2) hybridizing the probe to the sequencing library, thereby forming the labeled double-stranded nucleic acid with the probe or the vector or the fragment thereof;
3 )利用特异性结合探针上的标记的分子实体, 去除带标记的双链核酸, 从而除去测序 文库中的载体片段。  3) removing the labeled double-stranded nucleic acid by using the labeled molecular entity on the specific binding probe to remove the vector fragment in the sequencing library.
根据本发明的实施例, 载体用于构建基因组克隆文库, 并且优选是 Fosmid载体。 根 据本发明的具体示例,探针用生物素进行标记,并且特异性结合标记的分子实体是亲和素。 根据本发明的实施例, 优选分子实体为链霉亲和素, 根据本发明的一个实施例, 链霉亲 和素可以形成于磁珠。 根据本发明的实施例, 将探针与测序文库进行液相杂交反应。  According to an embodiment of the invention, the vector is used to construct a genomic clone library, and is preferably a Fosmid vector. According to a specific example of the invention, the probe is labeled with biotin and the molecular entity that specifically binds to the label is avidin. According to an embodiment of the present invention, preferably the molecular entity is streptavidin, and according to one embodiment of the present invention, streptavidin may be formed on the magnetic beads. According to an embodiment of the invention, the probe is subjected to a liquid phase hybridization reaction with a sequencing library.
根据本发明的实施例, 可以通过下列步骤制备生物素标记的探针:  According to an embodiment of the invention, the biotinylated probe can be prepared by the following steps:
1 )使用生物素化的 dNTP对载体进行 PCR扩增, 并纯化回收 PCR产物;  1) PCR amplification of the vector using biotinylated dNTPs, and purification of the PCR product;
2 )将 PCR产物片段化并回收纯化, 从而获得生物素标记的探针。  2) The PCR product is fragmented and recovered and purified to obtain a biotin-labeled probe.
根据本发明的实施例, 可以利用 Covaris仪将 PCR产物片段化。 根据本发明的具体 以将 PCR产物片段化为大约 300bp的片段。  According to an embodiment of the invention, the PCR product can be fragmented using a Covaris instrument. The PCR product is specifically fragmented into fragments of about 300 bp according to the present invention.
根据本发明的实施例, 可以通过在 95 °C下使探针与测序文库变性, 然后在 65 °C下退 火来进行杂交反应。根据本发明的具体示例,可以将探针与测序文库以大约 1:1至大约 2:1 的质量比进行杂交反应。  According to an embodiment of the present invention, the hybridization reaction can be carried out by denaturation of the probe and the sequencing library at 95 °C, followed by annealing at 65 °C. According to a specific example of the present invention, the probe and the sequencing library can be subjected to a hybridization reaction at a mass ratio of about 1:1 to about 2:1.
根据本发明的实施例, 任选地, 在进行探针与测序文库的杂交之前, 可以向测序文 库中加入接头封闭剂 (block )。 根据本发明的具体示例, 接头封闭剂是能够与测序文库构 建过程中所使用的接头杂交的核酸, 优选, 接头封闭剂的核苷酸序列与接头的核苷酸序列 完全互补, 由此, 在进行探针与测序文库的杂交时, 接头封闭剂与接头杂交, 从而避免发 生接头与探针的杂交以及接头间配对连接。 根据本发明的实施例, 加入的接头封闭剂和文 库的比例可以为大约 0.3pM/ng-0.8pM/ng,例如 0.5pM/ng。当加入的接头封闭剂的量过大时, 文库中的载体片段的去除效率会降低, 然而当加入的接头封闭剂的量过少时, 大量的非载 体的文库 DNA片段会被探针捕获而被去除。 根据本发明的实施例, 杂交可以进行 1 , 4, 16, 24或更多个小时。  According to an embodiment of the invention, a linker blocker can optionally be added to the sequencing library prior to hybridization of the probe to the sequencing library. According to a specific example of the present invention, the linker blocking agent is a nucleic acid capable of hybridizing to a linker used in the construction of the sequencing library, and preferably, the nucleotide sequence of the linker blocker is completely complementary to the nucleotide sequence of the linker, thereby When the probe is hybridized to the sequencing library, the linker is hybridized to the linker to avoid hybridization of the linker to the probe and pairing between the linkers. According to an embodiment of the invention, the ratio of the linker blocking agent to the library may be from about 0.3 pM/ng to about 0.8 pM/ng, for example 0.5 pM/ng. When the amount of the linker blocking agent added is too large, the removal efficiency of the vector fragment in the library is lowered. However, when the amount of the linker blocking agent added is too small, a large amount of non-vector library DNA fragments are captured by the probe. Remove. According to an embodiment of the invention, the hybridization can be carried out for 1, 4, 16, 24 or more hours.
根据本发明的另一方面, 本发明提供了一种用于对基因组克隆文库进行测序的方法, 其特征在于, 包括以下步骤:  According to another aspect of the present invention, the present invention provides a method for sequencing a genomic clone library, comprising the steps of:
首先, 利用基因组克隆文库构建测序文库, 其中测序文库中包含载体片段。 根据本发 明的实施例, 基因组克隆文库是 Fosmid克隆文库, 且载体是 Fosmid载体。 根据本发明的 一些实施例, 可以通过下列步骤构建测序文库: 将基因组克隆文库中的 DNA进行片段化, 以便获得 DNA片段; 将接头连接至 DNA片段的两端, 以便获得连接产物; 将连接产物进 行 PCR扩增, 以便获得扩增产物; 以及回收纯化扩增产物, 该扩增产物构成测序文库。 First, a sequencing library is constructed using a genomic clone library in which a vector fragment is included in the sequencing library. According to an embodiment of the invention, the genomic clone library is a Fosmid clone library and the vector is a Fosmid vector. According to the invention In some embodiments, the sequencing library can be constructed by: fragmenting the DNA in the genomic clone library to obtain a DNA fragment; ligating the linker to both ends of the DNA fragment to obtain a ligation product; and PCR-amplifying the ligation product , in order to obtain an amplification product; and recovering and purifying the amplification product, the amplification product constituting a sequencing library.
接着, 利用根据本发明实施例的用于除去测序文库中的载体片段的方法除去所述测序 文库中的载体片段。 根据本发明的实施例, 可以在构建测序文库后, 或在构建测序文库的 过程中, 除去测序文库中的载体片段; 根据本发明的具体示例, 在将基因组克隆文库中的 DNA进行片段化后, 除去测序文库中的载体片段。  Next, the vector fragment in the sequencing library is removed using a method for removing a vector fragment in a sequencing library according to an embodiment of the present invention. According to an embodiment of the present invention, the vector fragment in the sequencing library may be removed after the sequencing library is constructed, or during the process of constructing the sequencing library; according to a specific example of the present invention, after the DNA in the genomic clone library is fragmented , removing the vector fragment in the sequencing library.
最后, 对去除载体片段后的测序文库进行测序。 根据本发明的实施例, 测序的方法不 受特别限制。 根据本发明的一个实施例, 可以利用 Solexa测序仪进行测序。  Finally, the sequencing library after removal of the vector fragment was sequenced. According to an embodiment of the present invention, the method of sequencing is not particularly limited. According to one embodiment of the invention, sequencing can be performed using a Solexa sequencer.
发明人惊奇地发现, 利用根据本发明实施例的用于对基因组克隆文库进行测序的方 法, 能够有效地对基因组克隆文库即 Fosmid小片段测序文库进行测序, 由于该方法能够 有效地除去 Fosmid小片段测序文库中的载体片段, 因此能够避免测序资源浪费和产生 不需要的测序数据, 从而能够降低测序成本, 提高测序效率, 并且所得测序结果准确, 数据分析容易。 由此, 该方法能够有效地应用于现阶段大规模开展的集中化商业性多物 种基因组测序, 从而能够节省成本, 提高效率, 进一步, 能够将根据本发明实施例的 用于对基因组克隆文库进行测序的方法应用到分析、 疾病诊断以及个性化(个体化)医疗 等新领域。  The inventors have surprisingly found that the method for sequencing a genomic clone library according to an embodiment of the present invention can efficiently sequence a genomic clone library, that is, a Fosmid small fragment sequencing library, since the method can effectively remove Fosmid small fragments. By sequencing the vector fragments in the library, it is possible to avoid waste of sequencing resources and generate unnecessary sequencing data, thereby reducing the cost of sequencing, improving the sequencing efficiency, and obtaining accurate sequencing results and easy data analysis. Thus, the method can be effectively applied to the centralized commercial multi-species genome sequencing which is carried out on a large scale at present, thereby saving cost and improving efficiency, and further, capable of performing genomic clone library according to an embodiment of the present invention. The method of sequencing is applied to new areas such as analysis, disease diagnosis, and personalized (individualized) medicine.
具体地, 根据本发明的实施例, 本发明的用于对基因组克隆文库进行测序的方法 可以包括下列步骤:  Specifically, according to an embodiment of the present invention, the method for sequencing a genomic clone library of the present invention may comprise the following steps:
1 ) 利用基因组克隆文库构建测序文库, 并除去测序文库中的载体片段;  1) constructing a sequencing library using the genomic clone library, and removing the vector fragment in the sequencing library;
2 ) 对去除载体片段后的测序文库进行测序。  2) Sequencing the sequencing library after removing the vector fragment.
根据本发明的实施例, 基因组克隆文库是 Fosmid克隆文库, 并且载体是 Fosmid 载体。 根据本发明的具体示例, 可以利用根据本发明的用于除去测序文库中的载体片段 的方法来去除测序文库中的载体片段。  According to an embodiment of the invention, the genomic clone library is a Fosmid clone library and the vector is a Fosmid vector. According to a specific example of the present invention, a method for removing a vector fragment in a sequencing library according to the present invention can be utilized to remove a vector fragment in a sequencing library.
根据本发明的一些实施例, 测序文库的构建可以通过以下步骤进行:  According to some embodiments of the invention, the construction of the sequencing library can be performed by the following steps:
1 ) 将基因组克隆文库中的 DNA片段化;  1) fragmenting the DNA in the genomic clone library;
2 ) 将接头连接至经片段化的 DNA的两端, 以便获得添加了接头的 DNA;  2) attaching a linker to both ends of the fragmented DNA to obtain DNA to which the linker is added;
3 )将添加了接头的 DNA进行 PCR扩增, 并回收纯化扩增产物, 该扩增产物构成 测序文库。  3) The DNA to which the linker has been added is subjected to PCR amplification, and the purified amplification product is recovered, and the amplified product constitutes a sequencing library.
根据本发明的实施例, 可以在构建测序文库后, 除去测序文库中的载体片段。 根 据本发明的一些实施例, 可以在构建测序文库的过程中, 除去测序文库中的载体片段。 根据本发明的一个具体示例, 可以在片段化基因组克隆文库中的 DNA后, 除去载体片 段。 According to an embodiment of the invention, the vector fragment in the sequencing library can be removed after construction of the sequencing library. Root According to some embodiments of the invention, the vector fragments in the sequencing library can be removed during the construction of the sequencing library. According to a specific example of the present invention, the vector fragment can be removed after fragmenting the DNA in the genomic library.
根据本发明的实施例, 可以使用 Solexa测序仪对测序文库进行测序。  According to an embodiment of the invention, the sequencing library can be sequenced using a Solexa sequencer.
根据本发明的又一方面, 本发明提供了一种用于基因组测序的试剂盒, 其特征在于, 包括: 载体, 其用于构建基因组克隆文库; 经标记的探针, 其能够与载体或其片段进行杂 交; 以及分子实体,其能够特异性结合探针上的标记。根据本发明的实施例,载体是 Fosmid 载体。 根据本发明的一些实施例, 探针是经过生物素标记的探针, 并且分子实体是亲和素。 根据本发明的具体示例, 分子实体是链霉亲和素。 根据本发明的一个实施例, 分子实体为 形成于磁珠上的链霉亲和素。 本领域的技术人员可以理解, 本发明的试剂盒中还可以包括 基因组克隆文库构建、 测序等所需的其它试剂, 根据本发明的实施例, 该试剂盒中可以进 一步包括接头和接头封闭剂。 关于接头封闭剂的特征及作用, 前面已作详细描述, 在此不 再赘述。  According to still another aspect of the present invention, the present invention provides a kit for genome sequencing, comprising: a vector for constructing a genomic clone library; a labeled probe capable of interacting with the vector or The fragment is hybridized; and a molecular entity capable of specifically binding to the label on the probe. According to an embodiment of the invention, the vector is a Fosmid vector. According to some embodiments of the invention, the probe is a biotinylated probe and the molecular entity is avidin. According to a specific example of the invention, the molecular entity is streptavidin. According to one embodiment of the invention, the molecular entity is streptavidin formed on the magnetic beads. It will be appreciated by those skilled in the art that kits of the invention may also include other reagents required for genomic clone library construction, sequencing, etc., which may further include linkers and linker blocking agents in accordance with embodiments of the present invention. The features and functions of the joint sealer have been described in detail above and will not be described again.
发明人发现, 根据本发明实施例的试剂盒能够有效地用于基因组测序, 具体地, 利 用试剂盒中的载体能够有效地构建基因组克隆文库, 进而利用试剂盒中的经标记的探针以 及分子实体能够有效地将文库中的载体片段去除, 从而基于高通量测序平台, 能够有效地 对去除载体片段的文库进行基因组测序。 由此, 将本发明的试剂盒应用于基因组测序, 能 够有效地避免测序资源浪费和产生不需要的测序数据, 从而能够降低测序成本, 提高 测序效率, 并且所得测序结果准确, 数据分析容易。  The inventors have found that a kit according to an embodiment of the present invention can be effectively used for genome sequencing, and in particular, a vector in a kit can be used to efficiently construct a library of genomic clones, thereby utilizing labeled probes and molecules in the kit. The entity is capable of efficiently removing the vector fragments in the library, thereby enabling efficient genome sequencing of the library from which the vector fragments are removed based on a high throughput sequencing platform. Thus, the application of the kit of the present invention to genome sequencing can effectively avoid waste of sequencing resources and generate unnecessary sequencing data, thereby reducing the cost of sequencing, improving the sequencing efficiency, and obtaining accurate sequencing results and easy data analysis.
具体地, 根据本发明的实施例, 本发明的用于基因组测序的试剂盒可以包括: 用于 构建基因组克隆文库的载体, 能够与载体或其片段杂交的经标记的探针以及能够特异性结 合探针的标记的分子实体。  Specifically, according to an embodiment of the present invention, the kit for genome sequencing of the present invention may comprise: a vector for constructing a genomic clone library, a labeled probe capable of hybridizing with a vector or a fragment thereof, and capable of specifically binding The labeled molecular entity of the probe.
根据本发明的实施例, 载体是 Fosmid载体。根据本发明的一些实施例, 探针用生物 素进行标记, 并且特异性结合标记的分子实体是亲和素, 根据本发明的具体示例, 优选分 子实体为链霉亲和素。 根据本发明的一个实施例, 链霉亲和素可以形成于磁珠上。 根据本 发明的实施例, 试剂盒还可以包括其他试剂, 包括但不限于接头和接头封闭剂。  According to an embodiment of the invention, the vector is a Fosmid vector. According to some embodiments of the invention, the probe is labeled with biotin and the molecular entity that specifically binds to the label is avidin, and according to a specific example of the invention, the preferred molecular entity is streptavidin. According to an embodiment of the invention, streptavidin may be formed on the magnetic beads. According to embodiments of the invention, the kit may also include other reagents including, but not limited to, linkers and linker blocking agents.
需要说明的是, 根据本发明实施例的用于除去测序文库中的载体片段的方法及其用途 是本申请的发明人经过艰苦的创造性劳动和优化工作而完成的。  It should be noted that the method for removing a vector fragment in a sequencing library and its use according to an embodiment of the present invention are completed by the inventor of the present application through arduous creative labor and optimization work.
本发明的附加方面和优点将在下面的描述中部分给出, 部分将从下面的描述中变 得明显, 或通过本发明的实践了解到。 具体实施方式 The additional aspects and advantages of the invention will be set forth in part in the description which follows. detailed description
下面将结合实施例对本发明的方案进行解释。 本领域技术人员将会理解, 下面的实施 例仅用于说明本发明, 而不应视为限定本发明的范围。 实施例中未注明具体技术或条件的, 按照本领域内的文献所描述的技术或条件 (例如参考 J.萨姆布鲁克等著,黄培堂等译的《分 子克隆实验指南》, 第三版, 科学出版社)或者按照产品说明书进行。 所用试剂或仪器未注 明生产厂商者, 均为可以通过市购获得的常规产品, 例如可以釆购自 Illumina公司。  The solution of the present invention will be explained below in conjunction with the embodiments. Those skilled in the art will appreciate that the following examples are merely illustrative of the invention and are not to be considered as limiting the scope of the invention. In the examples, the specific techniques or conditions are not indicated, according to the techniques or conditions described in the literature in the field (for example, refer to J. Sambrook et al., Huang Peitang et al., Molecular Cloning Experimental Guide, Third Edition, Science Press) or in accordance with the product manual. The reagents or instruments used are not specified by the manufacturer, and are conventional products that are commercially available, for example, from Illumina.
实施例 1  Example 1
1、 制备探针  1. Prepare the probe
以 Fosmid载体(其序列如 SEQ ID NO: 1所示)为模板, 通过 PCR反应制备探针。 其 中, PCR的反应体系如下:  The probe was prepared by a PCR reaction using a Fosmid vector (the sequence of which is shown in SEQ ID NO: 1) as a template. Among them, the reaction system of PCR is as follows:
Figure imgf000009_0001
Figure imgf000009_0001
注: *表示所使用的 dNTP混合溶液是以 15:85的比例配置的生物素 -dNTP与普通 dNTP的混合溶液, 并且其终浓度是 1 μΜ。  Note: * indicates that the dNTP mixed solution used is a mixed solution of biotin-dNTP and normal dNTP in a ratio of 15:85, and the final concentration is 1 μΜ.
其中, 用于制备探针的正向引物为: 5'-CCTGGGGTGCCTAATGAGTG-3' (SEQ ID NO: 2)。 用于制备探针的反向引物: 5'-CGTCGTTTTACAACGTCGTGA-3' (SEQ ID NO: 3)。  Among them, the forward primer for preparing the probe was: 5'-CCTGGGGTGCCTAATGAGTG-3' (SEQ ID NO: 2). Reverse primer for the preparation of the probe: 5'-CGTCGTTTTACAACGTCGTGA-3' (SEQ ID NO: 3).
PCR反应条件为: 95 °C , 2分钟; 12个循环的 95 °C , 30秒、 65 °C , 30秒、 72 °C , 8 分钟; 72 °C , 10分钟; 4°C保存。  The PCR reaction conditions were: 95 ° C, 2 minutes; 12 cycles of 95 ° C, 30 seconds, 65 ° C, 30 seconds, 72 ° C, 8 minutes; 72 ° C, 10 minutes; 4 ° C storage.
PCR完成后, 利用 DNA clean and Concentrator™-25试剂盒对 PCR产物进行纯化, 然 后利用 Covaris S2仪将 PCR产物进行打断, 以便获得 DNA片段, 然后通过凝胶电泳检测 确定 DNA 片段的大小。 检测结果显示, DNA 片段的大小主要为大约 300bp。 然后利用 MinElute PCR纯化试剂盒纯化回收 DNA片段, 并将其溶于 20μ1洗脱緩冲液中, 从而获得 生物素标记的探针。 然后, 利用 Qubit ( HS )定量探针的浓度, 备用。 After the PCR was completed, the PCR product was purified using the DNA clean and ConcentratorTM-25 kit, and then the PCR product was disrupted using a Covaris S2 instrument to obtain a DNA fragment, which was then determined by gel electrophoresis to determine the size of the DNA fragment. The test results showed that the size of the DNA fragment was mainly about 300 bp. Then use The DNA fragment was purified by MinElute PCR Purification Kit and dissolved in 20 μl elution buffer to obtain a biotin-labeled probe. Then, use Qubit (HS) to quantify the concentration of the probe and set aside.
2、 探针与文库杂交 2. Probe hybridization with library
首先, 利用 CopyControl™HTP Fosmid文库构建试剂盒 (Epicentre, USA) , 按照生产商 的详细说明, 制备小麦条錄菌 (Puccinia striiformis f.sp 'rid)基因组 DNA的 Fosmid克隆文 库。  First, the Fosmid clone library of Puccinia striiformis f.sp 'rid genomic DNA was prepared using the CopyControlTM HTP Fosmid Library Construction Kit (Epicentre, USA) according to the manufacturer's detailed instructions.
接着, 利用 Illumina 公司的混合样品制备寡核苷酸试剂盒 ( Multiplexing sample preparation oligonucleotide kit, PE-400-1002 )构建小麦条锈菌 DNA的测序文库, 备用。  Next, a sequencing library of wheat stripe rust DNA was constructed using Illumina's Multiplexing sample preparation oligonucleotide kit (PE-400-1002).
然后, 将上述探针和小麦条锈菌 DNA的测序文库(在本实施例中有时将其简称为 "文 库")按照下表中的配比制备两份杂交体系, 并分别命名为样品 1和样品 2:  Then, the above-mentioned probe and a sequencing library of wheat stripe rust DNA (sometimes referred to simply as "library" in the present example) were prepared according to the ratios in the following table, and named as sample 1 and Sample 2:
Figure imgf000010_0001
Figure imgf000010_0001
注: SC杂交緩冲液和 SC杂交组件 A来源于序列捕获杂交试剂盒。  Note: SC hybridization buffer and SC hybridization component A are derived from the sequence capture hybridization kit.
其中, 在各杂交体系中, 文库的用量为 120ng , 探针的用量为 120ng (通过 Qubit进行 定量), 并且杂交法体系中添加有接头封闭剂。 该接头封闭剂来自标签测序引物试剂盒 ( Illumina公司)和混合 Rd2测序引物( Multiplexing Rd2 Sequencing Primer试剂盒, Illumina 公司), 其中接头封闭剂的序列如下所示:  Among them, in each hybridization system, the amount of the library was 120 ng, the amount of the probe was 120 ng (quantified by Qubit), and a linker blocking agent was added to the hybridization system. The linker blocker was obtained from the tag sequencing primer kit (Illumina) and the mixed Rd2 sequencing primer (Multixing Rd2 Sequencing Primer kit, Illumina), wherein the sequence of the linker blocker is as follows:
接头 -†闭剂 1 : 5'-AATGATACGGCGACCACCGAGATCTACACTCTTTCCC TACACG ACGCTCTTCCGATCT-3' (SEQ ID NO: 4);  Linker - sputum 1 : 5'-AATGATACGGCGACCACCGAGATCTACACTCTTTCCC TACACG ACGCTCTTCCGATCT-3' (SEQ ID NO: 4);
接头 -†闭剂 2: 5'-ACAAGCAGAAGACGGCATACGAGATAAGCAATGGTGA CTGGA GTTCAGACGTGTGCTCTTCCGATCT-3' (SEQ ID NO: 5)。  Linker - sputum 2: 5'-ACAAGCAGAAGACGGCATACGAGATAAGCAATGGTGA CTGGA GTTCAGACGTGTGCTCTTCCGATCT-3' (SEQ ID NO: 5).
在各杂交体系中, 两种接头封闭剂的用量都是 0.06 nM,从而接头封闭剂与文库的量的 比值为 0.5 pM/ng。  In each hybrid system, the amount of both linker blocking agents was 0.06 nM, and the ratio of the amount of the linker to the library was 0.5 pM/ng.
然后, 将上述制备好的两份杂交体系进行杂交, 其中杂交条件为: 在 95 °C下变性 10 分钟, 然后在 65 °C下按照下表中的指定时间进行杂交, 从而获得杂交产物, 备用。 Then, the two hybrid systems prepared above are hybridized, wherein the hybridization conditions are: denaturation at 95 ° C 10 After the minute, hybridization was carried out at 65 ° C according to the time specified in the table below to obtain a hybridization product, which was used.
Figure imgf000011_0001
Figure imgf000011_0001
3、 载体片段的捕获和分离 3. Capture and separation of vector fragments
根据制造商的说明书, 利用链霉亲和素磁珠( M280磁珠)捕获和分离上述杂交产物中 能够与探针杂交的文库 DNA (即, 载体片段), 然后将来源于样品 1和 2的去除了载体片 段的文库 DNA分别进行收集, 并标记为 24- 1-D和 4- 12-D , 并测定二者的浓度, 结果如下 所示:  Library DNA (i.e., vector fragment) capable of hybridizing with the probe in the above hybridization product was captured and isolated using streptavidin magnetic beads (M280 magnetic beads) according to the manufacturer's instructions, and then derived from samples 1 and 2 The library DNA from which the vector fragment was removed was separately collected and labeled as 24- 1-D and 4- 12-D, and the concentrations of the two were determined. The results are as follows:
Figure imgf000011_0002
Figure imgf000011_0002
此外, 分别收集来自样品 1和 2的与探针杂交的文库 DNA (即, 载体片段), 并标记 为 24-1- V和 4- 12-V。  In addition, library DNA (i.e., vector fragments) hybridized with the probes from samples 1 and 2, respectively, were collected and labeled 24-1-V and 4- 12-V.
根据制造商的说明书, 利用 QIAquick PCR纯化试剂盒分别将上述获得的文库 DNA , 即 24-l-D、 4- 12-D , 24-1-V以及 4- 12-V进行纯化, 备用。  The library DNAs obtained above, namely 24-l-D, 4- 12-D, 24-1-V and 4- 12-V, were purified by QIAquick PCR Purification Kit according to the manufacturer's instructions.
4、 文库 DNA的扩增 4. Library DNA amplification
利用 PFX酶将上述获得的文库 DNA进行 PCR扩增, 其中 PCR反应体系如下:  The library DNA obtained above was subjected to PCR amplification using PFX enzyme, wherein the PCR reaction system was as follows:
Figure imgf000011_0003
Figure imgf000011_0003
其中,用于文库 DNA扩增的正向引物为: 5'- CAAGCAGAAGACGGCATACGA -3' (SEQ ID NO: 6) , 用于文库 DNA扩增的反向引物: 5'- AATGATACGGCGACCACCGAGATC -3' (SEQ ID NO: 7)。 Among them, the forward primer for library DNA amplification is: 5'-CAAGCAGAAGACGGCATACGA -3' (SEQ ID NO: 6), reverse primer for library DNA amplification: 5'- AATGATACGGCGACCACCGAGATC -3' (SEQ ID NO: 7).
PCR反应条件为: 94 °C , 2分钟; 12个循环的 94 °C , 15秒、 58 °C , 30秒、 72 °C , 30 秒; 72 °C , 5分钟; 4°C保存。 由此, 获得扩增产物。  The PCR reaction conditions were: 94 ° C, 2 minutes; 12 cycles of 94 ° C, 15 seconds, 58 ° C, 30 seconds, 72 ° C, 30 seconds; 72 ° C, 5 minutes; 4 ° C storage. Thereby, an amplification product was obtained.
然后, 将扩增产物纯化后溶于 30μ1超纯水中, 并利用 Nanodrop测定获得的扩增产物 的浓度, 结果如下:
Figure imgf000012_0001
Then, the amplified product was purified, dissolved in 30 μl of ultrapure water, and the concentration of the obtained amplification product was measured by Nanodrop, and the results were as follows:
Figure imgf000012_0001
5、 载体去除情况的分析 5. Analysis of carrier removal
在本实施例中,通过 Q-PCR来分析利用本发明的用于除去测序文库中的载体片段方法 去除文库中载体片段的效果。 在分析时, 选择未经本发明的方法处理的样品作为对照。 为 确定去除载体片段的效果, 将未经处理的样品和经处理的样品稀释到同一浓度, 并进行 Q-PCR以检测这两种样品中载体片段的含量。 然后通过下列公式, 利用未经处理的样品和 经处理的样品的 Ct值来计算富集度: 富集度 =E" *^«^ - " «f' >, 其中, E表示扩增效 率, 例如, 当扩增效率为 100%时, E = 2, 表示扩增效率为每个扩增循环使得扩增子的拷贝 数增加 2倍。 富集度表示对照样品中载体片段的量相对于经处理的样品中载体片段的量的 倍数, 因此, 由富集度的值即可得出载体去除的效果, 例如当富集度为 N时, 其表示经过 本发明的方法处理的样品去除了 1 - 1/N的载体片段。 In the present example, the effect of removing the vector fragment in the library by the method for removing the vector fragment in the sequencing library of the present invention is analyzed by Q-PCR. At the time of analysis, a sample which was not treated by the method of the present invention was selected as a control. To determine the effect of removing the vector fragment, the untreated sample and the treated sample were diluted to the same concentration, and Q-PCR was performed to detect the content of the vector fragment in the two samples. The enrichment is then calculated by the following formula using the Ct values of the untreated and treated samples: enrichment = E " *^«^ - "«f'>, where E is the amplification efficiency, For example, when the amplification efficiency is 100%, E = 2, indicating that the amplification efficiency is such that the number of copies of the amplicon is doubled for each amplification cycle. The degree of enrichment indicates a multiple of the amount of the carrier fragment in the control sample relative to the amount of the carrier fragment in the treated sample, and therefore, the effect of carrier removal can be obtained from the value of the enrichment degree, for example, when the enrichment degree is N , which indicates that the sample treated by the method of the present invention removes the 1-1/N carrier fragment.
然后,以去除了载体片段的文库 DNA (即上述 24-1-D和 4-12-D)以及未去除载体片段的 文库 DNA (即小麦条锈菌 DNA的测序文库)为模板, 进行 Q-PCR反应以检测各文库中载体 片段的含量。 其中, Q-PCR反应体系如下:  Then, the library DNA from which the vector fragment was removed (i.e., the above 24-1-1D and 4112-D) and the library DNA from which the vector fragment was not removed (i.e., the sequencing library of the wheat stripe rust DNA) was used as a template for Q- A PCR reaction was performed to detect the content of the vector fragment in each library. Among them, the Q-PCR reaction system is as follows:
Figure imgf000012_0002
Figure imgf000012_0002
注: SYBR Premix和 Rox Reference Dye II来源于 SYBR@ Premix Ex Taq™ 其中,用于 Q-PCR的正向引物为: 5'-TTgTTCCCACgCCTGCTGAgTTGT-3' (SEQ ID NO: 8) , 用于 Q-PCR的反向引物为: 5'-ATCCCgAATTTgCTCCTCCATCCAC-3' (SEQ ID NO: 9 Q-PCR反应条件为: 95 °C , 30秒; 40个循环的 95 °C , 15秒、 60 °C , 1分钟。 Note: SYBR Premix and Rox Reference Dye II are from SYBR@ Premix Ex TaqTM Among them, the forward primer for Q-PCR is: 5'-TTgTTCCCACgCCTGCTGAgTTGT-3' (SEQ ID NO: 8), and the reverse primer for Q-PCR is: 5'-ATCCCgAATTTgCTCCTCCATCCAC-3' (SEQ ID NO : 9 Q-PCR reaction conditions are: 95 ° C, 30 seconds; 40 cycles of 95 ° C, 15 seconds, 60 ° C, 1 minute.
Q-PCR的结果如下:  The results of Q-PCR are as follows:
Figure imgf000013_0001
Figure imgf000013_0001
从上述结果可以看出, 载体片段的富集度都大于 5 , 这表明经本发明的方法处理的样 品中载体片段的含量小于对照样品的 1/5 , 从而表明本发明的方法去除了初始文库中至少 80 %的载体片段。 实施例 2  From the above results, it can be seen that the degree of enrichment of the vector fragment is greater than 5, which indicates that the content of the vector fragment in the sample treated by the method of the present invention is less than 1/5 of that of the control sample, thereby indicating that the method of the present invention removes the initial library. At least 80% of the vector fragments. Example 2
1、 制备探针  1. Prepare the probe
利用实施例 1中所釆用的制备探针方法的制备探针, 备用。  The probe prepared by the method for preparing a probe used in Example 1 was used.
2、 探针与文库杂交  2. Probe hybridization with library
首先, 利用来自一名中国成年男性的血液基因组 DNA, 构建 Fosmid克隆文库, 接着 根据制造商的说明书, 利用混合样品制备寡核苷酸试剂盒( Multiplexing sample preparation oligonucleotide kit, Illumina公司, PE-400-1002 )构建测序文库,称为 YH Fosmid测序文库。 然后, 将上述制备的探针和 YH Fosmid测序文库(在本实施例中有时将其简称为 "文库") 按照下表中的配比制备杂交体系并进行杂交:  First, a Fosmid clone library was constructed using blood genomic DNA from a Chinese adult male, and then a multi-sample preparation oligonucleotide kit (Illumina, PE-400-) was used according to the manufacturer's instructions. 1002) A sequencing library was constructed, called a YH Fosmid sequencing library. Then, the probe prepared above and the YH Fosmid sequencing library (hereinafter sometimes referred to simply as "library" in the present example) were prepared and hybridized according to the ratios in the following table:
文库 50 ng  Library 50 ng
2 x SC杂交緩冲液 7.5 μΐ  2 x SC hybridization buffer 7.5 μΐ
SC杂交组件 A 3 μ1  SC hybridization component A 3 μ1
探针 100 ng  Probe 100 ng
接头封闭剂 1 0.025 nM  Joint sealer 1 0.025 nM
接头封闭剂 2 0.025 nM 水 补足至 15 μΐ 其中, 在上述杂交体系中, 文库与探针的质量比为 1:2, 并且接头封闭剂与实施例 1的 接头封闭剂相同。 杂交条件为: 在 95 °C下进行变性 10分钟, 然后在 65 °C下进行杂交 24小 时, 从而获得杂交产物, 备用。 Joint sealer 2 0.025 nM The water was made up to 15 μΐ wherein, in the above hybrid system, the mass ratio of the library to the probe was 1:2, and the linker blocking agent was the same as the linker of Example 1. The hybridization conditions were: denaturation at 95 ° C for 10 minutes, followed by hybridization at 65 ° C for 24 hours, thereby obtaining a hybridization product, which was used.
3、 载体片段的捕获和分离 3. Capture and separation of vector fragments
利用实施例 1 中釆用的捕获和分离载体片段的方法, 对所得的杂交产物中能够与探针 杂交的文库 DNA, 即载体片段, 进行捕获和分离, 然后收集去除了载体片段的文库 DNA, 并标记为 YH-DNA-1 , 并测定其浓度, 结果如下所示:  Using the method for capturing and isolating the vector fragment used in Example 1, the library DNA capable of hybridizing with the probe, that is, the vector fragment, is captured and separated, and then the library DNA from which the vector fragment is removed is collected. It was labeled as YH-DNA-1 and its concentration was determined. The results are as follows:
Figure imgf000014_0001
Figure imgf000014_0001
收集与探针杂交的文库 DNA (即, 载体片段), 并标记为 YH- Vector- 1。  The library DNA (i.e., vector fragment) that hybridizes to the probe is collected and labeled as YH-Vector-1.
根据制造商的说明书, 利用 QIAquick PCR纯化试剂盒纯化获得的文库 DNA, 备用。  The obtained library DNA was purified using QIAquick PCR Purification Kit according to the manufacturer's instructions and used.
4、利用实施例 1中釆用的将文库 DNA进行 PCR扩增的方法,对上述获得的文库 DNA 进行 PCR扩增, 以便获得扩增产物, 然后将扩增产物纯化后溶于 30μ1超纯水中, 并利用 Nanodrop测定获得的扩增产物的浓度, 结果如下:
Figure imgf000014_0002
4. PCR amplification of the library DNA obtained by PCR amplification of the library DNA used in Example 1 to obtain an amplification product, and then the amplified product is purified and dissolved in 30 μl of ultrapure water. In the middle, and using Nanodrop to determine the concentration of the amplified product obtained, the results are as follows:
Figure imgf000014_0002
5、 载体去除情况的分析 5. Analysis of carrier removal
利用实施例 1 中釆用的对利用本发明的用于除去测序文库中的载体片段方法去除文库 中载体片段的效果进行分析的方法,通过 Q-PCR来分析本实施例中去除文库中载体片段的 效果, Q-PCR的结果如下:  Using the method of analyzing the effect of removing the vector fragment in the library by using the vector fragment method of the present invention for removing the vector fragment in the sequencing library, the method of removing the vector fragment in the library in this example was analyzed by Q-PCR. The effect of Q-PCR is as follows:
Figure imgf000014_0003
Figure imgf000014_0003
从上述结果可以看出, 载体片段的富集度大于 5, 这表明经本发明的方法处理的样品 中载体片段的含量小于对照样品的 1/5。 因此, 本发明的方法去除了初始文库中至少 80 % 的载体片段。 工业实用性 As can be seen from the above results, the degree of enrichment of the vector fragment is greater than 5, indicating the sample treated by the method of the present invention. The content of the carrier fragment was less than 1/5 of the control sample. Thus, the method of the invention removes at least 80% of the vector fragments in the original library. Industrial applicability
本发明的用于除去测序文库中的载体片段的方法、 用于对基因组克隆文库进行测序的 方法以及用于基因组测序的试剂盒, 能够有效地除去 Fosmid小片段测序文库中的载体片 段, 进而能够应用于基因组克隆文库的测序, 从而能够降低测序成本, 提高测序及数据分 析的效率。  The method for removing a vector fragment in a sequencing library of the present invention, a method for sequencing a genomic clone library, and a kit for genome sequencing can effectively remove a vector fragment in a Fosmid small fragment sequencing library, thereby enabling It can be applied to the sequencing of genomic clone libraries, which can reduce the cost of sequencing and improve the efficiency of sequencing and data analysis.
尽管本发明的具体实施方式已经得到详细的描述, 本领域技术人员将会理解。 根据已 经公开的所有教导, 可以对那些细节进行各种修改和替换, 这些改变均在本发明的保护范 围之内。 本发明的全部范围由所附权利要求及其任何等同物给出。  Although specific embodiments of the invention have been described in detail, those skilled in the art will understand. Various modifications and alterations of those details are possible in light of the teachings of the invention. The full scope of the invention is given by the appended claims and any equivalents thereof.
在本说明书的描述中, 参考术语 "一个实施例"、 "一些实施例"、 "示意性实施例"、 "示 例"、 "具体示例"、 或 "一些示例" 等的描述意指结合该实施例或示例描述的具体特征、 结 构、 材料或者特点包含于本发明的至少一个实施例或示例中。 在本说明书中, 对上述术语 的示意性表述不一定指的是相同的实施例或示例。 而且, 描述的具体特征、 结构、 材料或 者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。  In the description of the present specification, the description of the terms "one embodiment", "some embodiments", "illustrative embodiment", "example", "specific example", or "some examples", etc. Particular features, structures, materials or features described in the examples or examples are included in at least one embodiment or example of the invention. In the present specification, the schematic representation of the above terms does not necessarily mean the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in a suitable manner in any one or more embodiments or examples.

Claims

权利要求书 Claim
1、 一种用于除去测序文库中的载体片段的方法, 其包括以下步骤: A method for removing a vector fragment in a sequencing library, comprising the steps of:
提供经过标记的探针, 所述探针能够与所述载体片段进行杂交;  Providing a labeled probe capable of hybridizing to the vector fragment;
将所述探针与所述测序文库进行杂交, 以便所述探针与所述载体片段形成带标记的双 链核酸; 以及  Hybridizing the probe to the sequencing library such that the probe forms a labeled double-stranded nucleic acid with the vector fragment;
利用特异性结合探针上的标记的分子实体, 去除所述带标记的双链核酸, 从而除去所 述测序文库中的载体片段。  The labeled double-stranded nucleic acid is removed using a labeled molecular entity that specifically binds to the probe, thereby removing the vector fragment in the sequencing library.
2、 根据权利要求 1所述的方法, 其特征在于 , 所述载体是 Fosmid载体。  2. Method according to claim 1, characterized in that the carrier is a Fosmid vector.
3、 根据权利要求 1所述的方法, 其特征在于, 所述经过标记的探针是经过生物素标记 的探针, 并且所述分子实体是亲和素。  3. The method of claim 1 wherein the labeled probe is a biotinylated probe and the molecular entity is avidin.
4、 根据权利要求 3所述的方法, 其特征在于 , 所述分子实体是链霉亲和素。  4. The method according to claim 3, wherein the molecular entity is streptavidin.
5、 根据权利要求 4所述的方法, 其特征在于 , 所述分子实体形成于磁珠上。  5. The method according to claim 4, wherein the molecular entity is formed on the magnetic beads.
6、 根据权利要求 1所述的方法, 其特征在于, 将所述探针与所述测序文库进行液相杂 交反应。  6. The method according to claim 1, wherein the probe is subjected to liquid phase hybridization with the sequencing library.
7、 根据权利要求 1所述的方法, 其特征在于 , 将所述探针与所述测序文库以大约 1:1 至大约 2:1的质量比进行杂交反应。  7. The method according to claim 1, wherein the probe is hybridized with the sequencing library at a mass ratio of about 1:1 to about 2:1.
8、 根据权利要求 1所述的方法, 其特征在于, 在进行杂交反应前, 进一步包括向测序 文库中添加接头封闭剂。  8. The method of claim 1, further comprising adding a linker blocking agent to the sequencing library prior to performing the hybridization reaction.
9、 根据权利要求 8所述的方法, 其特征在于, 所述接头封闭剂和所述测序文库的比例 为大约 0.3 pM/ng-0.8 pM/ng。  9. The method of claim 8, wherein the ratio of the linker blocking agent to the sequencing library is between about 0.3 pM/ng and 0.8 pM/ng.
10、 根据权利要求 9所述的方法, 其特征在于, 所述接头封闭剂和所述测序文库的比 例为 0.5 pM/ng。  10. The method according to claim 9, wherein the ratio of the linker blocking agent to the sequencing library is 0.5 pM/ng.
11、 根据权利要求 1所述的方法, 其特征在于, 所述杂交反应的时间为 1 , 4, 16, 24 或更多个小时。  11. The method according to claim 1, wherein the hybridization reaction has a time of 1, 4, 16, 24 or more hours.
12、 一种用于对基因组克隆文库进行测序的方法, 其特征在于, 包括以下步骤: 利用基因组克隆文库构建测序文库, 其中所述测序文库中包含载体片段;  12. A method for sequencing a genomic clone library, comprising the steps of: constructing a sequencing library using a genomic clone library, wherein the sequencing library comprises a vector fragment;
利用根据权利要求 1-11任一项所述的方法除去所述测序文库中的载体片段; 以及 对去除载体片段后的测序文库进行测序。  The vector fragment in the sequencing library is removed using the method according to any one of claims 1-11; and the sequencing library after removal of the vector fragment is sequenced.
13、 根据权利要求 12所述的方法, 其特征在于, 所述基因组克隆文库是 Fosmid克隆 文库, 且所述载体是 Fosmid载体。 13. The method according to claim 12, wherein the genomic clone library is a Fosmid clone Library, and the vector is a Fosmid vector.
14、 根据权利要求 12所述的方法, 其特征在于, 所述构建测序文库是通过下列步骤进 行的:  14. The method according to claim 12, wherein said constructing the sequencing library is performed by the following steps:
将所述基因组克隆文库中的 DNA进行片段化, 以便获得 DNA片段;  Fragmenting the DNA in the genomic clone library to obtain a DNA fragment;
将接头连接至所述 DNA片段的两端, 以便获得连接产物;  A linker is attached to both ends of the DNA fragment to obtain a ligation product;
将所述连接产物进行 PCR扩增, 以便获得扩增产物; 以及  The ligation product is subjected to PCR amplification to obtain an amplification product;
回收纯化所述扩增产物, 所述扩增产物构成所述测序文库。  The amplification product is recovered and purified, and the amplification product constitutes the sequencing library.
15、 根据权利要求 12所述的方法, 其特征在于, 在构建测序文库后, 或在构建测序文 库的过程中, 除去所述测序文库中的载体片段;  15. The method according to claim 12, wherein the vector fragment in the sequencing library is removed after constructing the sequencing library or during the process of constructing the sequencing library;
16、 根据权利要求 12所述的方法, 其特征在于, 在将所述基因组克隆文库中的 DNA 进行片段化后, 除去所述测序文库中的载体片段。  16. The method according to claim 12, wherein after the DNA in the genomic clone library is fragmented, the vector fragment in the sequencing library is removed.
17、 根据权利要求 12所述的方法, 其特征在于, 利用 Solexa测序仪进行测序。  17. Method according to claim 12, characterized in that the sequencing is carried out using a Solexa sequencer.
18、 一种用于基因组测序的试剂盒, 其特征在于, 包括:  18. A kit for genome sequencing, comprising:
载体, 所述载体用于构建基因组克隆文库;  a vector for constructing a genomic clone library;
经标记的探针, 所述探针能够与所述载体片段进行杂交; 以及  a labeled probe capable of hybridizing to the vector fragment;
分子实体, 所述分子实体能够特异性结合所述探针上的标记。  A molecular entity that is capable of specifically binding a label on the probe.
19、 根据权利要求 18所述的试剂盒, 其特征在于, 所述载体是 Fosmid载体。  The kit according to claim 18, wherein the carrier is a Fosmid carrier.
20、根据权利要求 18所述的试剂盒,其特征在于,所述探针是经过生物素标记的探针, 并且所述分子实体是亲和素。  20. A kit according to claim 18 wherein the probe is a biotinylated probe and the molecular entity is avidin.
21、 根据权利要求 20所述的试剂盒, 其特征在于, 所述分子实体是链霉亲和素。 The kit according to claim 20, wherein the molecular entity is streptavidin.
22、 根据权利要求 21所述的试剂盒, 其特征在于, 所述分子实体为形成于磁珠上的链 霉亲和素。 The kit according to claim 21, wherein the molecular entity is streptavidin formed on the magnetic beads.
23、 根据权利要求 18所述的试剂盒, 其特征在于, 进一步包括接头和接头封闭剂。  23. The kit of claim 18, further comprising a linker and a linker blocking agent.
PCT/CN2011/084303 2010-12-22 2011-12-20 Methods for removal of vector fragments in sequencing library and uses thereof WO2012083845A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010600214.4A CN102559856B (en) 2010-12-22 2010-12-22 Method for deleting vector segments in sequencing library
CN201010600214.4 2010-12-22

Publications (1)

Publication Number Publication Date
WO2012083845A1 true WO2012083845A1 (en) 2012-06-28

Family

ID=46313172

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/084303 WO2012083845A1 (en) 2010-12-22 2011-12-20 Methods for removal of vector fragments in sequencing library and uses thereof

Country Status (3)

Country Link
CN (1) CN102559856B (en)
HK (1) HK1169453A1 (en)
WO (1) WO2012083845A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855398B (en) * 2012-08-28 2016-03-02 中国科学院自动化研究所 The acquisition methods of the potential associated gene of the disease based on Multi-source Information Fusion
WO2015042980A1 (en) * 2013-09-30 2015-04-02 深圳华大基因科技有限公司 Method, system, and computer-readable medium for determining snp information in a predetermined chromosomal region
ITUA20162640A1 (en) * 2016-04-15 2017-10-15 Menarini Silicon Biosystems Spa METHOD AND KIT FOR THE GENERATION OF DNA LIBRARIES FOR PARALLEL MAXIMUM SEQUENCING

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090325239A1 (en) * 2006-01-04 2009-12-31 Si Lok Methods for nucleic acid mapping and identification of fine-structural-variations in nucleic acids

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090325239A1 (en) * 2006-01-04 2009-12-31 Si Lok Methods for nucleic acid mapping and identification of fine-structural-variations in nucleic acids

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DONG, XIAOYI ET AL.: "Screening, Identifying and Function Analysis of Polyketide Synthase I Cluster from the Environmental Strain X-2 Which Produce Macrolactins", MICROBIOLOGY, vol. 35, no. 9, 2008, pages 1367 - 1372 *
KIM ET AL.: "Cloning, Sequencing, and Characterization of the Pradimicin Biosynthetic Gene Cluster of Actinomadura hibisca P157-2", J. MICROBIOL. BIOTECHNOL, vol. 17, no. 5, 2007, pages 830 - 839 *

Also Published As

Publication number Publication date
CN102559856B (en) 2014-03-12
HK1169453A1 (en) 2013-01-25
CN102559856A (en) 2012-07-11

Similar Documents

Publication Publication Date Title
CN113166797B (en) Nuclease-based RNA depletion
JP7377247B2 (en) Methods for targeted genomic analysis
JP7011392B2 (en) Methods and compositions for DNA profiling
JP7407227B2 (en) Methods and probes for identifying gene alleles
JP6525473B2 (en) Compositions and methods for identifying replicate sequencing leads
EP2906715B1 (en) Compositions, methods, systems and kits for target nucleic acid enrichment
EP2616555B1 (en) Capture probes immobilizable via l-nucleotide tail
EP2619329A1 (en) Direct capture, amplification and sequencing of target dna using immobilized primers
WO2010117817A2 (en) Methods for generating target specific probes for solution based capture
US9677122B2 (en) Integrated capture and amplification of target nucleic acid for sequencing
WO2013079649A1 (en) Method and kit for characterizing rna in a composition
US10590451B2 (en) Methods of constructing a circular template and detecting DNA molecules
WO2012083845A1 (en) Methods for removal of vector fragments in sequencing library and uses thereof
CN114787385A (en) Methods and systems for detecting nucleic acid modifications
Briggs Rapid retrieval of DNA target sequences by primer extension capture
JP2023517571A (en) Novel nucleic acid template structures for sequencing

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11851659

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11851659

Country of ref document: EP

Kind code of ref document: A1