WO2017190487A1 - Primer set for amplifying multiple target dna sequences in sample and use thereof - Google Patents

Primer set for amplifying multiple target dna sequences in sample and use thereof Download PDF

Info

Publication number
WO2017190487A1
WO2017190487A1 PCT/CN2016/106608 CN2016106608W WO2017190487A1 WO 2017190487 A1 WO2017190487 A1 WO 2017190487A1 CN 2016106608 W CN2016106608 W CN 2016106608W WO 2017190487 A1 WO2017190487 A1 WO 2017190487A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
specific
primer
target
modification
Prior art date
Application number
PCT/CN2016/106608
Other languages
French (fr)
Chinese (zh)
Inventor
屈武斌
蔡万世
易建明
杭兴宜
Original Assignee
艾吉泰康生物科技(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 艾吉泰康生物科技(北京)有限公司 filed Critical 艾吉泰康生物科技(北京)有限公司
Priority to AU2016405379A priority Critical patent/AU2016405379A1/en
Publication of WO2017190487A1 publication Critical patent/WO2017190487A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Analytical Chemistry (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided are a primer set for amplifying multiple target DNA sequences in a sample and a method for amplifying multiple target DNA sequences in a sample using the primer set. The primer set comprises multiple pairs of upstream and downstream specific primers for each target DNA sequence. Each pair of upstream and downstream specific primers comprises specific sequences for the target sequence, and the specific sequences satisfy with the following conditions: (1) no amplification occurs between the specific sequences and sequences other than the target sequence; (2) no dimer forms between the specific sequences; and (3) no hairpin structure forms. A consensus sequence that is not homologous to the genome is linked to the 5' terminus of the specific sequence. The 3' terminus of the specific sequence has a modification for increasing the steric hindrance, and the modification does not hinder the specific sequence from binding to a perfectly matching template with the specific sequence and elongating, but basically hinders the specific sequence from binding to an imperfect matching template and elongating.

Description

用于扩增样品中多个目标DNA序列的引物组及其应用Primer set for amplifying multiple target DNA sequences in a sample and application thereof 技术领域Technical field
本发明涉及核酸序列的捕获、富集与分析。更具体来说,本发明涉及基于多重PCR的目标序列富集方法。The invention relates to the capture, enrichment and analysis of nucleic acid sequences. More specifically, the present invention relates to a multiplex PCR-based target sequence enrichment method.
背景技术Background technique
全基因组测序可以获得全基因组水平范围的突变、插入、缺失以及结构变异。然而,由于基因组容量巨大,以30×进行测序就会产生接近100G的数据量。而肿瘤等相关的低突变频率测序则需要至少1000×的覆盖度,如果进行全基因组测序,则会产生3000G的数据量。这样规模的数据量除了会对数据的分析工作造成极大的困难,还会显著增加测序的成本,进而制约测序的应用。为了解决这个难题,目标区域捕获技术应运而生。Whole genome sequencing allows for mutations, insertions, deletions, and structural variations at the genome-wide level. However, due to the large genomic capacity, sequencing at 30x yields data volumes approaching 100G. Sequencing of low-mutation frequencies associated with tumors requires at least 1000× coverage, and if genome-wide sequencing is performed, it will produce 3000G of data. In addition to the enormous difficulty of data analysis, the amount of data of this scale will significantly increase the cost of sequencing, which in turn will limit the application of sequencing. In order to solve this problem, the target area capture technology came into being.
目标区域捕获技术是指通过特定的技术手段定向捕获目标区域的核酸序列,然后进行建库测序,以达到在对目标区域进行深度测序的目的的同时使得测序成本大大降低。PCR是一种常见的用于富集目标区域的技术,更为常见的是利用多重PCR技术一次性的捕获多个目标区域。多重PCR适用于热点区域或者长度较小的目标区域的捕获。制约多重PCR技术应用的两个重要因素是非特异扩增和二聚体的产生。Target region capture technology refers to the targeted capture of the nucleic acid sequence of the target region by a specific technical means, and then the library is sequenced to achieve the purpose of deep sequencing of the target region, and the sequencing cost is greatly reduced. PCR is a common technique for enriching target regions, and it is more common to capture multiple target regions at once using multiplex PCR techniques. Multiplex PCR is suitable for capture of hotspots or target areas of smaller length. Two important factors that limit the application of multiplex PCR techniques are non-specific amplification and dimer production.
因此,本领域中需要能有效降低非特异扩增和二聚体产生的多重PCR技术出现。Therefore, there is a need in the art for multiplex PCR techniques that are effective in reducing non-specific amplification and dimer production.
发明内容Summary of the invention
本发明提供了基于多重PCR扩增的目标序列富集方法,所述方法包括:兼容性多重PCR引物的筛选;进行第一轮特异性多重PCR扩增;进行第二轮通用引物扩增富集;回收产物并上机测序。The invention provides a target sequence enrichment method based on multiplex PCR amplification, which comprises: screening of compatible multiplex PCR primers; performing the first round of specific multiplex PCR amplification; performing a second round of universal primer amplification and enrichment The product was recovered and sequenced on the machine.
因此,在第一方面,本发明提供了用于扩增样品中多个目标DNA序列的引物组,所述引物组包括针对每个目标DNA序列的多对上下游特异性引物,其中:Thus, in a first aspect, the invention provides a primer set for amplifying a plurality of target DNA sequences in a sample, the primer set comprising a plurality of pairs of upstream and downstream specific primers for each target DNA sequence, wherein:
a)每对所述上下游特异性引物包括针对目标序列的特异性序列,所有特异性序列之间满足如下条件:(1)每个特异性序列与目标序列之外的序列不发生扩增,(2)特异性序列之间不形成二聚体,(3)特异性序列不形成发卡结构;a) each pair of the upstream and downstream specific primers includes a specific sequence for the target sequence, and the following conditions are satisfied between all the specific sequences: (1) each specific sequence does not undergo amplification with a sequence other than the target sequence, (2) no dimer is formed between the specific sequences, and (3) the specific sequence does not form a hairpin structure;
b)在所述特异性序列的5’端连接有与基因组不同源的通用序列;b) a universal sequence linked to a different source of the genome at the 5' end of the specific sequence;
c)在所述特异性序列的3’端的碱基处有增加空间位阻的修饰,所述修饰不阻断其与所述特异性序列完全匹配的模板的结合与延伸,但基本阻断其与不完全匹配的模板的结合与延伸。c) a modification that increases steric hindrance at the base 3' end of the specific sequence, the modification does not block the binding and extension of the template that exactly matches the specific sequence, but substantially blocks it Combination and extension of templates that do not exactly match.
在一个具体的实施方案中,所述特异性序列之间满足如下条件:(1)所述特异性序列与目标区域的Tm-与非目标区域的Tm≥5℃,优选≥10℃;(2)所述 特异性序列与目标区域的Tm-与其他特异性序列形成二聚体的Tm≥5℃,优选≥10℃;(3)所述特异性序列与目标区域的Tm-形成发卡结构的Tm≥5℃,优选≥10℃,优选Tm的值基于SantaLucia 2007热力学参数表的最邻近法计算。In a specific embodiment, the specific sequence satisfies the following conditions: (1) the specific sequence and the Tm- and non-target regions of the target region have a Tm ≥ 5 ° C, preferably ≥ 10 ° C; ) The specific sequence and the Tm- of the target region form a dimer with other specific sequences with a Tm ≥ 5 ° C, preferably ≥ 10 ° C; (3) the specific sequence and the target region Tm - form a hairpin structure Tm ≥ 5 The value of °C, preferably ≥10 °C, preferably Tm, is calculated based on the nearest neighbor method of the SantaLucia 2007 thermodynamic parameter table.
在本发明中,所述特异性序列的3’端的碱基处修饰包括在所述特异性序列的3’端-2、-3位碱基、核糖或磷酸二酯键上;在优选的实施方案中,所述特异性序列的3’端的5个碱基的GC含量大于50%,即有三个或三个以上的碱基为C或G,所述特异性序列的3’端的碱基处修饰还包括在所述特异性序列的3’端-4位碱基、核糖或磷酸二酯键处。即,在优选的实施方案中,在所述特异性序列的3’端的5个碱基的GC含量大于50%的情况下,即有三个或三个以上的碱基为C或G,所述特异性序列的3’端的碱基处修饰还包括在所述特异性序列的3’端-2、-3、-4处位碱基、核糖或磷酸二酯键处。In the present invention, the modification at the base of the 3' end of the specific sequence is included at the 3' end-2, -3 base, ribose or phosphodiester bond of the specific sequence; in a preferred embodiment In the scheme, the GC content of the 5 bases at the 3' end of the specific sequence is greater than 50%, that is, three or more bases are C or G, and the base of the 3' end of the specific sequence is Modifications are also included at the 3' end-4 base, ribose or phosphodiester bond of the specific sequence. That is, in a preferred embodiment, in the case where the GC content of the 5 bases at the 3' end of the specific sequence is greater than 50%, that is, three or more bases are C or G, Modifications at the base of the 3' end of the specific sequence are also included at the 3' end-2, -3, -4 position base, ribose or phosphodiester bond of the specific sequence.
在一个具体的实施方案中,所述增加空间位阻的修饰选自:脱氧次黄嘌呤(dI)、脱氧尿嘧啶(dU)、5-Methyl dC、2′-O-Me-dC、磷酸基团、硫代基团、地高辛、生物素、AminolinkerC7、BHQ1、BHQ2、Dabcyl、JOE、ROX、FAM、TAMRA、烷基基团、氟代基团、氨基基团和Thiol-C3S-S。In a specific embodiment, the modification to increase steric hindrance is selected from the group consisting of: deoxyhypoxanthine (dI), deoxyuracil (dU), 5-Methyl dC, 2'-O-Me-dC, phosphate Group, thio group, digoxin, biotin, Aminolinker C7, BHQ1, BHQ2, Dabcyl, JOE, ROX, FAM, TAMRA, alkyl group, fluoro group, amino group and Thiol-C3S-S.
在一个具体的实施方案中,所述3’端的碱基处增加空间位阻的修饰包括3’端-1、-2、-3位碱基处的硫代修饰。In a specific embodiment, the modification at the 3' end of the base that increases steric hindrance comprises a thio modification at the 3' end-1, -2, -3 base.
在一个具体的实施方案中,所述特异性序列的3’端的5个碱基的GC含量大于50%,在所述特异性序列的3’端-4位碱基处有硫代修饰。In a specific embodiment, the 5 base of the 3' end of the specific sequence has a GC content greater than 50% and a thio modification at the 3' base to the base of the specific sequence.
在一个具体的实施方案中,所述特异性引物序列有一致的热力学参数,优选Tm标准差≤5℃;更优选Tm标准差≤2℃;最优选Tm标准差≤1℃。Tm标准差是所述所有特异性引物序列与相应目标DNA序列之间的Tm的标准差。In a specific embodiment, the specific primer sequence has a uniform thermodynamic parameter, preferably a Tm standard deviation < 5 °C; more preferably a Tm standard deviation < 2 °C; most preferably a Tm standard deviation < 1 °C. The standard deviation of Tm is the standard deviation of the Tm between all of the specific primer sequences and the corresponding target DNA sequence.
在第二方面,本发明提供了一种扩增样品中多个目标DNA序列的方法,所述方法包括:In a second aspect, the invention provides a method of amplifying a plurality of target DNA sequences in a sample, the method comprising:
a)提供包含目标DNA序列和非目标序列的样本、本发明第一方面的引物组和与所述引物组中特异性引物5’端通用序列互补的通用引物对;a) providing a sample comprising the target DNA sequence and the non-target sequence, the primer set of the first aspect of the invention, and a universal primer pair complementary to the universal sequence of the 5' end of the specific primer in the primer set;
b)以所述引物组中的特异性引物进行PCR反应,扩增所述样本中的目标DNA序列,所述PCR反应的退火温度按照从高到低的阶梯进行,例如在一个退火过程中使用等差的3个温度(例如60℃、59℃、58℃)进行退火;b) performing a PCR reaction with a specific primer in the primer set to amplify a target DNA sequence in the sample, and the annealing temperature of the PCR reaction is performed in a step from high to low, for example, in an annealing process. Annealing at three temperatures of equal difference (eg 60 ° C, 59 ° C, 58 ° C);
c)以所述通用引物对再次扩增步骤b)中的扩增产物,进一步富集所述扩增产物。c) amplifying the amplification product in step b) with the universal primer pair to further enrich the amplification product.
在一个具体的实施方案中,所述方法还包括步骤d)对步骤c)得到扩增产物进行测序。In a specific embodiment, the method further comprises the step of d) sequencing the amplified product obtained in step c).
在一个具体的实施方案中,所述方法还包括步骤b’):回收步骤b)中富集的扩增产物。In a specific embodiment, the method further comprises the step b') of recovering the amplified product enriched in step b).
在一个具体的实施方案中,所述回收是通过采用磁珠对第一轮PCR反应产物进行片段筛选和纯化,去除目标区域外的大片段、基因组DNA、引物二聚体、引物和其他反应成分,得到目标区域序列的PCR产物。In a specific embodiment, the recovery is by fragment screening and purification of the first round of PCR reaction products using magnetic beads to remove large fragments, genomic DNA, primer dimers, primers and other reactive components outside the target region. , to obtain a PCR product of the sequence of the target region.
在一个具体的实施方案中,所述多个退火温度是3个温度,例如60℃、59℃、58℃。In a specific embodiment, the plurality of annealing temperatures are 3 temperatures, such as 60 ° C, 59 ° C, 58 ° C.
具体实施方式detailed description
本发明提供了基于多重PCR扩增的目标序列富集方法,所述方法包括:兼 容性多重PCR引物的筛选;第一轮特异性多重PCR扩增;第二轮通用引物扩增富集;回收产物并上机测序,达到检测的目的。因此,本发明提供了用于扩增样品中多个目标DNA序列的引物组以及使用所述引物组扩增样品中多个目标DNA序列的方法。The invention provides a target sequence enrichment method based on multiplex PCR amplification, the method comprising: Screening of capacitive multiplex PCR primers; first round of specific multiplex PCR amplification; second round of universal primer amplification and enrichment; recovery of products and sequencing on the machine to achieve the purpose of detection. Accordingly, the present invention provides a primer set for amplifying a plurality of target DNA sequences in a sample and a method of amplifying a plurality of target DNA sequences in the sample using the primer set.
在本发明中,本发明的多重PCR引物组中的引物优选具有如下特性:In the present invention, the primers in the multiplex PCR primer set of the present invention preferably have the following characteristics:
1.具有特异性,即在所述多重PCR体系中,同一反应体系内的所有引物,除了目标序列之外,不会扩增其他非目标序列。所述具有特异性的引物的设计方法是:首先对任意一对引物进行全基因组in sillco扩增分析,将预测到的扩增产物与目标扩增产物进行比较,如果预测产物中有非目标产物,并且这些非目标预测产物和目标产物之间的热力学参数相近,那么判定这对引物有非特异扩增;如果这些非目标预测产物和目标产物的热力学参数相差较大,那么可认为不会产生非特异扩增。热力学参数相差的判定的标准为:Tm(与目标产物)-Tm(与非目标产物)≥5℃,优选Tm(与目标产物)-Tm(与非目标产物)≥10℃;另外,不同的热力学计算方法以及参数会对计算结果有影响,本发明中优选基于SantaLucia 2007热力学参数表的最邻近法计算;1. Is specific, that is, in the multiplex PCR system, all primers in the same reaction system do not amplify other non-target sequences except the target sequence. The specific primer is designed by first performing a genome-wide sillco amplification analysis on any pair of primers, and comparing the predicted amplification product with the target amplification product if there is a non-target product in the predicted product. And the thermodynamic parameters between the non-target predicted products and the target products are similar, then the non-specific amplification of the primers is determined; if the thermodynamic parameters of the non-target predicted products and the target products are different, then it is considered that no Non-specific amplification. The criteria for determining the difference in thermodynamic parameters are: Tm (with target product) - Tm (with non-target product) ≥ 5 ° C, preferably Tm (with target product) - Tm (with non-target product) ≥ 10 ° C; in addition, different The thermodynamic calculation method and parameters have an influence on the calculation result, and the nearest neighbor method based on the SantaLucia 2007 thermodynamic parameter table is preferably used in the present invention;
2.无二聚体产生,即在所述多重PCR体系中,同一反应体系内的所有引物两两之间不能形成稳定的二聚体,判定的标准为:Tm(与目标产物)-Tm(二聚体)≥5℃,优选Tm(与目标产物)-Tm(二聚体)≥10℃;另外,不同的热力学计算方法以及参数会对计算结果有影响,本发明中优选基于SantaLucia 2007热力学参数表的最邻近法计算;2. No dimer production, that is, in the multiplex PCR system, stable dimers cannot be formed between all the primers in the same reaction system, and the criterion is: Tm (with target product)-Tm ( Dimer) ≥ 5 ° C, preferably Tm (with target product) - Tm (dimer) ≥ 10 ° C; in addition, different thermodynamic calculation methods and parameters will have an impact on the calculation results, preferably based on SantaLucia 2007 thermodynamics in the present invention The nearest neighbor method of the parameter table is calculated;
3.无发卡结构产生,即在所述多重PCR体系中任何引物自身都不形成稳定的发卡结构,判定的标准为:Tm(与目标产物)-Tm(发卡结构)≥5℃,优选Tm(与目标产物)-Tm(发卡结构)≥10℃;另外,不同的热力学计算方法以及参数会对计算结果有影响,本发明中优选基于SantaLucia 2007热力学参数表的最邻近法计算;3. No hairpin structure is produced, that is, any primer in the multiplex PCR system does not form a stable hairpin structure itself, and the criterion is: Tm (with target product)-Tm (hairpin structure) ≥ 5 ° C, preferably Tm ( And the target product) -Tm (hairpin structure) ≥ 10 ° C; in addition, different thermodynamic calculation methods and parameters have an impact on the calculation results, in the present invention, the nearest neighbor method based on the Santa Lucia 2007 thermodynamic parameter table is preferably calculated;
4.有一致的热力学参数,即在所述多重PCR体系中,同一反应体系内的所有引物具有相同或相似的Tm值,优选Tm标准差≤5℃,更优选Tm标准差≤2℃,最优选Tm标准差≤1℃;4. There are consistent thermodynamic parameters, that is, in the multiplex PCR system, all primers in the same reaction system have the same or similar Tm values, preferably Tm standard deviation ≤ 5 ° C, more preferably Tm standard deviation ≤ 2 ° C, most Preferably, the standard deviation of Tm is ≤ 1 ° C;
5.扩增产物长度在100-300bp的范围,优选150-250bp,最优选180-220bp;5. The length of the amplified product is in the range of 100-300 bp, preferably 150-250 bp, and most preferably 180-220 bp;
6.在所有引物的5’端连接有与基因组不同源的通用序列。6. A universal sequence with a different source than the genome is ligated to the 5' end of all primers.
在本发明的另一个实施方案中,在引物序列3’端增加空间位阻的方法包括:In another embodiment of the invention, the method of increasing steric hindrance at the 3' end of the primer sequence comprises:
1.在所述特异性引物序列的3’端添加能增加空间位阻的修饰,所述修饰不阻断其对与所述特异性引物序列完全匹配的模板的结合与延伸,但基本阻断其对与所述特异性引物序列不完全匹配的模板的结合与延伸,所述修饰如脱氧次黄嘌呤(dI)、脱氧尿嘧啶(dU)、5-Methyl dC、2′-O-Me-dC、磷酸基团、硫代基团、地高辛、生物素、AminolinkerC7、BHQ1、BHQ2、Dabcyl、JOE、ROX、FAM、TAMRA、烷基基团、氟代基团、氨基基团和Thiol-C3 S-S。其中,修饰基团AminolinkerC7、BHQ1、BHQ2、Dabcyl、JOE、ROX、FAM、TAMR和Thiol-C3 S-S为基团名称的简写形式,为引物合成行业公认的名称。1. Adding a modification at the 3' end of the specific primer sequence that increases steric hindrance, which does not block its binding and extension to a template that exactly matches the specific primer sequence, but blocks It binds and extends to a template that does not exactly match the specific primer sequence, such as deoxygenated hypoxanthine (dI), deoxyuracil (dU), 5-Methyl dC, 2'-O-Me- dC, phosphate group, thio group, digoxin, biotin, Aminolinker C7, BHQ1, BHQ2, Dabcyl, JOE, ROX, FAM, TAMRA, alkyl group, fluoro group, amino group and Thiol- C3 SS. Among them, the modifying groups Aminolinker C7, BHQ1, BHQ2, Dabcyl, JOE, ROX, FAM, TAMR and Thiol-C3 S-S are shorthand for the name of the group and are recognized by the industry for primer synthesis.
2.一般在所述特异性引物序列的3’端-2、-3位碱基处增加修饰,优选所述修饰是-1、-2、-3位碱基处增加修饰;2. Generally, the modification is added at the 3' end-2, -3 base of the specific primer sequence, preferably the modification is an increase at -1, -2, -3 bases;
3.当所述特异性引物序列的3’端的5个碱基的GC含量大于50%时,在所述特异性序列的3’端-4位碱基处增加修饰,例如硫代修饰。 3. When the GC content of the 5 bases at the 3' end of the specific primer sequence is greater than 50%, a modification such as a thio modification is added at the 3' base to the base of the specific sequence.
在本发明中,3’端-1、-2、-3位是指3’端往5’数第1、2、3位。以此类推。In the present invention, the 3' end-1, -2, and -3 positions mean that the 3' end is at the 1st, 2nd, and 3rd positions in the 5' number. And so on.
在本发明中,使用如下基团进行碱基修饰是为了不阻断其对与所述特异性引物序列完全匹配的模板的结合与延伸,但基本阻断其对与所述特异性引物不完全匹配的模板的结合与延伸:脱氧次黄嘌呤(dI)、脱氧尿嘧啶(dU)、5-Methyl dC、2′-O-Me-dC、磷酸基团、硫代基团、地高辛、生物素、AminolinkerC7、BHQ1、BHQ2、Dabcyl、JOE、ROX、FAM、TAMRA、烷基基团、氟代基团、氨基基团、Thiol-C3S-S等。所述修饰是在磷酸二酯键上,糖基或碱基上通过化学合成的方法,添加一些化学基团,目的是降低碱基配对的稳定性。在磷酸键上或者糖基或碱基上添加化学基团是本领域中已知的。例如,所述脱氧次黄嘌呤(dI)连接到引物磷酸二酯键上、脱氧尿嘧啶(dU)连接到引物的磷酸二酯键上、甲基基团连接到引物脱氧核糖胞嘧啶的5’(5-Methyl dC)、甲基基团连接到脱氧核糖胞嘧啶的2’(2′-O-Me-dC)、磷酸基团连接到引物3’的磷酸二酯键上、硫代基团连接到引物3’的磷酸二酯键上、地高辛基团连接到引物3’的磷酸二酯键上、生物素基团连接到引物3’的磷酸二酯键上、AminolinkerC7连接到引物3’的磷酸二酯键上、BHQ1连接到引物3’的磷酸二酯键上、BHQ2连接到引物的3’磷酸二酯键上、Dabcyl连接到引物的磷酸二酯键上、JOE连接到引物3’的磷酸二酯键上、ROX连接到引物3’的磷酸二酯键上、FAM连接到引物的磷酸二酯键上、TAMRA连接到引物3’的磷酸二酯键上、烷基基团连接到引物脱氧核糖鸟嘌呤的6’、氟代基团连接到脱氧核糖的2’、氨基基团连接到引物脱氧核糖的2’、巯基基团连接到引物3’磷酸二酯键上(Thiol-C3S-S)。In the present invention, base modification is carried out using a group which does not block its binding and extension to a template which completely matches the specific primer sequence, but substantially blocks its incompleteness with the specific primer. Binding and extension of matched templates: deoxygenated hypoxanthine (dI), deoxyuracil (dU), 5-Methyl dC, 2'-O-Me-dC, phosphate groups, thio groups, digoxin, Biotin, Aminolinker C7, BHQ1, BHQ2, Dabcyl, JOE, ROX, FAM, TAMRA, alkyl group, fluoro group, amino group, Thiol-C3S-S, and the like. The modification is by chemical synthesis on a phosphodiester bond, a sugar group or a base, and some chemical groups are added in order to reduce the stability of base pairing. The addition of a chemical group on a phosphate bond or a glycosyl or base is known in the art. For example, the deoxyhypoxanthine (dI) is attached to the primer phosphodiester bond, the deoxyuracil (dU) is attached to the phosphodiester bond of the primer, and the methyl group is attached to the 5' of the primer deoxyribose cytosine. (5-Methyl dC), a methyl group attached to the 2' (2'-O-Me-dC) of deoxyribose cytosine, a phosphate group attached to the phosphodiester bond of the primer 3', a thio group Attached to the phosphodiester bond of primer 3', the digoxigenin group is attached to the phosphodiester bond of primer 3', the biotin group is attached to the phosphodiester bond of primer 3', and Aminolinker C7 is linked to primer 3 ' on the phosphodiester bond, BHQ1 is attached to the phosphodiester bond of primer 3', BHQ2 is attached to the 3' phosphodiester bond of the primer, Dabcyl is attached to the phosphodiester bond of the primer, and JOE is attached to primer 3 On the phosphodiester bond, ROX is attached to the phosphodiester bond of primer 3', FAM is attached to the phosphodiester bond of the primer, TAMRA is attached to the phosphodiester bond of primer 3', and the alkyl group is attached. To the 6' of the primer deoxyribose guanine, the fluoro group is attached to the 2' of the deoxyribose, and the amino group is attached to the primer The 2', thiol group of deoxyribose is attached to the 3' phosphodiester bond of the primer (Thiol-C3S-S).
在本发明中,序列的Tm的计算不拘泥于具体的方法,各种方法计算的Tm值均可以用于本发明,各种方法得到的Tm值基本不能逆转本发明的效果,只是效果的程度会有差异。虽然SantaLucia 2007热力学参数表的最邻近法可以计算Tm,但其他方法计算的Tm值可以与之相对应,本领域技术人员可以经过简单的试验比较各种方法计算得到的Tm,从而对各种方法计算的Tm值作出适当选择。In the present invention, the calculated T m of the sequence is not held to a particular method, various methods of calculating the Tm value may be used in the present invention, the Tm value obtained by various methods not substantially reverse the effects of the present invention, but the effect of The degree will vary. Although the nearest neighbor method of the SantaLucia 2007 thermodynamic parameter table can calculate Tm, the Tm value calculated by other methods can correspond to it, and those skilled in the art can compare the Tm calculated by various methods through simple experiments, thereby The calculated Tm value is appropriately selected.
根据发明人的经验,对于人基因组编码区而言,超过99%的目标区域均可以设计出适合本发明的引物序列,表明我们前述对引物组的过滤都是合理的。According to the experience of the inventors, for the human genome coding region, more than 99% of the target regions can be designed to be suitable for the primer sequences of the present invention, indicating that our previous filtration of the primer set is reasonable.
在发明中,术语“样品”以其最广泛的意思使用,其意在包括从任何来源,优选从生物来源获得的样本或培养物。生物样品可从动物(包括人)获得,并包括液体、固体、组织和气体。生物样品包括血液制品,例如血浆、血清等等。因此,“核酸样品”包含任何来源的核酸(例如DNA、RNA、cDNA、mRNA、tRNA、miRNA等)。在所述核酸样品是RNA或mRNA的情况下,中步骤c)之前有将所述RNA或mRNA反转录成DNA的步骤。在本申请中,核酸样品优选源自生物来源,例如人或非人细胞、组织等等。术语“非人”系指所有非人动物和实体,包括但不限于,脊椎动物例如啮齿动物、非人灵长动物、绵羊、牛、反刍动物、兔类动物、猪、山羊、马、犬、猫、鸟类等等。非人还包括无脊椎动物和原核生物,例如细菌、植物、酵母、病毒等等。因此,用于本发明的方法和系统的核酸样品为源自任何生物,无论真核或者原核的核酸样品。In the invention, the term "sample" is used in its broadest sense and is intended to include a sample or culture obtained from any source, preferably from a biological source. Biological samples are available from animals, including humans, and include liquids, solids, tissues, and gases. Biological samples include blood products such as plasma, serum, and the like. Thus, a "nucleic acid sample" comprises nucleic acids of any origin (eg, DNA, RNA, cDNA, mRNA, tRNA, miRNA, etc.). In the case where the nucleic acid sample is RNA or mRNA, there is a step of reverse transcription of the RNA or mRNA into DNA prior to step c). In the present application, the nucleic acid sample is preferably derived from a biological source, such as a human or non-human cell, tissue, and the like. The term "non-human" refers to all non-human animals and entities including, but not limited to, vertebrates such as rodents, non-human primates, sheep, cattle, ruminants, rabbits, pigs, goats, horses, dogs, Cats, birds, etc. Non-humans also include invertebrates and prokaryotes, such as bacteria, plants, yeast, viruses, and the like. Thus, nucleic acid samples for use in the methods and systems of the invention are nucleic acid samples derived from any organism, whether eukaryotic or prokaryotic.
某些实施方案中,引物与目标核酸之间的杂交在优选地严格条件下进行,所述严格条件足以支持所述核酸之间的杂交,其中所述核酸包含连接化合物和所述目标核酸样品的互补区域,以提供所述核酸杂交复合物。所述复合物随后通过所述连接化合物捕获,并在足以去除非特应性结合核酸的条件下洗涤,然后所杂交的目标核酸序列从所捕获的核酸复合物中洗脱。 In certain embodiments, the hybridization between the primer and the target nucleic acid is carried out under preferably stringent conditions sufficient to support hybridization between the nucleic acids, wherein the nucleic acid comprises a linking compound and the target nucleic acid sample A complementary region to provide the nucleic acid hybridization complex. The complex is then captured by the linker compound and washed under conditions sufficient to remove the non-atopic binding nucleic acid, and the hybridized target nucleic acid sequence is then eluted from the captured nucleic acid complex.
在某些实施方案中,所述核酸包含化学基团或连接化合物,例如结合部分例如生物素、地高辛等等,其能结合于固体载体。所述固体载体可以包含相应的捕获化合物,例如用于生物素的链霉亲和素或用于地高辛的地高辛抗体。本发明不限于所使用的连接化合物,并且替代的连接化合物等同适用于本发明的方法、诱饵序列和试剂盒。In certain embodiments, the nucleic acid comprises a chemical group or a linking compound, such as a binding moiety such as biotin, digoxin, or the like, which is capable of binding to a solid support. The solid support may comprise a corresponding capture compound, such as streptavidin for biotin or a digoxin antibody for digoxin. The invention is not limited to the linking compounds used, and alternative linking compounds are equally suitable for use in the methods, bait sequences and kits of the invention.
在本发明的实施方案中,所述多个目标核酸分子优选包含一种生物的全基因组或至少一条染色体或一种任意大小分子量的核酸分子。优选地,所述核酸分子的大小至少约200kb、至少约500kb、至少约1Mb、至少约2Mb、或至少约5Mb,更优选大小约100kb至约5Mb、约200kb至约5Mb、约500kb至约5Mb、约1Mb至约2Mb或约2Mb至约5Mb。In an embodiment of the invention, the plurality of target nucleic acid molecules preferably comprise a whole genome of an organism or at least one chromosome or a nucleic acid molecule of any size. Preferably, the nucleic acid molecule is at least about 200 kb in size, at least about 500 kb, at least about 1 Mb, at least about 2 Mb, or at least about 5 Mb, more preferably from about 100 kb to about 5 Mb, from about 200 kb to about 5 Mb, from about 500 kb to about 5 Mb. From about 1 Mb to about 2 Mb or from about 2 Mb to about 5 Mb.
在某些实施方案中,所述目标核酸来自动物、植物或微生物,在优选的实施方案中,所述目标核酸分子选来自人。如果核酸样品的量比较少(例如某些情况下取得的人核酸样品,例如发育中的胎儿的基因组),在实施本发明的方法之前可扩增所述核酸,例如通过全基因组扩增。为进行本发明的方法,预先扩增可能是必须的,例如在法医应用中(例如在法医学中用于遗传特征目的)。In certain embodiments, the target nucleic acid is from an animal, plant or microorganism, and in a preferred embodiment, the target nucleic acid molecule is selected from a human. If the amount of nucleic acid sample is relatively small (e.g., a human nucleic acid sample obtained in some cases, such as the genome of a developing fetus), the nucleic acid can be amplified prior to performing the methods of the invention, such as by whole genome amplification. Pre-amplification may be necessary for performing the methods of the invention, such as in forensic applications (e.g., for use in genetics for forensic purposes).
在某些实施方案中,所述多个目标核酸分子为一组基因组DNA分子。所述诱饵序列可选自例如限定来自多个遗传基因座的多种外显子、内含子或调控序列的多个诱饵序列;限定至少一个单独遗传基因座的全序列的多个诱饵序列,所述基因座大小任意,优选至少1Mb,或至少上述特定大小之一;限定单核苷酸多态性(SNP)的多种诱饵序列;或限定一种阵列的多种诱饵序列,例如设计为捕获至少一条完整染色体的全序列的嵌合阵列。In certain embodiments, the plurality of target nucleic acid molecules are a set of genomic DNA molecules. The bait sequence may be selected, for example, from a plurality of decoy sequences defining a plurality of exons, introns or regulatory sequences from a plurality of genetic loci; a plurality of decoy sequences defining a full sequence of at least one individual genetic locus, Said locus is of any size, preferably at least 1 Mb, or at least one of the above specified sizes; a plurality of decoy sequences defining a single nucleotide polymorphism (SNP); or a plurality of bait sequences defining an array, for example designed as A chimeric array of full sequences of at least one complete chromosome is captured.
在本文中,术语“杂交”系指互补核酸的配对。杂交和杂交强度(例如核酸之间结合的强度)受多种因素的影响,例如核酸之间互补的程度、使用杂交条件的严格程度、所形成杂交体的解链温度(Tm)以及核酸的GC含量值。虽然本发明不受限于具体的杂交条件,但优选使用严格的杂交条件。严格的杂交条件取决于序列并随杂交参数(例如盐浓度、有机物存在等)而变化。通常,“严格的”条件选择为在规定的离子强度和pH下低于特定核酸序列的Tm约5℃到约20℃。优选地,严格的条件为低于结合互补核酸的具体核酸的温度熔点约5℃到10℃。所述Tm是50%核酸(例如目标核酸)与完全配对探针杂交的温度(在规定的离子强度和pH下)。As used herein, the term "hybridization" refers to the pairing of complementary nucleic acids. Hybridization and hybridization strength (eg, the strength of binding between nucleic acids) are affected by a number of factors, such as the degree of complementarity between nucleic acids, the stringency of hybridization conditions used, the melting temperature (Tm) of the formed hybrid, and the GC of the nucleic acid. Content value. Although the invention is not limited to specific hybridization conditions, it is preferred to use stringent hybridization conditions. Stringent hybridization conditions depend on the sequence and vary with hybridization parameters (eg, salt concentration, presence of organics, etc.). Generally, "stringent" conditions are selected to be from about 5 ° C to about 20 ° C below the Tm of the particular nucleic acid sequence at the specified ionic strength and pH. Preferably, stringent conditions are from about 5 ° C to 10 ° C below the temperature melting point of the particular nucleic acid to which the complementary nucleic acid is bound. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the nucleic acid (eg, the target nucleic acid) hybridizes to the fully matched probe.
在本文中,“严格的条件”,例如可为50%甲酰胺,5×SSC(0.75M NaCl,0.075M柠檬酸钠),50mM磷酸钠(pH6.8),0.1%焦磷酸钠,5×Denhardt溶液、超声波处理的鲑鱼精子DNA(50mg/ml),0.1%SDS,以及10%硫酸葡聚糖在42℃下杂交,在42℃以0.2×SSC(氯化钠/柠檬酸钠)和在55℃以50%甲酰胺洗涤,然后在55℃以含有EDTA的0.1×SSC洗涤。例如,预计包含35%甲酰胺、5×SSC和0.1%(w/v)十二烷基硫酸钠(SDS)的缓冲液适合在适度非严格条件下在45℃杂交16-72小时。As used herein, "stringent conditions" may, for example, be 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt solution, sonicated sperm DNA (50 mg/ml), 0.1% SDS, and 10% dextran sulfate hybridized at 42 ° C at 0.2 ° SSC (sodium chloride / sodium citrate) and at 42 ° C Wash at 50 °C with 50% formamide and then at 55 °C with 0.1 x SSC containing EDTA. For example, buffers containing 35% formamide, 5 x SSC, and 0.1% (w/v) sodium dodecyl sulfate (SDS) are expected to be suitable for hybridization at 45 ° C for 16-72 hours under moderately non-stringent conditions.
在本文中,术语“引物”系指寡核苷酸,无论天然存在经纯化、酶切后得到的或者经合成方法产生的,当置于诱导与核酸链互补的引物延伸产物的合成的条件下(例如在核苷酸和诱导试剂例如DNA聚合酶存在下,并在合适的温度和pH下),能够作为合成的起点。所述引物优选为具有最大扩增效率的单链。优选地,所述引物为寡脱氧核苷酸。所述引物必须足够长以在所述诱导试剂存在下引发延伸产物的合成。所述引物的确切长度取决于很多因素,包括温度、引物来源和所使用方法。 As used herein, the term "primer" refers to an oligonucleotide, whether purified, cleaved or produced synthetically, under conditions which induce the synthesis of a primer extension product complementary to a nucleic acid strand. (for example in the presence of nucleotides and inducing agents such as DNA polymerase, and at suitable temperatures and pH), can serve as a starting point for synthesis. The primer is preferably a single strand having the greatest amplification efficiency. Preferably, the primer is an oligodeoxynucleotide. The primer must be sufficiently long to initiate synthesis of the extension product in the presence of the inducing agent. The exact length of the primer depends on many factors including temperature, source of the primer and the method used.
在本文中,术语“探针序列”系指寡核苷酸(例如核苷酸序列),无论天然存在经纯化、酶切后得到的或者经合成、重组或PCR扩增产生的,能够与另一目标寡核苷酸例如目标核酸序列的至少一部分杂交。探针可为单链或双链。探针可用于特定基因序列的检测、鉴别和分离。As used herein, the term "probe sequence" refers to an oligonucleotide (eg, a nucleotide sequence), whether produced in nature, purified, cleaved, or produced by synthetic, recombinant, or PCR amplification, capable of A target oligonucleotide, such as at least a portion of a target nucleic acid sequence, hybridizes. The probe can be single stranded or double stranded. Probes can be used for the detection, identification and isolation of specific gene sequences.
在本文中,术语“目标核酸分子”是指来自目标基因组区域的分子或序列。预选的探针确定了目标核酸分子的范围。因此,所述“目标”试图与其它核酸序列区分出来。一个“片段”定义为所述目标序列中的一个核酸区域,如作为核酸序列的一个“片段”或一“部分”。As used herein, the term "target nucleic acid molecule" refers to a molecule or sequence from a region of a target genomic region. The preselected probe determines the extent of the target nucleic acid molecule. Thus, the "target" attempts to distinguish it from other nucleic acid sequences. A "fragment" is defined as a nucleic acid region in the sequence of interest, such as a "fragment" or a "portion" of a nucleic acid sequence.
在本文中,术语“分离”当用于涉及核酸时,如用于“分离核酸”时,系指核酸序列从其天然来源通常结合的至少一种其他组分或污染物中被鉴别并分离出来。分离的核酸以不同于其天然存在的形式存在。相反,未分离的核酸例如DNA和RNA的核酸以其天然存在的状态存在。所述分离的核酸、寡核苷酸或多核苷酸可以单链形式或双链形式存在。As used herein, the term "isolated" when used in reference to a nucleic acid, such as when used in "isolated nucleic acid", refers to the identification and isolation of a nucleic acid sequence from at least one other component or contaminant to which it is normally associated. . An isolated nucleic acid exists in a form different from its natural presence. In contrast, nucleic acids of unseparated nucleic acids such as DNA and RNA exist in their naturally occurring state. The isolated nucleic acid, oligonucleotide or polynucleotide may exist in a single stranded form or in a double stranded form.
在本发明的实施方案中,用于在本文所述的方法、引物组和试剂盒中使用的引物包含连接化合物,例如结合部分。结合部分包含任何连接或引入用于随后捕获核酸扩增产物的扩增引物的5’端的部分。结合部分为引入引物序列5’端的任何序列,例如可捕获的6组氨酸(6HIS)序列。例如,包含6HIS序列的引物可被镍捕获,例如在镍包被或包含镍包被珠子、颗粒等的管子、微孔、或纯化柱中,其中所述珠子包装入柱子中,样品装入并通过柱子以捕获复杂度降低的复合物(例如,和随后的目标洗脱)。用于本发明的实施方案的另一种结合部分的实例包括半抗原,例如地高辛,例如其连接到扩增引物的5’端。地高辛可使用地高辛抗体捕获,例如包被或包含抗地高辛抗体的基质。In an embodiment of the invention, the primers for use in the methods, primer sets and kits described herein comprise a linking compound, such as a binding moiety. The binding moiety comprises any portion that joins or introduces the 5&apos; end of the amplification primer for subsequent capture of the nucleic acid amplification product. The binding moiety is any sequence that introduces the 5' end of the primer sequence, such as a captureable 6 histidine (6HIS) sequence. For example, a primer comprising a 6HIS sequence can be captured by nickel, such as in a nickel coated or tube containing nickel coated beads, granules, or the like, in a microwell, or in a purification column, wherein the beads are packed into a column and the sample is loaded and The column is passed through to capture complexes with reduced complexity (eg, and subsequent target elution). An example of another binding moiety for use in embodiments of the invention includes a hapten, such as digoxin, for example, which is ligated to the 5&apos; end of the amplification primer. Digoxin can be captured using a digoxin antibody, such as a substrate coated or containing an anti-digoxigenin antibody.
在某些实施方案中,所述结合部分为生物素,用链霉亲和素包被所述捕获基质,例如珠子如顺磁颗粒,用于从非特异性杂交目标核酸中分离所述扩增产物。例如,当生物素为结合部分时,链霉亲和素(SA)包被的基质,例如SA包被的珠子(例如磁珠/颗粒)用于捕获所述生物素标记的扩增产物。洗涤所述SA结合的复合物,所杂交的目标核酸从所述扩增产物洗脱进行测序。In certain embodiments, the binding moiety is biotin, and the capture matrix, such as beads, such as paramagnetic particles, is coated with streptavidin for isolating the amplification product from a non-specific hybridization target nucleic acid. . For example, when biotin is a binding moiety, a streptavidin (SA) coated matrix, such as SA coated beads (eg, magnetic beads/particles), is used to capture the biotinylated amplification product. The SA-bound complex is washed, and the hybridized target nucleic acid is eluted from the amplification product for sequencing.
可使用无掩膜阵列合成技术在固体载体上并行提供序列中与所述基因组至少一个区域对应的引物序列。替代性地,引物序列可使用标准DNA合成仪连续获得并应用到所述固体载体,或可从有机体获得并固定于所述固体载体。扩增之后,非扩增产物的核酸通过洗涤从所述载体结合的扩增产物中分离。例如在热水中或在包含例如TRIS缓冲液和/或EDTA的核酸洗脱缓冲液中从所述固体载体洗脱,以产生所述目标核酸分子富集的洗脱物。Primer sequences corresponding to at least one region of the genome in the sequence can be provided in parallel on a solid support using a maskless array synthesis technique. Alternatively, the primer sequence can be obtained continuously and applied to the solid support using a standard DNA synthesizer, or can be obtained from an organism and fixed to the solid support. After amplification, the non-amplified product nucleic acid is separated from the vector-bound amplification product by washing. Elution from the solid support, for example, in hot water or in a nucleic acid elution buffer comprising, for example, TRIS buffer and/or EDTA, to produce an eluate enriched for the target nucleic acid molecule.
或者,用于目标分子的引物序列可如上所述在固体载体上合成,作为引物序列集合从所述固体载体释放并扩增。所释放的引物集合可共价或非共价固定于载体,例如玻璃、金属、陶瓷、或聚合珠子或其它固体载体。所述引物可设计为从所述固体载体方便释放,例如在最接近载体的核酸类似物末端或其附近提供酸或碱不稳定的核酸序列,其分别在低或高pH条件下释放所述引物。本领域已知多种可剪切的连接化合物。所述载体可以,例如,以具有液体进口和出口的圆柱提供。本领域熟悉将核酸固定到载体的方法,例如通过将生物素标记的核苷酸结合到所述引物中,并使用链霉亲和素包被所述载体,由此所述包被的载体非共价吸引并固定所述集合中的所述引物。所述样品在杂交条件下通过所述包含引物的载体,由此与所述固定载体杂交的扩增后的目标核酸分子可洗脱,用于之后的分析或其它用途。 Alternatively, primer sequences for the target molecule can be synthesized on a solid support as described above, released and amplified from the solid support as a set of primer sequences. The set of primers released can be fixed to the carrier, such as glass, metal, ceramic, or polymeric beads or other solid support, either covalently or non-covalently. The primer may be designed to be conveniently released from the solid support, for example to provide an acid or base labile nucleic acid sequence at or near the end of the nucleic acid analog closest to the vector, which releases the primer under low or high pH conditions, respectively. . A variety of cleavable linking compounds are known in the art. The carrier can be provided, for example, in a cylinder having a liquid inlet and an outlet. A method of immobilizing a nucleic acid to a vector is known in the art, for example by binding a biotin-labeled nucleotide to the primer and coating the vector with streptavidin, whereby the coated vector is not The primers in the collection are covalently attracted and immobilized. The sample is passed through the primer-containing vector under hybridization conditions, whereby the amplified target nucleic acid molecule that hybridizes to the immobilized vector can be eluted for later analysis or other use.
实施例1:本发明的方法的示例性步骤Example 1: Exemplary Steps of the Method of the Invention
扩增样品中多个目标DNA序列的方法包括以下步骤:A method of amplifying a plurality of target DNA sequences in a sample comprises the steps of:
1)提供包含目标DNA序列和非目标序列的样本、本发明的特异性引物组和与所述特异性引物5’端通用序列互补的通用引物对;1) providing a sample comprising a target DNA sequence and a non-target sequence, a specific primer set of the present invention, and a universal primer pair complementary to a universal sequence at the 5' end of the specific primer;
2)以所述引物组中的特异性引物进行第一轮多重PCR反应,扩增所述样本中的目标DNA序列,所述PCR反应的退火温度按照从高到低的阶梯进行,在本实施例中一个退火过程中使用等差的3个温度(例如60℃、59℃、58℃)进行退火,并且扩增循环数小于10,2) performing a first round of multiplex PCR reaction with a specific primer in the primer set to amplify a target DNA sequence in the sample, and the annealing temperature of the PCR reaction is performed in a step from high to low, in the present embodiment In an example, an annealing process is performed using three equal temperatures (for example, 60 ° C, 59 ° C, and 58 ° C), and the number of amplification cycles is less than 10,
第一轮多重PCR反应扩增多个目标区域DNA序列以如下方式进行:在该轮PCR反应中,多对特异性引物放在同一反应体系中同时扩增多个目标区域序列,其中所有上游引物的3’端含有与目标区域序列互补的特异性序列,5’端含有通用序列1(GSP1:CTTTCCCTACACGACgctcttccgatct(SEQ ID NO.1));所有下游引物的3’端含有与目标区域序列互补的特异性序列,5’端含有通用序列2(GSP2:GGAGTTCAGACGTGTgctcttccgatct(SEQ ID NO.2)),上下游引物的3’端的碱基处均具有3或4个基团修饰,增加扩增的特异性;在本轮多重PCR反应中,反应体系的成分如下:ddH2O、PCR反应缓冲物、底物(dNTP)、多重PCR引物混合物、样本基因组DNA或cDNA和高保真聚合酶。The first round of multiplex PCR reaction amplifies a plurality of target region DNA sequences in such a manner that in the round of PCR reaction, multiple pairs of specific primers are simultaneously placed in the same reaction system to simultaneously amplify a plurality of target region sequences, wherein all upstream primers The 3' end contains a specific sequence complementary to the sequence of the target region, and the 5' end contains the universal sequence 1 (GSP1: CTTTCCCTACACGACgctcttccgatct (SEQ ID NO. 1)); the 3' end of all downstream primers contains a specific sequence complementary to the target region sequence. Sequence, the 5' end contains the universal sequence 2 (GSP2: GGAGTTCAGACGTGTgctcttccgatct (SEQ ID NO. 2)), and the base of the 3' end of the upstream and downstream primers has 3 or 4 group modifications, which increases the specificity of amplification; In this round of multiplex PCR reactions, the components of the reaction system are as follows: ddH 2 O, PCR reaction buffer, substrate (dNTP), multiplex PCR primer mix, sample genomic DNA or cDNA, and high fidelity polymerase.
本轮多重PCR反应步骤包括如下三步:第一步预变性:95℃维持3.5min;第二步扩增:变性步骤在96-98℃下维持20s、梯度退火在66℃下维持1min、在65℃下维持1min和64℃下维持1min,延伸步骤在72℃下维持30s,第二步扩增根据模板的投入量扩增10-22个循环,退火时间根据扩增目标序列的个数而相应改变;第三步扩增在72℃下延伸5min,在本轮多重PCR反应中,采用梯度退火,使每对引物均能与模板高效互补结合,提高扩增效率,本实施例中如在一个退火过程中使用等差的3个温度(例如60℃、59℃、58℃)进行退火。本轮多重PCR反应结束后,得到多个目标区域序列的双链PCR产物。所述双链PCR产物的5’端包括通用序列1(GSP1),3’端包括通用序列2(GSP2)。The current multiplex PCR reaction step includes the following three steps: first step pre-denaturation: 95 ° C for 3.5 min; second step amplification: denaturation step is maintained at 96-98 ° C for 20 s, gradient annealing is maintained at 66 ° C for 1 min, at Maintaining at 65 °C for 1 min and at 64 °C for 1 min, the extension step is maintained at 72 °C for 30 s, and the second step of amplification is amplified for 10-22 cycles according to the amount of template input. The annealing time is based on the number of amplification target sequences. Corresponding changes; the third step of amplification is extended at 72 ° C for 5 min. In this round of multiplex PCR reaction, gradient annealing is used to enable each pair of primers to efficiently and complementarily bind to the template, thereby improving the amplification efficiency, as in the present example. Annealing is performed using an equal temperature of three temperatures (e.g., 60 ° C, 59 ° C, 58 ° C) in one annealing process. After the current round of multiplex PCR reaction, a double-stranded PCR product of multiple target region sequences was obtained. The 5' end of the double stranded PCR product comprises universal sequence 1 (GSP1) and the 3' end includes universal sequence 2 (GSP2).
采用核酸酶,如Exonuclease VII、Exonuclease I、Mung Bean Nuclease、T7 Endonuclease I、Nuclease BAL-31、核酸酶P1、核酸酶S1、绿豆核酸酶等对所述双链PCR产物进行消化。The double-stranded PCR product is digested with a nuclease such as Exonuclease VII, Exonuclease I, Mung Bean Nuclease, T7 Endonuclease I, Nuclease BAL-31, Nuclease P1, Nuclease S1, Mung Bean Nuclease, and the like.
采用Agencourt AMPure磁珠对所述第一轮多重PCR反应的产物进行片段筛选和纯化,去除目标区域外的大片段、基因组DNA、引物二聚体、引物和其他反应成分,得到包含目标区域序列的双链PCR产物;The products of the first round of multiplex PCR reaction were screened and purified by Agencourt AMPure magnetic beads to remove large fragments, genomic DNA, primer dimers, primers and other reaction components outside the target region, and the sequence containing the target region was obtained. Double-stranded PCR product;
3)以通用引物进行第二轮PCR扩增反应,对第一轮多重PCR反应中得到的双链PCR扩增产物进行再次扩增,进一步富集双链PCR所述扩增产物;3) performing a second round of PCR amplification reaction with a universal primer, re-amplifying the double-stranded PCR amplification product obtained in the first round of multiplex PCR reaction, and further enriching the amplification product of double-stranded PCR;
在所述第二轮PCR扩增反应的反应体系中,以第一轮多重PCR反应中纯化得到的包括目标区域序列的双链PCR产物为模板,以通用引物FGSP1(AATGATACGGCGACCACCGAGATCTacactctttccctacacgac,SEQ ID NO.3)和RGSP2(CAAGCAGAAGACGGCATACGAGAT******gtgactggagttcagacgtgt,SEQ ID NO.4)进行PCR扩增,其中FGSP1的3’端为通用序列1(GSP1),5端为通用序列3(AATGATACGGCGACCACCGAGATCT,SEQ ID NO.5);RGSP2的3’端为通用序列2(GSP2),5’端为通用序列4(CAAGCAGAAGACGGCATACGAGAT,SEQ ID NO.6),中间的6个“*”代表Index序列,用于区分不同样本。Index序列的设计原则是长度为6,两两之间 确保至少两个碱基的差异。在所述第二轮PCR反应后,扩增得到的目标区域序列产物的5’端为通用引物序列FGSP1,3’端为通用引物序列RGSP2。In the reaction system of the second round of PCR amplification reaction, the double-stranded PCR product including the target region sequence purified in the first round of multiplex PCR reaction is used as a template, and the universal primer FGSP1 (AATGATACGGCGACCACCGAGATCTacactctttccctacacgac, SEQ ID NO. 3 PCR amplification with RGSP2 (CAAGCAGAAGACGGCATACGAGAT******gtgactggagttcagacgtgt, SEQ ID NO. 4), wherein the 3' end of FGSP1 is universal sequence 1 (GSP1) and the 5th end is universal sequence 3 (AATGATACGGCGACCACCGAGATCT, SEQ ID NO .5); the 3' end of RGSP2 is universal sequence 2 (GSP2), the 5' end is universal sequence 4 (CAAGCAGAAGACGGCATACGAGAT, SEQ ID NO. 6), and the middle 6 "*" represent Index sequence, which is used to distinguish different samples. . The design principle of the Index sequence is length 6, between two Make sure there are at least two base differences. After the second round of PCR reaction, the 5' end of the amplified target region sequence product is the universal primer sequence FGSP1, and the 3' end is the universal primer sequence RGSP2.
采用Agencourt AMPure磁珠对所述第二轮PCR反应的PCR产物进行纯化,去除其他成分,得到含通用引物序列FGSP1和通用引物序列RGSP2的目标区域序列产物;The PCR product of the second round of PCR reaction was purified by Agencourt AMPure magnetic beads to remove other components, and the target region sequence product containing the universal primer sequence FGSP1 and the universal primer sequence RGSP2 was obtained;
4)回收上一步富集的目标区域序列产物,进行上机测序。4) Recovering the sequence product of the target region enriched in the previous step and performing sequencing on the machine.
实施例2Example 2
发明人随机选择人基因组上外显子和内含子上1000个位点用于测试本发明的方法,实验步骤按照实施例1的方法进行。The inventors randomly selected 1000 sites on the exon and intron of the human genome for testing the method of the present invention, and the experimental procedure was carried out in accordance with the method of Example 1.
表1:随机选择的1000个位点的染色体分布Table 1: Chromosome distribution of randomly selected 1000 loci
染色体chromosome 个数Number 染色体chromosome 个数Number
chr1Chr1 7575 chr12Chr12 6565
chr2Chr2 4040 chr13Chr13 2020
chr3Chr3 6060 chr14Chr14 1010
chr4Chr4 105105 chr15Chr15 2525
chr5Chr5 5555 chr16Chr16 3535
chr6Chr6 6565 chr17Chr17 4040
chr7Chr7 3030 chr18Chr18 1010
chr8Chr8 6565 chr19Chr19 2525
chr9Chr9 3030 chr20Chr20 3535
chr10Chr10 6565 chr21Chr21 1515
chr11Chr11 100100 chr22Chr22 3030
在本实施例中,基于SantaLucia 2007热力学参数表的最邻近法计算Tm。以下简要说明引物设计情况。In the present embodiment, Tm is calculated based on the nearest neighbor method of the SantaLucia 2007 thermodynamic parameter table. The following is a brief description of primer design.
(1)扩增产物长度在100-300bp的范围内随机选取,PCR引物分成多组,每组中2℃<Tm标准差≤5℃、1℃<Tm标准差≤2℃、Tm标准差≤1℃;另外设对照组,未考虑Tm标准差,组中存在Tm标准差>5℃的情况;(1) The length of the amplified product is randomly selected within the range of 100-300 bp, and the PCR primers are divided into multiple groups. In each group, 2 °C < Tm standard deviation ≤ 5 ° C, 1 ° C < Tm standard deviation ≤ 2 ° C, Tm standard deviation ≤ 1 ° C; another control group, did not consider the Tm standard deviation, there is a Tm standard deviation > 5 ° C in the group;
(2)选择部分引物(其他引物可以作为对照),在其序列的3’端-1、-2、-3处碱基、核糖或磷酸二酯键上增加修饰,脱氧次黄嘌呤(dI)、脱氧尿嘧啶(dU)、5-Methyl dC、2′-O-Me-dC、磷酸基团、硫代基团、地高辛、生物素、AminolinkerC7、BHQ1、BHQ2、Dabcyl、JOE、ROX、FAM、TAMRA、烷基基团、氟代基团、氨基基团和Thiol-C3S-S;(2) Select some primers (other primers can be used as a control), add modification on the base, ribose or phosphodiester bond at the 3' end-1, -2, -3 of the sequence, deoxygenated hypoxanthine (dI) , deoxyuracil (dU), 5-Methyl dC, 2'-O-Me-dC, phosphate group, thio group, digoxin, biotin, Aminolinker C7, BHQ1, BHQ2, Dabcyl, JOE, ROX, FAM, TAMRA, alkyl group, fluoro group, amino group and Thiol-C3S-S;
(3)当引物序列的3’端的5个碱基的GC含量大于50%时,选取部分引物(其他引物作为对照)在其序列的3’端-4处碱基增加连接到脱氧核糖胞嘧啶的5’的甲基基团修饰(5-Methyl dC)、3’端-4处碱基增加连接到磷酸二酯键上硫代修饰基团、3’端-4处增加连接到脱氧核糖2’的氟代修饰基团、3’端-4处增加连接到磷酸二酯键上的脱氧次黄嘌呤(dI)修饰基团或3’端-4处增加连接到磷酸二酯键上的脱氧尿嘧啶(dU)修饰基团。(3) When the GC content of the 5 bases at the 3' end of the primer sequence is greater than 50%, select a part of the primers (other primers as a control) to increase the base at the 3' end-4 of the sequence to the deoxyribose cytosine. 5' methyl group modification (5-Methyl dC), 3' end-4 base attachment increased to phosphodiester bond thio-modified group, 3' end-4 increased attachment to deoxyribose 2 'The fluoro-modified group, 3' end-4 increases the deoxygenated hypoxanthine (dI) modifying group attached to the phosphodiester bond or the 3' end-4 increases the deoxygenation attached to the phosphodiester bond A uracil (dU) modifying group.
其中,以上这些位置的基团修饰均是由引物合成公司通过有机化学方式合成得到。Among them, the group modification at these positions is synthesized by a primer synthesis company by organic chemical means.
另外,对所有引物还进行如下分组:In addition, all primers are also grouped as follows:
(1)Tm(与目标产物)-Tm(与非目标产物)≤5℃、5℃<Tm(与目标产 物)-Tm(与非目标产物)≤10℃;Tm(与目标产物)-Tm(与非目标产物)>10℃;(1) Tm (with target product) - Tm (with non-target product) ≤ 5 ° C, 5 ° C < Tm (with target production -Tm (with non-target product) ≤ 10 ° C; Tm (with target product) - Tm (with non-target product) > 10 ° C;
(2)Tm(与目标产物)-Tm(二聚体)≤5℃、5℃<Tm(与目标产物)-Tm(二聚体)≤10℃、Tm(与目标产物)-Tm(二聚体)>10℃;(2) Tm (with target product) - Tm (dimer) ≤ 5 ° C, 5 ° C < Tm (with target product) - Tm (dimer) ≤ 10 ° C, Tm (with target product) - Tm (two Polymer) > 10 ° C;
(3)Tm(与目标产物)-Tm(发卡结构)≤5℃、5℃<Tm(与目标产物)-Tm(发卡结构)≤10℃;Tm(与目标产物)-Tm(发卡结构)>10℃。(3) Tm (with target product) - Tm (hairpin structure) ≤ 5 ° C, 5 ° C < Tm (with target product) - Tm (hairpin structure) ≤ 10 ° C; Tm (with target product) - Tm (hairpin structure) >10 °C.
检测结果,以捕获效率和100×覆盖度下的覆盖率来衡量检测结果,检测结果在后文以图表的形式给出,从这些数据图表中可以看出:The test results are measured by the capture efficiency and the coverage under 100× coverage. The test results are given in the form of graphs, which can be seen from these data charts:
1)Tm标准差越小,捕获效率和覆盖率越高,但为了达到良好的捕获效率和100×覆盖率而又保证引物的设计空间,优选Tm标准差≤2℃;1) The smaller the Tm standard deviation, the higher the capture efficiency and coverage, but in order to achieve good capture efficiency and 100× coverage while ensuring the design space of the primer, the Tm standard deviation is preferably ≤ 2 ° C;
2)大部分3’端引物修饰的捕获效率和覆盖率高于不修饰的对照组;2) The capture efficiency and coverage of most 3' primers were higher than those of the unmodified control group;
3)当引物序列的3’端的5个碱基的GC含量大于50%时,引物序列的3’端-4处碱基硫代修饰的捕获效率和覆盖率高于不修饰的对照组;3) When the GC content of the 5 bases at the 3' end of the primer sequence is greater than 50%, the capture efficiency and coverage of the base thio modification at the 3' end-4 of the primer sequence is higher than that of the unmodified control group;
4)Tm(目标产物)-Tm(非目标产物)的差值越大,捕获效率和覆盖率越高,Tm(目标产物)-Tm(非目标产物)>10℃才可达到良好的捕获效率和100×覆盖率;4) The larger the difference between Tm (target product) and Tm (non-target product), the higher the capture efficiency and coverage, and the good capture efficiency can be achieved by Tm (target product)-Tm (non-target product) >10 °C. And 100× coverage;
5)Tm(目标产物)-Tm(二聚体)的差值越大,捕获效率和覆盖率越高,Tm(目标产物)-Tm(二聚体)>10℃才可达到良好的捕获效率和100×覆盖率;5) The larger the difference between Tm (target product) and Tm (dimer), the higher the capture efficiency and coverage, and the good capture efficiency can be achieved by Tm (target product)-Tm (dimer) >10 °C. And 100× coverage;
6)Tm(目标产物)-Tm(发卡结构)的差值越大,捕获效率和覆盖率越高,而且Tm(目标产物)-Tm(发卡结构)>5℃即可达到良好的捕获效率和100×覆盖率,同时保证比>10℃较多的候选引物可以选择。6) The larger the difference between Tm (target product) and Tm (issuing structure), the higher the capture efficiency and coverage, and the good capture efficiency can be achieved by Tm (target product)-Tm (issue structure) > 5 °C. 100× coverage, while ensuring more than >10 °C candidate primers can be selected.
优选地,Tm(目标产物)-Tm(非目标产物)>10℃,Tm(目标产物)-Tm(二聚体)>10℃,Tm(目标产物)-Tm(发卡结构)>5℃。Preferably, Tm (target product)-Tm (non-target product) > 10 ° C, Tm (target product) - Tm (dimer) > 10 ° C, Tm (target product) - Tm (hairpin structure) > 5 ° C.
其中,对照指的是不做任何修饰的正常引物,用于与相应试验组数据进行比较。Among them, the control refers to a normal primer without any modification, and is used for comparison with the data of the corresponding test group.
结果如下:The results are as follows:
表2.组中存在Tm标准差对检测结果的影响Table 2. Effect of Tm standard deviation on the test results in the group
引物特点Primer characteristics 捕获效率Capture efficiency 100x覆盖率100x coverage
Tm标准差≤1℃Tm standard deviation ≤ 1 ° C 80.01%80.01% 84.37%84.37%
1℃<Tm标准差≤2℃1 °C <Tm standard deviation ≤ 2 ° C 76.34%76.34% 78.80%78.80%
2℃<Tm标准差≤5℃2°C<Tm standard deviation ≤5°C 71.24%71.24% 72.02%72.02%
Tm标准差>5℃Tm standard deviation > 5 ° C 60.27%60.27% 64.99%64.99%
表3.引物序列的3’端-1、-2、-3处碱基修饰对检测结果的影响Table 3. Effect of base modification at the 3' end-1, -2, -3 of the primer sequence on the detection results
引物特点Primer characteristics 捕获效率Capture efficiency 100×覆盖率100× coverage
AminolinkerC7AminolinkerC7 93.2593.25 100%100%
脱氧尿嘧啶(dU)Deoxyuracil (dU) 91.32%91.32% 100%100%
5-Methyl dC5-Methyl dC 87.23%87.23% 95.2%95.2%
2′-O-Me-dC2'-O-Me-dC 79.98%79.98% 92.7%92.7%
硫代基团Thio group 83.62%83.62% 92.71%92.71%
磷酸基团Phosphate group 85.66%85.66% 93.37%93.37%
氨基基团Amino group 93.41%93.41% 100%100%
脱氧次黄嘌呤(dI)Deoxygenated hypoxanthine (dI) 95.87%95.87% 100%100%
BHQ1BHQ1 94.31%94.31% 100%100%
生物素Biotin 70.21%70.21% 74.94%74.94%
地高辛Digoxin 60.07%60.07% 64.18%64.18%
BHQ2BHQ2 95.26%95.26% 100%100%
DabcylDabcyl 93.63%93.63% 99.11%99.11%
FAMFAM 88.34%88.34% 93.34%93.34%
JOEJOE 86.73%86.73% 89.14%89.14%
ROXROX 82.62%82.62% 88.25%88.25%
TAMRATAMRA 83.41%83.41% 87.27%87.27%
烷基基团Alkyl group 96.66%96.66% 100%100%
氟代基团Fluoro group 85.26%85.26% 87.31%87.31%
Thiol-C3S-SThiol-C3S-S 92.21%92.21% 100%100%
对照(正常引物)Control (normal primer) 78.05%78.05% 83.36%83.36%
表4.当引物序列的3’端的5个碱基的GC含量大于50%时,引物序列的3’端-4处碱基硫代修饰对检测结果的影响Table 4. Effect of base thio modification at the 3' end-4 of the primer sequence on the detection results when the GC content of the 5 bases at the 3' end of the primer sequence is greater than 50%
引物特点Primer characteristics 捕获效率Capture efficiency 100×覆盖率100× coverage
3’端-4处氟代基团Fluorinated group at the 3' end-4 93.10%93.10% 100%100%
3’端-4处甲基修饰3' end-4 methyl modification 89.23%89.23% 98.87%98.87%
3’端-4处碱基硫代3' end-4 base thio 82.77%82.77% 91.15%91.15%
3’端-4脱氧尿嘧啶(dU)3' end-4 deoxyuracil (dU) 90.92%90.92% 99.57%99.57%
3’端-4脱氧次黄嘌呤(dI)3' end-4 deoxygenated hypoxanthine (dI) 94.03%94.03% 99.91%99.91%
对照(正常引物)Control (normal primer) 79.21%79.21% 84.09%84.09%
表5.引物特异性对检测结果的影响Table 5. Effect of primer specificity on test results
引物特点Primer characteristics 捕获效率Capture efficiency 100×覆盖率100× coverage
Tm(目标产物)-Tm(非目标产物)≤5℃Tm (target product)-Tm (non-target product) ≤ 5 ° C 62.35%62.35% 67.03%67.03%
5℃<Tm(目标产物)-Tm(非目标产物)≤10℃5 ° C < Tm (target product) - Tm (non-target product) ≤ 10 ° C 70.01%70.01% 73.45%73.45%
Tm(目标产物)-Tm(非目标产物)>10℃Tm (target product) - Tm (non-target product) > 10 ° C 78.84%78.84% 82.47%82.47%
表6.引物二聚体对检测结果的影响Table 6. Effect of primer dimer on test results
引物特点Primer characteristics 捕获效率Capture efficiency 100×覆盖率100× coverage
Tm(目标产物)-Tm(二聚体)≤5℃Tm (target product)-Tm (dimer) ≤ 5 ° C 45.21%45.21% 50.74%50.74%
5℃<Tm(目标产物)-Tm(二聚体)≤10℃5 ° C < Tm (target product) - Tm (dimer) ≤ 10 ° C 61.27%61.27% 68.17%68.17%
Tm(目标产物)-Tm(二聚体)>10℃Tm (target product)-Tm (dimer) > 10 ° C 78.64%78.64% 83.55%83.55%
表7.引物发夹结构对检测结果的影响Table 7. Effect of primer hairpin structure on test results
引物特点Primer characteristics 捕获效率Capture efficiency 100×覆盖率100× coverage
Tm(目标产物)-Tm(发卡结构)≤5℃Tm (target product)-Tm (issuing structure) ≤ 5 ° C 64.28%64.28% 68.36%68.36%
5℃<Tm(目标产物)-Tm(发卡结构)≤10℃5 ° C < Tm (target product) - Tm (hairpin structure) ≤ 10 ° C 75.09%75.09% 80.94%80.94%
Tm(目标产物)-Tm(发卡结构)>10℃Tm (target product) - Tm (issuing structure) > 10 ° C 79.98%79.98% 83.49%83.49%

Claims (10)

  1. 用于扩增样品中多个目标DNA序列的引物组,所述引物组包括针对每个目标DNA序列的多对上下游特异性引物,其中:A primer set for amplifying a plurality of target DNA sequences in a sample, the primer set comprising a plurality of pairs of upstream and downstream specific primers for each target DNA sequence, wherein:
    a)每对所述上下游特异性引物包括针对目标序列的特异性序列,所有特异性序列之间满足如下条件:(1)每个所述特异性序列与目标序列之外的序列不发生扩增;(2)所述特异性序列之间不形成二聚体;(3)所述特异性序列不形成发卡结构;a) Each pair of the upstream and downstream specific primers includes a specific sequence for the target sequence, and the following conditions are satisfied between all the specific sequences: (1) each of the specific sequence does not expand beyond the target sequence (2) no dimer is formed between the specific sequences; (3) the specific sequence does not form a hairpin structure;
    b)在所述特异性序列的5’端连接有与基因组不同源的通用序列;b) a universal sequence linked to a different source of the genome at the 5' end of the specific sequence;
    c)在所述特异性序列的3’端的碱基处有增加空间位阻的修饰,所述修饰不阻断其与所述特异性序列完全匹配的模板的结合与延伸,但基本阻断其与不完全匹配的模板的结合与延伸。c) a modification that increases steric hindrance at the base 3' end of the specific sequence, the modification does not block the binding and extension of the template that exactly matches the specific sequence, but substantially blocks it Combination and extension of templates that do not exactly match.
  2. 权利要求1的引物组,所述特异性序列之间满足如下条件:(1)所述特异性序列与目标区域的Tm–与非目标区域的Tm≥5℃,优选≥10℃;(2)所述特异性序列与目标区域的Tm–与其他特异性序列形成二聚体的Tm≥5℃,优选≥10℃;(3)所述特异性序列与目标区域的Tm–形成发卡结构的Tm≥5℃,优选≥10℃,优选的Tm值基于SantaLucia 2007热力学参数表的最邻近法计算。The primer set of claim 1, wherein the specific sequence satisfies the following conditions: (1) the specific sequence and the Tm of the target region - and the non-target region have a Tm ≥ 5 ° C, preferably ≥ 10 ° C; (2) The specific sequence and the Tm of the target region - forming a dimer with other specific sequences with a Tm ≥ 5 ° C, preferably ≥ 10 ° C; (3) the specific sequence and the Tm of the target region - forming a Tk of the hairpin structure ≥ 5 ° C, preferably ≥ 10 ° C, the preferred Tm value is calculated based on the nearest neighbor method of the Santa Lucia 2007 thermodynamic parameter table.
  3. 权利要求1的引物组,所述增加空间位阻的修饰选自:脱氧次黄嘌呤(dI)、脱氧尿嘧啶(dU)、5-Methyl dC、2'-O-Me-dC、磷酸基团、硫代基团、地高辛、生物素、AminolinkerC7、BHQ1、BHQ2、Dabcyl、JOE、ROX、FAM、TAMRA、烷基基团、氟代基团、氨基基团和Thiol-C3S-S。The primer set of claim 1, wherein the modification of increasing steric hindrance is selected from the group consisting of: deoxyhypoxanthine (dI), deoxyuracil (dU), 5-Methyl dC, 2'-O-Me-dC, and a phosphate group. , thio group, digoxin, biotin, Aminolinker C7, BHQ1, BHQ2, Dabcyl, JOE, ROX, FAM, TAMRA, alkyl group, fluoro group, amino group and Thiol-C3S-S.
  4. 权利要求1或3的引物组,所述3’端的碱基处增加空间位阻的修饰包括3’端-1、-2、-3位碱基、核糖或磷酸二酯键上的修饰。The primer set of claim 1 or 3, wherein the modification of the steric hindrance at the base of the 3' end comprises a modification at the 3' end-1, -2, -3 base, ribose or phosphodiester bond.
  5. 权利要求1-4任一项的引物组,所述特异性序列的3’端的5个碱基的GC含量大于50%,在所述特异性序列的3’端-4位碱基处有增加空间位阻的修饰,例如氟代修饰、甲基修饰、脱氧尿嘧啶(dU)、硫代修饰、脱氧次黄嘌呤(dI)、氨基修饰等。The primer set according to any one of claims 1 to 4, wherein the 5 base of the specific sequence has a GC content of more than 50%, and the base sequence has an increase at the 3' end to the base of the specific sequence. Modification of steric hindrance, such as fluoro modification, methyl modification, deoxyuracil (dU), thio modification, deoxyhypoxanthine (dI), amino modification, and the like.
  6. 权利要求1-5任一项的引物组,所述特异性引物序列有一致的热力学参数,优选Tm标准差≤5℃;更优选Tm标准差≤2℃;最优选Tm标准差≤1℃。The primer set according to any one of claims 1 to 5, wherein the specific primer sequence has a uniform thermodynamic parameter, preferably a Tm standard deviation ≤ 5 ° C; more preferably a Tm standard deviation ≤ 2 ° C; most preferably a Tm standard deviation ≤ 1 ° C.
  7. 一种扩增样品中多个目标DNA序列的方法,包括A method of amplifying a plurality of target DNA sequences in a sample, including
    a)提供包含目标DNA序列和非目标序列的样本、上述权利要求任一项的引物组和与所述特异性引物5’端通用序列互补的通用引物对;a) providing a sample comprising a DNA sequence of interest and a non-target sequence, a primer set according to any of the preceding claims, and a universal primer pair complementary to a universal sequence at the 5' end of said specific primer;
    b)以所述引物组中的特异性引物进行多重PCR反应,扩增所述样本中的多个目标DNA序列,所述多重PCR反应的退火温度按照从高到低的阶梯进行,例如在一个退火过程中使用等差的多个温度进行退火;b) performing a multiplex PCR reaction with a specific primer in the primer set to amplify a plurality of target DNA sequences in the sample, and the annealing temperature of the multiplex PCR reaction is performed in steps from high to low, for example, in one Annealing using multiple temperatures of equal difference during annealing;
    c)以所述通用引物对再次扩增步骤b)中的扩增产物,进一步富集所述扩增产物。c) amplifying the amplification product in step b) with the universal primer pair to further enrich the amplification product.
  8. 权利要7的方法,所述方法还包括步骤b’):回收步骤b)中富集的扩增产物。The method of claim 7, further comprising the step b') of recovering the amplified product enriched in step b).
  9. 权利要求8的方法,所述回收是通过采用磁珠对第一轮PCR反应产物进行片段筛选和纯化,去除目标区域外的大片段、基因组DNA、引物二聚体、引物和其他反应成分,得到目标区域序列的PCR产物。The method of claim 8, wherein the recovering is by using a magnetic bead to perform fragment screening and purification on the first round of PCR reaction products, and removing large fragments, genomic DNA, primer dimers, primers and other reaction components outside the target region. PCR product of the sequence of the target region.
  10. 权利要求7-9任一项的方法,所述方法还包括步骤d)对步骤c)得到扩增产物进行测序。 The method of any one of claims 7-9, further comprising the step of d) sequencing the amplified product obtained in step c).
PCT/CN2016/106608 2016-05-06 2016-11-21 Primer set for amplifying multiple target dna sequences in sample and use thereof WO2017190487A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2016405379A AU2016405379A1 (en) 2016-05-06 2016-11-21 Primer set for amplifying multiple target DNA sequences in sample and use thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610293653.2A CN105925678B (en) 2016-05-06 2016-05-06 Primer group for amplifying multiple target DNA sequences in sample and application thereof
CN201610293653.2 2016-05-06

Publications (1)

Publication Number Publication Date
WO2017190487A1 true WO2017190487A1 (en) 2017-11-09

Family

ID=56835122

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/106608 WO2017190487A1 (en) 2016-05-06 2016-11-21 Primer set for amplifying multiple target dna sequences in sample and use thereof

Country Status (3)

Country Link
CN (2) CN105925678B (en)
AU (2) AU2016405379A1 (en)
WO (1) WO2017190487A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110295252A (en) * 2019-05-22 2019-10-01 东莞博盛生物科技有限公司 Improve reaction solution, primer pair, probe and the kit of the detection dog parainfluenza of specificity

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105925678B (en) * 2016-05-06 2021-10-12 艾吉泰康生物科技(北京)有限公司 Primer group for amplifying multiple target DNA sequences in sample and application thereof
CN107254541B (en) * 2017-08-03 2020-05-08 广州万德基因医学科技有限公司 NGS (Next Generation Stratagene) library-building primer pool for amplifying multiple targets in cfDNA (deoxyribonucleic acid) sample and application
CN108251502B (en) * 2017-12-27 2021-12-24 深圳华大基因股份有限公司 Enrichment method of free tumor DNA in peripheral blood, kit and application thereof
CN110714077B (en) * 2019-08-13 2023-10-31 阔然生物医药科技(上海)有限公司 Primer for simultaneously detecting BRCA1/2 exon sequences and chemotherapeutic drug sites and application thereof
CN110734974B (en) * 2019-08-13 2023-10-31 阔然生物医药科技(上海)有限公司 SNP locus combination and detection primer for cancer chemotherapy drugs
CN110592210B (en) * 2019-08-13 2023-10-31 阔然生物医药科技(上海)有限公司 Sequence primer for detecting BRCA1/2 total exon and design method
CN112011599A (en) * 2020-09-07 2020-12-01 苏州贝康医疗器械有限公司 PCR primer set for reducing non-specific amplification and application thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001055454A1 (en) * 2000-01-28 2001-08-02 Althea Technologies, Inc. Methods for analysis of gene expression
CN1718737A (en) * 2004-07-09 2006-01-11 南华大学 PCR method capable of eliminating non specific braid and RT-PCR chip prepared using said method
WO2014110528A1 (en) * 2013-01-13 2014-07-17 Unitaq Bio Methods and compositions for pcr using blocked and universal primers
CN105925678A (en) * 2016-05-06 2016-09-07 艾吉泰康生物科技(北京)有限公司 Primer group used for amplifying plurality of target DNA sequences in sample and applications of primer group

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001055454A1 (en) * 2000-01-28 2001-08-02 Althea Technologies, Inc. Methods for analysis of gene expression
CN1718737A (en) * 2004-07-09 2006-01-11 南华大学 PCR method capable of eliminating non specific braid and RT-PCR chip prepared using said method
WO2014110528A1 (en) * 2013-01-13 2014-07-17 Unitaq Bio Methods and compositions for pcr using blocked and universal primers
CN105925678A (en) * 2016-05-06 2016-09-07 艾吉泰康生物科技(北京)有限公司 Primer group used for amplifying plurality of target DNA sequences in sample and applications of primer group

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
XU, YANG-YAN ET AL.: "A reverse transcriptase-polymerase chain reaction method improving the specific amplification of caveolin-1 gene sequences", CHINESE JOURNAL OF MEDICAL GENETICS, vol. 2, no. 1, 28 February 2006 (2006-02-28), pages 59 - 62 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110295252A (en) * 2019-05-22 2019-10-01 东莞博盛生物科技有限公司 Improve reaction solution, primer pair, probe and the kit of the detection dog parainfluenza of specificity

Also Published As

Publication number Publication date
CN107446995A (en) 2017-12-08
CN107446995B (en) 2020-06-02
AU2016405379A1 (en) 2019-01-03
AU2016102399A4 (en) 2019-05-16
CN105925678B (en) 2021-10-12
CN105925678A (en) 2016-09-07

Similar Documents

Publication Publication Date Title
WO2017190487A1 (en) Primer set for amplifying multiple target dna sequences in sample and use thereof
JP6995751B2 (en) Methods and Compositions for Making and Using Guide Nucleic Acids
US8986958B2 (en) Methods for generating target specific probes for solution based capture
US9790543B2 (en) Methods and systems for solution based sequence enrichment
JP7282692B2 (en) Preparation and Use of Guide Nucleic Acids
CN102732506B (en) Methods and compositions for enriching target polynucleotides or non-target polynucleotides from a mixture of target and non-target polynucleotides
US6297010B1 (en) Method for detecting and identifying mutations
CN105647907B (en) It is a kind of for targeting the preparation method of the modified DNA hybridization probe of hybrid capture
WO2017181670A1 (en) Method for enriching target nucleic acid sequence from nucleic acid sample
JP2000509241A (en) Detection of nucleotide sequences at candidate loci by glycosylase
JP2022516821A (en) Transposome complex with surface-bound complex
EP2494069B1 (en) Method for detecting balanced chromosomal aberrations in a genome
US20100112556A1 (en) Method for sample analysis using q probes
CN106191256B (en) Method for DNA methylation sequencing aiming at target region
US6090548A (en) Method for identifying and/or quantifying expression of nucleic acid molecules in a sample
US20070054272A1 (en) Method of preparing dna fragments by selective fragmentation of nucleic acids and applications thereof
CN115698319A (en) Methods and compositions for preparing nucleic acid libraries
US9315807B1 (en) Genome selection and conversion method
JP5300335B2 (en) Highly efficient method for preparing mismatched base pair-containing nucleic acid fragments
CN117987389A (en) Taq DNA polymerase for reducing KASP genotyping nonspecific amplification and application thereof
CN113454235A (en) Improved nucleic acid target enrichment and related methods

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16901009

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2016405379

Country of ref document: AU

Date of ref document: 20161121

Kind code of ref document: A

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 21.02.2019)

122 Ep: pct application non-entry in european phase

Ref document number: 16901009

Country of ref document: EP

Kind code of ref document: A1