WO2023035889A1 - 基因融合的检测方法和装置 - Google Patents

基因融合的检测方法和装置 Download PDF

Info

Publication number
WO2023035889A1
WO2023035889A1 PCT/CN2022/113135 CN2022113135W WO2023035889A1 WO 2023035889 A1 WO2023035889 A1 WO 2023035889A1 CN 2022113135 W CN2022113135 W CN 2022113135W WO 2023035889 A1 WO2023035889 A1 WO 2023035889A1
Authority
WO
WIPO (PCT)
Prior art keywords
sample
optionally
target region
sequence
gene fusion
Prior art date
Application number
PCT/CN2022/113135
Other languages
English (en)
French (fr)
Inventor
杨志
陈彦梅
孙继国
赵多军
Original Assignee
成都齐碳科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 成都齐碳科技有限公司 filed Critical 成都齐碳科技有限公司
Publication of WO2023035889A1 publication Critical patent/WO2023035889A1/zh

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Definitions

  • the present application belongs to the technical field of gene detection, and relates to a method for detecting gene fusion, and more specifically, the present application relates to a method and device for detecting gene fusion based on nanopore sequencing technology.
  • fusion genes began in the 1960s.
  • the Philadelphia chromosome discovered in patients with chronic myelogenous leukemia opened the prelude to the study of fusion genes and diseases.
  • the occurrence of many diseases is accompanied by the phenomenon of fusion genes.
  • leukemia is often accompanied by
  • fusion genes such as bcr/abl, AML1/ETO, CBF ⁇ /MYH11, PML/RAR ⁇ , etc. Fusion genes are also found in various solid tumors, including EML4-ALK in non-small cell lung cancer, SLC45A3-ELK4 in prostate cancer, rhabdomyosarcoma Among them are PAX3-FOXO1 and so on.
  • Scientific research has found that some fusion genes are involved in the pathogenic process of related diseases. Therefore, the detection of these fusion genes can be used as one of the diagnostic criteria and even as a therapeutic target.
  • Fusion genes are produced by chromosomal rearrangements, including chromosomal translocations, insertions, inversions, and deletions. Fusion gene detection methods include the following four methods: microscope staining observation, chromosome karyotype banding, fluorescence in situ hybridization and high-throughput sequencing methods. The resolution of fusion genes studied by high-throughput sequencing methods is higher than that of the first three methods. However, the current detection of fusion genes is generally based on the analysis of next-generation transcriptome sequencing data. This method designs capture probes in the exon region to enrich the target region, then performs second-generation high-depth sequencing, and judges whether there is a gene fusion through the reads across the breakpoint position of the fusion gene in the sequencing data.
  • fusion gene detection methods in related technologies, it is often impossible to distinguish between these chimeric sequences and For true fusion gene sequences, these randomly generated chimeric reads will also be detected as fusion genes, resulting in false positive results.
  • the fusion breakpoint is relatively fixed, and only involves the fusion between the exons of two genes. The fusion at the transcriptome level can clearly detect that the fusion gene has been expressed, but it cannot distinguish whether the detected fusion gene has mutated at the genome level or the RNA fusion that occurred after the two genes were transcribed separately.
  • the method of designing capture probes in the exon region combined with high-throughput sequencing can detect fusions at the genomic level, since fusions at this level mostly occur in the intron region, some inherent characteristics of the intron region itself , if the gene is long and contains a large number of repetitive sequences, it will affect the precise determination of the fusion breakpoint. For example, the similarity of introns between different genes will affect the accurate alignment of sequences. For example, the high GC area is not conducive to the combination of capture probes and affects the detection sensitivity.
  • nanopore sequencing technology has the advantages of long read length (up to MB level) and fast sequencing speed (400-500bp/s).
  • the inventors of the present application provided a gene fusion detection method based on nanopore sequencing technology, which requires a lower initial sample amount and greatly improved detection sensitivity, so it is particularly It is suitable for various detections in the fields of biological research and clinical medicine, such as disease diagnosis, etc.
  • the embodiment of the present application provides a method for detecting gene fusion, the method comprising the following steps:
  • step (2) Design PCR primers according to the linker sequence used in step (1) and the sequence upstream or downstream of the target region, and perform PCR amplification using the DNA fragment obtained in step (1) as a template to obtain a PCR product;
  • step (3) using the nanopore sequencing method to sequence the PCR product obtained in step (2);
  • step (3) (4) comparing the sequencing result obtained in step (3) with the reference genome sequence to determine whether gene fusion occurs in the target region of the sample to be tested.
  • the length of the DNA fragment is 2kb-15kb; preferably 9kb-11kb.
  • the length of the punctured fragment is designed to be a 2kb-15kb fragment, optionally a 9kb-11kb fragment, which not only improves the detection rate of the fusion gene, but also takes into account the characteristics of PCR amplification. The positive detection rate of gene fusion is guaranteed.
  • the DNA fragment is obtained by randomly breaking the genomic DNA; optionally, the random breaking is by physical method or enzyme cutting method; optionally, the random cleavage is an enzyme cleavage; further optionally, the enzyme cleavage uses Tn5 transposase or MuA transposase.
  • step (1) the end of the DNA fragment is repaired before an adapter is added to the end of the DNA fragment.
  • step (1) when Tn5 transposase is used to break the genomic DNA, the breaking, end repairing and linker addition are completed in one step, and then the DNA is broken down using ddNTP Fragments are 3' hydroxyl end blocked.
  • the random fragmentation of DNA is preferably carried out using transposases, especially Tn5 transposases, which have unexpected advantages over traditionally used restriction endonucleases, for example, the randomness of restriction endonucleases is relatively high. Large, and the length of PCR amplification is limited, so if there is a corresponding enzyme cutting site at a position that is too far upstream from the region to be detected (such as greater than 10kb), it is difficult to amplify such a long fragment by PCR.
  • a library of various types of endonucleases is generally constructed, which involves multi-step purification operations.
  • the initial amount of the sample to be tested is required to be high, reaching several micrograms or even a dozen micrograms, which is difficult to meet clinical testing. demand.
  • the inventors of the present application found that using Tn5 transposase to build a library, using ddNTP to block the 3' hydroxyl end and nested PCR amplification in the library building, effectively improved the amplification specificity of the downstream known fusion genes.
  • the initial sample size used in this method is relatively small, which can reach 10ng.
  • the target region includes a non-coding region; optionally, the target region includes a promoter region and/or an intron region.
  • the target area includes an adjacent unknown area; optionally, the unknown area is located upstream of the target area.
  • step (2) the PCR is nested PCR.
  • nested primers are designed according to the linker sequence used in step (1) and the sequence at 150bp-1000bp upstream or downstream from the target region; optionally, according to the step (1)
  • the 3' end of the outer primer of the nested primer is connected to the 5' end of the inner primer
  • step (2) in the nested PCR, the second round of amplification is performed after the first round of amplification products are fragment sorted; more optionally, the The products of the second round were subjected to fragment sorting to obtain longer PCR products.
  • step (3) before the nanopore sequencing, end repair is performed on the PCR product and a sequencing adapter is ligated.
  • step (4) it is also possible to determine the location where the gene fusion occurs, and/or determine the frequency of the gene fusion to occur.
  • the method is a method for non-disease diagnosis.
  • the embodiment of the present application provides a gene fusion detection device, which includes: a genomic DNA processing module, used to extract the DNA of the sample to be detected, and then break, end repair and add adapters; target region capture The module is used to amplify the DNA fragment of the detection sample using the designed nested PCR primers, and obtain the PCR product containing the target region; the sequencing module is used to sequence the PCR product by nanopore sequencing method, and obtain the sequencing result and a detection module, for comparing the obtained sequencing results with the reference genome sequence to determine whether gene fusion occurs in the target region of the sample to be tested.
  • a genomic DNA processing module used to extract the DNA of the sample to be detected, and then break, end repair and add adapters
  • target region capture The module is used to amplify the DNA fragment of the detection sample using the designed nested PCR primers, and obtain the PCR product containing the target region
  • the sequencing module is used to sequence the PCR product by nanopore sequencing method, and obtain the sequencing result and a detection module, for comparing
  • the sample to be tested is a tissue sample; preferably a blood sample.
  • the sample to be tested is a frozen sample, a puncture sample or an FFPE sample.
  • the embodiment of the present application also provides the use of the method and device, which includes at least one of the following (1)-(3):
  • the target region includes a non-coding region; optionally, the target region includes a promoter region and/or an intron region.
  • the target area includes an adjacent unknown area; optionally, the unknown area is located upstream of the target area.
  • the embodiment of the present application also provides the use of the method and device for diagnosing diseases caused by fusion genes.
  • the disease caused by the fusion gene is selected from leukemia, lymphoma, or non-small cell lung cancer.
  • the embodiment of the present application also provides a kit for diagnosing diseases caused by fusion genes, the kit comprising the reagent combination used in the method described in the present application;
  • the disease caused by the fusion gene is selected from leukemia, lymphoma, or non-small cell lung cancer.
  • the method provided in the examples of this application is based on nanopore sequencing. Compared with conventional second-generation high-throughput sequencing, the sequencing read length is long, no sequence splicing and assembly is required, and the real reaction sequence information can greatly save detection time. , and improve the detection efficiency.
  • the method provided in the examples of this application has more advantages in detecting fusion genes at the DNA level. For fusion breakpoints that occur in non-coding regions such as promoters and intron regions, the location of the breakage can be accurately detected.
  • the method provided in the examples of this application uses nested PCR to obtain the sequence of the target region. Compared with the probe capture method commonly used in the detection of fusion genes by second-generation high-throughput sequencing, the sensitivity has been greatly improved.
  • Fig. 1 shows the gel electrophoresis figure of the fragmented DNA obtained according to Example 1 of the present application
  • Fig. 2 shows the gel electrophoresis figure of the nested PCR product obtained according to Example 1 of the present application
  • Fig. 3 shows the length distribution of the high-quality alignment read length (read) obtained according to Example 1 of the present application
  • Fig. 4 shows the flow chart that detects according to embodiment 2 of the present application
  • Figure 5 shows the gel electrophoresis graph of fragmented DNA and nested PCR products obtained by using Tn5 transposase to break and build a library according to Example 2 of the present application;
  • FIG. 6 shows the position of the EML4-ALK fusion breakpoint obtained from the analysis of the sequencing results in Example 2 of the present application.
  • Example 1 Using the method of the embodiment of the application to detect myc gene fusion
  • the method of the present application was verified using the GM12878 cell line (coriell institute).
  • the conditions for the first round of PCR amplification were 98°C for 3min; 98°C for 10s, 72°C for 5min, 7 cycles; 98°C for 10s, 67°C for 5min, 32 cycles; 67°C for 7min; 12°C for storage.
  • the conditions of the second round of PCR amplification were 98°C for 3min; 98°C for 10s, 72°C for 5min, 5 cycles; 98°C for 10s, 67°C for 5min, 20 cycles; 67°C for 7min; 12°C for storage.
  • a nanopore sequencing library was constructed according to the general library construction process, and then sequenced on the machine. After the sequencing, the data obtained were analyzed, and the results showed that among the 160,000 reads obtained from all the sequencing, more than 140,000 reads were compared to the correct position on the human genome, and the effective data utilization rate exceeded 85%.
  • the length distribution of all correctly aligned high-quality reads was also analyzed, and the results are shown in Figure 3. Most of the high-quality aligned reads were below 3kb in length, and there were 74,425 reads. Among them, only 2 long reads (5-6kb) were detected.
  • the method of this application was verified by using a gDNA standard product (Jingliang Gene Technology) with a tumor structure variation of 5%.
  • the transposase complex was assembled with Tn5 transposase (novoprotein).
  • the fragmented and adapter-ligated DNA fragment contained a 9 bp gap.
  • the sealing system is: Tn5 fragmented DNA, 11.5 ⁇ L; 10 ⁇ Taq buffer, 2 ⁇ L; ddATP, ddTTP, ddCTP, ddGTP (1 ⁇ mol/ ⁇ L) each 1.5 ⁇ L; Taq enzyme (5U/ ⁇ L), 0.5 ⁇ L.
  • the blocked product was purified with 1 ⁇ beads.
  • the blocked product was subjected to two rounds of nested PCR amplification, and the amplification conditions were referred to the conditions described in Example 1.
  • the sequences of the amplification primers are shown in Table 2.
  • the results of the first round of amplification are shown in lanes 2 and 3 in Figure 5 .
  • the results of the second round of amplification are shown in lanes 4 and 5 in Figure 5 .
  • the second-round amplification product was purified with 0.6 ⁇ beads, and then the fragmented DNA template was subjected to end repair and sequencing adapter ligation using a commercial kit (ABclonal).

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Pathology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Hospice & Palliative Care (AREA)
  • Oncology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

提供了基因融合的检测方法和装置。一种基因融合的检测方法,所述方法包括:(1)取待测样本的基因组DNA片段,将所述DNA片段末端加接头;(2)根据接头序列和位于目标区域上游或下游的序列设计PCR引物,以步骤(1)获得的DNA片段为模板进行PCR扩增,获得PCR产物;(3)使用纳米孔测序的方法,对步骤(2)获得的PCR产物进行测序;(4)将步骤(3)得到的测序结果与参照基因组序列比对,以确定所述待测样本的目标区域是否发生基因融合。还提供了一种基因融合的检测装置。提供的方法能够真实反应长读长序列的信息,针对融合断点位于重复序列区域的融合基因检出更具优势。

Description

基因融合的检测方法和装置
相关申请的交叉引用
本申请要求享有于2021年9月9日提交的名称为“基因融合的检测方法和装置”的中国专利申请202111056962.5的优先权,该申请的全部内容通过引用并入本文中。
技术领域
本申请属于基因检测技术领域,涉及一种基因融合的检测方法,更具体地本申请涉及一种基于纳米孔测序技术进行基因融合检测的方法和装置。
背景技术
融合基因的发现始于20世纪60年代,在慢性粒性白血病病人中发现的费城染色体,拉开了融合基因与疾病的研究序幕,许多疾病的发生都伴随有融合基因现象,例如,白血病常伴随有bcr/abl、AML1/ETO、CBFβ/MYH11、PML/RARα等融合基因,多种实体瘤中也发现融合基因,非小细胞肺癌中有EML4-ALK,前列腺癌中有SLC45A3-ELK4,横纹肌肉瘤中有PAX3-FOXO1等等。科学研究发现,一些融合基因参与了相关疾病的致病过程,因此,这些融合基因的检测可作为诊断标准之一,甚至可将其作为治疗靶点。
融合基因是染色体重排产生的,包括染色体易位、插入、颠倒、缺失。融合基因检测方法包括如下四种:显微镜染色观察、染色体核型显带、荧光原位杂交和高通量测序方法。高通量测序方法研究融合基因的分辨率比前三种更高。然而,当前对融合基因的检测一般基于二代转录组测序数据分析得到。该方法在外显子区设计捕获探针对目标区域进行富集,然后进行二代高深度测序,通过测序数据中跨过融合基因断点位置的reads来判断是否存在基因融合。然而,该方法存在许多不足,首先,其主要通过比对两种类型的读长(reads)来进行。但是由于转录组的复杂性和二代 RNA-seq测序读序读长的限制,短读序比对本身就面临非常大的挑战。也就是说高通量测序读长较短,并且设计的靶向探针只有一部分能够捕获到跨过断点的reads,捕获效率难以保证,因此该种检测的灵敏度相对较低。另一方面,二代测序在文库构建过程中会随机产生一些来自不同基因的片段随机连接在一起形成的嵌合读列,采用相关技术中的融合基因检测方法,经常无法区分这些嵌合序列和真正的融合基因序列,这些随机产生的嵌合读列也会被当做融合基因检测出来,造成结果的假阳性。再一方面,相对于基因组水平,转录组水平的融合,融合断点相对固定,且只涉及两个基因的外显子之间的融合。转录组水平的融合可以明确检测到的融合基因已经发生了表达,但无法区分检测到的融合基因在基因组水平就已发生了变异,还是在两个基因分别转录后才发生的RNA融合。进一步地,在外显子区设计捕获探针结合高通量测序的方法虽然可以检测基因组水平的融合,但由于该种水平的融合多发生在内含子区,内含子区本身固有的一些特性,如基因冗长、含大量重复序列,会影响融合断裂点的精准确定。如不同基因间内含子的相似性,会影响序列的准确比对。如高GC区,不利于捕获探针的结合,影响检测灵敏度。
基于此,当前对检测灵敏度更高的基因融合的检测方法存在需求。
发明内容
本申请的目的是针对相关技术中技术的不足,提供一种基于纳米孔测序技术进行基因融合检测的方法和装置。纳米孔测序技术作为第三代测序技术,具有读长长(可达MB级)和测序速度快(400-500bp/s)的优点。在此基础上,本申请的发明人提供了一种基于纳米孔测序技术的基因融合的检测方法,该方法所需要的样本起始量更低,并且检测灵敏度得到了极大的提高,因此特别适用于生物研究领域和临床医学领域的各种检测,如疾病诊断等。
本申请的目的是通过以下技术方案实现的:
一方面,本申请实施例提供了一种基因融合的检测方法,所述方法包括以下步骤:
(1)取待测样本的基因组DNA片段,将所述DNA片段末端加接头;
(2)根据步骤(1)使用的接头序列和位于目标区域上游或下游的序列设计PCR引物,以步骤(1)获得的DNA片段为模板进行PCR扩增,获得PCR产物;
(3)使用纳米孔测序的方法,对步骤(2)获得的PCR产物进行测序;
(4)将步骤(3)得到的测序结果与参照基因组序列比对,以确定所述待测样本的目标区域是否发生基因融合。
根据本申请一个方面的实施例,在步骤(1)中,所述DNA片段的长度为2kb-15kb;优选9kb-11kb。
发明人发现,在随机打断中将打断片段的长度设计为2kb-15kb的片段,可选地9kb-11kb的片段,既提高了融合基因的检出率,又兼顾PCR扩增的特点,保证了基因融合的阳性检出率。
根据本申请一个方面的实施例,在步骤(1)中,所述DNA片段为将所述基因组DNA随机打断获得的;可选地,所述随机打断采用物理法或酶切法;可选地,所述随机打断为酶切法打断;进一步可选地,所述酶切法打断使用Tn5转座酶或MuA转座酶。
根据本申请一个方面的实施例,在步骤(1)中,在所述DNA片段末端加接头之前进行DNA片段末端修复。
根据本申请一个方面的实施例,在步骤(1)中,当使用Tn5转座酶打断所述基因组DNA时,所述打断、末端修复和加接头一步完成,随后使用ddNTP将所述DNA片段进行3’羟基端封闭。
发明人发现,DNA的随机打断优选使用转座酶进行,尤其优选使用Tn5转座酶,相对于传统使用的限制性内切酶具有出乎意料的优点,例如限制性内切酶随机性较大,而PCR扩增的长度有限,因此如果离待检测区域上游太远(如大于10kb)的位置才有相应的酶切位点,这么长片段的PCR很难扩增完全。同时为了保证检出,一般会构建多种类型内切酶的文库,中间涉及多步纯化操作,因此对待检测样本的起始量要求较高,达到几微克甚至十几微克,很难满足临床检测的需求。而本申请的发明人发现,使用Tn5转座酶的建库方式,建库中使用ddNTP封闭3’羟基端及巢式PCR扩增的方法,有效的提高了下游已知融合基因的扩增特异性及效率,并且该方法使用的建库初始样本量较少,可以达到10ng。
根据本申请一个方面的实施例,其中,在步骤(2)中,所述目标区域包含非编码区;可选地,所述目标区域包含启动子区和/或内含子区域。
根据本申请一个方面的实施例,在步骤(2)中,所述目标区域包含临近的未知区域;可选地,所述未知区域位于所述目标区域的上游区域。
根据本申请一个方面的实施例,在步骤(2)中,所述PCR为巢式PCR。
根据本申请一个方面的实施例,在步骤(2)中,根据步骤(1)使用的接头序列和距离目标区域的上游或下游150bp-1000bp处的序列设计巢式引物;可选地,根据步骤(1)使用的接头序列和距离目标区域的上游或下游150bp-500bp处的序列设计巢式引物;可选地,所述巢式引物的外引物的3’端与内引物的5’端有5-15bp的重叠区;可选地,所述巢式引物的长度为25-30bp;进一步可选地,所述巢式引物的TM值为70℃。
根据本申请一个方面的实施例,在步骤(2)中,在所述巢式PCR中,对第一轮扩增产物进行片段分选后再进行第二轮扩增;更可选地,对第二轮的产物进行片段分选,以获得更长的PCR产物。
根据本申请一个方面的实施例,在步骤(3)中,在所述纳米孔测序之前,对所述PCR产物进行末端修复并连接测序接头。
根据本申请一个方面的实施例,在步骤(4)中,还可以确定所述基因融合发生的位置,和/或确定所述基因融合发生的频率。
根据本申请一个方面的实施例,所述方法为非疾病的诊断用途的方法。
另一个方面,本申请实施例提供了一种基因融合的检测装置,所述装置包括:基因组DNA处理模块,用于提取待检测样本的DNA,然后打断、末端修复并加接头;目标区域捕获模块,用于使用经设计的巢式PCR引物扩增检测样本的DNA片段,并获得包含目标区域的PCR产物;测序模块,用于通过纳米孔测序的方法对PCR产物进行测序,并获得测序结果;检测模块,用于将得到的测序结果与参照基因组序列比对,以确定所述待测样本的目标区域是否发生基因融合。
根据本申请一个方面的实施例,所述待测样本为组织样本;优选为血液样本。
根据本申请一个方面的实施例,所述待测样本为冰冻样本、穿刺样本或FFPE样本。
再一方面,本申请实施例还提供了所述方法和装置的用途,其包括如下(1)-(3)中的至少一种:
(1)检测待测样本的目标区域是否发生基因融合的用途;
(2)检测待测样本的目标区域发生基因融合的频率的用途;
(3)确定待测样本的目标区域发生基因融合的位置的用途。
可选地,所述目标区域包含非编码区;可选地,所述目标区域包含启动子区和/或内含子区域。
可选地,所述目标区域包含临近的未知区域;可选地,所述未知区域位于所述目标区域的上游区域。
再一方面,本申请实施例还提供了所述方法和装置用于诊断融合基因引起的疾病中的用途。
根据本申请一个方面的实施例,所述融合基因引起的疾病选自白血病、淋巴癌、或非小细胞肺癌。
再一方面,本申请实施例还提供了一种用于诊断融合基因引起的疾病的试剂盒,所述试剂盒包含本申请所述的方法中使用的试剂组合;
可选地,所述融合基因引起的疾病选自白血病、淋巴癌、或非小细胞肺癌。
与相关技术中技术相比,本申请的实施例具有以下的有益效果:
1.本申请的实施例提供的方法基于纳米孔测序,相较于常规的二代高通量测序,测序读长长,不需序列拼接组装,真实反应序列信息,从而可以大大节省了检测时间,并提高了检测效率。
2.本申请的实施例提供的方法在DNA水平检出融合基因更具优势,对于融合断点发生在非编码区如启动子、内含子区域的融合,可以精准检出断裂位置。
3.本申请的实施例提供的方法采用巢式PCR获得目标区域的序列,相较于二代高通量测序检测融合基因常采用的探针捕获方法,灵敏度有了极大的提高。
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本申请。
附图说明
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本申请的实施例,并与说明书一起用于解释本申请的原理,并不构成对本申请的不当限定。
图1示出为根据本申请的实施例1获得的片段化DNA的凝胶电泳图;
图2示出为根据本申请的实施例1获得的巢式PCR产物的凝胶电泳图;
图3示出为根据本申请的实施例1获得的高质量比对读长(read)的长度分布情况;
图4示出为根据本申请的实施例2进行检测的流程图;
图5示出为根据本申请的实施例2使用Tn5转座酶打断并建库获得的片段化DNA及巢式PCR产物的凝胶电泳图;
图6示出为根据本申请的实施例2测序结果分析得到的EML4-ALK融合断裂点位置。
具体实施方式
为了使本领域普通人员更好地理解本申请的技术方案,下面将结合附图,对本申请实施例中的技术方案进行清楚、完整地描述。应理解,此处所描述的具体实施例仅意在解释本申请,而不是限定本申请。对于本领域技术人员来说,本申请可以在不需要这些具体细节中的一些细节的情况下实施。下面对实施例的描述仅仅是为了通过示出本申请的示例来提供对本申请更好的理解。
实施例1使用本申请实施例的方法检测myc基因融合
以myc基因为例,利用GM12878细胞系(coriell institute)验证本申请的方法。
1.DNA的片段化:
利用g-tube(Covaris)对1μg提取好的完整基因组DNA进行打断,打断结果如图1所示,其中M为1kb DNA Ladder(天根生化科技(北 京)有限公司),1为打断后的DNA片段,并且从图1可以看出,片段化的DNA主带大于10kb。
2.末端修复,加接头:
利用商品化的末端制备酶(End Prep Enzymes,ABclonal)、受损的DNA修复酶II(Damaged DNA Repair Enzymes II,ABclonal)、T4DNA连接酶(T4DNA Ligase,ABclonal)对片段化后的DNA模板进行末端修复、加A尾和接头连接,纯化后进行巢式PCR扩增,最终扩增的结果如图2所示,其中M为1kb DNA Ladder(天根生化科技(北京)有限公司),1为第二轮巢式PCR产物,并且从图2可以看出,扩增产物为不同大小的DNA片段,最大扩增产物大于10kb。其中,所述PCR使用的引物如表1所示。
表1myc基因巢式PCR所用的引物
Figure PCTCN2022113135-appb-000001
第一轮PCR扩增的条件为98℃3min;98℃10s,72℃5min,7个循环;98℃10s,67℃5min,32个循环;67℃7min;12℃保存。
第二轮PCR扩增的条件为98℃3min;98℃10s,72℃5min,5个循环;98℃10s,67℃5min,20个循环;67℃7min;12℃保存。
3.将产物进行纳米孔测序,分析测序结果
将上述PCR产物纯化后定量,根据通用建库流程构建纳米孔测序文库,然后上机测序。测序结束后对所得数据进行分析,结果显示在所有测序得到16万条reads中,有超过14万条比对到了人基因组上正确的位置,有效数据利用率超过了85%。此外,还分析了所有正确比对的高质量 reads的长度分布情况,结果如图3所示,高质量比对reads长度大部分处于3kb以下,有74425条。其中,长读长reads(5-6kb)只测到2条。
由于没有对扩增产物进行片段分选,且测序平台对短片段序列的测序偏好性,所得的高质量比对reads长度大部分处于3kb以下。如果想要得到更长的reads,可以对第一轮扩增产物进行片段分选后再进行第二轮扩增,并同时对第二轮的产物也进行片段分选,可以尽可能得到更长的reads。
实施例2使用本申请实施例的方法检测EML4-ALK基因融合
以EML4-ALK融合基因为例,利用肿瘤结构变异5%的gDNA标准品(菁良基因科技)验证本申请的方法。
其中,所述方法的流程图如图4所示。
1.转座酶复合物组装
转座酶复合物中接头序列(表2所示)经常规退火后,与Tn5转座酶(novoprotein)组装转座酶复合物。
2.gDNA片段化及接头连接
利用组装好的转座酶复合物对10ng肿瘤结构变异5%gDNA标准品进行打断。打断的同时,片段化的DNA两端连接拼接的接头序列。打断体系如下:
Figure PCTCN2022113135-appb-000002
PCR仪上55℃反应3min后,立即置于冰上2min,1x beads纯化回收DNA片段。纯化回收后的DNA用接头上的引物(AP1)进行扩增,用来表征片段化的DNA大小。结果如图5中泳道1所示。
3.ddNTP封闭
片段化且连接接头的DNA片段中含有9bp的gap。为避免无效的扩增,经一轮PCR延伸步骤,将DNA结构中3’羟基端进行ddNTP封闭。封闭体系为:Tn5片段化的DNA,11.5μL;10×Taq buffer,2μL;ddATP、ddTTP、ddCTP、ddGTP(1μmol/μL)各1.5μL;Taq酶(5U/μL),0.5μL。72℃延伸7min后,1×beads纯化封闭后产物。
4.巢式PCR
封闭后的产物进行两轮巢式PCR扩增,扩增条件参照实施例1中所述条件。扩增引物序列如表2所示。
表2接头序列及EML4-ALK巢式PCR使用的引物
Figure PCTCN2022113135-appb-000003
第一轮扩增的结果如图5中2、3泳道。第二轮扩增结果如图5中4、5泳道。
5.末端修复、连接测序接头及纳米孔测序分析
第二轮扩增产物用0.6×beads纯化后利用商品化试剂盒(ABclonal)对片段化后的DNA模板进行末端修复、测序接头连接。
将连接有测序接头的产物进行纳米孔测序。在3次重复实验中,比对到高质量的参考序列(reference reads)分别有213119条、296043条、107378条,比对到高质量的EML4-ALK融合序列(fusion reads)分别有 9280条、14314条、5121条。计算突变频率分别为4.35%、4.83%、4.77%,这与菁良肿瘤结构变异标准品中突变频率为5%EML4-ALK检出结果一致。用IGV软件分析,找到的EML4-ALK融合断裂点位置与标准品中给定的位置亦一致。结果如图6所示。
讨论:
使用本申请的方法,利用10ng菁良肿瘤结构变异5%标准品基因组进行EML4-ALK融合基因检测,3次重复实验中,检测到的最长读长分别是11999bp、43924bp、11894bp,均达10kb以上。由于测序平台对短读长序列的测序偏好性,所得的高质量比对reads长度以短读长序列居多,大部分处于3kb以下。如果想要得到更长的reads,可以对第一轮扩增产物进行片段分选后再进行第二轮扩增,并同时对第二轮的产物也进行片段分选,可以尽可能得到更长的reads。
以上所述,仅为本申请的具体实施方式,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的系统、模块和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。应理解,本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。

Claims (10)

  1. 一种基因融合的检测方法,所述方法包括以下步骤:
    (1)取待测样本的基因组DNA片段,将所述DNA片段末端加接头;
    (2)根据步骤(1)使用的接头序列和位于目标区域上游或下游的序列设计PCR引物,以步骤(1)获得的DNA片段为模板进行PCR扩增,获得PCR产物;
    (3)使用纳米孔测序的方法,对步骤(2)获得的PCR产物进行测序;
    (4)将步骤(3)得到的测序结果与参照基因组序列比对,以确定所述待测样本的目标区域是否发生基因融合。
  2. 根据权利要求1所述的检测方法,其中,在步骤(1)中,所述DNA片段的长度为2kb-15kb;优选9kb-11kb;
    和/或,在步骤(1)中,所述DNA片段为将所述基因组DNA随机打断获得的;可选地,所述随机打断采用物理法或酶切法;可选地,所述随机打断为酶切法打断;进一步可选地,所述酶切法打断使用Tn5转座酶或MuA转座酶;
    和/或,在步骤(1)中,在所述DNA片段末端加接头之前进行DNA片段末端修复;
    和/或,在步骤(1)中,当使用Tn5转座酶打断所述基因组DNA时,所述打断、末端修复和加接头一步完成,随后使用ddNTP将所述DNA片段进行3’羟基端封闭。
  3. 根据权利要求1或2所述的方法,其中,在步骤(2)中,所述目标区域包含非编码区;可选地,所述目标区域包含启动子区和/或内含子区域;和/或,
    所述目标区域包含临近的未知区域;可选地,所述未知区域位于所述目标区域的上游区域。
  4. 根据权利要求1至3中任一项所述的方法,其中,在步骤(2) 中,所述PCR为巢式PCR;和/或,
    在步骤(2)中,根据步骤(1)使用的接头序列和距离目标区域的上游或下游150bp-1000bp处的序列设计巢式引物;可选地,根据步骤(1)使用的接头序列和距离目标区域的上游或下游150bp-500bp处的序列设计巢式引物;可选地,所述巢式引物的外引物的3’端与内引物的5’端有5-15bp的重叠区;可选地,所述巢式引物的长度为25-30bp;进一步可选地,所述巢式引物的TM值为70℃;
    和/或,在所述巢式PCR中,对第一轮扩增产物进行片段分选后再进行第二轮扩增;可选地,对第二轮的产物进行片段分选,以获得更长的PCR产物。
  5. 根据权利要求1至4中任一项所述的方法,其中,在步骤(3)中,在所述纳米孔测序之前,对所述PCR产物进行末端修复并连接测序接头。
  6. 根据权利要求1至4中任一项所述的方法,其中,在步骤(4)中,确定所述基因融合发生的位置,和/或确定所述基因融合发生的频率。
  7. 一种基因融合的检测装置,所述装置包括:
    基因组DNA处理模块,用于提取待检测样本的DNA,然后打断、末端修复并加接头;
    目标区域捕获模块,用于使用经设计的PCR引物扩增检测样本的DNA片段,并获得包含目标区域的PCR产物;
    测序模块,用于通过纳米孔测序的方法对PCR产物进行测序,并获得测序结果;
    检测模块,用于将得到的测序结果与参照基因组序列比对,以确定所述待测样本的目标区域是否发生基因融合。
  8. 如权利要求1至6中任一项所述的方法或如权利要求7所述的装置,其中所述待测样本为组织样本;可选为血液样本;和/或
    所述待测样本为冰冻样本、穿刺样本或FFPE样本。
  9. 如权利要求1至6中任一项所述的方法或如权利要求7所述的装置的用途,其包括如下(1)-(3)中的至少一种:
    (1)检测待测样本的目标区域是否发生基因融合的用途;
    (2)检测待测样本的目标区域发生基因融合的频率的用途;
    (3)确定待测样本的目标区域发生基因融合的位置的用途;
    可选地,所述目标区域包含非编码区;可选地,所述目标区域包含启动子区和/或内含子区域;
    可选地,所述目标区域包含临近的未知区域;可选地,所述未知区域位于所述目标区域的上游区域。
  10. 一种用于诊断融合基因引起的疾病的试剂盒,所述试剂盒包含权利要求1-6中任一项所述方法中使用的试剂组合;
    可选地,所述融合基因引起的疾病选自白血病、淋巴癌、或非小细胞肺癌。
PCT/CN2022/113135 2021-09-09 2022-08-17 基因融合的检测方法和装置 WO2023035889A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111056962.5A CN113862344A (zh) 2021-09-09 2021-09-09 基因融合的检测方法和装置
CN202111056962.5 2021-09-09

Publications (1)

Publication Number Publication Date
WO2023035889A1 true WO2023035889A1 (zh) 2023-03-16

Family

ID=78995145

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/113135 WO2023035889A1 (zh) 2021-09-09 2022-08-17 基因融合的检测方法和装置

Country Status (2)

Country Link
CN (1) CN113862344A (zh)
WO (1) WO2023035889A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113862344A (zh) * 2021-09-09 2021-12-31 成都齐碳科技有限公司 基因融合的检测方法和装置
CN114005489B (zh) * 2021-12-28 2022-03-22 成都齐碳科技有限公司 基于三代测序数据检测点突变的分析方法和装置
CN114807331B (zh) * 2022-05-12 2024-06-18 中国海洋大学 一种短链dna的纳米孔测序方法
CN115620810B (zh) * 2022-12-19 2023-03-28 北京诺禾致源科技股份有限公司 基于第三代基因测序数据的外源插入信息的检测方法和装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160153039A1 (en) * 2012-01-26 2016-06-02 Nugen Technologies, Inc. Compositions and methods for targeted nucleic acid sequence enrichment and high efficiency library generation
CN107190329A (zh) * 2017-05-27 2017-09-22 领星生物科技(上海)有限公司 基于dna的融合基因定量测序建库、检测方法及其应用
US20190309352A1 (en) * 2016-11-16 2019-10-10 Progenity, Inc Multimodal assay for detecting nucleic acid aberrations
CN111020019A (zh) * 2020-03-06 2020-04-17 元码基因科技(北京)股份有限公司 基于纳米孔技术进行基因融合检测的方法
CN113862344A (zh) * 2021-09-09 2021-12-31 成都齐碳科技有限公司 基因融合的检测方法和装置

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117012283A (zh) * 2015-10-10 2023-11-07 夸登特健康公司 无细胞dna分析中基因融合检测的方法和应用
EP3545106B1 (en) * 2017-08-01 2022-01-19 Helitec Limited Methods of enriching and determining target nucleotide sequences
CN107586835B (zh) * 2017-10-19 2020-11-03 东南大学 一种基于单链接头的下一代测序文库的构建方法及其应用
CN110396534A (zh) * 2019-08-12 2019-11-01 华大生物科技(武汉)有限公司 基因文库的构建方法、待测核酸样本基因突变的检测方法及试剂盒
CA3158739A1 (en) * 2019-11-19 2021-05-27 Xiao Bing Wang Method for identifying gene fusions by circle cdna amplification
CN111304285B (zh) * 2019-12-13 2020-09-29 北京先声医学检验实验室有限公司 基于纳米孔测序平台的泌尿宏基因组样本建库和检测方法
CN112195521A (zh) * 2020-09-11 2021-01-08 翌圣生物科技(上海)有限公司 一种基于转座酶的dna/rna共建库方法、试剂盒及应用

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160153039A1 (en) * 2012-01-26 2016-06-02 Nugen Technologies, Inc. Compositions and methods for targeted nucleic acid sequence enrichment and high efficiency library generation
US20190309352A1 (en) * 2016-11-16 2019-10-10 Progenity, Inc Multimodal assay for detecting nucleic acid aberrations
CN107190329A (zh) * 2017-05-27 2017-09-22 领星生物科技(上海)有限公司 基于dna的融合基因定量测序建库、检测方法及其应用
CN111020019A (zh) * 2020-03-06 2020-04-17 元码基因科技(北京)股份有限公司 基于纳米孔技术进行基因融合检测的方法
CN113862344A (zh) * 2021-09-09 2021-12-31 成都齐碳科技有限公司 基因融合的检测方法和装置

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JECK WILLIAM R., LEE JESSE, ROBINSON HAYLEY, LE LONG P., IAFRATE A. JOHN, NARDI VALENTINA: "A Nanopore Sequencing–Based Assay for Rapid Detection of Gene Fusions", THE JOURNAL OF MOLECULAR DIAGNOSTICS, AMERICAN SOCIETY FOR INVESTIGATIVE PATHOLOGY AND THE ASSOCIATION FOR MOLECULAR PATHOLOGY, vol. 21, no. 1, 1 January 2019 (2019-01-01), pages 58 - 69, XP093045307, ISSN: 1525-1578, DOI: 10.1016/j.jmoldx.2018.08.003 *
ZHANG XUCHAO; ZHANG SHIRLEY; YANG XUENING; YANG JINJI; ZHOU QING; YIN LUCY; AN SHEJUAN; LIN JIAYING; CHEN SHILIANG; XIE ZHI; ZHU M: "Fusion of EML4 and ALK is associated with development of lung adenocarcinomas lacking EGFR and KRAS mutations and is correlated with ALK expression", MOLECULAR CANCER, BIOMED CENTRAL, LONDON, GB, vol. 9, no. 1, 13 July 2010 (2010-07-13), GB , pages 188, XP021077926, ISSN: 1476-4598, DOI: 10.1186/1476-4598-9-188 *

Also Published As

Publication number Publication date
CN113862344A (zh) 2021-12-31

Similar Documents

Publication Publication Date Title
WO2023035889A1 (zh) 基因融合的检测方法和装置
JP7119014B2 (ja) まれな変異およびコピー数多型を検出するためのシステムおよび方法
KR102628878B1 (ko) 메틸롬 분석을 이용한 암 검출 및 분류
JP6585117B2 (ja) 胎児の染色体異数性の診断
KR101858344B1 (ko) 바코드 서열을 포함하는 어댑터를 이용한 차세대 염기서열 분석 방법
WO2014151117A1 (en) Identification and use of circulating nucleic acid tumor markers
CN109161591A (zh) 单基因遗传性肾脏病基因联合筛查方法、试剂盒及其制备方法
US20210156863A1 (en) Cancer detection, classification, prognostication, therapy prediction and therapy monitoring using methylome analysis
US10947599B2 (en) Tumor mutation burden
CN110628880A (zh) 一种同步使用信使rna与基因组dna模板检测基因变异的方法
TW202102687A (zh) 確定循環核酸之線性及環狀形式
JP2023547620A (ja) 分配および塩基変換を使用してdnaを解析するための組成物および方法
CN111575349A (zh) 一种接头序列及其应用
CN114005490B (zh) 基于二代测序技术的循环肿瘤dna融合检测方法
WO2022262831A1 (zh) 用于评估肿瘤的物质及其方法
EP4281583A1 (en) Heatrich-bs: heat enrichment of cpg-rich regions for bisulfite sequencing
CN110564851B (zh) 一组用于非超突变型直肠癌分子分型的基因及其应用
CN112251512A (zh) 用于非小细胞肺癌患者基因检测的目标基因组以及相关的评估方法、用途和试剂盒
US11866765B2 (en) Composition for improving molecular barcoding efficiency and use thereof
Li et al. Molecular characterization and functional roles of circulating cell-free extrachromosomal circular DNA
JP2023524681A (ja) 分配された核酸を使用した配列決定のための方法
WO2023197004A1 (en) Detecting the presence of a tumor based on methylation status of cell-free nucleic acid molecules
CN116200491A (zh) 隆突性皮肤纤维肉瘤诊断及预后相关基因靶向检测试剂盒
JP2023551292A (ja) メチル化されたポリヌクレオチドを富化するための組成物および方法
WO2018148903A1 (zh) 泌尿系统肿瘤的辅助诊断方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22866362

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE