WO2022228351A1 - Representation and application of novel high temperature argonaute protein tpsago - Google Patents

Representation and application of novel high temperature argonaute protein tpsago Download PDF

Info

Publication number
WO2022228351A1
WO2022228351A1 PCT/CN2022/088816 CN2022088816W WO2022228351A1 WO 2022228351 A1 WO2022228351 A1 WO 2022228351A1 CN 2022088816 W CN2022088816 W CN 2022088816W WO 2022228351 A1 WO2022228351 A1 WO 2022228351A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
target nucleic
gdna
tpsago
amplification
Prior art date
Application number
PCT/CN2022/088816
Other languages
French (fr)
Chinese (zh)
Inventor
冯雁
孙莹璎
郭翔
叶星宇
李忠磊
Original Assignee
上海交通大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海交通大学 filed Critical 上海交通大学
Publication of WO2022228351A1 publication Critical patent/WO2022228351A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6816Hybridisation assays characterised by the detection means
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Definitions

  • the invention belongs to the field of biotechnology, and in particular relates to the characterization and application of a novel high-temperature Argonaute protein TpsAgo.
  • the Argonaute (Ago) protein was first mentioned in a study describing mutants of Arabidopsis thaliana.
  • Ago proteins are key players in the eukaryotic RNA interference (RNAi) pathway, which can regulate gene expression post-transcriptionally, thereby defending against host-invading RNA viruses and protecting genome integrity.
  • RNAi RNA interference
  • the reported Ago proteins are mainly divided into two categories: eukaryotic Ago (eAgo) and prokaryotic Ago (pAgo).
  • Prokaryotic Ago can bind to single-stranded guide DNA or RNA, and catalyze the cleavage of target DNA or RNA that is complementary to the guide. Different from the CRISPR/Cas system, prokaryotic Ago does not require a PAM sequence when cutting the target nucleic acid strand. It can bind to the guide DNA or RNA complementary to the target nucleic acid and cut at any complementary position of the target.
  • High-temperature Ago proteins derived from archaea can exert shearing activity above 70 °C.
  • the high-temperature Agos that have been characterized so far can generally be combined with single-stranded guide DNA or RNA to shear target single-stranded DNA under high temperature conditions, and a few can also shear Cut the target single-stranded RNA.
  • the purpose of the present invention is to provide a high-specificity, high-sensitivity low-abundance mutant DNA enrichment and detection method.
  • a nucleic acid cutting system comprising:
  • the temperature of the nucleic acid cutting system is 65-90°C, preferably 70-85°C, more preferably 75-85°C.
  • the programmable endonuclease Argonaute is selected from the following group: TtAgo, TpsAgo, SeAgo, RsAgo, KpAgo, hAgo1, hAgo2, MpAgo, TpAgo, PfAgo, MjAgo, MfAgo, NgAgo, LrAgo, AaAgo, CbAgo, CpAgo, IbAgo, KmAgo.
  • the programmable endonuclease Argonaute is derived from Thermus parvatiensis, and the programmable endonuclease Argonaute is a programmable endonuclease TpsAgo.
  • the TpsAgo includes wild-type and mutant TpsAgo.
  • amino acid sequence of the wild-type programmable endonuclease TpsAgo is shown in NCBI sequence number WP_060384876.1.
  • the 5' end of the guide DNA has a modification selected from the group consisting of 5'-P, 5'-OH, 5'-Biotin, 5'-NH 2 C 6 , 5'- FAM, or 5'-SHC 6 .
  • the guide DNA is a single-stranded DNA molecule with 5'-terminal phosphorylation or 5'-terminal hydroxylation.
  • the guide DNA is a single-stranded DNA molecule phosphorylated at the 5' end.
  • the guide DNA and the reporter nucleic acid have a reverse complementary fragment.
  • the length of the guide DNA is 5-30nt, preferably 10-24nt, more preferably 16-24nt, more preferably 16-21nt.
  • the guide DNA is a single-stranded DNA molecule phosphorylated at the 5' end, and its length is 16nt-21nt.
  • the guide DNA is a 5'-terminal hydroxylated single-stranded DNA molecule with a length of 16nt-24nt.
  • the first nucleotide at the 5' end of the guide DNA is phosphorylated thymine (T), guanine (G), adenine (A) or cytosine (C).
  • the reporter nucleic acid is a single-stranded nucleic acid, including single-stranded DNA (ssDNA) or single-stranded RNA (ssRNA).
  • ssDNA single-stranded DNA
  • ssRNA single-stranded RNA
  • the reporter nucleic acid is single-stranded DNA (ssDNA) or single-stranded RNA (ssRNA).
  • the reporter nucleic acid is a single-stranded DNA.
  • the cleavage when the reporter nucleic acid is cleaved, the cleavage can be detected by electrophoresis.
  • the electrophoresis method is a 16% nucleic acid Urea-PAG electrophoresis detection method.
  • the length of the reporter nucleic acid is 10-100nt, preferably 20-70nt, more preferably 30-60nt, more preferably 40-50nt, and most preferably 45nt.
  • the reporter nucleic acid is a fluorescent reporter nucleic acid
  • the fluorescent reporter nucleic acid has a fluorescent group and/or a quenching group.
  • the fluorescent group and the quenching group are independently located at the 5' end and the 3' end of the fluorescent reporter nucleic acid.
  • the fluorescent group and the quenching group are located on both sides of the complementary regions of the fluorescent reporter nucleic acid and the guide DNA, respectively.
  • the fluorescent group includes: FAM, HEX, CY5, CY3, VIC, JOE, TET, 5-TAMRA, ROX, Texas Red-X, or a combination thereof.
  • the quenching group includes: BHQ, TAMRA, DABCYL, DDQ, or a combination thereof.
  • the fluorescent reporter nucleic acid is a single-stranded DNA molecule with only a fluorescent group, and the fluorescent group is FAM.
  • the nucleic acid cutting system further: (d) divalent metal ions.
  • the divalent metal ion is Mn 2+ or Co 2+ , preferably Mn 2+ .
  • the concentration of divalent metal ions is 80 ⁇ M-3 mM, preferably 100 ⁇ M-2 mM, more preferably 250 ⁇ M-2 mM.
  • the nucleic acid cutting system further comprises: (e) buffer.
  • the concentration of NaCl is 0-4800 mM, preferably 40-1600 mM, more preferably 80-800 mM, more preferably 80-400 mM.
  • the pH value of the buffer solution is 7-9, preferably 8.0.
  • the 2nd, 3rd, 4th, 6th, 8th, 10th, 11th, 13th, 14th or 16th position from the 5' end (more It is preferably the 2nd, 4th, 10th, 11th, 13th or 14th position, more preferably the 2nd, 10th or 13th position) when there is a mismatch with the reporter nucleic acid, it will be significantly reduced.
  • the significantly reducing the cleavage rate of the programmable endonuclease Argonaute refers to: under the same reaction conditions, the cleavage rate of the programmable endonuclease Argonaute is reduced by ⁇ 80% , preferably by ⁇ 85%, more preferably by ⁇ 90%.
  • the TeAgo enzyme is guided to cleave the fluorescent reporter nucleic acid, thereby generating a detectable signal (eg, fluorescence).
  • the concentration of the reporter nucleic acid is 0.4 ⁇ M-4 ⁇ M, preferably 0.6 ⁇ M-2 ⁇ M, more preferably 0.8 ⁇ M-1 ⁇ M, and most preferably 0.8 ⁇ M.
  • the concentration of the programmable endonuclease Ago is 10nM-10 ⁇ M, preferably 50nM-1 ⁇ M, more preferably 100nM-500nM, and most preferably 200nM .
  • the concentration of the guide DNA is 10 nM-10 ⁇ M, preferably 100 nM-3 ⁇ M, more preferably 1 ⁇ M-2.5 ⁇ M, and most preferably 2 ⁇ M.
  • the reporter nucleic acid is plasmid DNA
  • the guide DNA includes forward gDNA and reverse gDNA
  • the forward gDNA and one strand of the plasmid DNA can form a first reverse complementary region, and the reverse gDNA and the other strand of the plasmid DNA can form a second reverse complementary region.
  • the distance between the first reverse complementary region and the second reverse complementary region is ⁇ 200bp, preferably ⁇ 50bp, more preferably ⁇ 1bp .
  • the plasmid DNA is plasmid pUC19.
  • the plasmid DNA is in a supercoiled state.
  • the plasmid DNA is in a supercoiled state, and the GC content of the plasmid DNA (80bp GC content near the cleavage site) is 20%-35%, preferably 25%- 32%, more preferably 29%.
  • the plasmid DNA is in a supercoiled state, and the GC content of the plasmid DNA (the GC content of 80 bp near the cleavage site) is 55%-75%, preferably 62%- 68%, more preferably 65%.
  • a reaction system for enriching low-abundance target nucleic acid is provided, and the reaction system is used to simultaneously perform Helicase-dependent isothermal DNA amplification on a nucleic acid sample ) and nucleic acid cleavage reaction, thereby obtaining amplification-cleavage reaction product;
  • the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non-target nucleic acid;
  • the nucleic acid cleavage reaction is used for specifically cleaving non-target nucleic acid, but not cleaving the target nucleic acid;
  • the amplification-cleavage reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cleavage system according to the first aspect of the present invention.
  • the concentration of the programmable endonuclease Argonaute (Ago) is 20-200nM, preferably 30-150nM, more preferably 40-100nM, the best The ground is 50 nM.
  • the reagents required for the isothermal amplification reaction include tHDA kit (purchased from New England Biolabs).
  • the reagents required for the isothermal amplification reaction further include a pair of amplification primers for the target nucleic acid.
  • the concentration of each primer in the amplification primer pair of the target nucleic acid is 10-300 nM, preferably 50-200 nM, more preferably 100 nM.
  • the concentration of the target nucleic acid is 0.1-100 nM, preferably 0.5-50 nM, more preferably 1 nM.
  • the gDNA includes forward gDNA and reverse gDNA
  • the forward gDNA refers to the gDNA having the same sequence fragment as the target nucleic acid
  • the reverse gDNA refers to the gDNA having the reverse complementary sequence fragment to the target nucleic acid
  • reaction system further includes: divalent metal ions.
  • the divalent metal ion is Mn 2+ .
  • the concentration of divalent metal ions is 50 ⁇ M-2000 ⁇ M, preferably 100 ⁇ M-1000 ⁇ M, more preferably 250 ⁇ M.
  • reaction temperature (reaction procedure) of the reaction system is: 65° C., 1.5-2 h.
  • the target nucleic acid is selected from the following group: wild-type EGFR sequence fragment, EGFR L861Q mutant sequence fragment.
  • nucleotide sequence of the wild-type EGFR sequence fragment is shown in SEQ ID NO:5.
  • nucleotide sequence of the EGFR L861Q mutant sequence fragment is shown in SEQ ID NO:6.
  • nucleotide sequences of the forward gDNA and the reverse gDNA are shown in SEQ ID NOs: 7 and 8, respectively.
  • nucleotide sequences of the amplification primer pair of the EGFR L861Q mutant sequence fragment are shown in SEQ ID NOs: 9 and 10, respectively.
  • a method for enriching low-abundance target nucleic acid comprising the steps of:
  • nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non-target nucleic acid,
  • the abundance of the target nucleic acid in the nucleic acid sample is F1a;
  • the nucleic acid cleavage reaction is used for specifically cleaving non-target nucleic acid, but not cleaving the target nucleic acid;
  • the amplification-cleavage reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cleavage system according to the first aspect of the present invention;
  • the abundance of the target nucleic acid in the amplification-cleavage reaction product is F1b
  • the ratio of F1b/F1a is ⁇ 5 (preferably ⁇ 10).
  • the target nucleic acid and the non-target nucleic acid differ by only one base.
  • the ratio of F1b/F1a ⁇ 10 when 1% ⁇ F1a ⁇ 10%, the ratio of F1b/F1a ⁇ 10, when 0.1% ⁇ F1a ⁇ 0.5%, the ratio of F1b/F1a ⁇ 100, when F1a ⁇ 0.1%, The ratio of F1b/F1a is ⁇ 200.
  • the nucleic acid sample includes a nucleic acid sample that is directly thermally lysed, a nucleic acid sample that is directly lysed by protease, an extracted nucleic acid sample, a nucleic acid sample that has been pre-amplified by PCR, or any nucleic acid-containing sample .
  • the target nucleic acid is a nucleotide sequence containing mutations.
  • the mutation is selected from the group consisting of nucleotide insertion, deletion, substitution, or a combination thereof.
  • the non-target nucleic acid (or the second nucleic acid) is a wild-type nucleotide sequence, a high-abundance nucleotide sequence, or a combination thereof.
  • the abundance of the non-target nucleic acid in the nucleic acid sample is F2a.
  • F1a+F2a 100%.
  • the ratio of F2a/F1a is ⁇ 20, preferably ⁇ 50, more preferably ⁇ 100, and most preferably ⁇ 1000 or ⁇ 5000.
  • the abundance of the non-target nucleic acid in the amplification-cleavage reaction product is F2b.
  • F1b+F2b 100%.
  • the F1b/F2b ⁇ 0.5, preferably ⁇ 1, more preferably ⁇ 2, most preferably ⁇ 3 or ⁇ 5.
  • the ratio of F1b/F1a is ⁇ 200, preferably ⁇ 500, more preferably ⁇ 1000, most preferably ⁇ 2000 or ⁇ 5000 or higher.
  • F1a ⁇ 0.5%, preferably ⁇ 0.2%, more preferably ⁇ 0.1%, most preferably ⁇ 0.01%.
  • F1b ⁇ 10%, preferably ⁇ 30%, more preferably ⁇ 50%, and most preferably ⁇ 70%.
  • the "reagents required for isothermal amplification reaction” include: helicase and DNA polymerase.
  • the "reagents required for isothermal amplification reaction” further include: dNTP, Mg 2+ , and isothermal amplification buffer.
  • the gDNA in the nucleic acid cleavage system forms a first complementary binding region with the nucleic acid sequence of the target region of the target nucleic acid (ie, the first nucleic acid); and the gDNA in the nucleic acid cleavage system also interacts with non- The nucleic acid sequence of the targeting region of the target nucleic acid (ie, the second nucleic acid) forms the second complementary binding region.
  • the first complementary binding region contains at least two unmatched base pairs.
  • the second complementary binding region contains 0 or 1 unmatched base pair.
  • the second complementary binding region contains one unmatched base pair.
  • the first complementary binding region contains at least two unmatched base pairs, so that the complex does not cleave the target nucleic acid; and the second complementary binding region contains one unmatched base pair. matched base pairs, causing the complex to cleave the non-target nucleic acid.
  • the targeting region of the target nucleic acid corresponds to the targeting region of the non-target nucleic acid (i.e. the second nucleic acid).
  • the amount of nucleic acid used as a template is 0.1-100 nM.
  • the method further includes:
  • the detection in step (c) includes quantitative detection, qualitative detection, or a combination thereof.
  • the quantitative detection is selected from the following group: TaqMan fluorescence quantitative PCR, Sanger sequencing, q-PCR, ddPCR, chemiluminescence, high-resolution melting curve method, NGS, etc.; From TaqMan real-time PCR, Sanger sequencing.
  • the first nucleic acid includes n different nucleic acid sequences, wherein n is a positive integer ⁇ 1.
  • n is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46 , 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71 , 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96 , 97, 98, 99, 100 or greater.
  • n is 2-1000, preferably 3-100, more preferably 3-50.
  • the method is non-diagnostic and non-therapeutic.
  • the nucleic acid sample includes nucleic acid from a sample, wherein the sample is selected from the group consisting of blood, cells, serum, saliva, body fluid, plasma, urine, prostatic fluid, bronchial lavage fluid, cerebrospinal fluid, gastric fluid, bile, lymph fluid, peritoneal fluid, feces, etc., or a combination thereof.
  • the low-abundance target nucleic acid is selected from the group consisting of wild-type EGFR sequence fragments and EGFR L861Q mutant sequence fragments.
  • kits for detecting target nucleic acid molecules comprising:
  • the kit includes:
  • the kit also contains:
  • the low-abundance target nucleic acid detection reagents include: primers, probes, and the like.
  • the low-abundance target nucleic acid detection reagents include: primers and probes required for TaqMan fluorescence quantitative PCR, or reagents required for Sanger sequencing.
  • the low-abundance target nucleic acid detection reagent includes the reagent required for Sanger sequencing.
  • the kit also contains:
  • the kit also includes:
  • first container, the second container, the third container, the fourth container, the fifth container and the sixth container may be the same or different containers.
  • a programmable endonuclease Argonaute for preparing a reagent or kit for detecting target molecules, or for preparing a reagent or kit for detecting low-abundance target nucleic acid.
  • the programmable endonuclease Argonaute is derived from Thermus parvatiensis; or a homologous analog thereof with the same or similar functions.
  • the TpsAgo includes wild-type and mutant TpsAgo.
  • the programmable endonuclease Argonaute has an amino acid sequence selected from the group consisting of:
  • the purpose of the present invention is to provide a novel high-temperature nuclease with multiple guide strand and substrate strand cleavage preferences.
  • the present invention provides a high-temperature Argonaute-TpsAgo protein, which has high-temperature nuclease activity, and is derived from a strain of thermophilic bacteria Thermus parvatiensis isolated from hot spring water (90-98°C) in northern India RL is the starting strain, which is a high-temperature Ago protein with a small molecular weight reported so far.
  • the present invention provides a high-temperature Argonaute, a gene of TpsAgo protein, which encodes the above-mentioned high-temperature nuclease TpsAgo protein.
  • the recombinant plasmid pET28a-TpsAgo is constructed after the gene of TpsAgo protein is excavated and sequence compared.
  • the recombinant plasmid is transformed into Escherichia coli (DE3) to realize the heterologous expression of TpsAgo, and then purified by Ni-NTA gravity column The Ago protein produced by the recombinant strain was obtained.
  • the molecular weight of the novel high-temperature Ago protein obtained by the invention is about 76kDa, and the enzyme can simultaneously use 5'-phosphorylated gDNA and 5'-hydroxylated gDNA to mediate the cleavage of single-stranded DNA target nucleic acid, and can also use 5'-phosphorylated gDNA and 5'-hydroxylated gDNA to mediate the cleavage of single-stranded DNA target nucleic acid.
  • - Phosphorylated gDNA mediates cleavage of single-stranded RNA target nucleic acids.
  • the optimum reaction temperature range is between 65°C-90°C; Mn 2+ and Co 2+ can be used as active ions, and 100 ⁇ M Mn 2+ can keep it highly active; the enzyme has a certain tolerance to NaCl concentration, The tolerance range is between 0-3200mM; the enzyme can utilize 16nt-21nt 5'-P gDNA and 16nt-24nt 5'-OH gDNA; the enzyme has a preference for the first base at the 5' end of gDNA, From the shear kinetics results, it can be seen that gDNA whose first hydroxyl group is T and G is preferred; the enzyme can utilize a variety of 5' end modified gDNA, such as 5'-P, 5'-OH, 5'- Biotin, 5'-NH 2 C 6 , 5'-FAM, 5-SHC 6 such as; the enzyme has a low tolerance for single-point mismatch between target and gDNA, which will have a certain degree of SNV gene detection application prospects.
  • the enzyme can cut single-stranded DNA as well as plasmid DNA, and can cut supercoiled DNA into linear DNA at the positions of 29% GC content and 65% GC content of pUC19 plasmid.
  • the enzyme has good discrimination between single-point mismatch and double-point mismatch, and can be combined with isothermal amplification technology to enrich and detect mutant genes such as EGFR L861Q through the coupling reaction of "amplification and shearing".
  • Figure 1 shows the results of phylogenetic analysis (A) and multiple sequence alignment (B) of TpsAgo.
  • Figure 2 shows the results of SDS-PAGE electrophoresis analysis of TpsAgo protein.
  • the lanes from left to right are protein Marker, cell lysis supernatant, cell lysis precipitation, and purified TpsAgo.
  • Figure 3 shows the results of the assay of the cleavage activity of TpsAgo.
  • Figure 4 shows the effect of length of 5'-phosphorylated gDNA (A) and 5'-hydroxylated gDNA (B) on TpsAgo cleavage activity.
  • Figure 5 shows a graph of the results of the optimum temperature range required for the TpsAgo reaction.
  • Figure 6 shows the results of the effect of divalent metal ion type (A) and concentration (B) on TpsAgo cleavage activity.
  • Figure 7 shows the results for the NaCl temperature range that TpsAgo can tolerate.
  • Figure 8 shows the results of the preference of TpsAgo for the first base at the 5' base end of gDNA.
  • Figure 9 shows the results of TpsAgo's high tolerance for gDNA 5' modification.
  • Figure 10 shows the results that TpsAgo can differentiate cleavage against single-point mismatches at different sites between gDNA and Target.
  • Figure 11 shows the results of cleavage of pUC19 plasmids at 29% GC (A, B) and 65% GC (C, D) content positions by TpsAgo.
  • OC stands for open-circle plasmid (one strand of plasmid is broken);
  • LIN stands for linearized plasmid (double strand of plasmid is broken);
  • SC stands for supercoiled plasmid.
  • Figure 12 shows the design principle of forward and reverse gDNAs for TpsAgo enrichment of SNV mutant genes.
  • Figure 13 shows the enrichment sequencing results of TpsAgo combined with isothermal amplification technology for the EGFR L861Q SNV gene (A means no TpsAgo added, B means 50nM TpsAgo added).
  • the present inventors After extensive and in-depth research and extensive screening, the present inventors have developed for the first time a method for enrichment and detection of low-abundance mutant DNA with high sensitivity, good specificity and high throughput. Specifically, the inventors obtained the nuclease TpsAgo through in vitro expression, purification and separation, and obtained its optimal reaction parameters through a large number of groping experiments, thereby providing a TpsAgo-based method for enriching low-abundance target nucleic acids and corresponding detection methods.
  • the invention has the advantages of non-invasiveness, easy operation, rapidity, etc., and can better detect low-abundance mutant genes in human liquid biopsy.
  • the technology of the invention can be widely used in various fields of molecular diagnosis involving nucleic acid detection, such as tumor liquid Biopsy, infectious diseases such as major infectious and pathogenic infectious diseases (viruses, pathogenic bacteria) detection fields.
  • infectious diseases such as major infectious and pathogenic infectious diseases (viruses, pathogenic bacteria) detection fields.
  • infectious diseases such as major infectious and pathogenic infectious diseases (viruses, pathogenic bacteria) detection fields.
  • infectious diseases such as major infectious and pathogenic infectious diseases (viruses, pathogenic bacteria) detection fields.
  • infectious diseases such as major infectious and pathogenic infectious diseases (viruses, pathogenic bacteria) detection fields.
  • the terms "containing” or “including (including)” can be open, semi-closed, and closed. In other words, the term also includes “consisting essentially of” or “consisting of.”
  • Transduction refers to the process of delivering an exogenous polynucleotide into a host cell for transcription and translation to produce a polypeptide product, including the use of plasmid molecules to transfer the exogenous polynucleotide to a host cell.
  • the polynucleotide is introduced into a host cell (eg, E. coli).
  • Gene expression or “expression” refers to the process of transcription, translation and post-translational modification of a gene to produce the RNA or protein product of a gene.
  • Polynucleotide refers to a polymeric form of nucleotides of any length, including deoxynucleotides (DNA), ribonucleotides (RNA), hybrid sequences thereof, and the like. Polynucleotides can include modified nucleotides, such as methylated or capped nucleotides or nucleotide analogs.
  • the term polynucleotide as used herein refers to interchangeable single- and double-stranded molecules. Unless otherwise specified, a polynucleotide in any of the embodiments described herein includes both the double-stranded form and the two complementary single strands known or predicted to make up the double-stranded form.
  • amino acids are within one or more of the following groups: glycine, alanine; and valine, isoleucine, leucine, and proline; aspartic acid, glutamic acid amino acids; asparagine, glutamine; serine, threonine, lysine, arginine and histidine; and/or phenylalanine, tryptophan and tyrosine; methionine and cysteine .
  • the present invention also provides non-conservative amino acid substitutions that allow for amino acid substitutions from different groups.
  • Argonaute protein belongs to the PIWI (P element-induced wimpy testis) protein superfamily, which is defined by the presence of the PIWI domain, widely present in all areas of life, and can bind to siDNA or siRNA guide strand to specifically silence or cut complementary nucleic acids target strand.
  • PIWI P element-induced wimpy testis
  • RNA interference RNA interference
  • eAgos Eukaryotic Argonaute protein
  • RISC multi-protein RNA-induced silencing complex
  • siRNA molecules as guide strands, cleaves complementary target RNAs, and directly silence the translation of target RNAs; or by binding to target RNAs, Other silencing factors are recruited to promote their degradation, thereby indirectly silencing the target RNA.
  • eAgos can regulate gene expression post-transcriptionally, protect their hosts from invading RNA viruses, and maintain genome integrity by reducing the mobility of transposons.
  • Argonaute proteins are also present in prokaryotes. Structural and biochemical studies of some prokaryotic Ago (pAgos) proteins (mainly from thermophilic bacteria and archaea) have shown that they can function as endonucleases in vitro and host defense in vivo. pAgos can bind to the siDNA guide strand to specifically cleave the complementary paired DNA target strand of the guide strand. As of 2018, the reported pAgos are mainly derived from high temperature hosts and are mostly used for genetic testing. The activity is very low at room temperature and cannot be used as a tool for gene editing. Since 2019, some pAgos derived from normal temperature hosts have been reported successively, which can exert DNA-directed DNA shearing activity under normal temperature conditions, and can shear plasmids with low GC content.
  • pAgos prokaryotic Ago
  • programmable endonuclease Thermus parvatiensis As used herein, the terms “programmable endonuclease Thermus parvatiensis”, “nuclease Thermus parvatiensis”, “TpsAgo enzyme” are used interchangeably and refer to the enzymes described in the first aspect of the invention.
  • the wild-type TpsAgo enzyme has the amino acid sequence shown in NCBI SEQ ID NO: WP_060384876.1.
  • the TpsAgo enzymes of the present invention may also comprise mutant forms thereof that retain functional activity.
  • Said mutant form can contain one or more amino acid residues substitution, deletion, change or insertion on the basis of the sequence shown in NCBI sequence number WP_060384876.1, or add 1 to 10 amino acid residues (preferably 1 to 5 amino acid residues, more preferably 1 to 3 amino acid residues), the amino acid sequence obtained; and the amino acid sequence obtained is the same as NCBI sequence number WP_060384876.1
  • the sequence shown has ⁇ 85% (preferably ⁇ 90%, more preferably ⁇ 95%, such as ⁇ 96%, ⁇ 97%, ⁇ 98% or ⁇ 99%) sequence identity; and the amino acid sequence obtained has Same or similar function as wild-type TpsAgo enzyme.
  • the reaction when the TpsAgo-gDNA complex is used to carry out the coupling reaction of "cleaving while amplification", the reaction can be carried out under appropriate conditions using the corresponding cleavage enzyme and the corresponding amplification enzyme, as long as the conditions are The cleavage enzymes and amplification enzymes can perform their corresponding functions.
  • the research of the present invention shows that, for enriching the mutant dsDNA signal through the coupling reaction, some key factors mainly include the following aspects:
  • the initial template concentration in the enrichment reaction system wild type (wild type, wt) and mutant type (mutant type, mut) total concentration (nM ⁇ fM)): preferably 0.1-100nM.
  • 2Initial TpsAgo protein concentration in the enrichment reaction system preferably 20-100nM;
  • the initial concentration of gDNAs in the enrichment reaction system preferably 200-2000nM;
  • a reaction system for enriching low-abundance target nucleic acids A reaction system for enriching low-abundance target nucleic acids
  • reaction system for enriching low-abundance target nucleic acid and “enrichment system of the present invention” are used interchangeably, and refer to the method for enriching low-abundance target nucleic acid described in the second aspect of the present invention. reaction system.
  • a reaction system for enriching low-abundance target nucleic acid is provided.
  • the system is based on the above-mentioned counting principle of the coupling reaction of "amplifying while shearing", and is used for simultaneously performing nucleic acid amplification on a nucleic acid sample.
  • the amplification reaction and the nucleic acid cleavage reaction are carried out to obtain the amplification-cleavage reaction product.
  • the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non- Target nucleic acid; the nucleic acid cleavage reaction is used to specifically cut non-target nucleic acid, but not the target nucleic acid.
  • the amplification-cleavage reaction system contains (i) the reagents required for the nucleic acid amplification reaction and (ii) the nucleic acid based on the programmable endonuclease Argonaute (Ago) of the present invention cutting system.
  • the concentration of the low-abundance target nucleic acid is 0.5-5nM, preferably 0.8-2nM, more preferably 1nM.
  • the concentration of the programmable endonuclease Argonaute (Ago) is 20-200 nM, preferably 30-150 nM, more preferably 40-100 nM.
  • the reagents required for the isothermal amplification reaction include One-Step tHDA kit (purchased from NEB Company (New England Biolabs).
  • the reagents required for performing the isothermal amplification reaction also include a pair of amplification primers for the target nucleic acid.
  • concentration of each primer in the amplification primer pair of the target nucleic acid is 10-300 nM, preferably 50-200 nM, more preferably 100 nM.
  • the gDNA includes forward gDNA and reverse gDNA; wherein, the forward gDNA refers to the gDNA with the same sequence fragment as the target nucleic acid, and the reverse gDNA refers to the gDNA with the target nucleic acid. gDNA of the reverse complement fragment.
  • the reaction system further includes divalent metal ions.
  • the divalent metal ion is Mn 2+ .
  • the concentration of the divalent metal ion is 50 ⁇ M-2000 ⁇ M, preferably 100 ⁇ M-1000 ⁇ M, more preferably 250 ⁇ M.
  • reaction temperature (reaction procedure) of the reaction system is: 65° C., 1.5-2 h.
  • enrichment method of the present invention As used herein, the terms “enrichment method of the present invention”, “method for enriching low-abundance target nucleic acid” and “method for enriching nucleic acid of the present invention” are used interchangeably, and all refer to the third aspect of the present invention. Methods for enriching low-abundance target nucleic acids.
  • the present invention provides a method for enriching low-abundance target nucleic acid, comprising the steps of: (a) providing a nucleic acid sample, wherein the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is The target nucleic acid, and the second nucleic acid is a non-target nucleic acid, and the abundance of the target nucleic acid in the nucleic acid sample is F1a; (b) the nucleic acid in the nucleic acid sample is used as a template, Perform isothermal amplification and nucleic acid cleavage reaction in an amplification-cleavage reaction system to obtain an amplification-cleavage reaction product; wherein the nucleic acid cleavage reaction is used to specifically cut non-target nucleic acid, but not cut the target nucleic acid and, the amplification-cutting reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucle
  • the target nucleic acid and the non-target nucleic acid differ by only one base.
  • the ratio of F1b/F1a ⁇ 10 when 1% ⁇ F1a ⁇ 10%, the ratio of F1b/F1a ⁇ 10, when 0.1% ⁇ F1a ⁇ 0.5%, the ratio of F1b/F1a ⁇ 100, when F1a ⁇ 0.1%, the ratio of F1b/F1a Ratio ⁇ 200.
  • the terms "detection method of the present invention” and “method for detecting low-abundance target nucleic acid” can be used interchangeably, and refer to the method for enriching low-abundance target nucleic acid based on the third aspect of the present invention.
  • a method for the detection of enriched low-abundance target nucleic acids can be used interchangeably, and refer to the method for enriching low-abundance target nucleic acid based on the third aspect of the present invention.
  • the present invention provides a method for detecting low-abundance target nucleic acid, the method is based on the above-mentioned method steps for enriching low-abundance target nucleic acid, further comprising: (c) detecting the amplification-cleavage reaction product , thereby determining the presence and/or quantity of the target nucleic acid.
  • the detection in the step (c) includes quantitative detection, qualitative detection, or a combination thereof.
  • the quantitative detection is selected from the following group: TaqMan fluorescence quantitative PCR, Sanger sequencing, q-PCR, ddPCR, chemiluminescence, high-resolution melting curve method, NGS, etc.; more preferably, selected from TaqMan fluorescence Quantitative PCR, Sanger sequencing.
  • the detection methods of the present invention may be non-diagnostic and non-therapeutic.
  • the nucleic acid sample includes nucleic acid from a sample, wherein the sample is selected from the group consisting of blood, cells, serum, saliva, body fluid, plasma, urine, prostatic fluid, bronchial lavage fluid , cerebrospinal fluid, gastric fluid, bile, lymph fluid, peritoneal fluid and feces, etc. or a combination thereof.
  • the low-abundance target nucleic acid can be an EGFR L861Q mutant sequence fragment.
  • the present invention further provides a kit for detecting target nucleic acid molecules, comprising: (i) the present invention for enriching the low-abundance target nucleic acid reaction system or reagents for preparing the reaction system; (ii) detection reagents for detecting low-abundance target nucleic acids; and (ii) instructions for use, which describe the detection method of the present invention.
  • the kit comprises: (a) a first container and a guide DNA located in the first container; (b) a second container and a programmable endonuclease Argonaute located in the second container (Ago); and (c) a third container and a nucleic acid amplification reaction reagent located in the third container.
  • the kit further contains: (d) a fourth container and a low-abundance target nucleic acid detection reagent located in the fourth container.
  • the low-abundance target nucleic acid detection reagents include: primers, probes, and the like.
  • the low-abundance target nucleic acid detection reagents include: primers and probes required for TaqMan fluorescence quantitative PCR, or reagents required for Sanger sequencing.
  • the kit further contains: (e) a fifth container and a divalent metal ion located in the fifth container.
  • the kit further comprises: (f) a sixth container and a buffer located in the sixth container.
  • the containers described above may be the same or different containers.
  • the TpsAgo protein of the present invention has a good distinction between single-point or double-point mismatches between target DNA and gDNA. Using this property, it can be applied to the enrichment and enrichment of low-abundance mutant genes. In the detection, the detection of early mutated genes in tumors is realized.
  • TpsAgo can use a variety of 5'-modified gDNA to cleave complementary DNA.
  • TpsAgo can tolerate high concentration of NaCl.
  • TpsAgo can cut a variety of target nucleic acids, including single-stranded DNA, single-stranded RNA, and double-stranded DNA.
  • TpsAgo was selected as the candidate enzyme.
  • the amino acid sequence of TpsAgo (WP_060384876.1) and the corresponding gene sequence encoding the protein (NZ_CP014142.1) were obtained. After the gene sequence was synthesized by codon optimization, it was cloned into pET28a expression vector.
  • TpsAgo-pET28a prokaryotic expression plasmid was introduced into E.coli BL21(DE3) to obtain a TpsAgo-pET28a/E.coli BL21(DE3) prokaryotic expression strain.
  • the expression strain E.coli BL21(DE3) containing the recombinant plasmid TpsAgo-pET28a was inoculated into LB medium containing 50 ⁇ g/mL kanamycin, and incubated at 37°C with a shaker at 220 rpm to an OD 600 to 0.6-0.8.
  • the cells were collected by centrifugation, resuspended in a resuspension buffer (containing 20 mM Tris-HCl, about pH 8.0, 1 M NaCl), then disrupted by high pressure, and centrifuged to obtain the supernatant.
  • the protein was affinity purified by Ni-NTA column, and the eluate was concentrated by ultrafiltration, desalted and other steps to obtain the purified protein.
  • the purified protein was stored in a buffer containing 20 mM Tris-HCl, and the protein was assayed by BCA kit, and the assay steps were carried out according to the operating instructions.
  • BSA as a standard, prepare a standard solution, draw a standard curve, and calculate the purified target protein concentration based on this, and store the protein in a -80°C refrigerator for later use.
  • TpsAgo protein was analyzed by SDS-PAGE electrophoresis.
  • TpsAgo was subjected to multiple sequence alignment with some of the characterized Agos.
  • TpsAgo had the smallest number of amino acids and the smallest protein molecular weight. It has been reported in the literature that the targeted cleavage of all catalytically active Ago proteins is mediated by a conserved DEDX (X stands for histidine, aspartate or asparagine) quadruplet. Through sequence alignment, it can be found that there is a DEDD quadruplet in TpsAgo ( Figure 1). Therefore, it is further speculated that it may have nuclease catalytic activity, which requires further in vitro identification and characterization.
  • DEDX histidine, aspartate or asparagine
  • DNA target nucleic acid sequence SEQ ID NO: 1
  • RNA target nucleic acid SEQ ID NO: 2
  • gRNA SEQ ID NO: 4
  • reaction buffer (containing 15mM Tris-HCl pH8.0, 250mM NaCl), add 0.5mM MnCl 2 , 200nM TpsAgo, 2 ⁇ M synthetic gDNA or gRNA and 0.8 ⁇ M 5' fluorescently modified to the reaction buffer
  • Sequence complementary single-stranded DNA or RNA target nucleic acid react at 80°C for 30min, after the reaction, take 6-10 ⁇ L of sample, add loading buffer (containing 95% (deionized) formamide, 0.5mmol/ L EDTA, 0.025% bromophenol blue, 0.025% xylene blue), electrophoresis detection was performed under 16% nucleic acid Urea-PAGE.
  • TpsAgo can use 5'-P and 5'-OH gDNA to cleave complementary single-stranded DNA, and can also use 5'-P gDNA to cleave complementary single-stranded RNA.
  • 11-30nt 5'phosphorylated gDNA and 14-24nt 5'hydroxylated gDNA were designed to explore the effect of different lengths of gDNA on TpsAgo enzyme activity.
  • MnCl 2 with a final concentration of 0.5 mM, TpsAgo with a final concentration of 200 nM, 2 ⁇ M of synthesized gDNA of different lengths, and 0.8 ⁇ M of 60nt sequence complementary single-stranded DNA target nucleic acid were added to the reaction buffer, respectively, and reacted at 80 °C for 30 min.
  • Electrophoretic detection was performed under 16% nucleic acid Urea-PAGE.
  • TpsAgo can use 16-20nt 5'-P gDNA, and can also use 16-22nt 5'-OH gDNA to cut complementary target nucleic acid.
  • TpsAgo The enzymatic activity of TpsAgo was investigated at different temperatures (50°C, 55°C, 60°C, 65°C, 70°C, 75°C, 80°C, 85°C, 90°C, 95°C, 100°C).
  • MnCl 2 with a final concentration of 0.5mM
  • TpsAgo with a final concentration of 200nM
  • 2 ⁇ M synthetic gDNA and 0.8 ⁇ M 60nt sequence complementary single-stranded DNA target nucleic acid and react at different temperatures for 30min, and the reaction products are displayed on 16% nucleic acid Urea-PAGE electrophoresis detection.
  • TpsAgo can use 5'-P gDNA to cleave complementary target nucleic acid in the range of 65°C-90°C.
  • the concentrations of TpsAgo, guide strand and target nucleic acid remained unchanged.
  • CoCl 2 , CuCl 2 , MgCl 2 , MnCl 2 , ZnCl 2 , and CaCl 2 solutions with a final concentration of 0.5 mM were added to the reaction system.
  • the effect of metal ions on enzyme activity was detected by electrophoresis under 16% nucleic acid Urea-PAGE.
  • the reaction system and conditions remained unchanged.
  • MnCl 2 Different concentrations of MnCl 2 were added: 25 ⁇ M, 50 ⁇ M, 100 ⁇ M, 250 ⁇ M, 500 ⁇ M, 1000 ⁇ M, and 2000 ⁇ M to determine the optimal MnCl 2 concentration of TpsAgo mediated by the 5' phosphorylated guide strand.
  • TpsAgo can utilize Mn 2+ and Co 2+ as metal ions to mediate single-stranded DNA-guided DNA cleavage.
  • TpsAgo prefers Mn 2+ , and 100 ⁇ M-2000 ⁇ M Mn 2+ can keep TpsAgo highly active.
  • reaction buffers with a final concentration of 15mM Tris-HCl pH8.0 and different concentrations of NaCl (20mM, 40mM, 80mM, 200mM, 400mM, 800mM, 1600mM, 2400mM, 3200mM, 4000mM, 4800mM).
  • the other reaction systems were unchanged, and the reaction was carried out at 80 °C for 30 min, and electrophoresis was carried out under 16% nucleic acid Urea-PAGE.
  • the 16nt gDNA whose first base at the 5' end of gDNA is A, T, G, and C was designed respectively.
  • the reaction system was unchanged. MnCl 2 , gDNA and target DNA were added. The preference of the first base at the 5' end of gDNA.
  • the reaction was carried out at 80°C for 15 min, and electrophoresis was performed under 16% nucleic acid Urea-PAGE.
  • the reaction system was unchanged, and the reactions were performed for 0, 5 min, 10 min, 15 min, 20 min, 25 min, and 30 min, respectively, and the shearing kinetics of the four kinds of gDNA were determined.
  • TpsAgo can utilize 5'-P, 5'-OH, 5'-Biotin, 5'-NH 2 C 6 , 5'-FAM, 5'-SHC 6 modified gDNA, and prefer 5'-OH and 5'-NH 2 C 6 .
  • the gDNAs of the two strands of the spliced plasmid pUC19 with GC contents of 29% and 65% (within 80 bp upstream and downstream of the cleavage site) were designed and synthesized. The sequences are shown in Figure 11 .
  • Prepare reaction buffer (containing 15mM Tris-HCl pH8.0, 100mM NaCl), add MnCl 2 at a final concentration of 0.5mM, 750nM TpsAgo, 2.5 ⁇ M synthetic forward and reverse gDNA and 300ng-600ng pUC19 plasmid to the reaction buffer , and react at 80°C for 2-4h.
  • Example 6 TpsAgo combined with isothermal amplification technology to achieve enrichment detection of EGFR single nucleotide variation (SNV) gene
  • gDNAs with mismatches with the SNV gene at positions 10-11 were designed, as shown in Figure 12.
  • the two strands of EGFR were used as target DNAs to screen gDNAs that could differentiate between splicing wild-type and SNV genes.
  • the splicing system was the same as above.
  • the wild and mutant templates were amplified by PCR using EGFR L861Q SNV wild and mutant fragments as substrates, respectively. After the PCR product was purified and recovered, the prepared samples were quantified using the Pikogreen dsDNA quantification kit (supersensitive) (compatible with Qubit 3.0) sold by Liji Bio. Templates were configured as 10 nM (final concentration of enrichment system) 10% mut EGFR L861Q samples.
  • the EGFR L861Q SNV gene was enriched using the tHDA kit sold by New England Biolabs in combination with TpsAgo. Take a 50 ⁇ L reaction system as an example:
  • Enriched samples were detected by Sanger sequencing.

Abstract

Provided are a representation and application of a novel high temperature Argonaute protein. Specifically, provided is a nucleic acid cleaving system, which is characterized in that the nucleic acid cleaving system comprises: (a) a guide DNA (gDNA); (b) a programmable endonuclease Argonaute (Ago); and (c) optional reporter nucleic acid, wherein if the reporter nucleic acid is cleaved, the cleavage can be detected. In addition, further provided are a system and a method for enriching and detecting low-abundance target nucleic acids of the programmable endonuclease TpsAgo. The provided detection system and method can specifically target low-abundance nucleic acids for effective enrichment and efficient detection, and the programmable endonuclease TpsAgo has a strong potential for gene manipulation.

Description

一种新型高温Argonaute蛋白TpsAgo表征及应用Characterization and application of a novel high temperature Argonaute protein TpsAgo 技术领域technical field
本发明属于生物技术领域,具体涉及一种新型高温Argonaute蛋白TpsAgo表征及应用。The invention belongs to the field of biotechnology, and in particular relates to the characterization and application of a novel high-temperature Argonaute protein TpsAgo.
背景技术Background technique
Argonaute(Ago)蛋白最早在一项描述拟南芥的突变体的研究中被提及。Ago蛋白是真核RNA干扰(RNAi)途径的关键参与者,可以在转录后调节基因表达,从而防御宿主入侵的RNA病毒并保护基因组完整性。目前已报道的Ago蛋白主要分为真核Ago(eAgo)和原核Ago(pAgo)两类。The Argonaute (Ago) protein was first mentioned in a study describing mutants of Arabidopsis thaliana. Ago proteins are key players in the eukaryotic RNA interference (RNAi) pathway, which can regulate gene expression post-transcriptionally, thereby defending against host-invading RNA viruses and protecting genome integrity. The reported Ago proteins are mainly divided into two categories: eukaryotic Ago (eAgo) and prokaryotic Ago (pAgo).
原核Ago可以结合单链的guide DNA或RNA,催化与guide互补配对的靶标DNA或RNA的剪切。与CRISPR/Cas系统不同,原核Ago在对靶标核酸链进行剪切时,不需要PAM序列,它可以结合与靶标核酸互补的guide DNA或RNA,在靶标的任意互补对应的位置进行剪切。Prokaryotic Ago can bind to single-stranded guide DNA or RNA, and catalyze the cleavage of target DNA or RNA that is complementary to the guide. Different from the CRISPR/Cas system, prokaryotic Ago does not require a PAM sequence when cutting the target nucleic acid strand. It can bind to the guide DNA or RNA complementary to the target nucleic acid and cut at any complementary position of the target.
来源于古菌的高温Ago蛋白可在70℃以上发挥剪切活性,目前已表征的高温Ago一般可以在高温条件下,结合单链guide DNA或RNA,剪切靶标单链DNA,少数也可以剪切靶标单链RNA。High-temperature Ago proteins derived from archaea can exert shearing activity above 70 °C. The high-temperature Agos that have been characterized so far can generally be combined with single-stranded guide DNA or RNA to shear target single-stranded DNA under high temperature conditions, and a few can also shear Cut the target single-stranded RNA.
近年来“液体活检”的概念正在兴起,其基本思想为运用血液等体液样本替代肿瘤组织样本行病理学、分子生物学的检测,通过检测患者体液样本(主要是血液)中的肿瘤循环DNA来获取肿瘤基因突变信息已经成为一种趋势。循环肿瘤DNA虽然是一种很好的肿瘤组织替代样本,但是,由于循环肿瘤DNA含量稀少,检测循环肿瘤DNA需要极灵敏的技术。目前的检测技术主要依赖于二代测序及数字PCR技术,但是他们均在灵敏度、运行成本方面具有一定局限性。In recent years, the concept of "liquid biopsy" is emerging. Obtaining tumor gene mutation information has become a trend. Although circulating tumor DNA is a good surrogate sample of tumor tissue, the detection of circulating tumor DNA requires extremely sensitive techniques due to the scarcity of circulating tumor DNA. The current detection technology mainly relies on next-generation sequencing and digital PCR technology, but they all have certain limitations in terms of sensitivity and operating cost.
因此,本领域迫切需要开发一种高特异性、高灵敏度的低丰度突变DNA的富集及检测方法。Therefore, there is an urgent need in the art to develop a method for enrichment and detection of low-abundance mutant DNA with high specificity and sensitivity.
发明内容SUMMARY OF THE INVENTION
本发明的目的就是提供一种高特异性、高灵敏度的低丰度突变DNA的富集及检测方法。The purpose of the present invention is to provide a high-specificity, high-sensitivity low-abundance mutant DNA enrichment and detection method.
在本发明的第一方面,提供了一种核酸切割体系,所述核酸切割体系包括:In a first aspect of the present invention, a nucleic acid cutting system is provided, the nucleic acid cutting system comprising:
(a)向导DNA(gDNA);(a) guide DNA (gDNA);
(b)可编程核酸内切酶Argonaute(Ago);和(b) the programmable endonuclease Argonaute (Ago); and
(c)任选的报告核酸,其中若所述报告核酸被剪切,所述的剪切是可以被检测出的。(c) An optional reporter nucleic acid, wherein the cleavage is detectable if the reporter nucleic acid is cleaved.
在另一优选例中,所述核酸切割体系的温度为65-90℃,较佳地为70-85℃,更佳地为75-85℃。In another preferred embodiment, the temperature of the nucleic acid cutting system is 65-90°C, preferably 70-85°C, more preferably 75-85°C.
在另一优选例中,所述的可编程核酸内切酶Argonaute选自下组:TtAgo、TpsAgo、SeAgo、RsAgo、KpAgo、hAgo1、hAgo2、MpAgo、TpAgo、PfAgo、MjAgo、MfAgo、NgAgo、LrAgo、AaAgo、CbAgo、CpAgo、IbAgo、KmAgo。In another preferred embodiment, the programmable endonuclease Argonaute is selected from the following group: TtAgo, TpsAgo, SeAgo, RsAgo, KpAgo, hAgo1, hAgo2, MpAgo, TpAgo, PfAgo, MjAgo, MfAgo, NgAgo, LrAgo, AaAgo, CbAgo, CpAgo, IbAgo, KmAgo.
在另一优选例中,所述的可编程核酸内切酶Argonaute来源于嗜热菌(Thermus parvatiensis),所述的可编程核酸内切酶Argonaute是可编程核酸内切酶TpsAgo。In another preferred embodiment, the programmable endonuclease Argonaute is derived from Thermus parvatiensis, and the programmable endonuclease Argonaute is a programmable endonuclease TpsAgo.
在另一优选例中,所述的TpsAgo包括野生型和突变型的TpsAgo。In another preferred embodiment, the TpsAgo includes wild-type and mutant TpsAgo.
在另一优选例中,所述野生型的可编程核酸内切酶TpsAgo的氨基酸序列如NCBI序列号WP_060384876.1所示。In another preferred example, the amino acid sequence of the wild-type programmable endonuclease TpsAgo is shown in NCBI sequence number WP_060384876.1.
在另一优选例中,所述的向导DNA的5’端具有选自下组的修饰:5’-P、5’-OH、5’-Biotin、5’-NH 2C 6、5’-FAM,或5’-SHC 6In another preferred embodiment, the 5' end of the guide DNA has a modification selected from the group consisting of 5'-P, 5'-OH, 5'-Biotin, 5'-NH 2 C 6 , 5'- FAM, or 5'-SHC 6 .
在另一优选例中,所述的向导DNA是5’端磷酸化或5’端羟基化的单链DNA分子。In another preferred embodiment, the guide DNA is a single-stranded DNA molecule with 5'-terminal phosphorylation or 5'-terminal hydroxylation.
在另一优选例中,所述的向导DNA是5’端磷酸化的单链DNA分子。In another preferred embodiment, the guide DNA is a single-stranded DNA molecule phosphorylated at the 5' end.
在另一优选例中,所述的向导DNA与所述报告核酸之间具有反向互补的片段。In another preferred embodiment, the guide DNA and the reporter nucleic acid have a reverse complementary fragment.
在另一优选例中,所述的向导DNA的长度为5-30nt,较佳地10-24nt,更佳地为16-24nt,更佳地为16-21nt。In another preferred embodiment, the length of the guide DNA is 5-30nt, preferably 10-24nt, more preferably 16-24nt, more preferably 16-21nt.
在另一优选例中,所述向导DNA是5’端磷酸化的单链DNA分子,其长度为16nt-21nt。In another preferred embodiment, the guide DNA is a single-stranded DNA molecule phosphorylated at the 5' end, and its length is 16nt-21nt.
在另一优选例中,所述向导DNA是5’端羟基化的单链DNA分子,其长度为16nt-24nt。In another preferred embodiment, the guide DNA is a 5'-terminal hydroxylated single-stranded DNA molecule with a length of 16nt-24nt.
在另一优选例中,所述向导DNA的5’端第一个核苷酸为磷酸化修饰的胸腺嘧啶(T)或鸟嘌呤(G)或腺嘌呤(A)或胞嘧啶(C)。In another preferred example, the first nucleotide at the 5' end of the guide DNA is phosphorylated thymine (T), guanine (G), adenine (A) or cytosine (C).
在另一优选例中,所述的报告核酸是单链核酸,包括单链DNA(ssDNA)或单链RNA(ssRNA)。In another preferred embodiment, the reporter nucleic acid is a single-stranded nucleic acid, including single-stranded DNA (ssDNA) or single-stranded RNA (ssRNA).
在另一优选例中,当所述的向导DNA是5’端磷酸化的单链DNA分子时,所述的报告核酸是单链DNA(ssDNA)或单链RNA(ssRNA)。In another preferred example, when the guide DNA is a single-stranded DNA molecule phosphorylated at the 5' end, the reporter nucleic acid is single-stranded DNA (ssDNA) or single-stranded RNA (ssRNA).
在另一优选例中,当所述的向导DNA是5’端羟基化的单链DNA分子时,所述的报告核酸是单链DNA。In another preferred embodiment, when the guide DNA is a single-stranded DNA molecule hydroxylated at the 5' end, the reporter nucleic acid is a single-stranded DNA.
在另一优选例中,当所述报告核酸被剪切,所述的剪切能够通过电泳法被检测出。In another preferred embodiment, when the reporter nucleic acid is cleaved, the cleavage can be detected by electrophoresis.
在另一优选例中,所述的电泳法是用16%的核酸Urea-PAG电泳检测法。In another preferred embodiment, the electrophoresis method is a 16% nucleic acid Urea-PAG electrophoresis detection method.
在另一优选例中,所述的报告核酸的长度为10-100nt,较佳地20-70nt,更佳地30-60nt,更佳地40-50nt,最佳地45nt。In another preferred embodiment, the length of the reporter nucleic acid is 10-100nt, preferably 20-70nt, more preferably 30-60nt, more preferably 40-50nt, and most preferably 45nt.
在另一优选例中,所述报告核酸是荧光报告核酸,所述荧光报告核酸带有荧光基团和/或淬灭基团。In another preferred example, the reporter nucleic acid is a fluorescent reporter nucleic acid, and the fluorescent reporter nucleic acid has a fluorescent group and/or a quenching group.
在另一优选例中,所述的荧光基团和淬灭基团各自独立地位于所述荧光报告核酸的5’端、3’端。In another preferred embodiment, the fluorescent group and the quenching group are independently located at the 5' end and the 3' end of the fluorescent reporter nucleic acid.
在另一优选例中,所述的荧光基团和淬灭基团分别位于所述荧光报告核酸与所述向导DNA的互补区域的两侧。In another preferred embodiment, the fluorescent group and the quenching group are located on both sides of the complementary regions of the fluorescent reporter nucleic acid and the guide DNA, respectively.
在另一优选例中,所述荧光基团包括:FAM、HEX、CY5、CY3、VIC、JOE、TET、5-TAMRA、ROX、Texas Red-X,或其组合。In another preferred example, the fluorescent group includes: FAM, HEX, CY5, CY3, VIC, JOE, TET, 5-TAMRA, ROX, Texas Red-X, or a combination thereof.
在另一优选例中,所述猝灭基团包括:BHQ、TAMRA、DABCYL、DDQ,或其组合。In another preferred example, the quenching group includes: BHQ, TAMRA, DABCYL, DDQ, or a combination thereof.
在另一优选例中,所述的荧光报告核酸是仅具有荧光基团的单链DNA分子,所述的荧光基团是FAM。In another preferred embodiment, the fluorescent reporter nucleic acid is a single-stranded DNA molecule with only a fluorescent group, and the fluorescent group is FAM.
在另一优选例中,所述的核酸切割体系还:(d)二价金属离子。In another preferred embodiment, the nucleic acid cutting system further: (d) divalent metal ions.
在另一优选例中,所述的二价金属离子为Mn 2+或Co 2+,优选地为Mn 2+In another preferred example, the divalent metal ion is Mn 2+ or Co 2+ , preferably Mn 2+ .
在另一优选例中,所述核酸切割体系中,二价金属离子的浓度为80μM-3mM,较佳地100μM-2mM,更佳地250μM-2mM。In another preferred embodiment, in the nucleic acid cleavage system, the concentration of divalent metal ions is 80 μM-3 mM, preferably 100 μM-2 mM, more preferably 250 μM-2 mM.
在另一优选例中,所述的核酸切割体系还包括:(e)缓冲液。In another preferred embodiment, the nucleic acid cutting system further comprises: (e) buffer.
在另一优选例中,所述缓冲液中,NaCl的浓度为0-4800mM,较佳地为40-1600mM,更佳地为80-800mM,更佳地为80-400mM。In another preferred embodiment, in the buffer, the concentration of NaCl is 0-4800 mM, preferably 40-1600 mM, more preferably 80-800 mM, more preferably 80-400 mM.
在另一优选例中,所述缓冲液的pH值为7-9,较佳地为8.0。In another preferred embodiment, the pH value of the buffer solution is 7-9, preferably 8.0.
在另一优选例中,当在所述gDNA与报告核酸的反向互补区域中,从5’端起第2、3、4、6、8、10、11、13、14或16位(较佳地为第2、4、10、11、13或14位,更佳地为第2、10或13位)中的任一个的碱基存在与所述报告核酸的错配时,会显著降低所述可编程核酸内切酶Argonaute的剪切率。In another preferred embodiment, when in the reverse complementary region of the gDNA and the reporter nucleic acid, the 2nd, 3rd, 4th, 6th, 8th, 10th, 11th, 13th, 14th or 16th position from the 5' end (more It is preferably the 2nd, 4th, 10th, 11th, 13th or 14th position, more preferably the 2nd, 10th or 13th position) when there is a mismatch with the reporter nucleic acid, it will be significantly reduced. The shear rate of the programmable endonuclease Argonaute.
在另一优选例中,所述的显著降低所述可编程核酸内切酶Argonaute的剪切率是指:相同反应条件下,所述可编程核酸内切酶Argonaute的剪切率降低≥80%,较佳地降低≥85%,更佳地降低≥90%。In another preferred example, the significantly reducing the cleavage rate of the programmable endonuclease Argonaute refers to: under the same reaction conditions, the cleavage rate of the programmable endonuclease Argonaute is reduced by ≥80% , preferably by ≥85%, more preferably by ≥90%.
在另一优选例中,所述的向导DNA与所述的荧光报告核酸的序列互补结合后,引导所述TeAgo酶对所述荧光报告核酸进行切割,从而产生可检测的信号(如荧光)。In another preferred embodiment, after the guide DNA is complementary to the sequence of the fluorescent reporter nucleic acid, the TeAgo enzyme is guided to cleave the fluorescent reporter nucleic acid, thereby generating a detectable signal (eg, fluorescence).
在另一优选例中,所述的核酸切割体系中,所述报告核酸的浓度为0.4μM-4μM,较佳地0.6μM-2μM,更佳地0.8μM-1μM,最佳地为0.8μM。In another preferred embodiment, in the nucleic acid cutting system, the concentration of the reporter nucleic acid is 0.4 μM-4 μM, preferably 0.6 μM-2 μM, more preferably 0.8 μM-1 μM, and most preferably 0.8 μM.
在另一优选例中,所述的核酸切割体系中,所述可编程核酸内切酶Ago的浓度为10nM-10μM,较佳地50nM-1μM,更佳地100nM-500nM,最佳地为200nM。In another preferred embodiment, in the nucleic acid cutting system, the concentration of the programmable endonuclease Ago is 10nM-10μM, preferably 50nM-1μM, more preferably 100nM-500nM, and most preferably 200nM .
在另一优选例中,所述的核酸切割体系中,所述向导DNA的浓度为10nM-10μM,较佳地100nM-3μM,更佳地1μM-2.5μM,最佳地为2μM。In another preferred embodiment, in the nucleic acid cutting system, the concentration of the guide DNA is 10 nM-10 μM, preferably 100 nM-3 μM, more preferably 1 μM-2.5 μM, and most preferably 2 μM.
在另一优选例中,所述的报告核酸是质粒DNA,并且所述的向导DNA(gDNA)包括正向gDNA和反向gDNA;In another preferred embodiment, the reporter nucleic acid is plasmid DNA, and the guide DNA (gDNA) includes forward gDNA and reverse gDNA;
其中,所述的正向gDNA与所述质粒DNA的一条链可形成第一反向互补区,所述反向gDNA与所述质粒DNA的另一条链可形成第二反向互补区。Wherein, the forward gDNA and one strand of the plasmid DNA can form a first reverse complementary region, and the reverse gDNA and the other strand of the plasmid DNA can form a second reverse complementary region.
在另一优选例中,在所述质粒DNA中,所述第一反向互补区与所述第二反向互补区的距离为≤200bp,较佳地为≤50bp,更佳地为≤1bp。In another preferred example, in the plasmid DNA, the distance between the first reverse complementary region and the second reverse complementary region is ≤200bp, preferably ≤50bp, more preferably ≤1bp .
在另一优选例中,所述的质粒DNA为质粒pUC19。In another preferred embodiment, the plasmid DNA is plasmid pUC19.
在另一优选例中,所述的质粒DNA是超螺旋状态的。In another preferred embodiment, the plasmid DNA is in a supercoiled state.
在另一优选例中,所述的质粒DNA是超螺旋状态的,并且所述质粒DNA的GC含量(剪切位点附近80bp的GC含量)为20%-35%,较佳地25%-32%,更佳地为29%。In another preferred example, the plasmid DNA is in a supercoiled state, and the GC content of the plasmid DNA (80bp GC content near the cleavage site) is 20%-35%, preferably 25%- 32%, more preferably 29%.
在另一优选例中,所述的质粒DNA是超螺旋状态的,并且所述质粒DNA的GC含量(剪切位点附近80bp的GC含量)为55%-75%,较佳地62%-68%,更佳地为65%。In another preferred example, the plasmid DNA is in a supercoiled state, and the GC content of the plasmid DNA (the GC content of 80 bp near the cleavage site) is 55%-75%, preferably 62%- 68%, more preferably 65%.
在本发明的第二方面,提供了一种富集低丰度目标核酸的反应体系,所述反应体系用于对一核酸样本同时进行依赖解旋酶等温扩增技术(Helicase-dependent isothermal DNA amplification)和核酸切割反应,从而获得扩增-切割反应产物;In a second aspect of the present invention, a reaction system for enriching low-abundance target nucleic acid is provided, and the reaction system is used to simultaneously perform Helicase-dependent isothermal DNA amplification on a nucleic acid sample ) and nucleic acid cleavage reaction, thereby obtaining amplification-cleavage reaction product;
其中,所述的核酸样本含有第一核酸和第二核酸,其中,所述的第一核酸为所述目标核酸,而所述的第二核酸为非目标核酸;Wherein, the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non-target nucleic acid;
所述的核酸切割反应用于特异性切割非目标核酸,但不切割所述目的核酸;The nucleic acid cleavage reaction is used for specifically cleaving non-target nucleic acid, but not cleaving the target nucleic acid;
所述的扩增-切割反应体系含有(i)进行等温扩增反应所需的试剂和(ii)如本发明第一方面所述的核酸切割体系。The amplification-cleavage reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cleavage system according to the first aspect of the present invention.
在另一优选例中,所述反应体系中,所述可编程核酸内切酶Argonaute(Ago)的浓度为20-200nM,较佳地为30-150nM,更佳地为40-100nM,最佳地为50nM。In another preferred example, in the reaction system, the concentration of the programmable endonuclease Argonaute (Ago) is 20-200nM, preferably 30-150nM, more preferably 40-100nM, the best The ground is 50 nM.
在另一优选例中,所述的进行等温扩增反应所需的试剂包括tHDA试剂盒(购自New England Biolabs公司)。In another preferred embodiment, the reagents required for the isothermal amplification reaction include tHDA kit (purchased from New England Biolabs).
在另一优选例中,所述的进行等温扩增反应所需的试剂还包括目标核酸的扩增引物对。In another preferred embodiment, the reagents required for the isothermal amplification reaction further include a pair of amplification primers for the target nucleic acid.
在另一优选例中,所述的目标核酸的扩增引物对中的各引物的浓度为10-300nM,较佳地为50-200nM,更佳地为100nM。In another preferred embodiment, the concentration of each primer in the amplification primer pair of the target nucleic acid is 10-300 nM, preferably 50-200 nM, more preferably 100 nM.
在另一优选例中,所述目标核酸的浓度为0.1-100nM,较佳地为0.5-50nM,更佳地为1nM。In another preferred example, the concentration of the target nucleic acid is 0.1-100 nM, preferably 0.5-50 nM, more preferably 1 nM.
在另一优选例中,所述gDNA包括正向gDNA和反向gDNA;In another preferred embodiment, the gDNA includes forward gDNA and reverse gDNA;
其中,所述正向gDNA是指与目标核酸具有相同序列片段的gDNA,所述反向gDNA是指与目标核酸具有反向互补序列片段的gDNA。Wherein, the forward gDNA refers to the gDNA having the same sequence fragment as the target nucleic acid, and the reverse gDNA refers to the gDNA having the reverse complementary sequence fragment to the target nucleic acid.
在另一优选例中,所述反应体系中还包括:二价金属离子。In another preferred embodiment, the reaction system further includes: divalent metal ions.
在另一优选例中,所述的二价金属离子为Mn 2+In another preferred example, the divalent metal ion is Mn 2+ .
在另一优选例中,所述反应体系中,二价金属离子的浓度为50μM-2000μM,较佳地100μM-1000μM,更佳地为250μM。In another preferred example, in the reaction system, the concentration of divalent metal ions is 50 μM-2000 μM, preferably 100 μM-1000 μM, more preferably 250 μM.
在另一优选例中,所述反应体系的反应温度(反应程序)为:65℃,1.5-2h。In another preferred example, the reaction temperature (reaction procedure) of the reaction system is: 65° C., 1.5-2 h.
在另一优选例中,所述的目标核酸选自下组:野生型EGFR序列片段、EGFR L861Q突变型序列片段。In another preferred embodiment, the target nucleic acid is selected from the following group: wild-type EGFR sequence fragment, EGFR L861Q mutant sequence fragment.
在另一优选例中,所述的野生型EGFR序列片段的核苷酸序列如SEQ ID NO:5 所示。In another preferred embodiment, the nucleotide sequence of the wild-type EGFR sequence fragment is shown in SEQ ID NO:5.
在另一优选例中,所述EGFR L861Q突变型序列片段的核苷酸序列如SEQ ID NO:6所示。In another preferred embodiment, the nucleotide sequence of the EGFR L861Q mutant sequence fragment is shown in SEQ ID NO:6.
在另一优选例中,所述的正向gDNA和反向gDNA的核苷酸序列分别如SEQ ID NO:7和8所示。In another preferred embodiment, the nucleotide sequences of the forward gDNA and the reverse gDNA are shown in SEQ ID NOs: 7 and 8, respectively.
在另一优选例中,所述EGFR L861Q突变型序列片段的扩增引物对的核苷酸序列分别如SEQ ID NO:9和10所示。In another preferred embodiment, the nucleotide sequences of the amplification primer pair of the EGFR L861Q mutant sequence fragment are shown in SEQ ID NOs: 9 and 10, respectively.
在本发明的第三方面,提供了一种富集低丰度目标核酸的方法,包括步骤:In a third aspect of the present invention, a method for enriching low-abundance target nucleic acid is provided, comprising the steps of:
(a)提供一核酸样本,所述的核酸样本含有第一核酸和第二核酸,其中,所述的第一核酸为所述目标核酸,而所述的第二核酸为非目标核酸,(a) providing a nucleic acid sample, the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non-target nucleic acid,
并且,所述目标核酸在所述的核酸样本中的丰度为F1a;And, the abundance of the target nucleic acid in the nucleic acid sample is F1a;
(b)对所述核酸样本中的核酸为模板,在扩增-切割反应体系中进行等温扩增和核酸切割反应,从而获得扩增-切割反应产物;(b) using the nucleic acid in the nucleic acid sample as a template, performing isothermal amplification and nucleic acid cleavage reaction in an amplification-cleavage reaction system, thereby obtaining an amplification-cleavage reaction product;
其中,所述的核酸切割反应用于特异性切割非目标核酸,但不切割所述目的核酸;Wherein, the nucleic acid cleavage reaction is used for specifically cleaving non-target nucleic acid, but not cleaving the target nucleic acid;
并且,所述的扩增-切割反应体系含有(i)进行等温扩增反应所需的试剂和(ii)如本发明第一方面所述的核酸切割体系;And, the amplification-cleavage reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cleavage system according to the first aspect of the present invention;
其中,所述目标核酸在所述的扩增-切割反应产物中的丰度为F1b,Wherein, the abundance of the target nucleic acid in the amplification-cleavage reaction product is F1b,
其中,F1b/F1a的比值≥5(较佳地为≥10)。Wherein, the ratio of F1b/F1a is ≥5 (preferably ≥10).
在另一优选例中,所述的目标核酸和非目标核酸仅相差一个碱基。In another preferred embodiment, the target nucleic acid and the non-target nucleic acid differ by only one base.
在另一优选例中,当1%≤F1a≤10%时,F1b/F1a的比值≥10,当0.1%≤F1a≤0.5%时,F1b/F1a的比值≥100,当F1a≤0.1%时,F1b/F1a的比值≥200。In another preferred example, when 1%≤F1a≤10%, the ratio of F1b/F1a≥10, when 0.1%≤F1a≤0.5%, the ratio of F1b/F1a≥100, when F1a≤0.1%, The ratio of F1b/F1a is ≥200.
在另一优选例中,所述的核酸样本包括直接加热裂解的核酸样本、直接裂解酶蛋白酶处理的核酸样本、经过抽提的核酸样本、经PCR预扩增的核酸样本或任意含核酸的样品。In another preferred embodiment, the nucleic acid sample includes a nucleic acid sample that is directly thermally lysed, a nucleic acid sample that is directly lysed by protease, an extracted nucleic acid sample, a nucleic acid sample that has been pre-amplified by PCR, or any nucleic acid-containing sample .
在另一优选例中,所述的目标核酸为含突变的核苷酸序列。In another preferred embodiment, the target nucleic acid is a nucleotide sequence containing mutations.
在另一优选例中,所述的突变选自下组:核苷酸的插入、缺失、取代、或其组合。In another preferred embodiment, the mutation is selected from the group consisting of nucleotide insertion, deletion, substitution, or a combination thereof.
在另一优选例中,所述的非目标核酸(或第二核酸)为野生型核苷酸序列、高丰度的核苷酸序列、或其组合。In another preferred embodiment, the non-target nucleic acid (or the second nucleic acid) is a wild-type nucleotide sequence, a high-abundance nucleotide sequence, or a combination thereof.
在另一优选例中,所述的非目标核酸在所述的核酸样本中的丰度为F2a。In another preferred embodiment, the abundance of the non-target nucleic acid in the nucleic acid sample is F2a.
在另一优选例中,F1a+F2a=100%。In another preferred example, F1a+F2a=100%.
在另一优选例中,所述的F2a/F1a的比值≥20,较佳地≥50,更佳地≥100,最佳地≥1000或≥5000。In another preferred embodiment, the ratio of F2a/F1a is ≥20, preferably ≥50, more preferably ≥100, and most preferably ≥1000 or ≥5000.
在另一优选例中,所述的非目标核酸在所述的扩增-切割反应产物中的丰度为F2b。In another preferred embodiment, the abundance of the non-target nucleic acid in the amplification-cleavage reaction product is F2b.
在另一优选例中,F1b+F2b=100%。In another preferred example, F1b+F2b=100%.
在另一优选例中,所述的F1b/F2b≥0.5,较佳地≥1,更佳地≥2,最佳地≥3或≥5。In another preferred example, the F1b/F2b ≥ 0.5, preferably ≥ 1, more preferably ≥ 2, most preferably ≥ 3 or ≥ 5.
在另一优选例中,所述的F1b/F1a的比值≥200,较佳地≥500,更佳地≥1000,最佳地≥2000或≥5000或更高。In another preferred embodiment, the ratio of F1b/F1a is ≥200, preferably ≥500, more preferably ≥1000, most preferably ≥2000 or ≥5000 or higher.
在另一优选例中,F1a≤0.5%,较佳地≤0.2%,更佳地≤0.1%,最佳地≤0.01%。In another preferred embodiment, F1a≤0.5%, preferably≤0.2%, more preferably≤0.1%, most preferably≤0.01%.
在另一优选例中,F1b≥10%,较佳地≥30%,更佳地≥50%,最佳地≤70%。In another preferred example, F1b≥10%, preferably ≥30%, more preferably ≥50%, and most preferably ≤70%.
在另一优选例中,所述的“进行等温扩增反应所需的试剂”包括:解旋酶、DNA聚合酶。In another preferred embodiment, the "reagents required for isothermal amplification reaction" include: helicase and DNA polymerase.
在另一优选例中,所述的“进行等温扩增反应所需的试剂”还包括:dNTP、Mg 2+、等温扩增缓冲液。 In another preferred example, the "reagents required for isothermal amplification reaction" further include: dNTP, Mg 2+ , and isothermal amplification buffer.
在另一优选例中,所述核酸切割体系中的gDNA与目标核酸(即第一核酸)的靶定区域的核酸序列形成第一互补结合区;并且所述核酸切割体系中的gDNA还与非目标核酸(即第二核酸)的靶定区域的核酸序列形成第二互补结合区。In another preferred embodiment, the gDNA in the nucleic acid cleavage system forms a first complementary binding region with the nucleic acid sequence of the target region of the target nucleic acid (ie, the first nucleic acid); and the gDNA in the nucleic acid cleavage system also interacts with non- The nucleic acid sequence of the targeting region of the target nucleic acid (ie, the second nucleic acid) forms the second complementary binding region.
在另一优选例中,在第一互补结合区中含有至少2个不匹配的碱基对。In another preferred embodiment, the first complementary binding region contains at least two unmatched base pairs.
在另一优选例中,在第二互补结合区中含有0或1个不匹配的碱基对。In another preferred embodiment, the second complementary binding region contains 0 or 1 unmatched base pair.
在另一优选例中,在第二互补结合区中含有1个不匹配的碱基对。In another preferred embodiment, the second complementary binding region contains one unmatched base pair.
在另一优选例中,在第一互补结合区中含有至少2个不匹配的碱基对,从而导致所述复合物不切割所述目标核酸;而在第二互补结合区中含有1个不匹配的碱基对,从而导致所述复合物切割所述非目标核酸。In another preferred embodiment, the first complementary binding region contains at least two unmatched base pairs, so that the complex does not cleave the target nucleic acid; and the second complementary binding region contains one unmatched base pair. matched base pairs, causing the complex to cleave the non-target nucleic acid.
在另一优选例中,目标核酸(即第一核酸)的靶定区域与非目标核酸(即第二核 酸)的靶定区域是相对应的。In another preferred embodiment, the targeting region of the target nucleic acid (i.e. the first nucleic acid) corresponds to the targeting region of the non-target nucleic acid (i.e. the second nucleic acid).
在另一优选例中,在扩增-切割反应体系中,作为模板的核酸的数量为0.1-100nM。In another preferred example, in the amplification-cleavage reaction system, the amount of nucleic acid used as a template is 0.1-100 nM.
在另一优选例中,所述方法还包括:In another preferred embodiment, the method further includes:
(c)对所述扩增-切割反应产物进行检测,从而测定所述目标核酸的存在与否和/或数量。(c) detecting the amplification-cleavage reaction product to determine the presence and/or amount of the target nucleic acid.
在另一优选例中,步骤(c)中的检测包括定量检测、定性检测、或其组合。In another preferred embodiment, the detection in step (c) includes quantitative detection, qualitative detection, or a combination thereof.
在另一优选例中,所述的定量检测选自下组:TaqMan荧光定量PCR、桑格测序、q-PCR、ddPCR、化学发光法、高分辨率熔解曲线法、NGS等;优选地为选自TaqMan荧光定量PCR、桑格测序。In another preferred embodiment, the quantitative detection is selected from the following group: TaqMan fluorescence quantitative PCR, Sanger sequencing, q-PCR, ddPCR, chemiluminescence, high-resolution melting curve method, NGS, etc.; From TaqMan real-time PCR, Sanger sequencing.
在另一优选例中,所述的第一核酸包括n种不同的核酸序列,其中n为≥1的正整数。In another preferred embodiment, the first nucleic acid includes n different nucleic acid sequences, wherein n is a positive integer ≥1.
在另一优选例中,n为1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、81、82、83、84、85、86、87、88、89、90、91、92、93、94、95、96、97、98、99、100或更大。In another preferred example, n is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46 , 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71 , 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96 , 97, 98, 99, 100 or greater.
在另一优选例中,n为2-1000,较佳地3-100,更佳地3-50。In another preferred embodiment, n is 2-1000, preferably 3-100, more preferably 3-50.
在另一优选例中,所述方法是非诊断性和非治疗性的。In another preferred embodiment, the method is non-diagnostic and non-therapeutic.
在另一优选例中,所述的核酸样本包括来自试样的核酸,其中所述试样选自下组:血液、细胞、血清、唾液、体液、血浆、尿液、前列腺液、支气管灌洗液、脑脊液、胃液、胆汁、淋巴液、腹腔液及粪便等或其组合。In another preferred embodiment, the nucleic acid sample includes nucleic acid from a sample, wherein the sample is selected from the group consisting of blood, cells, serum, saliva, body fluid, plasma, urine, prostatic fluid, bronchial lavage fluid, cerebrospinal fluid, gastric fluid, bile, lymph fluid, peritoneal fluid, feces, etc., or a combination thereof.
在另一优选例中,所述低丰度目标核酸选自下组:野生型EGFR序列片段、EGFR L861Q突变型序列片段。In another preferred embodiment, the low-abundance target nucleic acid is selected from the group consisting of wild-type EGFR sequence fragments and EGFR L861Q mutant sequence fragments.
在本发明的第四方面,提供了一种用于检测靶标核酸分子的试剂盒,所述试剂盒包括:In a fourth aspect of the present invention, there is provided a kit for detecting target nucleic acid molecules, the kit comprising:
(i)如本发明第二方面所述的富集低丰度目标核酸反应体系或用于配制所述 反应体系的试剂;(i) the enrichment low-abundance target nucleic acid reaction system according to the second aspect of the present invention or a reagent for preparing the reaction system;
(ii)用于检测低丰度目标核酸的检测试剂;和(ii) detection reagents for detecting low-abundance target nucleic acids; and
(ii)使用说明书,所述说明书描述了如本发明第三方面所述的方法。(ii) Instructions for use describing the method according to the third aspect of the invention.
在另一优选例中,所述的试剂盒包括:In another preferred embodiment, the kit includes:
(a)第一容器以及位于所述第一容器的向导DNA;(a) a first container and guide DNA located in said first container;
(b)第二容器以及位于第二容器的可编程核酸内切酶Argonaute(Ago);和(b) a second container and a programmable endonuclease Argonaute (Ago) located in the second container; and
(c)第三容器以及位于第三容器的核酸扩增反应试剂。(c) The third container and the nucleic acid amplification reaction reagent located in the third container.
在另一优选例中,所述的试剂盒还含有:In another preferred embodiment, the kit also contains:
(d)第四容器以及位于第四容器的低丰度目标核酸的检测试剂。(d) a fourth container and a detection reagent for low-abundance target nucleic acid located in the fourth container.
在另一优选例中,所述的低丰度目标核酸的检测试剂包括:引物、探针等。In another preferred example, the low-abundance target nucleic acid detection reagents include: primers, probes, and the like.
在另一优选例中,所述的低丰度目标核酸的检测试剂包括:TaqMan荧光定量PCR所需的引物和探针,或桑格测序所需试剂。In another preferred embodiment, the low-abundance target nucleic acid detection reagents include: primers and probes required for TaqMan fluorescence quantitative PCR, or reagents required for Sanger sequencing.
在另一优选例中,所述的低丰度目标核酸的检测试剂包括桑格测序所需试剂。In another preferred embodiment, the low-abundance target nucleic acid detection reagent includes the reagent required for Sanger sequencing.
在另一优选例中,所述的试剂盒还含有:In another preferred embodiment, the kit also contains:
(e)第五容器以及位于第五容器的二价金属离子。(e) A fifth container and a divalent metal ion located in the fifth container.
在另一优选例中,所述的试剂盒还包括:In another preferred embodiment, the kit also includes:
(f)第六容器以及位于第六容器的缓冲液。(f) A sixth container and buffer located in the sixth container.
在另一优选例中,所述第一容器、第二容器、第三容器、第四容器、第五容器和第六容器可以是相同或不同的容器。In another preferred embodiment, the first container, the second container, the third container, the fourth container, the fifth container and the sixth container may be the same or different containers.
在本发明的第五方面,提供了一种可编程核酸内切酶Argonaute的用途,用于制备检测靶标分子的试剂或试剂盒,或用于制备检测低丰度目标核酸的试剂或试剂盒。In a fifth aspect of the present invention, use of a programmable endonuclease Argonaute is provided, for preparing a reagent or kit for detecting target molecules, or for preparing a reagent or kit for detecting low-abundance target nucleic acid.
在另一优选例中,所述可编程核酸内切酶Argonaute来源于嗜热菌(Thermus parvatiensis);或是其具备相同或相似功能的同源类似物。In another preferred embodiment, the programmable endonuclease Argonaute is derived from Thermus parvatiensis; or a homologous analog thereof with the same or similar functions.
在另一优选例中,所述的TpsAgo包括野生型和突变型的TpsAgo。In another preferred embodiment, the TpsAgo includes wild-type and mutant TpsAgo.
在另一优选例中,所述的可编程核酸内切酶Argonaute具有选自下组的氨基酸序列:In another preferred embodiment, the programmable endonuclease Argonaute has an amino acid sequence selected from the group consisting of:
(i)如NCBI序列号WP_060384876.1所示的氨基酸序列;和(i) the amino acid sequence as set forth in NCBI SEQ ID NO: WP_060384876.1; and
(ii)在如NCBI序列号WP_060384876.1所示序列的基础上,进行一个或多个氨 基酸残基的替换、缺失、改变或插入,或在其N端或C端添加1至10个氨基酸残基(较佳地1至5个氨基酸残基,更佳地1至3个氨基酸残基),从而获得的氨基酸序列;并且所述获得的氨基酸序列与如NCBI序列号WP_060384876.1所示序列具有≥85%(优选地≥90%,更优选地≥95%,例如≥96%、≥97%、≥98%或≥99%)的序列同一性;并且所获得的氨基酸序列具备与(i)相同或相似的功能。(ii) on the basis of the sequence shown in NCBI sequence number WP_060384876.1, one or more amino acid residues are replaced, deleted, changed or inserted, or 1 to 10 amino acid residues are added to its N-terminus or C-terminus. base (preferably 1 to 5 amino acid residues, more preferably 1 to 3 amino acid residues), thereby the obtained amino acid sequence; and the obtained amino acid sequence has the same sequence as shown in NCBI serial number WP_060384876.1 ≥ 85% (preferably ≥ 90%, more preferably ≥ 95%, eg ≥ 96%, ≥ 97%, ≥ 98% or ≥ 99%) sequence identity; and the amino acid sequence obtained possesses the same as (i) same or similar functionality.
本发明的目的是提供一种具有多重引导链和底物链剪切偏好性的新型高温核酸酶。The purpose of the present invention is to provide a novel high-temperature nuclease with multiple guide strand and substrate strand cleavage preferences.
为了实现上述目的,本发明提供了一种高温Argonaute——TpsAgo的蛋白,该蛋白具有高温核酸酶活性,其是以一株分离自印度北部热泉水(90-98℃)的嗜热菌Thermus parvatiensis RL为出发菌株,为目前报道的分子量较小的高温Ago蛋白。In order to achieve the above object, the present invention provides a high-temperature Argonaute-TpsAgo protein, which has high-temperature nuclease activity, and is derived from a strain of thermophilic bacteria Thermus parvatiensis isolated from hot spring water (90-98°C) in northern India RL is the starting strain, which is a high-temperature Ago protein with a small molecular weight reported so far.
另一方面,本发明提供了一种高温Argonaute,TpsAgo蛋白的基因,该基因编码如上所述的高温核酸酶TpsAgo的蛋白。In another aspect, the present invention provides a high-temperature Argonaute, a gene of TpsAgo protein, which encodes the above-mentioned high-temperature nuclease TpsAgo protein.
本发明通过对TpsAgo蛋白的基因进行挖掘、序列比对后,构建了重组质粒pET28a—TpsAgo,该重组质粒转化大肠杆菌(DE3),实现了TpsAgo的异源表达,后经Ni-NTA重力柱纯化得到了重组菌株所产Ago蛋白。In the present invention, the recombinant plasmid pET28a-TpsAgo is constructed after the gene of TpsAgo protein is excavated and sequence compared. The recombinant plasmid is transformed into Escherichia coli (DE3) to realize the heterologous expression of TpsAgo, and then purified by Ni-NTA gravity column The Ago protein produced by the recombinant strain was obtained.
本发明所得到的新型高温Ago蛋白分子量约为76kDa,该酶可同时利用5′-磷酸化的gDNA和5′-羟基化的gDNA介导单链DNA靶标核酸的剪切,也可以利用5′-磷酸化的gDNA介导单链RNA靶标核酸的剪切。最适反应温度范围在65℃-90℃之间;可利用Mn 2+、Co 2+作为活性离子,100μM Mn 2+可使其保持较高活性;该酶对NaCl浓度具有一定耐受性,耐受范围在0-3200mM之间;该酶可利用16nt-21nt的5’-P gDNA以及16nt-24nt的5’-OH gDNA;该酶对gDNA 5’末端第一个碱基具有偏好性,由剪切动力学结果可以看出,更偏好第一个羟基是T和G的gDNA;该酶可利用多种5’末端修饰的gDNA,如5’-P、5’-OH、5’-Biotin、5’-NH 2C 6、5’-FAM、5-SHC 6如;该酶对target与gDNA间的单点错配具有较低的容忍度,这将在SNV基因检测中有一定的应用前景。该酶除了可以剪切单链DNA,还可以剪切质粒DNA,可以在pUC19质粒的29%GC含量和65%GC含量的位置,将超螺旋DNA剪切为线性DNA。该酶具有良好的单点错配及双点错配区分性,可结合等温扩增技术,通过“边扩增边剪切”的偶联反应,对EGFR L861Q等突变基因进行富集并检测。 The molecular weight of the novel high-temperature Ago protein obtained by the invention is about 76kDa, and the enzyme can simultaneously use 5'-phosphorylated gDNA and 5'-hydroxylated gDNA to mediate the cleavage of single-stranded DNA target nucleic acid, and can also use 5'-phosphorylated gDNA and 5'-hydroxylated gDNA to mediate the cleavage of single-stranded DNA target nucleic acid. - Phosphorylated gDNA mediates cleavage of single-stranded RNA target nucleic acids. The optimum reaction temperature range is between 65℃-90℃; Mn 2+ and Co 2+ can be used as active ions, and 100μM Mn 2+ can keep it highly active; the enzyme has a certain tolerance to NaCl concentration, The tolerance range is between 0-3200mM; the enzyme can utilize 16nt-21nt 5'-P gDNA and 16nt-24nt 5'-OH gDNA; the enzyme has a preference for the first base at the 5' end of gDNA, From the shear kinetics results, it can be seen that gDNA whose first hydroxyl group is T and G is preferred; the enzyme can utilize a variety of 5' end modified gDNA, such as 5'-P, 5'-OH, 5'- Biotin, 5'-NH 2 C 6 , 5'-FAM, 5-SHC 6 such as; the enzyme has a low tolerance for single-point mismatch between target and gDNA, which will have a certain degree of SNV gene detection application prospects. The enzyme can cut single-stranded DNA as well as plasmid DNA, and can cut supercoiled DNA into linear DNA at the positions of 29% GC content and 65% GC content of pUC19 plasmid. The enzyme has good discrimination between single-point mismatch and double-point mismatch, and can be combined with isothermal amplification technology to enrich and detect mutant genes such as EGFR L861Q through the coupling reaction of "amplification and shearing".
应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described in the following (eg, the embodiments) can be combined with each other to form new or preferred technical solutions. Due to space limitations, it is not repeated here.
附图说明Description of drawings
图1显示了TpsAgo的系统进化分析(A)以及多重序列比对(B)的结果。Figure 1 shows the results of phylogenetic analysis (A) and multiple sequence alignment (B) of TpsAgo.
图2显示了SDS-PAGE电泳分析TpsAgo蛋白的结果。其中,由左至右的泳道分别为蛋白Marker、菌体裂解上清、菌体裂解沉淀、纯化的TpsAgo。Figure 2 shows the results of SDS-PAGE electrophoresis analysis of TpsAgo protein. Among them, the lanes from left to right are protein Marker, cell lysis supernatant, cell lysis precipitation, and purified TpsAgo.
图3显示了TpsAgo的剪切活性的测定结果。Figure 3 shows the results of the assay of the cleavage activity of TpsAgo.
图4显示了5’-磷酸化gDNA(A)和5’-羟基化gDNA(B)的长度对TpsAgo剪切活性的影响。Figure 4 shows the effect of length of 5'-phosphorylated gDNA (A) and 5'-hydroxylated gDNA (B) on TpsAgo cleavage activity.
图5显示了TpsAgo反应所需的最适温度范围的结果图。Figure 5 shows a graph of the results of the optimum temperature range required for the TpsAgo reaction.
图6显示了二价金属离子类型(A)及浓度(B)对TpsAgo剪切活性的影响的结果。Figure 6 shows the results of the effect of divalent metal ion type (A) and concentration (B) on TpsAgo cleavage activity.
图7显示了TpsAgo所能耐受的NaCl温度范围的结果。Figure 7 shows the results for the NaCl temperature range that TpsAgo can tolerate.
图8显示了TpsAgo对gDNA 5’碱基末端第一个碱基的偏好性的结果。Figure 8 shows the results of the preference of TpsAgo for the first base at the 5' base end of gDNA.
图9显示了TpsAgo对gDNA 5’修饰的高容忍度的结果。Figure 9 shows the results of TpsAgo's high tolerance for gDNA 5' modification.
图10显示了TpsAgo针对gDNA与Target之间不同位点的单点错配可以区分剪切的结果。Figure 10 shows the results that TpsAgo can differentiate cleavage against single-point mismatches at different sites between gDNA and Target.
图11显示了TpsAgo剪切pUC19质粒29%GC(A、B)和65%GC(C、D)含量位置的结果。Figure 11 shows the results of cleavage of pUC19 plasmids at 29% GC (A, B) and 65% GC (C, D) content positions by TpsAgo.
其中,OC代表开环质粒(质粒一条链断开);LIN代表线性化质粒(质粒双链断开);SC代表超螺旋质粒。Among them, OC stands for open-circle plasmid (one strand of plasmid is broken); LIN stands for linearized plasmid (double strand of plasmid is broken); SC stands for supercoiled plasmid.
图12显示了TpsAgo对SNV突变基因富集的正反向gDNAs设计原理。Figure 12 shows the design principle of forward and reverse gDNAs for TpsAgo enrichment of SNV mutant genes.
图13显示了TpsAgo结合等温扩增技术对EGFR L861Q SNV基因的富集测序结果(A表示不加TpsAgo,B表示加入50nM TpsAgo)。Figure 13 shows the enrichment sequencing results of TpsAgo combined with isothermal amplification technology for the EGFR L861Q SNV gene (A means no TpsAgo added, B means 50nM TpsAgo added).
具体实施方式Detailed ways
本发明人经过广泛而深入的研究,经过大量的筛选,首次开发了一种灵敏度高、特异性好、通量高的低丰度突变DNA的富集及检测方法。具体地,发明人通过体外表达和纯化分离获得了核酸酶TpsAgo,并且通过大量的摸索实验, 获得了其最优的反应参数,从而提供了一种基于TpsAgo的富集低丰度目标核酸的方法和相应的检测方法。本发明具有非侵入性、易操作、快速等优势,能更好地进行人液态活检中低丰度突变基因的检测,本发明技术可广泛应用于涉及核酸检测的分子诊断各个领域,如肿瘤液态活检,感染性疾病如重大传染性和病原体感染性疾病(病毒、病原菌)检测领域等领域。在此基础上完成了本发明。After extensive and in-depth research and extensive screening, the present inventors have developed for the first time a method for enrichment and detection of low-abundance mutant DNA with high sensitivity, good specificity and high throughput. Specifically, the inventors obtained the nuclease TpsAgo through in vitro expression, purification and separation, and obtained its optimal reaction parameters through a large number of groping experiments, thereby providing a TpsAgo-based method for enriching low-abundance target nucleic acids and corresponding detection methods. The invention has the advantages of non-invasiveness, easy operation, rapidity, etc., and can better detect low-abundance mutant genes in human liquid biopsy. The technology of the invention can be widely used in various fields of molecular diagnosis involving nucleic acid detection, such as tumor liquid Biopsy, infectious diseases such as major infectious and pathogenic infectious diseases (viruses, pathogenic bacteria) detection fields. The present invention has been completed on this basis.
术语the term
为了更容易理解本发明,以下具体定义了某些技术和科学术语。除非在本文中另有明确定义,本文使用的所有其它技术和科学术语都具有本发明所属领域的一般技术人员通常理解的含义。在描述本发明之前,应当理解本发明不限于所述的具体方法和实验条件,因为这类方法和条件可以变动。还应当理解本文所用的术语其目的仅在于描述具体实施方案,并且不意图是限制性的,本发明的范围将仅由所附的权利要求书限制。For easier understanding of the present invention, certain technical and scientific terms are specifically defined below. Unless explicitly defined otherwise herein, all other technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Before the present invention is described, it is to be understood that this invention is not limited to the specific methods and experimental conditions described, as such methods and conditions may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting, the scope of the invention will be limited only by the appended claims.
除非另外定义,否则本文中所用的全部技术与科学术语均具有如本发明所属领域的普通技术人员通常理解的相同含义。如本文所用,在提到具体列举的数值中使用时,术语“约”意指该值可以从列举的值变动不多于1%。例如,如本文所用,表述“约100”包括99和101和之间的全部值(例如,99.1、99.2、99.3、99.4等)。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. As used herein, when used in reference to a specifically recited value, the term "about" means that the value may vary by no more than 1% from the recited value. For example, as used herein, the expression "about 100" includes all values between 99 and 101 and (eg, 99.1, 99.2, 99.3, 99.4, etc.).
如本文所用,术语“任选”或“任选地”意味着随后所描述的事件或情况可以发生但不是必须发生。As used herein, the terms "optional" or "optionally" mean that the subsequently described event or circumstance can, but need not, occur.
如本文所用,术语“含有”或“包括(包含)”可以使开放式、半封闭式和封闭式的。换言之,所述术语也包括“基本上由…构成”或“由…构成”。As used herein, the terms "containing" or "including (including)" can be open, semi-closed, and closed. In other words, the term also includes "consisting essentially of" or "consisting of."
“转导”、“转染”、“转化”或本文用到的术语指的是将外源多核苷酸传递导至宿主细胞,转录和翻译产生多肽产物的过程,包括利用质粒分子将外源多核苷酸引入宿主细胞(例如大肠杆菌)。"Transduction", "transfection", "transformation" or the terms used herein refer to the process of delivering an exogenous polynucleotide into a host cell for transcription and translation to produce a polypeptide product, including the use of plasmid molecules to transfer the exogenous polynucleotide to a host cell. The polynucleotide is introduced into a host cell (eg, E. coli).
“基因表达”或“表达”指的是基因转录,翻译和翻译后修饰产生基因的RNA或蛋白产物的过程。"Gene expression" or "expression" refers to the process of transcription, translation and post-translational modification of a gene to produce the RNA or protein product of a gene.
“多核苷酸”指的是任意长度的核苷酸的聚合形式,包括脱氧核苷酸(DNA),核糖核苷酸(RNA),其杂合序列和类似物。多核苷酸可包括修饰的核苷酸,比如甲基化或加帽的核苷酸或核苷酸类似物。本文使用的术语多核苷酸指 可互换的单链和双链分子。除非另有说明,本文描述的任意实施例里的多核苷酸包括双链的形式和已知的或可预测的构成双链形式的两条互补的单链。"Polynucleotide" refers to a polymeric form of nucleotides of any length, including deoxynucleotides (DNA), ribonucleotides (RNA), hybrid sequences thereof, and the like. Polynucleotides can include modified nucleotides, such as methylated or capped nucleotides or nucleotide analogs. The term polynucleotide as used herein refers to interchangeable single- and double-stranded molecules. Unless otherwise specified, a polynucleotide in any of the embodiments described herein includes both the double-stranded form and the two complementary single strands known or predicted to make up the double-stranded form.
保守氨基酸的取代是本领域已知的。在一些实施例中,潜在的取代氨基酸在以下组的一个或多个内:甘氨酸,丙氨酸;和缬氨酸,异亮氨酸,亮氨酸和脯氨酸;天冬氨酸,谷氨酸;天冬酰胺,谷氨酰胺;丝氨酸,苏氨酸赖氨酸,精氨酸和组氨酸;和/或苯丙氨酸,色氨酸和酪氨酸;蛋氨酸和半胱氨酸。此外,本发明还提供了允许来自不同基团的氨基酸取代的非保守的氨基酸取代。Conservative amino acid substitutions are known in the art. In some embodiments, potential substituting amino acids are within one or more of the following groups: glycine, alanine; and valine, isoleucine, leucine, and proline; aspartic acid, glutamic acid amino acids; asparagine, glutamine; serine, threonine, lysine, arginine and histidine; and/or phenylalanine, tryptophan and tyrosine; methionine and cysteine . In addition, the present invention also provides non-conservative amino acid substitutions that allow for amino acid substitutions from different groups.
本领域技术人员将容易理解本文所述的所有参数,尺寸,材料和构造的含义。实际参数,尺寸,材料和/或配置取决于使用本发明说明的特定应用。本领域技术人员能够理解,实施例或权利要求仅是通过示例的方式给出的,并且在等效物或权利要求的范围内,本发明的实施例可涵盖的范围不限于具体描述和要求的范围。Those skilled in the art will readily understand the meaning of all parameters, dimensions, materials and configurations described herein. Actual parameters, dimensions, materials and/or configurations depend on the particular application for which the invention is described. Those skilled in the art will appreciate that the embodiments or claims are given by way of example only, and within the scope of equivalents or claims, the scope that the embodiments of the present invention can cover is not limited to what is specifically described and claimed. scope.
本文的定义和使用的所有定义应被理解为超过词典定义或通过引用并入的文档中的定义。Definitions, and all definitions used herein, should be understood to control over dictionary definitions or definitions in documents incorporated by reference.
本文所发明的所有参考文献,专利和专利申请都相对于其所引用的主题通过引用并入,在某些情况下可能包含整个文档。All references, patents and patent applications disclosed herein are incorporated by reference with respect to the subject matter for which they are cited, and in some cases the entire document may be included.
应当理解,对于本文所述的包括一个以上步骤的任何方法,步骤的顺序不一定限于这些实施例中描述的顺序。It should be understood that for any method described herein that includes more than one step, the order of the steps is not necessarily limited to the order described in these examples.
Ago酶Ago enzyme
Argonaute蛋白属于PIWI(P element-induced wimpy testis)蛋白超家族,其由PIWI结构域的存在而界定,广泛存在于生活的所有领域,能够结合siDNA或siRNA指导链来特异性沉默或剪切互补核酸靶标链。研究表明,Ago在生物体细胞免疫防御及代谢调控中发挥重要作用,并可能具有人工基因编辑的应用潜力,因此针对Ago蛋白的功能研究成为生物学研究中新关注点。Argonaute protein belongs to the PIWI (P element-induced wimpy testis) protein superfamily, which is defined by the presence of the PIWI domain, widely present in all areas of life, and can bind to siDNA or siRNA guide strand to specifically silence or cut complementary nucleic acids target strand. Studies have shown that Ago plays an important role in the immune defense and metabolic regulation of organisms, and may have the application potential of artificial gene editing. Therefore, the functional study of Ago protein has become a new focus in biological research.
Ago蛋白最初是在真核生物中发现的,是RNA干扰(RNAi)途径的关键参与者。真核Argonaute蛋白(eAgos)作为多蛋白RNA诱导沉默复合物(RISC)的核心,能够结合siRNA分子作为指导链,剪切互补的靶标RNA,直接沉默靶标RNA的翻译;或通过与靶标RNA结合,募集其他沉默因子来促进其降解,进而间接沉默靶标RNA。因此,eAgos可以在转录后调节基因表达,保护其宿主不受入侵RNA病毒的侵害,并通过降低转座子的流动性保持基因组的完整性。Originally discovered in eukaryotes, Ago proteins are key players in the RNA interference (RNAi) pathway. Eukaryotic Argonaute protein (eAgos), as the core of multi-protein RNA-induced silencing complex (RISC), can bind siRNA molecules as guide strands, cleaves complementary target RNAs, and directly silence the translation of target RNAs; or by binding to target RNAs, Other silencing factors are recruited to promote their degradation, thereby indirectly silencing the target RNA. Thus, eAgos can regulate gene expression post-transcriptionally, protect their hosts from invading RNA viruses, and maintain genome integrity by reducing the mobility of transposons.
Argonaute蛋白还存在于原核生物中。对一些原核Ago(pAgos)蛋白(主要来自嗜热细菌和古菌)的结构和生化研究表明,它们在体外可以发挥核酸内切酶作用,在体内可以发挥宿主防御作用。pAgos可以结合siDNA指导链来特异性剪切和指导链互补配对的DNA靶标链。截至2018年,已报道的pAgos主要来源于高温宿主,多用于基因检测。常温条件下活性很低,无法作为基因编辑的工具。2019年至今,陆续报道了一些来源于常温宿主的pAgos,能在常温条件下发挥DNA指导的DNA剪切活性,并且能够剪切GC含量较低的质粒。Argonaute proteins are also present in prokaryotes. Structural and biochemical studies of some prokaryotic Ago (pAgos) proteins (mainly from thermophilic bacteria and archaea) have shown that they can function as endonucleases in vitro and host defense in vivo. pAgos can bind to the siDNA guide strand to specifically cleave the complementary paired DNA target strand of the guide strand. As of 2018, the reported pAgos are mainly derived from high temperature hosts and are mostly used for genetic testing. The activity is very low at room temperature and cannot be used as a tool for gene editing. Since 2019, some pAgos derived from normal temperature hosts have been reported successively, which can exert DNA-directed DNA shearing activity under normal temperature conditions, and can shear plasmids with low GC content.
如本文所用,术语“可编程核酸内切酶Thermus parvatiensis”、“核酸酶Thermus parvatiensis”、“TpsAgo酶”可互换使用,指本发明第一方面中所述的酶。As used herein, the terms "programmable endonuclease Thermus parvatiensis", "nuclease Thermus parvatiensis", "TpsAgo enzyme" are used interchangeably and refer to the enzymes described in the first aspect of the invention.
野生型的TpsAgo酶具有如NCBI序列号WP_060384876.1所示的氨基酸序列。The wild-type TpsAgo enzyme has the amino acid sequence shown in NCBI SEQ ID NO: WP_060384876.1.
本发明的TpsAgo酶还可包含其保留了功能活性的突变形式。所述的突变形式可含有在如NCBI序列号WP_060384876.1所示序列的基础上,进行一个或多个氨基酸残基的替换、缺失、改变或插入,或在其N端或C端添加1至10个氨基酸残基(较佳地1至5个氨基酸残基,更佳地1至3个氨基酸残基),从而获得的氨基酸序列;并且所述获得的氨基酸序列与如NCBI序列号WP_060384876.1所示序列具有≥85%(优选地≥90%,更优选地≥95%,例如≥96%、≥97%、≥98%或≥99%)的序列同一性;并且所获得的氨基酸序列具备与野生型TpsAgo酶相同或相似的功能。The TpsAgo enzymes of the present invention may also comprise mutant forms thereof that retain functional activity. Said mutant form can contain one or more amino acid residues substitution, deletion, change or insertion on the basis of the sequence shown in NCBI sequence number WP_060384876.1, or add 1 to 10 amino acid residues (preferably 1 to 5 amino acid residues, more preferably 1 to 3 amino acid residues), the amino acid sequence obtained; and the amino acid sequence obtained is the same as NCBI sequence number WP_060384876.1 The sequence shown has ≥85% (preferably ≥90%, more preferably ≥95%, such as ≥96%, ≥97%, ≥98% or ≥99%) sequence identity; and the amino acid sequence obtained has Same or similar function as wild-type TpsAgo enzyme.
“边扩增边剪切”的偶联反应"Amplification and cleavage" coupling reaction
在本发明中,在采用TpsAgo-gDNA复合物进行“边扩增边剪切”的偶联反应时,可以采用相应切割酶和相应扩增酶的合适条件下进行所述反应,只要该条件下所述的切割酶和扩增酶能够发挥其相应功能。In the present invention, when the TpsAgo-gDNA complex is used to carry out the coupling reaction of "cleaving while amplification", the reaction can be carried out under appropriate conditions using the corresponding cleavage enzyme and the corresponding amplification enzyme, as long as the conditions are The cleavage enzymes and amplification enzymes can perform their corresponding functions.
本发明的研究表明,对于通过所述偶联反应来富集突变型dsDNA信号,一些关键因素主要包括以下几个方面:The research of the present invention shows that, for enriching the mutant dsDNA signal through the coupling reaction, some key factors mainly include the following aspects:
①富集反应体系中初始模板浓度:野生型(wild type,wt)和突变型(mutant type,mut)总浓度(nM~fM)):优选为0.1-100nM。①The initial template concentration in the enrichment reaction system: wild type (wild type, wt) and mutant type (mutant type, mut) total concentration (nM~fM)): preferably 0.1-100nM.
②富集反应体系中初始TpsAgo蛋白浓度:优选为20-100nM;②Initial TpsAgo protein concentration in the enrichment reaction system: preferably 20-100nM;
③富集反应体系中初始gDNAs浓度:优选为200-2000nM;③The initial concentration of gDNAs in the enrichment reaction system: preferably 200-2000nM;
④TpsAgo蛋白与gDNAs间摩尔浓度比例:优选为1:5~1:20;④The molar concentration ratio between TpsAgo protein and gDNAs: preferably 1:5~1:20;
富集低丰度目标核酸的反应体系A reaction system for enriching low-abundance target nucleic acids
如本文所用,术语“富集低丰度目标核酸的反应体系”、“本发明富集体系”可互换使用,指本发明第二方面中所述的用于富集低丰度目标核酸的反应体系。As used herein, the terms "reaction system for enriching low-abundance target nucleic acid" and "enrichment system of the present invention" are used interchangeably, and refer to the method for enriching low-abundance target nucleic acid described in the second aspect of the present invention. reaction system.
在本发明中,提供了一种富集低丰度目标核酸的反应体系,该体系基于上述“边扩增边剪切”的偶联反应的计数原理,用于对一核酸样本同时进行核酸扩增反应和核酸切割反应,从而获得扩增-切割反应产物。In the present invention, a reaction system for enriching low-abundance target nucleic acid is provided. The system is based on the above-mentioned counting principle of the coupling reaction of "amplifying while shearing", and is used for simultaneously performing nucleic acid amplification on a nucleic acid sample. The amplification reaction and the nucleic acid cleavage reaction are carried out to obtain the amplification-cleavage reaction product.
在具体的实施方式中,在富集体系中,所述的核酸样本含有第一核酸和第二核酸,其中,所述的第一核酸为所述目标核酸,而所述的第二核酸为非目标核酸;所述的核酸切割反应用于特异性切割非目标核酸,但不切割所述目的核酸。In a specific embodiment, in the enrichment system, the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non- Target nucleic acid; the nucleic acid cleavage reaction is used to specifically cut non-target nucleic acid, but not the target nucleic acid.
在本发明的富集体系中,所述的扩增-切割反应体系含有(i)进行核酸扩增反应所需的试剂和(ii)本发明基于可编程核酸内切酶Argonaute(Ago)的核酸切割体系。In the enrichment system of the present invention, the amplification-cleavage reaction system contains (i) the reagents required for the nucleic acid amplification reaction and (ii) the nucleic acid based on the programmable endonuclease Argonaute (Ago) of the present invention cutting system.
在本发明富集体系中,在富集之前,所述低丰度目标核酸的浓度为0.5-5nM,较佳地为0.8-2nM,更佳地为1nM。In the enrichment system of the present invention, before enrichment, the concentration of the low-abundance target nucleic acid is 0.5-5nM, preferably 0.8-2nM, more preferably 1nM.
在另一优选例中,所述反应体系中,所述可编程核酸内切酶Argonaute(Ago)的浓度为20-200nM,较佳地为30-150nM,更佳地为40-100nM。In another preferred example, in the reaction system, the concentration of the programmable endonuclease Argonaute (Ago) is 20-200 nM, preferably 30-150 nM, more preferably 40-100 nM.
在另一优选例中,所述的进行等温扩增反应所需的试剂包括One-Step tHDA试剂盒(购自NEB公司(New England Biolabs)。In another preferred embodiment, the reagents required for the isothermal amplification reaction include One-Step tHDA kit (purchased from NEB Company (New England Biolabs).
另外,所述的进行等温扩增反应所需的试剂还包括目标核酸的扩增引物对。优选地,所述的目标核酸的扩增引物对中的各引物的浓度为10-300nM,较佳地为50-200nM,更佳地为100nM。In addition, the reagents required for performing the isothermal amplification reaction also include a pair of amplification primers for the target nucleic acid. Preferably, the concentration of each primer in the amplification primer pair of the target nucleic acid is 10-300 nM, preferably 50-200 nM, more preferably 100 nM.
在本发明富集体系中,所述gDNA包括正向gDNA和反向gDNA;其中,所述正向gDNA是指与目标核酸具有相同序列片段的gDNA,所述反向gDNA是指与目标核酸具有反向互补序列片段的gDNA。In the enrichment system of the present invention, the gDNA includes forward gDNA and reverse gDNA; wherein, the forward gDNA refers to the gDNA with the same sequence fragment as the target nucleic acid, and the reverse gDNA refers to the gDNA with the target nucleic acid. gDNA of the reverse complement fragment.
在一个优选的实施方式中,所述反应体系中还包括二价金属离子。优选地,所述的二价金属离子为Mn 2+。并且所述二价金属离子的浓度为50μM-2000μM,较佳地100μM-1000μM,更佳地为250μM。 In a preferred embodiment, the reaction system further includes divalent metal ions. Preferably, the divalent metal ion is Mn 2+ . And the concentration of the divalent metal ion is 50 μM-2000 μM, preferably 100 μM-1000 μM, more preferably 250 μM.
在富集目标核酸的过程中,优选地,所述反应体系的反应温度(反应程序)为:65℃,1.5-2h。In the process of enriching the target nucleic acid, preferably, the reaction temperature (reaction procedure) of the reaction system is: 65° C., 1.5-2 h.
本发明富集和检测低丰度目标核酸的方法The method for enriching and detecting low-abundance target nucleic acid of the present invention
如本文所用,术语“本发明富集方法”、“富集低丰度目标核酸的方法”、“本发明富集核酸的方法”可互换使用,均是指本发明第三方面所述的用于富集低丰度目标核酸的方法。As used herein, the terms "enrichment method of the present invention", "method for enriching low-abundance target nucleic acid" and "method for enriching nucleic acid of the present invention" are used interchangeably, and all refer to the third aspect of the present invention. Methods for enriching low-abundance target nucleic acids.
本发明提供了一种富集低丰度目标核酸的方法,包括步骤:(a)提供一核酸样本,所述的核酸样本含有第一核酸和第二核酸,其中,所述的第一核酸为所述目标核酸,而所述的第二核酸为非目标核酸,并且,所述目标核酸在所述的核酸样本中的丰度为F1a;(b)对所述核酸样本中的核酸为模板,在扩增-切割反应体系中进行等温扩增和核酸切割反应,从而获得扩增-切割反应产物;其中,所述的核酸切割反应用于特异性切割非目标核酸,但不切割所述目的核酸;并且,所述的扩增-切割反应体系含有(i)进行等温扩增反应所需的试剂和(ii)本发明的核酸切割体系;其中,所述目标核酸在所述的扩增-切割反应产物中的丰度为F1b;其中,F1b/F1a的比值≥5(较佳地为≥10)。The present invention provides a method for enriching low-abundance target nucleic acid, comprising the steps of: (a) providing a nucleic acid sample, wherein the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is The target nucleic acid, and the second nucleic acid is a non-target nucleic acid, and the abundance of the target nucleic acid in the nucleic acid sample is F1a; (b) the nucleic acid in the nucleic acid sample is used as a template, Perform isothermal amplification and nucleic acid cleavage reaction in an amplification-cleavage reaction system to obtain an amplification-cleavage reaction product; wherein the nucleic acid cleavage reaction is used to specifically cut non-target nucleic acid, but not cut the target nucleic acid and, the amplification-cutting reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cutting system of the present invention; wherein, the target nucleic acid is in the amplification-cutting The abundance in the reaction product is F1b; wherein, the ratio of F1b/F1a is ≥5 (preferably ≥10).
在一个优选的实施方式中,所述的目标核酸和非目标核酸仅相差一个碱基。In a preferred embodiment, the target nucleic acid and the non-target nucleic acid differ by only one base.
优选地,当1%≤F1a≤10%时,F1b/F1a的比值≥10,当0.1%≤F1a≤0.5%时,F1b/F1a的比值≥100,当F1a≤0.1%时,F1b/F1a的比值≥200。Preferably, when 1%≤F1a≤10%, the ratio of F1b/F1a≥10, when 0.1%≤F1a≤0.5%, the ratio of F1b/F1a≥100, when F1a≤0.1%, the ratio of F1b/F1a Ratio≥200.
如本文所用,术语“本发明检测方法”、“检测低丰度目标核酸的方法”可互换使用,是指基于本发明第三方面所述的富集低丰度目标核酸的方法,对被富集的低丰度目标核酸进行检测的方法。As used herein, the terms "detection method of the present invention" and "method for detecting low-abundance target nucleic acid" can be used interchangeably, and refer to the method for enriching low-abundance target nucleic acid based on the third aspect of the present invention. A method for the detection of enriched low-abundance target nucleic acids.
本发明提供了一种检测低丰度目标核酸的方法,所述的方法基于上述富集低丰度目标核酸的方法步骤,进一步地包括:(c)对所述扩增-切割反应产物进行检测,从而测定所述目标核酸的存在与否和/或数量。The present invention provides a method for detecting low-abundance target nucleic acid, the method is based on the above-mentioned method steps for enriching low-abundance target nucleic acid, further comprising: (c) detecting the amplification-cleavage reaction product , thereby determining the presence and/or quantity of the target nucleic acid.
所述步骤(c)中的检测包括定量检测、定性检测、或其组合。The detection in the step (c) includes quantitative detection, qualitative detection, or a combination thereof.
优选地,所述的定量检测选自下组:TaqMan荧光定量PCR、桑格测序、q-PCR、ddPCR、化学发光法、高分辨率熔解曲线法、NGS等;更加优选地,选自TaqMan荧光定量PCR、桑格测序。Preferably, the quantitative detection is selected from the following group: TaqMan fluorescence quantitative PCR, Sanger sequencing, q-PCR, ddPCR, chemiluminescence, high-resolution melting curve method, NGS, etc.; more preferably, selected from TaqMan fluorescence Quantitative PCR, Sanger sequencing.
本发明所述的检测方法是可以是非诊断性和非治疗性的。The detection methods of the present invention may be non-diagnostic and non-therapeutic.
在实际的应用中,所述的核酸样本包括来自试样的核酸,其中所述试样选自 下组:血液、细胞、血清、唾液、体液、血浆、尿液、前列腺液、支气管灌洗液、脑脊液、胃液、胆汁、淋巴液、腹腔液及粪便等或其组合。In practical applications, the nucleic acid sample includes nucleic acid from a sample, wherein the sample is selected from the group consisting of blood, cells, serum, saliva, body fluid, plasma, urine, prostatic fluid, bronchial lavage fluid , cerebrospinal fluid, gastric fluid, bile, lymph fluid, peritoneal fluid and feces, etc. or a combination thereof.
在本发明具体的实施方式中,所述低丰度目标核酸可以是EGFR L861Q突变型序列片段。In a specific embodiment of the present invention, the low-abundance target nucleic acid can be an EGFR L861Q mutant sequence fragment.
试剂盒Reagent test kit
基于本发明的富集和检测低丰度目标核酸的方法,本发明进一步地提供了一种用于检测靶标核酸分子的试剂盒,包括:(i)本发明富集低丰度目标核酸反应体系或用于配制所述反应体系的试剂;(ii)用于检测低丰度目标核酸的检测试剂;和(ii)使用说明书,所述说明书描述了本发明的检测方法。Based on the method for enriching and detecting low-abundance target nucleic acid of the present invention, the present invention further provides a kit for detecting target nucleic acid molecules, comprising: (i) the present invention for enriching the low-abundance target nucleic acid reaction system or reagents for preparing the reaction system; (ii) detection reagents for detecting low-abundance target nucleic acids; and (ii) instructions for use, which describe the detection method of the present invention.
在具体的实施方式中,所述的试剂盒包括:(a)第一容器以及位于所述第一容器的向导DNA;(b)第二容器以及位于第二容器的可编程核酸内切酶Argonaute(Ago);和(c)第三容器以及位于第三容器的核酸扩增反应试剂。In a specific embodiment, the kit comprises: (a) a first container and a guide DNA located in the first container; (b) a second container and a programmable endonuclease Argonaute located in the second container (Ago); and (c) a third container and a nucleic acid amplification reaction reagent located in the third container.
优选地,所述的试剂盒还含有:(d)第四容器以及位于第四容器的低丰度目标核酸的检测试剂。所述的低丰度目标核酸的检测试剂包括:引物、探针等。在一个实施方式中,所述的低丰度目标核酸的检测试剂包括:TaqMan荧光定量PCR所需的引物和探针,或桑格测序所需试剂。Preferably, the kit further contains: (d) a fourth container and a low-abundance target nucleic acid detection reagent located in the fourth container. The low-abundance target nucleic acid detection reagents include: primers, probes, and the like. In one embodiment, the low-abundance target nucleic acid detection reagents include: primers and probes required for TaqMan fluorescence quantitative PCR, or reagents required for Sanger sequencing.
优选地,所述的试剂盒还含有:(e)第五容器以及位于第五容器的二价金属离子。优选地,所述的试剂盒还包括:(f)第六容器以及位于第六容器的缓冲液。Preferably, the kit further contains: (e) a fifth container and a divalent metal ion located in the fifth container. Preferably, the kit further comprises: (f) a sixth container and a buffer located in the sixth container.
在本发明的各个实施方式中,上述各容器可以是相同或不同的容器。In various embodiments of the present invention, the containers described above may be the same or different containers.
本发明的主要优点包括:The main advantages of the present invention include:
1)本发明所述的TpsAgo蛋白对靶标DNA与gDNA间的单点或双点错配具有较好的区分性,利用这种性质,可以将其运用到低丰度突变型基因的富集与检测中,实现对肿瘤早期突变基因的检测。1) The TpsAgo protein of the present invention has a good distinction between single-point or double-point mismatches between target DNA and gDNA. Using this property, it can be applied to the enrichment and enrichment of low-abundance mutant genes. In the detection, the detection of early mutated genes in tumors is realized.
2)TpsAgo可利用多种5’端修饰的gDNA剪切互补DNA。2) TpsAgo can use a variety of 5'-modified gDNA to cleave complementary DNA.
3)TpsAgo可耐受高浓度NaCl。3) TpsAgo can tolerate high concentration of NaCl.
4)TpsAgo可剪切多种靶标核酸,包括单链DNA,单链RNA,双链DNA。4) TpsAgo can cut a variety of target nucleic acids, including single-stranded DNA, single-stranded RNA, and double-stranded DNA.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说 明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件,例如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor Laboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。除非另外说明,否则百分比和份数是重量百分比和重量份数。The present invention will be further described below in conjunction with specific embodiments. It should be understood that these examples are only used to illustrate the present invention and not to limit the scope of the present invention. The experimental method of unreceipted specific conditions in the following examples, usually according to conventional conditions, such as Sambrook et al., molecular cloning: conditions described in laboratory manual (New York:Cold Spring Harbor Laboratory Press, 1989), or according to manufacture conditions recommended by the manufacturer. Percentages and parts are weight percentages and parts unless otherwise specified.
实施例1:TpsAgo基因序列的获得Example 1: Acquisition of TpsAgo Gene Sequence
在数据库中,对已知的PfAgo的氨基酸序列进行相似性检索,选取部分序列一致性较高的氨基酸序列,采用MEGA软件进行分析,构建同源进化树,选取TpsAgo作为候选酶。获得TpsAgo(WP_060384876.1)的氨基酸序列和对应的编码该蛋白的基因序列(NZ_CP014142.1)。将该基因序列经密码子优化合成后,克隆至pET28a表达载体。In the database, the known amino acid sequences of PfAgo were searched for similarity, and some amino acid sequences with high sequence consistency were selected, analyzed by MEGA software, and a homologous evolutionary tree was constructed, and TpsAgo was selected as the candidate enzyme. The amino acid sequence of TpsAgo (WP_060384876.1) and the corresponding gene sequence encoding the protein (NZ_CP014142.1) were obtained. After the gene sequence was synthesized by codon optimization, it was cloned into pET28a expression vector.
实施例2:TpsAgo蛋白异源表达与纯化Example 2: Heterologous expression and purification of TpsAgo protein
将上述TpsAgo-pET28a原核表达质粒导入E.coli BL21(DE3)中,得到TpsAgo-pET28a/E.coli BL21(DE3)原核表达菌株。含重组质粒TpsAgo-pET28a的表达菌株E.coli BL21(DE3)接种于含有50μg/mL卡那霉素的LB培养基中,37℃,220rpm摇床培养到OD 600至0.6-0.8之间,加入终浓度0.4-0.6mM的IPTG,18℃,200rpm摇床继续培养16-20h,诱导TpsAgo蛋白的表达。离心收集菌体,使用重悬缓冲液(含20mM Tris-HCl、pH8.0左右、1M NaCl)重悬菌体,然后高压破碎菌体,离心获得上清。利用Ni-NTA柱亲和纯化蛋白,洗脱液经超滤浓缩、脱盐等步骤得到纯化的蛋白。纯化的蛋白保存于含20mM Tris-HCl的缓冲液中,并通过BCA试剂盒测定蛋白,测定步骤按照操作说明进行。以BSA作为标准品,配置标准溶液,绘制标准曲线,依此计算纯化的目的蛋白浓度,蛋白放于-80℃冰箱保存备用。SDS-PAGE电泳分析TpsAgo蛋白。 The above-mentioned TpsAgo-pET28a prokaryotic expression plasmid was introduced into E.coli BL21(DE3) to obtain a TpsAgo-pET28a/E.coli BL21(DE3) prokaryotic expression strain. The expression strain E.coli BL21(DE3) containing the recombinant plasmid TpsAgo-pET28a was inoculated into LB medium containing 50 μg/mL kanamycin, and incubated at 37°C with a shaker at 220 rpm to an OD 600 to 0.6-0.8. IPTG with a final concentration of 0.4-0.6 mM, 18° C., 200 rpm shaker was continued to culture for 16-20 h to induce the expression of TpsAgo protein. The cells were collected by centrifugation, resuspended in a resuspension buffer (containing 20 mM Tris-HCl, about pH 8.0, 1 M NaCl), then disrupted by high pressure, and centrifuged to obtain the supernatant. The protein was affinity purified by Ni-NTA column, and the eluate was concentrated by ultrafiltration, desalted and other steps to obtain the purified protein. The purified protein was stored in a buffer containing 20 mM Tris-HCl, and the protein was assayed by BCA kit, and the assay steps were carried out according to the operating instructions. Using BSA as a standard, prepare a standard solution, draw a standard curve, and calculate the purified target protein concentration based on this, and store the protein in a -80°C refrigerator for later use. TpsAgo protein was analyzed by SDS-PAGE electrophoresis.
结果如图2所示。结果表明,目的蛋白TpsAgo已得到纯化。The results are shown in Figure 2. The results showed that the target protein TpsAgo had been purified.
实施例3:TpsAgo与其他已知Ago序列比对Example 3: Alignment of TpsAgo with other known Ago sequences
在本实施例中,将TpsAgo与部分已表征的Ago进行了多重序列比对。In this example, TpsAgo was subjected to multiple sequence alignment with some of the characterized Agos.
结果显示,TpsAgo的氨基酸数是最少的,蛋白分子量也最小。据文献报道,所有具有催化活性的Ago蛋白的靶向剪切是通过一个保守的DEDX(X代表组氨酸,天冬氨酸或天冬酰胺)四联体所介导。通过序列比对,可以发现TpsAgo中 存在DEDD四联体(图1)。因此进一步推测其可能具有核酸酶催化活性,需要进一步体外鉴定表征。The results showed that TpsAgo had the smallest number of amino acids and the smallest protein molecular weight. It has been reported in the literature that the targeted cleavage of all catalytically active Ago proteins is mediated by a conserved DEDX (X stands for histidine, aspartate or asparagine) quadruplet. Through sequence alignment, it can be found that there is a DEDD quadruplet in TpsAgo (Figure 1). Therefore, it is further speculated that it may have nuclease catalytic activity, which requires further in vitro identification and characterization.
实施例4:TpsAgo剪切活性测定Example 4: TpsAgo cleavage activity assay
设计带有荧光修饰的45nt单链DNA、RNA靶标核酸以及互补的四种16nt DNA、RNA引导链,并送公司合成。Design 45nt single-stranded DNA with fluorescent modification, RNA target nucleic acid and four complementary 16nt DNA and RNA guide strands, and send them to the company for synthesis.
DNA靶标核酸序列(SEQ ID NO:1):DNA target nucleic acid sequence (SEQ ID NO: 1):
5’-FAM-CGCAGCATGTCAAGATCACAGATTTTGGGCTGGCCAAACTGCTGG-3’5’-FAM-CGCAGCATGTCAAGATCACAGATTTTGGGCTGGCCAAACTGCTGG-3’
RNA靶标核酸(SEQ ID NO:2):RNA target nucleic acid (SEQ ID NO: 2):
5’-FAM-CGCAGCAUGUCAAGAUCACAGAUUUUGGGCUGGCCAAACUGCUGG-3’5’-FAM-CGCAGCAUGUCAAGAUCACAGAUUUUGGGCUGGCCAAACUGCUGG-3’
gDNA(SEQ ID NO:3):gDNA (SEQ ID NO: 3):
5’-HO/P-TAGTTTGGCCAGCCCA-3’5’-HO/P-TAGTTTGGCCAGCCCA-3’
gRNA(SEQ ID NO:4):gRNA (SEQ ID NO: 4):
5’-HO/P-UAGUUUGGCCAGCCCA-3’5’-HO/P-UAGUUUGGCCAGCCCA-3’
配置反应缓冲液(含15mM Tris-HCl pH8.0、250mM NaCl),在反应缓冲液中加入终浓度为0.5mM的MnCl 2、200nM TpsAgo、2μM合成的gDNA或gRNA和0.8μM 5’荧光修饰的序列互补单链DNA或RNA靶标核酸,在80℃反应30min,反应结束后,取6-10μL样品,按1:1比例加入上样缓冲液(含95%(去离子)甲酰胺,0.5mmol/L EDTA,0.025%溴酚蓝,0.025%二甲苯蓝),在16%的核酸Urea-PAGE下进行电泳检测。 Prepare the reaction buffer (containing 15mM Tris-HCl pH8.0, 250mM NaCl), add 0.5mM MnCl 2 , 200nM TpsAgo, 2μM synthetic gDNA or gRNA and 0.8μM 5' fluorescently modified to the reaction buffer Sequence complementary single-stranded DNA or RNA target nucleic acid, react at 80°C for 30min, after the reaction, take 6-10μL of sample, add loading buffer (containing 95% (deionized) formamide, 0.5mmol/ L EDTA, 0.025% bromophenol blue, 0.025% xylene blue), electrophoresis detection was performed under 16% nucleic acid Urea-PAGE.
结果如图3所示。结果表明,TpsAgo可利用5’-P和5’-OH gDNA剪切互补单链DNA,还可利用5’-P gDNA剪切互补单链RNA。The results are shown in Figure 3. The results show that TpsAgo can use 5'-P and 5'-OH gDNA to cleave complementary single-stranded DNA, and can also use 5'-P gDNA to cleave complementary single-stranded RNA.
实施例5:TpsAgo催化特性分析Example 5: Analysis of the catalytic properties of TpsAgo
分别设计11-30nt的5’磷酸化gDNA和14-24nt的5’羟基化gDNA,探究不同长度gDNA对TpsAgo酶活的影响。在反应缓冲液中加入终浓度为0.5mM的MnCl 2,终浓度为200nM的TpsAgo,2μM合成的不同长度的gDNA和0.8μM 60nt序列互补单链DNA靶标核酸,分别在80℃反应30min,反应产物在16%的核酸Urea-PAGE下进行电泳检测。 11-30nt 5'phosphorylated gDNA and 14-24nt 5'hydroxylated gDNA were designed to explore the effect of different lengths of gDNA on TpsAgo enzyme activity. MnCl 2 with a final concentration of 0.5 mM, TpsAgo with a final concentration of 200 nM, 2 μM of synthesized gDNA of different lengths, and 0.8 μM of 60nt sequence complementary single-stranded DNA target nucleic acid were added to the reaction buffer, respectively, and reacted at 80 °C for 30 min. Electrophoretic detection was performed under 16% nucleic acid Urea-PAGE.
结果如图4所示。结果表明,TpsAgo可利用长度为16-20nt 5’-P gDNA, 也可利用长度为16-22nt 5’-OH gDNA剪切互补靶标核酸。The results are shown in Figure 4. The results show that TpsAgo can use 16-20nt 5'-P gDNA, and can also use 16-22nt 5'-OH gDNA to cut complementary target nucleic acid.
分别在不同温度(50℃、55℃、60℃、65℃、70℃、75℃、80℃、85℃、90℃、95℃、100℃)下探究TpsAgo的酶活,在反应缓冲液中加入终浓度为0.5mM的MnCl 2,终浓度为200nM的TpsAgo,2μM合成的gDNA和0.8μM 60nt序列互补单链DNA靶标核酸,在不同温度下反应30min,反应产物在16%的核酸Urea-PAGE下进行电泳检测。 The enzymatic activity of TpsAgo was investigated at different temperatures (50°C, 55°C, 60°C, 65°C, 70°C, 75°C, 80°C, 85°C, 90°C, 95°C, 100°C). Add MnCl 2 with a final concentration of 0.5mM, TpsAgo with a final concentration of 200nM, 2μM synthetic gDNA and 0.8μM 60nt sequence complementary single-stranded DNA target nucleic acid, and react at different temperatures for 30min, and the reaction products are displayed on 16% nucleic acid Urea-PAGE electrophoresis detection.
结果如图5所示。结果表明,TpsAgo可在65℃-90℃的范围内利用5’-P gDNA剪切互补靶标核酸。The results are shown in Figure 5. The results show that TpsAgo can use 5'-P gDNA to cleave complementary target nucleic acid in the range of 65℃-90℃.
TpsAgo、引导链和靶标核酸浓度不变,在反应体系中分别加入终浓度0.5mM的CoCl 2、CuCl 2、MgCl 2、MnCl 2、ZnCl 2、CaCl 2溶液,在反应温度80℃下,反应30min,在16%的核酸Urea-PAGE下进行电泳检测金属离子对酶活力影响。反应体系和条件不变,加入不同MnCl 2浓度:25μM、50μM、100μM、250μM、500μM、1000μM、2000μM,测定在5’磷酸化引导链介导下TpsAgo最适MnCl 2浓度。 The concentrations of TpsAgo, guide strand and target nucleic acid remained unchanged. CoCl 2 , CuCl 2 , MgCl 2 , MnCl 2 , ZnCl 2 , and CaCl 2 solutions with a final concentration of 0.5 mM were added to the reaction system. , the effect of metal ions on enzyme activity was detected by electrophoresis under 16% nucleic acid Urea-PAGE. The reaction system and conditions remained unchanged. Different concentrations of MnCl 2 were added: 25 μM, 50 μM, 100 μM, 250 μM, 500 μM, 1000 μM, and 2000 μM to determine the optimal MnCl 2 concentration of TpsAgo mediated by the 5' phosphorylated guide strand.
结果如图6所示。结果表明,TpsAgo可以利用Mn 2+和Co 2+作为金属离子,去介导单链DNA引导的DNA剪切。其中,TpsAgo更偏好Mn 2+,并且100μM-2000μM的Mn 2+可使TpsAgo保持较高活性。 The results are shown in Figure 6. The results show that TpsAgo can utilize Mn 2+ and Co 2+ as metal ions to mediate single-stranded DNA-guided DNA cleavage. Among them, TpsAgo prefers Mn 2+ , and 100μM-2000μM Mn 2+ can keep TpsAgo highly active.
调整反应缓冲液成分,分别配置终浓度为15mM Tris-HCl pH8.0和不同浓度的NaCl(20mM、40mM、80mM、200mM、400mM、800mM、1600mM、2400mM、3200mM、4000mM、4800mM)的反应缓冲液,其他反应体系不变,80℃反应30min,在16%的核酸Urea-PAGE下进行电泳检测。Adjust the composition of the reaction buffer to prepare reaction buffers with a final concentration of 15mM Tris-HCl pH8.0 and different concentrations of NaCl (20mM, 40mM, 80mM, 200mM, 400mM, 800mM, 1600mM, 2400mM, 3200mM, 4000mM, 4800mM). , the other reaction systems were unchanged, and the reaction was carried out at 80 °C for 30 min, and electrophoresis was carried out under 16% nucleic acid Urea-PAGE.
结果如图7所示。结果表明,TpsAgo可以在NaCl浓度为0-3200mM时发挥剪切活性。The results are shown in Figure 7. The results showed that TpsAgo could exert shearing activity at NaCl concentration of 0-3200 mM.
分别设计gDNA 5’末端第一个碱基是A、T、G、C的16nt gDNA,反应体系不变,加入MnCl 2、gDNA和靶标DNA,测定在5’磷酸化引导链介导下TpsAgo对gDNA 5’末端第一个碱基的偏好性。80℃反应15min,在16%的核酸Urea-PAGE下进行电泳检测。反应体系不变,分别反应0、5min、10min、15min、20min、25min、30min,测定四种gDNA的剪切动力学。 The 16nt gDNA whose first base at the 5' end of gDNA is A, T, G, and C was designed respectively. The reaction system was unchanged. MnCl 2 , gDNA and target DNA were added. The preference of the first base at the 5' end of gDNA. The reaction was carried out at 80°C for 15 min, and electrophoresis was performed under 16% nucleic acid Urea-PAGE. The reaction system was unchanged, and the reactions were performed for 0, 5 min, 10 min, 15 min, 20 min, 25 min, and 30 min, respectively, and the shearing kinetics of the four kinds of gDNA were determined.
结果如图8所示。结果表明,TpsAgo更偏好5’T或5’G。The results are shown in Figure 8. The results show that TpsAgo prefers 5'T or 5'G.
设计合成5’末端不同修饰的gDNA:5’-P、5’-OH、5’-Biotin、5’-NH 2C 6、5’-FAM、5’-SHC 6,反应体系不变,加入MnCl 2、gDNA和靶标DNA,测定在5’不同修饰引导链介导下TpsAgo的剪切效率。80℃反应15min,在16%的核酸Urea-PAGE下进行电泳检测。 Design and synthesize gDNA with different 5' end modifications: 5'-P, 5'-OH, 5'-Biotin, 5'-NH 2 C 6 , 5'-FAM, 5'-SHC 6 , the reaction system is unchanged, add MnCl 2 , gDNA and target DNA, and the cleavage efficiency of TpsAgo mediated by different 5' modified guide strands was determined. The reaction was carried out at 80°C for 15 min, and electrophoresis was performed under 16% nucleic acid Urea-PAGE.
结果如图9所示。结果表明,TpsAgo可以利用5’-P、5’-OH、5’-Biotin、5’-NH 2C 6、5’-FAM、5’-SHC 6修饰的gDNA,且更偏好5’-OH和5’-NH 2C 6The results are shown in Figure 9. The results show that TpsAgo can utilize 5'-P, 5'-OH, 5'-Biotin, 5'-NH 2 C 6 , 5'-FAM, 5'-SHC 6 modified gDNA, and prefer 5'-OH and 5'-NH 2 C 6 .
设计60-90nt范围内的野生型核酸序列以及单碱基突变的突变型核酸序列,以及一系列与target在不同位点(MP2-15)存在单点错配的gDNA,反应体系不变,加入MnCl 2、以及不同位点错配的gDNA和靶标DNA,测定TpsAgo的区分剪切效果。80℃反应15min,在16%的核酸Urea-PAGE下进行电泳检测。 Design wild-type nucleic acid sequences in the range of 60-90nt, mutant nucleic acid sequences with single base mutation, and a series of gDNAs with single-point mismatches with the target at different sites (MP2-15), the reaction system is unchanged, add MnCl 2 , and mismatched gDNA and target DNA at different sites were used to determine the differential cleavage effect of TpsAgo. The reaction was carried out at 80°C for 15 min, and electrophoresis was performed under 16% nucleic acid Urea-PAGE.
结果如图10所示。结果表明,当错配发生在gDNA的2、3、4、6、8、10、11、13、14和16位时,TpsAgo的剪切效率显著降低。The results are shown in Figure 10. The results showed that the cleavage efficiency of TpsAgo was significantly reduced when mismatches occurred at positions 2, 3, 4, 6, 8, 10, 11, 13, 14 and 16 of the gDNA.
设计合成GC含量为29%和65%(剪切位点上下游80bp范围内)的剪切质粒pUC19两条链的gDNAs,序列如图11所示。配置反应缓冲液(含15mM Tris-HCl pH8.0、100mM NaCl),在反应缓冲液中加入终浓度为0.5mM的MnCl 2,750nM TpsAgo,2.5μM合成的正反向gDNA和300ng-600ng pUC19质粒,在80℃反应2-4h。反应结束后,在样品中加入一定量的蛋白酶K和CaCl 2,50-55℃反应1h。加入5×上样缓冲液,在1.2%的琼脂糖胶中进行电泳检测。 The gDNAs of the two strands of the spliced plasmid pUC19 with GC contents of 29% and 65% (within 80 bp upstream and downstream of the cleavage site) were designed and synthesized. The sequences are shown in Figure 11 . Prepare reaction buffer (containing 15mM Tris-HCl pH8.0, 100mM NaCl), add MnCl 2 at a final concentration of 0.5mM, 750nM TpsAgo, 2.5μM synthetic forward and reverse gDNA and 300ng-600ng pUC19 plasmid to the reaction buffer , and react at 80°C for 2-4h. After the reaction, a certain amount of proteinase K and CaCl 2 were added to the sample, and the reaction was carried out at 50-55° C. for 1 h. Add 5× loading buffer and perform electrophoresis detection in 1.2% agarose gel.
结果如图11所示。结果表明,当剪切位点附近GC含量为29%时,TpsAgo可利用一对gDNA将超螺旋质粒全部剪切为线性质粒;当剪切位点附近GC含量为65%时,TpsAgo可利用一对gDNA将超螺旋质粒转化为开环质粒和线性质粒。The results are shown in Figure 11. The results showed that when the GC content near the cleavage site was 29%, TpsAgo could use a pair of gDNA to cut all the supercoiled plasmids into linear plasmids; when the GC content near the cleavage site was 65%, TpsAgo could use a pair of gDNA to cut all the supercoiled plasmids into linear plasmids. Transform supercoiled plasmids into open-circular and linear plasmids for gDNA.
实施例6:TpsAgo结合等温扩增技术,对EGFR单核苷酸变异(SNV)基因实现富集检测Example 6: TpsAgo combined with isothermal amplification technology to achieve enrichment detection of EGFR single nucleotide variation (SNV) gene
以EGFR L861Q SNV突变基因为例,根据序列特征(具体见表1序列),设计在10-11位与SNV基因具有错配的gDNAs,如图12所示。分别以EGFR两条链作为靶标DNA,筛选可以区分剪切野生型和SNV基因的gDNAs,剪切体系同 上。Taking the EGFR L861Q SNV mutant gene as an example, according to the sequence characteristics (see the sequence in Table 1 for details), gDNAs with mismatches with the SNV gene at positions 10-11 were designed, as shown in Figure 12. The two strands of EGFR were used as target DNAs to screen gDNAs that could differentiate between splicing wild-type and SNV genes. The splicing system was the same as above.
分别以EGFR L861Q SNV野生和突变片段为底物,通过PCR对野生和突变模板进行扩增。PCR产物经纯化回收后,采用李记生物公司销售的Pikogreen dsDNA定量试剂盒(超敏)(兼容Qubit 3.0)对配制样品进行定量。将模板配置为10nM(富集体系终浓度)10%mut EGFR L861Q样品。The wild and mutant templates were amplified by PCR using EGFR L861Q SNV wild and mutant fragments as substrates, respectively. After the PCR product was purified and recovered, the prepared samples were quantified using the Pikogreen dsDNA quantification kit (supersensitive) (compatible with Qubit 3.0) sold by Liji Bio. Templates were configured as 10 nM (final concentration of enrichment system) 10% mut EGFR L861Q samples.
筛选好gDNA后,采用New England Biolabs公司销售的tHDA试剂盒,结合TpsAgo对EGFR L861Q SNV基因进行富集。以50μL反应体系为例:After screening the gDNA, the EGFR L861Q SNV gene was enriched using the tHDA kit sold by New England Biolabs in combination with TpsAgo. Take a 50 μL reaction system as an example:
Figure PCTCN2022088816-appb-000001
Figure PCTCN2022088816-appb-000001
其余组分按tHDA试剂盒说明添加。The rest of the components were added according to the instructions of the tHDA kit.
反应程序:65℃,1.5-2hReaction program: 65℃, 1.5-2h
富集样品通过桑格尔测序检测。Enriched samples were detected by Sanger sequencing.
结果如图13所示。结果表明,针对EGFR L858R基因,TpsAgo可以对等位基因突变频率(VAF)为10%的Mut基因进行少许富集。The results are shown in Figure 13. The results showed that for the EGFR L858R gene, TpsAgo could slightly enrich the Mut gene with an allelic mutation frequency (VAF) of 10%.
表1Table 1
Figure PCTCN2022088816-appb-000002
Figure PCTCN2022088816-appb-000002
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。All documents mentioned herein are incorporated by reference in this application as if each document were individually incorporated by reference. In addition, it should be understood that after reading the above teaching content of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

Claims (15)

  1. 一种核酸切割体系,其特征在于,所述核酸切割体系包括:A nucleic acid cutting system, characterized in that the nucleic acid cutting system comprises:
    (a)向导DNA(gDNA);(a) guide DNA (gDNA);
    (b)可编程核酸内切酶Argonaute(Ago);和(b) the programmable endonuclease Argonaute (Ago); and
    (c)任选的报告核酸,其中若所述报告核酸被剪切,所述的剪切是可以被检测出的。(c) An optional reporter nucleic acid, wherein the cleavage is detectable if the reporter nucleic acid is cleaved.
  2. 如权利要求1所述的核酸切割体系,其特征在于,所述核酸切割体系的温度为65-90℃,较佳地为70-85℃,更佳地为75-85℃。The nucleic acid cutting system according to claim 1, wherein the temperature of the nucleic acid cutting system is 65-90°C, preferably 70-85°C, more preferably 75-85°C.
  3. 如权利要求1所述的核酸切割体系,其特征在于,所述的可编程核酸内切酶Argonaute来源于嗜热菌(Thermus parvatiensis),所述的可编程核酸内切酶Argonaute是可编程核酸内切酶TpsAgo。The nucleic acid cutting system of claim 1, wherein the programmable endonuclease Argonaute is derived from Thermus parvatiensis, and the programmable endonuclease Argonaute is a programmable endonuclease Dicer TpsAgo.
  4. 如权利要求1所述的核酸切割体系,其特征在于,所述的核酸切割体系还包括:(d)二价金属离子。The nucleic acid cutting system according to claim 1, wherein the nucleic acid cutting system further comprises: (d) divalent metal ions.
  5. 如权利要求1所述的核酸切割体系,其特征在于,所述的向导DNA的5’端具有选自下组的修饰:5’-P、5’-OH、5’-Biotin、5’-NH 2C 6、5’-FAM,或5’-SHC 6The nucleic acid cutting system of claim 1, wherein the 5' end of the guide DNA has a modification selected from the group consisting of 5'-P, 5'-OH, 5'-Biotin, 5'- NH2C6 , 5'-FAM, or 5' - SHC6 .
  6. 如权利要求1所述的核酸切割体系,其特征在于,所述的核酸切割体系还包括:(e)缓冲液。The nucleic acid cutting system according to claim 1, wherein the nucleic acid cutting system further comprises: (e) a buffer.
  7. 如权利要求6所述的核酸切割体系,其特征在于,所述缓冲液中,NaCl的浓度为0-4800mM,较佳地为40-1600mM,更佳地为80-800mM,更佳地为80-400mM。The nucleic acid cutting system according to claim 6, wherein, in the buffer, the concentration of NaCl is 0-4800mM, preferably 40-1600mM, more preferably 80-800mM, more preferably 80 -400mM.
  8. 如权利要求1所述的核酸切割体系,其特征在于,所述的报告核酸是单链核酸,包括单链DNA(ssDNA)或单链RNA(ssRNA)。The nucleic acid cutting system according to claim 1, wherein the reporter nucleic acid is a single-stranded nucleic acid, including single-stranded DNA (ssDNA) or single-stranded RNA (ssRNA).
  9. 如权利要求1所述的核酸切割体系,其特征在于,所述的报告核酸是质粒DNA,并且所述的向导DNA(gDNA)包括正向gDNA和反向gDNA;The nucleic acid cutting system of claim 1, wherein the reporter nucleic acid is plasmid DNA, and the guide DNA (gDNA) comprises forward gDNA and reverse gDNA;
    其中,所述的正向gDNA与所述质粒DNA的一条链可形成第一反向互补区,所述反向gDNA与所述质粒DNA的另一条链可形成第二反向互补区。Wherein, the forward gDNA and one strand of the plasmid DNA can form a first reverse complementary region, and the reverse gDNA and the other strand of the plasmid DNA can form a second reverse complementary region.
  10. 如权利要求9所述的核酸切割体系,其特征在于,所述的质粒DNA是超螺旋状态的。The nucleic acid cutting system of claim 9, wherein the plasmid DNA is in a supercoiled state.
  11. 一种富集低丰度目标核酸的反应体系,其特征在于,所述反应体系用于对一核酸样本同时进行依赖解旋酶等温扩增技术(Helicase-dependent isothermal DNA amplification)和核酸切割反应,从而获得扩增-切割反应产物;A reaction system for enriching low-abundance target nucleic acid, characterized in that the reaction system is used to simultaneously perform Helicase-dependent isothermal DNA amplification and nucleic acid cleavage reaction on a nucleic acid sample, Thereby the amplification-cleavage reaction product is obtained;
    其中,所述的核酸样本含有第一核酸和第二核酸,其中,所述的第一核酸为所述目标核酸,而所述的第二核酸为非目标核酸;Wherein, the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non-target nucleic acid;
    所述的核酸切割反应用于特异性切割非目标核酸,但不切割所述目的核酸;The nucleic acid cleavage reaction is used for specifically cleaving non-target nucleic acid, but not cleaving the target nucleic acid;
    所述的扩增-切割反应体系含有(i)进行等温扩增反应所需的试剂和(ii)如权利要求1所述的核酸切割体系。The amplification-cleavage reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cleavage system according to claim 1 .
  12. 一种富集低丰度目标核酸的方法,其特征在于,包括步骤:A method for enriching low-abundance target nucleic acid, comprising the steps of:
    (a)提供一核酸样本,所述的核酸样本含有第一核酸和第二核酸,其中,所述的第一核酸为所述目标核酸,而所述的第二核酸为非目标核酸,(a) providing a nucleic acid sample, the nucleic acid sample contains a first nucleic acid and a second nucleic acid, wherein the first nucleic acid is the target nucleic acid, and the second nucleic acid is a non-target nucleic acid,
    并且,所述目标核酸在所述的核酸样本中的丰度为F1a;And, the abundance of the target nucleic acid in the nucleic acid sample is F1a;
    (b)对所述核酸样本中的核酸为模板,在扩增-切割反应体系中进行等温扩增和核酸切割反应,从而获得扩增-切割反应产物;(b) using the nucleic acid in the nucleic acid sample as a template, performing isothermal amplification and nucleic acid cleavage reaction in an amplification-cleavage reaction system, thereby obtaining an amplification-cleavage reaction product;
    其中,所述的核酸切割反应用于特异性切割非目标核酸,但不切割所述目的核酸;Wherein, the nucleic acid cleavage reaction is used for specifically cleaving non-target nucleic acid, but not cleaving the target nucleic acid;
    并且,所述的扩增-切割反应体系含有(i)进行等温扩增反应所需的试剂和(ii)如权利要求1所述的核酸切割体系;And, the amplification-cleavage reaction system contains (i) reagents required for isothermal amplification reaction and (ii) the nucleic acid cleavage system according to claim 1;
    其中,所述目标核酸在所述的扩增-切割反应产物中的丰度为F1b,Wherein, the abundance of the target nucleic acid in the amplification-cleavage reaction product is F1b,
    其中,F1b/F1a的比值≥5(更佳地为≥10)。Wherein, the ratio of F1b/F1a is ≥5 (more preferably ≥10).
  13. 一种用于检测靶标核酸分子的试剂盒,其特征在于,所述试剂盒包括:A test kit for detecting target nucleic acid molecules, characterized in that the test kit comprises:
    (i)如权利要求11所述的富集低丰度目标核酸反应体系或用于配制所述反应体系的试剂;(i) the enrichment low-abundance target nucleic acid reaction system of claim 11 or a reagent for preparing the reaction system;
    (ii)用于检测低丰度目标核酸的检测试剂;和(ii) detection reagents for detecting low-abundance target nucleic acids; and
    (ii)使用说明书,所述说明书描述了如权利要求12所述的方法。(ii) Instructions for use, which instructions describe the method of claim 12.
  14. 一种可编程核酸内切酶Argonaute的用途,其特征在于,用于制备检测靶标分子的试剂或试剂盒,或用于制备检测低丰度目标核酸的试剂或试剂盒。The use of a programmable endonuclease Argonaute is characterized in that it is used for preparing a reagent or kit for detecting target molecules, or for preparing a reagent or kit for detecting low-abundance target nucleic acid.
  15. 如权利要求14所述的用途,其特征在于,所述的可编程核酸内切酶Argonaute具有选自下组的氨基酸序列:The use according to claim 14, wherein the programmable endonuclease Argonaute has an amino acid sequence selected from the group consisting of:
    (i)如NCBI序列号WP_060384876.1所示的氨基酸序列;和(i) the amino acid sequence as set forth in NCBI SEQ ID NO: WP_060384876.1; and
    (ii)在如NCBI序列号WP_060384876.1所示序列的基础上,进行一个或多个氨基酸残基的替换、缺失、改变或插入,或在其N端或C端添加1至10个氨基酸残基(较佳地1至5个氨基酸残基,更佳地1至3个氨基酸残基),从而获得的氨基酸序列;并且所述获得的氨基酸序列与如NCBI序列号WP_060384876.1所示序列具有≥85%(优选地≥90%,更优选地≥95%,例如≥96%、≥97%、≥98%或≥99%)的序列同一性;并且所获得的氨基酸序列具备与(i)相同或相似的功能。(ii) on the basis of the sequence shown in NCBI sequence number WP_060384876.1, one or more amino acid residues are replaced, deleted, changed or inserted, or 1 to 10 amino acid residues are added to its N-terminus or C-terminus. base (preferably 1 to 5 amino acid residues, more preferably 1 to 3 amino acid residues), thereby the obtained amino acid sequence; and the obtained amino acid sequence has the same sequence as shown in NCBI serial number WP_060384876.1 ≥ 85% (preferably ≥ 90%, more preferably ≥ 95%, eg ≥ 96%, ≥ 97%, ≥ 98% or ≥ 99%) sequence identity; and the amino acid sequence obtained possesses the same as (i) same or similar functionality.
PCT/CN2022/088816 2021-04-29 2022-04-24 Representation and application of novel high temperature argonaute protein tpsago WO2022228351A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110474666.0A CN113201578B (en) 2021-04-29 2021-04-29 Novel high-temperature Argonaute protein Tpsago characterization and application
CN202110474666.0 2021-04-29

Publications (1)

Publication Number Publication Date
WO2022228351A1 true WO2022228351A1 (en) 2022-11-03

Family

ID=77029435

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/088816 WO2022228351A1 (en) 2021-04-29 2022-04-24 Representation and application of novel high temperature argonaute protein tpsago

Country Status (2)

Country Link
CN (1) CN113201578B (en)
WO (1) WO2022228351A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113201578B (en) * 2021-04-29 2023-06-23 上海交通大学 Novel high-temperature Argonaute protein Tpsago characterization and application
CN114561374A (en) * 2022-03-11 2022-05-31 上海交通大学 Novel thermophilic endonuclease mutant and preparation method and application thereof
CN116240200A (en) * 2022-07-01 2023-06-09 中国科学院基础医学与肿瘤研究所(筹) Ultrasensitive target nucleic acid enrichment detection method based on programmable nuclease
CN116606839A (en) * 2023-05-09 2023-08-18 湖北大学 Nucleic acid cutting system based on Argonaute protein Tcago and application thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108796036A (en) * 2018-04-03 2018-11-13 上海交通大学 Nucleic acid detection method based on protokaryon Argonaute albumen and its application
CN109880891A (en) * 2019-04-22 2019-06-14 上海交通大学 The detection technique system and application of low abundance DNA mutation are enriched with based on nuclease coupling PCR principle
CN110229799A (en) * 2018-03-06 2019-09-13 北京贝瑞和康生物技术有限公司 Argonaute protein mutant and application thereof
CN111926117A (en) * 2020-08-18 2020-11-13 上海交通大学 SARS-CoV-2 virus nucleic acid isothermal rapid detection kit and detection method
CN113201578A (en) * 2021-04-29 2021-08-03 上海交通大学 Novel high-temperature Argonaute protein TpsAgo characterization and application

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113046355B (en) * 2021-04-20 2023-04-07 上海交通大学 Intermediate-temperature prokaryotic Argonaute protein PbAgo characterization and application
CN113106087B (en) * 2021-04-20 2023-02-24 上海交通大学 Characterization and application of novel high-temperature Argonaute protein

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110229799A (en) * 2018-03-06 2019-09-13 北京贝瑞和康生物技术有限公司 Argonaute protein mutant and application thereof
CN108796036A (en) * 2018-04-03 2018-11-13 上海交通大学 Nucleic acid detection method based on protokaryon Argonaute albumen and its application
CN109880891A (en) * 2019-04-22 2019-06-14 上海交通大学 The detection technique system and application of low abundance DNA mutation are enriched with based on nuclease coupling PCR principle
CN111926117A (en) * 2020-08-18 2020-11-13 上海交通大学 SARS-CoV-2 virus nucleic acid isothermal rapid detection kit and detection method
CN113201578A (en) * 2021-04-29 2021-08-03 上海交通大学 Novel high-temperature Argonaute protein TpsAgo characterization and application

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WU JIN'EN; YANG JING; CHO WILLIAM C.; ZHENG YADONG: "Argonaute proteins: Structural features, functions and emerging roles", JOURNAL OF ADVANCED RESEARCH, ELSEVIER, AMSTERDAM, NL, vol. 24, 29 April 2020 (2020-04-29), AMSTERDAM, NL , pages 317 - 324, XP086216434, ISSN: 2090-1232, DOI: 10.1016/j.jare.2020.04.017 *

Also Published As

Publication number Publication date
CN113201578A (en) 2021-08-03
CN113201578B (en) 2023-06-23

Similar Documents

Publication Publication Date Title
WO2022228351A1 (en) Representation and application of novel high temperature argonaute protein tpsago
WO2022222920A1 (en) Characterization and application of novel high-temperature argonaute protein
WO2022222973A1 (en) New type medium-temperature prokaryotic argonaute protein pbago characterization and application thereof
JP2003510052A (en) Methods and compositions for improved polynucleotide synthesis
WO2023169228A1 (en) Novel thermophilic endonuclease mutant, and preparation method therefor and application thereof
CN107002067A (en) Utilize the method for heat endurance mispairing endonuclease
EP4159853A1 (en) Genome editing system and method
CN112239754A (en) Isothermal nucleic acid amplification method and application
CN115820604B (en) High-temperature Argonaute protein and application thereof
Nthangeni et al. Development of a versatile cassette for directional genome walking using cassette ligation-mediated PCR and its application in the cloning of complete lipolytic genes from Bacillus species
CN112029744A (en) DNA polymerase and its coding gene, preparation method and PCR application
Dong et al. A single digestion, single-stranded oligonucleotide mediated PCR-independent site-directed mutagenesis method
EP0877084B1 (en) Thermostable diaphorase gene
Kaplan et al. Recombinant production of Thermus aquaticus single-strand binding protein for usage as PCR enhancer
CN110093390B (en) DNA guide-guided RecJ protein with endonuclease activity and application thereof in gene editing
RU2791447C1 (en) DNA CUTTER BASED ON THE ScCas12a PROTEIN FROM THE BACTERIUM SEDIMENTISPHAERA CYANOBACTERIORUM
US11697805B2 (en) High-fidelity polymerase with preference for gapped DNA and use thereof
CN114958808B (en) CRISPR/Cas system for small-sized genome editing and special CasX protein thereof
CN113234795B (en) Method for detecting nucleic acid by using Cas protein
US20230125232A1 (en) Atp-dependent dna ligase
CN116024193A (en) TthAGO-based nucleic acid cleavage system, target nucleic acid molecule detection method and kit
CN116926170A (en) Nucleic acid detection method based on sulfur modified nucleic acid and sulfur modified nucleic acid recognition protein
CN116024192A (en) TbAgo-based nucleic acid cleavage system, and detection method and kit for target nucleic acid molecules
CN116606839A (en) Nucleic acid cutting system based on Argonaute protein Tcago and application thereof
CN116732140A (en) Nucleic acid detection system and application thereof in detecting DNA mutation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22794821

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22794821

Country of ref document: EP

Kind code of ref document: A1