WO2023109538A1 - Adaptor used for characterizing polynucleotide and use thereof - Google Patents

Adaptor used for characterizing polynucleotide and use thereof Download PDF

Info

Publication number
WO2023109538A1
WO2023109538A1 PCT/CN2022/136442 CN2022136442W WO2023109538A1 WO 2023109538 A1 WO2023109538 A1 WO 2023109538A1 CN 2022136442 W CN2022136442 W CN 2022136442W WO 2023109538 A1 WO2023109538 A1 WO 2023109538A1
Authority
WO
WIPO (PCT)
Prior art keywords
chain
polynucleotide
strand
optionally
blocking
Prior art date
Application number
PCT/CN2022/136442
Other languages
French (fr)
Chinese (zh)
Inventor
刘先宇
常馨
Original Assignee
成都齐碳科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 成都齐碳科技有限公司 filed Critical 成都齐碳科技有限公司
Publication of WO2023109538A1 publication Critical patent/WO2023109538A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/483Physical analysis of biological material
    • G01N33/487Physical analysis of biological material of liquid biological material
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/92Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving lipids, e.g. cholesterol, lipoproteins, or their receptors

Definitions

  • the application belongs to the field of gene sequencing, and relates to an adapter used in characterizing polynucleotides, and also relates to a method for characterizing polynucleotides using the adapter.
  • Nanopore sequencing technology has the characteristics of long read length, direct reading of modification information and parallel analysis of real-time data production.
  • Nucleic acid-related variations such as shearing and RNA editing
  • modification information including but not limited to methylation, acetylation, etc.
  • the platform supports parallel data production and analysis to realize real-time mutation/modification detection and diagnosis, and the portable design makes it have a wide range of application prospects.
  • Nanopores detect nucleotides that give current changes of known character and duration.
  • the blocking strand of the polynucleotide when no potential is applied, the blocking strand of the polynucleotide is often capable of stalling the helicase, preventing further movement of the helicase across the blocking strand along the target polynucleotide.
  • the helicase-polynucleotide complex when the helicase-polynucleotide complex is in contact with the transmembrane pore and a potential is applied, one or more stalled helicases can be moved across the blocking strand on the polynucleotide and along the polynucleotide to be tested. The sequence is moved to achieve the purpose of sequencing. Therefore, adapters containing enzyme-binding regions and blocking strands are required for nanopore sequencing.
  • nucleic acid adapters such as Y-shaped adapters or hairpin-like adapters (patent CN202111018113.0) are usually used, wherein motor proteins such as helicase are bound to the binding region of the adapters.
  • Adapters in the related art have a phenomenon of adenosine triphosphate (ATP) depletion in practical applications, that is, adapters that have not been sequenced through the holes will also consume a large amount of ATP. Since the reduction of ATP concentration in the sequencing environment will lead to a decrease in sequencing speed, the sequencing time is too short, which in turn affects the output of sequencing data. Therefore, there is currently a need for sequencing methods that can reduce ATP consumption.
  • ATP adenosine triphosphate
  • the purpose of the present application is to provide a new adapter, and the present application also provides a preparation method of the adapter and its use for nanopore sequencing. Using the adapter of the present application greatly reduces the ATP waste in nanopore sequencing.
  • the present application provides an adapter for characterizing a target polynucleotide, the adapter comprising ⁇ LS ⁇ n or ⁇ SL ⁇ n in the direction of the 5' to 3' end,
  • L is a modified double-stranded polynucleotide
  • S is a blocking strand
  • n is a positive integer
  • the L double strand comprises a polynucleotide strand L' connected to S and a complementary strand L" of said L',
  • the L' includes a first segment away from the blocking chain S and a second segment close to the blocking chain S, the first segment includes a modified part, and the second segment includes a motor protein binding active region , the active region is blocked by the complementary chain L";
  • the complementary chain L" comprises a polymer capable of competing with said motor protein for binding to the L' chain.
  • the adapter according to the present application wherein the adapter comprises ⁇ D 1 -LS ⁇ n or ⁇ SLD 1 ⁇ n in the direction from 5' to 3' end, D 1 is the first double-stranded polynucleotide,
  • the adapter comprises ⁇ LSD 2 ⁇ n or ⁇ D 2 -SL ⁇ n in the direction from the 5' to the 3' end, and D 2 is the second double-stranded polynucleotide;
  • n is an integer of 1-20, for example, n may be 1, 2, 3, 4, 5, 6, 7, 8 or more.
  • the modified part of the chain L' makes the binding ability of the first segment to motor protein weaker than that of the second segment, or makes the first segment not binds to motor proteins;
  • the modification part in the chain L' is ribonucleotide (RNA) and/or nucleic acid analogue;
  • the ribonucleotides include 2'-modified ribonucleotides, optionally 2'alkoxy-modified ribonucleotides, and more optionally 2'methoxy-modified ribonucleotides;
  • the nucleic acid analogs include any one or any combination of two or more of peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA), and bridge nucleic acid (BNA).
  • PNA peptide nucleic acid
  • GNA glycerol nucleic acid
  • TAA threose nucleic acid
  • LNA locked nucleic acid
  • BNA bridge nucleic acid
  • the unmodified part of the chain L' is a deoxyribonucleotide
  • the number of modified polynucleotides is 1-20, more preferably 1-15, 1-10 or 2-6, further optionally 3, 4, 5.
  • modification is provided at one end (first section) of the polynucleotide chain L' away from the blocking chain S, and the binding ability of the modified end (first section) to the motor protein is weaker, so that in the preparation process of the adapter In , the motor protein is more likely to be combined into the end (the second segment) of the polynucleotide chain L' close to the blocking chain S, which is more conducive to the subsequent expulsion of the motor protein.
  • the modified part can be an optional modification, as long as the modification can weaken the binding of the modified part to the motor protein.
  • the length of the modification part can be determined according to the length of the polynucleotide chain L'.
  • the remaining parts can be selectively modified , without affecting the technical effect of the present application.
  • the end of the complementary strand L" close to the blocking strand S includes a part with stronger binding force to the polynucleotide strand L' or the second segment; optionally, This part contains PNA or LNA.
  • the main function of the complementary chain L" is to repel the motor protein by complementing the polynucleotide chain L'.
  • the motor protein can be expelled to the blocking strand by complementing the polynucleotide strand L'.
  • the end of the complementary chain L" away from the blocking chain S comprises a first part of click chemistry, an optional click reaction group
  • the D 1 double strand comprises a polynucleotide chain D 1 ′ connected to L’ and its complementary chain D 1 ′′, and one end of the complementary chain D 1 ′′ near the blocking chain S comprises the first click chemistry Two-part, optional click reactive group;
  • the D 2 double strand comprises a polynucleotide strand D 2 ′ connected to the blocking strand S and its complementary strand D 2 ′′, and the complementary strand D 2 ′′ includes a part that does not hybridize with the adapter ;
  • the part that does not hybridize with the adapter is located at one end of the complementary strand D 2 ′′ adjacent to the blocking strand S.
  • This moiety can be made more firmly bound by the introduction of click chemistry groups.
  • the blocking chain has a structure different from that of the polynucleotide and is used to block motor proteins
  • the blocker chain comprises one or more nitroindole, one or more inosine, one or more acridine, one or more 2-aminopurine, one or more 2-6- Diaminopurine, one or more 5-bromo-deoxyuridines, one or more inverted thymidines (inverted dTs), one or more inverted dideoxythymidines (ddTs), one or more dideoxy Cytidines (ddCs), one or more 5-methylcytidylic acid, one or more 5-hydroxymethylcytidine, one or more 2'alkoxy-modified ribonucleotides (optionally 2' methoxy-modified ribonucleotides), one or more isodeoxycytidines (iso-dCs), one or more isodeoxyguanosines (iso-dGs), one or more C3 groups, one or more One or more photocleavable (PC) groups, one or more hex
  • the number of 2' alkoxy-modified ribonucleotides is 1-10, optionally 2-6; optionally,
  • the 2'alkoxy-modified ribonucleotides are uniformly distributed in the blocking chain.
  • one end of the L' far away from the S comprises a leader strand sequence
  • the end of S away from L' is used to connect the target polynucleotide
  • the polynucleotide chain L' connected to S contains a motor protein binding active region at one end close to the blocking chain S, and the active region is blocked by the complementary chain L";
  • the motor protein is a protein capable of binding to a polynucleotide and controlling its movement through a pore; optionally, the motor protein is selected from one of polymerase, exonuclease, helicase and topoisomerase one or more, more optionally, the helicase is selected from one of Hel308 helicase, RecD helicase, Tral helicase, TrwC helicase, XPD helicase and DDA helicase or more.
  • the present application provides a complex comprising the adapter described in the present application, and the motor protein and/or the target polynucleotide, wherein the motor protein is located at the blocking chain .
  • the application provides a method for preparing the complex, comprising:
  • the method includes
  • the present application also provides a method for characterizing a target polynucleotide, the method using the adapter or the complex;
  • the method includes:
  • target polynucleotide is linked to said adapter or said complex
  • the present application also provides a kit for characterizing polynucleotides
  • composition of the kit is any one of the following 1)-4):
  • the polynucleotide chain L' for linking with S, the blocking chain S, and the complementary chain L" of the nucleotide chain L' optionally, also include an independent The packaged motor protein.
  • Y-Top-1 chain (Y1 chain) contains D 1 '-L'-SD 2 '
  • Y-Top-2 chain (Y2 chain) contains D 1 "
  • YB chain contains D 2 "
  • the star signal of the Y-Top-2 chain is a click reaction group, which is modified by dibenzocyclooctyne (DBCO, Dibenzocyclooctyne) in a specific embodiment
  • DBCO dibenzocyclooctyne
  • the PNA-R chain The triangle signal in is the click reaction group, which is N3 in the specific example.
  • the present application provides a novel adapter, which can greatly avoid ATP waste in nanopore sequencing and significantly improve sequencing efficiency.
  • Figure 1 is a schematic diagram of the principle of using the Y-type adapter of the present application to expel the helicase to the blocking chain; wherein, the Y-Top-1 chain (Y1 chain) contains D 1 '-L'-SD 2 ', Y- Top-2 chain (Y2 chain) contains D 1 ′′, and YB chain contains D 2 ′′; first, Y1 chain, Y2 chain, and YB chain are annealed, and then a helicase is added, which binds to the L' chain , and the Y-Top-2 chain contains a chemical click group; adding the PNA-R chain, the PNA-R chain can better combine with the L' in the Y-Top-1 chain (Y1 chain), and The chemical click reaction between the PNA-R strand and the Y-Top-2 strand further stabilizes the binding, thereby expelling the helicase bound at the L' strand to the S region along the 5' to 3' end direction, which The expulsion is driven by the binding force between
  • the star-shaped signal of the Y-Top-2 chain is a click reaction group, which is DBCO modification in a specific embodiment;
  • the triangle signal in the PNA-R chain is a click reaction group, In a specific embodiment, it is N3.
  • Fig. 2 is a quality inspection diagram of the adapter adapter complex before and after loading the fourth-strand PNA according to Example 1 of the present application.
  • Fig. 3 is a graph of ATP/NADH consumption measured after driving enzymes to blocker chains of different structures and sequences according to Example 2 of the present application.
  • Fig. 4 shows the reduction rate of different adapters in actual sequencing according to Example 3 of the present application.
  • polynucleotide includes two or more polynucleotides
  • anchor includes two or more anchors
  • helicase includes two or more Helicase
  • transmembrane pore includes two or more pores
  • the present application provides an adapter for characterizing polynucleotides, which comprises ⁇ LS ⁇ n or ⁇ SL ⁇ n in the direction of the 5' to 3' end, wherein L is a modified double-stranded polynucleotide , S is a blocking chain, and n is an integer; and, the L double-strand includes a polynucleotide chain L' connected to S and a complementary chain L "of the L', and the L' includes a chain far away from the blocking chain S
  • the first segment of the first segment, and the second segment near the blocking chain S the first segment includes a modified part, the second segment includes a motor protein binding active region, and the active region is blocked by the complementary chain L "Closed; complementary strand L" comprises a polymer capable of competing with the motor protein for binding to the L' strand.
  • the adapter comprises ⁇ D 1 -LS ⁇ n or ⁇ SLD 1 ⁇ n
  • D 1 is the first double-stranded polynucleotide
  • the adapter is in The direction from the 5' to the 3' end includes ⁇ LSD 2 ⁇ n or ⁇ D 2 -SL ⁇ n
  • D 2 is the second double-stranded polynucleotide
  • the n is an integer of 1-20, for example, n can be 1, 2, 3, 4, 5, 6, 7, 8, or more.
  • the modified part of the chain L' makes the binding ability of the first segment to motor protein weaker than that of the second segment, or makes the first segment not binds to motor proteins;
  • the modification part in the chain L' is ribonucleotide and/or nucleic acid analog.
  • the ribonucleotides include 2'-modified ribonucleotides, optionally 2'alkoxy-modified ribonucleotides, more optionally 2'methoxy-modified ribonucleotides or preferably, the nucleic acid analogue includes any one or any combination of two or more of peptide nucleic acid, glycerol nucleic acid, threose nucleic acid, locked nucleic acid, and bridge nucleic acid; and/or the motor in the chain L'
  • the protein binding active region is deoxyribonucleotides; and/or the number of ribonucleotides and/or nucleic acid analogs is 1-20, 1-15 or 1-10; optional 2-6, for example 2, 3, 4, 5 or 6.
  • the modification is provided at the end of the polynucleotide chain L' away from the blocking chain S, and the binding ability of the modified end to the motor protein is weaker, making it easier for the motor protein to bind to the polynucleotide chain during the preparation of the adapter L' is close to the end of the blocking chain S, which is more conducive to the subsequent expulsion of the motor protein.
  • the modified part can be an optional modification, as long as the modification can weaken the binding of the modified part to the motor protein.
  • the length of the modification part can be determined according to the length of the polynucleotide chain L'.
  • the remaining parts can be selectively modified without affecting the present application. technical effect.
  • the end of the complementary chain L" close to the blocking chain S contains a part with stronger binding force with the polynucleotide chain L'; optionally, this part contains PNA or LNA.
  • the main function of the complementary chain L" is to repel the motor protein by complementing the polynucleotide chain L'.
  • the motor protein can be expelled to the blocking strand by complementing the polynucleotide strand L'.
  • the end of the complementary chain L" far away from the blocking chain S comprises a first part of click chemistry, optionally a click reaction group;
  • the D 1 double strand comprises a polynucleotide chain D 1 ′ connected to L’ and its complementary chain D 1 ′′, and one end of the complementary chain D 1 ′′ near the blocking chain S comprises the first click chemistry Two-part, optional click reactive group;
  • the D 2 double strand comprises a polynucleotide strand D 2 ′ connected to the blocking strand S and its complementary strand D 2 ′′, and the complementary strand D 2 ′′ includes a part that does not hybridize with the adapter ;
  • the part that does not hybridize with the adapter is located at one end of the complementary strand D 2 ′′ adjacent to the blocking strand S.
  • This moiety can be made more firmly bound by the introduction of click chemistry groups.
  • the adapter according to the present application wherein the blocking chain has a structure different from that of the polynucleotide.
  • the present application provides a complex, which comprises the adapter described in the present application and a motor protein, wherein the motor protein is located at the blocking chain;
  • the motor protein is a protein capable of binding to a polynucleotide and controlling its movement through the pore; optionally an enzyme.
  • the enzyme is selected from one or more of polymerase, exonuclease, helicase and topoisomerase.
  • the helicase is selected from one or more of Hel308 helicase, RecD helicase, Tral helicase, TrwC helicase, XPD helicase and DDA helicase.
  • the application provides the preparation method of the complex, including:
  • the method includes
  • Polynucleotides such as nucleic acids are macromolecules containing two or more nucleotides.
  • a polynucleotide or nucleic acid may comprise any combination of nucleotides. Nucleotides can be naturally occurring or synthetic. One or more nucleotides in a polynucleotide may be oxidized or methylated. One or more nucleotides in a polynucleotide may be damaged.
  • a polynucleotide may comprise pyrimidine dimers. Such dimers are often associated with UV-induced damage and are the leading cause of skin melanoma.
  • One or more nucleotides in a polynucleotide may be modified, for example, with a label or tag. Suitable markers are described below.
  • the nucleotides in a polynucleotide are typically ribonucleotides or deoxyribonucleotides.
  • the polynucleotide may comprise the following nucleosides: adenosine, uridine, guanosine and cytidine.
  • the nucleotides may be deoxyribonucleotides.
  • the polynucleotide may optionally include the following nucleosides: deoxyadenosine (dA), deoxyuridine (dU) and/or thymidine (dT), deoxyguanosine (dG) and deoxycytidine (dC).
  • Nucleotides usually contain monophosphates, diphosphates or triphosphates. Phosphate can be attached on the 5" or 3" side of the nucleotide.
  • Suitable nucleotides include, but are not limited to, adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), cytidine monophosphate (CMP), Cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyguanosine monophosphate (dGMP), deoxythymidine monophosphate (dTMP), deoxyuridine monophosphate (dUMP ) and deoxycytidine monophosphate (dCMP).
  • AMP adenosine monophosphate
  • GMP guanosine monophosphate
  • TMP thymidine monophosphate
  • UMP uridine monophosphate
  • CMP cytidine monophosphate
  • cGMP cyclic
  • the nucleotides may optionally be selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP, dCMP and dUMP.
  • the nucleotides are most preferably selected from dAMP, dTMP, dGMP, dCMP and dUMP.
  • the polynucleotide may optionally comprise the following nucleotides: dAMP, dUMP and/or dTMP and dCMP.
  • nucleotides in the polynucleotide may be linked to each other in any manner. Nucleotides are usually linked by their sugar and phosphate groups, as in nucleic acids. The nucleotides may be linked via their nucleobases, as in pyrimidine dimers.
  • the polynucleotide may be a nucleic acid.
  • the polynucleotide can be any synthetic nucleic acid known in the art, such as peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA), or other nucleic acid with nucleotide side chain of synthetic polymers.
  • PNA peptide nucleic acid
  • GNA glycerol nucleic acid
  • TNA threose nucleic acid
  • LNA locked nucleic acid
  • the PNA backbone consists of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds.
  • the GNA backbone consists of repeating ethylene glycol units linked by phosphodiester bonds.
  • the TNA backbone consists of repeating threose sugars linked together by phosphodiester bonds.
  • the LNA is formed from nu
  • the polynucleotide is most preferably ribonucleic acid (RNA) or deoxyribonucleic acid (DNA).
  • the polynucleotide can be of any length.
  • a polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400, or at least 500 nucleotides in length.
  • the polynucleotide may be 1000 or more nucleotides, 5000 or more nucleotides in length or 100000 or more nucleotides in length.
  • the helicase may move along all or only part of the target polynucleotide in the methods of the present application. All or part of the target polynucleotide can be characterized using the methods of the present application.
  • a polynucleotide of interest can be single-stranded. At least a portion of the polynucleotide of interest is optionally double-stranded. Helicases typically bind to single-stranded polynucleotides. If at least a portion of the target polynucleotide is double-stranded, the target polynucleotide optionally includes single-stranded regions or non-hybridizing regions. The one or more helicases are capable of binding to the single-stranded region or to one strand of the non-hybridizing region. The target polynucleotide optionally includes one or more single-stranded regions or one or more non-hybridizing regions.
  • the polynucleotide of interest is present in any suitable sample.
  • the application is generally performed on samples known to contain or suspected to contain the polynucleotide of interest.
  • the present application can be performed on a sample to identify one or more target polynucleotides that are known or expected to be present in the sample.
  • the sample can be a biological sample.
  • the present application may be practiced in vitro on samples obtained or extracted from any organism or microorganism.
  • the organisms or microorganisms are generally archaean, prokaryotic or eukaryotic, and generally belong to one of the following five kingdoms: Plantae, Animalia, Fungi, Prokaryotes and Protists.
  • the application is performed in vitro on samples obtained or extracted from any virus.
  • the sample is optionally a liquid sample. Samples typically include bodily fluids of a patient.
  • the sample may be urine, lymph, saliva, mucus or amniotic fluid, but may alternatively be blood, plasma or serum.
  • the sample is of human origin, but may alternatively be from other mammals, such as from a commercially bred animal such as a horse, cow, sheep or pig, or may be a pet such as a cat or dog.
  • samples of plant origin are usually obtained from commercial crops such as cereals, legumes, fruits or vegetables such as wheat, quinoa, barley, oats, canola, corn, soybeans, rice, bananas, apples, tomatoes, potatoes, grapes, Tobacco, beans, lentils, sugar cane, cocoa, cotton.
  • the sample can be a non-biological sample.
  • Non-biological samples can be selected as liquid samples. Examples of non-biological samples include surgical fluids, water such as drinking water, sea water or river water, and reagents for laboratory testing.
  • Samples are often processed prior to testing, such as by centrifugation or filtration through membranes to remove unwanted molecules or cells, such as red blood cells. Testing can be performed immediately after obtaining said sample. Typically the samples may also be stored prior to analysis, optionally below -70°C.
  • the one or more blocker strands are included in the target polynucleotide.
  • the one or more blocking strands are optionally part of the target polynucleotide, eg it/they interrupt the polynucleotide sequence.
  • the one or more blocker strands are optionally not part of one or more block molecules such as speed bumps for hybridization to the polynucleotide of interest.
  • blocker strands in the target polynucleotide such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more blocks chain.
  • blocker strands in the polynucleotide of interest there are 2, 4 or 6 blocker strands in the polynucleotide of interest.
  • blocker strands in different regions of the polynucleotide of interest for example a blocker strand in the leader sequence and a blocker strand in the hairpin loop.
  • the one or more blocking strands each provide an energy barrier that the one or more helicases cannot overcome even in active mode.
  • the one or more blocking strands can be achieved by reducing the pull of the helicase (e.g., by removing bases of nucleotides in the target polynucleotide) or physically blocking the movement of the one or more helicases (eg, using bulky chemical groups) to stall one or more helicases.
  • the one or more blocking strands may comprise any molecule or combination of any molecules that stall one or more helicases.
  • the one or more blocker strands may comprise any molecule or combination of any molecules that prevent movement of the one or more helicases along the target polynucleotide. It directly determines whether, in the absence of a transmembrane pore and an applied potential, one or more helicases lodges at one or more blocking strands. For example, this can be tested as shown in the Examples, eg the ability of the helicase to pass through the blocking strand and displace the complementary strand of DNA can be measured by PAGE.
  • the one or more blocking chains typically comprise linear molecules such as polymers.
  • the one or more blocking strands typically have a different structure than the target polynucleotide.
  • the target polynucleotide is DNA
  • the one or more blocking strands are typically not deoxyribonucleic acid.
  • the polynucleotide of interest is deoxyribonucleic acid (DNA) or ribonucleic acid (RNA)
  • the one or more blocking strands optionally include peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid ( TNA), locked nucleic acid (LNA) or synthetic polymers with nucleotide side chains.
  • One or more blocker strands optionally include one or more nitroindole, for example one or more 5-nitroindole, one or more inosine, one or more acridine, one or more 2 - aminopurines, one or more 2-6-diaminopurines, one or more 5-bromo-deoxyuridines, one or more reverse thymidines (reverse dTs), one or more reverse deoxythymidines glycosides (ddTs), one or more dideoxycytidines (ddCs), one or more 5-methylcytidines, one or more 5-hydroxymethylcytidines, one or more 2' alkoxy modifications Ribonucleotides (optionally 2'methoxy-modified ribonucleotides), one or more isodeoxycytidines (iso-dCs), one or more isodeoxyguanosines (iso-dGs), one or Multiple iSpC3 groups (i.e.
  • the one or more blocking chains may contain any number of these groups.
  • One or more blocker strands optionally comprise 2, 3, 4, 5, 6, 7, 8 or more iSp9 groups.
  • One or more blocker strands optionally comprise 2, 3, 4, 5 or 6 or more iSpl8 groups.
  • Optional chain-blocking groups are 4 iSp18 groups.
  • the polymer can be optionally a polypeptide or polyethylene glycol (PEG).
  • the polypeptide optionally comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more amino acids.
  • the PEG optionally comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more monomeric units.
  • One or more blocking strands optionally include one or more abasic nucleotides (i.e., nucleotides lacking nucleobases), e.g., 2, 3, 4, 5, 6, 7 , 8, 9, 10, 11, 12 or more abasic nucleotides.
  • a nucleobase can be replaced by -H(idSp) or -OH in an abasic nucleotide.
  • An abasic blocker strand can be inserted into a polynucleotide of interest by removing a nucleobase from one or more adjacent nucleotides.
  • the one or more blocking strands optionally contain one or more chemical groups that physically cause the one or more helicases to stall.
  • the one or more chemical groups may be one or more pendant chemical groups.
  • the one or more chemical groups may be linked to one or more nucleobases in the target polynucleotide.
  • the one or more chemical groups may be attached to the backbone of the polynucleotide of interest. There may be any number, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more of these chemical groups.
  • Suitable groups include, but are not limited to, fluorophores, streptavidin and/or biotin, cholesterol, methylene blue, dinitrophenols (DNPs), digoxigenin and/or anti-digoxigenin and diphenyl Cyclooctyne group.
  • one blocker strand can include a linear molecule as discussed above, and the other blocker strand can include one or more chemical groups that physically cause one or more helicases to stall.
  • a blocking strand may comprise any linear molecule as discussed above and one or more chemical groups that physically cause one or more helicases to stall, such as one or more abasic groups and fluorophores.
  • a suitable blocker chain can be designed according to the type of target polynucleotide and carried out under the method conditions of the present application. Most helicases bind and move along DNA, so can be stalled by anything that is not DNA. Suitable molecules are described above.
  • the number of 2'methoxy-modified ribonucleotides is 1-10, more optionally 2-6; and/or the 2'methoxy-modified ribose Nucleotides are evenly distributed over the blocking strand.
  • the helicase may be or be derived from a Hel308 helicase, a RecD helicase, such as a TraI helicase or a TrwC helicase, an XPD helicase or a Dda helicase.
  • the helicase can be the helicase provided in Example 1, or the helicase disclosed in Chinese patent CN201880095718.X, or other helicases.
  • the polynucleotides of the present application can be linked covalently.
  • free copper click chemistry or copper catalyzed click chemistry can be used.
  • Click chemistry is used in these applications because of its desirable properties and its scope for generating covalent linkages between a variety of building blocks. For example, it is fast, clean and non-toxic, producing only harmless by-products.
  • Click chemistry is a term first introduced by Kolb et al. in 2001 to describe a broader set of robust, selective and modular building blocks that are reliably usable for both small-scale and large-scale applications (Kolb HC Finn, MG, Sharp less KB, click chemistry: diverse chemical function from a few good reactions, Angew. Chem. Int. Ed.
  • reaction must be modular, broad in scope, give very high yields, produce only harmless by-products that can be removed by non-chromatographic methods, and be stereogenic.
  • Required process features include simple reaction conditions (ideally the process should be insensitive to oxygen and water), readily available starting materials and reagents, solvent-free or use, the solvent is mild (e.g. water) or easily removed, and simple product isolation. Purification must be by non-chromatographic methods if necessary, such as crystallization or distillation, and the product must be stable under physiological conditions of".
  • Suitable examples of click chemistry include but are not limited to the following:
  • azides react with alkynes under stress, for example in cyclooctane rings
  • the click chemistry reaction is a Cu(I) catalyzed 1,3 dipolar cycloaddition between alkynes and azides.
  • the first group is an azide group and the second group is an alkyne group.
  • Nucleobases have been synthesized with insertion of azide and alkyne groups in alternative positions (e.g. Kocalka P, El-Sagheer AH, Brown T, Rapid and efficient DNA strand cross-linking by click chemistry, Chembiochem.2008.9(8 ):1280-5).
  • Alkyne groups were commercially available from Berry Associates (Michigan, USA), and azide groups were synthesized by ATDBio or IDT bio.
  • optionally reactive groups are azide and hexyl groups, eg azide N3 and DBCO.
  • the present application also provides a method for characterizing a target polynucleotide, the method using the adapter or the complex;
  • the method includes:
  • target polynucleotide is linked to said adapter or said complex
  • the methods of the present application comprise measuring one or more characteristics of said target polynucleotide.
  • the method may comprise measuring 2, 3, 4, 5 or more characteristics of the target polynucleotides.
  • the one or more characteristics may be selected from (i) the length of the target polynucleotide, (ii) the identity of the target polynucleotide, (iii) the sequence of the target polynucleotide, (iv) the secondary structure of the target polynucleotide; and (v) whether the target polynucleotide is modified. Any combination of (i) to (v) can be measured according to the present application.
  • the length of the polynucleotide can be determined, for example, by determining the number of interactions between the target polynucleotide and the pore, and the duration between interactions between the target polynucleotide and the pore.
  • the identity of the polynucleotides can be determined in a number of ways. Polynucleotide identity may be determined in conjunction with or without determination of the sequence of the polynucleotide of interest. The former is straightforward; the polynucleotide is sequenced and thus identified. The latter can be done in several ways. For example, the presence of a particular motif in a polynucleotide can be determined (without determining the rest of the sequence of the polynucleotide). Alternatively, specific electrical and/or optical signals determined in the method can identify a target polynucleotide from a specific source.
  • the sequence of the polynucleotide can be determined as previously described. Suitable sequencing methods, particularly those using electrical measurements, are described in Stoddart D et al., Proc Natl Acad Sci, 12; 106(19):7702-7, Lieberman KR et al, J Am Chem Soc. 2010; 132 (50):17961-72, and in International Application WO 2000/28312.
  • the secondary structure can be measured in various ways. For example, if the method involves electrical measurements, secondary structure can be measured using a change in residence time or a change in current through the pore. This allows regions of single- and double-stranded polynucleotides to be identified.
  • the presence or absence of any modification can be determined.
  • the method optionally includes determining whether said target polynucleotide is modified by methylation, oxidation, damage, with one or more proteins or with one or more markers, tags or blocker strands. Specific modifications will result in specific interactions with the pore, which can be determined using the methods described below. For example, cytosines can be identified from methylated cytosines based on the current flow through the pore during the interaction of the pore with each nucleotide.
  • the method is usually carried out in the presence of a buffer.
  • the buffer is present in the aqueous solution in the chamber. Any buffer may be used in the methods of the present application.
  • the buffer is phosphate buffered saline.
  • Other suitable buffers are HEPES and Tris-HCl buffers.
  • the method is typically carried out at a pH of 4.0 to 12.0, 4.5 to 10.0, 5.0 to 9.0, 5.5 to 8.8, 6.0 to 8.7, 7.0 to 8.8, or 7.5 to 8.5.
  • the pH used is optionally about 7.5.
  • the method may be performed at 0 to 100°C, 15°C to 95°C, 16°C to 90°C, 17°C to 85°C, 18°C to 80°C, 19°C to 70°C, or 20°C to 60°C.
  • the method is usually carried out at room temperature.
  • the method is optionally performed at a temperature that supports the function of the helicase, eg, about 37°C.
  • the method may be carried out in the presence of free nucleotides or free nucleotide analogs and/or cofactors which assist in the functioning of the helicase.
  • the method can also be carried out in the absence of free nucleotides or free nucleotide analogues and in the absence of cofactors for the helicase.
  • the free nucleotides may be any one or more of the individual nucleotides as discussed above.
  • Free nucleotides include, but are not limited to, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate GTP, thymidine monophosphate (TMP), thymidine diphosphate (TDP), thymidine triphosphate (TTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate ( UTP), cytidine monophosphate (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate ( dAMP), deoxyadenosine diphosphate (DADP), deoxyadenos
  • the free nucleotide may be selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP or dCMP.
  • the free nucleotide may be adenosine triphosphate (ATP).
  • a helicase cofactor is a factor that enables a helicase or construct to function.
  • the helicase cofactor can optionally be a divalent metal cation.
  • the divalent metal cation may be Mg 2+ , Mn 2+ , Ca 2+ or Co 2+ .
  • the helicase cofactor is most preferably Mg 2+ .
  • reduced coenzyme I is a chemical substance, which is the reduced state of nicotinamide adenine dinucleotide.
  • the present application also provides a kit for characterizing polynucleotides, said kit comprising said adapter or said complex.
  • the kit includes (a) one or more adapters, (b) one or more helicases.
  • the kit may include any of the helicases and wells discussed above.
  • the kit may also include membrane components, such as phospholipids, such as lipid bilayers, required for the formation of amphiphilic layers.
  • membrane components such as phospholipids, such as lipid bilayers, required for the formation of amphiphilic layers.
  • kits of the present application may additionally comprise one or more other reagents or instruments that enable the performance of any of the above-mentioned embodiments.
  • reagents or devices include one or more of the following: a suitable buffer (aqueous solution), means for obtaining a sample from a subject (such as a container or device containing a needle), amplification and/or Or a means for expressing polynucleotides, a membrane as defined above or a pressure clamp or patch clamp device.
  • the reagents may be present in a dry state in the kit such that a fluid sample resuspends the reagents.
  • the kit may also, optionally, include instructions for how to use the kit in the methods of the present application, or details of which patients the methods may be used for.
  • the kit optionally includes the necessary components to facilitate the movement of the helicase (eg ATP and Mg 2+ ).
  • Example 1 Preparation of a Y adapter-enzyme complex that reduces ATP empty consumption
  • SEQ ID NO: 6 GAATACGTTAGCGG, wherein SEQ ID NO: 6 is composed of PNA; PNA represents peptide nucleic acid, a class of DNA analogues that replace sugar and phosphate backbones with polypeptide backbones.
  • the complex is formed by the hybridization of 4 different strands
  • the first strand (Y-Top-1), which in turn contains the leader sequence, i.e. the iSpC3 blocking strand, denoted as 3, is connected to the 5' end of SEQ ID NO: 1, and its 3' end is connected to four i2OMeC and SEQ ID NO:2, the 3' end of SEQ ID NO:2 is connected to the blocking chain (R0-R6 as shown in Table 2) and SEQ ID NO:3, wherein i2OmeC and i2OmeG are 2''-O-methyl RNA, that is, 2'methoxy-modified RNA.
  • the second strand (Y-Top-2), DBCO is connected to the 5' end of SEQ ID NO:4.
  • the third strand (Y-Bottom), the 3' end of SEQ ID NO:4 is linked to SEQ ID NO:7.
  • PNA-R The fourth strand (PNA-R), [GAATACGTTAGCGG]pna-OO-azide (N3), where O is O-liker (also known as AEEA or eg1), was used to increase the solubility of PNA-R.
  • 3333333333333333333333333333333 represents C3 spacer modification.
  • Y-Top-2 DBCO- CGACTTCTACCGTTTGACTCCGC;
  • the final annealing system includes 160mM HEPES 7.0; 200mM NaCl, the final concentration of Y1 is 4-8 ⁇ M, and the Y-type adapter is finally formed.
  • TBE (native) PAGE gels were used to examine the mobility of samples 1 and 2 under the same conditions. After the PNA-R chain is loaded on the adapter, the mobility of sample 2 will decrease, which is slower than that of the control (sample 1) without PNA-R chain. Among them, the comparison results using the blocking chain as R0 are shown in Figure 2 Show.
  • Sample 2 was used on a DNAPac PA200 column with the following elution buffers (buffer A: 20 mM Na-CHES, 250 mM NaCl, 4% (W/V) glycerol, pH 8.6, buffer B: 20 mM Na-CHES, 1 M NaCl , 4% (W/V) glycerol, pH 8.6) for purification, sample 1 was loaded on the column, and the enzyme not bound to DNA was eluted from the column with buffer A. The enzyme-bound Y-adapter complex was then eluted with 10 column volumes of 0-100% buffer E. Then the main elution peaks were collected and their concentrations were measured for the detection of Example 2.
  • buffer A 20 mM Na-CHES, 250 mM NaCl, 4% (W/V) glycerol, pH 8.6
  • buffer B 20 mM Na-CHES, 1 M NaCl , 4% (W/V) glycerol, pH 8.6
  • Embodiment 2 ATPase activity detection
  • the NADH reaction mixture was prepared, and the reaction mixture was mainly prepared according to Table 1 below. After the preparation was completed, the reaction mixture was turned horizontally at room temperature and incubated for 10 minutes.
  • NADH reaction mixture 37.5 ⁇ L (20 nM) Y adapter-enzyme complex (i.e. any one of sample 1 and sample 2 after purification in Example 1 ) was added in the 96-well plate, and then Put it into an ultraviolet-visible spectrophotometer to measure the absorbance value at 380nm, set the temperature at 34°C; detect 200 cycles, each cycle is 5 minutes. The collected data were drawn into a standard curve and the ATP consumption value was obtained through the slope of the standard curve.
  • Example 3 On-machine test of the Y adapter-enzyme complex that reduces ATP empty consumption
  • a library of 10kb in length was prepared by end repair, and the Y adapter-enzyme complex (joint is R4) prepared in Example 1 to reduce ATP vacancy was used to connect the library, that is, the target polynucleotide, to build a library.
  • the acid linking position is the right end of the adapter shown in Figure 1. Take no addition of PNA-R chain as a control (denoted as RC).
  • Sequencing was performed using the nanopore sequencer QNome-9604 of Qitan Technology Co., Ltd., sequencing buffer: final concentration 10mM HEPEs, 100mM MgCl 2 , 375mM KCl, ATP 100mM, pH 7.1, sequencing temperature: 30-40°C.
  • results As shown in Figure 4, the control RC decreased by about 80 bp/s within 16 hours of sequencing; the rate of linker R4 decreased by about 10 bp/s within 16 hours of sequencing.
  • the sequencing rate is positively correlated with the ATP concentration. If the ATP concentration drops significantly during the sequencing process, the sequencing rate will also decrease.
  • B corresponding to A means that B is associated with A, and B can be determined according to A.
  • determining B according to A does not mean determining B only according to A, and B may also be determined according to A and/or other information.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Physics & Mathematics (AREA)
  • Zoology (AREA)
  • Biochemistry (AREA)
  • Immunology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • Hematology (AREA)
  • Urology & Nephrology (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Cell Biology (AREA)
  • Endocrinology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided are an adaptor used for characterizing a polynucleotide and a use thereof. The adaptor contains {L-S}n or {S-L}n in the 5' to 3' end direction. The L double strand contains polynucleotide strand L' connected to S and complementary strand L" of L', and L' comprises a first segment far away from blocking strand S and a second segment close to blocking strand S. The first segment contains a modified portion. The second segment contains a motor protein binding active region. The active region is closed by complementary strand L". The complementary strand L" contains a polymer that can compete with a motor protein so as to bind to strand L'.

Description

用于表征多核苷酸的衔接体及其用途Adapters for characterizing polynucleotides and uses thereof
相关申请的交叉引用Cross References to Related Applications
本申请要求享有于2021年12月15日提交的名称为“用于表征多核苷酸中使用的衔接体及其用途”的中国专利申请202111534007.8的优先权,该申请的全部内容通过引用并入本文中。This application claims the priority of Chinese patent application 202111534007.8 entitled "Adapters used in characterizing polynucleotides and their uses" filed on December 15, 2021, the entire contents of which are incorporated herein by reference middle.
技术领域technical field
本申请属于基因测序领域,涉及一种表征多核苷酸中使用的衔接体,本申请还涉及使用所述衔接体表征多核苷酸的方法。The application belongs to the field of gene sequencing, and relates to an adapter used in characterizing polynucleotides, and also relates to a method for characterizing polynucleotides using the adapter.
背景技术Background technique
纳米孔测序技术具有长读长、直接读取修饰信息和实时数据生产并行分析的特点,在长片段核酸检测变异(包括但不仅限于点突变、插入缺失、倒位易位、基因融合、RNA异常剪切、RNA编辑等多种核酸相关变异)和修饰信息(包括但不仅限于甲基化、乙酰化等)检测方面比二代测序或其他测序平台有更明显优势。该平台支持数据生产和分析并行的特点实现了实时变异/修饰检出和诊断,加上便携式的设计,使其具有广泛的应用前景。Nanopore sequencing technology has the characteristics of long read length, direct reading of modification information and parallel analysis of real-time data production. Nucleic acid-related variations such as shearing and RNA editing) and modification information (including but not limited to methylation, acetylation, etc.) have obvious advantages over next-generation sequencing or other sequencing platforms. The platform supports parallel data production and analysis to realize real-time mutation/modification detection and diagnosis, and the portable design makes it have a wide range of application prospects.
对纳米孔两侧施加电压后,当分析物(例如多核苷酸、多肽)通过纳米孔时造成电流下降,不同结构的分析物所引起的电流阻断程度不同。当分析物在纳米孔的桶(barrel)中暂时停留一段时间时,电流会发生变化。纳米孔检测核苷酸给出已知特征和持续时间的电流变化。After applying a voltage to both sides of the nanopore, when the analyte (such as polynucleotide, polypeptide) passes through the nanopore, the current will drop, and the degree of current blockage caused by the analyte with different structures is different. When the analyte temporarily stays in the barrel of the nanopore for a period of time, the current changes. Nanopores detect nucleotides that give current changes of known character and duration.
在纳米孔测序技术中,未施加电势时,多核苷酸的阻断链通常能够使解旋酶停滞,防止解旋酶穿过阻断链沿目标多核苷酸进一步移动。但当解旋酶与多核苷酸复合物同跨膜孔接触并施加电势后,可以移动一个或多个停滞的解旋酶穿过多核苷酸上的阻断链,沿着待测多核苷酸序列进行移动, 从而达到测序的目的。因此,纳米孔测序中需要使用包含酶的结合区和阻断链的接头。In nanopore sequencing, when no potential is applied, the blocking strand of the polynucleotide is often capable of stalling the helicase, preventing further movement of the helicase across the blocking strand along the target polynucleotide. However, when the helicase-polynucleotide complex is in contact with the transmembrane pore and a potential is applied, one or more stalled helicases can be moved across the blocking strand on the polynucleotide and along the polynucleotide to be tested. The sequence is moved to achieve the purpose of sequencing. Therefore, adapters containing enzyme-binding regions and blocking strands are required for nanopore sequencing.
在纳米孔测序中,通常使用核酸接头如Y型衔接体或类发夹衔接体(专利CN202111018113.0),其中,马达蛋白如解旋酶结合到接头的结合区。相关技术中的衔接体在实际应用中存在腺嘌呤核苷三磷酸(Adenosine triphosphate,简称ATP)空耗现象,即没有进行过孔测序的接头也会消耗大量ATP。由于在测序环境中ATP浓度的降低会导致测序速度降低,测序时长过短,进而影响测序数据的产出量。因此,当前对能够减少ATP消耗的测序方法存在需求。In nanopore sequencing, nucleic acid adapters such as Y-shaped adapters or hairpin-like adapters (patent CN202111018113.0) are usually used, wherein motor proteins such as helicase are bound to the binding region of the adapters. Adapters in the related art have a phenomenon of adenosine triphosphate (ATP) depletion in practical applications, that is, adapters that have not been sequenced through the holes will also consume a large amount of ATP. Since the reduction of ATP concentration in the sequencing environment will lead to a decrease in sequencing speed, the sequencing time is too short, which in turn affects the output of sequencing data. Therefore, there is currently a need for sequencing methods that can reduce ATP consumption.
发明内容Contents of the invention
针对相关技术中技术的不足,本申请的目的在于提供一种新的衔接体,本申请还提供了所述衔接体的制备方法,及其用于纳米孔测序的用途。使用本申请的衔接体,极大地降低了纳米孔测序中的ATP空耗。In view of the technical deficiencies in the related art, the purpose of the present application is to provide a new adapter, and the present application also provides a preparation method of the adapter and its use for nanopore sequencing. Using the adapter of the present application greatly reduces the ATP waste in nanopore sequencing.
本申请提供了如下的实施例:The application provides the following examples:
一方面,本申请提供了一种用于表征目标多核苷酸的衔接体,所述衔接体在5'到3'端方向包含{L-S} n或{S-L} nIn one aspect, the present application provides an adapter for characterizing a target polynucleotide, the adapter comprising {LS} n or {SL} n in the direction of the 5' to 3' end,
其中,L为修饰的双链多核苷酸,S为阻断链,n为正整数;Wherein, L is a modified double-stranded polynucleotide, S is a blocking strand, and n is a positive integer;
并且,所述L双链包含与S连接的多核苷酸链L'及所述L'的互补链L”,And, the L double strand comprises a polynucleotide strand L' connected to S and a complementary strand L" of said L',
所述L'包括远离阻断链S的第一区段、和靠近阻断链S的第二区段,所述第一区段包含修饰部分,所述第二区段包含马达蛋白结合活性区,该活性区被所述互补链L”封闭;The L' includes a first segment away from the blocking chain S and a second segment close to the blocking chain S, the first segment includes a modified part, and the second segment includes a motor protein binding active region , the active region is blocked by the complementary chain L";
互补链L”包含能够与所述马达蛋白竞争性结合到L'链的聚合物。The complementary chain L" comprises a polymer capable of competing with said motor protein for binding to the L' chain.
根据本申请所述的衔接体,其中,所述衔接体在5'到3'端方向包含{D 1-L-S} n或{S-L-D 1} n,D 1为第一双链多核苷酸, The adapter according to the present application, wherein the adapter comprises {D 1 -LS} n or {SLD 1 } n in the direction from 5' to 3' end, D 1 is the first double-stranded polynucleotide,
和/或,所述衔接体在5'到3'端方向包含{L-S-D 2} n或{D 2-S-L} n,D 2为第二双链多核苷酸; And/or, the adapter comprises {LSD 2 } n or {D 2 -SL} n in the direction from the 5' to the 3' end, and D 2 is the second double-stranded polynucleotide;
可选地,所述n为1-20的整数,例如n可以为1、2、3、4、5、6、7、8或更多个。Optionally, the n is an integer of 1-20, for example, n may be 1, 2, 3, 4, 5, 6, 7, 8 or more.
根据本申请所述的衔接体,其中,所述链L'的修饰部分使所述第一区段与马达蛋白的结合能力弱于所述第二区段,或者使所述第一区段不与马达蛋白结合;According to the adapter described in the present application, wherein, the modified part of the chain L' makes the binding ability of the first segment to motor protein weaker than that of the second segment, or makes the first segment not binds to motor proteins;
可选地,所述链L'中修饰部分为核糖核苷酸(RNA)和/或核酸类似物;Optionally, the modification part in the chain L' is ribonucleotide (RNA) and/or nucleic acid analogue;
所述核糖核苷酸包括2′位修饰的核糖核苷酸,可选2′烷氧基修饰的核糖核苷酸,更可选2′甲氧基修饰的核糖核苷酸;The ribonucleotides include 2'-modified ribonucleotides, optionally 2'alkoxy-modified ribonucleotides, and more optionally 2'methoxy-modified ribonucleotides;
所述核酸类似物包括肽核酸(PNA)、甘油核酸(GNA)、苏糖核酸(TNA)、锁核酸(LNA)、桥核酸(BNA)中的任一种或两种以上任意组合,更可选地,所述链L'的未修饰部分为脱氧核糖核苷酸;The nucleic acid analogs include any one or any combination of two or more of peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA), and bridge nucleic acid (BNA). Optionally, the unmodified part of the chain L' is a deoxyribonucleotide;
可选地,所述修饰的多核苷酸个数为1-20,更可选地为1-15、1-10或2-6,进一步可选地为3个、4个、5个。Optionally, the number of modified polynucleotides is 1-20, more preferably 1-15, 1-10 or 2-6, further optionally 3, 4, 5.
其中,在多核苷酸链L'中远离阻断链S的一端(第一区段)提供修饰,修饰端(第一区段)与马达蛋白的结合能力更弱,使得在衔接体的制备过程中,马达蛋白更容易结合到多核苷酸链L'中靠近阻断链S的一端(第二区段),从而更有利于后续对马达蛋白的驱逐。本领域技术人员可以理解的是,该修饰部分可以是任选的修饰,只要该修饰能够削弱修饰部分与马达蛋白的结合即可。所述修饰部分的长度可以根据多核苷酸链L'的长度确定。不限于任何特定理论,在保证多核苷酸链L'中靠近阻断链S的一端(第二区段)具有足够数目的多核苷酸结合马达蛋白后,其余的部分均可以选择性的被修饰,而不影响本申请的技术效果。Wherein, modification is provided at one end (first section) of the polynucleotide chain L' away from the blocking chain S, and the binding ability of the modified end (first section) to the motor protein is weaker, so that in the preparation process of the adapter In , the motor protein is more likely to be combined into the end (the second segment) of the polynucleotide chain L' close to the blocking chain S, which is more conducive to the subsequent expulsion of the motor protein. Those skilled in the art can understand that the modified part can be an optional modification, as long as the modification can weaken the binding of the modified part to the motor protein. The length of the modification part can be determined according to the length of the polynucleotide chain L'. Without being limited to any particular theory, after ensuring that one end (the second segment) of the polynucleotide chain L' near the blocking chain S has a sufficient number of polynucleotides bound to the motor protein, the remaining parts can be selectively modified , without affecting the technical effect of the present application.
根据本申请所述的衔接体,其中,所述互补链L”靠近阻断链S的一端包含与多核苷酸链L'或所述第二区段结合力更强的部分;可选地,该部分包含PNA或者LNA。According to the adapter described in the present application, wherein, the end of the complementary strand L" close to the blocking strand S includes a part with stronger binding force to the polynucleotide strand L' or the second segment; optionally, This part contains PNA or LNA.
所述互补链L”的主要作用为通过与多核苷酸链L'的互补,行使对马达蛋白的驱逐。本领域技术人员可以理解的是,不限于任何特定理论,只要互补链L”具有更强的结合力。在衔接体的制备中,就可以通过与多核苷酸链L'互补,将马达蛋白驱逐到阻断链。The main function of the complementary chain L" is to repel the motor protein by complementing the polynucleotide chain L'. Those skilled in the art can understand that it is not limited to any specific theory, as long as the complementary chain L" has more Strong binding force. In the preparation of the adapter, the motor protein can be expelled to the blocking strand by complementing the polynucleotide strand L'.
根据本申请所述的衔接体,其中,所述互补链L”远离阻断链S的一端 包含点击化学的第一部分,可选点击反应基团;According to the adapter described in the present application, wherein, the end of the complementary chain L" away from the blocking chain S comprises a first part of click chemistry, an optional click reaction group;
和/或,所述D 1双链包含与L'连接的多核苷酸链D 1'及其互补链D 1”,所述互补链D 1”靠近阻断链S的一端包含点击化学的第二部分,可选点击反应基团; And/or, the D 1 double strand comprises a polynucleotide chain D 1 ′ connected to L’ and its complementary chain D 1 ″, and one end of the complementary chain D 1 ″ near the blocking chain S comprises the first click chemistry Two-part, optional click reactive group;
和/或,所述D 2双链包含与阻断链S连接的多核苷酸链D 2'及其互补链D 2”,所述互补链D 2”包含不与所述衔接体杂交的部分;可选地,所述不与所述衔接体杂交的部分位于互补链D 2”临近阻断链S的一端。 And/or, the D 2 double strand comprises a polynucleotide strand D 2 ′ connected to the blocking strand S and its complementary strand D 2 ″, and the complementary strand D 2 ″ includes a part that does not hybridize with the adapter ; Optionally, the part that does not hybridize with the adapter is located at one end of the complementary strand D 2 ″ adjacent to the blocking strand S.
通过引入点击化学基团,可以使该部分的结合更牢固。This moiety can be made more firmly bound by the introduction of click chemistry groups.
根据本申请所述的衔接体,其中,所述阻断链具有与所述多核苷酸不同的结构,用于阻滞马达蛋白;The adapter according to the present application, wherein the blocking chain has a structure different from that of the polynucleotide and is used to block motor proteins;
可选地,所述阻断链包含一个或多个硝基吲哚、一个或多个肌苷、一个或多个吖啶、一个或多个2-氨基嘌呤、一个或多个2-6-二氨基嘌呤、一个或多个5-溴-脱氧尿嘧啶、一个或多个反向胸苷(反向dTs)、一个或多个反向二脱氧胸苷(ddTs)、一个或多个二脱氧胞苷(ddCs)、一个或多个5-甲基胞苷酸、一个或多个5-羟甲基胞苷、一个或多个2’烷氧基修饰的核糖核苷酸(可选2’甲氧基修饰的核糖核苷酸)、一个或多个异脱氧胞苷(异-dCs)、一个或多个异脱氧鸟苷(异-dGs)、一个或多个C3基团、一个或多个光裂解(PC)的基团、一个或多个己二醇、一个或多个iSp9基团、一个或多个iSp18基团、聚合物或一个或多个硫醇连接。Optionally, the blocker chain comprises one or more nitroindole, one or more inosine, one or more acridine, one or more 2-aminopurine, one or more 2-6- Diaminopurine, one or more 5-bromo-deoxyuridines, one or more inverted thymidines (inverted dTs), one or more inverted dideoxythymidines (ddTs), one or more dideoxy Cytidines (ddCs), one or more 5-methylcytidylic acid, one or more 5-hydroxymethylcytidine, one or more 2'alkoxy-modified ribonucleotides (optionally 2' methoxy-modified ribonucleotides), one or more isodeoxycytidines (iso-dCs), one or more isodeoxyguanosines (iso-dGs), one or more C3 groups, one or more One or more photocleavable (PC) groups, one or more hexanediols, one or more iSp9 groups, one or more iSp18 groups, polymers, or one or more thiol linkages.
在一个可选的实施方案中,2’烷氧基修饰的核糖核苷酸的数量为1-10个,可选为2-6个;可选地,In an optional embodiment, the number of 2' alkoxy-modified ribonucleotides is 1-10, optionally 2-6; optionally,
所述2′烷氧基修饰的核糖核苷酸均匀分布于所述阻断链。The 2'alkoxy-modified ribonucleotides are uniformly distributed in the blocking chain.
根据本申请所述的衔接体,其L'的远离S的一端包含先导链序列;According to the adapter described in the present application, one end of the L' far away from the S comprises a leader strand sequence;
S的远离L'的一端用于连接目标多核苷酸;The end of S away from L' is used to connect the target polynucleotide;
与S连接的多核苷酸链L'在靠近阻断链S的一端包含马达蛋白结合活性区,该活性区被所述互补链L”封闭;The polynucleotide chain L' connected to S contains a motor protein binding active region at one end close to the blocking chain S, and the active region is blocked by the complementary chain L";
所述马达蛋白为能够结合到多核苷酸并且控制其移动穿过孔的蛋白;可选地,所述马达蛋白选自聚合酶、核酸外切酶、解旋酶和拓扑异构酶中的一种或多种,更可选地,所述解旋酶选自Hel308解旋酶、RecD解旋酶、 Tral解旋酶、TrwC解旋酶、XPD解旋酶和DDA解旋酶中的一种或多种。The motor protein is a protein capable of binding to a polynucleotide and controlling its movement through a pore; optionally, the motor protein is selected from one of polymerase, exonuclease, helicase and topoisomerase one or more, more optionally, the helicase is selected from one of Hel308 helicase, RecD helicase, Tral helicase, TrwC helicase, XPD helicase and DDA helicase or more.
另一方面,本申请提供了一种复合物,所述复合物包含本申请所述的衔接体,和所述马达蛋白和/或所述目标多核苷酸,其中所述马达蛋白位于阻断链。In another aspect, the present application provides a complex comprising the adapter described in the present application, and the motor protein and/or the target polynucleotide, wherein the motor protein is located at the blocking chain .
再一方面,本申请提供了所述的复合物的制备方法,包括:In another aspect, the application provides a method for preparing the complex, comprising:
S1:使包含L'-S的Y1链与马达蛋白结合,所述结合的区域位于L'链;S1: binding the Y1 chain containing L'-S to the motor protein, and the binding region is located in the L' chain;
S2:加入包含互补链L”的PNA-R链,得到所述复合物,其中所述马达蛋白被PNA-R链驱赶到阻断链;S2: adding the PNA-R chain comprising the complementary chain L" to obtain the complex, wherein the motor protein is driven to the blocking chain by the PNA-R chain;
可选地,所述方法包括Optionally, the method includes
S101:使包含D 1'-L'-S-D 2'的Y1链、包含D 1”的Y2链和包含D 2”的YB链的退火产物与马达蛋白结合,所述结合区域位于退火产物的L'链处; S101: Binding the annealed product of the Y1 chain comprising D 1 '-L'-SD 2 ', the Y2 chain comprising D 1 ” and the YB chain comprising D 2 ” to the motor protein, the binding region is located at the L of the annealed product 'chain at;
S102:加入包含互补链L”的PNA-R链,得到所述复合物,其中马达蛋白通过PNA-R链驱赶到阻断链。S102: Adding the PNA-R chain containing the complementary chain L" to obtain the complex, wherein the motor protein is driven to the blocking chain by the PNA-R chain.
本申请还提供了一种表征目标多核苷酸的方法,所述方法使用所述衔接体或所述的复合物;The present application also provides a method for characterizing a target polynucleotide, the method using the adapter or the complex;
可选地,所述方法包括:Optionally, the method includes:
(a)使目标多核苷酸穿过跨膜孔移动,(a) moving the polynucleotide of interest across the transmembrane pore,
其中所述目标多核苷酸与所述的衔接体或所述的复合物连接;以及wherein said target polynucleotide is linked to said adapter or said complex; and
(b)随着所述多核苷酸相对于所述孔移动,获取一个或多个电和/或光测量值,其中所述测量值代表所述多核苷酸的一个或多个特征,并由此表征所述目标多核苷酸。(b) taking one or more electrical and/or optical measurements as the polynucleotide moves relative to the pore, wherein the measurements represent one or more characteristics of the polynucleotide and are determined by This characterizes the polynucleotide of interest.
再一方面,本申请还提供了一种用于表征多核苷酸的试剂盒,In another aspect, the present application also provides a kit for characterizing polynucleotides,
所述试剂盒的组成为如下1)-4)中任一种:The composition of the kit is any one of the following 1)-4):
1)包含独立包装的所述的衔接体,可选地,还包含独立包装的所述的马达蛋白;1) comprising the independently packaged adapter, optionally also comprising the independently packaged motor protein;
2)包含所述的复合物;2) comprising said complex;
3)包含所述制备方法得到的复合物;3) comprising the compound obtained by the preparation method;
4)包含分别独立包装的如下组分:4) Contains the following components individually packaged:
如所述的衔接体中的用于与S连接的多核苷酸链L'、所述阻断链S、 和所述核苷酸链L'的互补链L”,可选地,还包含独立包装的所述的马达蛋白。阻断链在纳米孔测序中,现有测序接头存在较为严重的ATP空耗现象,即没有进行过孔测序的接头也会消耗大量ATP。在测序火警中,ATP浓度的降低会导致测序速度降低,测序时长过短,进而影响到测序数据的产出值。本申请的申请人尝试提供一种方法,不需要ATP的消耗,将加载到衔接体上的马达蛋白驱赶到阻断链上,从而避免ATP的空耗。本申请的技术构思结合图1描述如下,图1为使用本申请的Y型衔接体将解旋酶驱逐到阻断链的原理示意图;其中,Y-Top-1链(Y1链)包含D 1'-L'-S-D 2',Y-Top-2链(Y2链)包含D 1”,YB链包含D 2”;首先,将Y1链、Y2链和YB链退火,然后加入解旋酶,所述解旋酶结合到L'链处,并且所述Y-Top-2链包含化学点击基团;加入PNA-R链,所述PNA-R链能够与Y-Top-1链(Y1链)中的L'更好的结合,并且PNA-R链与Y-Top-2链之间的化学点击反应进一步稳固了该结合,从而将结合于L'链处的解旋酶沿着5'到3'端方向驱逐到S区,该驱逐依靠的是双链之间的结合力的驱动,不需要消耗任何ATP,从而降低了实际测序过程中ATP的消耗; As described in the adapter, the polynucleotide chain L' for linking with S, the blocking chain S, and the complementary chain L" of the nucleotide chain L', optionally, also include an independent The packaged motor protein. Block chains In nanopore sequencing, there is a relatively serious phenomenon of ATP emptying in existing sequencing adapters, that is, adapters that have not been sequenced through holes will also consume a large amount of ATP. In sequencing fires, the ATP concentration The reduction will lead to a decrease in sequencing speed, and the sequencing time is too short, which will affect the output value of sequencing data. The applicant of this application tries to provide a method that does not require the consumption of ATP to drive the motor protein loaded on the adapter To the blocking chain, thereby avoiding the empty consumption of ATP.The technical concept of the present application is described as follows in conjunction with Fig. 1, and Fig. 1 is the schematic diagram of the principle of using the Y-type adapter of the present application to expel the helicase to the blocking chain; wherein, Y -Top-1 chain (Y1 chain) contains D 1 '-L'-SD 2 ', Y-Top-2 chain (Y2 chain) contains D 1 ", YB chain contains D 2 "; first, Y1 chain, Y2 chain and YB strand anneal, then add helicase, the helicase binds to the L' strand, and the Y-Top-2 strand contains a chemical click group; add the PNA-R strand, the PNA-R chain can better combine with the L' in the Y-Top-1 chain (Y1 chain), and the chemical click reaction between the PNA-R chain and the Y-Top-2 chain further stabilizes the combination, thereby binding to the The helicase at the L' strand is expelled to the S region along the direction from the 5' to the 3' end. This expulsion is driven by the binding force between the double strands and does not need to consume any ATP, thereby reducing the actual sequencing process. consumption of ATP;
其中,在图1中,所述Y-Top-2链的星型信号为点击反应基团,具体的实施例中为二苯并环辛炔(DBCO,Dibenzocyclooctyne)修饰;所述PNA-R链中的三角形信号为点击反应基团,具体的实施例中为N3。Wherein, in Fig. 1, the star signal of the Y-Top-2 chain is a click reaction group, which is modified by dibenzocyclooctyne (DBCO, Dibenzocyclooctyne) in a specific embodiment; the PNA-R chain The triangle signal in is the click reaction group, which is N3 in the specific example.
与相关技术相比,本申请的技术方案具备以下优点:Compared with related technologies, the technical solution of the present application has the following advantages:
本申请提供了一种新型的衔接体,使用该衔接体能极大避免纳米孔测序中的ATP空耗,显著地提高了测序效率。The present application provides a novel adapter, which can greatly avoid ATP waste in nanopore sequencing and significantly improve sequencing efficiency.
附图说明Description of drawings
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例中所需要使用的附图作简单地介绍,显而易见地,下面所描述的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the following will briefly introduce the accompanying drawings that need to be used in the embodiments of the present application. Obviously, the accompanying drawings described below are only some embodiments of the present application. Those of ordinary skill in the art can also obtain other drawings based on these drawings without making creative efforts.
图1为使用本申请的Y型衔接体将解旋酶驱逐到阻断链的原理示意图;其中,Y-Top-1链(Y1链)包含D 1'-L'-S-D 2',Y-Top-2链(Y2链)包含 D 1”,YB链包含D 2”;首先,将Y1链、Y2链和YB链退火,然后加入解旋酶,所述解旋酶结合到L'链处,并且所述Y-Top-2链包含化学点击基团;加入PNA-R链,所述PNA-R链能够与Y-Top-1链(Y1链)中的L'更好的结合,并且PNA-R链与Y-Top-2链之间的化学点击反应进一步稳固了该结合,从而将结合于L'链处的解旋酶沿着5'到3'端方向驱逐到S区,该驱逐依靠的是双链之间的结合力的驱动,不需要消耗任何ATP,从而降低了实际测序过程中ATP的消耗; Figure 1 is a schematic diagram of the principle of using the Y-type adapter of the present application to expel the helicase to the blocking chain; wherein, the Y-Top-1 chain (Y1 chain) contains D 1 '-L'-SD 2 ', Y- Top-2 chain (Y2 chain) contains D 1 ″, and YB chain contains D 2 ″; first, Y1 chain, Y2 chain, and YB chain are annealed, and then a helicase is added, which binds to the L' chain , and the Y-Top-2 chain contains a chemical click group; adding the PNA-R chain, the PNA-R chain can better combine with the L' in the Y-Top-1 chain (Y1 chain), and The chemical click reaction between the PNA-R strand and the Y-Top-2 strand further stabilizes the binding, thereby expelling the helicase bound at the L' strand to the S region along the 5' to 3' end direction, which The expulsion is driven by the binding force between the double strands and does not need to consume any ATP, thus reducing the consumption of ATP in the actual sequencing process;
其中,在图1中,所述Y-Top-2链的星型信号为点击反应基团,具体的实施例中为DBCO修饰;所述PNA-R链中的三角形信号为点击反应基团,具体的实施例中为N3。Wherein, in Fig. 1, the star-shaped signal of the Y-Top-2 chain is a click reaction group, which is DBCO modification in a specific embodiment; the triangle signal in the PNA-R chain is a click reaction group, In a specific embodiment, it is N3.
图2是根据本申请的实施例1,将第四链PNA加载前后,衔接体接头复合物的质检图。Fig. 2 is a quality inspection diagram of the adapter adapter complex before and after loading the fourth-strand PNA according to Example 1 of the present application.
图3是根据本申请的实施例2,将酶驱赶到不同的结构和序列的阻断链后,测定得到的ATP/NADH消耗的曲线图。Fig. 3 is a graph of ATP/NADH consumption measured after driving enzymes to blocker chains of different structures and sequences according to Example 2 of the present application.
图4是根据本申请的实施例3,实际测序中不同衔接体的速率降幅。Fig. 4 shows the reduction rate of different adapters in actual sequencing according to Example 3 of the present application.
具体实施方式Detailed ways
下面将详细描述本申请的各个方面的特征和示例性实施例。在下面的详细描述中,提出了许多具体细节,以便提供对本申请的全面理解。但是,对于本领域技术人员来说很明显的是,本申请可以在不需要这些具体细节中的一些细节的情况下实施。下面对实施例的描述仅仅是为了通过示出本申请的示例来提供对本申请的更好的理解。Features and exemplary embodiments of various aspects of the present application will be described in detail below. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the application. It will be apparent, however, to one skilled in the art that the present application may be practiced without some of these specific details. The following description of the embodiments is only to provide a better understanding of the present application by showing examples of the present application.
此外,除非内容另外明确指出,否则用于本说明书和所附权利要求书中的单数形式“一”,“一个”,和“所述”包括复数指代。因此,例如,涉及“多核苷酸”时包括两个或更多个多核苷酸,涉及“锚”时包括两个或更多个锚,涉及“解旋酶”时包括两个或更多个解旋酶,涉及“跨膜孔”时包括两个或更多个孔,等。Furthermore, as used in this specification and the appended claims, the singular forms "a," "an," and "the" include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to "polynucleotide" includes two or more polynucleotides, reference to "anchor" includes two or more anchors, reference to "helicase" includes two or more Helicase, reference to "transmembrane pore" includes two or more pores, etc.
衔接体Adapter
本申请提供了一种用于表征多核苷酸的衔接体,所述衔接体在5'到3' 端方向包含{L-S} n或{S-L} n,其中,L为修饰的双链多核苷酸,S为阻断链,n为整数;并且,所述L双链包含与S连接的多核苷酸链L'及所述L'的互补链L”,所述L'包括远离阻断链S的第一区段、和靠近阻断链S的第二区段,所述第一区段包含修饰部分,所述第二区段包含马达蛋白结合活性区,该活性区被所述互补链L”封闭;互补链L”包含能够与所述马达蛋白竞争性结合到L'链的聚合物。 The present application provides an adapter for characterizing polynucleotides, which comprises {LS} n or {SL} n in the direction of the 5' to 3' end, wherein L is a modified double-stranded polynucleotide , S is a blocking chain, and n is an integer; and, the L double-strand includes a polynucleotide chain L' connected to S and a complementary chain L "of the L', and the L' includes a chain far away from the blocking chain S The first segment of the first segment, and the second segment near the blocking chain S, the first segment includes a modified part, the second segment includes a motor protein binding active region, and the active region is blocked by the complementary chain L "Closed; complementary strand L" comprises a polymer capable of competing with the motor protein for binding to the L' strand.
根据本申请所述的衔接体,其中,所述衔接体包含{D 1-L-S} n或{S-L-D 1} n,D 1为第一双链多核苷酸;和/或,所述衔接体在5'到3'端方向包含{L-S-D 2} n或{D 2-S-L} n,D 2为第二双链多核苷酸;可选地,所述n为1-20的整数,例如n可以为1、2、3、4、5、6、7、8,或更多个。 According to the adapter described in the present application, wherein the adapter comprises {D 1 -LS} n or {SLD 1 } n , D 1 is the first double-stranded polynucleotide; and/or, the adapter is in The direction from the 5' to the 3' end includes {LSD 2 } n or {D 2 -SL} n , D 2 is the second double-stranded polynucleotide; optionally, the n is an integer of 1-20, for example, n can be 1, 2, 3, 4, 5, 6, 7, 8, or more.
根据本申请所述的衔接体,其中,所述链L'的修饰部分使所述第一区段与马达蛋白的结合能力弱于所述第二区段,或者使所述第一区段不与马达蛋白结合;和/或According to the adapter described in the present application, wherein, the modified part of the chain L' makes the binding ability of the first segment to motor protein weaker than that of the second segment, or makes the first segment not binds to motor proteins; and/or
所述链L'中所述修饰部分为核糖核苷酸和/或核酸类似物。The modification part in the chain L' is ribonucleotide and/or nucleic acid analog.
可选地,所述核糖核苷酸包括2′位修饰的核糖核苷酸,可选2′烷氧基修饰的核糖核苷酸,更可选为2′甲氧基修饰的核糖核苷酸;或优先地,所述核酸类似物包括肽核酸、甘油核酸、苏糖核酸、锁核酸、桥核酸中的任一种或两种以上任意组合;和/或所述链L'中所述马达蛋白结合活性区为脱氧核糖核苷酸;和/或所述核糖核苷酸和/或核酸类似物的个数为1-20、1-15或1-10;可选2-6,例如为2个、3个、4个、5个或6个。Optionally, the ribonucleotides include 2'-modified ribonucleotides, optionally 2'alkoxy-modified ribonucleotides, more optionally 2'methoxy-modified ribonucleotides or preferably, the nucleic acid analogue includes any one or any combination of two or more of peptide nucleic acid, glycerol nucleic acid, threose nucleic acid, locked nucleic acid, and bridge nucleic acid; and/or the motor in the chain L' The protein binding active region is deoxyribonucleotides; and/or the number of ribonucleotides and/or nucleic acid analogs is 1-20, 1-15 or 1-10; optional 2-6, for example 2, 3, 4, 5 or 6.
其中,在多核苷酸链L'中远离阻断链S的一端提供修饰,修饰端与马达蛋白的结合能力更弱,使得在衔接体的制备过程中,马达蛋白更容易结合到多核苷酸链L'中靠近阻断链S的一端,从而更有利于后续对马达蛋白的驱逐。本领域技术人员可以理解的是,该修饰部分可以是任选的修饰,只要该修饰能够削弱修饰部分与马达蛋白的结合即可。所述修饰部分的长度可以根据多核苷酸链L'的长度确定。不限于任何特定理论,在保证多核苷酸链L'中靠近阻断链S的一端具有足够数目的多核苷酸结合马达蛋白后,其余的部分均可以选择性的被修饰,而不影响本申请的技术效果。Among them, the modification is provided at the end of the polynucleotide chain L' away from the blocking chain S, and the binding ability of the modified end to the motor protein is weaker, making it easier for the motor protein to bind to the polynucleotide chain during the preparation of the adapter L' is close to the end of the blocking chain S, which is more conducive to the subsequent expulsion of the motor protein. Those skilled in the art can understand that the modified part can be an optional modification, as long as the modification can weaken the binding of the modified part to the motor protein. The length of the modification part can be determined according to the length of the polynucleotide chain L'. Without being limited to any particular theory, after ensuring that one end of the polynucleotide chain L' near the blocking chain S has a sufficient number of polynucleotides bound to the motor protein, the remaining parts can be selectively modified without affecting the present application. technical effect.
根据本申请所述的衔接体,其中,所述互补链L”靠近阻断链S的一端 包含与多核苷酸链L'结合力更强的部分;可选地,该部分包含PNA或者LNA。According to the adapter described in the present application, wherein, the end of the complementary chain L" close to the blocking chain S contains a part with stronger binding force with the polynucleotide chain L'; optionally, this part contains PNA or LNA.
所述互补链L”的主要作用为通过与多核苷酸链L'的互补,行使对马达蛋白的驱逐。本领域技术人员可以理解的是,不限于任何特定理论,只要互补链L”具有更强的结合力。在衔接体的制备中,就可以通过与多核苷酸链L'互补,将马达蛋白驱逐到阻断链。The main function of the complementary chain L" is to repel the motor protein by complementing the polynucleotide chain L'. Those skilled in the art can understand that it is not limited to any specific theory, as long as the complementary chain L" has more Strong binding force. In the preparation of the adapter, the motor protein can be expelled to the blocking strand by complementing the polynucleotide strand L'.
根据本申请所述的衔接体,其中,所述互补链L”远离阻断链S的一端包含点击化学的第一部分,可选点击反应基团;According to the adapter described in the present application, wherein, the end of the complementary chain L" far away from the blocking chain S comprises a first part of click chemistry, optionally a click reaction group;
和/或,所述D 1双链包含与L'连接的多核苷酸链D 1'及其互补链D 1”,所述互补链D 1”靠近阻断链S的一端包含点击化学的第二部分,可选点击反应基团; And/or, the D 1 double strand comprises a polynucleotide chain D 1 ′ connected to L’ and its complementary chain D 1 ″, and one end of the complementary chain D 1 ″ near the blocking chain S comprises the first click chemistry Two-part, optional click reactive group;
和/或,所述D 2双链包含与阻断链S连接的多核苷酸链D 2'及其互补链D 2”,所述互补链D 2”包含不与所述衔接体杂交的部分;可选地,所述不与所述衔接体杂交的部分位于互补链D 2”临近阻断链S的一端。 And/or, the D 2 double strand comprises a polynucleotide strand D 2 ′ connected to the blocking strand S and its complementary strand D 2 ″, and the complementary strand D 2 ″ includes a part that does not hybridize with the adapter ; Optionally, the part that does not hybridize with the adapter is located at one end of the complementary strand D 2 ″ adjacent to the blocking strand S.
通过引入点击化学基团,可以使该部分的结合更牢固。This moiety can be made more firmly bound by the introduction of click chemistry groups.
根据本申请所述的衔接体,其中,所述阻断链具有与所述多核苷酸不同的结构。The adapter according to the present application, wherein the blocking chain has a structure different from that of the polynucleotide.
复合物Complex
本申请提供了一种复合物,所述复合物包含本申请所述的衔接体和马达蛋白,其中所述马达蛋白位于阻断链;The present application provides a complex, which comprises the adapter described in the present application and a motor protein, wherein the motor protein is located at the blocking chain;
可选地,所述马达蛋白为能够结合到多核苷酸并且控制其移动穿过孔的蛋白;可选为酶。例如,所述酶选自聚合酶、核酸外切酶、解旋酶和拓扑异构酶中的一种或多种。例如,所述解旋酶选自Hel308解旋酶、RecD解旋酶、Tral解旋酶、TrwC解旋酶、XPD解旋酶和DDA解旋酶中的一种或多种。Optionally, the motor protein is a protein capable of binding to a polynucleotide and controlling its movement through the pore; optionally an enzyme. For example, the enzyme is selected from one or more of polymerase, exonuclease, helicase and topoisomerase. For example, the helicase is selected from one or more of Hel308 helicase, RecD helicase, Tral helicase, TrwC helicase, XPD helicase and DDA helicase.
本申请提供了所述的复合物的制备方法,包括:The application provides the preparation method of the complex, including:
S1:使包含L'-S的Y1链与马达蛋白结合,所述结合区域位于L'链;S1: binding the Y1 chain containing L'-S to the motor protein, and the binding region is located in the L' chain;
S2:加入包含互补链L”的PNA-R链,得到所述复合物,其中所述马达蛋白被PNA-R链驱赶到阻断链;S2: adding the PNA-R chain comprising the complementary chain L" to obtain the complex, wherein the motor protein is driven to the blocking chain by the PNA-R chain;
可选地,所述方法包括Optionally, the method includes
S101:使包含D 1'-L'-S-D 2'的Y1链、包含D 1”的Y2链和包含D 2”的YB链的退火产物与马达蛋白结合,所述结合区域位于退火产物的L'链处; S101: Binding the annealed product of the Y1 chain comprising D 1 '-L'-SD 2 ', the Y2 chain comprising D 1 ” and the YB chain comprising D 2 ” to the motor protein, the binding region is located at the L of the annealed product 'chain at;
S102:加入包含互补链L”的PNA-R链,得到所述复合物,其中马达蛋白通过PNA-R链驱赶到阻断链。S102: Adding the PNA-R chain containing the complementary chain L" to obtain the complex, wherein the motor protein is driven to the blocking chain by the PNA-R chain.
多核苷酸polynucleotide
多核苷酸如核酸是含有两个或更多个核苷酸的大分子。多核苷酸或核酸可包括任何核苷酸的任意组合。核苷酸可以是天然存在的或人工合成的。多核苷酸中的一个或多个核苷酸可以被氧化或甲基化。多核苷酸中的一个或多个核苷酸的可被损坏。例如,多核苷酸可包含嘧啶二聚体。此类二聚体通常与紫外线导致的损坏相关联,且是皮肤黑素瘤的首要原因。多核苷酸中的一个或多个核苷酸可被修饰,例如用标记物或标签。合适的标记物如下所述。Polynucleotides such as nucleic acids are macromolecules containing two or more nucleotides. A polynucleotide or nucleic acid may comprise any combination of nucleotides. Nucleotides can be naturally occurring or synthetic. One or more nucleotides in a polynucleotide may be oxidized or methylated. One or more nucleotides in a polynucleotide may be damaged. For example, a polynucleotide may comprise pyrimidine dimers. Such dimers are often associated with UV-induced damage and are the leading cause of skin melanoma. One or more nucleotides in a polynucleotide may be modified, for example, with a label or tag. Suitable markers are described below.
多核苷酸中的核苷酸通常为核糖核苷酸或脱氧核糖核苷酸。所述多核苷酸可包含以下核苷:腺苷,尿苷,鸟苷和胞苷。所述核苷酸可选脱氧核糖核苷酸。所述多核苷酸可选包括下列核苷:脱氧腺苷(dA),脱氧尿苷(dU)和/或胸苷(dT),脱氧鸟苷(dG)和脱氧胞苷(dC)。The nucleotides in a polynucleotide are typically ribonucleotides or deoxyribonucleotides. The polynucleotide may comprise the following nucleosides: adenosine, uridine, guanosine and cytidine. The nucleotides may be deoxyribonucleotides. The polynucleotide may optionally include the following nucleosides: deoxyadenosine (dA), deoxyuridine (dU) and/or thymidine (dT), deoxyguanosine (dG) and deoxycytidine (dC).
核苷酸通常含有单磷酸、二磷酸或三磷酸。磷酸盐可连接在核苷酸的5”或3”侧上。Nucleotides usually contain monophosphates, diphosphates or triphosphates. Phosphate can be attached on the 5" or 3" side of the nucleotide.
合适的核苷酸包括但不限于,单磷酸腺苷(AMP)、单磷酸鸟苷(GMP)、单磷酸胸苷(TMP)、单磷酸尿苷(UMP)、单磷酸胞苷(CMP),环磷酸腺苷(cAMP)、环磷酸鸟苷(cGMP)、脱氧单磷酸腺苷(dAMP),脱氧单磷酸鸟苷(dGMP)、脱氧单磷酸胸苷(dTMP)、脱氧单磷酸尿苷(dUMP)和脱氧单磷酸胞苷(dCMP)。所述核苷酸可选选自AMP、TMP、GMP、CMP、UMP、dAMP、dTMP、dGMP、dCMP和dUMP。所述核苷酸最可选自dAMP、dTMP、dGMP、dCMP和dUMP。所述多核苷酸可选包括以下核苷酸:dAMP、dUMP和/或dTMP和dCMP。Suitable nucleotides include, but are not limited to, adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), cytidine monophosphate (CMP), Cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyguanosine monophosphate (dGMP), deoxythymidine monophosphate (dTMP), deoxyuridine monophosphate (dUMP ) and deoxycytidine monophosphate (dCMP). The nucleotides may optionally be selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP, dCMP and dUMP. The nucleotides are most preferably selected from dAMP, dTMP, dGMP, dCMP and dUMP. The polynucleotide may optionally comprise the following nucleotides: dAMP, dUMP and/or dTMP and dCMP.
所述多核苷酸中的核苷酸可以任何方式彼此连接。核苷酸通常被它们的糖和的磷酸基连接,如在核酸中。所述核苷酸可通过它们的核碱基的连接,如嘧啶二聚体中。The nucleotides in the polynucleotide may be linked to each other in any manner. Nucleotides are usually linked by their sugar and phosphate groups, as in nucleic acids. The nucleotides may be linked via their nucleobases, as in pyrimidine dimers.
所述多核苷酸可以是核酸。所述多核苷酸可以是任何本领域已知的合成的核酸,例如肽核酸(PNA)、甘油核酸(GNA)、苏糖核酸(TNA)、锁核酸(LNA),或其它具有核苷酸侧链的合成聚合物。该PNA骨架由通过肽键连接的重复的N-(2-氨基乙基)-甘氨酸单元组成。该GNA骨架由通过磷酸二酯键连接的重复的乙二醇单元组成。该TNA骨架由通过磷酸二酯键连接在一起的重复的苏糖组成。该LNA是由如上所讨论的具有在核糖中连接2′'氧和4”碳的额外的桥的核苷酸形成。The polynucleotide may be a nucleic acid. The polynucleotide can be any synthetic nucleic acid known in the art, such as peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA), or other nucleic acid with nucleotide side chain of synthetic polymers. The PNA backbone consists of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds. The GNA backbone consists of repeating ethylene glycol units linked by phosphodiester bonds. The TNA backbone consists of repeating threose sugars linked together by phosphodiester bonds. The LNA is formed from nucleotides with an additional bridge linking the 2'' oxygen to the 4" carbon in the ribose sugar as discussed above.
该多核苷酸是最可选核糖核酸(RNA)或脱氧核糖核酸(DNA)。The polynucleotide is most preferably ribonucleic acid (RNA) or deoxyribonucleic acid (DNA).
所述多核苷酸可以为任何长度。例如,多核苷酸可以为至少10个,至少50个,至少100,至少150,至少200,至少250,至少300,至少400或至少500个核苷酸长度。所述多核苷酸可以为1000或更多个核苷酸,5000或更多个核苷酸长度或100000或更多个核苷酸长度。The polynucleotide can be of any length. For example, a polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400, or at least 500 nucleotides in length. The polynucleotide may be 1000 or more nucleotides, 5000 or more nucleotides in length or 100000 or more nucleotides in length.
解旋酶可沿本申请的方法中的全部或仅部分目标多核苷酸移动。全部或部分目标多核苷酸可以使用本申请的方法进行表征。The helicase may move along all or only part of the target polynucleotide in the methods of the present application. All or part of the target polynucleotide can be characterized using the methods of the present application.
目标多核苷酸可以是单链的。所述目标多核苷酸的至少一部分可选为双链。解旋酶通常结合至单链多核苷酸。如果目标多核苷酸的至少一部分是双链的,所述目标多核苷酸可选地包括单链区域或非杂交区域。所述一个或多个解旋酶能够结合到单链区域或非杂交区域的一条链上。所述目标多核苷酸可选地包括一个或多个单链区域或一个或多个非杂交的区域。A polynucleotide of interest can be single-stranded. At least a portion of the polynucleotide of interest is optionally double-stranded. Helicases typically bind to single-stranded polynucleotides. If at least a portion of the target polynucleotide is double-stranded, the target polynucleotide optionally includes single-stranded regions or non-hybridizing regions. The one or more helicases are capable of binding to the single-stranded region or to one strand of the non-hybridizing region. The target polynucleotide optionally includes one or more single-stranded regions or one or more non-hybridizing regions.
样品sample
目标多核苷酸存在于任何合适的样品中。本申请通常在已知含有或怀疑含有目标多核苷酸的样品上实施。替换的,可以对样本实施本申请,以确认所识别的一个或多个目标多核苷酸,所述目标多核苷酸已知或预期存在于所述样本中。The polynucleotide of interest is present in any suitable sample. The application is generally performed on samples known to contain or suspected to contain the polynucleotide of interest. Alternatively, the present application can be performed on a sample to identify one or more target polynucleotides that are known or expected to be present in the sample.
所述样品可以是生物样品。本申请可以针对从任何生物体或微生物中获得或提取的样品在体外实施。所述生物体或微生物通常是古核的 (archaean),原核的或真核的,并且通常属于以下五界中的一个:植物界,动物界,真菌,原核生物和原生生物。本申请针对从任何病毒中获得或提取的样品在体外实施。所述样品可选是液体样品。样品通常包括患者的体液。所述样品可以是尿液,淋巴液,唾液,粘液或羊水,但可选血液,血浆或血清。通常,所述样品是来源于人的,但可替代地可以是来自其他哺乳动物的,如自商业上养殖的动物如马,牛,绵羊或猪,或者可以是宠物如猫或狗。或者,植物来源的样品通常从商业作物获得,如谷类,豆类,水果或蔬菜,例如小麦,藜,大麦,燕麦,芸苔,玉米,大豆,水稻,香蕉,苹果,西红柿,土豆,葡萄,烟草,菜豆,小扁豆,甘蔗,可可,棉花。The sample can be a biological sample. The present application may be practiced in vitro on samples obtained or extracted from any organism or microorganism. The organisms or microorganisms are generally archaean, prokaryotic or eukaryotic, and generally belong to one of the following five kingdoms: Plantae, Animalia, Fungi, Prokaryotes and Protists. The application is performed in vitro on samples obtained or extracted from any virus. The sample is optionally a liquid sample. Samples typically include bodily fluids of a patient. The sample may be urine, lymph, saliva, mucus or amniotic fluid, but may alternatively be blood, plasma or serum. Typically, the sample is of human origin, but may alternatively be from other mammals, such as from a commercially bred animal such as a horse, cow, sheep or pig, or may be a pet such as a cat or dog. Alternatively, samples of plant origin are usually obtained from commercial crops such as cereals, legumes, fruits or vegetables such as wheat, quinoa, barley, oats, canola, corn, soybeans, rice, bananas, apples, tomatoes, potatoes, grapes, Tobacco, beans, lentils, sugar cane, cocoa, cotton.
所述样品可以是非生物样品。非生物样品可选为液体样品。非生物样品的示例包括外科手术液体,水如饮用水,海水或河水,以及用于实验室测试的试剂。The sample can be a non-biological sample. Non-biological samples can be selected as liquid samples. Examples of non-biological samples include surgical fluids, water such as drinking water, sea water or river water, and reagents for laboratory testing.
样品通常在测试前被处理,例如通过离心或通过膜滤除不需要的分子或细胞,例如红血细胞。可以在获取所述样本后立即进行检测。通常也可以在分析前储存所述样本,可选低于-70℃。Samples are often processed prior to testing, such as by centrifugation or filtration through membranes to remove unwanted molecules or cells, such as red blood cells. Testing can be performed immediately after obtaining said sample. Typically the samples may also be stored prior to analysis, optionally below -70°C.
阻断链blocking chain
所述一个或多个阻断链包括在目标多核苷酸中。一个或多个阻断链可选是目标多核苷酸的一部分,例如它/它们中断多核苷酸序列。一个或多个阻断链可选不为一个或多个嵌段分子的一部分,该嵌段分子如与目标多核苷酸杂交的减速带。The one or more blocker strands are included in the target polynucleotide. The one or more blocking strands are optionally part of the target polynucleotide, eg it/they interrupt the polynucleotide sequence. The one or more blocker strands are optionally not part of one or more block molecules such as speed bumps for hybridization to the polynucleotide of interest.
在目标多核苷酸中具有任意数量的阻断链,如1个,2个,3个,4个,5个,6个,7个,8个,9个,10个或更多个阻断链。可选在目标多核苷酸中具有2个,4个或6个阻断链。目标多核苷酸的不同区域中可具有阻断链,例如前导序列中的阻断链和发卡环中的阻断链。Have any number of blocker strands in the target polynucleotide, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more blocks chain. Optionally there are 2, 4 or 6 blocker strands in the polynucleotide of interest. There may be blocker strands in different regions of the polynucleotide of interest, for example a blocker strand in the leader sequence and a blocker strand in the hairpin loop.
一个或多个阻断链各提供了能量障碍,所述一个或多个解旋酶甚至在活动模式也不能克服该能量障碍。所述一个或多个阻断链可通过减少解旋酶的牵拉(例如通过除去目标多核苷酸中的核苷酸的碱基)或物理性地阻 断一个或多个解旋酶的移动(例如利用庞大的化学基团)来停滞一个或多个解旋酶。The one or more blocking strands each provide an energy barrier that the one or more helicases cannot overcome even in active mode. The one or more blocking strands can be achieved by reducing the pull of the helicase (e.g., by removing bases of nucleotides in the target polynucleotide) or physically blocking the movement of the one or more helicases (eg, using bulky chemical groups) to stall one or more helicases.
所述一个或多个阻断链可包括停滞一个或多个解旋酶的任意分子或任意分子的组合。所述一个或多个阻断链可以包括阻止所述一个或多个解旋酶沿目标多核苷酸移动的任意分子或任意分子的组合。其直接地确定在缺少跨膜孔和施加的电势的条件下,一个或多个解旋酶是否停留在一个或多个阻断链处。例如,这可如实施例中所示进行测试,例如解旋酶穿过阻断链且置换DNA的互补链的能力可以通过PAGE进行测量。The one or more blocking strands may comprise any molecule or combination of any molecules that stall one or more helicases. The one or more blocker strands may comprise any molecule or combination of any molecules that prevent movement of the one or more helicases along the target polynucleotide. It directly determines whether, in the absence of a transmembrane pore and an applied potential, one or more helicases lodges at one or more blocking strands. For example, this can be tested as shown in the Examples, eg the ability of the helicase to pass through the blocking strand and displace the complementary strand of DNA can be measured by PAGE.
一个或多个阻断链通常包括直链分子如聚合物。所述一个或多个阻断链通常具有与目标多核苷酸不同的结构。例如,如果所述目标多核苷酸是DNA,一个或多个阻断链通常不是脱氧核糖核酸。特别是,如果目标多核苷酸是脱氧核糖核酸(DNA)或核糖核酸(RNA),所述一个或多个阻断链可选包括肽核酸(PNA),甘油核酸(GNA),苏糖核酸(TNA),锁核酸(LNA)或具有核苷酸侧链的合成聚合物。The one or more blocking chains typically comprise linear molecules such as polymers. The one or more blocking strands typically have a different structure than the target polynucleotide. For example, if the target polynucleotide is DNA, the one or more blocking strands are typically not deoxyribonucleic acid. In particular, if the polynucleotide of interest is deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), the one or more blocking strands optionally include peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid ( TNA), locked nucleic acid (LNA) or synthetic polymers with nucleotide side chains.
一个或多个阻断链可选包括一个或多个硝基吲哚,例如一个或多个5-硝基吲哚,一个或多个肌苷,一个或多个吖啶,一个或多个2-氨基嘌呤,一个或多个2-6-二氨基嘌呤,一个或多个5-溴-脱氧尿嘧啶,一个或多个反向胸苷(反向dTs),一个或多个反向脱氧胸苷(ddTs),一个或多个二脱氧胞苷(ddCs),一个或多个5-甲基胞苷,一个或多个5-羟甲基胞苷,一个或多个2’烷氧基修饰的核糖核苷酸(可选2’甲氧基修饰的核糖核苷酸),一个或多个异脱氧胞苷(异-dCs),一个或多个异脱氧鸟苷(异dGs),一个或多个iSpC3基团(即缺少糖和碱基的核苷酸),一个或多个光裂解(PC)基团,一个或多个己二醇基团,一个或多个阻断链9(iSp9)基团,一个或多个阻断链18(iSp18)基团,聚合物或一个或多个硫醇连接。所述一个或多个阻断链可包括这些基团的任意组合。许多这些基团可以购自(Integrated DNA)。One or more blocker strands optionally include one or more nitroindole, for example one or more 5-nitroindole, one or more inosine, one or more acridine, one or more 2 - aminopurines, one or more 2-6-diaminopurines, one or more 5-bromo-deoxyuridines, one or more reverse thymidines (reverse dTs), one or more reverse deoxythymidines glycosides (ddTs), one or more dideoxycytidines (ddCs), one or more 5-methylcytidines, one or more 5-hydroxymethylcytidines, one or more 2' alkoxy modifications Ribonucleotides (optionally 2'methoxy-modified ribonucleotides), one or more isodeoxycytidines (iso-dCs), one or more isodeoxyguanosines (iso-dGs), one or Multiple iSpC3 groups (i.e., nucleotides lacking sugar and base), one or more photocleavage (PC) groups, one or more hexanediol groups, one or more blocker chain 9 (iSp9 ) group, one or more chain-blocking 18 (iSp18) groups, a polymer or one or more thiol linkages. The one or more blocking chains may comprise any combination of these groups. Many of these groups are commercially available from (Integrated DNA).
所述一个或多个阻断链可包含任何数量的这些基团。例如,对于2-氨基嘌呤,2-6-二氨基嘌呤,5-溴脱氧尿苷,反向dTs,ddTs,ddCs,5-甲基胞苷,5-羟甲基胞苷,2’烷氧基修饰的核糖核苷酸(可选2’甲氧基修饰的 核糖核苷酸),异dCs,异dGs,iSpC3基团,PC基团,己二醇基团和硫醇连接,一个或多个阻断链可选包含2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个或更多。一个或多个阻断链可选包含2个、3个、4个、5个、6个、7个、8个或更多iSp9基团。一个或多个阻断链可选包含2个、3个、4个、5个或6个或更多iSp18基团。可选的阻断链基团是4个iSp18基团。The one or more blocking chains may contain any number of these groups. For example, for 2-aminopurine, 2-6-diaminopurine, 5-bromodeoxyuridine, reverse dTs, ddTs, ddCs, 5-methylcytidine, 5-hydroxymethylcytidine, 2'alkoxy base-modified ribonucleotides (optionally 2'methoxy-modified ribonucleotides), iso-dCs, iso-dGs, iSpC3 groups, PC groups, hexanediol groups and thiol linkages, one or more Each blocking chain may optionally comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more. One or more blocker strands optionally comprise 2, 3, 4, 5, 6, 7, 8 or more iSp9 groups. One or more blocker strands optionally comprise 2, 3, 4, 5 or 6 or more iSpl8 groups. Optional chain-blocking groups are 4 iSp18 groups.
聚合物可选为多肽或聚乙二醇(PEG)。所述多肽可选地包含2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个或更多个氨基酸。所述PEG可选包含2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个或更多单体单元。The polymer can be optionally a polypeptide or polyethylene glycol (PEG). The polypeptide optionally comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more amino acids. The PEG optionally comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more monomeric units.
一个或多个阻断链可选包括一个或多个无碱基核苷酸(即缺乏核碱基的核苷酸),例如2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个或更多个无碱基的核苷酸。核碱基可以被无碱基核苷酸中的-H(idSp)或-OH置换。无碱基的阻断链可以通过从一个或多个相邻的核苷酸中除去核碱基而被插入到目标多核苷酸中。One or more blocking strands optionally include one or more abasic nucleotides (i.e., nucleotides lacking nucleobases), e.g., 2, 3, 4, 5, 6, 7 , 8, 9, 10, 11, 12 or more abasic nucleotides. A nucleobase can be replaced by -H(idSp) or -OH in an abasic nucleotide. An abasic blocker strand can be inserted into a polynucleotide of interest by removing a nucleobase from one or more adjacent nucleotides.
一个或多个阻断链可选包含一个或多个物理上导致一个或多个解旋酶停滞的化学基团。所述一个或多个化学基团可选为一个或多个侧挂的化学基团。所述一个或多个化学基团可以连接到目标多核苷酸中的一个或更多个核碱基。所述一个或多个化学基团可以连接到目标多核苷酸的骨架。可存在任何数量,如2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个或更多个这些化学基团。合适的基团包括但不限于,荧光团,链霉亲和素和/或生物素,胆固醇,亚甲基蓝,二硝基苯酚(DNPs),洋地黄毒苷和/或抗洋地黄毒苷和二苯基环辛炔基团。The one or more blocking strands optionally contain one or more chemical groups that physically cause the one or more helicases to stall. The one or more chemical groups may be one or more pendant chemical groups. The one or more chemical groups may be linked to one or more nucleobases in the target polynucleotide. The one or more chemical groups may be attached to the backbone of the polynucleotide of interest. There may be any number, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more of these chemical groups. Suitable groups include, but are not limited to, fluorophores, streptavidin and/or biotin, cholesterol, methylene blue, dinitrophenols (DNPs), digoxigenin and/or anti-digoxigenin and diphenyl Cyclooctyne group.
目标多核苷酸中的不同阻断链可以包含不同停滞分子。例如,一个阻断链可以包括如上讨论的一个线性分子,另一个阻断链可以包括一个或多个物理上导致一个或多个解旋酶停滞的化学基团。阻断链可包括如上讨论的任何线性分子和一个或多个物理上导致一个或多个解旋酶停滞的化学基团,例如一个或多个无碱基和荧光团。Different blocking strands in the target polynucleotide may contain different arresting molecules. For example, one blocker strand can include a linear molecule as discussed above, and the other blocker strand can include one or more chemical groups that physically cause one or more helicases to stall. A blocking strand may comprise any linear molecule as discussed above and one or more chemical groups that physically cause one or more helicases to stall, such as one or more abasic groups and fluorophores.
合适的阻断链可以根据目标多核苷酸的类型而设计并且在本申请的方法条件下进行实施。大多数解旋酶结合DNA且沿DNA移动,所以可以用任何不为DNA的物质而停滞。合适的分子如上所述。A suitable blocker chain can be designed according to the type of target polynucleotide and carried out under the method conditions of the present application. Most helicases bind and move along DNA, so can be stalled by anything that is not DNA. Suitable molecules are described above.
在一个具体的实施例中,所述2′甲氧基修饰的核糖核苷酸的数量为1-10个,更可选2-6个;和/或所述2′甲氧基修饰的核糖核苷酸均匀分布于所述阻断链。In a specific embodiment, the number of 2'methoxy-modified ribonucleotides is 1-10, more optionally 2-6; and/or the 2'methoxy-modified ribose Nucleotides are evenly distributed over the blocking strand.
解旋酶Helicase
本申请可使用任何解旋酶。解旋酶可以为或衍生自Hel308解旋酶,RecD解旋酶,如TraI解旋酶或TrwC解旋酶,XPD解旋酶或Dda解旋酶。解旋酶可以为实施例1中提供的解旋酶、或中国专利CN201880095718.X中公开的解旋酶,也可以是其他解旋酶。Any helicase can be used herein. The helicase may be or be derived from a Hel308 helicase, a RecD helicase, such as a TraI helicase or a TrwC helicase, an XPD helicase or a Dda helicase. The helicase can be the helicase provided in Example 1, or the helicase disclosed in Chinese patent CN201880095718.X, or other helicases.
点击化学click chemistry
本申请的多核苷酸之间可共价地连接。例如,可使用游离铜点击化学或铜催化的点击化学。由于点击化学令人满意的性质和其对于在多种构建块(building blocks)之间生成共价连接的范围,使得在这些应用中使用点击化学。例如,它是快速的,清洁的并且无毒的,只产生无害的副产物。点击化学是由Kolb等在2001首次介绍的术语,为描述更广泛的一系列强大,有选择性的和模块化的构建块,所述构建块可靠地用于小规模和大规模应用(Kolb HC Finn,MG,Sharp less KB,click chemistry:diverse chemical function from a few good reactions,Angew.Chem.Int.Ed.40(2001)2004-2021)。他们定义了如下一系列严格标准用于点击化学:“反应必须是模块化的,宽的范围,给出非常高的产量,只产生无害的可通过非色谱法去除的副产物,并且是立体定向的(但不必然是对映选择性)。所要求的方法特征包括简单的反应条件(理想的所述方法应对氧气和水不敏感),容易获得的起始物质和试剂,无溶剂或溶剂的使用,所述溶剂是温和的(例 如水)或容易去除的,和简单的产物分离。纯化如果需要必须是通过非色谱法,例如结晶或蒸馏,并且所述产物在生理状态下必须是稳定的”。The polynucleotides of the present application can be linked covalently. For example, free copper click chemistry or copper catalyzed click chemistry can be used. Click chemistry is used in these applications because of its desirable properties and its scope for generating covalent linkages between a variety of building blocks. For example, it is fast, clean and non-toxic, producing only harmless by-products. Click chemistry is a term first introduced by Kolb et al. in 2001 to describe a broader set of robust, selective and modular building blocks that are reliably usable for both small-scale and large-scale applications (Kolb HC Finn, MG, Sharp less KB, click chemistry: diverse chemical function from a few good reactions, Angew. Chem. Int. Ed. 40(2001) 2004-2021). They defined the following set of stringent criteria for click chemistry: "The reaction must be modular, broad in scope, give very high yields, produce only harmless by-products that can be removed by non-chromatographic methods, and be stereogenic. Directional (but not necessarily enantioselective). Required process features include simple reaction conditions (ideally the process should be insensitive to oxygen and water), readily available starting materials and reagents, solvent-free or use, the solvent is mild (e.g. water) or easily removed, and simple product isolation. Purification must be by non-chromatographic methods if necessary, such as crystallization or distillation, and the product must be stable under physiological conditions of".
点击化学的合适例子包括但不限于以下:Suitable examples of click chemistry include but are not limited to the following:
(a)游离铜的变体的1,3偶极环加成反应(1,3-Dipolar Cycloaddition),(a) 1,3-Dipolar Cycloaddition of variants of free copper (1,3-Dipolar Cycloaddition),
其中,叠氮化物与炔烃在应力例如在环辛烷环中反应;where azides react with alkynes under stress, for example in cyclooctane rings;
(b)在一个连接体上的氧亲核试剂与在其他连接体上的环氧化合物或氮杂环丙烷反应性部分的反应;和(b) reaction of an oxygen nucleophile on one linker with an epoxy or aziridine reactive moiety on the other linker; and
(c)施陶丁格连接,其中炔烃部分可被芳基膦取代,导致和叠氮化物的特异性反应,从而得到酰胺键。(c) Staudinger linkage, in which the alkyne moiety can be substituted by an arylphosphine, resulting in specific reaction with azide to give an amide bond.
可选所述点击化学反应是在炔烃和叠氮化物之间Cu(I)催化的1,3偶极环加成反应。在可选实施例中,第一基团是叠氮化物基团,第二基团是炔烃基团。核酸碱基已被合成,在可选位置中插入叠氮化物和炔烃基团(例如Kocalka P,El-Sagheer AH,Brown T,Rapid and efficient DNA strand cross-linking by click chemistry,Chembiochem.2008.9(8):1280-5)。炔烃基团是从Berry Associates(Michigan,USA)商业可获得的,并且叠氮化物基团是通过ATDBio或IDT bio合成的。Optionally the click chemistry reaction is a Cu(I) catalyzed 1,3 dipolar cycloaddition between alkynes and azides. In an alternative embodiment, the first group is an azide group and the second group is an alkyne group. Nucleobases have been synthesized with insertion of azide and alkyne groups in alternative positions (e.g. Kocalka P, El-Sagheer AH, Brown T, Rapid and efficient DNA strand cross-linking by click chemistry, Chembiochem.2008.9(8 ):1280-5). Alkyne groups were commercially available from Berry Associates (Michigan, USA), and azide groups were synthesized by ATDBio or IDT bio.
在本申请的一个具体的实施方案中,可选地反应基团是叠氮化物和己基基团,例如叠氮化物N3和DBCO。In a specific embodiment of the present application, optionally reactive groups are azide and hexyl groups, eg azide N3 and DBCO.
方法method
本申请还提供了一种表征目标多核苷酸的方法,所述方法使用所述衔接体或所述的复合物;The present application also provides a method for characterizing a target polynucleotide, the method using the adapter or the complex;
可选地,所述方法包括:Optionally, the method includes:
(a)使目标多核苷酸穿过跨膜孔移动,(a) moving the polynucleotide of interest across the transmembrane pore,
其中所述目标多核苷酸与所述的衔接体或所述的复合物连接;以及wherein said target polynucleotide is linked to said adapter or said complex; and
(b)随着所述多核苷酸相对于所述孔移动,获取一个或多个电和/或光测量值,其中所述测量值代表所述多核苷酸的一个或多个特征,并由此 表征所述目标多核苷酸。(b) taking one or more electrical and/or optical measurements as the polynucleotide moves relative to the pore, wherein the measurements represent one or more characteristics of the polynucleotide and are determined by This characterizes the polynucleotide of interest.
本申请的方法包括测量所述目标多核苷酸的一个或多个特征。该方法可以包括测量2个,3个,4个,5个或更多个目标多核苷酸的特征。所述一个或多个特征,可以选自(i)目标多核苷酸的长度,(ii)所述目标多核苷酸的同一性,(iii)所述目标多核苷酸的序列,(iv)所述目标多核苷酸的二级结构;以及(v)目标多核苷酸是否是经修饰的。(i)至(v)的任何组合可以根据本申请进行测量。The methods of the present application comprise measuring one or more characteristics of said target polynucleotide. The method may comprise measuring 2, 3, 4, 5 or more characteristics of the target polynucleotides. The one or more characteristics may be selected from (i) the length of the target polynucleotide, (ii) the identity of the target polynucleotide, (iii) the sequence of the target polynucleotide, (iv) the the secondary structure of the target polynucleotide; and (v) whether the target polynucleotide is modified. Any combination of (i) to (v) can be measured according to the present application.
对于(i),多核苷酸的长度例如可以通过确定所述目标多核苷酸和所述孔的相互作用数量,以及目标多核苷酸与所述孔相互作用之间的持续时间来测定。For (i), the length of the polynucleotide can be determined, for example, by determining the number of interactions between the target polynucleotide and the pore, and the duration between interactions between the target polynucleotide and the pore.
对于(ii),所述多核苷酸的同一性可以通过多种方式测定。多核苷酸的同一性可以联合目标多核苷酸的序列的测定,或不联合目标多核苷酸的序列的测定进行测定。前者是直接的;对所述多核苷酸进行测序,并由此进行鉴定。后者可以以几种方式来完成。例如,可以测定多核苷酸中特定模序的存在(而无需测定该多核苷酸的其余序列)。或者,方法中测定的特定的电和/或光信号可鉴定来自特定来源的目标多核苷酸。Regarding (ii), the identity of the polynucleotides can be determined in a number of ways. Polynucleotide identity may be determined in conjunction with or without determination of the sequence of the polynucleotide of interest. The former is straightforward; the polynucleotide is sequenced and thus identified. The latter can be done in several ways. For example, the presence of a particular motif in a polynucleotide can be determined (without determining the rest of the sequence of the polynucleotide). Alternatively, specific electrical and/or optical signals determined in the method can identify a target polynucleotide from a specific source.
对于(iii),多核苷酸的序列可以如前所述确定。合适的测序方法,特别是那些使用电测量的方法,描述于StoddartD et al.,Proc Natl Acad Sci,12;106(19):7702-7,Lieberman KR et al,J Am Chem Soc.2010;132(50):17961-72,和国际申请WO 2000/28312中。For (iii), the sequence of the polynucleotide can be determined as previously described. Suitable sequencing methods, particularly those using electrical measurements, are described in Stoddart D et al., Proc Natl Acad Sci, 12; 106(19):7702-7, Lieberman KR et al, J Am Chem Soc. 2010; 132 (50):17961-72, and in International Application WO 2000/28312.
对于(iv),所述二级结构可以以多种方式测量。例如,如果该方法包含电测量,二级结构可以利用穿过孔的停留时间的改变或电流变化进行测量。这使得单链和双链多核苷酸的区域能够得到识别。As for (iv), the secondary structure can be measured in various ways. For example, if the method involves electrical measurements, secondary structure can be measured using a change in residence time or a change in current through the pore. This allows regions of single- and double-stranded polynucleotides to be identified.
对于(v),可以测定任何存在或不存在修饰。该方法可选包括确定所述目标多核苷酸是否通过甲基化,氧化,损伤,用一个或多个蛋白质或一个或多个标记物,标签或阻断链进行了修饰。特异性修饰将导致与孔的特异性相互作用,其可以使用下面描述的方法来测定。例如,可以基于孔与每个核苷酸的相互作用过程中穿过孔的电流,识别胞嘧啶与甲基化的胞嘧啶。For (v), the presence or absence of any modification can be determined. The method optionally includes determining whether said target polynucleotide is modified by methylation, oxidation, damage, with one or more proteins or with one or more markers, tags or blocker strands. Specific modifications will result in specific interactions with the pore, which can be determined using the methods described below. For example, cytosines can be identified from methylated cytosines based on the current flow through the pore during the interaction of the pore with each nucleotide.
该方法通常在缓冲剂存在下实施。在上面讨论的示例性设备中,缓冲剂存在所述室的水溶液中。本申请的方法可使用任何缓冲剂。通常地,缓冲剂是磷酸盐缓冲液。其他合适的缓冲剂是HEPES和Tris-HCl缓冲剂。该方法通常在pH值为4.0至12.0,4.5至10.0,5.0至9.0,5.5至8.8,6.0至8.7,7.0至8.8,或7.5至8.5下实施。所使用的pH可选约为7.5。The method is usually carried out in the presence of a buffer. In the exemplary devices discussed above, the buffer is present in the aqueous solution in the chamber. Any buffer may be used in the methods of the present application. Typically, the buffer is phosphate buffered saline. Other suitable buffers are HEPES and Tris-HCl buffers. The method is typically carried out at a pH of 4.0 to 12.0, 4.5 to 10.0, 5.0 to 9.0, 5.5 to 8.8, 6.0 to 8.7, 7.0 to 8.8, or 7.5 to 8.5. The pH used is optionally about 7.5.
该方法可在0至100℃,15℃至95℃,16℃至90℃,17℃至85℃,18℃至80℃,19℃至70℃,或20℃至60℃实施。该方法通常在室温下进行。该方法可选地在支持解旋酶功能的温度下,例如约37℃实施。The method may be performed at 0 to 100°C, 15°C to 95°C, 16°C to 90°C, 17°C to 85°C, 18°C to 80°C, 19°C to 70°C, or 20°C to 60°C. The method is usually carried out at room temperature. The method is optionally performed at a temperature that supports the function of the helicase, eg, about 37°C.
该方法可以在游离核苷酸或游离核苷酸类似物和/或辅助解旋酶功能发挥的辅助因子存在下实施。该方法也可以在游离核苷酸或游离核苷酸类似物不存在且解旋酶的辅助因子不存在下实施。所述游离核苷酸可以为如上面讨论的单个核苷酸的任意一个或多个。游离核苷酸包括,但不限于,单磷酸腺苷(AMP),二磷酸腺苷(ADP),三磷酸腺苷(ATP),单磷酸鸟苷(GMP),二磷酸鸟苷(GDP),三磷酸鸟苷(GTP),单磷酸胸苷(TMP),二磷酸胸苷(TDP),三磷酸胸苷(TTP),单磷酸尿苷(UMP),二磷酸尿苷(UDP),三磷酸尿苷(UTP),单磷酸胞苷(CMP),二磷酸胞苷(CDP),三磷酸胞苷(CTP),环磷酸腺苷(cAMP),环单磷酸鸟苷(cGMP),脱氧单磷酸腺苷(dAMP),脱氧二磷酸腺苷(DADP),脱氧三磷酸腺苷(dATP),脱氧单磷酸鸟苷(dGMP),脱氧二磷酸鸟苷(dGDP),脱氧三磷酸鸟苷(dGTP),脱氧单磷酸胸苷(dTMP),脱氧二磷酸胸苷(dTDP),脱氧三磷酸胸苷(dTTP),脱氧二磷酸尿苷(dUMP),脱氧二磷酸尿苷(dUDP),脱氧三磷酸尿苷(dUTP),脱氧单磷酸胞苷(dCMP),脱氧二磷酸胞苷(dCDP)和脱氧三磷酸胞苷(dCTP)。该游离核苷酸可选选自AMP,TMP,GMP,CMP,UMP,dAMP,dTMP,dGMP或dCMP。该游离核苷酸可选三磷酸腺苷(ATP)。解旋酶辅助因子是使解旋酶或构建体发挥功能的因子。解旋酶辅助因子可选为二价金属阳离子。所述二价金属阳离子可选为Mg 2+,Mn 2+,Ca 2+或Co 2+。解旋酶辅助因子最可选Mg 2+The method may be carried out in the presence of free nucleotides or free nucleotide analogs and/or cofactors which assist in the functioning of the helicase. The method can also be carried out in the absence of free nucleotides or free nucleotide analogues and in the absence of cofactors for the helicase. The free nucleotides may be any one or more of the individual nucleotides as discussed above. Free nucleotides include, but are not limited to, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate GTP, thymidine monophosphate (TMP), thymidine diphosphate (TDP), thymidine triphosphate (TTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate ( UTP), cytidine monophosphate (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate ( dAMP), deoxyadenosine diphosphate (DADP), deoxyadenosine triphosphate (dATP), deoxyguanosine monophosphate (dGMP), deoxyguanosine diphosphate (dGDP), deoxyguanosine triphosphate (dGTP), deoxythymidine monophosphate (dTMP), deoxythymidine diphosphate (dTDP), deoxythymidine triphosphate (dTTP), deoxyuridine diphosphate (dUMP), deoxyuridine diphosphate (dUDP), deoxyuridine triphosphate (dUTP), deoxy Cytidine monophosphate (dCMP), deoxycytidine diphosphate (dCDP) and deoxycytidine triphosphate (dCTP). The free nucleotide may be selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP or dCMP. The free nucleotide may be adenosine triphosphate (ATP). A helicase cofactor is a factor that enables a helicase or construct to function. The helicase cofactor can optionally be a divalent metal cation. The divalent metal cation may be Mg 2+ , Mn 2+ , Ca 2+ or Co 2+ . The helicase cofactor is most preferably Mg 2+ .
需要说明的是:还原型辅酶Ⅰ,NADH(Nicotinamide adenine  dinucleotide)是一种化学物质,是烟酰胺腺嘌呤二核苷酸的还原态。It should be noted that: reduced coenzyme I, NADH (Nicotinamide adenine dinucleotide) is a chemical substance, which is the reduced state of nicotinamide adenine dinucleotide.
试剂盒Reagent test kit
再一方面,本申请还提供了一种用于表征多核苷酸的试剂盒,所述试剂盒包含所述的衔接体或所述的复合物。In another aspect, the present application also provides a kit for characterizing polynucleotides, said kit comprising said adapter or said complex.
所述试剂盒包括(a)一个或多个衔接体,(b)一个或多个解旋酶。所述试剂盒可包括上面讨论的任何解旋酶和孔。The kit includes (a) one or more adapters, (b) one or more helicases. The kit may include any of the helicases and wells discussed above.
所述试剂盒还可以包括膜的成分,如形成两性分子层需要的磷脂如脂质双分子层。The kit may also include membrane components, such as phospholipids, such as lipid bilayers, required for the formation of amphiphilic layers.
本申请的试剂盒可以另外包含使得上述提及的任何实施例能够实施的一个或多个其它试剂或仪器。这类试剂或仪器包括以下试剂或仪器中的一个或多个:合适的缓冲液(水溶液),从受体(subject)中得到样品的工具(例如包含针的容器或仪器),扩增和/或表达多核苷酸的工具,上文定义的膜或压力钳或膜片钳装置。试剂在试剂盒中可以干燥状态存在,使得流体样品重新悬浮试剂。所述试剂盒还可,可选地,包括使本申请的方法中如何使用试剂盒的说明书,或者该方法可用于何种患者的详细信息。所述试剂盒可选地包括促进解旋酶移动的必要的组分(例如ATP和Mg 2+)。 The kits of the present application may additionally comprise one or more other reagents or instruments that enable the performance of any of the above-mentioned embodiments. Such reagents or devices include one or more of the following: a suitable buffer (aqueous solution), means for obtaining a sample from a subject (such as a container or device containing a needle), amplification and/or Or a means for expressing polynucleotides, a membrane as defined above or a pressure clamp or patch clamp device. The reagents may be present in a dry state in the kit such that a fluid sample resuspends the reagents. The kit may also, optionally, include instructions for how to use the kit in the methods of the present application, or details of which patients the methods may be used for. The kit optionally includes the necessary components to facilitate the movement of the helicase (eg ATP and Mg 2+ ).
下列实施例说明本申请。The following examples illustrate the application.
实施例1:降低ATP空耗的Y衔接体-酶复合物的制备Example 1: Preparation of a Y adapter-enzyme complex that reduces ATP empty consumption
SEQ ID NO:1 GCGGAGTCAAACGGTAGAAGTCG;SEQ ID NO: 1 GCGGAGTCAAACGGTAGAAGTCG;
SEQ ID NO:2 TAACGTATTC;SEQ ID NO:2 TAACGTATTC;
SEQ ID NO:3 ACTGCTCATTCGGTCCTGCTGACT;SEQ ID NO:3 ACTGCTCATTCGGTCCTGCTGACT;
SEQ ID NO:4 CGACTTCTACCGTTTGACTCCGC;SEQ ID NO:4 CGACTTCTACCGTTTGACTCCGC;
SEQ ID NO:5 GTCAGCAGGACCGAATGA;SEQ ID NO:5 GTCAGCAGGACCGAATGA;
SEQ ID NO:6 GAATACGTTAGCGG,其中SEQ ID NO:6由PNA组成;PNA代表了肽核酸,一类以多肽骨架取代糖磷酸主链的DNA类似物。SEQ ID NO: 6 GAATACGTTAGCGG, wherein SEQ ID NO: 6 is composed of PNA; PNA represents peptide nucleic acid, a class of DNA analogues that replace sugar and phosphate backbones with polypeptide backbones.
SEQ ID NO:7 GCAGTAGTCCAGCACCGACC;SEQ ID NO:7 GCAGTAGTCCAGCACCGACC;
SEQ ID NO:8SEQ ID NO:8
Figure PCTCN2022136442-appb-000001
Figure PCTCN2022136442-appb-000001
复合物是由4个不同链杂交在一起构成的;The complex is formed by the hybridization of 4 different strands;
第一链(Y-Top-1),依次包含前导序列,即iSpC3阻断链,表示为3,其连接到SEQ ID NO:1的5′端,其3′端依次连接到4个i2OMeC和SEQ ID NO:2,SEQ ID NO:2的3′端连接到阻断链(R0-R6如表2所示)和SEQ ID NO:3,其中i2OmeC和i2OmeG为2′'-O-甲基RNA,即2′甲氧基修饰的RNA。The first strand (Y-Top-1), which in turn contains the leader sequence, i.e. the iSpC3 blocking strand, denoted as 3, is connected to the 5' end of SEQ ID NO: 1, and its 3' end is connected to four i2OMeC and SEQ ID NO:2, the 3' end of SEQ ID NO:2 is connected to the blocking chain (R0-R6 as shown in Table 2) and SEQ ID NO:3, wherein i2OmeC and i2OmeG are 2''-O-methyl RNA, that is, 2'methoxy-modified RNA.
第二链(Y-Top-2),DBCO连接SEQ ID NO:4的5′端。The second strand (Y-Top-2), DBCO is connected to the 5' end of SEQ ID NO:4.
第三链(Y-Bottom),SEQ ID NO:4的3′端链接SEQ ID NO:7。The third strand (Y-Bottom), the 3' end of SEQ ID NO:4 is linked to SEQ ID NO:7.
第四链(PNA-R),[GAATACGTTAGCGG]pna-OO-azide(N3),其中O为O-liker(也称为AEEA或eg1),用于增加PNA-R的溶解性。The fourth strand (PNA-R), [GAATACGTTAGCGG]pna-OO-azide (N3), where O is O-liker (also known as AEEA or eg1), was used to increase the solubility of PNA-R.
Figure PCTCN2022136442-appb-000002
Figure PCTCN2022136442-appb-000002
Figure PCTCN2022136442-appb-000003
其中,333333333333333333333333333333代表了C3 spacer修饰。
Figure PCTCN2022136442-appb-000003
Among them, 3333333333333333333333333333 represents C3 spacer modification.
Y-Top-2:DBCO- CGACTTCTACCGTTTGACTCCGC; Y-Top-2: DBCO- CGACTTCTACCGTTTGACTCCGC;
Figure PCTCN2022136442-appb-000004
Figure PCTCN2022136442-appb-000004
Figure PCTCN2022136442-appb-000005
Figure PCTCN2022136442-appb-000005
将Y1、Y2、YB三条合成单链以1:1.1:1.1的比例进行退火(从95℃缓慢降温到25℃,降温幅度不超过0.1℃/s)。退火终体系包括160mM HEPES 7.0;200mM NaCl,Y1的终浓度为4-8μM,最终形成Y型衔接体。将Y型衔接体(500nM)与6倍物质的量的酶T4 Dda-M1G/E94C/C109A/C136A/A360C(3μM)(序列如SEQ ID NO:8所示),在缓冲液(100mM NaAc(pH 7);1.5mM TMAD)中混合并室温孵育30分钟。该混合物称为样品1。Anneal the three synthetic single strands of Y1, Y2, and YB at a ratio of 1:1.1:1.1 (slowly cool down from 95°C to 25°C, and the temperature drop does not exceed 0.1°C/s). The final annealing system includes 160mM HEPES 7.0; 200mM NaCl, the final concentration of Y1 is 4-8μM, and the Y-type adapter is finally formed. Enzyme T4 Dda-M1G/E94C/C109A/C136A/A360C (3μM) (sequence shown in SEQ ID NO: 8) with Y-type adapter (500nM) and 6 times the amount of substance, in buffer (100mM NaAc ( pH 7); 1.5mM TMAD) and incubated at room temperature for 30 minutes. This mixture is referred to as Sample 1.
在样品1中加入1μM的PNA-R链,并且在室温下孵育30分钟。得到样品2。1 μM of PNA-R chain was added to sample 1 and incubated at room temperature for 30 minutes. Sample 2 is obtained.
TBE(天然的)PAGE凝胶检测样品1和样品2在相同条件下的迁移率。PNA-R链加载到接头上之后的样品2迁移率会下降,较未加PNA-R链的对照(样品1)迁移率较慢,其中,使用阻断链为R0的对比结果如图2所示。TBE (native) PAGE gels were used to examine the mobility of samples 1 and 2 under the same conditions. After the PNA-R chain is loaded on the adapter, the mobility of sample 2 will decrease, which is slower than that of the control (sample 1) without PNA-R chain. Among them, the comparison results using the blocking chain as R0 are shown in Figure 2 Show.
将样品2使用DNAPac PA200柱,使用下列洗脱缓冲液(缓冲液A:20mM Na-CHES,250mM NaCl,4%(W/V)甘油,pH 8.6,缓冲液B:20mM Na-CHES,1M NaCl,4%(W/V)甘油,pH8.6)进行纯化,将样品1装在所述柱子上,并且用缓冲液A将没有结合到DNA上的酶从柱上洗脱掉。然后用10倍柱体积的0-100%缓冲液E将结合了酶的Y型衔接体复合物进行洗脱。然后汇集主洗脱峰,测量其浓度用于实施例2的检测。 Sample 2 was used on a DNAPac PA200 column with the following elution buffers (buffer A: 20 mM Na-CHES, 250 mM NaCl, 4% (W/V) glycerol, pH 8.6, buffer B: 20 mM Na-CHES, 1 M NaCl , 4% (W/V) glycerol, pH 8.6) for purification, sample 1 was loaded on the column, and the enzyme not bound to DNA was eluted from the column with buffer A. The enzyme-bound Y-adapter complex was then eluted with 10 column volumes of 0-100% buffer E. Then the main elution peaks were collected and their concentrations were measured for the detection of Example 2.
实施例2:ATPase活性检测Embodiment 2: ATPase activity detection
首先进行NADH反应混合液的制备,主要按照下表1进行该反应混合液的配制,配制完成后,室温水平翻转,孵育10分钟。Firstly, the NADH reaction mixture was prepared, and the reaction mixture was mainly prepared according to Table 1 below. After the preparation was completed, the reaction mixture was turned horizontally at room temperature and incubated for 10 minutes.
表1 NADH反应混合液的制备Table 1 Preparation of NADH reaction mixture
Figure PCTCN2022136442-appb-000006
Figure PCTCN2022136442-appb-000006
之后,在96孔板内加入112.5μL NADH反应混合液,37.5μL(20nM) Y衔接体-酶复合物(即 实施例1纯化后的样品1和样品2中的任一种),然后将其放入紫外-可见光分光光度计内测量380nn处的吸光值,温度设定为34℃;检测200个循环,每个循环5分钟。收集的数据进行标准曲线绘制并通过标准曲线斜率得出ATP消耗值。 Afterwards, 112.5 μL NADH reaction mixture, 37.5 μL (20 nM) Y adapter-enzyme complex (i.e. any one of sample 1 and sample 2 after purification in Example 1 ) was added in the 96-well plate, and then Put it into an ultraviolet-visible spectrophotometer to measure the absorbance value at 380nm, set the temperature at 34°C; detect 200 cycles, each cycle is 5 minutes. The collected data were drawn into a standard curve and the ATP consumption value was obtained through the slope of the standard curve.
结果如图3和表2(加入复合物10h)所示。以不加入PNA-R为对照(其使用的阻断链与R1相同),并且设置为基线(100%)。ATP消耗百分比如果低于100%说明相较于对照接头具有降低ATP消耗的潜力。所有测试的接头适配体的阻断链序列如下表2所示,将酶驱赶到不同的结构和序列的阻断链后,测定得到的ATP/NADH消耗,具体见图3所示。The results are shown in Figure 3 and Table 2 (complex addition 10h). No addition of PNA-R was used as a control (it used the same blocker chain as R1), and was set as baseline (100%). Percent ATP depletion below 100% indicates potential for reduced ATP depletion compared to control linkers. The blocking chain sequences of all the adapter adapters tested are shown in Table 2 below. After driving the enzyme to blocking chains with different structures and sequences, the ATP/NADH consumption was determined, as shown in Figure 3.
由图3得知:相较于对照R0,衔接体R1-R6的ATP消耗速率均有明显下降。与R1相比,R2-R6的阻断链不同,ATP消耗量不同,其中,阻断链增加1个2′甲氧基修饰的核糖核苷酸如R2,ATP消耗变化不明显,阻断链增加4个2′甲氧基修饰的核糖核苷酸如R3-R6,ATP消耗明显降低。另外,图3中还设置了不含衔接体的酶(与实施例1相同)对照(CK),该酶依赖底物消耗ATP,不含衔接体时由于酶没有结合底物,其ATP消耗较低。It can be seen from Figure 3 that compared with the control R0, the ATP consumption rate of the adapter R1-R6 was significantly decreased. Compared with R1, the blocking chains of R2-R6 are different, and the ATP consumption is different. Among them, the blocking chain adds a 2'methoxy-modified ribonucleotide such as R2, and the ATP consumption does not change significantly. The blocking chain The addition of four 2′methoxy-modified ribonucleotides such as R3-R6 significantly decreased ATP consumption. In addition, an enzyme (same as Example 1) control (CK) without an adapter is also set in Figure 3, and the enzyme relies on the substrate to consume ATP. Low.
表2Table 2
Figure PCTCN2022136442-appb-000007
Figure PCTCN2022136442-appb-000007
实施例3:降低ATP空耗的Y衔接体-酶复合物的上机测试Example 3: On-machine test of the Y adapter-enzyme complex that reduces ATP empty consumption
通过末端修复方式制备长为10kb的文库,并且用实施例1中制备的降低ATP空耗的Y衔接体-酶复合物(接头为R4)与文库即目标多核苷酸进行连接建库,目标多核苷酸连接位置为图1所示衔接体的右端。以不加入PNA-R链为对照(记为RC)。A library of 10kb in length was prepared by end repair, and the Y adapter-enzyme complex (joint is R4) prepared in Example 1 to reduce ATP vacancy was used to connect the library, that is, the target polynucleotide, to build a library. The acid linking position is the right end of the adapter shown in Figure 1. Take no addition of PNA-R chain as a control (denoted as RC).
使用齐碳科技有限公司纳米孔测序仪QNome-9604进行测序,测序缓冲液:终浓度10mM HEPEs,100mM MgCl 2,375mM KCl,ATP 100mM,pH 7.1,测序温度:30-40℃。 Sequencing was performed using the nanopore sequencer QNome-9604 of Qitan Technology Co., Ltd., sequencing buffer: final concentration 10mM HEPEs, 100mM MgCl 2 , 375mM KCl, ATP 100mM, pH 7.1, sequencing temperature: 30-40°C.
结果:如图4所示,对照RC在测序16小时内下降幅度约为80bp/s;接头R4在测序16小时内速率下降幅度约为10bp/s。测序速率和ATP浓度呈正相关,若测序过程中,ATP浓度显著下降,则其测序速率也会下降。Results: As shown in Figure 4, the control RC decreased by about 80 bp/s within 16 hours of sequencing; the rate of linker R4 decreased by about 10 bp/s within 16 hours of sequencing. The sequencing rate is positively correlated with the ATP concentration. If the ATP concentration drops significantly during the sequencing process, the sequencing rate will also decrease.
另外,本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。In addition, the term "and/or" in this article is only an association relationship describing associated objects, which means that there may be three relationships, for example, A and/or B may mean: A exists alone, A and B exist at the same time, There are three cases of B alone. In addition, the character "/" in this article generally indicates that the contextual objects are an "or" relationship.
应理解,在本申请实施例中,“与A相应的B”表示B与A相关联,根据A可以确定B。但还应理解,根据A确定B并不意味着仅仅根据A确定B,还可以根据A和/或其它信息确定B。It should be understood that in this embodiment of the present application, "B corresponding to A" means that B is associated with A, and B can be determined according to A. However, it should also be understood that determining B according to A does not mean determining B only according to A, and B may also be determined according to A and/or other information.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above is only a specific embodiment of the application, but the scope of protection of the application is not limited thereto. Any person familiar with the technical field can easily think of various equivalents within the scope of the technology disclosed in the application. Modifications or replacements, these modifications or replacements shall be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims (10)

  1. 一种用于表征目标多核苷酸的衔接体,所述衔接体在5'到3'端方向包含{L-S} n或{S-L} nan adapter for characterizing a polynucleotide of interest comprising {LS} n or {SL} n in the 5' to 3' direction,
    其中,L为修饰的双链多核苷酸,S为阻断链,n为正整数;Wherein, L is a modified double-stranded polynucleotide, S is a blocking strand, and n is a positive integer;
    并且,所述L双链包含与S连接的多核苷酸链L'及所述L'的互补链L”,所述L'包括远离阻断链S的第一区段和靠近阻断链S的第二区段,所述第一区段包含修饰部分,所述第二区段包含马达蛋白结合活性区,该活性区被所述互补链L”封闭;And, the L double strand comprises a polynucleotide strand L' connected to S and a complementary strand L" of said L', said L' comprising a first segment away from the blocking strand S and a segment close to the blocking strand S The second section of said first section comprises a modified part, said second section comprises a motor protein binding active region, and the active region is blocked by said complementary chain L";
    互补链L”包含能够与所述马达蛋白竞争性结合到L'链的聚合物。The complementary chain L" comprises a polymer capable of competing with said motor protein for binding to the L' chain.
  2. 根据权利要求1所述的衔接体,其中,所述衔接体在5'到3'端方向包含{D 1-L-S} n或{S-L-D 1} n,D 1为第一双链多核苷酸; The adapter according to claim 1, wherein the adapter comprises {D 1 -LS} n or {SLD 1 } n in the direction from the 5' to the 3' end, and D 1 is the first double-stranded polynucleotide;
    和/或,所述衔接体在5'到3'端方向包含{L-S-D 2} n或{D 2-S-L} n,D 2为第二双链多核苷酸; And/or, the adapter comprises {LSD 2 } n or {D 2 -SL} n in the direction from the 5' to the 3' end, and D 2 is the second double-stranded polynucleotide;
    可选地,所述n为1-20的整数。Optionally, the n is an integer of 1-20.
  3. 根据权利要求1或2所述的衔接体,其中,所述链L'的修饰部分使所述第一区段与马达蛋白的结合能力弱于所述第二区段,或者使所述第一区段不与马达蛋白结合;和/或The adapter according to claim 1 or 2, wherein the modified part of the chain L' makes the binding ability of the first segment to motor protein weaker than that of the second segment, or makes the first segment The segment does not bind to the motor protein; and/or
    所述链L'中所述修饰部分为核糖核苷酸和/或核酸类似物;The modification part in the chain L' is a ribonucleotide and/or a nucleic acid analogue;
    可选地,所述核糖核苷酸包括2′位修饰的核糖核苷酸,可选2′烷氧基修饰的核糖核苷酸,更可选为2′甲氧基修饰的核糖核苷酸;或Optionally, the ribonucleotides include 2'-modified ribonucleotides, optionally 2'alkoxy-modified ribonucleotides, more optionally 2'methoxy-modified ribonucleotides ;or
    可选地,所述核酸类似物包括肽核酸、甘油核酸、苏糖核酸、锁核酸、桥核酸中的任一种或两种以上任意组合;和/或Optionally, the nucleic acid analogs include any one or any combination of two or more of peptide nucleic acid, glycerol nucleic acid, threose nucleic acid, locked nucleic acid, and bridge nucleic acid; and/or
    所述链L'中所述马达蛋白结合活性区为脱氧核糖核苷酸;和/或The motor protein binding active region in the chain L' is a deoxyribonucleotide; and/or
    所述核糖核苷酸和/或核酸类似物的个数为1-20、1-15或1-10;可选为2-6。The number of ribonucleotides and/or nucleic acid analogues is 1-20, 1-15 or 1-10; optionally 2-6.
  4. 根据权利要求1至3中任一项所述的衔接体,其中,所述互补链L”靠近阻断链S的一端包含与多核苷酸链L'或所述第二区段结合力更强的部分;可选地,该部分包含PNA或者LNA;The adapter according to any one of claims 1 to 3, wherein the end of the complementary strand L" close to the blocking strand S contains a stronger binding force with the polynucleotide strand L' or the second segment part; optionally, the part includes PNA or LNA;
    和/或,所述互补链L”远离阻断链S的一端包含点击化学的第一部分, 可选为点击反应基团;And/or, the end of the complementary chain L" away from the blocking chain S contains the first part of click chemistry, which can be optionally a click reaction group;
    和/或,所述D 1双链包含与L'连接的多核苷酸链D 1'及其互补链D 1”,所述互补链D 1”靠近阻断链S的一端包含点击化学的第二部分,可选为点击反应基团; And/or, the D 1 double strand comprises a polynucleotide chain D 1 ′ connected to L’ and its complementary chain D 1 ″, and one end of the complementary chain D 1 ″ near the blocking chain S comprises the first click chemistry Two parts, which can be optionally click-reactive groups;
    和/或,所述D 2双链包含与阻断链S连接的多核苷酸链D 2'及其互补链D 2”,所述互补链D 2”包含不与所述衔接体杂交的部分;可选地,所述不与所述衔接体杂交的部分位于互补链D 2”临近阻断链S的一端。 And/or, the D 2 double strand comprises a polynucleotide strand D 2 ′ connected to the blocking strand S and its complementary strand D 2 ″, and the complementary strand D 2 ″ includes a part that does not hybridize with the adapter ; Optionally, the part that does not hybridize with the adapter is located at one end of the complementary strand D 2 ″ adjacent to the blocking strand S.
  5. 根据权利要求1至4中任一项所述的衔接体,其中,所述阻断链具有与所述多核苷酸不同的结构,用于阻滞马达蛋白;The adapter according to any one of claims 1 to 4, wherein the blocking strand has a structure different from that of the polynucleotide for blocking a motor protein;
    可选地,所述阻断链包含一个或多个硝基吲哚、一个或多个肌苷、一个或多个吖啶、一个或多个2-氨基嘌呤、一个或多个2-6-二氨基嘌呤、一个或多个5-溴-脱氧尿嘧啶、一个或多个反向胸苷、一个或多个反向二脱氧胸苷、一个或多个二脱氧胞苷、一个或多个5-甲基胞苷酸、一个或多个5-羟甲基胞苷、一个或多个2’烷氧基修饰的核糖核苷酸可选2’甲氧基修饰的核糖核苷酸、一个或多个异脱氧胞苷、一个或多个异脱氧鸟苷、一个或多个C3基团、一个或多个光裂解(PC)的基团、一个或多个己二醇、一个或多个iSp9基团、一个或多个iSp18基团、聚合物或一个或多个硫醇连接。Optionally, the blocker chain comprises one or more nitroindole, one or more inosine, one or more acridine, one or more 2-aminopurine, one or more 2-6- Diaminopurine, one or more 5-bromo-deoxyuridine, one or more reverse thymidine, one or more reverse dideoxythymidine, one or more dideoxycytidine, one or more 5 -methylcytidylic acid, one or more 5-hydroxymethylcytidines, one or more 2'alkoxy-modified ribonucleotides, optional 2'methoxy-modified ribonucleotides, one or Multiple isodeoxycytidines, one or more isodeoxyguanosines, one or more C3 groups, one or more photocleavable (PC) groups, one or more hexanediols, one or more iSp9 group, one or more iSp18 groups, a polymer or one or more thiol linkages.
  6. 根据权利要求1至5中任一项所述的衔接体,其中,L'的远离S的一端包含先导链序列;The adapter according to any one of claims 1 to 5, wherein the end of L' away from S comprises a leader strand sequence;
    S的远离L'的一端用于连接目标多核苷酸;The end of S away from L' is used to connect the target polynucleotide;
    所述马达蛋白为能够结合到多核苷酸并且控制其移动穿过孔的蛋白;可选地,所述马达蛋白选自聚合酶、核酸外切酶、解旋酶和拓扑异构酶中的一种或多种,更可选地,所述解旋酶选自Hel308解旋酶、RecD解旋酶、Tral解旋酶、TrwC解旋酶、XPD解旋酶和DDA解旋酶中的一种或多种。The motor protein is a protein capable of binding to a polynucleotide and controlling its movement through a pore; optionally, the motor protein is selected from one of polymerase, exonuclease, helicase and topoisomerase one or more, more optionally, the helicase is selected from one of Hel308 helicase, RecD helicase, Tral helicase, TrwC helicase, XPD helicase and DDA helicase or more.
  7. 一种复合物,所述复合物包含权利要求1至6中任一项所述的衔接体,和所述马达蛋白和/或所述目标多核苷酸;A complex comprising the adapter of any one of claims 1 to 6, and the motor protein and/or the target polynucleotide;
    可选地,所述马达蛋白位于阻断链。Optionally, the motor protein is located on the blocking chain.
  8. 如权利要求7所述的复合物的制备方法,其中,包括:The preparation method of compound as claimed in claim 7, wherein, comprises:
    S1:使包含L'-S的Y1链与马达蛋白结合,所述结合区域位于L'链;S1: binding the Y1 chain containing L'-S to the motor protein, and the binding region is located in the L' chain;
    S2:加入包含互补链L”的PNA-R链,得到所述复合物,其中所述马达蛋白被PNA-R链驱赶到阻断链;S2: adding the PNA-R chain comprising the complementary chain L" to obtain the complex, wherein the motor protein is driven to the blocking chain by the PNA-R chain;
    可选地,所述方法包括Optionally, the method includes
    S101:使包含D 1'-L'-S-D 2'的Y1链、包含D 1”的Y2链和包含D 2”的YB链的退火产物与马达蛋白结合,所述结合区域位于退火产物的L'链处; S101: Binding the annealed product of the Y1 chain comprising D 1 '-L'-SD 2 ', the Y2 chain comprising D 1 ” and the YB chain comprising D 2 ” to the motor protein, the binding region is located at the L of the annealed product 'chain at;
    S102:加入包含互补链L”的PNA-R链,得到所述复合物,其中马达蛋白通过PNA-R链驱赶到阻断链。S102: Adding the PNA-R chain containing the complementary chain L" to obtain the complex, wherein the motor protein is driven to the blocking chain by the PNA-R chain.
  9. 一种表征目标多核苷酸的方法,所述方法使用如权利要求1至6中任一项所述的衔接体或如权利要求7或8所述的复合物;A method of characterizing a polynucleotide of interest using the adapter of any one of claims 1 to 6 or the complex of claims 7 or 8;
    可选地,所述方法包括:Optionally, the method includes:
    (a)使目标多核苷酸穿过跨膜孔移动,(a) moving the polynucleotide of interest across the transmembrane pore,
    其中所述目标多核苷酸与如权利要求1至6中任一项所述的衔接体或如权利要求7或8所述的复合物连接;以及Wherein said target polynucleotide is connected with the adapter according to any one of claims 1 to 6 or the complex according to claim 7 or 8; and
    (b)随着所述多核苷酸相对于所述孔移动,获取一个或多个电和/或光测量值,其中所述测量值代表所述多核苷酸的一个或多个特征,并由此表征所述目标多核苷酸。(b) taking one or more electrical and/or optical measurements as the polynucleotide moves relative to the pore, wherein the measurements represent one or more characteristics of the polynucleotide and are determined by This characterizes the polynucleotide of interest.
  10. 一种用于表征多核苷酸的试剂盒,所述试剂盒的组成为如下1)-4)中任一种:A kit for characterizing polynucleotides, the kit is composed of any one of the following 1)-4):
    1)包含独立包装的如权利要求1至6中任一项所述的衔接体,可选地,还包含独立包装的如权利要求1至6中任一项所述的马达蛋白;1) comprising the independently packaged adapter according to any one of claims 1 to 6, optionally also comprising the independently packaged motor protein according to any one of claims 1 to 6;
    2)包含如权利要求7所述的复合物;2) comprising the complex as claimed in claim 7;
    3)包含如权利要求8所述制备方法得到的复合物;3) comprising the compound obtained by the preparation method as claimed in claim 8;
    4)包含分别独立包装的如下组分:4) Contains the following components individually packaged:
    如权利要求1至6中任一项所述的衔接体中的用于与S连接的多核苷酸链L'、所述阻断链S、和所述核苷酸链L'的互补链L”,可选地,还包含独立包装的如权利要求1至6中任一项所述的马达蛋白。The polynucleotide strand L', the blocking strand S, and the complementary strand L of the nucleotide strand L' for being connected to S in the adapter according to any one of claims 1 to 6 ", optionally, also comprising independently packaged motor protein as described in any one of claims 1 to 6.
PCT/CN2022/136442 2021-12-15 2022-12-05 Adaptor used for characterizing polynucleotide and use thereof WO2023109538A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111534007.8A CN114262735A (en) 2021-12-15 2021-12-15 Adaptors for characterising polynucleotides and uses thereof
CN202111534007.8 2021-12-15

Publications (1)

Publication Number Publication Date
WO2023109538A1 true WO2023109538A1 (en) 2023-06-22

Family

ID=80827301

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/136442 WO2023109538A1 (en) 2021-12-15 2022-12-05 Adaptor used for characterizing polynucleotide and use thereof

Country Status (2)

Country Link
CN (1) CN114262735A (en)
WO (1) WO2023109538A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114134142B (en) * 2021-11-23 2024-05-28 成都齐碳科技有限公司 Adaptors, complexes, single-stranded molecules, kits, methods and uses
CN114262735A (en) * 2021-12-15 2022-04-01 成都齐碳科技有限公司 Adaptors for characterising polynucleotides and uses thereof
CN116200477A (en) * 2022-12-13 2023-06-02 成都齐碳科技有限公司 Adaptors, constructs, methods and uses comprising a combination blocker

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101346474A (en) * 2005-12-22 2009-01-14 凯津公司 Alternative nucleotides for improved targeted nucleotide exchange
CN103827320A (en) * 2011-07-25 2014-05-28 牛津纳米孔技术有限公司 Hairpin loop method for double strand polynucleotide sequencing using transmembrane pores
CN106103741A (en) * 2014-01-22 2016-11-09 牛津纳米孔技术公司 The method that one or more polynucleotide associated proteins are connected to target polynucleotide
CN110088299A (en) * 2016-09-29 2019-08-02 牛津纳米孔技术公司 The nucleic acid detection method instructed by nano-pore
CN111849965A (en) * 2019-04-26 2020-10-30 新英格兰生物实验室公司 Polynucleotide adaptor design for reduced bias
WO2020234612A1 (en) * 2019-05-22 2020-11-26 Oxford Nanopore Technologies Limited Method
CN114262735A (en) * 2021-12-15 2022-04-01 成都齐碳科技有限公司 Adaptors for characterising polynucleotides and uses thereof

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014135838A1 (en) * 2013-03-08 2014-09-12 Oxford Nanopore Technologies Limited Enzyme stalling method
GB201403096D0 (en) * 2014-02-21 2014-04-09 Oxford Nanopore Tech Ltd Sample preparation method
CN113462764B (en) * 2021-09-01 2021-12-17 北京齐碳科技有限公司 Hairpin-like adaptors, constructs and methods for characterizing double-stranded target polynucleotides

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101346474A (en) * 2005-12-22 2009-01-14 凯津公司 Alternative nucleotides for improved targeted nucleotide exchange
CN103827320A (en) * 2011-07-25 2014-05-28 牛津纳米孔技术有限公司 Hairpin loop method for double strand polynucleotide sequencing using transmembrane pores
CN106103741A (en) * 2014-01-22 2016-11-09 牛津纳米孔技术公司 The method that one or more polynucleotide associated proteins are connected to target polynucleotide
CN110088299A (en) * 2016-09-29 2019-08-02 牛津纳米孔技术公司 The nucleic acid detection method instructed by nano-pore
CN111849965A (en) * 2019-04-26 2020-10-30 新英格兰生物实验室公司 Polynucleotide adaptor design for reduced bias
WO2020234612A1 (en) * 2019-05-22 2020-11-26 Oxford Nanopore Technologies Limited Method
CN114262735A (en) * 2021-12-15 2022-04-01 成都齐碳科技有限公司 Adaptors for characterising polynucleotides and uses thereof

Also Published As

Publication number Publication date
CN114262735A (en) 2022-04-01

Similar Documents

Publication Publication Date Title
WO2023109538A1 (en) Adaptor used for characterizing polynucleotide and use thereof
US11390904B2 (en) Nanopore-based method and double stranded nucleic acid construct therefor
CN110088299B (en) Nucleic acid detection method guided by nanopores
KR102628035B1 (en) Single cell whole genome library for methylation sequencing
EP2895618B1 (en) Sample preparation method
EP3033435B1 (en) Method for fragmenting nucleic acid by means of transposase
EP3126516B1 (en) Method of target molecule characterisation using a molecular pore
JP6298404B2 (en) Hairpin loop method for double-stranded polynucleotide sequencing using transmembrane pores
CN109196116B (en) Method for characterizing target polynucleotide
JP2002539801A (en) Detection of mutations in genes by specific LNA primers
JP2021523704A (en) Method
CN114854826A (en) Sequences, linkers comprising sequences and uses thereof
JP2022137019A (en) Methods for detecting cystic fibrosis mutations using mitra tip extraction
BRPI0807826A2 (en) "GENERATION OF NUCLEIC ACID MOLECULES".
US10982253B2 (en) Nucleic acid catenane with a linking duplex biosensor for detection of a microorganism target
WO2019090621A1 (en) Hooked probe, method for ligating nucleic acid and method for constructing sequencing library
US20060252058A1 (en) Chip for detection of nucleic acid
KR20190137718A (en) Method for Detecting Target Nucleic Acid Using Rolling Circle Amplification and Composition for Detecting Target Nucleic Acid
US20210388427A1 (en) Liquid sample workflow for nanopore sequencing
CN115874291A (en) Method for marking and simultaneously detecting DNA (deoxyribonucleic acid) and RNA (ribonucleic acid) molecules in sample
CN112041461A (en) Methods for attaching adaptors to single-stranded regions of double-stranded polynucleotides
WO2023116575A1 (en) Adapter for characterizing target polynucleotide, method, and use thereof
CN114921533A (en) Methods and adaptors for characterising a target polynucleotide
CN115175985A (en) Method for extracting single-stranded DNA and RNA from untreated biological sample and sequencing
JP2002335962A (en) Device and method for concentrating gene

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22906288

Country of ref document: EP

Kind code of ref document: A1