WO2020124319A1 - Fusion protein and application thereof - Google Patents

Fusion protein and application thereof Download PDF

Info

Publication number
WO2020124319A1
WO2020124319A1 PCT/CN2018/121545 CN2018121545W WO2020124319A1 WO 2020124319 A1 WO2020124319 A1 WO 2020124319A1 CN 2018121545 W CN2018121545 W CN 2018121545W WO 2020124319 A1 WO2020124319 A1 WO 2020124319A1
Authority
WO
WIPO (PCT)
Prior art keywords
fusion protein
fragment
construct
present
nucleic acid
Prior art date
Application number
PCT/CN2018/121545
Other languages
French (fr)
Chinese (zh)
Inventor
王�琦
郭斐
郭苗苗
李敬
郑越
董宇亮
章文蔚
徐崇钧
Original Assignee
深圳华大生命科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大生命科学研究院 filed Critical 深圳华大生命科学研究院
Priority to PCT/CN2018/121545 priority Critical patent/WO2020124319A1/en
Priority to CN201880099747.3A priority patent/CN113166270A/en
Publication of WO2020124319A1 publication Critical patent/WO2020124319A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K19/00Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes

Definitions

  • the invention relates to the field of biology.
  • the invention relates to fusion proteins and their applications. More specifically, the present invention relates to fusion proteins and methods and applications for obtaining them, isolated nucleic acid molecules, constructs, and recombinant cells.
  • DNA ligase is a kind of important molecular biology tool enzyme that can catalyze the formation of phosphodiester bond between two DNA or RNA fragments.
  • the ligation reaction includes gap ligation, cohesive end ligation, TA ligation, blunt end ligation, and branch ligation. Branch), where the TA connection and the flat end connection are widely used in the library building connection, but the efficiency is low.
  • the sources of DNA ligase are diverse and cover almost all species.
  • T4 DNA ligase from T4 bacteriophage has become a unique and significant blunt end or TA end ligation activity for molecular cloning, nucleic acid modification, sequencing and library construction. There is no substitute for an enzyme.
  • junction connection based on blunt ends or TA ends is an indispensable key step in NGS database construction, and its efficiency directly affects the coverage and efficiency of subsequent sequencing results.
  • the reaction system of the linker connection reaction also has a greater impact on the upstream and downstream operations, and it can be determined whether additional nucleic acid purification steps need to be added.
  • the present invention aims to solve at least one of the problems in the prior art at least to a certain extent.
  • the present invention proposes a fusion protein, which can effectively improve the efficiency of linker connection, and is particularly suitable for the linker connection of blunt ends/TA ends, which is convenient for subsequent library construction and sequencing.
  • it simplifies the experimental process and avoids the losses caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are smaller, which is more suitable for the automation platform.
  • T4 DNA ligase alone is based on blunt-end/TA-end linker connection efficiency is low, usually need to add high-density and high-viscosity auxiliary additives (such as 10-30% polyethylene glycol) to achieve Increased connection efficiency.
  • high-viscosity and high-density auxiliary additives will affect the accuracy of subsequent library construction and sequencing, and are not suitable for automated platforms.
  • the inventors found that by combining T4 DNA ligase with a protein having a DNA binding domain (eg Sso 7d binding protein, NF ⁇ B p50 protein), the protein having a DNA binding domain can fix the linker in order to T4 DNA ligase is fully functional and can significantly improve the efficiency of linker ligation in the construction of sequencing libraries. It is especially suitable for ligation of blunt-end/TA-end linkers, which facilitates subsequent sequencing, simplifies the experimental process, and avoids losses caused by excessive purification.
  • a protein having a DNA binding domain eg Sso 7d binding protein, NF ⁇ B p50 protein
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the invention proposes a fusion protein.
  • the fusion protein includes: a first fragment having T4 DNA ligase activity; and a second fragment having one end of the first fragment connected to one end of the second fragment , The second fragment has a DNA binding domain.
  • the protein with a DNA binding domain can fix the linker, so that T4 DNA ligase can fully play its role
  • the efficiency of linker connection can be significantly improved, especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the fusion protein may also have the following additional technical features:
  • the second fragment has Sso 7d DNA binding protein activity or NF ⁇ B p50 protein activity.
  • the first fragment has the amino acid sequence shown in SEQ ID NO: 1.
  • the second fragment has the amino acid sequence shown in SEQ ID NO: 2 or the amino acid sequence shown in SEQ ID NO: 3.
  • the fusion protein further includes purified fragments and/or screening fragments.
  • the purified fragment is selected from at least one of His and Flag tags.
  • the screening fragment has kanamycin resistance and/or ampicillin resistance.
  • the fusion protein has the amino acid sequence shown in SEQ ID NO: 4 or 5.
  • the invention proposes an isolated nucleic acid molecule.
  • the isolated nucleic acid molecule encodes the aforementioned fusion protein.
  • the fusion protein encoded by the isolated nucleic acid molecule can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the isolated nucleic acid molecule has the nucleotide sequence shown in SEQ ID NO: 6 or 7.
  • the invention proposes a construct.
  • the construct contains the aforementioned isolated nucleic acid molecule.
  • the fusion protein expressed by the construct of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends. There is no need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the construct is selected from at least one of plasmids, bacteriophages, artificial chromosomes, cosmids, and viruses.
  • the construct is selected from plasmids.
  • the construct is selected from pET28a plasmid.
  • the construct further includes a promoter operably linked to the nucleotide sequence shown in SEQ ID NO: 6 or 7.
  • the invention provides a recombinant cell.
  • the recombinant cell contains the aforementioned isolated nucleic acid molecule or the aforementioned construct.
  • the fusion protein expressed by the recombinant cell of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the present invention provides a method for obtaining the aforementioned fusion protein.
  • the recombinant cells described above are cultured under conditions suitable for the expression of the fusion protein, so as to obtain the fusion protein.
  • the fusion protein obtained by the method of the present invention can significantly improve the efficiency of linker connection in constructing a sequencing library, and is particularly suitable for linker connection of blunt end/TA end connection, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the present invention proposes the application of the aforementioned fusion protein in constructing a sequencing library.
  • the fusion protein of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends.
  • high viscosity and high density auxiliary additives such as PEG, which facilitates subsequent sequencing and simplifies the experimental process. Avoid losses caused by excessive purification.
  • PEG and other high-viscosity and high-density additives the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • FIG. 1 shows an electrophoresis diagram of purification results of recombinant T4 DNA ligase fusion protein G1 according to an embodiment of the present invention
  • FIG. 2 shows an electrophoresis diagram of the purification result of recombinant T4 DNA ligase fusion protein G2 according to an embodiment of the present invention
  • FIG. 3 shows an electrophoresis diagram of the test result of the ligation efficiency of the blunt end of the recombinant T4 DNA ligase fusion protein G1 according to an embodiment of the present invention
  • FIG. 4 shows an electrophoresis diagram of the test result of the ligation efficiency of the blunt end of the recombinant T4 DNA ligase fusion protein G2 according to an embodiment of the present invention.
  • FIG. 5 shows an analysis diagram of the connection efficiency of the NGS library construction connector according to an embodiment of the present invention.
  • the present invention proposes fusion proteins, isolated nucleic acid molecules, constructs, recombinant cells, methods for obtaining fusion proteins, and applications of fusion proteins in constructing sequencing libraries, which will be described in detail below.
  • the invention proposes a fusion protein.
  • the fusion protein includes: a first fragment having T4 DNA ligase activity; and a second fragment, one end of the first fragment is connected to one end of the second fragment, the second fragment With DNA binding domain.
  • the protein with a DNA binding domain can fix the linker, so that T4 DNA ligase can fully play its role
  • the efficiency of linker connection can be significantly improved, especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • amino acid fragments are generally divided into N-terminal and C-terminal.
  • the connection method of the first fragment and the second fragment is not strictly limited in the present invention, as long as it is the N-terminal of one fragment and the C of another fragment The end can be connected.
  • the second fragment has Sso 7d DNA binding protein activity or NF ⁇ B p50 protein activity.
  • the Sso 7d binding protein is derived from the hyperthermophilic archaebacterium Sulfolobus solfataricus, with a molecular weight of approximately 7.0 kDa, and a denaturation temperature near neutral pH close to 100°C. It has non-sequence-specific DNA binding activity and binds to the minor groove of double-stranded DNA.
  • NF ⁇ B p50 is a eukaryotic transcription factor derived from the eukaryotic Homo sapiens, with a molecular weight of approximately 36.5 kDa and a DNA binding domain, which can enhance the expression of specific anti-apoptotic genes.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the first fragment has the amino acid sequence shown in SEQ ID NO:1.
  • T4 DNA ligase is a commonly used DNA ligase in library construction, especially suitable for blunt end or TA end ligation.
  • the second fragment has the amino acid sequence shown in SEQ ID NO: 2 or the amino acid sequence shown in SEQ ID NO: 3.
  • the fusion protein formed by Sso 7d binding protein (SEQ ID NO: 2) or NF ⁇ B p50 protein (SEQ ID NO: 3) and T4 DNA ligase can significantly improve the efficiency of linker connection, especially suitable for blunt end/TA end connection
  • the connection facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification.
  • high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the fusion protein further includes purified fragments and/or screening fragments. In order to achieve the purpose of screening and separation.
  • the purified fragment is selected from at least one of His and Flag tags. This makes it easy to screen for positive transformants.
  • the screening fragment has kanamycin resistance and/or ampicillin resistance.
  • the efficiency of screening recombinant cells that receive foreign constructs can be further improved.
  • the fusion protein has the amino acid sequence shown in SEQ ID NO: 4 or 5.
  • the fusion protein formed by T4 DNA ligase and Sso 7d DNA binding protein (SEQ ID NO: 4) or the fusion protein formed by T4 DNA ligase and NF ⁇ B p50 protein (SEQ ID NO: 5) can significantly improve the joint connection efficiency It is especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the invention proposes an isolated nucleic acid molecule.
  • the isolated nucleic acid molecule encodes the aforementioned fusion protein.
  • the fusion protein encoded by the isolated nucleic acid molecule can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the isolated nucleic acid molecule has the nucleotide sequence shown in SEQ ID NO: 6 or 7.
  • the nucleotide sequence shown in SEQ ID NO: 6 encodes the amino acid sequence shown in SEQ ID NO: 4
  • the nucleotide sequence shown in SEQ ID NO: 7 encodes the amino acid sequence shown in SEQ ID NO: 5.
  • the invention proposes a construct.
  • the construct contains the aforementioned isolated nucleic acid molecule.
  • the fusion protein expressed by the construct of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends. There is no need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the construct is selected from at least one of plasmids, bacteriophages, artificial chromosomes, cosmids and viruses, preferably plasmids, more preferably pET28a plasmids. This can effectively improve the efficiency of genetic transformation using the construct of the present invention.
  • the construct of the present invention may also include other elements, so as to impart additional beneficial effects to the construct.
  • the construct further includes a promoter operably linked to the nucleotide sequence shown in SEQ ID NO: 6 or 7.
  • promoter used in the present invention refers to a nucleic acid sequence capable of directing the transcription of a nucleic acid molecule operably linked thereto.
  • operably used in the present invention refers to a functional connection between a nucleic acid expression control sequence such as a promoter, a signal sequence, an enhancer, etc. and a target nucleic acid sequence, wherein when a suitable molecule such as a transcription activation molecule is When the expression control sequence is combined, the expression control sequence affects the transcription and/or translation of the nucleic acid corresponding to the target nucleic acid sequence.
  • a specific promoter can be directly introduced into the host cell through the construct, and the promoter can be used to initiate the transcription and expression of the nucleotide sequence shown in SEQ ID NO: 6 or 7, which can improve The expression efficiency of T4 DNA ligase gene, Ssco7d binding protein gene/NF ⁇ B p50 protein in the obtained recombinant cells.
  • the type of the promoter is not particularly limited, as long as the gene carried by the construct can be efficiently expressed in the host animal cell.
  • the invention provides a recombinant cell.
  • the recombinant cell contains the aforementioned isolated nucleic acid molecule or the aforementioned construct.
  • the fusion protein expressed by the recombinant cell of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the present invention provides a method for obtaining the aforementioned fusion protein.
  • the recombinant cells described above are cultured under conditions suitable for the expression of the fusion protein, so as to obtain the fusion protein.
  • the fusion protein obtained by the method of the present invention can significantly improve the efficiency of linker connection in constructing a sequencing library, and is particularly suitable for linker connection of blunt end/TA end connection, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification.
  • the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • the present invention proposes the application of the aforementioned fusion protein in constructing a sequencing library.
  • the fusion protein of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends.
  • high viscosity and high density auxiliary additives such as PEG, which facilitates subsequent sequencing and simplifies the experimental process. Avoid losses caused by excessive purification.
  • high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
  • T4 DNA ligase fusion protein G1 is composed of T4 DNA ligase and Sso 7d binding protein derived from the superthermophilic archaea Sulfolobus solfataricus.
  • the pET 28a vector was digested with NcoI and HindIII, and the gene sequence encoded by T4 DNA ligase and the gene sequence of Sso 7d were inserted into the cloning area of the vector pET 28a.
  • the 6 Hiss at the C-terminus of the amino acid sequence of the recombinant T4 DNA ligase fusion protein were used as purification tags.
  • the screening tag was kanamycin, and the constructed vector was named pET28a-G1.
  • T4 DNA ligase fusion protein G2 is composed of T4 DNA ligase and eukaryotic transcription factor NF ⁇ B p50 derived from the eukaryote Homo sapiens.
  • NF ⁇ B p50 eukaryotic transcription factor NF ⁇ B p50 derived from the eukaryote Homo sapiens.
  • NdeI and SacI the gene sequence encoded by T4 DNA ligase and the gene sequence of p50 were inserted into the cloning area of the vector pET28a.
  • the 6 Hiss at the N-terminus of the amino acid sequence of the recombinant T4 DNA ligase fusion protein were used as purification tags.
  • the selection tag was kanamycin, and the constructed vector was named pET28a-G2.
  • LB liquid medium tryptone 10g/L, yeast extract 5g/L, NaCl 10g/L.
  • the recombinant expression vector pET28a-G1 or pET28a-G2 was transformed into E. coli expression strain Ecoli.BL21 (DE3), and the bacterial solution was evenly spread on a 50 ⁇ g/mL kanamycin plate, and cultured at 37°C overnight. Single colonies were picked and cultured in 5ml LB medium (containing 50 ⁇ g/mL kanamycin), 37°C, 200rpm, and cultured overnight. The bacterial solution obtained above was inoculated at 1:100 in 50 mL LB (containing 50 ⁇ g/mL kanamycin) for cultivation, 37° C., 200 rpm, 4 h.
  • Buffer A equilibration buffer 50mM Tris-HCl + 500mM NaCl + 10mM imidazole, pH 7.0.
  • Buffer B elution buffer 50mM Tris-HCl + 500mM NaCl + 500mM imidazole, pH 7.0.
  • Buffer Balance buffer 50mM Tris-HCl + 100mM NaCl, pH 7.0.
  • Buffer D elution buffer 50mM Tris-HCl+1M NaCl, pH 7.0.
  • Buffer E dilution 50mM Tris-HCl, pH 7.0.
  • Buffer F 2 ⁇ Storage solution 20mM Tris-HCl+100mM KCl+0.2mM EDTA, pH 7.0.
  • the blunt-end ligation efficiency of the fusion protein was mainly detected, and the 570bp blunt-end fragment was used as the reaction substrate.
  • the substrate was obtained by PCR, and the upstream primer was designed with a phosphate group at the 5'end, and the downstream primer was a common primer. After the PCR, the PCR product was recovered using Cycle Pure Kit (D6493, Omega).
  • the reaction system for the ligation efficiency test was 10 ⁇ L, containing substrate 120 ng, fusion protein G1 or G2 2 ⁇ M, 1 ⁇ BGI T4 DNA Ligase reaction buffer (50 mM Tris-HCl, 10 mM MgCl 2 , 5 mM DTT, 1 mM ATP, pH 7.5 25 °C ).
  • the above prepared reaction mixture was mixed well, placed on a PCR instrument for reaction, incubated at 16°C for 20 min, and heat-inactivated at 65°C for 15 min. After the reaction was completed, 1 ⁇ L of Proteinase K and 1 ⁇ L of 1% SDS were added thereto, and after thorough mixing, the agarose gel test was performed. The experimental results are shown in Figures 3 and 4. Compared with T4 DNA ligase, the fusion proteins G1 and G2 consume more substrate, higher product volume, and better ligation efficiency.
  • Example 5 The fusion protein is used to construct a linker in the NGS library to improve the conversion rate of the library
  • the conversion rate of the library that is, the efficiency of joining the DNA fragments at both ends of the linker. Only DNA that has been successfully ligated at both ends can be effectively amplified and eventually become a sequencing template.
  • a single-band PCR product (220bp) is used in a tube reaction to sequentially perform end repair, A addition, and joint addition reactions.
  • the reaction product is purified by magnetic beads and then detected by a 2100 bioanalyzer to analyze the reaction substrate. 1.
  • the peak concentration of the single-ended connector product and the double-ended connector product and then calculate the double-ended connector connection efficiency.
  • NGS terminal repair reaction the system is as follows:
  • NGS connection joint reaction the system is as follows:
  • Step 1 reaction product 50 ⁇ L Library adapter (25 ⁇ M) 2 ⁇ L 10X T4 PNK Buffer 3 ⁇ L ATP (100mM) 0.8 ⁇ L T4DNALigase(600U/ ⁇ L) 1.6 ⁇ L Nuclease Free Water up to 80 ⁇ L Total 80 ⁇ L
  • NGS terminal repair reaction the system is as follows:
  • E.coli gDNA interrupts double-selection products ⁇ 50ng 10X PNK reaction Buffer 5 ⁇ L 5:1 dATP: dNTP 0.6 ⁇ L T4PNK(10U/ ⁇ L) 0.6 ⁇ L Klenowfragment(5U/ ⁇ L) 0.1 ⁇ L rTaq(5U/ ⁇ L) 0.2 ⁇ L T4 DNA polymerase (3U/ ⁇ L) 2 ⁇ L Nuclease Free Water up to 50 ⁇ L Total 50 ⁇ L
  • the experimental groups are set as follows:
  • NGS library amplification the system is as follows:
  • the product was purified with 1 volume of Ampure XP beads, and the purified product was quantified by Qubit dsDNA HS Assay.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Peptides Or Proteins (AREA)

Abstract

Provided are a fusion protein, a method for obtaining the same, an application thereof, an isolated nucleic acid molecule, a construct, and a recombinant cell. The fusion protein comprises: a first segment having T4 DNA ligase activity; and a second segment having an end connected to an end of the first segment, where the second segment has a DNA-binding domain.

Description

融合蛋白及其应用Fusion protein and its application 技术领域Technical field
本发明涉及生物领域。具体地,本发明涉及融合蛋白及其应用。更具体地,本发明涉及融合蛋白及其获得方法和应用、分离的核酸分子、构建体以及重组细胞。The invention relates to the field of biology. In particular, the invention relates to fusion proteins and their applications. More specifically, the present invention relates to fusion proteins and methods and applications for obtaining them, isolated nucleic acid molecules, constructs, and recombinant cells.
背景技术Background technique
DNA连接酶是一类可以催化两个DNA或者RNA片段之间形成磷酸二酯键的重要分子生物学工具酶。根据连接酶催化的核酸底物构象的不同,连接反应包括缺口连接(Nick ligation)、粘性末端连接(Cohesive end ligation)、TA连接(TA ligation)、平末端连接(Blunt end ligation)及分支连接(Branch ligation),其中TA连接和平末端连接在建库接头连接中应用较广,但是效率较低。DNA连接酶的来源多种多样,几乎覆盖了所有物种,其中来源于T4噬菌体的T4 DNA连接酶以其独特而显著的平末端或者TA末端连接活性成为了分子克隆、核酸修饰、测序建库中无可替代的一种酶。DNA ligase is a kind of important molecular biology tool enzyme that can catalyze the formation of phosphodiester bond between two DNA or RNA fragments. Depending on the conformation of the nucleic acid substrate catalyzed by the ligase, the ligation reaction includes gap ligation, cohesive end ligation, TA ligation, blunt end ligation, and branch ligation. Branch), where the TA connection and the flat end connection are widely used in the library building connection, but the efficiency is low. The sources of DNA ligase are diverse and cover almost all species. Among them, T4 DNA ligase from T4 bacteriophage has become a unique and significant blunt end or TA end ligation activity for molecular cloning, nucleic acid modification, sequencing and library construction. There is no substitute for an enzyme.
基于平末端或者TA末端的接头连接是NGS建库中不可缺少的关键步骤,其效率直接影响了后续测序结果的覆盖度和效率。同时,接头连接反应的反应体系也对上下游的操作有较大影响,可以决定是否需要添加额外的核酸纯化步骤。Junction connection based on blunt ends or TA ends is an indispensable key step in NGS database construction, and its efficiency directly affects the coverage and efficiency of subsequent sequencing results. At the same time, the reaction system of the linker connection reaction also has a greater impact on the upstream and downstream operations, and it can be determined whether additional nucleic acid purification steps need to be added.
然而,目前的DNA连接酶仍有待改进。However, the current DNA ligase still needs to be improved.
发明内容Summary of the invention
本发明旨在至少在一定程度上解决现有技术中存在的问题的至少之一。为此,本发明提出了一种融合蛋白,其可以有效地提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,便于后续建库及测序。同时,简化了实验流程,避免过度纯化造成的损失。此外,测序文库构建体系的密度及粘度较小,更适合于自动化平台。The present invention aims to solve at least one of the problems in the prior art at least to a certain extent. To this end, the present invention proposes a fusion protein, which can effectively improve the efficiency of linker connection, and is particularly suitable for the linker connection of blunt ends/TA ends, which is convenient for subsequent library construction and sequencing. At the same time, it simplifies the experimental process and avoids the losses caused by excessive purification. In addition, the density and viscosity of the sequencing library construction system are smaller, which is more suitable for the automation platform.
需要说明的是,本发明是基于发明人的下列发现而完成的:It should be noted that the present invention was completed based on the inventor's following findings:
在NGS建库中,单独采用T4 DNA连接酶基于平末端/TA末端连接的接头连接效率低,通常需要通过添加高密度高粘度的辅助添加剂(例如10~30%的聚乙二醇)来实现连接效率的提高。但是,这种高粘度高密度的辅助添加剂会影响后续建库及测序的准确性,也不适合自动化平台。In the NGS library construction, the use of T4 DNA ligase alone is based on blunt-end/TA-end linker connection efficiency is low, usually need to add high-density and high-viscosity auxiliary additives (such as 10-30% polyethylene glycol) to achieve Increased connection efficiency. However, such high-viscosity and high-density auxiliary additives will affect the accuracy of subsequent library construction and sequencing, and are not suitable for automated platforms.
有鉴于此,发明人发现,通过将T4 DNA连接酶与具有DNA结合结构域的蛋白(例如Sso 7d结合蛋白、NFκB p50蛋白)形成融合蛋白,其中具有DNA结合结构域的蛋白可以 固定接头,以便于T4 DNA连接酶充分发挥作用,在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In view of this, the inventors found that by combining T4 DNA ligase with a protein having a DNA binding domain (eg Sso 7d binding protein, NFκB p50 protein), the protein having a DNA binding domain can fix the linker in order to T4 DNA ligase is fully functional and can significantly improve the efficiency of linker ligation in the construction of sequencing libraries. It is especially suitable for ligation of blunt-end/TA-end linkers, which facilitates subsequent sequencing, simplifies the experimental process, and avoids losses caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
为此,在本发明的一个方面,本发明提出了一种融合蛋白。根据本发明的实施例,所述融合蛋白包括:第一片段,所述第一片段具有T4 DNA连接酶活性;以及第二片段,所述第一片段的一端与所述第二片段的一端相连,所述第二片段具有DNA结合结构域。通过将T4 DNA连接酶与具有DNA结合结构域的蛋白(例如Sso 7d结合蛋白、NFκB p50蛋白)形成融合蛋白,其中具有DNA结合结构域的蛋白可以固定接头,以便于T4 DNA连接酶充分发挥作用,在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。To this end, in one aspect of the invention, the invention proposes a fusion protein. According to an embodiment of the present invention, the fusion protein includes: a first fragment having T4 DNA ligase activity; and a second fragment having one end of the first fragment connected to one end of the second fragment , The second fragment has a DNA binding domain. By combining T4 DNA ligase with a protein with a DNA binding domain (such as Sso 7d binding protein, NFκB p50 protein), the protein with a DNA binding domain can fix the linker, so that T4 DNA ligase can fully play its role In the construction of sequencing libraries, the efficiency of linker connection can be significantly improved, especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
根据本发明的实施例,所述融合蛋白还可以具有下列附加技术特征:According to an embodiment of the present invention, the fusion protein may also have the following additional technical features:
根据本发明的实施例,所述第二片段具有Sso 7d DNA结合蛋白活性或者NFκB p50蛋白活性。According to an embodiment of the present invention, the second fragment has Sso 7d DNA binding protein activity or NFκB p50 protein activity.
根据本发明的实施例,所述第一片段具有SEQ ID NO:1所示的氨基酸序列。According to an embodiment of the present invention, the first fragment has the amino acid sequence shown in SEQ ID NO: 1.
根据本发明的实施例,所述第二片段具有SEQ ID NO:2所示的氨基酸序列或者SEQ ID NO:3所示的氨基酸序列。According to an embodiment of the present invention, the second fragment has the amino acid sequence shown in SEQ ID NO: 2 or the amino acid sequence shown in SEQ ID NO: 3.
根据本发明的实施例,所述融合蛋白进一步包括纯化片段和/或筛选片段。According to an embodiment of the present invention, the fusion protein further includes purified fragments and/or screening fragments.
根据本发明的实施例,所述纯化片段选自His和Flag标签的至少一种。According to an embodiment of the present invention, the purified fragment is selected from at least one of His and Flag tags.
根据本发明的实施例,所述筛选片段具有卡那霉素抗性和/或氨苄青霉素抗性。According to an embodiment of the present invention, the screening fragment has kanamycin resistance and/or ampicillin resistance.
根据本发明的实施例,所述融合蛋白具有SEQ ID NO:4或5所示的氨基酸序列。According to an embodiment of the present invention, the fusion protein has the amino acid sequence shown in SEQ ID NO: 4 or 5.
在本发明的又一方面,本发明提出了一种分离的核酸分子。根据本发明的实施例,所述分离的核酸分子编码前面所述的融合蛋白。利用该分离的核酸分子编码的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the invention, the invention proposes an isolated nucleic acid molecule. According to an embodiment of the present invention, the isolated nucleic acid molecule encodes the aforementioned fusion protein. The fusion protein encoded by the isolated nucleic acid molecule can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
根据本发明的实施例,所述分离的核酸分子具有SEQ ID NO:6或7所示的核苷酸序 列。According to an embodiment of the present invention, the isolated nucleic acid molecule has the nucleotide sequence shown in SEQ ID NO: 6 or 7.
在本发明的又一方面,本发明提出了一种构建体。根据本发明的实施例,所述构建体含有前面所述的分离的核酸分子。本发明的构建体表达的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the invention, the invention proposes a construct. According to an embodiment of the present invention, the construct contains the aforementioned isolated nucleic acid molecule. The fusion protein expressed by the construct of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends. There is no need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
根据本发明的实施例,所述构建体选自质粒、噬菌体、人工染色体、粘粒以及病毒的至少一种。According to an embodiment of the present invention, the construct is selected from at least one of plasmids, bacteriophages, artificial chromosomes, cosmids, and viruses.
根据本发明的实施例,所述构建体选自质粒。According to an embodiment of the invention, the construct is selected from plasmids.
根据本发明的实施例,所述构建体选自pET 28a质粒。According to an embodiment of the present invention, the construct is selected from pET28a plasmid.
根据本发明的实施例,所述构建体进一步包括启动子,所述启动子与SEQ ID NO:6或7所示的核苷酸序列可操纵地连接。According to an embodiment of the present invention, the construct further includes a promoter operably linked to the nucleotide sequence shown in SEQ ID NO: 6 or 7.
在本发明的又一方面,本发明提出了一种重组细胞。根据本发明的实施例,所述重组细胞含有前面所述的分离的核酸分子或者前面所述的构建体。本发明的重组细胞表达的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the invention, the invention provides a recombinant cell. According to an embodiment of the present invention, the recombinant cell contains the aforementioned isolated nucleic acid molecule or the aforementioned construct. The fusion protein expressed by the recombinant cell of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
在本发明的又一方面,本发明提出了一种获得前面所述融合蛋白的方法。根据本发明的实施例,在适于所述融合蛋白表达的条件下,培养前面所述的重组细胞,以便获得所述融合蛋白。利用本发明的方法所获得的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the present invention, the present invention provides a method for obtaining the aforementioned fusion protein. According to an embodiment of the present invention, the recombinant cells described above are cultured under conditions suitable for the expression of the fusion protein, so as to obtain the fusion protein. The fusion protein obtained by the method of the present invention can significantly improve the efficiency of linker connection in constructing a sequencing library, and is particularly suitable for linker connection of blunt end/TA end connection, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
在本发明的又一方面,本发明提出了前面所述融合蛋白在构建测序文库中的应用。本发明的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因 此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In another aspect of the present invention, the present invention proposes the application of the aforementioned fusion protein in constructing a sequencing library. The fusion protein of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends. There is no need to add high viscosity and high density auxiliary additives such as PEG, which facilitates subsequent sequencing and simplifies the experimental process. Avoid losses caused by excessive purification. In addition, since there is no need to add PEG and other high-viscosity and high-density additives, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。Additional aspects and advantages of the present invention will be partially given in the following description, and some will become apparent from the following description, or be learned through the practice of the present invention.
附图说明BRIEF DESCRIPTION
本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become apparent and easily understood from the description of the embodiments in conjunction with the following drawings, in which:
图1显示了根据本发明一个实施例的重组型T4 DNA连接酶融合蛋白G1纯化结果电泳图;FIG. 1 shows an electrophoresis diagram of purification results of recombinant T4 DNA ligase fusion protein G1 according to an embodiment of the present invention;
图2显示了根据本发明一个实施例的重组型T4 DNA连接酶融合蛋白G2纯化结果电泳图;2 shows an electrophoresis diagram of the purification result of recombinant T4 DNA ligase fusion protein G2 according to an embodiment of the present invention;
图3显示了根据本发明一个实施例的重组型T4 DNA连接酶融合蛋白G1平末端连接效率测试结果电泳图;3 shows an electrophoresis diagram of the test result of the ligation efficiency of the blunt end of the recombinant T4 DNA ligase fusion protein G1 according to an embodiment of the present invention;
图4显示了根据本发明一个实施例的重组型T4 DNA连接酶融合蛋白G2平末端连接效率测试结果电泳图;以及4 shows an electrophoresis diagram of the test result of the ligation efficiency of the blunt end of the recombinant T4 DNA ligase fusion protein G2 according to an embodiment of the present invention; and
图5显示了根据本发明一个实施例的NGS文库构建接头连接效率分析图。FIG. 5 shows an analysis diagram of the connection efficiency of the NGS library construction connector according to an embodiment of the present invention.
具体实施方式detailed description
下面详细描述本发明的实施例。下面描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。The embodiments of the present invention are described in detail below. The embodiments described below are exemplary and are only used to explain the present invention, and cannot be construed as limiting the present invention.
本发明提出了融合蛋白、分离的核酸分子、构建体、重组细胞、获得融合蛋白的方法和融合蛋白在构建测序文库中的应用,下面将分别对其进行详细描述。The present invention proposes fusion proteins, isolated nucleic acid molecules, constructs, recombinant cells, methods for obtaining fusion proteins, and applications of fusion proteins in constructing sequencing libraries, which will be described in detail below.
融合蛋白Fusion protein
在本发明的一个方面,本发明提出了一种融合蛋白。根据本发明的实施例,融合蛋白包括:第一片段,该第一片段具有T4 DNA连接酶活性;以及第二片段,该第一片段的一端与所述第二片段的一端相连,第二片段具有DNA结合结构域。通过将T4 DNA连接酶与具有DNA结合结构域的蛋白(例如Sso 7d结合蛋白、NFκB p50蛋白)形成融合蛋白,其中具有DNA结合结构域的蛋白可以固定接头,以便于T4 DNA连接酶充分发挥作用,在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接, 便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In one aspect of the invention, the invention proposes a fusion protein. According to an embodiment of the present invention, the fusion protein includes: a first fragment having T4 DNA ligase activity; and a second fragment, one end of the first fragment is connected to one end of the second fragment, the second fragment With DNA binding domain. By combining T4 DNA ligase with a protein with a DNA binding domain (such as Sso 7d binding protein, NFκB p50 protein), the protein with a DNA binding domain can fix the linker, so that T4 DNA ligase can fully play its role In the construction of sequencing libraries, the efficiency of linker connection can be significantly improved, especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
本领域技术人员能够理解的是,氨基酸片段通常分为N端和C端,本发明对于第一片段和第二片段的连接方式不作严格限定,只要是一个片段的N端与另一个片段的C端相连即可。Those skilled in the art can understand that amino acid fragments are generally divided into N-terminal and C-terminal. The connection method of the first fragment and the second fragment is not strictly limited in the present invention, as long as it is the N-terminal of one fragment and the C of another fragment The end can be connected.
根据本发明的实施例,所述第二片段具有Sso 7d DNA结合蛋白活性或者NFκB p50蛋白活性。According to an embodiment of the present invention, the second fragment has Sso 7d DNA binding protein activity or NFκB p50 protein activity.
Sso 7d结合蛋白是来自超嗜热古细菌Sulfolobus solfataricus,分子量约7.0kDa,在中性pH附近的变性温度接近100℃,具有非序列特异性的DNA结合活性,与双链DNA的小沟结合。The Sso 7d binding protein is derived from the hyperthermophilic archaebacterium Sulfolobus solfataricus, with a molecular weight of approximately 7.0 kDa, and a denaturation temperature near neutral pH close to 100°C. It has non-sequence-specific DNA binding activity and binds to the minor groove of double-stranded DNA.
NFκB p50是来源于真核生物Homo sapiens的真核转录因子,分子量约为36.5kDa,具有DNA结合结构域,能够增强特异性抗凋亡基因的表达。NFκB p50 is a eukaryotic transcription factor derived from the eukaryotic Homo sapiens, with a molecular weight of approximately 36.5 kDa and a DNA binding domain, which can enhance the expression of specific anti-apoptotic genes.
发明人发现,通过将T4 DNA连接酶与Sso 7d结合蛋白或者NFκB p50蛋白形成融合蛋白,其中Sso 7d结合蛋白或者NFκB p50蛋白可以固定接头,以便于T4 DNA连接酶充分发挥作用,在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。The inventor found that by combining T4 DNA ligase with Sso 7d binding protein or NFκB p50 protein to form a fusion protein, where Sso 7d binding protein or NFκB p50 protein can fix the linker, so that T4 DNA ligase can fully play its role in constructing a sequencing library Can significantly improve the efficiency of joint connection, especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
根据本发明的实施例,第一片段具有SEQ ID NO:1所示的氨基酸序列。T4 DNA连接酶是建库中常用的DNA连接酶,尤其适用于平末端或者TA末端连接。According to an embodiment of the present invention, the first fragment has the amino acid sequence shown in SEQ ID NO:1. T4 DNA ligase is a commonly used DNA ligase in library construction, especially suitable for blunt end or TA end ligation.
Figure PCTCN2018121545-appb-000001
Figure PCTCN2018121545-appb-000001
根据本发明的实施例,第二片段具有SEQ ID NO:2所示的氨基酸序列或者SEQ ID NO: 3所示的氨基酸序列。Sso 7d结合蛋白(SEQ ID NO:2)或者NFκB p50蛋白(SEQ ID NO:3)与T4 DNA连接酶形成的融合蛋白均能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。According to an embodiment of the present invention, the second fragment has the amino acid sequence shown in SEQ ID NO: 2 or the amino acid sequence shown in SEQ ID NO: 3. The fusion protein formed by Sso 7d binding protein (SEQ ID NO: 2) or NFκB p50 protein (SEQ ID NO: 3) and T4 DNA ligase can significantly improve the efficiency of linker connection, especially suitable for blunt end/TA end connection The connection facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
Figure PCTCN2018121545-appb-000002
Figure PCTCN2018121545-appb-000002
根据本发明的实施例,融合蛋白进一步包括纯化片段和/或筛选片段。由此,以便达到筛选和分离目的。According to an embodiment of the present invention, the fusion protein further includes purified fragments and/or screening fragments. In order to achieve the purpose of screening and separation.
根据本发明的实施例,纯化片段选自His和Flag标签的至少一种。由此,能够方便地筛选阳性转化子。According to an embodiment of the present invention, the purified fragment is selected from at least one of His and Flag tags. This makes it easy to screen for positive transformants.
根据本发明的实施例,筛选片段具有卡那霉素抗性和/或氨苄青霉素抗性。由此,能够进一步提高筛选接受外源构建体的重组细胞的效率。According to an embodiment of the present invention, the screening fragment has kanamycin resistance and/or ampicillin resistance. As a result, the efficiency of screening recombinant cells that receive foreign constructs can be further improved.
根据本发明的实施例,融合蛋白具有SEQ ID NO:4或5所示的氨基酸序列。由T4 DNA连接酶和Sso 7d DNA结合蛋白形成的融合蛋白(SEQ ID NO:4)或者由T4 DNA连接酶和NFκB p50蛋白形成的融合蛋白(SEQ ID NO:5)均可以显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。According to an embodiment of the present invention, the fusion protein has the amino acid sequence shown in SEQ ID NO: 4 or 5. The fusion protein formed by T4 DNA ligase and Sso 7d DNA binding protein (SEQ ID NO: 4) or the fusion protein formed by T4 DNA ligase and NFκB p50 protein (SEQ ID NO: 5) can significantly improve the joint connection efficiency It is especially suitable for the connection of blunt end/TA end connection, which facilitates subsequent sequencing, simplifies the experimental process, and avoids the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
Figure PCTCN2018121545-appb-000003
Figure PCTCN2018121545-appb-000003
Figure PCTCN2018121545-appb-000004
Figure PCTCN2018121545-appb-000004
分离的核酸分子Isolated nucleic acid molecule
在本发明的又一方面,本发明提出了一种分离的核酸分子。根据本发明的实施例,该分离的核酸分子编码前面所述的融合蛋白。利用该分离的核酸分子编码的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the invention, the invention proposes an isolated nucleic acid molecule. According to an embodiment of the present invention, the isolated nucleic acid molecule encodes the aforementioned fusion protein. The fusion protein encoded by the isolated nucleic acid molecule can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
根据本发明的实施例,分离的核酸分子具有SEQ ID NO:6或7所示的核苷酸序列。其中,SEQ ID NO:6所示的核苷酸序列编码SEQ ID NO:4所示的氨基酸序列,SEQ ID NO:7所示的核苷酸序列编码SEQ ID NO:5所示的氨基酸序列。According to an embodiment of the present invention, the isolated nucleic acid molecule has the nucleotide sequence shown in SEQ ID NO: 6 or 7. Among them, the nucleotide sequence shown in SEQ ID NO: 6 encodes the amino acid sequence shown in SEQ ID NO: 4, and the nucleotide sequence shown in SEQ ID NO: 7 encodes the amino acid sequence shown in SEQ ID NO: 5.
Figure PCTCN2018121545-appb-000005
Figure PCTCN2018121545-appb-000005
Figure PCTCN2018121545-appb-000006
Figure PCTCN2018121545-appb-000006
Figure PCTCN2018121545-appb-000007
Figure PCTCN2018121545-appb-000007
Figure PCTCN2018121545-appb-000008
Figure PCTCN2018121545-appb-000008
T4 DNA连接酶基因序列:T4 DNA ligase gene sequence:
Figure PCTCN2018121545-appb-000009
Figure PCTCN2018121545-appb-000009
Figure PCTCN2018121545-appb-000010
Figure PCTCN2018121545-appb-000010
Sso 7d蛋白基因序列:Sso 7d protein gene sequence:
Figure PCTCN2018121545-appb-000011
Figure PCTCN2018121545-appb-000011
NFκB p50蛋白基因序列:NFκB p50 protein gene sequence:
Figure PCTCN2018121545-appb-000012
Figure PCTCN2018121545-appb-000012
构建体Construct
在本发明的又一方面,本发明提出了一种构建体。根据本发明的实施例,所述构建体含有前面所述的分离的核酸分子。本发明的构建体表达的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the invention, the invention proposes a construct. According to an embodiment of the present invention, the construct contains the aforementioned isolated nucleic acid molecule. The fusion protein expressed by the construct of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends. There is no need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
根据本发明的实施例,所述构建体选自质粒、噬菌体、人工染色体、粘粒以及病毒的至少一种,优选质粒,更优选pET 28a质粒。由此,能够有效提高利用本发明的构建体进行遗传转化的效率。According to an embodiment of the present invention, the construct is selected from at least one of plasmids, bacteriophages, artificial chromosomes, cosmids and viruses, preferably plasmids, more preferably pET28a plasmids. This can effectively improve the efficiency of genetic transformation using the construct of the present invention.
需要说明的是,本发明的构建体中还可以包含其他的元件,从而赋予构建体额外的有益效果。根据本发明的实施例,所述构建体进一步包括启动子,该启动子与SEQ ID NO:6或7所示的核苷酸序列可操纵地连接。It should be noted that the construct of the present invention may also include other elements, so as to impart additional beneficial effects to the construct. According to an embodiment of the present invention, the construct further includes a promoter operably linked to the nucleotide sequence shown in SEQ ID NO: 6 or 7.
在本发明中所使用的术语“启动子”指的是这样一种核酸序列,其能够指导与其可操纵地连接的核酸分子的转录。在本发明中所使用的术语“可操纵地”指的是核酸表达控制序列例如启动子、信号序列、增强子等与目标核酸序列之间的功能连接,其中当合适的分子例如转录活化分子与表达控制序列结合时,表达控制序列影响着对应于目标核酸序列的核酸的转录和/或翻译。由此,可以直接通过构建体在宿主细胞中引入特定的启动子,并且该启动子可以用于启动SEQ ID NO:6或7所示的核苷酸序列的转录和表达,由此,可以提高所获得的重组细胞中T4 DNA连接酶基因、Ssco7d结合蛋白基因/NFκB p50蛋白的表达效率。根据本发明的实施例,在本发明的构建体中,所述启动子的种类不受特别限制,只要能够使构建体所携带的基因能够在宿主动物细胞中高效表达即可。The term "promoter" used in the present invention refers to a nucleic acid sequence capable of directing the transcription of a nucleic acid molecule operably linked thereto. The term "operably" used in the present invention refers to a functional connection between a nucleic acid expression control sequence such as a promoter, a signal sequence, an enhancer, etc. and a target nucleic acid sequence, wherein when a suitable molecule such as a transcription activation molecule is When the expression control sequence is combined, the expression control sequence affects the transcription and/or translation of the nucleic acid corresponding to the target nucleic acid sequence. Thereby, a specific promoter can be directly introduced into the host cell through the construct, and the promoter can be used to initiate the transcription and expression of the nucleotide sequence shown in SEQ ID NO: 6 or 7, which can improve The expression efficiency of T4 DNA ligase gene, Ssco7d binding protein gene/NFκB p50 protein in the obtained recombinant cells. According to the embodiments of the present invention, in the construct of the present invention, the type of the promoter is not particularly limited, as long as the gene carried by the construct can be efficiently expressed in the host animal cell.
重组细胞Recombinant cells
在本发明的又一方面,本发明提出了一种重组细胞。根据本发明的实施例,所述重组细胞含有前面所述的分离的核酸分子或者前面所述的构建体。本发明的重组细胞表达的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the invention, the invention provides a recombinant cell. According to an embodiment of the present invention, the recombinant cell contains the aforementioned isolated nucleic acid molecule or the aforementioned construct. The fusion protein expressed by the recombinant cell of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for the linker connection of blunt ends/TA ends, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing. Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
获得融合蛋白的方法Method for obtaining fusion protein
在本发明的又一方面,本发明提出了一种获得前面所述融合蛋白的方法。根据本发明的实施例,在适于所述融合蛋白表达的条件下,培养前面所述的重组细胞,以便获得所述融合蛋白。利用本发明的方法所获得的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In yet another aspect of the present invention, the present invention provides a method for obtaining the aforementioned fusion protein. According to an embodiment of the present invention, the recombinant cells described above are cultured under conditions suitable for the expression of the fusion protein, so as to obtain the fusion protein. The fusion protein obtained by the method of the present invention can significantly improve the efficiency of linker connection in constructing a sequencing library, and is particularly suitable for linker connection of blunt end/TA end connection, without the need to add high viscosity and high density auxiliary additives such as PEG, which is convenient for subsequent sequencing , Simplify the experimental process and avoid the loss caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
融合蛋白在构建测序文库中的应用Application of fusion protein in constructing sequencing library
在本发明的又一方面,本发明提出了前面所述融合蛋白在构建测序文库中的应用。本发明的融合蛋白在构建测序文库中能够显著提高接头连接效率,尤其适用于平末端/TA末端连接的接头连接,无需添加PEG等高粘度高密度的辅助添加剂,便于后续测序,简化实验流程,避免过度纯化造成的损失。此外,由于无需添加PEG等高粘度高密度添加剂,因此测序文库构建体系的密度及粘度较小,提高了后续建库及测序的准确性和特异性,更适合于自动化平台。In another aspect of the present invention, the present invention proposes the application of the aforementioned fusion protein in constructing a sequencing library. The fusion protein of the present invention can significantly improve the efficiency of linker connection in the construction of a sequencing library, and is particularly suitable for linker connection of blunt ends/TA ends. There is no need to add high viscosity and high density auxiliary additives such as PEG, which facilitates subsequent sequencing and simplifies the experimental process. Avoid losses caused by excessive purification. In addition, because there is no need to add high viscosity and high density additives such as PEG, the density and viscosity of the sequencing library construction system are small, which improves the accuracy and specificity of subsequent library construction and sequencing, and is more suitable for automated platforms.
下面将结合实施例对本发明的方案进行解释。本领域技术人员将会理解,下面的实施例仅用于说明本发明,而不应视为限定本发明的范围。实施例中未注明具体技术或条件的,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品。The solution of the present invention will be explained below in conjunction with examples. Those skilled in the art will understand that the following embodiments are only for illustrating the present invention and should not be considered as limiting the scope of the present invention. If no specific technology or conditions are indicated in the examples, the technology or conditions described in the literature in the art or the product specification shall be followed. The reagents or instruments used do not indicate the manufacturer, are all conventional products that are commercially available.
实施例1 重组型T4 DNA连接酶融合蛋白表达载体的构建Example 1 Construction of recombinant T4 DNA ligase fusion protein expression vector
T4 DNA连接酶融合蛋白G1是由T4 DNA连接酶和来源于超嗜热古细菌Sulfolobus solfataricus的Sso 7d结合蛋白组成。通过In-fusion的方法,采用NcoI和HindIII酶切pET 28a载体后,将T4 DNA连接酶编码的基因序列与Sso 7d的基因序列共同插入到载体pET 28a的克隆区。将重组型T4 DNA连接酶融合蛋白氨基酸序列的C端6个His作为纯化标签,其中筛选标签为卡那霉素,将构建好的载体命名为pET 28a-G1。T4 DNA ligase fusion protein G1 is composed of T4 DNA ligase and Sso 7d binding protein derived from the superthermophilic archaea Sulfolobus solfataricus. Through the In-fusion method, the pET 28a vector was digested with NcoI and HindIII, and the gene sequence encoded by T4 DNA ligase and the gene sequence of Sso 7d were inserted into the cloning area of the vector pET 28a. The 6 Hiss at the C-terminus of the amino acid sequence of the recombinant T4 DNA ligase fusion protein were used as purification tags. The screening tag was kanamycin, and the constructed vector was named pET28a-G1.
T4 DNA连接酶融合蛋白G2是由T4 DNA连接酶和来源于真核生物Homo sapiens的真核转录因子NFκB p50组成。通过In-fusion的方法,采用NdeI和SacI酶切pET 28a载体后,将T4 DNA连接酶编码的基因序列与p50的基因序列共同插入到载体pET 28a的克隆区。将 重组型T4 DNA连接酶融合蛋白氨基酸序列的N端6个His作为纯化标签,其中筛选标签为卡那霉素,将构建好的载体命名为pET 28a-G2。T4 DNA ligase fusion protein G2 is composed of T4 DNA ligase and eukaryotic transcription factor NFκB p50 derived from the eukaryote Homo sapiens. Through the method of In-fusion, after cutting the pET28a vector with NdeI and SacI, the gene sequence encoded by T4 DNA ligase and the gene sequence of p50 were inserted into the cloning area of the vector pET28a. The 6 Hiss at the N-terminus of the amino acid sequence of the recombinant T4 DNA ligase fusion protein were used as purification tags. The selection tag was kanamycin, and the constructed vector was named pET28a-G2.
实施例2 重组型T4 DNA连接酶融合蛋白菌株的培养和诱导Example 2 Cultivation and induction of recombinant T4 DNA ligase fusion protein strain
LB液体培养基:胰蛋白胨10g/L,酵母提取物5g/L,NaCl 10g/L。LB liquid medium: tryptone 10g/L, yeast extract 5g/L, NaCl 10g/L.
将重组表达载体pET 28a-G1或pET 28a-G2转化到大肠杆菌表达菌株Ecoli.BL21(DE3)中,将菌液均匀涂抹在50μg/mL卡那霉素的平板上,37℃过夜培养。挑取单菌落,于5ml LB培养基(含有50μg/mL卡那霉素)培养,37℃,200rpm,过夜培养。将上述所得菌液,按1:100接种于50mL LB(含有50μg/mL卡那霉素)中培养,37℃,200rpm,4h。将扩大培养的菌液,按1:100接种于2L LB(含有50μg/mL卡那霉素)中培养,37℃,200rpm,待OD600值达0.6-0.8左右,加入IPTG至终浓度为0.4mM,25℃,200rpm,培养过夜,约16-18h。将长好的菌体于8000rpm离心收集,菌体冻存于-20℃待用。The recombinant expression vector pET28a-G1 or pET28a-G2 was transformed into E. coli expression strain Ecoli.BL21 (DE3), and the bacterial solution was evenly spread on a 50μg/mL kanamycin plate, and cultured at 37°C overnight. Single colonies were picked and cultured in 5ml LB medium (containing 50μg/mL kanamycin), 37°C, 200rpm, and cultured overnight. The bacterial solution obtained above was inoculated at 1:100 in 50 mL LB (containing 50 μg/mL kanamycin) for cultivation, 37° C., 200 rpm, 4 h. Inoculate the expanded bacterial solution at 1:100 in 2L LB (containing 50μg/mL kanamycin) and cultivate at 37°C, 200rpm. When the OD600 value reaches about 0.6-0.8, add IPTG to a final concentration of 0.4mM , 25 ℃, 200rpm, culture overnight, about 16-18h. The grown bacteria were collected by centrifugation at 8000 rpm, and the bacteria were frozen and stored at -20°C until use.
实施例3 重组型T4 DNA连接酶融合蛋白提取及纯化Example 3 Recombinant T4 DNA ligase fusion protein extraction and purification
纯化Buffer配制:Purified Buffer preparation:
1、Ni柱亲和层析1. Ni column affinity chromatography
Buffer A平衡缓冲液:50mM Tris-HCl+500mM NaCl+10mM咪唑,pH 7.0。Buffer A equilibration buffer: 50mM Tris-HCl + 500mM NaCl + 10mM imidazole, pH 7.0.
Buffer B洗脱缓冲液:50mM Tris-HCl+500mM NaCl+500mM咪唑,pH 7.0。Buffer B elution buffer: 50mM Tris-HCl + 500mM NaCl + 500mM imidazole, pH 7.0.
2、离子交换层析2. Ion exchange chromatography
Buffer C平衡缓冲液:50mM Tris-HCl+100mM NaCl,pH 7.0。Buffer Balance buffer: 50mM Tris-HCl + 100mM NaCl, pH 7.0.
Buffer D洗脱缓冲液:50mM Tris-HCl+1M NaCl,pH 7.0。Buffer D elution buffer: 50mM Tris-HCl+1M NaCl, pH 7.0.
3、蛋白样品稀释液3. Protein sample diluent
Buffer E稀释液:50mM Tris-HCl,pH 7.0。Buffer E dilution: 50mM Tris-HCl, pH 7.0.
4、蛋白样品2×储存液4. Protein sample 2× storage solution
Buffer F 2×储存液20mM Tris-HCl+100mM KCl+0.2mM EDTA,pH 7.0。Buffer F 2×Storage solution 20mM Tris-HCl+100mM KCl+0.2mM EDTA, pH 7.0.
按1g菌体加15mL亲和A液的比例重悬菌体,超声破碎细胞,直至菌体溶液至澄清。将破碎后的菌体12000rpm,4℃离心30min,取上清,0.22μm滤膜过滤后于4℃储存。Resuspend the cells at a rate of 1 g of bacteria plus 15 mL of affinity A solution, and ultrasonically break the cells until the bacterial solution is clear. The crushed cells were centrifuged at 12000 rpm and 4°C for 30 min, and the supernatant was taken. The 0.22 μm filter membrane was filtered and stored at 4°C.
将Ni柱亲合层析柱水洗5CV,Buffer B清洗5CV,Buffer A进行平衡10CV后,进行上样。上样完成后,平衡15CV,使用Buffer B进行线性洗脱(0-100%Buffer B,10CV),当UV值大于100mAU以上收集蛋白。Wash the Ni column affinity chromatography column with 5CV, Buffer B to wash 5CV, and Buffer A to equilibrate to 10CV before loading. After loading the sample, equilibrate 15CV, use Buffer B for linear elution (0-100% Buffer B, 10CV), and collect protein when the UV value is greater than 100mAU.
将Ni柱收集到的蛋白用Buffer E稀释5倍,将Q阴离子交换柱水洗5CV,Buffer C平衡5CV,蛋白样品上样,当UV值上升开始收样。将SP阳离子交换柱使用Buffer C平衡5CV, 将上步得到的蛋白样品上样,上样完成后,用Buffer C平衡15CV后,用洗脱缓冲液Buffer D线性洗脱(0-50%Buffer D,20CV),收集蛋白。收集蛋白进行过夜透析,透析液为2×的储存Buffer。蛋白终浓度为2mg/mL,甘油浓度为50%。如图1和图2所示,SDS-PAGE结果显示融合蛋白的纯化效果很好,纯度合格。Dilute the protein collected by the Ni column with Buffer E 5 times, wash the Q anion exchange column with 5CV, Buffer C equilibrate 5CV, and load the protein sample. When the UV value rises, the sample will be collected. Use SP cation exchange column to equilibrate 5CV with Buffer C. Load the protein sample obtained in the previous step. After loading, equilibrate 15CV with Buffer C and linearly elute with Buffer D (0-50% Buffer D , 20CV), collect protein. The protein was collected for dialysis overnight, and the dialysate was 2× storage buffer. The final protein concentration is 2 mg/mL and the glycerol concentration is 50%. As shown in Figure 1 and Figure 2, SDS-PAGE results show that the fusion protein purification effect is very good, the purity is qualified.
实施例4 重组型T4 DNA连接酶融合蛋白连接效率测试Example 4 Recombinant T4 DNA ligase fusion protein ligation efficiency test
本实验主要检测融合蛋白的平末端连接效率,以570bp平末端片段为反应底物。底物由PCR得来,并且设计上游引物5’端带有磷酸基团,下游引物为普通引物。PCR结束后,使用Cycle Pure Kit(D6493,Omega)回收PCR产物。连接效率测试的反应体系为10μL,包含底物120ng,融合蛋白G1或G2 2μM,1×BGI T4 DNA Ligase反应缓冲液(50mM Tris-HCl,10mM MgCl 2,5mM DTT,1mM ATP,pH 7.5 25℃)。将上述配制完成的反应混合液混匀,置于PCR仪上进行反应,16℃孵育20min,65℃热失活15min。反应完成后,向其中加入1μL Proteinase K和1μL 1%SDS,充分混匀后进行琼脂糖凝胶检测。实验结果如图3和4所示,相比于T4 DNA连接酶,融合蛋白G1和G2的底物消耗更多,产物量更高,连接效率更好。 In this experiment, the blunt-end ligation efficiency of the fusion protein was mainly detected, and the 570bp blunt-end fragment was used as the reaction substrate. The substrate was obtained by PCR, and the upstream primer was designed with a phosphate group at the 5'end, and the downstream primer was a common primer. After the PCR, the PCR product was recovered using Cycle Pure Kit (D6493, Omega). The reaction system for the ligation efficiency test was 10 μL, containing substrate 120 ng, fusion protein G1 or G2 2 μM, 1×BGI T4 DNA Ligase reaction buffer (50 mM Tris-HCl, 10 mM MgCl 2 , 5 mM DTT, 1 mM ATP, pH 7.5 25 ℃ ). The above prepared reaction mixture was mixed well, placed on a PCR instrument for reaction, incubated at 16°C for 20 min, and heat-inactivated at 65°C for 15 min. After the reaction was completed, 1 μL of Proteinase K and 1 μL of 1% SDS were added thereto, and after thorough mixing, the agarose gel test was performed. The experimental results are shown in Figures 3 and 4. Compared with T4 DNA ligase, the fusion proteins G1 and G2 consume more substrate, higher product volume, and better ligation efficiency.
实施例5 融合蛋白用于NGS文库构建连接接头,提高文库转化率Example 5 The fusion protein is used to construct a linker in the NGS library to improve the conversion rate of the library
文库转化率,即DNA片段双端接头连接效率。只有双端连接成功的DNA,才能够被有效扩增最终成为测序模板。本实施例采用单一条带PCR产物(220bp)在一管反应中顺次进行末端修复、加A、加接头等反应,反应产物经磁珠纯化后用2100生物分析仪进行检测,分析反应底物、单端接头产物、双端接头产物峰的浓度,进而计算双端接头连接效率。The conversion rate of the library, that is, the efficiency of joining the DNA fragments at both ends of the linker. Only DNA that has been successfully ligated at both ends can be effectively amplified and eventually become a sequencing template. In this embodiment, a single-band PCR product (220bp) is used in a tube reaction to sequentially perform end repair, A addition, and joint addition reactions. The reaction product is purified by magnetic beads and then detected by a 2100 bioanalyzer to analyze the reaction substrate. 1. The peak concentration of the single-ended connector product and the double-ended connector product, and then calculate the double-ended connector connection efficiency.
1、NGS末端修复反应,体系如下:1. NGS terminal repair reaction, the system is as follows:
组分Component 体积volume
220bp DNA片段220bp DNA fragment ~50ng~50ng
10X PNK reaction Buffer10X PNK reaction Buffer 5μL5μL
5:1 dATP:dNTP5:1 dATP: dNTP 0.6μL0.6μL
T4 PNK(10U/μL)T4PNK(10U/μL) 0.6μL0.6μL
Klenow fragment(5U/μL)Klenowfragment(5U/μL) 0.1μL0.1μL
rTaq(5U/μL)rTaq(5U/μL) 0.2μL0.2μL
T4 DNA polymerase(3U/μL)T4 DNA polymerase (3U/μL) 2μL2μL
Nuclease Free WaterNuclease Free Water up to 50μLup to 50μL
TotalTotal 50μL50μL
于冰上配制上述反应体系,Vortex混匀离心后,于PCR仪上执行以下程序:Prepare the above reaction system on ice. After mixing and centrifuging Vortex, perform the following procedure on the PCR machine:
温度temperature 时间time
热盖Hot cover onon
37℃37℃ 30min30min
65℃65℃ 15min15min
4℃4℃ HoldHold
2、NGS连接接头反应,体系如下:2. NGS connection joint reaction, the system is as follows:
组分 Component 体积volume
Step 1反应产物Step 1 reaction product 50μL50μL
文库接头(25μM)Library adapter (25μM) 2μL2μL
10X T4 PNK Buffer10X T4 PNK Buffer 3μL3μL
ATP(100mM)ATP (100mM) 0.8μL0.8μL
T4 DNA Ligase(600U/μL)T4DNALigase(600U/μL) 1.6μL1.6μL
Nuclease Free WaterNuclease Free Water up to 80μLup to 80μL
TotalTotal 80μL80μL
于冰上配制上述反应体系,23℃反应1h,每管加入2μl Proteinase K(20mg/mL)终止反应,用AMPure XP beads纯化后取1μL 2100检测,结果见附图5,其中(A)为电泳图;(B)为连接效率图;(C)为2100检测峰图,其中0:底物峰,1:单端连接峰,2:双端连接峰。Prepare the above reaction system on ice, and react at 23℃ for 1h. Add 2μl of Proteinase K (20mg/mL) to each tube to stop the reaction. After purification with AMPure XP XP, take 1μL of 2100 for testing. The results are shown in Figure 5, where (A) is electrophoresis Figure; (B) is the connection efficiency diagram; (C) is the 2100 detection peak diagram, where 0: substrate peak, 1: single-ended connection peak, 2: double-ended connection peak.
各实验组设置及连接效率测定:The setting of each experimental group and the determination of connection efficiency:
Figure PCTCN2018121545-appb-000013
Figure PCTCN2018121545-appb-000013
Figure PCTCN2018121545-appb-000014
Figure PCTCN2018121545-appb-000014
实施例6融合蛋白用于NGS文库构建Example 6 Fusion protein for NGS library construction
50ng经超声打断双选后的E.coli gDNA用于NGS文库构建,测试融合蛋白G1用于NGS文库构建的产量。50ng of E.coli gDNA after double selection by ultrasound interruption was used for NGS library construction, and the yield of fusion protein G1 for NGS library construction was tested.
1、NGS末端修复反应,体系如下:1. NGS terminal repair reaction, the system is as follows:
组分Component 体积volume
E.coli gDNA打断双选产物E.coli gDNA interrupts double-selection products ~50ng~50ng
10X PNK reaction Buffer10X PNK reaction Buffer 5μL5μL
5:1 dATP:dNTP5:1 dATP: dNTP 0.6μL0.6μL
T4 PNK(10U/μL)T4PNK(10U/μL) 0.6μL0.6μL
Klenow fragment(5U/μL)Klenowfragment(5U/μL) 0.1μL0.1μL
rTaq(5U/μL)rTaq(5U/μL) 0.2μL0.2μL
T4 DNA polymerase(3U/μL)T4 DNA polymerase (3U/μL) 2μL2μL
Nuclease Free WaterNuclease Free Water up to 50μLup to 50μL
TotalTotal 50μL50μL
于冰上配制上述反应体系,Vortex混匀离心后,于PCR仪上执行以下程序:Prepare the above reaction system on ice. After mixing and centrifuging Vortex, perform the following procedure on the PCR machine:
温度temperature 时间time
热盖Hot cover onon
37℃37℃ 30min30min
65℃65℃ 15min15min
4℃4℃ HoldHold
2、NGS连接接头反应体系:2. NGS connection joint reaction system:
Figure PCTCN2018121545-appb-000015
Figure PCTCN2018121545-appb-000015
Figure PCTCN2018121545-appb-000016
Figure PCTCN2018121545-appb-000016
于冰上配制上述反应体系,23℃反应1h,每管加入2ul Proteinase K(20mg/mL)终止反应,用AMPure XP beads纯化后,洗脱溶于32μL TE Buffer中。Prepare the above reaction system on ice, react at 23℃ for 1h, add 2ul of Proteinase K (20mg/mL) to each tube to stop the reaction, purify with AMPure XPbeads, elute and dissolve in 32μL TE Buffer.
各实验组设置如下:The experimental groups are set as follows:
Figure PCTCN2018121545-appb-000017
Figure PCTCN2018121545-appb-000017
3、NGS文库扩增,体系如下:3. NGS library amplification, the system is as follows:
组分Component 体积volume
Step 2纯化产物Step 2 Purified product 15μL15μL
2X KAPA HiFi PCR Mix2X KAPA HiFi PCR Mix 25μL25μL
Primer F(20uM)Primer F(20uM) 1.5μL1.5μL
Primer R(20uM)Primer R(20uM) 1.5μL1.5μL
Nuclease Free WaterNuclease Free Water up to 50μLup to 50μL
TotalTotal 50μL50μL
于冰上配制以上反应体系,Vortex混匀离心后,于PCR仪上执行以下程序:Prepare the above reaction system on ice. After mixing and centrifuging Vortex, perform the following procedure on the PCR machine:
Figure PCTCN2018121545-appb-000018
Figure PCTCN2018121545-appb-000018
Figure PCTCN2018121545-appb-000019
Figure PCTCN2018121545-appb-000019
反应结束后用1倍体积Ampure XP beads纯化,纯化后产物用Qubit dsDNA HS Assay定量。After the reaction, the product was purified with 1 volume of Ampure XP beads, and the purified product was quantified by Qubit dsDNA HS Assay.
各实验组设置及文库产量对比:Comparison of experimental group settings and library output:
Figure PCTCN2018121545-appb-000020
Figure PCTCN2018121545-appb-000020
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, the description referring to the terms "one embodiment", "some embodiments", "examples", "specific examples", or "some examples" means specific features described in conjunction with the embodiment or examples , Structure, material or characteristic is included in at least one embodiment or example of the present invention. In this specification, the schematic representation of the above terms does not necessarily refer to the same embodiment or example. Moreover, the specific features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. In addition, without contradicting each other, those skilled in the art may combine and combine different embodiments or examples and features of different embodiments or examples described in this specification.
尽管上面已经示出和描述了本发明的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本发明的限制,本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。Although the embodiments of the present invention have been shown and described above, it should be understood that the above-mentioned embodiments are exemplary and cannot be construed as limitations to the present invention, and those of ordinary skill in the art can understand the above within the scope of the present invention. The embodiments are changed, modified, replaced, and modified.

Claims (18)

  1. 一种融合蛋白,其特征在于,包括:A fusion protein, characterized in that it includes:
    第一片段,所述第一片段具有T4DNA连接酶活性;以及A first fragment having T4DNA ligase activity; and
    第二片段,所述第一片段的一端与所述第二片段的一端相连,所述第二片段具有DNA结合结构域。A second fragment, one end of the first fragment is connected to one end of the second fragment, and the second fragment has a DNA binding domain.
  2. 根据权利要求1所述的融合蛋白,其特征在于,所述第二片段具有Sso 7d DNA结合蛋白活性或者NFκB p50蛋白活性。The fusion protein according to claim 1, wherein the second fragment has Sso 7d DNA binding protein activity or NFκB p50 protein activity.
  3. 根据权利要求1所述的融合蛋白,其特征在于,所述第一片段具有SEQ ID NO:1所示的氨基酸序列。The fusion protein according to claim 1, wherein the first fragment has the amino acid sequence shown in SEQ ID NO: 1.
  4. 根据权利要求1所述的融合蛋白,其特征在于,所述第二片段具有SEQ ID NO:2所示的氨基酸序列或者SEQ ID NO:3所示的氨基酸序列。The fusion protein according to claim 1, wherein the second fragment has the amino acid sequence shown in SEQ ID NO: 2 or the amino acid sequence shown in SEQ ID NO: 3.
  5. 根据权利要求1所述的融合蛋白,其特征在于,进一步包括纯化片段和/或筛选片段。The fusion protein according to claim 1, further comprising purified fragments and/or screening fragments.
  6. 根据权利要求5所述的融合蛋白,其特征在于,所述纯化片段选自His和Flag标签的至少一种。The fusion protein according to claim 5, wherein the purified fragment is selected from at least one of His and Flag tags.
  7. 根据权利要求5所述的融合蛋白,其特征在于,所述筛选片段具有卡那霉素抗性和/或氨苄青霉素抗性。The fusion protein according to claim 5, wherein the screening fragment has kanamycin resistance and/or ampicillin resistance.
  8. 根据权利要求1所述的融合蛋白,其特征在于,具有SEQ ID NO:4或5所示的氨基酸序列。The fusion protein according to claim 1, which has the amino acid sequence shown in SEQ ID NO: 4 or 5.
  9. 一种分离的核酸分子,其特征在于,所述分离的核酸分子编码权利要求1~8任一项所述的融合蛋白。An isolated nucleic acid molecule, characterized in that the isolated nucleic acid molecule encodes the fusion protein according to any one of claims 1-8.
  10. 根据权利要求9所述的分离的核酸分子,其特征在于,所述分离的核酸分子具有SEQ ID NO:6或7所示的核苷酸序列。The isolated nucleic acid molecule according to claim 9, wherein the isolated nucleic acid molecule has a nucleotide sequence shown in SEQ ID NO: 6 or 7.
  11. 一种构建体,其特征在于,含有权利要求9或10所述的分离的核酸分子。A construct characterized by containing the isolated nucleic acid molecule of claim 9 or 10.
  12. 根据权利要求11所述的构建体,其特征在于,所述构建体选自质粒、噬菌体、人工染色体、粘粒以及病毒的至少一种。The construct according to claim 11, wherein the construct is selected from at least one of plasmids, bacteriophages, artificial chromosomes, cosmids, and viruses.
  13. 根据权利要求11所述的构建体,其特征在于,所述构建体选自质粒。The construct according to claim 11, wherein the construct is selected from plasmids.
  14. 根据权利要求11所述的构建体,其特征在于,所述构建体选自pET 28a质粒。The construct according to claim 11, wherein the construct is selected from pET28a plasmid.
  15. 根据权利要求11所述的构建体,其特征在于,所述构建体进一步包括启动子,所述启动子与SEQ ID NO:6或7所示的核苷酸序列可操纵地连接。The construct according to claim 11, wherein the construct further comprises a promoter operably linked to the nucleotide sequence shown in SEQ ID NO: 6 or 7.
  16. 一种重组细胞,其特征在于,含有权利要求9或10所述的分离的核酸分子或者权 利要求11~15任一项所述的构建体。A recombinant cell characterized by comprising the isolated nucleic acid molecule according to claim 9 or 10 or the construct according to any one of claims 11 to 15.
  17. 一种获得权利要求1~8任一项所述融合蛋白的方法,其特征在于,在适于所述融合蛋白表达的条件下,培养权利要求16所述的重组细胞,以便获得所述融合蛋白。A method for obtaining the fusion protein according to any one of claims 1 to 8, characterized in that, under conditions suitable for expression of the fusion protein, the recombinant cell according to claim 16 is cultured to obtain the fusion protein .
  18. 权利要求1~8任一项所述融合蛋白在构建测序文库中的应用。Use of the fusion protein according to any one of claims 1 to 8 in the construction of a sequencing library.
PCT/CN2018/121545 2018-12-17 2018-12-17 Fusion protein and application thereof WO2020124319A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2018/121545 WO2020124319A1 (en) 2018-12-17 2018-12-17 Fusion protein and application thereof
CN201880099747.3A CN113166270A (en) 2018-12-17 2018-12-17 Fusion protein and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/121545 WO2020124319A1 (en) 2018-12-17 2018-12-17 Fusion protein and application thereof

Publications (1)

Publication Number Publication Date
WO2020124319A1 true WO2020124319A1 (en) 2020-06-25

Family

ID=71100173

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/121545 WO2020124319A1 (en) 2018-12-17 2018-12-17 Fusion protein and application thereof

Country Status (2)

Country Link
CN (1) CN113166270A (en)
WO (1) WO2020124319A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113774032A (en) * 2021-11-12 2021-12-10 翌圣生物科技(上海)股份有限公司 Recombinant T4 ligase mutant, coding DNA and NGS library construction method
CN116004681A (en) * 2022-08-15 2023-04-25 南京诺唯赞生物科技股份有限公司 Method and kit for improving carrier connection efficiency in TOPO cloning

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115896047B (en) * 2022-12-12 2023-06-16 南京诺唯赞生物科技股份有限公司 Recombinant T4DNA ligase mutant, fusion protein and application thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102597006A (en) * 2009-09-16 2012-07-18 梅西大学 Fusion polypeptides and uses thereof
CN108129571A (en) * 2017-12-25 2018-06-08 上海捷瑞生物工程有限公司 Taq DNA ligase fusion proteins
EP3360958A1 (en) * 2013-03-15 2018-08-15 Theranos, Inc. Thermostable blunt-end ligase and methods of use

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102597006A (en) * 2009-09-16 2012-07-18 梅西大学 Fusion polypeptides and uses thereof
EP3360958A1 (en) * 2013-03-15 2018-08-15 Theranos, Inc. Thermostable blunt-end ligase and methods of use
CN108129571A (en) * 2017-12-25 2018-06-08 上海捷瑞生物工程有限公司 Taq DNA ligase fusion proteins

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
TANABE, M.: "From Structure-Function Analyses to Protein Engineering for Practical Applications of DNA Ligase", ARCHAEA, 31 December 2015 (2015-12-31), XP055721523 *
WANG, Y.: "A novel strategy to engineer DNA polymerases for enhanced pro- cessivity and improved performance in vitro.", NUCLEIC ACIDS RESEARCH, 31 December 2004 (2004-12-31), XP002303419 *
WILSON, R. H.: "Engineered DNA ligases with improved activities in vitro", PROTEIN ENGINEERING, DESIGN & SELECTION, 10 June 2013 (2013-06-10), XP055297919 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113774032A (en) * 2021-11-12 2021-12-10 翌圣生物科技(上海)股份有限公司 Recombinant T4 ligase mutant, coding DNA and NGS library construction method
CN116004681A (en) * 2022-08-15 2023-04-25 南京诺唯赞生物科技股份有限公司 Method and kit for improving carrier connection efficiency in TOPO cloning
CN116004681B (en) * 2022-08-15 2023-08-29 南京诺唯赞生物科技股份有限公司 Method and kit for improving carrier connection efficiency in TOPO cloning

Also Published As

Publication number Publication date
CN113166270A (en) 2021-07-23

Similar Documents

Publication Publication Date Title
WO2020124319A1 (en) Fusion protein and application thereof
US10883091B2 (en) DNA polymerase variant and application thereof
WO2019120193A1 (en) Split single-base gene editing systems and application thereof
Korepanova et al. Expression of membrane proteins from Mycobacterium tuberculosis in Escherichia coli as fusions with maltose binding protein
WO2021184766A1 (en) Gene expression component, cloning vector constructed thereby and application
CN113195521A (en) Mtu Delta I-CM intein variants and uses thereof
Frank et al. A strategy for the expression of recombinant proteins traditionally hard to purify
CN113136374A (en) Preparation and application of recombinant mutant Tn5 transposase
CN113774039B (en) Recombinant DNA polymerase and application thereof
CN107629129B (en) Method for producing and purifying polypeptides
EP1981978B1 (en) Affinity polypeptide for purification of recombinant proteins
US20050214899A1 (en) Removal of N-terminal methionine from proteins by engineered methionine aminopeptidase
CN112481286A (en) Amino acid sequence for improving heterologous expression efficiency of recombinant milk protein
CN112391367A (en) Preparation method of Cas9 protein for gene editing of human primary cells
Teubl et al. Tethered MNase Structure Probing as Versatile Technique for Analyzing RNPs Using Tagging Cassettes for Homologous Recombination in Saccharomyces cerevisiae
EP3950708A1 (en) Method for expressing and purifying protein by using csq-tag
WO2023115517A1 (en) Dna polymerase mutant and use thereof
CN115838437A (en) Human NT-proBNP fusion protein and preparation method and application thereof
KR102185312B1 (en) A composition for solubilization of target proteins and use thereof
US11214791B2 (en) Engineered FHA domains
WO2004106527B1 (en) Archaeon expression system
CN115974981A (en) DNA binding domain, fusion type phi29DNA polymerase, and preparation method and application thereof
Listwan et al. Modification of recombinatorial cloning for small affinity tag fusion protein construct generation
CN114230644A (en) GP32 protein mutant, recombinant vector, and construction method and application thereof
KR20070035499A (en) Process for producing polypeptide

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18943958

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18943958

Country of ref document: EP

Kind code of ref document: A1