WO2021190600A1 - Automated gene assembly system and method - Google Patents

Automated gene assembly system and method Download PDF

Info

Publication number
WO2021190600A1
WO2021190600A1 PCT/CN2021/083012 CN2021083012W WO2021190600A1 WO 2021190600 A1 WO2021190600 A1 WO 2021190600A1 CN 2021083012 W CN2021083012 W CN 2021083012W WO 2021190600 A1 WO2021190600 A1 WO 2021190600A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
gene
reactions
fragment
fragments
Prior art date
Application number
PCT/CN2021/083012
Other languages
French (fr)
Chinese (zh)
Inventor
马小舒
李一凡
翟春华
邓栋臣
王嫚
马艳秋
Original Assignee
南京金斯瑞生物科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京金斯瑞生物科技有限公司 filed Critical 南京金斯瑞生物科技有限公司
Publication of WO2021190600A1 publication Critical patent/WO2021190600A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression

Definitions

  • the invention relates to the field of gene synthesis, in particular to the synthesis of gene mutation libraries or gene combinatorial libraries and their automated execution.
  • Synthetic biology is based on the idea of engineering design, constructing standardized components and modules, transforming existing natural systems or synthesizing new artificial life systems from scratch. People use gene recombination technology and gene positioning editing to realize special programming of life systems and perform special functions; modularize metabolic pathways, optimize the combination and collocation of components, and realize the synthesis of chemicals in the best mode. Synthetic biology has made significant progress in the energy, chemical, and pharmaceutical industries.
  • High-throughput, high-efficiency, and low-cost DNA synthesis is still a problem facing the nucleic acid synthesis industry.
  • Large-scale DNA synthesis projects not only require a high-throughput platform, but also a highly automated production line from primer synthesis to product inspection. Therefore, we are committed to improving the degree of automation of the production line to solve the current problems.
  • the invention can shorten the synthesis period, greatly reduce the cost (especially the labor cost), and the correct rate and the success rate of the one-time product can reach more than 95%.
  • the present invention relates to a method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, and the method includes:
  • the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
  • mutant gene library was constructed, wherein the method was carried out in batches in a multi-compartment container.
  • the present invention relates to a method for constructing a gene combinatorial library, wherein various gene combinations in the gene combinatorial library have sequence-unique segments and multi-sequence segments relative to each other, and the method includes:
  • the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library,
  • the variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
  • a gene combinatorial library is constructed, wherein the method is carried out in batches in a multi-compartment container.
  • the present invention relates to a method for automatically preparing a batch reaction system, which includes:
  • the reagent transfer relationship table is generated as follows:
  • the method is carried out in a multi-compartment container.
  • the method can be used to construct mutant gene libraries and gene combinatorial libraries.
  • the implementation of the present invention can utilize various automation platforms, including, for example, an automated synthesis platform, an automated pipetting platform, and an automated reaction platform.
  • the implementation of the present invention can utilize various seamless splicing technologies, including, for example, the Gibson method, the Golden-Gate technology, and the Genbuilder technology.
  • the implementation of the present invention can also use conventional molecular cloning techniques, such as site-directed mutagenesis, mutagenesis PCR, overlap PCR and restriction endonucleases.
  • the method of the present invention can perform at least 10 reactions, at least 25 reactions, at least 50 reactions, at least 100 reactions, at least 250 reactions, at least 500 reactions, at least 750 reactions, at least 1000 reactions, at least 1250 reactions at a time. Reactions, at least 1500 reactions, at least 1750 reactions, at least 2000 reactions, at least 2250 reactions, at least 2500 reactions, or at least 2750 reactions.
  • Figure 1A shows a schematic diagram of the structure of the pUC57 plasmid described in Example 1.
  • Figure 1B shows a schematic diagram of the structure of the pUC57-BsmBI plasmid described in Example 1.
  • Fig. 2A shows a schematic diagram of the target sequence and the A, B, and C fragments of the mutant gene bank described in Example 2.
  • FIG. 2B shows a schematic diagram of the assembly of the mutant gene library described in Example 2.
  • Figure 3 shows a schematic diagram of the distribution of PCR reaction plates and reagent plates for constructing the mutant gene library described in Example 2.
  • FIG. 4 shows a flowchart of the reagent transfer relationship table of the PCR system for constructing the mutant gene library described in Example 2.
  • Figure 5 shows the reagent transfer relationship table of the PCR system for constructing the mutant gene library described in Example 2.
  • Figure 6 shows the electrophoresis photographs of the PCR products for constructing the mutant gene library described in Example 2.
  • FIG. 7 shows a schematic diagram of the distribution of reaction plates and reagent plates of the splicing reaction for constructing the mutant gene library described in Example 2.
  • FIG. 8 shows a flowchart of the reagent transfer relationship table of the splicing reaction system for constructing the mutant gene library described in Example 2.
  • FIG. 9 shows the reagent transfer relationship table of the splicing reaction system for constructing the mutant gene library described in Example 2.
  • Figure 10 shows the electrophoresis photographs of the colony PCR test for constructing the mutant gene library described in Example 2.
  • " ⁇ " represents the strip of the second reinspection.
  • FIG. 11 shows a schematic diagram of the structure of the U692AEH070 intermediate carrier described in Example 3.
  • Figure 12 shows a schematic diagram of fragment fusion for constructing a gene combinatorial library described in Example 3.
  • Figure 13 shows a schematic diagram of the assembly of the gene combinatorial library constructed in Example 3.
  • FIG. 14 shows a flowchart of the reagent transfer relationship table of PCR generated by fragments of the gene combinatorial library constructed in Example 3.
  • FIG. 15 shows the reagent distribution table and reagent transfer relationship table of the fragment generation PCR for constructing the gene combinatorial library described in Example 3.
  • al means fragment 4
  • bl means fragment 6
  • bll means fragment 7
  • C means fragment 8
  • F means forward primer
  • R means reverse primer
  • 113al-F means amplified fragment 4
  • a forward primer required, 113al-R indicates a reverse primer required to amplify fragment 4
  • "113” is the set number
  • “item” indicates the template required to amplify the target fragment, and the following number is the number .
  • Figure 16 shows the electrophoresis photographs of PCR products generated from the fragments of the gene combinatorial library constructed in Example 3. " ⁇ " represents the strips that are supplemented twice.
  • FIG. 17 shows a flow chart of the reagent transfer relationship table of fragment fusion PCR for constructing a gene combinatorial library described in Example 3.
  • Figure 18 shows the reagent distribution table and reagent transfer relationship table of the fragment fusion PCR for constructing the gene combinatorial library described in Example 3.
  • A represents the fragment amplified by the fusion of fragment a1 and fragment all
  • B is the fragment amplified by the fusion of fragments bl and b11
  • C refers to fragment 8
  • F refers to forward primer
  • R refers to reverse primer;
  • 113al-F Indicates a forward primer needed for fusion amplified fragment 113A
  • 113all-R indicates a reverse primer needed for fusion amplified fragment 113A
  • 113al and 113all are templates for fusion amplified fragment 113A
  • "113" is the set number .
  • FIG. 19 shows the electrophoresis photograph of the PCR product of the fragment fusion for constructing the gene combinatorial library described in Example 3.
  • FIG. 20 shows a flow chart of the reagent transfer relationship table of the splicing reaction system for constructing the gene combinatorial library described in Example 3.
  • FIG. 20 shows a flow chart of the reagent transfer relationship table of the splicing reaction system for constructing the gene combinatorial library described in Example 3.
  • Figure 21 shows the reagent distribution table and reagent transfer relationship table of the splicing reaction system for constructing the gene combinatorial library described in Example 3.
  • A represents the fragment after fusion and amplification of fragment a1 and fragment all
  • B represents the fragment after fusion and amplification of fragment bl and b11
  • C represents fragment 8
  • A, B and C are the three fragments required to assemble a full length
  • 113A, 113B, and 113C are used to splice three fragments of the full-length sequence of No. 113
  • "113" is the set number.
  • FIG. 22 shows the electrophoresis photographs of colony PCR test for constructing the gene combinatorial library described in Example 3.
  • FIG. " ⁇ " represents the strip of the second reinspection.
  • Figure 23 shows a schematic diagram of the numbering of the wells of a 96-well plate.
  • the present invention provides a method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, and the method includes:
  • the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
  • mutant gene library was constructed, wherein the method was carried out in batches in a multi-compartment container.
  • the present invention provides a method for constructing a gene combinatorial library, wherein various gene combinations in the gene combinatorial library have sequence-unique segments and multi-sequence segments relative to each other, and the method includes:
  • the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library,
  • the variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
  • a gene combinatorial library is constructed, wherein the method is carried out in batches in a multi-compartment container.
  • the present invention provides a method for automatically preparing a batch reaction system, which includes:
  • the reagent transfer relationship table is generated as follows:
  • the method is carried out in a multi-compartment container.
  • the present invention provides a method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, and the method includes:
  • the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
  • the present invention provides a method for constructing a gene combinatorial library, wherein various gene combinations in the gene combinatorial library have sequence-unique segments and multi-sequence segments relative to each other, and the method includes:
  • the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library,
  • the variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
  • the multi-compartment container is a multi-well plate, such as a 96-well plate or a 384-well plate.
  • one or more multiwell plates are used at a time, such as 1-27, 4-25, or 9-16 96-well plates or 384-well plates, such as 1, 2, 3, 4, 5, 6, 8 , 9, 10, 12, 15, 16, 18, 20, 24, 25 or 27 96-well plates.
  • reagents common to multiple reactions and/or reagents common to all reactions are combined in one or more wells on the reagent plate.
  • the common reagent may occupy only one hole on the reagent plate.
  • the common reagent can also occupy multiple holes on the reagent plate, for example, 2-12, 2-8, or 2-4 holes, for example, 2, 4, 6, 8 or 12 holes.
  • the number of wells occupied by public reagents on the reagent plate depends on many factors, such as the number of pipetting at one time for automatic pipetting (e.g.
  • the reagent can be molecular biology, especially reagents used in nucleic acid chemical synthesis, such as primers (oligonucleotides), templates (such as plasmids, genomes), buffers, restriction endonucleases, polymerases, ligases, dNTP mixture and so on.
  • Common reagents can be multiple reactions in one run, or even reagents common to all reactions in one run, such as common primers, common templates, buffers, restriction endonucleases, polymerases, ligases, dNTP mixtures, etc.
  • reagents common to multiple reactions and/or reagents common to all reactions are added manually.
  • manual addition is performed using a multichannel pipette or high-throughput dispenser.
  • At least 10 reactions, at least 25 reactions, at least 50 reactions, at least 96 reactions, at least 100 reactions, at least 192 reactions, at least 250 reactions, at least 288 reactions are performed at a time.
  • the upper limit of the number of reactions performed at one time depends on the capacity of automated workstations (e.g., automated pipetting workstations, automated synthesis workstations, automated reaction workstations).
  • step (4) uses seamless splicing technology, including but not limited to Golden-Gate method, Gibson method or Genbuilder method.
  • seamless splicing technology including but not limited to Golden-Gate method, Gibson method or Genbuilder method.
  • Those skilled in the art can determine the upper limit of the number of fragments that can be spliced in one reaction according to the seamless splicing technology specifically adopted.
  • 2-9, 2-7, or 3-5 fragments are spliced at a time, for example, 2, 3, 4, 5, 6, 7, 8, or 9 fragments.
  • step (4) utilizes conventional molecular biology techniques, including but not limited to restriction endonucleases.
  • restriction endonucleases Those skilled in the art can determine the upper limit of the number of fragments that can be spliced in one reaction according to the specific restriction endonuclease used. Generally, 2-9, 2-7, or 3-5 fragments are spliced at a time, for example, 2, 3, 4, 5, 6, 7, 8, or 9 fragments.
  • the mutation may be the substitution, addition or deletion of one or more nucleotide residues. In one embodiment, the plurality of nucleotide residues may be continuous or discontinuous. In one embodiment of the method of constructing a mutant gene library, the mutation may be the substitution, addition or deletion of one or more encoded amino acid residues. In one embodiment, the plurality of encoded amino acid residues may be continuous or discontinuous. In one embodiment, mutations are relative to each member of the mutation gene library. In one embodiment, the mutation is a member of the mutation gene library in terms of the reference sequence. The reference sequence may or may not be included in the mutant gene library.
  • the full-length target sequence can be a single gene sequence composed of a sequence unique segment and a sequence multiple-selection segment, or it can be a sequence unique gene and sequence multiple-selection sequence.
  • the multi-gene sequence composed of the genes of the above can even be a multi-gene sequence composed of a unique segment of the sequence and a multi-selected segment of the sequence.
  • the polygene may be polycistronic, or a cascade or pathway (such as a metabolic pathway, a synthetic pathway, a signal transduction pathway), or even a complete genome (such as a genome of a lower organism (such as a virus)).
  • the target sequence into segments (for example, a constant sequence segment and a variable sequence segment).
  • the variable sequence segment is a segment that contains a mutation.
  • the variable sequence segment is a sequence multiple-selected segment.
  • Those skilled in the art can divide each segment by comparing all target sequences. Methods and tools (such as software) for sequence alignment are well known in the art. The length of each fragment depends on many factors, such as the full length of the target sequence, the optimal synthesis length of the synthesis system, the position of the mutation on the target sequence, and the distance between adjacent mutations.
  • each fragment may contain sequences that are not on the target sequence, such as restriction sites (such as type II restriction endonucleases, especially Type IIs restriction endonuclease), homology arms, tags, etc.
  • restriction sites such as type II restriction endonucleases, especially Type IIs restriction endonuclease
  • the full length of the target sequence can be, for example, about 1-100,000, 10-10,000, 100-8,000, 150-8,000, 200-5,000, 250-1,000 or 400-600, for example about 100, 150, 250, 500, 750, 1,000 , 1,250, 1,500, 1,750, 2,000, 2,500, 3,000, 4,000, 5,000, 8,000, 10,000, 12,500, 15,000, 17,500, 20,000, 25,000, 50,000, 75,000 or 100,000 (including this number and ⁇ 10%).
  • each fragment (such as a variable sequence fragment or a constant sequence fragment) can be, for example, about 1-5,500, 150-5,000, 200-1,000, 250-500, or 400-600, such as about 50, 100, 150, 200, 250, 500, 750, 1,000 or 5,000 (including this number and ⁇ 10%).
  • the full length of the target sequence can be, for example, about 1-10,000, 100-8,000, 150-8,000, 150-5,000, 200-5,000, 250-1,000, or 400-600, such as about 100, 150, 250, 500 , 750, 1,000, 1,250, 1,500, 1,750, 2,000, 2,500, 5,000, 8,000 or 10,000 (including this number and ⁇ 10%).
  • the length of each fragment (such as a variable sequence fragment or a constant sequence fragment) may be, for example, about 1-5,500, 150-5,000, 200-1,000, 250-750, or 400-600, such as about 50, 100, 150, 200, 250, 500, 750, 1,000 or 5,000 (including this number and ⁇ 10%).
  • the full length of the target sequence can be, for example, about 1-100,000, 10-10,000, 100-10,000, 200-8,000, or 500-8,000, such as about 100, 250, 500, 750, 1,000, 1,250, 1,500, 1,750 , 2,000, 2,500, 3,000, 4,000, 5,000, 8,000, 10,000, 12,500, 15,000, 17,500, 20,000, 25,000, 50,000, 75,000 or 100,000 (including this number and ⁇ 10%).
  • each fragment (such as a variable sequence fragment or a constant sequence fragment) may be, for example, about 1-5,500, 100-5,000, 150-5,000, 250-1,000, or 250-500, such as about 50, 100, 150, 200, 250, 500, 1,000 or 5,000 (including this number and ⁇ 10%).
  • the reagent transfer relationship table can be performed using any tool (such as software), especially statistical software, such as Excel.
  • the common reagents can be combined through the deduplication module of Excel to save the number of wells occupied by the common reagents, thereby saving the number of plates and increasing the number of reactions performed in one run.
  • Various reagents (especially primers and templates) can be represented in various ways, such as name, code, sequence, number, synthetic order number.
  • the information of the reagent transfer relationship table in the previous step can be used to generate the reagent transfer relationship table in the subsequent step.
  • the data comparison, search, positioning, etc. involved in this process can be performed using the vlookup module.
  • the reagent transfer relationship table can be output in various ways, such as CSV and other formats that can be recognized by the machine.
  • This example describes the construction of exemplary plasmids that can be used for seamless splicing (e.g., recognition sites for type IIS restriction enzymes (e.g., BsmB I) with a pair of head-to-head orientation) and blue-white spot screening functions (e.g., LacZ gene).
  • the plasmid pUC57 (GenBank: Y14837.1) was transformed to obtain the plasmid pUC57-BsmBI.
  • the plasmid pUC57 contains the LacZ gene and two BsmB I restriction sites in a tail-to-tail orientation (that is, the recognition site is outside and the cleavage site is inside) on the same side. Based on the BsmB I restriction cloning procedure, the LacZ gene and the two BsmB I recognition sites are all retained. Therefore, pUC57 is not suitable as a seamless splicing vector.
  • the transformed plasmid pUC57-BsmBI contains the LacZ gene and a pair of BsmB I restriction sites in the head-to-head orientation (that is, the recognition site is inside, and the cleavage site is outside) of the LacZ gene. Based on the BsmB I restriction cloning procedure, both the LacZ gene and a pair of BsmB I recognition sites were excised. Therefore, pUC57-BsmBI is suitable as a seamless splicing vector.
  • pUC57-BsmBI-f agaggcctgcatgcaagcttggcgtaatcatggtcatagctgttcgtctctcctgtgtgaaattgttatccgc (SEQ ID NO: 1);
  • pUC57-BsmBI-r ggttatcaagtgagaaatcaccatgagtgacgactgaatcggtttcttagacgtcaggtggc (SEQ ID NO: 2);
  • pUC57-LacZ-f gattcagtcgtcactcatggtgatttctcacttgataaccttcggtgatgacggtgaaaac (SEQ ID NO: 3);
  • pUC57-LacZ-r gctatgaccatgattacgccaagctt (SEQ ID NO: 4).
  • the target sequence is 819 bp in length, in which the 720 bp core segment contains mutations relative to the starting sequence, and the 72 bp upstream segment and 27 bp downstream segment contain no mutations relative to the starting sequence.
  • the 720bp core segment can be divided into 24 continuous core sub-segments in units of 30 bp, and each target sequence contains mutations in a corresponding core sub-segment relative to the starting sequence, and in other core sub-segments. Does not contain mutations (as shown in Figure 2A).
  • the middle B fragment is 80 bp long, contains a 30 bp moving window in the middle, corresponds to one of the above core sub-segments, and the flanking sequences on both sides total 50bp, providing a seamlessly spliced recognition site and protective base (for example, the same or complementary to the corresponding part of the starting sequence);
  • the upstream A segment complements the upstream sequence that the corresponding B segment lacks relative to the starting sequence;
  • the downstream C The fragments complement the corresponding downstream sequences that the B fragment lacks relative to the starting sequence; similar to the B fragment, the A fragment and the C fragment also contain flanking sequences, providing a seamlessly spliced recognition site and protective bases (for example, with the starting sequence
  • the corresponding parts are the same or complementary; for example, overlap with the corresponding B fragment or plasmid).
  • this step generates a total of 72 fragments from A1 to A24, B1 to B24, and C1 to C24. These fragments can be generated by PCR.
  • the forward primer used for PCR amplification of the fragments A1 to A24 is a common primer
  • the reverse primer used for PCR amplification of the C1 to C24 fragments is a common primer.
  • PCR templates there are 2 kinds of PCR templates and 98 kinds of PCR primers, resulting in 72 kinds of PCR products.
  • a 96-well plate is used, at least 2 reagent plates (labeled Source1 and Source2) and 1 reaction plate (labeled Destination1) are required.
  • the reagent plate indicates the position of the primer and template, and the reaction plate indicates the position of the product.
  • the position of the product on the reaction plate is determined first, and then the position of the primer and template on the reagent plate is determined according to the position of the product on the reaction plate (as shown in Figure 3).
  • the forward primers used for PCR amplification of fragments A1 to A24 are common primers, marked with GG-AF; the reverse primers used for PCR amplification of fragments A1 to A24 are marked with GG-AR1 to GG-AR24; used for PCR
  • the forward and reverse primers used to amplify fragments B1 to B24 are labeled GG-BF1 to GG-BF24 and GG-BR1 to GG-BR24, respectively; the forward primers used for PCR amplification of fragments C1 to C24 are labeled GG-BF1 to GG-BF24, respectively.
  • CF1 to GG-CF24 are labeled; the reverse primers used for PCR amplification of C1 to C24 fragments are common primers and are labeled GG-CR.
  • the starting sequence is used as a common template for PCR to generate fragments of A1 to A24 and C1 to C24, marked with A/C-T; the mutant sequence containing all possible mutations is used as a common template for PCR to generate fragments of B1 to B24, marked with B-T.
  • Common reagents can occupy one or more wells.
  • GG-BF1 to GG-BF24 position is transferred to the corresponding Destination1 plate GG-B1 to GG-B24 position
  • the reverse primer of fragment B is transferred from GG-BR1 to GG-BR24 position of Source1 plate to the corresponding Destination1 plate GG-B1 to GG-B24 position
  • transfer the forward primer of fragment C from GG-CF1 to GG-CF24 position of Source1 plate to GG-C1 to GG-C24 position of corresponding Destination1 plate
  • transfer the common positive of fragment A Transfer the primers from the GG-AF position of the Source2 plate to the GG-A1 to GG-A24 positions of the Destination1 plate, and transfer the common reverse primer of fragment C from the GG-CR position of the Source2 plate to the GG-C1 to GG of the Destination1 plate -C24 location.
  • a reagent transfer relationship table is generated.
  • the generated reagent transfer relationship table is shown in Figure 5.
  • the primers are synthesized in the corresponding wells of a 96-well deep-well plate (Shanghai Best Biotechnology Co., Ltd., PCR-96M2-HS-C) according to the reagent position table. After synthesis, the template is added to the corresponding wells of the 96-well deep-well plate according to the reagent position table.
  • the Tecan workstation automatically transfers primers and templates according to the transfer relationship table, from the reagent plate to the reaction plate.
  • Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput microdispenser (Preddator, Model S4).
  • Fragments A, B, and C were generated by PCR as described below.
  • the expected size of the PCR product is shown in Table 1.
  • the gel electrophoresis photograph of the recovered PCR product is shown in Figure 6.
  • this step inserts the set of A, B, and C fragments into the plasmid. These fragments can be inserted into the plasmid pUC57-BsmBI by seamless splicing (see Example 1).
  • Fragment B the length is 80bp
  • the dosage is about 40ng.
  • Fragment A and Fragment C have different lengths, the dosage is about 80ng if the length is less than 550bp, and the dosage is about 120ng if the length is greater than 550bp.
  • the plasmid pUC57-BsmBI has a length of 2683bp and a dosage of about 140ng.
  • the reagent position table of the splicing reaction can follow the reaction position table of the previous PCR (marked by Source11-Destination1), and the plasmid (marked by GG-V) is added.
  • segment A is transferred from GG-A1 to GG-A24 of Source11-Destination1 to GoldenGate-1 to GoldenGate-24 of the corresponding Destination11 board
  • segment B is transferred from GG-B1 to GG- of Source11-Destination1.
  • a reagent transfer relationship table is generated.
  • the generated reagent transfer relationship table is shown in FIG. 9.
  • the Tecan workstation automatically transfers inserts and plasmids according to the transfer relationship table, from the reagent plate to the reaction plate.
  • Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser. When the reaction solution is viscous, manual aliquoting can be carried out to reduce errors.
  • the splicing product was transformed into E. coli Top10 competent cells (Nanjing GenScript Biotechnology Co., Ltd.), and blue-white plate screening was performed (results not shown).
  • Each splicing reaction uses Qpix (Molec ⁇ Lar Devices, Qpix 420) to pick 4 single colonies to perform PCR inspection on the inserts (as shown in Figure 10); the colonies that are correct by PCR are submitted for sequencing inspection (data not shown).
  • the test results are shown in Table 2. In short, the one-time success rate of splicing reaches 100%, and the one-time correct rate reaches more than 95%.
  • the target sequence is composed of 9 parts, which are represented by fragments 1 to 9 in the 5'to 3'direction.
  • the length of fragment 4 is 500 bp, and there are 4 kinds of sequences to choose;
  • the length of fragment 5 is 301-1500 bp, there are 16 kinds of sequences to choose;
  • the length of fragment 8 is 2050 bp, there are 4 kinds of sequences to choose;
  • the length of fragment 6 is 711 bp, the sequence Unique;
  • Fragment 7 has a length of 400bp with unique sequence; Fragments 1, 2, 3 and 9 have unique sequences.
  • fragments 1, 2, 3 and 9 at both ends are unique, so these four fragments can be integrated into the vector pUC57-KanR (SEQ ID NO: 5) constructed based on pUC57 to generate the intermediate vector U692AEH070-2 ( Figure 11) Shown).
  • fragment 4 and fragment 5 are fused into fragment A
  • fragment 6 and fragment 7 are fused into fragment B
  • fragment 8 is used as fragment C alone (as shown in FIG. 12).
  • Both ends of fragment B and fragments A and C each have 60 bp homology arms for seamless splicing.
  • insert the corresponding set of A, B, and C fragments into the intermediate vector (as shown in Figure 13).
  • the four fragments, fragment 1, fragment 2, fragment 3, and fragment 9 were integrated into pUC57-KanR, and the Not I and Asc I restriction sites were introduced for subsequent cloning to generate intermediate vector U692AEH070 -2.
  • the intermediate vector was linearized by Not I and Asc I restriction digestion, purified, and ready for use (concentration of about 20 ng/ ⁇ L). Both ends of the intermediate vector have 40 bp homology arms with fragments A and C for seamless splicing.
  • this step generates fragment 4, fragment 5, fragment 6, fragment 7, and fragment 8. These fragments can be generated by PCR.
  • the primers used for PCR amplification of fragment 4, fragment 5, fragment 6, fragment 7, and fragment 8 were generated by conventional methods (Nanjing GenScript Biotechnology Co., Ltd.), and homology arms were introduced on both sides.
  • the reagent transfer relationship table is generated as described below.
  • the number of types of segment 4 is 4, the number of types of segment 5 is 16, the number of types of segment 6 is 1, the number of types of segment 7 is 1, and the number of types of segment 8 is 4. There are 26 kinds in total.
  • the total number of types of fragments corresponds to the total number of types of templates.
  • step 3 According to the number of times required by each template in step 3, the volume required by each template is obtained.
  • step 13 Find the position of the corresponding template in step 2 according to the template name in step 11, establish the corresponding relationship between the position of the PCR product, the position and volume of the template, and obtain the template transfer relationship table.
  • the Tecan workstation automatically transfers primers and templates according to the reagent transfer relationship table, from the reagent plate to the reaction plate.
  • Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser.
  • the gel electrophoresis photograph of the recovered PCR product is shown in FIG. 16.
  • fragment 4 and fragment 5 are fused into fragment A
  • fragment 6 and fragment 7 are fused into fragment B.
  • the fusion of the fragments can be performed by PCR.
  • the reagent transfer relationship table is generated as described below.
  • step 8 Find the position of the corresponding primer in step 3 according to the primer name in step 7, establish the corresponding relationship between the position of the fusion product, the position and volume of the primer, and obtain the transfer relationship table of the primer.
  • step 9 Find the position of the corresponding template in step 1 according to the template name in step 7, establish the corresponding relationship between the position of the fusion product, the position and volume of the template, and obtain the transfer relationship table of the template.
  • the Tecan workstation automatically transfers primers and templates according to the reagent transfer relationship table, from the reagent plate to the reaction plate.
  • Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser.
  • the expected length of fragment A is 801-2000 bp, and the expected length of fragment B is 1111 bp.
  • the gel electrophoresis photograph of the recovered PCR product is shown in FIG. 19.
  • the required fragment A, fragment B, fragment C, etc. are transferred from the reagent plate to the reaction plate.
  • a reagent transfer relationship table is generated as shown in FIG. 20.
  • the reagent distribution table and the reagent transfer relationship table are shown in FIG. 21.
  • the Tecan workstation automatically transfers fragment A, fragment B, and fragment C according to the transfer relationship table, and transfers from the reagent plate to the reaction plate.
  • Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser.
  • the spliced product was transformed into E. coli Top10 competent cells (Nanjing GenScript Biotechnology Co., Ltd.), and screened with antibiotics (such as kanamycin) on a plate (results not shown). Pick 6 single colonies for each splicing reaction to perform PCR inspection on the inserts (as shown in Figure 22); the colonies that were correctly tested by PCR were submitted for sequencing inspection (data not shown). The test results are shown in Table 3. In short, the one-time success rate of splicing reaches 100%, and the one-time correct rate reaches more than 95%.
  • serial number PCR positive rate Sequencing accuracy rate serial number PCR positive rate Sequencing accuracy rate 1 2/6 + 26 4/6 + 2 4/6 + 27 3/6 + 3 3/6 + 28 3/6 + 4 3/6 + 29 3/6 + 5 3/6 + 30 5/6 + 6 6/6 + 31 5/6 + 7 3/6 + 32 6/6 + 8 5/6 + 33 1/6 + 9 3/6 + 34 3/6 + 10 6/6 + 35 3/6 + 11 4/6 + 36 6/6 + 12 3/6 + 37 1/6 + 13 1/6 + 38 6/6 + 14 3/6 + 39 2/6 + 15 2/6 + 40 5/6 + 16 2/6 + 41 5/6 + 17 1/6 + 42 6/6 + 18 2/6 + 43 4/6 + 19 4/6 + 44 6/6 + 20 5/6 + 45 2/6 + twenty one 6/6 + 46 3/6 + twenty two 6/6 + 47 6/6 + twenty three 2/6 + 48 3/6 + twenty four 6/6 + 49 6/6 + 25 3/6 + 50 3/6 +

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided is a mutant gene pool constructing method, comprising: for each type of mutant gene, segmenting a full-length sequence into one or more constant sequence fragments (containing a sequence identical to a corresponding section of a reference sequence or a complementary sequence thereof) and one or more variable sequence fragments (containing a mutant compared with the corresponding section of the reference sequence or the complementary sequence thereof); respectively generating constant sequence fragments of each mutant gene and respectively generating variable sequence fragments of each mutant gene; for each mutant gene, preparing a reaction system containing a set of constant sequence fragments and variable sequence fragments; and for each mutant gene, generating a full-length mutant gene insertion vector, and constructing a mutant gene pool. The operations are carried out in batches in a multi-compartment container.

Description

一种基因自动化组装系统和方法Gene automatic assembly system and method 技术领域Technical field
本发明涉及基因合成领域,特别是基因突变库或基因组合库的合成及其自动化执行。The invention relates to the field of gene synthesis, in particular to the synthesis of gene mutation libraries or gene combinatorial libraries and their automated execution.
背景技术Background technique
合成生物学是以工程化设计思路,构建标准化的元器件和模块,改造已存在的天然系统或者从头合成全新的人工生命体系。人们利用基因重组技术和基因定位编辑来实现对生命系统的特殊编程并执行特殊的功能;模块化处理代谢途径,优化元器件间的组合搭配,以最优的模式来实现化学品的合成。目前合成生物学已在能源、化工、医药等行业取得了重大进展。Synthetic biology is based on the idea of engineering design, constructing standardized components and modules, transforming existing natural systems or synthesizing new artificial life systems from scratch. People use gene recombination technology and gene positioning editing to realize special programming of life systems and perform special functions; modularize metabolic pathways, optimize the combination and collocation of components, and realize the synthesis of chemicals in the best mode. Synthetic biology has made significant progress in the energy, chemical, and pharmaceutical industries.
为了优化所需的功能,人们会同时构建数百甚至数千条代谢合成通路,然后筛选性能最好的一条用于下游实验,这意味着,合成生物学进入了高通量筛选时代。然而,由于生物系统的复杂性和它们之间许多未知的相互作用,必须进行许多轮的筛选和测试,这是迈向设计、构建和测试自动化循环的重要一步。In order to optimize the required functions, people will construct hundreds or even thousands of metabolic synthesis pathways at the same time, and then screen the best-performing one for downstream experiments, which means that synthetic biology has entered the era of high-throughput screening. However, due to the complexity of biological systems and the many unknown interactions between them, many rounds of screening and testing must be carried out. This is an important step towards the design, construction, and test automation cycle.
据发明人所知,目前市场上没有供应商提供这种类型的服务,下游用户要么以低通量的方式构建基因文库,要么花费大量资金来维持他们的生产线。长期以来,它一直受到高昂的实验成本、时间成本及人工成本的阻碍。As far as the inventor knows, there are currently no suppliers on the market that provide this type of service. Downstream users either construct gene libraries in a low-throughput manner or spend a lot of money to maintain their production lines. For a long time, it has been hindered by high experimental cost, time cost and labor cost.
高通量DNA的组装方法已成为合成生物学应用的必备工具,例如代谢途径优化、微生物工程和合成基因组工程等,已经越来越多地应用于各种生物技术中。在过去的数十年里,大量的DNA组装方法被开发出来,例如南京金斯瑞生物科技公司的Genbuilder组装,Golden-Gate组装,Gibson拼接等。High-throughput DNA assembly methods have become an indispensable tool for synthetic biology applications, such as metabolic pathway optimization, microbial engineering and synthetic genome engineering, etc., which have been increasingly used in various biotechnologies. In the past few decades, a large number of DNA assembly methods have been developed, such as Genbuilder assembly, Golden-Gate assembly, Gibson assembly, etc. of Nanjing GenScript Biotechnology Company.
发明概述Summary of the invention
高通量、高效率、低成本的DNA合成仍然是核酸合成产业面临的难题。大规模的DNA合成项目不仅需要一个高通量的平台,更需要一条从引物合成到产物检验的自动化程度高的生产线来实现。因此,我们致力于提高生产线的自动化程度来解决现在面临的问题。本发明可以缩短合成周期,大大降低成本(尤其是人工成本),而且一次产品正确率与成功率均可达到95%以上。High-throughput, high-efficiency, and low-cost DNA synthesis is still a problem facing the nucleic acid synthesis industry. Large-scale DNA synthesis projects not only require a high-throughput platform, but also a highly automated production line from primer synthesis to product inspection. Therefore, we are committed to improving the degree of automation of the production line to solve the current problems. The invention can shorten the synthesis period, greatly reduce the cost (especially the labor cost), and the correct rate and the success rate of the one-time product can reach more than 95%.
在一个方面,本发明涉及一种构建突变基因库的方法,其中突变基因库中的每一种突变基因相对于参照序列包含突变,所述方法包括:In one aspect, the present invention relates to a method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, and the method includes:
(1)针对每一种突变基因将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段包含与参照序列或其互补序列的对应区段相同的序列,可变序列片段包含与参照序列或其互补序列的对应区段相比的突变;(1) For each mutant gene, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
(2)分别生成每一种突变基因的各个恒定序列片段和分别生成每一种突变基因的各个可变序列片段;(2) Generate each constant sequence fragment of each mutant gene and each variable sequence fragment of each mutant gene separately;
(3)针对每一种突变基因配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each mutant gene; and
(4)针对每一种突变基因生成全长突变基因,任选地插入载体,(4) Generate a full-length mutant gene for each mutant gene, and optionally insert it into a vector,
由此构建突变基因库,其中所述方法是在多隔室容器中批量进行的。Thus, a mutant gene library was constructed, wherein the method was carried out in batches in a multi-compartment container.
在另一个方面,本发明涉及一种构建基因组合库的方法,其中基因组合库中的各种基因组合相对于彼此具有序列唯一的区段和序列多选的区段,所述方法包括:In another aspect, the present invention relates to a method for constructing a gene combinatorial library, wherein various gene combinations in the gene combinatorial library have sequence-unique segments and multi-sequence segments relative to each other, and the method includes:
(1)针对每一种基因组合将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段对应于在基因组合库中具有唯一序列的区段,可变序列片段对应于在基因组合库中具有多选序列的区段;(1) For each gene combination, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library, The variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
(2)生成各个恒定序列片段和分别生成每一种基因组合的各个可变序列片段;(2) Generate each constant sequence fragment and each variable sequence fragment for each gene combination;
(3)针对每一种基因组合配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each gene combination; and
(4)针对每一种基因组合生成全长基因组合,任选地插入载体,(4) Generate a full-length gene combination for each gene combination, optionally insert a vector,
由此构建基因组合库,其中所述方法是在多隔室容器中批量进行的。Thus, a gene combinatorial library is constructed, wherein the method is carried out in batches in a multi-compartment container.
在又一个方面,本发明涉及一种自动配制批量反应体系的方法,其包括:In yet another aspect, the present invention relates to a method for automatically preparing a batch reaction system, which includes:
生成试剂转移关系表;和Generate a reagent transfer relationship table; and
将试剂转移关系表上传自动移液装置,由自动移液装置按照试剂转移关系表自动将试剂自装有试剂的容器转移至用于进行反应的容器,Upload the reagent transfer relationship table to the automatic pipetting device, and the automatic pipetting device will automatically transfer the reagent from the container containing the reagent to the container for reaction according to the reagent transfer relationship table.
其中试剂转移关系表是如下生成的:The reagent transfer relationship table is generated as follows:
(1)列出要进行的批量反应,列出每一个反应需要的试剂及其体积;(1) List the batch reactions to be carried out, list the reagents and their volumes required for each reaction;
(2)确定每一个反应就反应容器而言的位置;(2) Determine the position of each reaction in terms of the reaction vessel;
(3)确定每一个反应需要的每一种试剂就试剂容器而言的位置;(3) Determine the position of each reagent required for each reaction in terms of the reagent container;
(4)确定每一个反应需要的每一种试剂的转移起点和终点,(4) Determine the transfer starting point and end point of each reagent required for each reaction,
其中所述方法是在多隔室容器中进行的。The method is carried out in a multi-compartment container.
所述方法可以用于构建突变基因库和基因组合库。The method can be used to construct mutant gene libraries and gene combinatorial libraries.
本发明的实施可以利用各种自动化平台,包括例如自动化合成平台、自动化移液平台和自动化反应平台。The implementation of the present invention can utilize various automation platforms, including, for example, an automated synthesis platform, an automated pipetting platform, and an automated reaction platform.
本发明的实施可以利用各种无缝拼接技术,包括例如Gibson方法、Golden-Gate技术和Genbuilder技术。本发明的实施也可以利用常规的分子克隆技术,例如定点突变、诱变PCR、交叠PCR和限制性核酸内切酶。The implementation of the present invention can utilize various seamless splicing technologies, including, for example, the Gibson method, the Golden-Gate technology, and the Genbuilder technology. The implementation of the present invention can also use conventional molecular cloning techniques, such as site-directed mutagenesis, mutagenesis PCR, overlap PCR and restriction endonucleases.
本发明的方法可以一次进行至少10个反应、至少25个反应、至少50个反应、至少100个反应、至少250个反应、至少500个反应、至少750个反应、至少1000个反应、至少1250个反应、至少1500个反应、至少1750个反应、至少2000个反应、至少2250个反应、至少2500个反应、或至少2750个反应。The method of the present invention can perform at least 10 reactions, at least 25 reactions, at least 50 reactions, at least 100 reactions, at least 250 reactions, at least 500 reactions, at least 750 reactions, at least 1000 reactions, at least 1250 reactions at a time. Reactions, at least 1500 reactions, at least 1750 reactions, at least 2000 reactions, at least 2250 reactions, at least 2500 reactions, or at least 2750 reactions.
附图说明Description of the drawings
图1A显示实施例1所述pUC57质粒的结构示意图。Figure 1A shows a schematic diagram of the structure of the pUC57 plasmid described in Example 1.
图1B显示实施例1所述pUC57-BsmBI质粒的结构示意图。Figure 1B shows a schematic diagram of the structure of the pUC57-BsmBI plasmid described in Example 1.
图2A显示实施例2所述突变基因库目标序列和A、B、C片段的示意图。Fig. 2A shows a schematic diagram of the target sequence and the A, B, and C fragments of the mutant gene bank described in Example 2.
图2B显示实施例2所述构建突变基因库的组装示意图。FIG. 2B shows a schematic diagram of the assembly of the mutant gene library described in Example 2. FIG.
图3显示实施例2所述构建突变基因库的PCR的反应板和试剂板的分布示意图。Figure 3 shows a schematic diagram of the distribution of PCR reaction plates and reagent plates for constructing the mutant gene library described in Example 2.
图4显示实施例2所述构建突变基因库的PCR体系的试剂转移关系表的流程图。FIG. 4 shows a flowchart of the reagent transfer relationship table of the PCR system for constructing the mutant gene library described in Example 2. FIG.
图5显示实施例2所述构建突变基因库的PCR体系的试剂转移关系表。Figure 5 shows the reagent transfer relationship table of the PCR system for constructing the mutant gene library described in Example 2.
图6显示实施例2所述构建突变基因库的PCR产物的电泳照片。Figure 6 shows the electrophoresis photographs of the PCR products for constructing the mutant gene library described in Example 2.
图7显示实施例2所述构建突变基因库的拼接反应的反应板和试剂板的分布示意图。FIG. 7 shows a schematic diagram of the distribution of reaction plates and reagent plates of the splicing reaction for constructing the mutant gene library described in Example 2. FIG.
图8显示实施例2所述构建突变基因库的拼接反应体系的试剂转移关系表的流程图。FIG. 8 shows a flowchart of the reagent transfer relationship table of the splicing reaction system for constructing the mutant gene library described in Example 2. FIG.
图9显示实施例2所述构建突变基因库的拼接反应体系的试剂转移关系表。FIG. 9 shows the reagent transfer relationship table of the splicing reaction system for constructing the mutant gene library described in Example 2. FIG.
图10显示实施例2所述构建突变基因库的菌落PCR检验的电泳照片。“Δ”表示二次补检的条带。Figure 10 shows the electrophoresis photographs of the colony PCR test for constructing the mutant gene library described in Example 2. "Δ" represents the strip of the second reinspection.
图11显示实施例3所述U692AEH070中间载体的结构示意图。11 shows a schematic diagram of the structure of the U692AEH070 intermediate carrier described in Example 3.
图12显示实施例3所述构建基因组合库的片段融合的示意图Figure 12 shows a schematic diagram of fragment fusion for constructing a gene combinatorial library described in Example 3
图13显示实施例3所述构建基因组合库的组装示意图。Figure 13 shows a schematic diagram of the assembly of the gene combinatorial library constructed in Example 3.
图14显示实施例3所述构建基因组合库的片段生成PCR的试剂转移关系表的流程图。FIG. 14 shows a flowchart of the reagent transfer relationship table of PCR generated by fragments of the gene combinatorial library constructed in Example 3. FIG.
图15显示实施例3所述构建基因组合库的片段生成PCR的试剂分布表和试剂转移关系表。其中,al表示片段4,all表示片段5,bl表示片段6,bll表示片段7,C表示片段8,F表示正向引物,R表示反向引物;如“113al-F”表示扩增片段4所需的一个正向引物,113al-R表示扩增片段4所需的一个反向引物,“113”是设置的编号;“item”表示扩增目的片段所需的模板,后面的数字是编号。FIG. 15 shows the reagent distribution table and reagent transfer relationship table of the fragment generation PCR for constructing the gene combinatorial library described in Example 3. Among them, al means fragment 4, all means fragment 5, bl means fragment 6, bll means fragment 7, C means fragment 8, F means forward primer, R means reverse primer; for example, "113al-F" means amplified fragment 4 A forward primer required, 113al-R indicates a reverse primer required to amplify fragment 4, "113" is the set number; "item" indicates the template required to amplify the target fragment, and the following number is the number .
图16显示实施例3所述构建基因组合库的片段生成PCR产物的电泳照片。“Δ”表示二次补扩的条带。Figure 16 shows the electrophoresis photographs of PCR products generated from the fragments of the gene combinatorial library constructed in Example 3. "Δ" represents the strips that are supplemented twice.
图17显示实施例3所述构建基因组合库的片段融合PCR的试剂转移关系表的流程图。FIG. 17 shows a flow chart of the reagent transfer relationship table of fragment fusion PCR for constructing a gene combinatorial library described in Example 3. FIG.
图18显示实施例3所述构建基因组合库的片段融合PCR的试剂分布表和试剂转移关系表。其中,A表示片段al和片段all融合扩增后的片段,B表示片段bl和bll融合扩增后的片段,C表示片段8,F表示正向引物,R表示反向引物;如113al-F表示融合扩增片段113A所需的一个正向引物;113all-R表示融合扩增片段113A所需的一个反向引物;113al和113all是融合扩增片段113A的模板,“113”是设置的编号。Figure 18 shows the reagent distribution table and reagent transfer relationship table of the fragment fusion PCR for constructing the gene combinatorial library described in Example 3. Among them, A represents the fragment amplified by the fusion of fragment a1 and fragment all, B is the fragment amplified by the fusion of fragments bl and b11, C refers to fragment 8, F refers to forward primer, R refers to reverse primer; such as 113al-F Indicates a forward primer needed for fusion amplified fragment 113A; 113all-R indicates a reverse primer needed for fusion amplified fragment 113A; 113al and 113all are templates for fusion amplified fragment 113A, "113" is the set number .
图19显示实施例3所述构建基因组合库的片段融合PCR产物的电泳照片。FIG. 19 shows the electrophoresis photograph of the PCR product of the fragment fusion for constructing the gene combinatorial library described in Example 3. FIG.
图20显示实施例3所述构建基因组合库的拼接反应体系的试剂转移关系表的流程图。FIG. 20 shows a flow chart of the reagent transfer relationship table of the splicing reaction system for constructing the gene combinatorial library described in Example 3. FIG.
图21显示实施例3所述构建基因组合库的拼接反应体系的试剂分布表和试剂转移关系表。A表示片段al和片段all融合扩增后的片段,B表示片段bl和bll融合扩增后的片段,C表示片段8,A、B和C是拼接一个全长所需的三个片段;如113A、113B和113C是用于拼接113号全长序列的三个片段,“113”是设置的编号。Figure 21 shows the reagent distribution table and reagent transfer relationship table of the splicing reaction system for constructing the gene combinatorial library described in Example 3. A represents the fragment after fusion and amplification of fragment a1 and fragment all, B represents the fragment after fusion and amplification of fragment bl and b11, C represents fragment 8, A, B and C are the three fragments required to assemble a full length; 113A, 113B, and 113C are used to splice three fragments of the full-length sequence of No. 113, and "113" is the set number.
图22显示实施例3所述构建基因组合库的菌落PCR检验的电泳照片。“Δ”表示二次补检的条带。FIG. 22 shows the electrophoresis photographs of colony PCR test for constructing the gene combinatorial library described in Example 3. FIG. "Δ" represents the strip of the second reinspection.
图23显示96孔板的孔的编号的示意图。Figure 23 shows a schematic diagram of the numbering of the wells of a 96-well plate.
发明详述Detailed description of the invention
在第一个方面,本发明提供一种构建突变基因库的方法,其中突变基因库中的每一种突变基因相对于参照序列包含突变,所述方法包括:In the first aspect, the present invention provides a method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, and the method includes:
(1)针对每一种突变基因将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段包含与参照序列或其互补序列的对应区段相同的序列,可变序列片段包含与参照序列或其互补序列的对应区段相比的突变;(1) For each mutant gene, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
(2)分别生成每一种突变基因的各个恒定序列片段和分别生成每一种突变基因的各个可变序列片段;(2) Generate each constant sequence fragment of each mutant gene and each variable sequence fragment of each mutant gene separately;
(3)针对每一种突变基因配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each mutant gene; and
(4)针对每一种突变基因生成全长突变基因,任选地插入载体,(4) Generate a full-length mutant gene for each mutant gene, and optionally insert it into a vector,
由此构建突变基因库,其中所述方法是在多隔室容器中批量进行的。Thus, a mutant gene library was constructed, wherein the method was carried out in batches in a multi-compartment container.
在第二个方面,本发明提供一种构建基因组合库的方法,其中基因组合库中的各种基因组合相对于彼此具有序列唯一的区段和序列多选的区段,所述方法包括:In a second aspect, the present invention provides a method for constructing a gene combinatorial library, wherein various gene combinations in the gene combinatorial library have sequence-unique segments and multi-sequence segments relative to each other, and the method includes:
(1)针对每一种基因组合将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段对应于在基因组合库中具有唯一序列的区段,可变序列片段对应于在基因组合库中具有多选序列的区段;(1) For each gene combination, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library, The variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
(2)生成各个恒定序列片段和分别生成每一种基因组合的各个可变序列片段;(2) Generate each constant sequence fragment and each variable sequence fragment for each gene combination;
(3)针对每一种基因组合配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each gene combination; and
(4)针对每一种基因组合生成全长基因组合,任选地插入载体,(4) Generate a full-length gene combination for each gene combination, optionally insert a vector,
由此构建基因组合库,其中所述方法是在多隔室容器中批量进行的。Thus, a gene combinatorial library is constructed, wherein the method is carried out in batches in a multi-compartment container.
在第三个方面,本发明提供一种自动配制批量反应体系的方法,其包括:In a third aspect, the present invention provides a method for automatically preparing a batch reaction system, which includes:
生成试剂转移关系表;和Generate a reagent transfer relationship table; and
将试剂转移关系表上传自动移液装置,由自动移液装置按照试剂转移关系表自动将试剂自装有试剂的容器转移至用于进行反应的容器,Upload the reagent transfer relationship table to the automatic pipetting device, and the automatic pipetting device will automatically transfer the reagent from the container containing the reagent to the container for reaction according to the reagent transfer relationship table.
其中试剂转移关系表是如下生成的:The reagent transfer relationship table is generated as follows:
(1)列出要进行的批量反应,列出每一个反应需要的试剂及其体积;(1) List the batch reactions to be carried out, list the reagents and their volumes required for each reaction;
(2)确定每一个反应就反应容器而言的位置;(2) Determine the position of each reaction in terms of the reaction vessel;
(3)确定每一个反应需要的每一种试剂就试剂容器而言的位置;(3) Determine the position of each reagent required for each reaction in terms of the reagent container;
(4)确定每一个反应需要的每一种试剂的转移起点和终点,(4) Determine the transfer starting point and end point of each reagent required for each reaction,
其中所述方法是在多隔室容器中进行的。The method is carried out in a multi-compartment container.
在第四个方面,本发明提供一种构建突变基因库的方法,其中突变基因库中的每一种突变基因相对于参照序列包含突变,所述方法包括:In a fourth aspect, the present invention provides a method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, and the method includes:
(1)针对每一种突变基因将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段包含与参照序列或其互补序列的对应区段相同的序列,可变序列片段包含与参照序列或其互补序列的对应区段相比的突变;(1) For each mutant gene, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
(2)分别生成每一种突变基因的各个恒定序列片段和分别生成每一种突变基因的各个可变序列片段;(2) Generate each constant sequence fragment of each mutant gene and each variable sequence fragment of each mutant gene separately;
(3)针对每一种突变基因通过第三个方面的方法配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Prepare a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each mutant gene by the third aspect of the method; and
(4)针对每一种突变基因生成全长突变基因,任选地插入载体,(4) Generate a full-length mutant gene for each mutant gene, and optionally insert it into a vector,
由此构建突变基因库。Thus, a mutant gene library was constructed.
在第五个方面,本发明提供一种构建基因组合库的方法,其中基因组合库中的各种基因组合相对于彼此具有序列唯一的区段和序列多选的区段,所述方法包括:In a fifth aspect, the present invention provides a method for constructing a gene combinatorial library, wherein various gene combinations in the gene combinatorial library have sequence-unique segments and multi-sequence segments relative to each other, and the method includes:
(1)针对每一种基因组合将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段对应于在基因组合库中具有唯一序列的区段,可变序列片段对应于在基因组合库中具有多选序列的区段;(1) For each gene combination, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library, The variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
(2)生成各个恒定序列片段和分别生成每一种基因组合的各个可变序列片段;(2) Generate each constant sequence fragment and each variable sequence fragment for each gene combination;
(3)针对每一种基因组合通过第三个方面的方法配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Prepare a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each gene combination by the third aspect of the method; and
(4)针对每一种基因组合生成全长基因组合,任选地插入载体,(4) Generate a full-length gene combination for each gene combination, optionally insert a vector,
由此构建基因组合库。Thus, a gene combinatorial library was constructed.
在本发明任何方面的一个实施方案中,所述多隔室容器是多孔板,例如96孔板或384孔板。在一个实施方案中,一次使用一块或多块多孔板,例如1-27、4-25或9-16块96孔板或384孔板,例如1、2、3、4、5、6、8、9、10、12、15、16、18、20、24、25或27块96孔板。In one embodiment of any aspect of the invention, the multi-compartment container is a multi-well plate, such as a 96-well plate or a 384-well plate. In one embodiment, one or more multiwell plates are used at a time, such as 1-27, 4-25, or 9-16 96-well plates or 384-well plates, such as 1, 2, 3, 4, 5, 6, 8 , 9, 10, 12, 15, 16, 18, 20, 24, 25 or 27 96-well plates.
在本发明任何方面的一个实施方案中,多个反应公用的试剂和/或所有反应公用的试剂是合并在试剂板上的一个或多个孔中的。在合并公用试剂的情况中,公用试剂可以只占用试剂板上的一个孔。或者,公用试剂也可以占用试剂板上的多个孔,例如2-12个、2-8个或2-4个孔,例如2、4、6、8或12个孔。公用试剂在试剂板上占用的孔的数目取决于多个因素,例如自动移液一次移液的数目(例如自动化移液工作站一次移液的数目)、手工移液一次移液的数目(例如多道移液器或高通量分装器一次移液的数目)。试剂可以是分子生物学,特别是核酸化学合成中使用的试剂,例如引物(寡核苷酸)、模板(例如质粒、基因组)、缓冲液、限制性核酸内切酶、聚合酶、连接酶、dNTP混合物等。公用试剂可以是一次运行中的多个反应,甚至一次运行中的所有反应公用的试剂,例如公用引物、公用模板、缓冲液、限制性核酸内切酶、聚合酶、连接酶、dNTP混合物等。在一个实施方案中,多个反应公用的试剂和/或所有反应公用的试剂是手工添加的。在一个实施方案中,手工添加是使用多道移液器或高通量分装器进行的。In one embodiment of any aspect of the present invention, reagents common to multiple reactions and/or reagents common to all reactions are combined in one or more wells on the reagent plate. In the case of combining common reagents, the common reagent may occupy only one hole on the reagent plate. Alternatively, the common reagent can also occupy multiple holes on the reagent plate, for example, 2-12, 2-8, or 2-4 holes, for example, 2, 4, 6, 8 or 12 holes. The number of wells occupied by public reagents on the reagent plate depends on many factors, such as the number of pipetting at one time for automatic pipetting (e.g. the number of pipetting at one time in an automated pipetting workstation), the number of pipetting at one time for manual pipetting (e.g., more The number of pipettes per pipette or high-throughput dispenser). The reagent can be molecular biology, especially reagents used in nucleic acid chemical synthesis, such as primers (oligonucleotides), templates (such as plasmids, genomes), buffers, restriction endonucleases, polymerases, ligases, dNTP mixture and so on. Common reagents can be multiple reactions in one run, or even reagents common to all reactions in one run, such as common primers, common templates, buffers, restriction endonucleases, polymerases, ligases, dNTP mixtures, etc. In one embodiment, reagents common to multiple reactions and/or reagents common to all reactions are added manually. In one embodiment, manual addition is performed using a multichannel pipette or high-throughput dispenser.
在本发明任何方面的一个实施方案中,一次进行至少10个反应、至少25个反应、至少50个反应、至少96个反应、至少100个反应、至少192个反应、至少250个反应、至少288个反应、至少384个反应、至少480个反应、至少500个反应、至少576个反应、至少750个反应、至少1,000个反应、至少1,250个反应、至少1,500个反应、至少1,750个反应、至少2,000个反应、至少2,250个反应、至少2,500个反应或至少2,750个反应。一次进行的反应数目的上限取决于自动化工作站(例如自动化移液工作站、自动化合成工作站、自动化 反应工作站)的容量。以实施例中使用的Tecan EVO20018为例,工作站上可以放置27块96孔板,总共提供96 x 27=2592个孔,包括提供试剂的孔和进行反应的孔。以需要一种模板和两种引物的常规PCR(3个试剂孔和1个反应孔)为例,一次可以进行最多648个反应。In one embodiment of any aspect of the present invention, at least 10 reactions, at least 25 reactions, at least 50 reactions, at least 96 reactions, at least 100 reactions, at least 192 reactions, at least 250 reactions, at least 288 reactions are performed at a time. Reactions, at least 384 reactions, at least 480 reactions, at least 500 reactions, at least 576 reactions, at least 750 reactions, at least 1,000 reactions, at least 1,250 reactions, at least 1,500 reactions, at least 1,750 reactions, at least 2,000 Reactions, at least 2,250 reactions, at least 2,500 reactions, or at least 2,750 reactions. The upper limit of the number of reactions performed at one time depends on the capacity of automated workstations (e.g., automated pipetting workstations, automated synthesis workstations, automated reaction workstations). Taking the Tecan EVO20018 used in the examples as an example, 27 96-well plates can be placed on the workstation, providing a total of 96 x 27 = 2592 wells, including wells for providing reagents and wells for performing reactions. Taking the conventional PCR (3 reagent wells and 1 reaction well) that requires one template and two primers as an example, a maximum of 648 reactions can be performed at a time.
在本发明任何方面的一个实施方案中,步骤(4)利用无缝拼接技术,包括但不限于Golden-Gate法、Gibson法或Genbuilder法。本领域技术人员能够根据具体采用的无缝拼接技术确定一次反应拼接的片段数目的上限。通常,一次拼接2-9个、2-7个或3-5个片段,例如2、3、4、5、6、7、8或9个片段。In an embodiment of any aspect of the present invention, step (4) uses seamless splicing technology, including but not limited to Golden-Gate method, Gibson method or Genbuilder method. Those skilled in the art can determine the upper limit of the number of fragments that can be spliced in one reaction according to the seamless splicing technology specifically adopted. Generally, 2-9, 2-7, or 3-5 fragments are spliced at a time, for example, 2, 3, 4, 5, 6, 7, 8, or 9 fragments.
在本发明任何方面的一个实施方案中,步骤(4)利用常规分子生物学技术,包括但不限于限制性核酸内切酶。本领域技术人员能够根据具体采用的限制性核酸内切酶确定一次反应拼接的片段数目的上限。通常,一次拼接2-9个、2-7个或3-5个片段,例如2、3、4、5、6、7、8或9个片段。In one embodiment of any aspect of the present invention, step (4) utilizes conventional molecular biology techniques, including but not limited to restriction endonucleases. Those skilled in the art can determine the upper limit of the number of fragments that can be spliced in one reaction according to the specific restriction endonuclease used. Generally, 2-9, 2-7, or 3-5 fragments are spliced at a time, for example, 2, 3, 4, 5, 6, 7, 8, or 9 fragments.
在构建突变基因库的方法的一个实施方案中,突变可以是一个或多个核苷酸残基的替代、添加或删除。在一个实施方案中,所述多个核苷酸残基可以是连续的或不连续的。在构建突变基因库的方法的一个实施方案中,突变可以是一个或多个所编码氨基酸残基的替代、添加或删除。在一个实施方案中,所述多个所编码氨基酸残基可以是连续的或不连续的。在一个实施方案中,突变是突变基因库的各个成员彼此而言的。在一个实施方案中,突变是突变基因库的成员就参考序列而言的。参考序列可以包含在突变基因库中,也可以不包含在突变基因库中。In one embodiment of the method of constructing a mutant gene library, the mutation may be the substitution, addition or deletion of one or more nucleotide residues. In one embodiment, the plurality of nucleotide residues may be continuous or discontinuous. In one embodiment of the method of constructing a mutant gene library, the mutation may be the substitution, addition or deletion of one or more encoded amino acid residues. In one embodiment, the plurality of encoded amino acid residues may be continuous or discontinuous. In one embodiment, mutations are relative to each member of the mutation gene library. In one embodiment, the mutation is a member of the mutation gene library in terms of the reference sequence. The reference sequence may or may not be included in the mutant gene library.
在构建基因组合库的方法的一个实施方案中,全长目标序列可以是由序列唯一的区段和序列多选的区段构成的单基因序列,也可以是由序列唯一的基因和序列多选的基因构成的多基因序列,甚至可以是由序列唯一的区段和序列多选的区段构成的多基因序列。所述多基因可以是多顺反子,或者是一种级联或途径(例如代谢途径、合成途径、信号传导途径),甚至是完整基因组(例如低等生物(例如病毒)的基因组)。In one embodiment of the method of constructing a gene combinatorial library, the full-length target sequence can be a single gene sequence composed of a sequence unique segment and a sequence multiple-selection segment, or it can be a sequence unique gene and sequence multiple-selection sequence. The multi-gene sequence composed of the genes of the above can even be a multi-gene sequence composed of a unique segment of the sequence and a multi-selected segment of the sequence. The polygene may be polycistronic, or a cascade or pathway (such as a metabolic pathway, a synthetic pathway, a signal transduction pathway), or even a complete genome (such as a genome of a lower organism (such as a virus)).
本领域技术人员能够将目标序列合理划分成各个区段(例如恒定序列区段和可变序列区段)。在突变基因库的实施方案中,可变序列区段是包含突变的区段。在基因组合库的实施方案中,可变序列区段是序列多选的区段。本领域技术人员可以通过比对所有目标序列来进行各个区段的划分。进行序列比对的方法和工具(例如软件)是本领域公知的。各个片段的长度取决于多个因素,例如目标序列的全长、合成系统的最佳合成长度、突变在目标序列上的位置和相邻突变的距离。根据采用的具体方法(特别是各个片段的拼接方法以及插入载体的方法),各个片段之间和/或片段与载体之间可以有重叠序列(即相同序列或互补序列)。根据采用的具体方法(特别是各个片段的拼接方法以及插入载体的方法),各个片段可以包含目标序列上没有的序列,例如限制性位点(例如II型限制性核酸内切酶的,特别是IIs型限制性核酸内切酶的)、同源臂、标签等。Those skilled in the art can reasonably divide the target sequence into segments (for example, a constant sequence segment and a variable sequence segment). In the embodiment of the mutant gene library, the variable sequence segment is a segment that contains a mutation. In the embodiment of the gene combinatorial library, the variable sequence segment is a sequence multiple-selected segment. Those skilled in the art can divide each segment by comparing all target sequences. Methods and tools (such as software) for sequence alignment are well known in the art. The length of each fragment depends on many factors, such as the full length of the target sequence, the optimal synthesis length of the synthesis system, the position of the mutation on the target sequence, and the distance between adjacent mutations. According to the specific method used (especially the method of assembling each fragment and the method of inserting the vector), there may be overlapping sequences (ie, the same sequence or complementary sequence) between the various fragments and/or between the fragments and the vector. According to the specific method used (especially the method of assembling each fragment and the method of inserting the vector), each fragment may contain sequences that are not on the target sequence, such as restriction sites (such as type II restriction endonucleases, especially Type IIs restriction endonuclease), homology arms, tags, etc.
目标序列的全长可以是例如约1-100,000、10-10,000、100-8,000、150-8,000、200-5,000、250-1,000或400-600,例如约100、150、250、500、750、1,000、1,250、1,500、1,750、2,000、2,500、3,000、4,000、5,000、8,000、10,000、12,500、15,000、17,500、20,000、 25,000、50,000、75,000或100,000(包括本数及±10%)。每个片段(例如可变序列片段或恒定序列片段)的长度可以是例如约1-5,500、150-5,000、200-1,000、250-500或400-600,例如约50、100、150、200、250、500、750、1,000或5,000(包括本数及±10%)。The full length of the target sequence can be, for example, about 1-100,000, 10-10,000, 100-8,000, 150-8,000, 200-5,000, 250-1,000 or 400-600, for example about 100, 150, 250, 500, 750, 1,000 , 1,250, 1,500, 1,750, 2,000, 2,500, 3,000, 4,000, 5,000, 8,000, 10,000, 12,500, 15,000, 17,500, 20,000, 25,000, 50,000, 75,000 or 100,000 (including this number and ±10%). The length of each fragment (such as a variable sequence fragment or a constant sequence fragment) can be, for example, about 1-5,500, 150-5,000, 200-1,000, 250-500, or 400-600, such as about 50, 100, 150, 200, 250, 500, 750, 1,000 or 5,000 (including this number and ±10%).
对于单基因,目标序列的全长可以是例如约1-10,000、100-8,000、150-8,000、150-5,000、200-5,000、250-1,000或400-600,例如约100、150、250、500、750、1,000、1,250、1,500、1,750、2,000、2,500、5,000、8,000或10,000(包括本数及±10%)。每个片段(例如可变序列片段或恒定序列片段)的长度可以是例如约1-5,500、150-5,000、200-1,000、250-750或400-600,例如约50、100、150、200、250、500、750、1,000或5,000(包括本数及±10%)。For a single gene, the full length of the target sequence can be, for example, about 1-10,000, 100-8,000, 150-8,000, 150-5,000, 200-5,000, 250-1,000, or 400-600, such as about 100, 150, 250, 500 , 750, 1,000, 1,250, 1,500, 1,750, 2,000, 2,500, 5,000, 8,000 or 10,000 (including this number and ±10%). The length of each fragment (such as a variable sequence fragment or a constant sequence fragment) may be, for example, about 1-5,500, 150-5,000, 200-1,000, 250-750, or 400-600, such as about 50, 100, 150, 200, 250, 500, 750, 1,000 or 5,000 (including this number and ±10%).
对于多基因,目标序列的全长可以是例如约1-100,000、10-10,000、100-10,000、200-8,000或500-8,000,例如约100、250、500、750、1,000、1,250、1,500、1,750、2,000、2,500、3,000、4,000、5,000、8,000、10,000、12,500、15,000、17,500、20,000、25,000、50,000、75,000或100,000(包括本数及±10%)。每个片段(例如可变序列片段或恒定序列片段)的长度可以是例如约1-5,500、100-5,000、150-5,000、250-1,000或250-500,例如约50、100、150、200、250、500、1,000或5,000(包括本数及±10%)。For polygenes, the full length of the target sequence can be, for example, about 1-100,000, 10-10,000, 100-10,000, 200-8,000, or 500-8,000, such as about 100, 250, 500, 750, 1,000, 1,250, 1,500, 1,750 , 2,000, 2,500, 3,000, 4,000, 5,000, 8,000, 10,000, 12,500, 15,000, 17,500, 20,000, 25,000, 50,000, 75,000 or 100,000 (including this number and ±10%). The length of each fragment (such as a variable sequence fragment or a constant sequence fragment) may be, for example, about 1-5,500, 100-5,000, 150-5,000, 250-1,000, or 250-500, such as about 50, 100, 150, 200, 250, 500, 1,000 or 5,000 (including this number and ±10%).
试剂转移关系表可以使用任何工具(例如软件)来进行,特别是统计学软件,例如Excel。可以列出每个反应需要的试剂及其体积(每个反应的体积可以是不一样的);放置试剂的位置(例如板的代码和孔的代码);进行反应的位置(例如板的代码和孔的代码)。可以查找重复使用的试剂,即公用试剂,包括多个反应公用的试剂,甚至所有反应公用的试剂。可以利用Excel自带的countif模块或者数据透视表来进行相关数据统计,统计出公用试剂。可以统计公用试剂的总需求量。可以通过Excel的删除重复项模块来合并公用试剂,节省公用试剂占用的孔的数目,从而节省板的数目,提高一次运行进行的反应数。可以用各种方式来代表各种试剂(特别是引物和模板),例如名称、代码、序列、编号、合成单号。在多步骤方法中,可以利用在先步骤的试剂转移关系表的信息来生成在后步骤的试剂转移关系表。这中间涉及到的数据比对、查找、定位等可以利用vlookup模块来进行。可以以各种方式输出试剂转移关系表,例如CSV等机器可以识别的格式。The reagent transfer relationship table can be performed using any tool (such as software), especially statistical software, such as Excel. You can list the reagents and their volumes required for each reaction (the volume of each reaction can be different); the location of the reagents (such as the code of the plate and the code of the well); the location of the reaction (such as the code of the plate and the code of the well); Hole code). You can search for reusable reagents, that is, common reagents, including reagents common to multiple reactions, and even reagents common to all reactions. You can use Excel's own countif module or pivot table to carry out relevant data statistics and calculate public reagents. The total demand for public reagents can be counted. The common reagents can be combined through the deduplication module of Excel to save the number of wells occupied by the common reagents, thereby saving the number of plates and increasing the number of reactions performed in one run. Various reagents (especially primers and templates) can be represented in various ways, such as name, code, sequence, number, synthetic order number. In the multi-step method, the information of the reagent transfer relationship table in the previous step can be used to generate the reagent transfer relationship table in the subsequent step. The data comparison, search, positioning, etc. involved in this process can be performed using the vlookup module. The reagent transfer relationship table can be output in various ways, such as CSV and other formats that can be recognized by the machine.
实施例Example
实施例1:无缝拼接载体的构建Example 1: Construction of seamless splicing vector
1.前言1 Introduction
此实施例描述可用于无缝拼接(例如具有一对头-头取向的IIS型限制酶(例如BsmB I)识别位点)、具有蓝白斑筛选功能(例如具有LacZ基因)的例示性质粒的构建。简言之,改造质粒pUC57(GenBank:Y14837.1),得到质粒pUC57-BsmBI。This example describes the construction of exemplary plasmids that can be used for seamless splicing (e.g., recognition sites for type IIS restriction enzymes (e.g., BsmB I) with a pair of head-to-head orientation) and blue-white spot screening functions (e.g., LacZ gene). In short, the plasmid pUC57 (GenBank: Y14837.1) was transformed to obtain the plasmid pUC57-BsmBI.
如图1A所示,质粒pUC57包含LacZ基因及位于其同侧的尾-尾取向(即识别位点在外,切割位点在内)的两个BsmB I限制性位点。基于BsmB I限制性的克隆规程后,LacZ基因和两个BsmB I识别位点均被保留。因而,pUC57不适合作为无缝拼接载体。As shown in Figure 1A, the plasmid pUC57 contains the LacZ gene and two BsmB I restriction sites in a tail-to-tail orientation (that is, the recognition site is outside and the cleavage site is inside) on the same side. Based on the BsmB I restriction cloning procedure, the LacZ gene and the two BsmB I recognition sites are all retained. Therefore, pUC57 is not suitable as a seamless splicing vector.
如图1B所示,改造后的质粒pUC57-BsmBI包含LacZ基因及位于其异侧的头-头取向(即识别位点在内,切割位点在外)的一对BsmB I限制性位点。基于BsmB I限制性的克隆 规程后,LacZ基因和一对BsmB I识别位点均被切除。因而,pUC57-BsmBI适合作为无缝拼接载体。As shown in Fig. 1B, the transformed plasmid pUC57-BsmBI contains the LacZ gene and a pair of BsmB I restriction sites in the head-to-head orientation (that is, the recognition site is inside, and the cleavage site is outside) of the LacZ gene. Based on the BsmB I restriction cloning procedure, both the LacZ gene and a pair of BsmB I recognition sites were excised. Therefore, pUC57-BsmBI is suitable as a seamless splicing vector.
2.方法2. Method
基于质粒pUC57(包括LacZ基因)的序列和BsmB I识别位点的序列,设计并化学合成(南京金斯瑞生物科技有限公司)以下四种引物:Based on the sequence of the plasmid pUC57 (including the LacZ gene) and the sequence of the BsmB I recognition site, the following four primers were designed and chemically synthesized (Nanjing GenScript Biotechnology Co., Ltd.):
pUC57-BsmBI-f:agaggcctgcatgcaagcttggcgtaatcatggtcatagctgttcgtctctcctgtgtgaaattgttatccgc(SEQ ID NO:1);pUC57-BsmBI-f: agaggcctgcatgcaagcttggcgtaatcatggtcatagctgttcgtctctcctgtgtgaaattgttatccgc (SEQ ID NO: 1);
pUC57-BsmBI-r:ggttatcaagtgagaaatcaccatgagtgacgactgaatcggtttcttagacgtcaggtggc(SEQ ID NO:2);pUC57-BsmBI-r: ggttatcaagtgagaaatcaccatgagtgacgactgaatcggtttcttagacgtcaggtggc (SEQ ID NO: 2);
pUC57-LacZ-f:gattcagtcgtcactcatggtgatttctcacttgataaccttcggtgatgacggtgaaaac(SEQ ID NO:3);pUC57-LacZ-f: gattcagtcgtcactcatggtgatttctcacttgataaccttcggtgatgacggtgaaaac (SEQ ID NO: 3);
pUC57-LacZ-r:gctatgaccatgattacgccaagctt(SEQ ID NO:4)。pUC57-LacZ-r: gctatgaccatgattacgccaagctt (SEQ ID NO: 4).
以质粒pUC57为模板,用引物pUC57-BsmBI-f和pUC57-BsmBI-r扩增pUC57质粒骨架部分(产物长度2245bp),用引物pUC57-LacZ-f和pUC57-LacZ-r扩增LacZ基因插入物部分(产物长度528bp)。两种PCR产物两端各有40bp同源臂。Using plasmid pUC57 as a template, using primers pUC57-BsmBI-f and pUC57-BsmBI-r to amplify the pUC57 plasmid backbone (product length 2245bp), using primers pUC57-LacZ-f and pUC57-LacZ-r to amplify the LacZ gene insert Part (product length 528bp). The two PCR products have 40 bp homology arms at each end.
基于PrimeSTAR GXL DNA Polymerase(Takara生物技术有限公司,R050A)的PCR体系:PCR system based on PrimeSTAR GXL DNA Polymerase (Takara Biotechnology Co., Ltd., R050A):
Figure PCTCN2021083012-appb-000001
Figure PCTCN2021083012-appb-000001
PCR程序:PCR program:
Figure PCTCN2021083012-appb-000002
Figure PCTCN2021083012-appb-000002
使用GenBuilder Plus(南京金斯瑞生物科技有限公司,IM00712)拼接上述两种PCR产物,得到质粒pUC57-BsmBI。Use GenBuilder Plus (Nanjing GenScript Biotechnology Co., Ltd., IM00712) to splice the above two PCR products to obtain the plasmid pUC57-BsmBI.
拼接反应体系:Splicing reaction system:
Figure PCTCN2021083012-appb-000003
Figure PCTCN2021083012-appb-000003
Figure PCTCN2021083012-appb-000004
Figure PCTCN2021083012-appb-000004
拼接反应条件:Splicing reaction conditions:
Figure PCTCN2021083012-appb-000005
Figure PCTCN2021083012-appb-000005
实施例2:突变基因库的构建Example 2: Construction of mutant gene library
任务:Task:
在载体中插入一个核酸序列集合,共24个成员(即目标序列)。目标序列全长819bp,其中720bp核心区段相对于起始序列包含突变,72bp上游区段和27bp下游区段相对于起始序列不包含突变。而且,720bp核心区段可以以30bp为单位分割成24个连续核心亚区段,每一种目标序列相对于起始序列在相应的一个核心亚区段中包含突变,在其它核心亚区段中不包含突变(如图2A所示)。Insert a collection of nucleic acid sequences into the vector, a total of 24 members (ie target sequences). The target sequence is 819 bp in length, in which the 720 bp core segment contains mutations relative to the starting sequence, and the 72 bp upstream segment and 27 bp downstream segment contain no mutations relative to the starting sequence. Moreover, the 720bp core segment can be divided into 24 continuous core sub-segments in units of 30 bp, and each target sequence contains mutations in a corresponding core sub-segment relative to the starting sequence, and in other core sub-segments. Does not contain mutations (as shown in Figure 2A).
基本设计:basic design:
通过PCR,生成A、B、C三个片段集合,各24种序列,其中:中间的B片段长80bp,包含居中的一段30bp移动窗口,与一个上述核心亚区段对应,两侧侧翼序列合计50bp,提供无缝拼接的识别位点和保护碱基(例如与起始序列的对应部分相同或互补);上游的A片段补足相应的B片段相对于起始序列缺乏的上游序列;下游的C片段补足相应的B片段相对于起始序列缺乏的下游序列;与B片段类似,A片段和C片段也包含侧翼序列,提供无缝拼接的识别位点和保护碱基(例如与起始序列的对应部分相同或互补;例如与相应的B片段或质粒交叠)。通过无缝拼接,将相应的一套A、B、C片段插入载体(如图2B所示)。By PCR, three sets of fragments A, B, and C are generated, each with 24 kinds of sequences, of which: the middle B fragment is 80 bp long, contains a 30 bp moving window in the middle, corresponds to one of the above core sub-segments, and the flanking sequences on both sides total 50bp, providing a seamlessly spliced recognition site and protective base (for example, the same or complementary to the corresponding part of the starting sequence); the upstream A segment complements the upstream sequence that the corresponding B segment lacks relative to the starting sequence; the downstream C The fragments complement the corresponding downstream sequences that the B fragment lacks relative to the starting sequence; similar to the B fragment, the A fragment and the C fragment also contain flanking sequences, providing a seamlessly spliced recognition site and protective bases (for example, with the starting sequence The corresponding parts are the same or complementary; for example, overlap with the corresponding B fragment or plasmid). Through seamless splicing, the corresponding set of fragments A, B, and C is inserted into the vector (as shown in Figure 2B).
方法:method:
(1)片段的生成(1) Fragment generation
如上所述,这个步骤生成A1至A24、B1至B24、C1至C24共72种片段。可以通过PCR来生成这些片段。As mentioned above, this step generates a total of 72 fragments from A1 to A24, B1 to B24, and C1 to C24. These fragments can be generated by PCR.
(1.1)模板(1.1) Template
合成(南京金斯瑞生物科技有限公司)不含任何突变的起始序列作为PCR生成A1至A24、C1至C24片段的公用模板,设计并合成(南京金斯瑞生物科技有限公司)包含所有突变的突变序列作为PCR生成B1至B24片段的公用模板。Synthesized (Nanjing GenScript Biotechnology Co., Ltd.) The starting sequence without any mutations was used as a common template for PCR to generate fragments A1 to A24, C1 to C24, and designed and synthesized (Nanjing GenScript Biotechnology Co., Ltd.) including all mutations The mutant sequence is used as a common template for PCR to generate fragments B1 to B24.
(1.2)引物(1.2) Primer
设计并合成(南京金斯瑞生物科技有限公司)用于PCR扩增A1至A24、B1至B24、C1至C24片段的引物,在每种片段的两端各引入一个BsmB I识别位点(尾-尾取向)。其中,用于PCR扩增A1至A24片段的正向引物为公用引物,用于PCR扩增C1至C24片段的反向引物为公用引物。Designed and synthesized (Nanjing GenScript Biotechnology Co., Ltd.) primers for PCR amplification of fragments A1 to A24, B1 to B24, and C1 to C24, and introduced a BsmB I recognition site (tail -Tail orientation). Wherein, the forward primer used for PCR amplification of the fragments A1 to A24 is a common primer, and the reverse primer used for PCR amplification of the C1 to C24 fragments is a common primer.
(1.3)编写试剂位置表和反应位置表(1.3) Compile reagent position table and reaction position table
如上所述,共有2种PCR模板和98种PCR引物,产生72种PCR产物。如果使用96孔板的话,至少需要2块试剂板(以Source1和Source2标记)和1块反应板(以Destination1标记)。其中,试剂板标示引物和模板的位置,反应板标示产物的位置。通常,先确定反应板上产物的位置,再根据反应板上产物的位置,确定试剂板上引物和模板的位置(如图3所示)。As mentioned above, there are 2 kinds of PCR templates and 98 kinds of PCR primers, resulting in 72 kinds of PCR products. If a 96-well plate is used, at least 2 reagent plates (labeled Source1 and Source2) and 1 reaction plate (labeled Destination1) are required. Among them, the reagent plate indicates the position of the primer and template, and the reaction plate indicates the position of the product. Usually, the position of the product on the reaction plate is determined first, and then the position of the primer and template on the reagent plate is determined according to the position of the product on the reaction plate (as shown in Figure 3).
用于PCR扩增A1至A24片段的正向引物为公用引物,以GG-AF标示;用于PCR扩增A1至A24片段的反向引物分别以GG-AR1至GG-AR24标示;用于PCR扩增B1至B24片段的正向引物和反向引物分别以GG-BF1至GG-BF24和GG-BR1至GG-BR24标示;用于PCR扩增C1至C24片段的正向引物分别以GG-CF1至GG-CF24标示;用于PCR扩增C1至C24片段的反向引物为公用引物,以GG-CR标示。The forward primers used for PCR amplification of fragments A1 to A24 are common primers, marked with GG-AF; the reverse primers used for PCR amplification of fragments A1 to A24 are marked with GG-AR1 to GG-AR24; used for PCR The forward and reverse primers used to amplify fragments B1 to B24 are labeled GG-BF1 to GG-BF24 and GG-BR1 to GG-BR24, respectively; the forward primers used for PCR amplification of fragments C1 to C24 are labeled GG-BF1 to GG-BF24, respectively. CF1 to GG-CF24 are labeled; the reverse primers used for PCR amplification of C1 to C24 fragments are common primers and are labeled GG-CR.
起始序列作为PCR生成A1至A24和C1至C24片段的公用模板,以A/C-T标示;包含所有可能突变的突变序列作为PCR生成B1至B24片段的公用模板,以B-T标示。The starting sequence is used as a common template for PCR to generate fragments of A1 to A24 and C1 to C24, marked with A/C-T; the mutant sequence containing all possible mutations is used as a common template for PCR to generate fragments of B1 to B24, marked with B-T.
公用试剂(包括引物和模板)可以占用一个或多个孔。Common reagents (including primers and templates) can occupy one or more wells.
(1.4)编写试剂转移关系表(1.4) Compile the reagent transfer relationship table
为了进行PCR,需要将引物、模板等从试剂板转移至反应板。确定需要转移的试剂的源位置和目标位置,及需要转移的体积。具体而言,将片段A的反向引物自Source1板的GG-AR1至GG-AR24位置转移至对应的Destination1板的GG-A1至GG-A24位置,将片段B的正向引物自Source1板的GG-BF1至GG-BF24位置转移至对应的Destination1板的GG-B1至GG-B24位置,将片段B的反向引物自Source1板的GG-BR1至GG-BR24位置转移至对应的Destination1板的GG-B1至GG-B24位置,将片段C的正向引物自Source1板的GG-CF1至GG-CF24位置转移至对应的Destination1板的GG-C1至GG-C24位置,将片段A的共用正向引物自Source2板的GG-AF位置转移至Destination1板的GG-A1至GG-A24位置,将片段C的共用反向引物自Source2板的GG-CR位置转移至Destination1板的GG-C1至GG-C24位置。将片段A1至A24和C1至C24的公用模板自Source2板的A/C-T位置转移至对应的Destination1板的GG-A1至GG-A24和GG-C1至GG-C24位置,将片段B1至B24的公用模板自Source2板的B-T位置转移至对应的Destination1板的GG-B1至GG-B24位置。In order to perform PCR, it is necessary to transfer primers, templates, etc. from the reagent plate to the reaction plate. Determine the source and target locations of the reagents that need to be transferred, and the volume that needs to be transferred. Specifically, transfer the reverse primer of fragment A from GG-AR1 to GG-AR24 on the Source1 plate to the corresponding GG-A1 to GG-A24 on the Destination1 plate, and transfer the forward primer of fragment B from the GG-AR1 to GG-AR24 of the Source1 plate. GG-BF1 to GG-BF24 position is transferred to the corresponding Destination1 plate GG-B1 to GG-B24 position, the reverse primer of fragment B is transferred from GG-BR1 to GG-BR24 position of Source1 plate to the corresponding Destination1 plate GG-B1 to GG-B24 position, transfer the forward primer of fragment C from GG-CF1 to GG-CF24 position of Source1 plate to GG-C1 to GG-C24 position of corresponding Destination1 plate, and transfer the common positive of fragment A Transfer the primers from the GG-AF position of the Source2 plate to the GG-A1 to GG-A24 positions of the Destination1 plate, and transfer the common reverse primer of fragment C from the GG-CR position of the Source2 plate to the GG-C1 to GG of the Destination1 plate -C24 location. Transfer the common templates of the fragments A1 to A24 and C1 to C24 from the A/CT position of the Source2 plate to the GG-A1 to GG-A24 and GG-C1 to GG-C24 positions of the corresponding Destination1 plate. The common template is transferred from the BT position on the Source2 board to the GG-B1 to GG-B24 positions on the corresponding Destination1 board.
如图4所示,生成试剂转移关系表。生成的试剂转移关系表如图5所示。As shown in Figure 4, a reagent transfer relationship table is generated. The generated reagent transfer relationship table is shown in Figure 5.
(1.5)PCR体系配制和运行(1.5) PCR system preparation and operation
如上所述,引物是根据试剂位置表在96孔深孔板(上海百赛生物技术有限公司,PCR-96M2-HS-C)的相应孔中合成的。模板在合成后根据试剂位置表添加到96孔深孔板的相应孔中。As mentioned above, the primers are synthesized in the corresponding wells of a 96-well deep-well plate (Shanghai Best Biotechnology Co., Ltd., PCR-96M2-HS-C) according to the reagent position table. After synthesis, the template is added to the corresponding wells of the 96-well deep-well plate according to the reagent position table.
将转移关系表上传Tecan工作站(Tecan,型号EVO20018)。由Tecan工作站按照转移关系表自动进行引物和模板的转移,从试剂板转移至反应板。Upload the transfer relationship table to the Tecan workstation (Tecan, model EVO20018). The Tecan workstation automatically transfers primers and templates according to the transfer relationship table, from the reagent plate to the reaction plate.
配制反应工作液,分装至已有引物和模板的反应板孔中。分装可用八通道移液器手动分装或者采用高通量微量分装器(Preddator,型号S4)进行自动分装。Prepare the reaction working solution and distribute it to the wells of the reaction plate with primers and templates. Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput microdispenser (Preddator, Model S4).
如下所述通过PCR生成A、B、C片段。Fragments A, B, and C were generated by PCR as described below.
基于Phusion High-Fidelity DNA Polymerase(NEB生物技术有限公司,M0530L)的片段A和C的PCR体系:PCR system based on Fragments A and C of Phusion High-Fidelity DNA Polymerase (NEB Biotechnology Co., Ltd., M0530L):
Figure PCTCN2021083012-appb-000006
Figure PCTCN2021083012-appb-000006
片段A和C的PCR程序:PCR program for fragments A and C:
Figure PCTCN2021083012-appb-000007
Figure PCTCN2021083012-appb-000007
基于Phusion High-Fidelity DNA Polymerase(NEB生物技术有限公司,M0530L)的片段B的PCR体系:PCR system based on Fragment B of Phusion High-Fidelity DNA Polymerase (NEB Biotechnology Co., Ltd., M0530L):
Figure PCTCN2021083012-appb-000008
Figure PCTCN2021083012-appb-000008
片段B的PCR程序:PCR program for fragment B:
Figure PCTCN2021083012-appb-000009
Figure PCTCN2021083012-appb-000009
Figure PCTCN2021083012-appb-000010
Figure PCTCN2021083012-appb-000010
PCR产物的预期大小如表1所示。回收的PCR产物的凝胶电泳照片如图6所示。The expected size of the PCR product is shown in Table 1. The gel electrophoresis photograph of the recovered PCR product is shown in Figure 6.
表1Table 1
Figure PCTCN2021083012-appb-000011
Figure PCTCN2021083012-appb-000011
(2)片段的拼接(2) Fragment splicing
如上所述,这个步骤将成套的A、B、C片段插入质粒。可以通过无缝拼接将这些片段插入质粒pUC57-BsmBI(见实施例1)。As mentioned above, this step inserts the set of A, B, and C fragments into the plasmid. These fragments can be inserted into the plasmid pUC57-BsmBI by seamless splicing (see Example 1).
(2.1)插入物和质粒的用量(2.1) The amount of insert and plasmid
片段B,长度均为80bp,用量为约40ng。片段A和片段C,具有不同长度,长度小于550bp的用量为约80ng,长度大于550bp的用量为约120ng。质粒pUC57-BsmBI,长度为2683bp,用量为约140ng。Fragment B, the length is 80bp, the dosage is about 40ng. Fragment A and Fragment C have different lengths, the dosage is about 80ng if the length is less than 550bp, and the dosage is about 120ng if the length is greater than 550bp. The plasmid pUC57-BsmBI has a length of 2683bp and a dosage of about 140ng.
(2.2)编写试剂位置表和反应位置表(2.2) Compile reagent position table and reaction position table
如上所述,共有72种插入片段(即上一步得到的72种PCR产物)和1种质粒(即pUC57-BsmBI),产生24种拼接产物。如果使用96孔板的话,至少需要1块试剂板(以Source11标示)和1块反应板(以Destination11标示)。通常,先确定反应板上产物的位置,再根据反应板上产物的位置,确定试剂板上插入片段和质粒的位置(如图7所示)。As mentioned above, there are 72 kinds of insert fragments (that is, 72 kinds of PCR products obtained in the previous step) and 1 kind of plasmid (that is, pUC57-BsmBI), resulting in 24 kinds of splicing products. If a 96-well plate is used, at least one reagent plate (labeled Source11) and one reaction plate (labeled Destination11) are required. Usually, the position of the product on the reaction plate is determined first, and then the position of the inserted fragment and plasmid on the reagent plate is determined according to the position of the product on the reaction plate (as shown in Figure 7).
拼接反应的试剂位置表可以沿用上一步PCR的反应位置表(以Source11-Destination1标示),其中增加质粒(以GG-V标示)。The reagent position table of the splicing reaction can follow the reaction position table of the previous PCR (marked by Source11-Destination1), and the plasmid (marked by GG-V) is added.
(2.4)编写试剂转移关系表(2.4) Compile the reagent transfer relationship table
为了进行拼接,需要将插入物、质粒等从试剂板转移至反应板。确定需要转移的试剂的源位置和目标位置,及需要转移的体积。具体而言,将片段A自Source11-Destination1的GG-A1至GG-A24位置转移至对应的Destination11板的GoldenGate-1至GoldenGate-24位置,将片段B自Source11-Destination1的GG-B1至GG-B24位置转移至对应的Destination11板的GoldenGate-1至GoldenGate-24位置,将片段C自Source11-Destination1的GG-B1~GG-B24转移至对应的Destination11板的GoldenGate-1至GoldenGate-24位置,将质粒GG-V转移至对应的Destination11板的GoldenGate-1至GoldenGate-24位置。For splicing, it is necessary to transfer inserts, plasmids, etc. from the reagent plate to the reaction plate. Determine the source and target locations of the reagents that need to be transferred, and the volume that needs to be transferred. Specifically, segment A is transferred from GG-A1 to GG-A24 of Source11-Destination1 to GoldenGate-1 to GoldenGate-24 of the corresponding Destination11 board, and segment B is transferred from GG-B1 to GG- of Source11-Destination1. Move the B24 position to the GoldenGate-1 to GoldenGate-24 positions of the corresponding Destination11 board, and transfer the fragment C from GG-B1~GG-B24 of Source11-Destination1 to the GoldenGate-1 to GoldenGate-24 positions of the corresponding Destination11 board. The plasmid GG-V was transferred to the positions of GoldenGate-1 to GoldenGate-24 on the corresponding Destination11 plate.
如图8所示,生成试剂转移关系表。生成的试剂转移关系表如图9所示。As shown in Figure 8, a reagent transfer relationship table is generated. The generated reagent transfer relationship table is shown in FIG. 9.
(2.5)拼接体系配制和运行(2.5) Preparation and operation of splicing system
将转移关系表上传Tecan工作站。由Tecan工作站按照转移关系表自动进行插入物和质粒的转移,从试剂板转移至反应板。Upload the transfer relationship table to the Tecan workstation. The Tecan workstation automatically transfers inserts and plasmids according to the transfer relationship table, from the reagent plate to the reaction plate.
配制反应工作液,分装至已有插入物和质粒的反应板孔中。质粒也可以配制在工作液中。分装可用八通道移液器手动分装或者采用高通量微量分装器进行自动分装。在反应液粘稠的情况下可进行手动分装,以降低误差。Prepare the reaction working solution and distribute it to the wells of the reaction plate with inserts and plasmids. Plasmids can also be formulated in working solutions. Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser. When the reaction solution is viscous, manual aliquoting can be carried out to reduce errors.
基于BsmB I限制酶(NEB生物技术有限公司,R0580L)和T4 DNA Ligase(NEB生物技术有限公司,M0202M)的拼接反应体系:Splicing reaction system based on BsmB I restriction enzyme (NEB Biotechnology Co., Ltd., R0580L) and T4 DNA Ligase (NEB Biotechnology Co., Ltd., M0202M):
Figure PCTCN2021083012-appb-000012
Figure PCTCN2021083012-appb-000012
拼接反应程序:Splicing reaction procedure:
Figure PCTCN2021083012-appb-000013
Figure PCTCN2021083012-appb-000013
(3)检验(3) Inspection
按照常规方法,将拼接产物转化入大肠杆菌Top10感受态细胞(南京金斯瑞生物科技有限公司),进行蓝白斑平板筛选(结果未显示)。每个拼接反应使用Qpix(MolecμLar Devices,Qpix 420)挑取4个单菌落对插入物进行PCR检验(如图10所示);将PCR检验正确的菌落提交测序检验(数据未显示)。检验结果如表2所示。总之,拼接的一次成功率达到100%,一次正确率达到95%以上。According to a conventional method, the splicing product was transformed into E. coli Top10 competent cells (Nanjing GenScript Biotechnology Co., Ltd.), and blue-white plate screening was performed (results not shown). Each splicing reaction uses Qpix (MolecμLar Devices, Qpix 420) to pick 4 single colonies to perform PCR inspection on the inserts (as shown in Figure 10); the colonies that are correct by PCR are submitted for sequencing inspection (data not shown). The test results are shown in Table 2. In short, the one-time success rate of splicing reaches 100%, and the one-time correct rate reaches more than 95%.
表2Table 2
编号serial number PCR阳性率PCR positive rate 测序正确率Sequencing accuracy rate
11 3/43/4 ++
22 3/43/4 ++
33 4/44/4 ++
44 2/42/4 ++
55 4/44/4 ++
66 4/44/4 ++
77 4/44/4 ++
88 2/42/4 ++
99 3/43/4 ++
1010 3/43/4 ++
1111 4/44/4 ++
1212 4/44/4 ++
1313 3/43/4 ++
1414 4/44/4 ++
1515 4/44/4 ++
1616 4/44/4 ++
1717 3/43/4 ++
1818 4/44/4 ++
1919 4/44/4 ++
2020 1/41/4 ++
21twenty one 2/42/4 ++
22twenty two 3/43/4 ++
23twenty three 4/44/4 ++
24twenty four 2/42/4 ++
实施例3:基因组合库的构建Example 3: Construction of gene combinatorial library
任务:Task:
在载体中插入一个核酸序列集合,共50个成员(即目标序列)。目标序列由9个部分构成,按照5’至3’方向以片段1至9表示。其中,片段4长度为500bp,有4种序列可选;片段5长度为301-1500bp,有16种序列可选;片段8长度为2050bp,有4种序列可选;片段6长度为711bp,序列唯一;片段7长度为400bp,序列唯一;片段1、2、3和9序列唯一。9个片段总共有50种组合,即目标序列。Insert a collection of nucleic acid sequences into the vector, a total of 50 members (ie target sequences). The target sequence is composed of 9 parts, which are represented by fragments 1 to 9 in the 5'to 3'direction. Among them, the length of fragment 4 is 500 bp, and there are 4 kinds of sequences to choose; the length of fragment 5 is 301-1500 bp, there are 16 kinds of sequences to choose; the length of fragment 8 is 2050 bp, there are 4 kinds of sequences to choose; the length of fragment 6 is 711 bp, the sequence Unique; Fragment 7 has a length of 400bp with unique sequence; Fragments 1, 2, 3 and 9 have unique sequences. There are a total of 50 combinations of 9 fragments, that is, the target sequence.
基本设计:basic design:
位于两端的片段1、2、3和9序列唯一,因而可以先将这四个片段整合入基于pUC57构建的载体pUC57-KanR(SEQ ID NO:5),产生中间载体U692AEH070-2(如图11所示)。为了缩小片段间的长度差异,根据序列长度,将片段4与片段5融合成片段A,将片段6与片段7融合成片段B,将片段8单独作为片段C(如图12所示)。片段B两端与片段A和C各有60bp的同源臂,用于进行无缝拼接。最后,将相应的一套A、B、C片段插入中间载体(如图13所示)。The sequences of fragments 1, 2, 3 and 9 at both ends are unique, so these four fragments can be integrated into the vector pUC57-KanR (SEQ ID NO: 5) constructed based on pUC57 to generate the intermediate vector U692AEH070-2 (Figure 11) Shown). In order to reduce the length difference between the fragments, according to the sequence length, fragment 4 and fragment 5 are fused into fragment A, fragment 6 and fragment 7 are fused into fragment B, and fragment 8 is used as fragment C alone (as shown in FIG. 12). Both ends of fragment B and fragments A and C each have 60 bp homology arms for seamless splicing. Finally, insert the corresponding set of A, B, and C fragments into the intermediate vector (as shown in Figure 13).
(1)中间载体的构建(1) Construction of intermediate vector
如上所述,通过常规方法,将片段1、片段2、片段3、片段9这四个片段整合入pUC57-KanR,同时引入Not I和Asc I限制性位点供后续克隆用,产生中间载体U692AEH070-2。将中间载体通过Not I和Asc I限制性消化而线性化,纯化,备用(浓度为约20ng/μL)。中间载体的片段两端与片段A和C各有40bp的同源臂,用于进行无缝拼接。As mentioned above, through conventional methods, the four fragments, fragment 1, fragment 2, fragment 3, and fragment 9 were integrated into pUC57-KanR, and the Not I and Asc I restriction sites were introduced for subsequent cloning to generate intermediate vector U692AEH070 -2. The intermediate vector was linearized by Not I and Asc I restriction digestion, purified, and ready for use (concentration of about 20 ng/μL). Both ends of the intermediate vector have 40 bp homology arms with fragments A and C for seamless splicing.
(2)片段的生成(2) Fragment generation
如上所述,这个步骤生成片段4、片段5、片段6、片段7、片段8。可以通过PCR来生成这些片段。As mentioned above, this step generates fragment 4, fragment 5, fragment 6, fragment 7, and fragment 8. These fragments can be generated by PCR.
通过常规方法生成(南京金斯瑞生物科技有限公司)包含片段4任一序列、片段5任一序列、片段6唯一序列、片段7唯一序列、片段8任一序列之一的质粒(共26种)作为模板。Generated by conventional methods (Nanjing GenScript Biotechnology Co., Ltd.) A plasmid containing any sequence of fragment 4, any sequence of fragment 5, unique sequence of fragment 6, unique sequence of fragment 7, and any sequence of fragment 8 (a total of 26 types) ) As a template.
通过常规方法生成(南京金斯瑞生物科技有限公司)用于PCR扩增片段4、片段5、片段6、片段7、片段8的引物,两侧引入同源臂。The primers used for PCR amplification of fragment 4, fragment 5, fragment 6, fragment 7, and fragment 8 were generated by conventional methods (Nanjing GenScript Biotechnology Co., Ltd.), and homology arms were introduced on both sides.
如图14所示,具体如下所述,生成试剂转移关系表。As shown in FIG. 14, the reagent transfer relationship table is generated as described below.
关于模板About templates
1、根据50条目标序列的片段4、5、6、7、8序列信息确认各个片段的种类数。例如,片段4的种类数为4,片段5的种类数为16,片段6的种类数为1,片段7的种类数为1,片段8的种类数为4。总共26种。片段的总种类数对应于模板的总种类数。1. Confirm the number of types of fragments based on the sequence information of fragments 4, 5, 6, 7, and 8 of the 50 target sequences. For example, the number of types of segment 4 is 4, the number of types of segment 5 is 16, the number of types of segment 6 is 1, the number of types of segment 7 is 1, and the number of types of segment 8 is 4. There are 26 kinds in total. The total number of types of fragments corresponds to the total number of types of templates.
2、给出上述26种模板的位置(试剂板号和孔位)。2. Give the positions of the above 26 templates (reagent plate numbers and well positions).
3、根据反应需要,统计每种模板需要的次数。3. Count the number of times required for each template according to the needs of the reaction.
4、根据步骤3的每种模板需要的次数得到每种模板需要的体积。4. According to the number of times required by each template in step 3, the volume required by each template is obtained.
5、建立模板位置与体积的对应关系。5. Establish the corresponding relationship between template position and volume.
关于引物About primers
6、从所有的引物中去除序列重复的引物。6. Remove sequence repeat primers from all primers.
7、给出剩余引物的位置(试剂板号和孔位)。7. Give the position of the remaining primers (reagent plate number and well position).
8、根据反应需要,统计每种引物需要的次数。8. According to the needs of the reaction, count the number of times required for each primer.
9、根据步骤8的每种引物需要的次数得到每种引物需要的体积。9. Obtain the required volume of each primer according to the number of times required for each primer in step 8.
10、建立引物位置与体积的对应关系。10. Establish the corresponding relationship between primer position and volume.
关于模板/引物位置与产物位置的对应关系About the correspondence between template/primer position and product position
11、给出每种PCR产物的位置(反应板号和孔位),并建立PCR产物的位置、产物名称、所需引物名称、所需模板名称的对应关系。11. Give the position of each PCR product (reaction plate number and well position), and establish the corresponding relationship between the position of the PCR product, the name of the product, the name of the required primer, and the name of the required template.
12、根据步骤11中的引物名称查找步骤7中相应引物的位置,建立PCR产物的位置与引物位置、体积的对应关系,得到引物的转移关系表。12. Find the position of the corresponding primer in step 7 according to the name of the primer in step 11, establish the corresponding relationship between the position of the PCR product, the position and volume of the primer, and obtain the transfer relationship table of the primer.
13、根据步骤11中的模板名称查找步骤2中相应模板的位置,建立PCR产物的位置与模板位置、体积的对应关系,得到模板的转移关系表。13. Find the position of the corresponding template in step 2 according to the template name in step 11, establish the corresponding relationship between the position of the PCR product, the position and volume of the template, and obtain the template transfer relationship table.
试剂分布图和试剂转移关系表如图15所示。The reagent distribution diagram and the reagent transfer relationship table are shown in Figure 15.
将试剂转移关系表上传Tecan工作站。由Tecan工作站按照试剂转移关系表自动进行引物和模板的转移,自试剂板转移至反应板。Upload the reagent transfer relationship table to the Tecan workstation. The Tecan workstation automatically transfers primers and templates according to the reagent transfer relationship table, from the reagent plate to the reaction plate.
配制反应工作液,分装至已有引物和模板的反应板孔中。分装可用八通道移液器手动分装或者采用高通量微量分装器进行自动分装。Prepare the reaction working solution and distribute it to the wells of the reaction plate with primers and templates. Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser.
基于KODFX DNA Polymerase(TOYOBO,KFX-101)的PCR体系:PCR system based on KODFX DNA Polymerase (TOYOBO, KFX-101):
Figure PCTCN2021083012-appb-000014
Figure PCTCN2021083012-appb-000014
PCR程序:PCR program:
Figure PCTCN2021083012-appb-000015
Figure PCTCN2021083012-appb-000015
Figure PCTCN2021083012-appb-000016
Figure PCTCN2021083012-appb-000016
回收的PCR产物的凝胶电泳照片如图16所示。The gel electrophoresis photograph of the recovered PCR product is shown in FIG. 16.
(3)片段的融合(3) Fragment fusion
如上所述,这个步骤将片段4与片段5融合成片段A,将片段6与片段7融合成片段B。可以通过PCR进行片段的融合。As mentioned above, in this step, fragment 4 and fragment 5 are fused into fragment A, and fragment 6 and fragment 7 are fused into fragment B. The fusion of the fragments can be performed by PCR.
如图17所示,具体如下所述,生成试剂转移关系表。As shown in FIG. 17, the reagent transfer relationship table is generated as described below.
关于模板About templates
1、根据上述PCR产物的回收位置,确定模板的位置(试剂板号和孔位)。1. Determine the position of the template (reagent plate number and well position) according to the recovery position of the above PCR product.
关于引物About primers
2、从所有的引物中去除序列重复的引物。2. Remove sequence repeat primers from all primers.
3、给出剩余引物的位置(试剂板号和孔位)。3. Give the position of the remaining primers (reagent plate number and well position).
4、根据反应需要,统计每种引物需要的次数。4. Count the number of times required for each primer according to the needs of the reaction.
5、根据步骤4的每种引物需要的次数得到每种引物需要的体积。5. Obtain the required volume of each primer according to the number of times required for each primer in step 4.
6、建立引物位置与体积的对应关系。6. Establish the corresponding relationship between primer position and volume.
关于模板/引物位置与产物位置的对应关系About the correspondence between template/primer position and product position
7、给出每种融合产物的位置,并建立融合产物(片段A或片段B)的位置、产物名称、所需引物名称、所需模板名称(片段4和5或者片段6和7)的对应关系。7. Give the position of each fusion product, and establish the corresponding position of the fusion product (fragment A or fragment B), product name, desired primer name, desired template name ( fragment 4 and 5 or fragment 6 and 7) relation.
8、根据步骤7中的引物名称查找步骤3中相应引物的位置,建立融合产物的位置与引物位置、体积的对应关系,得到引物的转移关系表。8. Find the position of the corresponding primer in step 3 according to the primer name in step 7, establish the corresponding relationship between the position of the fusion product, the position and volume of the primer, and obtain the transfer relationship table of the primer.
9、根据步骤7中的模板名称查找步骤1中相应模板的位置,建立融合产物的位置与模板位置、体积的对应关系,得到模板的转移关系表。9. Find the position of the corresponding template in step 1 according to the template name in step 7, establish the corresponding relationship between the position of the fusion product, the position and volume of the template, and obtain the transfer relationship table of the template.
试剂分布图和试剂转移关系表如图18所示。The reagent distribution diagram and the reagent transfer relationship table are shown in Figure 18.
将试剂转移关系表上传Tecan工作站。由Tecan工作站按照试剂转移关系表自动进行引物和模板的转移,自试剂板转移至反应板。Upload the reagent transfer relationship table to the Tecan workstation. The Tecan workstation automatically transfers primers and templates according to the reagent transfer relationship table, from the reagent plate to the reaction plate.
配制反应工作液,分装至已有引物和模板的反应板孔中。分装可用八通道移液器手动分装或者采用高通量微量分装器进行自动分装。Prepare the reaction working solution and distribute it to the wells of the reaction plate with primers and templates. Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser.
基于KOD-FX DNA Polymerase(TOYOBO,KFX-101)的PCR体系:PCR system based on KOD-FX DNA Polymerase (TOYOBO, KFX-101):
Figure PCTCN2021083012-appb-000017
Figure PCTCN2021083012-appb-000017
Figure PCTCN2021083012-appb-000018
Figure PCTCN2021083012-appb-000018
PCR程序:PCR program:
Figure PCTCN2021083012-appb-000019
Figure PCTCN2021083012-appb-000019
片段A预期长度为801-2000bp,片段B预期长度为1111bp。回收的PCR产物的凝胶电泳照片如图19所示。The expected length of fragment A is 801-2000 bp, and the expected length of fragment B is 1111 bp. The gel electrophoresis photograph of the recovered PCR product is shown in FIG. 19.
(4)最终拼接(4) Final splicing
使用GenBuilder Plus(南京金斯瑞生物科技有限公司,IM00712),将成套的A、B、C片段插入中间质粒。Use GenBuilder Plus (Nanjing GenScript Biotechnology Co., Ltd., IM00712) to insert the complete set of A, B, and C fragments into the intermediate plasmid.
为了进行最终的拼接,将所需要的片段A、片段B、片段C等从试剂板转移至反应板。如图20所示生成试剂转移关系表。试剂分布表和试剂转移关系表如图21所示。For the final splicing, the required fragment A, fragment B, fragment C, etc. are transferred from the reagent plate to the reaction plate. A reagent transfer relationship table is generated as shown in FIG. 20. The reagent distribution table and the reagent transfer relationship table are shown in FIG. 21.
将转移关系表上传Tecan工作站。由Tecan工作站按照转移关系表自动进行片段A、片段B、片段C的转移,从试剂板转移至反应板。Upload the transfer relationship table to the Tecan workstation. The Tecan workstation automatically transfers fragment A, fragment B, and fragment C according to the transfer relationship table, and transfers from the reagent plate to the reaction plate.
将质粒片段和GenBuilder Plus 2x Master Mix分装至已有片段A、片段B、片段C的反应板孔中。分装可用八通道移液器手动分装或者采用高通量微量分装器进行自动分装。Aliquot the plasmid fragments and GenBuilder Plus 2x Master Mix into the wells of the existing Fragment A, Fragment B, and Fragment C. Dispensing can be manually dispensed with an eight-channel pipette or automatically dispensed with a high-throughput micro-dispenser.
拼接反应体系:Splicing reaction system:
Figure PCTCN2021083012-appb-000020
Figure PCTCN2021083012-appb-000020
拼接反应条件:Splicing reaction conditions:
Figure PCTCN2021083012-appb-000021
Figure PCTCN2021083012-appb-000021
(5)检验(5) Inspection
按照常规方法,将拼接产物转化入大肠杆菌Top10感受态细胞(南京金斯瑞生物科技有限公司),进行抗生素(例如卡那霉素)平板筛选(结果未显示)。每个拼接反应挑取6个单菌落对插入物进行PCR检验(如图22所示);将PCR检验正确的菌落提交测序检验(数据未显示)。检验结果如表3所示。总之,拼接的一次成功率达到100%,一次正确率达到95%以上。According to a conventional method, the spliced product was transformed into E. coli Top10 competent cells (Nanjing GenScript Biotechnology Co., Ltd.), and screened with antibiotics (such as kanamycin) on a plate (results not shown). Pick 6 single colonies for each splicing reaction to perform PCR inspection on the inserts (as shown in Figure 22); the colonies that were correctly tested by PCR were submitted for sequencing inspection (data not shown). The test results are shown in Table 3. In short, the one-time success rate of splicing reaches 100%, and the one-time correct rate reaches more than 95%.
表3table 3
编号serial number PCR阳性率PCR positive rate 测序正确率Sequencing accuracy rate 编号serial number PCR阳性率PCR positive rate 测序正确率Sequencing accuracy rate
11 2/62/6 ++ 2626 4/64/6 ++
22 4/64/6 ++ 2727 3/63/6 ++
33 3/63/6 ++ 2828 3/63/6 ++
44 3/63/6 ++ 2929 3/63/6 ++
55 3/63/6 ++ 3030 5/65/6 ++
66 6/66/6 ++ 3131 5/65/6 ++
77 3/63/6 ++ 3232 6/66/6 ++
88 5/65/6 ++ 3333 1/61/6 ++
99 3/63/6 ++ 3434 3/63/6 ++
1010 6/66/6 ++ 3535 3/63/6 ++
1111 4/64/6 ++ 3636 6/66/6 ++
1212 3/63/6 ++ 3737 1/61/6 ++
1313 1/61/6 ++ 3838 6/66/6 ++
1414 3/63/6 ++ 3939 2/62/6 ++
1515 2/62/6 ++ 4040 5/65/6 ++
1616 2/62/6 ++ 4141 5/65/6 ++
1717 1/61/6 ++ 4242 6/66/6 ++
1818 2/62/6 ++ 4343 4/64/6 ++
1919 4/64/6 ++ 4444 6/66/6 ++
2020 5/65/6 ++ 4545 2/62/6 ++
21twenty one 6/66/6 ++ 4646 3/63/6 ++
22twenty two 6/66/6 ++ 4747 6/66/6 ++
23twenty three 2/62/6 ++ 4848 3/63/6 ++
24twenty four 6/66/6 ++ 4949 6/66/6 ++
2525 3/63/6 ++ 5050 3/63/6 ++
96孔板的孔编号如图23所示。The well numbers of the 96-well plate are shown in Figure 23.
序列sequence
Figure PCTCN2021083012-appb-000022
Figure PCTCN2021083012-appb-000022
Figure PCTCN2021083012-appb-000023
Figure PCTCN2021083012-appb-000023

Claims (13)

  1. 一种构建突变基因库的方法,其中突变基因库中的每一种突变基因相对于参照序列包含突变,所述方法包括:A method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, the method comprising:
    (1)针对每一种突变基因将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段包含与参照序列或其互补序列的对应区段相同的序列,可变序列片段包含与参照序列或其互补序列的对应区段相比的突变;(1) For each mutant gene, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
    (2)分别生成每一种突变基因的各个恒定序列片段和分别生成每一种突变基因的各个可变序列片段;(2) Generate each constant sequence fragment of each mutant gene and each variable sequence fragment of each mutant gene separately;
    (3)针对每一种突变基因配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each mutant gene; and
    (4)针对每一种突变基因生成全长突变基因,任选地插入载体,(4) Generate a full-length mutant gene for each mutant gene, and optionally insert it into a vector,
    由此构建突变基因库,其中所述方法是在多隔室容器中批量进行的。Thus, a mutant gene library was constructed, wherein the method was carried out in batches in a multi-compartment container.
  2. 一种构建基因组合库的方法,其中基因组合库中的各种基因组合相对于彼此具有序列唯一的区段和序列多选的区段,所述方法包括:A method for constructing a gene combination library, wherein various gene combinations in the gene combination library have segments with unique sequences and segments with multiple selections of sequences relative to each other, and the method includes:
    (1)针对每一种基因组合将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段对应于在基因组合库中具有唯一序列的区段,可变序列片段对应于在基因组合库中具有多选序列的区段;(1) For each gene combination, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library, The variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
    (2)生成各个恒定序列片段和分别生成每一种基因组合的各个可变序列片段;(2) Generate each constant sequence fragment and each variable sequence fragment for each gene combination;
    (3)针对每一种基因组合配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a complete set of constant sequence fragments and variable sequence fragments for each gene combination; and
    (4)针对每一种基因组合生成全长基因组合,任选地插入载体,(4) Generate a full-length gene combination for each gene combination, optionally insert a vector,
    由此构建基因组合库,其中所述方法是在多隔室容器中批量进行的。Thus, a gene combinatorial library is constructed, wherein the method is carried out in batches in a multi-compartment container.
  3. 一种自动配制批量反应体系的方法,其包括:A method for automatically preparing a batch reaction system, which includes:
    生成试剂转移关系表;和Generate a reagent transfer relationship table; and
    将试剂转移关系表上传自动移液装置,由自动移液装置按照试剂转移关系表自动将试剂自装有试剂的容器转移至用于进行反应的容器,Upload the reagent transfer relationship table to the automatic pipetting device, and the automatic pipetting device will automatically transfer the reagent from the container containing the reagent to the container for reaction according to the reagent transfer relationship table.
    其中试剂转移关系表是如下生成的:The reagent transfer relationship table is generated as follows:
    (1)列出要进行的批量反应,列出每一个反应需要的试剂及其体积;(1) List the batch reactions to be carried out, list the reagents and their volumes required for each reaction;
    (2)确定每一个反应就反应容器而言的位置;(2) Determine the position of each reaction in terms of the reaction vessel;
    (3)确定每一个反应需要的每一种试剂就试剂容器而言的位置;(3) Determine the position of each reagent required for each reaction in terms of the reagent container;
    (4)确定每一个反应需要的每一种试剂的转移起点和终点,(4) Determine the transfer starting point and end point of each reagent required for each reaction,
    其中所述方法是在多隔室容器中进行的。The method is carried out in a multi-compartment container.
  4. 一种构建突变基因库的方法,其中突变基因库中的每一种突变基因相对于参照序列包含突变,所述方法包括:A method for constructing a mutant gene library, wherein each mutant gene in the mutant gene library contains a mutation relative to a reference sequence, the method comprising:
    (1)针对每一种突变基因将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段包含与参照序列或其互补序列的对应区段相同的序列,可变序列片段包含与参照序列或其互补序列的对应区段相比的突变;(1) For each mutant gene, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments contain the same segments as the reference sequence or its complementary sequence. Sequences, variable sequence fragments contain mutations compared to the corresponding segments of the reference sequence or its complementary sequence;
    (2)分别生成每一种突变基因的各个恒定序列片段和分别生成每一种突变基因的各个可变序列片段;(2) Generate each constant sequence fragment of each mutant gene and each variable sequence fragment of each mutant gene separately;
    (3)针对每一种突变基因通过权利要求3的方法配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system containing a set of constant sequence fragments and variable sequence fragments by the method of claim 3 for each mutant gene; and
    (4)针对每一种突变基因生成全长突变基因,任选地插入载体,(4) Generate a full-length mutant gene for each mutant gene, and optionally insert it into a vector,
    由此构建突变基因库。Thus, a mutant gene library was constructed.
  5. 一种构建基因组合库的方法,其中基因组合库中的各种基因组合相对于彼此具有序列唯一的区段和序列多选的区段,所述方法包括:A method for constructing a gene combination library, wherein various gene combinations in the gene combination library have segments with unique sequences and segments with multiple selections of sequences relative to each other, and the method includes:
    (1)针对每一种基因组合将全长序列分割成一个或多个恒定序列片段和一个或多个可变序列片段,其中恒定序列片段对应于在基因组合库中具有唯一序列的区段,可变序列片段对应于在基因组合库中具有多选序列的区段;(1) For each gene combination, the full-length sequence is divided into one or more constant sequence fragments and one or more variable sequence fragments, where the constant sequence fragments correspond to the segments with unique sequences in the gene combination library, The variable sequence segment corresponds to a segment with multiple selection sequences in the gene combinatorial library;
    (2)生成各个恒定序列片段和分别生成每一种基因组合的各个可变序列片段;(2) Generate each constant sequence fragment and each variable sequence fragment for each gene combination;
    (3)针对每一种基因组合通过权利要求3的方法配制包含成套的恒定序列片段和可变序列片段的反应体系;和(3) Formulating a reaction system comprising a set of constant sequence fragments and variable sequence fragments by the method of claim 3 for each gene combination; and
    (4)针对每一种基因组合生成全长基因组合,任选地插入载体,(4) Generate a full-length gene combination for each gene combination, optionally insert a vector,
    由此构建基因组合库。Thus, a gene combinatorial library was constructed.
  6. 权利要求1-5任一项的方法,其中所述多隔室容器是多孔板,例如96孔板或384孔板。The method of any one of claims 1-5, wherein the multi-compartment container is a multi-well plate, such as a 96-well plate or a 384-well plate.
  7. 权利要求1-6任一项的方法,其中多个反应公用的试剂是合并在试剂板上的一个或多个孔中的。The method according to any one of claims 1 to 6, wherein the reagents common to a plurality of reactions are combined in one or more wells on the reagent plate.
  8. 权利要求1-6任一项的方法,其中多个反应公用的试剂和/或所有反应公用的试剂是手工添加的。The method according to any one of claims 1 to 6, wherein the reagents common to multiple reactions and/or the reagents common to all reactions are added manually.
  9. 权利要求8的方法,其中手工添加是使用多道移液器进行的。The method of claim 8, wherein the manual addition is performed using a multichannel pipette.
  10. 权利要求1-9任一项的方法,其中一次进行至少10个反应、至少25个反应、至少50个反应、至少100个反应、至少250个反应、至少500个反应、至少750个反应、至少1000个反应、至少1250个反应、至少1500个反应、至少1750个反应、至少2000个反应、至少2250个反应、至少2500个反应、或至少2750个反应。The method of any one of claims 1-9, wherein at least 10 reactions, at least 25 reactions, at least 50 reactions, at least 100 reactions, at least 250 reactions, at least 500 reactions, at least 750 reactions, at least 1000 reactions, at least 1250 reactions, at least 1500 reactions, at least 1750 reactions, at least 2000 reactions, at least 2250 reactions, at least 2500 reactions, or at least 2750 reactions.
  11. 权利要求1、2、4和5任一项的方法,其中步骤(4)利用无缝拼接技术。The method of any one of claims 1, 2, 4, and 5, wherein step (4) uses a seamless splicing technique.
  12. 权利要求11的方法,其中所述无缝拼接技术选自Golden-gate法、Gibson法或Genbuilder法。The method of claim 11, wherein the seamless splicing technique is selected from the group consisting of Golden-gate method, Gibson method or Genbuilder method.
  13. 权利要求1或4的方法,其中所述突变基因库包含所述参照序列。The method of claim 1 or 4, wherein the mutant gene library comprises the reference sequence.
PCT/CN2021/083012 2020-03-25 2021-03-25 Automated gene assembly system and method WO2021190600A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010219277.9 2020-03-25
CN202010219277 2020-03-25

Publications (1)

Publication Number Publication Date
WO2021190600A1 true WO2021190600A1 (en) 2021-09-30

Family

ID=77891568

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/083012 WO2021190600A1 (en) 2020-03-25 2021-03-25 Automated gene assembly system and method

Country Status (1)

Country Link
WO (1) WO2021190600A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116962685A (en) * 2023-09-21 2023-10-27 杭州爱芯元智科技有限公司 Video encoding method, video encoding device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104593354A (en) * 2015-01-14 2015-05-06 江南大学 Method for rapid directed evolution of DNA on basis of in vitro combined assembly
CN104789556A (en) * 2015-04-29 2015-07-22 江南大学 Method of carrying out rapid and efficient DNA combination and evolution based on synthesis of single-chain DNA library
CN104805508A (en) * 2015-04-29 2015-07-29 江南大学 Method for evolving metabolic pathways based on synthetic single-stranded DNA library
CN109750032A (en) * 2019-02-28 2019-05-14 尚科生物医药(上海)有限公司 A method of building gene multipoint mutation and evolution

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104593354A (en) * 2015-01-14 2015-05-06 江南大学 Method for rapid directed evolution of DNA on basis of in vitro combined assembly
CN104789556A (en) * 2015-04-29 2015-07-22 江南大学 Method of carrying out rapid and efficient DNA combination and evolution based on synthesis of single-chain DNA library
CN104805508A (en) * 2015-04-29 2015-07-29 江南大学 Method for evolving metabolic pathways based on synthetic single-stranded DNA library
CN109750032A (en) * 2019-02-28 2019-05-14 尚科生物医药(上海)有限公司 A method of building gene multipoint mutation and evolution

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
FANG, BAISHAN : "In Vitro Directed Evolution of Enzymes (I) Mutant Gene Library Construction Technology and Its New Development", JOURNAL OF HUAQIAO UNIVERSITY (NATURAL SCIENCE), vol. 25, no. 4, 20 October 2004 (2004-10-20), CN, pages 337 - 342, XP009531046, ISSN: 1000-5013 *
HEYDENREICH FRANZISKA M., MILJUŠ TAMARA, JAUSSI ROLF, BENOIT ROGER, MILIĆ DALIBOR, VEPRINTSEV DMITRY B.: "High-throughput mutagenesis using a two-fragment PCR approach", SCIENTIFIC REPORTS, vol. 7, no. 1, 1 December 2017 (2017-12-01), XP055853145, DOI: 10.1038/s41598-017-07010-4 *
MA, YUCHENG ET AL.: "Advances in the Construction of Mutant Gene Libraries for Directed Evolutionary Research", SCIENCE & TECHNOLOGY INFORMATION, no. 10, 31 December 2012 (2012-12-31), pages 98 - 99, XP055853131, ISSN: 1001-9960 *
PÜLLMANN PASCAL, ULPINNIS CHRIS, MARILLONNET SYLVESTRE, GRUETZNER RAMONA, NEUMANN STEFFEN, WEISSENBORN MARTIN J.: "Golden Mutagenesis: An efficient multi-site-saturation mutagenesis approach by Golden Gate cloning with automated primer design", SCIENTIFIC REPORTS, vol. 9, no. 1, 29 July 2019 (2019-07-29), US, pages 1 - 11, XP055821396, ISSN: 2045-2322, DOI: 10.1038/s41598-019-47376-1 *
ZENG FANLI, ZHANG SUHUA, HAO ZHIMIN, DUAN SHIXIN, MENG YANAN, LI PAN, DONG JINGAO, LIN YIBIN: "Efficient strategy for introducing large and multiple changes in plasmid DNA", SCIENTIFIC REPORTS, vol. 8, no. 1, 1 December 2018 (2018-12-01), XP055853136, DOI: 10.1038/s41598-018-20169-8 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116962685A (en) * 2023-09-21 2023-10-27 杭州爱芯元智科技有限公司 Video encoding method, video encoding device, electronic equipment and storage medium
CN116962685B (en) * 2023-09-21 2024-01-30 杭州爱芯元智科技有限公司 Video encoding method, video encoding device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
US11530446B2 (en) Methods and compositions for DNA profiling
Rohland et al. Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture
Taylor et al. Sequence verification as quality-control step for production of cDNA microarrays
CN105734048A (en) PCR-free sequencing library preparation method for genome DNA
CN107236729A (en) The method and kit of a kind of rapid build target nucleic acid sequencing library that enrichment is captured based on probe
CN108251504A (en) A kind of method and kit of supper-fast structure genomic DNA sequencing library
US20230151421A1 (en) Method for determining cell clonality
CN102409049A (en) DNA(deoxyribonucleic acid) index library building method based on PCR (polymerase chain reaction)
WO2023284768A1 (en) Fusion primer direct amplification method-based human mitochondrial whole genome high-throughput sequencing kit
CN108517567B (en) Adaptor, primer group, kit and library construction method for cfDNA library construction
WO2021190600A1 (en) Automated gene assembly system and method
CN104532360A (en) Whole-genome methylation sequencing library and construction method thereof
CN102409043B (en) Method for constructing high-flux and low-cost Fosmid library, label and label joint used in method
WO2020118596A1 (en) Tag sequence detection method
CN110656157A (en) Quality control product for tracing high-throughput sequencing sample and design and use method thereof
Robinson et al. Multifragment DNA assembly of biochemical pathways via automated ligase cycling reaction
CN106795650B (en) PF quick database building method and application thereof
CN105002570B (en) A kind of method for once preparing multiple double end sequencing libraries of DNA large fragments insertion
Farias-Hesson et al. Semi-automated library preparation for high-throughput DNA sequencing platforms
CN108932401B (en) Identification method of sequencing sample and application thereof
CN108866155B (en) Preparation method of next generation sequencing library
CN116622836A (en) Primer probe combination for detecting beta-thalassemia gene mutation and application thereof
CN107002150B (en) High-throughput detection method for DNA synthesis product
CN112251491B (en) cDNA library construction method of capillary 96-well plate
Sternlieb et al. Efficient Generation of Genome-wide Libraries for Protein–ligand Screens Using Gibson Assembly

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21776580

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21776580

Country of ref document: EP

Kind code of ref document: A1