WO2023093007A1 - Site for stably expressing protein in cho cell gene nw_003614889.1, and use thereof - Google Patents

Site for stably expressing protein in cho cell gene nw_003614889.1, and use thereof Download PDF

Info

Publication number
WO2023093007A1
WO2023093007A1 PCT/CN2022/099466 CN2022099466W WO2023093007A1 WO 2023093007 A1 WO2023093007 A1 WO 2023093007A1 CN 2022099466 W CN2022099466 W CN 2022099466W WO 2023093007 A1 WO2023093007 A1 WO 2023093007A1
Authority
WO
WIPO (PCT)
Prior art keywords
protein
site
gene
cho
expression
Prior art date
Application number
PCT/CN2022/099466
Other languages
French (fr)
Chinese (zh)
Inventor
陈蕴
金坚
瞿丽丽
丁学峰
李华钟
蔡燕飞
朱景宇
杨兆琪
鲁晨
戴云峰
Original Assignee
江南大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 江南大学 filed Critical 江南大学
Publication of WO2023093007A1 publication Critical patent/WO2023093007A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0681Cells of the genital tract; Non-germinal cells from gonads
    • C12N5/0682Cells of the female genital tract, e.g. endometrium; Non-germinal cells from ovaries, e.g. ovarian follicle cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • C12N2510/02Cells for production
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/60Vectors containing traps for, e.g. exons, promoters

Definitions

  • the invention relates to a site for stably expressing protein in CHO cell gene NW_003614889.1 and an application thereof, belonging to the field of gene technology.
  • CHO cells Chinese hamster ovary (CHO) cells were established in Dr. Theodore T. Puck's laboratory in 1957. They are immortalized non-secreting cells that rarely secrete endogenous proteins; Post-translational modifications closer to human natural proteins, less susceptible to human virus infection, and large-scale culture in serum-free medium with defined chemical components, etc., are widely used in the field of biopharmaceuticals, and more than 70% of proteins are produced drug.
  • mammalian cells CHO cells have a long culture cycle and high culture costs.
  • the demand for recombinant products such as monoclonal antibodies continues to increase. The continuous growth of demand means that the specific productivity still needs to be optimized.
  • the expression level can be increased by increasing the copy number of the gene, developing a new strong promoter, finding a suitable enhancer, etc.
  • the expression level of most CHO cells is unstable during the long-term culture process, which directly affects the supervision of the product by the regulatory authorities. Approval and listing issues. Therefore, it is very important to construct a CHO expression strain that stably and highly expresses the target gene for the development and marketing of protein drugs.
  • Random integration is the most mature traditional method for constructing protein expression systems, but obtaining stable high-expression cell lines through random integration requires multiple screenings, which is time-consuming and expensive.
  • the loss of expression stability of the cells in the later stage of culture is completely unpredictable: this instability may not occur at all; it may also gradually appear after numerous cell divisions; It is also possible that apparent instability occurs after only a few generations of cell division.
  • the stability issue will not only affect the time to market of the product, but will also conflict with drug regulatory management.
  • the information of random integration sites is not clear, and the site effect of exogenous gene integration will also lead to a significant decrease in the expression level of the target gene. According to existing literature reports, this instability of recombinant CHO cell lines occurs in all recombinant CHO cell lines, making the problem of unstable expression an extremely common problem.
  • the present invention provides a site for stably expressing proteins in the genome of CHO cells.
  • the screening process and time in the cell construction process reduce the cost of research and development.
  • the first object of the present invention is to provide a stable protein expression site in the CHO cell genome, and the stable protein expression site is located in the 103331-103531 base of the CHO cell gene NW_003614889.1.
  • nucleotide sequence of bases 103331-103531 of the CHO cell gene NW_003614889.1 is shown in SEQ ID NO.1.
  • the site for stably expressing the protein can be recognized by the 5'NNNNNNNNNNNNNNNNNNNGG 3' sequence of the CRISPR/Cas9 technology.
  • the 5'NNNNNNNNNNNNNNNNNNNNNNNNNNNNNGG 3' sequence in the embodiment of the present invention selects the following 9 sequences: 5'-ATCTTACTTTGCAACTACCAAGG-3', 5'-CACTTTTATCTAACACTGGCCAGG-3', 5'-AGCAACACTTTTATCTAACACTGG-3', 5'-GCACTCTGTGTGGAGCAAAGAGG-3', 5'-GCAAATGAGTAAAGTTCTCCTGG-3', 5'-TGCAGTGATAGCACTCTGTGTGG-3', 5'-AACAGGATGAAAATGTATGGAGG-3', 5'-TTCATCCTGTTCTCTGTTCTGGG-3', 5'-TTTCATCCTGTTCTCTGTTCTGG- 3'.
  • the above-mentioned 9 groups of sequences are distributed within the 103331-103531 bases of the CHO cell gene NW_003614889.1 and cover most of the upper, middle and downstream sequences within the 200 bases of the present invention, indicating that the 200 bases of the present invention All bases can be used as sites for stably expressing proteins.
  • the present invention does not limit the above nine groups of sequences.
  • the above nine groups of sequences are only used as the preferred technical means to introduce the coding gene of the target protein into the stable expression site.
  • the purpose of stably expressing the protein of the present invention can be achieved.
  • the protein is a protein with a molecular weight less than 160KDa.
  • the protein is one of polypeptides, functional proteins, antibodies, and fusion proteins.
  • the second object of the present invention is to provide the application of the stable protein expression sites in the CHO cell genome in the stable expression of foreign proteins in CHO cells.
  • the application specifically involves constructing a gene encoding an exogenous protein or polypeptide at the site of stably expressing the protein in the CHO cell gene NW_003614889.1.
  • the third object of the present invention is to provide an expression vector for expressing protein in CHO cells, the coding gene of the protein is located between the 5' homology arm and the 3' homology arm of the expression vector region, the 5' homology arm and the 3' homology arm are sequences with a length of 600 bp upstream and downstream of the stable protein expression site respectively.
  • the expression vector is a vector suitable for expression in CHO cells.
  • the expression vector also includes a promoter sequence located upstream of the gene encoding the protein, and the promoter controls the expression of the protein.
  • the promoters are: CMV (strong mammalian expression promoter derived from human cytomegalovirus), EF-1a (strong mammalian expression promoter derived from human elongation factor 1 ⁇ ), SV40 (simian vacuolar virus 40 derived mammalian expression promoter), PGK1 (mammalian promoter derived from phosphoglycerate kinase gene), UBC (mammalian promoter derived from human ubiquitin C gene), humanbeta actin ( ⁇ -actin gene derived mammalian promoter), CAG (strong hybrid mammalian promoter), etc.
  • an expression vector for expressing a protein in CHO cells comprising the following steps: inserting the gene encoding the protein into the region between the 5'arm and the 3'arm of the plasmid, making the protein encoding The gene is located downstream of the promoter and is regulated by the promoter to obtain the expression vector for expressing the protein in CHO cells.
  • the fourth object of the present invention is to provide a CHO recombinant cell with site-specific integration and stable expression of protein.
  • the CHO recombinant cell is obtained by using the expression vector for expressing protein in CHO cells and the sgRNA corresponding to the target sequence. Plasmids and Cas9 plasmids were transformed into CHO cells.
  • the target sequence is preferably 5'-ATCTTACTTTGCAACTACCAAGG-3', 5'-CACTTTTATCTAACACTGGCCAGG-3', 5'-AGCAACACTTTTATCTAACACTGG-3', 5'-GCACTCTGTGTGGAGCAAAGAGG-3', 5'-GCAAATGAGTAAAGTTCTCCTGG-3', 5'-TGCAGTGATAGCACTCTGTGTGG-3', 5'-AACAGGATGAAAATGTATGGAGG-3', 5'-TTCATCCTGTTCTCTGTTCTGGG-3' or 5'-TTTCATCCTGTTCTCTGTTCTGG-3'.
  • a method for constructing CHO recombinant cells comprising the steps of:
  • the plasmids are respectively the expression vector for expressing protein in CHO cells, the sgRNA plasmid and the Cas9 plasmid corresponding to the target sequence;
  • the stable expression site obtained by the present invention is located within 100 bp upstream and downstream of base 103431 of CHO cell gene NW_003614889.1, that is, base 103331-103531, and can integrate foreign protein genes and perform stable expression.
  • the present invention integrates the target gene into the stable expression region through fixed-point integration, which solves the problem of unclear integration sites caused by random integration; the present invention uses base 103431 of the stable expression site NW_003614889.1 in the CHO genome
  • the site-specific integration of exogenous genes within the range of 103331-103531 bases upstream and downstream of the base overcomes the expression instability caused by the position effect and the repeated and tedious cell line screening process, reducing the original screening time of 6-12 months to 1-3 months, effectively shortening the R&D time for constructing stable expression cell lines and reducing costs.
  • Fig. 1 is a schematic diagram of fixed-point integration of the present invention
  • Figure 2 shows the expression of EGFP at different passages in cells constructed with different target sequences.
  • Figure 3 shows the expression of anti-EGFR at different passages in cells constructed with different target sequences.
  • the method for measuring the average fluorescence intensity of cells culture the cells until the confluence reaches about 90%, digest the cells with 0.25% trypsin, and terminate the digestion with a complete medium equal to the amount of trypsin, and collect the cells in a sterile centrifuge tube at 1000rpm/ Centrifuge for 5 min, discard the supernatant and resuspend with PBS, pass through a cell strainer and collect in a flow loading tube, and use a blank CHO-K1 cell as a control to analyze the fluorescence intensity of the cells with a flow cytometer.
  • the CHO-K1-2d9 cells expressing the Zsgreen1 reporter gene screened by high-throughput flow cytometry were cultured in an adherent culture until they were in good condition, and the CHO cells at this time were regarded as the 0th generation, and then cultured continuously for 20 days.
  • the expression of Zsgreen1 protein in cells at passages 0, 10, and 20 was observed under an inverted fluorescence microscope, and the average fluorescence intensity of cells at passages 0, 10, and 20 was detected by BD flow cytometry.
  • CHO-K1-2d9 cells could still express 100% Zsgreen1 protein in the adherent culture state, and Zsgreen1 protein between different passages
  • the expression levels are basically the same, and there is a strong green fluorescent signal.
  • the CHO-K1-2d9 cells that passed the verification of anchorage stability were subjected to suspension acclimation, and the CHO-K1-2d9 cells that were successfully acclimatized in suspension were continuously passaged for 60 passages, and the success of acclimation in suspension was regarded as the 0th passage.
  • the expression of Zsgreen1 protein in the cells at passages 0, 10, 20, 30, 40, 50, and 60 was observed under an inverted fluorescence microscope, and at the same time, the cells at passages 0, 10, 20, 30, 40, and 50 were detected by flow cytometry. Mean fluorescence intensity of cells at passage 60.
  • CHO-K1-2d9 cells could still express 100% Zsgreen1 protein in suspension culture state, and Zsgreen1 protein expression between different passages The levels are basically the same, and there is a strong green fluorescent signal. It shows that CHO-K1-2d9 cells can stably express the Zsgreen1 reporter gene, and at the same time, it shows that the integrated site of the lentivirus carrying the Zsgreen1 reporter gene is a stable expression site.
  • Collect CHO-K1-2d9 cells use the DNA extraction kit to extract the genome, use DraI, SspI, HpaI three restriction endonucleases to digest the genome at 37°C for 16-18h, and the digestion system is as follows:
  • the digested product was purified and recovered using a PCR purification kit, and the chromosome walking adapter GenomeWalker Adaptor was connected to both ends of the purified digested fragment.
  • the connection system was as follows:
  • a round of PCR reaction system is as follows:
  • the second round of PCR reaction system is as follows:
  • the second round of PCR products were subjected to agarose gel electrophoresis and gel recovery sequencing, and the sequencing was performed according to the kit Lenti-X Integration Site Analysis Kit (Clontech: 631263).
  • the lentivirus integration site information can be obtained by comparing the sequencing results with the CHO cell genome on NCBI.
  • the lentivirus integration site in CHO-K1-2d9 cells is located at base 103431 of the CHO cell genome NW_003614889.1.
  • Sequences in which the predicted editing efficiency was higher than 0.56 were selected as target sequences.
  • Target sequence 5'-ATCTTACTTTGCAACTACCAAGG-3'(SEQ ID NO.2), score 0.65
  • Target sequence 5'-CACTTTATCTAACACTGGCCAGG-3'(SEQ ID NO.3), score 0.62
  • Target sequence 5'-AGCAACACTTTTATCTAACACTGG-3'(SEQ ID NO.4), score 0.59
  • Target sequence 5'-GCACTCTGTGTGGAGCAAAGAGG-3'(SEQ ID NO.5), score 0.83
  • Target sequence 5'-GCAAATGAGTAAAGTTCTCCTGG-3'(SEQ ID NO.6), score 0.57
  • Target sequence 5'-TGCAGTGATAGCACTCTGTGTGG-3'(SEQ ID NO.7), score 0.68
  • Target sequence 5'-AACAGGATGAAAATGTATGGAGG-3'(SEQ ID NO.8), score 0.77
  • Target sequence 5'-TTCATCCTGTTCTCTGTTCTGGG-3'(SEQ ID NO.9), score 0.66
  • Target sequence 5'-TTTCATCCTGTTCTCTGTTCTGG-3'(SEQ ID NO.10), score 0.69
  • CRISPR/Cas9-mediated genome editing technology Using CRISPR/Cas9-mediated genome editing technology and homologous recombination, the green fluorescent protein gene (EGFP, 26.7KDa) will be integrated at the target site.
  • CRISPR/Cas9-mediated homologous recombination technology needs to construct sgRNA plasmid and Donor Plasmid, the construction process is as follows:
  • sgRNA-R1 5'TAAAACCCTTGGTAGTTGCAAAGTAAGAT 3'(SEQ ID NO.12)
  • sgRNA-R2 5'TAAAACCCTGGCCAGTGTTAGATAAAGTG 3'(SEQ ID NO.14)
  • sgRNA-R4 5'TAAAACCCCTCTTTGCTCCACACAGAGTGC 3'(SEQ ID NO.18)
  • sgRNA-R5 5'TAAAACCCAGGAGAACTTTACTCATTTGC 3'(SEQ ID NO.20)
  • sgRNA-R6 5'TAAAACCCACACAGAGTGCTATCACTGCA 3'(SEQ ID NO.22)
  • sgRNA-R8 5'TAAAACCCCAGAACAGAGAACAGGATGAA 3'(SEQ ID NO.26)
  • the PSK-u6-gRNA plasmid is digested with BBsI enzyme, and the vector after digestion is gel recovered;
  • Donor plasma The information of Donor plasma is shown in Figure 1, which was obtained by transforming the existing plasmid vector expressing EGFP. 5'arm and 3'arm are the upstream and downstream homology arms of the target sites recognized by each pair of sgRNAs, with a length of 600bp, and GOI is the integrated target gene.
  • the EGFP sequence of the target gene is carried by the original plasmid.
  • CRISPR/Cas9-mediated genome site-directed editing technology Using CRISPR/Cas9-mediated genome site-directed editing technology and homologous homology, the humanized antibody gene (anti-EGFR, 150KDa) expressing epidermal growth factor receptor was site-directedly integrated at the target site.
  • CRISPR/Cas9-mediated homologous recombination technology needs to construct sgRNA plasmid and Donor Plasmid, the construction process is as follows:
  • sgRNA-R1 5'TAAAACCCTTGGTAGTTGCAAAGTAAGAT 3'
  • sgRNA-R2 5'TAAAACCCTGGCCAGTGTTAGATAAAGTG 3'
  • sgRNA-R3 5'TAAAACCCAGTGTTAGATAAAGTGTTGCT 3'
  • sgRNA-R4 5'TAAAACCCCTCTTTGCTCCACACAGAGTGC 3'
  • sgRNA-R5 5'TAAAACCCAGGAGAACTTTACTCATTTGC 3'
  • sgRNA-R6 5'TAAAACCCACACAGAGTGCTATCACTGCA 3'
  • the PSK-u6-gRNA plasmid is digested with BBsI enzyme, and the vector after digestion is gel recovered;
  • Donor plasma information is shown in Figure 1. 5'arm and 3'arm are the upstream and downstream homology arms of the target site, respectively, with a length of 600bp, and GOI is the integrated target gene.
  • Detection method the cell lines obtained in Example 4 were continuously passaged for 60 passages, and the cells were collected for 15 passages per passage, and the cell fluorescence was detected with a flow cytometer and the intensity was detected, as shown in Figure 2.
  • the fluctuation range of green fluorescence intensity between the 0th passage and the 60th passage was not more than 30%.
  • Example 5 The cell line obtained in Example 5 was continuously passaged for 60 generations under serum-free culture conditions, and the cell fermentation supernatant of this generation was collected every 15 generations, and the anti-EGFR content in the fermentation broth was detected by ELISA kit , the analysis of the detection results showed that the cells constructed according to different target sequences in Example 5 had a stable ability to express anti-EGFR at different passages, as shown in FIG. 3 .
  • the 9 sets of target sequences screened in Example 3 of the present invention cover most of the upper, middle and downstream sequences within the 200 bp base of the present invention, and the 103331-103531 base range in the CHO cell gene NW_003614889.1 of the present invention is all A stable expression cell line with site-specific integration can be successfully constructed, and all of them can stably express the target protein.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Reproductive Health (AREA)
  • Cell Biology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided are a site for stably expressing a protein in the CHO cell gene NW_003614889.1, and the use thereof, wherein the obtained site for stable expression is located within a range of 100 bp upstream and downstream of the 103431st base of the CHO cell gene NW_003614889.1, that is, an exogenous protein gene can be integrated at the 103331st-103531st bases and stably expressed. In the present invention, a target gene is integrated to the above-mentioned region for stable expression by means of site-specific integration, which solves the problem of unclear integration sites caused by random integration; and by means of the site-specific integration of an exogenous gene at the site for stable expression in a CHO genome which is located within a range of the 103331st-103531st bases upstream and downstream of the 103431st base of gene NW_003614889.1, the problems of expression instability caused by the position effect and a repetitive and tedious cell strain screening process are overcome.

Description

一种CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点及其应用A site for stably expressing protein in CHO cell gene NW_003614889.1 and its application 技术领域technical field
本发明涉及一种CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点及其应用,属于基因技术领域。The invention relates to a site for stably expressing protein in CHO cell gene NW_003614889.1 and an application thereof, belonging to the field of gene technology.
背景技术Background technique
中国仓鼠卵巢(Chinese hamster ovary,CHO)细胞是1957年在Dr.Theodore T.Puck的实验室中建立的,是一种永生化的非分泌型细胞,很少分泌内源蛋白;同时因其具有与人类天然蛋白更接近的翻译后修饰、不易受人类病毒感染、能在化学成分明确的无血清培养基中大规模培养等优点,被广泛应用于生物制药领域,生产了超过70%的蛋白类药物。然而CHO细胞作为哺乳动物细胞,培养周期长,培养成本高,而同时单克隆抗体等重组产品的需求不断增加,持续增长的需求意味着比生产率仍需要优化。虽然通过增加基因的拷贝数、发展新的强启动子、寻找合适的增强子等方式可以提高表达量,但多数CHO细胞在长期培养过程中的表达水平不稳定,直接影响到监管部门对产品的审批与上市问题。因此构建稳定且高表达目的基因的CHO表达株对于蛋白类药物的研发与上市非常重要。Chinese hamster ovary (CHO) cells were established in Dr. Theodore T. Puck's laboratory in 1957. They are immortalized non-secreting cells that rarely secrete endogenous proteins; Post-translational modifications closer to human natural proteins, less susceptible to human virus infection, and large-scale culture in serum-free medium with defined chemical components, etc., are widely used in the field of biopharmaceuticals, and more than 70% of proteins are produced drug. However, as mammalian cells, CHO cells have a long culture cycle and high culture costs. At the same time, the demand for recombinant products such as monoclonal antibodies continues to increase. The continuous growth of demand means that the specific productivity still needs to be optimized. Although the expression level can be increased by increasing the copy number of the gene, developing a new strong promoter, finding a suitable enhancer, etc., the expression level of most CHO cells is unstable during the long-term culture process, which directly affects the supervision of the product by the regulatory authorities. Approval and listing issues. Therefore, it is very important to construct a CHO expression strain that stably and highly expresses the target gene for the development and marketing of protein drugs.
目前有两种构建稳定且高表达细胞株的策略。一种是传统的随机整合与高通量筛选相结合的方法,另一种是利用基因编辑技术与同源定向修复相结合的方法将目的基因定点整合到预定的染色体位点。但是,随机整合由于整合位点的不确定性,以及位置效应的存在,构建的细胞基因型差异巨大,且在长期传代的过程中容易发生表达不稳定的问题,导致后期的筛选过程非常漫长,在工业环境中,通过随机整合方式产生重组细胞系的整个过程通常需要6-12个月;与随机整合相比,定点整合利用基因编辑技术,尤其是近年来应用广泛的CRISPR/Cas9介导的基因组定点编辑技术,大大减少了研发时间和成本,且由于序列已知,定点整合后的信息比随机整合后的信息更加清晰明确。There are currently two strategies for constructing stable and high-expressing cell lines. One is the combination of traditional random integration and high-throughput screening, and the other is the combination of gene editing technology and homology-directed repair to integrate the target gene into a predetermined chromosomal site. However, due to the uncertainty of the integration site and the existence of position effects in random integration, the genotypes of the constructed cells are very different, and expression instability is prone to occur in the process of long-term passage, resulting in a very long screening process in the later stage. In an industrial setting, the entire process of generating recombinant cell lines by random integration usually takes 6-12 months; compared with random integration, site-directed integration utilizes gene editing technology, especially CRISPR/Cas9-mediated, which has been widely used in recent years Genome fixed-point editing technology greatly reduces the time and cost of research and development, and because the sequence is known, the information after fixed-point integration is clearer than that after random integration.
随机整合是构建蛋白表达系统的最为成熟的传统方法,但通过随机整合方式获得稳定高表达细胞株需要经过多重筛选,耗时长,成本高。同时,对于随机整合获得的细胞系,在培养后期,对于细胞丧失表达稳定性完全无法预测:这种不稳定性可能完全不会发生;也可能在细胞出现了无数次的分裂之后,逐步显现;也可能在细胞仅仅分裂几代后,就出现了明显 的不稳定性。而稳定性问题不仅仅只是会影响到产品上市时间,更会与药品法规管理产生一定冲突。另外随机整合位点的信息不明确,外源基因整合的位点效应也会导致目的基因的表达水平显著下降。根据已有的文献报道,这种重组CHO细胞系的不稳定性出现在了所有的重组CHO细胞系中,使不稳定表达的问题成为了一个极其普遍的问题。Random integration is the most mature traditional method for constructing protein expression systems, but obtaining stable high-expression cell lines through random integration requires multiple screenings, which is time-consuming and expensive. At the same time, for the cell lines obtained by random integration, the loss of expression stability of the cells in the later stage of culture is completely unpredictable: this instability may not occur at all; it may also gradually appear after numerous cell divisions; It is also possible that apparent instability occurs after only a few generations of cell division. The stability issue will not only affect the time to market of the product, but will also conflict with drug regulatory management. In addition, the information of random integration sites is not clear, and the site effect of exogenous gene integration will also lead to a significant decrease in the expression level of the target gene. According to existing literature reports, this instability of recombinant CHO cell lines occurs in all recombinant CHO cell lines, making the problem of unstable expression an extremely common problem.
发明内容Contents of the invention
为解决上述技术问题,本发明提供一种CHO细胞基因组内稳定表达蛋白质的位点,该位点信息明确,在该位点能够实现蛋白基因的定点整合,且能稳定表达蛋白质,并能大量缩短细胞构建过程中的筛选流程与时间,降低研发成本。In order to solve the above technical problems, the present invention provides a site for stably expressing proteins in the genome of CHO cells. The screening process and time in the cell construction process reduce the cost of research and development.
本发明的第一个目的是提供一种CHO细胞基因组内稳定表达蛋白质的位点,所述的稳定表达蛋白质的位点位于CHO细胞基因NW_003614889.1的第103331-103531碱基内。The first object of the present invention is to provide a stable protein expression site in the CHO cell genome, and the stable protein expression site is located in the 103331-103531 base of the CHO cell gene NW_003614889.1.
进一步地,所述的CHO细胞基因NW_003614889.1的第103331-103531碱基的核苷酸序列如SEQ ID NO.1所示。Further, the nucleotide sequence of bases 103331-103531 of the CHO cell gene NW_003614889.1 is shown in SEQ ID NO.1.
进一步地,在采用CRISPR/Cas9技术定点转入目的蛋白质的编码基因时,所述的稳定表达蛋白质的位点可被CRISPR/Cas9技术的5'NNNNNNNNNNNNNNNNNNNNNGG 3'序列识别。Further, when the CRISPR/Cas9 technology is used to site-directly transfer the coding gene of the target protein, the site for stably expressing the protein can be recognized by the 5'NNNNNNNNNNNNNNNNNNNGG 3' sequence of the CRISPR/Cas9 technology.
进一步地,所述的5'NNNNNNNNNNNNNNNNNNNNNGG 3'序列在本发明实施例中,选取了以下9组序列:5’-ATCTTACTTTGCAACTACCAAGG-3’、5’-CACTTTATCTAACACTGGCCAGG-3’、5’-AGCAACACTTTATCTAACACTGG-3’、5’-GCACTCTGTGTGGAGCAAAGAGG-3’、5’-GCAAATGAGTAAAGTTCTCCTGG-3’、5’-TGCAGTGATAGCACTCTGTGTGG-3’、5’-AACAGGATGAAAATGTATGGAGG-3’、5’-TTCATCCTGTTCTCTGTTCTGGG-3’、5’-TTTCATCCTGTTCTCTGTTCTGG-3’。Further, the 5'NNNNNNNNNNNNNNNNNNNNNNNGG 3' sequence in the embodiment of the present invention selects the following 9 sequences: 5'-ATCTTACTTTGCAACTACCAAGG-3', 5'-CACTTTTATCTAACACTGGCCAGG-3', 5'-AGCAACACTTTTATCTAACACTGG-3', 5'-GCACTCTGTGTGGAGCAAAGAGG-3', 5'-GCAAATGAGTAAAGTTCTCCTGG-3', 5'-TGCAGTGATAGCACTCTGTGTGG-3', 5'-AACAGGATGAAAATGTATGGAGG-3', 5'-TTCATCCTGTTCTCTGTTCTGGG-3', 5'-TTTCATCCTGTTCTCTGTTCTGG- 3'.
在本发明中,上述9组序列在CHO细胞基因NW_003614889.1的第103331-103531碱基内分布覆盖了本发明200个碱基内的上、中、下游大部分序列,说明本发明的200个碱基都可以作为稳定表达蛋白质的位点。In the present invention, the above-mentioned 9 groups of sequences are distributed within the 103331-103531 bases of the CHO cell gene NW_003614889.1 and cover most of the upper, middle and downstream sequences within the 200 bases of the present invention, indicating that the 200 bases of the present invention All bases can be used as sites for stably expressing proteins.
本发明并不限定上述9组序列,上述9组序列只是作为其中优选的技术手段来将目的蛋白质的编码基因导入到稳定表达位点,在采用其他序列,甚至采用其他导入目的基因的手段,仍然可以实现本发明稳定表达蛋白质的目的。The present invention does not limit the above nine groups of sequences. The above nine groups of sequences are only used as the preferred technical means to introduce the coding gene of the target protein into the stable expression site. The purpose of stably expressing the protein of the present invention can be achieved.
进一步地,所述的蛋白质为分子量小于160KDa的蛋白质。Further, the protein is a protein with a molecular weight less than 160KDa.
进一步地,所述的蛋白质为多肽、功能性蛋白质、抗体、融合蛋白质中的一种。Further, the protein is one of polypeptides, functional proteins, antibodies, and fusion proteins.
本发明的第二个目的是提供所述的CHO细胞基因组内稳定表达蛋白质的位点在 CHO细胞稳定表达外源蛋白质中的应用。The second object of the present invention is to provide the application of the stable protein expression sites in the CHO cell genome in the stable expression of foreign proteins in CHO cells.
进一步地,所述的应用具体是将外源蛋白质或多肽的编码基因构建在所述的CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点上。Further, the application specifically involves constructing a gene encoding an exogenous protein or polypeptide at the site of stably expressing the protein in the CHO cell gene NW_003614889.1.
本发明的第三个目的是提供一种用于在CHO细胞中表达蛋白质的表达载体,所述的蛋白质的编码基因位于所述的表达载体上5’同源臂和3’同源臂中间的区域,所述的5’同源臂和3’同源臂分别为所述的稳定表达蛋白质的位点上下游长度为600bp的序列。The third object of the present invention is to provide an expression vector for expressing protein in CHO cells, the coding gene of the protein is located between the 5' homology arm and the 3' homology arm of the expression vector region, the 5' homology arm and the 3' homology arm are sequences with a length of 600 bp upstream and downstream of the stable protein expression site respectively.
在本发明中,所述的表达载体为适用于CHO细胞表达的载体。In the present invention, the expression vector is a vector suitable for expression in CHO cells.
进一步地,所述的表达载体上还包括位于所述的蛋白质的编码基因上游的启动子序列,所述的启动子控制所述的蛋白质的表达。Further, the expression vector also includes a promoter sequence located upstream of the gene encoding the protein, and the promoter controls the expression of the protein.
进一步地,所述的启动子为:CMV(人类巨细胞病毒来源的强哺乳动物表达启动子)、EF-1a(人延长因子1α来源的强哺乳动物表达启动子)、SV40(猿猴空泡病毒40来源的哺乳动物表达启动子)、PGK1(磷酸甘油酸酯激酶基因来源的哺乳动物启动子)、UBC(人类泛素C基因来源的哺乳动物启动子)、humanbeta actin(β-肌动蛋白基因来源的哺乳动物启动子)、CAG(强杂交哺乳动物启动子)等中的一种。Further, the promoters are: CMV (strong mammalian expression promoter derived from human cytomegalovirus), EF-1a (strong mammalian expression promoter derived from human elongation factor 1α), SV40 (simian vacuolar virus 40 derived mammalian expression promoter), PGK1 (mammalian promoter derived from phosphoglycerate kinase gene), UBC (mammalian promoter derived from human ubiquitin C gene), humanbeta actin (β-actin gene derived mammalian promoter), CAG (strong hybrid mammalian promoter), etc.
在本发明中,还包括用于在CHO细胞中表达蛋白质的表达载体的构建方法,包括如下步骤:将蛋白质的编码基因插入到质粒5’arm和3’arm中间的区域,使该蛋白质的编码基因位于启动子的下游并受启动子调控,得到所述的用于在CHO细胞中表达蛋白质的表达载体。In the present invention, it also includes a method for constructing an expression vector for expressing a protein in CHO cells, comprising the following steps: inserting the gene encoding the protein into the region between the 5'arm and the 3'arm of the plasmid, making the protein encoding The gene is located downstream of the promoter and is regulated by the promoter to obtain the expression vector for expressing the protein in CHO cells.
本发明的第四个目的是提供一种定点整合、稳定表达蛋白质的CHO重组细胞,所述的CHO重组细胞通过将所述的用于在CHO细胞中表达蛋白质的表达载体、靶序列对应的sgRNA质粒以及Cas9质粒转入CHO细胞得到。The fourth object of the present invention is to provide a CHO recombinant cell with site-specific integration and stable expression of protein. The CHO recombinant cell is obtained by using the expression vector for expressing protein in CHO cells and the sgRNA corresponding to the target sequence. Plasmids and Cas9 plasmids were transformed into CHO cells.
进一步地,所述的靶序列优选为5’-ATCTTACTTTGCAACTACCAAGG-3’、5’-CACTTTATCTAACACTGGCCAGG-3’、5’-AGCAACACTTTATCTAACACTGG-3’、5’-GCACTCTGTGTGGAGCAAAGAGG-3’、5’-GCAAATGAGTAAAGTTCTCCTGG-3’、5’-TGCAGTGATAGCACTCTGTGTGG-3’、5’-AACAGGATGAAAATGTATGGAGG-3’、5’-TTCATCCTGTTCTCTGTTCTGGG-3’或5’-TTTCATCCTGTTCTCTGTTCTGG-3’。Further, the target sequence is preferably 5'-ATCTTACTTTGCAACTACCAAGG-3', 5'-CACTTTTATCTAACACTGGCCAGG-3', 5'-AGCAACACTTTTATCTAACACTGG-3', 5'-GCACTCTGTGTGGAGCAAAGAGG-3', 5'-GCAAATGAGTAAAGTTCTCCTGG-3', 5'-TGCAGTGATAGCACTCTGTGTGG-3', 5'-AACAGGATGAAAATGTATGGAGG-3', 5'-TTCATCCTGTTCTCTGTTCTGGG-3' or 5'-TTTCATCCTGTTCTCTGTTCTGG-3'.
在本发明中,还提供利用构建CHO重组细胞的方法,包括如下步骤:In the present invention, a method for constructing CHO recombinant cells is also provided, comprising the steps of:
(1)利用脂质体转染的方式将质粒载体转染进CHO细胞,得重组CHO细胞池;(1) Transfect the plasmid vector into CHO cells by liposome transfection to obtain a recombinant CHO cell pool;
其中,所述的质粒分别为所述的用于在CHO细胞中表达蛋白质的表达载体、所述的靶序列对应的sgRNA质粒以及Cas9质粒;Wherein, the plasmids are respectively the expression vector for expressing protein in CHO cells, the sgRNA plasmid and the Cas9 plasmid corresponding to the target sequence;
(2)筛选重组细胞池,得到表达外源蛋白质的CHO重组细胞;(2) screening the recombinant cell pool to obtain CHO recombinant cells expressing foreign proteins;
(3)贴壁培养CHO重组细胞,检测蛋白质的表达水平,以及对高表达贴壁CHO 重组细胞进行悬浮驯化;(3) Adhesive culture of CHO recombinant cells, detection of protein expression levels, and suspension acclimatization of highly expressed adherent CHO recombinant cells;
(4)悬浮驯化的CHO重组细胞进行培养及稳定性验证,并检测蛋白质的表达水平。(4) Suspension and acclimatization of CHO recombinant cells for culture and stability verification, and detection of protein expression levels.
本发明的有益效果是:The beneficial effects of the present invention are:
本发明获得的稳定表达位点位于CHO细胞基因NW_003614889.1的第103431碱基处上下游100bp范围内,即第103331-103531碱基处,可整合外源蛋白质基因并进行稳定表达。本发明通过定点整合的方式将目的基因定点整合到稳定表达区域,解决了随机整合所带来的整合位点不明确的问题;本发明通过CHO基因组内稳定表达位点NW_003614889.1的第103431碱基处上下游103331-103531碱基范围内定点整合外源基因克服了位置效应所带来的表达不稳定性以及反复繁琐的细胞株筛选过程,将原本的6-12个月的筛选时间减少至1-3个月,有效的缩短构建稳定表达细胞系的研发时间,降低成本。The stable expression site obtained by the present invention is located within 100 bp upstream and downstream of base 103431 of CHO cell gene NW_003614889.1, that is, base 103331-103531, and can integrate foreign protein genes and perform stable expression. The present invention integrates the target gene into the stable expression region through fixed-point integration, which solves the problem of unclear integration sites caused by random integration; the present invention uses base 103431 of the stable expression site NW_003614889.1 in the CHO genome The site-specific integration of exogenous genes within the range of 103331-103531 bases upstream and downstream of the base overcomes the expression instability caused by the position effect and the repeated and tedious cell line screening process, reducing the original screening time of 6-12 months to 1-3 months, effectively shortening the R&D time for constructing stable expression cell lines and reducing costs.
附图说明:Description of drawings:
图1为本发明的定点整合示意图;Fig. 1 is a schematic diagram of fixed-point integration of the present invention;
图2为不同靶序列所构建细胞在不同代次时表达EGFP的情况。Figure 2 shows the expression of EGFP at different passages in cells constructed with different target sequences.
图3为不同靶序列所构建细胞在不同代次时表达anti-EGFR的情况。Figure 3 shows the expression of anti-EGFR at different passages in cells constructed with different target sequences.
具体实施方式Detailed ways
下面结合具体实施例对本发明作进一步说明,以使本领域的技术人员可以更好地理解本发明并能予以实施,但所举实施例不作为对本发明的限定。The present invention will be further described below in conjunction with specific examples, so that those skilled in the art can better understand the present invention and implement it, but the given examples are not intended to limit the present invention.
涉及的检测方法:Involved detection methods:
细胞平均荧光强度测定方法:将细胞培养至汇合度达90%左右,用0.25%胰蛋白酶消化细胞,并用与胰蛋白酶等量的完全培养基终止消化,将细胞收集于无菌离心管中1000rpm/min离心5min,弃上清并用PBS重悬,过细胞滤网收集于流式上样管,以空白CHO-K1细胞为对照用流式细胞仪分析细胞的荧光强度。The method for measuring the average fluorescence intensity of cells: culture the cells until the confluence reaches about 90%, digest the cells with 0.25% trypsin, and terminate the digestion with a complete medium equal to the amount of trypsin, and collect the cells in a sterile centrifuge tube at 1000rpm/ Centrifuge for 5 min, discard the supernatant and resuspend with PBS, pass through a cell strainer and collect in a flow loading tube, and use a blank CHO-K1 cell as a control to analyze the fluorescence intensity of the cells with a flow cytometer.
实施例1:稳定表达位点的筛选Example 1: Screening of stable expression sites
将利用流式细胞仪高通量筛选出的表达Zsgreen1报告基因的CHO-K1-2d9细胞在贴壁培养状态下培养至状态良好,将此时的CHO细胞视为第0代,再连续培养20个代次,并在倒置荧光显微镜下观察第0、10、20代时细胞表达Zsgreen1蛋白的情况,同时利用BD流式细胞仪检测第0、10、20代时细胞的平均荧光强度。The CHO-K1-2d9 cells expressing the Zsgreen1 reporter gene screened by high-throughput flow cytometry were cultured in an adherent culture until they were in good condition, and the CHO cells at this time were regarded as the 0th generation, and then cultured continuously for 20 days. The expression of Zsgreen1 protein in cells at passages 0, 10, and 20 was observed under an inverted fluorescence microscope, and the average fluorescence intensity of cells at passages 0, 10, and 20 was detected by BD flow cytometry.
通过倒置荧光显微镜观察以及流式细胞仪检测,在经过连续传代20个代次后,CHO-K1-2d9细胞在贴壁培养状态下仍然能够百分百表达Zsgreen1蛋白,且不同代次间Zsgreen1蛋白表达水平基本一致,并有较强绿色荧光信号。Observation by inverted fluorescence microscope and detection by flow cytometry showed that after 20 passages, CHO-K1-2d9 cells could still express 100% Zsgreen1 protein in the adherent culture state, and Zsgreen1 protein between different passages The expression levels are basically the same, and there is a strong green fluorescent signal.
对通过贴壁稳定性验证的CHO-K1-2d9细胞进行悬浮驯化,并对悬浮驯化成功的CHO-K1-2d9细胞进行连续传代60个代次,以悬浮驯化成功时作为第0代。并在倒置荧光显微镜下观察第0、10、20、30、40、50、60代时细胞表达Zsgreen1蛋白的情况,同时利用流式细胞仪检测第0、10、20、30、40、50、60代时细胞的平均荧光强度。The CHO-K1-2d9 cells that passed the verification of anchorage stability were subjected to suspension acclimation, and the CHO-K1-2d9 cells that were successfully acclimatized in suspension were continuously passaged for 60 passages, and the success of acclimation in suspension was regarded as the 0th passage. The expression of Zsgreen1 protein in the cells at passages 0, 10, 20, 30, 40, 50, and 60 was observed under an inverted fluorescence microscope, and at the same time, the cells at passages 0, 10, 20, 30, 40, and 50 were detected by flow cytometry. Mean fluorescence intensity of cells at passage 60.
通过倒置荧光显微镜观察以及流式细胞仪检测,在经过连续传代60个代次后,CHO-K1-2d9细胞在悬浮培养状态下仍然能够百分百表达Zsgreen1蛋白,且不同代次间Zsgreen1蛋白表达水平基本一致,并有较强绿色荧光信号。表明CHO-K1-2d9细胞能够稳定表达Zsgreen1报告基因,同时表明携带Zsgreen1报告基因的慢病毒所整合的位点为稳定表达位点。Observation by inverted fluorescence microscope and detection by flow cytometry showed that after 60 passages, CHO-K1-2d9 cells could still express 100% Zsgreen1 protein in suspension culture state, and Zsgreen1 protein expression between different passages The levels are basically the same, and there is a strong green fluorescent signal. It shows that CHO-K1-2d9 cells can stably express the Zsgreen1 reporter gene, and at the same time, it shows that the integrated site of the lentivirus carrying the Zsgreen1 reporter gene is a stable expression site.
实施例2:慢病毒整合位点分析Example 2: Analysis of Lentiviral Integration Sites
利用染色体步移技术相关Lenti-X Integration Site Analysis Kit(Clontech:631263)分析慢病毒载体在CHO-K1-2d9细胞中的整合位点,具体步骤如下:Use the Lenti-X Integration Site Analysis Kit (Clontech: 631263) related to chromosome walking technology to analyze the integration site of the lentiviral vector in CHO-K1-2d9 cells. The specific steps are as follows:
(1)、慢病毒整合文库的构建(1), construction of lentiviral integration library
收集CHO-K1-2d9细胞,利用DNA提取试剂盒抽提基因组,使用DraI、SspI、HpaI三种限制性内切酶分别对基因组于37℃酶切16-18h,酶切体系如下:Collect CHO-K1-2d9 cells, use the DNA extraction kit to extract the genome, use DraI, SspI, HpaI three restriction endonucleases to digest the genome at 37°C for 16-18h, and the digestion system is as follows:
Figure PCTCN2022099466-appb-000001
Figure PCTCN2022099466-appb-000001
将酶切后产物利用PCR纯化试剂盒进行纯化回收,并将染色体步移接头GenomeWalker Adaptor连接至纯化后酶切片段两端,连接体系如下:The digested product was purified and recovered using a PCR purification kit, and the chromosome walking adapter GenomeWalker Adaptor was connected to both ends of the purified digested fragment. The connection system was as follows:
Figure PCTCN2022099466-appb-000002
Figure PCTCN2022099466-appb-000002
在16℃下孵育过夜,70℃孵育5分钟,终止反应后,向体系中加入32μl TE(10/1,pH 7.5),获得三个慢病毒整合文库。Incubate overnight at 16°C and incubate at 70°C for 5 minutes. After terminating the reaction, add 32 μl TE (10/1, pH 7.5) to the system to obtain three lentiviral integration libraries.
(2)、慢病毒整合文库的PCR扩增(2), PCR amplification of lentiviral integration library
对实施例2中(1)步骤所得三个慢病毒整合文库分别进行两轮巢式PCR。利用实施例2中(1)步骤中所连步移接头的接头引物AP1、AP2,慢病毒序列特异引物LSP1、LSP2将LTR区域同临近的CHO-K1细胞基因组区域扩增出来Two rounds of nested PCR were respectively performed on the three lentiviral integration libraries obtained in step (1) of Example 2. Using the adapter primers AP1 and AP2 of the walking adapter connected in the step (1) of Example 2, and the lentiviral sequence-specific primers LSP1 and LSP2, the LTR region and the adjacent CHO-K1 cell genome region were amplified
一轮PCR反应体系如下:A round of PCR reaction system is as follows:
Figure PCTCN2022099466-appb-000003
Figure PCTCN2022099466-appb-000003
反应程序如下:The reaction procedure is as follows:
Figure PCTCN2022099466-appb-000004
Figure PCTCN2022099466-appb-000004
取1μl一轮PCR产物用deionized H2O稀释至50μl。Take 1 μl of a round of PCR product and dilute to 50 μl with deionized H2O.
二轮PCR反应体系如下:The second round of PCR reaction system is as follows:
Figure PCTCN2022099466-appb-000005
Figure PCTCN2022099466-appb-000005
Figure PCTCN2022099466-appb-000006
Figure PCTCN2022099466-appb-000006
反应程序如下:The reaction procedure is as follows:
Figure PCTCN2022099466-appb-000007
Figure PCTCN2022099466-appb-000007
(3)、测序与分析(3), sequencing and analysis
对二轮PCR产物进行琼脂糖凝胶电泳并进行胶回收测序,测序按照试剂盒Lenti-X Integration Site Analysis Kit(Clontech:631263)进行操作。将测序结果在NCBI上与CHO细胞基因组进行比对,即可获得慢病毒整合位点信息,CHO-K1-2d9细胞中慢病毒整合位点位于CHO细胞基因组NW_003614889.1第103431碱基处。The second round of PCR products were subjected to agarose gel electrophoresis and gel recovery sequencing, and the sequencing was performed according to the kit Lenti-X Integration Site Analysis Kit (Clontech: 631263). The lentivirus integration site information can be obtained by comparing the sequencing results with the CHO cell genome on NCBI. The lentivirus integration site in CHO-K1-2d9 cells is located at base 103431 of the CHO cell genome NW_003614889.1.
实施例3:靶序列选择Example 3: Target Sequence Selection
根据就近原则,使用CCTOP CRISPR/Cas9在线预测系统对位点NW_003614889.1第103431碱基处上下游各100bp的序列:TGCAGTGATAGCACTCTGTGTGGAGCAAAGAGGAAGAGATGGAGCAACACTTTATCTAACACTGGCCAGGAGAACTTTACTCATTTGCTATCTTACTTTGCAACTACCAAGGTCTCTCTTGTTTTCTGTTTTCTCACGTAGTCTACTTGAGTGAGTCATAGTTAACCTCCATACATTTTCATCCTGTTCTCTGTTCTGGG(SEQ ID NO.1)进行预测,并挑选出编辑效率较高的靶序列。According to the principle of proximity, use the CCTOP CRISPR/Cas9 online prediction system to align the sequence of 100 bp upstream and downstream at base 103431 of NW_003614889.1: TGCAGTGATAGCACTCTGTGTGGAGCAAAGAGGAAGAGATGGAGCAACACTTTTATCTAACACTGGCCAGGAGAACTTTACTCATTTGCTATCTTACTTTGCAACTACCAAGGTCTCTCTT GTTTTCTGTTTTTCTCACGTAGTCTACTTGAGTGAGTCATAGTTAACCTCCATACATTTTCATCCTGTTCTCTGTTCTGGG (SEQ ID NO.1) is predicted and the target with higher editing efficiency is selected sequence.
相关参数设置如下:The relevant parameters are set as follows:
1)、NGG后20bp序列内允许前13bp的最大错配碱基数为1;1) The maximum number of base mismatches allowed in the first 13 bp within the 20 bp sequence after NGG is 1;
2)、NGG后所有20bp的错配碱基数为4。2) The number of all 20bp mismatched bases after NGG is 4.
CCTOP CRISPR/Cas9在线预测系统会对识别出的5'NNNNNNNNNNNNNNNNNNNNNGG 3'靶序列进行编辑效率打分,LOW efficacy(score<0.56);MEDIUM efficacy(0.56<=s core<=0.74);HIGH efficacy(score>0.74)。The CCTOP CRISPR/Cas9 online prediction system will score the editing efficiency of the identified 5'NNNNNNNNNNNNNNNNNNNNNGG 3' target sequence, LOW efficiency(score<0.56); MEDIUM efficiency(0.56<=s core<=0.74); HIGH efficiency(score> 0.74).
选择其中预测编辑效率高于0.56的序列作为靶序列。Sequences in which the predicted editing efficiency was higher than 0.56 were selected as target sequences.
靶序列5’-ATCTTACTTTGCAACTACCAAGG-3’(SEQ ID NO.2),score=0.65Target sequence 5'-ATCTTACTTTGCAACTACCAAGG-3'(SEQ ID NO.2), score=0.65
靶序列5’-CACTTTATCTAACACTGGCCAGG-3’(SEQ ID NO.3),score=0.62Target sequence 5'-CACTTTATCTAACACTGGCCAGG-3'(SEQ ID NO.3), score=0.62
靶序列5’-AGCAACACTTTATCTAACACTGG-3’(SEQ ID NO.4),score=0.59Target sequence 5'-AGCAACACTTTTATCTAACACTGG-3'(SEQ ID NO.4), score=0.59
靶序列5’-GCACTCTGTGTGGAGCAAAGAGG-3’(SEQ ID NO.5),score=0.83Target sequence 5'-GCACTCTGTGTGGAGCAAAGAGG-3'(SEQ ID NO.5), score=0.83
靶序列5’-GCAAATGAGTAAAGTTCTCCTGG-3’(SEQ ID NO.6),score=0.57Target sequence 5'-GCAAATGAGTAAAGTTCTCCTGG-3'(SEQ ID NO.6), score=0.57
靶序列5’-TGCAGTGATAGCACTCTGTGTGG-3’(SEQ ID NO.7),score=0.68Target sequence 5'-TGCAGTGATAGCACTCTGTGTGG-3'(SEQ ID NO.7), score=0.68
靶序列5’-AACAGGATGAAAATGTATGGAGG-3’(SEQ ID NO.8),score=0.77Target sequence 5'-AACAGGATGAAAATGTATGGAGG-3'(SEQ ID NO.8), score=0.77
靶序列5’-TTCATCCTGTTCTCTGTTCTGGG-3’(SEQ ID NO.9),score=0.66Target sequence 5'-TTCATCCTGTTCTCTGTTCTGGG-3'(SEQ ID NO.9), score=0.66
靶序列5’-TTTCATCCTGTTCTCTGTTCTGG-3’(SEQ ID NO.10),score=0.69Target sequence 5'-TTTCATCCTGTTCTCTGTTCTGG-3'(SEQ ID NO.10), score=0.69
实施例4:定点整合EGFPExample 4: Fixed-point integration of EGFP
利用CRISPR/Cas9介导的基因组定点编辑技术以及同源重组将将绿色荧光蛋白基因(EGFP,26.7KDa)定点整合在靶位点。CRISPR/Cas9介导的同源重组技术需要构建sgRNA质粒以及Donor Plasmid,构建过程如下:Using CRISPR/Cas9-mediated genome editing technology and homologous recombination, the green fluorescent protein gene (EGFP, 26.7KDa) will be integrated at the target site. CRISPR/Cas9-mediated homologous recombination technology needs to construct sgRNA plasmid and Donor Plasmid, the construction process is as follows:
1、sgRNA质粒构建1. sgRNA plasmid construction
1)、根据实施例3选择的靶序列,合成寡核苷酸链1), according to the target sequence selected in embodiment 3, synthetic oligonucleotide chain
sgRNA-F1 5'TTTGATCTTACTTTGCAACTACCAGT 3'(SEQ ID NO.11)sgRNA-F1 5'TTTGATCTTACTTTGCAACTACCAGT 3'(SEQ ID NO.11)
sgRNA-R1 5'TAAAACCCTTGGTAGTTGCAAAGTAAGAT 3'(SEQ ID NO.12)sgRNA-R1 5'TAAAACCCTTGGTAGTTGCAAAGTAAGAT 3'(SEQ ID NO.12)
sgRNA-F2 5'TTTGCACTTTATCTAACACTGGCCGT 3'(SEQ ID NO.13)sgRNA-F2 5'TTTGCACTTTTATCTAACACTGGCCGT 3'(SEQ ID NO.13)
sgRNA-R2 5'TAAAACCCTGGCCAGTGTTAGATAAAGTG 3'(SEQ ID NO.14)sgRNA-R2 5'TAAAACCCTGGCCAGTGTTAGATAAAGTG 3'(SEQ ID NO.14)
sgRNA-F3 5'TTTGAGCAACACTTTATCTAACACGT 3'(SEQ ID NO.15)sgRNA-F3 5'TTTGAGCAACACTTTTATCTAACACGT 3'(SEQ ID NO.15)
sgRNA-R3 5'TAAAACCCAGTGTTAGATAAAGTGTTGCT 3'(SEQ ID NO.16)sgRNA-R3 5'TAAAACCCAGTGTTAGATAAAGTGTTGCT 3'(SEQ ID NO.16)
sgRNA-F4 5'TTTGGCACTCTGTGTGGAGCAAAGGT 3'(SEQ ID NO.17)sgRNA-F4 5'TTTGGCACTCTGTGTGGAGCAAAGGT 3'(SEQ ID NO.17)
sgRNA-R4 5'TAAAACCCTCTTTGCTCCACACAGAGTGC 3'(SEQ ID NO.18)sgRNA-R4 5'TAAAACCCCTCTTTGCTCCACACAGAGTGC 3'(SEQ ID NO.18)
sgRNA-F5 5'TTTGGCAAATGAGTAAAGTTCTCCGT 3'(SEQ ID NO.19)sgRNA-F5 5'TTTGGCAAATGAGTAAAGTTCTCCGT 3' (SEQ ID NO.19)
sgRNA-R5 5'TAAAACCCAGGAGAACTTTACTCATTTGC 3'(SEQ ID NO.20)sgRNA-R5 5'TAAAACCCAGGAGAACTTTACTCATTTGC 3'(SEQ ID NO.20)
sgRNA-F6 5'TTTGTGCAGTGATAGCACTCTGTGGT 3'(SEQ ID NO.21)sgRNA-F6 5'TTTGTGCAGTGATAGCACTCTGTGGT 3'(SEQ ID NO.21)
sgRNA-R6 5'TAAAACCCACACAGAGTGCTATCACTGCA 3'(SEQ ID NO.22)sgRNA-R6 5'TAAAACCCACACAGAGTGCTATCACTGCA 3'(SEQ ID NO.22)
sgRNA-F7 5'TTTGAACAGGATGAAAATGTATGGGT 3'(SEQ ID NO.23)sgRNA-F7 5'TTTGAACAGGATGAAAATGTATGGGT 3' (SEQ ID NO.23)
sgRNA-R7 5'TAAAACCCTCCATACATTTTCATCCTGTT 3'(SEQ ID NO.24)sgRNA-R7 5'TAAAACCCTCCATACATTTTCATCCTGTT 3'(SEQ ID NO.24)
sgRNA-F8 5'TTTGTTCATCCTGTTCTCTGTTCTGT 3'(SEQ ID NO.25)sgRNA-F8 5'TTTGTTCATCCTGTTCTCTGTTCTGT 3' (SEQ ID NO.25)
sgRNA-R8 5'TAAAACCCCAGAACAGAGAACAGGATGAA 3'(SEQ ID NO.26)sgRNA-R8 5'TAAAACCCCAGAACAGAGAACAGGATGAA 3'(SEQ ID NO.26)
sgRNA-F9 5'TTTGTTTCATCCTGTTCTCTGTTCGT 3'(SEQ ID NO.27)sgRNA-F9 5'TTTGTTTCATCCCTGTTCTCTGTTCGT 3' (SEQ ID NO.27)
sgRNA-R9 5'TAAAACCCAGAACAGAGAACAGGATGAAA 3'(SEQ ID NO.28)sgRNA-R9 5'TAAAACCCAGAACAGAGAACAGGATGAAA 3'(SEQ ID NO.28)
2)、将合成的寡核苷酸链(1-9对)分别进行退火连接2), annealing the synthesized oligonucleotide chains (1-9 pairs) respectively
Figure PCTCN2022099466-appb-000008
Figure PCTCN2022099466-appb-000008
金属浴95℃5min,自然降至室温;Metal bath at 95°C for 5 minutes, then naturally lower to room temperature;
3)、用BBsI酶对PSK-u6-gRNA质粒进行酶切,对酶切后的载体进行胶回收;3), the PSK-u6-gRNA plasmid is digested with BBsI enzyme, and the vector after digestion is gel recovered;
4)、将回收后的质粒载体与退火后的寡核苷酸链进行连接4), Ligate the recovered plasmid vector with the annealed oligonucleotide chain
Figure PCTCN2022099466-appb-000009
Figure PCTCN2022099466-appb-000009
22℃连接1h或4℃过夜连接;Ligate at 22°C for 1 hour or overnight at 4°C;
5)、转化至DH5α感受态;5), transformed into DH5α competent;
6)、挑选阳性克隆利用通用引物M13fwd测序;6), select positive clones and use universal primer M13fwd for sequencing;
7)、扩培阳性克隆菌株并提质粒。7) Expand and cultivate positive cloned strains and extract plasmids.
2、Donor plasmid构建:Donor plasmid信息如图1所示,在已有表达EGFP的质粒载体上进行改造所得。5’arm与3’arm分别为每对sgRNA所识别的靶位点的上下游同源臂,长度为600bp,GOI为整合的目的基因。2. Construction of Donor plasma: The information of Donor plasma is shown in Figure 1, which was obtained by transforming the existing plasmid vector expressing EGFP. 5'arm and 3'arm are the upstream and downstream homology arms of the target sites recognized by each pair of sgRNAs, with a length of 600bp, and GOI is the integrated target gene.
1)、通过引物设计与PCR扩增获得带有质粒同源片段的位点上下游600bp长度的5’arm与3’arm;1) Through primer design and PCR amplification, the 5'arm and 3'arm with a length of 600 bp upstream and downstream of the homologous fragment of the plasmid were obtained;
2)、分别利用双酶切与胶回收切除Donor plasmid原有同源臂;2), using double enzyme digestion and gel recovery to remove the original homology arm of Donor plasmamid;
3)、通过同源重组的方法分别连接靶位点对应的5’arm与3’arm;3) Connect the 5'arm and 3'arm corresponding to the target site by means of homologous recombination;
4)、目的基因EGFP序列为原有质粒自带。4) The EGFP sequence of the target gene is carried by the original plasmid.
3、将所构建的sgRNA质粒、Donor plasmid与Cas9-DTU质粒(丹麦科技大学Dr.Helene F Kildegaard捐赠)通过Lipofectamine 3000转染试剂以1.8:1.8:1的质量比共转染37℃、5%CO2条件下培养的CHO-K1细胞,同时设置空白对照组。转染24h后,使用10μg/ml 嘌呤霉素进行压力筛选至对照组细胞全部死亡,扩培筛选后细胞池,并利用BD流式细胞仪分选出只发绿色荧光且不发红色荧光的单克隆细胞。3. Co-transfect the constructed sgRNA plasmid, Donor plasmid and Cas9-DTU plasmid (donated by Dr.Helene F Kildegaard, Technical University of Denmark) with Lipofectamine 3000 transfection reagent at a mass ratio of 1.8:1.8:1 at 37°C, 5% CHO-K1 cells cultured under CO2 conditions, and a blank control group was set at the same time. After 24 hours of transfection, 10 μg/ml puromycin was used to perform pressure selection until all the cells in the control group died, and the cell pool after screening was expanded, and the cells that only emitted green fluorescence and did not emit red fluorescence were sorted out by BD flow cytometry. Clone cells.
4、将克隆细胞株扩培后,取一部分提取基因组,通过5’junction PCR,3’Junction PCR及out-out PCR进行鉴定,如图1所示。4. After expanding the cloned cell line, take a part to extract the genome, and identify it by 5'junction PCR, 3'Junction PCR and out-out PCR, as shown in Figure 1.
5、保留阳性克隆细胞株。5. Keep the positive clonal cell lines.
实施例5:定点整合anti-EGFRExample 5: fixed-point integration of anti-EGFR
利用CRISPR/Cas9介导的基因组定点编辑技术以及同源同组将表达表皮生长因子受体人源化抗体基因(anti-EGFR,150KDa)定点整合在靶位点。CRISPR/Cas9介导的同源重组技术需要构建sgRNA质粒以及Donor Plasmid,构建过程如下:Using CRISPR/Cas9-mediated genome site-directed editing technology and homologous homology, the humanized antibody gene (anti-EGFR, 150KDa) expressing epidermal growth factor receptor was site-directedly integrated at the target site. CRISPR/Cas9-mediated homologous recombination technology needs to construct sgRNA plasmid and Donor Plasmid, the construction process is as follows:
1、sgRNA质粒构建:1. sgRNA plasmid construction:
1)、根据实施例3选择的靶序列,合成寡核苷酸链1), according to the target sequence selected in embodiment 3, synthetic oligonucleotide chain
sgRNA-F1 5'TTTGATCTTACTTTGCAACTACCAGT 3'sgRNA-F1 5'TTTGATCTTACTTTGCAACTACCAGT 3'
sgRNA-R1 5'TAAAACCCTTGGTAGTTGCAAAGTAAGAT 3'sgRNA-R1 5'TAAAACCCTTGGTAGTTGCAAAGTAAGAT 3'
sgRNA-F2 5'TTTGCACTTTATCTAACACTGGCCGT 3'sgRNA-F2 5'TTTGCACTTTTATCTAACACTGGCCGT 3'
sgRNA-R2 5'TAAAACCCTGGCCAGTGTTAGATAAAGTG 3'sgRNA-R2 5'TAAAACCCTGGCCAGTGTTAGATAAAGTG 3'
sgRNA-F3 5'TTTGAGCAACACTTTATCTAACACGT 3'sgRNA-F3 5'TTTGAGCAACACTTTTATCTAACACGT 3'
sgRNA-R3 5'TAAAACCCAGTGTTAGATAAAGTGTTGCT 3'sgRNA-R3 5'TAAAACCCAGTGTTAGATAAAGTGTTGCT 3'
sgRNA-F4 5'TTTGGCACTCTGTGTGGAGCAAAGGT 3'sgRNA-F4 5'TTTGGCACTCTGTGTGGAGCAAAGGT 3'
sgRNA-R4 5'TAAAACCCTCTTTGCTCCACACAGAGTGC 3'sgRNA-R4 5'TAAAACCCCTCTTTGCTCCACACAGAGTGC 3'
sgRNA-F5 5'TTTGGCAAATGAGTAAAGTTCTCCGT 3'sgRNA-F5 5'TTTGGCAAATGAGTAAAGTTCTCCGT 3'
sgRNA-R5 5'TAAAACCCAGGAGAACTTTACTCATTTGC 3'sgRNA-R5 5'TAAAACCCAGGAGAACTTTACTCATTTGC 3'
sgRNA-F6 5'TTTGTGCAGTGATAGCACTCTGTGGT 3'sgRNA-F6 5'TTTGTGCAGTGATAGCACTCTGTGGT 3'
sgRNA-R6 5'TAAAACCCACACAGAGTGCTATCACTGCA 3'sgRNA-R6 5'TAAAACCCACACAGAGTGCTATCACTGCA 3'
sgRNA-F7 5'TTTGAACAGGATGAAAATGTATGGGT 3'sgRNA-F7 5'TTTGAACAGGATGAAAATGTATGGGT 3'
sgRNA-R7 5'TAAAACCCTCCATACATTTTCATCCTGTT 3'sgRNA-R7 5'TAAAACCCTCCATACATTTTCATCCTGTT 3'
sgRNA-F8 5'TTTGTTCATCCTGTTCTCTGTTCTGT 3'sgRNA-F8 5' TTTGTTCATCCTGTTCTCTGTTCTGT 3'
sgRNA-R8 5'TAAAACCCCAGAACAGAGAACAGGATGAA 3'sgRNA-R8 5'TAAAACCCCAGAACAGAGAACAGGATGAA 3'
sgRNA-F9 5'TTTGTTTCATCCTGTTCTCTGTTCGT 3'sgRNA-F9 5'TTTGTTTCATCCCTGTTCTCTGTTCGT 3'
sgRNA-R9 5'TAAAACCCAGAACAGAGAACAGGATGAAA 3sgRNA-R9 5'TAAAACCCAGAACAGAGAACAGGATGAAA 3
2)、将合成的寡核苷酸链(1-9对)分别进行退火连接2), annealing the synthesized oligonucleotide chains (1-9 pairs) respectively
Figure PCTCN2022099466-appb-000010
Figure PCTCN2022099466-appb-000010
Figure PCTCN2022099466-appb-000011
Figure PCTCN2022099466-appb-000011
金属浴95℃ 5min,自然降至室温;Metal bath at 95°C for 5 minutes, then naturally lower to room temperature;
3)、用BBsI酶对PSK-u6-gRNA质粒进行酶切,对酶切后的载体进行胶回收;3), the PSK-u6-gRNA plasmid is digested with BBsI enzyme, and the vector after digestion is gel recovered;
4)、将回收后的质粒载体与退火后的寡核苷酸链进行连接4), Ligate the recovered plasmid vector with the annealed oligonucleotide chain
Figure PCTCN2022099466-appb-000012
Figure PCTCN2022099466-appb-000012
22℃连接1h或4℃过夜连接;Ligate at 22°C for 1 hour or overnight at 4°C;
5)、转化至DH5α感受态;5), transformed into DH5α competent;
6)、挑选阳性克隆利用通用引物M13fwd测序;6), select positive clones and use universal primer M13fwd for sequencing;
7)、扩培阳性克隆菌株并提质粒;7), expanding positive clones and extracting plasmids;
2、Donor Plasmid构建:Donor plasmid信息如图1所示。5’arm与3’arm分别为靶位点的上下游同源臂,长度为600bp,GOI为整合的目的基因。2. Donor plasma construction: Donor plasma information is shown in Figure 1. 5'arm and 3'arm are the upstream and downstream homology arms of the target site, respectively, with a length of 600bp, and GOI is the integrated target gene.
1)、通过引物设计与PCR扩增获得带有质粒同源片段的位点上下游600bp长度的5’arm与3’arm;1) Through primer design and PCR amplification, the 5'arm and 3'arm with a length of 600 bp upstream and downstream of the homologous fragment of the plasmid were obtained;
2)、分别利用双酶切与胶回收切除Donor plasmid原有同源臂;2), using double enzyme digestion and gel recovery to remove the original homology arm of Donor plasmamid;
3)、通过同源重组的方法分别连接靶位点对应的5’arm与3’arm;3) Connect the 5'arm and 3'arm corresponding to the target site by means of homologous recombination;
4)、通过PCR扩增获得目的基因anti-EGFR,利用酶切链接连接至质粒载体。4) Obtain the target gene anti-EGFR by PCR amplification, and connect it to the plasmid vector by restriction restriction linking.
3、将所构建的sgRNA质粒、Donor plasmid与Cas9-DTU质粒(丹麦科技大学Dr.Helene F Kildegaard捐赠)通过Lipofectamine 3000转染试剂以1.8:1.8:1的质量比共转染37℃、5%CO 2条件下培养的CHO-K1细胞,同时设置空白对照组。转染24h后,使用10μg/ml嘌呤霉素进行压力筛选至对照组细胞全部死亡,扩培筛选后细胞池,并利用BD流式细胞仪分选出不发任何荧光的单克隆细胞。 3. Co-transfect the constructed sgRNA plasmid, Donor plasmid and Cas9-DTU plasmid (donated by Dr. Helene F Kildegaard, Technical University of Denmark) with Lipofectamine 3000 transfection reagent at a mass ratio of 1.8:1.8:1 at 37°C, 5% CHO-K1 cells cultured under CO 2 conditions, and a blank control group was set at the same time. After 24 hours of transfection, 10 μg/ml puromycin was used for pressure selection until all cells in the control group died, and the cell pool after screening was expanded, and monoclonal cells without any fluorescence were sorted out by BD flow cytometry.
4、将克隆细胞株扩培后,取一部分提取基因组,通过5’junction PCR,3’Junction PCR及out-out PCR进行鉴定,如图1所示。4. After expanding the cloned cell line, take a part to extract the genome, and identify it by 5'junction PCR, 3'Junction PCR and out-out PCR, as shown in Figure 1.
5、保留阳性克隆细胞株。5. Keep the positive clonal cell lines.
测试例:Test case:
1、用BD流式细胞仪检测实施例4所构建细胞株的绿色荧光强度1. Detect the green fluorescence intensity of the cell line constructed in Example 4 with a BD flow cytometer
检测方法:将实施例4所得细胞株均连续传代60个代次,每传代15个代次收集细胞,用流式细胞仪检测细胞荧光并检测强度,如图2所示,检测结果显示实施例4中根据不同靶序列所构建细胞在连续传代60个代次后均仍有98%以上细胞表达绿色荧光蛋白,且第0代与第60代间绿色荧光强度波动幅度均不超过30%。Detection method: the cell lines obtained in Example 4 were continuously passaged for 60 passages, and the cells were collected for 15 passages per passage, and the cell fluorescence was detected with a flow cytometer and the intensity was detected, as shown in Figure 2. In 4, more than 98% of the cells constructed according to different target sequences still expressed green fluorescent protein after 60 consecutive passages, and the fluctuation range of green fluorescence intensity between the 0th passage and the 60th passage was not more than 30%.
2、ELISA试剂盒检测EGFR抗体表达情况2. ELISA kit to detect the expression of EGFR antibody
检测方法:将实施例5所得细胞株在无血清培养条件下均连续传代60个代次,每隔15代收集该代次下细胞发酵上清,利用ELISA试剂盒检测发酵液中anti-EGFR含量,检测结果分析显示,实施例5中根据不同靶序列所构建细胞在不同代次时表达anti-EGFR的能力稳定,如图3所示。Detection method: The cell line obtained in Example 5 was continuously passaged for 60 generations under serum-free culture conditions, and the cell fermentation supernatant of this generation was collected every 15 generations, and the anti-EGFR content in the fermentation broth was detected by ELISA kit , the analysis of the detection results showed that the cells constructed according to different target sequences in Example 5 had a stable ability to express anti-EGFR at different passages, as shown in FIG. 3 .
在本发明实施例3中筛选得到的9组靶序列覆盖了本发明200bp碱基内的上、中、下游大部分序列,本发明CHO细胞基因NW_003614889.1内的第103331-103531碱基范围均能成功构建出定点整合的稳定表达细胞系,且均能稳定表达出目的蛋白。The 9 sets of target sequences screened in Example 3 of the present invention cover most of the upper, middle and downstream sequences within the 200 bp base of the present invention, and the 103331-103531 base range in the CHO cell gene NW_003614889.1 of the present invention is all A stable expression cell line with site-specific integration can be successfully constructed, and all of them can stably express the target protein.
以上所述实施例仅是为充分说明本发明而所举的较佳的实施例,本发明的保护范围不限于此。本技术领域的技术人员在本发明基础上所作的等同替代或变换,均在本发明的保护范围之内。本发明的保护范围以权利要求书为准。The above-mentioned embodiments are only preferred embodiments for fully illustrating the present invention, and the protection scope of the present invention is not limited thereto. Equivalent substitutions or transformations made by those skilled in the art on the basis of the present invention are all within the protection scope of the present invention. The protection scope of the present invention shall be determined by the claims.

Claims (10)

  1. 一种CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点,其特征在于,所述的稳定表达蛋白质的位点位于CHO细胞基因NW_003614889.1的第103331-103531碱基内。A site for stably expressing protein in CHO cell gene NW_003614889.1, characterized in that the site for stably expressing protein is located within base 103331-103531 of CHO cell gene NW_003614889.1.
  2. 根据权利要求1所述的CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点,其特征在于,所述的CHO细胞基因NW_003614889.1的第103331-103531碱基的核苷酸序列如SEQ ID NO.1所示。According to the site of stably expressing protein in the CHO cell gene NW_003614889.1 according to claim 1, it is characterized in that, the nucleotide sequence of the 103331-103531 base of the CHO cell gene NW_003614889.1 is as SEQ ID NO .1 shown.
  3. 根据权利要求1所述的CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点,其特征在于,所述的稳定表达蛋白质的位点可被CRISPR/Cas9技术以5'NNNNNNNNNNNNNNNNNNNNNGG 3'作为靶序列识别。According to the site of stably expressing protein in CHO cell gene NW_003614889.1 according to claim 1, it is characterized in that, the site of stably expressing protein can be recognized by CRISPR/Cas9 technology with 5'NNNNNNNNNNNNNNNNNNNNNNNGG 3' as the target sequence .
  4. 根据权利要求1所述的CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点,其特征在于,所述的蛋白质为分子量小于160KDa的蛋白质或多肽。The site for stably expressing protein in the CHO cell gene NW_003614889.1 according to claim 1, wherein the protein is a protein or polypeptide with a molecular weight less than 160KDa.
  5. 权利要求1~4任一项所述的CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点在CHO细胞稳定表达外源蛋白质或多肽中的应用。The application of the stable expression protein site in the CHO cell gene NW_003614889.1 described in any one of claims 1 to 4 in the stable expression of foreign proteins or polypeptides in CHO cells.
  6. 根据权利要求5所述的应用,其特征在于,所述的应用具体是将外源蛋白质或多肽的编码基因构建在所述的CHO细胞基因NW_003614889.1内稳定表达蛋白质的位点上。The application according to claim 5, characterized in that, the application is specifically constructing the gene encoding the exogenous protein or polypeptide at the site of stably expressing the protein in the CHO cell gene NW_003614889.1.
  7. 一种用于在CHO细胞中表达蛋白质的表达载体,其特征在于,所述的蛋白质的编码基因位于所述的表达载体上5’同源臂和3’同源臂中间的区域,所述的5’同源臂和3’同源臂分别为权利要求1所述的稳定表达蛋白质的位点上下游长度为600bp的序列。An expression vector for expressing proteins in CHO cells, characterized in that the coding gene of the protein is located in the region between the 5' homology arm and the 3' homology arm of the expression vector, and the The 5' homology arm and the 3' homology arm are sequences with a length of 600 bp upstream and downstream of the site of stably expressing the protein according to claim 1, respectively.
  8. 根据权利要求7所述的表达载体,其特征在于,所述的表达载体上还包括位于所述的蛋白质的编码基因上游的启动子序列,所述的启动子控制所述的蛋白质的表达。The expression vector according to claim 7, characterized in that, the expression vector further comprises a promoter sequence located upstream of the gene encoding the protein, and the promoter controls the expression of the protein.
  9. 根据权利要求8所述的表达载体,其特征在于,所述的启动子为:人类巨细胞病毒来源的强哺乳动物表达启动子、人延长因子1α来源的强哺乳动物表达启动子、猿猴空泡病毒40来源的哺乳动物表达启动子、磷酸甘油酸酯激酶基因来源的哺乳动物启动子、人类泛素C基因来源的哺乳动物启动子、β-肌动蛋白基因来源的哺乳动物启动子、强杂交哺乳动物启动子中的一种。The expression vector according to claim 8, wherein the promoters are: a strong mammalian expression promoter derived from human cytomegalovirus, a strong mammalian expression promoter derived from human elongation factor 1α, and a simian vacuolar Mammalian expression promoter derived from virus 40, mammalian promoter derived from phosphoglycerate kinase gene, mammalian promoter derived from human ubiquitin C gene, mammalian promoter derived from β-actin gene, strong hybridization One of the mammalian promoters.
  10. 一种定点整合表达蛋白质的CHO重组细胞,其特征在于,所述的CHO重组细胞通过将权利要求7所述的用于在CHO细胞中表达蛋白质的表达载体、靶序列对应的sgRNA质粒以及Cas9质粒转入CHO细胞得到。A CHO recombinant cell for site-specific integration and expression of protein, characterized in that, the CHO recombinant cell is characterized in that the expression vector for expressing protein in CHO cells according to claim 7, the sgRNA plasmid corresponding to the target sequence and the Cas9 plasmid obtained by transferring into CHO cells.
PCT/CN2022/099466 2021-11-23 2022-06-17 Site for stably expressing protein in cho cell gene nw_003614889.1, and use thereof WO2023093007A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111395511.4A CN113969284B (en) 2021-11-23 2021-11-23 Site for stably expressing protein in CHO cell gene NW _003614889.1 and application thereof
CN202111395511.4 2021-11-23

Publications (1)

Publication Number Publication Date
WO2023093007A1 true WO2023093007A1 (en) 2023-06-01

Family

ID=79590155

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/099466 WO2023093007A1 (en) 2021-11-23 2022-06-17 Site for stably expressing protein in cho cell gene nw_003614889.1, and use thereof

Country Status (2)

Country Link
CN (1) CN113969284B (en)
WO (1) WO2023093007A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113969284B (en) * 2021-11-23 2022-07-12 江南大学 Site for stably expressing protein in CHO cell gene NW _003614889.1 and application thereof
CN113969283B (en) * 2021-11-23 2022-07-12 江南大学 Site for stably expressing protein in CHO cell gene NW _003613756.1 and application thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007081336A1 (en) * 2006-01-13 2007-07-19 Five Prime Therapeutics, Inc. Mammalian vectors for high-level expression of recombinant proteins
CN102016025A (en) * 2008-02-27 2011-04-13 国立大学法人北海道大学 Expression vector for mass production of foreign gene-derived protein using animal cell and use thereof
CN109321604A (en) * 2018-10-30 2019-02-12 江南大学 NW_006882077-1 stablizes the application of expression protein in a kind of CHO cell genome
WO2021214173A2 (en) * 2020-04-22 2021-10-28 Lek Pharmaceuticals D.D. Super-enhancers for recombinant gene expression in cho cells
CN113969284A (en) * 2021-11-23 2022-01-25 江南大学 Site for stably expressing protein in CHO cell gene NW _003614889.1 and application thereof

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108441478A (en) * 2017-02-16 2018-08-24 成都贝爱特生物科技有限公司 A kind of genetically engineered cell system receiving fixed point integration of foreign gene
CN107723276B (en) * 2017-11-02 2021-08-13 上海交通大学 Construction method and kit for cell strain of stable high-expression target product
CN109295093B (en) * 2018-10-30 2021-08-03 江南大学 Application of NW _006882456-1 stable expression protein in CHO cell genome
CN109136193B (en) * 2018-10-30 2021-07-06 江南大学 Application of NW _006884764-1 stable expression protein in CHO cell genome
CN109295092B (en) * 2018-10-30 2021-06-29 江南大学 Application of NW _003613638-1 stable expression protein in CHO cell genome
CN109337927B (en) * 2018-10-30 2021-07-06 江南大学 Application of NW _006880285-1 stable expression protein in CHO cell genome
CN109207432B (en) * 2018-10-30 2021-08-03 江南大学 Application of NW _006883358-1 stable expression protein in CHO cell genome
CN110982790B (en) * 2019-12-26 2022-07-19 海珂分子(北京)科技有限责任公司 Cell strain for protein display and expression and preparation method and application thereof
CN114085841B (en) * 2021-11-23 2022-07-15 江南大学 Site for stably expressing protein in CHO cell gene NW _003614092.1 and application thereof
CN113969283B (en) * 2021-11-23 2022-07-12 江南大学 Site for stably expressing protein in CHO cell gene NW _003613756.1 and application thereof
CN114058625B (en) * 2021-11-25 2022-07-15 江南大学 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007081336A1 (en) * 2006-01-13 2007-07-19 Five Prime Therapeutics, Inc. Mammalian vectors for high-level expression of recombinant proteins
CN102016025A (en) * 2008-02-27 2011-04-13 国立大学法人北海道大学 Expression vector for mass production of foreign gene-derived protein using animal cell and use thereof
CN109321604A (en) * 2018-10-30 2019-02-12 江南大学 NW_006882077-1 stablizes the application of expression protein in a kind of CHO cell genome
WO2021214173A2 (en) * 2020-04-22 2021-10-28 Lek Pharmaceuticals D.D. Super-enhancers for recombinant gene expression in cho cells
CN113969284A (en) * 2021-11-23 2022-01-25 江南大学 Site for stably expressing protein in CHO cell gene NW _003614889.1 and application thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
HU WANWAN, DING XUEFENG;CAI YANFEI;CHEN YUN;DUAN ZUOYING;JIN JIAN;LI HUAZHONG: "Site-specific integration and stable expression of exogenous protein at a novel site on CHO cell chromosome", JOURNAL OF CHINA PHARMACEUTICAL UNIVERSITY, NAJING, CN, vol. 52, no. 4, 12 April 2021 (2021-04-12), CN , pages 487 - 495, XP093068728, ISSN: 1000-5048, DOI: 10.11665/j.issn.1000-5048.20210412 *

Also Published As

Publication number Publication date
CN113969284B (en) 2022-07-12
CN113969284A (en) 2022-01-25

Similar Documents

Publication Publication Date Title
WO2023093006A1 (en) Site capable of stably expressing protein in cho cell gene nw_003614092.1 and use thereof
WO2023093008A1 (en) Site for stably expressing protein in cho cell gene nw_003613781.1, and use thereof
WO2023093007A1 (en) Site for stably expressing protein in cho cell gene nw_003614889.1, and use thereof
WO2023093005A1 (en) Site capable of stably expressing protein in cho cell gene nw_003613756.1 and use thereof
WO2020088301A1 (en) Application of nw_006884764-1 in cho cell genome to stable expression of protein
CN107893073B (en) Method for screening glutamine synthetase defect type HEK293 cell strain
US11732276B2 (en) Use of genomic NW_006882077.1 in CHO cell for stably expressing a protein
WO2020088300A1 (en) Use of nw_003613638-1 in cho cell genome for stable protein expression
EP3670650A1 (en) Cell strain for reducing production of reproducible adenovirus, and construction method and use
WO2020088299A1 (en) Application of nw_006883358.1 in cho cell genome to stable expression of protein
WO2020088302A1 (en) Use of nw_006882456-1 in cho cell genome in stable expression of protein
US11692204B2 (en) Use of genomic NW_006880285.1 in CHO cell for stably expressing a protein
CN105695494B (en) Three-cistron expression vector, preparation method and application
CN113355360A (en) Construction method of GS gene knockout CHO-K1 cell strain and suspension cell monoclonality
EA037273B1 (en) Expression cassette
CN112175984A (en) Molecular cloning method based on synthetic gene and saccharomyces cerevisiae homologous recombination mechanism
CN110982790B (en) Cell strain for protein display and expression and preparation method and application thereof
US20180251790A1 (en) Method for Cell Line Development
WO2023246626A1 (en) Cell with targeted integration, method for preparing same, and method for producing target gene expression product
WO2023050169A1 (en) Method for achieving tag-to-taa conversion on genome with high throughput
CN110734929B (en) Transposon-mediated high-efficiency non-viral eukaryotic cell stable transfection method
CN116574735B (en) Polynucleotide and application thereof
WO2023050158A1 (en) Method for achieving multi-base editing
CN117683813A (en) EE0.6 site knockout vector and NC_006127.4 site knockout vector in DF-1 cells
EP4179085A1 (en) Guide rna for hsv-1 gene editing and method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22897104

Country of ref document: EP

Kind code of ref document: A1