CN114058625B - Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof - Google Patents

Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof Download PDF

Info

Publication number
CN114058625B
CN114058625B CN202111415460.7A CN202111415460A CN114058625B CN 114058625 B CN114058625 B CN 114058625B CN 202111415460 A CN202111415460 A CN 202111415460A CN 114058625 B CN114058625 B CN 114058625B
Authority
CN
China
Prior art keywords
protein
site
gene
expression
cho
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111415460.7A
Other languages
Chinese (zh)
Other versions
CN114058625A (en
Inventor
陈蕴
金坚
瞿丽丽
丁学峰
李华钟
蔡燕飞
杨兆琪
朱景宇
鲁晨
刘辰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangnan University
Original Assignee
Jiangnan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangnan University filed Critical Jiangnan University
Priority to CN202111415460.7A priority Critical patent/CN114058625B/en
Publication of CN114058625A publication Critical patent/CN114058625A/en
Priority to PCT/CN2022/099467 priority patent/WO2023093008A1/en
Application granted granted Critical
Publication of CN114058625B publication Critical patent/CN114058625B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • C12N15/907Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0681Cells of the genital tract; Non-germinal cells from gonads
    • C12N5/0682Cells of the female genital tract, e.g. endometrium; Non-germinal cells from ovaries, e.g. ovarian follicle cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian

Abstract

The invention discloses a site for stably expressing protein in a CHO cell gene NW _003613781.1 and application thereof, wherein the stable expression site obtained by the invention is positioned at the base in the range of 100bp upstream and downstream of the 1497187 th base of the CHO cell gene NW _003613781.1, namely the 1497087 + 1497287 th base, and can integrate a foreign protein gene and carry out stable expression. The invention integrates the target gene into the stable expression region in a fixed-point integration way, thus solving the problem of undefined integration site caused by random integration; the invention overcomes the expression instability caused by position effect and the repeated and fussy cell strain screening process by site-specific integration of exogenous genes in the base range of 1497087-1497287 at the 1497187 th base of a stable expression site NW-003613781.1 in a CHO genome.

Description

Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof
Technical Field
The invention relates to a site for stably expressing protein in a CHO cell gene NW _003613781.1 and application thereof, belonging to the technical field of genes.
Background
Chinese Hamster Ovary (CHO) cells, established in the laboratory of dr. theodore t. puck in 1957, are immortalized, non-secretory cells, secreting little endogenous protein; meanwhile, the protein has the advantages of closer posttranslational modification to human natural protein, difficult infection by human viruses, large-scale culture in serum-free culture medium with definite chemical components and the like, is widely applied to the field of biological pharmacy, and produces more than 70 percent of protein medicines. However, CHO cells have long culture period and high culture cost as mammalian cells, and meanwhile, the demand for recombinant products such as monoclonal antibodies is increasing, and the continuously increasing demand means that the specific productivity needs to be optimized. Although the expression quantity can be improved by increasing the copy number of genes, developing a new strong promoter, searching a proper enhancer and the like, the expression level of most CHO cells in the long-term culture process is unstable, and the problems of product approval and product marketing of a supervision department are directly influenced. Therefore, the construction of a stable and high-expression CHO expression strain of the target gene is very important for the research, development and marketing of protein medicines.
There are two strategies for constructing stable and highly expressed cell lines. One is the traditional method of random integration combined with high-throughput screening, and the other is the method of combining gene editing technology and homology-directed repair to integrate the target gene into the predetermined chromosomal site in a targeted manner. However, due to uncertainty of integration sites and existence of position effects in random integration, the constructed cell genotypes have huge differences and the problem of unstable expression is easy to occur in the long-term passage process, so that the later screening process is very long, and the whole process of generating a recombinant cell line by a random integration mode in an industrial environment usually needs 6-12 months; compared with random integration, the site-directed integration utilizes a gene editing technology, particularly a genome site-directed editing technology widely applied in recent years and mediated by CRISPR/Cas9, greatly reduces the development time and cost, and because the sequence is known, the information after the site-directed integration is clearer and more definite than the information after the random integration.
Random integration is the most mature traditional method for constructing a protein expression system, but multiple screening is needed to obtain a stable high-expression cell strain in a random integration mode, so that the time consumption is long, and the cost is high. Meanwhile, for the cell line obtained by random integration, the loss of expression stability of the cells in the later period of culture cannot be predicted at all: such instability may not occur at all; it is also possible that the cells develop after an infinite number of divisions; it is also possible that after the cell has divided for only a few passages, a significant instability occurs. The stability problem not only affects the time of the product on the market, but also conflicts with the drug regulation management. In addition, the information of random integration sites is not clear, and the expression level of the target gene is obviously reduced due to the site effect of exogenous gene integration. According to the existing literature reports, the instability of the recombinant CHO cell line appears in all recombinant CHO cell lines, and the problem of unstable expression becomes an extremely common problem.
Disclosure of Invention
In order to solve the technical problems, the invention provides a site for stably expressing protein in a CHO cell genome, the site has clear information, the site can realize the site-specific integration of protein genes, can stably express the protein, can greatly shorten the screening process and time in the cell construction process, and reduces the research and development cost.
The first objective of the invention is to provide a site for stably expressing a protein in the CHO cell genome, wherein the site for stably expressing the protein is located within the 1497087-1497287 bases of the CHO cell gene NW _ 003613781.1.
Further, the nucleotide sequence of bases 1497087-1497287 of the CHO cell gene NW _003613781.1 is shown as SEQ ID NO. 1.
Further, when the CRISPR/Cas9 technology is used for site-specific transfer of the coding gene of the target protein, the site for stably expressing the protein can be recognized by the 5'NNNNNNNNNNNNNNNNNNNNNGG 3' sequence of the CRISPR/Cas9 technology.
In the present invention, this site is located at the third intron of the Pdgfc gene on the CHO cell genome NW _ 003613781.1.
Further, in the embodiment of the present invention, the 5'NNNNNNNNNNNNNNNNNNNNNGG 3' sequence is selected from the following 8 groups of sequences: 5'-GAGGATCCAACACTTTCCACTGG-3', 5'-CTATCCATTATTCACTATTTCGG-3', 5'-AATACCGAAATAGTGAATAATGG-3', 5'-CACATGCCAGTGGAAAGTGTTGG-3', 5'-GCACTATATGCACATGCCAGTGG-3', 5'-TAGTGCGCATACATTTGTATAGG-3', 5'-AAGATAGGAAGGATTTTGTTTGG-3', 5'-AGTGTTGGATCCTCTGAATCTGG-3'.
In the present invention, the 8 sequences cover most of the sequences at the upper, middle and lower reaches within 200 bases of the present invention within 1497087-1497287 bases of the CHO cell gene NW _003613781.1, indicating that 200 bases of the present invention can be used as sites for stably expressing proteins.
The present invention is not limited to the above 8 sequences, but the above 8 sequences are only preferred technical means for introducing the gene encoding the target protein into the stable expression site, and the purpose of stably expressing the protein of the present invention can be achieved by using other sequences or even other means for introducing the target gene.
Further, the protein is a protein with a molecular weight of less than 160 KDa.
Further, the protein is one of polypeptide, functional protein, antibody and fusion protein.
The second purpose of the invention is to provide the application of the site for stably expressing the protein in the CHO cell genome in the stable expression of the foreign protein in the CHO cell.
Furthermore, the application specifically comprises constructing a foreign protein or polypeptide encoding gene at a site where the protein is stably expressed in the CHO cell gene NW _ 003613781.1.
The third purpose of the invention is to provide an expression vector for expressing protein in CHO cells, wherein the coding gene of the protein is positioned in the middle region of a 5 'homologous arm and a 3' homologous arm on the expression vector, and the 5 'homologous arm and the 3' homologous arm are respectively sequences with the upstream and downstream length of 600bp of the site for stably expressing the protein.
In the invention, the expression vector is a vector suitable for CHO cell expression.
Furthermore, the expression vector also comprises a promoter sequence positioned at the upstream of the coding gene of the protein, and the promoter controls the expression of the protein.
Further, the promoter is: CMV (a strong mammalian expression promoter derived from human cytomegalovirus), EF-1a (a strong mammalian expression promoter derived from human elongation factor 1 α), SV40 (a mammalian expression promoter derived from simian vacuolating virus 40), PGK1 (a mammalian promoter derived from phosphoglycerate kinase gene), UBC (a mammalian promoter derived from human ubiquitin C gene), human beta actin (a mammalian promoter derived from beta-actin gene), CAG (a strong hybrid mammalian promoter), and the like.
In the present invention, there is also included a method of constructing an expression vector for expressing a protein in CHO cells, comprising the steps of: inserting the coding gene of the protein into the region between 5 'arm and 3' arm of the plasmid, and enabling the coding gene of the protein to be positioned at the downstream of the promoter and to be controlled by the promoter to obtain the expression vector for expressing the protein in the CHO cell.
The fourth purpose of the invention is to provide a CHO recombinant cell which can stably express protein in a fixed-point integration way, wherein the CHO recombinant cell is obtained by transferring the expression vector for expressing the protein in the CHO cell, sgRNA plasmid corresponding to a target sequence and Cas9 plasmid into the CHO cell.
Further, the target sequence is preferably 5'-GAGGATCCAACACTTTCCACTGG-3', 5'-CTATCCATTATTCACTATTTCGG-3', 5'-AATACCGAAATAGTGAATAATGG-3', 5'-CACATGCCAGTGGAAAGTGTTGG-3', 5'-GCACTATATGCACATGCCAGTGG-3', 5'-TAGTGCGCATACATTTGTATAGG-3', 5'-AAGATAGGAAGGATTTTGTTTGG-3' or 5'-AGTGTTGGATCCTCTGAATCTGG-3'.
In the present invention, there is also provided a method for constructing a CHO recombinant cell, comprising the steps of:
(1) transfecting the plasmid vector into a CHO cell by a liposome transfection mode to obtain a recombinant CHO cell pool;
wherein, the plasmids are the expression vector for expressing the protein in the CHO cell, the sgRNA plasmid corresponding to the target sequence and the Cas9 plasmid respectively;
(2) screening the recombinant cell pool to obtain a CHO recombinant cell expressing the foreign protein;
(3) culturing the CHO recombinant cells in an adherent manner, detecting the expression level of the protein, and carrying out suspension domestication on the high expression adherent CHO recombinant cells;
(4) and (3) culturing and verifying stability of the suspension domesticated CHO recombinant cells, and detecting the expression level of the protein.
The invention has the beneficial effects that:
the stable expression site obtained by the invention is located in the range of 100bp upstream and downstream of the 1497187 th base of the CHO cell gene NW _003613781.1, namely the 1497087-1497287 th base, and can integrate the foreign protein gene and carry out stable expression. The invention integrates the target gene into the stable expression region in a fixed-point integration way, thus solving the problem of unclear integration site caused by random integration; according to the invention, the exogenous gene is integrated at a fixed point in the range of the base 1497087-1497287 at the position of 1497187 base of the stable expression site NW _003613781.1 in the CHO genome, so that the expression instability and the repeated and complicated cell strain screening process caused by the position effect are overcome, the original screening time of 6-12 months is reduced to 1-3 months, the research and development time for constructing the stable expression cell line is effectively shortened, and the cost is reduced.
Description of the drawings:
FIG. 1 is a schematic diagram of a fixed point integration according to the present invention;
FIG. 2 shows the EGFP expression of cells constructed with different target sequences in different generations.
FIG. 3 shows the expression of IFN-. beta.HSA in cells constructed with different target sequences in different generations.
Detailed Description
The present invention is further described below with reference to specific examples so that those skilled in the art can better understand the present invention and can practice the present invention, but the examples are not intended to limit the present invention.
The related detection method comprises the following steps:
the method for measuring the average fluorescence intensity of the cells comprises the following steps: culturing the cells until the confluence reaches about 90%, digesting the cells by using 0.25% trypsin, terminating the digestion by using a complete culture medium with the same amount as the trypsin, collecting the cells in a sterile centrifuge tube, centrifuging for 5min at 1000rpm/min, discarding the supernatant, resuspending the cells by using PBS, collecting the cells in a flow-type sample tube through a cell filter screen, and analyzing the fluorescence intensity of the cells by using a flow cytometer by using a blank CHO-K1 cell as a control.
Example 1: screening for Stable expression sites
The CHO-K1-1b7 cells which are screened by a flow cytometer in high flux and express Zsgreen1 reporter genes are cultured to be in a good state under an adherent culture state, the CHO cells at the time are regarded as 0 generation, the CHO cells are continuously cultured for 20 generations, the conditions that the cells express Zsgreen1 protein at 0, 10 and 20 generations are observed under an inverted fluorescence microscope, and meanwhile, the average fluorescence intensity of the cells at 0, 10 and 20 generations is detected by a BD flow cytometer.
Through observation by an inverted fluorescence microscope and detection by a flow cytometer, after 20 generations of continuous passage, the CHO-K1-1b7 cells can still express Zsgreen1 protein in percentage under the adherent culture state, and the expression levels of the Zsgreen1 protein among different generations are basically consistent and have stronger green fluorescence signals.
Suspension domestication was performed on CHO-K1-1b7 cells verified for adherence stability, and 60 serial passages were performed on CHO-K1-1b7 cells successfully subjected to suspension domestication, and the cells were used as the 0 th passage when suspension domestication was successful. And observing the expression of the Zsgreen1 protein in the cells at 0, 10, 20, 30, 40, 50 and 60 generations under an inverted fluorescence microscope, and simultaneously detecting the average fluorescence intensity of the cells at 0, 10, 20, 30, 40, 50 and 60 generations by using a flow cytometer.
Through observation of an inverted fluorescence microscope and detection of a flow cytometer, after 60 generations of continuous passage, the CHO-K1-1b7 cells can still express Zsgreen1 protein in percentage under the suspension culture state, and the expression levels of the Zsgreen1 protein among different generations are basically consistent and have strong green fluorescence signals. The CHO-K1-1b7 cell is shown to be capable of stably expressing the Zsgreen1 reporter gene, and simultaneously, the integrated site of the lentivirus carrying the Zsgreen1 reporter gene is shown to be a stable expression site.
Example 2: lentiviral integration site analysis
The Integration Site of the lentiviral vector in CHO-K1-1b7 cells was analyzed using the Lenti-X Integration Site Analysis Kit (Clontech:631263) related to chromosome walking technology, and the specific steps were as follows:
(1) construction of lentivirus integration library
Collecting CHO-K1-1b7 cells, extracting a genome by using a DNA extraction kit, and performing enzyme digestion on the genome for 16-18h at 37 ℃ by using three restriction enzymes of DraI, SspI and HpaI respectively, wherein the enzyme digestion system is as follows:
Figure BDA0003375170250000051
purifying and recovering the product after enzyme digestion by using a PCR purification kit, and connecting chromosome walking joints genome Walker adapter to two ends of the purified enzyme digestion fragment, wherein the connection system is as follows:
Figure BDA0003375170250000052
after incubation overnight at 16 ℃ and 5 minutes at 70 ℃ and reaction termination, 32. mu.l TE (10/1, pH 7.5) was added to the system to obtain three lentivirus integration libraries.
(2) PCR amplification of lentivirus integration libraries
Two rounds of nested PCR were performed on each of the three lentivirus integration libraries obtained in step (1) of example 2. Using the adaptor primers AP1 and AP2 and the lentiviral sequence-specific primers LSP1 and LSP2 of the adaptor ligated in step (1) in example 2, LTR region was amplified from the adjacent genomic region of CHO-K1 cells
The one-round PCR reaction system is as follows:
Figure BDA0003375170250000061
the reaction procedure was as follows:
Figure BDA0003375170250000062
mu.l of one round of PCR product was diluted to 50. mu.l with deinized H2O.
The two-round PCR reaction system is as follows:
Figure BDA0003375170250000063
Figure BDA0003375170250000071
the reaction procedure was as follows:
Figure BDA0003375170250000072
(3) sequencing and analysis
The two rounds of PCR products were subjected to agarose gel electrophoresis and gel recovery sequencing, which was performed according to the Lenti-X Integration Site Analysis Kit (Clontech: 631263). The sequencing result is compared with the CHO cell genome at NCBI to obtain the information of the slow virus integration site, and the slow virus integration site in CHO-K1-1b7 cell is positioned at 1497187 th base of CHO cell genome NW _ 003613781.1.
Example 3: target sequence selection
According to the principle of being close, a CCTOP CRISPR/Cas9 online prediction system is used for predicting sequences of 100bp upstream and downstream of 1497187 base of a position NW _ 003613781.1: AAAATGTGTATAACTCCAGATTCAGAGGATCCAACACTTTCCACTGGCATGTGCATATAGTGCGCATACATTTGTATAGGCTAAAAACTCATAATATTATACAATAACAATAATAATGATAAATCTGTAATAACAATCTCTATCCATTATTCACTATTTCGGTATTTGCCAAACCAAACAAAATCCTTCCTATCTTTTCT (SEQ ID NO.1), and selecting the target sequence with higher editing efficiency.
The relevant parameters are set as follows:
1) the maximum number of mismatched bases of the first 13bp allowed in the 20bp sequence after NGG is 1;
2) and the number of mismatched bases of all 20bp after NGG is 4.
The CCTOP CRISPR/Cas9 online prediction system scores editing efficiency of the identified 5'NNNNNNNNNNNNNNNNNNNNNGG 3' target sequence, LOW efficacy (score < 0.56); MEDIUM efficacy (0.56 ═ score ≦ 0.74); HIGH efficacy (score > 0.74).
Selecting as the target sequence a sequence in which the predicted editing efficiency is higher than 0.56.
Target sequence 5'-GAGGATCCAACACTTTCCACTGG-3' (SEQ ID NO.2), score 0.68
Target sequence 5'-CTATCCATTATTCACTATTTCGG-3' (SEQ ID No.3), score 0.62
Target sequence 5'-AATACCGAAATAGTGAATAATGG-3' (SEQ ID No.4), score 0.67
Target sequence 5'-CACATGCCAGTGGAAAGTGTTGG-3' (SEQ ID No.5), score 0.66
Target sequence 5'-GCACTATATGCACATGCCAGTGG-3' (SEQ ID No.6), score 0.85
Target sequence 5'-TAGTGCGCATACATTTGTATAGG-3' (SEQ ID NO.7), score 0.67
Target sequence 5'-AAGATAGGAAGGATTTTGTTTGG-3' (SEQ ID NO.8), score 0.64
Target sequence 5'-AGTGTTGGATCCTCTGAATCTGG-3' (SEQ ID No.9), score 0.71.
Example 4: site-specific integration of EGFP
The CRISPR/Cas9 mediated genome site-directed editing technology and homologous recombination are utilized to integrate the green fluorescent protein gene (EGFP, 26.7KDa) at the target site in a site-directed manner. The CRISPR/Cas 9-mediated homologous recombination technology requires the construction of sgRNA Plasmid and Donor Plasmid, and the construction process is as follows:
1. sgRNA plasmid construction
1) Synthesis of an oligonucleotide chain according to the target sequence selected in example 3
sgRNA-F1 5'TTTGGAGGATCCAACACTTTCCACGT 3'(SEQ ID NO.10)
sgRNA-R1 5'TAAAAC GTGGAAAGTGTTGGATCCTC 3'(SEQ ID NO.11)
sgRNA-F2 5'TTTGCTATCCATTATTCACTATTTGT 3'(SEQ ID NO.12)
sgRNA-R2 5'TAAAAC AAATAGTGAATAATGGATAG 3'(SEQ ID NO.13)
sgRNA-F3 5'TTTGAATACCGAAATAGTGAATAAGT 3'(SEQ ID NO.14)
sgRNA-R3 5'TAAAAC TTATTCACTATTTCGGTATT 3'(SEQ ID NO.15)
sgRNA-F4 5'TTTGCACATGCCAGTGGAAAGTGTGT 3'(SEQ ID NO.16)
sgRNA-R4 5'TAAAAC ACACTTTCCACTGGCATGTG 3'(SEQ ID NO.17)
sgRNA-F5 5'TTTGGCACTATATGCACATGCCAGGT 3'(SEQ ID NO.18)
sgRNA-R5 5'TAAAAC CTGGCATGTGCATATAGTGC 3'(SEQ ID NO.19)
sgRNA-F6 5'TTTGTAGTGCGCATACATTTGTATGT 3'(SEQ ID NO.20)
sgRNA-R6 5'TAAAAC ATACAAATGTATGCGCACTA 3'(SEQ ID NO.21)
sgRNA-F7 5'TTTGAAGATAGGAAGGATTTTGTTGT 3'(SEQ ID NO.22)
sgRNA-R7 5'TAAAAC AACAAAATCCTTCCTATCTT 3'(SEQ ID NO.23)
sgRNA-F8 5'TTTGAGTGTTGGATCCTCTGAATCGT 3'(SEQ ID NO.24)
sgRNA-R8 5'TAAAACGATTCAGAGGATCCAACACT 3'(SEQ ID NO.25)
2) Separately annealing and connecting the synthesized oligonucleotide chains (1-8 pairs)
Figure BDA0003375170250000091
Performing metal bath at 95 ℃ for 5min, and naturally cooling to room temperature;
3) carrying out enzyme digestion on the PSK-u6-gRNA plasmid by BBsI enzyme, and carrying out gel recovery on the vector subjected to enzyme digestion;
4) connecting the recovered plasmid vector with the annealed oligonucleotide chain
Figure BDA0003375170250000092
Ligation was performed at 22 ℃ for 1h or at 4 ℃ overnight;
5) transformation to DH5 α competence;
6) selecting positive clones, and sequencing by using a universal primer M13 fwd;
7) and expanding and culturing the positive clone bacterial strain and extracting plasmid.
2. Construction of Donor plasmid: the Donor plasmid information is shown in FIG. 1, and is obtained by modifying a plasmid vector expressing EGFP. The 5 'arm and the 3' arm are respectively an upstream homologous arm and a downstream homologous arm of the target site recognized by each pair of sgRNAs, the length is 600bp, and the GOI is an integrated target gene.
1) Obtaining 5 'arm and 3' arm with the length of 600bp upstream and downstream of the site with the plasmid homologous fragment through primer design and PCR amplification;
2) respectively utilizing double enzyme digestion and glue recovery to cut out the original homology arm of the Donor plasma;
3) respectively connecting 5 'arm and 3' arm corresponding to the target site by a homologous recombination method;
4) and the EGFP sequence of the target gene is carried by the original plasmid.
3. The constructed sgRNA plasmid, Donor plasmid and Cas9-DTU plasmid (donated by dr. helene F Kildegaard, denmark science and technology university) were transfected by Lipofectamine 3000 transfection reagent at a ratio of 1.8: 1.8:1 mass ratio to CO-transfect CHO-K1 cells cultured at 37 ℃ under 5% CO2, and set blank control group. After 24h of transfection, pressure screening was performed using puromycin at 10. mu.g/ml until all the control cells were dead, the post-screening cell pool was expanded, and monoclonal cells emitting only green fluorescence and not emitting red fluorescence were sorted out using a BD flow cytometer.
4. After the cloning cell strain is amplified, a part of the amplified cell strain is extracted to obtain a genome, and the genome is identified by 5 'Junction PCR, 3' Junction PCR and out-out PCR, as shown in FIG. 1.
5. The positive clone cell line was retained.
Example 5: site-directed integration of IFN beta HSA
The gene (IFN beta-HSA, 89KDa) for expressing the interferon-beta-human serum albumin fusion protein is integrated at a target site in a fixed point way by using a CRISPR/Cas9 mediated genome fixed point editing technology and a homologous group. The CRISPR/Cas 9-mediated homologous recombination technology requires the construction of sgRNA Plasmid and Donor Plasmid, and the construction process is as follows:
1. sgRNA plasmid construction:
1) the oligonucleotide chain was synthesized based on the target sequence selected in example 3
sgRNA-F1 5'TTTGGAGGATCCAACACTTTCCACGT 3'
sgRNA-R1 5'TAAAAC GTGGAAAGTGTTGGATCCTC 3'
sgRNA-F2 5'TTTGCTATCCATTATTCACTATTTGT 3'
sgRNA-R2 5'TAAAAC AAATAGTGAATAATGGATAG 3'
sgRNA-F3 5'TTTGAATACCGAAATAGTGAATAAGT 3'
sgRNA-R3 5'TAAAAC TTATTCACTATTTCGGTATT 3'
sgRNA-F4 5'TTTGCACATGCCAGTGGAAAGTGTGT 3'
sgRNA-R4 5'TAAAAC ACACTTTCCACTGGCATGTG 3'
sgRNA-F5 5'TTTGGCACTATATGCACATGCCAGGT 3'
sgRNA-R5 5'TAAAAC CTGGCATGTGCATATAGTGC 3'
sgRNA-F6 5'TTTGTAGTGCGCATACATTTGTATGT 3'
sgRNA-R6 5'TAAAAC ATACAAATGTATGCGCACTA 3'
sgRNA-F7 5'TTTGAAGATAGGAAGGATTTTGTTGT 3'
sgRNA-R7 5'TAAAAC AACAAAATCCTTCCTATCTT 3'
sgRNA-F8 5'TTTGAGTGTTGGATCCTCTGAATCGT 3'
sgRNA-R8 5'TAAAACGATTCAGAGGATCCAACACT 3'
2) Separately annealing and connecting the synthesized oligonucleotide chains (1-8 pairs)
Figure BDA0003375170250000101
Performing metal bath at 95 ℃ for 5min, and naturally cooling to room temperature;
3) carrying out enzyme digestion on the PSK-u6-gRNA plasmid by BBsI enzyme, and carrying out gel recovery on the vector subjected to enzyme digestion;
4) connecting the recovered plasmid vector with the annealed oligonucleotide chain
Figure BDA0003375170250000111
Ligation was performed at 22 ℃ for 1h or at 4 ℃ overnight;
5) transformation to DH5 α competence;
6) selecting positive clones, and sequencing by using a universal primer M13 fwd;
7) expanding and culturing positive clone strains and improving quality;
2. construction of Donor Plasmid: the Donor plasma information is shown in fig. 1. The 5 'arm and the 3' arm are respectively an upstream homologous arm and a downstream homologous arm of the target site, the length is 600bp, and the GOI is an integrated target gene.
1) Obtaining 5 'arm and 3' arm with 600bp length upstream and downstream of the site with the plasmid homologous fragment through primer design and PCR amplification;
2) respectively utilizing double enzyme digestion and glue recovery to cut out the original homology arm of the Donor plasmid;
3) respectively connecting 5 'arm and 3' arm corresponding to the target site by a homologous recombination method;
4) and obtaining the target gene IFN beta-HSA through PCR amplification, and connecting the target gene IFN beta-HSA to a plasmid vector by utilizing enzyme digestion linkage.
3. The constructed sgRNA plasmid, Donor plasmid and Cas9-DTU plasmid (donated by dr. helene F Kildegaard, denmark science and technology university) were transfected with Lipofectamine 3000 transfection reagent at 1.8: 1.8:1 mass ratio cotransfected at 37 ℃ with 5% CO2CHO-K1 cells cultured under the conditions and a blank control was set. After 24h of transfection, pressure screening was performed using puromycin 10. mu.g/ml until all cells in the control group died, the cell pool after screening was expanded and the cells that did not fluoresce were sorted out using a BD flow cytometerCloning the cells.
4. After the cloning cell strain is expanded, a part of the extracted genome is identified by 5 'Junction PCR, 3' Junction PCR and out-out PCR, as shown in FIG. 1.
5. The positive clone cell line was retained.
Test example:
1. the green fluorescence intensity of the cell line constructed in example 4 was measured by a BD flow cytometer
The detection method comprises the following steps: the cell strains obtained in example 4 are continuously passaged for 60 generations, cells are collected for 10 generations, and the fluorescence and the intensity of the cells are detected by a flow cytometer, as shown in fig. 2, the detection result shows that more than 98% of the cells constructed according to different target sequences in example 4 still express green fluorescent protein after the cells are continuously passaged for 60 generations, and the fluctuation range of the green fluorescent intensity between the 0 th generation and the 60 th generation does not exceed 30%.
2. Urine microalbumin assay kit for detecting the expression of IFN beta-HSA in the cell line constructed in example 5
The detection method comprises the following steps: continuously passaging the cell strain obtained in example 5 under serum-free culture condition for 60 generations, collecting cell fermentation supernatant of the cell at every 10 generations, and detecting the HSA content in the fermentation liquor by using urine microalbumin assay kit according to the following formula
Figure BDA0003375170250000121
The expression level of IFN beta-HSA is calculated, and the analysis of the detection result shows that the cells constructed according to different target sequences in example 5 have stable ability of expressing IFN beta-HAS at different generations, as shown in FIG. 3.
The 8 sets of target sequences screened in the example 3 of the present invention cover most of the sequences in the 200bp base region, the middle and the downstream of the present invention, and the 1497087-containing 1497287 base region in the CHO cell gene NW _003613781.1 of the present invention can successfully construct a site-specific integration stable expression cell line and can stably express the target protein.
The above-mentioned embodiments are merely preferred embodiments for fully illustrating the present invention, and the scope of the present invention is not limited thereto. The equivalent substitutions or changes made by the person skilled in the art on the basis of the present invention are all within the protection scope of the present invention. The protection scope of the invention is subject to the claims.
Figure BDA0003375170250000131
Figure BDA0003375170250000141
Figure BDA0003375170250000151
Figure BDA0003375170250000161
Figure BDA0003375170250000171
Figure BDA0003375170250000181
Figure BDA0003375170250000191
Figure BDA0003375170250000201
Sequence listing
<110> university in south of the Yangtze river
<120> site for stably expressing protein in CHO cell gene NW _003613781.1 and use thereof
<160> 25
<170> SIPOSequenceListing 1.0
<210> 1
<211> 200
<212> DNA
<213> (Artificial sequence)
<400> 1
aaaatgtgta taactccaga ttcagaggat ccaacacttt ccactggcat gtgcatatag 60
tgcgcataca tttgtatagg ctaaaaactc ataatattat acaataacaa taataatgat 120
aaatctgtaa taacaatctc tatccattat tcactatttc ggtatttgcc aaaccaaaca 180
aaatccttcc tatcttttct 200
<210> 2
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 2
gaggatccaa cactttccac tgg 23
<210> 3
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 3
ctatccatta ttcactattt cgg 23
<210> 4
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 4
aataccgaaa tagtgaataa tgg 23
<210> 5
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 5
cacatgccag tggaaagtgt tgg 23
<210> 6
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 6
gcactatatg cacatgccag tgg 23
<210> 7
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 7
tagtgcgcat acatttgtat agg 23
<210> 8
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 8
aagataggaa ggattttgtt tgg 23
<210> 9
<211> 23
<212> DNA
<213> (Artificial sequence)
<400> 9
agtgttggat cctctgaatc tgg 23
<210> 10
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 10
tttggaggat ccaacacttt ccacgt 26
<210> 11
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 11
taaaacgtgg aaagtgttgg atcctc 26
<210> 12
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 12
tttgctatcc attattcact atttgt 26
<210> 13
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 13
taaaacaaat agtgaataat ggatag 26
<210> 14
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 14
tttgaatacc gaaatagtga ataagt 26
<210> 15
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 15
taaaacttat tcactatttc ggtatt 26
<210> 16
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 16
tttgcacatg ccagtggaaa gtgtgt 26
<210> 17
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 17
taaaacacac tttccactgg catgtg 26
<210> 18
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 18
tttggcacta tatgcacatg ccaggt 26
<210> 19
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 19
taaaacctgg catgtgcata tagtgc 26
<210> 20
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 20
tttgtagtgc gcatacattt gtatgt 26
<210> 21
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 21
taaaacatac aaatgtatgc gcacta 26
<210> 22
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 22
tttgaagata ggaaggattt tgttgt 26
<210> 23
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 23
taaaacaaca aaatccttcc tatctt 26
<210> 24
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 24
tttgagtgtt ggatcctctg aatcgt 26
<210> 25
<211> 26
<212> DNA
<213> (Artificial sequence)
<400> 25
taaaacgatt cagaggatcc aacact 26

Claims (7)

1. The application of a site for stably expressing a protein in a CHO cell gene NW _003613781.1 in stably expressing a foreign protein or polypeptide in a CHO cell, wherein the application is characterized in that a gene encoding the foreign protein or polypeptide is integrated at the site for stably expressing the protein in the CHO cell gene NW _ 003613781.1; the site for stable protein expression was located within bases 1497087-1497287 of the CHO cell gene NW _ 003613781.1.
2. The use according to claim 1, wherein the site of stable protein expression is recognized by CRISPR/Cas9 technology with 5'NNNNNNNNNNNNNNNNNNNNNGG 3' as target sequence.
3. The use according to claim 1, wherein the protein is a protein having a molecular weight of less than 160 KDa.
4. An expression vector for expressing a protein in CHO cells, wherein a coding gene of the protein is located in a region between a 5 'homologous arm and a 3' homologous arm on the expression vector, and the 5 'homologous arm and the 3' homologous arm are sequences with the upstream and downstream lengths of a site for stably expressing the protein being 600bp respectively; the site for stable protein expression was located within bases 1497087-1497287 of the CHO cell gene NW _ 003613781.1.
5. The expression vector of claim 4, further comprising a promoter sequence upstream of the gene encoding the protein, wherein the promoter controls the expression of the protein.
6. The expression vector of claim 5, wherein the promoter is: one of a human cytomegalovirus-derived strong mammalian expression promoter, a human elongation factor 1 α -derived strong mammalian expression promoter, a simian vacuolating virus 40-derived mammalian expression promoter, a phosphoglycerate kinase gene-derived mammalian promoter, a human ubiquitin C gene-derived mammalian promoter, a β -actin gene-derived mammalian promoter, and a strong hybrid mammalian promoter.
7. A CHO recombinant cell for expressing a protein in a site-specific integration manner, which is obtained by transferring the expression vector for expressing the protein in the CHO cell according to claim 4, a sgRNA plasmid corresponding to a target sequence and a Cas9 plasmid into the CHO cell.
CN202111415460.7A 2021-11-25 2021-11-25 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof Active CN114058625B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202111415460.7A CN114058625B (en) 2021-11-25 2021-11-25 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof
PCT/CN2022/099467 WO2023093008A1 (en) 2021-11-25 2022-06-17 Site for stably expressing protein in cho cell gene nw_003613781.1, and use thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111415460.7A CN114058625B (en) 2021-11-25 2021-11-25 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof

Publications (2)

Publication Number Publication Date
CN114058625A CN114058625A (en) 2022-02-18
CN114058625B true CN114058625B (en) 2022-07-15

Family

ID=80276484

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111415460.7A Active CN114058625B (en) 2021-11-25 2021-11-25 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof

Country Status (2)

Country Link
CN (1) CN114058625B (en)
WO (1) WO2023093008A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113969284B (en) * 2021-11-23 2022-07-12 江南大学 Site for stably expressing protein in CHO cell gene NW _003614889.1 and application thereof
CN113969283B (en) * 2021-11-23 2022-07-12 江南大学 Site for stably expressing protein in CHO cell gene NW _003613756.1 and application thereof
CN114058625B (en) * 2021-11-25 2022-07-15 江南大学 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104818253B (en) * 2015-03-24 2018-03-09 成都贝爱特生物科技有限公司 Target transformation of site-directed integration Chinese hamster ovary celI system and application thereof
SG11202000966XA (en) * 2017-08-11 2020-02-27 Boehringer Ingelheim Int Integration sites in CHO cells
CN109295092B (en) * 2018-10-30 2021-06-29 江南大学 Application of NW _003613638-1 stable expression protein in CHO cell genome
CN109321604B (en) * 2018-10-30 2021-07-06 江南大学 Application of NW _006882077-1 stable expression protein in CHO cell genome
CN109136193B (en) * 2018-10-30 2021-07-06 江南大学 Application of NW _006884764-1 stable expression protein in CHO cell genome
CN109337927B (en) * 2018-10-30 2021-07-06 江南大学 Application of NW _006880285-1 stable expression protein in CHO cell genome
CN114058625B (en) * 2021-11-25 2022-07-15 江南大学 Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof

Also Published As

Publication number Publication date
CN114058625A (en) 2022-02-18
WO2023093008A1 (en) 2023-06-01

Similar Documents

Publication Publication Date Title
CN114058625B (en) Site for stably expressing protein in CHO cell gene NW _003613781.1 and application thereof
CN114085841B (en) Site for stably expressing protein in CHO cell gene NW _003614092.1 and application thereof
CN106191116B (en) Foreign gene based on CRISPR/Cas9 knocks in integration system and its method for building up and application
WO2017190664A1 (en) Use of chemosynthetic crrna and modified crrna in crispr/cpf1 gene editing systems
CN113969284B (en) Site for stably expressing protein in CHO cell gene NW _003614889.1 and application thereof
CN107893073B (en) Method for screening glutamine synthetase defect type HEK293 cell strain
CN113969283B (en) Site for stably expressing protein in CHO cell gene NW _003613756.1 and application thereof
CN109136193B (en) Application of NW _006884764-1 stable expression protein in CHO cell genome
US11732276B2 (en) Use of genomic NW_006882077.1 in CHO cell for stably expressing a protein
WO2019120193A1 (en) Split single-base gene editing systems and application thereof
CN109295092B (en) Application of NW _003613638-1 stable expression protein in CHO cell genome
CN111254164A (en) Method for rapidly establishing CRISPR gene editing liver cancer cell strain and cell strain
US11692204B2 (en) Use of genomic NW_006880285.1 in CHO cell for stably expressing a protein
CN109295093B (en) Application of NW _006882456-1 stable expression protein in CHO cell genome
CN109207432B (en) Application of NW _006883358-1 stable expression protein in CHO cell genome
KR101947869B1 (en) Exploring hotspot method for expression of recombinant protein in cell line using next-generation sequencing
CN109402096A (en) A kind of AID enzyme mutant and its application
CN113528506B (en) DNA inversion system and application thereof and target DNA inversion method
JP3415840B2 (en) Method for producing foreign protein
CN113234759A (en) Preparation method of NAMPT gene modified human umbilical cord mesenchymal stem cell exosome
CN114540308A (en) Cell line for stably expressing orthogonal aminoacyl tRNA synthetase/tRNA and construction method
CN117683813A (en) EE0.6 site knockout vector and NC_006127.4 site knockout vector in DF-1 cells
JP2022543522A (en) Cell lines for high-level production of protein-based pharmaceuticals
CN116875597A (en) Vector for correcting LMNA cardiomyopathy pathogenic mutation and application thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant