CN113584078B - CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells - Google Patents

CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells Download PDF

Info

Publication number
CN113584078B
CN113584078B CN202110065686.2A CN202110065686A CN113584078B CN 113584078 B CN113584078 B CN 113584078B CN 202110065686 A CN202110065686 A CN 202110065686A CN 113584078 B CN113584078 B CN 113584078B
Authority
CN
China
Prior art keywords
seq
bdnf
gene
pig
expression vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110065686.2A
Other languages
Chinese (zh)
Other versions
CN113584078A (en
Inventor
牛冬
汪滔
马翔
曾为俊
刘璐
王磊
程锐
赵泽英
陶裴裴
黄彩云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Qizhen Genetic Engineering Co Ltd
Original Assignee
Nanjing Qizhen Genetic Engineering Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Qizhen Genetic Engineering Co Ltd filed Critical Nanjing Qizhen Genetic Engineering Co Ltd
Publication of CN113584078A publication Critical patent/CN113584078A/en
Application granted granted Critical
Publication of CN113584078B publication Critical patent/CN113584078B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/475Growth factors; Growth regulators
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0652Cells of skeletal and connective tissues; Mesenchyme
    • C12N5/0656Adult fibroblasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Abstract

The invention discloses a CRISPR/Cas9 system for double-target gene editing and application thereof in constructing depressive pig nuclear transfer donor cells. A CRISPR/Cas9 system for pig BDNF and SLC6A4 gene editing, comprising a Cas9 expression vector, a gRNA expression vector for a pig BDNF gene, and a gRNA expression vector for a pig SLC6A4 gene; the whole sequence of the Cas9 expression vector plasmid is shown as SEQ ID NO. 2. The efficient expression vector of Cas9 modified by combining the screened gRNA is used for gene editing, and the editing efficiency is obviously improved compared with that of the original vector.

Description

CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells
Technical Field
The invention belongs to the technical field of biology, and particularly relates to a CRISPR/Cas9 system for BDNF and SLC6A4 gene editing and application thereof.
Background
Depression is the most common depressive disorder, characterized by a marked and persistent fall in mood, the major clinical feature, being the major type of mood disorder. Currently, there are over 2.64 billion depressed patients worldwide, and depression has become the second most common disorder in the world. In China, the number of patients with depression reaches 9000 ten thousand, but medical treatment and prevention of depression in China is still in the situation of low recognition rate, and the onset of depression starts to be in a trend of low age.
The cause of depression is not clear, but it is confirmed that many factors such as genetics, psychological and social environments are involved in the pathogenesis of depression. Genetically, depression is a disease that is highly heterogeneous and is affected by multiple genes, the effect of which on depression accounts for 40-50% of the risk of developing depression. Among them, brain-derived neurotrophic factor (BDNF) is a neuropsychiatric disease-related marker which has been widely studied, plays an important role in the pathophysiology of depression, and can cause depression when BDNF mRNA and related proteins in the hippocampus are decreased in level and neurons are damaged. In addition, depression is closely related to the 5-hydroxytryptamine (5-HIT) content encoded by the SLC6A4 gene in the central nervous system. Abnormalities in both BDNF and SLC6A4 have been widely recognized as being closely related to the occurrence of human depression, and mutation of either gene alone does not result in a major depressive phenotype, but when both genes are mutated at the same time, results in a more major depressive phenotype. Therefore, there is an urgent need to develop an animal model of depression based on BDNF and SLC6A4 gene mutations to solve the pathogenesis puzzle of depression as soon as possible and lay a foundation for further treatment of the disease. The animal model commonly used at present is a mouse model, but the mouse has huge differences from human in the aspects of body type, organ size, physiology, pathology and the like, and can not truly simulate normal physiological and pathological states of human beings. Pigs are major meat animals for a long time, have the size and physiological functions similar to those of human beings, are easy to breed and raise on a large scale, have lower requirements on ethical morals, animal protection and the like, and are ideal human disease model animals.
Gene editing is a biotechnology that has been greatly developed in recent years, and includes editing technologies from gene editing based on homologous recombination to ZFN, TALEN, CRISPR/Cas9 based on nucleases, and the CRISPR/Cas9 technology is currently the most advanced gene editing technology. Currently, gene editing techniques are increasingly applied to the production of animal models.
According to the invention, a CRISPR/Cas9 gene editing technology is adopted to knock out BDNF and SLC6A4 genes of pigs, and the BDNF and SLC6A4 gene double-knocked-out pig recombinant cells are prepared, so that a solid foundation is laid for further producing depression cloned pigs by a somatic cell cloning technology, and a powerful experimental tool is provided for researches such as depression treatment.
Disclosure of Invention
The object of the present invention is to address the above-mentioned deficiencies of the prior art, providing a CRISPR/Cas9 system for BDNF and SLC6A4 gene editing.
It is another object of the present invention to provide grnas and expression vectors thereof for BDNF and SLC6A4 gene editing.
The invention also aims to provide application of the CRISPR/Cas9 system in construction of BDNF and SLC6A4 gene mutated porcine recombinant cells.
The aim of the invention can be achieved by the following technical scheme:
A CRISPR/Cas9 system for pig BDNF and SLC6A4 gene editing, comprising a Cas9 expression vector, a gRNA expression vector for a pig BDNF gene, and a gRNA expression vector for a pig SLC6A4 gene; the Cas9 expression vector is a pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO vector with a plasmid full sequence shown as SEQ ID NO. 2.
To increase the gene editing capacity of the Cas9 Plasmid, we modified pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO (abbreviated as Plasmid pKG-GE 3) on the basis of the vector pX330-U6-Chimeric_BB-CBh-hSpCas9 (abbreviated as PX 330) purchased from adedge (Plasmid #42230,from Zhang Feng lab). The map of PX330 is shown in fig. 1, modified as follows:
1) Removing redundant invalid sequences in the gRNA skeleton of the original vector;
2) Modifying a promoter: the original promoter (chicken beta-actin promoter) is modified into EF1a promoter with higher expression activity, so that the protein expression capacity of the Cas9 gene is increased;
3) Increasing the nuclear localization signal: adding a nuclear localization signal coding sequence (NLS) at the N end and the C end of the Cas9, and increasing the nuclear localization capability of the Cas 9;
4) Adding double screening markers: the original vector does not have any screening mark, is not beneficial to screening and enrichment of positive transformed cells, and P2A-EGFP-T2A-PURO is inserted into the C end of Cas9, so that the fluorescence and resistance screening capability of the vector are endowed;
5) Inserting WPRE, 3' LTR and other sequences for regulating gene expression: the WPRE, 3' LTR and other sequences are inserted into the gene frame at last, so that the protein translation capacity of the Cas9 gene can be enhanced.
The modified vector pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO (called pKG-GE3 for short) is shown in figure 2, and the plasmid full sequence is shown in SEQ ID NO. 2; the main elements of pKG-GE3 are:
1) gRNA expression element: u6gRNA scaffold;
2) Promoter: EF1a promoter and CMV enhancer;
3) Cas9 gene comprising multiple NLSs: cas9 gene containing N-terminal and C-terminal polynuclear localization signals (NLS);
4) Screening marker genes: fluorescent and resistant double selectable marker element P2A-EGFP-T2A-PURO;
5) Element for enhancing translation: WPRE and 3' LTR enhance the translation efficiency of Cas9 and selectable marker genes;
6) Transcription termination signal: bGH polyA signal;
7) A carrier skeleton: including Amp resistance elements and ori replicons, and the like.
The plasmid pKG-GE3 has a specific fusion gene; the specific fusion gene codes for a specific fusion protein;
the specific fusion protein sequentially comprises the following elements from the N end to the C end: two Nuclear Localization Signals (NLS), cas9 protein, two nuclear localization signals, self-cleaving polypeptide P2A, fluorescent reporter protein, self-cleaving polypeptide T2A, resistance selection marker protein;
In the plasmid pKG-GE3, the EF1a promoter is used for promoting the expression of the specific fusion gene;
in plasmid pKG-GE3, the specific fusion gene has downstream a WPRE sequence element, a 3' LTR sequence element and a bGH poly (A) signal sequence element.
The plasmid pKG-GE3 has the following elements in this order: CMV enhancer, EF1a promoter, the specific fusion gene, WPRE sequence element, 3' LTR sequence element, bGH poly (A) signal sequence element.
In the specific fusion protein, two nuclear localization signals at the upstream of the Cas9 protein are SV40 nuclear localization signals, and two nuclear localization signals at the downstream of the Cas9 protein are nucleoplasin nuclear localization signals.
In the specific fusion protein, the fluorescent reporter protein can be EGFP protein.
In the specific fusion protein, the resistance screening marker protein can be a Puromycin protein.
The amino acid sequence of the self-cleaving polypeptide P2A is "ATNFSLLKQAGDVEENPGP" (the cleavage site where self-cleavage occurs is between the first amino acid residue and the second amino acid residue from the C-terminus).
The amino acid sequence of the self-cleaving polypeptide T2A is "EGRGSLLTCGDVEENPGP" (the cleavage site where self-cleavage occurs is between the first amino acid residue and the second amino acid residue from the C-terminus).
The specific fusion gene is specifically shown as 911-6706 nucleotides in SEQ ID NO. 2.
The CMV enhancer is shown as nucleotide 395-680 in SEQ ID NO. 2.
The EF1a promoter is shown as 682-890 nucleotides in SEQ ID NO. 2.
The WPRE sequence element is shown as nucleotide 6722-7310 of SEQ ID NO. 2.
The 3' LTR sequence element is shown as 7382-7615 nucleotides in SEQ ID NO. 2.
The bGH poly (A) signal sequence element is shown as 7647-7871 nucleotides in SEQ ID NO. 2.
As one preferable choice of the invention, the vector skeleton of the gRNA expression vector for the pig BDNF gene and the vector skeleton of the gRNA expression vector for the pig SLC6A4 gene are both pKG-U6gRNA, and the plasmid full sequence is shown as SEQ ID NO. 3.
As a preferable mode of the invention, the gRNA expression vector aiming at the BDNF gene of the pig expresses gRNA shown in SEQ ID NO.22, and the target point is shown in SEQ ID NO. 18.
As a further preferred aspect of the present invention, the gRNA expression vector for the pig BDNF gene is obtained by annealing single-stranded DNA shown in SEQ ID NO.26 and SEQ ID NO.27 to form double-stranded insert vector backbone pKG-U6 gRNA.
As a preferable mode of the invention, the gRNA expression vector aiming at the pig SLC6A4 gene expresses gRNA shown in SEQ ID NO.43, and the target point is shown in SEQ ID NO. 39.
As a further preferred aspect of the present invention, the gRNA expression vector for the SLC6A4 gene of swine is obtained by annealing single-stranded DNA shown in SEQ ID NO.48 and SEQ ID NO.49 to form a double-stranded insert vector backbone pKG-U6 gRNA.
The CRISPR/Cas9 system is applied to construction of BDNF and SLC6A4 gene mutated porcine recombinant cells.
The recombinant cell is obtained by verifying the CRISPR/Cas9 system co-transfected pig primary fibroblast used for pig BDNF and SLC6A4 gene editing.
The recombinant cell is applied to construction of BDNF and SLC6A4 gene knockout cloned pigs; preferably in constructing BDNF and SLC6A4 gene knockout depressed cloned pigs.
A gRNA expression vector for pig BDNF gene expresses gRNA shown in SEQ ID NO.22, the vector skeleton of the expression vector is pKG-U6gRNA, and the plasmid full sequence is shown in SEQ ID NO. 3.
A gRNA expression vector for the SLC6A4 gene of pig expresses gRNA shown in SEQ ID NO.43, the vector skeleton of the expression vector is pKG-U6gRNA, and the plasmid full sequence is shown in SEQ ID NO. 3.
Compared with the prior art, the invention has at least the following beneficial effects:
(1) The subject (pig) of the invention has better applicability than other animals (rats, mice, primates).
Rodents such as rats and mice have great differences from humans in terms of body type, organ size, physiology, pathology and the like, and cannot truly simulate normal physiological and pathological states of humans. Studies have shown that more than 95% of drugs that are validated in mice are ineffective in human clinical trials. In the case of large animals, primates are animals with the closest relationship to humans, but are small in size, late in sexual maturity (mating begins at 6-7 years old), and single animals, the population expansion rate is extremely slow, and the raising cost is also high. In addition, primate cloning is inefficient, difficult and costly.
The pig is an animal which has the closest relationship with human except primate, and has the similar body shape, weight, organ size and the like as human, and is very similar to human in terms of anatomy, physiology, immunology, nutrition metabolism, disease pathogenesis and the like. Meanwhile, the pigs are early in sexual maturity (4-6 months), have high fertility and have more piglets, and can form a larger group within 2-3 years. In addition, the cloning technology of pigs is very mature, and the cloning and feeding costs are much lower than those of primates.
(2) According to the invention, four gRNAs are designed aiming at pig BDNF and SLC6A4 genes, and the high-efficiency gRNAs are screened from the gRNAs and then the pre-set targets are knocked out, so that the workload of later-stage identification and screening can be effectively reduced, and the PCR product sequencing can be directly used for detecting the gene editing efficiency.
(3) The efficient expression vector of Cas9 modified by the invention is adopted for gene editing, and the editing efficiency is improved by more than 100% compared with the original vector.
(4) The efficient expression vector of the Cas9 modified by the invention is adopted for gene editing, and the genotype [ homozygous mutation (including mutation of the same bi-allele variation and mutation of different bi-allele variation), heterozygous mutation or wild type ] of the obtained cells can be analyzed according to the sequencing result of the target gene PCR product, wherein the probability of obtaining the homozygous mutation is 20% -30%, which is greatly superior to the probability (lower than 5%) of obtaining the homozygous mutation in a model preparation method (namely fertilized ovum injection gene editing material) using embryo injection technology.
(5) The cloning of somatic cell nuclear transfer animal by using the obtained mutant type monoclonal cell clone strain can directly obtain cloned pigs containing target gene mutation, and the mutation can be inherited stably.
The method of microinjection of gene editing material into fertilized ovum and embryo transplantation adopted in mouse model production is not suitable for large animal (such as pig) model production with long gestation period because the probability of directly obtaining homozygous mutant offspring is very low (less than 5%), and the offspring need to be hybridized and bred. Therefore, the method for editing and screening the positive editing single cell clone in vitro by the primary cells with high technical difficulty and high challenge is adopted, and the corresponding disease model pig is directly obtained by the somatic cell nuclear transfer animal cloning technology in the later period, so that the manufacturing period of the model pig can be greatly shortened, and the manpower, material resources and financial resources are saved.
The invention lays a solid foundation for obtaining the depression-suppressing pig model by a gene editing means, is helpful for researching and revealing the pathogenesis of depression caused by BDNF and SLC6A4 gene mutation, can be used for carrying out research such as drug screening, drug effect detection, disease pathology, gene therapy, cell therapy and the like, can provide effective experimental data for further clinical application, and also provides a powerful experimental means for successfully treating the depression caused by BDNF and SLC6A4 gene mutation in human beings. The invention has great application value for researching and developing depression drugs and revealing pathogenesis of the depression.
Drawings
FIG. 1 is a schematic diagram of the structure of plasmid pX 330.
FIG. 2 is a schematic structural diagram of plasmid pU6gRNACas 9.
FIG. 3 is a structural map of pU6gRNA-eEF1a Cas9 vector.
FIG. 4 is a pU6gRNA-eEF1a Cas9+nNLS vector map.
FIG. 5 is a schematic diagram of the structure of plasmid pKG-GE 3.
FIG. 6 is a schematic diagram of the structure of plasmid pKG-U6 gRNA.
FIG. 7 is a schematic representation of the insertion of a DNA molecule of about 20bp (target sequence binding region for transcription to form gRNA) into plasmid pKG-U6 gRNA.
FIG. 8 shows the sequencing results when the plasmid proportioning was optimized.
FIG. 9 shows the sequencing results when the effects of plasmid pX330 and plasmid pKG-GE3 are compared.
FIG. 10 is an electrophoresis chart of example 3 after PCR amplification using 18 pig genomic DNAs as templates.
FIG. 11 is a plot of sequencing peaks in step four of example 3.
FIG. 12 is an electrophoresis chart of example 4 after PCR amplification using 18 pig genomic DNAs as templates.
FIG. 13 is a plot of sequencing peaks in step four of example 4.
FIG. 14 is an electrophoresis chart of PCR products of BDNF gene-edited single cell clones obtained in example 5.
FIG. 15 is an electrophoretogram of the PCR product of the single cell clone of the SLC6A4 gene obtained in example 5.
FIG. 16 shows the results of forward sequencing of the single cell clone numbered BDNF-1 compared to the wild-type.
FIG. 17 shows the results of forward sequencing of a single cell clone numbered BDNF-3 compared to wild-type.
FIG. 18 is a forward sequencing of a single cell clone numbered BDNF-4 aligned with wild-type.
FIG. 19 is a forward sequencing of a single cell clone numbered BDNF-12 aligned with wild-type.
FIG. 20 is a result of forward sequencing of a single cell clone numbered SLC6A4-4 in comparison to wild type.
FIG. 21 is a forward sequencing of a single cell clone numbered SLC6A4-3 in comparison to wild type.
FIG. 22 shows the results of forward sequencing of single cell clones numbered SLC6A4-8 in comparison to wild type.
FIG. 23 shows the results of forward sequencing of single cell clones numbered SLC6A4-12 in comparison to wild type.
Detailed Description
The following detailed description of the invention is provided in connection with the accompanying drawings that are presented to illustrate the invention and not to limit the scope thereof. The examples provided below are intended as guidelines for further modifications by one of ordinary skill in the art and are not to be construed as limiting the invention in any way.
The experimental methods in the following examples, unless otherwise specified, are conventional methods, and are carried out according to techniques or conditions described in the literature in the field or according to the product specifications. Materials, reagents and the like used in the examples described below are commercially available unless otherwise specified. The recombinant plasmids constructed in the examples were all subjected to sequencing verification. Complete culture solution (% by volume): 15% fetal bovine serum (Gibco) +83% DMEM medium (Gibco) +1% Penicillin-Streptomycin (Gibco) +1% HEPES (Solarbio). Cell culture conditions: 37 ℃,5% CO 2 、5%O 2 Is a constant temperature incubator.
A method of preparing porcine primary fibroblasts: (1) taking 0.5g of pig ear tissue, removing hair, soaking in 75% alcohol for 30-40s, washing with PBS buffer solution containing 5% (volume ratio) Penicillin-Streptomycin (Gibco) for 5 times, and washing with PBS buffer solution for one time; (2) shearing the tissue with scissors, digesting with 5mL 1% collagenase solution (Sigma) at 37deg.C for 1h, centrifuging 500g for 5min, and discarding the supernatant; (3) the pellet was resuspended in 1mL of complete medium, then plated into 10-diameter cell culture dishes containing 10mL of complete medium and capped with 0.2% gelatin (VWR) and cultured to about 60% of the cell growth bottom; (4) after step (3) is completed, cells are digested and harvested with trypsin and then resuspended in complete medium.
Example 1 preparation of plasmids
1.1 preparation of plasmid pU6gRNA eEF1a-mNLS-hSpCas9-EGFP-PURO (abbreviated as plasmid pKG-GE 3)
The original plasmid pX330-U6-Chimeric_BB-CBh-hSpCas9 (abbreviated as plasmid pX 330) has a sequence shown in SEQ ID NO. 1. The schematic structure of plasmid pX330 is shown in fig. 1. In SEQ ID No.1, nucleotides 440 to 725 constitute the CMV enhancer, nucleotides 727 to 1208 constitute the chicken beta-actin promoter, nucleotides 1304 to 1324 encode the SV40 Nuclear Localization Signal (NLS), nucleotides 1325 to 5449 encode the Cas9 protein, and nucleotides 5450 to 5497 encode the nucleoplasin Nuclear Localization Signal (NLS).
Plasmid pU6gRNA eEF1a-mNLS-hSpCas9-EGFP-PURO (FIG. 5), abbreviated as plasmid pKG-GE3, and the nucleotide is shown as SEQ ID NO. 2. Compared with plasmid pX330, plasmid pKG-GE3 was mainly modified as follows: (1) removing residual gRNA backbone sequences (GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTTT), reducing interference; (2) the original chicken beta-actin promoter is modified into an EF1a promoter with higher expression activity, so that the protein expression capacity of the Cas9 gene is increased; (3) adding nuclear localization signal coding genes (NLS) at the upstream and downstream of the Cas9 gene, and increasing the nuclear localization capability of the Cas9 protein; (4) the original plasmid has no eukaryotic cell screening mark, is not beneficial to screening and enrichment of positive transformed cells, and is sequentially inserted with P2A-EGFP-T2A-PURO coding genes at the downstream of Cas9 genes, so that the carrier fluorescence and eukaryotic cell resistance screening capability are endowed; (5) the insertion of the WPRE element and the 3' ltr sequence element enhances the protein translation capacity of the Cas9 gene.
The construction method of the pKG-GE3 plasmid is as follows:
(1) Removal of superfluous and ineffective sequences from gRNA backbone
Plasmid pX330 was digested with BbsI and XbaI, the vector fragment (about 8313 bp) was recovered, the insert 175bp (SEQ ID NO. 4) was synthesized by a multi-fragment recombination method, and the pU6gRNACas9 vector was obtained by recombination with the recovered vector fragment (FIG. 2).
(2) Modified promoter and enhancer
The constructed pU6gRNACas9 vector was subjected to removal of the promoter (chicken beta-actin promoter) and enhancer sequence (CMV enhancer) by using XbaI and AgeI endonucleases, about 7650bp of the linear vector sequence was recovered, 554bp of the sequence (SEQ ID NO. 5) containing the CMV enhancer and EF1a promoter was synthesized by using a multi-fragment recombination method, and pU6gRNA-eEF1a Cas9 vector was obtained by recombination with the vector pU6gRNACas9 after cleavage (FIG. 3).
(3) N-terminal increased NLS sequence of Cas9 gene
And (3) carrying out enzyme digestion on the constructed vector pU6gRNA-eEF1a Cas9 by using AgeI and BglII, recovering 7786bp vector sequence, supplementing the sequence added with NLS to enzyme digestion sites, namely synthesizing 447bp Cas9 coding sequence (SEQ ID NO. 6) comprising 2 nuclear localization signals and partial excision by utilizing a multi-fragment recombination method, and recombining to obtain the pU6gRNA-eEF1a Cas9+nNLS vector (figure 4).
(4) Adding NLS, P2A-EGFP-T2A-PURO and WPRE-3' LTR-bGH polyA signals into C end of Cas9 gene
The vector constructed above is named pU6gRNA-eEF1a Cas9+nNLS, fseI and SbfI are used for digestion, 7781bp of vector sequence is recovered, and 2727bp of vector sequence is synthesized by a multi-fragment recombination method
The fragment of NLS-P2A-EGFP-T2A-PURO-WPRE-3' LTR-bGH polyA signals (SEQ ID NO. 7) is recombined with the vector fragment to obtain a vector pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO, called pKG-GE3 for short, the plasmid map is shown in FIG. 5, and the nucleotide sequence (SEQ ID NO. 2).
In SEQ ID NO.2, nucleotides 395 to 680 constitute the CMV enhancer, nucleotides 682 to 890 constitute the EF1a promoter, nucleotides 986 to 1006 encode the Nuclear Localization Signal (NLS), nucleotides 1016 to 1036 encode the Nuclear Localization Signal (NLS), nucleotides 1037 to 5161 encode the Cas9 protein, nucleotides 5162 to 5209 encode the Nuclear Localization Signal (NLS), nucleotides 5219 to 5266 encode the Nuclear Localization Signal (NLS), nucleotides 5276 to 5332 encode the self-cleaving polypeptide P2A (the amino acid sequence of the self-cleaving polypeptide P2A is "ATNFSLLKQAGDVEENPGP", the cleavage site where self-cleavage occurs is between the first amino acid residue and the second amino acid residue from the C-terminus), nucleotide numbers 5333-6046 encode EGFP protein, nucleotide numbers 6056-6109 encode self-cleaving polypeptide T2A (the amino acid sequence of self-cleaving polypeptide T2A is EGRGSLLTCGDVEENPGP, the cleavage site where self-cleavage occurs is between the first amino acid residue and the second amino acid residue from the C-terminus), nucleotide numbers 6110-6703 encode Puromycin protein (called Puro protein for short), nucleotide numbers 6722-7310 constitute WPRE sequence element, nucleotide numbers 7382-7615 constitute 3' LTR sequence element, and nucleotide numbers 7647-7871 constitute bGH poly (A) signal sequence element. In SEQ ID No.2, the 911-6706 genes form fusion genes, expressing fusion proteins. Due to the presence of self-cleaving polypeptide P2A and self-cleaving polypeptide T2A, the fusion protein spontaneously forms three proteins: proteins with Cas9 protein, proteins with EGFP protein, and proteins with Puro protein.
1.2 construction of pKG-U6gRNA vector
The pUC57 vector is obtained by connecting a pKG-U6gRNA insertion sequence (a DNA fragment containing a U6 promoter, a BbsI restriction enzyme site and an sgRNA framework sequence, the sequence is shown as SEQ ID NO. 8) through an EcoRV restriction enzyme site, reversely inserting the DNA fragment into the pUC57 vector to obtain a complete sequence (SEQ ID NO. 3) of the pKG-U6gRNA vector, wherein nucleotides 2280 to 2539 form an hU6 promoter, and nucleotides 2558 to 2637 are used for transcription to form a gRNA framework. In use, a DNA molecule of about 20bp (target sequence binding region for transcription to form gRNA) is inserted into plasmid pKG-U6gRNA (FIG. 7) to form a recombinant plasmid, which is transcribed in cells to give gRNA. The constructed pKG-U6gRNA vector is shown in FIG. 6.
Example 2 comparison of the effects of plasmid pX330 and plasmid pKG-GE3
Selecting a high-efficiency gRNA target located in the RAG1 gene:
target for RAG1-gRNA 4: 5'-AGTTATGGCAGAACTCAGTG-3' (SEQ ID NO. 9).
Primers used to amplify and detect fragments containing the target were as follows:
RAG1-nF126:5’-CCCCATCCAAAGTTTTTAAAGGA-3’(SEQ ID NO.10);
RAG1-nR525:5’-TGTGGCAGATGTCACAGTTTAGG-3’(SEQ ID NO.11)
porcine primary fibroblasts were prepared from ear tissue of a junior river-flavored pig (female, blood group AO).
1. Preparation of recombinant plasmids
Plasmid pKG-U6gRNA was taken and digested with restriction enzyme BbsI, and the vector backbone (about 3kb linear fragment) was recovered. RAG1-4S and RAG1-4A were synthesized separately, and then mixed and annealed to give a double-stranded DNA molecule having cohesive ends. The double-stranded DNA molecule having a cohesive end and the vector backbone were ligated to obtain plasmid pKG-U6gRNA (RAG 1-gRNA 4).
RAG1-4S:5’-caccgAGTTATGGCAGAACTCAGTG-3’(SEQ ID NO.12);
RAG1-4A:5’-aaacCACTGAGTTCTGCCATAACTc-3’(SEQ ID NO.13)。
RAG1-4S and RAG1-4A are single stranded DNA molecules.
2. Plasmid proportioning optimization
A first group: the plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.44. Mu.g plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.56. Mu.g of plasmid pKG-GE3. Namely, the molar ratio of the plasmid pKG-U6gRNA (RAG 1-gRNA 4) to the plasmid pKG-GE3 is as follows: 1:1.
second group: the plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.72. Mu.g plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.28. Mu.g of plasmid pKG-GE3. Namely, the molar ratio of the plasmid pKG-U6gRNA (RAG 1-gRNA 4) to the plasmid pKG-GE3 is as follows: 2:1.
third group: the plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.08 μg of plasmid pKG-GE3. Namely, the molar ratio of the plasmid pKG-U6gRNA (RAG 1-gRNA 4) to the plasmid pKG-GE3 is as follows: 3:1.
fourth group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) was transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4).
Co-transfection was performed by electric shock transfection using a mammalian nuclear transfection kit (Neon kit, thermofiser) and a Neon TM transfection system electrotransfection apparatus (parameters set to 1450V, 10ms, 3 pulses).
2. After the step 1 is completed, the culture is carried out for 16 to 18 hours by adopting the complete culture solution, and then the culture is carried out by replacing the new complete culture solution. The total incubation time was 48 hours.
3. After step 2 is completed, cells are digested and collected by trypsin, genomic DNA is extracted, PCR amplification is performed by using a primer pair consisting of RAG1-nF126 and RAG1-nR525, and then electrophoresis is performed.
The band of interest was recovered after electrophoresis and sequenced, and the sequencing results are shown in FIG. 8.
The efficiency of editing of different targets was obtained by analyzing the sequencing peak plots using the synthetic ICE tool. The gene editing efficiency of the first group to the third group was 9%, 53%, 66% in this order. The fourth group did not undergo gene editing. The results showed that the third set of edits were most efficient, determining the optimum amount of single gRNA plasmid to Cas9 plasmid as a molar ratio of 3:1, the actual amount of plasmid was 0.92. Mu.g: 1.08 μg. 3. Comparison of the effects of plasmid pX330 and plasmid pKG-GE3
1. Co-transfection
RAG1-B group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) was transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4).
RAG1-330 group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pX330 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.08 μg of plasmid pX330.
RAG1-KG group: the plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.08 μg of plasmid pKG-GE3.
Co-transfection was performed by electric shock transfection using a mammalian nuclear transfection kit (Neon kit, thermofiser) and a Neon TM transfection system electrotransfection apparatus (parameters set to 1450V, 10ms, 3 pulses).
2. After the step 1 is completed, the culture is carried out for 16 to 18 hours by adopting the complete culture solution, and then the culture is carried out by replacing the new complete culture solution. The total incubation time was 48 hours.
3. After step 2 is completed, cells are digested and collected by trypsin, genomic DNA is extracted, PCR amplification is carried out by using a primer pair consisting of RAG1-nF126 and RAG1-nR525, and the products are sequenced.
The efficiency of editing of different targets was obtained by analyzing the sequencing peak plots using the synthetic ICE tool. No gene editing occurred in RAG1-B groups. The editing efficiency of RAG1-330 groups and RAG1-KG groups is 28% and 68% in sequence. Exemplary peak diagrams of sequencing results are shown in FIG. 9. The results show that the use of plasmid pKG-GE3 results in a significant increase in gene editing efficiency compared to the use of plasmid pX330.
Example 3 target screening for BDNF Gene knockout
Pig BDNF gene information: encoding brain derived neurotrophic factor protein; is located on chromosome 2 of pig; geneID is 397495,Sus scrofa. The protein encoded by the porcine BDNF gene is shown as GENBANK ACCESSION NO. XP_005654741.1 (linear CON 12-JAN-2018). In the genome DNA, the pig BDNF gene has 6 exons, wherein the 6 th exon and 400bp sequences on the upstream and downstream of the 6 th exon are shown as SEQ ID NO.14, and the coded protein fragment is shown as SEQ ID NO. 15.
1. BDNF gene knockout preset target point and adjacent genome sequence conservation analysis
18 junior from Jiang fragrant pigs, of which 10 females (named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, respectively) and 8 males (named A, B, C, D, E, F, G, H, respectively) were female.
PCR amplification was performed using 18 pig genomic DNAs as templates, respectively, and primer pairs (the target sequences of the primer pairs include exon 6 of the pig BDNF gene) were then subjected to electrophoresis. And (3) recovering PCR amplified products, sequencing, and comparing the sequencing results with BDNF and SLC6A4 gene sequences in a public database for analysis. Based on the results of the alignment, primers for detecting the mutation (the primers themselves avoid possible mutation sites) were designed. The primers designed for mutation detection were: BDNF-E6-F/BDNF-E-R. The electrophoresis diagram of 18 pig genome DNA amplified by the primer pair composed of BDNF-E6-F/BDNF-E6-R is shown in figure 10.
BDNF-E6-F:5’-ACGTGATCGAGGAGCTGTTG-3’(SEQ ID NO.16);
BDNF-E6-R:5’-TTGCTATCCATGGTGAGGGC-3’(SEQ ID NO.17)。
2. Screening target
A plurality of targets are initially screened by screening NGG (avoiding possible mutation sites), and 4 targets are further screened from the targets through preliminary experiments.
The 4 targets were as follows:
sgRNA BDNF-E6-g1 target point: 5'-AGACCAAGTGCAATCCTATG-3' (SEQ ID NO. 18);
sgRNA BDNF-E6-g2 target point: 5'-CATGTCGGGTGGCACGGTCA-3' (SEQ ID NO. 19);
sgRNA BDNF-E6-g3 target point: 5'-GGCGGATAAAAAGACGGCAG-3' (SEQ ID NO. 20);
sgRNA BDNF-E6-g4 target point: 5'-GAAAAAGTCCCCGTCTCGAA-3' (SEQ ID NO. 21).
3. Preparation of recombinant plasmids
Plasmid pKG-U6gRNA was taken and digested with restriction enzyme BbsI, and the vector backbone (about 3kb linear fragment) was recovered.
BDNF-E6-gRNA1-S and BDNF-E6-gRNA1-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (BDNF-E6-g 1). Plasmid pKG-U6gRNA (BDNF-E6-g 1) expresses the sgRNA shown in SEQ ID NO.22 BDNF-E6-g1
BDNF-E6-gRNA2-S and BDNF-E6-gRNA2-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule with cohesive ends and the vector backbone were ligated to give plasmid pKG-U6gRNA (BDN) F-E6-g 2). Plasmid pKG-U6gRNA (BDNF-E6-g 2) expresses the sgRNA shown in SEQ ID NO.23 BDNF-E6-g2
BDNF-E6-gRNA3-S and BDNF-E6-gRNA3-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (BDNF-E6-g 3). Plasmid pKG-U6gRNA (BDNF-E6-g 3) expresses the sgRNA shown in SEQ ID NO.24 BDNF-E6-g3
BDNF-E6-gRNA4-S and BDNF-E6-gRNA4-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (BDNF-E6-g 4). Plasmid pKG-U6gRNA (BDNF-E6-g 4) expresses the sgRNA shown in SEQ ID NO.25 BDNF-E6-g1
BDNF-E6-gRNA1-S:5’-caccgAGACCAAGTGCAATCCTATG-3’(SEQ ID NO.26);
BDNF-E6-gRNA1-A:5’-aaacCATAGGATTGCACTTGGTCTc-3’(SEQ ID NO.27);
BDNF-E6-gRNA2-S:5’-caccgCATGTCGGGTGGCACGGTCA-3’(SEQ ID NO.28);
BDNF-E6-gRNA2-A:5’-aaacTGACCGTGCCACCCGACATGc-3’(SEQ ID NO.29);
BDNF-E6-gRNA3-S:5’-caccGGCGGATAAAAAGACGGCAG-3’(SEQ ID NO.30);
BDNF-E6-gRNA3-A:5’-aaacCTGCCGTCTTTTTATCCGCC-3’(SEQ ID NO.31);
BDNF-E6-gRNA4-S:5’-caccGAAAAAGTCCCCGTCTCGAA-3’(SEQ ID NO.32);
BDNF-E6-gRNA4-A:5’-aaacTTCGAGACGGGGACTTTTTC-3’(SEQ ID NO.33)。
BDNF-E6-gRNA1-S, BDNF-E6-gRNA1-A, BDNF-E6-gRNA2-S, BDNF-E6-gRNA2-A, BDNF-E6-gRNA3-S, BDNF-E6-gRNA3-A, BDNF-E6-gRNA4-S, BDNF-E6-gRNA4-A are all single stranded DNA molecules.
4. Editing efficiency comparison of different targets
Porcine primary fibroblasts were prepared from ear tissue of a junior river-flavored pig (female, blood group AO).
1. Co-transfection
A first group: plasmid pKG-U6gRNA (BDNF-E6-g 1) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (BDNF-E6-g 1): 1.08 μg of plasmid pKG-GE3.
Second group: plasmid pKG-U6gRNA (BDNF-E6-g 2) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (BDNF-E6-g 2): 1.08 μg of plasmid pKG-GE3.
Third group: plasmid pKG-U6gRNA (BDNF-E6-g 3) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (BDNF-E6-g 3): 1.08 μg of plasmid pKG-GE3.
Fourth group: plasmid pKG-U6gRNA (BDNF-E6-g 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (BDNF-E6-g 4): 1.08 μg of plasmid pKG-GE3.
Fifth group: pig primary fibroblasts were not subjected to any transfection procedure.
Co-transfection was performed by electric shock transfection using a mammalian nuclear transfection kit (Neon kit, thermofiser) and a Neon TM transfection system electrotransfection apparatus (parameters set to 1450V, 10ms, 3 pulses).
2. After the step 1 is completed, the culture is carried out for 16 to 18 hours by adopting the complete culture solution, and then the culture is carried out by replacing the new complete culture solution. The total incubation time was 48 hours.
3. After step 2 was completed, cells were digested with trypsin and collected, then lysed and genomic DNA was extracted, and PCR amplification was performed using primer pairs consisting of BDNF-E6-F and BDNF-E6-R, followed by electrophoresis. The target fragment was recovered and sequenced, and the sequencing peaks are shown in FIG. 11. Analysis of the sequencing peak map using the synthetic ICE tool resulted in gene editing efficiencies for different targets. The gene editing efficiency of the first group to the fourth group was 36%, 7%, 1% in this order. The fifth group did not undergo gene editing. The results showed that the first set of editing was most efficient, sgRNA BDNF-E6-g1 Is the most targetedAnd (5) optimizing target points.
Example 4 target screening for SLC6A4 Gene knockout
Pig SLC6A4 gene information: encoding solute carrier family membrane 4 protein; is located on chromosome 12 of pig; geneID is 100517307,Sus scrofa. The protein encoded by the SLC6A4 gene of the pig is shown as GENBANK ACCESSION No. XP_020923179.1 (linear CON 12-JAN-2018). In the genome DNA, the pig SLC6A4 gene has 16 exons, wherein the 4 th exon and 400bp sequences on the upper and lower sides thereof are shown as SEQ ID NO.34, and the encoded protein fragment is shown as SEQ ID NO. 35.
1. SLC6A4R gene knockout preset target point and adjacent genome sequence conservation analysis
18 junior from Jiang fragrant pigs, of which 10 females (named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, respectively) and 8 males (named A, B, C, D, E, F, G, H, respectively) were female.
PCR amplification was performed using 18 pig genomic DNAs as templates, respectively, using primer pairs (the target sequences of the primer pairs include exon 4 of the pig SLC6A4 gene), followed by electrophoresis. And (3) recovering PCR amplified products, sequencing, and comparing the sequencing results with SLC6A4 and SLC6A4 gene sequences in a public database for analysis. Based on the results of the alignment, primers for detecting the mutation (the primers themselves avoid possible mutation sites) were designed. The primers designed for mutation detection were:
SLC6A4-E4-F/SLC6A4-E-R. The electrophoresis pattern of 18 pig genome DNA amplified by PCR using primer pair composed of SLC6A4-E4-F/SLC6A4-E4-R is shown in FIG. 12.
SLC6A4-E4-F:5’-GCGTTTACCTTGTTGGCACT-3’(SEQ ID NO.36);
SLC6A4-E4-R:5’-GAGCATCCAGGTGACGTTGT-3’(SEQ ID NO.37)。
2. Screening target
A plurality of targets are initially screened by screening NGG (avoiding possible mutation sites), and 4 targets are further screened from the targets through preliminary experiments.
The 4 targets were as follows:
sgRNA SLC6A4-E4-g1 target point: 5' -GTAGAAGGCGATGACGCAGA-3’(SEQ ID NO.38);
sgRNA SLC6A4-E4-g2 Target point: 5'-CATGATGGTGTTGTAGTAGG-3' (SEQ ID NO. 39);
sgRNA SLC6A4-E4-g3 target point: 5'-CTCCTACTACAACACCATCA-3' (SEQ ID NO. 40);
sgRNA SLC6A4-E4-g4 target point: 5'-CTACAACACCATCATGGCCT-3' (SEQ ID NO. 41).
3. Preparation of recombinant plasmids
Plasmid pKG-U6gRNA was taken and digested with restriction enzyme BbsI, and the vector backbone (about 3kb linear fragment) was recovered.
SLC6A4-E4-gRNA1-S and SLC6A4-E4-gRNA1-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (SLC 6A4-E4-g 1). Plasmid pKG-U6gRNA (SLC 6A4-E4-g 1) expresses the sgRNA shown in SEQ ID NO.42 SLC6A4-E4-g1
SEQ ID NO.42:
GUAGAAGGCGAUGACGCAGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
SLC6A4-E4-gRNA2-S and SLC6A4-E4-gRNA2-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (SLC 6A4-E4-g 2). Plasmid pKG-U6gRNA (SLC 6A4-E4-g 2) expresses the sgRNA shown in SEQ ID NO.43 SLC6A4-E4-g2
SEQ ID NO.43
CAUGAUGGUGUUGUAGUAGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
SLC6A4-E4-gRNA3-S and SLC6A4-E4-gRNA3-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (SLC 6A4-E4-g 3). Plasmid pKG-U6gRNA (SLC 6A4-E4-g 3) expresses the sgRNA shown in SEQ ID NO.44 SLC6A4-E4-g3
SEQ ID NO.44:
CUCCUACUACAACACCAUCAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
SLC6A4-E4-gRNA4-S and SLC6A4-E4-gRNA4-A were synthesized separately, and then mixed and annealed to give double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to give plasmid pKG-U6gRNA (SLC 6A4-E4-g 4). Plasmid pKG-U6gRNA (SLC 6A4-E4-g 4) expresses the sgRNA shown in SEQ ID NO.45 SLC6A4-E4-g4
SEQ ID NO.45:
CUACAACACCAUCAUGGCCUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
SLC6A4-E4-gRNA1-S:5’-caccGTAGAAGGCGATGACGCAGA-3’(SEQ ID NO.46);
SLC6A4-E4-gRNA1-A:5’-aaacTCTGCGTCATCGCCTTCTAC-3’(SEQ ID NO.47);
SLC6A4-E4-gRNA2-S:5’-caccgCATGATGGTGTTGTAGTAGG-3’(SEQ ID NO.48);
SLC6A4-E4-gRNA2-A:5’-aaacCCTACTACAACACCATCATGc-3’(SEQ ID NO.49);
SLC6A4-E4-gRNA3-S:5’-caccgCTCCTACTACAACACCATCA-3’(SEQ ID NO.50);
SLC6A4-E4-gRNA3-A:5’-aaacTGATGGTGTTGTAGTAGGAGc-3’(SEQ ID NO.51);
SLC6A4-E4-gRNA4-S:5’-caccgCTACAACACCATCATGGCCT-3’(SEQ ID NO.52);
SLC6A4-E4-gRNA4-A:5’-aaacAGGCCATGATGGTGTTGTAGc-3’(SEQ ID NO.53)。
SLC6A4-E4-gRNA1-S, SLC A4-E4-gRNA1-A, SLC A4-E4-gRNA2-S, SLC A4-E4-gRNA2-A, SLC A4-E4-gRNA3-S, SLC A4-E4-gRNA3-A, SLC A4-E4-gRNA4-S, SLC A4-E4-gRNA4-A are single stranded DNA molecules.
4. Editing efficiency comparison of different targets
Porcine primary fibroblasts were prepared from ear tissue of a junior river-flavored pig (female, blood group AO).
1. Co-transfection
A first group: plasmid pKG-U6gRNA (SLC 6A4-E4-g 1) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (SLC 6A4-E4-g 1): 1.08 μg of plasmid pKG-GE3.
Second group: plasmid pKG-U6gRNA (SLC 6A4-E4-g 2) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g plasmid pKG-U6gRNA (SLC 6A4-E4-g 2): 1.08 μg of plasmid pKG-GE3.
Third group: plasmid pKG-U6gRNA (SLC 6A4-E4-g 3) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (SLC 6A4-E4-g 3): 1.08 μg of plasmid pKG-GE3.
Fourth group: plasmid pKG-U6gRNA (SLC 6A4-E4-g 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g plasmid pKG-U6gRNA (SLC 6A4-E4-g 4): 1.08 μg of plasmid pKG-GE3.
Fifth group: pig primary fibroblasts were not subjected to any transfection procedure.
Co-transfection was performed by electric shock transfection using a mammalian nuclear transfection kit (Neon kit, thermofiser) and a Neon TM transfection system electrotransfection apparatus (parameters set to 1450V, 10ms, 3 pulses).
2. After the step 1 is completed, the culture is carried out for 16 to 18 hours by adopting the complete culture solution, and then the culture is carried out by replacing the new complete culture solution. The total incubation time was 48 hours.
3. After step 2 is completed, cells are digested and collected by trypsin, then the cells are lysed and genomic DNA is extracted, PCR amplification is performed by using a primer pair consisting of SLC6A4-E4-F and SLC6A4-E4-R, and then electrophoresis is performed. The target fragment was recovered and sequenced, and the sequencing peaks are shown in FIG. 13. Analysis of the sequencing peak map using the synthetic ICE tool resulted in gene editing efficiencies for different targets. The gene editing efficiency of the first group to the fourth group was 20%, 30%, 16%, 1% in this order. The fifth group did not undergo gene editing. The results showed that the second set of editing was most efficient, sgRNA SLC6A4-E4-g2 The target point of (2) is the optimal target point.
EXAMPLE 5 preparation of BDNF and SLC6A4 Gene editing Single cell clones
Porcine primary fibroblasts were prepared from ear tissue of a junior river-flavored pig (female, blood group AO).
1. Co-transfection
The plasmid pKG-U6gRNA (BDNF-E6-g 1), pKG-U6gRNA (SLC 6A4-E4-g 2) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.47. Mu.g of plasmid pKG-U6gRNA (BDNF-E6-g 2): 0.47. Mu.g of plasmid pKG-U6gRNA (SLC 6A4-E4-g 1): 1.06 μg of plasmid pKG-GE3.
Co-transfection was performed by electric shock transfection using a mammalian nuclear transfection kit (Neon kit, thermofiser) and a Neon TM transfection system electrotransfection apparatus (parameters set to 1450V, 10ms, 3 pulses).
2. After the step 1 is completed, the culture is carried out for 16 to 18 hours by adopting the complete culture solution, and then the culture is carried out by replacing the new complete culture solution. The total incubation time was 48 hours.
3. After step 2 was completed, the cells were digested with trypsin and collected, washed with complete broth, resuspended with complete broth, and then each single cell was individually picked into different wells of a 96-well plate (1 cell per well, 100 μl of complete broth per well) and cultured for 2 weeks (new complete broth was changed every 2-3 days).
4. After completion of step 3, cells were digested with trypsin and collected (about 2/3 of the resulting cells per well were inoculated into 6-well plates with complete culture medium, and the remaining 1/3 were collected in 1.5mL centrifuge tubes for subsequent genotyping detection).
5. The 6-well plate of step 4 was used to culture cells to 80% confluence, trypsinized and the cells were collected and frozen using cell frozen stock (90% complete medium+10% dmso, volume ratio).
6. Taking the centrifuge tube in the step 4, taking cells, extracting genome DNA, respectively adopting primer pairs consisting of BDNF-E6-F/BDNF-E6-R and SLC6A4-E4-F/SLC6A4-E4-R to carry out PCR amplification, and then carrying out electrophoresis. Porcine primary fibroblasts were used as wild-type controls. The electrophoresis patterns are shown in fig. 14 and 15, respectively. The lane numbers in fig. 14 are consistent with the cell numbers in table 1, and the lane numbers in fig. 15 are consistent with the cell numbers in table 2.
7. After step 6 is completed, the PCR amplification product is recovered and sequenced.
The sequencing result of the primary fibroblast of the pig is only one, and the genotype of the primary fibroblast is wild type. If there are two types of sequencing results of a single cell clone, one is consistent with the sequencing results of the primary fibroblast of the pig, the other is mutated (the mutation comprises deletion, insertion or substitution of one or more nucleotides) compared with the sequencing results of the primary fibroblast of the pig, and the genotype of the single cell clone is heterozygous mutant; if the sequencing result of a single cell clone is two, the single cell clone has mutation (the mutation comprises deletion, insertion or replacement of one or more nucleotides) compared with the sequencing result of a primary fibroblast of a pig, and the genotype of the single cell clone is homozygotic mutation type with different variants of double alleles; if the sequencing result of a single cell clone is one and a mutation (mutation includes deletion, insertion or substitution of one or more nucleotides) is made compared with the sequencing result of a porcine primary fibroblast, the genotype of the single cell clone is a homozygotic mutant of the same variation of the double allele; if the sequencing result of a single cell clone is one and is consistent with the sequencing result of porcine primary fibroblasts, the genotype of the single cell clone is wild type.
The results of the BDNF gene editing are shown in Table 1. The genotypes of the single cell clones numbered 12, 15 and 19 were homozygotic mutants of the same variation of the double alleles. The genotype of the single cell clone numbered 4 was homozygously mutated versions of the different variants of the bi-allele. The genotypes of the single cell clones numbered 3, 5, 8, 13, 16, 18 are heterozygous mutants. The rate of gene editing single cell clones was 50%.
Exemplary sequencing alignment results are shown in FIGS. 16-19. FIG. 16 is a result of forward sequencing of a single cell clone numbered BDNF-1 aligned with the wild type, and judged to be the wild type. FIG. 17 is a result of forward sequencing of a single cell clone numbered BDNF-3 aligned with wild-type, judged as heterozygous mutant. FIG. 18 is a result of forward sequencing of a single cell clone numbered BDNF-4 aligned with wild-type, and a homozygous mutant version with different variants of the biallelic gene was determined. FIG. 19 is a result of forward sequencing of a single cell clone numbered BDNF-12 aligned with wild-type, and a homozygous mutant version of the same variation as the biallelic was determined.
TABLE 1 genotype of BDNF Gene knockout Single cell clones
Figure BDA0002904074420000181
Figure BDA0002904074420000191
The results of editing the SLC6A4 gene are shown in Table 2. The genotypes of the single cell clones numbered 2, 7 and 12 were homozygotic mutants of the same variation of the double alleles. The genotypes of the single cell clones No. 8 and 16 are homozygotic mutants of different variants of the double alleles. The genotypes of the single cell clones numbered 1, 3, 10, 13, 18 are heterozygous mutants. The rate of gene editing single cell clones was 50%.
Exemplary sequencing alignment results are shown in FIGS. 20-23. FIG. 20 is a result of forward sequencing of a single cell clone numbered SLC6A4-4 aligned with wild type, and judged to be wild type. FIG. 21 is a result of forward sequencing of a single cell clone numbered SLC6A4-3 aligned with wild type, judged as heterozygous mutant. FIG. 22 shows the results of forward sequencing of single cell clones numbered SLC6A4-8 aligned with wild type, and the homozygous mutant versions were determined for the different variants of the biallelic. FIG. 23 is a result of forward sequencing of a single cell clone numbered SLC6A4-12 aligned with wild type, and a homozygous mutant version of the same variation was determined for the biallelic.
TABLE 2 genotype of SLC6A4 Gene knockout Single cell clone
Figure BDA0002904074420000192
Figure BDA0002904074420000201
8. After the step 7 is completed, screening of double-gene combined knockout monoclonal strains is carried out. Since the homozygous knockout of the BDNF gene is lethal, single cell clones need to be selected that have the heterozygous knockout of the BDNF gene and the homozygous knockout of the SLC6A4 gene. By analysis, the single cell clones numbered 8 and 16 were single cell clones with the BDNF gene heterozygous knockdown and the SLC6A4 gene homozygous knockdown.
The present invention is described in detail above. It will be apparent to those skilled in the art that the present invention can be practiced in a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation. While the invention has been described with respect to specific embodiments, it will be appreciated that the invention may be further modified. In general, this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. The application of some of the basic features may be done in accordance with the scope of the claims that follow.
Sequence listing
<110> Nanjing Kidney Gene engineering Co., ltd
<120> CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells
<160> 53
<170> SIPOSequenceListing 1.0
<210> 1
<211> 8484
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 1
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttg ttttagagct 360
agaaatagca agttaaaata aggctagtcc gtttttagcg cgtgcgccaa ttctgcagac 420
aaatggctct agaggtaccc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 480
ccaacgaccc ccgcccattg acgtcaatag taacgccaat agggactttc cattgacgtc 540
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 600
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tgtgcccagt 660
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 720
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 780
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 840
ggggggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc 900
agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata 960
aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgcgctg ccttcgcccc gtgccccgct 1020
ccgccgccgc ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga 1080
gcgggcggga cggcccttct cctccgggct gtaattagct gagcaagagg taagggttta 1140
agggatggtt ggttggtggg gtattaatgt ttaattacct ggagcacctg cctgaaatca 1200
ctttttttca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 1260
ggatcatgat attgattaca aagacgatga cgataagatg gccccaaaga agaagcggaa 1320
ggtcggtatc cacggagtcc cagcagccga caagaagtac agcatcggcc tggacatcgg 1380
caccaactct gtgggctggg ccgtgatcac cgacgagtac aaggtgccca gcaagaaatt 1440
caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg gagccctgct 1500
gttcgacagc ggcgaaacag ccgaggccac ccggctgaag agaaccgcca gaagaagata 1560
caccagacgg aagaaccgga tctgctatct gcaagagatc ttcagcaacg agatggccaa 1620
ggtggacgac agcttcttcc acagactgga agagtccttc ctggtggaag aggataagaa 1680
gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc acgagaagta 1740
ccccaccatc taccacctga gaaagaaact ggtggacagc accgacaagg ccgacctgcg 1800
gctgatctat ctggccctgg cccacatgat caagttccgg ggccacttcc tgatcgaggg 1860
cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg tgcagaccta 1920
caaccagctg ttcgaggaaa accccatcaa cgccagcggc gtggacgcca aggccatcct 1980
gtctgccaga ctgagcaaga gcagacggct ggaaaatctg atcgcccagc tgcccggcga 2040
gaagaagaat ggcctgttcg gaaacctgat tgccctgagc ctgggcctga cccccaactt 2100
caagagcaac ttcgacctgg ccgaggatgc caaactgcag ctgagcaagg acacctacga 2160
cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc tgtttctggc 2220
cgccaagaac ctgtccgacg ccatcctgct gagcgacatc ctgagagtga acaccgagat 2280
caccaaggcc cccctgagcg cctctatgat caagagatac gacgagcacc accaggacct 2340
gaccctgctg aaagctctcg tgcggcagca gctgcctgag aagtacaaag agattttctt 2400
cgaccagagc aagaacggct acgccggcta cattgacggc ggagccagcc aggaagagtt 2460
ctacaagttc atcaagccca tcctggaaaa gatggacggc accgaggaac tgctcgtgaa 2520
gctgaacaga gaggacctgc tgcggaagca gcggaccttc gacaacggca gcatccccca 2580
ccagatccac ctgggagagc tgcacgccat tctgcggcgg caggaagatt tttacccatt 2640
cctgaaggac aaccgggaaa agatcgagaa gatcctgacc ttccgcatcc cctactacgt 2700
gggccctctg gccaggggaa acagcagatt cgcctggatg accagaaaga gcgaggaaac 2760
catcaccccc tggaacttcg aggaagtggt ggacaagggc gcttccgccc agagcttcat 2820
cgagcggatg accaacttcg ataagaacct gcccaacgag aaggtgctgc ccaagcacag 2880
cctgctgtac gagtacttca ccgtgtataa cgagctgacc aaagtgaaat acgtgaccga 2940
gggaatgaga aagcccgcct tcctgagcgg cgagcagaaa aaggccatcg tggacctgct 3000
gttcaagacc aaccggaaag tgaccgtgaa gcagctgaaa gaggactact tcaagaaaat 3060
cgagtgcttc gactccgtgg aaatctccgg cgtggaagat cggttcaacg cctccctggg 3120
cacataccac gatctgctga aaattatcaa ggacaaggac ttcctggaca atgaggaaaa 3180
cgaggacatt ctggaagata tcgtgctgac cctgacactg tttgaggaca gagagatgat 3240
cgaggaacgg ctgaaaacct atgcccacct gttcgacgac aaagtgatga agcagctgaa 3300
gcggcggaga tacaccggct ggggcaggct gagccggaag ctgatcaacg gcatccggga 3360
caagcagtcc ggcaagacaa tcctggattt cctgaagtcc gacggcttcg ccaacagaaa 3420
cttcatgcag ctgatccacg acgacagcct gacctttaaa gaggacatcc agaaagccca 3480
ggtgtccggc cagggcgata gcctgcacga gcacattgcc aatctggccg gcagccccgc 3540
cattaagaag ggcatcctgc agacagtgaa ggtggtggac gagctcgtga aagtgatggg 3600
ccggcacaag cccgagaaca tcgtgatcga aatggccaga gagaaccaga ccacccagaa 3660
gggacagaag aacagccgcg agagaatgaa gcggatcgaa gagggcatca aagagctggg 3720
cagccagatc ctgaaagaac accccgtgga aaacacccag ctgcagaacg agaagctgta 3780
cctgtactac ctgcagaatg ggcgggatat gtacgtggac caggaactgg acatcaaccg 3840
gctgtccgac tacgatgtgg accatatcgt gcctcagagc tttctgaagg acgactccat 3900
cgacaacaag gtgctgacca gaagcgacaa gaaccggggc aagagcgaca acgtgccctc 3960
cgaagaggtc gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg ccaagctgat 4020
tacccagaga aagttcgaca atctgaccaa ggccgagaga ggcggcctga gcgaactgga 4080
taaggccggc ttcatcaaga gacagctggt ggaaacccgg cagatcacaa agcacgtggc 4140
acagatcctg gactcccgga tgaacactaa gtacgacgag aatgacaagc tgatccggga 4200
agtgaaagtg atcaccctga agtccaagct ggtgtccgat ttccggaagg atttccagtt 4260
ttacaaagtg cgcgagatca acaactacca ccacgcccac gacgcctacc tgaacgccgt 4320
cgtgggaacc gccctgatca aaaagtaccc taagctggaa agcgagttcg tgtacggcga 4380
ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaaa tcggcaaggc 4440
taccgccaag tacttcttct acagcaacat catgaacttt ttcaagaccg agattaccct 4500
ggccaacggc gagatccgga agcggcctct gatcgagaca aacggcgaaa ccggggagat 4560
cgtgtgggat aagggccggg attttgccac cgtgcggaaa gtgctgagca tgccccaagt 4620
gaatatcgtg aaaaagaccg aggtgcagac aggcggcttc agcaaagagt ctatcctgcc 4680
caagaggaac agcgataagc tgatcgccag aaagaaggac tgggacccta agaagtacgg 4740
cggcttcgac agccccaccg tggcctattc tgtgctggtg gtggccaaag tggaaaaggg 4800
caagtccaag aaactgaaga gtgtgaaaga gctgctgggg atcaccatca tggaaagaag 4860
cagcttcgag aagaatccca tcgactttct ggaagccaag ggctacaaag aagtgaaaaa 4920
ggacctgatc atcaagctgc ctaagtactc cctgttcgag ctggaaaacg gccggaagag 4980
aatgctggcc tctgccggcg aactgcagaa gggaaacgaa ctggccctgc cctccaaata 5040
tgtgaacttc ctgtacctgg ccagccacta tgagaagctg aagggctccc ccgaggataa 5100
tgagcagaaa cagctgtttg tggaacagca caagcactac ctggacgaga tcatcgagca 5160
gatcagcgag ttctccaaga gagtgatcct ggccgacgct aatctggaca aagtgctgtc 5220
cgcctacaac aagcaccggg ataagcccat cagagagcag gccgagaata tcatccacct 5280
gtttaccctg accaatctgg gagcccctgc cgccttcaag tactttgaca ccaccatcga 5340
ccggaagagg tacaccagca ccaaagaggt gctggacgcc accctgatcc accagagcat 5400
caccggcctg tacgagacac ggatcgacct gtctcagctg ggaggcgaca aaaggccggc 5460
ggccacgaaa aaggccggcc aggcaaaaaa gaaaaagtaa gaattcctag agctcgctga 5520
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 5580
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 5640
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 5700
ggggaggatt gggaagagaa tagcaggcat gctggggagc ggccgcagga acccctagtg 5760
atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag 5820
gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc 5880
ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 5940
atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 6000
tggttacgcg cagcgtgacc gctacacttg ccagcgcctt agcgcccgct cctttcgctt 6060
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 6120
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 6180
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 6240
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aactctatct 6300
cgggctattc ttttgattta taagggattt tgccgatttc ggtctattgg ttaaaaaatg 6360
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat 6420
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc 6480
caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 6540
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 6600
cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg 6660
tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 6720
ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 6780
aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 6840
tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 6900
atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 6960
agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 7020
tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 7080
tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 7140
atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 7200
ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 7260
tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 7320
acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 7380
ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata 7440
aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 7500
ctggagccgg tgagcgtgga agccgcggta tcattgcagc actggggcca gatggtaagc 7560
cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata 7620
gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 7680
actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 7740
agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 7800
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 7860
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 7920
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 7980
ttcttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 8040
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 8100
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 8160
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 8220
gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 8280
gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 8340
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 8400
caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 8460
tttgctggcc ttttgctcac atgt 8484
<210> 2
<211> 10476
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 2
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc tagcgcgtgc 360
gccaattctg cagacaaatg gctctagagg tacccgttac ataacttacg gtaaatggcc 420
cgcctggctg accgcccaac gacccccgcc cattgacgtc aatagtaacg ccaataggga 480
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 540
aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 600
ggcattgtgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 660
tagtcatcgc tattaccatg ggggcagagc gcacatcgcc cacagtcccc gagaagttgg 720
ggggaggggt cggcaattga tccggtgcct agagaaggtg gcgcggggta aactgggaaa 780
gtgatgtcgt gtactggctc cgcctttttc ccgagggtgg gggagaaccg tatataagtg 840
cagtagtcgc cgtgaacgtt ctttttcgca acgggtttgc cgccagaaca caggttggac 900
cggtgccacc atggactata aggaccacga cggagactac aaggatcatg atattgatta 960
caaagacgat gacgataaga tggcccccaa aaagaaacga aaggtgggtg ggtccccaaa 1020
gaagaagcgg aaggtcggta tccacggagt cccagcagcc gacaagaagt acagcatcgg 1080
cctggacatc ggcaccaact ctgtgggctg ggccgtgatc accgacgagt acaaggtgcc 1140
cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac agcatcaaga agaacctgat 1200
cggagccctg ctgttcgaca gcggcgaaac agccgaggcc acccggctga agagaaccgc 1260
cagaagaaga tacaccagac ggaagaaccg gatctgctat ctgcaagaga tcttcagcaa 1320
cgagatggcc aaggtggacg acagcttctt ccacagactg gaagagtcct tcctggtgga 1380
agaggataag aagcacgagc ggcaccccat cttcggcaac atcgtggacg aggtggccta 1440
ccacgagaag taccccacca tctaccacct gagaaagaaa ctggtggaca gcaccgacaa 1500
ggccgacctg cggctgatct atctggccct ggcccacatg atcaagttcc ggggccactt 1560
cctgatcgag ggcgacctga accccgacaa cagcgacgtg gacaagctgt tcatccagct 1620
ggtgcagacc tacaaccagc tgttcgagga aaaccccatc aacgccagcg gcgtggacgc 1680
caaggccatc ctgtctgcca gactgagcaa gagcagacgg ctggaaaatc tgatcgccca 1740
gctgcccggc gagaagaaga atggcctgtt cggaaacctg attgccctga gcctgggcct 1800
gacccccaac ttcaagagca acttcgacct ggccgaggat gccaaactgc agctgagcaa 1860
ggacacctac gacgacgacc tggacaacct gctggcccag atcggcgacc agtacgccga 1920
cctgtttctg gccgccaaga acctgtccga cgccatcctg ctgagcgaca tcctgagagt 1980
gaacaccgag atcaccaagg cccccctgag cgcctctatg atcaagagat acgacgagca 2040
ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag cagctgcctg agaagtacaa 2100
agagattttc ttcgaccaga gcaagaacgg ctacgccggc tacattgacg gcggagccag 2160
ccaggaagag ttctacaagt tcatcaagcc catcctggaa aagatggacg gcaccgagga 2220
actgctcgtg aagctgaaca gagaggacct gctgcggaag cagcggacct tcgacaacgg 2280
cagcatcccc caccagatcc acctgggaga gctgcacgcc attctgcggc ggcaggaaga 2340
tttttaccca ttcctgaagg acaaccggga aaagatcgag aagatcctga ccttccgcat 2400
cccctactac gtgggccctc tggccagggg aaacagcaga ttcgcctgga tgaccagaaa 2460
gagcgaggaa accatcaccc cctggaactt cgaggaagtg gtggacaagg gcgcttccgc 2520
ccagagcttc atcgagcgga tgaccaactt cgataagaac ctgcccaacg agaaggtgct 2580
gcccaagcac agcctgctgt acgagtactt caccgtgtat aacgagctga ccaaagtgaa 2640
atacgtgacc gagggaatga gaaagcccgc cttcctgagc ggcgagcaga aaaaggccat 2700
cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg aagcagctga aagaggacta 2760
cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc ggcgtggaag atcggttcaa 2820
cgcctccctg ggcacatacc acgatctgct gaaaattatc aaggacaagg acttcctgga 2880
caatgaggaa aacgaggaca ttctggaaga tatcgtgctg accctgacac tgtttgagga 2940
cagagagatg atcgaggaac ggctgaaaac ctatgcccac ctgttcgacg acaaagtgat 3000
gaagcagctg aagcggcgga gatacaccgg ctggggcagg ctgagccgga agctgatcaa 3060
cggcatccgg gacaagcagt ccggcaagac aatcctggat ttcctgaagt ccgacggctt 3120
cgccaacaga aacttcatgc agctgatcca cgacgacagc ctgaccttta aagaggacat 3180
ccagaaagcc caggtgtccg gccagggcga tagcctgcac gagcacattg ccaatctggc 3240
cggcagcccc gccattaaga agggcatcct gcagacagtg aaggtggtgg acgagctcgt 3300
gaaagtgatg ggccggcaca agcccgagaa catcgtgatc gaaatggcca gagagaacca 3360
gaccacccag aagggacaga agaacagccg cgagagaatg aagcggatcg aagagggcat 3420
caaagagctg ggcagccaga tcctgaaaga acaccccgtg gaaaacaccc agctgcagaa 3480
cgagaagctg tacctgtact acctgcagaa tgggcgggat atgtacgtgg accaggaact 3540
ggacatcaac cggctgtccg actacgatgt ggaccatatc gtgcctcaga gctttctgaa 3600
ggacgactcc atcgacaaca aggtgctgac cagaagcgac aagaaccggg gcaagagcga 3660
caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac tactggcggc agctgctgaa 3720
cgccaagctg attacccaga gaaagttcga caatctgacc aaggccgaga gaggcggcct 3780
gagcgaactg gataaggccg gcttcatcaa gagacagctg gtggaaaccc ggcagatcac 3840
aaagcacgtg gcacagatcc tggactcccg gatgaacact aagtacgacg agaatgacaa 3900
gctgatccgg gaagtgaaag tgatcaccct gaagtccaag ctggtgtccg atttccggaa 3960
ggatttccag ttttacaaag tgcgcgagat caacaactac caccacgccc acgacgccta 4020
cctgaacgcc gtcgtgggaa ccgccctgat caaaaagtac cctaagctgg aaagcgagtt 4080
cgtgtacggc gactacaagg tgtacgacgt gcggaagatg atcgccaaga gcgagcagga 4140
aatcggcaag gctaccgcca agtacttctt ctacagcaac atcatgaact ttttcaagac 4200
cgagattacc ctggccaacg gcgagatccg gaagcggcct ctgatcgaga caaacggcga 4260
aaccggggag atcgtgtggg ataagggccg ggattttgcc accgtgcgga aagtgctgag 4320
catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag acaggcggct tcagcaaaga 4380
gtctatcctg cccaagagga acagcgataa gctgatcgcc agaaagaagg actgggaccc 4440
taagaagtac ggcggcttcg acagccccac cgtggcctat tctgtgctgg tggtggccaa 4500
agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa gagctgctgg ggatcaccat 4560
catggaaaga agcagcttcg agaagaatcc catcgacttt ctggaagcca agggctacaa 4620
agaagtgaaa aaggacctga tcatcaagct gcctaagtac tccctgttcg agctggaaaa 4680
cggccggaag agaatgctgg cctctgccgg cgaactgcag aagggaaacg aactggccct 4740
gccctccaaa tatgtgaact tcctgtacct ggccagccac tatgagaagc tgaagggctc 4800
ccccgaggat aatgagcaga aacagctgtt tgtggaacag cacaagcact acctggacga 4860
gatcatcgag cagatcagcg agttctccaa gagagtgatc ctggccgacg ctaatctgga 4920
caaagtgctg tccgcctaca acaagcaccg ggataagccc atcagagagc aggccgagaa 4980
tatcatccac ctgtttaccc tgaccaatct gggagcccct gccgccttca agtactttga 5040
caccaccatc gaccggaaga ggtacaccag caccaaagag gtgctggacg ccaccctgat 5100
ccaccagagc atcaccggcc tgtacgagac acggatcgac ctgtctcagc tgggaggcga 5160
caaaaggccg gcggccacga aaaaggccgg ccaggcaaaa aagaaaaagg gcggctccaa 5220
gcggcctgcc gcgacgaaga aagcgggaca ggccaagaaa aagaaaggat ccggcgcaac 5280
aaacttctct ctgctgaaac aagccggaga tgtcgaagag aatcctggac cggtgagcaa 5340
gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 5400
cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 5460
cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 5520
cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 5580
cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 5640
cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 5700
cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 5760
caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 5820
gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 5880
gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 5940
ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 6000
cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagggct ccggcgaggg 6060
caggggaagt cttctaacat gcggggacgt ggaggaaaat cccggcccaa ccgagtacaa 6120
gcccacggtg cgcctcgcca cccgcgacga cgtccccagg gccgtacgca ccctcgccgc 6180
cgcgttcgcc gactaccccg ccacgcgcca caccgtcgat ccggaccgcc acatcgagcg 6240
ggtcaccgag ctgcaagaac tcttcctcac gcgcgtcggg ctcgacatcg gcaaggtgtg 6300
ggtcgcggac gacggcgccg cggtggcggt ctggaccacg ccggagagcg tcgaagcggg 6360
ggcggtgttc gccgagatcg gcccgcgcat ggccgagttg agcggttccc ggctggccgc 6420
gcagcaacag atggaaggcc tcctggcgcc gcaccggccc aaggagcccg cgtggttcct 6480
ggccaccgtc ggagtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct 6540
ccccggagtg gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc 6600
ccgcaacctc cccttctacg agcggctcgg cttcaccgtc accgccgacg tcgaggtgcc 6660
cgaaggaccg cgcacctggt gcatgacccg caagcccggt gcctgaacgc gttaagtcga 6720
caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 6780
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 6840
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 6900
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 6960
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 7020
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 7080
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 7140
cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 7200
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 7260
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gtcgacttta 7320
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 7380
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 7440
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7500
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7560
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagggcc 7620
cgtttaaacc cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 7680
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 7740
aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt 7800
ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt 7860
gggctctatg gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 7920
atttcacacc gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg 7980
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ttagcgcccg 8040
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 8100
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 8160
aacttgattt gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 8220
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 8280
tcaactctat ctcgggctat tcttttgatt tataagggat tttgccgatt tcggtctatt 8340
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 8400
ttacaatttt atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc 8460
cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg 8520
cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat 8580
caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 8640
tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 8700
ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 8760
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 8820
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 8880
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 8940
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 9000
cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac 9060
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 9120
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 9180
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 9240
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 9300
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 9360
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 9420
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 9480
ttgctgataa atctggagcc ggtgagcgtg gaagccgcgg tatcattgca gcactggggc 9540
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 9600
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 9660
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 9720
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 9780
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 9840
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 9900
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 9960
taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 10020
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 10080
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 10140
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 10200
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 10260
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 10320
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 10380
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 10440
ggttcctggc cttttgctgg ccttttgctc acatgt 10476
<210> 3
<211> 3120
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 3
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcaggc ctctgcagtc gacgggcccg ggatccgatg 2280
ataaacatgt gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc 2340
tgttagagag ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac 2400
gtgacgtaga aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat 2460
ggactatcat atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt 2520
gtggaaagga cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag 2580
ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 2640
tagcgcgtgc gccaattctg cagacaaatg gctctagagg tacccataga tctagatgca 2700
ttcgcgaggt accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa 2760
accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta 2820
atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 2880
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 2940
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 3000
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 3060
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3120
<210> 4
<211> 175
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 4
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 60
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 120
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccgtta cataa 175
<210> 5
<211> 554
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 5
tctgcagaca aatggctcta gaggtacccg ttacataact tacggtaaat ggcccgcctg 60
gctgaccgcc caacgacccc cgcccattga cgtcaatagt aacgccaata gggactttcc 120
attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt 180
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 240
gtgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 300
tcgctattac catgggggca gagcgcacat cgcccacagt ccccgagaag ttggggggag 360
gggtcggcaa ttgatccggt gcctagagaa ggtggcgcgg ggtaaactgg gaaagtgatg 420
tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata agtgcagtag 480
tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacacaggtt ggaccggtgc 540
caccatggac tata 554
<210> 6
<211> 447
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 6
ccagaacaca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 60
ggatcatgat attgattaca aagacgatga cgataagatg gcccccaaaa agaaacgaaa 120
ggtgggtggg tccccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagccga 180
caagaagtac agcatcggcc tggacatcgg caccaactct gtgggctggg ccgtgatcac 240
cgacgagtac aaggtgccca gcaagaaatt caaggtgctg ggcaacaccg accggcacag 300
catcaagaag aacctgatcg gagccctgct gttcgacagc ggcgaaacag ccgaggccac 360
ccggctgaag agaaccgcca gaagaagata caccagacgg aagaaccgga tctgctatct 420
gcaagagatc ttcagcaacg agatggc 447
<210> 7
<211> 2727
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 7
cggcggccac gaaaaaggcc ggccaggcaa aaaagaaaaa gggcggctcc aagcggcctg 60
ccgcgacgaa gaaagcggga caggccaaga aaaagaaagg atccggcgca acaaacttct 120
ctctgctgaa acaagccgga gatgtcgaag agaatcctgg accggtgagc aagggcgagg 180
agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta aacggccaca 240
agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg accctgaagt 300
tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc accctgacct 360
acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac ttcttcaagt 420
ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac gacggcaact 480
acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc atcgagctga 540
agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag tacaactaca 600
acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag gtgaacttca 660
agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac cagcagaaca 720
cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc acccagtccg 780
ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag ttcgtgaccg 840
ccgccgggat cactctcggc atggacgagc tgtacaaggg ctccggcgag ggcaggggaa 900
gtcttctaac atgcggggac gtggaggaaa atcccggccc aaccgagtac aagcccacgg 960
tgcgcctcgc cacccgcgac gacgtcccca gggccgtacg caccctcgcc gccgcgttcg 1020
ccgactaccc cgccacgcgc cacaccgtcg atccggaccg ccacatcgag cgggtcaccg 1080
agctgcaaga actcttcctc acgcgcgtcg ggctcgacat cggcaaggtg tgggtcgcgg 1140
acgacggcgc cgcggtggcg gtctggacca cgccggagag cgtcgaagcg ggggcggtgt 1200
tcgccgagat cggcccgcgc atggccgagt tgagcggttc ccggctggcc gcgcagcaac 1260
agatggaagg cctcctggcg ccgcaccggc ccaaggagcc cgcgtggttc ctggccaccg 1320
tcggagtctc gcccgaccac cagggcaagg gtctgggcag cgccgtcgtg ctccccggag 1380
tggaggcggc cgagcgcgcc ggggtgcccg ccttcctgga gacctccgcg ccccgcaacc 1440
tccccttcta cgagcggctc ggcttcaccg tcaccgccga cgtcgaggtg cccgaaggac 1500
cgcgcacctg gtgcatgacc cgcaagcccg gtgcctgaac gcgttaagtc gacaatcaac 1560
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 1620
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 1680
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 1740
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 1800
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 1860
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 1920
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 1980
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 2040
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 2100
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 2160
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 2220
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 2280
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 2340
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 2400
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcaggg cccgtttaaa 2460
cccgctgatc agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc 2520
ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 2580
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 2640
acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 2700
tggcctgcag gggcgcctga tgcggta 2727
<210> 8
<211> 410
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 8
gataaacatg tgagggccta tttcccatga ttccttcata tttgcatata cgatacaagg 60
ctgttagaga gataattgga attaatttga ctgtaaacac aaagatatta gtacaaaata 120
cgtgacgtag aaagtaataa tttcttgggt agtttgcagt tttaaaatta tgttttaaaa 180
tggactatca tatgcttacc gtaacttgaa agtatttcga tttcttggct ttatatatct 240
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 300
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 360
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccatag 410
<210> 9
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 9
agttatggca gaactcagtg 20
<210> 10
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 10
ccccatccaa agtttttaaa gga 23
<210> 11
<211> 23
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 11
tgtggcagat gtcacagttt agg 23
<210> 12
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 12
caccgagtta tggcagaact cagtg 25
<210> 13
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 13
aaaccactga gttctgccat aactc 25
<210> 14
<211> 1659
<212> DNA
<213> pig (Sus scrofa)
<400> 14
tctgatgttg cgtttcctgt gacagcctga gcagagagca ttaaaaatta aacttacaaa 60
gctgctcaag tgggcggaag gaggcaaagt tgaagccgca gttttttttt ccacttgctc 120
aggagccatc tgatctcggg cttatacgct agagcttggg gccaaggctg cacgtctctg 180
gtttatgtta aagcccatct agcacactac tgacattaac ctgtgtctgg tgcagctgga 240
gtttctcacc aatgcgtcag aaaccttgac cctgcagaat ggcctggaac tgcagtcaga 300
gggctgcctc gtccccttgg gaaagcaagc aagcaagccc taaacagtct tgtgtcctgt 360
ttctgctttc tccctacagt tccaccaggt gagaagagtg atgaccatcc ttttccttac 420
tatggttatt tcatacttcg gttgcatgaa ggctgccccc atgaaagaag ccaacgtccg 480
aggacaaggc agcttggcct acccaggtgt gcggacccat gggactctgg agagcgtgaa 540
tgggcccaag gcaggttcaa gaggcctgac atcgtcgtca tcgtcgtcgt tggcggacac 600
ttttgaacac gtgatcgagg agctgttgga cgaggaccag aaagttcggc ccaatgagga 660
aaacaataag gacgcggaca tgtatacgtc ccgagtcatg ctcagcagtc aagtgccttt 720
ggagcctcct cttctctttc tgctggagga atacaaaaat tacctggatg ctgcaaacat 780
gtccatgagg gtccggcgcc actcggaccc cgcccgccgc ggggagctga gcgtgtgcga 840
cagcattagc gagtgggtga cggcggcgga taaaaagacg gcagtggaca tgtcgggtgg 900
cacggtcacg gtcctcgaaa aagtccccgt ctcgaaaggc caactgaagc agtacttcta 960
cgagaccaag tgcaatccta tggggtacac aaaggagggc tgcaggggca tagacaagag 1020
gcactggaac tcccagtgcc gaactaccca gtcgtatgtg cgggccctca ccatggatag 1080
caaaaagcga attggctggc ggttcataag gatagacact tcctgtgtat gtactttgac 1140
cattaagagg ggaagatagt ggctttatgt tgtatagatt atattgagac aaaaattatc 1200
tatttgtata tatacataac agggtaaatt attcagttaa gaaaaaaaat aattttatga 1260
actgcatgta taaatgaagt ttatacagta cagtggttct acaatctatt tattggacat 1320
ttccatgacc agagggaaac agtcattttt tgcgcacaac tttaaaaaaa aagtctgcat 1380
tacattcctc gataatgttg tggtttgttg ccgttgccaa gaattgaaaa acgttaaaga 1440
aaaaaaaagt ttaaaaaata ctactaataa attgcatgct gctttaattg tgaattgata 1500
acaaactgtc ctcttacaga aaacagacat aaaaaaaaag cgaaagaaaa aaaaaaaaag 1560
cctaacaaaa atttgaacca aaacattccg tttacatttt agacagtaag tatctttgtt 1620
cttgttggta ctctgtttta ctgcttttaa cttctgata 1659
<210> 15
<211> 260
<212> PRT
<213> pig (Sus scrofa)
<400> 15
Met Phe His Gln Val Arg Arg Val Met Thr Ile Leu Phe Leu Thr Met
1 5 10 15
Val Ile Ser Tyr Phe Gly Cys Met Lys Ala Ala Pro Met Lys Glu Ala
20 25 30
Asn Val Arg Gly Gln Gly Ser Leu Ala Tyr Pro Gly Val Arg Thr His
35 40 45
Gly Thr Leu Glu Ser Val Asn Gly Pro Lys Ala Gly Ser Arg Gly Leu
50 55 60
Thr Ser Ser Ser Ser Ser Ser Leu Ala Asp Thr Phe Glu His Val Ile
65 70 75 80
Glu Glu Leu Leu Asp Glu Asp Gln Lys Val Arg Pro Asn Glu Glu Asn
85 90 95
Asn Lys Asp Ala Asp Met Tyr Thr Ser Arg Val Met Leu Ser Ser Gln
100 105 110
Val Pro Leu Glu Pro Pro Leu Leu Phe Leu Leu Glu Glu Tyr Lys Asn
115 120 125
Tyr Leu Asp Ala Ala Asn Met Ser Met Arg Val Arg Arg His Ser Asp
130 135 140
Pro Ala Arg Arg Gly Glu Leu Ser Val Cys Asp Ser Ile Ser Glu Trp
145 150 155 160
Val Thr Ala Ala Asp Lys Lys Thr Ala Val Asp Met Ser Gly Gly Thr
165 170 175
Val Thr Val Leu Glu Lys Val Pro Val Ser Lys Gly Gln Leu Lys Gln
180 185 190
Tyr Phe Tyr Glu Thr Lys Cys Asn Pro Met Gly Tyr Thr Lys Glu Gly
195 200 205
Cys Arg Gly Ile Asp Lys Arg His Trp Asn Ser Gln Cys Arg Thr Thr
210 215 220
Gln Ser Tyr Val Arg Ala Leu Thr Met Asp Ser Lys Lys Arg Ile Gly
225 230 235 240
Trp Arg Phe Ile Arg Ile Asp Thr Ser Cys Val Cys Thr Leu Thr Ile
245 250 255
Lys Arg Gly Arg
260
<210> 16
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 16
acgtgatcga ggagctgttg 20
<210> 17
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 17
ttgctatcca tggtgagggc 20
<210> 18
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 18
agaccaagtg caatcctatg 20
<210> 19
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 19
catgtcgggt ggcacggtca 20
<210> 20
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 20
ggcggataaa aagacggcag 20
<210> 21
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 21
gaaaaagtcc ccgtctcgaa 20
<210> 22
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 22
agaccaagug caauccuaug guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 23
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 23
caugucgggu ggcacgguca guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 24
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 24
ggcggauaaa aagacggcag guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 25
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 25
gaaaaagucc ccgucucgaa guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 26
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 26
caccgagacc aagtgcaatc ctatg 25
<210> 27
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 27
aaaccatagg attgcacttg gtctc 25
<210> 28
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 28
caccgcatgt cgggtggcac ggtca 25
<210> 29
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 29
aaactgaccg tgccacccga catgc 25
<210> 30
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 30
caccggcgga taaaaagacg gcag 24
<210> 31
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 31
aaacctgccg tctttttatc cgcc 24
<210> 32
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 32
caccgaaaaa gtccccgtct cgaa 24
<210> 33
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 33
aaacttcgag acggggactt tttc 24
<210> 34
<211> 1020
<212> DNA
<213> pig (Sus scrofa)
<400> 34
cagcatggtc tatcccaaga gactggacac agtttcctga gctgtccagc aggacctcac 60
tgcttatgca ttctaaatgg aagagtttgc atctcccaac cccgaactcc cctcctcccc 120
ttggcaacca caagtctgtt ctccattctg agtctgtttc tgttttgtag ctaggttcat 180
ctgtgcccta ttttagacgc cacattttct aaaacatgta accttgcaaa tacaaataag 240
gcgtttacct tgttggcact ttcccaccgt tcctggcgtc tttgggggac agggtgaatg 300
ggagcccccc tcttccccct cacccgggct gcactgcctc cgatgccctg tttccctgtc 360
aggctgccct taacccggcc ccctgccccc tctcgcccag ggatcggctt cgccatctgc 420
gtcatcgcct tctacatcgc ctcctactac aacaccatca tggcctgggc cctctactac 480
ctcatctcct ctttcacgga ccagctgccc tggaccagct gcaagaactc ctggaacacc 540
ggcaactgca ccaactactt ctccgaggac aacgtcacct ggatgctcca ctcaacgtcc 600
cctgcagaag aattttacac gtaagtgcat gtaagtgagg gggtggcctg ttaggggcag 660
accacacccc tggggtttgg cttcttagga gggaggcggg ggcccggggc tcttagctct 720
tggctgatgg gttttgctgt ggggaggggg gtgaatgtga gggagggagt tgggagactc 780
taccgtgagg gtccgcagtc tggggctctg tgccgtcccc ttcttttctg atgcgcccag 840
gccagtccct gccttcacat tccaaaggta cccccctgct ttggaagcac ccttcctcct 900
gggccctcct ctcagccagt tgtcatggga gcagagcctc tctctacttc aatagtttga 960
aatgactcaa aaagtttctg gtttcatgtt ttcacttcac acgtgagctc tatcctgtga 1020
<210> 35
<211> 667
<212> PRT
<213> pig (Sus scrofa)
<400> 35
Met Asn Glu Leu Ala Thr Pro Leu Ile Lys Ser Ala Lys Asp Arg His
1 5 10 15
Arg Thr Glu Leu Gln Asn Ser Gly Gln Ser Thr Ala His Val Cys Gln
20 25 30
Arg Pro Phe Gly Arg Met Glu Thr Thr Pro Leu Asn Ser Gln Arg Glu
35 40 45
Leu Ser Ala Tyr Lys Asp Gly Asp Asp Cys Gln Glu Asn Gly Val Leu
50 55 60
Arg Lys Gly Leu Pro Ala Pro Gly Asp Lys Ala Glu Ser Ser His Ile
65 70 75 80
Ser Asn Gly Tyr Ser Ala Val Pro Ser Pro Gly Ala Gly Asp Asp Thr
85 90 95
Gln Asn Ser Ile Pro Ala Ala Thr Thr Ala Leu Val Ala Glu Val His
100 105 110
Pro Gly Glu Arg Glu Thr Trp Gly Lys Lys Val Asp Phe Leu Leu Ser
115 120 125
Val Ile Gly Tyr Ala Val Asp Leu Gly Asn Val Trp Arg Phe Pro Tyr
130 135 140
Ile Cys Tyr Gln Asn Gly Gly Gly Ala Phe Leu Leu Pro Tyr Thr Ile
145 150 155 160
Met Ala Ile Phe Gly Gly Ile Pro Leu Phe Tyr Met Glu Leu Ala Leu
165 170 175
Gly Gln Tyr His Arg Asn Gly Cys Ile Ser Ile Trp Arg Lys Ile Cys
180 185 190
Pro Ile Phe Lys Gly Ile Gly Phe Ala Ile Cys Val Ile Ala Phe Tyr
195 200 205
Ile Ala Ser Tyr Tyr Asn Thr Ile Met Ala Trp Ala Leu Tyr Tyr Leu
210 215 220
Ile Ser Ser Phe Thr Asp Gln Leu Pro Trp Thr Ser Cys Lys Asn Ser
225 230 235 240
Trp Asn Thr Gly Asn Cys Thr Asn Tyr Phe Ser Glu Asp Asn Val Thr
245 250 255
Trp Met Leu His Ser Thr Ser Pro Ala Glu Glu Phe Tyr Thr Arg His
260 265 270
Val Leu Gln Ile His Arg Ser Lys Gly Leu Gln Asp Leu Gly Gly Ile
275 280 285
Ser Trp Gln Leu Ala Leu Cys Ile Met Leu Ile Phe Thr Ile Ile Tyr
290 295 300
Phe Ser Ile Trp Lys Gly Val Lys Thr Ser Gly Lys Val Val Trp Val
305 310 315 320
Thr Ala Thr Phe Pro Tyr Ile Ile Leu Ser Ile Leu Leu Val Arg Gly
325 330 335
Ala Thr Leu Pro Gly Ala Trp Arg Gly Val Leu Phe Tyr Leu Lys Pro
340 345 350
Asn Trp Gln Lys Leu Leu Glu Thr Gly Val Trp Val Asp Ala Ala Ala
355 360 365
Gln Ile Phe Phe Ser Leu Gly Pro Gly Phe Gly Val Leu Leu Ala Phe
370 375 380
Ala Ser Tyr Asn Lys Phe Asn Asn Asn Cys Tyr Gln Asp Ala Leu Val
385 390 395 400
Thr Ser Val Val Asn Cys Met Thr Ser Phe Val Ser Gly Phe Val Ile
405 410 415
Phe Thr Val Leu Gly Tyr Met Ala Glu Met Arg Asn Glu Asp Val Ser
420 425 430
Glu Val Ala Lys Asp Ala Gly Pro Ser Leu Leu Phe Ile Thr Tyr Ala
435 440 445
Glu Ala Ile Ala Asn Met Pro Ala Ser Thr Phe Phe Ala Ile Ile Phe
450 455 460
Phe Leu Met Leu Ile Thr Leu Gly Leu Asp Ser Thr Phe Ala Gly Leu
465 470 475 480
Glu Gly Val Ile Thr Ala Val Leu Asp Glu Phe Pro His Phe Trp Ser
485 490 495
Lys Arg Arg Glu Trp Leu Ala Leu Gly Val Val Ile Thr Cys Phe Leu
500 505 510
Gly Ser Leu Ile Thr Leu Thr Phe Gly Gly Ala Tyr Val Val Lys Leu
515 520 525
Leu Glu Glu Phe Ala Thr Gly Pro Ala Val Leu Thr Val Ala Leu Ile
530 535 540
Glu Ala Val Ala Val Phe Trp Phe Tyr Gly Ile Thr Gln Phe Cys Ser
545 550 555 560
Asp Val Lys Glu Met Leu Gly Phe Ser Pro Gly Trp Phe Trp Arg Ile
565 570 575
Cys Trp Val Ala Ile Ser Pro Leu Phe Leu Val Phe Ile Ile Cys Ser
580 585 590
Phe Leu Met Ser Pro Pro Gln Leu Trp Leu Phe Gln Tyr Asn Tyr Pro
595 600 605
Gln Trp Ser Ile Ile Leu Gly Tyr Cys Ile Gly Thr Ser Ser Phe Ile
610 615 620
Cys Ile Pro Thr Tyr Ile Thr Tyr Arg Leu Ile Ile Thr Pro Gly Thr
625 630 635 640
Leu Lys Glu Arg Ile Val Lys Gly Ile Thr Pro Glu Thr Pro Thr Glu
645 650 655
Ile Pro Cys Gly Asp Ile Arg Leu Asn Ala Val
660 665
<210> 36
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 36
gcgtttacct tgttggcact 20
<210> 37
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 37
gagcatccag gtgacgttgt 20
<210> 38
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 38
gtagaaggcg atgacgcaga 20
<210> 39
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 39
catgatggtg ttgtagtagg 20
<210> 40
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 40
ctcctactac aacaccatca 20
<210> 41
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 41
ctacaacacc atcatggcct 20
<210> 42
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 42
guagaaggcg augacgcaga guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 43
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 43
caugauggug uuguaguagg guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 44
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 44
cuccuacuac aacaccauca guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 45
<211> 100
<212> RNA
<213> Artificial sequence (Artificial Sequence)
<400> 45
cuacaacacc aucauggccu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 46
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 46
caccgtagaa ggcgatgacg caga 24
<210> 47
<211> 24
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 47
aaactctgcg tcatcgcctt ctac 24
<210> 48
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 48
caccgcatga tggtgttgta gtagg 25
<210> 49
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 49
aaaccctact acaacaccat catgc 25
<210> 50
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 50
caccgctcct actacaacac catca 25
<210> 51
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 51
aaactgatgg tgttgtagta ggagc 25
<210> 52
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 52
caccgctaca acaccatcat ggcct 25
<210> 53
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 53
aaacaggcca tgatggtgtt gtagc 25

Claims (3)

1. A CRISPR/Cas9 system for pig BDNF and SLC6A4 gene editing, characterized by comprising a Cas9 expression vector, a gRNA expression vector for pig BDNF gene and a gRNA expression vector for pig SLC6A4 gene; the Cas9 expression vector is a pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO vector with a plasmid full sequence shown as SEQ ID NO. 2; the vector skeletons of the gRNA expression vector aiming at the pig BDNF gene and the gRNA expression vector aiming at the pig SLC6A4 gene are pKG-U6gRNA, and the plasmid full sequence is shown as SEQ ID NO. 3; the gRNA expressed by SEQ ID NO.22 is expressed by a gRNA expression vector aiming at the pig BDNF gene, and the target point is shown as SEQ ID NO. 18; the gRNA expressed by SEQ ID NO.43 is expressed by a gRNA expression vector aiming at the SLC6A4 gene of the pig, and the target point is shown as SEQ ID NO. 39; the gRNA expression vector for the pig BDNF gene is obtained by double-stranded insert vector skeleton pKG-U6gRNA formed by annealing single-stranded DNA shown in SEQ ID NO.26 and SEQ ID NO. 27; the gRNA expression vector for the SLC6A4 gene of the pig is obtained by double-stranded insert vector skeleton pKG-U6gRNA formed by annealing single-stranded DNA shown in SEQ ID NO.48 and SEQ ID NO. 49; gRNA expression vector for swine BDNF gene: gRNA expression vector for the porcine SLC6A4 gene: the mass ratio of Cas9 expression vector is 0.47:0.47:1.06.
2. Use of the CRISPR/Cas9 system of claim 1 in constructing BDNF and SLC6A4 gene mutated porcine recombinant cells.
3. A recombinant cell obtained by the validation of a CRISPR/Cas9 system co-transfected porcine primary fibroblast cell according to claim 1.
CN202110065686.2A 2021-01-08 2021-01-19 CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells Active CN113584078B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110025593 2021-01-08
CN2021100255937 2021-01-08

Publications (2)

Publication Number Publication Date
CN113584078A CN113584078A (en) 2021-11-02
CN113584078B true CN113584078B (en) 2023-06-20

Family

ID=78238093

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110065686.2A Active CN113584078B (en) 2021-01-08 2021-01-19 CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells

Country Status (1)

Country Link
CN (1) CN113584078B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106191113A (en) * 2016-07-29 2016-12-07 中国农业大学 A kind of preparation method of MC3R gene knock-out pig
WO2019236082A1 (en) * 2018-06-06 2019-12-12 The Regents Of The University Of California Neural stem cell compositions and methods to treat neurodegenerative disorders
CN111484994A (en) * 2019-01-29 2020-08-04 四川大学华西医院 Method for specifically knocking out pig Fah and Rag2 double genes by CRISPR-Cas9

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106191113A (en) * 2016-07-29 2016-12-07 中国农业大学 A kind of preparation method of MC3R gene knock-out pig
WO2019236082A1 (en) * 2018-06-06 2019-12-12 The Regents Of The University Of California Neural stem cell compositions and methods to treat neurodegenerative disorders
CN111484994A (en) * 2019-01-29 2020-08-04 四川大学华西医院 Method for specifically knocking out pig Fah and Rag2 double genes by CRISPR-Cas9

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CRISPR/Cas9基因编辑系统中两种gRNA活性检测方法比较;刘燕等;心肺血管病杂志;35(07);第563-568页 *
Dennis L. Murphy et al..Targeting the murine serotonin transporter: insights into human neurobiology.NATURE REVIEWS.2008,9第85-96页. *
Loss of Brain-Derived Neurotrophic Factor Gene Allele Exacerbates Brain Monoamine Deficiencies and Increases Stress Abnormalities of Serotonin Transporter Knockout Mice;Renee F. Ren-Patterson et al.;Journal of Neuroscience Research;79;第756-771页 *

Also Published As

Publication number Publication date
CN113584078A (en) 2021-11-02

Similar Documents

Publication Publication Date Title
CN112779292B (en) Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells
CN112779291B (en) Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage, fast growth, high reproductive capacity and resistance to series epidemic diseases and application thereof
CN112522260B (en) CRISPR system and application thereof in preparing TTN gene mutation dilated cardiomyopathy clone pig nuclear donor cells
CN112877362A (en) Gene editing system for constructing high-quality porcine nuclear transplantation donor cells with high fertility and capability of resisting porcine reproductive and respiratory syndrome and serial diarrhea diseases and application of gene editing system
CN113046388B (en) CRISPR system for constructing atherosclerosis pig nuclear transfer donor cells with double genes in combined knockout mode and application of CRISPR system
CN112522264B (en) CRISPR/Cas9 system causing congenital deafness and application thereof in preparation of model pig nuclear donor cells
CN112522313B (en) CRISPR/Cas9 system for constructing depression cloned pig nuclear donor cells with TPH2 gene mutation
CN112522261B (en) CRISPR system for preparing LMNA gene mutation dilated cardiomyopathy clone pig nuclear donor cell and application thereof
CN114958762B (en) Method for constructing nerve tissue specific overexpression humanized SNCA parkinsonism model pig and application
CN113584078B (en) CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells
CN112522311B (en) CRISPR system for ADCY3 gene editing and application thereof in construction of obese pig nuclear transfer donor cells
CN112608941B (en) CRISPR system for constructing obese pig nuclear transplantation donor cells with MC4R gene mutation and application of CRISPR system
CN112813101B (en) Gene editing system for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and application thereof
CN112795566B (en) OPG gene editing system for constructing osteoporosis clone pig nuclear donor cell line and application thereof
CN112899306B (en) CRISPR system and application thereof in construction of GABRG2 gene mutation cloned pig nuclear donor cells
CN112575033B (en) CRISPR system and application thereof in construction of SCN1A gene mutated epileptic encephalopathy clone pig nuclear donor cell
CN112680453B (en) CRISPR system and application thereof in construction of STXBP1 mutant epileptic encephalopathy clone pig nuclear donor cell
CN112680444B (en) CRISPR system for OCA2 gene mutation and application thereof in construction of albino clone pig nuclear donor cells
CN112522258B (en) Recombinant cell with IL2RG gene and ADA gene knocked out in combined mode and application of recombinant cell in preparation of immunodeficiency pig model
CN112522202B (en) Method for preparing ADDI four-gene combined knockout severe immunodeficiency swine-derived recombinant cell and special kit thereof
CN112522309B (en) Severe immunodeficiency pig source recombinant cell, preparation method and kit thereof
CN112522256B (en) CRISPR/Cas9 system and application thereof in construction of dystrophin gene-deficient porcine recombinant cells
CN112522257B (en) System for preparing severe immunodeficiency pig source recombinant cells with RRIP four genes knocked out in combined mode
CN112522255B (en) CRISPR/Cas9 system and application thereof in construction of porcine recombinant cell with insulin receptor substrate gene defect
CN112877359A (en) CRISPR/cas system and application thereof in constructing INHA (INHA-mutated high-fertility porcine nuclear transfer donor cells)

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant