CN112779292B - Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells - Google Patents

Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells Download PDF

Info

Publication number
CN112779292B
CN112779292B CN202110197015.1A CN202110197015A CN112779292B CN 112779292 B CN112779292 B CN 112779292B CN 202110197015 A CN202110197015 A CN 202110197015A CN 112779292 B CN112779292 B CN 112779292B
Authority
CN
China
Prior art keywords
gene
grna
leu
seq
ser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110197015.1A
Other languages
Chinese (zh)
Other versions
CN112779292A (en
Inventor
牛冬
汪滔
马翔
曾为俊
刘璐
王磊
程锐
赵泽英
段星
陶裴裴
黄彩云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Qizhen Genetic Engineering Co Ltd
Original Assignee
Nanjing Qizhen Genetic Engineering Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Qizhen Genetic Engineering Co Ltd filed Critical Nanjing Qizhen Genetic Engineering Co Ltd
Priority to CN202110197015.1A priority Critical patent/CN112779292B/en
Publication of CN112779292A publication Critical patent/CN112779292A/en
Application granted granted Critical
Publication of CN112779292B publication Critical patent/CN112779292B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/8509Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K67/00Rearing or breeding animals, not otherwise provided for; New breeds of animals
    • A01K67/027New breeds of vertebrates
    • A01K67/0275Genetically modified vertebrates, e.g. transgenic
    • A01K67/0276Knockout animals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/65Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K2227/00Animals characterised by species
    • A01K2227/10Mammal
    • A01K2227/108Swine
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K2267/00Animals characterised by purpose
    • A01K2267/02Animal zootechnically ameliorated
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K2267/00Animals characterised by purpose
    • A01K2267/02Animal zootechnically ameliorated
    • A01K2267/025Animal producing cells or organs for transplantation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]

Abstract

The invention discloses a method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage, fast growth and resistance to blue ear diseases and series diarrhea diseases and application thereof. A CRISPR/Cas9 system for pig MSTN-SST-CD163-pAPN four-gene editing comprises a Cas9 expression vector, a gRNA expression vector aiming at pig MSTN gene, SST gene, CD163 gene and pAPN gene; the full sequence of the plasmid of the Cas9 expression vector is shown as SEQ ID NO. 2. The invention designs corresponding gRNA expression vectors aiming at different target points of MSTN, SST, CD163 and pAPN genes respectively, and obtains gRNA with higher editing efficiency and the expression vector thereof by screening. The modified Cas9 high-efficiency expression vector is matched for gene editing, and the editing efficiency is obviously improved compared with that of the original vector.

Description

Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells
Technical Field
The invention belongs to the technical field of biology, and relates to a method for constructing high-quality porcine nuclear transfer donor cells with high lean meat percentage and fast growth and resistance to porcine reproductive and respiratory syndrome and serial diarrhea diseases and application thereof, in particular to a CRISPR/Cas9 system for MSTN, SST, CD163 and pAPN four-gene editing and application thereof in constructing high-quality porcine nuclear transfer donor cells with high lean meat percentage and fast growth and resistance to porcine reproductive and respiratory syndrome and serial diarrhea diseases.
Background
The pig is one of domestic animals domesticated in the earliest time in China, and is an important meat animal for human beings in the historical long river. Chinese people like to eat pork, which is related to the diet culture of thousands of years. Since 2000 years, pork accounts for over 70% of the meat consumption in China, and is the most important meat consumed in China. At present, the price of the lean meat in cities in China is nearly 1 time higher than that of fat meat, lean meat type pigs with the lean meat percentage of more than 60% are favored, and the lean meat percentage character has a space for further improving. On the other hand, the weight of the fattened pigs in China is about 120kg in slaughter at present, the pigs only need 180 days from birth to slaughter, and if the growth speed of the pigs can be increased, the slaughter time of the fattened pigs can be shortened. Porcine reproductive and respiratory syndrome (porcine reproductive and respiratory syndrome), transmissible gastroenteritis, epidemic diarrhea and the like are the most common infectious diseases of live pigs under the intensive breeding production condition, and the fatality rate of the diseases is extremely high, which causes great economic loss to the pig raising production. Therefore, the method for breeding the high-quality pig strain with higher lean meat percentage, higher growth speed and resistance to the blue ear disease and the serial diarrhea diseases can bring more economic benefits for the pig industry.
Myostatin (MSTN) belongs to TGF- β superfamily, is a negative regulator of skeletal muscle growth, and is involved in regulation of muscle fiber proliferation and hypertrophy, and its deletion or mutation can cause double muscle phenomenon. The discovery of the gene has great significance for the animal husbandry and the medical field. Somatostatin (SST) is a protein that inhibits the release of hormones such as pituitary growth hormone, thyroid stimulating hormone, adrenocorticotropic hormone, and the like, and is capable of inhibiting growth by inhibiting a range of growth-related hormones. The existing research shows that the inhibition of the expression of MSTN can increase muscle deposition and the lean meat percentage of pigs to a certain degree, while the inhibition of the expression of SST can accelerate the growth speed of the pigs to a certain degree and reduce the feed conversion ratio. CD163 has been shown to be a receptor for Porcine Reproductive and Respiratory Syndrome Virus (PRRSV), and the major interaction with PRRSV is Ligand-binding pocket (LBP) in the SRCR5 domain of the CD163 receptor, whose coding region is located in exon 7 of porcine CD163, and since other parts of the SRCR5 domain still have important biological functions, editing only exon 7 of CD163 maximizes the biological functions of CD163 under conditions that disrupt the viral interaction region. Porcine aminopeptidase N (pAPN) has been shown to be a specific receptor for porcine transmissible gastroenteritis virus (TGEV) and has also been shown to be one of the receptors for Porcine Epidemic Diarrhea Virus (PEDV) and porcine delta coronavirus (PDCoV), which is abundantly expressed on the brush border of the porcine small intestine. The pAPN gene is damaged to inactivate the encoded receptor protein, TGEV cannot infect live pigs, and the infection of PEDV and PDCoV viruses to the pigs is greatly reduced. TGEV and PEDV belong to the genus alphacoronavirus, while PDCoV belongs to the genus delta coronavirus, pAPN being a common receptor for these three coronaviruses.
Gene editing is a biotechnology that has been greatly developed in recent years, and includes editing technologies from homologous recombination-based gene editing to nuclease-based ZFNs, TALENs, CRISPR/Cas9, and the like, wherein CRISPR/Cas9 technology is currently the most advanced gene editing technology. Therefore, the invention adopts CRISPR/cas9 technology to carry out mutation of four genes of MSTN, SST, CD163 and pAPN, obtains single cell clone with the combined knockout of the four genes of MSTN, SST, CD163 and pAPN, and lays a foundation for later breeding high-production-performance high-quality disease-resistant pig breeds with high lean meat percentage, fast growth, blue ear disease resistance and series diarrhea resistance through somatic cell nuclear transfer animal cloning technology. The loss of functions of MSTN, SST, CD163 and pAPN can effectively improve the lean meat percentage and the growth speed of pigs and increase the economic benefit of the pig industry on one hand; on the other hand, the pig feed additive can effectively improve the resistance of pigs to the blue ear disease and the serial diarrhea, reduce the death rate of live pigs and reduce the loss of the pig industry.
Disclosure of Invention
The invention aims to provide a CRISPR/Cas9 system for porcine MSTN-SST-CD163-pAPN four-gene editing.
It is another object of the invention to provide an application of the CRISPR/Cas9 system.
Still another object of the present invention is to provide a recombinant cell and use thereof.
The purpose of the invention can be realized by the following technical scheme:
a CRISPR/Cas9 system for editing four genes of pig MSTN-SST-CD163-pAPN comprises a Cas9 expression vector, a gRNA expression vector aiming at pig MSTN gene, a gRNA expression vector aiming at pig SST gene, a gRNA expression vector aiming at pig CD163 gene and a gRNA expression vector aiming at pig pAPN gene.
Preferably, the whole plasmid sequence of the Cas9 expression vector (named as pKG-GE 3) is shown in SEQ ID NO. 2. The Cas9 vector comprises nucleotide sequences for encoding Cas9 protein, EGFP and Puro resistance protein, wherein the Cas9 vector further comprises an EF1a promoter, a WPRE element and a 3' LTR sequence element, and preferably, the nucleotide sequences of the Cas vector are as follows from 5' to 3 ': a CMV enhancer, an EF1a promoter, a nuclear localization signal, a nucleotide sequence encoding a Cas9 protein, a nuclear localization signal, a nucleotide sequence encoding a self-splicing polypeptide P2A, a nucleotide sequence encoding an EGFP, a nucleotide sequence encoding a self-cleaving polypeptide T2A, a nucleotide sequence encoding a Puro resistance protein, a WPRE sequence element, a 3' ltr sequence element, and a polyA signal sequence element.
In order to increase the gene editing capacity of the Cas9 vector, the invention obtains pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO (namely the pKG-GE3 vector) by modifying pX330-U6-Chimeric _ BB-CBh-hSpCas9 (PX 330 for short) on the basis of the addge (Plasmid #42230, from Zhang Feng lab) vector. The map of PX330 is shown in FIG. 1, and the modification mode is as follows:
1) Removing redundant invalid sequences in the original vector gRNA framework;
2) Modifying a promoter: the original promoter (chicken beta-actin promoter) is transformed into an EF1a promoter with higher expression activity, so that the protein expression capacity of the Cas9 gene is increased;
3) Increase of nuclear localization signal: a nuclear localization signal coding sequence (NLS) is added at the N end and the C end of the Cas9, and the nuclear localization capability of the Cas9 is improved;
4) Adding double screening marks: the original vector does not have any screening marker, is not beneficial to screening and enriching of positive transformed cells, and is inserted with P2A-EGFP-T2A-PURO at the C end of Cas9 to endow the vector with fluorescence and resistance screening capability;
5) Inserting WPRE and 3' LTR and other regulatory gene expression sequences: WPRE, 3' LTR and other sequences are inserted into the last reading frame of the gene, so that the protein translation capability of the Cas9 gene can be enhanced.
The modified vector pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO (pKG-GE 3 for short) and the modified site are shown in figure 2, and the whole sequence of the plasmid is shown in SEQ ID NO:2 is shown in the specification; the main elements of pKG-GE3 are:
1) gRNA expression elements: u6gRNA scaffold;
2) A promoter: the EF1a promoter and CMV enhancer;
3) Cas9 gene containing multiple NLS: a Cas9 gene containing N-terminal and C-terminal multinuclear localization signals (NLS);
4) Screening for marker genes: a fluorescent and resistant double-screening marker element P2A-EGFP-T2A-PURO;
5) Elements that enhance translation: WPRE and 3' LTR enhance the translation efficiency of Cas9 and the screening marker gene;
6) Transcription termination signal: a bGHpolyA signal;
7) Carrier skeleton: including Amp resistance elements and ori replicons and the like.
The plasmid pKG-GE3 has a specific fusion gene; the specific fusion gene encodes a specific fusion protein;
the specific fusion protein sequentially comprises the following elements from N end to C end: two Nuclear Localization Signals (NLS), cas9 protein, two nuclear localization signals, self-splicing polypeptide P2A, fluorescent reporter protein, self-cleavage polypeptide T2A and resistance screening marker protein;
in the plasmid pKG-GE3, the expression of the specific fusion gene is started by the EF1a promoter;
the plasmid pKG-GE3 has a WPRE sequence element, a 3' LTR sequence element and a bGH poly (A) signal sequence element downstream of the specific fusion gene.
The plasmid pKG-GE3 comprises the following elements in this order: CMV enhancer, EF1a promoter, the specific fusion gene, WPRE sequence element, 3' LTR sequence element, bGH poly (A) signal sequence element.
In the specific fusion protein, two nuclear localization signals at the upstream of the Cas9 protein are SV40 nuclear localization signals, and two nuclear localization signals at the downstream of the Cas9 protein are nucleoplamin nuclear localization signals.
In the specific fusion protein, the fluorescent reporter protein can be EGFP protein.
In the specific fusion protein, the resistance screening marker protein can be Puromycin resistance protein.
The amino acid sequence of self-cleaving polypeptide P2A is "ATNFSLLKQAGDVEENPGP" (the cleavage site that occurs from cleavage is between the first amino acid residue and the second amino acid residue from the C-terminus).
The amino acid sequence of the self-cleaving polypeptide T2A is "EGRGSLLTCGDVEENPGP" (the cleavage site that occurs from the cleavage is between the first and second amino acid residues from the C-terminus).
The specific fusion gene is specifically shown in SEQ ID NO:2 from nucleotide 911 to nucleotide 6706.
The CMV enhancer is as set forth in SEQ ID NO:2 from nucleotide 395 to nucleotide 680.
The EF1a promoter is shown as SEQ ID NO:2 from nucleotide 682 to nucleotide 890.
The WPRE sequence element is shown as SEQ ID NO:2 from 6722 to 7310.
3' LTR sequence element is shown as SEQ ID NO:2 from nucleotide 7382 to nucleotide 7615.
bGH poly (a) signal sequence element is as set forth in SEQ ID NO:2, nucleotides 7647-7871.
Preferably, the pKG-GE3 plasmid is a circular plasmid.
As a preferred choice of the invention, in the CRISPR/Cas9 system, vector skeletons of a gRNA expression vector for a pig MSTN gene, a gRNA expression vector for a pig SST gene, a gRNA expression vector for a pig CD163 gene and a gRNA expression vector for a pig pAPN gene are all pKG-U6 gRNAs, and the whole sequence of the plasmid is shown in SEQ ID No. 3.
As further optimization of the invention, the gRNA shown in SEQ ID No.18 is expressed by a gRNA expression vector of the pig MSTN gene, and a target point of the gRNA is shown in SEQ ID No. 15; a gRNA expression vector aiming at a pig SST gene expresses a gRNA shown as SEQ ID No.24, and the target point of the gRNA is shown as SEQ ID No. 21; a gRNA expression vector aiming at the pig CD163 gene expresses a gRNA shown in SEQ ID NO.30, and the target point of the gRNA is shown in SEQ ID NO. 27; (ii) a The gRNA expression vector for the porcine pAPN gene expresses gRNA shown in SEQ ID NO.36, and the target point is shown in SEQ ID NO. 33.
As a further preferred mode of the invention, the gRNA expression vector for the pig MSTN gene is obtained by inserting a double chain formed by annealing single-stranded DNA shown in SEQ ID NO.16 and SEQ ID NO.17 into a vector skeleton pKG-U6 gRNA; the gRNA expression vector aiming at the pig SST gene is obtained by annealing single-stranded DNA shown in SEQ ID NO.22 and SEQ ID NO.23 to form a double-stranded vector skeleton pKG-U6 gRNA; the gRNA expression vector for the pig CD163 gene is obtained by annealing single-stranded DNA shown in SEQ ID NO.28 and SEQ ID NO.29 to form a double-stranded vector skeleton pKG-U6 gRNA; the gRNA expression vector for the pAPN gene of the pig is obtained by inserting a double chain formed by annealing single-chain DNA shown in SEQ ID NO.34 and SEQ ID NO.35 into a vector skeleton pKG-U6 gRNA.
The CRISPR/Cas9 system disclosed by the invention is applied to construction of the pig recombinant cell with MSTN, SST, CD163 and pAPN four-gene mutation.
A recombinant cell is obtained by carrying out cotransfection on a primary pig fibroblast by using the CRISPR/Cas9 system disclosed by the invention through verification.
The recombinant cell is applied to construction of cloned pigs with four knockout genes of MSTN, SST, CD163 and pAPN.
Compared with the prior art, the invention has at least the following beneficial effects:
(1) The subject of the invention (pig) has better applicability than other animals (rats, mice, primates).
Rodents such as rats and mice have great differences from humans in body types, organ sizes, physiology, pathology and the like, and cannot truly simulate normal physiological and pathological states of humans. Studies have shown that over 95% of drugs validated to be effective in large mice are not effective in human clinical trials. In large animals, primates are animals that have a close relationship with humans, but are small in size, late in sexual maturity (mating starts at age 6-7), and are single-birth animals, and the population propagation speed is extremely slow, and the raising cost is high. In addition, the cloning efficiency of the primate is low, the difficulty is high and the cost is high.
However, pigs, which are animals related to humans other than primates, do not have the above-mentioned disadvantages, and have body types, body weights, organ sizes, and the like similar to those of humans, and are very similar to those of humans in terms of anatomy, physiology, immunology, nutritional metabolism, disease pathogenesis, and the like. Meanwhile, the pigs have early sexual maturity (4-6 months), high reproductive capacity and multiple piglets, and can form a large group within 2-3 years. In addition, the cloning technology of the pig is very mature, and the cloning and feeding cost is much lower than that of a primate. Pigs are therefore very suitable animals as models for human diseases.
(2) Experiments prove that compared with a pX330 vector before modification, the modified pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO vector has the advantages that a stronger promoter is replaced, an element for enhancing protein translation is added, the Cas9 expression is improved, the number of nuclear localization signals is increased, the nuclear localization capability of the Cas9 protein is improved, and the gene editing efficiency is higher. The invention also adds fluorescent mark and resistance mark in the carrier, which is more convenient to be applied to the screening and enrichment of the positive transformation cell of the carrier. The editing efficiency of the Cas9 high-efficiency expression vector jointly modified by the gRNA screened by the invention is improved by more than 100 percent compared with the original vector.
(3) The invention designs corresponding expression vectors aiming at different target gRNAs of MSTN gene, SST gene, CD163 gene and pAPN gene respectively, and obtains gRNAs with higher editing efficiency and expression vectors thereof by screening. The modified Cas9 efficient expression vector is matched for gene editing, the genotype of the obtained cells can be analyzed through the sequencing result of the target gene PCR product, the probability of obtaining single target gene mutation is 15-29%, and the probability is greatly superior to the probability of obtaining mutation in a gene editing delivery method (namely injecting a gene editing material into fertilized eggs) by using an embryo injection technology.
(4) The obtained mutant unicellular clone strain is used for somatic cell nuclear transfer animal cloning to directly obtain the cloned pig containing target gene mutation, and the mutation can be stably inherited.
The invention adopts the method of primary cell in vitro editing with great technical difficulty and high challenge and screening positive editing single cell clone, and directly obtains the corresponding gene editing pig through somatic cell nuclear transfer animal cloning technology at the later stage, thereby greatly shortening the manufacturing period of the gene editing pig and saving manpower, material resources and financial resources.
Drawings
FIG. 1 is a schematic diagram of the structure of plasmid pX 330.
Fig. 2 is a schematic structural diagram of plasmid pU6gRNACas 9.
FIG. 3 is a structural map of pU6gRNA-eEF1a Cas9 vector.
FIG. 4 is a pU6gRNA-eEF1a Cas9+ nNLS vector map.
FIG. 5 is a schematic diagram of the structure of plasmid pKG-GE3.
FIG. 6 is a schematic structural diagram of plasmid pKG-U6 gRNA.
FIG. 7 is a schematic diagram showing the insertion of a DNA molecule of about 20bp (a target sequence binding region for transcription to form a gRNA) into a plasmid pKG-U6 gRNA.
FIG. 8 is a graph of the sequencing peaks of step 2.3.3 in example 2.
FIG. 9 is a graph of the sequencing peaks of step 2.4.3 in example 2.
FIG. 10 is an electrophoretogram obtained after PCR amplification of 18 pig genomic DNAs as templates in step 3.1 of example 3 using a primer set consisting of MSTN-JDF102/MSTN-JDR429.
FIG. 11 is a graph of the sequencing peaks in step 3.4 of example 3.
FIG. 12 is an electrophoretogram obtained after PCR amplification of 18 pig genomic DNAs as templates and primer pairs SST-JDF290/SST-JDR689 in step 4.1 of example 4.
FIG. 13 is a graph of the sequencing peaks in step 4.4 of example 4.
FIG. 14 is an electrophoretogram obtained after PCR amplification of 18 pig genomic DNAs as templates using a primer set consisting of CD163-JDF121/CD163-JDR518, step 5.1 of example 5.
FIG. 15 is a graph of the sequencing peaks in step 5.4 of example 5.
FIG. 16 is an electrophoretogram obtained after PCR amplification of 18 pig genomic DNAs as templates using a primer set consisting of APN-JDF94/APN-JDR656 in step 6.1 of example 6.
FIG. 17 is a plot of the sequencing peaks in step 6.4 of example 6.
FIG. 18 is an electrophoretogram obtained after PCR amplification using a primer pair consisting of MSTN-JDF102/MSTN-JDR429 and genomic DNA as a template in step 7.4.4 of example 7.
FIG. 19 is an electrophoretogram after PCR amplification using a primer pair composed of SST-JDF290/SST-JDR689 with genomic DNA as a template in step 7.4.4 of example 7.
FIG. 20 is an electrophoretogram obtained after PCR amplification using a primer pair consisting of CD163-JDF121/CD163-JDR518 using genomic DNA as a template in step 7.4.4 of example 7.
FIG. 21 is an electrophoretogram obtained after PCR amplification of a primer pair consisting of APN-JDF94/APN-JDR656 using genomic DNA as a template in step 7.4.4 of example 7.
FIG. 22 is an exemplary sequencing peak plot for the determination of the target gene as wild-type at step 7.4.5.1 in example 7.
FIG. 23 is a diagram showing exemplary sequencing peaks for determining that the target I gene is a heterozygous mutant type at step 7.4.5.1 in example 7.
FIG. 24 is a graph of exemplary sequencing peaks for homozygous mutants determined to have biallelic identity variations in the target gene at step 7.4.5.1 of example 7.
FIG. 25 is a graph of exemplary sequencing peaks for homozygous mutants determined to have biallelic variant in step 7.4.5.1 of example 7.
FIG. 26 is an exemplary sequencing peak plot for the determination of the target gene as wild-type at step 7.4.5.2 in example 7.
FIG. 27 is a diagram of exemplary sequencing peaks for the determination of the target gene as being a hybrid mutant at step 7.4.5.2 in example 7.
FIG. 28 is a diagram of exemplary sequencing peaks for determining homozygous mutant type of the target gene for biallelic identity variation at 7.4.5.2 in example 7.
FIG. 29 is a diagram of exemplary sequencing peaks for homozygous mutant type in example 7 at 7.4.5.2 for determination of biallelic variant of the target gene.
FIG. 30 is an exemplary sequencing peak plot for the determination of the target gene as wild-type at step 7.4.5.3 in example 7.
FIG. 31 is a diagram of exemplary sequencing peaks for determining that the target gene is a hybrid mutant type at step 7.4.5.3 in example 7.
FIG. 32 is a diagram of exemplary sequencing peaks for step 7.4.5.3 of example 7 to determine whether the target gene is homozygous mutant with biallelic variation.
FIG. 33 is a diagram of exemplary sequencing peaks for homozygous mutant type in example 7 at 7.4.5.3 for determination of biallelic variant of the target gene.
FIG. 34 is an exemplary sequencing peak plot for the determination of the target gene as wild-type at step 7.4.5.4 in example 7.
FIG. 35 is a diagram of exemplary sequencing peaks for the determination of the target gene as being a hybrid mutant at step 7.4.5.4 in example 7.
FIG. 36 is a diagram of exemplary sequencing peaks for determining homozygous mutant type of the target gene for biallelic identity variation at 7.4.5.4 in example 7.
FIG. 37 is a diagram of exemplary sequencing peaks for homozygous mutant type in example 7 at 7.4.5.4 for determination of biallelic variant of the target gene.
Detailed Description
Example 1 construction of plasmids
1.1 construction of plasmid pU6gRNA eEF1a-mNLS-hSpCas9-EGFP-PURO (plasmid pKG-GE3 for short)
The original plasmid pX330-U6-Chimeric _ BB-CBh-hSpCas9 (plasmid pX330 for short) has the sequence shown in SEQ ID NO. 1. The structure of plasmid pX330 is schematically shown in FIG. 1. In SEQ ID NO.1, nucleotides 440 to 725 constitute the CMV enhancer, nucleotides 727 to 1208 constitute the chicken β -actin promoter, nucleotides 1304 to 1324 encode the SV40 Nuclear Localization Signal (NLS), nucleotides 1325 to 5449 encode the Cas9 protein, and nucleotides 5450 to 5497 encode the nucleoplasmin Nuclear Localization Signal (NLS).
The plasmid pU6gRNA eEF1a-mNLS-hSpCas9-EGFP-PURO (figure 5) is called plasmid pKG-GE3 for short, and the nucleotide is shown in SEQ ID NO. 2. Compared with plasmid pX330, plasmid pKG-GE3 was mainly modified as follows: (1) removing residual gRNA framework sequences (GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTTT) and reducing interference; (2) the original chicken beta-actin promoter is transformed into an EF1a promoter with higher expression activity, so that the protein expression capacity of the Cas9 gene is increased; (3) a nuclear localization signal coding gene (NLS) is added at the upstream and the downstream of the Cas9 gene, so that the nuclear localization capacity of the Cas9 protein is increased; (4) the original plasmid does not have any eukaryotic cell screening marker, is not beneficial to screening and enriching of positive transformed cells, and is sequentially inserted with a P2A-EGFP-T2A-PURO coding gene at the downstream of a Cas9 gene to endow a carrier with fluorescence and eukaryotic cell resistance screening capacity; (5) WPRE element and 3' LTR sequence element are inserted to enhance protein translation capability of Cas9 gene.
The construction method of pKG-GE3 plasmid is as follows:
(1) Removal of redundant null sequences in the gRNA backbone
Plasmid pX330 was digested with BbsI and XbaI, the vector fragment (about 8313 bp) was recovered, an insert 175bp (SEQ ID NO. 4) was synthesized by a multi-fragment recombination method, and the recovered vector fragment was recombined to obtain the pU6gRNAcas9 vector (FIG. 2).
(2) Engineering promoters and enhancers
For the constructed pU6gRNAcas9 vector, xbaI and AgeI endonuclease are used to remove promoter (chicken beta-actin promoter) and enhancer sequence (CMV enhancer), linear vector sequence is recovered about 7650bp, 554bp sequence containing CMV enhancer and EF1a promoter (SEQ ID NO. 5) is synthesized by a multi-fragment recombination method, and the sequence is recombined with the vector pU6gRNAcas9 after enzyme digestion to obtain pU6gRNA-eEF1a Cas9 vector (figure 3).
(3) Cas9 gene N-terminal increasing NLS sequence
The constructed vector pU6gRNA-eEF1a Cas9 is subjected to enzyme digestion by AgeI and BglII, a 7786bp vector sequence is recovered, and a sequence with increased NLS is supplemented to an enzyme digestion site, namely a 447bp Cas9 coding sequence (SEQ ID NO. 6) comprising 2 nuclear localization signals and partial excision is synthesized by a multi-fragment recombination method, and a pU6gRNA-eEF1a Cas9+ nNLS vector is obtained by recombination (figure 4).
(4) Adding NLS, P2A-EGFP-T2A-PURO and WPRE-3' LTR-bGH polyA signal into the C end of Cas9 gene
The constructed vector is named as pU6gRNA-eEF1a Cas9+ nNLS, enzyme digestion is carried out by using FseI and SbfI, the vector sequence 7781bp is recovered, 2727bp fragment (SEQ ID NO. 7) comprising NLS-P2A-EGFP-T2A-PURO-WPRE-3 LTR-bGH polyA signal is synthesized by a multi-fragment recombination method, recombination is carried out with the vector fragment, and the vector pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO, which is called as pKG-GE3 for short, is obtained, wherein the plasmid map is shown in figure 5, and the nucleotide sequence (SEQ ID NO. 2) is shown in the figure.
In SEQ ID NO.2, nucleotides 395 to 680 constitute a CMV enhancer, nucleotides 682 to 890 constitute an EF1a promoter, nucleotides 986 to 1006 encode a Nuclear Localization Signal (NLS), nucleotides 1016 to 1036 encode a Nuclear Localization Signal (NLS), nucleotides 1037 to 5161 encode a Cas9 protein, nucleotides 5162 to 5209 encode a Nuclear Localization Signal (NLS), nucleotides 5219 to 5266 encode a Nuclear Localization Signal (NLS), nucleotides 5276 to 5332 encode a self-cleaving polypeptide P2A (the amino acid sequence of the self-cleaving polypeptide P2A is "ATNFSLLKQAGDVEENPGP", the cleavage site occurring from the cleavage site is between the first amino acid residue and the second amino acid residue from the C-terminus), nucleotides 5333 to 6046 encode an EGFP protein, nucleotides 6056 to 539 encode a self-cleaving polypeptide T2A (the amino acid sequence of the self-cleaving polypeptide T2A is "EGRGSLLTCGDVEENPGP", the cleavage site occurring from the C-terminus and the cleavage site between the first amino acid residue from the C-terminus and the second amino acid residue 677647), nucleotides 677647 to 6747 encode a self-cleaving polypeptide T2A (the nucleotide sequence of the RG 6773, the nucleotide sequence of the polypeptide B761 6773 to 6747) and the nucleotide sequence of the polypeptide B761. In SEQ ID NO.2, 911-6706 forms a fusion gene to express a fusion protein. Due to the presence of the self-cleaving polypeptide P2A and the self-cleaving polypeptide T2A, the fusion protein spontaneously forms the following three proteins: proteins with Cas9 protein, proteins with EGFP protein and proteins with Puro protein.
1.2 construction of pKG-U6gRNA vector
A pUC57 vector is sourced, a pKG-U6gRNA insertion sequence (a DNA fragment containing a U6 promoter, a BbsI enzyme cutting site and a sgRNA framework sequence, the sequence is shown in SEQ ID No. 8) is connected through an EcoRV enzyme cutting site, the pKG-U6gRNA insertion sequence is reversely inserted into a pUC57 vector to obtain a pKG-U6gRNA vector complete sequence (SEQ ID No. 3), in the SEQ ID No.3, nucleotides 2280-2539 form an hU6 promoter, and nucleotides 2558-2637 are used for transcription to form a gRNA framework. In use, a DNA molecule of about 20bp (target sequence binding region for transcription to form a gRNA) (fig. 7) is inserted into a plasmid pKG-U6gRNA (fig. 6) to form a recombinant plasmid, and the recombinant plasmid is transcribed in a cell to obtain a gRNA.
Example 2 plasmid proportioning optimization and comparison of the Effect of plasmid pX330 and plasmid pKG-GE3
2.1 target gRNA design and construction
2.1.1 target gRNA design of RAG1 Gene Using Benchling
RAG1-g4:AGTTATGGCAGAACTCAGTG(SEQ ID NO.9)
The synthesis of the insertion sequence complementary DNA Oligo for the RAG1 gene target is as follows:
RAG1-gRNA4S:caccgAGTTATGGCAGAACTCAGTG(SEQ ID NO.10)
RAG1-gRNA4A:aaacCACTGAGTTCTGCCATAACTc(SEQ ID NO.11)
RAG1-gRNA4S, RAG-gRNA 4A are all single-stranded DNA molecules.
2.1.2 primers designed for amplification and detection of fragments containing the RAG1 gRNA target
RAG1-nF126:CCCCATCCAAAGTTTTTAAAGGA
RAG1-nR525:TGTGGCAGATGTCACAGTTTAGG
2.1.3 Construction and cloning of gRNA recombinant vector
1) Digesting 1ug pKG-U6gRNA plasmid with restriction enzyme BbsI;
2) Separating the digested pKG-U6gRNA plasmid by agarose gel (agarose gel concentration is 1%, namely 1g of agarose gel is added into 100mL of electrophoresis buffer solution), and purifying and recovering the digested product by a gel recovery kit (Vazyme);
3) 2 complementary DNA oligos synthesized from the target of 2.1.1 are annealed to form a DNA double strand complementary to the cleaved sticky end of pKG-U6gRNA vector BbsI, as shown in FIG. 7:
95 deg.C, 5min and then reducing to 25 deg.C at a rate of 5 deg.C/min;
4) The ligation reaction was initiated as follows: reacting at room temperature for 10min
Figure BDA0002947302830000061
Figure BDA0002947302830000071
5) Transformation of
The procedure was followed in accordance with the instructions for competent cells (Vazyme).
2.1.4 gRNA vector construction
1) The synthesized RAG1-gRNA4S and RAG1-gRNA4A were mixed and annealed to obtain a double-stranded DNA molecule having a cohesive end. The double-stranded DNA molecule having the cohesive ends was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (RAG 1-gRNA 4). Plasmid pKG-U6gRNA (RAG 1-gRNA 4) will express the RAG1-gRNA4 shown in SEQ ID No. 12.
2.1.5 gRNA vector identification
Picking a single clone from an LB flat plate, placing the single clone into an LB culture solution added with corresponding antibiotics, culturing the single clone in a constant temperature shaking table at 37 ℃ for 12-16h, sending the small upgraded grains to a general company for sequencing, and confirming that the RAG1-gRNA4 vector is successfully constructed through sequence comparison.
2.2 preparation of Primary pig fibroblasts
2.2.1 taking 0.5g of ear tissue of the newborn juniperus communis, removing hair and bone tissue, and soaking in 75% of alcohol for 30-40s;
2.2.2 washing 5 times with PBS containing 5%P/S (Gibco Penicillin-Streptomyces) and once with PBS without P/S;
wherein the PBS formula of 5%P/S is: 5%P/S (Gibco Penicillin-Streptomyces) +95% PBS,5%, 95% by volume.
2.2.3 cutting the tissue with scissors, adding 5mL of 0.1% collagenase (Sigma) solution, digesting with a shaker at 37 ℃ for 1h;
2.2.4 500g centrifugation for 5min, supernatant removed, pellet resuspended in 1mL complete medium, plated into 10cm cell culture dish containing 10mL complete medium and sealed with 0.2% gelatin (VWR).
Wherein, the formula of the complete cell culture medium is as follows: 15% fetal bovine serum (Gibco) +83% DMEM Medium
(Gibco) +1%P/S (Gibco penillilin-Streptomyces) +1% HEPES (Solambio), 15%, 83%, 1% in volume percentage.
2.2.5 in a constant temperature incubator at 37 ℃,5% CO2 (vol.%), 5% O2 (vol.%);
2.2.6 cells were cultured until they grew to about 60% of the bottom of the dish, 0.25% (Gibco) trypsin was used to digest the cells, complete medium was then added to stop digestion, the cell suspension was transferred to a 15mL centrifuge tube, 400g was centrifuged for 4min, the supernatant was discarded, and cell pellet was obtained for further cell transfection experiments
2.3 plasmid proportioning optimization
2.3.1 Co-transfection grouping
A first group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.44 μ g plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.56. Mu.g of plasmid pKG-GE3. Namely, the molar ratio of the plasmid pKG-U6gRNA (RAG 1-gRNA 4) to the plasmid pKG-GE3 is 1:1.
second group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.72 μ g plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.28. Mu.g of plasmid pKG-GE3. Namely, the molar ratio of the plasmid pKG-U6gRNA (RAG 1-gRNA 4) to the plasmid pKG-GE3 is 2:1.
third group: the plasmid pKG-U6gRNA (RAG 1-gRNA 4) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92 μ g plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.08. Mu.g of plasmid pKG-GE3. Namely, the molar ratio of the plasmid pKG-U6gRNA (RAG 1-gRNA 4) to the plasmid pKG-GE3 is 3:1.
and a fourth group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) was transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4).
2.3.2 Co-transfection procedure
Transfection experiments were performed using a mammalian fibroblast cell nuclear transfection kit (Neon kit) with a Neon TM transfection system electrotransfer.
1) Preparing an electrotransformation DNA solution according to the above groups, and deliberately not generating bubbles in the process of uniformly mixing;
2) Washing the cell precipitate prepared in the 2.2.6 step by using 1ml of PBS phosphate buffer (Solarbio), transferring the cell precipitate into a 1.5ml centrifuge tube, centrifuging the cell precipitate for 6min at 600g, discarding the supernatant, and resuspending the cells by using 11 mu L of electric transfer basic solution Opti-MEM, wherein bubbles are prevented from being generated in the process of resuspension;
3) Sucking 10 mu L of cell suspension, adding the cell suspension into the electrotransformation DNA solution obtained in the step 1), and uniformly mixing, wherein no bubbles are generated in the uniformly mixing process;
4) Placing an electric rotating cup with the reagent cassette in a cup groove of a Neon (TM) transformation system electric rotating instrument, and adding 3mL of Buffer E;
5) Sucking 10 μ L of the mixture obtained in step 3) with an electric rotary gun, inserting into a click cup, selecting an electric rotary program (1450V 10ms 3pulse), transferring the electric rotary gun mixture into 6-well plates in a clean bench immediately after electric shock transfection, wherein each well contains 3mL of complete culture solution (15% fetal bovine serum (Gibco) +83 DMEM medium (Gibco) +1%P/S (Gibco Penicillin-Streptomycin) +1 HEPES (Solarbio));
6) Mixing, and culturing in constant temperature incubator at 37 deg.C, 5% CO2, 5% O2;
7) After 12-18h of electrotransformation, the solution was changed, and 36-48h were digested with 0.25% (Gibco) trypsin and the cells were collected in a 1.5mL centrifuge tube.
2.3.3 Gene editing efficiency analysis
Extracting the cellular genomic DNA collected in 2.3.2, performing PCR amplification by using a primer pair consisting of RAG1-nF126 and RAG1-nR525, and sequencing the product. The sequencing result utilizes a webpage version synthgo ICE tool to analyze the sequencing peak map to obtain that the editing efficiency of the first group, the second group and the third group is 9%, 53% and 66% in sequence, and an exemplary peak map of the sequencing result is shown in figure 8. Analyzing and determining that the gene editing efficiency of the third group is highest, namely determining that the optimal dosage of the gRNA plasmid and the Cas9 plasmid is 3:1, the actual amount of plasmid is 0.92. Mu.g: 1.08. Mu.g.
2.4 comparison of the Effect of plasmid pX330 and plasmid pKG-GE3
2.4.1 Co-transfection grouping
RAG1-330 group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pX330 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92 μ g plasmid pKG-U6gRNA (RAG 1-gRNA 4): mu.g of plasmid pX330, wherein the molar ratio of pKG-U6gRNA (RAG 1-gRNA 4) to pX330 is 3:1.
Group RAG 1-KG: plasmid pKG-U6gRNA (RAG 1-gRNA 4) and plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92 μ g plasmid pKG-U6gRNA (RAG 1-gRNA 4): 1.08. Mu.g of plasmid pKG-GE3, wherein the molar ratio of pKG-U6gRNA (RAG 1-gRNA 4) to pKG-GE3 is 3:1.
RAG1-B group: plasmid pKG-U6gRNA (RAG 1-gRNA 4) was transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (RAG 1-gRNA 4).
2.4.2 Co-transfection procedure
As in this example 2.3.2.
2.4.3 Gene editing efficiency analysis
Extracting the cellular genomic DNA collected in 2.4.2, performing PCR amplification by using a primer pair consisting of RAG1-nF126 and RAG1-nR525, and sequencing the product. The sequencing result utilizes a webpage version synthgo ICE tool to analyze a sequencing peak map to obtain that the editing efficiency of a RAG1-330 group and a RAG1-KG group is respectively 28% and 68%, an exemplary peak map of the sequencing result is shown in figure 9, and the result shows that compared with the plasmid pX330, the gene editing efficiency is obviously improved by adopting the plasmid pKG-GE3.
Example 3 screening of efficient MSTN Gene gRNA target
Pig MSTN gene information: encoding a myostatin protein; is located on pig chromosome 15; geneID is 399534, sus scrofa. The protein encoded by the pig MSTN gene is shown as GENBANK ACCESSION NO. NP-999600.2 (linear CON 12-JAN-2018), and the amino acid sequence is shown as SEQ ID NO. 13. In the genome DNA, the pig MSTN gene has 3 exons, wherein the 1 st exon and the downstream 200bp sequences thereof are shown as SEQ ID NO. 14.
3.1 Conservation analysis of MSTN gene knockout preset target and adjacent genome sequence
18 newborn Jiangxiang pigs, 10 females (named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, respectively) and 8 males (named A, B, C, D, E, F, G, H, respectively).
The genomic DNA of 18 pigs was used as templates, PCR amplification was performed using primer pairs (the target sequence of the primer pair includes exon 1 of the pig MSTN gene), and electrophoresis was performed. And recovering PCR amplification products, sequencing, and comparing and analyzing the sequencing result with the MSTN gene sequence in the public database. Based on the alignment, primers for detecting mutations were designed (the primers themselves avoid potential mutation sites). Primers designed to detect mutations were: MSTN-JDF102/MSTN-JDR429. The electrophoretogram of 18 porcine genomic DNAs after PCR amplification using the primer set MSTN-JDF102/MSTN-JDR429 is shown in FIG. 10.
MSTN-JDF102:5’-AAAAGAGGGGCTGTGTAATGC-3’;
MSTN-JDR429:5’-AAACACTGGAACAACAGTCAGC-3’。
3.2 target gRNA design and construction
And primarily screening a plurality of targets by screening NGG (avoiding possible mutation sites), and further screening 4 targets from the NGG through a preliminary experiment.
The 4 targets were as follows:
sgRNA MSTN-E1-g1 and (3) target point: 5'-TGATGATTATCACGCTACGA-3';
sgRNA MSTN-E1-g2 and (3) target spot: 5'-TGATCAATCAGTTCCCGGAG-3' (SEQ ID NO. 15);
sgRNA MSTN-E1-g3 and (3) target point: 5'-GTGATAATCATCATCTTCCA-3';
sgRNA MSTN-E1-g4 and (3) target point: 5'-TTTCCAGGCGAAGTTTACTG-3'.
The synthetic MSTN gene has 4 targets of the complementary DNA Oligo with the insert sequence as follows:
MSTN-E1-gRNA1-S:5’-caccgTGATGATTATCACGCTACGA-3’;
MSTN-E1-gRNA1-A:5’-aaacTCGTAGCGTGATAATCATCAc-3’;
MSTN-E1-gRNA2-S:5’-caccgTGATCAATCAGTTCCCGGAG-3’(SEQ ID NO.16);
MSTN-E1-gRNA2-A:5’-aaacCTCCGGGAACTGATTGATCAc-3’(SEQ ID NO.17);
MSTN-E1-gRNA3-S:5’-caccGTGATAATCATCATCTTCCA-3’;
MSTN-E1-gRNA3-A:5’-aaacTGGAAGATGATGATTATCAC-3’;
MSTN-E1-gRNA4-S:5’-caccgTTTCCAGGCGAAGTTTACTG-3’;
MSTN-E1-gRNA4-A:5’-aaacCAGTAAACTTCGCCTGGAAAc-3’。
MSTN-E1-gRNA1-S, MSTN-E1-gRNA1-A, MSTN-E1-gRNA2-S, MSTN-E1-gRNA2-A, MSTN-E1-gRNA3-S, MSTN-E1-gRNA3-A, MSTN-E1-gRNA4-S, MSTN-E1-gRNA4-A are single-stranded DNA molecules.
3.3 preparation of gRNA recombinant plasmids
The plasmid pKG-U6gRNA was digested with the restriction enzyme BbsI, and the vector backbone (approximately 3kb linear large fragment) was recovered.
Respectively synthesizing MSTN-E1-gRNA1-S and MSTN-E1-gRNA1-A, then mixed and annealed to obtain double-stranded DNA molecules with cohesive ends. The double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (MSTN-E1-g 1). sgRNA is expressed by plasmid pKG-U6gRNA (MSTN-E1-g 1) MSTN-E1-g1 ,sgRNA MSTN-E1-g1 The sequence is shown as follows:
UGAUGAUUAUCACGCUACGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
MSTN-E1-gRNA2-S and MSTN-E1-gRNA2-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with the cohesive end is connected with a vector framework to obtain a plasmid pKG-U6gRNA (MSTN-E1-g 2). Plasmid pKG-U6gRNA (MSTN-E1-g 2) expresses sgRNA shown in SEQ ID No.18 MSTN-E1-g2
UGAUCAAUCAGUUCCCGGAGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu(SEQ ID NO.18)。
MSTN-E1-gRNA3-S and MSTN-E1-gRNA3-A are respectively synthesized, and then are mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with the cohesive end is connected with a vector framework to obtain a plasmid pKG-U6gRNA (MSTN-E1-g 3). Plasmid pKG-U6gRNA (MSTN-E1-g 3) expresses sgRNA MSTN-E1-g3 ,sgRNA MSTN-E1-g3 The sequence is shown as follows:
GUGAUAAUCAUCAUCUUCCAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
MSTN-E1-gRNA4-S and MSTN-E1-gRNA4-A are respectively synthesized, and then are mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with the cohesive end is connected with a vector framework to obtain a plasmid pKG-U6gRNA (MSTN-E1-g 4). Plasmid pKG-U6gRNA (MSTN-E1-g 4) expresses sgRNA MSTN-E1-g4 ,sgRNA MSTN-E1-g4 The sequence is shown below:
UUUCCAGGCGAAGUUUACUGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
3.4 Comparison of editing efficiency of gRNAs of different target spots of MSTN gene
Porcine primary fibroblasts were prepared from ear tissue of newborn Jiangxiang pigs (female, blood group AO).
1. Cotransfection
A first group: the plasmid pKG-U6gRNA (MSTN-E1-g 1) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (MSTN-E1-g 1): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (MSTN-E1-g 1) to pKG-GE3 is 3:1.
Second group: the plasmid pKG-U6gRNA (MSTN-E1-g 2) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (MSTN-E1-g 2): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (MSTN-E1-g 1) to pKG-GE3 is 3:1.
Third group: the plasmid pKG-U6gRNA (MSTN-E1-g 3) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (MSTN-E1-g 3): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (MSTN-E1-g 1) to pKG-GE3 is 3:1.
And a fourth group: the plasmid pKG-U6gRNA (MSTN-E1-g 4) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (MSTN-E1-g 4): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (MSTN-E1-g 1) to pKG-GE3 is 3:1.
A fifth group: the pig primary fibroblast is subjected to electrotransfection operation without adding plasmid under the same electrotransformation parameters.
Co-transfection was performed by electroporation using a mammalian nuclear transfection kit (Neon kit, thermofeisher) and a Neon TM transfection system electrotransformation apparatus (parameters set at 1450V, 10ms, 3 pulses).
2. After step 1, the culture is carried out for 16 to 18 hours by using the complete culture solution, and then the culture is carried out by replacing the complete culture solution with a new one. The total time of incubation was 48 hours.
3. After completion of step 2, cells were digested and collected with trypsin, then lysed and genomic DNA extracted using MSTN-JDF102 and MSThe primer pair consisting of TN-JDR429 was subjected to PCR amplification and then to 1% agarose gel electrophoresis. The target fragment was recovered and sequenced, and the peak pattern of the sequencing is shown in FIG. 11. Analyzing the sequencing peak map by using a syntheo ICE tool to obtain the gene editing efficiency of different targets. The gene editing efficiencies of the first group to the fourth group were 3%, 32%, 15%, and 19% in this order. No gene editing occurred in the fifth group. The result shows that the editing efficiency of the second group is highest, and sgRNA MSTN-E1-g2 The target point of (2) is the optimal target point.
Example 4 screening of efficient SST Gene gRNA target
Porcine SST gene information: encoding a somastatin protein; is located on pig chromosome 13; geneID is 39386, sus scrofa. The protein coded by the pig SST gene is shown in GENBANK ACCESSION NO. NP _001009583.1 (linear CON 12-JAN-2018), and the amino acid sequence is shown in SEQ ID NO. 19. In the genome DNA, the porcine SST gene has 2 exons, wherein the 1 st exon and the 400bp sequences of the upstream exon and the downstream exon are shown as SEQ ID NO. 20.
4.1 Conservation analysis of preset target point of SST gene knockout and adjacent genome sequence
18 newborn Jiangxiang pigs, 10 females (named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, respectively) and 8 males (named A, B, C, D, E, F, G, H, respectively).
The genomic DNA of 18 pigs were used as templates, respectively, and PCR amplification was performed using primer pairs (the target sequence of the primer pair includes exon 1 of the porcine SST gene), followed by electrophoresis. And recovering PCR amplification products, sequencing, and comparing and analyzing the sequencing result with an SST gene sequence in a public database. Based on the alignment, primers for detecting mutations were designed (the primers themselves avoid potential mutation sites). Primers designed to detect mutations were: SST-JDF290/SST-JDR689. The electrophoretogram of 18 porcine genomic DNAs after PCR amplification using the primer set SST-JDF290/SST-JDR689 is shown in FIG. 12.
SST-JDF290:5’-CACGAGGGTAATGGTGCGTA-3’;
SST-JDR689:5’-GGTTAGGGGATTCGCGAGAG-3’。
4.2 target gRNA design and construction
And primarily screening a plurality of targets by screening NGG (avoiding possible mutation sites), and further screening 4 targets from the NGG through a preliminary experiment.
The 4 targets were as follows:
sgRNA SST-E1-g1 and (3) target point: 5'-TCCATCGTCCTGGCTCTGGG-3' (SEQ ID NO. 21);
the insert complementary DNA Oligo of the synthesized SST gene with 4 targets is as follows:
SST-E1-gRNA1-S:5’-caccgTCCATCGTCCTGGCTCTGGG-3’(SEQ ID NO.22);
SST-E1-gRNA1-A:5’-aaacCCCAGAGCCAGGACGATGGAc-3’(SEQ ID NO.23);
SST-E1-gRNA2-S:5’-caccGGGACTTCTGCAGAAACTGA-3’;
SST-E1-gRNA2-A:5’-aaacTCAGTTTCTGCAGAAGTCCC-3’;
SST-E1-gRNA3-S:5’-caccgTGACGGAGTCGGGGATCCGA-3’;
SST-E1-gRNA3-A:5’-aaacTCGGATCCCCGACTCCGTCAc-3’;
SST-E1-gRNA4-S:5’-caccgTGCAGAAACTGACGGAGTCG-3’;
SST-E1-gRNA4-A:5’-aaacCGACTCCGTCAGTTTCTGCAc-3’。
SST-E1-gRNA1-S, SST-E1-gRNA1-A, SST-E1-gRNA2-S, SST-E1-gRNA2-A, SST-E1-gRNA3-S, SST-E1-gRNA3-A, SST-E1-gRNA4-S, SST-E1-gRNA4-A are single-stranded DNA molecules.
4.3 preparation of gRNA recombinant plasmids
The plasmid pKG-U6gRNA was digested with the restriction enzyme BbsI, and the vector backbone (approximately 3kb linear large fragment) was recovered.
SST-E1-gRNA1-S and SST-E1-gRNA1-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with the cohesive end is connected with a vector framework to obtain a plasmid pKG-U6gRNA (SST-E1-g 1). Plasmid pKG-U6gRNA (SST-E1-g 1) expresses sgRNA shown in SEQ ID No.24 SST-E1-g1
UCCAUCGUCCUGGCUCUGGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu(SEQ ID NO.24)。
SST-E1-gRNA2-S and SST-E1-gRNA2-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with the cohesive end is connected with a vector framework to obtain a plasmid pKG-U6gRNA (SST-E1-g 2). Plasmid pKG-U6gRNA (SST-E1-g 2) expresses sgRNA SST-E1-g2 ,sgRNA SST-E1-g2 The sequence is shown below:
GGGACUUCUGCAGAAACUGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
SST-E1-gRNA3-S and SST-E1-gRNA3-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with cohesive ends is connected with a vector framework to obtain a plasmid pKG-U6gRNA (SST-E1-g 3). Plasmid pKG-U6gRNA (SST-E1-g 3) expresses sgRNA SST-E1-g3 ,sgRNA SST-E1-g3 The sequence is shown as follows:
UGACGGAGUCGGGGAUCCGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
SST-E1-gRNA4-S and SST-E1-gRNA4-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule with cohesive ends is connected with a vector framework to obtain a plasmid pKG-U6gRNA (SST-E1-g 4). Plasmid pKG-U6gRNA (SST-E1-g 4) expresses sgRNA SST-E1-g4 ,sgRNA SST-E1-g4 The sequence is shown below:
UGCAGAAACUGACGGAGUCGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
4.4 Comparison of editing efficiency of gRNAs of different target points of SST gene
Porcine primary fibroblasts were prepared from ear tissue of newborn Jiangxiang pigs (female, blood group AO).
1. Cotransfection
A first group: the plasmid pKG-U6gRNA (SST-E1-g 1) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (SST-E1-g 1): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (SST-E1-g 1) to pKG-GE3 is 3:1.
Second group: the plasmid pKG-U6gRNA (SST-E1-g 2) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (SST-E1-g 2): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (SST-E1-g 1) to pKG-GE3 is 3:1.
Third group: the plasmid pKG-U6gRNA (SST-E1-g 3) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (SST-E1-g 3): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (SST-E1-g 1) to pKG-GE3 is 3:1.
And a fourth group: the plasmid pKG-U6gRNA (SST-E1-g 4) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (SST-E1-g 4): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (SST-E1-g 1) to pKG-GE3 is 3:1.
And a fifth group: the pig primary fibroblast is subjected to electrotransfection operation without adding plasmid under the same electrotransformation parameters.
Co-transfection was performed by electroporation using a mammalian nuclear transfection kit (Neon kit, thermofeisher) and a Neon TM transfection system electrotransformation apparatus (parameters set at 1450V, 10ms, 3 pulses).
2. After step 1, the culture is carried out for 16 to 18 hours by using the complete culture solution, and then the culture is carried out by replacing the complete culture solution with a new one. The total time of incubation was 48 hours.
3. After completion of step 2, cells were digested and collected with trypsin, then lysed and genomic DNA extracted, PCR amplified with a primer pair consisting of SST-JDF290 and SST-JDR689, and then subjected to 1% agarose gel electrophoresis. The target fragment was recovered and sequenced, and the peak pattern of the sequencing is shown in FIG. 13. Analyzing the sequencing peak map by using a syntheo ICE tool to obtain the gene editing efficiency of different targets. The gene editing efficiencies of the first group to the fourth group were 47%, 26%, 22%, and 11% in this order. No gene editing occurred in the fifth group. The result shows that the editing efficiency of the first group is highest, and the sgRNA SST-E1-g1 The target point of (2) is the optimal target point.
Example 5 screening of efficient CD163 Gene gRNA target
Porcine CD163 gene information: encoding a CD163 molecule protein; is located on pig chromosome 5; geneID is 3976, sus scrofa. The protein encoded by the pig CD163 gene is shown in GENBANK ACCESSION NO. XP _020946779.1 (linear CON 12-JAN-2018), and the amino acid sequence is shown in SEQ ID NO. 25. In the genome DNA, the porcine CD163 gene has 18 exons, wherein the 7 th exon and the 400bp sequences of the exon and the upstream and the downstream thereof are shown as SEQ ID NO. 26.
5.1 Conservation analysis of CD163 gene knockout preset target and adjacent genome sequence
18 newborn Jiangxiang pigs, 10 females (named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, respectively) and 8 males (named A, B, C, D, E, F, G, H, respectively).
The genomic DNA of 18 pigs were used as templates, respectively, and PCR amplification was performed using a primer pair (the target sequence of the primer pair includes exon 7 of the porcine CD163 gene), followed by electrophoresis. And recovering PCR amplification products, sequencing, and comparing and analyzing the sequencing result with the CD163 gene sequence in a public database. Based on the alignment, primers for detecting mutations were designed (the primers themselves avoid potential mutation sites). Primers designed to detect mutations were: CD163-JDF121/CD163-JDR518. The electropherogram of 18 porcine genomic DNAs amplified by PCR using the primer set consisting of CD163-JDF121/CD163-JDR518 is shown in FIG. 14.
CD163-JDF121:5’-GAATCGGCTAAGCCCACTGTA-3’;
CD163-JDR518:5’-ACTGGGCAGAGTGAAAGGTG-3’。
5.2 target gRNA design and construction
And primarily screening a plurality of targets by screening NGG (avoiding possible mutation sites), and further screening 4 targets from the NGG through a preliminary experiment.
The 4 targets were as follows:
sgRNA CD163-E7-g1 and (3) target point: 5'-GGAACTACAGTGCGGCACTG-3';
sgRNA CD163-E7-g2 and (3) target spot: 5'-GGTCGTGTTGAAGTACAACA-3' (SEQ ID NO: S) ID NO.27);
sgRNA CD163-E7-g3 And (3) target point: 5'-GTACAACATGGAGACACGTG-3';
sgRNA CD163-E7-g4 and (3) target spot: 5'-ACTGTGGTTTCCCTCCTGGG-3'.
The synthetic CD163 gene has the following complementary DNA Oligo with the insert sequence of 4 targets:
CD163-E7-gRNA1-S:5’-caccGGAACTACAGTGCGGCACTG-3’;
CD163-E7-gRNA1-A:5’-aaacCAGTGCCGCACTGTAGTTCC-3’;
CD163-E7-gRNA2-S:5’-caccGGTCGTGTTGAAGTACAACA-3’(SEQ ID NO.28);
CD163-E7-gRNA2-A:5’-aaacTGTTGTACTTCAACACGACC-3’(SEQ ID NO.29);
CD163-E7-gRNA3-S:5’-caccGTACAACATGGAGACACGTG-3’;
CD163-E7-gRNA3-A:5’-aaacCACGTGTCTCCATGTTGTAC-3’;
CD163-E7-gRNA4-S:5’-caccgACTGTGGTTTCCCTCCTGGG-3’;
CD163-E7-gRNA4-A:5’-aaacCCCAGGAGGGAAACCACAGTc-3’。
CD163-E7-gRNA1-S, CD163-E7-gRNA1-A, CD163-E7-gRNA2-S, CD163-E7-gRNA2-A, CD163-E7-gRNA3-S, CD163-E7-gRNA3-A, CD163-E7-gRNA4-S, CD163-E7-gRNA4-A are all single-stranded DNA molecules.
5.3 preparation of gRNA recombinant plasmid
The plasmid pKG-U6gRNA was digested with the restriction enzyme BbsI, and the vector backbone (approximately 3kb linear large fragment) was recovered.
CD163-E7-gRNA1-S and CD163-E7-gRNA1-A were synthesized separately, and then mixed and annealed to obtain a double-stranded DNA molecule having a cohesive end. The double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (CD 163-E7-g 1). Plasmid pKG-U6gRNA (CD 163-E7-g 1) expresses sgRNA CD163-E7-g1 ,sgRNA CD163-E7-g1 The sequence is shown as follows:
GGAACUACAGUGCGGCACUGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
CD163-E7-gRNA2-S and CD163-E7-gRNA2-A were synthesized separately, and then mixed and annealed to obtain a double-stranded DNA molecule having a cohesive end. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to obtain plasmid pKG-U6gRNA (CD 163-E7-g 2). Plasmid pKG-U6gRNA (CD 163-E7-g 2) expresses sgRNA shown in SEQ ID No.30 CD163-E7-g2
GGUCGUGUUGAAGUACAACAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu(SEQ ID NO.30)。
CD163-E7-gRNA3-S and CD163-E7-gRNA3-A were synthesized separately, and then mixed and annealed to obtain a double-stranded DNA molecule having a cohesive end. The double-stranded DNA molecule having a cohesive end was ligated to the vector backbone to obtain plasmid pKG-U6gRNA (CD 163-E7-g 3). Plasmid pKG-U6gRNA (CD 163-E7-g 3) expresses sgRNA CD163-E7-g3 ,sgRNA CD163-E7-g3 The sequence is shown as follows:
GUACAACAUGGAGACACGUGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
CD163-E7-gRNA4-S and CD163-E7-gRNA4-A were synthesized separately, and then mixed and annealed to obtain a double-stranded DNA molecule having a cohesive end. The double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (CD 163-E7-g 4). Plasmid pKG-U6gRNA (CD 163-E7-g 4) expresses sgRNA CD163-E7-g4 ,sgRNA CD163-E7-g4 The sequence is shown as follows:
ACUGUGGUUUCCCUCCUGGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
5.4 Comparison of editing efficiency of gRNA of different target points of CD163 gene
Porcine primary fibroblasts were prepared from ear tissue of a newborn Jiangxiang pig (female, blood group AO).
1. Cotransfection
A first group: the plasmid pKG-U6gRNA (CD 163-E7-g 1) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (CD 163-E7-g 1): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (CD 163-E7-g 1) to pKG-GE3 is 3:1.
Second group: the plasmid pKG-U6gRNA (CD 163-E7-g 2) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (CD 163-E7-g 2): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (CD 163-E7-g 2) to pKG-GE3 is 3:1.
Third group: the plasmid pKG-U6gRNA (CD 163-E7-g 3) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (CD 163-E7-g 3): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (CD 163-E7-g 3) to pKG-GE3 is 3:1.
And a fourth group: the plasmid pKG-U6gRNA (CD 163-E7-g 4) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (CD 163-E7-g 4): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (CD 163-E7-g 4) to pKG-GE3 is 3:1.
And a fifth group: the pig primary fibroblast is subjected to electrotransfection operation without adding plasmid under the same electrotransformation parameters.
Co-transfection was performed by electroporation using a mammalian nuclear transfection kit (Neon kit, thermofeisher) and a Neon TM transfection system electrotransformation apparatus (parameters set at 1450V, 10ms, 3 pulses).
2. After step 1, the culture is carried out for 16 to 18 hours by using the complete culture solution, and then the culture is carried out by replacing the complete culture solution with a new one. The total time of incubation was 48 hours.
3. After completion of step 2, cells were digested with trypsin and collected, then lysed and genomic DNA extracted, amplified by PCR using a primer pair consisting of CD163-JDF121 and CD163-JDR518, and then subjected to 1% agarose gel electrophoresis. The target fragment was recovered and sequenced, and the peak pattern of the sequencing is shown in FIG. 15. Analyzing the sequencing peak map by using a syntheo ICE tool to obtain the gene editing efficiency of different targets. The gene editing efficiency of the first group to the fourth group was 16%, 34%, 10% in this order. No gene editing occurred in the fifth group. The result shows that the editing efficiency of the second group is highest, and sgRNA CD163-E7-g2 Target point of (2) is the optimal targetAnd (4) point.
Example 6 screening of efficient pAPN Gene gRNA targets
Pig APN gene information: coding alanyl aminopeptidase, membrane protein; is located on pig chromosome 7; geneID 393978, sus scrofa. The protein encoded by the pig CD163 gene is shown in GENBANK ACCESSION NO. NP _999442.1 (linear CON 12-JAN-2018), and the amino acid sequence is shown in SEQ ID NO. 31. In the genome DNA, the pig APN gene has 2 exons, wherein the 2 nd exon and 100bp sequences of the upstream exon and the downstream exon are shown as SEQ ID NO. 32.
6.1 pAPN gene knockout preset target and adjacent genome sequence conservation analysis
18 newborn Jiangxiang pigs, 10 females (named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, respectively) and 8 males (named A, B, C, D, E, F, G, H, respectively).
The genomic DNA of 18 pigs is taken as a template, PCR amplification is carried out by using a primer pair (the target sequence of the primer pair comprises the 2 nd exon of the pig APN gene), and then electrophoresis is carried out. And recovering PCR amplification products, sequencing, and comparing and analyzing the sequencing result with the pAPN gene sequence in the public database. Based on the alignment, primers for detecting mutations were designed (the primers themselves avoid potential mutation sites). Primers designed to detect mutations were: APN-JDF94/APN-JDR656. The electropherogram of 18 pigs after PCR amplification of genomic DNA using the primer set APN-JDF94/APN-JDR656 is shown in FIG. 16.
APN-JDF94:5’-GAACCGGAGCAGTGTCTCTA-3’;
APN-JDR656:5’-CCCCTGGGTGGTGTAGTTGA-3’。
6.2 target gRNA design and construction
And (3) primarily screening a plurality of targets by screening NGG (avoiding possible mutation sites), and further screening 4 targets from the NGG through a preliminary experiment.
The 4 targets were as follows:
sgRNA APN-E2-g1 and (3) target point: 5'-GAGGATGCCCAGGATGCCCA-3' (SEQ ID No. 33);
sgRNA APN-E2-g2 and (3) target point: 5'-CCTGGGCATCCTCCTCGGCG-3';
sgRNA APN-E2-g3 And (3) target point: 5'-CACAGACAGAGCGATGATGG-3';
sgRNA APN-E2-g4 and (3) target point: 5'-GCTCTGGTCCAAGGTGATGG-3'.
The synthetic APN gene has the following complementary DNA Oligo with the insert sequences of 4 targets:
APN-E2-gRNA1-S:5’-caccGAGGATGCCCAGGATGCCCA-3’(SEQ ID NO.34);
APN-E2-gRNA1-A:5’-aaacTGGGCATCCTGGGCATCCTC-3’(SEQ ID NO.35);
APN-E2-gRNA2-S:5’-caccgCCTGGGCATCCTCCTCGGCG-3’;
APN-E2-gRNA2-A:5’-aaacCGCCGAGGAGGATGCCCAGGc-3’;
APN-E2-gRNA3-S:5’-caccgCACAGACAGAGCGATGATGG-3’;
APN-E2-gRNA3-A:5’-aaacCCATCATCGCTCTGTCTGTGc-3’;
APN-E2-gRNA4-S:5’-caccGCTCTGGTCCAAGGTGATGG-3’;
APN-E2-gRNA4-A:5’-aaacCCATCACCTTGGACCAGAGC-3’。
APN-E2-gRNA1-S, APN-E2-gRNA1-A, APN-E2-gRNA2-S, APN-E2-gRNA2-A, APN-E2-gRNA3-S, APN-E2-gRNA3-A, APN-E2-gRNA4-S, APN-E2-gRNA4-A are single-stranded DNA molecules.
6.3 preparation of gRNA recombinant plasmid
The plasmid pKG-U6gRNA was digested with the restriction enzyme BbsI, and the vector backbone (approximately 3kb linear large fragment) was recovered.
APN-E2-gRNA1-S and APN-E2-gRNA1-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (APN-E2-g 1). Plasmid pKG-U6gRNA (APN-E2-g 1) expresses sgRNA shown in SEQ ID No.36 APN-E2-g1
GAGGAUGCCCAGGAUGCCCAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu(SEQ ID NO.36)
Separately synthesizing APN-E2-gRNA2-S and APN-E2-gRNA2-A, and mixingAnnealing is performed to obtain a double-stranded DNA molecule having a cohesive end. A double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (APN-E2-g 2). Plasmid pKG-U6gRNA (APN-E2-g 2) expresses sgRNA APN-E2-g2 ,sgRNA APN-E2-g2 The sequence is shown as follows:
CCUGGGCAUCCUCCUCGGCGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
respectively synthesizing APN-E2-gRNA3-S and APN-E2-gRNA3-A, then mixing and annealing to obtain the double-stranded DNA molecule with a sticky end. The double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (APN-E2-g 3). Plasmid pKG-U6gRNA (APN-E2-g 3) expresses sgRNA APN-E2-g3 ,sgRNA APN-E2-g3 The sequence is shown as follows:
CACAGACAGAGCGAUGAUGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
APN-E2-gRNA4-S and APN-E2-gRNA4-A are synthesized respectively, and then mixed and annealed to obtain double-stranded DNA molecules with sticky ends. The double-stranded DNA molecule having a cohesive end was ligated to a vector backbone to obtain a plasmid pKG-U6gRNA (APN-E2-g 4). Plasmid pKG-U6gRNA (APN-E2-g 4) expresses sgRNA APN-E2-g4 ,sgRNA APN-E2-g4 The sequence is shown as follows:
GCUCUGGUCCAAGGUGAUGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
6.4 Comparison of editing efficiency of different target points gRNA of pAPN gene
Porcine primary fibroblasts were prepared from ear tissue of newborn Jiangxiang pigs (female, blood group AO).
1. Cotransfection
A first group: the plasmid pKG-U6gRNA (APN-E2-g 1) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 ten thousand porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (APN-E2-g 1): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (APN-E2-g 1) to pKG-GE3 is 3:1.
Second group: the plasmid pKG-U6gRNA (APN-E2-g 2) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (APN-E2-g 2): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (APN-E2-g 2) to pKG-GE3 is 3:1.
Third group: the plasmid pKG-U6gRNA (APN-E2-g 3) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (APN-E2-g 3): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (APN-E2-g 3) to pKG-GE3 is 3:1.
And a fourth group: the plasmid pKG-U6gRNA (APN-E2-g 4) and the plasmid pKG-GE3 were co-transfected into porcine primary fibroblasts. Proportioning: about 20 million porcine primary fibroblasts: 0.92. Mu.g of plasmid pKG-U6gRNA (APN-E2-g 4): 1.08. Mu.g of plasmid pKG-GE3. Wherein the molar ratio of pKG-U6gRNA (APN-E2-g 4) to pKG-GE3 is 3:1.
And a fifth group: the pig primary fibroblast is subjected to electrotransfection operation without adding plasmid under the same electrotransformation parameters.
Co-transfection was performed by electroporation using a mammalian nuclear transfection kit (Neon kit, thermofeisher) and a Neon TM transfection system electrotransformation apparatus (parameters set at 1450V, 10ms, 3 pulses).
2. After step 1, the culture is carried out for 16 to 18 hours by using the complete culture solution, and then the culture is carried out by replacing the complete culture solution with a new one. The total time of incubation was 48 hours.
3. After completion of step 2, cells were digested with trypsin and collected, then lysed and genomic DNA extracted, PCR amplified using primer pairs consisting of APN-JDF94 and APN-JDR656, and then subjected to 1% agarose gel electrophoresis. The target fragment was recovered and sequenced, and the peak pattern of the sequencing is shown in FIG. 17. Analyzing the sequencing peak map by using a syntheo ICE tool to obtain the gene editing efficiency of different targets. The gene editing efficiency of the first group to the fourth group was 52%, 15%, 16%, 5% in this order. No gene editing occurred in the fifth group. The result shows that the editing efficiency of the first group is highest, and the sgRNA APN-E2-g1 The target point of (2) is the optimal target point.
Example 7 construction of a Single cell clone from Jiangxiang pig with combined knockout of four genes MSTN, SST, CD163 and pAPN
7.1 preparation of Primary pig fibroblasts
The same as 2.2 in example 2.
7.2 Co-transfection of porcine primary fibroblasts with the constructed pKG-U6gRNA (MSTN-E1-g 2) plasmid, pKG-U6gRNA (SST-E1-g 1) plasmid, pKG-U6gRNA (CD 163-E7-g 2) plasmid, pKG-U6gRNA (APN-E2-g 1) plasmid, pKG-GE3 plasmid
7.2.1 plasmids pKG-U6gRNA (MSTN-E1-g 2), pKG-U6gRNA (SST-E1-g 1), pKG-U6gRNA (CD 163-E7-g 2), pKG-U6gRNA (APN-E2-g 1), pKG-GE3 were transfected into porcine primary fibroblasts.
Proportioning: about 20 million porcine primary fibroblasts: 0.295. Mu.g of plasmid pKG-U6gRNA (MSTN-E1-g 2): 0.295. Mu.g of plasmid pKG-U6gRNA (SST-E1-g 1): 0.295. Mu.g of plasmid pKG-U6gRNA (CD 163-E7-g 2): 0.295. Mu.g of plasmid pKG-U6gRNA (APN-E2-g 1): 1.320. Mu.g of plasmid pKG-GE3, the molar ratio of the plasmids being: (3/4): (3/4): (3/4): (3/4): 1, i.e. total gRNA: pKG-GE3 is 3:1..
7.2.2 Co-transfection procedure
The cells were digested as in 2.3.2 of example 2, but without 0.25% (Gibco) trypsin and collected in a 1.5mL centrifuge tube.
7.3 Screening of MSTN, SST, CD163 and pAPN four-gene combined knockout single cell clone strain
7.3.1 the population cells obtained in step 7.2 after 48h electroporation were digested with trypsin, neutralized with complete medium, centrifuged at 500g for 5min, the supernatant removed, the pellet resuspended in 200. Mu.L complete medium and diluted appropriately, and the single cells picked up by a pipette were transferred to a 96-well plate containing 100. Mu.L complete medium per well, one cell per well;
7.3.2 37℃,5%CO 2 、5%O 2 the cell culture medium is changed every 2 to 3 days, the growth condition of cells in each hole is observed by a microscope during the cell culture medium changing process, and the hole without cells and non-single cell clone is removed;
7.3.3 when the wells of the 96-well plate were filled with cells, trypsinized and the cells were collected, 2/3 of the cells were plated into a 6-well plate containing complete medium, and the remaining 1/3 of the cells were collected in a 1.5mL centrifuge tube for subsequent genotyping;
7.3.4 cells were digested and harvested with 0.25% (Gibco) trypsin when the 6-well plates were up to 80% confluency, and frozen using cell freezing medium (90% complete medium +10% DMSO, vol.).
7.4 MSTN, SST, CD163 and pAPN four-gene combined knockout single cell clone identification
7.4.1 to the cells collected in step 7.3.3 in a 1.5mL centrifuge tube, 10. Mu.L of KAPA2G lysate was added to lyse the cells, resulting in a lysate of cells that released genomic DNA.
The KAPA2G lysate preparation system is as follows:
10X extract Buffer 1μL
Enzyme 0.2μL
ddH2O 8.8μL
preserving cell lysate at-20 ℃ after the reaction is finished at 75 ℃ for 15min to 95 ℃ for 5min to 4 ℃;
7.4.2 the target gene mutation condition of the single-cell clone is detected by respectively carrying out PCR amplification on MSTN, SST, CD163 and pAPN gene target region by using the cell lysate as a DNA template and adopting the primer pair (MSTN-JDF 102/MSTN-JDR 429) aiming at MSTN gene E1, the primer pair (SST-JDF 290/SST-JDR 689) aiming at SST gene E1, the primer pair (CD 163-JDF121/CD163-JDR 518) aiming at CD163 gene E7 and the primer pair (APN-JDF 94/APN-JDR 656) aiming at pAPN gene E2. The length of a target PCR product of the MSTN gene is 328bp, the length of a target PCR product of the SST gene is 400bp, the length of a target PCR product of the CD163 gene is 398bp, and the length of a target PCR product of the pAPN gene is 563bp;
7.4.3 amplifying MSTN, SST, CD163 and pAPN gene target region by using PCR conventional reaction;
7.4.4 the PCR reaction products were electrophoresed, and the electrophoresis results are shown in FIG. 18 (MSTN), FIG. 19 (SST), FIG. 20 (CD 163) and FIG. 21 (pAPN), respectively, and the lane numbers are the same as the single cell clone numbers. The PCR amplification product was recovered and sequenced.
7.4.5 compares the sequencing result with the MSTN gene, SST gene, CD163 gene and pAPN gene target point information, thereby judging whether the single cell clone is subjected to combined knockout of the MSTN, SST, CD163 and pAPN four genes.
7.4.5.1 for the MSTN gene, the genotypes of the single-cell clones numbered 2, 23, 46, 69, 78 are homozygous mutants with the same variation on both alleles; the genotypes of the single-cell clones numbered 16, 27 and 57 are homozygous mutants of different variation of double alleles; the genotypes of the single cell clones numbered 13, 29, 45, 73 are heterozygous mutants; the genotypes of the single cell clones with other numbers are wild types; the rate of resulting MSTN gene-edited single cell clones was 15%.
Exemplary sequencing alignments are shown in fig. 22-25, where fig. 22 is an alignment of clone No.1 with a wild-type reference sequence, and is judged wild-type; FIG. 23 shows the result of alignment of the sequencing result of clone No.29 with the wild-type reference sequence, which was judged as the heterozygous mutant; FIG. 24 shows the result of comparison of the sequencing result of clone No.2 with the wild-type reference sequence, and it was judged as a homozygous mutant having the same variation in biallelic genes; FIG. 25 shows the result of alignment of the sequencing result of clone No.27 with the wild-type reference sequence, and it was judged as a homozygous mutant type having a biallelic different variation.
The genotype of each single cell clone was shown in table 1 by analysis of specific sequences:
TABLE 1 identification of MSTN Gene knockout Single cell clone genotypes
Figure BDA0002947302830000161
Figure BDA0002947302830000171
Figure BDA0002947302830000181
7.4.5.2 for the SST gene, the genotypes of single cell clones numbered 7, 16, 23, 33, 54, 57, 62, 67, 69, 70 are homozygous mutants of biallelic identical variations; the genotypes of the single-cell clones numbered 14, 43, 48, 78, 79 are homozygous mutants of biallelic different variations; the genotypes of the single cell clones numbered 5, 24, 47, 58, 61, 74 are heterozygous mutants; the genotypes of the single cell clones with other numbers are wild types; the rate of single cell clones resulting from editing of the SST gene was 26%.
Exemplary sequencing alignments are shown in fig. 26-29, where fig. 26 is an alignment of clone No.1 with a wild-type reference sequence, and is judged wild-type; FIG. 27 shows the result of alignment of the sequencing result of clone No.5 with the wild-type reference sequence, and it was judged as the heterozygous mutant; FIG. 28 shows the result of alignment of the sequencing result of clone No.7 with the wild-type reference sequence, and it was judged as a homozygous mutant having the same variation in biallelic genes; FIG. 29 shows the results of alignment of the sequencing result of clone No.14 with the wild-type reference sequence, and it was judged as a homozygous mutant type having different biallelic variations.
Through the analysis of specific sequences, the clone genotypes of each single cell are shown in the table 2:
TABLE 2 identification of single cell clone genotypes by SST gene knockout
Figure BDA0002947302830000182
Figure BDA0002947302830000191
Figure BDA0002947302830000201
Figure BDA0002947302830000211
7.4.5.3 for the CD163 gene, the genotypes of the single cell clones numbered 11, 16, 22, 35, 57, 78 are homozygous mutants with the same variation on both alleles; the genotypes of the single-cell clones numbered 34, 66 and 69 are homozygous mutants of different variation of double alleles; the genotypes of the single cell clones numbered 23, 44, 60, 77 are heterozygous mutants; the genotypes of the other numbered single cell clones are wild types; the rate of single cell clones resulting from CD163 gene editing was 16%.
Exemplary sequencing alignments are shown in fig. 30-33, wherein fig. 30 is an alignment of the sequencing of clone No.1 with a wild-type reference sequence, and is judged wild-type; FIG. 31 shows the result of alignment of the sequencing result of clone No.23 with the wild-type reference sequence, and it was judged as the heterozygous mutant; FIG. 32 shows the result of alignment of the sequencing result of clone No.16 with the wild-type reference sequence, and it was judged as a homozygous mutant having the same variation in biallelic genes; FIG. 33 shows the results of alignment of the clone No. 69 with the wild-type reference sequence, and the results were judged as homozygous mutants having different biallelic variations.
Through analysis of specific sequences, the genotype of each single cell clone is shown in table 3:
TABLE 3 CD163 Gene knockout Single cell clone genotype identification
Figure BDA0002947302830000212
Figure BDA0002947302830000221
Figure BDA0002947302830000231
Figure BDA0002947302830000241
7.4.5.4 for the pAPN gene, the genotype of the single cell clones numbered 4, 7, 16, 20, 38, 46, 55, 57, 60, 69, 73, 79 are homozygous mutants with the same variation in both alleles; the genotypes of the single-cell clones numbered 33, 74 and 78 are homozygous mutants of different variation of biallelic genes; the genotypes of the single cell clones numbered 2, 10, 14, 26, 29, 40, 45, 61 are heterozygous mutants; the genotypes of the single cell clones with other numbers are wild types; the rate of resulting pAPN gene-edited single cell clones was 29%.
Exemplary sequencing alignments are shown in fig. 34-37, where fig. 34 is an alignment of clone No.1 with a wild-type reference sequence, and is judged wild-type; FIG. 35 shows the result of alignment of the sequencing result of clone No.29 with the wild-type reference sequence, which was judged as the heterozygous mutant; FIG. 36 shows the result of alignment of the sequencing result of clone No. 38 with the wild-type reference sequence, and it was judged as a homozygous mutant having the same variation in biallelic genes; FIG. 37 shows the results of alignment of clone number 74 with the wild-type reference sequence, and the results were judged as homozygous mutants of different biallelic variations.
Through the analysis of specific sequences, the genotype of each single-cell clone is shown in table 4:
TABLE 4 identification of pAPN gene knockout single cell clone genotype
Figure BDA0002947302830000242
Figure BDA0002947302830000251
Figure BDA0002947302830000261
7.4.6 through analysis, the single cell clones with the numbers of 16, 57, 69 and 78 are the single cell clones with homozygous knockout of the MSTN gene, the SST gene, the CD163 gene and the pAPN gene, and the homozygous knockout rate of the four genes is 5%.
The heterozygous mutation and the homozygous mutation single-cell clone can be used for cloning and producing the gene editing pig.
The present invention has been described in detail above. It will be apparent to those skilled in the art that the invention can be practiced in a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation. While the invention has been described with reference to specific examples, it will be appreciated that the invention may be further modified. In general, this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. The use of some of the essential features is possible within the scope of the claims attached below.
Sequence listing
<110> Nanjing King Gene engineering Co., ltd
<120> method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage, fast growth and resistance to blue-ear disease and serial diarrhea diseases and application thereof
<160> 36
<170> SIPOSequenceListing 1.0
<210> 1
<211> 8484
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttg ttttagagct 360
agaaatagca agttaaaata aggctagtcc gtttttagcg cgtgcgccaa ttctgcagac 420
aaatggctct agaggtaccc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 480
ccaacgaccc ccgcccattg acgtcaatag taacgccaat agggactttc cattgacgtc 540
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 600
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tgtgcccagt 660
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 720
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 780
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 840
ggggggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc 900
agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata 960
aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgcgctg ccttcgcccc gtgccccgct 1020
ccgccgccgc ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga 1080
gcgggcggga cggcccttct cctccgggct gtaattagct gagcaagagg taagggttta 1140
agggatggtt ggttggtggg gtattaatgt ttaattacct ggagcacctg cctgaaatca 1200
ctttttttca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 1260
ggatcatgat attgattaca aagacgatga cgataagatg gccccaaaga agaagcggaa 1320
ggtcggtatc cacggagtcc cagcagccga caagaagtac agcatcggcc tggacatcgg 1380
caccaactct gtgggctggg ccgtgatcac cgacgagtac aaggtgccca gcaagaaatt 1440
caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg gagccctgct 1500
gttcgacagc ggcgaaacag ccgaggccac ccggctgaag agaaccgcca gaagaagata 1560
caccagacgg aagaaccgga tctgctatct gcaagagatc ttcagcaacg agatggccaa 1620
ggtggacgac agcttcttcc acagactgga agagtccttc ctggtggaag aggataagaa 1680
gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc acgagaagta 1740
ccccaccatc taccacctga gaaagaaact ggtggacagc accgacaagg ccgacctgcg 1800
gctgatctat ctggccctgg cccacatgat caagttccgg ggccacttcc tgatcgaggg 1860
cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg tgcagaccta 1920
caaccagctg ttcgaggaaa accccatcaa cgccagcggc gtggacgcca aggccatcct 1980
gtctgccaga ctgagcaaga gcagacggct ggaaaatctg atcgcccagc tgcccggcga 2040
gaagaagaat ggcctgttcg gaaacctgat tgccctgagc ctgggcctga cccccaactt 2100
caagagcaac ttcgacctgg ccgaggatgc caaactgcag ctgagcaagg acacctacga 2160
cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc tgtttctggc 2220
cgccaagaac ctgtccgacg ccatcctgct gagcgacatc ctgagagtga acaccgagat 2280
caccaaggcc cccctgagcg cctctatgat caagagatac gacgagcacc accaggacct 2340
gaccctgctg aaagctctcg tgcggcagca gctgcctgag aagtacaaag agattttctt 2400
cgaccagagc aagaacggct acgccggcta cattgacggc ggagccagcc aggaagagtt 2460
ctacaagttc atcaagccca tcctggaaaa gatggacggc accgaggaac tgctcgtgaa 2520
gctgaacaga gaggacctgc tgcggaagca gcggaccttc gacaacggca gcatccccca 2580
ccagatccac ctgggagagc tgcacgccat tctgcggcgg caggaagatt tttacccatt 2640
cctgaaggac aaccgggaaa agatcgagaa gatcctgacc ttccgcatcc cctactacgt 2700
gggccctctg gccaggggaa acagcagatt cgcctggatg accagaaaga gcgaggaaac 2760
catcaccccc tggaacttcg aggaagtggt ggacaagggc gcttccgccc agagcttcat 2820
cgagcggatg accaacttcg ataagaacct gcccaacgag aaggtgctgc ccaagcacag 2880
cctgctgtac gagtacttca ccgtgtataa cgagctgacc aaagtgaaat acgtgaccga 2940
gggaatgaga aagcccgcct tcctgagcgg cgagcagaaa aaggccatcg tggacctgct 3000
gttcaagacc aaccggaaag tgaccgtgaa gcagctgaaa gaggactact tcaagaaaat 3060
cgagtgcttc gactccgtgg aaatctccgg cgtggaagat cggttcaacg cctccctggg 3120
cacataccac gatctgctga aaattatcaa ggacaaggac ttcctggaca atgaggaaaa 3180
cgaggacatt ctggaagata tcgtgctgac cctgacactg tttgaggaca gagagatgat 3240
cgaggaacgg ctgaaaacct atgcccacct gttcgacgac aaagtgatga agcagctgaa 3300
gcggcggaga tacaccggct ggggcaggct gagccggaag ctgatcaacg gcatccggga 3360
caagcagtcc ggcaagacaa tcctggattt cctgaagtcc gacggcttcg ccaacagaaa 3420
cttcatgcag ctgatccacg acgacagcct gacctttaaa gaggacatcc agaaagccca 3480
ggtgtccggc cagggcgata gcctgcacga gcacattgcc aatctggccg gcagccccgc 3540
cattaagaag ggcatcctgc agacagtgaa ggtggtggac gagctcgtga aagtgatggg 3600
ccggcacaag cccgagaaca tcgtgatcga aatggccaga gagaaccaga ccacccagaa 3660
gggacagaag aacagccgcg agagaatgaa gcggatcgaa gagggcatca aagagctggg 3720
cagccagatc ctgaaagaac accccgtgga aaacacccag ctgcagaacg agaagctgta 3780
cctgtactac ctgcagaatg ggcgggatat gtacgtggac caggaactgg acatcaaccg 3840
gctgtccgac tacgatgtgg accatatcgt gcctcagagc tttctgaagg acgactccat 3900
cgacaacaag gtgctgacca gaagcgacaa gaaccggggc aagagcgaca acgtgccctc 3960
cgaagaggtc gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg ccaagctgat 4020
tacccagaga aagttcgaca atctgaccaa ggccgagaga ggcggcctga gcgaactgga 4080
taaggccggc ttcatcaaga gacagctggt ggaaacccgg cagatcacaa agcacgtggc 4140
acagatcctg gactcccgga tgaacactaa gtacgacgag aatgacaagc tgatccggga 4200
agtgaaagtg atcaccctga agtccaagct ggtgtccgat ttccggaagg atttccagtt 4260
ttacaaagtg cgcgagatca acaactacca ccacgcccac gacgcctacc tgaacgccgt 4320
cgtgggaacc gccctgatca aaaagtaccc taagctggaa agcgagttcg tgtacggcga 4380
ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaaa tcggcaaggc 4440
taccgccaag tacttcttct acagcaacat catgaacttt ttcaagaccg agattaccct 4500
ggccaacggc gagatccgga agcggcctct gatcgagaca aacggcgaaa ccggggagat 4560
cgtgtgggat aagggccggg attttgccac cgtgcggaaa gtgctgagca tgccccaagt 4620
gaatatcgtg aaaaagaccg aggtgcagac aggcggcttc agcaaagagt ctatcctgcc 4680
caagaggaac agcgataagc tgatcgccag aaagaaggac tgggacccta agaagtacgg 4740
cggcttcgac agccccaccg tggcctattc tgtgctggtg gtggccaaag tggaaaaggg 4800
caagtccaag aaactgaaga gtgtgaaaga gctgctgggg atcaccatca tggaaagaag 4860
cagcttcgag aagaatccca tcgactttct ggaagccaag ggctacaaag aagtgaaaaa 4920
ggacctgatc atcaagctgc ctaagtactc cctgttcgag ctggaaaacg gccggaagag 4980
aatgctggcc tctgccggcg aactgcagaa gggaaacgaa ctggccctgc cctccaaata 5040
tgtgaacttc ctgtacctgg ccagccacta tgagaagctg aagggctccc ccgaggataa 5100
tgagcagaaa cagctgtttg tggaacagca caagcactac ctggacgaga tcatcgagca 5160
gatcagcgag ttctccaaga gagtgatcct ggccgacgct aatctggaca aagtgctgtc 5220
cgcctacaac aagcaccggg ataagcccat cagagagcag gccgagaata tcatccacct 5280
gtttaccctg accaatctgg gagcccctgc cgccttcaag tactttgaca ccaccatcga 5340
ccggaagagg tacaccagca ccaaagaggt gctggacgcc accctgatcc accagagcat 5400
caccggcctg tacgagacac ggatcgacct gtctcagctg ggaggcgaca aaaggccggc 5460
ggccacgaaa aaggccggcc aggcaaaaaa gaaaaagtaa gaattcctag agctcgctga 5520
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 5580
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 5640
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 5700
ggggaggatt gggaagagaa tagcaggcat gctggggagc ggccgcagga acccctagtg 5760
atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag 5820
gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc 5880
ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 5940
atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 6000
tggttacgcg cagcgtgacc gctacacttg ccagcgcctt agcgcccgct cctttcgctt 6060
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 6120
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 6180
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 6240
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aactctatct 6300
cgggctattc ttttgattta taagggattt tgccgatttc ggtctattgg ttaaaaaatg 6360
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat 6420
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc 6480
caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 6540
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 6600
cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg 6660
tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 6720
ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 6780
aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 6840
tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 6900
atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 6960
agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 7020
tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 7080
tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 7140
atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 7200
ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 7260
tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 7320
acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 7380
ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata 7440
aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 7500
ctggagccgg tgagcgtgga agccgcggta tcattgcagc actggggcca gatggtaagc 7560
cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata 7620
gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 7680
actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 7740
agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 7800
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 7860
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 7920
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 7980
ttcttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 8040
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 8100
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 8160
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 8220
gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 8280
gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 8340
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 8400
caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 8460
tttgctggcc ttttgctcac atgt 8484
<210> 2
<211> 10476
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc tagcgcgtgc 360
gccaattctg cagacaaatg gctctagagg tacccgttac ataacttacg gtaaatggcc 420
cgcctggctg accgcccaac gacccccgcc cattgacgtc aatagtaacg ccaataggga 480
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 540
aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 600
ggcattgtgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 660
tagtcatcgc tattaccatg ggggcagagc gcacatcgcc cacagtcccc gagaagttgg 720
ggggaggggt cggcaattga tccggtgcct agagaaggtg gcgcggggta aactgggaaa 780
gtgatgtcgt gtactggctc cgcctttttc ccgagggtgg gggagaaccg tatataagtg 840
cagtagtcgc cgtgaacgtt ctttttcgca acgggtttgc cgccagaaca caggttggac 900
cggtgccacc atggactata aggaccacga cggagactac aaggatcatg atattgatta 960
caaagacgat gacgataaga tggcccccaa aaagaaacga aaggtgggtg ggtccccaaa 1020
gaagaagcgg aaggtcggta tccacggagt cccagcagcc gacaagaagt acagcatcgg 1080
cctggacatc ggcaccaact ctgtgggctg ggccgtgatc accgacgagt acaaggtgcc 1140
cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac agcatcaaga agaacctgat 1200
cggagccctg ctgttcgaca gcggcgaaac agccgaggcc acccggctga agagaaccgc 1260
cagaagaaga tacaccagac ggaagaaccg gatctgctat ctgcaagaga tcttcagcaa 1320
cgagatggcc aaggtggacg acagcttctt ccacagactg gaagagtcct tcctggtgga 1380
agaggataag aagcacgagc ggcaccccat cttcggcaac atcgtggacg aggtggccta 1440
ccacgagaag taccccacca tctaccacct gagaaagaaa ctggtggaca gcaccgacaa 1500
ggccgacctg cggctgatct atctggccct ggcccacatg atcaagttcc ggggccactt 1560
cctgatcgag ggcgacctga accccgacaa cagcgacgtg gacaagctgt tcatccagct 1620
ggtgcagacc tacaaccagc tgttcgagga aaaccccatc aacgccagcg gcgtggacgc 1680
caaggccatc ctgtctgcca gactgagcaa gagcagacgg ctggaaaatc tgatcgccca 1740
gctgcccggc gagaagaaga atggcctgtt cggaaacctg attgccctga gcctgggcct 1800
gacccccaac ttcaagagca acttcgacct ggccgaggat gccaaactgc agctgagcaa 1860
ggacacctac gacgacgacc tggacaacct gctggcccag atcggcgacc agtacgccga 1920
cctgtttctg gccgccaaga acctgtccga cgccatcctg ctgagcgaca tcctgagagt 1980
gaacaccgag atcaccaagg cccccctgag cgcctctatg atcaagagat acgacgagca 2040
ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag cagctgcctg agaagtacaa 2100
agagattttc ttcgaccaga gcaagaacgg ctacgccggc tacattgacg gcggagccag 2160
ccaggaagag ttctacaagt tcatcaagcc catcctggaa aagatggacg gcaccgagga 2220
actgctcgtg aagctgaaca gagaggacct gctgcggaag cagcggacct tcgacaacgg 2280
cagcatcccc caccagatcc acctgggaga gctgcacgcc attctgcggc ggcaggaaga 2340
tttttaccca ttcctgaagg acaaccggga aaagatcgag aagatcctga ccttccgcat 2400
cccctactac gtgggccctc tggccagggg aaacagcaga ttcgcctgga tgaccagaaa 2460
gagcgaggaa accatcaccc cctggaactt cgaggaagtg gtggacaagg gcgcttccgc 2520
ccagagcttc atcgagcgga tgaccaactt cgataagaac ctgcccaacg agaaggtgct 2580
gcccaagcac agcctgctgt acgagtactt caccgtgtat aacgagctga ccaaagtgaa 2640
atacgtgacc gagggaatga gaaagcccgc cttcctgagc ggcgagcaga aaaaggccat 2700
cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg aagcagctga aagaggacta 2760
cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc ggcgtggaag atcggttcaa 2820
cgcctccctg ggcacatacc acgatctgct gaaaattatc aaggacaagg acttcctgga 2880
caatgaggaa aacgaggaca ttctggaaga tatcgtgctg accctgacac tgtttgagga 2940
cagagagatg atcgaggaac ggctgaaaac ctatgcccac ctgttcgacg acaaagtgat 3000
gaagcagctg aagcggcgga gatacaccgg ctggggcagg ctgagccgga agctgatcaa 3060
cggcatccgg gacaagcagt ccggcaagac aatcctggat ttcctgaagt ccgacggctt 3120
cgccaacaga aacttcatgc agctgatcca cgacgacagc ctgaccttta aagaggacat 3180
ccagaaagcc caggtgtccg gccagggcga tagcctgcac gagcacattg ccaatctggc 3240
cggcagcccc gccattaaga agggcatcct gcagacagtg aaggtggtgg acgagctcgt 3300
gaaagtgatg ggccggcaca agcccgagaa catcgtgatc gaaatggcca gagagaacca 3360
gaccacccag aagggacaga agaacagccg cgagagaatg aagcggatcg aagagggcat 3420
caaagagctg ggcagccaga tcctgaaaga acaccccgtg gaaaacaccc agctgcagaa 3480
cgagaagctg tacctgtact acctgcagaa tgggcgggat atgtacgtgg accaggaact 3540
ggacatcaac cggctgtccg actacgatgt ggaccatatc gtgcctcaga gctttctgaa 3600
ggacgactcc atcgacaaca aggtgctgac cagaagcgac aagaaccggg gcaagagcga 3660
caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac tactggcggc agctgctgaa 3720
cgccaagctg attacccaga gaaagttcga caatctgacc aaggccgaga gaggcggcct 3780
gagcgaactg gataaggccg gcttcatcaa gagacagctg gtggaaaccc ggcagatcac 3840
aaagcacgtg gcacagatcc tggactcccg gatgaacact aagtacgacg agaatgacaa 3900
gctgatccgg gaagtgaaag tgatcaccct gaagtccaag ctggtgtccg atttccggaa 3960
ggatttccag ttttacaaag tgcgcgagat caacaactac caccacgccc acgacgccta 4020
cctgaacgcc gtcgtgggaa ccgccctgat caaaaagtac cctaagctgg aaagcgagtt 4080
cgtgtacggc gactacaagg tgtacgacgt gcggaagatg atcgccaaga gcgagcagga 4140
aatcggcaag gctaccgcca agtacttctt ctacagcaac atcatgaact ttttcaagac 4200
cgagattacc ctggccaacg gcgagatccg gaagcggcct ctgatcgaga caaacggcga 4260
aaccggggag atcgtgtggg ataagggccg ggattttgcc accgtgcgga aagtgctgag 4320
catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag acaggcggct tcagcaaaga 4380
gtctatcctg cccaagagga acagcgataa gctgatcgcc agaaagaagg actgggaccc 4440
taagaagtac ggcggcttcg acagccccac cgtggcctat tctgtgctgg tggtggccaa 4500
agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa gagctgctgg ggatcaccat 4560
catggaaaga agcagcttcg agaagaatcc catcgacttt ctggaagcca agggctacaa 4620
agaagtgaaa aaggacctga tcatcaagct gcctaagtac tccctgttcg agctggaaaa 4680
cggccggaag agaatgctgg cctctgccgg cgaactgcag aagggaaacg aactggccct 4740
gccctccaaa tatgtgaact tcctgtacct ggccagccac tatgagaagc tgaagggctc 4800
ccccgaggat aatgagcaga aacagctgtt tgtggaacag cacaagcact acctggacga 4860
gatcatcgag cagatcagcg agttctccaa gagagtgatc ctggccgacg ctaatctgga 4920
caaagtgctg tccgcctaca acaagcaccg ggataagccc atcagagagc aggccgagaa 4980
tatcatccac ctgtttaccc tgaccaatct gggagcccct gccgccttca agtactttga 5040
caccaccatc gaccggaaga ggtacaccag caccaaagag gtgctggacg ccaccctgat 5100
ccaccagagc atcaccggcc tgtacgagac acggatcgac ctgtctcagc tgggaggcga 5160
caaaaggccg gcggccacga aaaaggccgg ccaggcaaaa aagaaaaagg gcggctccaa 5220
gcggcctgcc gcgacgaaga aagcgggaca ggccaagaaa aagaaaggat ccggcgcaac 5280
aaacttctct ctgctgaaac aagccggaga tgtcgaagag aatcctggac cggtgagcaa 5340
gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 5400
cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 5460
cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 5520
cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 5580
cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 5640
cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 5700
cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 5760
caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 5820
gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 5880
gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 5940
ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 6000
cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagggct ccggcgaggg 6060
caggggaagt cttctaacat gcggggacgt ggaggaaaat cccggcccaa ccgagtacaa 6120
gcccacggtg cgcctcgcca cccgcgacga cgtccccagg gccgtacgca ccctcgccgc 6180
cgcgttcgcc gactaccccg ccacgcgcca caccgtcgat ccggaccgcc acatcgagcg 6240
ggtcaccgag ctgcaagaac tcttcctcac gcgcgtcggg ctcgacatcg gcaaggtgtg 6300
ggtcgcggac gacggcgccg cggtggcggt ctggaccacg ccggagagcg tcgaagcggg 6360
ggcggtgttc gccgagatcg gcccgcgcat ggccgagttg agcggttccc ggctggccgc 6420
gcagcaacag atggaaggcc tcctggcgcc gcaccggccc aaggagcccg cgtggttcct 6480
ggccaccgtc ggagtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct 6540
ccccggagtg gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc 6600
ccgcaacctc cccttctacg agcggctcgg cttcaccgtc accgccgacg tcgaggtgcc 6660
cgaaggaccg cgcacctggt gcatgacccg caagcccggt gcctgaacgc gttaagtcga 6720
caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 6780
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 6840
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 6900
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 6960
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 7020
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 7080
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 7140
cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 7200
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 7260
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gtcgacttta 7320
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 7380
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 7440
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7500
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7560
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagggcc 7620
cgtttaaacc cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 7680
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 7740
aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt 7800
ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt 7860
gggctctatg gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 7920
atttcacacc gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg 7980
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ttagcgcccg 8040
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 8100
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 8160
aacttgattt gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 8220
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 8280
tcaactctat ctcgggctat tcttttgatt tataagggat tttgccgatt tcggtctatt 8340
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 8400
ttacaatttt atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc 8460
cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg 8520
cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat 8580
caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 8640
tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 8700
ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 8760
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 8820
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 8880
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 8940
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 9000
cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac 9060
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 9120
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 9180
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 9240
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 9300
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 9360
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 9420
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 9480
ttgctgataa atctggagcc ggtgagcgtg gaagccgcgg tatcattgca gcactggggc 9540
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 9600
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 9660
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 9720
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 9780
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 9840
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 9900
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 9960
taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 10020
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 10080
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 10140
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 10200
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 10260
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 10320
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 10380
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 10440
ggttcctggc cttttgctgg ccttttgctc acatgt 10476
<210> 3
<211> 3120
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 3
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcaggc ctctgcagtc gacgggcccg ggatccgatg 2280
ataaacatgt gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc 2340
tgttagagag ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac 2400
gtgacgtaga aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat 2460
ggactatcat atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt 2520
gtggaaagga cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag 2580
ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 2640
tagcgcgtgc gccaattctg cagacaaatg gctctagagg tacccataga tctagatgca 2700
ttcgcgaggt accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa 2760
accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta 2820
atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 2880
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 2940
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 3000
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 3060
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3120
<210> 4
<211> 175
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 4
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 60
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 120
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccgtta cataa 175
<210> 5
<211> 554
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
tctgcagaca aatggctcta gaggtacccg ttacataact tacggtaaat ggcccgcctg 60
gctgaccgcc caacgacccc cgcccattga cgtcaatagt aacgccaata gggactttcc 120
attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt 180
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 240
gtgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 300
tcgctattac catgggggca gagcgcacat cgcccacagt ccccgagaag ttggggggag 360
gggtcggcaa ttgatccggt gcctagagaa ggtggcgcgg ggtaaactgg gaaagtgatg 420
tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata agtgcagtag 480
tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacacaggtt ggaccggtgc 540
caccatggac tata 554
<210> 6
<211> 447
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
ccagaacaca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 60
ggatcatgat attgattaca aagacgatga cgataagatg gcccccaaaa agaaacgaaa 120
ggtgggtggg tccccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagccga 180
caagaagtac agcatcggcc tggacatcgg caccaactct gtgggctggg ccgtgatcac 240
cgacgagtac aaggtgccca gcaagaaatt caaggtgctg ggcaacaccg accggcacag 300
catcaagaag aacctgatcg gagccctgct gttcgacagc ggcgaaacag ccgaggccac 360
ccggctgaag agaaccgcca gaagaagata caccagacgg aagaaccgga tctgctatct 420
gcaagagatc ttcagcaacg agatggc 447
<210> 7
<211> 2727
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
cggcggccac gaaaaaggcc ggccaggcaa aaaagaaaaa gggcggctcc aagcggcctg 60
ccgcgacgaa gaaagcggga caggccaaga aaaagaaagg atccggcgca acaaacttct 120
ctctgctgaa acaagccgga gatgtcgaag agaatcctgg accggtgagc aagggcgagg 180
agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta aacggccaca 240
agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg accctgaagt 300
tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc accctgacct 360
acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac ttcttcaagt 420
ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac gacggcaact 480
acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc atcgagctga 540
agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag tacaactaca 600
acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag gtgaacttca 660
agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac cagcagaaca 720
cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc acccagtccg 780
ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag ttcgtgaccg 840
ccgccgggat cactctcggc atggacgagc tgtacaaggg ctccggcgag ggcaggggaa 900
gtcttctaac atgcggggac gtggaggaaa atcccggccc aaccgagtac aagcccacgg 960
tgcgcctcgc cacccgcgac gacgtcccca gggccgtacg caccctcgcc gccgcgttcg 1020
ccgactaccc cgccacgcgc cacaccgtcg atccggaccg ccacatcgag cgggtcaccg 1080
agctgcaaga actcttcctc acgcgcgtcg ggctcgacat cggcaaggtg tgggtcgcgg 1140
acgacggcgc cgcggtggcg gtctggacca cgccggagag cgtcgaagcg ggggcggtgt 1200
tcgccgagat cggcccgcgc atggccgagt tgagcggttc ccggctggcc gcgcagcaac 1260
agatggaagg cctcctggcg ccgcaccggc ccaaggagcc cgcgtggttc ctggccaccg 1320
tcggagtctc gcccgaccac cagggcaagg gtctgggcag cgccgtcgtg ctccccggag 1380
tggaggcggc cgagcgcgcc ggggtgcccg ccttcctgga gacctccgcg ccccgcaacc 1440
tccccttcta cgagcggctc ggcttcaccg tcaccgccga cgtcgaggtg cccgaaggac 1500
cgcgcacctg gtgcatgacc cgcaagcccg gtgcctgaac gcgttaagtc gacaatcaac 1560
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 1620
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 1680
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 1740
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 1800
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 1860
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 1920
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 1980
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 2040
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 2100
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 2160
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 2220
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 2280
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 2340
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 2400
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcaggg cccgtttaaa 2460
cccgctgatc agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc 2520
ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 2580
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 2640
acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 2700
tggcctgcag gggcgcctga tgcggta 2727
<210> 8
<211> 410
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
gataaacatg tgagggccta tttcccatga ttccttcata tttgcatata cgatacaagg 60
ctgttagaga gataattgga attaatttga ctgtaaacac aaagatatta gtacaaaata 120
cgtgacgtag aaagtaataa tttcttgggt agtttgcagt tttaaaatta tgttttaaaa 180
tggactatca tatgcttacc gtaacttgaa agtatttcga tttcttggct ttatatatct 240
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 300
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 360
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccatag 410
<210> 9
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
agttatggca gaactcagtg 20
<210> 10
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 10
caccgagtta tggcagaact cagtg 25
<210> 11
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 11
aaaccactga gttctgccat aactc 25
<210> 12
<211> 100
<212> RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
aguuauggca gaacucagug guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 13
<211> 375
<212> PRT
<213> pig (Sus scrofa)
<400> 13
Met Gln Lys Leu Gln Ile Tyr Val Tyr Ile Tyr Leu Phe Met Leu Ile
1 5 10 15
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gln Lys Glu Asn
20 25 30
Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Met Trp Arg Gln Asn Thr
35 40 45
Lys Ser Ser Arg Leu Glu Ala Ile Lys Ile Gln Ile Leu Ser Lys Leu
50 55 60
Arg Leu Glu Thr Ala Pro Asn Ile Ser Lys Asp Ala Ile Arg Gln Leu
65 70 75 80
Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu Ile Asp Gln Tyr Asp Val
85 90 95
Gln Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His
100 105 110
Ala Thr Thr Glu Thr Ile Ile Thr Met Pro Thr Glu Ser Asp Leu Leu
115 120 125
Met Gln Val Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser
130 135 140
Lys Ile Gln Tyr Asn Lys Val Val Lys Ala Gln Leu Trp Ile Tyr Leu
145 150 155 160
Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gln Ile Leu Arg Leu
165 170 175
Ile Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly Ile Arg Ser Leu
180 185 190
Lys Leu Asp Met Asn Pro Gly Thr Gly Ile Trp Gln Ser Ile Asp Val
195 200 205
Lys Thr Val Leu Gln Asn Trp Leu Lys Gln Pro Glu Ser Asn Leu Gly
210 215 220
Ile Glu Ile Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr
225 230 235 240
Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys
245 250 255
Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys
260 265 270
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val
275 280 285
Asp Phe Glu Ala Phe Gly Trp Asp Trp Ile Ile Ala Pro Lys Arg Tyr
290 295 300
Lys Ala Ser Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gln Lys
305 310 315 320
Tyr Pro His Thr His Leu Val His Gln Ala Asn Pro Arg Gly Ser Ala
325 330 335
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro Ile Asn Met Leu Tyr
340 345 350
Phe Asn Gly Lys Glu Gln Ile Ile Tyr Gly Lys Ile Pro Ala Met Val
355 360 365
Val Asp Arg Cys Gly Cys Ser
370 375
<210> 14
<211> 573
<212> DNA
<213> pig (Sus scrofa)
<400> 14
atgcaaaaac tgcaaatcta tgtttatatt tacctgttta tgctgattgt tgctggtccc 60
gtggatctga atgagaacag cgagcaaaag gaaaatgtgg aaaaagaggg gctgtgtaat 120
gcatgtatgt ggagacaaaa cactaaatct tcaagactag aagccataaa aattcaaatc 180
ctcagtaaac ttcgcctgga aacagctcct aacattagca aagatgctat aagacaactt 240
ttgcccaaag ctcctccact ccgggaactg attgatcagt acgatgtcca gagagatgac 300
agcagtgatg gctccttgga agatgatgat tatcacgcta cgacggaaac gatcattacc 360
atgcctacag agtgtaagta gtcctattag tgtatatcaa caattctgct gactgttgtt 420
ccagtgttta tgagaaacag atctattttc aggctctttt aacaagctgt tggcttgtac 480
gtaagtagga gggaaaagag tttctttttt caagatttca tgagaaataa actaatgaga 540
ctgaaagctg ctgtattatt gttttcctta gct 573
<210> 15
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 15
tgatcaatca gttcccggag 20
<210> 16
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
caccgtgatc aatcagttcc cggag 25
<210> 17
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 17
aaacctccgg gaactgattg atcac 25
<210> 18
<211> 100
<212> RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 18
ugaucaauca guucccggag guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 19
<211> 116
<212> PRT
<213> pig (Sus scrofa)
<400> 19
Met Leu Ser Cys Arg Leu Gln Cys Ala Leu Ala Ala Leu Ser Ile Val
1 5 10 15
Leu Ala Leu Gly Gly Val Thr Gly Ala Pro Ser Asp Pro Arg Leu Arg
20 25 30
Gln Phe Leu Gln Lys Ser Leu Ala Ala Ala Ala Gly Lys Gln Glu Leu
35 40 45
Ala Lys Tyr Phe Leu Ala Glu Leu Leu Ser Glu Pro Asn Gln Thr Glu
50 55 60
Asn Asp Ala Leu Glu Pro Glu Asp Leu Ser Gln Ala Ala Glu Gln Asp
65 70 75 80
Glu Met Arg Leu Glu Leu Gln Arg Ser Ala Asn Ser Asn Pro Ala Met
85 90 95
Ala Pro Arg Glu Arg Lys Ala Gly Cys Lys Asn Phe Phe Trp Lys Thr
100 105 110
Phe Thr Ser Cys
115
<210> 20
<211> 938
<212> DNA
<213> pig (Sus scrofa)
<400> 20
taactggtgt gcacatgtgt gagtgaaatt atggaatgtg tatgtgcata gcactgagtg 60
aatataaaaa gattgtgtag atggtgtggc acgtggggga attgtgtggg cctgtgtgca 120
ggatttattt atttcttaat aagctacttt tgattgtgta gagcctcctc tcacttcggt 180
gattgatttc acgagggtaa tggtgcgtaa aagcgctggt gagatctggg ggcgcctcct 240
agtctgacgt cagagagaga gtttaaaaag ggggagacgg tggcgagcgc acaagccgct 300
tcaggagtcg cgaggttcag agccgtcgct gctgcctgca aatcgactcc tagagtttga 360
ccaaccgcgc tctagctcgg cttctctggc cgctgccgag atgctgtcct gccgcctcca 420
gtgcgcgctg gccgcgctct ccatcgtcct ggctctgggc ggtgtcactg gcgcgccctc 480
ggatccccga ctccgtcagt ttctgcagaa gtccctggct gctgccgctg ggaagcaggt 540
aaggagactc cctcgacgcc ttctttcccc tctcgcgaat cccctaacct taccttagcc 600
ttgcccctcc tcccttgggt ggacttagga ggtggtccca aagagtatcg gtgcttttct 660
gggtccctta ggcaccaaat ctctcaggaa aactttcaaa gtccagaatt cctttttacc 720
tctttgtttt ttccctcttt gatcagcgca gtaggtcaca gttcaggtga gttctttggc 780
tttcaagaaa attctaagat ctggggaact gagctcgagg ggatgatggc atctatccgc 840
ggtgctgacc atgggaggtg ctgacccagg tgctgaaagc gcggacctct gaagcttcct 900
aagcagtacc tcccacccat gcagcagggc tgggggct 938
<210> 21
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 21
tccatcgtcc tggctctggg 20
<210> 22
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 22
caccgtccat cgtcctggct ctggg 25
<210> 23
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 23
aaaccccaga gccaggacga tggac 25
<210> 24
<211> 100
<212> RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 24
uccaucgucc uggcucuggg guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 25
<211> 1113
<212> PRT
<213> pig (Sus scrofa)
<400> 25
Met Asp Lys Leu Arg Met Val Leu His Glu Asn Ser Gly Ser Ala Asp
1 5 10 15
Phe Arg Arg Cys Ser Ala His Leu Ser Ser Phe Thr Phe Ala Val Val
20 25 30
Ala Val Leu Ser Ala Cys Leu Val Thr Ser Ser Leu Gly Gly Lys Asp
35 40 45
Lys Glu Leu Arg Leu Thr Gly Gly Glu Asn Lys Cys Ser Gly Arg Val
50 55 60
Glu Val Lys Val Gln Glu Glu Trp Gly Thr Val Cys Asn Asn Gly Trp
65 70 75 80
Asp Met Asp Val Val Ser Val Val Cys Arg Gln Leu Gly Cys Pro Thr
85 90 95
Ala Ile Lys Ala Thr Gly Trp Ala Asn Phe Ser Ala Gly Ser Gly Arg
100 105 110
Ile Trp Met Asp His Val Ser Cys Arg Gly Asn Glu Ser Ala Leu Trp
115 120 125
Asp Cys Lys His Asp Gly Trp Gly Lys His Asn Cys Thr His Gln Gln
130 135 140
Asp Ala Gly Val Thr Cys Ser Asp Gly Ser Asp Leu Glu Met Arg Leu
145 150 155 160
Val Asn Gly Gly Asn Arg Cys Leu Gly Arg Ile Glu Val Lys Phe Gln
165 170 175
Gly Arg Trp Gly Thr Val Cys Asp Asp Asn Phe Asn Ile Asn His Ala
180 185 190
Ser Val Val Cys Lys Gln Leu Glu Cys Gly Ser Ala Val Ser Phe Ser
195 200 205
Gly Ser Ala Asn Phe Gly Glu Gly Ser Gly Pro Ile Trp Phe Asp Asp
210 215 220
Leu Val Cys Asn Gly Asn Glu Ser Ala Leu Trp Asn Cys Lys His Glu
225 230 235 240
Gly Trp Gly Lys His Asn Cys Asp His Ala Glu Asp Ala Gly Val Ile
245 250 255
Cys Leu Asn Gly Ala Asp Leu Lys Leu Arg Val Val Asp Gly Val Thr
260 265 270
Glu Cys Ser Gly Arg Leu Glu Val Lys Phe Gln Gly Glu Trp Gly Thr
275 280 285
Ile Cys Asp Asp Gly Trp Asp Ser Asp Asp Ala Ala Val Ala Cys Lys
290 295 300
Gln Leu Gly Cys Pro Thr Ala Val Thr Ala Ile Gly Arg Val Asn Ala
305 310 315 320
Ser Glu Gly Thr Gly His Ile Trp Leu Asp Ser Val Ser Cys His Gly
325 330 335
His Glu Ser Ala Leu Trp Gln Cys Arg His His Glu Trp Gly Lys His
340 345 350
Tyr Cys Asn His Asn Glu Asp Ala Gly Val Thr Cys Ser Asp Gly Ser
355 360 365
Asp Leu Glu Leu Arg Leu Lys Gly Gly Gly Ser His Cys Ala Gly Thr
370 375 380
Val Glu Val Glu Ile Gln Lys Leu Val Gly Lys Val Cys Asp Arg Ser
385 390 395 400
Trp Gly Leu Lys Glu Ala Asp Val Val Cys Arg Gln Leu Gly Cys Gly
405 410 415
Ser Ala Leu Lys Thr Ser Tyr Gln Val Tyr Ser Lys Thr Lys Ala Thr
420 425 430
Asn Thr Trp Leu Phe Val Ser Ser Cys Asn Gly Asn Glu Thr Ser Leu
435 440 445
Trp Asp Cys Lys Asn Trp Gln Trp Gly Gly Leu Ser Cys Asp His Tyr
450 455 460
Asp Glu Ala Lys Ile Thr Cys Ser Ala His Arg Lys Pro Arg Leu Val
465 470 475 480
Gly Gly Asp Ile Pro Cys Ser Gly Arg Val Glu Val Gln His Gly Asp
485 490 495
Thr Trp Gly Thr Val Cys Asp Ser Asp Phe Ser Leu Glu Ala Ala Ser
500 505 510
Val Leu Cys Arg Glu Leu Gln Cys Gly Thr Val Val Ser Leu Leu Gly
515 520 525
Gly Ala His Phe Gly Glu Gly Ser Gly Gln Ile Trp Ala Glu Glu Phe
530 535 540
Gln Cys Glu Gly His Glu Ser His Leu Ser Leu Cys Pro Val Ala Pro
545 550 555 560
Arg Pro Asp Gly Thr Cys Ser His Ser Arg Asp Val Gly Val Val Cys
565 570 575
Ser Arg Tyr Thr Gln Ile Arg Leu Val Asn Gly Lys Thr Pro Cys Glu
580 585 590
Gly Arg Val Glu Leu Asn Ile Leu Gly Ser Trp Gly Ser Leu Cys Asn
595 600 605
Ser His Trp Asp Met Glu Asp Ala His Val Leu Cys Gln Gln Leu Lys
610 615 620
Cys Gly Val Ala Leu Ser Ile Pro Gly Gly Ala Pro Phe Gly Lys Gly
625 630 635 640
Ser Glu Gln Val Trp Arg His Met Phe His Cys Thr Gly Thr Glu Lys
645 650 655
His Met Gly Asp Cys Ser Val Thr Ala Leu Gly Ala Ser Leu Cys Ser
660 665 670
Ser Gly Gln Val Ala Ser Val Ile Cys Ser Gly Asn Gln Ser Gln Thr
675 680 685
Leu Ser Pro Cys Asn Ser Ser Ser Ser Asp Pro Ser Ser Ser Ile Ile
690 695 700
Ser Glu Glu Asn Gly Val Ala Cys Ile Gly Ser Gly Gln Leu Arg Leu
705 710 715 720
Val Asp Gly Gly Gly Arg Cys Ala Gly Arg Val Glu Val Tyr His Glu
725 730 735
Gly Ser Trp Gly Thr Ile Cys Asp Asp Ser Trp Asp Leu Asn Asp Ala
740 745 750
His Val Val Cys Lys Gln Leu Ser Cys Gly Trp Ala Ile Asn Ala Thr
755 760 765
Gly Ser Ala His Phe Gly Glu Gly Thr Gly Pro Ile Trp Leu Asp Glu
770 775 780
Ile Asn Cys Asn Gly Lys Glu Ser His Ile Trp Gln Cys His Ser His
785 790 795 800
Gly Trp Gly Arg His Asn Cys Arg His Lys Glu Asp Ala Gly Val Ile
805 810 815
Cys Ser Glu Phe Met Ser Leu Arg Leu Ile Ser Glu Asn Ser Arg Glu
820 825 830
Thr Cys Ala Gly Arg Leu Glu Val Phe Tyr Asn Gly Ala Trp Gly Ser
835 840 845
Val Gly Lys Asn Ser Met Ser Pro Ala Thr Val Gly Val Val Cys Arg
850 855 860
Gln Leu Gly Cys Ala Asp Arg Gly Asp Ile Ser Pro Ala Ser Ser Asp
865 870 875 880
Lys Thr Val Ser Arg His Met Trp Val Asp Asn Val Gln Cys Pro Lys
885 890 895
Gly Pro Asp Thr Leu Trp Gln Cys Pro Ser Ser Pro Trp Lys Lys Arg
900 905 910
Leu Ala Ser Pro Ser Glu Glu Thr Trp Ile Thr Cys Ala Asn Lys Ile
915 920 925
Arg Leu Gln Glu Gly Asn Thr Asn Cys Ser Gly Arg Val Glu Ile Trp
930 935 940
Tyr Gly Gly Ser Trp Gly Thr Val Cys Asp Asp Ser Trp Asp Leu Glu
945 950 955 960
Asp Ala Gln Val Val Cys Arg Gln Leu Gly Cys Gly Ser Ala Leu Glu
965 970 975
Ala Gly Lys Glu Ala Ala Phe Gly Gln Gly Thr Gly Pro Ile Trp Leu
980 985 990
Asn Glu Val Lys Cys Lys Gly Asn Glu Thr Ser Leu Trp Asp Cys Pro
995 1000 1005
Ala Arg Ser Trp Gly His Ser Asp Cys Gly His Lys Glu Asp Ala Ala
1010 1015 1020
Val Thr Cys Ser Glu Ile Ala Lys Ser Arg Glu Ser Leu His Ala Thr
1025 1030 1035 1040
Gly Arg Ser Ser Phe Val Ala Leu Ala Ile Phe Gly Val Ile Leu Leu
1045 1050 1055
Ala Cys Leu Ile Ala Phe Leu Ile Trp Thr Gln Lys Arg Arg Gln Arg
1060 1065 1070
Gln Arg Leu Ser Val Phe Ser Gly Gly Glu Asn Ser Val His Gln Ile
1075 1080 1085
Gln Tyr Arg Glu Met Asn Ser Cys Leu Lys Ala Asp Glu Thr Asp Met
1090 1095 1100
Leu Asn Pro Ser Glu Asn Ser Asn Glu
1105 1110
<210> 26
<211> 1115
<212> DNA
<213> pig (Sus scrofa)
<400> 26
tggcaaagat tgtctttaaa atctgagctc catgtcttct gctttatttc tggtgtgcct 60
ttgactccag attacagtaa atggaggact gagtataggg ctaaaaagta gagagaatgg 120
atgcatatta tctgtggtct ccaatgtgat gaatgaagta ggcaaatact caaaggaaag 180
agaaagcatg ctccaagaat tatgggttcc agaaggcaaa gtcccagaat tgtctccagg 240
gaaggacagg gaggtctaga atcggctaag cccactgtag gcagaaaaac caagaggcat 300
gaatggcttc cctttctcac ttttcactct ctggcttact cctatcatga aggaaaatat 360
tggaatcata ttctccctca ccgaaatgct attttttcag cccacaggaa acccaggctg 420
gttggagggg acattccctg ctctggtcgt gttgaagtac aacatggaga cacgtggggc 480
accgtctgtg attctgactt ctctctggag gcggccagcg tgctgtgcag ggaactacag 540
tgcggcactg tggtttccct cctgggggga gctcactttg gagaaggaag tggacagatc 600
tgggctgaag aattccagtg tgaggggcac gagtcccacc tttcactctg cccagtagca 660
ccccgccctg acgggacatg tagccacagc agggacgtcg gcgtagtctg ctcaagtgag 720
acccagggaa tgtgttcact ttgttcccat gccatgaaga gggtagggtt aggtagtcac 780
agacatcttt ttaaagccct gtctccttcc aggatacaca caaatccgct tggtgaatgg 840
caagacccca tgtgaaggaa gagtggagct caacattctt gggtcctggg ggtccctctg 900
caactctcac tgggacatgg aagatgccca tgttttatgc cagcagctta aatgtggagt 960
tgccctttct atcccgggag gagcaccttt tgggaaagga agtgagcagg tctggaggca 1020
catgtttcac tgcactggga ctgagaagca catgggagat tgttccgtca ctgctctggg 1080
cgcatcactc tgttcttcag ggcaagtggc ctctg 1115
<210> 27
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 27
ggtcgtgttg aagtacaaca 20
<210> 28
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 28
caccggtcgt gttgaagtac aaca 24
<210> 29
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 29
aaactgttgt acttcaacac gacc 24
<210> 30
<211> 100
<212> RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 30
ggucguguug aaguacaaca guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 31
<211> 963
<212> PRT
<213> pig (Sus scrofa)
<400> 31
Met Ala Lys Gly Phe Tyr Ile Ser Lys Ala Leu Gly Ile Leu Gly Ile
1 5 10 15
Leu Leu Gly Val Ala Ala Val Ala Thr Ile Ile Ala Leu Ser Val Val
20 25 30
Tyr Ala Gln Glu Lys Asn Lys Asn Ala Glu His Val Pro Gln Ala Pro
35 40 45
Thr Ser Pro Thr Ile Thr Thr Thr Ala Ala Ile Thr Leu Asp Gln Ser
50 55 60
Lys Pro Trp Asn Arg Tyr Arg Leu Pro Thr Thr Leu Leu Pro Asp Ser
65 70 75 80
Tyr Phe Val Thr Leu Arg Pro Tyr Leu Thr Pro Asn Ala Asp Gly Leu
85 90 95
Tyr Ile Phe Lys Gly Lys Ser Ile Val Arg Leu Leu Cys Gln Glu Pro
100 105 110
Thr Asp Val Ile Ile Ile His Ser Lys Lys Leu Asn Tyr Thr Thr Gln
115 120 125
Gly His Met Val Val Leu Arg Gly Val Gly Asp Ser Gln Val Pro Glu
130 135 140
Ile Asp Arg Thr Glu Leu Val Glu Leu Thr Glu Tyr Leu Val Val His
145 150 155 160
Leu Lys Gly Ser Leu Gln Pro Gly His Met Tyr Glu Met Glu Ser Glu
165 170 175
Phe Gln Gly Glu Leu Ala Asp Asp Leu Ala Gly Phe Tyr Arg Ser Glu
180 185 190
Tyr Met Glu Gly Asn Val Lys Lys Val Leu Ala Thr Thr Gln Met Gln
195 200 205
Ser Thr Asp Ala Arg Lys Ser Phe Pro Cys Phe Asp Glu Pro Ala Met
210 215 220
Lys Ala Thr Phe Asn Ile Thr Leu Ile His Pro Asn Asn Leu Thr Ala
225 230 235 240
Leu Ser Asn Met Pro Pro Lys Gly Ser Ser Thr Pro Leu Ala Glu Asp
245 250 255
Pro Asn Trp Ser Val Thr Glu Phe Glu Thr Thr Pro Val Met Ser Thr
260 265 270
Tyr Leu Leu Ala Tyr Ile Val Ser Glu Phe Gln Ser Val Asn Glu Thr
275 280 285
Ala Gln Asn Gly Val Leu Ile Arg Ile Trp Ala Arg Pro Asn Ala Ile
290 295 300
Ala Glu Gly His Gly Met Tyr Ala Leu Asn Val Thr Gly Pro Ile Leu
305 310 315 320
Asn Phe Phe Ala Asn His Tyr Asn Thr Ser Tyr Pro Leu Pro Lys Ser
325 330 335
Asp Gln Ile Ala Leu Pro Asp Phe Asn Ala Gly Ala Met Glu Asn Trp
340 345 350
Gly Leu Val Thr Tyr Arg Glu Asn Ala Leu Leu Phe Asp Pro Gln Ser
355 360 365
Ser Ser Ile Ser Asn Lys Glu Arg Val Val Thr Val Ile Ala His Glu
370 375 380
Leu Ala His Gln Trp Phe Gly Asn Leu Val Thr Leu Ala Trp Trp Asn
385 390 395 400
Asp Leu Trp Leu Asn Glu Gly Phe Ala Ser Tyr Val Glu Tyr Leu Gly
405 410 415
Ala Asp His Ala Glu Pro Thr Trp Asn Leu Lys Asp Leu Ile Val Pro
420 425 430
Gly Asp Val Tyr Arg Val Met Ala Val Asp Ala Leu Ala Ser Ser His
435 440 445
Pro Leu Thr Thr Pro Ala Glu Glu Val Asn Thr Pro Ala Gln Ile Ser
450 455 460
Glu Met Phe Asp Ser Ile Ser Tyr Ser Lys Gly Ala Ser Val Ile Arg
465 470 475 480
Met Leu Ser Asn Phe Leu Thr Glu Asp Leu Phe Lys Glu Gly Leu Ala
485 490 495
Ser Tyr Leu His Ala Phe Ala Tyr Gln Asn Thr Thr Tyr Leu Asp Leu
500 505 510
Trp Glu His Leu Gln Lys Ala Val Asp Ala Gln Thr Ser Ile Arg Leu
515 520 525
Pro Asp Thr Val Arg Ala Ile Met Asp Arg Trp Thr Leu Gln Met Gly
530 535 540
Phe Pro Val Ile Thr Val Asp Thr Lys Thr Gly Asn Ile Ser Gln Lys
545 550 555 560
His Phe Leu Leu Asp Ser Glu Ser Asn Val Thr Arg Ser Ser Ala Phe
565 570 575
Asp Tyr Leu Trp Ile Val Pro Ile Ser Ser Ile Lys Asn Gly Val Met
580 585 590
Gln Asp His Tyr Trp Leu Arg Asp Val Ser Gln Ala Gln Asn Asp Leu
595 600 605
Phe Lys Thr Ala Ser Asp Asp Trp Val Leu Leu Asn Val Asn Val Thr
610 615 620
Gly Tyr Phe Gln Val Asn Tyr Asp Glu Asp Asn Trp Arg Met Ile Gln
625 630 635 640
His Gln Leu Gln Thr Asn Leu Ser Val Ile Pro Val Ile Asn Arg Ala
645 650 655
Gln Val Ile Tyr Asp Ser Phe Asn Leu Ala Thr Ala His Met Val Pro
660 665 670
Val Thr Leu Ala Leu Asp Asn Thr Leu Phe Leu Asn Gly Glu Lys Glu
675 680 685
Tyr Met Pro Trp Gln Ala Ala Leu Ser Ser Leu Ser Tyr Phe Ser Leu
690 695 700
Met Phe Asp Arg Ser Glu Val Tyr Gly Pro Met Lys Lys Tyr Leu Arg
705 710 715 720
Lys Gln Val Glu Pro Leu Phe Gln His Phe Glu Thr Leu Thr Lys Asn
725 730 735
Trp Thr Glu Arg Pro Glu Asn Leu Met Asp Gln Tyr Ser Glu Ile Asn
740 745 750
Ala Ile Ser Thr Ala Cys Ser Asn Gly Leu Pro Gln Cys Glu Asn Leu
755 760 765
Ala Lys Thr Leu Phe Asp Gln Trp Met Ser Asp Pro Glu Asn Asn Pro
770 775 780
Ile His Pro Asn Leu Arg Ser Thr Ile Tyr Cys Asn Ala Ile Ala Gln
785 790 795 800
Gly Gly Gln Asp Gln Trp Asp Phe Ala Trp Gly Gln Leu Gln Gln Ala
805 810 815
Gln Leu Val Asn Glu Ala Asp Lys Leu Arg Ser Ala Leu Ala Cys Ser
820 825 830
Asn Glu Val Trp Leu Leu Asn Arg Tyr Leu Gly Tyr Thr Leu Asn Pro
835 840 845
Asp Leu Ile Arg Lys Gln Asp Ala Thr Ser Thr Ile Asn Ser Ile Ala
850 855 860
Ser Asn Val Ile Gly Gln Pro Leu Ala Trp Asp Phe Val Gln Ser Asn
865 870 875 880
Trp Lys Lys Leu Phe Gln Asp Tyr Gly Gly Gly Ser Phe Ser Phe Ser
885 890 895
Asn Leu Ile Gln Gly Val Thr Arg Arg Phe Ser Ser Glu Phe Glu Leu
900 905 910
Gln Gln Leu Glu Gln Phe Lys Lys Asn Asn Met Asp Val Gly Phe Gly
915 920 925
Ser Gly Thr Arg Ala Leu Glu Gln Ala Leu Glu Lys Thr Lys Ala Asn
930 935 940
Ile Lys Trp Val Lys Glu Asn Lys Glu Val Val Leu Asn Trp Phe Ile
945 950 955 960
Glu His Ser
<210> 32
<211> 1203
<212> DNA
<213> pig (Sus scrofa)
<400> 32
ctgccacctg cccttcagcc cttggtgggc tcccaggctc ctgcagcctg taaccagacc 60
ctgtttgctc ccagcaggca cccctgagcc gcactccgca cgctgttcct gaatctcccc 120
tccagaaccg gagcagtgtc tctacccagt tcagtgacct tcgtctgtct gagccctggt 180
taatttttgc ccagtctgca ggctgtgggg ctcctcccct tcagggatat aagcctggtc 240
cgaagctgcc ctgtcccctg cccgtcctga gcctccccga gctcccttct caccctcacc 300
atggccaagg gattctacat ttccaaggcc ctgggcatcc tgggcatcct cctcggcgtg 360
gcggccgtgg ccaccatcat cgctctgtct gtggtgtacg cccaggagaa gaacaagaat 420
gccgagcatg tcccccaggc ccccacgtcg cccaccatca ccaccacagc cgccatcacc 480
ttggaccaga gcaagccgtg gaaccggtac cgcctaccca caacgctgtt gcctgattcc 540
tacaacgtga cgctgagacc ctacctcact cccaacgcgg atggcctgta catcttcaag 600
ggcaaaagca tcgtccgctt catctgccag gagcccaccg atgtcatcat catccatagc 660
aagaagctca actacaccac ccaggggcac atggtggtcc tgcggggcgt gggggactcc 720
caggtcccag agatcgacag gactgagctg gtagagctca ctgagtacct ggtggtccac 780
ctcaagggct cgctgcagcc cggccacatg tacgagatgg agagtgaatt ccagggggaa 840
cttgccgacg acctggcagg cttctaccgc agcgagtaca tggagggcaa cgtcaaaaag 900
taagtcaggt gggggcacac cctagatgct gaggcagagc tggatcctgg gggccaagga 960
agggcttgga ttcgggacct tggaaccttc tggagacttt ggctggcccg tcgctccatc 1020
cgcagctctg gtagagaagc tatctagaca atcagccctt tcccggagag cccccctaac 1080
cttagggagt caggggtgag tgatccaagt gcccccttgg gtagaaagga aaacaggctc 1140
tgaggacaga aatttgccca aggtctccca gctaattcag gggtggagcc tgcccggact 1200
ttg 1203
<210> 33
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 33
gaggatgccc aggatgccca 20
<210> 34
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 34
caccgaggat gcccaggatg ccca 24
<210> 35
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 35
aaactgggca tcctgggcat cctc 24
<210> 36
<211> 100
<212> RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 36
gaggaugccc aggaugccca guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100

Claims (4)

1. A CRISPR/Cas9 system for editing four genes of pig MSTN, SST, CD163 and pAPN is characterized by comprising a Cas9 expression vector, a gRNA expression vector aiming at pig MSTN gene, a gRNA expression vector aiming at pig SST gene, a gRNA expression vector aiming at pig CD163 gene and a gRNA expression vector aiming at pig pAPN gene; the complete sequence of the Cas9 expression vector plasmid is shown as SEQ ID NO. 2; the gRNA expression vector for the pig MSTN gene expresses gRNA shown by SEQ ID NO.18, the target point of the gRNA is shown by SEQ ID NO.15, and the expression vector is obtained by inserting double chains formed by annealing single-chain DNA shown by SEQ ID NO.16 and SEQ ID NO.17 into a vector skeleton pKG-U6 gRNA; the gRNA expression vector for the pig SST gene expresses gRNA shown by SEQ ID No.24, the target point of the gRNA is shown by SEQ ID No.21, and the expression vector is obtained by inserting double chains formed by annealing single-chain DNA shown by SEQ ID No.22 and SEQ ID No.23 into a vector skeleton pKG-U6 gRNA; the gRNA expression vector for the pig CD163 gene expresses gRNA shown in SEQ ID No.30, the target point of the gRNA is shown in SEQ ID No.27, and the expression vector is obtained by inserting a double chain formed by annealing single-chain DNA shown in SEQ ID No.28 and SEQ ID No.29 into a vector skeleton pKG-U6 gRNA; the gRNA expression vector for the pAPN gene of the pig expresses gRNA shown by SEQ ID NO.36, the target point of the gRNA is shown by SEQ ID NO.33, and the expression vector is obtained by inserting double chains formed by annealing single-chain DNA shown by SEQ ID NO.34 and SEQ ID NO.35 into a vector skeleton pKG-U6 gRNA; the plasmid complete sequence of vector framework pKG-U6gRNA is shown in SEQ ID NO. 3.
2. Use of the CRISPR/Cas9 system of claim 1 to construct recombinant cells of pigs knocked out for four genes of MSTN, SST, CD163 and pAPN.
3. A porcine recombinant cell with MSTN, SST, CD163 and pAPN gene knockout, which is obtained by verifying the primary porcine fibroblast cotransfected by the CRISPR/Cas9 system of claim 1.
4. Use of the recombinant cell of claim 3 in the construction of cloned pigs with four knockout genes MSTN, SST, CD163 and pAPN.
CN202110197015.1A 2021-02-22 2021-02-22 Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells Active CN112779292B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110197015.1A CN112779292B (en) 2021-02-22 2021-02-22 Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110197015.1A CN112779292B (en) 2021-02-22 2021-02-22 Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells

Publications (2)

Publication Number Publication Date
CN112779292A CN112779292A (en) 2021-05-11
CN112779292B true CN112779292B (en) 2023-03-10

Family

ID=75761730

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110197015.1A Active CN112779292B (en) 2021-02-22 2021-02-22 Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells

Country Status (1)

Country Link
CN (1) CN112779292B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113403337A (en) * 2021-05-13 2021-09-17 温氏食品集团股份有限公司 Carrier system, method for preparing pig fibroblast and gene editing pig
CN113604502A (en) * 2021-08-26 2021-11-05 中国农业科学院北京畜牧兽医研究所 Gene editing system of pAPN gene 16 th exon and application thereof
CN114774468B (en) * 2022-04-20 2022-12-20 温氏食品集团股份有限公司 Allele molecular marker and anti-blue-ear-disease pig group construction method
CN116855539A (en) * 2023-07-18 2023-10-10 中农种源(深圳)科技有限公司 Pig breeding method capable of simultaneously knocking out CD163, pAPN and MSTN genes and improving disease resistance and quality

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105950625A (en) * 2016-06-03 2016-09-21 中国农业科学院北京畜牧兽医研究所 sgRNA pair for conducting specific recognition on pig MSTN gene promoter and encoding DNA and application thereof
CN107177595A (en) * 2017-06-07 2017-09-19 浙江大学 Targeting sgRNA, modification carrier for pig CD163 gene editings and its preparation method and application
CN109628494A (en) * 2019-01-23 2019-04-16 华南农业大学 Coronavirus resistance clone pig and preparation method thereof
CN111778252A (en) * 2020-07-17 2020-10-16 湖北省农业科学院畜牧兽医研究所 SgRNA for targeted knockout of SST gene, CRISPR/Cas9 system and application thereof

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11240997B2 (en) * 2019-04-09 2022-02-08 Shandong Landsee Genetics Co., Ltd. Method for preparing porcine fibroblasts with both CD163 gene and CD13 gene being knocked-out

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105950625A (en) * 2016-06-03 2016-09-21 中国农业科学院北京畜牧兽医研究所 sgRNA pair for conducting specific recognition on pig MSTN gene promoter and encoding DNA and application thereof
CN107177595A (en) * 2017-06-07 2017-09-19 浙江大学 Targeting sgRNA, modification carrier for pig CD163 gene editings and its preparation method and application
CN109628494A (en) * 2019-01-23 2019-04-16 华南农业大学 Coronavirus resistance clone pig and preparation method thereof
CN111778252A (en) * 2020-07-17 2020-10-16 湖北省农业科学院畜牧兽医研究所 SgRNA for targeted knockout of SST gene, CRISPR/Cas9 system and application thereof

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
A Non-integrating Lentiviral Approach Overcomes Cas9-Induced Immune Rejection to Establish an Immunocompetent Metastatic Renal Cancer Model;Hu等;《Mol Ther Methods Clin Dev》;20180630;第9卷;第204页图1 *
analysis of an aggrecan knockout cell line.《Bone》.2014,第69卷 *
CRISPR/Cas9基因编辑技术在养猪业中的应用;陈辉;《猪业科学》;20181231;摘要,第119页左栏第2段,图1 *
Yang等.CRISPR/Cas9 mediated generation of stable chondrocyte cell lines with targeted gene knockouts *

Also Published As

Publication number Publication date
CN112779292A (en) 2021-05-11

Similar Documents

Publication Publication Date Title
CN112779292B (en) Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells
CN112779291B (en) Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage, fast growth, high reproductive capacity and resistance to series epidemic diseases and application thereof
CN112522260B (en) CRISPR system and application thereof in preparing TTN gene mutation dilated cardiomyopathy clone pig nuclear donor cells
CN112877362A (en) Gene editing system for constructing high-quality porcine nuclear transplantation donor cells with high fertility and capability of resisting porcine reproductive and respiratory syndrome and serial diarrhea diseases and application of gene editing system
CN113046388B (en) CRISPR system for constructing atherosclerosis pig nuclear transfer donor cells with double genes in combined knockout mode and application of CRISPR system
CN112522264B (en) CRISPR/Cas9 system causing congenital deafness and application thereof in preparation of model pig nuclear donor cells
CN112522313B (en) CRISPR/Cas9 system for constructing depression cloned pig nuclear donor cells with TPH2 gene mutation
CN112522261B (en) CRISPR system for preparing LMNA gene mutation dilated cardiomyopathy clone pig nuclear donor cell and application thereof
CN114958762B (en) Method for constructing nerve tissue specific overexpression humanized SNCA parkinsonism model pig and application
CN112813101B (en) Gene editing system for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and application thereof
CN112608941B (en) CRISPR system for constructing obese pig nuclear transplantation donor cells with MC4R gene mutation and application of CRISPR system
CN112795566B (en) OPG gene editing system for constructing osteoporosis clone pig nuclear donor cell line and application thereof
CN112522311B (en) CRISPR system for ADCY3 gene editing and application thereof in construction of obese pig nuclear transfer donor cells
CN113584078B (en) CRISPR system for double-target gene editing and application thereof in construction of depressive pig nuclear transfer donor cells
CN112899306B (en) CRISPR system and application thereof in construction of GABRG2 gene mutation cloned pig nuclear donor cells
CN112575033B (en) CRISPR system and application thereof in construction of SCN1A gene mutated epileptic encephalopathy clone pig nuclear donor cell
CN112680444B (en) CRISPR system for OCA2 gene mutation and application thereof in construction of albino clone pig nuclear donor cells
CN112522258B (en) Recombinant cell with IL2RG gene and ADA gene knocked out in combined mode and application of recombinant cell in preparation of immunodeficiency pig model
CN112522202B (en) Method for preparing ADDI four-gene combined knockout severe immunodeficiency swine-derived recombinant cell and special kit thereof
CN112522309B (en) Severe immunodeficiency pig source recombinant cell, preparation method and kit thereof
CN112680453B (en) CRISPR system and application thereof in construction of STXBP1 mutant epileptic encephalopathy clone pig nuclear donor cell
CN112522257B (en) System for preparing severe immunodeficiency pig source recombinant cells with RRIP four genes knocked out in combined mode
CN112522256B (en) CRISPR/Cas9 system and application thereof in construction of dystrophin gene-deficient porcine recombinant cells
CN112522255B (en) CRISPR/Cas9 system and application thereof in construction of porcine recombinant cell with insulin receptor substrate gene defect
CN112877363A (en) Gene editing system for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage, fast growth and high reproductive capacity and application thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20221107

Address after: 211306 3rd floor, Nanjing Health Industry Research Institute, building 6, No. 17, Huashan Road, economic development zone, Gaochun District, Nanjing City, Jiangsu Province

Applicant after: Nanjing Qizhen Genetic Engineering Co.,Ltd.

Address before: Room 5681, 5th floor, No.20 xidoumen Road, Gudang street, Xihu District, Hangzhou City, Zhejiang Province, 310058

Applicant before: Hangzhou hexinyuan Biotechnology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant