KR100880509B1 - A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same - Google Patents

A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same Download PDF

Info

Publication number
KR100880509B1
KR100880509B1 KR1020060100507A KR20060100507A KR100880509B1 KR 100880509 B1 KR100880509 B1 KR 100880509B1 KR 1020060100507 A KR1020060100507 A KR 1020060100507A KR 20060100507 A KR20060100507 A KR 20060100507A KR 100880509 B1 KR100880509 B1 KR 100880509B1
Authority
KR
South Korea
Prior art keywords
cell line
recombinant protein
expression vector
expression
gene
Prior art date
Application number
KR1020060100507A
Other languages
Korean (ko)
Other versions
KR20080034364A (en
Inventor
최인영
김창환
이현지
박성희
권세창
이관순
Original Assignee
한미약품 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to KR1020060100507A priority Critical patent/KR100880509B1/en
Application filed by 한미약품 주식회사 filed Critical 한미약품 주식회사
Priority to CN2007800384033A priority patent/CN101605899B/en
Priority to EP07833373A priority patent/EP2087118B1/en
Priority to US12/444,567 priority patent/US8957196B2/en
Priority to PCT/KR2007/005064 priority patent/WO2008048037A1/en
Priority to AT07833373T priority patent/ATE512229T1/en
Priority to ES07833373T priority patent/ES2366061T3/en
Priority to JP2009533241A priority patent/JP5076084B2/en
Publication of KR20080034364A publication Critical patent/KR20080034364A/en
Application granted granted Critical
Publication of KR100880509B1 publication Critical patent/KR100880509B1/en
Priority to HK10101256.5A priority patent/HK1137481A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0012Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
    • C12N9/0026Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on CH-NH groups of donors (1.5)
    • C12N9/0028Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on CH-NH groups of donors (1.5) with NAD or NADP as acceptor (1.5.1)
    • C12N9/003Dihydrofolate reductase [DHFR] (1.5.1.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

Disclosed herein is an inducible high-expression cassette comprising a dihydrofolate reductase (DHFR) promoter from which GC-rich repeat sequences are partially or entirely removed, the cassette capable of more effectively improving a gene amplification system. Also disclosed are an expression vector comprising the inducible expression cassette and optionally a gene encoding a recombinant protein of interest, an animal cell line transformed with the expression vector, and a method of mass producing and purifying a recombinant protein by culturing the transformant. The present invention enables the shortening of the time required to establish a cell line producing a recombinant protein of interest at high levels using a low concentration of a DHFR inhibitor, thereby allowing more effective production of the recombinant protein.

Description

재조합 단백질의 대량 생산을 위한 신규한 벡터, 발현 세포주 및 이를 이용한 재조합 단백질의 생산 방법{A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same}A novel vector and expression cell line for mass production of recombinant protein and a method for producing recombinant protein using the same

도1은 GC-rich 서열이 순차적으로 제거된 발현벡터 X1GC/dhfr, X3GC/dhfr 및 X6GC/dhfr의 클로닝 과정 모식도이다.1 is a schematic diagram of a cloning process of expression vectors X1GC / dhfr, X3GC / dhfr and X6GC / dhfr from which GC-rich sequences are sequentially removed.

도2는 GC-rich 서열이 완전히 제거된 발현벡터 X0GC/dhfr의 클로닝 과정 모식도이다.2 is a schematic diagram of the cloning process of the expression vector X0GC / dhfr from which the GC-rich sequence was completely removed.

도3은 상기 도1과 도2의 발현벡터에 gEPO 유전자를 클로닝하는 과정에 대한 모식도이다.Figure 3 is a schematic diagram of a process for cloning the gEPO gene in the expression vector of Figures 1 and 2.

도4 내지 도7은 본 발명의 발현벡터로 형질전환된 각각의 세포주들의 EPO 발현량을 이라이자(ELISA)로 측정한 것이다.4 to 7 are measured by ELISA of the amount of EPO expression of each cell line transformed with the expression vector of the present invention.

도8은 증폭된 디하이드로폴레이트 유전자의 발현량을 웨스턴 블로팅으로 나타낸 결과이다.8 shows the result of Western blotting of the expression level of the amplified dihydrofolate gene.

도9는 실시예 6에서 선별된 단클론 세포주 X0GC/GEPO9647(DXB11)를 대량 배양하여 총 9회에 걸쳐 인간 에리트로포이에틴 발현 상등액을 회수하여 간접 이라이 자법으로 발현량을 측정한 결과이다.9 is a result of measuring the expression level by indirect ERISA method by recovering the human erythropoietin expression supernatant over a total of nine times by culturing the monoclonal cell line X0GC / GEPO9647 (DXB11) selected in Example 6.

도10은 실시예 8에서 정제된 재조합 에리트로포이에틴의 SDS-PAGE 및 웨스턴 블로팅 분석 결과이다.Figure 10 shows the results of SDS-PAGE and Western blotting of the recombinant erythropoietin purified in Example 8.

도11는 실시예 4에서 정제된 재조합 에리트로포이에틴의 등전 집속 분석 결과이다.11 is an isoelectric focusing analysis result of the recombinant erythropoietin purified in Example 4.

도12은 무혈청 배지에 완전 적응된 X1GC/GEPO9629(DG44), X0GC/GEPO9603(DG44)를 배양하여 발현된 에리트로포이에틴의 등전집속 분석 결과이다.12 is an isoelectric focusing result of erythropoietin expressed by culturing X1GC / GEPO9629 (DG44) and X0GC / GEPO9603 (DG44) fully adapted to serum-free medium.

본 발명은 재조합 단백질을 대량 생산하는 벡터 및 이의 발현 세포주, 그리고 이를 이용한 재조합 단백질을 생산 및 정제하는 방법에 관한 것이다. 보다 상세하게는 디하이드로폴레이트 환원효소(Dihyrofolate redutase)의 전사조절 부위인 프로모터를 인위적으로 약화시켜 유전자의 증폭 효율을 크게 개선시킬 수 있는 벡터, 이를 이용하여 형질전환된 동물 세포주를 제작하고, 이러한 동물 세포주를 이용하여 단백질을 발현시키고 발현된 단백질 중에서 고당쇄화형만을 정제하는 방법에 관한 것이다.The present invention relates to a vector for mass production of a recombinant protein and an expression cell line thereof, and a method for producing and purifying the recombinant protein using the same. More specifically, a vector, which can greatly improve the amplification efficiency of a gene by artificially weakening a promoter, a transcriptional regulatory region of Dihyrofolate redutase, can be used to prepare a transformed animal cell line. The present invention relates to a method of expressing a protein using an animal cell line and purifying only a high glycated chain form from the expressed protein.

재조합 단백질을 대량으로 생산하기 위해서는 다양한 벡터와 숙주들이 이용 되고 있으며 일반적으로 대장균을 이용한 방법 등이 널리 사용되고 있으나, 당쇄화가 필수적이거나 복잡한 구조를 갖는 단백질의 경우에는 이용이 제한적인 단점을 지니고 있다. 이러한 문제점을 해결하기 위해서 동물세포주를 이용하거나, 효모균, 형질전환 동물, 형질전환 식물 등을 이용하기도 한다. Various vectors and hosts have been used to produce a large amount of recombinant protein, and generally, a method using Escherichia coli is widely used. However, in the case of a protein having glycosylation or a complex structure, its use is limited. To solve these problems, animal cell lines may be used, or yeast, transgenic animals, or transgenic plants may be used.

효모균의 경우 대량 생산이 용이한 장점을 지니고 있으나 당쇄화의 정도와 패턴이 인간과는 매우 상이하여 많은 면역원성을 야기하는 것으로 알려져 있다(Hermeling et al,. Pharm. Res. 21(6) : 897-903 (2004)). 형질전환동물의 경우에는 동물 자체의 관리 및 병원균에 대한 오염 가능성에 관한 문제로 아직 상용화에 이르지는 못하였다. Although yeast has the advantage of easy mass production, it is known that the degree and pattern of glycosylation is very different from humans, causing many immunogenicity (Hermeling et al, Pharm. Res. 21 (6): 897). -903 (2004)). In the case of transgenic animals, it has not yet been commercialized due to problems related to the management of the animal itself and the possibility of contamination with pathogens.

반면에 동물세포주를 이용하는 방법은 생산 단가가 높고 세포주 제작에 많은 시간과 비용이 들지만, 생산되는 재조합 단백질이 인간의 형태와 매우 유사하며 안정적인 생산과 관리가 가능하기 때문에 재조합 단백질의 생산에 가장 널리 이용되고 있다. 그러나, 동물세포주에서 재조합 단백질을 생산하기 위해 형질전환을 시도함에 있어 대부분 발현량이 적어 높은 생산성을 나타내기가 어렵다. 따라서, 이러한 문제들을 해결하기 위해 세포주 내 유전자 증폭법을 이용하는데, 가장 대표적인 것이 디하이드로폴레이트 환원효소 또는 글루타민 합성효소를 이용한 유전자 증폭법으로서, 재조합 단백질의 생산량을 크게 증가시킬 수 있어 널리 이용되고 있다. 이러한 유전자 증폭 기술은 생산성을 개선시킬 수 있는 장점이 있지만 고농도의 메토트렉세이트(이하, MTX라 약칭)를 사용하기 위하여 여러 단계의 유전자 증폭 과정을 거침으로써 제작에 많은 시간이 소요되고, 세포주의 장기간 계대 배양에 의해 유전자가 소실되고 발현량이 안정되지 못하는 등의 문제점을 지니고 있는 것으로 알려져 있다.On the other hand, the method of using animal cell line is the most widely used in the production of recombinant protein because the production cost is high and it takes much time and cost to manufacture the cell line, because the recombinant protein produced is very similar to human form and stable production and management are possible. It is becoming. However, in attempting transformation to produce recombinant protein in animal cell lines, it is difficult to show high productivity due to the small amount of expression. Therefore, in order to solve these problems, a gene amplification method using a cell line is used. The most representative one is a gene amplification method using dihydrofolate reductase or glutamine synthetase, which can greatly increase the yield of recombinant protein and is widely used. have. This gene amplification technology has the advantage of improving productivity, but it takes a lot of time to manufacture by going through several stages of gene amplification to use high concentration of methotrexate (hereinafter, abbreviated as MTX), and long-term passage of cell lines This is known to have problems such as loss of genes and unstable expression.

이러한 단점을 개선하기 위하여 디하이드로폴레이트 유전자 프로모터로서 SV40 프로모터의 일부 부위를 제거하여 만든 프로모터를 사용하거나(대한민국 등록 특허 제0162021호), 염기서열에 돌연변이를 도입하는 방법으로 CMV(Cytomegalo virus) 프로모터의 메틸화 DNA 결합 단백질에 대한 친화성을 조절하는 방법(대한민국 등록 특허 제0493703호) 등이 보고되어 있다. 경우에 따라, 디하이드로폴레이트 유전자의 단독 프로모터를 사용하지 않고 면역글로불린 중쇄결합단백질의 5‘-비코딩부위를 IRES(Internal ribosome entry site)로 사용하여 재조합 단백질의 전사조절인자에 의해서 조절되게 하는 방법(대한민국 등록 특허 제0184778호) 등이 보고되어 있다.In order to remedy this drawback, as a dihydrofolate gene promoter, a promoter made by removing a part of the SV40 promoter may be used (Korean Patent No. 0162021), or a mutation may be introduced into a sequencing Cytomegalo virus (CMV) promoter. And a method for controlling affinity for methylated DNA binding protein (Korean Patent No. 0493703) and the like have been reported. Occasionally, the 5'-noncoding region of an immunoglobulin heavy chain binding protein is used as an internal ribosome entry site (IRES) without the use of a single promoter of the dihydrofolate gene, allowing it to be regulated by transcriptional regulators of recombinant proteins. Method (Korean Registered Patent No. 0184778) and the like have been reported.

대한민국 등록 특허 제0162021호의 경우, SV40 전사조절인자의 128 - 270 부위를 제거하여 프로모터의 활성을 조절하고자 하였으나, 제거된 서열의 역할이 명확하지 않으며 나머지 잔존하는 서열의 역할에 대해서도 언급 하지 않고 있다. 특히 대조군의 설정이 없는 조건에서 20nM 이상의 농도에서 증폭이 더 이상 이루어지지 않음을 보여주지 않고 있기 때문에 기존의 디하이드로폴레이트 프로모터 활성에 어떠한 변화가 일어났음은 물론, 낮은 농도의 MTX를 사용하여 최적의 발현을 나타내고 있는가에 대한 근거를 제시하지 못하고 있다. 대한민국 등록 특허 제0493703호의 경우 CMV 프로모터의 메틸화 DNA 결합단백질에 대한 결합 서열을 변이시켜서 효율적으로 유전자를 증폭시키고자 하였으나, 증폭이 발현에 미치는 효과는 여타 방법에 비해 미약하다. 또한 대한민국 등록특허 제0184778호와 같이 단독의 프로모터를 사용하지 않고 IRES(Internal ribosome entry site) 서열을 이용하여 발현하는 경우에도, 유전자 증폭에 이용되는 MTX의 농도가 수 μM에 이를 뿐이고, 발현양의 증가도 30배 수준 정도에 머물렀음을 알 수 있다. 이와 같이 일반적인 프로모터의 변형만으로는 디하이드로폴레이트 유전자의 증폭을 미루어 예상할 수 없으며, 그 적용가능성은 실질적으로 다양한 종류의 프로모터를 사용하여 유전자 증폭을 실시하였을 경우에만 판단할 수 있다고 할 수 있다. In the case of Korean Patent Registration No. 0162021, the activity of the promoter was controlled by removing 128-270 sites of the SV40 transcriptional regulator, but the role of the removed sequence is not clear and does not mention the role of the remaining sequence. In particular, since no amplification is no longer performed at concentrations of 20 nM or more under the condition of no control setting, any change in the existing dihydrofolate promoter activity has occurred, and it is optimal to use a low concentration of MTX. Evidence does not provide evidence of the expression of. Korean Patent No. 0493703 attempts to efficiently amplify a gene by changing the binding sequence for the methylated DNA binding protein of the CMV promoter, but the effect of the amplification on the expression is weak compared to other methods. In addition, even when expressed using an internal ribosome entry site (IRS) sequence without using a single promoter as in Korean Patent No. 0148778, the concentration of MTX used for gene amplification is only a few μM, The increase also remained at about 30 times. As described above, the modification of the general promoter cannot be expected due to the amplification of the dihydrofolate gene, and the applicability thereof can be judged only when the gene amplification is performed using various kinds of promoters.

본 발명의 재조합 단백질로서 예시하고 있는 인간 에리트로포이에틴은 순수 분자량은 약 18000 Da 정도이나 당쇄화 되는 경우에는 약 34000 Da 정도의 분자량을 갖는다. 이들은 빈혈이나 대기 중의 산소분압이 낮거나 또는 출혈 등이 발생하는 경우 신장에서 합성되어 적혈구의 생산을 촉진하고 항상성을 유지하는 역할을 한다. 정상인의 생체 내에서는 약 10 내지 20 mIU/mL의 농도로 존재하며 신장에 이상이 발생하는 경우 극심한 빈혈을 일으킨다(Jacobson, et al., Nature, 179: 633-634 (1957)). 따라서, 에리트로포이에틴은 만성신부전증 및 다양한 원인에 의한 빈혈 등의 치료제로 사용되고 있다. 과거에는 동물의 혈장이나, 정상인보다 높은 농도의 에리트로포이에틴이 생성되는 재생불량성 빈혈 등의 질환이 있는 환자의 혈액이나 뇨 등에서 분리하여 사용하였으나 이들은 안정성이 낮고 양이 부족하며, 정상인의 뇨에서 나오는 에리트로포이에틴의 경우에는 그 농도가 낮을 뿐 아니라 에리트로포이에틴 활성 저해제가 있어 이를 제거하기 위한 고순도의 정제가 요구된다는 단점이 있었다(미합중국 특허 제4397840, 4303650, 3865810 참조). 이러한 방법으로는 순수한 고수율의 에리트로포이에틴을 얻기 어렵기 때문에 유전자 재조합법을 이용한 방법들이 개발되었다. 그러나, 에리트로포이에틴은 생체 내 활성에 당쇄화가 필수적이어서, 에리트로포이에틴 유전자를 클로닝하여 대장균이나 효모에서 발현시키는 경우에는 당쇄화가 제대로 일어나지 않아 에리트로포이에틴 생활성이 나타나지 않는 문제점이 있다. 따라서, 에리트로포이에틴의 생산에는 재조합 동물세포주를 이용한 방법이 필수적으로 요구된다. 재조합 동물세포주를 이용한 방법이 필수적으로 요구된다. 재조합 동물세포주를 이용하여 에리트로포이에틴을 생산하는 경우에는 상기에 언급한 디하이드로폴레이트 환원효소를 사용한 유전자 증폭 기술이 일반적으로 사용된다(Malik et al. DNA and Cell Bio., 6:453-459 (1992)).The human erythropoietin exemplified as the recombinant protein of the present invention has a molecular weight of about 18000 Da when the pure molecular weight is about 18000 Da, or about 34000 Da when glycosylated. They are synthesized in the kidneys when anemia, low oxygen partial pressure in the air, or bleeding occurs, thereby promoting red blood cell production and maintaining homeostasis. It is present at a concentration of about 10-20 mIU / mL in normal humans and causes severe anemia when abnormalities occur in the kidneys (Jacobson, et al., Nature, 179: 633-634 (1957)). Therefore, erythropoietin has been used as a therapeutic agent for chronic kidney failure and anemia due to various causes. In the past, they were separated from the blood or urine of patients with diseases such as plasma of animals or aplastic anemia that produces higher concentrations of erythropoietin than normal ones, but they are low in stability and lacking in quantity. In the case of erythropoietin, the concentration of the erythropoietin is low and there is an erythropoietin activity inhibitor, which requires a high-purity purification to remove it (see US Patent No. 4397840, 4303650, 3865810). Since it is difficult to obtain pure high yield of erythropoietin by this method, methods using genetic recombination have been developed. However, erythropoietin has a problem in that glycosylation is essential for in vivo activity, and thus, when the erythropoietin gene is cloned and expressed in Escherichia coli or yeast, glycosylation does not occur properly and erythropoietin bioactivity does not appear. Therefore, production of erythropoietin requires a method using a recombinant animal cell line. Methods using recombinant animal cell lines are indispensable. In the production of erythropoietin using recombinant animal cell lines, gene amplification techniques using the above-mentioned dihydrofolate reductase are generally used (Malik et al. DNA and Cell Bio., 6: 453-459). (1992).

이러한 사실을 기초로, 본 발명자는 디하이드로폴레이트 구조 유전자와 연결된 프로모터 부위에 존재하는 6개의 GC-rich 반복서열의 역할이 전사활성에 중요하다는 Michel Fromm과 Paul Berg의 결과(J. Mol. Appl. Genet. 1983. 2(1):127-135)를 참조하여, 순차적으로 GC-rich 반복서열을 제거함으로써 보다 효율적으로 유전자 증폭과 발현이 이루어지는 동물세포용 발현 벡터를 제작하여 본 발명을 완성하였다.Based on this fact, the inventors of Michel Fromm and Paul Berg (J. Mol. Appl) found that the role of six GC-rich repeats in the promoter region linked to the dihydrofolate structural gene is important for transcriptional activity. Genet. 1983. 2 (1): 127-135), by sequential removal of the GC-rich repeat sequence to produce an animal cell expression vector that gene amplification and expression more efficiently to complete the present invention. .

따라서, 본 발명의 목적은 동물세포 내에서 재조합 단백질을 생산함에 있어 보다 효과적으로 유전자를 증폭시킬 수 있는 발현 벡터를 제공하는 것이다. Accordingly, an object of the present invention is to provide an expression vector capable of amplifying genes more effectively in producing recombinant proteins in animal cells.

본 발명의 다른 목적은 상기 발현 벡터에 의해 형질전환된 동물세포주를 제공하는 것이다. Another object of the present invention is to provide an animal cell line transformed with the expression vector.

본 발명의 또 다른 목적은 상기 동물세포주를 사용하여 재조합 단백질을 대량 생산하는 방법을 제공하는 데 있다. Another object of the present invention to provide a method for mass production of recombinant protein using the animal cell line.

상기 목적을 달성하기 위하여, 본 발명은 디하이드로폴레이트 환원효소의 GC-리치(rich) 반복서열 결실 프로모터를 포함하는 동물세포주용 발현 벡터를 제공한다.In order to achieve the above object, the present invention provides an expression vector for an animal cell line comprising a GC-rich repeat deletion promoter of dihydrofolate reductase.

상기 언급한 바와 같이, 동물 세포주에서의 재조합 단백질의 고발현을 유도하기 위한 한 방법으로서 디하이드로폴레이트 환원효소에 의한 유전자 증폭법이 널리 사용되어 왔으나, 종래 기술에 따르면 디하이드로폴레이트 환원효소 저해제를 고농도로 오랜 기간 사용함에 따른 세포주 안정화 및 시간과 비용상의 문제가 있었던 바, 본 발명은 이러한 디하이드로폴레이트 환원효소의 프로모터 중 GC-리치 반복서열을 일부 또는 전부 결실시켜 변이시킨 디하이드로폴레이트 환원효소 유전자를 이용한 발현 벡터가 보다 짧은 시간에 보다 낮은 디하이드로폴레이트 환원효소 저해제를 사용하는 경우에도 보다 높은 효율로 디하이드로폴레이트 환원효소 및/또는 상기 벡터에 포함된 목적 재조합 유전자의 발현을 유도할 수 있음을 발견하였다. As mentioned above, gene amplification by dihydrofolate reductase has been widely used as a method for inducing high expression of recombinant proteins in animal cell lines, but according to the prior art, dihydrofolate reductase inhibitors. Cell line stabilization and time and cost problems caused by using a high concentration for a long time, the present invention is a dihydrofolate mutated by deleting part or all of the GC-rich repeat sequence of the promoter of such dihydrofolate reductase The expression vector using the reductase gene can express the expression of the dihydrofolate reductase and / or the target recombinant gene included in the vector with higher efficiency even when a lower dihydrofolate reductase inhibitor is used in a shorter time. It was found to be inducible.

여기서, GC-리치 반복서열이란, 디하이드로폴레이트 환원효소의 전사조절인자인 프로모터에 포함된 CCGCCC 반복서열을 의미하는 것으로, 이 반복서열을 전부 또는 일부를 결실, 변이 등에 의한 방법으로 인위적으로 결손시키면 디하이드로폴레이트 환원효소의 발현은 최소한으로 이루어지게 된다. 이 때, 발현이 최소로 유지된 상태에서 디하이드로폴레이트 저해제를 첨가하는 경우, 세포는 생존을 위하여 더 많은 수의 디하이드로폴레이트 환원효소 유전자를 증폭하게 되고, 따라서, 상기 디하이드로폴레이트 환원효소 유전자를 포함하는 발현 벡터 내에 포함된 목적 재조합 유전자 또한 동시에 증폭되어 고발현되는 것으로 관찰되었다. Here, the GC-rich repeat sequence refers to a CCGCCC repeat sequence included in a promoter which is a transcription regulator of dihydrofolate reductase, and artificially deleted all or part of the repeat sequence by a method such as deletion or mutation. This results in minimal expression of dihydrofolate reductase. At this time, when the dihydrofolate inhibitor is added while the expression is kept to a minimum, the cells amplify a larger number of dihydrofolate reductase genes for survival, and thus, the dihydrofolate reduction It was observed that the target recombinant gene contained in the expression vector containing the enzyme gene was also amplified and highly expressed at the same time.

따라서, 본 발명의 구체적인 일 양태에서는, CCGCCC 반복서열이 하나 이상 제거된 프로모터를 포함하는 디하드로폴레이트 환원효소 유전자의 염기서열을 포함하는 고발현 유도 카세트를 제공한다. 바람직하게, 상기 고발현 유도 카세트는 6 개 이하의 CCGCCC 반복서열을 포함하는 디하이드로폴레이트 환원효소의 프로모터를 포함하며, 더욱 바람직하게는 3 개 이하의 CCGCCCC 반복 서열을 포함하는 프로모터를 포함하고, 특히 바람직하게는 1 개 이하의 CCGCCC 반복 서열을 포함하는 프로모터를 포함하며, 더욱 특히 바람직하게는 CCGCCC 반복 서열 전부가 제거된 프로모터를 포함한다.Accordingly, in one specific aspect of the present invention, there is provided a high expression induction cassette comprising a nucleotide sequence of the dihydrofolate reductase gene comprising a promoter from which one or more CCGCCC repeats have been removed. Preferably, the high expression induction cassette comprises a promoter of dihydrofolate reductase comprising 6 or less CCGCCC repeats, more preferably a promoter comprising 3 or less CCGCCCC repeats, Particularly preferably it comprises a promoter comprising at most one CCGCCC repeat sequence, more particularly preferably a promoter from which all of the CCGCCC repeat sequence has been removed.

이들 CCGCCC 반복서열의 제거는, 당 업계에 널리 알려져 있는 유전자 재조합 기술에 따른 염기서열의 치환이나 결실 등의 방법에 의해 이루어질 수 있으며, 본 발명의 구체적 실시예에서는 CCGCCC 반복서열을 포함하는 염기서열의 일부를 결실시키는 방법에 의하여 프로모터 내 GC-리치 서열의 일부 또는 전부를 제거하였다. Removal of these CCGCCC repeat sequences may be performed by a method such as substitution or deletion of nucleotide sequences according to gene recombination techniques well known in the art, and in a specific embodiment of the present invention, Some or all of the GC-rich sequences in the promoter were removed by a method of deleting some.

본 발명의 또 다른 양태에서는, 상기 고발현 유도 카세트를 포함하는 발현 벡터를 제공한다. In another aspect of the present invention, an expression vector comprising the high expression induction cassette is provided.

용어 ‘벡터’는 숙주 세포에서 목적 유전자를 발현시키기 위한 수단으로 플라스미드 벡터, 코즈미드 벡터, 박테리오파아지 벡터 및 아데노바이러스 벡터, 레트로바이러스 벡터, 아데노-연관 바이러스 벡터 같은 바이러스 벡터 등을 포함하며, 바람직하게는 플라스미드 벡터이다. The term 'vector' includes means for expressing a gene of interest in a host cell, including plasmid vectors, cozmid vectors, bacteriophage vectors and adenovirus vectors, retroviral vectors, viral vectors such as adeno-associated viral vectors, and the like. Is a plasmid vector.

상기 발현 벡터는 바람직하게, 목적하는 재조합 단백질을 암호화하는 유전자를 더 포함할 수 있으며, 이러한 발현벡터를 발현시킴으로써 목적하는 재조합 단백질을 고효율로 발현시킬 수 있다. Preferably, the expression vector may further include a gene encoding a recombinant protein of interest, and by expressing such expression vector, the recombinant protein of interest may be expressed with high efficiency.

상기 목적 재조합 단백질이란, 일반적으로 생리활성 폴리펩타이드를 포함하는 개념으로서, 호르몬, 사이토카인, 인터루킨, 인터루킨 결합 단백질, 효소, 항체, 성장인자, 전사조절인자, 혈액인자, 백신, 구조단백질, 리간드 단백질 또는 수용체, 세포표면항원, 수용체 길항물질과 같은 다양한 단백질 및 이들의 유도체 및 유사체들을 예시할 수 있으며, 구체적으로는 인간 성장 호르몬, 인터페론류 및 인터페론 수용체류, 콜로니 자극인자, 인터루킨류, 에리스로포이에틴, 인슐린, 안지오텐신, 골 형성 성장인자, B 세포인자, T 세포인자, 신경 성장인자류, 세포 표면항원, 단일클론항체 및 바이러스 유래 백신 항원 등을 포함하며 이들에 제한되는 것이 아니다. 당업자는 현재의 기술 수준에서 디하이드로 폴레이트 환원효소에 의한 증폭 기술을 적용할 수 있는 재조합 단백질을 용이하게 선택할 수 있다. 본 발명의 바람직한 실시 양태에서, 상기 재조합 단백질은 인간 에리트로포이에틴이다. The target recombinant protein generally includes a physiologically active polypeptide, and includes hormones, cytokines, interleukins, interleukin binding proteins, enzymes, antibodies, growth factors, transcription regulators, blood factors, vaccines, structural proteins, and ligand proteins. Or various proteins such as receptors, cell surface antigens, receptor antagonists, and derivatives and analogues thereof, specifically human growth hormone, interferon and interferon receptors, colony stimulating factors, interleukins, erythropoietin, insulin , Angiotensin, bone formation growth factor, B cell factor, T cell factor, nerve growth factor, cell surface antigen, monoclonal antibody and virus derived vaccine antigen and the like. One skilled in the art can readily select recombinant proteins to which amplification techniques by dihydrofolate reductase can be applied at the current state of the art. In a preferred embodiment of the invention, said recombinant protein is human erythropoietin.

상기 목적 재조합 단백질은 상기 디하이드로폴레이트 환원효소 유전자의 프로모터에 의해 발현되거나 또는 별도의 프로모터에 의해 발현이 조절될 수 있다. 바람직하게는, 상기 목적 재조합 단백질은 별도의 프로모터에 의해 발현이 조절된다. 이러한 프로모터로는 당 업계에 널리 알려진 것으로서, 예를 들어 사이토메갈로바이러스(CMV) 프로모터, LTR 프로모터, EFα 프로모터, SV40 프로모터 및 TK 프로모터로 구성되는 군으로부터 당업자가 용이하게 선택하여 사용할 수 있을 것이며, 이들에 제한되는 것은 아니다.The recombinant protein of interest may be expressed by a promoter of the dihydrofolate reductase gene or the expression may be controlled by a separate promoter. Preferably, the recombinant protein of interest is regulated by a separate promoter. Such promoters are well known in the art and can be easily selected and used by those skilled in the art, for example, from the group consisting of cytomegalovirus (CMV) promoter, LTR promoter, EFα promoter, SV40 promoter and TK promoter. It is not limited to.

본 발명의 발현 벡터는 동물세포주에서의 고발현을 유도하기 위한 것으로서, 바람직하게 동물세포에서의 영구적인 발현을 위한 선택 표지 인자로 사용되는 동물세포주용 저항성 유전자를 더 포함할 수 있다. 상기 동물세포주용 저항성 유전자로는 통상적으로 당업계에서 사용하는 동물세포주용 저항성 유전자인 네오마이신 저항성 유전자, 제오신 저항성 유전자, 하이그로마이신 저항성 유전자 및 블라스시스틴 저항성 유전자 등이 있으며, 이들에 한정되는 것은 아니다. The expression vector of the present invention is for inducing high expression in animal cell lines, and may further comprise a resistance gene for animal cell lines, which is preferably used as a selection marker for permanent expression in animal cells. The animal cell line resistance genes include neomycin resistance genes, zeocin resistance genes, hygromycin resistance genes, and blocystine resistance genes, which are commonly used in the art, such as animal cell line resistance genes, and the like. no.

이 외에도, 본 발명의 발현 벡터는 또한 일반적인 벡터의 구성요소들, 예를 들어 복제원점 및 다중 아데닐레이션 신호와 기타 전사 조절인자 등을 더 포함할 수 있으며, 이들에 제한되지 않는다.In addition, the expression vector of the present invention may further include, but is not limited to, components of a general vector, such as origin of replication and multiple adenylation signals and other transcriptional regulators.

본 발명의 또 다른 양태에서는, 본 발명에 따른 상기 발현 벡터로 형질전환된 세포주를 제공한다.In another aspect of the invention, there is provided a cell line transformed with the expression vector according to the invention.

구체적인 한 양태로서, 본 발명은 디하이드로폴레이트 환원효소의 프로모터 중 CCGCCC 반복서열을 하나만 가지는 고발현 유도 카세트 및 CCGCCC 반복서열을 전혀 포함하지 않는 고발현 유도 카세트를 각각 제조하였으며, 이들 발현카세트를 포함하는 발현벡터에 의해 형질전환된 E.coli 세포주를 제공한다. 이들 세포주는 2006년 10월 2일자로 대한민국 대전시 유성구에 소재하는 생명공학연구원 내 유전자은행에 수탁번호 KCTC 10991 BP 및 KCTC 10992 BP 로서 각각 기탁하였다. 이들 세포주는, 원하는 목적 재조합 단백질의 고발현을 유도하기 위하여, 상기 세포주로부터 상기한 고발현 유도 카세트를 포함하는 발현벡터를 분리하여 유전자 재조합 기술 등에 의한 클로닝 방법으로, 목적 재조합 단백질을 암호화하는 유전자를 더 포함하는 발현벡터의 제작에 이용될 수 있다. In a specific embodiment, the present invention prepared a high expression induction cassette having only one CCGCCC repeat sequence and a high expression induction cassette containing no CCGCCC repeat sequence in the promoter of dihydrofolate reductase, respectively, and include these expression cassettes. To provide an E. coli cell line transformed with an expression vector. These cell lines were deposited on October 2, 2006 as Accession Nos. KCTC 10991 BP and KCTC 10992 BP, respectively, to the Gene Bank in the Biotechnology Research Institute, Yuseong-gu, Daejeon, Korea. In order to induce high expression of the desired recombinant protein, these cell lines are separated from the expression vector containing the high expression induction cassette from the cell line and cloned by a gene recombination technique or the like, thereby encoding a gene encoding the target recombinant protein. It can be used for the production of an expression vector further comprising.

본 발명의 또 다른 구체적인 양태에서는, 상기 발현 벡터는 목적하는 재조합 단백질을 암호화하는 유전자를 더 포함하는 것으로서, 이러한 발현 벡터에 의해 형질전환된 세포주 또한 제공한다. In another specific embodiment of the present invention, the expression vector further comprises a gene encoding a recombinant protein of interest, and also provides a cell line transformed by such expression vector.

바람직한 한 양태로서, 이들 목적하는 재조합 단백질은 동물세포주에서의 발현이 요구되는 것으로서, 이러한 목적에 비추어, 본 발명에서 사용 가능한 바람직한 동물세포주로는 중국 햄스터 난소(CHO : Chinese hamster ovarian cacer)세포주, 원숭이 신장 세포 7(COS7 : Monkey kidney cells) 세포주, NSO 세포주, SP2/0 세포주, W138, 어린 햄스터 신장(BHK : Baby hamster kidney) 세포주, MDCK, 골수종 세포주, HuT 78 세포 및 293 세포 등을 포함할 수 있으며, 이들에 제한되지 않는다. 당해 기술 분야에서 통상의 지식을 가지는 기술자라면 본 발명의 디하이드로폴레이트 환원 효소를 이용한 증폭 기술에 적용가능한 적절한 동물세포주를 용이 하게 선택할 수 있을 것이다. In a preferred embodiment, these desired recombinant proteins require expression in animal cell lines, and in view of these purposes, preferred animal cell lines usable in the present invention include Chinese hamster ovarian cacer (CHO) cell lines, monkeys. Renal cells 7 (COS7: Monkey kidney cells) cell line, NSO cell line, SP2 / 0 cell line, W138, baby hamster kidney (BHK) cell line, MDCK, myeloma cell line, HuT 78 cells and 293 cells, etc. Are not limited thereto. Those skilled in the art will be able to readily select suitable animal cell lines applicable to the amplification technique using the dihydrofolate reductase of the present invention.

본 발명의 구체적인 양태에서는, 상기 동물세포주로서 CHO 세포주를 사용하였으며, 보다 구체적으로는 디하이드로폴레이트 환원효소가 결핍된 중국 햄스터 난소 세포주(CHO/dhfr-)를 사용하였다. 즉, 디하이드로폴레이트 환원효소가 결핍된 CHO 세포주에 인간 재조합 에리트로포이에틴을 암호화하는 유전자를 포함하는 본 발명에 따른 발현 벡터를 형질전환하여, 100nM 이하의, 보다 바람직하게는 50nM 이하의 낮은 메토트렉세이트 농도에서도 충분한 수의 유전자가 증폭되어 생산성이 검증된 동물 세포주를 제공한다. 후술하는 실시예에 자세히 기술하는 바와 같은 이들 세포주는 2006년 10월 2일에, 상기한 대전시 유성구 소재 생명공학연구원 내 유전자 은행에 수탁번호 KCTC 10993BP, KCTC 10994BP 및 KCTC 10995BP로서 기탁하였다.In a specific embodiment of the present invention, a CHO cell line was used as the animal cell line, and more specifically, a Chinese hamster ovary cell line (CHO / dhfr-) deficient in dihydrofolate reductase was used. That is, a CHO cell line deficient in dihydrofolate reductase is transformed into an expression vector according to the present invention comprising a gene encoding human recombinant erythropoietin, so as to have a low methotrexate of 100 nM or less, more preferably 50 nM or less. Sufficient genes are amplified at concentrations to provide animal cell lines with proven productivity. These cell lines, as described in detail in Examples below, were deposited on October 2, 2006 as Accession Nos. KCTC 10993BP, KCTC 10994BP and KCTC 10995BP to the Gene Bank in Biotechnology Research Institute, Yuseong-gu, Daejeon, above.

본 발명의 또 다른 양태에서는, 본 발명에 따른 GC-리치 서열이 일부 또는 전부 제거된 프로모터를 포함하는 디하이드로폴레이트 환원효소를 암호화하는 유전자 및 목적 재조합 단백질을 암호화하는 유전자를 포함하는 본 발명의 동물세포용 발현 벡터로 동물 세포주를 형질전환하는 단계; 및 상기 형질전환된 동물 세포주를 배양하는 단계를 포함하는 재조합 단백질의 제조 방법을 제공한다. In another embodiment of the present invention, a gene encoding a dihydrofolate reductase comprising a promoter in which the GC-rich sequence according to the present invention is partially or wholly removed, and a gene encoding a recombinant protein of interest. Transforming the animal cell line with an expression vector for the animal cell; And it provides a method for producing a recombinant protein comprising the step of culturing the transformed animal cell line.

본 발명에서 동물세포주로의 “형질전환”은 핵산을 유기체, 세포, 조직 또는 기관에 도입하는 어떤 방법도 포함되며, 당 분야에서 공지된 바와 같이 사용하는 동물세포주에 따라 적합한 표준 기술을 선택하여 수행할 수 있다. 세포벽이 없 는 포유동물 세포의 경우, 인산칼슘 침전법을 사용할 수 있다(Graham et al, 1978, Virology, 52 : 456-457). 포유동물 숙주 세포로의 형질전환의 일반적인 방법 및 특징은 미합중국 특허 제4,399,216호에 기재되어 있다. 구체적으로 본 발명에서는 CHO 세포에 리포펙타민을 이용하여 재조합 단백질을 발현하는 벡터를 세포 안으로 형질전환하였다. 또한, 본 발명에서 동물세포주의 배양은 당업계에 알려진 적당한 배지와 배양조건에 따라 이루어질 수 있다. 이러한 배양과정은 당업자라면 선택되는 동물세포주에 따라 용이하게 조정하여 사용할 수 있다. 세포의 성장방식에 따라 현탁배양과 부착배양을, 배양방법에 따라 회분식과 유가식 및 연속배양식의 방법으로 구분된다. 배양에 사용되는 배지는 특정한 세포주의 요구조건을 적절하게 만족시켜야 한다.In the present invention, "transformation" into an animal cell line includes any method of introducing a nucleic acid into an organism, cell, tissue, or organ, and is performed by selecting a suitable standard technique according to the animal cell line to be used as known in the art. can do. For mammalian cells without cell walls, calcium phosphate precipitation can be used (Graham et al, 1978, Virology, 52: 456-457). General methods and features of transformation into mammalian host cells are described in US Pat. No. 4,399,216. Specifically, in the present invention, a vector expressing a recombinant protein was transformed into cells using lipofectamine in CHO cells. In addition, the culture of the animal cell line in the present invention can be made according to the appropriate medium and culture conditions known in the art. This culture process can be used by those skilled in the art can be easily adjusted according to the animal cell line selected. Depending on the cell growth method, suspension culture and adhesion culture are classified into batch, fed-batch and continuous culture according to the culture method. The medium used for culturing must adequately meet the requirements of the particular cell line.

동물세포 배양에 있어 상기 배지는 다양한 탄소원, 질소원 및 미량원소 성분을 포함한다. 사용될 수 있는 탄소원의 예에는 포도당, 자당, 유당, 과당, 말토즈, 전분, 셀룰로즈와 같은 탄수화물, 대두유, 해바라기유, 피마자유 및 코코넛유와 같은 지방, 팔미트산, 스테아린산 및 리놀레산과 같은 지방산, 글리세롤 및 에탄올과 같은 알코올 및 아세트산과 같은 유기산이 포함된다, 이들 탄소원은 단독 또는 조합되어 사용될 수 있다. 사용될 수 있는 질소원의 예에는 펩톤, 효모 추출물, 육즙, 맥아 추출물, 옥수수 침지액(CSL) 및 대두밀과 같은 유기 질소원 및 요소, 황산암모늄, 염화암모늄, 인산암모늄, 탄산암모늄 및 질산암모늄과 같은 무기질소원이 포함된다. 이들 질소원은 단독 또는 조합되어 사용될 수 있다. 그 외에, 아미노산, 비타민 및 적절한 전구체 등이 포함될 수 있다. In animal cell culture, the medium contains various carbon sources, nitrogen sources and trace element components. Examples of carbon sources that can be used include glucose, sucrose, lactose, fructose, maltose, starch, carbohydrates such as cellulose, fats such as soybean oil, sunflower oil, castor oil and coconut oil, fatty acids such as palmitic acid, stearic acid and linoleic acid, Alcohols such as glycerol and ethanol and organic acids such as acetic acid. These carbon sources may be used alone or in combination. Examples of nitrogen sources that can be used include organic nitrogen sources such as peptone, yeast extract, gravy, malt extract, corn steep liquor (CSL) and soybean wheat, mineral nitrogen sources such as urea, ammonium sulfate, ammonium chloride, ammonium phosphate, ammonium carbonate and ammonium nitrate This includes. These nitrogen sources may be used alone or in combination. In addition, amino acids, vitamins and suitable precursors may be included.

또한 상기 배지에는 메톡트렉세이트와 같은 디하이드로폴레이트 환원효소 저해제가 첨가될 수 있다. 이는 상기한 바와 같이, 본 발명의 바람직한 실시예에 따른 단백질 재조합 방법은 디하이드로폴레이트 환원효소가 결실된 동물세포주에 본 발명에 따른 발현 벡터를 형질전환하고, 재조합 유전자를 증폭하기 위해 디하이드로폴레이트 환원효소 저해제를 첨가하여 벡터 내의 디하이드로폴레이트 환원효소가 증폭되어 선택될 수 있도록 하는 시스템을 단기간에 효율적으로 구현하는데 그 목적이 있기 때문이다. In addition, a dihydrofolate reductase inhibitor such as methoxtrexate may be added to the medium. As described above, the protein recombination method according to a preferred embodiment of the present invention transforms the expression vector according to the present invention to an animal cell line in which the dihydrofolate reductase is deleted, and dihydropol to amplify the recombinant gene. This is because a system for adding a rate reductase inhibitor to allow a dihydrofolate reductase in a vector to be amplified and selected can be efficiently implemented in a short time.

본 발명의 바람직한 구현에 따르면, 디하이드로폴레이트 저해제의 경우 세포주의 안정성과 경제성을 고려하여 가급적 낮은 농도로 짧게 사용하는 것이 바람직하다. 즉, 낮은 농도로 디하이드로폴레이트 환원효소 저해제를 사용함으로써 대량 생산을 하는 경우의 안정성과 짧은 생산 세포주 개발 기간을 가능하게 한다. 구체적으로, 본 발명은 디하이드로폴레이트 환원효소가 결핍된 중국 햄스터 난소 세포주에 상기의 재조합 단백질 발현벡터를 형질전환하여 100nM 이하의, 바람직하게는 50nM 이하 농도의 메토트렉세이트를 사용하는 재조합 단백질의 제조방법을 제공한다.According to a preferred embodiment of the present invention, the dihydrofolate inhibitor is preferably used as short as possible in consideration of the stability and economic efficiency of the cell line. In other words, the use of dihydrofolate reductase inhibitors at low concentrations enables stability in mass production and shorter development cell line development periods. Specifically, the present invention is a method for producing a recombinant protein using methotrexate at a concentration of 100 nM or less, preferably 50 nM or less by transforming the recombinant protein expression vector into a Chinese hamster ovary cell line deficient in dihydrofolate reductase. To provide.

또한, 본 발명의 추가적인 한 양태로서, 상기에 기술한 세포주로부터 에리트로포이에틴을 생산하는 경우, 생산된 에리트로포이에틴 중 시알산을 다량 함유하여 물성이 뛰어난 고시알산 함유 에리트로포이에틴을 대량으로 정제하는 단계를 추가적으로 포함할 수 있다. In addition, as an additional aspect of the present invention, when producing erythropoietin from the cell line described above, a large amount of sialic acid in the produced erythropoietin to purify the high sialic acid-containing erythropoietin excellent in physical properties It may further comprise a step.

본 발명의 일 실시예에서는 디하이드로폴레이트 환원효소의 프로모터 중 GC-rich 서열을 인위적으로 결손시켜 발현이 최소한으로 이루어 질 수 있게 한 후 디하이드로폴레이트 환원효소 저해제를 첨가하여 유전자를 증폭시켰다. 유전자 증폭이 이루어진 세포주들 중 한 개의 세포로부터 기원한 클론을 얻기 위하여 한계 희석을 실시하여 단클론 세포주를 획득하고, 이를 대량 배양하여 무혈청 배지에서 재조합 인간 에리트로포이에틴을 생산하였다. 이 때, 저염 농도에서 용출되는 에리트로포이에틴은 시알산 함유량이 높은 분획이며, 고염 농도에서 용출되는 에리트로포이에틴은 시알산 함유량이 낮은 분획이 됨을 이용하여, 칼럼 볼륨 농도 구배를 적용하여 에리트로포이에틴을 용출시켜 고시알산 함유 에리트로포이에틴을 정제하여 활성을 확인하였다.In an embodiment of the present invention, the gene is amplified by adding a dihydrofolate reductase inhibitor after artificially deleting the GC-rich sequence in the promoter of the dihydrofolate reductase to minimize expression. To obtain clones originating from one of the cell lines subjected to gene amplification, limiting dilution was performed to obtain monoclonal cell lines, which were mass cultured to produce recombinant human erythropoietin in serum-free medium. At this time, the erythropoietin eluted at low salt concentration is a fraction having a high sialic acid content, and the erythropoietin eluted at a high salt concentration is a fraction having a low sialic acid content. Was eluted to purify erythropoietin containing gosialic acid, and activity was confirmed.

이하 실시예에 의거하여 본 발명을 보다 상세히 설명한다. 본 실시예는 발명을 구체적으로 예시하기 위한 것일 뿐으로, 본 발명이 하기의 실시예에 국한되거나 제한되는 것은 아니다.The present invention will be described in more detail based on the following examples. This embodiment is only for illustrating the invention in detail, the present invention is not limited to the following examples.

실시예Example 1. One.

실시예Example 1.  One. GCGC -- richrich 반복서열이 순차적으로 제거된 발현벡터의 제조 Preparation of Expression Vectors with Repeated Sequences Removed

GC-rich 반복서열이 결실된 프로모터에 의해 발현되는 디하이드로폴레이트 환원효소 유전자 카세트 구조를 만들기 위하여 pSV2-dhfr(ATCC # 37146) 플라스미 드로부터 SmaI 제한 효소 인식 부위를 갖는 프라이머 dhfr 01(5'-GCG CCC GGG ATG GTT CGA CCA TTG AAC TGC-3')과 BstBI 제한효소 인식부위를 갖는 프라이머 dhfr-02 (5'-CAC TTA GAA CCT GTT AGT CTT TCT TCT CGT AGA C-3')을 이용하여 PCR을 실시하여 디하이드로 폴레이트 환원 효소 유전자를 증폭하고 얻어진 약 200 bp의 증폭 산물을 1 % 아가로스 겔에서 전기영동 한 후 QIAGEN사의 gel extraction 키트(cat # 28706)를 사용하여 회수하였다. 회수한 유전자 절편은 Qiagen 사의 pDRIVE 벡터에 직접 삽입하여 클로닝 하고 염기서열 분석을 실시하여 오류가 없음을 확인하였다. 염기서열이 확인된 pDRIVE-dhfr 플라스미드로부터 디하이드로 폴레이트 환원 효소 유전자를 제한효소 SmaI/BstBI으로 절단하고 1.5 % 아가로스 겔에서 전기 영동한 후 약 200 bp의 유전자 절편을 QIAGEN사의 gel extration 키트(cat # 28706)를 사용하여 회수하였다. 이러한 방법으로 얻어진 디하이드로폴레이트 환원 효소 유전자 절편은 Invitrogen 사의 pcDNA 3.1 벡터의 네오마이신 유전자 서열을 동일한 제한 효소로 절단하여 치환하였다. 이렇게 클로닝된 pcDNA3.1-dhfr 플라스미드를 주형으로 하여, BamHI 제한효소 인식부위를 가지며 디하이드로폴레이트 환원효소 프로모터의 각기 다른 GC-rich 반복서열에 상보적인 프라이머 x 1GC (5'-TCA GGA TCC ATT CTC CGC CCC ATG GCT GAC TAA-3'), x 3GC (5'-CAT GGA TCC TAA CTC CGC CCA GTT CCG CCC ATT CT-3'), x 6GC (5'-CAT GGA TCC CAT AGT CCC GCC CCT AAC TCC GCC C-3')와, 역시 BamHI제한효소 인식부위를 가지며 디하이드로폴레이트 환원효소와 구조적으로 연결된 폴리아데닐레이트화 신호서열과 상보적인 프라이머 BISVpAR(5'-TCA GGA TCC CAG ACA TGA TAA GAT ACA TTG ATG -3')를 이용한 PCR을 실 시하여 GC-rich 반복 서열이 순차적으로 일부 손실된 각기 다른 세가지 크기의 유전자 카세트를 얻었다. 이 유전자 카세트를 BglII 제한효소로 절단한 invitrogen사의 pcDNA3.1벡터에 삽입하여 클로닝 한 후, 제한효소 지도법을 이용하여 벡터의 CMV 프로모터와 동일한 방향성을 지니는 클론을 선별하였다. 이를 통하여 X1GC/dhfr, X3GC/dhfr 및 X6GC/dhfr 플라스미드를 제조하였다. 각각의 플라스미드는 인간 genomic DNA 유래의 에리트로포이에틴 유전자를 클로닝 하는데 사용하였다. 또한 GC-rich 반복서열이 완전히 제거된 대조군 발현벡터를 제작하기 위하여 다음과 같이 클로닝을 실시하였다.Primer dhfr 01 (5 ') with a SmaI restriction enzyme recognition site from the pSV2-dhfr (ATCC # 37146) plasmid to create a dihydrofolate reductase gene cassette structure expressed by a promoter deleted with a GC-rich repeat sequence -GCG CCC GGG ATG GTT CGA CCA TTG AAC TGC-3 ') and primer dhfr-02 (5'-CAC TTA GAA CCT GTT AGT CTT TCT TCT CGT AGA C-3') with BstBI restriction enzyme recognition site PCR was performed to amplify the dihydrofolate reductase gene, and the obtained amplification product of about 200 bp was electrophoresed on a 1% agarose gel and recovered using QIAGEN's gel extraction kit (cat # 28706). The recovered gene fragment was directly inserted into the pDRIVE vector of Qiagen, cloned, and subjected to sequencing to confirm that there was no error. Dihydrofolate reductase gene was digested with restriction enzyme SmaI / BstBI from the nucleotide sequence pDRIVE-dhfr plasmid, and electrophoresed on 1.5% agarose gel. # 28706). The dihydrofolate reductase gene fragment obtained by this method was replaced by cutting the neomycin gene sequence of Invitrogen's pcDNA 3.1 vector with the same restriction enzyme. Using the cloned pcDNA3.1-dhfr plasmid as a template, the primer x 1GC (5'-TCA GGA TCC ATT) having a BamHI restriction enzyme recognition site and complementary to different GC-rich repeats of the dihydrofolate reductase promoter CTC CGC CCC ATG GCT GAC TAA-3 '), x 3GC (5'-CAT GGA TCC TAA CTC CGC CCA GTT CCG CCC ATT CT-3'), x 6GC (5'-CAT GGA TCC CAT AGT CCC GCC CCT AAC TCC GCC C-3 ') and a primer BISVpAR (5'-TCA GGA TCC CAG ACA TGA TAA GAT), which also has a BamHI restriction enzyme recognition site and is complementary to the polyadenylation signal sequence structurally linked with dihydrofolate reductase PCR using ACA TTG ATG-3 ') yielded three different size gene cassettes with partial loss of GC-rich repeat sequences. The gene cassette was inserted and cloned into the invitrogen pcDNA3.1 vector digested with BglII restriction enzyme, and clones having the same orientation as the CMV promoter of the vector were selected using restriction enzyme mapping. Through this, X1GC / dhfr, X3GC / dhfr and X6GC / dhfr plasmids were prepared. Each plasmid was used to clone the erythropoietin gene from human genomic DNA. In addition, cloning was performed as follows to prepare a control expression vector from which the GC-rich repeat sequence was completely removed.

디하이드로폴레이트 환원효소 유전자와 SV40 바이러스 폴리아데닐레이트 서열을 얻기 위하여 BamHI 제한효소 인식부위를 포함하는 프라이머 X0GC (5'-CGA TGG ATC CGA CAT GAT AAG ATA CAT TGA T-3')와 X0GCRR (5'-CGT TGG ATC CAC AGC TCA GGG CTG CGA TTT C-3')를 제작하였다. pSV2-dhfr 플라스미드를 주형으로 하여 PCR을 수행하여 SV40 바이러스 폴리아데닐레이트 서열로부터 디하이드로폴레이트 환원효소 유전자의 5' 비해독 부위에 이르는 서열 1.5 kb의 증폭산물을 얻었다. 이 유전자절편을 1% 아가로스 겔에서 전기영동한 후 gel extraction 키트 (QIAGEN, cat #28706)를 사용하여 회수하여 invitrogen사의 pcDNA3.1을 제한효소 BglII로 절단한 위치에 삽입하였다. 제한효소 지도법을 이용하여 디하이드로폴레이트 환원효소 유전자가 CMV 프로모터와 반대 방향으로 위치한 클론을 선별하였다. 몇 가지 제한효소 인식부위를 제거하기 위하여 벡터의 다클로닝 부위를 제한효소 NdeI과 DraIII로 절단하고, invitrogen사의 pRcCMV 벡터를 같은 방법으로 절단한 절편으로 치환하였다. 이를 통하여 X0GC/dhfr 벡터를 제작하였다.Primers X0GC (5'-CGA TGG ATC CGA CAT GAT AAG ATA CAT TGA T-3 ') and X0GCRR (5 '-CGT TGG ATC CAC AGC TCA GGG CTG CGA TTT C-3'). PCR was performed using the pSV2-dhfr plasmid as a template to obtain an amplification product of sequence 1.5 kb from the SV40 virus polyadenylate sequence to the 5 'non-toxic site of the dihydrofolate reductase gene. The gene fragment was electrophoresed on a 1% agarose gel and then recovered using a gel extraction kit (QIAGEN, cat # 28706) and inserted at the position where the invitrogen pcDNA3.1 was cut with restriction enzyme BglII. Restriction maps were used to select clones in which the dihydrofolate reductase gene was located in the opposite direction to the CMV promoter. In order to remove several restriction enzyme recognition sites, the polycloning site of the vector was digested with restriction enzymes NdeI and DraIII, and the pRcCMV vector of invitrogen was replaced with a fragment cut in the same manner. X0GC / dhfr vector was produced through this.

이상의 클로닝 모식도를 도 1과 도 2에 정리하였다.The above cloning schematic is summarized in FIG. 1 and FIG.

실시예Example 2.  2. gEPOgEPO 유전자의  Gene 클로닝Cloning

pCI-neo/gEPO 플라스미드를 XhoI/EcoRI 제한효소로 절단하고, DNA polymerase I Klenow fragment로 접착말단을 평활 말단으로 만든 후, 0.7% 아가로스 겔에서 전기 영동하여 인간 에리트로포이에틴 게놈유전자에 해당하는 약 2.2Kb 의 DNA절편을 gel extration 키트(QIAGEN사, cat #28706)를 사용하여 회수하였다. Invitrogen 사의 pRcCMV 벡터를 EcoRV로 절단하고 QIAGEN사의 PCR purification 키트(cat#28106)로 회수한 후, EcoRV 절단부위에 상기 에리트로포이에틴 게놈유전자를 접합시켜 클로닝하였다. 제한효소 지도법을 이용하여 벡터의 에리트로포이에틴 게놈유전자가 CMV 프로모터와 같은 방향성을 지니는 클론을 선별하였다. 실시예 1.의 X0GC/dhfr, x1GC/dhfr, x3GC/dhfr, x6GC/dhfr 벡터를 BamHI/XhoI 제한효소로 절단하고 1% 아가로스 겔에서 전기 영동한 후 gel extration 키트(QIAGEN사, cat #28706)를 사용하여 회수하였다. 같은 방법으로 pRcCMV벡터에 클로닝한 에리트로포이에틴 게놈유전자 절편을 회수하여, 절단한 x0GC/dhfr, x1GC/dhfr, x3GC/dhfr 및 x6GC/dhfr 벡터의 멀티클로닝 사이트의 BamHI/XhoI 위치에 삽입하여 서브클로닝 하였다. 얻어진 클론들의 염기서열을 분석하여 오류가 없음을 확인하고 이를 각각 X0GC/GEPO, X1GC/GEPO, X3GC/GEPO 및 X6GC/GEPO로 명명하였다. 이상의 클로닝 과정 을 도 3에 정리하였으며, 각각의 최종 서열을 서열목록에 표시하였다. The pCI-neo / gEPO plasmid was digested with XhoI / EcoRI restriction enzyme, the end of adhesion was blunted with DNA polymerase I Klenow fragment, and then electrophoresed on a 0.7% agarose gel to correspond to the human erythropoietin genome. 2.2Kb DNA fragments were recovered using a gel extration kit (QIAGEN, cat # 28706). The Invitrogen pRcCMV vector was digested with EcoRV and recovered with QIAGEN's PCR purification kit (cat # 28106), and then cloned by attaching the erythropoietin genomic gene to the EcoRV cleavage site. Restriction maps were used to select clones in which the erythropoietin genomic gene of the vector has the same orientation as the CMV promoter. Example 1 X0GC / dhfr, x1GC / dhfr, x3GC / dhfr, x6GC / dhfr vectors were digested with BamHI / XhoI restriction enzymes and subjected to electrophoresis on 1% agarose gels followed by gel extration kit (QIAGEN, cat # 28706). ) Was recovered using. In the same manner, the erythropoietin genomic fragment cloned into the pRcCMV vector was recovered and inserted into the BamHI / XhoI position of the multicloning sites of the cut x0GC / dhfr, x1GC / dhfr, x3GC / dhfr and x6GC / dhfr vectors. It was. The nucleotide sequences of the clones obtained were confirmed to be free of errors and named as X0GC / GEPO, X1GC / GEPO, X3GC / GEPO, and X6GC / GEPO, respectively. The above cloning process is summarized in FIG. 3, and each final sequence is shown in the sequence list.

실시예Example 3  3 디하이드로폴레이트Dihydrofolate 유전자 결실 중국햄스터 난소 세포주의 형질전환 Transformation of Chinese Hamster Ovary Cell Line

디하이드로폴레이트 유전자가 결손된 중국 햄스터 난소 세포주 CHO/DXB11 스트레인과 CHO/DG44 스트레인을 10 % 소태아 혈청(fetal bovine serum, Welgene, Cat # S101-01)과 1% 페니실린-스트렙토마이신(Gibco, Cat # 15140-122)이 포함된 DMEM/F12 배지(Welgene, Cat # LM002-04)에 접종하여 37 ℃, 5 % CO2 항온기에서 계대배양 하였다. 상기 제조된 X0GC/GEPO, X1GC/GEPO, X3GC/GEPO 및 X6GC/GEPO 플라스미드로 각각의 디하이드로폴레이트 유전자 결손 중국 햄스터 난소 세포주를 형질 전환하기 위해 직경 6 cm 세포 배양 접시에 1×106 세포를 접종하고 37 ℃, 5 % CO2 항온기에서 24 시간 배양한 후, Gibco사의 Opti-MEM 배지(Gibco., Cat # 31985-070)로 2 회 세척하였다. 10 μg의 각각의 플라스미드가 들어있는 3 개의 1 mL의 Opti-MEM에 LipofectamineTM Reagent(Invitrogen, Cat # 18324-020) 1 mL을 혼합하여 상온에서 20분간 정치 반응시킨 후, 준비된 디하이드폴레이트 유전자 결손 중국 햄스터 난소 세포에 골고루 점적하고, 18시간 동안 37 ℃, 5 % CO2 항온기에서 배양한 후, 다시 10 % 소태아 혈청과 1 % 페니실린-스트렙토마이신이 포함된 DMEM/F12 배양배지로 교환하여 48시간 동안 배양하였다. 형질전환된 세포주를 선별하기 위해 10 % 투석 소태아 혈청(dialyzed fetal bovine serum Welgene, Cat #XXXX)과 1% 페니실린-스트렙토마이신, 그리고 800 μg/mL의 geneticin(Mediatech, Cat # 61-234 RG)이 들어있는 α-MEM(Welgene, Cat # LM008-02) 선별 배지에서 0.5 % Trypsin-EDTA (Gibco., Cat # 15400-054) 처리와 원심분리를 통해 얻어진 세포주를 모두 T 25 세포 배양 용기로 옮겨 접종한 후, 37 ℃, 5 % CO2 항온기에서 배양하고, 형질 전환되어 제네티신(geneticin)으로 선별된 세포주가 배양 용기에 90% 이상 되도록 배양하였다. 동일한 농도의 제네티신과 배양 조건에서 세포주를 선별하였으며, GC 반복서열이 더 많이 제거된 세포주들에서 선별효과가 빠르게 나타남을 관찰할 수 있었다. 디하이드로폴레이트 유전자 결손 햄스터 난소세포주의 스트레인간의 유의적 차이는 발견되지 않았다. Chinese hamster ovary cell lines lacking the dihydrofolate gene, CHO / DXB11 strain and CHO / DG44 strain, were treated with 10% fetal bovine serum (Welgene, Cat # S101-01) and 1% penicillin-streptomycin (Gibco, Inoculated in DMEM / F12 medium (Welgene, Cat # LM002-04) containing Cat # 15140-122) was passaged at 37 ℃, 5% CO 2 thermostat. 1 × 10 6 cells were placed in a 6 cm diameter cell culture dish to transform each dehydrofolate gene deficient Chinese hamster ovary cell line with the prepared X0GC / GEPO, X1GC / GEPO, X3GC / GEPO and X6GC / GEPO plasmids. Inoculated and incubated for 24 hours at 37 ℃, 5% CO 2 incubator, washed twice with Gibco's Opti-MEM medium (Gibco., Cat # 31985-070). After mixing 1 mL of Lipofectamine TM Reagent (Invitrogen, Cat # 18324-020) in 3 1 mL Opti-MEM containing 10 μg of each plasmid, the reaction was allowed to stand at room temperature for 20 minutes, and then the prepared dihydrofolate gene. Evenly distributed in the defective Chinese hamster ovary cells, incubated for 18 hours at 37 ℃, 5% CO 2 incubator, then exchanged with DMEM / F12 culture medium containing 10% fetal bovine serum and 1% penicillin-streptomycin Incubate for 48 hours. 10% dialyzed fetal bovine serum Welgene (Cat #XXXX), 1% penicillin-streptomycin, and 800 μg / mL geneticin (Mediatech, Cat # 61-234 RG) for selection of transformed cell lines. Cell lines obtained by treatment with 0.5% Trypsin-EDTA (Gibco., Cat # 15400-054) and centrifugation in α-MEM (Welgene, Cat # LM008-02) screening medium were transferred to T 25 cell culture vessel. After inoculation, the cells were cultured at 37 ° C. in a 5% CO 2 incubator, and cultured so that 90% or more of the cell lines transformed and selected with geneticin were cultured. Cell lines were selected at the same concentrations of geneticin and culture conditions, and it was observed that the screening effect appeared rapidly in cell lines with more GC repeats removed. No significant differences were found between strains of the dihydrofolate gene-deficient hamster ovary cell line.

실시예Example 4. 재조합 세포주의 선별과 에리트로포이에틴 유전자의 증폭 4. Selection of Recombinant Cell Lines and Amplification of Erythropoietin Gene

상기에서 형질전환되어 제네티신(geneticin)으로 선별된 각각 세포주(CHO/DXB11 스트레인 또는 CHO/DG44 strain)의 에리트로포이에틴 발현양을 증가시키기 위해 메토트렉세이트 (MTX, Sigma, Cat # M-8407)가 20 nM의 농도로 첨가된 선별 배지(상기의 선별배지와 동일)에 2×104세포/웰 을 24-well 배양 용기에 접종한 후, 37 ℃, 5 % CO2 항온기에서 2 주간 배양하였다. 에리트로포이에틴의 발현 양이 높은 세포주를 선별하기 위해 24 웰 배양 용기에서 웰의 바닥이 100 % 덮이도록 배양된 세포주를 PBS(Welgene, Cat # LB 001-02)로 2 회 세척한 후, 에리트로포이 에틴 생산 배지인 CHO-A-SFM(Gibco. Cat # 05-5072EF)을 200 μL/웰이 되도록 첨가하고 24시간 동안 배양한 배양 상층액을 회수하여 간접 이라이자(EPO ELISA kit, R&D, Cat # DEP00)방법으로 발현양을 측정하였다. 메토트렉세이트 농도를 30 nM으로 올린 후 동일한 선별배지에서 재차 2주간 배양을 실시하고 각각의 세포주들의 에리트로포이에틴 발현 양을 이라이자(ELISA ; Enzyme linked immunosorbent assay)키트 (R & D systems cat # DEP00)로 측정하여 비교하였다. 이라이자로 측정된 발현량은 도 4 ~ 도 7에서 보여지듯이 전반적으로 X0GC/GEPO와 X1GC/GEPO가 동일농도의 MTX 조건에서 가장 높은 발현량을 보이고 있음을 알 수 있으며, 이에 이어서 X3GC/GEPO와 X6GC/GEPO의 순이었다. 이러한 사실은 디하이드로폴레이트 유전자의 GC 반복서열을 많이 제거 할수록 동일 농도의 메토트렉세이트 존재시 유전자 증폭이 많이 일어나는 경우를 보여주고 있다. 또한 GC반복서열이 완전히 제거된 X0GC/GEPO에서도 효율적인 유전자 증폭이 일어남을 확인할 수 있었다. 따라서, GC 반복서열은 최소로 유지될 때 그 증폭 효과가 최대로 유지된다는 것을 확인할 수 있었다. 또한 최대로 유전자 발현이 일어나는 메토트렉세이트의 농도를 확인하고자, 선별배지내 메토트렉세이트의 농도를 추가로 40 nM에서 60 nM 까지 증가시켜 보았으나 유의할 수준의 발현량 증가나 산성이소머의 함량변화는 일어나지 않음을 확인하였다. 본 결과를 바탕으로 실시예 6에서와 같이 X0GC/GEPO 및 X1GC/GEPO로 형질전환된 두 개의 세포주 스트레인에 대하여 한계 희석을 실시하였다. Methotrexate (MTX, Sigma, Cat # M-8407) was used to increase the expression level of erythropoietin of each cell line (CHO / DXB11 strain or CHO / DG44 strain) transformed and selected as geneticin. 2 × 10 4 cells / well were inoculated into a 24-well culture vessel in a selection medium (same as the above selection medium) added at a concentration of 20 nM, and then cultured in a 37 ° C., 5% CO 2 incubator for 2 weeks. In order to select a cell line with high expression amount of erythropoietin, the cell line incubated so that the bottom of the well is 100% covered in a 24-well culture vessel was washed twice with PBS (Welgene, Cat # LB 001-02), and then erythropoie CHO-A-SFM (Gibco. Cat # 05-5072EF), an ethine production medium, was added to 200 μL / well, and the culture supernatant incubated for 24 hours was recovered to recover the indirect Eliza (EPO ELISA kit, R & D, Cat # DEP00). Expression level was measured by the method. After raising methotrexate concentration to 30 nM, the cells were cultured again for two weeks in the same selection medium, and the amount of erythropoietin expression of each cell line was measured by ELISA (Ezyme linked immunosorbent assay) kit (R & D systems cat # DEP00). By comparison. As shown in FIGS. 4-7, X0GC / GEPO and X1GC / GEPO showed the highest expression levels at the same concentration of MTX as shown in FIGS. 4 to 7, followed by X3GC / GEPO and X6GC. Followed by / GEPO. This fact indicates that the more amplification of the GC sequence of the dihydrofolate gene, the more amplification of the gene occurs in the presence of the same concentration of methotrexate. In addition, efficient gene amplification also occurred in X0GC / GEPO where the GC repeat sequence was completely removed. Therefore, it was confirmed that the amplification effect is maintained at the maximum when the GC repeat sequence is kept at the minimum. In addition, in order to confirm the concentration of methotrexate in which gene expression occurs to the maximum, we tried to increase the concentration of methotrexate in the selection medium from 40 nM to 60 nM, but there was no significant increase in expression level or change in acid isomer content. Confirmed. Based on the results, limit dilution was performed on two cell line strains transformed with X0GC / GEPO and X1GC / GEPO as in Example 6.

실시예Example 5.  5. 디하이드로폴레이트Dihydrofolate 유전자의 증폭 확인 Confirmation of gene amplification

실시예 4에서 얻어진 각각의 세포를 D-PBS용액으로 1회 세척하고 trypsin-EDTA를 처리하여 배양플라스크로부터 회수한 후 1,000 rpm에서 3 분간 원심분리하여 세포를 수득하였다. 이를 다시 D-PBS로 1회 세척하고 헤마사이토미터 (hemacytometer, incyto사)로 세포수를 세어 3 x 106개의 세포를 분리하고, 원심분리하여 세포침전을 얻었다. 얻어진 세포들은 각각 1.5 mL의 세포 용해 완충액(PBS, 5mM EDTA, 1% NP-40)으로 현탁한 후 4 ℃에서 30 분간 용해시킨 후 12,000 rpm으로 10 분간 원심 분리하여 상등액을 수득하였다. Each cell obtained in Example 4 was washed once with D-PBS solution, treated with trypsin-EDTA, recovered from the culture flask, and centrifuged at 1,000 rpm for 3 minutes to obtain cells. This was washed once again with D-PBS and counted with a hematocytometer (hemacytometer, incyto) to count 3 x 10 6 cells, followed by centrifugation to obtain cell precipitation. The obtained cells were each suspended in 1.5 mL of cell lysis buffer (PBS, 5 mM EDTA, 1% NP-40), lysed at 4 ° C. for 30 minutes, and centrifuged at 12,000 rpm for 10 minutes to obtain a supernatant.

12.5 % SDS 폴리아크릴아미드겔을 제작하고, 각각의 세포용해액과 음성대조군인 CHO/dhfr- 세포 용해액을 20 uL씩 취하여 샘플 완충액 10 uL와 혼합하여 100 ℃에서 5 분간 전처리하여 겔에 로딩하였다. 양성 대조군으로는 20 ng내지 100 ng의 디하이드로폴레이트 환원효소(Dihydrofolate Reductase, Sigma사, cat# D6566)를 샘플 완충액과 혼합하여 겔에 로딩하였다. 로딩한 겔을 30 mA의 전류로 전기 영동한 후, 겔을 분리하여 세미포어 웨스턴 블로팅 유닛(SEMI-PHOR, Hoefer사)에 PVDF여지와 3MM지와 포개어 장착하고 90 mA에서 1시간 동안 단백질을 PVDF 여지에 이동 시켰다. 이후 단백질이 이동된 PVDF 여지를 분리하여, 0.5% 탈지유가 포함된 TBS-T 용액 10 mL에 항-디하이드로폴레이트 환원효소 마우스 항체(BD Biosciences사, cat # 610697)를 1 : 500의 비율로 첨가한 용액에서 1시간 동안 교반하며 반응시켰다. TBS-T 용액으로 10 분씩 5 회 세척한 후, 0.5 % 탈지유가 포함된 TBS-T 용액 10ml에 호스 래디쉬 퍼록시다제(horseradish peroxidase)가 표지된 항-마우스 항체(Amersham Biosciences사, cat# RPN2108 에 포함)를 1 : 3000의 비율로 첨가한 용액에서 1시간 동안 교반하며 반응시켰다. 다시 TBS-T용액으로 10 분씩 5 회 세척한 후, ECL 웨스턴 블로팅 분석 시약 (ECL Western Blotting Analysis System, Amersham Biosciences사, cat # RPN2108)의 1제와 2제를 1:1로 혼합하여 PVDF 여지 위에 뿌려 1분간 방치하여 반응시켰다. 반응 직후 엑스레이 필름으로 감광하고 현상하여 판독하였다. 웨스턴블로팅 결과는 도 9에 나타내었다. 도 9의 결과로부터 알 수 있듯이, 형질전환에 의해 dhfr 유전자의 발현량이 증가한 것을 알 수 있었다. 동일한 세포 개수에서 X0GC나 X1GC/GEPO에서의 dhfr 발현이 좀 더 높음을 알 수 있었다.12.5% SDS polyacrylamide gel was prepared, and each cell lysate and negative control CHO / dhfr-cell lysate were taken in 20 uL each, mixed with 10 uL of sample buffer and pre-treated at 100 ° C. for 5 minutes to be loaded onto the gel. . As a positive control, 20 ng to 100 ng of dihydrofolate reductase (Sigma, cat # D6566) was mixed with sample buffer and loaded onto the gel. After the loaded gel was electrophoresed at a current of 30 mA, the gel was separated and superimposed with a PVDF filter and 3MM paper in a semi-pore western blotting unit (SEMI-PHOR, Hoefer Co., Ltd.). Moved to PVDF room. Then, the protein-transferred PVDF space was separated, and anti-dihydrofolate reductase mouse antibody (BD Biosciences, cat # 610697) was 1: 500 in 10 mL of TBS-T solution containing 0.5% skim milk. The reaction was stirred for 1 hour in the added solution. After washing 10 times with TBS-T solution 5 times, anti-mouse antibody labeled with horseradish peroxidase in 10 ml of TBS-T solution containing 0.5% skim milk (Amersham Biosciences, cat # RPN2108) Included in the ratio of 1: 3000, and reacted with stirring for 1 hour. After washing 5 times with TBS-T solution for 5 minutes each, one and two agents of ECL Western Blotting Analysis System (ECL Western Blotting Analysis System, Amersham Biosciences, cat # RPN2108) were mixed in a 1: 1 ratio to allow for PVDF. Sprinkle on top and allowed to react for 1 minute. Immediately after the reaction, the photosensitive film was developed with an X-ray film and developed. Western blotting results are shown in FIG. 9. As can be seen from the results of FIG. 9, it was found that the expression level of the dhfr gene was increased by transformation. In the same cell number, dhfr expression was higher in X0GC or X1GC / GEPO.

실시예Example 6.  6. 단클론Monoclonal 세포주의 분리 및 선택 Isolation and Selection of Cell Lines

실시예 4에서 가장 발현 양이 높았던 웰의 X0GC/GEPO와 X1GC/GEPO 세포주를 6-웰 배양 용기로 옮겨 배양한 후, 단클론을 분리하기 위해 96-웰 배양 용기에 웰 당 0.5 세포가 되도록 메토트렉세이트가 첨가된 선별 배지로 각각의 세포주를 한계 희석하여 접종하였다. 37 ℃, 5 % CO2 항온기에서 약 2 ~ 3 주간 배양하며 단일 콜로니가 생성된 웰만을 선별하고 24-웰 배양 용기로 옮겨 배양하여 실시예 4와 같이 간접 이라이자 방법으로 에리트로포이에틴의 발현 양을 측정하였다. The X0GC / GEPO and X1GC / GEPO cell lines of the wells, which had the highest expression amount in Example 4, were transferred to 6-well culture vessels and incubated, and methotrexate was added to the 96-well culture vessels at 0.5 cells per well to separate monoclonal cells. Each cell line was inoculated at the limiting dilution with the added selection medium. Incubate at 37 ° C. in a 5% CO 2 incubator for about 2 to 3 weeks, select only the wells from which single colonies have been produced, transfer them to 24-well culture vessels, and incubate the erythropoietin expression in an indirect IRIS method as in Example 4. Measured.

이중 높은 발현량을 보이면서 등전집속 이형체중 산성 이소머의 비율도 높은 단클론 X0GC/GEPO9647(DXB11) 클론과 X1GC/GEPO9629(DG44) 및 X0GC/GEPO9603 clone 을 최종 단클론 세포주로 분리 및 선택하고, 시간에 따른 단클론들의 발현량과 등전집속 이형체 패턴을 분석하였다. Of these, monoclonal X0GC / GEPO9647 (DXB11) clones and X1GC / GEPO9629 (DG44) and X0GC / GEPO9603 clones with high expression and high ratios of acid isomers in isoelectrically focused isomers were isolated and selected as the final monoclonal cell lines. The expression levels and isoelectric focusing isoforms of the monoclones were analyzed.

X0GC/GEPO9647(DXB11)의 경우 세포주당 발현량은 약 80 μg/10E6cell/day로 측정 되었다. 이와 같이 본 실시예에서 사용된 X0GC/GEPO나 X1GC/GEPO의 경우는 단 2 단계의 유전자 증폭만으로도 고발현 세포주를 획득할 수 있는 방법을 가능하게 하여 낮은 passage만으로도 단백질 생산용 단클론 생산 세포주를 제작하는 것이 가능함을 보여주고 있다. In the case of X0GC / GEPO9647 (DXB11), the expression level per cell line was about 80 μg / 10E6 cells / day. As described above, in the case of X0GC / GEPO or X1GC / GEPO used in this example, a method capable of obtaining a high expression cell line by only two stages of amplification of the gene enables production of a monoclonal cell line for protein production with only a low passage. It is showing that it is possible.

실시예Example 7  7 단클론Monoclonal 세포주의 대량 배양 Mass culture of cell lines

인간 에리트로포이에틴을 대량 생산하기 위해서 실시예 6에서 선별된 단클론 세포주들 중 X0GC/GEPO9647(DXB11)를 T175 배양 용기에서 계대 배양을 단계적으로 실시하여 총 120 개의 T175 flask까지 amplification을 실시하여 1 개의 Cell Factory(Nunc, Cat # 170009)당 약 2.5 X 108 개의 세포주를 접종하여, 총 8개의 cell factory를 접종 한 후 37 ℃, 5 % CO2 항온기에서 약 48 시간 동안 배양하였다. 인산 완충액으로 Cell Factory 1 개당 1 리터씩 2 번 수세한 후, 0.3 mM sodium butyrate(Sigma, Cat. B-5887)가 첨가된 에리트로포이에틴 대량 생산 배지인 무혈청 CHO-A-SFM(Gibco Fomula # 05-5072EF)을 1 리터씩 넣어주고 33 ℃, 5 % CO2 항온기에서 배양하며 2일마다 총 9회 동안 인간 에리트로포이에틴 발현 상등액을 회수하였다. 회수된 발현 상등액은 원심분리기와 pore size 0.2 μm 여과지를 이용하여 세포 잔존물등의 고형 부유물등을 제거하였다. 회수된 발현 상등액의 일부를 취하여 간접 이라이자 방법으로 발현량을 측정 하고 등전집속을 실시하여 산성의 등전집속 이형체의 함유량등을 분석하였다. 도 10의 결과에서와 같이 대량 배양에서도 40 - 50 mg/L 수준의 발현량을 유지하고 있음을 알 수 있었으며, 고시알산함유량의 척도가 될 수 있는 6번에서 8번 이소머의 함유량이 따로 정제과정을 거치지 않았음에도 높다는 것을 알 수 있었다. In order to mass-produce human erythropoietin, amplification of X0GC / GEPO9647 (DXB11) among the monoclonal cell lines selected in Example 6 in a T175 culture vessel in stages was carried out in steps of amplification up to 120 T175 flasks in total. Approximately 2.5 X 10 8 cell lines were inoculated per factory (Nunc, Cat # 170009), and after inoculating a total of eight cell factories, the cells were incubated at 37 ° C. and 5% CO 2 incubator for about 48 hours. After washing twice with 1 liter per Cell Factory in phosphate buffer, serum-free CHO-A-SFM (Gibco Fomula #), a medium for the production of erythropoietin with 0.3 mM sodium butyrate (Sigma, Cat. B-5887), was added. 05-5072EF) in 1 liter each, 33 ℃, 5% CO 2 Human erythropoietin-expressing supernatants were harvested for 9 times every 2 days while incubating in a thermostat. The recovered supernatant was removed using a centrifuge and a pore size 0.2 μm filter paper to remove solid suspensions such as cell residues. A portion of the recovered supernatant was taken, and the expression level was measured by indirect ELISA method, and isoelectric focusing was performed to analyze the contents of acidic isoelectric focusing variants. As shown in the results of FIG. 10, it can be seen that the expression level of 40-50 mg / L is maintained even in a large amount of culture, and the contents of isomers 6 to 8 which can be used as a measure of high sialic acid content are separately purified. Although it did not go through the process was found to be high.

실시예Example 8  8 고시알산Gosialic acid 함유 인간 에리트로포이에틴의 분리 정제 Purification of Containing Human Erythropoietin

실시예 7에서 생산된 세포 배양액내에 존재하는 부유물들을 제거하기 위하여 Beckman사의 XL-90 원심분리기를 이용하여 JLA-8.1000 로터에서 7,000 rpm으로 원심 분리 후 상등액을 회수 한 후 0.2 um의 여과지로 여과한 후 초여과지막(ultrafiltration membrane)을 이용하여 10분의 1 볼륨으로 농축을 실시하였다. 농축액에 동량의 20 mM Sodium Phosphate pH 7.4 버퍼를 섞었다. Amersham 사의 Blue FF 칼럼을 20 mM Sodium Phosphate pH 7.4 버퍼로 평형화 시키고 위의 에리트로포이에틴 함유액을 적용한 후, 20 mM Sodium Phosphate pH 7.4 버퍼로 칼럼의 5 배 볼륨으로 씻어 준 후, 2 M NaCl, 20 mM Sodium Phosphate pH 7.4 버퍼를 B 버퍼로 하여 칼럼 볼륨 2 배의 농도 구배를 적용하여 에리트로포이에틴을 용출시켰다.In order to remove the suspended solids in the cell culture produced in Example 7, using a Beckman XL-90 centrifuge, centrifugation at 7,000 rpm in a JLA-8.1000 rotor, the supernatant was recovered and filtered through a 0.2 um filter paper. The ultrafiltration membrane (ultrafiltration membrane) was used to concentrate to 1/10 volume. To the concentrate was mixed the same amount of 20 mM Sodium Phosphate pH 7.4 buffer. Amersham Blue FF column was equilibrated with 20 mM Sodium Phosphate pH 7.4 buffer, the above erythropoietin-containing solution was applied, washed with 5 mM volume of column with 20 mM Sodium Phosphate pH 7.4 buffer, 2 M NaCl, 20 Erithropoietin was eluted by applying a concentration gradient of 2 times the column volume using mM Sodium Phosphate pH 7.4 buffer as the B buffer.

에리트로포이에틴 분획이 주요 분획이 되는 부분을 모아 Sephardex G25 칼럼을 사용하여 에리트로포이에틴 함유 용액의 20 mM Sodium Phosphate pH 5.4 버퍼로 탈염 및 용액의 pH를 바꾸어 주었다. 이 에리트로포이에틴 함유 용액을 20 mM Sodium Phosphate pH 5.4로 평형화된 Amersham사의 SP HP 컬럼에 적용하고 20 mM Sodium Phosphate pH 5.4 버퍼로 5배의 칼럼 볼륨만큼 닦아 준 후 1 M NaCl, 20 mM Sodium Phosphate pH 7.4 버퍼를 B 버퍼로 하여 12배의 칼럼 볼륨 농도 구배를 적용하여 에리트로포이에틴을 용출시켰다. 이 때, 저염 농도에서 용출되는 에리트로포이에틴은 시알산 함유량이 높은 분획이며, 고염 농도에서 용출되는 에리트로포이에틴은 시알산 함유량이 낮은 분획이 되는데, 전체 용출양 중 고염 농도 부위의 에리트로포이에틴 30 %에 해당하는 부위는 제외하고 나머지 부분만을 취했다. SP HP 컬럼에서 정제한 에리트로포이에틴 분획은 다시 10 mM Tris, pH 7.5 버퍼로 평형화된 Sephardex G25 컬럼을 사용하여 염을 제거하고 버퍼를 바꾸어 준다. 이것을 10 mM Tris pH 7.5 버퍼로 평형화된 Amersham사의 Source 15 Q 컬럼에 적용하고 0.25 M NaCl, 10 mM Tris, pH 7.5 버퍼를 B로 하여 6배의 칼럼볼륨 농도 구배를 적용하여 에리트로포이에틴을 용출시켰다. 에리트로포이에틴이 용출되는 부분을 15개 이상 분획하여, 각 분획을 SDS-PAGE, 등전집속과 모세관 전기영동(CZE; capillary zone electrophoresis)를 이용하여 전하 이성체 분포를 분석하여, 불순물이 없고, 시알산 함유량이 10개 붙은 에리트로포이에틴이 가장 많은 분획부터 고염 농도 분획을 모아 취하였다. 이상의 정제과정을 실시한 결과 시알산 함유량이 10 몰 이상인 재조합 인간 에리트로포이에틴의 생산성은 약 20 μg/mL에 달하는 것으로 조사되었다. The erythropoietin fraction was the main fraction, and the desalting and pH of the solution were changed to 20 mM Sodium Phosphate pH 5.4 buffer of the erythropoietin-containing solution using Sephardex G25 column. This erythropoietin-containing solution was applied to Amersham's SP HP column equilibrated with 20 mM Sodium Phosphate pH 5.4 and wiped by 5 times column volume with 20 mM Sodium Phosphate pH 5.4 buffer, followed by 1 M NaCl, 20 mM Sodium Phosphate pH. 7.4 erythropoietin was eluted by applying a 12-fold column volume concentration gradient with buffer as B buffer. At this time, the erythropoietin eluted at a low salt concentration is a fraction with high sialic acid content, and the erythropoietin eluted at a high salt concentration is a fraction with a low sialic acid content. Only the remaining part was taken except for the part corresponding to%. The erythropoietin fraction purified on the SP HP column was then removed using a Sephardex G25 column equilibrated with 10 mM Tris, pH 7.5 buffer to remove salts and change the buffer. This was applied to Amersham's Source 15 Q column equilibrated with 10 mM Tris pH 7.5 buffer and erythropoietin was eluted by applying a six-fold column volume concentration gradient with 0.25 M NaCl, 10 mM Tris, pH 7.5 buffer to B. . Fractions of erythropoietin eluted at least 15, each fraction was analyzed by the charge isomer distribution using SDS-PAGE, isoelectric focusing and capillary zone electrophoresis (CZE), free from impurities The highest salt concentration fraction was collected from the fraction containing the most erythropoietin with 10 acid contents. As a result of the above purification process, the productivity of recombinant human erythropoietin with a sialic acid content of 10 mol or more reached about 20 μg / mL.

실시예Example 9  9 인간에리트로포이에틴의Of human erythropoietin 특성 확인 Property check

SDSSDS -- PAGEPAGE  And 웨스턴Weston 블로팅Blotting 분석  analysis

상기한 실시예 8에서 정제된 재조합 에리트로포이에틴을 두 개의 12 % SDS-PAGE서 전기영동을 실시하고, 한 개의 겔은 코마시 브릴리안트 블루 염색액에서 염색하고 탈색을 실시하여 관찰하였다. 나머지 하나의 겔은 세미 드라이 전기 전이기를 이용하여 PVDF 막 (Roche, cat # )에 흡착 시키고 항 인간 EPO항체 (R & D systems cat # AB-286-NA)를 1 : 5,000으로 처리한 후 세척을 실시하고 알칼라인 포스파타아제가 접합된 항 마우스 토끼 항체(Amersham cat # NA934V)를 처리하였다. 0.5 % Tween 20이 포함된 인산완충액으로 충분히 세척을 실시한 후 Amersham사의 발색시약(cat # RPN2108)으로 처리한 후 암실에서 필름 감광을 실시하고, 자동 현상기를 이용하여 현상을 실시하였다. 분석 결과 SDS-PAGE의 밴드가 에리트로포이에틴임을 확인할 수 있었으며, 당쇄화의 정도에 따라서 크기의 차이를 나타내는 것을 볼 수 있었다 (도 10 참조)The recombinant erythropoietin purified in Example 8 described above was subjected to electrophoresis on two 12% SDS-PAGE, and one gel was observed by staining and decolorizing in Comash Brilliant Blue staining solution. The other gel was adsorbed onto the PVDF membrane (Roche, cat #) using a semi-dry electric transition machine and treated with anti-human EPO antibody (R & D systems cat # AB-286-NA) at 1: 5,000 and washed. And anti-mouse rabbit antibody conjugated with alkaline phosphatase (Amersham cat # NA934V). After washing sufficiently with phosphate buffer containing 0.5% Tween 20, it was treated with Amersham's coloring reagent (cat # RPN2108), and then subjected to photosensitive film in the dark, and developed using an automatic developing machine. As a result, it was confirmed that the band of erythropoietin was SDS-PAGE, and the size difference was shown depending on the degree of glycosylation (see FIG. 10).

등전Isoelectric 집속Focus 분석  analysis

세포배양 농축액과 상기한 실시예 4에서 정제된 재조합 에리트로포이에틴을 시료 완충액(Invitrogen cat # LC5371)과 1 : 1로 섞은 후 등전집속 겔 (Invitrogen cat # EC6655B)에 BRP (BRP cat # E1515000) 표준품 및 등전집속 기준물질과 함께 로딩하고 저전압과 고전압에서 차례로 전기영동을 실시하였다. 전기영동이 완료된 등전집속 겔을 12 % TCA와 3 % sulfosalisylic acid가 포함된 고정용 액에서 30 분간 흔들면서 고정을 실시하고, 코마시 브릴리언트 블루 R250(Amresco cat # 6104-59-2) 염색액에서 염색을 실시한 후 각각의 등전점에 해당하는 에리트로포이에틴을 관찰하였다. (도 11 참조) The cell culture concentrate and the recombinant erythropoietin purified in Example 4 above were mixed with the sample buffer (Invitrogen cat # LC5371) in 1: 1 and then BRP (BRP cat # E1515000) in an isoelectric focusing gel (Invitrogen cat # EC6655B). They were loaded with standard and isoelectric focusing materials and electrophoresed in turn at low and high voltages. After electrophoresis, the isoelectric focusing gel was immobilized by shaking for 30 minutes in a fixed solution containing 12% TCA and 3% sulfosalisylic acid, and stained with Coomas Brilliant Blue R250 (Amresco cat # 6104-59-2). After staining at erythropoietin corresponding to each isoelectric point was observed. (See Figure 11)

TFTF -1 세포주를 이용한 With -1 cell line 역가와Activity and 활성 분석  Activity analysis

10 % 소태아 혈청과 10 uM β-mercaptoethanol, 20 ug/mL 트랜스페린, 12 ng/mL의 GM-CSF등이 함유된 RPMI 1640 (Welgene cat # LM 011-03)에서 자란 TF-1 (ATCC cat # CRL-2003)을 1,000 rpm에서 5 분간 원심 분리하여 회수한다. 회수된 TF-1 세포주는 인산완충액으로 2 회 세척을 실시하고, 2 % 소 태아 혈청 및 100 ug/mL 트랜스페린, 2 mg/mL protease-free 소 혈청, 1 % 페니실린/스트렙토마이신이 들어 있는 어세이 배지로 1회 세척을 실시하였다. 96 웰 플레이트에 50 uL의 어세이 배지를 첨가하고 시작하는 웰에 25 uL의 시료와 표준품을 넣어서 넣어 최종 농도가 1 ug/mL이 되도록 한 후 3 배씩 연속적으로 희석을 실시하였다. 각 웰에 어세이 배지로 세척을 실시한 TF-1 세포주를 2 X 104/50 uL/웰이 되게 더하였다. 37 ℃ 5 % CO2 배양기에서 약 72 시간동안 반응을 시킨 후 Promega사의 cell titer one solution 키트(cat # G4102)의 발색 용액 각 웰당 20 uL씩 첨가하였다. 4 시간 동안 37 ℃ 5 % CO2 배양기에서 발색을 실시 후 잘 섞어서 490 nm 파장에서의 흡광도를 Molecular Dynamics 사의 ELISA 리더에서 측정하였다. 본 시험을 통하여 X0GC/GEPO 유래의 재조합 에리트로포이에틴 활성이 표준품과의 차이가 없음을 확인 할 수 있었다. TF-1 (ATCC cat #) grown on RPMI 1640 (Welgene cat # LM 011-03) containing 10% fetal bovine serum, 10 uM β-mercaptoethanol, 20 ug / mL transferrin, and 12 ng / mL GM-CSF CRL-2003) is recovered by centrifugation at 1,000 rpm for 5 minutes. The recovered TF-1 cell line was washed twice with phosphate buffer and assayed with 2% fetal bovine serum and 100 ug / mL transferrin, 2 mg / mL protease-free bovine serum, 1% penicillin / streptomycin. One wash was performed with the medium. 50 uL of assay medium was added to a 96 well plate, and 25 uL of sample and standard product were added to the starting well to make a final concentration of 1 ug / mL, and then serially diluted three times. The TF-1 cell line subjected to washing with assay medium to each well, 2 X 10 4/50 was further causes the uL / well. After reacting for about 72 hours in a 37 ° C. 5% CO 2 incubator, 20 uL of each color well of Promega's cell titer one solution kit (cat # G4102) was added. After 4 hours of color development at 37 ° C. in a 5% CO 2 incubator, the mixture was mixed well and the absorbance at 490 nm was measured in an ELISA reader manufactured by Molecular Dynamics. Through this test, it was confirmed that the recombinant erythropoietin activity derived from X0GC / GEPO was not different from the standard product.

실시예Example 10  10 단클론Monoclonal 세포주의  Cell line 무혈청배지Serum-free medium adaptationadaptation

X1GC/GEPO9629(DG44)와 X0GC/GEPO9603(DG44) 스트레인을 10% 투석 소태아 혈청(dialyzed fetal bovine serum)과 1% 페니실린-스트렙토마이신, 800 ug/mL의 geneticin (Mediatech, Cat # 61-234 RG)이 들어있는 a-MEM(Welgene, Cat # LM008-02)선별 배지를 사용하여 2개의 T175 배양 용기에서 바닥에 90% 이상 배양되도록 하였다. 0.5% 트립신(Gibco., Cat # 15400-054) 처리 후, 원심 분리하여 얻은 세포 침전 덩어리를 10 ml의 PBS (Welgene, Cat # LM001-01)로 현탁하고, 재차 원심분리를 실시한 후 바닥의 세포덩어리를 8 mM 글루타민이 첨가된 JRH 사의 EX-CELL CD CHO (cat # 14360) 배양 배지로 현탁시켰다. 무혈청배지로 현탁된 세포배양액을 T175 배양용기에 넣어서 3일간 CO2 배양기에서 정치 배양을 실시하였다. 배양 후 약 3일 후에 바닥에 붙지 않고 떠서 자라는 세포들을 원심분리 한 후 새로운 T25 배양용기로 옮긴 후 다시 9 일간 정치 배양을 실시하였다. 9일 후 배양용기에서 떨어진 세포들을 다시 회수하여 세포수 및 성장곡선, 생존율 등을 측정하였다. 동일한 작업을 반복하여 세포분열이 24시간 내외로 이루어지고 생존률이 80 % 이상이 될 때가지 반복 실시하였다. 무혈청 배지로 완전 적응된 X0GC/GEPO9603(DG44) 세포주 및 X1GC/GEPO9629(DG44)의 발현 양상을 확인하기 위하여 500 mL spinner 배양용기(Bellco)에 1.0 X 10E5 cells/mL의 농도로 200 mL 배양배지에서 50 rpm으로 교반 배양을 실시하였다. 배양 7일 후 100 mL의 배양배지를 새 배양 배지로 바꾸고 배양온도를 33 ℃로 낮추어 저온발현을 실시하였다. 24시간 간격으로 배양 상등액을 취하여 등전집속을 실시하여 발현되는 에리트로포이에틴의 산성 이소머 함량을 확인하였다. 도 12에서 보이는 것과 같이 무혈청 배지에 완전 적응된 X0GC/GEPO9603(DG44) 및 X1GC/GEPO9629(DG44)의 경우 기존 cell factory에서 생산된 에리트로포이에틴과 대등한 수준의 등전집속 프로파일을 보이고 있음을 확인 할 수 있었다. X1GC / GEPO9629 (DG44) and X0GC / GEPO9603 (DG44) strains with 10% dialyzed fetal bovine serum, 1% penicillin-streptomycin, 800 ug / mL of geneticin (Mediatech, Cat # 61-234 RG A-MEM (Welgene, Cat # LM008-02) selection medium containing) was used to allow more than 90% culture on the bottom in two T175 culture vessels. After treatment with 0.5% trypsin (Gibco., Cat # 15400-054), the cell precipitated mass obtained by centrifugation was suspended with 10 ml of PBS (Welgene, Cat # LM001-01), followed by centrifugation again, and then the cells at the bottom. The mass was suspended in EXH-CELL CD CHO (cat # 14360) culture medium from JRH Company with 8 mM glutamine. Cell culture medium suspended in serum-free medium was placed in a T175 culture vessel and CO 2 Stationary culture was performed in the incubator. After about 3 days of incubation, the floating cells that did not adhere to the bottom were centrifuged, transferred to a new T25 culture vessel, and then stationed again for 9 days. Nine days later, the cells were removed from the culture vessel again, and the number of cells, growth curve, and survival rate were measured. The same operation was repeated until cell division was performed within 24 hours and the survival rate was 80% or more. To confirm the expression pattern of X0GC / GEPO9603 (DG44) cell line and X1GC / GEPO9629 (DG44) fully adapted with serum-free medium, 200 mL culture medium at a concentration of 1.0 X 10E5 cells / mL in 500 mL spinner culture vessel (Bellco). Stirring culture was performed at 50 rpm. After 7 days of culture, 100 mL of the culture medium was changed to a new culture medium, and the low temperature expression was performed by lowering the culture temperature to 33 ° C. Culture supernatants were taken at 24 hour intervals to perform isoelectric focusing to determine the acid isomer content of erythropoietin expressed. As shown in FIG. 12, X0GC / GEPO9603 (DG44) and X1GC / GEPO9629 (DG44) fully adapted to serum-free medium showed an isoelectric focusing profile comparable to that of erythropoietin produced in the existing cell factory. Could confirm.

본 발명에 따른 디하이드로 폴레이트 환원 효소의 GC-rich 서열 제거된 벡터 및 세포주를 사용하는 경우, 인간 에리트로포이에틴을 생산하는 세포주를 선별하는데 저농도의 디하이드로폴레이트 환원효소 저해제를 단기간 사용함으로 인해 고농도로 에리트로포이에틴을 생산하는 세포주의 개발기간을 단축시킬 수 있으며, 상기 세포주를 사용하여 고효율로 원하는 재조합 단백질을 대량생산할 수 있어, 기존의 동물 세포 발현벡터에서 사용하는 방법 중의 하나인 유전자 증폭 기술을 보다 효율적으로 개선시키는 효과가 있다.In the case of using the GC-rich sequence-deleted vector and cell line of the dihydrofolate reductase according to the present invention, short-term use of a low concentration of the dihydrofolate reductase inhibitor to select a cell line producing human erythropoietin It is possible to shorten the development period of the cell line producing erythropoietin at high concentration, and to use the cell line to mass produce the desired recombinant protein with high efficiency, gene amplification technology which is one of the methods used in the existing animal cell expression vector. There is an effect to improve more efficiently.

<110> Hanmi Pharm. Co., Ltd. <120> A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same <160> 18 <170> KopatentIn 1.71 <210> 1 <211> 9227 <212> DNA <213> Artificial Sequence <220> <223> X0GC/GEPO vector <400> 1 gacggatcgg gagatccgac atgataagat acattgatga gtttggacaa accacaacta 60 gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa 120 ccattataag ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg 180 ttcaggggga ggtgtgggag gttttttaaa gcaagtaaaa cctctacaaa tgtggtatgg 240 ctgattatga tctctagtca aggcactata catcaaatat tccttattaa cccctttaca 300 aattaaaaag ctaaaggtac acaatttttg agcatagtta ttaatagcag acactctatg 360 cctgtgtgga gtaagaaaaa acagtatgtt atgattataa ctgttatgcc tacttataaa 420 ggttacagaa tatttttcca taattttctt gtatagcagt gcagcttttt cctttgtggt 480 gtaaatagca aagcaagcaa gagttctatt actaaacaca gcatgactca aaaaacttag 540 caattctgaa ggaaagtcct tggggtcttc tacctttctc ttcttttttg gaggagtaga 600 atgttgagag tcagcagtag cctcatcatc actagatggc atttcttctg agcaaaacag 660 gttttcctca ttaaaggcat tccaccactg ctcccattca tcagttccat aggttggaat 720 ctaaaataca caaacaatta gaatcagtag tttaacacat tatacactta aaaattttat 780 atttacctta gagctttaaa tctctgtagg tagtttgtcc aattatgtca caccacagaa 840 gtaaggttcc ttcacaaaga tccaaagcca gcaaaagtcc catggtctta taaaaatgca 900 tagctttagg aggggagcag agaacttgaa agcatcttcc tgttagtctt tcttctcgta 960 gacttcaaac ttatacttga tgcctttttc ctcctggacc tcagagagga cgcctgggta 1020 ttctgggaga agtttatatt tccccaaatc aatttctggg aaaaacgtgt cactttcaaa 1080 ttcctgcatg atccttgtca caaagagtct gaggtggcct ggttgattca tggcttcctg 1140 gtaaacagaa ctgcctccga ctatccaaac catgtctact ttacttgcca attccggttg 1200 ttcaataagt cttaaggcat catccaaact tttggcaaga aaatgagctc ctcgtggtgg 1260 ttctttgagt tctctactga gaactatatt aattctgtcc tttaaaggtc gattcttctc 1320 aggaatggag aaccaggttt tcctacccat aatcaccaga ttctgtttac cttccactga 1380 agaggttgtg gtcattcttt ggaagtactt gaactcgttc ctgagcggag gccagggtag 1440 gtctccgttc ttgccaatcc ccatattttg ggacacggcg acgatgcagt tcaatggtcg 1500 aaccatgatg gcagcgggga taaaatccta ccagccttca cgctaggatt gccgtcaagt 1560 ttggcgcgaa atcgcagccc tgagctgtgg atctcccgat cccctatggt gcactctcag 1620 tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt gtgtgttgga 1680 ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct tgaccgacaa 1740 ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg tacgggccag 1800 atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 1860 agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 1920 ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac 1980 gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt 2040 ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa 2100 atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta 2160 catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg 2220 gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg 2280 gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc 2340 attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctctctg 2400 gctaactaga gaacccactg cttactggct tatcgaaatt aatacgactc actataggga 2460 gacccaagct tggtaccgag ctcggatcca ctagtaacgg ccgccagtgt gctggaattc 2520 tgcagattcg aggtcgacgg tatcgataag cttccaagcg cggagatggg ggtgcacggt 2580 gagtactcgc gggctgggcg ctcccgcccg cccgggtccc tgtttgagcg gggatttagc 2640 gccccggcta ttggccagga ggtggctggg ttcaaggacc ggcgacttgt caaggacccc 2700 ggaaggggga ggggggtggg gcagcctcca cgtgccagcg gggacttggg ggagtccttg 2760 gggatggcaa aaacctgacc tgtgaagggg acacagtttg ggggttgagg ggaagaaggt 2820 ttgggggttc tgctgtgcca gtggagagga agctgataag ctgataacct gggcgctgga 2880 gccaccactt atctgccaga ggggaagcct ctgtcacacc aggattgaag tttggccgga 2940 gaagtggatg ctggtagctg ggggtggggt gtgcacacgg cagcaggatt gaatgaaggc 3000 cagggaggca gcacctgagt gcttgcatgg ttggggacag gaaggacgag ctggggcaga 3060 gacgtgggga tgaaggaagc tgtccttcca cagccaccct tctccctccc cgcctgactc 3120 tcagcctggc tatctgttct agaatgtcct gcctggctgt ggcttctcct gtccctgctg 3180 tcgctccctc tgggcctccc agtcctgggc gccccaccac gcctcatctg tgacagccga 3240 gtcctggaga ggtacctctt ggaggccaag gaggccgaga atatcacggt gagacccctt 3300 ccccagcaca ttccacagaa ctcacgctca gggcttcagg gaactcctcc cagatccagg 3360 aacctggcac ttggtttagg gtggagttgg gaagctagac actgcccccc tacataagaa 3420 taagtctggt ggccccaaac catacctgga aactaggcaa ggagcaaagc cagcagatcc 3480 tacggcctgt gggccagggc cagagccttc agggaccctt gactccccgg gctgtgtgca 3540 tttcagacgg gctgtgctga acactgcagc ttgaatgaga atatcactgt cccagacacc 3600 aaagttaatt tctatgcctg gaagaggatg gaggtgagtt cctttttttt tttttttcct 3660 ttcttttgga gaatctcatt tgcgagcctg attttggatg aaagggagaa tgatcgaggg 3720 aaaggtaaaa tggagcagca gagatgaggc tgcctgggcg cagaggctca cgtctataat 3780 cccaggctga gatggccgag atgggagaat tgcttgagcc ctggagtttc agaccaacct 3840 aggcagcata gtgagatccc ccatctctac aaacatttaa aaaaattagt caggtgaagt 3900 ggtgcatggt ggtagtccca gatatttgga aggctgaggc gggaggatcg cttgagccca 3960 ggaatttgag gctgcagtga gctgtgatca caccactgca ctccagcctc agtgacagag 4020 tgaggccctg tctcaaaaaa gaaaagaaaa aagaaaaata atgagggctg tatggaatac 4080 attcattatt cattcactca ctcactcact cattcattca ttcattcatt caacaagtct 4140 tattgcatac cttctgtttg ctcagcttgg tgcttggggc tgctgagggg caggagggag 4200 agggtgacat gggtcagctg actcccagag tccactccct gtaggtcggg cagcaggccg 4260 tagaagtctg gcagggcctg gccctgctgt cggaagctgt cctgcggggc caggccctgt 4320 tggtcaactc ttcccagccg tgggagcccc tgcagctgca tgtggataaa gccgtcagtg 4380 gccttcgcag cctcaccact ctgcttcggg ctctgggagc ccaggtgagt aggagcggac 4440 acttctgctt gccctttctg taagaagggg agaagggtct tgctaaggag tacaggaact 4500 gtccgtattc cttccctttc tgtggcactg cagcgacctc ctgttttctc cttggcagaa 4560 ggaagccatc tcccctccag atgcggcctc agctgctcca ctccgaacaa tcactgctga 4620 cactttccgc aaactcttcc gagtctactc caatttcctc cggggaaagc tgaagctgta 4680 cacaggggag gcctgcagga caggggacag atgaccagct tgatatcgaa ttatccatca 4740 cactggcggc cgctcgagca tgcatctaga gggccctatt ctatagtgtc acctaaatgc 4800 tagagctcgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg ttgtttgccc 4860 ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa 4920 tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg 4980 gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg 5040 ctctatggct tctgaggcgg aaagaaccag ctggggctct agggggtatc cccacgcgcc 5100 ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact 5160 tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc 5220 cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat ttagtgcttt 5280 acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc 5340 ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt 5400 gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat 5460 tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa 5520 ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccagcaggc 5580 agaagtatgc aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc 5640 tccccagcag gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg 5700 cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 5760 ggctgactaa ttttttttat ttatgcagag gccgaggccg cctctgcctc tgagctattc 5820 cagaagtagt gaggaggctt ttttggaggc ctaggctttt gcaaaaagct cccgggagct 5880 tgtatatcca ttttcggatc tgatcaagag acaggatgag gatcgtttcg catgattgaa 5940 caagatggat tgcacgcagg ttctccggcc gcttgggtgg agaggctatt cggctatgac 6000 tgggcacaac agacaatcgg ctgctctgat gccgccgtgt tccggctgtc agcgcagggg 6060 cgcccggttc tttttgtcaa gaccgacctg tccggtgccc tgaatgaact gcaggacgag 6120 gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt 6180 gtcactgaag cgggaaggga ctggctgcta ttgggcgaag tgccggggca ggatctcctg 6240 tcatctcacc ttgctcctgc cgagaaagta tccatcatgg ctgatgcaat gcggcggctg 6300 catacgcttg atccggctac ctgcccattc gaccaccaag cgaaacatcg catcgagcga 6360 gcacgtactc ggatggaagc cggtcttgtc gatcaggatg atctggacga agagcatcag 6420 gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc gcatgcccga cggcgaggat 6480 ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt 6540 tctggattca tcgactgtgg ccggctgggt gtggcggacc gctatcagga catagcgttg 6600 gctacccgtg atattgctga agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt 6660 tacggtatcg ccgctcccga ttcgcagcgc atcgccttct atcgccttct tgacgagttc 6720 ttctgagcgg gactctgggg ttcgaaatga ccgaccaagc gacgcccaac ctgccatcac 6780 gagatttcga ttccaccgcc gccttctatg aaaggttggg cttcggaatc gttttccggg 6840 acgccggctg gatgatcctc cagcgcgggg atctcatgct ggagttcttc gcccacccca 6900 acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa 6960 ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt 7020 atcatgtctg tataccgtcg acctctagct agagcttggc gtaatcatgg tcatagctgt 7080 ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 7140 agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 7200 tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 7260 cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 7320 gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 7380 ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 7440 ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 7500 atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 7560 aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 7620 gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta 7680 ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 7740 ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 7800 acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 7860 gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 7920 ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 7980 ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 8040 gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 8100 ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 8160 agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 8220 ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 8280 gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 8340 catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 8400 cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 8460 cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 8520 gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 8580 tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 8640 gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 8700 tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 8760 gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 8820 gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt 8880 taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc 8940 tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta 9000 ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa 9060 taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca 9120 tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 9180 aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtc 9227 <210> 2 <211> 8635 <212> DNA <213> Artificial Sequence <220> <223> X1GC/GEPO vector <400> 2 gacggatcgg gagatccatt ctccgcccca tggctgacta atttttttta tttatgcaga 60 ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg 120 cctaggcttt tgcaaaaagc tcccgggatg gttcgaccat tgaactgcat cgtcgccgtg 180 tcccaaaata tggggattgg caagaacgga aacctaccct ggcctccgct caggaacgag 240 ttcaagtact tccaaagaat gaccacaacc tcttcagtgg aaggtaaaca gaatctggtg 300 attatgggta ggaaaacctg gttctccatt cctgagaaga atcgaccttt aaaggacaga 360 attaatatag ttctcagtag agaactcaaa gaaccaccac gaggagctca ttttcttgcc 420 aaaagtttgg atgatgcctt aagacttatt gaacaaccgg aattggcaag taaagtagac 480 atggtttgga tagtcggagg cagttctgtt taccaggaag ccatgaatca accaggccac 540 ctcagactct ttgtgacaag gatcatgcag gaatttgaaa gtgacacgtt tttcccagaa 600 attgatttgg ggaaatataa acttctccca gaatacccag gcgtcctctc tgaggtccag 660 gaggaaaaag gcatcaagta taagtttgaa gtctacgaga agaaagacta acaggttcga 720 aatgaccgac caagcgacgc ccaacctgcc atcacgagat ttcgattcca ccgccgcctt 780 ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc ggctggatga tcctccagcg 840 cggggatctc atgctggagt tcttcgccca ccccaacttg tttattgcag cttataatgg 900 ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 960 tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctgggatc tcccgatccc 1020 ctatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatctgctc 1080 cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca aaatttaagc tacaacaagg 1140 caaggcttga ccgacaattg catgaagaat ctgcttaggg ttaggcgttt tgcgctgctt 1200 cgcgatgtac gggccagata tacgcgttga cattgattat tgactagtta ttaatagtaa 1260 tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 1320 gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 1380 tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 1440 cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 1500 gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 1560 tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 1620 tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 1680 cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 1740 cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 1800 ataagcagag ctctctggct aactagagaa cccactgctt actggcttat cgaaattaat 1860 acgactcact atagggagac ccaagctggc tagcgtttaa acttaagctt ggtaccgagc 1920 tcggatccac tagtaacggc cgccagtgtg ctggaattct gcagattcga ggtcgacggt 1980 atcgataagc ttccaagcgc ggagatgggg gtgcacggtg agtactcgcg ggctgggcgc 2040 tcccgcccgc ccgggtccct gtttgagcgg ggatttagcg ccccggctat tggccaggag 2100 gtggctgggt tcaaggaccg gcgacttgtc aaggaccccg gaagggggag gggggtgggg 2160 cagcctccac gtgccagcgg ggacttgggg gagtccttgg ggatggcaaa aacctgacct 2220 gtgaagggga cacagtttgg gggttgaggg gaagaaggtt tgggggttct gctgtgccag 2280 tggagaggaa gctgataagc tgataacctg ggcgctggag ccaccactta tctgccagag 2340 gggaagcctc tgtcacacca ggattgaagt ttggccggag aagtggatgc tggtagctgg 2400 gggtggggtg tgcacacggc agcaggattg aatgaaggcc agggaggcag cacctgagtg 2460 cttgcatggt tggggacagg aaggacgagc tggggcagag acgtggggat gaaggaagct 2520 gtccttccac agccaccctt ctccctcccc gcctgactct cagcctggct atctgttcta 2580 gaatgtcctg cctggctgtg gcttctcctg tccctgctgt cgctccctct gggcctccca 2640 gtcctgggcg ccccaccacg cctcatctgt gacagccgag tcctggagag gtacctcttg 2700 gaggccaagg aggccgagaa tatcacggtg agaccccttc cccagcacat tccacagaac 2760 tcacgctcag ggcttcaggg aactcctccc agatccagga acctggcact tggtttaggg 2820 tggagttggg aagctagaca ctgcccccct acataagaat aagtctggtg gccccaaacc 2880 atacctggaa actaggcaag gagcaaagcc agcagatcct acggcctgtg ggccagggcc 2940 agagccttca gggacccttg actccccggg ctgtgtgcat ttcagacggg ctgtgctgaa 3000 cactgcagct tgaatgagaa tatcactgtc ccagacacca aagttaattt ctatgcctgg 3060 aagaggatgg aggtgagttc cttttttttt ttttttcctt tcttttggag aatctcattt 3120 gcgagcctga ttttggatga aagggagaat gatcgaggga aaggtaaaat ggagcagcag 3180 agatgaggct gcctgggcgc agaggctcac gtctataatc ccaggctgag atggccgaga 3240 tgggagaatt gcttgagccc tggagtttca gaccaaccta ggcagcatag tgagatcccc 3300 catctctaca aacatttaaa aaaattagtc aggtgaagtg gtgcatggtg gtagtcccag 3360 atatttggaa ggctgaggcg ggaggatcgc ttgagcccag gaatttgagg ctgcagtgag 3420 ctgtgatcac accactgcac tccagcctca gtgacagagt gaggccctgt ctcaaaaaag 3480 aaaagaaaaa agaaaaataa tgagggctgt atggaataca ttcattattc attcactcac 3540 tcactcactc attcattcat tcattcattc aacaagtctt attgcatacc ttctgtttgc 3600 tcagcttggt gcttggggct gctgaggggc aggagggaga gggtgacatg ggtcagctga 3660 ctcccagagt ccactccctg taggtcgggc agcaggccgt agaagtctgg cagggcctgg 3720 ccctgctgtc ggaagctgtc ctgcggggcc aggccctgtt ggtcaactct tcccagccgt 3780 gggagcccct gcagctgcat gtggataaag ccgtcagtgg ccttcgcagc ctcaccactc 3840 tgcttcgggc tctgggagcc caggtgagta ggagcggaca cttctgcttg ccctttctgt 3900 aagaagggga gaagggtctt gctaaggagt acaggaactg tccgtattcc ttccctttct 3960 gtggcactgc agcgacctcc tgttttctcc ttggcagaag gaagccatct cccctccaga 4020 tgcggcctca gctgctccac tccgaacaat cactgctgac actttccgca aactcttccg 4080 agtctactcc aatttcctcc ggggaaagct gaagctgtac acaggggagg cctgcaggac 4140 aggggacaga tgaccagctt gatatcgaat tatccatcac actggcggcc gctcgagtct 4200 agagggcccg tttaaacccg ctgatcagcc tcgactgtgc cttctagttg ccagccatct 4260 gttgtttgcc cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt 4320 tcctaataaa atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg 4380 ggtggggtgg ggcaggacag caagggggag gattgggaag acaatagcag gcatgctggg 4440 gatgcggtgg gctctatggc ttctgaggcg gaaagaacca gctggggctc tagggggtat 4500 ccccacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 4560 accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 4620 gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 4680 tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 4740 gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 4800 agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 4860 ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 4920 tttaacgcga attaattctg tggaatgtgt gtcagttagg gtgtggaaag tccccaggct 4980 ccccagcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc aggtgtggaa 5040 agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa 5100 ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt 5160 ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctctgcct 5220 ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcaaaaagc 5280 tcccgggagc ttgtatatcc attttcggat ctgatcaaga gacaggatga ggatcgtttc 5340 gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat 5400 tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt 5460 cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac 5520 tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg 5580 tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc 5640 aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa 5700 tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc 5760 gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg 5820 aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg 5880 acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa 5940 atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg 6000 acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct 6060 tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc 6120 ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag cgacgcccaa 6180 cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg gcttcggaat 6240 cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc tggagttctt 6300 cgcccacccc aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 6360 aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 6420 caatgtatct tatcatgtct gtataccgtc gacctctagc tagagcttgg cgtaatcatg 6480 gtcatagctg tttcctgtgt gaaattgtta tccgctcaca attccacaca acatacgagc 6540 cggaagcata aagtgtaaag cctggggtgc ctaatgagtg agctaactca cattaattgc 6600 gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat 6660 cggccaacgc gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac 6720 tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt 6780 aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca 6840 gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc 6900 ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact 6960 ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct 7020 gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag 7080 ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca 7140 cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa 7200 cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc 7260 gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag 7320 aagaacagta tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg 7380 tagctcttga tccggcaaac aaaccaccgc tggtagcggt ttttttgttt gcaagcagca 7440 gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 7500 cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 7560 cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 7620 gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 7680 tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 7740 gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 7800 agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 7860 tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 7920 agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 7980 gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 8040 catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 8100 ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 8160 atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 8220 tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 8280 cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 8340 cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 8400 atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 8460 aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 8520 ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 8580 aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtc 8635 <210> 3 <211> 8657 <212> DNA <213> Artificial Sequence <220> <223> X3GC/GEPO vector <400> 3 gacggatcgg gagatcccta actccgccca gttccgccca ttctccgccc catggctgac 60 taattttttt tatttatgca gaggccgagg ccgcctctgc ctctgagcta ttccagaagt 120 agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcccggga tggttcgacc 180 attgaactgc atcgtcgccg tgtcccaaaa tatggggatt ggcaagaacg gaaacctacc 240 ctggcctccg ctcaggaacg agttcaagta cttccaaaga atgaccacaa cctcttcagt 300 ggaaggtaaa cagaatctgg tgattatggg taggaaaacc tggttctcca ttcctgagaa 360 gaatcgacct ttaaaggaca gaattaatat agttctcagt agagaactca aagaaccacc 420 acgaggagct cattttcttg ccaaaagttt ggatgatgcc ttaagactta ttgaacaacc 480 ggaattggca agtaaagtag acatggtttg gatagtcgga ggcagttctg tttaccagga 540 agccatgaat caaccaggcc acctcagact ctttgtgaca aggatcatgc aggaatttga 600 aagtgacacg tttttcccag aaattgattt ggggaaatat aaacttctcc cagaataccc 660 aggcgtcctc tctgaggtcc aggaggaaaa aggcatcaag tataagtttg aagtctacga 720 gaagaaagac taacaggttc gaaatgaccg accaagcgac gcccaacctg ccatcacgag 780 atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 840 ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact 900 tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata 960 aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc 1020 atgtctggga tctcccgatc ccctatggtg cactctcagt acaatctgct ctgatgccgc 1080 atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag 1140 caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag 1200 ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt gacattgatt 1260 attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga 1320 gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg 1380 cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga ctttccattg 1440 acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc aagtgtatca 1500 tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc 1560 ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat tagtcatcgc 1620 tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc 1680 acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa 1740 tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag 1800 gcgtgtacgg tgggaggtct atataagcag agctctctgg ctaactagag aacccactgc 1860 ttactggctt atcgaaatta atacgactca ctatagggag acccaagctg gctagcgttt 1920 aaacttaagc ttggtaccga gctcggatcc actagtaacg gccgccagtg tgctggaatt 1980 ctgcagattc gaggtcgacg gtatcgataa gcttccaggc gcggagatgg gggtgcacgg 2040 tgagtactcg cgggctgggc gctcccgccc gcccgggtcc ctgtttgagc ggggatttag 2100 cgccccggct attggccagg aggtggctgg gttcaaggac cggcgacttg tcaaggaccc 2160 cggaaggggg aggggggtgg ggcagcctcc acgtgccagc ggggacttgg gggagtcctt 2220 ggggatggca aaaacctgac ctgtgaaggg gacacagttt gggggttgag gggaagaagg 2280 tttgggggtt ctgctgtgcc agtggagagg aagctgataa gctgataacc tgggcgctgg 2340 agccaccact tatctgccag aggggaagcc tctgtcacac caggattgaa gtttggccgg 2400 agaagtggat gctggtagct gggggtgggg tgtgcacacg gcagcaggat tgaatgaagg 2460 ccagggaggc agcacctgag tgcttgcatg gttggggaca ggaaggacga gctggggcag 2520 agacgtgggg atgaaggaag ctgtccttcc acagccaccc ttctccctcc ccgcctgact 2580 ctcagcctgg ctatctgttc tagaatgtcc tgcctggctg tggcttctcc tgtccctgct 2640 gtcgctccct ctgggcctcc cagtcctggg cgccccacca cgcctcatct gtgacagccg 2700 agtcctggag aggtacctct tggaggccaa ggaggccgag aatatcacgg tgagacccct 2760 tccccagcac attccacaga actcacgctc agggcttcag ggaactcctc ccagatccag 2820 gaacctggca cttggtttag ggtggagttg ggaagctaga cactgccccc ctacataaga 2880 ataagtctgg tggccccaaa ccatacctgg aaactaggca aggagcaaag ccagcagatc 2940 ctacggcctg tgggccaggg ccagagcctt cagggaccct tgactccccg ggctgtgtgc 3000 atttcagacg ggctgtgctg aacactgcag cttgaatgag aatatcactg tcccagacac 3060 caaagttaat ttctatgcct ggaagaggat ggaggtgagt tccttttttt ttttttttcc 3120 tttcttttgg agaatctcat ttgcgagcct gattttggat gaaagggaga atgatcgagg 3180 gaaaggtaaa atggagcagc agagatgagg ctgcctgggc gcagaggctc acgtctataa 3240 tcccaggctg agatggccga gatgggagaa ttgcttgagc cctggagttt cagaccaacc 3300 taggcagcat agtgagatcc cccatctcta caaacattta aaaaaattag tcaggtgaag 3360 tggtgcatgg tggtagtccc agatatttgg aaggctgagg cgggaggatc gcttgagccc 3420 aggaatttga ggctgcagtg agctgtgatc acaccactgc actccagcct cagtgacaga 3480 gtgaggccct gtctcaaaaa agaaaagaaa aaagaaaaat aatgagggct gtatggaata 3540 cattcattat tcattcactc actcactcac tcattcattc attcattcat tcaacaagtc 3600 ttattgcata ccttctgttt gctcagcttg gtgcttgggg ctgctgaggg gcaggaggga 3660 gagggtgaca tgggtcagct gactcccaga gtccactccc tgtaggtcgg gcagcaggcc 3720 gtagaagtct ggcagggcct ggccctgctg tcggaagctg tcctgcgggg ccaggccctg 3780 ttggtcaact cttcccagcc gtgggagccc ctgcagctgc atgtggataa agccgtcagt 3840 ggccttcgca gcctcaccac tctgcttcgg gctctgggag cccaggtgag taggagcgga 3900 cacttctgct tgccctttct gtaagaaggg gagaagggtc ttgctaagga gtacaggaac 3960 tgtccgtatt ccttcccttt ctgtggcact gcagcgacct cctgttttct ccttggcaga 4020 aggaagccat ctcccctcca gatgcggcct cagctgctcc actccgaaca atcactgctg 4080 acactttccg caaactcttc cgagtctact ccaatttcct ccggggaaag ctgaagctgt 4140 acacagggga ggcctgcagg acaggggaca gatgaccagc ttgatatcga attatccatc 4200 acactggcgg ccgctcgagt ctagagggcc cgtttaaacc cgctgatcag cctcgactgt 4260 gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga 4320 aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag 4380 taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga 4440 agacaatagc aggcatgctg gggatgcggt gggctctatg gcttctgagg cggaaagaac 4500 cagctggggc tctagggggt atccccacgc gccctgtagc ggcgcattaa gcgcggcggg 4560 tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 4620 cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 4680 ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 4740 ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 4800 gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 4860 tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa 4920 aaatgagctg atttaacaaa aatttaacgc gaattaattc tgtggaatgt gtgtcagtta 4980 gggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat 5040 tagtcagcaa ccaggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 5100 atgcatctca attagtcagc aaccatagtc ccgcccctaa ctccgcccat cccgccccta 5160 actccgccca gttccgccca ttctccgccc catggctgac taattttttt tatttatgca 5220 gaggccgagg ccgcctctgc ctctgagcta ttccagaagt agtgaggagg cttttttgga 5280 ggcctaggct tttgcaaaaa gctcccggga gcttgtatat ccattttcgg atctgatcaa 5340 gagacaggat gaggatcgtt tcgcatgatt gaacaagatg gattgcacgc aggttctccg 5400 gccgcttggg tggagaggct attcggctat gactgggcac aacagacaat cggctgctct 5460 gatgccgccg tgttccggct gtcagcgcag gggcgcccgg ttctttttgt caagaccgac 5520 ctgtccggtg ccctgaatga actgcaggac gaggcagcgc ggctatcgtg gctggccacg 5580 acgggcgttc cttgcgcagc tgtgctcgac gttgtcactg aagcgggaag ggactggctg 5640 ctattgggcg aagtgccggg gcaggatctc ctgtcatctc accttgctcc tgccgagaaa 5700 gtatccatca tggctgatgc aatgcggcgg ctgcatacgc ttgatccggc tacctgccca 5760 ttcgaccacc aagcgaaaca tcgcatcgag cgagcacgta ctcggatgga agccggtctt 5820 gtcgatcagg atgatctgga cgaagagcat caggggctcg cgccagccga actgttcgcc 5880 aggctcaagg cgcgcatgcc cgacggcgag gatctcgtcg tgacccatgg cgatgcctgc 5940 ttgccgaata tcatggtgga aaatggccgc ttttctggat tcatcgactg tggccggctg 6000 ggtgtggcgg accgctatca ggacatagcg ttggctaccc gtgatattgc tgaagagctt 6060 ggcggcgaat gggctgaccg cttcctcgtg ctttacggta tcgccgctcc cgattcgcag 6120 cgcatcgcct tctatcgcct tcttgacgag ttcttctgag cgggactctg gggttcgaaa 6180 tgaccgacca agcgacgccc aacctgccat cacgagattt cgattccacc gccgccttct 6240 atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc ctccagcgcg 6300 gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct tataatggtt 6360 acaaataaag caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta 6420 gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgtataccg tcgacctcta 6480 gctagagctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca 6540 caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag 6600 tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt 6660 cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 6720 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 6780 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 6840 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 6900 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 6960 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 7020 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 7080 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 7140 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 7200 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 7260 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 7320 ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 7380 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 7440 gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 7500 tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg 7560 tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta 7620 aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg 7680 aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg 7740 tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc 7800 gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg 7860 agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg 7920 aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag 7980 gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat 8040 caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc 8100 cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc 8160 ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa 8220 ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac 8280 gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt 8340 cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc 8400 gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa 8460 caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca 8520 tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat 8580 acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa 8640 aagtgccacc tgacgtc 8657 <210> 4 <211> 8691 <212> DNA <213> Artificial Sequence <220> <223> x6GC/GEPO vector <400> 4 gacggatcgg gagatcccat agtcccgccc ctaactccgc ccatcccgcc cctaactccg 60 cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 120 gaggccgcct ctgcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 180 ggcttttgca aaaagctccc gggatggttc gaccattgaa ctgcatcgtc gccgtgtccc 240 aaaatatggg gattggcaag aacggaaacc taccctggcc tccgctcagg aacgagttca 300 agtacttcca aagaatgacc acaacctctt cagtggaagg taaacagaat ctggtgatta 360 tgggtaggaa aacctggttc tccattcctg agaagaatcg acctttaaag gacagaatta 420 atatagttct cagtagagaa ctcaaagaac caccacgagg agctcatttt cttgccaaaa 480 gtttggatga tgccttaaga cttattgaac aaccggaatt ggcaagtaaa gtagacatgg 540 tttggatagt cggaggcagt tctgtttacc aggaagccat gaatcaacca ggccacctca 600 gactctttgt gacaaggatc atgcaggaat ttgaaagtga cacgtttttc ccagaaattg 660 atttggggaa atataaactt ctcccagaat acccaggcgt cctctctgag gtccaggagg 720 aaaaaggcat caagtataag tttgaagtct acgagaagaa agactaacag gttcgaaatg 780 accgaccaag cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat 840 gaaaggttgg gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg 900 gatctcatgc tggagttctt cgcccacccc aacttgttta ttgcagctta taatggttac 960 aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact gcattctagt 1020 tgtggtttgt ccaaactcat caatgtatct tatcatgtct gggatctccc gatcccctat 1080 ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat ctgctccctg 1140 cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca acaaggcaag 1200 gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg ctgcttcgcg 1260 atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa tagtaatcaa 1320 ttacggggtc attagttcat agcccatata tggagttccg cgttacataa cttacggtaa 1380 atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg 1440 ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt 1500 aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg 1560 tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc 1620 ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg cggttttggc 1680 agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt ctccacccca 1740 ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca aaatgtcgta 1800 acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag gtctatataa 1860 gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa attaatacga 1920 ctcactatag ggagacccaa gctggctagc gtttaaactt aagcttggta ccgagctcgg 1980 atccactagt aacggccgcc agtgtgctgg aattctgcag attcgaggtc gacggtatcg 2040 ataagcttcc aagcgcggag atgggggtgc acggtgagta ctcgcgggct gggcgctccc 2100 gcccgcccgg gtccctgttt gagcggggat ttagcgcccc ggctattggc caggaggtgg 2160 ctgggttcaa ggaccggcga cttgtcaagg accccggaag ggggaggggg gtggggcagc 2220 ctccacgtgc cagcggggac ttgggggagt ccttggggat ggcaaaaacc tgacctgtga 2280 aggggacaca gtttgggggt tgaggggaag aaggtttggg ggttctgctg tgccagtgga 2340 gaggaagctg ataagctgat aacctgggcg ctggagccac cacttatctg ccagagggga 2400 agcctctgtc acaccaggat tgaagtttgg ccggagaagt ggatgctggt agctgggggt 2460 ggggtgtgca cacggcagca ggattgaatg aaggccaggg aggcagcacc tgagtgcttg 2520 catggttggg gacaggaagg acgagctggg gcagagacgt ggggatgaag gaagctgtcc 2580 ttccacagcc acccttctcc ctccccgcct gactctcagc ctggctatct gttctagaat 2640 gtcctgcctg gctgtggctt ctcctgtccc tgctgtcgct ccctctgggc ctcccagtcc 2700 tgggcgcccc accacgcctc atctgtgaca gccgagtcct ggagaggtac ctcttggagg 2760 ccaaggaggc cgagaatatc acggtgagac cccttcccca gcacattcca cagaactcac 2820 gctcagggct tcagggaact cctcccagat ccaggaacct ggcacttggt ttagggtgga 2880 gttgggaagc tagacactgc ccccctacat aagaataagt ctggtggccc caaaccatac 2940 ctggaaacta ggcaaggagc aaagccagca gatcctacgg cctgtgggcc agggccagag 3000 ccttcaggga cccttgactc cccgggctgt gtgcatttca gacgggctgt gctgaacact 3060 gcagcttgaa tgagaatatc actgtcccag acaccaaagt taatttctat gcctggaaga 3120 ggatggaggt gagttccttt tttttttttt ttcctttctt ttggagaatc tcatttgcga 3180 gcctgatttt ggatgaaagg gagaatgatc gagggaaagg taaaatggag cagcagagat 3240 gaggctgcct gggcgcagag gctcacgtct ataatcccag gctgagatgg ccgagatggg 3300 agaattgctt gagccctgga gtttcagacc aacctaggca gcatagtgag atcccccatc 3360 tctacaaaca tttaaaaaaa ttagtcaggt gaagtggtgc atggtggtag tcccagatat 3420 ttggaaggct gaggcgggag gatcgcttga gcccaggaat ttgaggctgc agtgagctgt 3480 gatcacacca ctgcactcca gcctcagtga cagagtgagg ccctgtctca aaaaagaaaa 3540 gaaaaaagaa aaataatgag ggctgtatgg aatacattca ttattcattc actcactcac 3600 tcactcattc attcattcat tcattcaaca agtcttattg cataccttct gtttgctcag 3660 cttggtgctt ggggctgctg aggggcagga gggagagggt gacatgggtc agctgactcc 3720 cagagtccac tccctgtagg tcgggcagca ggccgtagaa gtctggcagg gcctggccct 3780 gctgtcggaa gctgtcctgc ggggccaggc cctgttggtc aactcttccc agccgtggga 3840 gcccctgcag ctgcatgtgg ataaagccgt cagtggcctt cgcagcctca ccactctgct 3900 tcgggctctg ggagcccagg tgagtaggag cggacacttc tgcttgccct ttctgtaaga 3960 aggggagaag ggtcttgcta aggagtacag gaactgtccg tattccttcc ctttctgtgg 4020 cactgcagcg acctcctgtt ttctccttgg cagaaggaag ccatctcccc tccagatgcg 4080 gcctcagctg ctccactccg aacaatcact gctgacactt tccgcaaact cttccgagtc 4140 tactccaatt tcctccgggg aaagctgaag ctgtacacag gggaggcctg caggacaggg 4200 gacagatgac cagcttgata tcgaattatc catcacactg gcggccgctc gagtctagag 4260 ggcccgttta aacccgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 4320 tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 4380 aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 4440 gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggatg 4500 cggtgggctc tatggcttct gaggcggaaa gaaccagctg gggctctagg gggtatcccc 4560 acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 4620 ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 4680 cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 4740 gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 4800 catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 4860 gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 4920 aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 4980 acgcgaatta attctgtgga atgtgtgtca gttagggtgt ggaaagtccc caggctcccc 5040 agcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccaggt gtggaaagtc 5100 cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt cagcaaccat 5160 agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc 5220 gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct ctgcctctga 5280 gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca aaaagctccc 5340 gggagcttgt atatccattt tcggatctga tcaagagaca ggatgaggat cgtttcgcat 5400 gattgaacaa gatggattgc acgcaggttc tccggccgct tgggtggaga ggctattcgg 5460 ctatgactgg gcacaacaga caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc 5520 gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgca 5580 ggacgaggca gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagctgtgct 5640 cgacgttgtc actgaagcgg gaagggactg gctgctattg ggcgaagtgc cggggcagga 5700 tctcctgtca tctcaccttg ctcctgccga gaaagtatcc atcatggctg atgcaatgcg 5760 gcggctgcat acgcttgatc cggctacctg cccattcgac caccaagcga aacatcgcat 5820 cgagcgagca cgtactcgga tggaagccgg tcttgtcgat caggatgatc tggacgaaga 5880 gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg 5940 cgaggatctc gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg 6000 ccgcttttct ggattcatcg actgtggccg gctgggtgtg gcggaccgct atcaggacat 6060 agcgttggct acccgtgata ttgctgaaga gcttggcggc gaatgggctg accgcttcct 6120 cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc gccttctatc gccttcttga 6180 cgagttcttc tgagcgggac tctggggttc gaaatgaccg accaagcgac gcccaacctg 6240 ccatcacgag atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt 6300 ttccgggacg ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc 6360 caccccaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 6420 ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 6480 gtatcttatc atgtctgtat accgtcgacc tctagctaga gcttggcgta atcatggtca 6540 tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 6600 agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg 6660 cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 6720 caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 6780 tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 6840 cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 6900 aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 6960 gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 7020 agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 7080 cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 7140 cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 7200 ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 7260 gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7320 tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7380 acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 7440 tcttgatccg gcaaacaaac caccgctggt agcggttttt ttgtttgcaa gcagcagatt 7500 acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct 7560 cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 7620 acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 7680 acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 7740 tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 7800 ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 7860 ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 7920 tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 7980 aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt 8040 ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 8100 ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 8160 gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 8220 gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 8280 cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga 8340 actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 8400 ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 8460 tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 8520 ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 8580 agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 8640 aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt c 8691 <210> 5 <211> 7022 <212> DNA <213> Artificial Sequence <220> <223> X0GC/dhfr vector <400> 5 gacggatcgg gagatccgac atgataagat acattgatga gtttggacaa accacaacta 60 gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa 120 ccattataag ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg 180 ttcaggggga ggtgtgggag gttttttaaa gcaagtaaaa cctctacaaa tgtggtatgg 240 ctgattatga tctctagtca aggcactata catcaaatat tccttattaa cccctttaca 300 aattaaaaag ctaaaggtac acaatttttg agcatagtta ttaatagcag acactctatg 360 cctgtgtgga gtaagaaaaa acagtatgtt atgattataa ctgttatgcc tacttataaa 420 ggttacagaa tatttttcca taattttctt gtatagcagt gcagcttttt cctttgtggt 480 gtaaatagca aagcaagcaa gagttctatt actaaacaca gcatgactca aaaaacttag 540 caattctgaa ggaaagtcct tggggtcttc tacctttctc ttcttttttg gaggagtaga 600 atgttgagag tcagcagtag cctcatcatc actagatggc atttcttctg agcaaaacag 660 gttttcctca ttaaaggcat tccaccactg ctcccattca tcagttccat aggttggaat 720 ctaaaataca caaacaatta gaatcagtag tttaacacat tatacactta aaaattttat 780 atttacctta gagctttaaa tctctgtagg tagtttgtcc aattatgtca caccacagaa 840 gtaaggttcc ttcacaaaga tccaaagcca gcaaaagtcc catggtctta taaaaatgca 900 tagctttagg aggggagcag agaacttgaa agcatcttcc tgttagtctt tcttctcgta 960 gacttcaaac ttatacttga tgcctttttc ctcctggacc tcagagagga cgcctgggta 1020 ttctgggaga agtttatatt tccccaaatc aatttctggg aaaaacgtgt cactttcaaa 1080 ttcctgcatg atccttgtca caaagagtct gaggtggcct ggttgattca tggcttcctg 1140 gtaaacagaa ctgcctccga ctatccaaac catgtctact ttacttgcca attccggttg 1200 ttcaataagt cttaaggcat catccaaact tttggcaaga aaatgagctc ctcgtggtgg 1260 ttctttgagt tctctactga gaactatatt aattctgtcc tttaaaggtc gattcttctc 1320 aggaatggag aaccaggttt tcctacccat aatcaccaga ttctgtttac cttccactga 1380 agaggttgtg gtcattcttt ggaagtactt gaactcgttc ctgagcggag gccagggtag 1440 gtctccgttc ttgccaatcc ccatattttg ggacacggcg acgatgcagt tcaatggtcg 1500 aaccatgatg gcagcgggga taaaatccta ccagccttca cgctaggatt gccgtcaagt 1560 ttggcgcgaa atcgcagccc tgagctgtgg atctcccgat cccctatggt gcactctcag 1620 tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt gtgtgttgga 1680 ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct tgaccgacaa 1740 ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg tacgggccag 1800 atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 1860 agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 1920 ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac 1980 gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt 2040 ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa 2100 atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta 2160 catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg 2220 gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg 2280 gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc 2340 attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctctctg 2400 gctaactaga gaacccactg cttactggct tatcgaaatt aatacgactc actataggga 2460 gacccaagct tggtaccgag ctcggatcca ctagtaacgg ccgccagtgt gctggaattc 2520 tgcagatatc catcacactg gcggccgctc gagcatgcat ctagagggcc ctattctata 2580 gtgtcaccta aatgctagag ctcgctgatc agcctcgact gtgccttcta gttgccagcc 2640 atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt 2700 cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc attctattct 2760 ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc 2820 tggggatgcg gtgggctcta tggcttctga ggcggaaaga accagctggg gctctagggg 2880 gtatccccac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag 2940 cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt 3000 tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt 3060 ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg 3120 tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt 3180 taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt 3240 tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca 3300 aaaatttaac gcgaattaat tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca 3360 ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc aaccaggtgt 3420 ggaaagtccc caggctcccc agcaggcaga agtatgcaaa gcatgcatct caattagtca 3480 gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc 3540 cattctccgc cccatggctg actaattttt tttatttatg cagaggccga ggccgcctct 3600 gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg cttttgcaaa 3660 aagctcccgg gagcttgtat atccattttc ggatctgatc aagagacagg atgaggatcg 3720 tttcgcatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3780 ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3840 ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3900 gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3960 gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 4020 gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 4080 gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 4140 catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 4200 gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 4260 cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 4320 gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 4380 caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 4440 cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 4500 cttcttgacg agttcttctg agcgggactc tggggttcga aatgaccgac caagcgacgc 4560 ccaacctgcc atcacgagat ttcgattcca ccgccgcctt ctatgaaagg ttgggcttcg 4620 gaatcgtttt ccgggacgcc ggctggatga tcctccagcg cggggatctc atgctggagt 4680 tcttcgccca ccccaacttg tttattgcag cttataatgg ttacaaataa agcaatagca 4740 tcacaaattt cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac 4800 tcatcaatgt atcttatcat gtctgtatac cgtcgacctc tagctagagc ttggcgtaat 4860 catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 4920 gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa ctcacattaa 4980 ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 5040 gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 5100 tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 5160 cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 5220 gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 5280 gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 5340 gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 5400 ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 5460 atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 5520 tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 5580 ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 5640 gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 5700 ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 5760 ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 5820 agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 5880 ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 5940 aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 6000 tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 6060 cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 6120 tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 6180 cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 6240 ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 6300 gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 6360 gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 6420 gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 6480 gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 6540 tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 6600 aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 6660 cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 6720 caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 6780 cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 6840 ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 6900 aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 6960 tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 7020 tc 7022 <210> 6 <211> 6422 <212> DNA <213> Artificial Sequence <220> <223> X1GC/dhfr vector <400> 6 gacggatcgg gagatccatt ctccgcccca tggctgacta atttttttta tttatgcaga 60 ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg 120 cctaggcttt tgcaaaaagc tcccgggatg gttcgaccat tgaactgcat cgtcgccgtg 180 tcccaaaata tggggattgg caagaacgga aacctaccct ggcctccgct caggaacgag 240 ttcaagtact tccaaagaat gaccacaacc tcttcagtgg aaggtaaaca gaatctggtg 300 attatgggta ggaaaacctg gttctccatt cctgagaaga atcgaccttt aaaggacaga 360 attaatatag ttctcagtag agaactcaaa gaaccaccac gaggagctca ttttcttgcc 420 aaaagtttgg atgatgcctt aagacttatt gaacaaccgg aattggcaag taaagtagac 480 atggtttgga tagtcggagg cagttctgtt taccaggaag ccatgaatca accaggccac 540 ctcagactct ttgtgacaag gatcatgcag gaatttgaaa gtgacacgtt tttcccagaa 600 attgatttgg ggaaatataa acttctccca gaatacccag gcgtcctctc tgaggtccag 660 gaggaaaaag gcatcaagta taagtttgaa gtctacgaga agaaagacta acaggttcga 720 aatgaccgac caagcgacgc ccaacctgcc atcacgagat ttcgattcca ccgccgcctt 780 ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc ggctggatga tcctccagcg 840 cggggatctc atgctggagt tcttcgccca ccccaacttg tttattgcag cttataatgg 900 ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 960 tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctgggatc tcccgatccc 1020 ctatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatctgctc 1080 cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca aaatttaagc tacaacaagg 1140 caaggcttga ccgacaattg catgaagaat ctgcttaggg ttaggcgttt tgcgctgctt 1200 cgcgatgtac gggccagata tacgcgttga cattgattat tgactagtta ttaatagtaa 1260 tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 1320 gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 1380 tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 1440 cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 1500 gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 1560 tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 1620 tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 1680 cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 1740 cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 1800 ataagcagag ctctctggct aactagagaa cccactgctt actggcttat cgaaattaat 1860 acgactcact atagggagac ccaagctggc tagcgtttaa acttaagctt ggtaccgagc 1920 tcggatccac tagtccagtg tggtggaatt ctgcagatat ccagcacagt ggcggccgct 1980 cgagtctaga gggcccgttt aaacccgctg atcagcctcg actgtgcctt ctagttgcca 2040 gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac 2100 tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat 2160 tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca 2220 tgctggggat gcggtgggct ctatggcttc tgaggcggaa agaaccagct ggggctctag 2280 ggggtatccc cacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 2340 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 2400 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 2460 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 2520 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 2580 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 2640 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 2700 acaaaaattt aacgcgaatt aattctgtgg aatgtgtgtc agttagggtg tggaaagtcc 2760 ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccagg 2820 tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag 2880 tcagcaacca tagtcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc 2940 gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc 3000 tctgcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct aggcttttgc 3060 aaaaagctcc cgggagcttg tatatccatt ttcggatctg atcaagagac aggatgagga 3120 tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc ttgggtggag 3180 aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc cgccgtgttc 3240 cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc cggtgccctg 3300 aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg cgttccttgc 3360 gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt gggcgaagtg 3420 ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc catcatggct 3480 gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga ccaccaagcg 3540 aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga tcaggatgat 3600 ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc 3660 atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg 3720 gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt ggcggaccgc 3780 tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg cgaatgggct 3840 gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat cgccttctat 3900 cgccttcttg acgagttctt ctgagcggga ctctggggtt cgaaatgacc gaccaagcga 3960 cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa aggttgggct 4020 tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat ctcatgctgg 4080 agttcttcgc ccaccccaac ttgtttattg cagcttataa tggttacaaa taaagcaata 4140 gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca 4200 aactcatcaa tgtatcttat catgtctgta taccgtcgac ctctagctag agcttggcgt 4260 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 4320 tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat 4380 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 4440 aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 4500 cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 4560 aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 4620 aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 4680 tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 4740 caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 4800 cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 4860 ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 4920 gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 4980 agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 5040 gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 5100 acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 5160 gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtttt tttgtttgca 5220 agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 5280 ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 5340 aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 5400 tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 5460 cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 5520 tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 5580 cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 5640 ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 5700 gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 5760 gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 5820 gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 5880 gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 5940 tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 6000 aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 6060 cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 6120 caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 6180 cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 6240 ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 6300 aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 6360 tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 6420 tc 6422 <210> 7 <211> 1572 <212> DNA <213> Artificial Sequence <220> <223> x0GC/dhfr cassette <400> 7 gacatgataa gatacattga tgagtttgga caaaccacaa ctagaatgca gtgaaaaaaa 60 tgctttattt gtgaaatttg tgatgctatt gctttatttg taaccattat aagctgcaat 120 aaacaagtta acaacaacaa ttgcattcat tttatgtttc aggttcaggg ggaggtgtgg 180 gaggtttttt aaagcaagta aaacctctac aaatgtggta tggctgatta tgatctctag 240 tcaaggcact atacatcaaa tattccttat taaccccttt acaaattaaa aagctaaagg 300 tacacaattt ttgagcatag ttattaatag cagacactct atgcctgtgt ggagtaagaa 360 aaaacagtat gttatgatta taactgttat gcctacttat aaaggttaca gaatattttt 420 ccataatttt cttgtatagc agtgcagctt tttcctttgt ggtgtaaata gcaaagcaag 480 caagagttct attactaaac acagcatgac tcaaaaaact tagcaattct gaaggaaagt 540 ccttggggtc ttctaccttt ctcttctttt ttggaggagt agaatgttga gagtcagcag 600 tagcctcatc atcactagat ggcatttctt ctgagcaaaa caggttttcc tcattaaagg 660 cattccacca ctgctcccat tcatcagttc cataggttgg aatctaaaat acacaaacaa 720 ttagaatcag tagtttaaca cattatacac ttaaaaattt tatatttacc ttagagcttt 780 aaatctctgt aggtagtttg tccaattatg tcacaccaca gaagtaaggt tccttcacaa 840 agatccaaag ccagcaaaag tcccatggtc ttataaaaat gcatagcttt aggaggggag 900 cagagaactt gaaagcatct tcctgttagt ctttcttctc gtagacttca aacttatact 960 tgatgccttt ttcctcctgg acctcagaga ggacgcctgg gtattctggg agaagtttat 1020 atttccccaa atcaatttct gggaaaaacg tgtcactttc aaattcctgc atgatccttg 1080 tcacaaagag tctgaggtgg cctggttgat tcatggcttc ctggtaaaca gaactgcctc 1140 cgactatcca aaccatgtct actttacttg ccaattccgg ttgttcaata agtcttaagg 1200 catcatccaa acttttggca agaaaatgag ctcctcgtgg tggttctttg agttctctac 1260 tgagaactat attaattctg tcctttaaag gtcgattctt ctcaggaatg gagaaccagg 1320 ttttcctacc cataatcacc agattctgtt taccttccac tgaagaggtt gtggtcattc 1380 tttggaagta cttgaactcg ttcctgagcg gaggccaggg taggtctccg ttcttgccaa 1440 tccccatatt ttgggacacg gcgacgatgc agttcaatgg tcgaaccatg atggcagcgg 1500 ggataaaatc ctaccagcct tcacgctagg attgccgtca agtttggcgc gaaatcgcag 1560 ccctgagctg tg 1572 <210> 8 <211> 988 <212> DNA <213> Artificial Sequence <220> <223> x1GC dhfr cassette <400> 8 attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag gccgcctctg 60 cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc ttttgcaaaa 120 agctcccggg atggttcgac cattgaactg catcgtcgcc gtgtcccaaa atatggggat 180 tggcaagaac ggaaacctac cctggcctcc gctcaggaac gagttcaagt acttccaaag 240 aatgaccaca acctcttcag tggaaggtaa acagaatctg gtgattatgg gtaggaaaac 300 ctggttctcc attcctgaga agaatcgacc tttaaaggac agaattaata tagttctcag 360 tagagaactc aaagaaccac cacgaggagc tcattttctt gccaaaagtt tggatgatgc 420 cttaagactt attgaacaac cggaattggc aagtaaagta gacatggttt ggatagtcgg 480 aggcagttct gtttaccagg aagccatgaa tcaaccaggc cacctcagac tctttgtgac 540 aaggatcatg caggaatttg aaagtgacac gtttttccca gaaattgatt tggggaaata 600 taaacttctc ccagaatacc caggcgtcct ctctgaggtc caggaggaaa aaggcatcaa 660 gtataagttt gaagtctacg agaagaaaga ctaacaggtt cgaaatgacc gaccaagcga 720 cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa aggttgggct 780 tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat ctcatgctgg 840 agttcttcgc ccaccccaac ttgtttattg cagcttataa tggttacaaa taaagcaata 900 gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca 960 aactcatcaa tgtatcttat catgtctg 988 <210> 9 <211> 1010 <212> DNA <213> Artificial Sequence <220> <223> x3GC dhfr cassette <400> 9 ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 60 gcagaggccg aggccgcctc tgcctctgag ctattccaga agtagtgagg aggctttttt 120 ggaggcctag gcttttgcaa aaagctcccg ggatggttcg accattgaac tgcatcgtcg 180 ccgtgtccca aaatatgggg attggcaaga acggaaacct accctggcct ccgctcagga 240 acgagttcaa gtacttccaa agaatgacca caacctcttc agtggaaggt aaacagaatc 300 tggtgattat gggtaggaaa acctggttct ccattcctga gaagaatcga cctttaaagg 360 acagaattaa tatagttctc agtagagaac tcaaagaacc accacgagga gctcattttc 420 ttgccaaaag tttggatgat gccttaagac ttattgaaca accggaattg gcaagtaaag 480 tagacatggt ttggatagtc ggaggcagtt ctgtttacca ggaagccatg aatcaaccag 540 gccacctcag actctttgtg acaaggatca tgcaggaatt tgaaagtgac acgtttttcc 600 cagaaattga tttggggaaa tataaacttc tcccagaata cccaggcgtc ctctctgagg 660 tccaggagga aaaaggcatc aagtataagt ttgaagtcta cgagaagaaa gactaacagg 720 ttcgaaatga ccgaccaagc gacgcccaac ctgccatcac gagatttcga ttccaccgcc 780 gccttctatg aaaggttggg cttcggaatc gttttccggg acgccggctg gatgatcctc 840 cagcgcgggg atctcatgct ggagttcttc gcccacccca acttgtttat tgcagcttat 900 aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 960 cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg 1010 <210> 10 <211> 1044 <212> DNA <213> Artificial Sequence <220> <223> x6GC dhfr cassette <400> 10 catagtcccg cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc 60 tccgccccat ggctgactaa ttttttttat ttatgcagag gccgaggccg cctctgcctc 120 tgagctattc cagaagtagt gaggaggctt ttttggaggc ctaggctttt gcaaaaagct 180 cccgggatgg ttcgaccatt gaactgcatc gtcgccgtgt cccaaaatat ggggattggc 240 aagaacggaa acctaccctg gcctccgctc aggaacgagt tcaagtactt ccaaagaatg 300 accacaacct cttcagtgga aggtaaacag aatctggtga ttatgggtag gaaaacctgg 360 ttctccattc ctgagaagaa tcgaccttta aaggacagaa ttaatatagt tctcagtaga 420 gaactcaaag aaccaccacg aggagctcat tttcttgcca aaagtttgga tgatgcctta 480 agacttattg aacaaccgga attggcaagt aaagtagaca tggtttggat agtcggaggc 540 agttctgttt accaggaagc catgaatcaa ccaggccacc tcagactctt tgtgacaagg 600 atcatgcagg aatttgaaag tgacacgttt ttcccagaaa ttgatttggg gaaatataaa 660 cttctcccag aatacccagg cgtcctctct gaggtccagg aggaaaaagg catcaagtat 720 aagtttgaag tctacgagaa gaaagactaa caggttcgaa atgaccgacc aagcgacgcc 780 caacctgcca tcacgagatt tcgattccac cgccgccttc tatgaaaggt tgggcttcgg 840 aatcgttttc cgggacgccg gctggatgat cctccagcgc ggggatctca tgctggagtt 900 cttcgcccac cccaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat 960 cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact 1020 catcaatgta tcttatcatg tctg 1044 <210> 11 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> dhfr 01 primer <400> 11 gcgcccggga tggttcgacc attgaactgc 30 <210> 12 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> dhfr-02 primer <400> 12 cacttagaac ctgttagtct ttcttctcgt agac 34 <210> 13 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> x 1GC primer <400> 13 tcaggatcca ttctccgccc catggctgac taa 33 <210> 14 <211> 35 <212> DNA <213> Artificial Sequence <220> <223> x 3GC primer <400> 14 catggatcct aactccgccc agttccgccc attct 35 <210> 15 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> x6GC primer <400> 15 catggatccc atagtcccgc ccctaactcc gccc 34 <210> 16 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> X0GC primer <400> 16 cgatggatcc gacatgataa gatacattga t 31 <210> 17 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> X0GCRR primer <400> 17 cgttggatcc acagctcagg gctgcgattt c 31 <210> 18 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> BISVpAR primer <400> 18 tcaggatccc agacatgata agatacattg atg 33 <110> Hanmi Pharm. Co., Ltd. <120> A Novel vector and expression cell line for mass production of          recombinant protein and a process of producing recombinant          protein using same <160> 18 <170> KopatentIn 1.71 <210> 1 <211> 9227 <212> DNA <213> Artificial Sequence <220> <223> X0GC / GEPO vector <400> 1 gacggatcgg gagatccgac atgataagat acattgatga gtttggacaa accacaacta 60 gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa 120 ccattataag ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg 180 ttcaggggga ggtgtgggag gttttttaaa gcaagtaaaa cctctacaaa tgtggtatgg 240 ctgattatga tctctagtca aggcactata catcaaatat tccttattaa cccctttaca 300 aattaaaaag ctaaaggtac acaatttttg agcatagtta ttaatagcag acactctatg 360 cctgtgtgga gtaagaaaaa acagtatgtt atgattataa ctgttatgcc tacttataaa 420 ggttacagaa tatttttcca taattttctt gtatagcagt gcagcttttt cctttgtggt 480 gtaaatagca aagcaagcaa gagttctatt actaaacaca gcatgactca aaaaacttag 540 caattctgaa ggaaagtcct tggggtcttc tacctttctc ttcttttttg gaggagtaga 600 atgttgagag tcagcagtag cctcatcatc actagatggc atttcttctg agcaaaacag 660 gttttcctca ttaaaggcat tccaccactg ctcccattca tcagttccat aggttggaat 720 ctaaaataca caaacaatta gaatcagtag tttaacacat tatacactta aaaattttat 780 atttacctta gagctttaaa tctctgtagg tagtttgtcc aattatgtca caccacagaa 840 gtaaggttcc ttcacaaaga tccaaagcca gcaaaagtcc catggtctta taaaaatgca 900 tagctttagg aggggagcag agaacttgaa agcatcttcc tgttagtctt tcttctcgta 960 gacttcaaac ttatacttga tgcctttttc ctcctggacc tcagagagga cgcctgggta 1020 ttctgggaga agtttatatt tccccaaatc aatttctggg aaaaacgtgt cactttcaaa 1080 ttcctgcatg atccttgtca caaagagtct gaggtggcct ggttgattca tggcttcctg 1140 gtaaacagaa ctgcctccga ctatccaaac catgtctact ttacttgcca attccggttg 1200 ttcaataagt cttaaggcat catccaaact tttggcaaga aaatgagctc ctcgtggtgg 1260 ttctttgagt tctctactga gaactatatt aattctgtcc tttaaaggtc gattcttctc 1320 aggaatggag aaccaggttt tcctacccat aatcaccaga ttctgtttac cttccactga 1380 agaggttgtg gtcattcttt ggaagtactt gaactcgttc ctgagcggag gccagggtag 1440 gtctccgttc ttgccaatcc ccatattttg ggacacggcg acgatgcagt tcaatggtcg 1500 aaccatgatg gcagcgggga taaaatccta ccagccttca cgctaggatt gccgtcaagt 1560 ttggcgcgaa atcgcagccc tgagctgtgg atctcccgat cccctatggt gcactctcag 1620 tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt gtgtgttgga 1680 ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct tgaccgacaa 1740 ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg tacgggccag 1800 atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 1860 agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 1920 ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac 1980 gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt 2040 ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa 2100 atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta 2160 catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg 2220 gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg 2280 gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc 2340 attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctctctg 2400 gctaactaga gaacccactg cttactggct tatcgaaatt aatacgactc actataggga 2460 gacccaagct tggtaccgag ctcggatcca ctagtaacgg ccgccagtgt gctggaattc 2520 tgcagattcg aggtcgacgg tatcgataag cttccaagcg cggagatggg ggtgcacggt 2580 gagtactcgc gggctgggcg ctcccgcccg cccgggtccc tgtttgagcg gggatttagc 2640 gccccggcta ttggccagga ggtggctggg ttcaaggacc ggcgacttgt caaggacccc 2700 ggaaggggga ggggggtggg gcagcctcca cgtgccagcg gggacttggg ggagtccttg 2760 gggatggcaa aaacctgacc tgtgaagggg acacagtttg ggggttgagg ggaagaaggt 2820 ttgggggttc tgctgtgcca gtggagagga agctgataag ctgataacct gggcgctgga 2880 gccaccactt atctgccaga ggggaagcct ctgtcacacc aggattgaag tttggccgga 2940 gaagtggatg ctggtagctg ggggtggggt gtgcacacgg cagcaggatt gaatgaaggc 3000 cagggaggca gcacctgagt gcttgcatgg ttggggacag gaaggacgag ctggggcaga 3060 gacgtgggga tgaaggaagc tgtccttcca cagccaccct tctccctccc cgcctgactc 3120 tcagcctggc tatctgttct agaatgtcct gcctggctgt ggcttctcct gtccctgctg 3180 tcgctccctc tgggcctccc agtcctgggc gccccaccac gcctcatctg tgacagccga 3240 gtcctggaga ggtacctctt ggaggccaag gaggccgaga atatcacggt gagacccctt 3300 ccccagcaca ttccacagaa ctcacgctca gggcttcagg gaactcctcc cagatccagg 3360 aacctggcac ttggtttagg gtggagttgg gaagctagac actgcccccc tacataagaa 3420 taagtctggt ggccccaaac catacctgga aactaggcaa ggagcaaagc cagcagatcc 3480 tacggcctgt gggccagggc cagagccttc agggaccctt gactccccgg gctgtgtgca 3540 tttcagacgg gctgtgctga acactgcagc ttgaatgaga atatcactgt cccagacacc 3600 aaagttaatt tctatgcctg gaagaggatg gaggtgagtt cctttttttt tttttttcct 3660 ttcttttgga gaatctcatt tgcgagcctg attttggatg aaagggagaa tgatcgaggg 3720 aaaggtaaaa tggagcagca gagatgaggc tgcctgggcg cagaggctca cgtctataat 3780 cccaggctga gatggccgag atgggagaat tgcttgagcc ctggagtttc agaccaacct 3840 aggcagcata gtgagatccc ccatctctac aaacatttaa aaaaattagt caggtgaagt 3900 ggtgcatggt ggtagtccca gatatttgga aggctgaggc gggaggatcg cttgagccca 3960 ggaatttgag gctgcagtga gctgtgatca caccactgca ctccagcctc agtgacagag 4020 tgaggccctg tctcaaaaaa gaaaagaaaa aagaaaaata atgagggctg tatggaatac 4080 attcattatt cattcactca ctcactcact cattcattca ttcattcatt caacaagtct 4140 tattgcatac cttctgtttg ctcagcttgg tgcttggggc tgctgagggg caggagggag 4200 agggtgacat gggtcagctg actcccagag tccactccct gtaggtcggg cagcaggccg 4260 tagaagtctg gcagggcctg gccctgctgt cggaagctgt cctgcggggc caggccctgt 4320 tggtcaactc ttcccagccg tgggagcccc tgcagctgca tgtggataaa gccgtcagtg 4380 gccttcgcag cctcaccact ctgcttcggg ctctgggagc ccaggtgagt aggagcggac 4440 acttctgctt gccctttctg taagaagggg agaagggtct tgctaaggag tacaggaact 4500 gtccgtattc cttccctttc tgtggcactg cagcgacctc ctgttttctc cttggcagaa 4560 ggaagccatc tcccctccag atgcggcctc agctgctcca ctccgaacaa tcactgctga 4620 cactttccgc aaactcttcc gagtctactc caatttcctc cggggaaagc tgaagctgta 4680 cacaggggag gcctgcagga caggggacag atgaccagct tgatatcgaa ttatccatca 4740 cactggcggc cgctcgagca tgcatctaga gggccctatt ctatagtgtc acctaaatgc 4800 tagagctcgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg ttgtttgccc 4860 ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa 4920 tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg 4980 gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg 5040 ctctatggct tctgaggcgg aaagaaccag ctggggctct agggggtatc cccacgcgcc 5100 ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact 5160 tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc 5220 cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat ttagtgcttt 5280 acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc 5340 ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt 5400 gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat 5460 tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa 5520 ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccagcaggc 5580 agaagtatgc aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc 5640 tccccagcag gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg 5700 cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 5760 ggctgactaa ttttttttat ttatgcagag gccgaggccg cctctgcctc tgagctattc 5820 cagaagtagt gaggaggctt ttttggaggc ctaggctttt gcaaaaagct cccgggagct 5880 tgtatatcca ttttcggatc tgatcaagag acaggatgag gatcgtttcg catgattgaa 5940 caagatggat tgcacgcagg ttctccggcc gcttgggtgg agaggctatt cggctatgac 6000 tgggcacaac agacaatcgg ctgctctgat gccgccgtgt tccggctgtc agcgcagggg 6060 cgcccggttc tttttgtcaa gaccgacctg tccggtgccc tgaatgaact gcaggacgag 6120 gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt 6180 gtcactgaag cgggaaggga ctggctgcta ttgggcgaag tgccggggca ggatctcctg 6240 tcatctcacc ttgctcctgc cgagaaagta tccatcatgg ctgatgcaat gcggcggctg 6300 catacgcttg atccggctac ctgcccattc gaccaccaag cgaaacatcg catcgagcga 6360 gcacgtactc ggatggaagc cggtcttgtc gatcaggatg atctggacga agagcatcag 6420 gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc gcatgcccga cggcgaggat 6480 ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt 6540 tctggattca tcgactgtgg ccggctgggt gtggcggacc gctatcagga catagcgttg 6600 gctacccgtg atattgctga agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt 6660 tacggtatcg ccgctcccga ttcgcagcgc atcgccttct atcgccttct tgacgagttc 6720 ttctgagcgg gactctgggg ttcgaaatga ccgaccaagc gacgcccaac ctgccatcac 6780 gagatttcga ttccaccgcc gccttctatg aaaggttggg cttcggaatc gttttccggg 6840 acgccggctg gatgatcctc cagcgcgggg atctcatgct ggagttcttc gcccacccca 6900 acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa 6960 ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt 7020 atcatgtctg tataccgtcg acctctagct agagcttggc gtaatcatgg tcatagctgt 7080 ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 7140 agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 7200 tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 7260 cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 7320 gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 7380 ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 7440 ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 7500 atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 7560 aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 7620 gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta 7680 ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 7740 ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 7800 acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 7860 gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 7920 ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 7980 ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 8040 gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 8100 ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 8160 agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 8220 ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 8280 gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 8340 catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 8400 cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 8460 cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 8520 gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 8580 tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 8640 gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 8700 tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 8760 gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 8820 gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt 8880 taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc 8940 tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta 9000 ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa 9060 taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca 9120 tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 9180 aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtc 9227 <210> 2 <211> 8635 <212> DNA <213> Artificial Sequence <220> <223> X1GC / GEPO vector <400> 2 gacggatcgg gagatccatt ctccgcccca tggctgacta atttttttta tttatgcaga 60 ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg 120 cctaggcttt tgcaaaaagc tcccgggatg gttcgaccat tgaactgcat cgtcgccgtg 180 tcccaaaata tggggattgg caagaacgga aacctaccct ggcctccgct caggaacgag 240 ttcaagtact tccaaagaat gaccacaacc tcttcagtgg aaggtaaaca gaatctggtg 300 attatgggta ggaaaacctg gttctccatt cctgagaaga atcgaccttt aaaggacaga 360 attaatatag ttctcagtag agaactcaaa gaaccaccac gaggagctca ttttcttgcc 420 aaaagtttgg atgatgcctt aagacttatt gaacaaccgg aattggcaag taaagtagac 480 atggtttgga tagtcggagg cagttctgtt taccaggaag ccatgaatca accaggccac 540 ctcagactct ttgtgacaag gatcatgcag gaatttgaaa gtgacacgtt tttcccagaa 600 attgatttgg ggaaatataa acttctccca gaatacccag gcgtcctctc tgaggtccag 660 gaggaaaaag gcatcaagta taagtttgaa gtctacgaga agaaagacta acaggttcga 720 aatgaccgac caagcgacgc ccaacctgcc atcacgagat ttcgattcca ccgccgcctt 780 ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc ggctggatga tcctccagcg 840 cggggatctc atgctggagt tcttcgccca ccccaacttg tttattgcag cttataatgg 900 ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 960 tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctgggatc tcccgatccc 1020 ctatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatctgctc 1080 cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca aaatttaagc tacaacaagg 1140 caaggcttga ccgacaattg catgaagaat ctgcttaggg ttaggcgttt tgcgctgctt 1200 cgcgatgtac gggccagata tacgcgttga cattgattat tgactagtta ttaatagtaa 1260 tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 1320 gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 1380 tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 1440 cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 1500 gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 1560 tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 1620 tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 1680 cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 1740 cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 1800 ataagcagag ctctctggct aactagagaa cccactgctt actggcttat cgaaattaat 1860 acgactcact atagggagac ccaagctggc tagcgtttaa acttaagctt ggtaccgagc 1920 tcggatccac tagtaacggc cgccagtgtg ctggaattct gcagattcga ggtcgacggt 1980 atcgataagc ttccaagcgc ggagatgggg gtgcacggtg agtactcgcg ggctgggcgc 2040 tcccgcccgc ccgggtccct gtttgagcgg ggatttagcg ccccggctat tggccaggag 2100 gtggctgggt tcaaggaccg gcgacttgtc aaggaccccg gaagggggag gggggtgggg 2160 cagcctccac gtgccagcgg ggacttgggg gagtccttgg ggatggcaaa aacctgacct 2220 gtgaagggga cacagtttgg gggttgaggg gaagaaggtt tgggggttct gctgtgccag 2280 tggagaggaa gctgataagc tgataacctg ggcgctggag ccaccactta tctgccagag 2340 gggaagcctc tgtcacacca ggattgaagt ttggccggag aagtggatgc tggtagctgg 2400 gggtggggtg tgcacacggc agcaggattg aatgaaggcc agggaggcag cacctgagtg 2460 cttgcatggt tggggacagg aaggacgagc tggggcagag acgtggggat gaaggaagct 2520 gtccttccac agccaccctt ctccctcccc gcctgactct cagcctggct atctgttcta 2580 gaatgtcctg cctggctgtg gcttctcctg tccctgctgt cgctccctct gggcctccca 2640 gtcctgggcg ccccaccacg cctcatctgt gacagccgag tcctggagag gtacctcttg 2700 gaggccaagg aggccgagaa tatcacggtg agaccccttc cccagcacat tccacagaac 2760 tcacgctcag ggcttcaggg aactcctccc agatccagga acctggcact tggtttaggg 2820 tggagttggg aagctagaca ctgcccccct acataagaat aagtctggtg gccccaaacc 2880 atacctggaa actaggcaag gagcaaagcc agcagatcct acggcctgtg ggccagggcc 2940 agagccttca gggacccttg actccccggg ctgtgtgcat ttcagacggg ctgtgctgaa 3000 cactgcagct tgaatgagaa tatcactgtc ccagacacca aagttaattt ctatgcctgg 3060 aagaggatgg aggtgagttc cttttttttt ttttttcctt tcttttggag aatctcattt 3120 gcgagcctga ttttggatga aagggagaat gatcgaggga aaggtaaaat ggagcagcag 3180 agatgaggct gcctgggcgc agaggctcac gtctataatc ccaggctgag atggccgaga 3240 tgggagaatt gcttgagccc tggagtttca gaccaaccta ggcagcatag tgagatcccc 3300 catctctaca aacatttaaa aaaattagtc aggtgaagtg gtgcatggtg gtagtcccag 3360 atatttggaa ggctgaggcg ggaggatcgc ttgagcccag gaatttgagg ctgcagtgag 3420 ctgtgatcac accactgcac tccagcctca gtgacagagt gaggccctgt ctcaaaaaag 3480 aaaagaaaaa agaaaaataa tgagggctgt atggaataca ttcattattc attcactcac 3540 tcactcactc attcattcat tcattcattc aacaagtctt attgcatacc ttctgtttgc 3600 tcagcttggt gcttggggct gctgaggggc aggagggaga gggtgacatg ggtcagctga 3660 ctcccagagt ccactccctg taggtcgggc agcaggccgt agaagtctgg cagggcctgg 3720 ccctgctgtc ggaagctgtc ctgcggggcc aggccctgtt ggtcaactct tcccagccgt 3780 gggagcccct gcagctgcat gtggataaag ccgtcagtgg ccttcgcagc ctcaccactc 3840 tgcttcgggc tctgggagcc caggtgagta ggagcggaca cttctgcttg ccctttctgt 3900 aagaagggga gaagggtctt gctaaggagt acaggaactg tccgtattcc ttccctttct 3960 gtggcactgc agcgacctcc tgttttctcc ttggcagaag gaagccatct cccctccaga 4020 tgcggcctca gctgctccac tccgaacaat cactgctgac actttccgca aactcttccg 4080 agtctactcc aatttcctcc ggggaaagct gaagctgtac acaggggagg cctgcaggac 4140 aggggacaga tgaccagctt gatatcgaat tatccatcac actggcggcc gctcgagtct 4200 agagggcccg tttaaacccg ctgatcagcc tcgactgtgc cttctagttg ccagccatct 4260 gttgtttgcc cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt 4320 tcctaataaa atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg 4380 ggtggggtgg ggcaggacag caagggggag gattgggaag acaatagcag gcatgctggg 4440 gatgcggtgg gctctatggc ttctgaggcg gaaagaacca gctggggctc tagggggtat 4500 ccccacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 4560 accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 4620 gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 4680 tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 4740 gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 4800 agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 4860 ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 4920 tttaacgcga attaattctg tggaatgtgt gtcagttagg gtgtggaaag tccccaggct 4980 ccccagcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc aggtgtggaa 5040 agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa 5100 ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt 5160 ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctctgcct 5220 ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcaaaaagc 5280 tcccgggagc ttgtatatcc attttcggat ctgatcaaga gacaggatga ggatcgtttc 5340 gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat 5400 tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt 5460 cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac 5520 tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg 5580 tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc 5640 aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa 5700 tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc 5760 gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg 5820 aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg 5880 acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa 5940 atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg 6000 acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct 6060 tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc 6120 ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag cgacgcccaa 6180 cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg gcttcggaat 6240 cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc tggagttctt 6300 cgcccacccc aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 6360 aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 6420 caatgtatct tatcatgtct gtataccgtc gacctctagc tagagcttgg cgtaatcatg 6480 gtcatagctg tttcctgtgt gaaattgtta tccgctcaca attccacaca acatacgagc 6540 cggaagcata aagtgtaaag cctggggtgc ctaatgagtg agctaactca cattaattgc 6600 gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat 6660 cggccaacgc gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac 6720 tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt 6780 aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca 6840 gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc 6900 ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact 6960 ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct 7020 gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag 7080 ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca 7140 cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa 7200 cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc 7260 gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag 7320 aagaacagta tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg 7380 tagctcttga tccggcaaac aaaccaccgc tggtagcggt ttttttgttt gcaagcagca 7440 gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 7500 cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 7560 cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 7620 gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 7680 tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 7740 gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 7800 agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 7860 tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 7920 agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 7980 gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 8040 catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 8100 ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 8160 atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 8220 tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 8280 cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 8340 cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 8400 atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 8460 aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 8520 ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 8580 aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtc 8635 <210> 3 <211> 8657 <212> DNA <213> Artificial Sequence <220> <223> X3GC / GEPO vector <400> 3 gacggatcgg gagatcccta actccgccca gttccgccca ttctccgccc catggctgac 60 taattttttt tatttatgca gaggccgagg ccgcctctgc ctctgagcta ttccagaagt 120 agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcccggga tggttcgacc 180 attgaactgc atcgtcgccg tgtcccaaaa tatggggatt ggcaagaacg gaaacctacc 240 ctggcctccg ctcaggaacg agttcaagta cttccaaaga atgaccacaa cctcttcagt 300 ggaaggtaaa cagaatctgg tgattatggg taggaaaacc tggttctcca ttcctgagaa 360 gaatcgacct ttaaaggaca gaattaatat agttctcagt agagaactca aagaaccacc 420 acgaggagct cattttcttg ccaaaagttt ggatgatgcc ttaagactta ttgaacaacc 480 ggaattggca agtaaagtag acatggtttg gatagtcgga ggcagttctg tttaccagga 540 agccatgaat caaccaggcc acctcagact ctttgtgaca aggatcatgc aggaatttga 600 aagtgacacg tttttcccag aaattgattt ggggaaatat aaacttctcc cagaataccc 660 aggcgtcctc tctgaggtcc aggaggaaaa aggcatcaag tataagtttg aagtctacga 720 gaagaaagac taacaggttc gaaatgaccg accaagcgac gcccaacctg ccatcacgag 780 atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 840 ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact 900 tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata 960 aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc 1020 atgtctggga tctcccgatc ccctatggtg cactctcagt acaatctgct ctgatgccgc 1080 atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag 1140 caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag 1200 ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt gacattgatt 1260 attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga 1320 gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg 1380 cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga ctttccattg 1440 acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc aagtgtatca 1500 tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc 1560 ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat tagtcatcgc 1620 tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc 1680 acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa 1740 tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag 1800 gcgtgtacgg tgggaggtct atataagcag agctctctgg ctaactagag aacccactgc 1860 ttactggctt atcgaaatta atacgactca ctatagggag acccaagctg gctagcgttt 1920 aaacttaagc ttggtaccga gctcggatcc actagtaacg gccgccagtg tgctggaatt 1980 ctgcagattc gaggtcgacg gtatcgataa gcttccaggc gcggagatgg gggtgcacgg 2040 tgagtactcg cgggctgggc gctcccgccc gcccgggtcc ctgtttgagc ggggatttag 2100 cgccccggct attggccagg aggtggctgg gttcaaggac cggcgacttg tcaaggaccc 2160 cggaaggggg aggggggtgg ggcagcctcc acgtgccagc ggggacttgg gggagtcctt 2220 ggggatggca aaaacctgac ctgtgaaggg gacacagttt gggggttgag gggaagaagg 2280 tttgggggtt ctgctgtgcc agtggagagg aagctgataa gctgataacc tgggcgctgg 2340 agccaccact tatctgccag aggggaagcc tctgtcacac caggattgaa gtttggccgg 2400 agaagtggat gctggtagct gggggtgggg tgtgcacacg gcagcaggat tgaatgaagg 2460 ccagggaggc agcacctgag tgcttgcatg gttggggaca ggaaggacga gctggggcag 2520 agacgtgggg atgaaggaag ctgtccttcc acagccaccc ttctccctcc ccgcctgact 2580 ctcagcctgg ctatctgttc tagaatgtcc tgcctggctg tggcttctcc tgtccctgct 2640 gtcgctccct ctgggcctcc cagtcctggg cgccccacca cgcctcatct gtgacagccg 2700 agtcctggag aggtacctct tggaggccaa ggaggccgag aatatcacgg tgagacccct 2760 tccccagcac attccacaga actcacgctc agggcttcag ggaactcctc ccagatccag 2820 gaacctggca cttggtttag ggtggagttg ggaagctaga cactgccccc ctacataaga 2880 ataagtctgg tggccccaaa ccatacctgg aaactaggca aggagcaaag ccagcagatc 2940 ctacggcctg tgggccaggg ccagagcctt cagggaccct tgactccccg ggctgtgtgc 3000 atttcagacg ggctgtgctg aacactgcag cttgaatgag aatatcactg tcccagacac 3060 caaagttaat ttctatgcct ggaagaggat ggaggtgagt tccttttttt ttttttttcc 3120 tttcttttgg agaatctcat ttgcgagcct gattttggat gaaagggaga atgatcgagg 3180 gaaaggtaaa atggagcagc agagatgagg ctgcctgggc gcagaggctc acgtctataa 3240 tcccaggctg agatggccga gatgggagaa ttgcttgagc cctggagttt cagaccaacc 3300 taggcagcat agtgagatcc cccatctcta caaacattta aaaaaattag tcaggtgaag 3360 tggtgcatgg tggtagtccc agatatttgg aaggctgagg cgggaggatc gcttgagccc 3420 aggaatttga ggctgcagtg agctgtgatc acaccactgc actccagcct cagtgacaga 3480 gtgaggccct gtctcaaaaa agaaaagaaa aaagaaaaat aatgagggct gtatggaata 3540 cattcattat tcattcactc actcactcac tcattcattc attcattcat tcaacaagtc 3600 ttattgcata ccttctgttt gctcagcttg gtgcttgggg ctgctgaggg gcaggaggga 3660 gagggtgaca tgggtcagct gactcccaga gtccactccc tgtaggtcgg gcagcaggcc 3720 gtagaagtct ggcagggcct ggccctgctg tcggaagctg tcctgcgggg ccaggccctg 3780 ttggtcaact cttcccagcc gtgggagccc ctgcagctgc atgtggataa agccgtcagt 3840 ggccttcgca gcctcaccac tctgcttcgg gctctgggag cccaggtgag taggagcgga 3900 cacttctgct tgccctttct gtaagaaggg gagaagggtc ttgctaagga gtacaggaac 3960 tgtccgtatt ccttcccttt ctgtggcact gcagcgacct cctgttttct ccttggcaga 4020 aggaagccat ctcccctcca gatgcggcct cagctgctcc actccgaaca atcactgctg 4080 acactttccg caaactcttc cgagtctact ccaatttcct ccggggaaag ctgaagctgt 4140 acacagggga ggcctgcagg acaggggaca gatgaccagc ttgatatcga attatccatc 4200 acactggcgg ccgctcgagt ctagagggcc cgtttaaacc cgctgatcag cctcgactgt 4260 gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga 4320 aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag 4380 taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga 4440 agacaatagc aggcatgctg gggatgcggt gggctctatg gcttctgagg cggaaagaac 4500 cagctggggc tctagggggt atccccacgc gccctgtagc ggcgcattaa gcgcggcggg 4560 tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 4620 cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 4680 ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 4740 ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 4800 gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 4860 tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa 4920 aaatgagctg atttaacaaa aatttaacgc gaattaattc tgtggaatgt gtgtcagtta 4980 gggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat 5040 tagtcagcaa ccaggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 5100 atgcatctca attagtcagc aaccatagtc ccgcccctaa ctccgcccat cccgccccta 5160 actccgccca gttccgccca ttctccgccc catggctgac taattttttt tatttatgca 5220 gaggccgagg ccgcctctgc ctctgagcta ttccagaagt agtgaggagg cttttttgga 5280 ggcctaggct tttgcaaaaa gctcccggga gcttgtatat ccattttcgg atctgatcaa 5340 gagacaggat gaggatcgtt tcgcatgatt gaacaagatg gattgcacgc aggttctccg 5400 gccgcttggg tggagaggct attcggctat gactgggcac aacagacaat cggctgctct 5460 gatgccgccg tgttccggct gtcagcgcag gggcgcccgg ttctttttgt caagaccgac 5520 ctgtccggtg ccctgaatga actgcaggac gaggcagcgc ggctatcgtg gctggccacg 5580 acgggcgttc cttgcgcagc tgtgctcgac gttgtcactg aagcgggaag ggactggctg 5640 ctattgggcg aagtgccggg gcaggatctc ctgtcatctc accttgctcc tgccgagaaa 5700 gtatccatca tggctgatgc aatgcggcgg ctgcatacgc ttgatccggc tacctgccca 5760 ttcgaccacc aagcgaaaca tcgcatcgag cgagcacgta ctcggatgga agccggtctt 5820 gtcgatcagg atgatctgga cgaagagcat caggggctcg cgccagccga actgttcgcc 5880 aggctcaagg cgcgcatgcc cgacggcgag gatctcgtcg tgacccatgg cgatgcctgc 5940 ttgccgaata tcatggtgga aaatggccgc ttttctggat tcatcgactg tggccggctg 6000 ggtgtggcgg accgctatca ggacatagcg ttggctaccc gtgatattgc tgaagagctt 6060 ggcggcgaat gggctgaccg cttcctcgtg ctttacggta tcgccgctcc cgattcgcag 6120 cgcatcgcct tctatcgcct tcttgacgag ttcttctgag cgggactctg gggttcgaaa 6180 tgaccgacca agcgacgccc aacctgccat cacgagattt cgattccacc gccgccttct 6240 atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc ctccagcgcg 6300 gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct tataatggtt 6360 acaaataaag caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta 6420 gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgtataccg tcgacctcta 6480 gctagagctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca 6540 caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag 6600 tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt 6660 cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 6720 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 6780 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 6840 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 6900 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 6960 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 7020 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 7080 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 7140 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 7200 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 7260 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 7320 ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 7380 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 7440 gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 7500 tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg 7560 tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta 7620 aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg 7680 aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg 7740 tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc 7800 gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg 7860 agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg 7920 aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag 7980 gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat 8040 caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc 8100 cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc 8160 ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa 8220 ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac 8280 gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt 8340 cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc 8400 gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa 8460 caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca 8520 tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat 8580 acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa 8640 aagtgccacc tgacgtc 8657 <210> 4 <211> 8691 <212> DNA <213> Artificial Sequence <220> <223> x6GC / GEPO vector <400> 4 gacggatcgg gagatcccat agtcccgccc ctaactccgc ccatcccgcc cctaactccg 60 cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 120 gaggccgcct ctgcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 180 ggcttttgca aaaagctccc gggatggttc gaccattgaa ctgcatcgtc gccgtgtccc 240 aaaatatggg gattggcaag aacggaaacc taccctggcc tccgctcagg aacgagttca 300 agtacttcca aagaatgacc acaacctctt cagtggaagg taaacagaat ctggtgatta 360 tgggtaggaa aacctggttc tccattcctg agaagaatcg acctttaaag gacagaatta 420 atatagttct cagtagagaa ctcaaagaac caccacgagg agctcatttt cttgccaaaa 480 gtttggatga tgccttaaga cttattgaac aaccggaatt ggcaagtaaa gtagacatgg 540 tttggatagt cggaggcagt tctgtttacc aggaagccat gaatcaacca ggccacctca 600 gactctttgt gacaaggatc atgcaggaat ttgaaagtga cacgtttttc ccagaaattg 660 atttggggaa atataaactt ctcccagaat acccaggcgt cctctctgag gtccaggagg 720 aaaaaggcat caagtataag tttgaagtct acgagaagaa agactaacag gttcgaaatg 780 accgaccaag cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat 840 gaaaggttgg gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg 900 gatctcatgc tggagttctt cgcccacccc aacttgttta ttgcagctta taatggttac 960 aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact gcattctagt 1020 tgtggtttgt ccaaactcat caatgtatct tatcatgtct gggatctccc gatcccctat 1080 ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat ctgctccctg 1140 cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca acaaggcaag 1200 gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg ctgcttcgcg 1260 atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa tagtaatcaa 1320 ttacggggtc attagttcat agcccatata tggagttccg cgttacataa cttacggtaa 1380 atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg 1440 ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt 1500 aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg 1560 tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc 1620 ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg cggttttggc 1680 agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt ctccacccca 1740 ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca aaatgtcgta 1800 acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag gtctatataa 1860 gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa attaatacga 1920 ctcactatag ggagacccaa gctggctagc gtttaaactt aagcttggta ccgagctcgg 1980 atccactagt aacggccgcc agtgtgctgg aattctgcag attcgaggtc gacggtatcg 2040 ataagcttcc aagcgcggag atgggggtgc acggtgagta ctcgcgggct gggcgctccc 2100 gcccgcccgg gtccctgttt gagcggggat ttagcgcccc ggctattggc caggaggtgg 2160 ctgggttcaa ggaccggcga cttgtcaagg accccggaag ggggaggggg gtggggcagc 2220 ctccacgtgc cagcggggac ttgggggagt ccttggggat ggcaaaaacc tgacctgtga 2280 aggggacaca gtttgggggt tgaggggaag aaggtttggg ggttctgctg tgccagtgga 2340 gaggaagctg ataagctgat aacctgggcg ctggagccac cacttatctg ccagagggga 2400 agcctctgtc acaccaggat tgaagtttgg ccggagaagt ggatgctggt agctgggggt 2460 ggggtgtgca cacggcagca ggattgaatg aaggccaggg aggcagcacc tgagtgcttg 2520 catggttggg gacaggaagg acgagctggg gcagagacgt ggggatgaag gaagctgtcc 2580 ttccacagcc acccttctcc ctccccgcct gactctcagc ctggctatct gttctagaat 2640 gtcctgcctg gctgtggctt ctcctgtccc tgctgtcgct ccctctgggc ctcccagtcc 2700 tgggcgcccc accacgcctc atctgtgaca gccgagtcct ggagaggtac ctcttggagg 2760 ccaaggaggc cgagaatatc acggtgagac cccttcccca gcacattcca cagaactcac 2820 gctcagggct tcagggaact cctcccagat ccaggaacct ggcacttggt ttagggtgga 2880 gttgggaagc tagacactgc ccccctacat aagaataagt ctggtggccc caaaccatac 2940 ctggaaacta ggcaaggagc aaagccagca gatcctacgg cctgtgggcc agggccagag 3000 ccttcaggga cccttgactc cccgggctgt gtgcatttca gacgggctgt gctgaacact 3060 gcagcttgaa tgagaatatc actgtcccag acaccaaagt taatttctat gcctggaaga 3120 ggatggaggt gagttccttt tttttttttt ttcctttctt ttggagaatc tcatttgcga 3180 gcctgatttt ggatgaaagg gagaatgatc gagggaaagg taaaatggag cagcagagat 3240 gaggctgcct gggcgcagag gctcacgtct ataatcccag gctgagatgg ccgagatggg 3300 agaattgctt gagccctgga gtttcagacc aacctaggca gcatagtgag atcccccatc 3360 tctacaaaca tttaaaaaaa ttagtcaggt gaagtggtgc atggtggtag tcccagatat 3420 ttggaaggct gaggcgggag gatcgcttga gcccaggaat ttgaggctgc agtgagctgt 3480 gatcacacca ctgcactcca gcctcagtga cagagtgagg ccctgtctca aaaaagaaaa 3540 gaaaaaagaa aaataatgag ggctgtatgg aatacattca ttattcattc actcactcac 3600 tcactcattc attcattcat tcattcaaca agtcttattg cataccttct gtttgctcag 3660 cttggtgctt ggggctgctg aggggcagga gggagagggt gacatgggtc agctgactcc 3720 cagagtccac tccctgtagg tcgggcagca ggccgtagaa gtctggcagg gcctggccct 3780 gctgtcggaa gctgtcctgc ggggccaggc cctgttggtc aactcttccc agccgtggga 3840 gcccctgcag ctgcatgtgg ataaagccgt cagtggcctt cgcagcctca ccactctgct 3900 tcgggctctg ggagcccagg tgagtaggag cggacacttc tgcttgccct ttctgtaaga 3960 aggggagaag ggtcttgcta aggagtacag gaactgtccg tattccttcc ctttctgtgg 4020 cactgcagcg acctcctgtt ttctccttgg cagaaggaag ccatctcccc tccagatgcg 4080 gcctcagctg ctccactccg aacaatcact gctgacactt tccgcaaact cttccgagtc 4140 tactccaatt tcctccgggg aaagctgaag ctgtacacag gggaggcctg caggacaggg 4200 gacagatgac cagcttgata tcgaattatc catcacactg gcggccgctc gagtctagag 4260 ggcccgttta aacccgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 4320 tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 4380 aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 4440 gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggatg 4500 cggtgggctc tatggcttct gaggcggaaa gaaccagctg gggctctagg gggtatcccc 4560 acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 4620 ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 4680 cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 4740 gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 4800 catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 4860 gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 4920 aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 4980 acgcgaatta attctgtgga atgtgtgtca gttagggtgt ggaaagtccc caggctcccc 5040 agcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccaggt gtggaaagtc 5100 cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt cagcaaccat 5160 agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc 5220 gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct ctgcctctga 5280 gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca aaaagctccc 5340 gggagcttgt atatccattt tcggatctga tcaagagaca ggatgaggat cgtttcgcat 5400 gattgaacaa gatggattgc acgcaggttc tccggccgct tgggtggaga ggctattcgg 5460 ctatgactgg gcacaacaga caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc 5520 gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgca 5580 ggacgaggca gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagctgtgct 5640 cgacgttgtc actgaagcgg gaagggactg gctgctattg ggcgaagtgc cggggcagga 5700 tctcctgtca tctcaccttg ctcctgccga gaaagtatcc atcatggctg atgcaatgcg 5760 gcggctgcat acgcttgatc cggctacctg cccattcgac caccaagcga aacatcgcat 5820 cgagcgagca cgtactcgga tggaagccgg tcttgtcgat caggatgatc tggacgaaga 5880 gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg 5940 cgaggatctc gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg 6000 ccgcttttct ggattcatcg actgtggccg gctgggtgtg gcggaccgct atcaggacat 6060 agcgttggct acccgtgata ttgctgaaga gcttggcggc gaatgggctg accgcttcct 6120 cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc gccttctatc gccttcttga 6180 cgagttcttc tgagcgggac tctggggttc gaaatgaccg accaagcgac gcccaacctg 6240 ccatcacgag atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt 6300 ttccgggacg ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc 6360 caccccaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 6420 ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 6480 gtatcttatc atgtctgtat accgtcgacc tctagctaga gcttggcgta atcatggtca 6540 tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 6600 agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg 6660 cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 6720 caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 6780 tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 6840 cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 6900 aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 6960 gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 7020 agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 7080 cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 7140 cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 7200 ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 7260 gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7320 tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7380 acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 7440 tcttgatccg gcaaacaaac caccgctggt agcggttttt ttgtttgcaa gcagcagatt 7500 acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct 7560 cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 7620 acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 7680 acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 7740 tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 7800 ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 7860 ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 7920 tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 7980 aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt 8040 ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 8100 ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 8160 gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 8220 gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 8280 cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga 8340 actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 8400 ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 8460 tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 8520 ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 8580 agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 8640 aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt c 8691 <210> 5 <211> 7022 <212> DNA <213> Artificial Sequence <220> <223> X0GC / dhfr vector <400> 5 gacggatcgg gagatccgac atgataagat acattgatga gtttggacaa accacaacta 60 gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa 120 ccattataag ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg 180 ttcaggggga ggtgtgggag gttttttaaa gcaagtaaaa cctctacaaa tgtggtatgg 240 ctgattatga tctctagtca aggcactata catcaaatat tccttattaa cccctttaca 300 aattaaaaag ctaaaggtac acaatttttg agcatagtta ttaatagcag acactctatg 360 cctgtgtgga gtaagaaaaa acagtatgtt atgattataa ctgttatgcc tacttataaa 420 ggttacagaa tatttttcca taattttctt gtatagcagt gcagcttttt cctttgtggt 480 gtaaatagca aagcaagcaa gagttctatt actaaacaca gcatgactca aaaaacttag 540 caattctgaa ggaaagtcct tggggtcttc tacctttctc ttcttttttg gaggagtaga 600 atgttgagag tcagcagtag cctcatcatc actagatggc atttcttctg agcaaaacag 660 gttttcctca ttaaaggcat tccaccactg ctcccattca tcagttccat aggttggaat 720 ctaaaataca caaacaatta gaatcagtag tttaacacat tatacactta aaaattttat 780 atttacctta gagctttaaa tctctgtagg tagtttgtcc aattatgtca caccacagaa 840 gtaaggttcc ttcacaaaga tccaaagcca gcaaaagtcc catggtctta taaaaatgca 900 tagctttagg aggggagcag agaacttgaa agcatcttcc tgttagtctt tcttctcgta 960 gacttcaaac ttatacttga tgcctttttc ctcctggacc tcagagagga cgcctgggta 1020 ttctgggaga agtttatatt tccccaaatc aatttctggg aaaaacgtgt cactttcaaa 1080 ttcctgcatg atccttgtca caaagagtct gaggtggcct ggttgattca tggcttcctg 1140 gtaaacagaa ctgcctccga ctatccaaac catgtctact ttacttgcca attccggttg 1200 ttcaataagt cttaaggcat catccaaact tttggcaaga aaatgagctc ctcgtggtgg 1260 ttctttgagt tctctactga gaactatatt aattctgtcc tttaaaggtc gattcttctc 1320 aggaatggag aaccaggttt tcctacccat aatcaccaga ttctgtttac cttccactga 1380 agaggttgtg gtcattcttt ggaagtactt gaactcgttc ctgagcggag gccagggtag 1440 gtctccgttc ttgccaatcc ccatattttg ggacacggcg acgatgcagt tcaatggtcg 1500 aaccatgatg gcagcgggga taaaatccta ccagccttca cgctaggatt gccgtcaagt 1560 ttggcgcgaa atcgcagccc tgagctgtgg atctcccgat cccctatggt gcactctcag 1620 tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt gtgtgttgga 1680 ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct tgaccgacaa 1740 ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg tacgggccag 1800 atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 1860 agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 1920 ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac 1980 gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt 2040 ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa 2100 atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta 2160 catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg 2220 gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg 2280 gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc 2340 attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctctctg 2400 gctaactaga gaacccactg cttactggct tatcgaaatt aatacgactc actataggga 2460 gacccaagct tggtaccgag ctcggatcca ctagtaacgg ccgccagtgt gctggaattc 2520 tgcagatatc catcacactg gcggccgctc gagcatgcat ctagagggcc ctattctata 2580 gtgtcaccta aatgctagag ctcgctgatc agcctcgact gtgccttcta gttgccagcc 2640 atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt 2700 cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc attctattct 2760 ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc 2820 tggggatgcg gtgggctcta tggcttctga ggcggaaaga accagctggg gctctagggg 2880 gtatccccac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag 2940 cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt 3000 tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt 3060 ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg 3120 tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt 3180 taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt 3240 tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca 3300 aaaatttaac gcgaattaat tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca 3360 ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc aaccaggtgt 3420 ggaaagtccc caggctcccc agcaggcaga agtatgcaaa gcatgcatct caattagtca 3480 gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc 3540 cattctccgc cccatggctg actaattttt tttatttatg cagaggccga ggccgcctct 3600 gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg cttttgcaaa 3660 aagctcccgg gagcttgtat atccattttc ggatctgatc aagagacagg atgaggatcg 3720 tttcgcatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3780 ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3840 ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3900 gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3960 gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 4020 gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 4080 gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 4140 catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 4200 gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 4260 cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 4320 gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 4380 caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 4440 cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 4500 cttcttgacg agttcttctg agcgggactc tggggttcga aatgaccgac caagcgacgc 4560 ccaacctgcc atcacgagat ttcgattcca ccgccgcctt ctatgaaagg ttgggcttcg 4620 gaatcgtttt ccgggacgcc ggctggatga tcctccagcg cggggatctc atgctggagt 4680 tcttcgccca ccccaacttg tttattgcag cttataatgg ttacaaataa agcaatagca 4740 tcacaaattt cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac 4800 tcatcaatgt atcttatcat gtctgtatac cgtcgacctc tagctagagc ttggcgtaat 4860 catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 4920 gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa ctcacattaa 4980 ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 5040 gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 5100 tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 5160 cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 5220 gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 5280 gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 5340 gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 5400 ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 5460 atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 5520 tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 5580 ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 5640 gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 5700 ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 5760 ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 5820 agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 5880 ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 5940 aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 6000 tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 6060 cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 6120 tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 6180 cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 6240 ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 6300 gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 6360 gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 6420 gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 6480 gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 6540 tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 6600 aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 6660 cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 6720 caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 6780 cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 6840 ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 6900 aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 6960 tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 7020 tc 7022 <210> 6 <211> 6422 <212> DNA <213> Artificial Sequence <220> <223> X1GC / dhfr vector <400> 6 gacggatcgg gagatccatt ctccgcccca tggctgacta atttttttta tttatgcaga 60 ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg 120 cctaggcttt tgcaaaaagc tcccgggatg gttcgaccat tgaactgcat cgtcgccgtg 180 tcccaaaata tggggattgg caagaacgga aacctaccct ggcctccgct caggaacgag 240 ttcaagtact tccaaagaat gaccacaacc tcttcagtgg aaggtaaaca gaatctggtg 300 attatgggta ggaaaacctg gttctccatt cctgagaaga atcgaccttt aaaggacaga 360 attaatatag ttctcagtag agaactcaaa gaaccaccac gaggagctca ttttcttgcc 420 aaaagtttgg atgatgcctt aagacttatt gaacaaccgg aattggcaag taaagtagac 480 atggtttgga tagtcggagg cagttctgtt taccaggaag ccatgaatca accaggccac 540 ctcagactct ttgtgacaag gatcatgcag gaatttgaaa gtgacacgtt tttcccagaa 600 attgatttgg ggaaatataa acttctccca gaatacccag gcgtcctctc tgaggtccag 660 gaggaaaaag gcatcaagta taagtttgaa gtctacgaga agaaagacta acaggttcga 720 aatgaccgac caagcgacgc ccaacctgcc atcacgagat ttcgattcca ccgccgcctt 780 ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc ggctggatga tcctccagcg 840 cggggatctc atgctggagt tcttcgccca ccccaacttg tttattgcag cttataatgg 900 ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 960 tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctgggatc tcccgatccc 1020 ctatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatctgctc 1080 cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca aaatttaagc tacaacaagg 1140 caaggcttga ccgacaattg catgaagaat ctgcttaggg ttaggcgttt tgcgctgctt 1200 cgcgatgtac gggccagata tacgcgttga cattgattat tgactagtta ttaatagtaa 1260 tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 1320 gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 1380 tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 1440 cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 1500 gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 1560 tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 1620 tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 1680 cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 1740 cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 1800 ataagcagag ctctctggct aactagagaa cccactgctt actggcttat cgaaattaat 1860 acgactcact atagggagac ccaagctggc tagcgtttaa acttaagctt ggtaccgagc 1920 tcggatccac tagtccagtg tggtggaatt ctgcagatat ccagcacagt ggcggccgct 1980 cgagtctaga gggcccgttt aaacccgctg atcagcctcg actgtgcctt ctagttgcca 2040 gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac 2100 tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat 2160 tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca 2220 tgctggggat gcggtgggct ctatggcttc tgaggcggaa agaaccagct ggggctctag 2280 ggggtatccc cacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 2340 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 2400 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 2460 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 2520 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 2580 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 2640 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 2700 acaaaaattt aacgcgaatt aattctgtgg aatgtgtgtc agttagggtg tggaaagtcc 2760 ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccagg 2820 tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag 2880 tcagcaacca tagtcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc 2940 gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc 3000 tctgcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct aggcttttgc 3060 aaaaagctcc cgggagcttg tatatccatt ttcggatctg atcaagagac aggatgagga 3120 tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc ttgggtggag 3180 aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc cgccgtgttc 3240 cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc cggtgccctg 3300 aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg cgttccttgc 3360 gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt gggcgaagtg 3420 ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc catcatggct 3480 gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga ccaccaagcg 3540 aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga tcaggatgat 3600 ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc 3660 atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg 3720 gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt ggcggaccgc 3780 tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg cgaatgggct 3840 gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat cgccttctat 3900 cgccttcttg acgagttctt ctgagcggga ctctggggtt cgaaatgacc gaccaagcga 3960 cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa aggttgggct 4020 tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat ctcatgctgg 4080 agttcttcgc ccaccccaac ttgtttattg cagcttataa tggttacaaa taaagcaata 4140 gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca 4200 aactcatcaa tgtatcttat catgtctgta taccgtcgac ctctagctag agcttggcgt 4260 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 4320 tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat 4380 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 4440 aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 4500 cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 4560 aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 4620 aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 4680 tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 4740 caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 4800 cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 4860 ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 4920 gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 4980 agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 5040 gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 5100 acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 5160 gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtttt tttgtttgca 5220 agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 5280 ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 5340 aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 5400 tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 5460 cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 5520 tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 5580 cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 5640 ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 5700 gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 5760 gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 5820 gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 5880 gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 5940 tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 6000 aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 6060 cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 6120 caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 6180 cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 6240 ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 6300 aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 6360 tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 6420 tc 6422 <210> 7 <211> 1572 <212> DNA <213> Artificial Sequence <220> <223> x0GC / dhfr cassette <400> 7 gacatgataa gatacattga tgagtttgga caaaccacaa ctagaatgca gtgaaaaaaa 60 tgctttattt gtgaaatttg tgatgctatt gctttatttg taaccattat aagctgcaat 120 aaacaagtta acaacaacaa ttgcattcat tttatgtttc aggttcaggg ggaggtgtgg 180 gaggtttttt aaagcaagta aaacctctac aaatgtggta tggctgatta tgatctctag 240 tcaaggcact atacatcaaa tattccttat taaccccttt acaaattaaa aagctaaagg 300 tacacaattt ttgagcatag ttattaatag cagacactct atgcctgtgt ggagtaagaa 360 aaaacagtat gttatgatta taactgttat gcctacttat aaaggttaca gaatattttt 420 ccataatttt cttgtatagc agtgcagctt tttcctttgt ggtgtaaata gcaaagcaag 480 caagagttct attactaaac acagcatgac tcaaaaaact tagcaattct gaaggaaagt 540 ccttggggtc ttctaccttt ctcttctttt ttggaggagt agaatgttga gagtcagcag 600 tagcctcatc atcactagat ggcatttctt ctgagcaaaa caggttttcc tcattaaagg 660 cattccacca ctgctcccat tcatcagttc cataggttgg aatctaaaat acacaaacaa 720 ttagaatcag tagtttaaca cattatacac ttaaaaattt tatatttacc ttagagcttt 780 aaatctctgt aggtagtttg tccaattatg tcacaccaca gaagtaaggt tccttcacaa 840 agatccaaag ccagcaaaag tcccatggtc ttataaaaat gcatagcttt aggaggggag 900 cagagaactt gaaagcatct tcctgttagt ctttcttctc gtagacttca aacttatact 960 tgatgccttt ttcctcctgg acctcagaga ggacgcctgg gtattctggg agaagtttat 1020 atttccccaa atcaatttct gggaaaaacg tgtcactttc aaattcctgc atgatccttg 1080 tcacaaagag tctgaggtgg cctggttgat tcatggcttc ctggtaaaca gaactgcctc 1140 cgactatcca aaccatgtct actttacttg ccaattccgg ttgttcaata agtcttaagg 1200 catcatccaa acttttggca agaaaatgag ctcctcgtgg tggttctttg agttctctac 1260 tgagaactat attaattctg tcctttaaag gtcgattctt ctcaggaatg gagaaccagg 1320 ttttcctacc cataatcacc agattctgtt taccttccac tgaagaggtt gtggtcattc 1380 tttggaagta cttgaactcg ttcctgagcg gaggccaggg taggtctccg ttcttgccaa 1440 tccccatatt ttgggacacg gcgacgatgc agttcaatgg tcgaaccatg atggcagcgg 1500 ggataaaatc ctaccagcct tcacgctagg attgccgtca agtttggcgc gaaatcgcag 1560 ccctgagctg tg 1572 <210> 8 <211> 988 <212> DNA <213> Artificial Sequence <220> <223> x1GC dhfr cassette <400> 8 attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag gccgcctctg 60 cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc ttttgcaaaa 120 agctcccggg atggttcgac cattgaactg catcgtcgcc gtgtcccaaa atatggggat 180 tggcaagaac ggaaacctac cctggcctcc gctcaggaac gagttcaagt acttccaaag 240 aatgaccaca acctcttcag tggaaggtaa acagaatctg gtgattatgg gtaggaaaac 300 ctggttctcc attcctgaga agaatcgacc tttaaaggac agaattaata tagttctcag 360 tagagaactc aaagaaccac cacgaggagc tcattttctt gccaaaagtt tggatgatgc 420 cttaagactt attgaacaac cggaattggc aagtaaagta gacatggttt ggatagtcgg 480 aggcagttct gtttaccagg aagccatgaa tcaaccaggc cacctcagac tctttgtgac 540 aaggatcatg caggaatttg aaagtgacac gtttttccca gaaattgatt tggggaaata 600 taaacttctc ccagaatacc caggcgtcct ctctgaggtc caggaggaaa aaggcatcaa 660 gtataagttt gaagtctacg agaagaaaga ctaacaggtt cgaaatgacc gaccaagcga 720 cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa aggttgggct 780 tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat ctcatgctgg 840 agttcttcgc ccaccccaac ttgtttattg cagcttataa tggttacaaa taaagcaata 900 gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca 960 aactcatcaa tgtatcttat catgtctg 988 <210> 9 <211> 1010 <212> DNA <213> Artificial Sequence <220> X3GC dhfr cassette <400> 9 ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 60 gcagaggccg aggccgcctc tgcctctgag ctattccaga agtagtgagg aggctttttt 120 ggaggcctag gcttttgcaa aaagctcccg ggatggttcg accattgaac tgcatcgtcg 180 ccgtgtccca aaatatgggg attggcaaga acggaaacct accctggcct ccgctcagga 240 acgagttcaa gtacttccaa agaatgacca caacctcttc agtggaaggt aaacagaatc 300 tggtgattat gggtaggaaa acctggttct ccattcctga gaagaatcga cctttaaagg 360 acagaattaa tatagttctc agtagagaac tcaaagaacc accacgagga gctcattttc 420 ttgccaaaag tttggatgat gccttaagac ttattgaaca accggaattg gcaagtaaag 480 tagacatggt ttggatagtc ggaggcagtt ctgtttacca ggaagccatg aatcaaccag 540 gccacctcag actctttgtg acaaggatca tgcaggaatt tgaaagtgac acgtttttcc 600 cagaaattga tttggggaaa tataaacttc tcccagaata cccaggcgtc ctctctgagg 660 tccaggagga aaaaggcatc aagtataagt ttgaagtcta cgagaagaaa gactaacagg 720 ttcgaaatga ccgaccaagc gacgcccaac ctgccatcac gagatttcga ttccaccgcc 780 gccttctatg aaaggttggg cttcggaatc gttttccggg acgccggctg gatgatcctc 840 cagcgcgggg atctcatgct ggagttcttc gcccacccca acttgtttat tgcagcttat 900 aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 960 cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg 1010 <210> 10 <211> 1044 <212> DNA <213> Artificial Sequence <220> <223> x6GC dhfr cassette <400> 10 catagtcccg cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc 60 tccgccccat ggctgactaa ttttttttat ttatgcagag gccgaggccg cctctgcctc 120 tgagctattc cagaagtagt gaggaggctt ttttggaggc ctaggctttt gcaaaaagct 180 cccgggatgg ttcgaccatt gaactgcatc gtcgccgtgt cccaaaatat ggggattggc 240 aagaacggaa acctaccctg gcctccgctc aggaacgagt tcaagtactt ccaaagaatg 300 accacaacct cttcagtgga aggtaaacag aatctggtga ttatgggtag gaaaacctgg 360 ttctccattc ctgagaagaa tcgaccttta aaggacagaa ttaatatagt tctcagtaga 420 gaactcaaag aaccaccacg aggagctcat tttcttgcca aaagtttgga tgatgcctta 480 agacttattg aacaaccgga attggcaagt aaagtagaca tggtttggat agtcggaggc 540 agttctgttt accaggaagc catgaatcaa ccaggccacc tcagactctt tgtgacaagg 600 atcatgcagg aatttgaaag tgacacgttt ttcccagaaa ttgatttggg gaaatataaa 660 cttctcccag aatacccagg cgtcctctct gaggtccagg aggaaaaagg catcaagtat 720 aagtttgaag tctacgagaa gaaagactaa caggttcgaa atgaccgacc aagcgacgcc 780 caacctgcca tcacgagatt tcgattccac cgccgccttc tatgaaaggt tgggcttcgg 840 aatcgttttc cgggacgccg gctggatgat cctccagcgc ggggatctca tgctggagtt 900 cttcgcccac cccaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat 960 cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact 1020 catcaatgta tcttatcatg tctg 1044 <210> 11 <211> 30 <212> DNA <213> Artificial Sequence <220> 223 dhfr 01 primer <400> 11 gcgcccggga tggttcgacc attgaactgc 30 <210> 12 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> dhfr-02 primer <400> 12 cacttagaac ctgttagtct ttcttctcgt agac 34 <210> 13 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> x 1GC primer <400> 13 tcaggatcca ttctccgccc catggctgac taa 33 <210> 14 <211> 35 <212> DNA <213> Artificial Sequence <220> <223> x 3GC primer <400> 14 catggatcct aactccgccc agttccgccc attct 35 <210> 15 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> x6GC primer <400> 15 catggatccc atagtcccgc ccctaactcc gccc 34 <210> 16 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> X0GC primer <400> 16 cgatggatcc gacatgataa gatacattga t 31 <210> 17 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> X0GCRR primer <400> 17 cgttggatcc acagctcagg gctgcgattt c 31 <210> 18 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> BISVpAR primer <400> 18 tcaggatccc agacatgata agatacattg atg 33  

Claims (20)

서열번호 10으로 정의되는 염기서열의 8번 내지 67번 염기 사이의 GC-풍부 영역(GC-Rich region)에서 하나 이상의 CCGCCC 반복서열이 제거된 염기서열을 갖는 고발현 유도 카세트.A high expression induction cassette having a nucleotide sequence from which at least one CCGCCC repeat sequence is removed from a GC-rich region between bases 8 to 67 of the nucleotide sequence defined by SEQ ID NO. 제1항에 있어서, 상기 GC-풍부 영역이 1개 이하의 CCGCCC 반복서열을 포함하는 것인 고발현 유도 카세트.The high expression induction cassette of claim 1, wherein the GC-rich region comprises up to one CCGCCC repeat sequence. 서열번호 7 내지 10으로 구성되는 군으로부터 선택되는 염기서열을 가지는 고발현 유도 카세트.A high expression induction cassette having a nucleotide sequence selected from the group consisting of SEQ ID NOs: 7 to 10. 제1항 내지 제3항 중 어느 한 항의 고발현 유도 카세트를 포함하는 발현 벡터. An expression vector comprising the high expression induction cassette of any one of claims 1 to 3. 제4항에 있어서, 목적 재조합 단백질을 암호화하는 유전자가 추가적으로 고발현 유도 카세트에 작동적으로 연결된 발현 벡터.The expression vector of claim 4, wherein the gene encoding the recombinant protein of interest is further operably linked to a high expression induction cassette. 제5항에 있어서, 상기 목적 재조합 단백질은 인간 에리트로포이에틴인 것을 특징으로 하는 발현 벡터.The expression vector according to claim 5, wherein the recombinant protein of interest is human erythropoietin. 제6항에 있어서, 서열번호 1 내지 4로 구성되는 군으로부터 선택되는 염기서열을 가지는 발현 벡터. The expression vector according to claim 6, which has a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1 to 4. 제4항에 있어서, 서열번호 5 또는 6의 염기서열을 가지는 발현 벡터.The expression vector according to claim 4, which has a nucleotide sequence of SEQ ID NO: 5 or 6. 제8항의 발현벡터에 의해 형질전환된 세포주. A cell line transformed with the expression vector of claim 8. 제9항에 있어서, 수탁번호 KCTC10991BP 또는 KCTC10992BP 인 세포주. The cell line according to claim 9, which is accession number KCTC10991BP or KCTC10992BP. 제5항의 발현 벡터에 의해 형질 전환된 세포주.A cell line transformed with the expression vector of claim 5. 제11항에 있어서, 상기 세포주가 CHO 세포주임을 특징으로 하는 세포주.12. The cell line of claim 11, wherein said cell line is a CHO cell line. 제12항에 있어서, 상기 CHO 세포주가 디하이드로폴레이트 환원효소 유전자가 결핍된 것임을 특징으로 하는 세포주.13. The cell line according to claim 12, wherein the CHO cell line is deficient in the dihydrofolate reductase gene. 제12항에 있어서, 수탁번호 KCTC10993BP, KCTC10994BP 또는 KCTC10995BP인 세포주.The cell line according to claim 12, which is accession number KCTC10993BP, KCTC10994BP or KCTC10995BP. (a) 제1항 내지 제3항 중 어느 한 항의 고발현 유도 카세트를 포함하는 발현 벡터로 동물 세포주를 형질전환하는 단계; 및(a) transforming an animal cell line with an expression vector comprising the high expression induction cassette of any one of claims 1 to 3; And (b) 디하이드로폴레이트 환원효소 저해제의 존재 하에 상기 형질전환된 동물 세포주를 배양하는 단계를 포함하는 목적 재조합 단백질의 제조 방법.(b) culturing the transformed animal cell line in the presence of a dihydrofolate reductase inhibitor. 제15항에 있어서, 목적 재조합 단백질은 인간 에리트로포이에틴인 방법.The method of claim 15, wherein the recombinant protein of interest is human erythropoietin. 제16항에 있어서, 고시알산 함유 에리트로포이에틴을 정제하는 단계를 더 포함하는 방법.The method of claim 16, further comprising purifying the sialic acid-containing erythropoietin. 제15항에 있어서, 발현 벡터가 도1에 기재된 발현 벡터인 방법.The method of claim 15, wherein the expression vector is the expression vector described in FIG. 1. 제15항에 있어서, 단계 (a)의 동물 세포주가 디하이드로폴레이트 환원효소 결핍 CHO 세포주임을 특징으로 하는 방법.The method of claim 15, wherein the animal cell line of step (a) is a dihydrofolate reductase deficient CHO cell line. 제15항에 있어서, 단계 (b)의 형질전환된 동물 세포주가 수탁번호 KCTC10993BP, KCTC10994BP 또는 KCTC10995BP인 동물 세포주인 방법.The method of claim 15, wherein the transformed animal cell line of step (b) is an animal cell line with accession number KCTC10993BP, KCTC10994BP or KCTC10995BP.
KR1020060100507A 2006-10-16 2006-10-16 A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same KR100880509B1 (en)

Priority Applications (9)

Application Number Priority Date Filing Date Title
KR1020060100507A KR100880509B1 (en) 2006-10-16 2006-10-16 A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same
EP07833373A EP2087118B1 (en) 2006-10-16 2007-10-16 A novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same
US12/444,567 US8957196B2 (en) 2006-10-16 2007-10-16 Vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same
PCT/KR2007/005064 WO2008048037A1 (en) 2006-10-16 2007-10-16 A novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same
CN2007800384033A CN101605899B (en) 2006-10-16 2007-10-16 A novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same
AT07833373T ATE512229T1 (en) 2006-10-16 2007-10-16 NEW VECTOR AND EXPRESSION CELL LINE FOR THE MASS PRODUCTION OF RECOMBINANT PROTEIN AND METHOD FOR THE PRODUCTION OF RECOMBINANT PROTEIN THEREFROM
ES07833373T ES2366061T3 (en) 2006-10-16 2007-10-16 A NEW VECTOR AND CELLULAR EXPRESSION LINE FOR MASS PRODUCTION OF A RECOMBINATING PROTEIN AND A PROCESS OF RECOMBINATING PROTEIN PRODUCTION USING THE SAME.
JP2009533241A JP5076084B2 (en) 2006-10-16 2007-10-16 Novel vector and expression cell line for mass production of recombinant protein, and method for producing recombinant protein using the same
HK10101256.5A HK1137481A1 (en) 2006-10-16 2010-02-04 A novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020060100507A KR100880509B1 (en) 2006-10-16 2006-10-16 A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same

Publications (2)

Publication Number Publication Date
KR20080034364A KR20080034364A (en) 2008-04-21
KR100880509B1 true KR100880509B1 (en) 2009-01-28

Family

ID=39314214

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020060100507A KR100880509B1 (en) 2006-10-16 2006-10-16 A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same

Country Status (9)

Country Link
US (1) US8957196B2 (en)
EP (1) EP2087118B1 (en)
JP (1) JP5076084B2 (en)
KR (1) KR100880509B1 (en)
CN (1) CN101605899B (en)
AT (1) ATE512229T1 (en)
ES (1) ES2366061T3 (en)
HK (1) HK1137481A1 (en)
WO (1) WO2008048037A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012057527A2 (en) 2010-10-26 2012-05-03 Hanmi Holdings Co., Ltd. Method for mass production of factor vii/viia
WO2013051900A2 (en) 2011-10-06 2013-04-11 Hanmi Science Co., Ltd. Blood coagulation factor ⅶ and ⅶa derivatives, conjugates and complexes comprising the same, and use thereof
WO2015030479A1 (en) 2013-08-30 2015-03-05 한미약품 주식회사 Method for mass producing human blood coagulation factor vii derivative
EP3348274A1 (en) 2013-01-31 2018-07-18 Hanmi Pharm. Co., Ltd. A method of virus inactivation in composition comprising factor vii
US11254724B2 (en) 2014-12-30 2022-02-22 Hanmi Pharm. Co., Ltd. Glucagon derivatives
US11872283B2 (en) 2011-06-17 2024-01-16 Hanmi Science Co., Ltd Conjugate comprising oxyntomodulin and an immunoglobulin fragment, and use thereof

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101030978B1 (en) * 2008-07-10 2011-04-28 (주) 에이프로젠 Recombinant Expression Vectors for Animal Cells
KR101681513B1 (en) * 2010-02-04 2016-12-02 한미사이언스 주식회사 Method for mass production of human follicle stimulating hormone
CN104093736B (en) * 2012-01-31 2016-10-19 坎瓦克斯生物技术公司 There is the GPCR of the cell surface expression of improvement
US11046975B2 (en) * 2013-10-07 2021-06-29 Prestige Biopharma Pte. Ltd. Bicistronic expression vector for antibody expression and method for producing antibody using same
RU2548806C1 (en) * 2014-01-20 2015-04-20 Общество с ограниченной ответственностью "РД-БИОТЕХ" Synthetic dna encoding human erythropoietin, comprising its vector, method of production of erythropoietin producer strain, erythropoietin producer strain
KR20160113268A (en) * 2014-01-28 2016-09-28 베이징 한미 파마슈티컬 컴퍼니 리미티드 Bifunctional fusion protein, preparation method therefor, and use thereof
CN106496302B (en) * 2015-09-08 2021-12-10 三生国健药业(上海)股份有限公司 Method for purifying protein by ion exchange chromatography
CN108753819B (en) * 2018-05-31 2022-05-20 暨南大学 Eukaryotic expression vector, eukaryotic expression system, preparation methods and applications of eukaryotic expression vector and eukaryotic expression system and GDF11 protein
EP4276111A1 (en) 2021-01-08 2023-11-15 Beijing Hanmi Pharmaceutical Co., Ltd. Antibody specifically binding to 4-1bb and antigen-binding fragment of antibody
US20240052037A1 (en) 2021-01-08 2024-02-15 Beijing Hanmi Pharmaceutical Co., Ltd. Anti-pd-l1/anti-cd47 natural antibody structure-like heterodimeric form bispecific antibody and preparation thereof
JP2024503395A (en) 2021-01-08 2024-01-25 北京韓美薬品有限公司 Antibodies that specifically bind to PD-L1 and antigen-binding fragments thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR970006499A (en) * 1995-07-27 1997-02-21 김은영 Method for preparing erythropoietin
KR19990000214A (en) * 1997-06-03 1999-01-15 박원훈 Cell line producing erythropoietin (EPO), its preparation method and method of producing EPO using the same
KR20010022107A (en) * 1997-07-23 2001-03-15 로셰 디아그노스틱스 게엠베하 Production of erythropoietin by endogenous gene activation

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4399216A (en) 1980-02-25 1983-08-16 The Trustees Of Columbia University Processes for inserting DNA into eucaryotic cells and for producing proteinaceous materials

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR970006499A (en) * 1995-07-27 1997-02-21 김은영 Method for preparing erythropoietin
KR19990000214A (en) * 1997-06-03 1999-01-15 박원훈 Cell line producing erythropoietin (EPO), its preparation method and method of producing EPO using the same
KR20010022107A (en) * 1997-07-23 2001-03-15 로셰 디아그노스틱스 게엠베하 Production of erythropoietin by endogenous gene activation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Nucleic Acids Res, 23(15),pp.3041-3049, (1995. 8.)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012057527A2 (en) 2010-10-26 2012-05-03 Hanmi Holdings Co., Ltd. Method for mass production of factor vii/viia
KR101380728B1 (en) * 2010-10-26 2014-04-03 한미사이언스 주식회사 Method for mass production of Factor VII/VIIa
US9243237B2 (en) 2010-10-26 2016-01-26 Hanmi Science Co., Ltd Method for mass production of factor VII/VIIA
US11872283B2 (en) 2011-06-17 2024-01-16 Hanmi Science Co., Ltd Conjugate comprising oxyntomodulin and an immunoglobulin fragment, and use thereof
WO2013051900A2 (en) 2011-10-06 2013-04-11 Hanmi Science Co., Ltd. Blood coagulation factor ⅶ and ⅶa derivatives, conjugates and complexes comprising the same, and use thereof
EP3417881A1 (en) 2011-10-06 2018-12-26 Hanmi Science Co., Ltd. Blood coagulation factor vii and viia derivatives, conjugates and complexes comprising the same, and use thereof
EP3348274A1 (en) 2013-01-31 2018-07-18 Hanmi Pharm. Co., Ltd. A method of virus inactivation in composition comprising factor vii
WO2015030479A1 (en) 2013-08-30 2015-03-05 한미약품 주식회사 Method for mass producing human blood coagulation factor vii derivative
US11254724B2 (en) 2014-12-30 2022-02-22 Hanmi Pharm. Co., Ltd. Glucagon derivatives

Also Published As

Publication number Publication date
ATE512229T1 (en) 2011-06-15
EP2087118A4 (en) 2010-03-03
WO2008048037A1 (en) 2008-04-24
US20100120089A1 (en) 2010-05-13
US8957196B2 (en) 2015-02-17
KR20080034364A (en) 2008-04-21
JP2010506586A (en) 2010-03-04
EP2087118B1 (en) 2011-06-08
CN101605899B (en) 2013-07-31
CN101605899A (en) 2009-12-16
HK1137481A1 (en) 2010-07-30
ES2366061T3 (en) 2011-10-14
JP5076084B2 (en) 2012-11-21
EP2087118A1 (en) 2009-08-12

Similar Documents

Publication Publication Date Title
KR100880509B1 (en) A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same
US11965012B2 (en) Compositions and methods for TCR reprogramming using fusion proteins
AU775988B2 (en) Ligand activated transcriptional regulator proteins
CN110023500A (en) The attenuation glutamine synthelase alternatively marked
KR20210108423A (en) Adeno-associated virus (AAV) producer cell lines and related methods
WO2004067743A1 (en) Vaccine production using transposon based vectors
KR20150126978A (en) Psma antibody-drug conjugates
JP2003534775A (en) Methods for destabilizing proteins and uses thereof
US20040235011A1 (en) Production of multimeric proteins
KR20230019063A (en) Triple function adeno-associated virus (AAV) vectors for the treatment of C9ORF72 associated diseases
AU2016378480A1 (en) Endothelium-specific nucleic acid regulatory elements and methods and use thereof
CN112877292A (en) Human antibody producing cell
CN114807140B (en) Myogenic cell blood glucose responsive SIA expression promoter, recombinant vector, construction method and application thereof
KR20240021906A (en) Expression vectors, bacterial sequence-free vectors, and methods of making and using the same
KR20230142712A (en) Adeno-Associated Virus (AAV) Production
KR20230117327A (en) An expression vector comprising a soluble alkaline phosphatase construct and a polynucleotide encoding the soluble alkaline phosphatase construct.
KR20240022571A (en) Systems, methods and components for RNA-guided effector recruitment
KR102393402B1 (en) Composition for the preventing or treating cancer comprising an expression vector for dual genes
CN117881788A (en) Expression vectors, bacterial sequence-free vectors, and methods of making and using the same

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E90F Notification of reason for final refusal
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20120313

Year of fee payment: 5

FPAY Annual fee payment

Payment date: 20131231

Year of fee payment: 6

FPAY Annual fee payment

Payment date: 20141215

Year of fee payment: 7

FPAY Annual fee payment

Payment date: 20161216

Year of fee payment: 9

FPAY Annual fee payment

Payment date: 20180104

Year of fee payment: 10

FPAY Annual fee payment

Payment date: 20181218

Year of fee payment: 11

FPAY Annual fee payment

Payment date: 20191226

Year of fee payment: 12