KR20070053798A - Caev-based vector systems - Google Patents

Caev-based vector systems Download PDF

Info

Publication number
KR20070053798A
KR20070053798A KR1020077007725A KR20077007725A KR20070053798A KR 20070053798 A KR20070053798 A KR 20070053798A KR 1020077007725 A KR1020077007725 A KR 1020077007725A KR 20077007725 A KR20077007725 A KR 20077007725A KR 20070053798 A KR20070053798 A KR 20070053798A
Authority
KR
South Korea
Prior art keywords
vector
less
caev
polynucleotide
coding sequence
Prior art date
Application number
KR1020077007725A
Other languages
Korean (ko)
Other versions
KR100884214B1 (en
Inventor
김연수
이수경
김종필
Original Assignee
주식회사 마크로젠
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 마크로젠 filed Critical 주식회사 마크로젠
Priority to KR1020077007725A priority Critical patent/KR100884214B1/en
Publication of KR20070053798A publication Critical patent/KR20070053798A/en
Application granted granted Critical
Publication of KR100884214B1 publication Critical patent/KR100884214B1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/64General methods for preparing the vector, for introducing it into the cell or for selecting the vector-containing host
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease

Landscapes

  • Genetics & Genomics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Organic Chemistry (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Cell Biology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

본 발명은 비-분열 및 분열 세포로 핵산을 전달하는 데 유용한 염소 관절염 뇌염 바이러스-계 벡터 및 벡터 시스템에 관한 것이다. 상기 벡터 시스템을 사용하여 비-분열 및 분열 세포에 핵산을 전달하는 방법 또한 개시하고 있다.The present invention relates to goat arthritis encephalitis virus-based vectors and vector systems useful for delivering nucleic acids to non-dividing and dividing cells. Also disclosed are methods for delivering nucleic acids to non-dividing and dividing cells using the vector system.

Description

CAEV-계 벡터 시스템{CAEV-BASED VECTOR SYSTEMS}CAE-based vector system {CAEV-BASED VECTOR SYSTEMS}

본 발명은 폴리뉴클레오티드(polynucleotide) 전달에 유용한 렌티바이러스 벡터, 및 더 구체적으로 폴리뉴클레오티드를 비분열 및 분열 세포들로 전달하는데 유용한 염소 관절염 뇌염 바이러스-계 벡터(caprine arthritis encephalitis virus)에 관한 것이다.The present invention relates to lentiviral vectors useful for polynucleotide delivery, and more specifically to caprine arthritis encephalitis virus, which is useful for delivering polynucleotides to non-dividing and dividing cells.

렌티바이러스는 분열 세포 뿐 아니라 비분열 세포도 감염시킬 수 있는 레트로바이러스의 서브그룹이다. 렌티바이러스 유래의 벡터들은 분열 및 비분열 세포의 유전체(genome)에 안정적으로 유전자를 통합(integration)하여 장기간 유전자 발현을 매개할 수 있기 때문에 외래 유전자를 표적 세포로 전달하기 위한 이상적인 도구이다(Gilbert and Wong-Staal, 2001; Mitrophanous et al., 1999; Naldini et al., 1996; Sauter and Gasmi, 2001).Lentiviruses are a subgroup of retroviruses that can infect non-dividing cells as well as dividing cells. Vectors derived from lentiviral are an ideal tool for delivering foreign genes to target cells because they can reliably integrate genes into the genomes of dividing and non-dividing cells to mediate long-term gene expression (Gilbert and Wong-Staal, 2001; Mitrophanous et al., 1999; Naldini et al., 1996; Sauter and Gasmi, 2001).

렌티바이러스는 비-영장류(예: 고양이 면역결핍 바이러스(FIV), 소 면역결핍 바이러스(BIV), 말 감염 바이러스(EIAV), 염소 관절염 뇌염 바이러스(CAEV) 및 비스나 바이러스) 뿐 아니라, 영장류(예: 사람 및 원숭이 면역결핍 바이러스(HIV-1, HIV-2, SIV))를 포함하는 여러 척추동물 종들로부터 분리된 바 있다. 이들 중, HIV 및 SIV가 현재 가장 잘 알려져 있다. 그러나, 벡터가 전염성 및 병원성 형태로 재조합될 수 있는 가능성으로 인해, 이러한 시스템을 인간에서 사용하는 것은 심각한 안전성 문제를 일으킨다. 따라서, 유전자 치료에서는 비-영장류 렌티바이러스를 사용하는 것이 바람직하다.Lentiviruses are not only non-primates (e.g. feline immunodeficiency virus (FIV), bovine immunodeficiency virus (BIV), equine infection virus (EIAV), goat arthritis encephalitis virus (CAEV) and visna virus), but also primates (e.g. : Has been isolated from several vertebrate species, including human and monkey immunodeficiency viruses (HIV-1, HIV-2, SIV). Of these, HIV and SIV are currently best known. However, due to the possibility that the vector can be recombined into infectious and pathogenic forms, the use of such a system in humans poses serious safety problems. Therefore, it is desirable to use non-primate lentiviral in gene therapy.

비-영장류 렌티바이러스 벡터들 중에는, FIV(Curran 및 Nolan, 2002) 및 EIAV[US 2001/0044149] 유래 벡터들이 가장 잘 연구되었으며, 기타 비-영장류 벡터들에 대해서는 거의 진전된 바가 없다.Among the non-primate lentiviral vectors, vectors derived from FIV (Curran and Nolan, 2002) and EIAV [US 2001/0044149] have been best studied and little progress has been made on other non-primate vectors.

CAEV는, 모든 렌티바이러스들이 그렇듯이, 최종적으로 분화된 세포 및 비분열 세포 뿐 아니라 분열 세포들에서도 감염 및 복제가 가능하다. CAEV의 생물학적 몇가지 특징들은 본 바이러스가 유전자 전이/치료 벡터로 개발시키기에 흥미로운 후보임을 보여준다. 첫째, CAEV의 일반 숙주는 염소류이며, CAEV에 의해 인간이 감염된 경우는 보고된 바 없다. 둘째, CAEV 유전체는 렌티바이러스들 중 HIV-1과 계통분류학적으로 가장 거리가 있다. 셋째, CAEV의 유전체 구성은 다른 렌티바이러스들과 비교해 상대적으로 간단하다. CAEV 유전체는 세 개의 구조 유전자(gag, pol, env) 및 세 개의 조절/악세서리 유전자(vif, tat 및 rev)를 포함한다.As with all lentiviruses, CAEV is capable of infection and replication in dividing cells as well as finally differentiated and non-dividing cells. Several biological features of CAEV show that the virus is an interesting candidate for development as a gene transfer / therapeutic vector. First, the common host of CAEV is goats, and no human infection by CAEV has been reported. Second, the CAEV genome is the most systematically separated from HIV-1 among lentiviruses. Third, the genome construction of CAEV is relatively simple compared to other lentiviruses. The CAEV genome contains three structural genes ( gag, pol, env ) and three regulatory / accessory genes ( vif, tat and rev ).

그러나, 이러한 장점들에도 불구하고, CAEV-계 전달 시스템을 개발시키기 위한 노력들은 성공하지 못했으며, 그 결과 CAEV-계 유전자 전달 시스템의 사용을 비현실적으로 만드는 안전하지 못하고 비효율적인 재조합 바이러스 벡터 생산 시스템만을 얻었다.However, despite these advantages, efforts to develop a CAEV-based delivery system have not been successful, resulting in an insecure and inefficient recombinant viral vector production system that renders the use of a CAEV-based gene delivery system impractical. Got it.

1998년에, L. 셀리-라칼 등(L. Mselli-Lakhal et al.)은 제1세대 CAEV-계 벡터 시스템을 보고하였으나, 상기 시스템의 바이러스 역가(즉, 10-187 TU/㎖)는 유효한 수준 이하였다. 상기 저자들은 이러한 비효율성이 세포질 내로의 유전체 RNA 축적의 부족 및 벡터 RNA의 낮은 패키징 효율 때문이라고 설명하였다. 상기 연구의 다른 문제점으로는 패키징 시스템으로서 전염성 야생형 바이러스("헬퍼 바이러스")를 사용하는 점인데, 이로 인해 인간에 적용하기에는 거의 실용 가치가 없다.In 1998, L. Mselli-Lakhal et al. Reported a first generation CAEV-based vector system, but the viral titer (ie 10-187 TU / ml) of the system was effective. It was below the level. The authors explained that this inefficiency was due to the lack of genomic RNA accumulation into the cytoplasm and the low packaging efficiency of the vector RNA. Another problem with this study is the use of infectious wild type virus ("helper virus") as a packaging system, which is of little practical value for human application.

따라서, 광범위한 분열 및 비분열 세포로의 유전자 전이를 매개할 수 있는 안전하고 효과적인 CAEV-계 렌티바이러스 벡터 시스템이 요구되고 있다. Therefore, there is a need for a safe and effective CAEV-based lentiviral vector system that can mediate gene transfer to a wide range of dividing and non-dividing cells.

발명의 개요Summary of the Invention

본 발명은, 넓게는 외래 폴리뉴클레오티드를 표적 세포로 전달하는데 유용한 CAEV-계 렌티바이러스 벡터 입자(particles)의 생산에 관한 것이다. 이러한 벡터 입자는 항-바이러스, 항-종양 및/또는 유전자 치료에 사용될 수 있다.The present invention broadly relates to the production of CAEV-based lentiviral vector particles useful for delivering foreign polynucleotides to target cells. Such vector particles can be used for anti-viral, anti-tumor and / or gene therapy.

본 발명은 상술된 CAEV-계 벡터 생산 시스템에서 사용되는 전이 벡터를 제공하며, 상기 전이 벡터는,The present invention provides a transition vector for use in the CAEV-based vector production system described above, wherein the transition vector is

(a) (i) CAEV 5' LTR과 CAEV gag-코딩 서열 사이의 비번역 영역, 및 (a) (i) an untranslated region between the CAEV 5 ′ LTR and the CAEV gag -coding sequence, and

(ⅱ) 상기 비번역 영역의 3' 말단에 연결된 CAEV gag-코딩 서열의 1 내지 X번째 뉴클레오티드(이때, X는 613 미만)(Ii) 1 to X nucleotides of the CAEV gag -coding sequence linked to the 3 'end of the untranslated region, where X is less than 613

로 필수적으로 이루어진, CAEV 패키징 서열; 및Consisting essentially of a CAEV packaging sequence; And

(b) 상기 패키징 서열과 작동가능하게 연관된, 폴리아데닐화 반응, RNA 수 송, 역전사 및 통합(integration)에 요구되는 시스-작용 요소(cis-acting elements)(b) System which is associated operably with said packaging sequence, required for polyadenylation reaction, RNA can transport, reverse transcription, and integration (integration) - acting element (cis -acting elements)

를 포함한다.It includes.

본 발명의 일 실시양태에서, X는 60, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575 및 600으로 이루어진 군으로부터 선택된다.In one embodiment of the invention, X is 60, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575 and 600.

본 발명의 다른 실시양태에서, X는In another embodiment of the invention, X is

(a) X는 25 초과 600 미만,(a) X is greater than 25 but less than 600,

(b) X는 25 초과 500 미만,(b) X is greater than 25 but less than 500,

(c) X는 25 초과 400 미만,(c) X is greater than 25 but less than 400,

(d) X는 25 초과 300 미만,(d) X is greater than 25 but less than 300,

(e) X는 25 초과 200 미만,(e) X is greater than 25 but less than 200,

(f) X는 50 초과 600 미만,(f) X is greater than 50 but less than 600,

(g) X는 50 초과 500 미만,(g) X is greater than 50 and less than 500,

(h) X는 50 초과 400 미만,(h) X is greater than 50 and less than 400,

(i) X는 50 초과 300 미만,(i) X is greater than 50 and less than 300,

(j) X는 50 초과 200 미만,(j) X is greater than 50 and less than 200,

(k) X는 75 초과 600 미만,(k) X is greater than 75 and less than 600,

(l) X는 75 초과 500 미만,(l) X is greater than 75 and less than 500,

(m) X는 75 초과 400 미만,(m) X is greater than 75 and less than 400,

(n) X는 75 초과 300 미만,(n) X is greater than 75 and less than 300,

(o) X는 75 초과 200 미만,(o) X is greater than 75 and less than 200,

(p) X는 100 초과 600 미만,(p) X is greater than 100 but less than 600,

(q) X는 100 초과 500 미만,(q) X is greater than 100 and less than 500,

(r) X는 100 초과 400 미만,(r) X is greater than 100 but less than 400,

(s) X는 100 초과 300 미만,(s) X is greater than 100 and less than 300,

(t) X는 100 초과 200 미만,(t) X is greater than 100 and less than 200,

(u) X는 125 초과 600 미만,(u) X is greater than 125 and less than 600,

(v) X는 125 초과 500 미만,(v) X is greater than 125 and less than 500,

(w) X는 125 초과 400 미만,(w) X is greater than 125 and less than 400,

(x) X는 125 초과 300 미만,(x) X is greater than 125 and less than 300,

(y) X는 125 초과 200 미만,(y) X is greater than 125 and less than 200,

(z) X는 150 초과 600 미만,(z) X is greater than 150 and less than 600,

(aa) X는 150 초과 500 미만,(aa) X is greater than 150 and less than 500,

(bb) X는 150 초과 400 미만,(bb) X is greater than 150 and less than 400,

(cc) X는 150 초과 300 미만,(cc) X is greater than 150 and less than 300,

(dd) X는 150 초과 200 미만,(dd) X is greater than 150 and less than 200,

(ee) X는 200 초과 600 미만,(ee) X is greater than 200 and less than 600,

(ff) X는 200 초과 500 미만,(ff) X is greater than 200 and less than 500,

(gg) X는 200 초과 400 미만,(gg) X is greater than 200 and less than 400,

(hh) X는 200 초과 300 미만, (hh) X is greater than 200 and less than 300,

(ii) X는 200 초과 200 미만,(ii) X is greater than 200 and less than 200,

(jj) X는 250 초과 600 미만,(jj) X is greater than 250 and less than 600,

(kk) X는 250 초과 500 미만,(kk) X is greater than 250 and less than 500,

(ll) X는 250 초과 400 미만, 및(ll) X is greater than 250 and less than 400, and

(mm) X는 250 초과 300 미만(mm) X is greater than 250 and less than 300

으로 이루어진 군으로부터 선택된다.It is selected from the group consisting of.

다른 실시양태에서, X는 40 초과 613 미만이다.In other embodiments, X is greater than 40 but less than 613.

다른 실시양태에서, X는 57 초과 613 미만이다.In other embodiments, X is greater than 57 and less than 613.

또 다른 실시양태에서, X는 약 327이다.In another embodiment, X is about 327.

본 발명의 일 실시양태에서, gag-코딩 서열의 시작 코돈은 gag 단백질의 번역을 방지하도록 변이된다. 또 다른 실시양태에서, 시작 코돈은 TAG로 변이된다.In one embodiment of the present invention, gag - start codon of the coding sequence is mutated to prevent translation of gag protein. In another embodiment, the start codon is mutated to TAG.

본 발명의 전이 벡터의 다른 실시양태에서, gag-코딩 서열의 ATG 코돈은 시작 코돈 ATG의 하류 X번째 염기쌍에 위치하며, 이때 상기 시작 코돈은 gag 단백질의 번역을 방지하도록 변이되고, 이때 상기 X는 30 미만이다. 또 다른 실시양태에서 X는 약 21이다.In another embodiment of the transition vectors of the invention, the ATG codon of the gag -coding sequence is located downstream of the start codon ATG, wherein the start codon is mutated to prevent translation of the gag protein, wherein X is Less than 30. In another embodiment X is about 21.

본 발명의 전이 벡터는 RRE 영역을 추가로 포함할 수 있다.The transition vector of the present invention may further comprise an RRE region.

본 발명의 다른 실시양태에서, 전이 벡터는 U3 영역이 삭제된 CAEV 3' LTR을 포함한다.In another embodiment of the invention, the transition vector comprises a CAEV 3 'LTR with the U3 region deleted.

본 발명의 전이 벡터는 이종의 프로모터(heterologous promoter)를 추가로 포함할 수 있다. 본 발명의 일 실시양태에서, 이종의 프로모터는 인간 거대세포바이러스 주 극초기 프로모터(human cytomegalovirus major immediate early promoter; HCMV MIEP)이다. 또 다른 실시양태에서, 전이 벡터는 pCAH/SINd1(서열번호: 68)이다.The transition vector of the present invention may further comprise a heterologous promoter. In one embodiment of the invention, the heterologous promoter is a human cytomegalovirus major immediate early promoter (HCMV MIEP). In another embodiment, the transition vector is pCAH / SINd1 (SEQ ID NO: 68).

본 발명의 전이 벡터는 이종의 프로모터(예: 인간 거대세포바이러스 주 극초기 프로모터 HCMV MIEP, 또는 쥐 거대세포바이러스 주 극초기 프로모터 MCMV MIEP)에 작동가능하게 연결된 이종의 관심 폴리뉴클레오티드를 포함하는 전사 카세트를 추가로 포함할 수 있다. 이러한 전이 벡터는 바이러스 입자내로 관심 폴리뉴클레오티드가 결합하는 것을 가능하게 함으로써, 내부에 폴리뉴클레오티드를 포함하는 감염된 숙주 세포의 수를 증대시키기 위한 수단을 제공한다.The transfer vector of the present invention is a transcription cassette comprising a heterologous polynucleotide of interest operably linked to a heterologous promoter (e.g., human cytomegalovirus primary early promoter HCMV MIEP, or murine cytomegalovirus primary early promoter MCMV MIEP). It may further include. Such transfer vectors enable the polynucleotide of interest to bind into the virus particle, thereby providing a means for increasing the number of infected host cells comprising the polynucleotide therein.

본 발명은 또한 외래 폴리뉴클레오티드를 포유류 세포에 전달하는데 유용한 CAEV-계, 복제-결함 벡터 입자를 생산하는 CAEV-계 렌티바이러스 벡터 시스템을 제공한다. 상기 벡터 입자는 포유류 세포를 감염 및 형질도입 시킬 수 있다. 벡터 시스템은, 상술된 전이 벡터; 및 GAEV gag-pol-코딩 서열 및 RRE를 포함하는 제1 폴리뉴클레오티드, 및 바이러스 외피 코딩 서열을 포함하는 제2 폴리뉴클레오티드를 포함하는, 패키징 벡터 시스템을 포함한다.The invention also provides a CAEV-based lentiviral vector system that produces CAEV-based, replication-defective vector particles useful for delivering foreign polynucleotides to mammalian cells. The vector particles can infect and transduce mammalian cells. The vector system includes a transition vector as described above; And a packaging vector system comprising a first polynucleotide comprising a GAEV gag-pol -coding sequence and an RRE, and a second polynucleotide comprising a viral envelope coding sequence.

일 실시양태에서, 상기 제2 폴리뉴클레오티드는 비-CAEV env-코딩 서열을 포함한다. 일 실시양태에서, 상기 제2 폴리뉴클레오티드는 VSV-G- 또는 GaLV-코딩 서열을 포함한다.In one embodiment, said second polynucleotide comprises a non-CAEV env -coding sequence. In one embodiment, said second polynucleotide comprises a VSV-G- or GaLV-coding sequence.

다른 실시양태에서, CAEV 벡터 시스템은 rev-코딩 서열을 포함하는 제3 폴리 뉴클레오티드 서열을 포함한다.In other embodiments, the CAEV vector system comprises a third poly nucleotide sequence comprising a rev -coding sequence.

다른 실시양태에서, CAEV 벡터 시스템은 vif-코딩 서열을 포함하는 제4 폴리뉴클레오티드 서열을 포함한다.In other embodiments, the CAEV vector system comprises a fourth polynucleotide sequence comprising a vif -coding sequence.

또 다른 실시양태에서, 상술된 각 CAEV 벡터 시스템의 제1 폴리뉴클레오티드는 CAEV gag-pol-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함한다.In another embodiment, the first polynucleotide of each CAEV vector system described above further comprises a heterologous regulatory sequence operably linked to a CAEV gag-pol -coding sequence.

또 다른 실시양태에서, 상술된 CAEV 벡터 시스템의 제2 폴리뉴클레오티드는 상기 바이러스 외피-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함한다.In another embodiment, the second polynucleotide of the CAEV vector system described above further comprises a heterologous regulatory sequence operably linked to said viral envelope-coding sequence.

또 다른 실시양태에서, 제3 폴리뉴클레오티드는 rev-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함한다.In another embodiment, the third polynucleotide further comprises a heterologous regulatory sequence operably linked to a rev -coding sequence.

또 다른 실시양태에서, 제4 폴리뉴클레오티드는 vif-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함한다.In another embodiment, the fourth polynucleotide further comprises a heterologous regulatory sequence operably linked to the vif -coding sequence.

본 발명의 일 실시양태에서, CAEV 벡터 시스템은 적격(competent) CAEV 패키징 서열이 없는 패키징 벡터 시스템을 포함한다. 또 다른 실시양태에서, 패키징 벡터 시스템은 CAEV 유전체의 스플라이싱 제공 위치(splice donor site)와 gag 시작 코돈 사이의 5' 말단 영역이 결여되어 있다.In one embodiment of the invention, the CAEV vector system comprises a packaging vector system without a competent CAEV packaging sequence. In another embodiment, the packaging vector system lacks the 5 'terminal region between the splicing donor site of the CAEV genome and the gag start codon.

일 실시양태에서, CAEV 벡터 시스템은 제1 폴리뉴클레오티드를 포함하는 제1 벡터 및 제2 폴리뉴클레오티드를 포함하는 제2 벡터를 포함한다. 다른 실시양태에서, 벡터 시스템은 제1 폴리뉴클레오티드를 포함하는 제1 벡터, 제2 폴리뉴클레오 티드를 포함하는 제2 벡터, 및 제3 폴리뉴클레오티드를 포함하는 제3 벡터를 포함한다. 다른 실시양태에서, 벡터 시스템은 제1 폴리뉴클레오티드를 포함하는 제1 벡터, 제2 폴리뉴클레오티드를 포함하는 제2 벡터, 제3 폴리뉴클레오티드를 포함하는 제3 벡터, 및 제4 폴리뉴클레오티드를 포함하는 제4 벡터를 포함한다. 상기 제3 벡터는 pHYK/rev(서열번호: 75)일 수 있고, 상기 제4 벡터는 pHYK/vif(서열번호: 76)일 수 있다.In one embodiment, the CAEV vector system comprises a first vector comprising a first polynucleotide and a second vector comprising a second polynucleotide. In other embodiments, the vector system comprises a first vector comprising a first polynucleotide, a second vector comprising a second polynucleotide, and a third vector comprising a third polynucleotide. In other embodiments, the vector system comprises a first vector comprising a first polynucleotide, a second vector comprising a second polynucleotide, a third vector comprising a third polynucleotide, and an agent comprising a fourth polynucleotide Contains 4 vectors. The third vector may be pHYK / rev (SEQ ID NO: 75), and the fourth vector may be pHYK / vif (SEQ ID NO: 76).

또 다른 실시양태에서, 벡터 시스템은 제1 폴리뉴클레오티드, 제3 폴리뉴클레오티드 및 제4 폴리뉴클레오티드를 포함하는 제1 벡터, 및 제2 폴리뉴클레오티드를 포함하는 제2 벡터를 포함한다.In another embodiment, the vector system comprises a first vector comprising a first polynucleotide, a third polynucleotide and a fourth polynucleotide, and a second vector comprising a second polynucleotide.

일 실시양태에서, CAEV 벡터 시스템의 제1 벡터는 CAEV gag-코딩 서열 및 이종의 프로모터에 작동가능하게 연결된 RRE를 포함한다. 상기 프로모터는 MCMV MIEP일 수 있다. 또 다른 실시양태에서, CAEV 벡터 시스템은 제1 벡터 pMGP/RRE(서열번호: 77)를 포함한다.In one embodiment, the first vector of the CAEV vector system comprises a CAEV gag -coding sequence and an RRE operably linked to a heterologous promoter. The promoter may be MCMV MIEP. In another embodiment, the CAEV vector system comprises a first vector pMGP / RRE (SEQ ID NO: 77).

일 실시양태에서, CAEV 벡터 시스템의 제2 벡터는 이종의 프로모터에 작동가능하게 연결된 VSV-G-코딩 서열이다. 상기 프로모터는 HCMV MIEP일 수 있다. 상기 제2 벡터는 베타 글로빈 인트론(beta globin intron)을 추가로 포함할 수 있다. 또 다른 실시양태에서, CAEV 벡터 시스템은 제2 벡터 pHGVSV-G(서열번호: 74)를 포함한다.In one embodiment, the second vector of the CAEV vector system is a VSV-G-coding sequence operably linked to a heterologous promoter. The promoter may be HCMV MIEP. The second vector may further include a beta globin intron. In another embodiment, the CAEV vector system comprises a second vector pHGVSV-G (SEQ ID NO: 74).

일 실시양태에서, CAEV 벡터 시스템의 제2 벡터는 이종의 프로모터에 작동가능하게 연결된 GaLV env-코딩 서열이다. 상기 프로모터는 MCMV MIEP일 수 있다. 상기 제2 벡터는 진핵성 연장 인자-1 알파 인트론(eukaryotic elongation factor-1 alpha intron)을 추가로 포함할 수 있다. 또 다른 실시양태에서, CAEV 벡터 시스템은 제2 벡터 pMYKEF-1/env(서열변호: 72)를 포함한다.In one embodiment, the second vector of the CAEV vector system is a GaLV env- coding sequence operably linked to a heterologous promoter. The promoter may be MCMV MIEP. The second vector may further include eukaryotic elongation factor-1 alpha intron. In another embodiment, the CAEV vector system comprises a second vector pMYKEF-1 / env (SEQ ID NO: 72).

본 발명의 다른 태양은 포유류 세포를 감염시키는데 유용한 CAEV-계 렌티바이러스 벡터 입자를 생산하는 방법이다. 상기 방법은 (a) CAEV-계 입자 생산에 적합한 조건하에 상술된 벡터 시스템으로 세포를 형질감염시키는 단계(이때, 상기 벡터 입자는 감염- 및 형질도입-적격이며, 복제-결함임), 및 (b) 상기 벡터 입자를 회수하는 단계를 포함한다.Another aspect of the invention is a method of producing CAEV-based lentiviral vector particles useful for infecting mammalian cells. The method comprises the steps of (a) transfecting cells with the above-described vector system under conditions suitable for CAEV-based particle production, wherein the vector particles are infection- and transduction-competent and replication-defective, and ( b) recovering said vector particles.

본 발명은 또한 CAEV-계 렌티바이러스 벡터 입자 및 선택적으로 담체를 포함하는 조성물을 제공하며, 이때 상기 벡터 입자는 상술된 방법에 의해 생산된다.The present invention also provides a composition comprising CAEV-based lentiviral vector particles and optionally a carrier, wherein said vector particles are produced by the method described above.

본 발명은 또한 상술된 전이 벡터 또는 CAEV-계 렌티바이러스 벡터 시스템을 포함하는 키트를 제공한다.The present invention also provides a kit comprising the above-described transfer vector or CAEV-based lentiviral vector system.

본 발명은 또한 CAEV gag-pol-코딩 서열 및 RRE, 및 선택적으로 바이러스 env-코딩 서열을 포함하는 패키징 세포를 제공한다. 상기 패키징 세포는 rev-코딩 및/또는 vif-코딩 서열을 추가로 포함할 수 있다. 상기 세포는 전이 벡터의 RNA 형태를 감염- 및 형질도입-적격이면서 복제-결함인 벡터 입자내로 패키징하는데 유용하다.The invention also provides a packaging cell comprising a CAEV gag-pol- coding sequence and RRE, and optionally a viral env- coding sequence. The packaging cell may further comprise a rev- coding and / or vif -coding sequence. The cells are useful for packaging the RNA form of the transfer vector into vector particles that are infection- and transduction-competent and replication-defective.

일 실시양태에서, 벡터 시스템은 상술된 제1 폴리뉴클레오티드를 포함하는 세포를 포함한다. 상기 벡터 시스템은 상술된 제3 및/또는 제4 폴리뉴클레오티드를 추가로 포함할 수 있다.In one embodiment, the vector system comprises a cell comprising the first polynucleotide described above. The vector system may further comprise the third and / or fourth polynucleotide described above.

다른 실시양태에서, 벡터 시스템은 상술된 제1 폴리뉴클레오티드 및 제2 폴리뉴클레오티드를 포함하는 세포를 포함한다. 상기 벡터 시스템은 상술된 제3 및/또는 제4 폴리뉴클레오티드를 추가로 포함할 수 있다. In other embodiments, the vector system comprises a cell comprising the first polynucleotide and the second polynucleotide described above. The vector system may further comprise the third and / or fourth polynucleotide described above.

다른 실시양태에서, 벡터 시스템은 CAEV gag-pol-코딩 서열 및 RRE를 포함하는 제1 벡터를 포함하는 세포를 포함한다. 상기 제1 벡터는 rev-코딩 및/또는 vif-코딩 서열을 추가로 포함할 수 있다. 한편, 상기 세포는 CAEV gag-pol-코딩 서열 및 RRE를 포함하는 제1 벡터, rev-코딩 서열을 포함하는 제2 벡터, 및/또는 vif-코딩 서열을 포함하는 제3 벡터를 포함할 수 있다.In other embodiments, the vector system comprises a cell comprising a first vector comprising a CAEV gag-pol- coding sequence and an RRE. The first vector may further comprise a rev -coding and / or vif -coding sequence. On the other hand, the cell may comprise a first vector comprising a CAEV gag-pol- coding sequence and a RRE, a second vector comprising a rev -coding sequence, and / or a third vector comprising a vif -coding sequence. .

일부 실시양태에서, 벡터 시스템은 CAEV gag-pol-코딩 서열 및 RRE를 포함하는 제1 벡터, 및 바이러스 env-코딩 서열을 포함하는 제2 벡터를 포함하는 세포를 포함한다. 상기 제1 벡터는 rev-코딩 및/또는 vif-코딩 서열을 추가로 포함할 수 있다. 다른 한편으로, 상기 세포는 CAEV gag-pol-코딩 서열 및 RRE를 포함하는 제1 벡터, 바이러스 env-코딩 서열을 포함하는 제2 벡터, 및 선택적인 rev-코딩 서열을 포함하는 제3 벡터 및/또는 vif-코딩 서열을 포함하는 제4 벡터를 포함할 수 있다.In some embodiments, the vector system comprises a cell comprising a first vector comprising a CAEV gag-pol- coding sequence and a RRE, and a second vector comprising a viral env -coding sequence. The first vector may further comprise a rev -coding and / or vif -coding sequence. On the other hand, the cell comprises a first vector comprising a CAEV gag-pol- coding sequence and a RRE, a second vector comprising a viral env -coding sequence, and a third vector comprising an optional rev -coding sequence and / Or a fourth vector comprising the vif -coding sequence.

본 발명의 다른 태양은 폴리뉴클레오티드 또는 폴리펩타이드를 포유류 세포로 전달하거나 상기 폴리펩타이드를 코딩하는 폴리뉴클레오티드를 복제하는 방법이며, 상기 방법은 상기 폴리뉴클레오티드를 포유류 세포의 유전체 내로 통합할 수 있는 조건 및 선택적으로 상기 폴리뉴클레오티드에 의해 코딩된 상기 폴리펩타이드가 증식할 수 있는 조건하에, 상기 포유류 세포를 상술된 벡터 입자와 접촉시키는 단계를 포함한다. 상기 포유류 세포는 분열 세포, 비-분열 세포 또는 CD34+ 줄기 세포일 수 있다. 폴리뉴클레오티드 또는 폴리펩타이드를 포유류 세포로 전달하는 방법 또는 상기 폴리펩타이드를 코딩하는 폴리뉴클레오티드 분자를 복제하는 방법은, 세포를 벡터 입자와 접촉시키기 전에 포유동물로부터 세포를 분리하는 단계를 추가로 포함할 수 있다. 상기 방법은 세포를 벡터 입자와 접촉시킨 후에 배양액에서 상기 세포를 증식시키는 단계를 추가로 포함할 수 있다. 상기 방법은 접촉된 세포를 증식시키기 전 또는 후에 세포를 포유동물에 재도입하는 단계를 추가로 포함할 수 있다.Another aspect of the invention is a method of delivering a polynucleotide or polypeptide to a mammalian cell or replicating a polynucleotide encoding the polypeptide, wherein the method is selective and conditions capable of integrating the polynucleotide into the mammalian cell genome. And contacting said mammalian cell with the above-mentioned vector particle under conditions such that said polypeptide encoded by said polynucleotide can proliferate. The mammalian cell may be a dividing cell, a non-dividing cell or a CD34 + stem cell. The method of delivering a polynucleotide or polypeptide to a mammalian cell or replicating a polynucleotide molecule encoding the polypeptide may further comprise separating the cell from the mammal before contacting the cell with the vector particles. have. The method may further comprise propagating the cells in culture after contacting the cells with the vector particles. The method may further comprise reintroducing the cells into the mammal before or after propagating the contacted cells.

본 발명은 또한, 전달된 폴리뉴클레오티드에 의해 코딩된 폴리펩타이드가 척추동물에서의 생물학적 반응을 검출 또는 유도하기에 충분한 양으로 척추동물에서 발현되도록 하기 위해, 이종의 관심 폴리뉴클레오티드를 포함하는 CAEV-계 렌티바이러스 벡터 입자를 척추동물에 투여하는 단계(이때, 상기 벡터 입자는 상술된 방법에 의해 생산됨)를 포함하는, 폴리펩타이드를 척추동물에 전달하는 방법을 제공한다.The invention also relates to a CAEV-based comprising heterologous polynucleotides of interest, such that the polypeptides encoded by the delivered polynucleotides are expressed in vertebrates in an amount sufficient to detect or elicit a biological response in vertebrates. A method of delivering a polypeptide to a vertebrate, comprising administering the lentiviral vector particles to the vertebrate, wherein the vector particles are produced by the method described above.

본 발명은 또한, (a) CAEV 5' LTR과 CAEV gag-코딩 서열 사이의 비번역 영역, 및 (b) 상기 비번역 영역의 3' 말단에 연결된 CAEV gag-코딩 서열의 1 내지 X번째 뉴클레오티드(이때, X는 613 미만임)로 필수적으로 이루어진, CAEV 패키징 서열을 포함하는 벡터를 제공한다.The invention also relates to (a) an untranslated region between the CAEV 5 'LTR and the CAEV gag -coding sequence, and (b) the 1st to Xth nucleotides of the CAEV gag -coding sequence linked to the 3' end of the untranslated region ( Wherein X is less than 613), thereby providing a vector comprising a CAEV packaging sequence.

본 발명자들은, 본원에서 제시된 바와 같이, 상기 CAEV-계 렌티바이러스 벡터 입자의 생산이 렌티바이러스 벡터 설계에 있어서 기존 CAEV-계 벡터 입자들에 비해 향상된 효율성 및 안전성을 나타내는 것을 발견하였다. 상기 향상된 효율성은 효율적인 캡시드화를 가능하게 하는 효율적인 패키징 서열을 제공하는, 5' LTR과 gag 시작 코돈 사이의 비번역 영역 및 gag-코딩 영역의 최적의 길이를 발견함으로써 달성되며, 이는 바이러스 역가를 증대시킨다. 바이러스 역가는 또한 패키징 플라스미드의 설계시 강력한 이종의 프로모터를 사용하여 향상된다. 상기 향상된 안전성은 tat-독립적인 전이 벡터 및 플라스미드-계 패키징 시스템의 제작을 통해 달성된다.The inventors have found that, as presented herein, the production of CAEV-based lentiviral vector particles exhibits improved efficiency and safety compared to existing CAEV-based vector particles in lentiviral vector design. This improved efficiency is achieved by finding the optimal length of the untranslated region and gag-coding region between the 5 'LTR and the gag start codon, which provides an efficient packaging sequence that enables efficient capsidization, which increases viral titer. Let's do it. Viral titers are also enhanced using a strong heterologous promoter in the design of the packaging plasmid. This improved safety is achieved through the construction of tat -independent transition vectors and plasmid-based packaging systems.

도 1은 CAEV 프로바이러스 유전체 구조에 대한 개략도이다.1 is a schematic of the CAEV provirus genome structure.

도 2a는 플라스미드 pMGP/RRE(서열번호: 77)의 개략도이다. pMGP/RRE(서열번호: 77)는 CAEV gag-pol 코딩 영역(bp 709-5,243)의 상류에 위치하는 MCMV MIEP 영역(bp 1-660), RRE 영역(bp 5,426-5,627 또는 bp 5,368-5,669), 및 소 성장호르몬(BGH) 폴리아데닐화 신호(bp 5,751-5,984)를 포함하는 9,446 bp의 플라스미드이다. 상기 벡터는 또한 네오마이신 내성 유전자 코딩 영역(bp 8,151-7,155), SV40 복제 기점(origin of replicaiton)(bp 8,509-8,152), Col E1 복제 기점(bp 6,115-6,698), 및 암피실린 내성 유전자 영역(bp 9,362-8,528)을 포함한다.2A is a schematic of plasmid pMGP / RRE (SEQ ID NO: 77). pMGP / RRE (SEQ ID NO: 77) is the MCMV MIEP region (bp 1-660), RRE region (bp 5,426-5,627 or bp 5,368-5,669) located upstream of the CAEV gag-pol coding region (bp 709-5,243). , And a 9,446 bp plasmid containing bovine growth hormone (BGH) polyadenylation signal (bp 5,751-5,984). The vector also contains neomycin resistance gene coding region (bp 8,151-7,155), SV40 origin of replicaiton (bp 8,509-8,152), Col E1 origin of replication (bp 6,115-6,698), and ampicillin resistance gene region (bp 9,362-8,528).

도 2b는 플라스미드 pMGP/REV/RRE의 개략도이다. pMGP/REV/RRE는 CAEV gag-pol 코딩 영역(bp 726-5,258)의 상류에 위치하는 MCMV MIEP 영역(bp 1-660에 위치) 및 CAEV의 주 스플라이싱 제공 위치(bp 688-704), rev 코딩 영역의 제1 엑 손(exon)(bp 5,383-5,494), RRE 영역(bp 5,540-5,841), rev 코딩 영역의 제2 엑손(bp 5,888-6,177), 및 소 성장호르몬(BGH) 폴리아데닐화 신호(bp 6,229-6,462)를 포함하는 9,924 bp의 플라스미드이다. 상기 벡터는 또한 네오마이신 내성 유전자 코딩 영역(bp 7,633-8,629), SV40 복제 기점(bp 8,987-8,630), Col E1 복제 기점(bp 6,593-7,176), 및 암피실린 내성 유전자 영역(bp 9,840-9,006)을 포함한다.2B is a schematic of plasmid pMGP / REV / RRE. pMGP / REV / RRE is the MCMV MIEP region (located at bp 1-660) upstream of the CAEV gag-pol coding region (bp 726-5,258) and the primary splicing provision location of CAEV (bp 688-704), first exon of the rev coding region (bp 5,383-5,494), RRE region (bp 5,540-5,841), second exon of the rev coding region (bp 5,888-6,177), and bovine growth hormone (BGH) polyade 9,924 bp plasmid containing a nilation signal (bp 6,229-6,462). The vector also contains neomycin resistance gene coding regions (bp 7,633-8,629), SV40 origin of replication (bp 8,987-8,630), Col E1 origin of replication (bp 6,593-7,176), and ampicillin resistance gene region (bp 9,840-9,006). Include.

도 3a는 플라스미드 pCAH/SINd(서열번호: 73)의 개략도이다. pCAH/SINd (서열번호: 73)는 HCMV MIEP(bp 1-588), CAEV 5'LTR에서의 R-U5 서열 영역(bp 611-772), RRE 영역(bp 796-1,154), 및 U3-삭제된 CAEV 3'LTR 영역(bp 1,275-1,458)을 포함하는 3,566 bp의 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 1,863-2,466), 및 카나마이신 내성 유전자 코딩 영역(bp 2,698-3,510)을 포함한다.3A is a schematic of plasmid pCAH / SINd (SEQ ID NO: 73). pCAH / SINd (SEQ ID NO: 73) shows HCMV MIEP (bp 1-588), R-U5 sequence region (bp 611-772), RRE region (bp 796-1,154), and U3-deletion at CAEV 5'LTR. 3,566 bp plasmid containing the CAEV 3'LTR region (bp 1,275-1,458). The vector also includes Col E1 origin of replication (bp 1,863-2,466), and kanamycin resistance gene coding region (bp 2,698-3,510).

도 3b는 플라스미드 pCAH/SINd0(서열번호: 67)의 개략도이다. pCAH/SINd0(서열번호: 67)은 HCMV MIEP(bp 1-588), CAEV 5'LTR에서의 R-U5 서열 영역(bp 611-772), 프라이머 결합 위치(PBS)를 포함하는 잔여 비번역 서열(bp 773-789), RRE 영역(bp 1,141-1,499), 및 U3-삭제된 CAEV 3'LTR 영역(bp 1,620-1,803)을 포함하는 3,911 bp의 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 2,208-2,791) 및 카나마이신 내성 유전자 코딩 영역(bp 3,043-3,855)을 포함한다.3B is a schematic of plasmid pCAH / SINd0 (SEQ ID NO: 67). pCAH / SINd0 (SEQ ID NO: 67) is the remaining untranslated sequence including HCMV MIEP (bp 1-588), R-U5 sequence region (bp 611-772), primer binding site (PBS) at CAEV 5'LTR. (bp 773-789), a RRE region (bp 1,141-1,499), and a U3-deleted CAEV 3'LTR region (bp 1,620-1,803). The vector also includes Col E1 origin of replication (bp 2,208-2,791) and kanamycin resistance gene coding region (bp 3,043-3,855).

도 3c는 플라스미드 pCAH/SINd1(서열번호: 68)의 개략도이다. pCAH/SINd1(서열번호: 68)은 HCMV MIEP(bp 1-588) 프로모터; CAEV 5'LTR에서의 R-U5 서열 영역(bp 610-772); PBS 위치를 포함하는 잔여 비번역 영역(bp 773-789); 시작 ATG 코돈(bp1121-1123) 및 상기 시작 ATG 코돈의 하류에 위치하는 ATG 코 돈(bp1142-1144)에서 ATG로부터 TAG로의 점변이(point mutation)를 갖는, gag 유전자의 327 bp의 절편(bp 1,121-1,448); RRE 영역(bp 1,468-1,826); 및 U3-삭제된 CAEV 3'LTR 영역(bp 1,947-2,130)을 포함하는 4,238 bp의 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 2,535-3,118) 및 카나마이신 내성 유전자 영역(bp 3,370-4,182)을 포함한다.3C is a schematic of plasmid pCAH / SINd1 (SEQ ID NO: 68). pCAH / SINd1 (SEQ ID NO: 68) is the HCMV MIEP (bp 1-588) promoter; R-U5 sequence region in CAEV 5'LTR (bp 610-772); Residual untranslated region (bp 773-789) comprising the PBS position; ATG start codon (bp1121-1123) and a fragment of 327 bp of, gag gene has a point mutation (point mutation) to TAG from the ATG ATG co-Don (bp1142-1144) located downstream of the ATG start codon (bp 1,121 -1,448); RRE region (bp 1,468-1,826); And a U3-deleted CAEV 3′LTR region (bp 1,947-2,130). The vector also includes the Col E1 origin of replication (bp 2,535-3,118) and the kanamycin resistance gene region (bp 3,370-4,182).

도 3d는 플라스미드 pCAH/SINd2(서열 번호: 69)의 개략도이다. 플라스미드 pCAH/SINd2(서열 번호: 69)는 HCMV MIEP(bp 1-588); CAEV 5'LTR에서의 R-U5 서열 영역(bp 610-772); PBS위치를 포함하는 잔여 비번역 영역(bp 773-789); 시작 ATG 코돈(bp1121-1123) 및 상기 시작 ATG 코돈의 하류에 위치하는 ATG 코돈(bp1142-1144)에서 점변이를 갖는, gag 유전자의 612 bp의 절편(bp 1,121-1,733); RRE 영역(bp 1,753-2,111); 및 U3-삭제된 CAEV 3'LTR 영역(bp 2,232-2,415)을 포함하는 4,523 bp 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 2,820-3,403) 및 카나마이신 내성 유전자 코딩 영역(bp 3,655-4,467)을 포함한다.3D is a schematic of plasmid pCAH / SINd2 (SEQ ID NO: 69). Plasmid pCAH / SINd2 (SEQ ID NO: 69) was HCMV MIEP (bp 1-588); R-U5 sequence region in CAEV 5'LTR (bp 610-772); Residual untranslated region (bp 773-789) comprising the PBS position; 612 bp fragment (bp 1,121-1,733) of the gag gene with point mutations in the starting ATG codon (bp1121-1123) and the ATG codon (bp1142-1144) located downstream of the starting ATG codon; RRE region (bp 1,753-2,111); And a 4,523 bp plasmid comprising the U3-deleted CAEV 3'LTR region (bp 2,232-2,415). The vector also includes Col E1 origin of replication (bp 2,820-3,403) and kanamycin resistance gene coding region (bp 3,655-4,467).

도 3e는 플라스미드 pCAH/SINd3(서열 번호: 70)의 개략도이다. pCAH/SINd3(서열 번호: 70)는 HCMV MIEP(bp 1-588); CAEV 5'LTR에서의 R-U5 서열 영역(bp 610-772); PBS 위치를 포함하는 잔여 비번역 영역(bp 773-789); 시작 ATG 코돈(bp 1121-1123) 및 상기 시작 ATG 코돈의 하류에 위치하는 ATG 코돈(bp 1142-1144)에서 점변이를 갖는, gag 유전자의 908 bp의 절편(bp1,121-2,029); RRE 영역(bp 2,049-2,407); 및 U3-삭제된 CAEV 3'LTR 영역(bp 2,549-2,711)을 포함하는 4,819 bp의 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 3,116-3,699) 및 카나마이신 내성 유전자 코딩 영역(bp 3,951-4,763)을 포함한다.3E is a schematic of plasmid pCAH / SINd3 (SEQ ID NO: 70). pCAH / SINd3 (SEQ ID NO: 70) is HCMV MIEP (bp 1-588); R-U5 sequence region in CAEV 5'LTR (bp 610-772); Residual untranslated region (bp 773-789) comprising the PBS position; Start ATG codon (bp 1121-1123) and a fragment of 908 bp of, gag gene having a point mutation at the ATG codon (bp 1142-1144) located downstream of the start ATG codon (bp1,121-2,029); RRE region (bp 2,049-2,407); And a 4,819 bp plasmid comprising a U3-deleted CAEV 3'LTR region (bp 2,549-2,711). The vector also includes a Col E1 origin of replication (bp 3,116-3,699) and a kanamycin resistance gene coding region (bp 3,951-4,763).

3f는 플라스미드 pCAH/SINd4(서열 번호: 71)의 개략도이다. pCAH/SINd4(서열 번호: 71)는 HCMV MIEP(bp 1-588); CAEV 5'LTR에서의 R-U5 서열 영역(bp 610-772); PBS 위치를 포함하는 잔여 비번역 영역(bp 773-1,120); 시작 ATG 코돈(bp 1121-1123) 및 상기 시작 ATG 코돈의 하류에 위치하는 ATG 코돈(bp 1142-1144)에서 점변이를 갖는, gag 유전자의 1198 bp의 절편(bp 1,121-2,319); RRE 영역(bp 2,342-2,700); 및 U3-삭제된 CAEV 3'LTR 영역(bp 2,842-3,004)을 포함하는 5,112 bp의 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 3,409-3,992) 및 카나마이신 내성 유전자 코딩 영역(bp 4,244-5,056)을 포함한다.Degree 3f is a schematic of plasmid pCAH / SINd4 (SEQ ID NO: 71). pCAH / SINd4 (SEQ ID NO: 71) is HCMV MIEP (bp 1-588); R-U5 sequence region in CAEV 5'LTR (bp 610-772); Residual untranslated region (bp 773-1,120) comprising the PBS position; A segment of 1198 bp of the gag gene (bp 1,121-2,319) with point mutations in the starting ATG codon (bp 1121-1123) and the ATG codon (bp 1142-1144) located downstream of the starting ATG codon; RRE region (bp 2,342-2,700); And a 5,112 bp plasmid comprising the U3-deleted CAEV 3'LTR region (bp 2,842-3,004). The vector also includes a Col E1 origin of replication (bp 3,409-3,992) and a kanamycin resistance gene coding region (bp 4,244-5,056).

도 3g는 플라스미드 pCAH/SINd1/hlacZ(서열 번호: 79)의 개략도이다. pCAH/SINd1/hlacZ(서열 번호: 79)는 lacZ 리포터 유전자(reporter 유전자)를 발현하는 pCAH/SINd1(서열 번호: 68)로부터 유래된 8,127 bp의 플라스미드이다. 상기 벡터는 두 개의 HCMV MIEP 프로모터 영역(각각 bp 1-588 및 bp 1,866-2,460에 위치); CAEV 5'LTR에서의 R-U5 서열 영역(bp 610-772); PBS 위치를 포함하는 잔여 비번역 영역(bp 773-789); 시작 ATG 코돈(bp 1121-1123) 및 상기 시작 ATG 코돈의 하류에 위치하는 ATG 코돈(bp 1142-1144)에서 점변이를 갖는, gag 유전자의 325 bp의 절편(bp 1,121-1,446); lacZ 유전자 코딩 서열(bp 2,541-5,711); 및 U3-삭제된 CAEV 3'LTR 영역(bp 5,782-6,019)을 포함한다. 상기 벡터는 또한 Col E1 복제 기점(bp 6,424-7,007), 및 카나마이신 내성 유전자 코딩 영역(bp 7,259-8,071)을 포함한다.3G is a schematic of the plasmid pCAH / SINd1 / hlacZ (SEQ ID NO: 79). pCAH / SINd1 / hlacZ (SEQ ID NO: 79) is an 8,127 bp plasmid derived from pCAH / SINd1 (SEQ ID NO: 68) that expresses a lacZ reporter gene (reporter gene). The vector comprises two HCMV MIEP promoter regions (located at bp 1-588 and bp 1,866-2,460, respectively); R-U5 sequence region in CAEV 5'LTR (bp 610-772); Residual untranslated region (bp 773-789) comprising the PBS position; Start ATG codon (bp 1121-1123) and a fragment of 325 bp of, gag gene having a point mutation at the ATG codon (bp 1142-1144) located downstream of the start ATG codon (bp 1,121-1,446); lacZ gene coding sequence (bp 2,541-5,711); And a U3-deleted CAEV 3'LTR region (bp 5,782-6,019). The vector also includes a Col E1 origin of replication (bp 6,424-7,007), and a kanamycin resistance gene coding region (bp 7,259-8,071).

도 3h는 플라스미드 pCAH/SINd60/hlacZ(서열 번호: 78)의 개략도이다. 플라스미드 pCAH/SINd60/hlacZ(서열 번호: 78)는 두 개의 프로모터 영역 HCMV MIEP(각각 bp 1-588 및 bp 1,595-2,189에 위치); CAEV 5'LTR에서의 R-U5 서열 영역(bp 610-772); PBS 위치를 포함하는 잔여 비번역 영역(bp 773- 789 bp); 시작 ATG 코돈(bp 1121-1123) 및 상기 시작 ATG 코돈의 하류에 위치하는 ATG 코돈(bp 1142-1144)에서 점변이를 갖는, gag 유전자의 60 bp의 절편(bp 1,121-1,181); RRE 영역(bp 1,195-1,565); lacZ 유전자 코딩 서열(bp 2,270-5,440); 및 U3-삭제된 CAEV 3'LTR 영역(bp 5,511-5,748)을 포함하는 7,856 bp이다. 상기 벡터는 또한 Col E1 복제 기점(bp 6,153-6,736), 및 카나마이신 내성 유전자 코딩 영역(bp 6,988-7,800)을 포함한다.3H is a schematic of the plasmid pCAH / SINd60 / hlacZ (SEQ ID NO: 78). Plasmid pCAH / SINd60 / hlacZ (SEQ ID NO: 78) shows two promoter regions HCMV MIEP (located at bp 1-588 and bp 1,595-2,189, respectively); R-U5 sequence region in CAEV 5'LTR (bp 610-772); Residual untranslated region (bp 773-789 bp) comprising the PBS position; Start ATG codon (bp 1121-1123) and a fragment of 60 bp of, gag gene having a point mutation at the ATG codon (bp 1142-1144) located downstream of the start ATG codon (bp 1,121-1,181); RRE region (bp 1,195-1,565); lacZ gene coding sequence (bp 2,270-5,440); And 7,856 bp comprising a U3-deleted CAEV 3 ′ LTR region (bp 5,511-5,748). The vector also includes the Col E1 origin of replication (bp 6,153-6,736), and the kanamycin resistance gene coding region (bp 6,988-7,800).

도 4는 플라스미드 pHYK/vif(서열 번호: 76)의 개략도이다. pHYK/vif(서열 번호: 76)는 HCMV MIEP(bp 1-596), vif 유전자 코딩 영역(bp 691-1,380), BGH 폴리아데닐화 신호(bp 1,467-1,695), Col E1 복제 기점(bp 1,826-2,409), 네오마이신 내성 유전자 코딩 영역(bp 3,862-2,866), 및 암피실린 내성 유전자 코딩 영역(bp 5,270-4,239)을 포함하는 5,729 bp의 플라스미드이다.4 is a schematic of plasmid pHYK / vif (SEQ ID NO: 76). pHYK / vif (SEQ ID NO: 76) shows HCMV MIEP (bp 1-596), vif gene coding region (bp 691-1,380), BGH polyadenylation signal (bp 1,467-1,695), Col E1 origin of replication (bp 1,826- 2,409), a plasmid of 5,729 bp including neomycin resistance gene coding region (bp 3,862-2,866), and ampicillin resistance gene coding region (bp 5,270-4,239).

도 5는 플라스미드 pHYK/rev(서열 번호: 75)의 개략도이다. pHYK/rev(서열 번호: 75)는 HCMV MIEP(bp 1-596), rev 유전자 코딩 영역(bp 672-1,073), BGH 폴리아데닐화 신호(bp 1,157-1,385), Col E1 복제 기점(bp 1,516-2,099), 네오마이신 내성 유전자 코딩 영역(bp 3,552-2,556), 및 암피실린 내성 유전자 코딩 영역(bp 4,960-3,929)을 포함하는 5,419 bp의 플라스미드이다.5 is a schematic of plasmid pHYK / rev (SEQ ID NO: 75). pHYK / rev (SEQ ID NO: 75) is HCMV MIEP (bp 1-596),revGene coding region (bp 672-1,073), BGH polyadenylation signal (bp 1,157-1,385), Col E1 origin of replication (bp 1,516-2,099), neomycin resistance gene coding region (bp 3,552-2,556), and ampicillin resistance gene 5,419 bp plasmid containing coding region (bp 4,960-3,929).

도 6a는 플라스미드 pHGVSV-G(서열 번호: 74)의 개략도이다. pHGVSV-G(서열 번호: 74)는 HCMV MIEP(bp 1-596), β-글로빈 인트론 영역(bp 714-1,599), VSV-G 코딩 영역(bp 1,632-3,312), BGH 폴리아데닐화 신호(bp 3,361-3,589), Col E1 복제 기점(bp 3,720-4,303), 네오마이신 내성 유전자 코딩 영역(bp 5,756-4,760), 암피실린 내성 유전자 코딩 영역(bp 7,164-6,133), 및 F1 복제 기점(bp 7,165-7,621)을 포함하는 7,623 bp의 플라스미드이다.6A is a schematic of plasmid pHGVSV-G (SEQ ID NO: 74). pHGVSV-G (SEQ ID NO: 74) shows HCMV MIEP (bp 1-596), β-globin intron region (bp 714-1,599), VSV-G coding region (bp 1,632-3,312), BGH polyadenylation signal (bp 3,361-3,589), Col E1 origin of replication (bp 3,720-4,303), neomycin resistance gene coding region (bp 5,756-4,760), ampicillin resistance gene coding region (bp 7,164-6,133), and F1 origin of replication (bp 7,165-7,621 ) Is a 7,623 bp plasmid.

도 6b는 플라스미드 pMYKEF1/env(서열 번호: 72)의 개략도이다. pMYKEF1/env(서열 번호: 72)는 MCMV MIEP (bp 1-665), 인간 EF1-α 인트론 영역(bp 668-1,618), GaLV env 코딩 영역(bp 1,699-3701), BGH 폴리아데닐화 신호(bp 3,885-4,118), Col E1 복제 기점(bp 4,349-4,832), 네오마이신 내성 유전자 코딩 영역(bp 6,290-5,284), 및 암피실린 내성 유전자 코딩 영역(bp 7,496-6,666)을 포함하는 7,579 bp의 플라스미드이다.6B is a schematic of plasmid pMYKEF1 / env (SEQ ID NO: 72). pMYKEF1 / env (SEQ ID NO: 72) shows MCMV MIEP (bp 1-665), human EF1-α intron region (bp 668-1,618), GaLV env coding region (bp 1,699-3701), BGH polyadenylation signal (bp 3,885-4,118), Col E1 origin of replication (bp 4,349-4,832), neomycin resistance gene coding region (bp 6,290-5,284), and ampicillin resistance gene coding region (bp 7,496-6,666) plasmid of 7,579 bp.

도 7은 인간 293T 표적 세포로 형질감염된 유전자 전이 벡터로부터 전사된, 전이 벡터 RNA의 상대량을 나타내는 사진이다.7 is a photograph showing the relative amounts of transfer vector RNA transcribed from a gene transfer vector transfected with human 293T target cells.

도 8은 CAEV (A) 및 MuLV (B) 벡터에 의해 유전자 전이시킨 인간 293T 표적 세포를 나타내는 두 개의 사진이다.8 is two photographs showing human 293T target cells transfected with CAEV (A) and MuLV (B) vectors.

도 9는 형질감염된 293T 세포에서 발현되거나(레인 1, 2 및 3) 293T 패키징 세포에서 캡시드화되고 방출된(레인 4, 5 및 6), 전이 벡터 RNA의 상대량을 나타내는 사진이다.FIG. 9 is a photograph showing the relative amounts of transfer vector RNA expressed in transfected 293T cells (lanes 1, 2 and 3) or capsidated and released (lanes 4, 5 and 6) in 293T packaging cells.

도 10은 인간 293T 패키징 세포에서 캡시드화되고 방출된 전이벡터 RNA의 상 대량을 나타내는 사진이다.FIG. 10 is a photograph showing the phase mass of transferred vector RNA capsidized and released in human 293T packaging cells. FIG.

도 11은 VSV-G 또는 GaLV 외피 단백질에 의해 의사형화된(pseudotyped) 렌티바이러스 벡터의 감염 및 역전사 후, 통합된 레트로바이러스 cDNA의 상대량을 나타내는 사진이다.FIG. 11 is a photograph showing the relative amount of integrated retroviral cDNA after infection and reverse transcription of pseudolented lentiviral vectors with VSV-G or GaLV envelope proteins.

도 12는 감염된 숙주 세포 염색체(chromosome)내로 통합된 바이러스 벡터 cDNA의 상대량을 나타내는 사진이다.12 is a photograph showing the relative amount of viral vector cDNA integrated into an infected host cell chromosome.

도 13은 (A) 대조군 세포 및 (B) G1-정체된 세포에 대한 FACS 분석 결과를 나타내는 두 개의 그래프이다.FIG. 13 is two graphs showing the results of FACS analysis on (A) control cells and (B) G1-stabilized cells. FIG.

도 14는 (A) 형질도입된 세포의 수 및 (B) 분열 및 비분열 세포에 대한 HIV-1-, CAEV-, 및 MuLV-유래 바이러스 벡터의 상대적 형질도입 효율성을 나타내는 두 개의 그래프이다.FIG. 14 is two graphs showing the relative transduction efficiency of HIV-1-, CAEV-, and MuLV-derived viral vectors for (A) the number of transduced cells and (B) dividing and non-dividing cells.

본 발명은, 특히, CAEV-계 렌티바이러스 벡터 시스템 및 상기 벡터를 사용하여 관심 폴리펩타이드를 분열 및 비분열 세포로 전달하는 방법에 관한 것이다.The present invention relates, in particular, to a CAEV-based lentiviral vector system and a method for delivering a polypeptide of interest to dividing and non-dividing cells using said vector.

CAEV 유전체CAEV dielectric

야생형 CAEV 바이러스는, 이중-가닥 DNA 중간 물질을 통해 복제되고 핵단백질 코어(nucleoprotein core)를 포함하는 구형의 외피 비리온(virion)으로 패키징되는, 2량체 RNA 유전체를 갖는다(단일-가닥, 양성 극성). 상기 유전체는 구조 및 효소 단백질인 Gag, Pol 및 Env를 코딩하는 세 개의 유전자, 및 통합된 바이러스 유전체 각 말단에서의 긴 말단 반복(long terminal repeat; LTR)을 포함한다. 또한, 상기 유전체는 세 개의 조절 단백질인 vif, tatrev를 코딩한다.Wild-type CAEV virus has a dimeric RNA genome (single-stranded, positive polarity) that is replicated through a double-stranded DNA intermediate and packaged into a spherical envelope virion that contains a nucleoprotein core. ). The genome includes three genes encoding the structural and enzyme proteins Gag, Pol and Env, and a long terminal repeat (LTR) at each end of the integrated viral genome. The genome also encodes three regulatory proteins, vif , tat and rev .

상기 gag 유전자는 내부 구조 단백질을 코딩하고, pol 유전자는 바이러스 복제 효소를 코딩하며, env 유전자는 바이러스가 세포 표면에 부착하는데 매개하는 외피 당단백질(glycoprotein)을 코딩한다. Vif 단백질은 바이러스 감염성(infectivity)에 연관되어 있고, Tat 단백질은 5' LTR의 전사활성화(transactivation)에 연관되어 있다. Rev 단백질 및 그 표적 서열 RRE(Rev 반응 요소)는 바이러스 RNA의 안정성, 바이러스 RNA 스플라이싱의 조절, 및 대형 RNA(비-스플라이싱되거나 단일-스플라이싱된)을 핵으로부터 세포질로 수송하는데 연관되어 있다. 프로바이러스 LTR 서열은 U3(구조 단백질들의 하류에 위치하는 특이(unique) 서열 요소), R(각 유전체 말단의 짧은 반복서열), 및 U5(상기 R 서열 바로 다음의 특이 서열 요소) 영역들을 포함한다. 5'LTR의 U3 영역은 바이러스 프로모터 및 인핸서(enhancer)를 포함한다. 유전체의 3' 말단은 3'LTR에서의 폴리아데닐화 신호를 포함한다.The gag gene encodes an internal structural protein, the pol gene encodes a viral replication enzyme, and the env gene encodes an envelope glycoprotein that mediates the virus to attach to the cell surface. Vif protein is involved in viral infectivity and Tat protein is involved in transactivation of 5 'LTR. Rev protein and its target sequence RRE (Rev Response Element) are used to transport the stability of viral RNA, regulation of viral RNA splicing, and transport large RNA (non-spliced or single-spliced) from the nucleus to the cytoplasm. It is related. The proviral LTR sequence comprises U3 (unique sequence elements downstream of structural proteins), R (short repetitions at each genome end), and U5 (specific sequence elements immediately following the R sequence) regions . The U3 region of 5'LTR includes a viral promoter and an enhancer. The 3 'end of the genome contains a polyadenylation signal at 3'LTR.

CAEV의 야생형 유전체는 또한 프로바이러스 통합을 위한 LTR 말단의 att(부착 위치) 등의 여러 시스-작용 요소; 5'LTR에서 통합된 프로바이러스의 전사 개시를 조절하는 프로모터 요소; 5'LTR의 하류에 위치하는 PBS(프라이머 결합 위치); 5'-스플라이싱 제공 위치; 패키징 서열(본원에서는 호환가능하게 패키징 위치 또는 패키징 신호를 의미한다.); 3'LTR 부근에 위치하는 ppt(폴리퓨린 트랙) 위치; 및 3'LTR에서의 폴리아데닐화 신호를 포함한다.The wild-type genome of CAEV also includes several cis -acting elements such as att (attachment site) at the LTR terminus for proviral integration; Promoter elements that regulate transcriptional initiation of integrated provirus at 5′LTR; PBS (primer binding position) located downstream of 5 ′ LTR; 5'-splicing provision position; Packaging sequence (herein interchangeably means packaging location or packaging signal); Ppt (polypurine track) position located near 3'LTR; And polyadenylation signals at 3'LTR.

본원에서 사용된 용어 "시스"는 핵산의 동일 염색체 또는 선형 부분 상에 존재하는 유전자들을 의미한다. 따라서, 용어 "시스-결함"은 핵산의 선형 서열 상에서 발견된 결함을 의미한다. 용어 "시스-작용"은 핵산의 동일 염색체 또는 선형 부분 상에 존재하는 유전자에 대한 조절 유전자의 제어 효과를 의미한다. 예를 들면, 하류 mRNA의 합성에 영향을 주는 프로모터는 시스-작용 조절 요소이다.As used herein, the term “ cis ” refers to genes present on the same chromosome or linear portion of a nucleic acid. Thus, the term “ cis -defect” means a defect found on the linear sequence of a nucleic acid. The term “ cis -action” means the control effect of a regulatory gene on a gene present on the same chromosome or linear portion of a nucleic acid. For example, promoters that affect the synthesis of downstream mRNAs are cis -acting regulatory elements.

CAEV의 두 분리주에 대한 전장 유전체 서열이 공지되어 있으며, 상기 서열들은 생명공학 정보센터(National Center for Biotechnology Information; NCBI) 데이터베이스에 NC_001463(서열번호: 1) 및 AF322109(서열번호: 2)(Saltarelli et al., 1990, 및 Gjerset, B.J. et al., 비공개)로서 기탁되어 있다. 청구된 본 발명의 핵산은 CAEV의 특정 분리주에 한정되지는 않으나, 그 유전체 서열의 공지된 기능을 유지하는 서열일 수 있다. 예를 들면, 유전자 서열에서의 자연적인 변이들이 바이러스 복제 중에 일어남으로써 유사한 기능을 갖는 단백질을 코딩하는 유사 핵산 서열들이 발생된다는 것이 당 분야에 공지되어 있다.Full-length genomic sequences for two isolates of CAEV are known, which are listed in the National Center for Biotechnology Information (NCBI) database NC_001463 (SEQ ID NO: 1) and AF322109 (SEQ ID NO: 2) (Saltarelli et al. al., 1990, and Gjerset, BJ et al., non-public). The nucleic acid of the claimed invention is not limited to a particular isolate of CAEV, but may be a sequence that retains the known function of its genomic sequence. For example, it is known in the art that natural variations in the gene sequence occur during viral replication, resulting in similar nucleic acid sequences encoding proteins with similar functions.

NC_001463(서열번호: 1) 및 AF322109(서열번호: 2) 유전체 서열들의 서열 정렬(sequence alignment)을 표 1에 나타내었다. 표 1에서 볼 수 있는 바와 같이, 상기 서열들간에 상당한 핵산 상동성이 있으나, 핵산 수준에서의 차이가 명백하다. 표 2(서열번호: 3 내지 6)에 기재된 CAEV gag 영역의 변이가능성은 특히 중요하다. NC_001463 5'LTR, pol, revvif 유전자들, 및 그에 상응하는 AF322109로부터의 유전자들의 서열 정렬은 표 3-6(서열번호: 7-14)에서 확인될 수 있다. 또한, CAEV 유전체의 많은 부분 서열들이 공지되고 기탁되어 있다. 예를 들면, 기탁 번호 AY081139, AY101347, AY101348, AY047362, AF402668, AF402667, AF402666, AF402665, AF402664, AJ305042, AJ305041, 및 AJ305040는 모두 CAEV의 브라질 분리주로부터의 gag 유전자 서열을 제공한다. 기탁 번호 AF015181, L78453, L78451, L78450, L78447 및 L78446은 또한 다양한 CAEV 분리주들로부터의 gag 유전자 서열을 포함한다. 기탁번호 X64828 및 M63106은 다양한 CAEV 분리주들로부터의 rev 유전자 서열을 포함한다. 기탁번호 AF015182, AJ305053, K03327, L78448, L78452 및 U35814는 다양한 CAEV 분리주들로부터의 pol 유전자를 포함한다. NC_001463의 gag 유전자(서열번호: 15, 17)와, AF015181(서열번호: 16, 17), AF402664(서열 번호: 20, 26), AF402665(서열 번호: 21, 27), AF402666(서열 번호: 22, 28), AF402667(서열 번호: 23, 29) 및 AF402668(서열 번호: 24, 30)로부터의 gag 유전자들간의 서열 정렬은 표 8에서 확인된다. NC_001463의 gag 유전자(서열 번호: 31, 35)와, AJ305040(서열 번호: 32, 36), AJ305041(서열 번호: 33, 37) 및 AJ305042(서열 번호: 34, 38)로부터의 gag 유전자들 간의 서열 정렬은 표 9에서 확인된다. NC_001463의 gag 유전자(서열 번호: 39, 41)와 AY047362(서열 번호: 40, 42)로부터의 gag 유전자 간의 서열 정렬은 표 10에서 확인된다. NC_001463 gag 유전자(서열 번호: 43, 45)와 AY081139(서열 번호: 44, 46)로부터의 gag 유전자 간의 서열 정렬은 표 11에서 확인된다. NC_001463(서열 번호: 47, 50)의 gag 유전자와, AY101347(서열 번호: 48, 51) 및 AY101348(서열 번호: 49, 52)로부터의 gag 유전자들 간의 서열 정렬은 표 12에서 확인된다. NC_001463 gag 유전자(서열 번호: 53, 59)와, L78446(서열 번호: 54, 60), L78447(서열 번호: 55, 61), L78450(서열 번호: 56, 62), L78451(서열 번호: 57, 63) 및 L78453(서열 번호: 58, 64)으로부터의 gag 유전자들 간의 서열 정렬은 표 13에서 확인된다. The sequence alignment of NC_001463 (SEQ ID NO: 1) and AF322109 (SEQ ID NO: 2) genomic sequences is shown in Table 1. As can be seen in Table 1, there is considerable nucleic acid homology between the sequences, but the difference at the nucleic acid level is evident. The variability of the CAEV gag regions described in Table 2 (SEQ ID NOS: 3-6) is particularly important. The sequence alignment of the NC_001463 5'LTR, pol , rev and vif genes, and the corresponding genes from AF322109, can be found in Table 3-6 (SEQ ID NOs: 7-14). In addition, many partial sequences of the CAEV genome are known and deposited. For example, accession numbers AY081139, AY101347, AY101348, AY047362, AF402668, AF402667, AF402666, AF402665, AF402664, AJ305042, AJ305041, and AJ305040 all provide gag gene sequences from Brazilian isolates of CAEV. Accession Nos. AF015181, L78453, L78451, L78450, L78447 and L78446 also include gag gene sequences from various CAEV isolates. Accession Nos. X64828 and M63106 include the rev gene sequence from various CAEV isolates. Accession Nos. AF015182, AJ305053, K03327, L78448, L78452 and U35814 include the pol gene from various CAEV isolates. Gag gene of NC_001463 (SEQ ID NOs: 15, 17), AF015181 (SEQ ID NOs: 16, 17), AF402664 (SEQ ID NOs: 20, 26), AF402665 (SEQ ID NOs: 21, 27), AF402666 (SEQ ID NOs: 22 , 28), sequence alignments between gag genes from AF402667 (SEQ ID NOs: 23, 29) and AF402668 (SEQ ID NOs: 24, 30) are identified in Table 8. Sequence between the gag gene from the gag gene of the NC_001463 (SEQ ID NO: 31, 35), AJ305040 (SEQ ID NO: 32, 36), AJ305041 (SEQ ID NO: 33, 37), and AJ305042 (34, 38 SEQ ID NO) The alignment is shown in Table 9. The sequence alignment between the gag gene (SEQ ID NO: 39, 41) of NC_001463 and the gag gene from AY047362 (SEQ ID NO: 40, 42) is shown in Table 10. The sequence alignment between the NC_001463 gag gene (SEQ ID NOs: 43, 45) and the gag gene from AY081139 (SEQ ID NOs: 44, 46) is shown in Table 11. The sequence alignment between the gag gene of NC_001463 (SEQ ID NO: 47, 50) and the gag genes from AY101347 (SEQ ID NO: 48, 51) and AY101348 (SEQ ID NO: 49, 52) is shown in Table 12. NC_001463 gag gene (SEQ ID NO: 53, 59), L78446 (SEQ ID NO: 54, 60), L78447 (SEQ ID NO: 55, 61), L78450 (SEQ ID NO: 56, 62), L78451 (SEQ ID NO: 57, 63) and the sequence alignment between the gag genes from L78453 (SEQ ID NOs: 58, 64) are shown in Table 13.

상기 정렬들은 벡터 NTI(VectorNTI, Informax, USA)를 사용하여 하기 파라미터를 사용하여 수행되었다: The alignments were performed using the following parameters using the vector NTI (VectorNTI, Informax, USA):

이중정렬(pairwise alignment)의 경우: 공백(gap) 개시 벌점: 15For pairwise alignment: gap start penalty: 15

공백 연장 벌점: 6.6                                     Blank Extension Penalty: 6.6

다중정렬(multiple alignment)의 경우: 공백 개시 벌점: 15For multiple alignment: blank starting penalty: 15

공백 연장 벌점: 6.6                                     Blank Extension Penalty: 6.6

공백 분리 벌점 범위 : 8                                     Whitespace Separation Penalty Range: 8

표 14에는 상기 제시된 gag 유전자 서열들의 서열 정렬에 대한 상동성 백분율 값들이 요약되어 있다. 표 15에는 NC_001463(서열 번호: 1) 및 AF322109(서열 번호: 2)의 전체 유전체 정렬, 및 gag, 5 LTR, pol, revvif 영역들의 정렬들의 상동성 백분율이 요약되어 있다. 다양한 CAEV 분리주들로부터의 많은 부분 서열들 뿐 아니라 두 CAEV 분리주들의 유전체 서열이 공지되어 있으며, 일치되는 서열들을 용이하게 식별할 수 있으므로, 다양한 CAEV 서열들을 사용하여 청구된 본 발명을 실시하기 위해 과도한 실험이 요구되지 않는다.Table 14 summarizes the percent homology to sequence alignment of the gag gene sequences presented above. Table 15 summarizes the percent homology of the overall genome alignments of NC_001463 (SEQ ID NO: 1) and AF322109 (SEQ ID NO: 2), and the alignments of the gag , 5 LTR, pol , rev and vif regions. Numerous partial sequences from various CAEV isolates as well as the genomic sequences of the two CAEV isolates are known and can easily identify matching sequences, thus over-exploiting experiments to implement the claimed invention using various CAEV sequences. This is not required.

본 발명의 CAEV 벡터CAEV Vector of the Invention

본 발명의 벡터는 넓은 계통분류학적 범위의 숙주 세포에서 숙주 세포의 핵과 무관한 폴리뉴클레오티드 또는 유전자들을 복제하고 발현하기 위한 수단을 제공한다. 이러한 이종 핵산의 벡터-매개된 숙주세포로의 결합은 숙주 세포의 형질감염 또는 감염을 의미하며, 이때 감염은 바이러스 입자의 사용을 의미하고, 형질감염은 핵산 분자 그대로의 사용을 의미한다.The vectors of the present invention provide a means for replicating and expressing polynucleotides or genes that are independent of the nucleus of a host cell in a broad phylogenetic range of host cells. Binding of such heterologous nucleic acid to a vector-mediated host cell refers to transfection or infection of the host cell, where infection refers to the use of viral particles and transfection refers to the use of the nucleic acid molecule as it is.

용어 "유전자"는 폴리펩티드 또는 전구체의 생산에 필요한 조절(control) 및 코딩 서열들을 포함하는 DNA 서열을 의미한다. 본원에서 상호호환적으로 사용된 용어 "폴리뉴클레오티드" 또는 "핵산 분자"는 두 개 이상과 같이 임의의 길이를 갖는 뉴클레오티드 폴리머를 의미하며, DNA 및 RNA 둘 다를 포함한다. 뉴클레오티드는 데옥시리보뉴클레오티드, 리보뉴클레오티드, 뉴클레오티드 유사물(analogs)(변형된 인산기, 염기 또는 당 등), 또는 DNA 중합효소 또는 RNA 중합효소 등의 적절한 효소에 의해 폴리머로 결합될 수 있는 임의의 치환기일 수 있다. 폴리펩타이드는 전장 코딩 서열 또는 폴리펩타이드의 목적하는 활성이 유지되는 정도의 길이를 갖는 코딩 서열의 임의의 부분에 의해 코딩될 수 있다.The term "gene" refers to a DNA sequence comprising control and coding sequences necessary for the production of a polypeptide or precursor. The terms "polynucleotide" or "nucleic acid molecule" as used interchangeably herein refer to nucleotide polymers of any length, such as two or more, and include both DNA and RNA. Nucleotides can be bound to the polymer by deoxyribonucleotides, ribonucleotides, nucleotide analogues (such as modified phosphates, bases or sugars), or suitable enzymes such as DNA polymerase or RNA polymerase. Can be. The polypeptide may be encoded by either the full length coding sequence or any portion of the coding sequence having a length such that the desired activity of the polypeptide is maintained.

용어 "야생형"은 자연발생원(naturally occurring source)으로부터 분리되어 그 유전자 또는 유전자 산물의 특징을 갖는 유전자 또는 유전자 산물을 의미한다. 야생형 유전자는 개체군들 중에서 가장 빈번하게 관찰되어 임의로 그 유전자의 "일반적인" 또는 "야생의" 형태로서 고안된 것이다. 반면에, 용어 "변이된" 또는 "변이체"는 상기 야생형 유전자 또는 유전자 산물과 비교할 때, 서열 또는 기능적 특징에서의 변형(즉, 대체된 특징)을 나타내는 유전자 또는 유전자 산물을 의미한다. 자연-발생적 변이체는 분리될 수 있으며, 야생형 유전자 또는 유전자 산물과 비교할 때, 변화된 특징을 갖는다는 사실에 의해 식별될 수 있다.The term "wild type" means a gene or gene product that is separated from a naturally occurring source and that is characterized by that gene or gene product. Wild-type genes are the most frequently observed among populations and are optionally designed as "general" or "wild" forms of the gene. On the other hand, the term “mutated” or “variant” refers to a gene or gene product that exhibits a change in sequence or functional feature (ie, a replaced feature) when compared to the wild type gene or gene product. Naturally-occurring variants can be isolated and identified by the fact that they have altered characteristics when compared to wild-type genes or gene products.

본원 명세서 및 청구항에서 사용되는 "a", "an", "the" 등의 단일형 형태들은 문맥상 명확하게 기재하고 있지 않는 한 복수의 인용물들을 포함한다. 따라서, 예를 들면, "폴리뉴클레오티드"의 의미에는 폴리뉴클레오티드들도 포함되며, "줄기 세포"의 의미에는 복수의 세포들도 포함된다.As used herein and in the claims, the singular forms "a", "an", "the", and the like include plural references unless the context clearly dictates otherwise. Thus, for example, the meaning of "polynucleotide" includes polynucleotides, and the meaning of "stem cell" also includes a plurality of cells.

본원에 사용된 용어 "레트로바이러스"는 그들의 복제 주기 동안 역전사효소를 이용하는 RNA 바이러스들을 의미한다. 레트로바이러스 유전체 RNA는 역전사 효소에 의해 이중-가닥 DNA로 전환된다. 바이러스의 이러한 이중-가닥 DNA 형태는 감염된 세포의 염색체내로 통합될 수 있으며; 일단 통합되면, 이것을 "프로바이러스"라고 일컫는다. 프로바이러스는 RNA 중합효소 Ⅱ에 대한 주형(template)으로 제공되며 신규 바이러스 입자들을 생산하는데 필요한 구조 단백질들 및 효소들을 코딩하는 RNA 분자들의 발현을 지배한다.As used herein, the term "retrovirus" refers to RNA viruses that utilize reverse transcriptase during their replication cycle. Retroviral genomic RNA is converted to double-stranded DNA by reverse transcriptase. This double-stranded DNA form of the virus can be integrated into the chromosome of infected cells; Once integrated, this is called a "provirus". Proviruses serve as templates for RNA polymerase II and govern the expression of RNA molecules encoding structural proteins and enzymes necessary to produce new viral particles.

본원에 사용된 용어 "렌티바이러스"는 질병을 서서히 진행시키는 레트로바이러스의 한 군(또는 속)을 의미한다. 이러한 군에 포함되는 바이러스들로는 인간 면역결핍 바이러스(HIV); 양에서 뇌염(비스나) 또는 폐렴(매디)을 일으키는 비스나-매디, 염소 관절염 뇌염 바이러스(CAEV); 말 감염성 빈혈 바이러스(EIAV); 고양이 면역결핍 바이러스(FIV); 소 면역결핍 바이러스(BIV); 및 원숭이 면역결핍 바이러스(SIV)가 있다. 이러한 바이러스들에 의해 발병되는 질병들은 긴 잠복기와 장기화된 경로를 특징으로 한다. 일반적으로, 상기 바이러스들은 단핵세포 및 대식세포들을 잠재적으로 감염시키고, 이들로부터 다른 세포들로 퍼져나간다.As used herein, the term "lentiviral" refers to a group (or genus) of retroviruses that slowly progress the disease. Viruses included in this group include human immunodeficiency virus (HIV); Visna-Madi, Goat Arthritis Encephalitis Virus (CAEV), which causes encephalitis (bisna) or pneumonia (maddy) in sheep; Equine Infectious Anemia Virus (EIAV); Feline immunodeficiency virus (FIV); Bovine immunodeficiency virus (BIV); And monkey immunodeficiency virus (SIV). Diseases caused by these viruses are characterized by long incubation and prolonged pathways. In general, the viruses potentially infect monocytes and macrophages and spread from them to other cells.

본원에 사용된 용어 "벡터"는 전이 폴리뉴클레오티드(예: DNA) 조각들을 하나의 세포로부터 다른 세포로 전이하는 핵산 분자들을 의미한다. 용어 "전달물질(vehicle)"은 때때로 "벡터"와 상호호환적으로 사용된다. 전달물질 또는 벡터의 임의의 형태는 이러한 정의 내에 포함된다. 예를 들면, 벡터로는 이에 한정되는 것은 아니나, 바이러스 입자, 플라스미드, 트랜스포존(transposon) 등이 포함된다.As used herein, the term "vector" refers to a nucleic acid molecule that transfers fragments of a transition polynucleotide (eg, DNA) from one cell to another. The term "vehicle" is sometimes used interchangeably with "vector". Any form of carrier or vector is included within this definition. For example, the vector includes, but is not limited to, viral particles, plasmids, transposons, and the like.

본 발명의 벡터의 제작을 위한 기본 기술들은 통상의 기술자들에게 이미 공지되어 있으며, [Sambrook et al., Molecular Cloning: A Laboratory Manual 2nd Ed. (Cold Spring Harbor, N.Y., 1989)]와 같은 인용문헌에서 확인될 수 있다. 다양한 방법들이 DNA 절편을 결합하기 위해 활용가능하며, 이의 선택은 DNA 절편의 말단 성질에 따라 통상의 기술자에 의해 용이하게 이루어진다.Basic techniques for the construction of the vector of the present invention are already known to those skilled in the art, see Sambrook et al., Molecular Cloning: A Laboratory Manual 2nd Ed. (Cold Spring Harbor, N.Y., 1989). Various methods are available for joining DNA fragments, the selection of which is readily made by one skilled in the art depending on the terminal nature of the DNA fragment.

본 발명의 적합한 폴리아데닐화 서열로는 이들로 한정되는 것은 아니나, 소 성장 호르몬(BGH) 폴리아데닐화 신호(Pfarr et al., 1986), SV40 초기 영역 폴리아데닐화 위치(Hall et al., 1983) 및 SV40 후기 영역 폴리아데닐화 위치(Carswell and Alwine, 1989), β-글로빈 폴리A, 및 허피스 심플렉스 바이러스 티미딘 카이네이즈 폴리A 등이 포함된다.Suitable polyadenylation sequences of the invention include, but are not limited to, bovine growth hormone (BGH) polyadenylation signals (Pfarr et al., 1986), SV40 early region polyadenylation sites (Hall et al., 1983). ) And SV40 late region polyadenylation sites (Carswell and Alwine, 1989), β-globin polyA, and Herpes simplex virus thymidine kinase polyA, and the like.

본 발명의 프로모터는 포유류 또는 바이러스 기점에 대한 프로모터를 포함할 수 있으며, 세포에서 말단에 위치하는 서열(즉, 프로모터 서열의 5' 말단에 연결된 서열)의 전사를 지배하기에 충분할 것이다. 프로모터 영역으로는 또한 전사의 증대 또는 감소에 대한 조절 요소들이 포함될 수 있다. 본 발명에 적합한 프로모터로는 이에 한정되는 것은 아니나, 인간 또는 쥐 거대세포바이러스 극초기 프로모터(HCMV MIEP 또는 MCMV MIEP), 연장 인자 1 알파(ef-1α), 및 로우스 육종 바이러스 긴 말단 반복 프로모터(pRSV) 등이 있다. 인트론 서열들은 또한 프로모터와 결합될 수 있다. 인트론 서열들로는 이에 한정되는 것은 아니나, ef-1α 인트론 및 β-글로빈 인트론 등이 있다. 또한, 유도발현 시스템(inducible expression systems)이 사용될 수 있다. 유도 시스템의 예로는 이에 한정되는 것은 아니나, 엑시존(ecdysone)-유도 포유류 발현 시스템(Invitrogen, CA, USA) 및 Tet-On 및 Tet-Off 유전자 발현 시스템(Clontech, CA, USA) 등이 있다. 세포 또는 조직 특이적 프로모터들은 특정 세포 개체군내 유전자 서열의 표적 발현을 위해 이용될 수 있다. Promoters of the invention may include promoters for mammalian or viral origin and will be sufficient to govern the transcription of a terminal located in a cell (ie, a sequence linked to the 5 'end of a promoter sequence). Promoter regions may also include regulatory elements for increasing or decreasing transcription. Suitable promoters for the present invention include, but are not limited to, human or murine cytomegalovirus early promoters (HCMV MIEP or MCMV MIEP), elongation factor 1 alpha (ef-1α), and Loose sarcoma virus long terminal repeat promoters ( pRSV) and the like. Intron sequences may also be associated with a promoter. Intron sequences include, but are not limited to, ef-1α introns and β-globin introns. Inducible expression systems can also be used. Examples of induction systems include, but are not limited to, ecdysone-induced mammalian expression systems (Invitrogen, CA, USA) and Tet-On and Tet-Off gene expression systems (Clontech, CA, USA). Cell or tissue specific promoters can be used for targeted expression of gene sequences in specific cell populations.

프로모터 또는 터미네이터(terminator) 서열들의 상류, 및 코딩 영역의 하류로부터의 인핸서(enhancer) 서열들은 발현을 촉진하기 위해 본 발명의 벡터에 선택적으로 포함될 수 있다. 본 발명의 벡터는 또한 세포가 벡터의 핵산에 의해 발현된 단백질을 효율적 및 효과적으로 처리하도록, 인트론 서열, 이동 서열(localization sequence) 또는 신호 서열과 같은 추가의 핵산 서열들을 포함할 수 있다. 인트론 서열의 예로는 β-글로빈 인트론(Kim et al., 2002) 및 인간 EF-1α 인트론(Kim et al., 2002) 등이 포함된다. 이같은 추가의 서열들은 벡터에 삽입되어, 전사를 목적으로 하는 경우에는 프로모터 서열에 작동가능하게 연결되거나, 번역 및 처리를 목적으로 하는 경우에는 개시 및 처리 서열에 추가적으로 연결된다. 한편, 삽입된 서열들은 벡터 내 어떤 곳에도 위치할 수 있다.Enhancer sequences upstream of the promoter or terminator sequences, and downstream of the coding region, may optionally be included in the vector of the present invention to promote expression. Vectors of the invention may also include additional nucleic acid sequences, such as intron sequences, localization sequences, or signal sequences, such that cells efficiently and efficiently process proteins expressed by the nucleic acids of the vector. Examples of intron sequences include β-globin introns (Kim et al., 2002) and human EF-1α introns (Kim et al., 2002) and the like. Such additional sequences are inserted into the vector to be operably linked to the promoter sequence for transcriptional purposes or to the initiation and processing sequences for translational and processing purposes. Meanwhile, inserted sequences can be located anywhere in the vector.

용어 "작동가능하게 연결된"은, 유전자 서열의 전사가 작동가능하게 연결된 프로모터 서열에 의해 지배되고, 유전자 서열의 번역이 작동 가능하게 연결된 번역 조절 서열에 의해 지배되고, 또한 유전자 서열의 번역-후 처리가 작동가능하게 연결된 처리 서열에 의해 지배되는, 유전자 서열과 프로모터 또는 다른 조절 또는 처리 서열간의 연결을 의미한다.The term “operably linked” is governed by a promoter sequence in which transcription of the gene sequence is operably linked, is governed by a translation control sequence in which translation of the gene sequence is operably linked, and also post-translational processing of the gene sequence. Means a linkage between a gene sequence and a promoter or other regulatory or processing sequence, governed by a processing sequence operably linked.

용어 "SIN 벡터"는 3'LTR에서의 절단된(truncated) U3 영역을 갖는 자가-불활성(self-inactivating) 벡터를 의미한다. 역전사 기간 동안, 절단된 U3는 5'LTR에서 복제되고, 그 결과 전사 용량(capacity)의 손실 및 내부 프로모터에 대한 저해 효과가 나타난다.The term "SIN vector" refers to a self-inactivating vector having a truncated U3 region in 3'LTR. During the reverse transcription period, the cleaved U3 replicates at 5'LTR, resulting in a loss of transcriptional capacity and an inhibitory effect on the internal promoter.

전이 벡터의 패키징 서열은 (ⅰ) CAEV 5' LTR과 CAEV gag-코딩 서열 간의 비번역 영역, 및 (ii) 상기 비번역 영역의 3' 말단에 연결된 CAEV gag-코딩 서열의 1 내지 X번째 뉴클레오티드(이때, X는 613 미만임)로 필수적으로 이루어진다. 일 실시양태에서, X는 60, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575 및 600으로 이루어진 군으로부터 선택된다.The packaging sequence of the transfer vector comprises (i) an untranslated region between the CAEV 5 'LTR and the CAEV gag -coding sequence, and (ii) the 1 to X nucleotides of the CAEV gag -coding sequence linked to the 3' end of the untranslated region ( Where X is less than 613). In one embodiment, X is 60, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575 and 600 It is selected from the group consisting of.

본 발명의 다른 실시양태에서, X는 다음으로 이루어진 군으로부터 선택된다: In another embodiment of the invention, X is selected from the group consisting of:

(a) X는 25 초과 600 미만,(a) X is greater than 25 but less than 600,

(b) X는 25 초과 500 미만,(b) X is greater than 25 but less than 500,

(c) X는 25 초과 400 미만,(c) X is greater than 25 but less than 400,

(d) X는 25 초과 300 미만,(d) X is greater than 25 but less than 300,

(e) X는 25 초과 200 미만,(e) X is greater than 25 but less than 200,

(f) X는 50 초과 600 미만,(f) X is greater than 50 but less than 600,

(g) X는 50 초과 500 미만,(g) X is greater than 50 and less than 500,

(h) X는 50 초과 400 미만,(h) X is greater than 50 and less than 400,

(i) X는 50 초과 300 미만,(i) X is greater than 50 and less than 300,

(j) X는 50 초과 200 미만,(j) X is greater than 50 and less than 200,

(k) X는 75 초과 600 미만,(k) X is greater than 75 and less than 600,

(l) X는 75 초과 500 미만,(l) X is greater than 75 and less than 500,

(m) X는 75 초과 400 미만,(m) X is greater than 75 and less than 400,

(n) X는 75 초과 300 미만,(n) X is greater than 75 and less than 300,

(o) X는 75 초과 200 미만,(o) X is greater than 75 and less than 200,

(p) X는 100 초과 600 미만,(p) X is greater than 100 but less than 600,

(q) X는 100 초과 500 미만,(q) X is greater than 100 and less than 500,

(r) X는 100 초과 400 미만,(r) X is greater than 100 but less than 400,

(s) X는 100 초과 300 미만,(s) X is greater than 100 and less than 300,

(t) X는 100 초과 200 미만,(t) X is greater than 100 and less than 200,

(u) X는 125 초과 600 미만,(u) X is greater than 125 and less than 600,

(v) X는 125 초과 500 미만,(v) X is greater than 125 and less than 500,

(w) X는 125 초과 400 미만,(w) X is greater than 125 and less than 400,

(x) X는 125 초과 300 미만,(x) X is greater than 125 and less than 300,

(y) X는 125 초과 200 미만,(y) X is greater than 125 and less than 200,

(z) X는 150 초과 600 미만,(z) X is greater than 150 and less than 600,

(aa) X는 150 초과 500 미만,(aa) X is greater than 150 and less than 500,

(bb) X는 150 초과 400 미만,(bb) X is greater than 150 and less than 400,

(cc) X는 150 초과 300 미만,(cc) X is greater than 150 and less than 300,

(dd) X는 150 초과 200 미만,(dd) X is greater than 150 and less than 200,

(ee) X는 200 초과 600 미만,(ee) X is greater than 200 and less than 600,

(ff) X는 200 초과 500 미만,(ff) X is greater than 200 and less than 500,

(gg) X는 200 초과 400 미만,(gg) X is greater than 200 and less than 400,

(hh) X는 200 초과 300 미만, (hh) X is greater than 200 and less than 300,

(ii) X는 200 초과 200 미만,(ii) X is greater than 200 and less than 200,

(jj) X는 250 초과 600 미만,(jj) X is greater than 250 and less than 600,

(kk) X는 250 초과 500 미만,(kk) X is greater than 250 and less than 500,

(ll) X는 250 초과 400 미만, 및(ll) X is greater than 250 and less than 400, and

(mm) X는 250 초과 300 미만.(mm) X is greater than 250 and less than 300.

다른 실시양태에서, X는 40 초과 613 미만이다. 또 다른 실시양태에서, X는 약 327이다. 전이 벡터의 일 실시양태에서, gag 번역을 개시하는 코돈은 변이되거나(예: ATG가 TAG, TTG, CTG 또는 ATT로 치환) 삭제된다. 용어 "코돈"은, 특정 아미노산의 성장 폴리펩타이드 쇄로의 결합에 대한 지령을 의미하는, DNA 또는 전령(messenger) RNA 분자에서의 세 개 뉴클레오티드로 이루어진 하나의 서열을 의미한다. 전이 벡터는 이종의 프로모터 및 하나 이상의 시스-작용 서열들을 추가로 포함한다.In other embodiments, X is greater than 40 but less than 613. In another embodiment, X is about 327. In one embodiment of the transition vector, the codons that initiate gag translation are mutated (eg, ATG replaced with TAG, TTG, CTG, or ATT). The term "codon" refers to one sequence consisting of three nucleotides in a DNA or messenger RNA molecule, meaning an instruction for binding a specific amino acid to a growing polypeptide chain. The transition vector further comprises a heterologous promoter and one or more cis -acting sequences.

본원에 사용된 용어 "패키징 신호" 또는 "패키징 서열"은 바이러스 RNA가 바이러스 캡시드 또는 입자로 캡시드화하는데 요구되는, CAEV 유전체의 5' LTR에 인접하여 위치하는 서열을 의미한다. 여러 레트로바이러스 벡터는 바이러스 유전체의 캡시드화에 요구되는 최소한의 패키징 신호(또한 싸이(psi[ψ]) 서열이라고도 함)를 사용한다. 따라서, 본원에 사용된 용어 "패키징 서열", "패키징 신호", "싸이", 및 기호 "ψ"은 바이러스 입자 형성 기간 동안에 CAEV RNA 가닥들의 캡시드화에 요구되는 비-코딩 서열을 의미한다.As used herein, the term "packaging signal" or "packaging sequence" refers to a sequence located adjacent to the 5 'LTR of the CAEV genome, where viral RNA is required to encapsulate into a viral capsid or particle. Many retroviral vectors use the minimal packaging signal (also called psi [psi] sequence) required for the encapsidation of the viral genome. Thus, as used herein, the terms "packaging sequence", "packaging signal", "cy", and the symbol "ψ" refer to a non-coding sequence required for the encapsidation of CAEV RNA strands during viral particle formation.

본 발명의 다른 실시양태에서, 전이 벡터는 또한 전사 카세트(transcription cassette)를 포함한다. 본원에 사용된 용어 "전사 카세트"는 이종의 프로모터에 작동가능하게 연결된, 유전학적 요소들의 특정 그룹, 일반적으로 이종의 프로모터에 작동가능하게 연결되고, 관심 폴리펩타이드를 발현하는 폴리뉴클레오티드를 포함하는 핵산의 절편(fragment) 또는 조각(segment)을 의미한다. 상기 카세트는 단일 단위로서 제거되거나 벡터 또는 플라스미드에 삽입될 수 있다.In another embodiment of the invention, the transition vector also comprises a transcription cassette. As used herein, the term “transcription cassette” refers to a nucleic acid comprising a polynucleotide that is operably linked to a specific group of genetic elements, generally a heterologous promoter, operably linked to a heterologous promoter, and that expresses the polypeptide of interest. It means a fragment or segment of. The cassette can be removed as a single unit or inserted into a vector or plasmid.

본 발명의 전이 벡터에 대한 실례가 도 3c에 나타나 있다. 도 3c는 플라스미드 pCAH/SINd1(서열 번호: 68)을 나타낸다. pCAH/SINd1(서열 번호: 68)은 HCMV MIEP 프로모터, CAEV 5'LTR에서의 R-U5 서열 영역, PBS 위치를 포함하는 잔여 비번역 서열, ATG→TAG 이중 점변이를 갖는 gag 유전자의 327 bp 절편, RRE 영역 및 U3-삭제된 CAEV 3'LTR 영역을 포함하는, 4,238 bp의 플라스미드이다. 상기 벡터는 또한 Col E1 복제 기점(bp 2535-3118) 및 카나마이신 내성 유전자 영역(bp 3370-4182)을 포함한다. 전이 벡터의 다른 실례들이 도 3a 내지 3h에 나타나 있다.An example of a transition vector of the invention is shown in FIG. 3C. 3C shows plasmid pCAH / SINd1 (SEQ ID NO: 68). pCAH / SINd1 (SEQ ID NO: 68) is a 327 bp fragment of the gag gene with the HCMV MIEP promoter, R-U5 sequence region in CAEV 5'LTR, residual untranslated sequence comprising PBS position, ATG → TAG double point mutation 4,238 bp plasmid, comprising the RRE region and the U3-deleted CAEV 3'LTR region. The vector also includes a Col E1 origin of replication (bp 2535-3118) and a kanamycin resistance gene region (bp 3370-4182). Other examples of transition vectors are shown in FIGS. 3A-3H.

본 발명은 상술된 전이 벡터 및 패키징 벡터 시스템을 포함하는 CAEV 벡터 시스템을 제공한다. 상기 패키징 벡터 시스템은 제1 및 제2 폴리뉴클레오티드 벡터 서열을 포함한다. 제1 폴리뉴클레오티드 서열은 CAEV gag-pol 및 RRE-코딩 서열을 포함하며, 제2 폴리뉴클레오티드는 바이러스 외피 코딩 서열을 포함한다. 일 실시양태에서, 제2 폴리뉴클레오티드는 비-CAEV 외피를 코딩한다.The present invention provides a CAEV vector system comprising the transition vector and packaging vector system described above. The packaging vector system includes first and second polynucleotide vector sequences. The first polynucleotide sequence comprises a CAEV gag-pol and RRE-coding sequence and the second polynucleotide comprises a viral envelope coding sequence. In one embodiment, the second polynucleotide encodes a non-CAEV envelope.

본원에서 사용된 문구 "구조 유전자"는 바이러스 유전체의 캡시드화(예: 패키징)에 요구되는 단백질을 코딩하는 폴리뉴클레오티드 서열을 의미하며, gag, polenv 등이 이에 속한다.As used herein, the phrase “structural gene” refers to a polynucleotide sequence that encodes a protein required for capsidization (eg, packaging) of a viral genome, including gag , pol and env and the like.

본 발명의 제1 패키징 벡터의 실례가 도 2a에 나타나 있다. 도 2a는 플라스미드 pMGP/RRE(서열 번호: 77)를 나타낸다. 상기 플라스미드는 9,446 염기쌍(bp)을 가지며, MCMV MIEP 영역, CAEV gag-pol 코딩 영역, RRE 영역, 및 소 성장 호르몬(BGH) 폴리아데닐화 신호를 포함한다. 상기 벡터는 또한 네오마이신 내성 유전자 코딩 영역, SV40 복제 기점, Col E1 복제 기점, 및 암피실린 내성 유전자 영역을 포함한다.An example of a first packaging vector of the invention is shown in FIG. 2A. 2A shows plasmid pMGP / RRE (SEQ ID NO: 77). The plasmid has 9,446 base pairs (bp) and includes an MCMV MIEP region, a CAEV gag-pol coding region, an RRE region, and bovine growth hormone (BGH) polyadenylation signal. The vector also includes a neomycin resistance gene coding region, an SV40 origin of replication, a Col E1 origin of replication, and an ampicillin resistance gene region.

본 발명의 바이러스 벡터가 감염시킬 수 있는 세포들의 숙주 범위는 다른 근친관계의 바이러스로부터의 외피 유전자를 이용함으로써 대체될 수 있다. 즉, 특정 바이러스의 외피 단백질이 갖는 다른 바이러스의 캡시드화에 관여하는 능력의 장점을 취하여 본원 발명의 CAEV 벡터의 숙주 범위를 넓히는 것이 가능하다. 레트로바이러스-유래 env 유전자의 예로는 이에 한정되는 것은 아니나 다음과 같다: 소수포성 구내염 바이러스의 G-단백질(VSV-G), 긴팔 원숭이 백혈병 바이러스(GaLV), 라우스 육종 바이러스(RSV), 몰로니 쥐 백혈병 바이러스(MoMuLV), 마우스 유방 종양 바이러스(MMTV), 및 인간 면역결핍성 바이러스(HIV). 이들 바이러스 외피 단백질은 모두 다른 바이러스의 유전체 및 매트릭스 구성원들과 함께 의사형 비리온을 효과적으로 형성한다. 본원에서 사용된 용어 "의사형(pseudotype)"은 어떤 바이러스의 핵산과 다른 바이러스의 외피 단백질을 포함하는 바이러스 입자를 의미한다. 일반적으로, VSV-G나 GaLV 의사형 벡터는 매우 넓은 숙주 범위를 가지며, 여전히 높은 수준의 감염성을 나타내는 반면, 초원심분리에 의해 높은 농도의 역가로 펠렛화 될 수 있다(Burns et al., 1993). The host range of cells that can be infected by the viral vectors of the invention can be replaced by using enveloped genes from other intimate viruses. That is, it is possible to extend the host range of the CAEV vector of the present invention by taking advantage of the ability to participate in the encapsidation of other viruses possessed by the envelope protein of a particular virus. Examples of retrovirus-derived env genes include, but are not limited to: G-protein of hydrophobic stomatitis virus (VSV-G), gibbon leukemia virus (GaLV), Raus sarcoma virus (RSV), molony rat Leukemia virus (MoMuLV), mouse breast tumor virus (MMTV), and human immunodeficiency virus (HIV). These viral envelope proteins all effectively form pseudo virions with the genome and matrix members of other viruses. As used herein, the term “pseudotype” refers to a viral particle comprising a nucleic acid of one virus and an envelope protein of another virus. In general, VSV-G or GaLV pseudo-type vectors have a very wide host range and still exhibit high levels of infectivity, while they can be pelleted at high concentrations by ultracentrifugation (Burns et al., 1993). ).

본 발명의 제2 패키징 벡터의 다른 실례가 도 6a 및 6b에 나타나 있다. 도 6a는 플라스미드 pHGVSV-G(서열 번호: 74)를 나타낸다. pHGVSV-G(서열 번호: 74)는 HCMV MIEP, β-글로빈 인트론 영역, VSV-G 코딩 영역, BGH 폴리아데닐화 신호, Col E1 복제 기점, 네오마이신 내성 유전자 코딩 영역, 암피실린 내성 유전자 코딩 영역, 및 F1 복제기점을 포함하는 7,623 bp의 플라스미드이다. 도 6b는 플라스미드 pMYKEF1/env(서열 번호: 72)를 나타낸다. 상기 플라스미드는 MCMV MIEP, 인간 EF1-α 인트론 영역, GaLV env 코딩 영역, BGH 폴리아데닐화 신호, Col E1 복제 기점, 네오마이신 내성 유전자 코딩 영역, 및 암피실린 내성 유전자 코딩 영역을 포함하는 7,579 bp를 포함한다. Another example of a second packaging vector of the present invention is shown in FIGS. 6A and 6B. 6A shows plasmid pHGVSV-G (SEQ ID NO: 74). pHGVSV-G (SEQ ID NO: 74) is HCMV MIEP, β-globin intron region, VSV-G coding region, BGH polyadenylation signal, Col E1 origin of replication, neomycin resistance gene coding region, ampicillin resistance gene coding region, and 7,623 bp plasmid containing the F1 replication origin. 6B shows plasmid pMYKEF1 / env (SEQ ID NO: 72). The plasmid comprises 7,579 bp comprising MCMV MIEP, human EF1-α intron region, GaLV env coding region, BGH polyadenylation signal, Col E1 origin of replication, neomycin resistance gene coding region, and ampicillin resistance gene coding region. .

본 발명의 다른 실시양태에서, 패키징 벡터는 Rev를 코딩하는 제3 폴리뉴클레오티드를 포함한다. 감염된 세포에서, Rev는 바이러스 전사물에서의 Rev-반응 요소(RRE)와 결합하여 복제 후기에 바이러스 구조 단백질의 특징적인 단일-스플라이싱되거나 비스플라이싱된 전사물들의 전사를 일으킨다. 따라서, Rev는 바이러스 유전자 발현의 일시적인 조절을 매개한다. 포유류 세포의 스플라이싱 기작은 mRNA가 핵내 합성 위치로부터 세포질로 이동하는 것과 짝을 이루기 때문에, Rev는 또한 RRE를 포함하는 바이러스 전사물의 수송에도 영향을 미친다.In another embodiment of the invention, the packaging vector comprises a third polynucleotide encoding Rev. In infected cells, Rev binds to the Rev-response element (RRE) in viral transcripts, resulting in transcription of characteristic single-spliced or non-spliced transcripts of viral structural proteins later in replication. Thus, Rev mediates transient regulation of viral gene expression. Rev also affects the transport of viral transcripts, including RRE, because the splicing mechanism of mammalian cells is paired with the migration of mRNA from the synthetic site to the cytoplasm.

본 발명의 제3 패키징 벡터의 실례가 도 5에 제시되어 있다. 도 5는 플라스미드 pHYK/rev(서열 번호: 75)를 나타낸다. pHYK/rev(서열 번호: 75)는 HCMV MIEP, rev 유전자 코딩 영역, BGH 폴리아데닐화 신호, Col E1 복제 기점, 네오마이신 내성 유전자 코딩 영역, 및 암피실린 내성 유전자 코딩 영역을 포함하는 5,419 bp의 플라스미드이다.An example of a third packaging vector of the invention is shown in FIG. 5. 5 shows plasmid pHYK / rev (SEQ ID NO: 75). pHYK / rev (SEQ ID NO: 75) is a 5,419 bp plasmid comprising HCMV MIEP, rev gene coding region, BGH polyadenylation signal, Col E1 origin of replication, neomycin resistance gene coding region, and ampicillin resistance gene coding region .

본 발명의 또 다른 실시양태에서, 패키징 벡터는 Vif를 코딩하는 제4 폴리뉴클레오티드를 포함한다. Vif의 결합은 선택된 패키징 세포주에 따라 비리온의 감염 및 패키징에 필수적일 수 있다. In another embodiment of the invention, the packaging vector comprises a fourth polynucleotide encoding Vif. Binding of Vif may be essential for infection and packaging of virions depending on the packaging cell line chosen.

본 발명의 제4 패키징 벡터의 실례가 도 4에 제시되어 있다. pHYK/vif(서열 번호: 76)는 HCMV MIEP, vif 유전자 코딩 영역, BGH 폴리아데닐화 신호, Col E1 복제 기점, 네오마이신 내성 유전자 코딩 영역, 및 암피실린 내성 유전자 코딩 영역을 포함하는 5,729 bp의 플라스미드이다.An example of a fourth packaging vector of the present invention is shown in FIG. 4. pHYK / vif (SEQ ID NO: 76) is a 5,729 bp plasmid comprising HCMV MIEP, vif gene coding region, BGH polyadenylation signal, Col E1 origin of replication, neomycin resistance gene coding region, and ampicillin resistance gene coding region .

레트로바이러스 벡터 DNA가 세포 내로 형질감염될 때, 이는 염색체 DNA로 통합되거나 통합되지 않을 수도 있고, 이후 전사됨으로써 ψ 서열을 포함하는 전장 레트로바이러스 벡터 RNA를 생산한다. 이러한 조건 하에, 오직 벡터 RNA만 바이러스 캡시드 구조물로 패키징된다. 이러한 완성된, 그러나 복제-결함의, 바이러스 입자들은 상대적으로 높은 효율성을 가지고 레트로바이러스 벡터를 표적세포로 전달하는데 사용될 수 있다.When retroviral vector DNA is transfected into a cell, it may or may not be integrated into chromosomal DNA and then transcribed to produce full length retroviral vector RNA comprising the ψ sequence. Under these conditions, only vector RNA is packaged into the viral capsid construct. Such completed, but replication-defective, viral particles can be used to deliver retroviral vectors to target cells with relatively high efficiency.

본원에서 사용된 용어 "복제-결함"은 감염성 비리온이 생산되지 않도록, 완전하고 효과적으로 복제할 수 없는 바이러스를 의미한다(예: 복제-결함 렌티바이러스 프로제니(progeny)). 용어 "복제-적격"은 바이러스 복제에 의해 감염성 비리온이 생산될 수 있도록 하는 복제 능력이 있는, 야생형 바이러스 또는 변이 바이러스를 의미한다(예: 복제-적격 렌티바이러스 프로제니).As used herein, the term "replicating-defective" means a virus that is unable to replicate completely and effectively, such that infectious virions are not produced (eg, replication-defective lentiviral progeny). The term “replicating-competent” means a wild-type virus or a mutant virus that has the ability to replicate, such that infectious virions can be produced by viral replication (eg, replication-competent lentivirus progeny).

패키징은 또한 비-유도성(non-inducible)일 뿐 아니라 유도성(inducible)일 수 있다. 유도 패키징 세포 및 패키징 세포주에서, CAEV 입자들은 최소 하나 이상의 유도물질(inducer)에 반응하여 생산된다. 유도 세포주를 사용한 바람직한 실시양태에서, 유도물질은 Tat이다. 비-유도 패키징 세포주 및 패키징 세포에서는, 렌티바이러스 입자 생산을 일으키기 위해 유도성 물질을 필요로 하지 않는다.Packaging can also be inducible as well as non-inducible. In induced packaging cells and packaging cell lines, CAEV particles are produced in response to at least one or more inducers. In a preferred embodiment with an inducing cell line, the inducer is Tat. In non-induced packaging cell lines and packaging cells, no inducible material is required to cause lentiviral particle production.

CAEV 벡터 서열CAEV vector sequence

본 발명의 기능적으로 동등한 서열들은 또한 각각의 고유 서열(native sequence)과 실질적으로 동일한 기능을 유지하는 CAEV 유전체의 다양한 절편들이 포함된다. 이러한 절편들은 특정 관심 유전자의 요소들의 최소 약 10, 15개의 연속적인 뉴클레오티드, 최소 약 20개의 연속적인 뉴클레오티드, 최소 약 24, 50, 60, 80, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 340, 360, 380, 또는 그 이상 전체길이의 연속적인 뉴클레오티드를 포함할 수 있다. 이러한 절편들은 고유의 바이러스 유전체를 절단하는 제한 효소의 사용에 의해; 바이러스 유전체의 고유의 뉴클레오티드 서열로부터 뉴클레오티드서열을 합성함으로써; 또는 PCR 기술의 사용을 통하여 얻어질 수 있다(특히, (Mullis 및 Faloona, 1987) 및 (Erlich, 1989) 참조). 또한, 위치-유도 돌연변이(site-directed mutagenesis)의 결과에서처럼, 다양한 벡터 구성원들의 변이체들이 본 발명의 방법에 포함된다. 좀 더 자세히 설명하면, 방법들은 기능적 동등성을 결정하는 당분야에 통상적인 것이다. Functionally equivalent sequences of the present invention also include various segments of the CAEV genome that maintain substantially the same function as each native sequence. These fragments may be at least about 10, 15 contiguous nucleotides, at least about 20 contiguous nucleotides, at least about 24, 50, 60, 80, 100, 120, 140, 160, 180, 200, 220 of elements of a particular gene of interest. , 240, 260, 280, 300, 340, 360, 380, or more. Such fragments are produced by the use of restriction enzymes that cut native viral genomes; By synthesizing nucleotide sequences from unique nucleotide sequences of the viral genome; Or through the use of PCR techniques (see in particular (Mullis and Faloona, 1987) and (Erlich, 1989)). In addition, variants of various vector members are included in the methods of the invention, as a result of site-directed mutagenesis. In more detail, methods are common in the art of determining functional equivalence.

"변이체"로는 실질적으로 유사한 서열들이 포함된다. 따라서, 뉴클레오티드 서열 또는 아미노산 서열의 경우, 변이체로는 바이러스 벡터 시스템의 다양한 구성원들과 기능적으로 동등한 서열들이 포함된다. 변이 뉴클레오티드 서열에는 또한 예를 들면, 고유 서열의 기능을 여전히 유지하는 위치 유도 돌연변이에 의해 발생된 합성적으로 유도된 뉴클레오티드 서열들이 포함된다. 일반적으로, 본 발명의 뉴클레오티드 서열 변이체 또는 아미노산 서열 변이체들은 그 각각의 고유 뉴클레오티드 서열에 대해 최소 70%, 일반적으로 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99%의 서열 상동성을 가질 것이다."Variants" include substantially similar sequences. Thus, for nucleotide sequences or amino acid sequences, variants include sequences that are functionally equivalent to various members of the viral vector system. Variant nucleotide sequences also include synthetically derived nucleotide sequences generated, for example, by position induced mutations that still retain the function of the native sequence. In general, nucleotide sequence variants or amino acid sequence variants of the invention are at least 70%, generally 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95 with respect to their respective unique nucleotide sequences Will have a sequence homology of%, 96%, 97%, 98%, or 99%.

본 발명의 변이체로는 본원에 개시된 벡터(서열 번호:67-79)들의 서열과 최소 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99%의 상동성을 갖는 서열들을 포함하거나, 이들로 필수적으로 이루어지거나, 또는 이들로 이루어진, 폴리뉴클레오티드(예: 벡터)가 포함된다.Variants of the invention include sequences of the vectors disclosed herein (SEQ ID NOs: 67-79) and at least 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96 Polynucleotides (eg, vectors), including, consisting essentially of, or consisting of, sequences having homology of%, 97%, 98%, or 99%.

통상의 기술자는 개시된 핵산 구조물의 많은 보존적 변이(conservative variation)들이 기능적으로 동일한 구조물을 생산한다는 것을 인식할 것이다. 특정 핵산 서열의 보존적 변이는 동일하거나 본질적으로 동일한 아미노산 서열을 코딩하는 핵산을 의미하거나, 또는 핵산이 아미노산 서열을 코딩하지 않을 경우 본질적으로 동일한 서열을 의미한다. 유전자 코드의 축퇴성으로 인해, 많은 기능적으로 동일한 핵산들은 임의이 주어진 폴리펩타이드를 코딩한다. 예를 들면, 유전자 코드의 축퇴성으로 인해, "침묵 치환(silent substitutions)"(즉, 코딩된 폴리펩타이드의 변이를 야기하지 않는 핵산 서열의 치환)은 아미노산을 코딩하는 모든 핵산 서열의 특징을 내재한다. 유사하게, 패키징 또는 패키징 가능한 구조물의 아미노산 서열의 하나 또는 소수의 아미노산에서 "보존적 아미노산 치환"은 매우 유사한 특징을 갖는 다른 아미노산으로 치환되며, 또한 개시된 구조물과 매우 유사하여 용이하게 확인된다. 예를 들면, 코돈 CGU, CGC, CGA, CGG, AGA 및 AGG는 모두 아미노산 아르기닌을 코딩한다. 따라서, 아르기닌이 코돈에 의해 특정되는 모든 위치에서, 상기 코돈은 코딩된 폴리펩타이드를 변화시키지 않고 제시된 임의의 대응 코돈으로 대체될 수 있다. 이러한 핵산 변이가 "보존적으로 변형된 변이"의 한 종류인, "침묵 변이"이다. 폴리펩타이드를 코딩하는 본원의 모든 핵산 서열은 또한 모든 가능한 침묵 변이를 제시한다. 통상의 기술자는 핵산에서의 각 코돈(통상적으로 메티오닌에 대한 유일한 코돈인 AUG는 제외)이 기본 기술에 의해 기능적으로 동일한 분자를 생산하도록 변형될 수 있다는 것을 인식할 것이다. 따라서, 폴리펩타이드를 코딩하는 핵산의 각 "침묵 변이"는 임의의 제시된 서열에 내재되어 있다. 또한, 통상의 기술자는 코딩된 서열에서 단일 아미노산 또는 일부 아미노산(일반적으로 5% 미만, 더 일반적으로 1% 미만)을 대체하거나, 첨가하거나 또는 삭제하는 개개의 치환, 삭제 또는 첨가가 아미노산을 화학적으로 유사한 아미노산으로 치환시키는 "보존적 변형 변이(conservertively modified variations)"인 것을 인식할 것이다. 기능적으로 유사한 아미노산들을 제공하는 보존적 치환표가 이미 당 분야에 공지되어 있다. 다음의 각 6 군은 서로 보존적 치환인 아미노산들을 포함한다:Those skilled in the art will appreciate that many conservative variations of the disclosed nucleic acid constructs produce functionally identical constructs. Conservative variation of a particular nucleic acid sequence means a nucleic acid that encodes the same or essentially the same amino acid sequence, or if the nucleic acid does not encode an amino acid sequence, it means essentially the same sequence. Due to the degeneracy of the genetic code, many functionally identical nucleic acids encode any given polypeptide. For example, due to the degeneracy of the genetic code, "silent substitutions" (ie substitutions of nucleic acid sequences that do not cause mutation of the encoded polypeptide) inherent in the characteristics of all nucleic acid sequences encoding amino acids. . Similarly, “conservative amino acid substitutions” in one or a few amino acids of the amino acid sequence of a packaged or packageable construct are substituted with other amino acids having very similar characteristics and are also very similar to the disclosed constructs and are readily identified. For example, codons CGU, CGC, CGA, CGG, AGA and AGG all encode the amino acid arginine. Thus, at all positions where arginine is specified by codons, the codons can be replaced with any corresponding codons shown without changing the encoded polypeptide. Such nucleic acid variation is a "silent variation", which is a type of "conservatively modified variation". Every nucleic acid sequence herein that encodes a polypeptide also presents all possible silent variations. Those skilled in the art will recognize that each codon in a nucleic acid (except AUG, which is typically the only codon for methionine) can be modified to produce functionally identical molecules by basic techniques. Thus, each "silent variation" of a nucleic acid encoding a polypeptide is inherent in any given sequence. In addition, one of ordinary skill in the art will appreciate that individual substitutions, deletions, or additions that replace, add, or delete a single amino acid or some amino acids (generally less than 5%, more generally less than 1%) in the encoded sequence chemically It will be appreciated that there are "conservertively modified variations" that are substituted with similar amino acids. Conservative substitution tables that provide functionally similar amino acids are already known in the art. Each of the following six groups contain amino acids that are conservative substitutions for one another:

1) 알라닌(A), 세린(S), 트레오닌(T); 1) Alanine (A), Serine (S), Threonine (T);

2) 아스파트산(D), 글루탐산(E); 2) aspartic acid (D), glutamic acid (E);

3) 아스파라긴(N), 글루타민(Q); 3) asparagine (N), glutamine (Q);

4) 아르기닌(R), 리신(K); 4) arginine (R), lysine (K);

5) 이소루신(I), 루신(L), 메티오닌(M), 발린(V); 및 5) isoleucine (I), leucine (L), methionine (M), valine (V); And

6) 페닐알라닌(F), 타이로신(Y), 트립토판(W).6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).

또한, 문헌 (Creighton(1984) Proteins W. H. Freeman and Company)을 참조하라. 끝으로, 비-기능적 서열과 같이, 핵산 분자의 활성을 변화시키지 않는 서열의 첨가는 기본 핵산의 보존적 변형이다. 각 개시된 서열의 이러한 보존적 치환 변이는 본 발명의 특징이다.See also Creighton (1984) Proteins W. H. Freeman and Company. Finally, the addition of a sequence that does not change the activity of the nucleic acid molecule, such as a non-functional sequence, is a conservative modification of the underlying nucleic acid. Such conservative substitutions variations in each disclosed sequence are a feature of the invention.

본 발명의 벡터 시스템에서 사용되는 다양한 전장 또는 성숙(mature) 폴리펩타이드에 대한 아미노산 서열과 관련하여, 변이체로는 고유의 폴리펩타이드에서 N-말단 및/또는 C-말단에서의 하나 이상의 아미노산의 삭제(절단이라고도 함) 또는 첨가; 고유의 폴리 펩타이드에서 하나 이상의 위치에서의 하나 이상의 아미노산의 삭제 또는 첨가; 또는 고유의 폴리펩타이드에서 하나 이상의 위치에서의 하나 이상의 아미노산의 치환에 의해, 고유의 폴리펩타이드로부터 유래된 폴리펩타이드들이 포함된다. 이러한 변이체들은, 예를 들면, 유전자 다형성(polymorphism) 또는 인간의 조작으로부터 야기될 수 있다. 이러한 조작 방법은 일반적으로 당분야에 공지되어 있다.With respect to amino acid sequences for various full-length or mature polypeptides used in the vector systems of the present invention, variants include deletion of one or more amino acids at the N-terminus and / or C-terminus in the native polypeptide ( Also called cleavage) or addition; Deletion or addition of one or more amino acids at one or more positions in the native polypeptide; Or by substitution of one or more amino acids at one or more positions in the native polypeptide, polypeptides derived from the native polypeptide. Such variants can result, for example, from polymorphism or human manipulation. Such methods of operation are generally known in the art.

통상의 기술자는 주어진 핵산 구조물 내에서 치환을 일으키는 여러 방법들을 인식할 것이다. 이러한 공지된 방법들에는 위치-유도 돌연변이, 축퇴성 올리고뉴클레오티드를 이용하는 PCR 증폭, 핵산을 포함하는 세포의 변이 시약 또는 방사선으로의 노출, (예를 들어, 대형 핵산을 만드는 연결(ligation) 및/또는 클로닝에 관련된) 목적하는 올리고뉴클레오티드의 화학적 합성 및 기타 공지된 기술들이 포함된다(Gillam 및 Smith, 1979), (Roberts, Cheetham, 및 Rees, 1987), 및 Sambrook, Innis, Ausbel, Berger, Needham VanDevanter 및 Mullis(모두 상기 문헌) 참조). Those skilled in the art will recognize several ways of causing substitution within a given nucleic acid construct. Such known methods include site-directed mutations, PCR amplification using degenerate oligonucleotides, exposure of cells, including nucleic acids, to mutation reagents or radiation (eg, ligation and / or making large nucleic acids). Chemical synthesis of the desired oligonucleotides and other known techniques (related to cloning) (Gillam and Smith, 1979), (Roberts, Cheetham, and Rees, 1987), and Sambrook, Innis, Ausbel, Berger, Needham VanDevanter and See Mullis (all supra).

고유의 핵산 또는 고유의 폴리펩타이드의 변이체는 고유의 서열 또는 고유의 폴리펩타이드와 실질적인 상동성을 갖는다. 변이체는 1 내지 10개의 아미노산 잔기와 같이 적은, 6-10개와 같은, 5개와 같이 적은, 4, 3, 2 또는 1개의 아미노산 잔기와 같이 적은 수에 의해 달라질 수 있다. 핵산 서열의 변이체는 1 내지 30개의 핵산과 같이 적은, 6 내지 20개와 같은, 5개와 같이 적은, 4, 3, 2 또는 1개의 핵산 잔기와 같이 적은 수에 의해 달라질 수 있다.Variants of native nucleic acids or native polypeptides have substantial homology with native sequences or native polypeptides. Variants may vary by as few as 4, 3, 2 or 1 amino acid residues, as low as 5, such as 6-10, as low as 1-10 amino acid residues. Variants of the nucleic acid sequence may vary by as few as 4, 3, 2 or 1 nucleic acid residues, as low as 5, such as 6 to 20, as low as 1 to 30 nucleic acids.

"서열 상동성"은 변이체의 뉴클레오티드 서열 또는 아미노산 서열의 특정의 연속적인 절편을 인용 서열의 뉴클레오티드 서열 또는 아미노산 서열에 대해 정렬하고 비교할 때, 변이체 서열과 인용 서열내에 동일한 뉴클레오티드 또는 아미노산 잔기가 확인되는 것을 의미한다. 서열들간의 서열 정렬(sequence alignment) 및 상동성 결정 방법은 당분야에 공지되어 있다. 두 뉴클레오티드 서열들의 최적의 정렬과 관련하여, 변이 뉴클레오티드 서열의 연속적인 절편은 인용 뉴클레오티드 서열에 대해 추가의 뉴클레오티드 또는 삭제된 뉴클레오티드를 가질 수 있다. 마찬가지로, 두 아미노산 서열의 최적의 정렬을 위해서, 변이 아미노산 서열의 연속적인 절편은 인용 아미노산 서열에 대해 추가의 아미노산 잔기 또는 삭제된 아미노산 잔기를 가질 수 있다. 인용 뉴클레오티드 서열 또는 인용 아미노산 서열과의 비교를 위해 사용되는 연속적인 절편은 최소 20개의 연속적인 뉴클레오티드 또는 아미노산 잔기를 포함할 것이며, 30, 40, 50, 100, 또는 그 이상의 뉴클레오티드 또는 아미노산 잔기를 포함할 수 있다. 변이체의 뉴클레오티드 서열 또는 아미노산 서열에서 공백을 포함하는 것과 관련하여, 증가된 서열 상동성에 대한 보정은 공백 벌점을 부과함으로써 이루어질 수 있다.“Sequence homology” refers to the identification of identical nucleotide or amino acid residues within a variant sequence and a cited sequence when the alignment and comparison of certain consecutive segments of the nucleotide sequence or amino acid sequence of the variant with respect to the nucleotide sequence or amino acid sequence of the cited sequence. it means. Sequence alignment and homology determination methods between sequences are known in the art. With regard to the optimal alignment of two nucleotide sequences, consecutive segments of variant nucleotide sequences may have additional nucleotides or deleted nucleotides relative to the cited nucleotide sequence. Likewise, for optimal alignment of two amino acid sequences, successive segments of variant amino acid sequences may have additional or deleted amino acid residues relative to the cited amino acid sequence. Consecutive segments used for comparison with a cited nucleotide sequence or cited amino acid sequence will include at least 20 contiguous nucleotide or amino acid residues, and may include 30, 40, 50, 100, or more nucleotide or amino acid residues. Can be. Regarding the inclusion of a void in the nucleotide sequence or amino acid sequence of the variant, correction for increased sequence homology can be made by imposing a blank penalty.

두 서열들 간의 상동성 백분율의 결정은 수학적 알고리즘을 사용하여 수행될 수 있다. 예를 들면, 아미노산 서열의 상동성 백분율은 12의 공백 개시 벌점 및 2의 공백 연장 벌점을 갖는 아핀 6 공백 검색(affine 6 gap search)인 블로섬 매트릭스 62(BLOSUM matrix 62)를 사용하는 스미스-워터만 상동성 검색 알고리즘(Smith-Waterman homology search algorithm)을 사용하여 결정될 수 있다. 한편, 뉴클레오티드 서열의 상동성 백분율은 25의 공백 개시 벌점 및 5의 공백 연장 벌점을 사용하는 스미스-워터만 상동성 검색 알고리즘을 사용하여 결정될 수 있다. 이러한 서열 상동성 결정법은, 예를 들면, 타임로직 버전 G(TimeLogic Version G)로부터의 디사이퍼 하드웨어 가속장치(DeCypher Hardware Accelerator)을 사용하여 수행될 수 있다. 상기 스미스-워터만 상동성 검색 알고리즘은 스미스 및 워터만에 의해 교시되고 있으며, 본원에서 인용되었다. 한편, 디폴트 파라미터(default parameters)를 사용하는 정렬 프로그램 GCG 갭(GCG Gap, Wisconsin Genetic Computing Group, Suite Version 10.1)이 사용될 수 있다. GCG 갭 프로그램은 니들만 및 운츠 알고리즘(Needleman and Wunch algorithm)을 적용하며, 3의 공백 개시 벌점 및 1의 공백 연장 벌점을 가지고 뉴클레오티드 서열 정렬을 위해 사용될 수 있다. 이외 바람직하나 이에 한정되는 것은 아닌 두 서열의 비교에 이용되는 수학적 알고리즘의 예는 카를린 및 알트술(Karlin and Altschul, 1993)에 의해 변형된 카를린 및 알트술의 알고리즘(Karlin and Altschul, 1990)이 있다. 이러한 알고리즘은 알트술 등(Altschul et al., 1990)의 N블라스트(NBLAST) 및 X블라스트(XBLAST) 프로그램에 반영되었다. 블라스트(BLAST) 뉴클레오티드 검색법은 N블라스트 프로그램, 스코어(score)=100, 워드길이(wordlength)=12의 조건으로 수행되어 충분한 서열 상동성을 갖는 뉴클레오티드 서열들을 얻을 수 있다. 블라스트 단백질 검색법은 X블라스트 프로그램, 스코어=50, 워드길이=3의 조건으로 수행되어 충분한 서열 상동성을 갖는 아미노산들을 얻을 수 있다. 비교의 목적으로 공백이 있는 정렬(gapped alignments)을 얻기 위해서는, 알트술 등(Altschul et al., 1997)에 제시된 바에 따라 공백 블라스트(Gapped BLAST)가 이용될 수 있다. 한편, 분자들간의 원연 관계(distant relationships)를 검색하는 반복 검색(iterated search)을 수행하기 위해 PSI-블라스트(PSI-Blast)가 사용될 수 있다(상기 알트술 등(Altschul et al., 1997) 참조). 블라스트, 공백 플라스트 및 PSI-블라스트 프로그램을 이용하는 경우, 각 프로그램(예: X블라스트 및 N블라스트)의 디폴트 파라미터가 사용될 수 있다(http://www.ncbi.nlm.nih.gov 참조). 서열 비교에 이용되는 수학적 알고리즘의 다른 비한정적인 예로는 마이어스 및 밀러(Myers and Miller (1988) CABIOS 4:11-17)의 알고리즘이 있다. 이러한 알고리즘은 GCG 서열 정렬 소프트웨어 팩키지(GCG sequence alignment software package)의 일부인 얼라인 프로그램(ALIGN program, version 2.0)에 반영되어 있다. 아미노산 서열 비교를 위해 얼라인 프로그램을 이용하는 경우, PAM120 분자량 잔기 표(PAM120 weight residue table), 12의 공백 길이 벌점, 및 4의 공백 벌점에 사용될 수 있다. 아미노산 서열의 상동성 백분율은 또한 벡터NTI(VectorNTI, Informax, USA)를 사용하여 결정될 수 있다.Determination of percent homology between two sequences can be performed using a mathematical algorithm. For example, the percent homology of the amino acid sequence is Smith-Waterman using BLOSUM matrix 62, an affine 6 gap search with 12 blank starting penalty and 2 blank extending penalty. It can be determined using the Smith-Waterman homology search algorithm. Meanwhile, the percent homology of the nucleotide sequence can be determined using the Smith-Waterman homology search algorithm using a blank starting penalty of 25 and a blank extending penalty of 5. Such sequence homology determination can be performed using, for example, a DeCypher Hardware Accelerator from TimeLogic Version G. The Smith-Waterman homology search algorithm is taught by Smith and Waterman and is cited herein. Meanwhile, an alignment program GCG gap using default parameters (GCG Gap, Wisconsin Genetic Computing Group, Suite Version 10.1) may be used. The GCG Gap program applies the Needleman and Wunch algorithm and can be used for nucleotide sequence alignment with a blank start penalty of 3 and a blank extension penalty of 1. Examples of mathematical algorithms used to compare two sequences, which are preferred but not limited thereto, include, but are not limited to, Karlin and Altschul's (Karlin and Altschul, 1990) modifications made by Karlin and Altschul (1993). There is this. This algorithm has been reflected in the Nblast (NBLAST) and XBLAST programs of Altschul et al. (1990). BLAST nucleotide search can be performed under conditions of N blast program, score = 100, wordlength = 12 to obtain nucleotide sequences with sufficient sequence homology. Blast protein search can be performed under conditions of Xblast program, score = 50, wordlength = 3 to obtain amino acids with sufficient sequence homology. To obtain gapped alignments for comparison purposes, Gapped BLAST can be used as set forth in Altschul et al. (1997). On the other hand, PSI-Blast can be used to perform an iterated search for searching for the discrete relationships between molecules (see Altschul et al., 1997, above). ). When using blast, blank flask and PSI-blast programs, the default parameters of each program (eg Xblast and Nblast) can be used (see http://www.ncbi.nlm.nih.gov). Another non-limiting example of a mathematical algorithm used for sequence comparison is the algorithm of Myers and Miller (1988) CABIOS 4: 11-17. This algorithm is reflected in the ALIGN program, version 2.0, which is part of the GCG sequence alignment software package. When using an alignment program for amino acid sequence comparison, it can be used for the PAM120 weight residue table, 12 blank length penalty, and 4 blank penalty. Percent homology of amino acid sequences can also be determined using VectorNTI (VectorNTI, Informax, USA).

통상의 기술자는 제공된 서열 및 일반적으로 CAEV에 관련하여 당분야에 공지된 것을 근거로 본 발명의 목적하는 핵산을 선별할 수 있다. 렌티바이러스의 생명주기, 유전체 구조, 발생 조절 및 관련 분자 생물학은 지난 10년간의 집중적인 연구의 초점이 되어왔다. 많은 레티바이러스 유전체에서 여러 변이들의 특정 효과가 공지되어 있다. 또한, 일부 CAEV 균주들의 핵산 서열 다양성이 공지되어 있다. 아울러, 통상의 단백질 및 핵산의 특성과 관련된 통상의 지식으로 인해 통상의 기술자는 본원의 서열 목록에 개시된 핵산 및 폴리펩타이드와 유사하거나 동등한 활성을 갖는 적절한 서열을 선택할 수 있다. One skilled in the art can screen for nucleic acids of interest based on the sequences provided and generally known in the art with respect to CAEV. Lentivirus life cycle, genome structure, developmental control and related molecular biology have been the focus of intensive research over the last decade. The specific effects of several variations on many retivirus genomes are known. In addition, nucleic acid sequence diversity of some CAEV strains is known. In addition, conventional knowledge related to the properties of conventional proteins and nucleic acids allows those skilled in the art to select appropriate sequences that have similar or equivalent activity to the nucleic acids and polypeptides disclosed in the Sequence Listing herein.

끝으로, 대부분의 핵산에 대한 변형은 목적하는 특성에 대한 적합한 분석에서의 일련의 스크리닝 기술에 의해 확인된다. 예를 들면, 코딩된 폴리펩타이드의 면역학적 특성에서의 변형은 적절한 면역학적 분석에 의해 검색될 수 있다. 상보적인 핵산에 대한 핵산 혼성화(hybridization), 코딩된 단백질의 산화환원 반응 또는 열안정성, 소수성, 단백질분해에 대한 감수성, 또는 응집 경향 등과 같은 다른 성질에 대한 변형도 기본 기술들에 따라 모두 분석된다. Finally, modifications to most nucleic acids are identified by a series of screening techniques in suitable assays for the desired properties. For example, modifications in the immunological properties of the encoded polypeptide can be detected by appropriate immunological analysis. Modifications to other properties such as nucleic acid hybridization to complementary nucleic acids, redox reactions or thermostability of the encoded protein, hydrophobicity, susceptibility to proteolysis, or tendency to aggregate, are all analyzed according to basic techniques.

관심 폴리뉴클레오티드Polynucleotide of interest

통상의 기술자가 인식하는 바와 같이, 삽입된 관심 폴리펩타이드의 뉴클레오티드 서열은 임의의 뉴클레오티드 서열일 수 있다. 예를 들면, 폴리뉴클레오티드 서열은 리포터(reporter) 유전자 또는 선별 마커(selectable marker) 유전자 서열일 수 있다. 본원에서 사용된 리포터 유전자 서열은 발현시 그 존재 또는 활성이 감지될 수 있는 단백질을 생산하게 하는 임의의 유전자 서열이다. 적합한 리포터 유전자의 대표적인 예로는 갈락토카이네이즈(galactokinase), β-갈락토시데이즈(β-galactosidase), 클로람페니콜 아세틸트랜스퍼레이즈(chloramphenicol acetyltransferase), β-락타메이즈(β-lactamase), 녹색 형광 단백질(green fluorescent protein), 강화 녹색 형광 단백질(enhanced green fluorescent protein) 등이 있다. 한편, 리포터 유전자는 그 발현이 세포 생리에 영향을 주는 유전자 산물을 생산해내는 임의의 유전자 서열일 수 있다. 본 발명의 폴리뉴클레오티드 서열은 이미 하나 이상의 프로모터, 개시 서열, 또는 처리 서열을 갖는 하나 이상의 유전자 서열들을 포함할 수 있다. As will be appreciated by those skilled in the art, the nucleotide sequence of the inserted polypeptide of interest can be any nucleotide sequence. For example, the polynucleotide sequence can be a reporter gene or a selectable marker gene sequence. As used herein, a reporter gene sequence is any gene sequence that, upon expression, produces a protein whose presence or activity can be detected. Representative examples of suitable reporter genes include galactokinase, β-galactosidase, chloramphenicol acetyltransferase, β-lactamase, and green fluorescent protein. fluorescent protein), enhanced green fluorescent protein, and the like. The reporter gene, on the other hand, can be any gene sequence whose expression produces a gene product that affects cell physiology. Polynucleotide sequences of the invention may comprise one or more gene sequences that already have one or more promoters, initiation sequences, or processing sequences.

선별 마커 유전자 서열은 그 존재로 인해 이를 포함하는 세포를 선택적으로 증식시키는 단백질을 발현할 수 있는 임의의 유전자 서열이다. 선별 마커 유전자의 예로는 항생제(예: 퓨로마이신, 하이그로마이신, 네오마이신, 테오신 등)에 대한 숙주 내성을 부여하거나, 아미노산 유사체에 대한 숙주 내성을 부여하거나, 기타 가혹(impermissible) 배양 조건 하에 추가의 탄소원 상에서 박테리아의 성장을 가능하게 할 수 있는 유전자 서열들이 포함된다.The selectable marker gene sequence is any gene sequence capable of expressing a protein that, due to its presence, selectively proliferates a cell comprising the same. Examples of selectable marker genes include conferring host resistance to antibiotics (e.g., puromycin, hygromycin, neomycin, theosin, etc.), conferring host resistance to amino acid analogs, or under other severemissible culture conditions. Gene sequences that may enable the growth of bacteria on additional carbon sources are included.

리포터 또는 선별 마커 유전자 서열은 일반 세포에서 벡터의 인식 또는 선별을 가능하게 하기에 충분하다. 본 발명의 일 실시양태에서, 리포터 유전자 서열은 일반적으로 포유류 세포에 없으며, 따라서 그 존재로 이러한 세포들에서의 벡터의 존재를 명확하게 입증할 수 있는 효소 또는 기타 단백질을 코딩할 수 있다.The reporter or selection marker gene sequence is sufficient to enable recognition or selection of the vector in normal cells. In one embodiment of the invention, the reporter gene sequence is generally absent in mammalian cells and, therefore, may encode enzymes or other proteins that can clearly demonstrate the presence of the vector in these cells.

본 발명의 전이 벡터는 추가적으로 이종의 핵산 또는 폴리뉴클레오티드를 바이러스 입자로 결합하도록 하여, 이로써 내부에 이종 핵산을 포함하는 감염된 숙주세포의 수를 증대시키기 위한 수단을 제공할 수 있다. 이종의 폴리뉴클레오티드의 결합은 바이러스 입자내 이종 핵산의 복제, 및 이후의 이종 단백질의 생산을 촉진시킨다. 본원에서 이종의 단백질은 단백질의 전부 또는 부분이 숙주 세포에 의해 발현되지 않는 단백질 또는 그의 절편으로 정의된다. 핵산 또는 유전자 서열은 유전자를 세포내로 전달하기 위해 사용된 바이러스 벡터의 야생형 내에 본래부터 존재하는 것이 아니라면, 이종이라고 일컫는다. 본원에서 사용된 용어 이종의 핵산 서열 또는 폴리뉴클레오티드 서열은 핵산 분자(바람직하게는 DNA)를 의미한다. 폴리뉴클레오티드 서열 또는 이종의 폴리뉴클레오티드 서열은 또한 적절한 생물학적 활성 단백질 또는 폴리펩타이드, 면역성 또는 항원성 단백질 또는 폴리펩타이드, 또는 치료 활성 단백질 또는 폴리펩타이드와 같은 목적하는 산물의 코딩 서열을 포함할 수 있다. 폴리펩타이드는 숙주 세포내 내인성 단백질의 결핍 또는 비존재(nonexistent) 발현을 보충할 수 있다. 이러한 유전자 서열들은 DNA, cDNA, 합성 DNA, RNA 또는 이들의 조합을 포함하는 다양한 출처로부터 유래될 수 있다. 이러한 유전자 서열들은 또한 자연적으로 발생하는 인트론을 포함하거나 포함하지 않을 수 있는 유전체 DNA를 포함할 수 있다. 또한, 이러한 유전체 DNA는 프로모터 서열 또는 폴리아데닐화 서열과 관련하여 수득될 수 있다. 본 발명의 유전자 서열은 바람직하게 cDNA이다. 유전체 또는 cDNA는 여러 방법에 의해 얻어질 수 있다. 유전체 DNA는 당분야에 공지된 방법에 의해 적절한 세포로부터 추출되고 분리될 수 있다. 한편, mRNA는 세포로부터 분리되어 역전사 또는 다른 방법을 통해 cDNA를 제조하기 위해 사용될 수 있다. 한편, 폴리뉴클레오티드 서열은 안티센스(antisense) RNA 서열과 같이 RNA 서열에 상보적인 서열을 포함할 수 있으며, 이러한 안티센스 서열은 개체내에 투여되어 개체 세포 내 상보적인 폴리뉴클레오티드의 발현을 저해할 수 있다. The transfer vectors of the present invention may additionally provide a means for binding heterologous nucleic acids or polynucleotides into viral particles, thereby increasing the number of infected host cells comprising heterologous nucleic acids therein. Binding of heterologous polynucleotides promotes replication of heterologous nucleic acid in viral particles and subsequent production of heterologous proteins. Heterologous proteins are defined herein as proteins or fragments in which all or part of the protein is not expressed by the host cell. Nucleic acids or gene sequences are referred to as heterologous unless they are inherently present in the wild type of the viral vector used to deliver the gene into the cell. The term heterologous nucleic acid sequence or polynucleotide sequence as used herein refers to a nucleic acid molecule (preferably DNA). The polynucleotide sequence or heterologous polynucleotide sequence may also comprise coding sequences of the desired product, such as a suitable biologically active protein or polypeptide, an immunogenic or antigenic protein or polypeptide, or a therapeutically active protein or polypeptide. Polypeptides can compensate for the lack or nonexistent expression of endogenous proteins in host cells. Such gene sequences can be derived from a variety of sources including DNA, cDNA, synthetic DNA, RNA or combinations thereof. Such gene sequences may also include genomic DNA, which may or may not include naturally occurring introns. In addition, such genomic DNA can be obtained in connection with a promoter sequence or a polyadenylation sequence. The gene sequence of the present invention is preferably cDNA. Genomes or cDNAs can be obtained by several methods. Genomic DNA can be extracted and isolated from appropriate cells by methods known in the art. On the other hand, mRNA can be isolated from cells and used to prepare cDNA via reverse transcription or other methods. Meanwhile, the polynucleotide sequence may include a sequence complementary to the RNA sequence, such as an antisense RNA sequence, and the antisense sequence may be administered in the individual to inhibit the expression of the complementary polynucleotide in the individual cell.

이종 유전자의 발현은 항체반응을 달성하는 면역성 또는 항원성 단백질 또는 폴리펩타이드를 제공할 수 있다. 이렇게 발생된 항체들은 동물의 혈액, 혈청 또는 복수(ascites)와 같은 체액으로부터 회수될 수 있다. Expression of the heterologous gene may provide an immune or antigenic protein or polypeptide that achieves an antibody response. The antibodies thus generated can be recovered from body fluids such as blood, serum or ascites of the animal.

이종의 유전자는 또한 전사될 수 있는 임의의 관심 핵산일 수 있다. 일반적으로 외래 유전자는 폴리펩타이드를 코딩한다. 바람직하게 폴리펩타이드는 특정 치료적 장점을 가진다. 폴리펩타이드는 숙주 세포 내 내인성 단백질의 결핍 또는 비존재 발현을 보충할 수 있다. 폴리펩타이드는 숙주세포에 키메라 신호전달 수용체(chimeric signaling receptor)(미국 특허 제5,359,046호 참조)와 같은 새로운 성질들을 부여할 수 있다. 통상의 기술자는 본원에서 교시되고 당분야에 공지된 외래 유전자 시행 기술들의 적합성을 결정할 수 있다. 예를 들면, 상기 기술자는 외래 유전자가 캡시드화에 적절한 크기를 갖는지와 외래 유전자 산물이 적절히 발현되는지를 인식할 것이다.The heterologous gene can also be any nucleic acid of interest that can be transcribed. Generally, foreign genes encode polypeptides. Preferably the polypeptide has certain therapeutic advantages. The polypeptide may compensate for the lack or absence of endogenous protein in the host cell. Polypeptides can impart new properties to host cells such as chimeric signaling receptors (see US Pat. No. 5,359,046). Those skilled in the art can determine the suitability of foreign gene enforcement techniques taught herein and known in the art. For example, the skilled person will recognize that the foreign gene is of an appropriate size for capsidation and that the foreign gene product is properly expressed.

본 발명에서 사용될 수 있는 특정의 이종 단백질은 본 발명에 필수적인 것은 아니다. Certain heterologous proteins that may be used in the present invention are not essential to the present invention.

본 발명에서 사용될 수 있는 이러한 이종 단백질의 구체적인 예로는 디스트로핀(dystrophin)(Hoffman, Brown, and Kunkel, 1987), 응고 인자 VIII (Wion et al., 1985), 낭성섬유증 막전도 조절 단백질(cystic fbrosis transmembrane regulator protein; CFTR)(Anderson et al., 1991; Crawford, 1991), 오르니틴 트랜스카바밀레이즈(Ornithine Transcarbamylase; OTC)(Murakami et al., 1988), 및 α1-안티트립신(α1-antitrypsin)(Fagerhol and Cox, 1981) 등이 있다.Specific examples of such heterologous proteins that can be used in the present invention include dystrophin (Hoffman, Brown, and Kunkel, 1987), coagulation factor VIII (Wion et al., 1985), cystic fibrosis transmembrane regulator protein; CFTR) (Anderson et al., 1991; Crawford, 1991), Ornithine Transcarbamylase (OTC) (Murakami et al., 1988), and α1-antitrypsin (α1-antitrypsin) Fagerhol and Cox, 1981).

여러 이종 단백질을 코딩하는 유전자들이 당분야에 공지되어 있으며, 유전체 또는 cDNA 라이브러리(libraries)로부터 클로닝될 수 있다[Sambrook et al, supra]. 이러한 유전자들의 예로는 디스트로핀 유전자(Lee et al., 1991), 응고 인자 VIII 유전자(Toole et al., 1984), CFTR 유전자(Rommens et al., 1989; Riordan, 1989), OTC 유전자(Horwich et al., 1984), 및 α1-안티트립신 유전자(Lemarchand et al., 1992) 등이 있다. Genes encoding several heterologous proteins are known in the art and can be cloned from genomic or cDNA libraries (Sambrook et al, supra). Examples of such genes include dystrophin gene (Lee et al., 1991), coagulation factor VIII gene (Toole et al., 1984), CFTR gene (Rommens et al., 1989; Riordan, 1989), OTC gene (Horwich et al. , 1984), and the α1-antitrypsin gene (Lemarchand et al., 1992).

또한, 동맥경화증과 같은 혈관 증식성 질환의 치료를 위한 Rb(Chang et al., 1995), 및 암(Wills et al., 1994; Clayman, 1995) 및 HIV 질환(Bridges and Sarver, 1995)의 치료를 위한 p53과 같은 이종 단백질을 코딩하는 유전자들이 본 발명에 사용될 수 있다.In addition, treatment of Rb (Chang et al., 1995) and cancer (Wills et al., 1994; Clayman, 1995) and HIV disease (Bridges and Sarver, 1995) for the treatment of vascular proliferative diseases such as atherosclerosis Genes encoding heterologous proteins such as p53 can be used in the present invention.

벡터는 항상 기능적인 이종의 유전자 산물을 코딩할 필요가 없으며, 즉, 진핵성 효소의 저해 물질로서 작용하는 부분 유전자 산물을 또한 코딩할 수 있다(Warne, Viciana, and Downward, 1993; Wang, 1991).Vectors do not always have to encode functional heterologous gene products, ie they can also encode partial gene products that act as inhibitors of eukaryotic enzymes (Warne, Viciana, and Downward, 1993; Wang, 1991). .

본 발명의 방법에 의한 분자 도입에 의해 세포내 유전자 조절 분자의 발현을 조정하는 것이 바람직하다. 용어 "조정하다(modulate)"는 유전자가 과발현되는 경우 그 발현을 억제하거나, 저발현되는 경우 그 발현을 증대시키는 것을 의미한다. 세포 증식성 질환이 유전자의 발현에 관련되는 경우, 번역 수준에서의 유전자 발현을 저해하는 핵산 서열이 사용될 수 있다. 예를 들면, 안티센스 핵산 또는 삼중 항원(triplex agent)으로 mRNA를 차폐시키거나 리보자임으로 이를 절단함으로써 특정 mRNA의 전사 또는 번역을 차단하기 위해, 안티센스 핵산, 리보자임 또는 삼중 항원 등이 이용될 수 있다.It is preferred to modulate the expression of intracellular gene regulatory molecules by molecular introduction by the methods of the invention. The term "modulate" means inhibiting the expression of a gene when it is overexpressed or augmenting its expression when it is underexpressed. When cell proliferative diseases are involved in the expression of genes, nucleic acid sequences that inhibit gene expression at the translational level can be used. For example, antisense nucleic acids, ribozymes or triple antigens and the like can be used to block transcription or translation of a particular mRNA by masking the mRNA with an antisense nucleic acid or triplex agent or cleaving it with ribozymes. .

안티센스 핵산은 특정 mRNA 분자의 최소 일부분에 대해 상보적인 DNA 또는 RNA 분자들이다. 세포 내에서, 안티센스 핵산은 대응 mRNA와 혼성화 결합하여 이중-가닥 분자를 형성한다. 세포는 이중-가닥 mRNA를 번역하지 못하므로, 안티센스 핵산은 mRNA의 번역을 저해한다. 약 15개 또는 그 이상의 뉴클레오티드를 갖는 안티센스 올리고머는 용이하게 합성되면서도 표적세포내로 도입될 때 큰 분자들에 비해 문제를 덜 발생시키므로 바람직하다. 유전자의 생체외 번역을 저해하는 안티센스 방법의 용도는 당 분야에 공지되어 있다((Marcus-Sekura, 1988). Antisense nucleic acids are DNA or RNA molecules that are complementary to at least a portion of a particular mRNA molecule. In cells, antisense nucleic acids hybridize to the corresponding mRNAs to form double-stranded molecules. Since cells do not translate double-stranded mRNA, antisense nucleic acids inhibit the translation of mRNA. Antisense oligomers having about 15 or more nucleotides are preferred because they are easily synthesized but produce less problems than larger molecules when introduced into target cells. The use of antisense methods to inhibit ex vivo translation of genes is known in the art (Marcus-Sekura, 1988).

안티센스 핵산은 알츠하이머 질환(Alzheimer's disease)에서 축적되는 아밀로이드 전구체 단백질과 같이, 변이 단백질 또는 과도한 활성의 유전자 산물의 발현을 차단하기 위해 사용될 수 있다. 이러한 방법들은 또한 헌팅턴 질환(Huntington's disease), 유전성 파킨슨 질환 및 기타 질환들의 치료에 유용하다. 안티센스 핵산은 또한 독성과 관련된 단백질의 발현을 저해하는 데에도 유용하다.Antisense nucleic acids can be used to block the expression of mutant proteins or excessively active gene products, such as amyloid precursor proteins that accumulate in Alzheimer's disease. These methods are also useful for the treatment of Huntington's disease, hereditary Parkinson's disease and other diseases. Antisense nucleic acids are also useful for inhibiting the expression of proteins associated with toxicity.

올리고뉴클레오티드는 올리고머가 이중-나선(double-helical) DNA 주위에 결합됨으로 인한 삼중화 방법(triplex strategy)으로 공지된 기작을 통해, 전사를 지연시키기 위해 사용할 수 있다. 따라서, 삼중 화합물들은 선택된 유전자의 특정 위치를 인식하도록 고안될 수 있다(Maher, Wold, and Dervan, 1991; Helene, 1991). Oligonucleotides can be used to delay transcription through a mechanism known as the triplex strategy, in which oligomers are bound around double-helical DNA. Thus, triple compounds can be designed to recognize specific positions of selected genes (Maher, Wold, and Dervan, 1991; Helene, 1991).

리보자임은 DNA 제한 엔도뉴클리에이즈(endonuclease)와 유사한 방법으로 다른 단일-가닥 RNA를 특이적으로 절단할 수 있는 능력은 갖는 RNA 분자이다. 이러한 RNA를 코딩하는 핵산 서열들의 변형을 통해 RNA 분자에서 특정 핵산 서열을 인식하고 절단하는 분자들을 가공하는 것이 가능하다(Cech, 1988). 이러한 접근의 주된 장점은 특정 서열을 갖는 mRNA만을 불활성화시킨다는 것이다.Ribozymes are RNA molecules that have the ability to specifically cleave other single-stranded RNA in a manner similar to DNA restriction endonuclease. Modification of nucleic acid sequences encoding such RNA makes it possible to process molecules that recognize and cleave specific nucleic acid sequences in RNA molecules (Cech, 1988). The main advantage of this approach is that it inactivates only mRNAs with specific sequences.

생물학적 반응 변형 물질(biological response modifier)을 코딩하는 핵산을 전이 시키는 것이 바람직할 수 있다. 이러한 범주에는, 예를 들면, 인터루킨(interleukin) 1 내지 12의 "인터루킨"으로 분류되는 여러 사이토카인을 코딩하는 핵산 등의 면역강화제가 포함된다. 상기 범주에는 또한, 동일 기작에 따라 필수적으로 작용하지는 않으나, 인터페론(interferons), 특히 감마 인터페론(γ-IFN), 종양괴사인자(TNF) 및 과립구-대식세포 콜로니 자극 인자(GM-CSF) 등이 포함된다. 선천적 효소 결핍 또는 면역 결핍을 치료하기 위해 이러한 핵산을 골수 세포 또는 대식세포로 전달하는 것이 바람직하다. 성장인자, 독성 펩타이드, 리간드, 수용체 또는 기타 생리학적 중요 단백질들을 코딩하는 핵산들이 또한 특정 비-분열 세포들에 도입될 수 있다. It may be desirable to transfer nucleic acids encoding biological response modifiers. This category includes, for example, immunopotentiators, such as nucleic acids encoding various cytokines classified as "interleukins" of interleukin 1-12. This category also includes, but does not necessarily work according to the same mechanism, interferons, in particular gamma interferon (γ-IFN), tumor necrosis factor (TNF) and granulocyte-macrophage colony stimulating factor (GM-CSF), etc. Included. It is desirable to deliver these nucleic acids to bone marrow cells or macrophages to treat innate enzyme deficiency or immune deficiency. Nucleic acids encoding growth factors, toxic peptides, ligands, receptors or other physiologically important proteins can also be introduced into certain non-dividing cells.

따라서, 본 발명의 재조합 CAEV 벡터 시스템은 항-HIV 분자로 HIV-감염된 세포(예: T-세포 또는 대식세포)를 치료하는데 사용될 수 있다. 또한, 예를 들면, 낭성 섬유증의 치료를 위해 낭성섬유증 전이막 전도 조절 물질(CFTR)에 대한 유전자를 갖는 본 발명의 재조합 렌티바이러스로 호흡기 상피세포를 감염시킬 수 있다.Thus, the recombinant CAEV vector system of the present invention can be used to treat HIV-infected cells (eg T-cells or macrophages) with anti-HIV molecules. In addition, for example, for the treatment of cystic fibrosis, respiratory epithelial cells may be infected with the recombinant lentivirus of the present invention having a gene for cystic fibrosis metastatic membrane conduction modulator (CFTR).

따라서, 본 발명의 재조합 CAEV 벡터 시스템은 여러 인간의 질병을 치료하는데 사용될 수 있다. CAEV 벡터 시스템이 적용 가능한 인간 질환의 대표적인 예로는, 이에 한정되는 것은 아니나, 알츠하이머 질환, 파킨슨 질환, 근위축성 측삭 경화증(amyotrophic lateral sclerosis disease), 헌팅톤 질환, 베타-용혈성빈혈 (beta-thalassemia), 색소성 망막염(retinitis pigmentosa), 점액성다당 질환(mucopolysaccharide disease), 백질이양성 질환(leukodystrophy diseases), 반성유전성 SCID(X-linked SCID), 페닐케톤뇨증(phenylketonuria), 티로신혈증(tyrosinemia), A 및 B형 혈우병(hemophilia A and B), 윌슨 질환(Wilsons diseases), LDL 수용체 결핍(LDL receptor deficiency), 인간 면역 결핍(Human Immunodeficiency), 및 듀센형 근이영양증(Duchennes dystrophy) 등이 포함된다.Thus, the recombinant CAEV vector system of the present invention can be used to treat various human diseases. Representative examples of human diseases to which the CAEV vector system is applicable include, but are not limited to, Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis disease, Huntington's disease, beta-thalassemia, Retinitis pigmentosa, mucopolysaccharide disease, leukodystrophy diseases, X-linked SCID, phenylketonuria, tyrosinemia, A and Hemophilia A and B, Wilsons disease, LDL receptor deficiency, Human Immunodeficiency, and Ducens dystrophy.

CAEV 벡터 입자CAEV Vector Particles

본 발명의 방법에서, 감염성 및 복제-결핍 CAEV 벡터 입자들은 당분야에 공지된 기술과 접목된 본원에 개시된 방법에 따라 제조될 수 있다. 상기 방법에는 본 발명의 벡터 발현 시스템으로 렌티바이러스-허용 세포를 형질감염시키는 단계; 형질감염된 세포에서 CAEV-유래 입자들을 생산하는 단계; 및 상기 세포로부터 바이러스 입자를 회수하는 단계들을 포함한다. In the methods of the invention, infectious and replication-deficient CAEV vector particles can be prepared according to the methods disclosed herein in combination with techniques known in the art. The method comprises transfecting lentiviral-allowed cells with the vector expression system of the present invention; Producing CAEV-derived particles in the transfected cells; And recovering viral particles from the cell.

용어 "형질감염(transfection)"은 외래 DNA를 진핵 세포로 도입하는 것을 의미한다. 형질감염은 이에 한정되지는 않으나 칼슘 포스페이트-DNA 공침전법, DEAE-덱스트란-매개 형질감염법, 폴리브렌-매개 형질감염법, 전기충격법, 미세주사법, 리포좀 융합법, 리포펙타민 및 원형질체 융합법 등의 당분야에 공지된 여러 방법에 의해 수행될 수 있다. The term "transfection" refers to the introduction of foreign DNA into eukaryotic cells. Transfections include, but are not limited to, calcium phosphate-DNA coprecipitation, DEAE-dextran-mediated transfection, polybrene-mediated transfection, electroshock, microinjection, liposome fusion, lipofectamine and protoplasts It can be performed by various methods known in the art, such as a fusion method.

본원에서 사용된 용어 "형질도입(transduction)"은 형질감염에 의해서라기 보다는 감염(infection)을 수단으로 하여 바이러스 또는 레트로바이러스 벡터 입자를 사용하는 유전자 전달을 의미한다. 어떤 실시양태에서, 레트로바이러스 벡터는 형질도입된다. 따라서, "형질도입된 유전자"는 렌티바이러스 또는 벡터 감염 및 프로바이러스 통합을 통해 세포로 도입된 유전자이다. 특정 실시양태에서, CAEV 바이러스 벡터 입자들은 "표적 세포" 또는 숙주 세포로 유전자를 형질도입한다.As used herein, the term "transduction" refers to gene transfer using viral or retroviral vector particles by means of infection rather than by transfection. In some embodiments, the retroviral vector is transduced. Thus, "transduced genes" are genes introduced into cells through lentiviral or vector infection and proviral integration. In certain embodiments, CAEV viral vector particles transduce genes into "target cells" or host cells.

세포내 감염성 바이러스 입자의 생산을 촉진하는 단계는 또한 기본 세포 배양 성장 기법과 같이 통상적인 기술을 사용하여 수행될 수 있다. Promoting the production of intracellular infectious viral particles can also be carried out using conventional techniques such as basic cell culture growth techniques.

감염성 바이러스 입자를 회수하는 단계는 통상의 기술을 사용하여 수행될 수 있다. 예를 들면, 감염성 입자들은 공지된 바와 같이, 세포 배양액의 상등액을 회수하여 회수될 수 있다. 선택적으로, 회수된 바이러스 입자들은 필요에 따라 정제될 수 있다. 적절한 정제 기법은 통상의 기술자에게 공지되어 있다.Recovering infectious virus particles can be carried out using conventional techniques. For example, infectious particles can be recovered by recovering the supernatant of the cell culture, as is known. Optionally, the recovered virus particles can be purified as needed. Appropriate purification techniques are known to those skilled in the art.

통상의 기술자의 필요에 따라, CAEV 모액(stock solution)을 본 발명의 벡터 및 방법을 사용하여 제조할 수 있다. 바이러스 모액의 제조방법은 당분야에 공지되어 있으며, 예를 들면 [(Soneoka et al., 1995) 및 (Landau and Littman, 1992)]에 기재되어 있다. 본 발명의 모액을 생산하는 방법에서, 렌티바이러스-허용 세포들은 본 발명의 벡터 시스템으로 형질감염된다. 상기 세포들은 적절한 세포 배양 조건 하에 배양되고, 상술된 세포 배양액으로부터 CAEV 입자들을 회수한다. 적절한 허용 세포주로는 이에 한정되는 것은 아니나, 인간 세포주 293, 293T, 및 HeLa 원숭이 세포주 Vero, 및 염소 세포주 GSM 및 Ch1Es 등이 포함된다. 본 발명의 벡터는 또한 안정(stable) 패키징 세포(즉, 그 자체로서는 감염성 바이러스 입자를 만들어낼 수 없으면서 CAEV 구조 단백질들을 안정되게 발현하는 세포) 및 바이러스 생산 세포(virus producing cells; VPC)를 제조하는데 유용하다. 레트로바이러스 단백질을 발현하는 패키징 세포를 제조하는 방법은 당분야에 공지되어 있으며, 예를 들면, 미국 특허 제 4,650,764호(Temin et al.) 등에 이러한 방법이 실시되어 있으며, 이는 그대로 본원에 접목되어 있다. 본 발명의 범위 내에서, 패키징 세포는 본 발명에 제시된 최소 하나의 CAEV 패키징 벡터로부터의 CAEV 핵산 서열을 포함하는 렌티바이러스-허용 숙주 세포를 포함할 것이며, 상기 핵산 서열은 패키징-신호 결함이므로 세포가 복제-적격 감염성 바이러스를 생산할 수는 없으나, 최소 하나 이상의 CAEV 구조 단백질을 생산할 수 있도록 한다. As required by those skilled in the art, CAEV stock solutions can be prepared using the vectors and methods of the present invention. Methods of preparing viral mother liquors are known in the art and are described, for example, in (Soneoka et al., 1995) and (Landau and Littman, 1992). In the method of producing the mother liquor of the present invention, the lentiviral-allowing cells are transfected with the vector system of the present invention. The cells are cultured under appropriate cell culture conditions and recover CAEV particles from the cell culture described above. Suitable acceptable cell lines include, but are not limited to, human cell lines 293, 293T, and HeLa monkey cell lines Vero, and goat cell lines GSM and Ch1Es. The vectors of the present invention are also used to prepare stable packaging cells (ie, cells that stably express CAEV structural proteins without producing infectious viral particles on their own) and virus producing cells (VPCs). useful. Methods for preparing packaging cells expressing retroviral proteins are known in the art and are described, for example, in US Pat. No. 4,650,764 to Temin et al., Which is incorporated herein as such. . Within the scope of the present invention, the packaging cell will comprise a lentiviral-allowed host cell comprising a CAEV nucleic acid sequence from at least one CAEV packaging vector set forth herein, wherein the nucleic acid sequence is a packaging-signal defect so that the cell is It is not possible to produce a replication-qualified infectious virus, but it is possible to produce at least one CAEV structural protein.

패키징 세포는 CAEV-허용 숙주 세포(예: 인간 배아 신장 293 또는 293T 세포)를 상기 제시된 바와 같이 공지된 방법에 따라 적절한 CAEV 핵산 서열로 형질감염시켜 만들 수 있다. 따라서, 그 결과 얻어진 패키징 세포는 최소 하나의 CAEV 구조 단백질을 발현하고 생산할 수 있다. 그러나, 패키징 세포는 여전히 재조합 CAEV 바이러스를 생산할 수 없다. 패키징 세포는 그 후 다른 핵산 서열, 즉, 관심 이종 유전자 및 적절한 패키징 신호를 포함하는 전이 벡터로 형질감염될 수 있다. 일단 추가의 서열 또는 서열들로 형질감염되면, 상기 패키징 세포는 이종의 유전자를 포함하는 CAEV 바이러스 스탁(stocks)을 제공하는데 사용될 수 있으며, 이때 바이러스는 그 자체로서 복제-부적격이다. 따라서, 그 결과 얻어진 바이러스 생산 세포(VPC)는 이종의 관심 유전자를 포함하는 감염성 바이러스 입자들을 생산할 수 있다.Packaging cells can be made by transfecting CAEV-allowed host cells (eg, human embryonic kidney 293 or 293T cells) with appropriate CAEV nucleic acid sequences according to known methods as set out above. Thus, the resulting packaging cells can express and produce at least one CAEV structural protein. However, the packaging cells still cannot produce recombinant CAEV virus. The packaging cell may then be transfected with another nucleic acid sequence, a transfer vector comprising the heterologous gene of interest and the appropriate packaging signal. Once transfected with additional sequences or sequences, the packaging cells can be used to provide CAEV virus stocks containing heterologous genes, where the virus is itself replication-ineligible. Thus, the resulting virus producing cells (VPCs) can produce infectious viral particles comprising heterologous genes of interest.

유전자 전이 및 치료Gene transfer and treatment

단일 유전자에서의 변경으로 인한 여러 인간 유전 질환은 유전자 치료의 주요 후보들이다. 본원에서 사용된 용어 "유전자 치료" 또는 "유전자 전이"는 임상 치료를 목적으로 세포내로 유전자를 삽입하는 것으로 정의된다. 유전자 치료의 많은 적용방법, 특히 줄기세포 유전자 삽입을 통한 방법들이 공지되어 있으며, 광범위하게 검토되고 있다. 생체외 또는 생체내 치료 유전자 전이를 위한 표적 세포로는 이에 한정되는 것은 아니나, 조혈 줄기 세포, 림프구, 혈관내피세포(vascular endothelial cell), 호흡기 상피 세포(respiratory epithelial cell), 케라틴 형성 세포(keratinocyte), 골근 세포(skeletal and muscle cells), 간세포(liver cell), 신경세포(neuron cell), 및 암 세포(cancer cell) 등이 포함된다.Many human genetic diseases resulting from alterations in a single gene are major candidates for gene therapy. As used herein, the term "gene therapy" or "gene transfer" is defined as the insertion of a gene into a cell for the purpose of clinical treatment. Many applications of gene therapy, particularly through stem cell gene insertion, are known and extensively reviewed. Target cells for ex vivo or in vivo therapeutic gene transfer include, but are not limited to, hematopoietic stem cells, lymphocytes, vascular endothelial cells, respiratory epithelial cells, keratinocytes. , Skeletal and muscle cells, liver cells (liver cells), neurons (neuron cells), cancer cells (cancer cells) and the like.

본 발명의 유전자 전이 기술은 또한 펩타이드 처리 공정의 확인 및 다양한 단백질의 기능 도메인의 확인에 사용될 수 있다. 처리 및 세포 운명에서의 세포-특이적 차이를 연구하기 위해, 단백질에 대한 클로닝된 cDNA 또는 유전체 서열을 생체 외 또는 생체 내의 서로 다른 표적 세포로 도입할 수 있다. 강력한 프로모터의 조절 하에 코딩 서열을 위치시킴으로써, 상당량의 목적 단백질을 만들 수 있다. 또한, 단백질 처리, 세포내 저장 또는 생물학적 활성에 관여하는 특정 잔기는 코딩 서열의 구분 잔기들의 변이적 변화에 의해 결정될 수 있다.The gene transfer technology of the present invention can also be used to identify peptide processing processes and to identify functional domains of various proteins. To study cell-specific differences in processing and cell fate, cloned cDNA or genomic sequences for proteins can be introduced into different target cells in vitro or in vivo. By placing the coding sequence under the control of a strong promoter, a significant amount of the desired protein can be made. In addition, specific residues involved in protein processing, intracellular storage, or biological activity can be determined by mutated changes in the distinct residues of the coding sequence.

본 발명의 유전자 전이 기술은 또한 단백질의 발현을 조절하고 그 세포수준의 현상을 조절하는 능력을 확인하는 수단을 제공하는데 적용될 수 있다. 분화에서의 그 역할과 같은 단백질의 특정 기능은 조직 배양에서 연구될 수 있는 반면, 다른 것들은 관련 특성에서의 변화를 관찰하기 위해 발달과정의 서로 다른 시기에 생체내 시스템으로의 재도입이 필요할 것이다.The gene transfer technology of the present invention can also be applied to provide a means of regulating the expression of proteins and the ability to regulate their cellular level. Certain functions of proteins, such as their role in differentiation, can be studied in tissue culture, while others will need to be reintroduced into the in vivo system at different stages of development to observe changes in relevant properties.

유전자 전이는 특정 유전자의 발현을 조절하는 핵산 서열 및 세포 인자들을 연구하기 위한 수단을 제공한다. 이러한 연구의 일례로 리포터 유전자에 조절 요소들을 융합시켜 리포터 유전자의 발현을 분석하는 연구가 있다.Gene transfer provides a means for studying nucleic acid sequences and cellular factors that regulate expression of a particular gene. An example of such research is the study of analyzing reporter gene expression by fusing regulatory elements to the reporter gene.

유전자 전이는 또한 질환 상태에 대해 이해하고 치료를 제공하는데 있어서 실질적으로 중요한 용도를 갖는다. 결함 유전자들이 공지되고 클로닝된 여러 유전 질환이 있다. 일부의 경우, 이러한 클로닝된 유전자의 기능은 공지되어 있다. 일반적으로, 상기 질환의 상태는 두 부류로 나뉠 수 있다: 일반적으로 열성으로 유전되는, 일반적으로 효소에서의, 결핍상태, 및 우성으로 유전되는, 최소한 때때로 조절 또는 구조 단백질에 관련된, 불균형 상태. 결핍 상태 질환의 경우, 유전자 전이는 안티센스 돌연변이를 사용하여 상기 질환에 대한 동물 모델을 만들어낼 뿐 아니라, 대체 요법을 위해 발병 조직에 정상 유전자를 전달하기 위해 사용될 수 있다. 불균형 질환 상태의 경우, 유전자 전이는 모델 시스템에서 질환 상태를 만들어내기 위해서 사용될 수 있고, 이 모델 시스템은 질환 상태를 역전시키기 위한 연구에 사용될 수 있다. 따라서, 본 발명의 방법은 유전적 질환의 치료를 가능하게 한다. 본원에서 사용된 바와 같이, 질환 상태는 질환을 일으키고 점차 악화시키는 결핍 또는 불균형을 부분적으로 또는 전체적으로 치료함으로써 치료된다. 변이를 일으키거나 결함을 고치는 핵산 서열의 위치-특이적 통합도 역시 사용할 수 있다.Gene transfer also has practically important uses in understanding disease states and providing treatment. There are several genetic diseases in which defective genes are known and cloned. In some cases, the function of these cloned genes is known. In general, the condition of the disease can be divided into two classes: an imbalanced state, generally inherited recessively, generally in enzymes, deficient, and at least occasionally related to regulatory or structural proteins. In the case of a disease of deficiency, gene transfer can be used to deliver normal genes to the diseased tissue for replacement therapy, as well as to create animal models for the disease using antisense mutations. In the case of an imbalanced disease state, gene transfer can be used to create a disease state in a model system, which can be used in studies to reverse the disease state. Thus, the methods of the present invention allow for the treatment of genetic diseases. As used herein, a disease state is treated by partially or wholly treating a deficiency or imbalance that causes and gradually worsens the disease. Location-specific integration of nucleic acid sequences that cause mutations or correct defects can also be used.

본 발명의 방법은 신경 세포, 아교 세포(glial cell), 섬유아세포(fibroblast) 또는 간엽세포(mesenchymal cell)의 이식(transplantation), 또는 본 발명의 재조합 렌티바이러스로 감염된 세포의 생체외 이식, 중추 신경계 또는 심실강(ventricular cavities)으로의 생체내 감염, 또는 숙주 뇌 표면의 경막하(subdurally) 감염과 관련된 "그래프팅(grafting)"에 유용할 수 있다. 이러한 그래프팅 방법은 통상의 기술자들에게 공지되어 있으며, 문헌 [Neural Grafting in the Mammalian CNS, Bjorklund & Stenevi, eds. (1985)]에 제시되어 있다.The method of the present invention can be used for transplantation of nerve cells, glial cells, fibroblasts or mesenchymal cells, or ex vivo transplantation of cells infected with the recombinant lentivirus of the present invention, central nervous system. Or "grafting" associated with in vivo infection with ventricular cavities, or subdurally infection of the host brain surface. Such grafting methods are known to those of skill in the art and described in Neural Grafting in the Mammalian CNS, Bjorklund & Stenevi, eds. (1985).

단백질 산물의 결핍으로 인한 질병의 경우, 유전자 전이는 안티센스 변이를 사용하여 질병에 대한 동물 모델을 만들 뿐 아니라, 대체 요법을 위해 환부 조직으로 정상 유전자를 도입하는데 사용될 수 있다. 예를 들면, 근육, 비장 또는 간 세포의 감염을 위해 인자 Ⅷ 또는 Ⅸ 코딩 핵산을 CAEV 입자내로 도입시킬 수 있다.In the case of diseases due to lack of protein products, gene transfer can be used to introduce normal genes into the affected tissue for replacement therapy, as well as to create animal models for the disease using antisense mutations. For example, factor VIII or VIII coding nucleic acids can be introduced into CAEV particles for infection of muscle, spleen or liver cells.

유전자 치료, 특히 줄기 세포 유전자 삽입을 통한 유전자 치료의 많은 적용방법이 공지되어 있고 광범위하게 검토되고 있다. 본원에서 사용된 용어 "줄기 세포"로는, 이에 한정되는 것은 아니나, 조혈 줄기 세포, 신경 줄기 세포, 간엽(특히 근육의) 줄기세포, 및 간 줄기 세포 등이 포함된다. 줄기세포는 생체내 조직들을 재증식(repopulation)할 수 있다. 조혈 줄기 세포는 발달 초기의 인간 조혈 세포로부터 유래된 전구 세포(progenitor cells)이다.Many applications of gene therapy, in particular gene therapy via stem cell gene insertion, are known and extensively studied. The term "stem cells" as used herein includes, but is not limited to, hematopoietic stem cells, neural stem cells, mesenchymal (particularly muscular) stem cells, liver stem cells, and the like. Stem cells can repopulate tissues in vivo. Hematopoietic stem cells are progenitor cells derived from human hematopoietic cells early in development.

조혈 줄기 세포를 이용한 유전자 치료는 또한 일반적으로 결함 단백질의 생산 또는 이상(abnormal) 수준의 유전자 발현을 야기하는 림프계(lymphoid) 또는 골수계(myeloid) 세포들에서의 유전자 이상(abnormality)을 치료하는데 유용하다.Gene therapy with hematopoietic stem cells is also useful for treating gene abnormalities in lymphoid or myeloid cells that generally result in the production of defective proteins or abnormal levels of gene expression. Do.

이러한 여러 질환에 대해, 결함 유전자의 정상 복사체(copy) 또는 기능적 상동체(homolog)의 도입, 및 소량일지라도 소실 유전자(missing gene)의 생산은 유익한 효과를 갖는다. 동시에, 유전자 산물의 과발현이 해로운 효과를 가져오리라고는 예상되지 않는다. 조혈 줄기 세포로의 유전자 전이가 매우 유용한 질환의 비한정적 예는 다음과 같다. 이러한 질환에는 일반적으로 골수 질환, 적혈구 세포 결함, 대사 질환 등이 포함된다. 조혈 줄기 세포 유전자 치료는 글로빈 유전자 또는 응고 인자 유전자(예: 인자 Ⅸ 및 인자 Ⅹ 유전자)에 결함이 있는 α- 및 β-탈라세미아 빈혈, 겸상 적혈구 빈혈(sickle cell anemia) 및 A형 및 B형 혈우병(hemophilia A and B)과 같은 혈액 세포들의 유전자 질환 치료에 유익하다. 다른 바람직한 예로는 T 및 B 림프세포에 독성인 특정 부산물을 제거하도록 돕는 아데노신 디아미네이즈(adenosine deaminase; ADA) 효소가 결여되어 환자가 감염에 대해 무방비상태가 되는, 중증 합병성 면역결핍증 장애(severe combined immunodeficiency disease; SCIDS)의 치료가 있다. 이러한 환자들은 과거에 행해졌던 환자의 림프세포 대신, ADA 유전자를 그들의 조혈 줄기 세포로 도입하여 유전자 치료를 받기 위한 이상적인 후보들이다. 다른 질병들로는 호중구(neutrophils)가 결함 사이토크롬 b(cytochrome b)를 발현하는 만성 육아종증(chronic granulomatosis) 및 대식세포에서의 비정상 글루코세레프로시데이즈(glucocerebrosidase) 유전자 생산으로 인한 고세병(Gaucher disease) 등이 있다.For many of these diseases, the introduction of normal copies or functional homologs of defective genes and the production of missing genes, even in small amounts, have a beneficial effect. At the same time, it is not expected that overexpression of gene products will have a detrimental effect. Non-limiting examples of diseases in which gene transfer to hematopoietic stem cells are very useful are as follows. Such diseases generally include bone marrow diseases, red blood cell defects, metabolic diseases, and the like. Hematopoietic stem cell gene therapy includes α- and β-thalassemia anemia, sickle cell anemia, and type A and B defects in the globin gene or coagulation factor genes (eg, factor VII and factor VII genes). It is beneficial for the treatment of genetic diseases of blood cells such as hemophilia A and B. Another preferred example is a severe complication of immunodeficiency, in which the patient lacks adenosine deaminase (ADA) enzymes that help to remove certain by-products that are toxic to T and B lymphocytes. combined immunodeficiency disease (SCIDS). These patients are ideal candidates for gene therapy by introducing the ADA gene into their hematopoietic stem cells instead of the lymphatic cells of patients previously done. Other diseases include neutrophils, chronic granulomatosis, in which defective cytochrome b is expressed, and Gaucher disease due to abnormal glucocerebrosidase gene production in macrophages. Etc.

또한, 파킨슨 질환(Parkinsons disease) 등의 신경 퇴행성 질환은 GDNF (glial cell line-derived neurotrophic factor) 유전자를 선조체(striatum) 및 흑색질(substantia)로 도입시키는 유전자 치료의 좋은 표적이다(Kordower et al., 2000).Neurodegenerative diseases such as Parkinson's disease are also good targets for gene therapy that introduces GDNF (glial cell line-derived neurotrophic factor) genes into striatum and substantia (Kordower et al., 2000).

암의 다양한 형태를 치료하는 방법 또한 유전자 치료에 포함된다. CAEV 벡터는 예를 들면, 암 세포를 특이적으로 사멸시키는데 효과적인 독소(toxin) 또는 세포 사멸(apoptosis) 유도 물질을 코딩하는 유전자를 포함할 수 있다. 종양 세포 특이적 사멸은 단지 종양세포만이 자살 유전자(suicide gene)를 발현한다는 조건 하에 암 조혈 세포로 자살 유전자를 도입하여 수행될 수 있다. 자살 유전자 산물은 일반적으로 비독성 약제를 독성 유도체로 전환시켜 세포에 치사 감수성을 부여한다. 예를 들면, 효소 사이토신 디아미네이즈는 비독성 물질 5'-플루오로사이토신을 독성 유도체인 5-플루오로우라실로 전환시킨다(Mullen, Kilstrup, and Blaese, 1992). 종양-특이적 림프 세포는, 예를 들면, 항-종양 활성을 갖는 유전자 산물을 종양 위치에 위치특이적으로 전달하여 이러한 유전자 산물의 전신 전달과 관련된 독성을 피하도록 유전학적으로 변형될 수 있다. 유전자 치료의 접근법은 또한 화학치료(chemotherapy)의 독성 효과에 대해 내성인 골수 세포들에 적용될 수 있다.Methods of treating various forms of cancer are also included in gene therapy. The CAEV vector may include, for example, a gene encoding a toxin or apoptosis inducing substance effective for killing cancer cells specifically. Tumor cell specific killing can be performed by introducing a suicide gene into cancer hematopoietic cells under the condition that only tumor cells express a suicide gene. Suicide gene products generally confer lethal sensitivity to cells by converting non-toxic agents into toxic derivatives. For example, the enzyme cytosine deaminase converts the non-toxic substance 5'-fluorocytosine into 5-fluorouracil, a toxic derivative (Mullen, Kilstrup, and Blaese, 1992). Tumor-specific lymphoid cells can be genetically modified to, for example, deliver site-specific gene products with anti-tumor activity to tumor locations to avoid toxicity associated with systemic delivery of such gene products. The approach of gene therapy can also be applied to bone marrow cells that are resistant to the toxic effects of chemotherapy.

유전자 치료는 또한 HIV 및 HTLV-1 감염과 같은 바이러스 감염을 예방하거나 치료하는데 사용될 수 있다. 예를 들면, 조혈 줄기 세포는 HIV 감염에 대해 내성을 부여하도록 유전학적으로 변형될 수 있다. 한 접근법은 특별히 안티센스 RNA를 사용하거나 존재하는 바이러스 조절 경로를 파괴하여 바이러스 유전자 발현을 저해한다. 레트로바이러스 RNA에 상보적인 안티센스 RNA는 HIV(Rhodes and James, 1991) 및 HTLV-1(von Ruden and Gilboa, 1989)을 포함하는 여러 레트로바이러스의 복제를 저해하는 것으로 나타났다(To, Booth, and Neiman, 1986).Gene therapy can also be used to prevent or treat viral infections such as HIV and HTLV-1 infections. For example, hematopoietic stem cells can be genetically modified to confer resistance to HIV infection. One approach specifically inhibits viral gene expression by using antisense RNA or by disrupting existing viral regulatory pathways. Antisense RNA complementary to retroviral RNA has been shown to inhibit the replication of several retroviruses, including HIV (Rhodes and James, 1991) and HTLV-1 (von Ruden and Gilboa, 1989) (To, Booth, and Neiman, 1986).

조혈 줄기 세포에서의 유전자 치료가 사용될 수 있는 다른 영역으로는 자가면역 질환(autoimmune disease)의 완화가 있다. 치료 유전자는 예를 들면, 자가반응성(autoreactive) 세포의 사멸 및 제거를 야기하는 정상 세포사멸 신호를 재구성할 수 있는, B 또는 T 세포 신호 분자를 코딩할 수 있다.Another area where gene therapy in hematopoietic stem cells can be used is the alleviation of autoimmune disease. The therapeutic gene may encode a B or T cell signal molecule, for example, capable of reconstituting normal apoptosis signals resulting in the killing and elimination of autoreactive cells.

진단, 연구 또는 유전자 치료를 위한 생체 외 세포 형질전환법(예: 형질전환된 세포의 숙주세포로의 재-융합을 통한)이 통상의 기술자에게 공지되어 있다. 본 발명의 일 실시양태에서, 세포는 대상 유기체로부터 분리되고, 관심 폴리펩타이드를 포함하는 본 발명의 벡터로 형질감염되며, 대상 유기체(예: 환자)로 다시 재융합된다.In vitro cell transformation methods (eg, through re-fusion of transformed cells into host cells) for diagnosis, research or gene therapy are known to those skilled in the art. In one embodiment of the invention, the cells are isolated from the subject organism, transfected with the vector of the invention comprising the polypeptide of interest and refused back to the subject organism (eg, patient).

생체외 형질전환에 적합한 다양한 세포형이 통상의 기술자에게 공지되어 있다. 특히 바람직한 세포는 상술된 줄기 세포(예: Freshney (1994) Culture of Animal Cells, a Manual of Basic Technique, third edition Wiley-Liss, New York, 및 여기에 인용된 환자로부터의 세포를 분리하고 배양하는 방법에 관한 논의 참조)이다. 형질전환된 세포는 당분야에 공지된 방법(Kuchler (1977) Biochemical Methods in Cell Culture and Virology, Kuchler, R. J., Dowden, Hutchinson and Ross, Inc., 및 Atlas (1993) CRC Handbook of Microbiological Media (Parks ed) CRC press, Boca Raton, Fl. 참조)에 의해 배양된다. 포유류 세포 시스템은 종종 세포 단일층으로 형성될 것이나, 포유류 세포 부유물(suspension) 또한 사용될 수 있다. 한편, 세포는 세포은행(예: 혈액은행)에 저장된 것으로부터 유래될 수 있다. 포유류 세포주의 바람직한 예로는 HEC-1-B 세포주, VERO 및 Hela 세포, 중국 햄스터 난소(Chinese hamster ovary; CHO) 세포주, W138, BHK, Cos-7 또는 MDCK 세포주(예: Freshney, supra 참조) 등이 포함된다.Various cell types suitable for ex vivo transformation are known to those skilled in the art. Particularly preferred cells are stem cells described above (eg Freshney (1994) Culture of Animal Cells, a Manual of Basic Technique, third edition Wiley-Liss, New York, and methods of isolating and culturing cells from patients cited herein). See the discussion. Transformed cells are known in the art (Kuchler (1977) Biochemical Methods in Cell Culture and Virology, Kuchler, RJ, Dowden, Hutchinson and Ross, Inc., and Atlas (1993) CRC Handbook of Microbiological Media (Parks ed). ) CRC press, Boca Raton, Fl. Mammalian cell systems will often be formed into cell monolayers, but mammalian cell suspensions may also be used. On the other hand, the cells may be derived from those stored in cell banks (eg blood banks). Preferred examples of mammalian cell lines include HEC-1-B cell lines, VERO and Hela cells, Chinese hamster ovary (CHO) cell lines, W138, BHK, Cos-7 or MDCK cell lines (see, eg, Freshney, supra). Included.

T 세포 또는 B 세포는 또한 일부 생체외 유전자 전이 공정에서 사용될 수 있다. T 및 B 세포를 분리해내는 여러 기법이 공지되어 있다. 표면 마커의 발현은 이러한 세포의 분리 및 정제를 용이하게 한다.T cells or B cells may also be used in some ex vivo gene transfer processes. Several techniques for isolating T and B cells are known. Expression of surface markers facilitates the isolation and purification of these cells.

요약하면, 본 발명의 바이러스 벡터는 분열 또는 비분열 세포를 안정되게 형질도입시키고 이종의 유전자를 안정되게 발현하는데 사용될 수 있다. 이러한 벡터 시스템을 사용하면, 상기 세포의 생리작용에 영향을 줄 수 있는 단백질 코딩 유전자를 분열 또는 비분열 세포로 도입하는 것이 가능하다. 따라서, 본 발명의 벡터는 질병 또는 세포 생리 작용의 실험적 변형을 위한 유전자 치료에 유용하게 사용될 수 있다. In summary, the viral vectors of the present invention can be used to stably transduce dividing or non-dividing cells and to stably express heterologous genes. Using such a vector system, it is possible to introduce protein coding genes into dividing or non-dividing cells which can affect the physiology of the cells. Thus, the vector of the present invention can be usefully used for gene therapy for experimental modification of disease or cellular physiology.

키트(Kits)Kits

본 발명의 또 다른 목적은 본원에 기술된 방법에 사용하기 위한 벡터를 포함하는 키트 또는 약제 전달 시스템을 제공하는 것이다. 모든 표적 레트로바이러스 입자의 투여에 요구되는 필수 물질 및 시약이 키트에 포함될 수 있다(예: 패키징 세포 구조물 또는 세포주). 키트의 구성성분은 다양한 제형으로 제공된다. 하나 이상의 CAEV 입자들은 하나 이상의 약제(예: 화학치료제)를 가지고 단일 약학적으로 허용가능한 조성물 또는 분리된 약학적으로 허용가능한 조성물로 제형화될 수 있다.It is another object of the present invention to provide a kit or drug delivery system comprising a vector for use in the methods described herein. Essential materials and reagents required for the administration of all target retroviral particles can be included in the kit (eg, packaging cell constructs or cell lines). The components of the kits are provided in various formulations. One or more CAEV particles may be formulated in a single pharmaceutically acceptable composition or in separate pharmaceutically acceptable compositions with one or more agents (eg, chemotherapeutic agents).

이러한 키트 또는 약제 전달 시스템의 구성원은 또한 건조 또는 감압 동결건조된 형태로 제공될 수 있다. 시약 또는 구성원이 건조 형태로 제공되었을 경우, 일반적으로 다른 용기 수단에 제공될 수 있는 적절한 용매의 첨가로 인해 재구성된다. 본 발명의 키트는 또한 표적 CAEV 입자에 대한 복용 및/또는 투여 정보와 관련된 지시서를 포함한다. 본 발명의 키트 또는 약제 전달 시스템으로는 또한, 바이얼(vial)이 유지되도록 일반적으로 주사 또는 블로우 몰딩된(blow-molded) 가소성 용기(container)와 같은 시판용 밀집 바이얼 등의 수단이 포함될 것이다. 상기 용기의 수와 형태에 상관 없이, 키트는 또한 최종 구조체 조성물의 대상 체내 주사/투여 또는 위치이동을 돕기 위한 기구를 포함하거나 또는 이로 패키징될 수 있다. 이러한 기구는 어플리케이터(applicator), 흡입기, 시린지(syringe), 피펫(pipette), 포셉(forceps), 측정 수저(measured spoon), 안구-점적기(eye-dropper) 또는 이러한 임의의 의학적으로 승인된 전달 물질일 수 있다.Members of such kits or drug delivery systems may also be provided in a dry or reduced pressure lyophilized form. When the reagents or members are provided in dry form, they are generally reconstituted due to the addition of a suitable solvent which may be provided to other container means. Kits of the invention also include instructions relating to dose and / or administration information for the target CAEV particles. Kits or medicament delivery systems of the present invention will also include means such as commercially available dense vials, such as injection or blow-molded plastic containers, generally to maintain a vial. Regardless of the number and shape of the containers, the kit may also include or be packaged with an instrument to assist injecting / administering or locating the final construct composition into the subject. Such instruments may be applicators, inhalers, syringes, pipettes, forceps, measured spoons, eye-droppers or any such medically approved delivery. It may be a substance.

이하 실시예는 본 발명을 다양한 측면에서 설명하나, 본 발명의 방법을 한정하는 것은 아니다.The following examples illustrate the invention in various aspects, but do not limit the method of the invention.

실시예Example

하기 실시예들은 본 발명의 특정 실시양태 및 측면을 제시하고 있으며, 본 발명의 범위를 한정하는 것은 아니다.The following examples set forth specific embodiments and aspects of the invention and do not limit the scope of the invention.

하기 실시예들은 본 발명의 재조합 CAEV-계 렌티바이러스 벡터 시스템이 공지된 HIV-1 계 렌티바이러스 시스템에서와 같이 발현에 있어서 효과적이라는 발견을 입증한다. 상기 실시예들은 본 발명의 CAEV-계 벡터 입자 생산 시스템의 유전체 RNA 전사, 캡시드화, 번역, 역전사, 및 통합 수준이, 오랜 기간 매우 효과적인 유전자 전이 시스템으로 인정받고 있는 HIV-1-계 렌티바이러스 벡터 시스템(Naldini et al., 1996)과 비교할만 하다는 것을 보여준다.The following examples demonstrate the discovery that the recombinant CAEV-based lentiviral vector system of the present invention is as effective in expression as in the known HIV-1 based lentiviral system. The above examples are HIV-1-based lentiviral vectors whose genomic RNA transcription, capsidization, translation, reverse transcription, and integration levels of the CAEV-based vector particle production system of the present invention have long been recognized as highly effective gene transfer systems. Comparable to the system (Naldini et al., 1996).

본 발명은 최소 세 개의 플라스미드 공-형질감염 방법을 근거로 하면서, gag-pol env 유전자, 및 선택적으로 rev 유전자의 발현을 요구하는, 높은 역가의 CAEV-계 벡터 시스템의 구조에 대한 첫 번째 보고이다.The present invention is based on at least three plasmid co-transfection methods and is the first report on the structure of a high titer CAEV-based vector system that requires expression of the gag - pol and env genes, and optionally the rev gene. to be.

물질 및 방법Substances and Methods

플라스미드 구조Plasmid structure

모체 플라스미드(parent plasmids). 본 발명의 CAEV 벡터가 유래된 모체 플라스미드는 마리 수잔 박사(Dr. Marie Suzan, Institut National de la Sante et de la Recherche Medicale INSERM, France)에 의해 제공된 플라스미드 pWTE-BM 및 플라스미드 pCAEV-LTR이다. pWTE-BM 플라스미드는 env, rev, 및 U3 영역을 포함하는 0.4 kb Hind III 절편을 제외한 전장 유전체 CAEV cDNA 및 1337 염기쌍의 비필수 절편(stuffer fragment)을 포함한다. 플라스미드 pCAEV-LTR는 pWTE-BM에서 제외된 0.4 kb Hind III 절편을 포함한다(Saltarelli et al., 1990; Saltarelli, 1993). 두 벡터 모두 야생형 바이러스를 생산할 수 없다. Parent plasmids. The parental plasmid from which the CAEV vector of the present invention is derived is the plasmid pWTE-BM and plasmid pCAEV-LTR provided by Dr. Marie Suzan, Institut National de la Sante et de la Recherche Medicale INSERM, France. The pWTE-BM plasmid contains non-essential stuffer fragments of full length genomic CAEV cDNA and 1337 base pairs, except for the 0.4 kb Hind III fragments comprising the env , rev, and U3 regions. Plasmid pCAEV-LTR contains 0.4 kb Hind III fragments excluded from pWTE-BM (Saltarelli et al., 1990; Saltarelli, 1993). Neither vector can produce wild type viruses.

CAEV gag-pol 발현 벡터(pMGP/RRE(서열번호: 77) 및 pMGP/REV/RRE). pMGP/RRE (서열번호: 77) 플라스미드는 pWTE-BM 유래 gag-pol 발현 플라스미드이다(도 2a에 제시됨). pMGP/RRE(서열번호: 77) 플라스미드는 강력하고 이종인 MCMV 주 극초기 프로모터(MCMV MIEP), gag-pol 유전자, 및 rev 반응 요소(RRE)를 포함한다. pMGP/RRE(서열번호: 77) 플라스미드는 또한 항생물질 선별 마커로서 네오마이신 내성 유전자를 코딩한다. 상기 플라스미드 구조에서, pWTE-BM 유래 gag-pol 유전자 절편(CAEV 유전체의 512 내지 5046번째 뉴클레오티드)은 몇몇 PCR 및 서브클로닝 단계에 대한 표준방법을 사용하여 pGL2-Basic(Promega, WI, USA) 클로닝 벡터로 서브클로닝 되었다. MCMV MIEP 절편은 플라스미드 pMYK(Kim et al., 2002)로부터 절단되어 gag 유전자의 상류에 삽입되었고, RRE 영역(CAEV 유전체의 7824 내지 8183번째 뉴클레오티드 또는 7849 내지 8150번째 뉴클레오티드)은 pol 유전자의 하류에 삽입되었다. pMGP/REV/RRE는 CAEV rev 유전자를 포함하는 다른 gag-pol 발현 플라스미드이다(도 2b에 제시됨). 또한, CAEV의 주 스플라이싱 제공 위치(CAEV 유전체의 330 내지 346번째 뉴클레오티드)가 MCMV 프로모터의 하류에 삽입되었다. CAEV gag-pol expression vector (pMGP / RRE (SEQ ID NO: 77) and pMGP / REV / RRE) . pMGP / RRE (SEQ ID NO: 77) plasmid pWTE BM-derived gag - (noted in Fig. 2a) a pol expression plasmid. The pMGP / RRE (SEQ ID NO: 77) plasmid contains the strong and heterologous MCMV main early promoter (MCMV MIEP), the gag-pol gene, and the rev response element (RRE). The pMGP / RRE (SEQ ID NO: 77) plasmid also encodes a neomycin resistance gene as an antibiotic selection marker. In this plasmid structure, the pWTE-BM derived gag-pol gene segment (the 512-5046 nucleotides of the CAEV genome) was cloned into the pGL2-Basic (Promega, WI, USA) cloning vector using standard methods for several PCR and subcloning steps. Subcloned. The MCMV MIEP fragment was cut from plasmid pMYK (Kim et al., 2002) and inserted upstream of the gag gene, and the RRE region (7824-8183 nucleotides or 7849-8150 nucleotides of the CAEV genome) was inserted downstream of the pol gene. It became. pMGP / REV / RRE is another gag-pol expression plasmid containing the CAEV rev gene (shown in FIG. 2B). In addition, the main splicing donation site of the CAEV (330-346 nucleotides of the CAEV genome) was inserted downstream of the MCMV promoter.

전이 벡터(pCAH/SINd 시리즈). pCAH/SINd 시리즈에 속하는 플라스미드들(도 3a-3h에 제시됨) (서열번호: 67-71, 73, 78, 및 79)은 본 발명의 전이 벡터 설계를 위한 최적의 패키징 서열을 확인하기 위해 제작되었다. 상기 시리즈내 각 플라스미드는 서로 다른 길이의 5' 비번역 영역 및 gag-코딩 영역의 개시부분을 포함하도록 설계되어 이러한 영역에서의 다양한 길이의 효과를 나란히 비교할 수 있게 하였다. 특히 안전성 문제를 처리하기 위해, 이러한 플라스미드는 3'LTR이 삭제된 U3 영역을 갖는 SIN(self-inactivation) 벡터로서 설계되었다. 트랜스-작용 인자인 tat가 없는 전이벡터로부터 벡터 RNA가 높은 수준으로 발현되도록, 5'LTR의 U3 영역을 HCMV MIEP로 치환하였다. 또한, 폴리아데닐화, RNA 수송, 역전사, 및 통합에 요구되는 모든 공지된 시스-작용 서열 요소가 상기 전이 벡터 시리즈에 포함되었다. Transition vector (pCAH / SINd series). Plasmids belonging to the pCAH / SINd series (shown in FIGS. 3A-3H) (SEQ ID NOs: 67-71, 73, 78, and 79) were constructed to identify optimal packaging sequences for the design of transition vectors of the present invention. . Each plasmid in the series was designed to include 5 'untranslated regions and gag -encoding regions of different lengths to allow side-by-side comparison of the effects of various lengths in these regions. In particular, to address safety issues, these plasmids were designed as self-inactivation (SIN) vectors with a U3 region with 3'LTR deleted. The U3 region of 5'LTR was replaced with HCMV MIEP so that vector RNA was expressed at high levels from the transvector without the trans -acting factor tat . In addition, all known cis -acting sequence elements required for polyadenylation, RNA transport, reverse transcription, and integration were included in this series of transition vectors.

pCAH/SINd 시리즈의 플라스미드들(서열 번호: 67-71, 73, 78, 79)은 다음과 같은 방법으로 제작되었다. pCAH/SINd(PBS-결핍 음성 대조군 벡터)(서열번호: 73)(도 3a)는 단지 5'LTR 내 5' 비번역 서열(R 및 U5 영역)(CAEV 유전체의 1 내지 163번째 뉴클레오티드)만을 포함하도록 설계되었다. pCAH/SINd0(서열번호: 67)(도 3c)는 5' 비번역 영역 전부(CAEV 유전체의 1 내지 511번째 뉴클레오티드)를 포함하도록 설계되었다. pCAH/SINd1(서열번호: 68)(도 3c)는 5' 비번역 영역 전부 및 점변이를 갖는 gag 유전자의 327 bp 절편(CAEV 유전체의 1 내지 839번째 뉴클레오티드)을 포함하도록 설계되었다. pCAH/SINd2(서열번호: 69)(도 3d)는 5' 비번역 영역 전부 및 점변이를 갖는 gag 유전자의 612 bp 절편(CAEV 유전체의 1 내지 1124번째 뉴클레오티드)을 포함하도록 설계되었다. 플라스미드 pCAH/SINd3(서열번호: 70)(도 3e)는 5' 비번역 영역 전부 및 점변이를 갖는 gag 유전자의 908 bp 절편(CAEV 유전체의 1 내지 1420번째 뉴클레오티드)을 포함하도록 설계되었다. 플라스미드 pCAH/SINd4(서열번호: 71)(도 3f)는 5' 비번역 영역 전부 및 점변이를 갖는 gag 유전자의 1,198 bp 절편(CAEV 유전체의 1 내지 1710번째 뉴클레오티드)을 포함하도록 설계되었다. pCAH/SINd1/hlacZ(서열번호: 78)(도 3g)는 HCMV MIEP 및 lacZ 유전자로 이루어진 발현 카세트를 pCAH/SINd1(서열번호: 68) 내로 삽입시켜 제작되었다. 플라스미드 pCAH/SINd60/hlacZ(서열번호: 78)(도 3h)는 점변이를 갖는 gag 유전자의 처음 60 bp 절편을 포함하는 gag 유전자의 길이(CAEV의 1 내지 569번째 뉴클레오티드)에 대한 것을 제외하고는, pCAH/SINd1(서열번호: 68)에서와 동일한 설계를 갖는다.Plasmids of the pCAH / SINd series (SEQ ID NOs: 67-71, 73, 78, 79) were prepared in the following manner. pCAH / SINd (PBS-deficient negative control vector) (SEQ ID NO: 73) (FIG. 3A) contains only 5 ′ untranslated sequences (R and U5 regions) (1 th to 163 nucleotides of the CAEV genome) in 5 ′ LTR It is designed to be. pCAH / SINd0 (SEQ ID NO: 67) (FIG. 3C) was designed to include all of the 5 ′ untranslated region (1 th to 511 nucleotides of the CAEV genome). pCAH / SINd1 (SEQ ID NO: 68) (Fig. 3c) is designed to contain a 5 '327 bp fragment of the gag gene having the untranslated region and all point mutations (1 to 839th nucleotides of the CAEV genome). pCAH / SINd2 (SEQ ID NO: 69) (FIG. 3D) was designed to contain a 612 bp fragment (1st to 1124th nucleotide of the CAEV genome) of the gag gene with all 5 ′ untranslated regions and point mutations. Plasmid pCAH / SINd3 (SEQ ID NO: 70) (FIG. 3E) was designed to contain the 908 bp fragment (1st to 1420th nucleotide of the CAEV genome) of the gag gene with all 5 ′ untranslated regions and point mutations. Was designed to contain: (SEQ ID NO: 71) (Fig. 3f) is the 5 'untranslated region all 1,198 bp fragment of the gag gene, and has a point mutation (1 to 1710th nucleotides of the CAEV genome) plasmid pCAH / SINd4. pCAH / SINd1 / hlacZ (SEQ ID NO: 78) (FIG. 3G) was prepared by inserting an expression cassette consisting of HCMV MIEP and lacZ genes into pCAH / SINd1 (SEQ ID NO: 68). Plasmid pCAH / SINd60 / hlacZ (SEQ ID NO: 78) (FIG. 3H) is for the length of the gag gene (1-569 nucleotides of CAEV) including the first 60 bp fragment of the gag gene with point mutation. , pCAH / SINd1 (SEQ ID NO: 68).

CAEV vif 발현 벡터(pHYK/vif)(서열번호: 76). 신속하고 효율적인 바이러스 복제를 위해 요구되는 것으로 알려진 vif 유전자(CAEV 유전체의 5006 내지 5695번째 뉴클레오티드)는 진핵세포 발현 벡터 pHYK(Kim et al., 2002)(도 4)에 클로닝되었다. CAEV vif expression vector (pHYK / vif) (SEQ ID NO: 76) . The vif gene (5006-5695 nucleotides of the CAEV genome), known to be required for rapid and efficient viral replication, was cloned into the eukaryotic expression vector pHYK (Kim et al., 2002) (FIG. 4).

CAEV rev 발현 벡터(pHYK/rev)(서열번호: 75). RRE와의 상호작용으로 바이러스 유전자 발현을 전사 후 수준에서 조절하는 rev 유전자는 두 개의 엑손(제1 엑손은 CAEV 유전체의 6,012 내지 6,123번째 뉴클레오티드, 그리고 제2 엑손은 CAEV 유전체의 8514 내지 8803번째 뉴클레오티드)으로 이루어진다. Rev/RRE 시스템은 비-스플라이싱된 RNA의 핵 배출을 촉진하며 렌티바이러스 복제에 필수적인 것으로 알려져 있다. rev 유전자의 전장 cDNA는 RT-PCR에 의해 합성되었으며 pHYK 벡터로 서브클로닝되었다(도 5). CAEV rev Expression Vector (pHYK / rev) (SEQ ID NO: 75). The rev gene, which regulates viral gene expression at the post-transcriptional level by interaction with RRE, has two exons (the first exon is 6,012-6,123 nucleotides of the CAEV genome and the second exon is 8514-8803 nucleotides of the CAEV genome). Is done. The Rev / RRE system promotes nuclear release of non-spliced RNA and is known to be essential for lentiviral replication. Full length cDNA of the rev gene was synthesized by RT-PCR and subcloned into the pHYK vector (FIG. 5).

바이러스 외피 유전자 발현 벡터. 본원에서 사용된 외피 유전자 발현 벡터 시스템은 플라스미드 pHGVSV-G(서열번호: 74) 및 플라스미드 pMYKEF1/env(서열번호: 72)(도 6a 및 6b)이다. 플라스미드 pHGVSV-G(서열번호: 74)는 소수포성 구내염 바이러스-G(VSV-G) 당단백질을 발현하도록 설계되었으며, 프로모터로서 β-글로빈 인트론을 갖는 HCMV MIEP를 포함한다. pMYKEF-1/env(서열번호: 72)는 긴팔원숭이 백혈병 바이러스(GaLV) 외피 단백질을 발현하도록 고안되었으며 프로모터로서 진핵세포 연장 인자-1α를 갖는 MCMV MIEP를 포함한다. Virus envelope gene expression vector. Envelope gene expression vector systems as used herein are plasmid pHGVSV-G (SEQ ID NO: 74) and plasmid pMYKEF1 / env (SEQ ID NO: 72) (FIGS. 6A and 6B). Plasmid pHGVSV-G (SEQ ID NO: 74) is designed to express hydrophobic stomatitis virus-G (VSV-G) glycoprotein and includes HCMV MIEP with β-globin intron as promoter. pMYKEF-1 / env (SEQ ID NO: 72) is designed to express Gibbons leukemia virus (GaLV) envelope protein and includes an MCMV MIEP with eukaryotic elongation factor-1α as a promoter.

MuLV- 및 HIV-1-계 플라스미드. 대조군 벡터 시스템으로서, pMFG/lac/Zpuro 및 pHR/lacZ 벡터들이 본 발명에 사용되었으며, 이들은 각각 쥐 백혈병 바이러스(MuLV)(Kim et al., 1997) 및 1형 인간 면역결핍 바이러스(HIV-1)(Naldini et al., 1996)로부터 유래된 lacZ-함유 레트로바이러스 벡터였다. MuLV 및 HIV-1 벡터 시스템의 패키징 플라스미드로는, pEQPAM3(Persons et al., 1998) 및 pCMVΔR8-2가 각각 사용되었다. HIV-1 패키징 플라스미드 pCMVΔR8-2는, HIV-1 vpu 유전자를 코딩하고 env 유전자에서 1.3-kb BglII 절편이 삭제된 것을 제외하고, pCMVΔR9(Naldini et al., 1996)와 동일하다. MuLV- and HIV-1-based plasmids. As a control vector system, pMFG / lac / Zpuro and pHR / lacZ vectors were used in the present invention, which were murine leukemia virus (MuLV) (Kim et al., 1997) and human type 1 immunodeficiency virus (HIV-1), respectively. (Naldini et al., 1996), a lacZ-containing retroviral vector. As packaging plasmids of the MuLV and HIV-1 vector systems, pEQPAM3 (Persons et al., 1998) and pCMVΔR8-2 were used, respectively. The HIV-1 packaging plasmid pCMVΔR8-2 is identical to pCMVΔR9 (Naldini et al., 1996) except that the HIV-1 vpu gene was encoded and the 1.3-kb BglII fragment was deleted from the env gene.

벡터 입자 생산Vector particle production

의사형 CAEV-계 렌티바이러스 벡터 입자들은, 형질감염 하루 전에 6-웰 배양 디쉬(dish) 당 5×105 세포 농도로 분주된 293T 세포들에, 세 개 이상의 플라스미드의 리포좀 매개 일시적 형질전환을 수행하여 생산하였다. 세 개의 플라스미드 공형질감염은 gag-pol 발현 플라스미드, 전이 벡터 플라스미드, 및 env-코딩 플라스미드의 1:1:1 몰비로 수행하였다. 네 개의 플라스미드 공형질감염은 gag-pol 발현 플라스미드, 전이 벡터 플라스미드, env-코딩 플라스미드, 및 rev-발현 플라스미드의 3:3:3:1 몰비로 수행하였다. 다섯 개의 플라스미드 공형질감염은 gag-pol 발현 플라스미드, 전이 벡터 플라스미드, env-코딩 플라스미드, rev-발현 플라스미드 및 vif-발현 플라스미드의 3:3:3:1:1 몰비로 수행하였다. 바이러스 벡터 입자를 포함하는 배양 상등액은 48 시간 후에 회수되고, 0.45 μM 막 여과로 여과한 후(Nalgene, NY, USA), 바로 사용하거나 -70℃ 딥-프리저(deep-freezer)에 보관하였다. Pseudo-CAEV-based lentiviral vector particles undergo liposome mediated transient transformation of three or more plasmids to 293T cells dispensed at a concentration of 5 × 10 5 cells per 6-well culture dish one day before transfection. To produce. Three plasmid cotransfections were performed in a 1: 1: 1 molar ratio of gag - pol expression plasmid, transition vector plasmid, and env -encoding plasmid. Four plasmid cotransfections were performed at a 3: 3: 3: 1 molar ratio of gag - pol expression plasmid, transition vector plasmid, env -encoding plasmid, and rev -expressing plasmid. Five plasmid cotransfections were performed in a 3: 3: 3: 1: 1 molar ratio of gag - pol expressing plasmid, transition vector plasmid, env -encoding plasmid, rev -expressing plasmid and vif -expressing plasmid. Culture supernatants containing viral vector particles were recovered after 48 hours, filtered by 0.45 μM membrane filtration (Nalgene, NY, USA), immediately used or stored in a -70 ° C. deep-freezer.

생체 내 형질도입In vivo Transduction

형질도입은 바이러스 벡터 입자를 8 ㎍/㎖ 폴리브렌(polybrene) 존재하에 293T 세포에 4시간 동안 가한 후 신선한 배지를 첨가하여 수행되었다. 48시간 후, 상기 세포를 1% 폼알데하이드 및 0.2% 글루타알데하이드로 이루어진 용액으로 고정시키고 300 ㎍의 5-브로모-4-클로로-3-인돌릴 β-D-갈락토사이드(X-Gal, Promega, WI, USA), 4 mM 포타슘 페로사이아나이드(potassium ferrocyanide), 4 mM 포타슘 페리사이아나이드(potassium ferricyanide) 및 2 mM MgCl2를 포함하는 용액으로 37℃에서 12시간 동안 염색한 후, 베타-gal 발현(Beta-Gal expression)을 분석하였다. 역가는 ㎖ 당 LacZ-형성 단위(LacZ-forming units)(LFU/ml)로서 청색 포커스(foci)의 수를 측정하여 결정될 수 있다.Transduction was performed by adding viral vector particles to 293T cells for 4 hours in the presence of 8 μg / ml polybrene and then adding fresh medium. After 48 hours, the cells were fixed with a solution consisting of 1% formaldehyde and 0.2% glutaaldehyde and 300 μg of 5-bromo-4-chloro-3-indolyl β-D-galactoside (X-Gal, Promega, WI, USA), 4 mM potassium ferrocyanide, 4 mM potassium ferricyanide and 2 mM MgCl 2 for 12 hours at 37 ° C., followed by beta- The gal expression (Beta-Gal expression) was analyzed. Titer can be determined by measuring the number of blue foci as LacZ-forming units (LFU / ml) per ml.

RT-PCR 분석RT-PCR Analysis

트리졸 LS 시약(TRIzol LS Reagent, GIBCO BRL, CA, USA)을 사용하는 방법에 의해 배양된 세포 또는 세포 상등액으로부터 총 RNA를 추출하였다. 상기 총 RNA는 RNA 분해효소가 없는 DNA 분해효소 Ⅰ(RNase free-DNase I, 1 unit/mg의 DNA를 37℃에서 20분간 처리, Promega, WI, USA)으로 처리하여 DNA 오염물을 제거하였다. DNA 분해효소 I의 반응은 상기 DNA 분해효소와 함께 제공되는 RQ1 DNA 분해효소 중지 용액(RQ1 DNase stop solution)을 가하여 중지시켰으며, RNA는 알앤이지 미니 키트(RNeasy mini kit, Qiagen, Germany)를 사용하는 방법을 통해 정제되었다. 정제된 RNA를 역전사(RT) 반응(37℃에서 90분)을 통해 cDNA로 역전사하였다. 구체적으로, RT 반응은 MuLV 역전사효소, 올리고-dT 프라이머 또는 C-말단 특이적 프라이머, 및 dNTPs 혼합물 존재하에 수행되었다. PCR 증폭은 주형 DNA의 반-정량 분석(semi-quantitative analysis)을 위해 특이적 프라이머를 가지고 수행되었다. 구체적으로, PCR 산물 DNA는 열안정성 Ex Taq 중합효소, 서열 특이적 DNA 프라이머, 및 dNTPs 혼합물 존재하에 cDNA 또는 염색체 DNA로부터 합성되었다.Total RNA was extracted from the cultured cells or cell supernatants by the method using Trizol LS Reagent (TRIzol LS Reagent, GIBCO BRL, CA, USA). The total RNA was removed by DNA lyase-free DNAase I (RNase free-DNase I, 1 unit / mg of DNA treatment for 20 minutes at 37 ℃, Promega, WI, USA) to remove DNA contaminants. The reaction of DNAase I was stopped by adding the RQ1 DNase stop solution provided with the DNAase, and RNA was used by using RNeasy mini kit (Qiagen, Germany). It was purified through the method. Purified RNA was reverse transcribed into cDNA via reverse transcription (RT) reaction (90 minutes at 37 ° C.). Specifically, the RT reaction was performed in the presence of MuLV reverse transcriptase, oligo-dT primers or C-terminal specific primers, and a mixture of dNTPs. PCR amplification was performed with specific primers for semi-quantitative analysis of template DNA. Specifically, PCR product DNA was synthesized from cDNA or chromosomal DNA in the presence of thermostable Ex Taq polymerase, sequence specific DNA primers, and a mixture of dNTPs.

서던 블럿 분석(Southern Blot Analysis)Southern Blot Analysis

유전체 DNA는 디앤이지 조직 키트(DNeasy Tissue Kit, Qiagen, Germany)를 사용하여 의사형 HIV-1 또는 CAEV 벡터 입자 중 하나로 형질도입된 세포, 및 비-형질도입된 대조군 세포(mock-transduced control cells)로부터 제조되었다. HIV-1 벡터로 형질도입된 세포로부터의 유전체 DNA 10 ㎍을 BamH I 및 Kpn I으로 절단하였다. CAEV 벡터 형질도입된 세포로부터의 유전체 DNA 각 10 ㎍을 EcoR I 및 Ssp I으로 절단하였다. 절단된 유전체 DNA들을 0.7% 아가로즈 겔 상의 전기영동에 의해 분리하고 양이온 전하된 나일론막(Roche, Germany)으로 전이시켰다. Dig-표지된 프로브를 lacZ 유전자 특이적인 프라이머(전방향 프라이머: CTGGCGTAATAGCGAAGAGG(서열번호: 65), 역방향 프라이머: AACTCGCCGCACATCTGAAC(서열번호: 66))들을 이용한 PCR에 의해 제조하였으며, 서던 혼성화는 Dig 적용 매뉴얼(Dig application manual, Roche, Germany)에 따라 수행되었다.Genomic DNA was transduced with one of the pseudo-HIV-1 or CAEV vector particles using the DNeasy Tissue Kit (Qiagen, Germany), and mock-transduced control cells. Prepared from. 10 μg of genomic DNA from cells transduced with the HIV-1 vector were digested with Bam H I and Kpn I. Each 10 μg of genomic DNA from CAEV vector transduced cells was digested with Eco R I and Ssp I. The cleaved genomic DNAs were separated by electrophoresis on a 0.7% agarose gel and transferred to a cationic charged nylon membrane (Roche, Germany). Dig-labeled probes were prepared by PCR using lac Z gene specific primers (forward primer: CTGGCGTAATAGCGAAGAGG (SEQ ID NO: 65), reverse primer: AACTCGCCGCACATCTGAAC (SEQ ID NO: 66)), and Southern hybridization was performed using the Dig Application Manual. (Dig application manual, Roche, Germany).

세포의 성장 정체(Growth Arrest) 및 성장 정체된 세포의 FACS 분석Growth Arrest of Cells and FACS Analysis of Growth Stagnant Cells

293T 세포들을 아피디콜린(aphidicolin, Sigma, USA) 처리(25 ㎍/ml)로 성장정체시켰으며, 그 후 CAEV 바이러스 벡터 입자들로 형질도입시켰다. 양성 대조군 또는 음성 대조군으로서, 세포들을 HIV-1 벡터 또는 MuLV 레트로바이러스 벡터 중 하나로 나란히 형질도입시켰다. 형질도입후 2일에, 세포들의 베타-gal 활성을 알아보기 위해 X-gal로 염색시켰다. 아피디콜린 처리된 세포에서, 아피딘 콜린은 감염 전 또는 후에 존재하였다.293T cells were grown in aphidicolin (Sigma, USA) treatment (25 μg / ml) and then transduced with CAEV virus vector particles. As a positive or negative control, cells were transduced side by side with either an HIV-1 vector or a MuLV retroviral vector. Two days after transduction, cells were stained with X-gal to determine the beta-gal activity. In apidicholine treated cells, apidine choline was present before or after infection.

세포의 성장 정체를 FACS 분석에 의해 확인하였다. 아피디콜린 처리된 또는 처리되지 않은 대조군 세포들을 PBS로 세척하고, 70% 에탄올로 -20℃에서 밤새 고정시켰으며, 프로피듐 아이오다이드(100 ㎍/ml)(Sigma, USA) 및 RNAse A(100 ㎍/ml)(Qiagen, Germany)로 실온에서 1 시간 처리하였다. 세포들을 FACS 분석에 의해 분석하였으며, 세포 주기의 G1, S 및 G2/M 단계에서의 총 살아있는 세포들의 백분율을 산출하였다(Becton Dickinson, Sanjose, CA).Growth retention of cells was confirmed by FACS analysis. Apidicholine treated or untreated control cells were washed with PBS and fixed overnight at −20 ° C. with 70% ethanol, propidium iodide (100 μg / ml) (Sigma, USA) and RNAse A ( 100 μg / ml) (Qiagen, Germany) for 1 hour at room temperature. Cells were analyzed by FACS analysis and calculated the percentage of total live cells at the G1, S and G2 / M stages of the cell cycle (Becton Dickinson, Sanjose, CA).

실시예 1: CAEV-계 렌티바이러스 벡터 입자의 생산Example 1 Production of CAEV-Based Lentiviral Vector Particles

복제 결함 렌티바이러스 벡터 입자들은 CAEV gag-pol 발현 플라스미드, CAEV env-발현 플라스미드 및 전이 벡터 플라스미드의 최소 3-플라스미드 시스템을 이용한 인간 293T 세포의 일시적 공-형질감염에 의해 생산되었다. 4-플라스미드 시스템에서는, CAEV rev 발현 플라스미드가 추가되고, 5-플라스미드 시스템에서는, CAEV vif 발현 플라스미드가 추가된다. 효율적인 패키징을 위해, 전이 벡터는 gag 단백질의 발현을 방지하기 위해 시작 ATG 코돈 및 하류에 위치한 ATG 코돈에 변이(ATG에서 TAG로)가 도입된, gag-코딩 서열의 처음 부분을 포함하도록 고안되었다. RRE는 패키징 효율을 증대시키기 위해 포함되었으며, 4- 및 5-플라스미드 시스템에서의 rev는 CAEV mRNA 배출을 위해 벡터로부터 발현되었다. 전이 벡터 플라스미드에서의 내부 HCMV-MIEP 프로모터로 작동되는 β-갈락토시데이즈 유전자는 리포터 유전자로서 삽입되었다. 5'LTR의 U3 영역은 강력한 바이러스 프로모터인, HCMV-MIEP로 대체되어 벡터 유전체가 tat 독립적이 되도록 하였다. Replication defective lentiviral vector particles were produced by transient co-transfection of human 293T cells using a minimal 3-plasmid system of CAEV gag - pol expression plasmids, CAEV env -expressing plasmids and transition vector plasmids. In a 4-plasmid system, a CAEV rev expression plasmid is added, and in a 5-plasmid system, a CAEV vif expression plasmid is added. For efficient packaging, the transfer vectors were designed to include the first part of the gag -coding sequence, with mutations (ATG to TAG) introduced into the starting ATG codon and downstream ATG codons to prevent expression of the gag protein. RRE was included to enhance packaging efficiency and rev in 4- and 5-plasmid systems was expressed from the vector for CAEV mRNA release. The β-galactosidase gene, which acts as an internal HCMV-MIEP promoter in the transition vector plasmid, was inserted as a reporter gene. The U3 region of 5'LTR was replaced by HCMV-MIEP, a powerful viral promoter, to make the vector genome tat independent.

전이 벡터 RNA 전사 수준. 전이 벡터로부터의 유전체 RNA의 전사 수준은 패키징 세포로부터의 재조합 바이러스 벡터의 높은 역가 생산에 매개하는 중요한 요소 중 하나이다. 본 발명에서, HCMV 인핸서/프로모터 요소는 전이 벡터 RNA의 안전하고 효율적인 전사를 위한 HCMV/CAEV 하이브리드 LTR 프로모터 시스템을 제작하는데 사용되었다. 하이브리드 LTR 프로모터를 포함하는 pCAH/SINd(서열번호: 67-71, 73, 78, 및 79) 시리즈의 전이 벡터 플라스미드들의 전사 수준을 시험하기 위해, 각 전이벡터 플라스미드를 패키징 플라스미드(pMGP/RRE (서열번호: 77), pHYK/rev (서열번호: 75), pHYK/vif (서열번호: 76), pHGVSV-G (서열번호: 74) 또는 pMYKEF1/env (서열번호: 72))와 함께, 리포좀-매개 형질감염에 의해 인간 T 세포로 도입시켰다. 48시간 배양 후, 형질감염된 세포로부터 총 RNA를 정제하여, 벡터 RNA 전사물을 측정하기 위한 역전사 중합효소 연쇄 반응(RT-PCR) 분석을 수행하였다. CAEV 전이 벡터에 대한 PCR 프라이머 세트(RRE primer set)는 RRE 영역 부분을 코딩하는 348-bp PCR 산물을 합성하도록 설계되었다. HIV-1 전이 벡터인 pHRlacZ(Naldini et al., 1996)에 대한 다른 PCR 프라이머 세트(lacZ primer set)는 lacZ 유전자를 코딩하는 645 bp PCR 산물을 합성하도록 설계되었다. 도 7에 나타낸 바와 같이, 본 발명의 CAEV 전이 벡터는 HIV-1-계 렌티바이러스 전이 벡터와 비교할만한 수준의 RNA 전사물을 생산하였다. Transcription Vector RNA Transcription Levels. The level of transcription of genomic RNA from the transfer vectors is one of the important factors mediating the high titer production of recombinant viral vectors from packaging cells. In the present invention, the HCMV enhancer / promoter element was used to construct an HCMV / CAEV hybrid LTR promoter system for safe and efficient transcription of transfer vector RNA. To test the level of transcription of the transition vector plasmids of the pCAH / SINd (SEQ ID NOs: 67-71, 73, 78, and 79) series containing the hybrid LTR promoter, each transition vector plasmid was subjected to a packaging plasmid (pMGP / RRE (SEQ ID NO: 6). Number: 77), liposome- with pHYK / rev (SEQ ID NO: 75), pHYK / vif (SEQ ID NO: 76), pHGVSV-G (SEQ ID NO: 74) or pMYKEF1 / env (SEQ ID NO: 72)) It was introduced into human T cells by mediated transfection. After 48 hours of incubation, total RNA was purified from the transfected cells and subjected to reverse transcriptase polymerase chain reaction (RT-PCR) analysis to determine vector RNA transcripts. The PCR primer set for the CAEV transfer vector was designed to synthesize 348-bp PCR products encoding the RRE region portion. HIV-1 transfer vector of pHRlacZ (Naldini et al., 1996 ) another PCR primer set (primer set lac Z) of the is designed to synthesize a 645 bp PCR product encoding the lac Z gene. As shown in FIG. 7, the CAEV transfer vectors of the present invention produced RNA transcripts comparable to HIV-1-based lentiviral transfer vectors.

벡터 입자의 형성 및 방출. 성숙 및 감염성 바이러스 벡터 입자의 형성 및 방출을 시험하기 위해, pMGP/RRE (서열번호: 77) gag-pol 발현 플라스미드, pHGVSV-G(서열번호: 74) env 발현 플라스미드, pHYK/rev(서열번호: 75) rev 발현 플라스미드, pHYK/vif(서열번호: 76) vif 발현 플라스미드, 및 pCAH/SINd60/hlacZ(서열번호: 78) 전이 벡터 플라스미드를 인간 293T 세포(DuBridge et al., 1987)로 리포좀-매개 공-형질감염시켜 CAEV 벡터 입자를 생산하였다. 형질감염후 48시간에, 감염을 위해 배양 상등액을 형질감염된 세포로부터 회수하여 8 ㎍/ml 폴리브렌 존재하에 신선한 인간 293T 세포에 적용시켰다. 그 결과는 본 발명의 5개의 플라스미드 시스템이 MuLV-계 렌티 바이러스 벡터 시스템(pEQPAM3, pMFG/lacZ/puro, pHGVSV-G (서열번호: 74))(Ory, Neugeboren, and Mulligan, 1996; Persons et al., 1998)의 것과 비교할만한 바이러스 벡터 입자 역가를 생산할 수 있음을 보여주었다(도 8에 제시). Formation and release of vector particles. To test the formation and release of mature and infectious viral vector particles, pMGP / RRE (SEQ ID NO: 77) gag-pol expression plasmid, pHGVSV-G (SEQ ID NO: 74) env expression plasmid, pHYK / rev (SEQ ID NO: 75) Liposome-mediated rev expression plasmid, pHYK / vif (SEQ ID NO: 76) vif expression plasmid, and pCAH / SINd60 / hlacZ (SEQ ID NO: 78) transition vector plasmid into human 293T cells (DuBridge et al., 1987). Co-transfection produced CAEV vector particles. 48 hours post transfection, culture supernatants were recovered from the transfected cells for infection and applied to fresh human 293T cells in the presence of 8 μg / ml polybrene. The results show that the five plasmid systems of the present invention are MuLV-based lentiviral vector systems (pEQPAM3, pMFG / lacZ / puro, pHGVSV-G (SEQ ID NO: 74)) (Ory, Neugeboren, and Mulligan, 1996; Persons et al , 1998) which can produce viral vector particle titers comparable to that of (see FIG. 8).

실시예 2: 벡터 입자 생산에 대한 Rev Vif 발현 효과Example 2: Effect of Rev and Vif Expression on Vector Particle Production

벡터 입자 생산에 대한 CAEV rev vif 조절 유전자 발현의 효과를 결정하기 위해, (1) rev- 및 vif-코딩 서열을 갖지 않는, 3-플라스미드 시스템(pCAH/SIN, pMGP/RRE(서열번호: 77), pHGVSV-G(서열번호: 74) 또는 pMYKEF1/env(서열번호: 72)), (2) vif-코딩 서열을 갖지 않는, 4-플라스미드 시스템(pCAH/SIN, pMGP/RR (서열번호: 77), pHGVSV-G(서열번호: 74) 또는 pMYKEF1/env(서열번호: 72), pHYK/rev(서열번호: 75)), 및 (3) rev- 및 vif-코딩 서열을 모두 포함하는, 5-플라스미드 시스템(pCAH/SIN, pMGP/RRE(서열번호: 77), pHGVSV-G(서열번호: 74) 또는 pMYKEF1/env(서열번호: 72), pHYK/rev(서열번호: 75), pHYK/vif(서열번호: 76))의 벡터 입자 생산 시스템을 벡터 입자 생산에서의 그 효율성에 대해 나란히 시험하였다. 각 시스템의 플라스미드들을 293T 세포로 형질감염시켰다. 형질감염 후 2일에, 전이 벡터 RNA 및 비리온 RNA를 형질감염된 세포 및 형질감염된 세포의 배지로부터 각각 추출하여, 전이 벡터 RNA 유전체를 검출하기 위해 lacZ 프라이머 세트와 함께 RT-PCR 주형으로 사용하였다.To determine the effect of CAEV rev and vif control gene expression for vector particle production, (1) rev - and vif - not having the coding sequence, 3-plasmid system (pCAH / SIN, pMGP / RRE ( SEQ ID NO: 77 ), pHGVSV-G (SEQ ID nO: 74) or pMYKEF1 / env (SEQ ID nO: 72)), (2) vif - not having the coding sequence, 4-plasmid system (pCAH / SIN, pMGP / RR ( SEQ ID nO: 77), pHGVSV-G (SEQ ID NO: 74) or pMYKEF1 / env (SEQ ID NO: 72), pHYK / rev (SEQ ID NO: 75)), and (3) rev - and vif - including all of the coding sequence, 5-plasmid system (pCAH / SIN, pMGP / RRE (SEQ ID NO: 77), pHGVSV-G (SEQ ID NO: 74) or pMYKEF1 / env (SEQ ID NO: 72), pHYK / rev (SEQ ID NO: 75), pHYK) / vif (SEQ ID NO: 76)) was tested side by side for its efficiency in vector particle production. Plasmids of each system were transfected with 293T cells. Two days after transfection, the transfer vector RNA and virion RNA were extracted from the transfected cells and the medium of the transfected cells, respectively, and used as RT-PCR templates with a set of lac Z primers to detect the transition vector RNA genome. .

도 9에 나타낸 바와 같이, 패키징 세포에서의 전이 벡터 RNA의 발현 수준이 rev 또는 vif 유전자 발현에 비의존적이기는 하나(도 9에서의 레인 1, 2 및 3), rev가 없이 캡시드화된 전이 벡터 RNA의 양(도 9의 레인 4)은 rev 존재하에서의 양(도 9의 레인 5)보다 훨씬 낮았다. 그러나, 놀랍게도, vif 존재하에 캡시드화된 RNA를 가지고 RT-PCR로 측정된 벡터 입자의 역가(도 9의 레인 6)는 CAEV vif가 없을때의 양(도 9의 레인 5)보다 낮았다. 이러한 결과는 벡터 입자 생산을 위해 CAEV rev vif가 요구되지는 않으나, rev는 효율적인 벡터 입자 생산을 위해 바람직하다는 것을 의미한다.As shown in FIG. 9, although the expression level of transfer vector RNA in packaging cells is independent of rev or vif gene expression (lanes 1, 2 and 3 in FIG. 9), without rev The amount of encapsidated transfer vector RNA (lane 4 in FIG. 9) was much lower than the amount in the presence of rev (lane 5 in FIG. 9). Surprisingly, however, the titer of vector particles measured by RT-PCR with RNA encapsidated in the presence of vif (lane 6 in FIG. 9) was lower than the amount without CAEV vif (lane 5 in FIG. 9). This result means that CAEV rev and vif are not required for vector particle production, but rev is desirable for efficient vector particle production.

vif 발현과 관련된 본 발명의 결과는, vif 유전자가 염소 활막 세포에서 CAEV의 효율적 복제를 위해 필수적이며 바이러스 복제 주기의 말기(예: RNA 캡시드화, 숙주 세포로부터의 바이러스 입자 방출)에 영향을 미친다는, 하르마쉐 등(Harmache et al., 1995; Harmache et al., 1996)에 의해 보고된 결과와 일치하지 않는다. 상기 불일치에 대한 적절한 설명으로는 재조합 CAEV 벡터 입자의 생산에서 염소 세포 대신 인간 293T 세포를 사용했기 때문일 수 있다. 이러한 해석은 vif 및 바이러스-생산 세포가 바이러스 감염성에 대한 vif 작용을 조절할 수 있다는 세로드 등에 의해 제시된 가설(Seroude et al., 2002)을 지지한다.The results of the present invention related to vif expression indicate that the vif gene is essential for efficient replication of CAEV in goat synovial cells and affects the end of the viral replication cycle (eg RNA encapsidation, viral particle release from host cells). Inconsistent with the results reported by Harmache et al., 1995; Harmache et al., 1996. A suitable explanation for this discrepancy may be that human 293T cells were used instead of goat cells in the production of recombinant CAEV vector particles. This interpretation supports the hypothesis presented by Serde et al. (2002) that vif and virus-producing cells can modulate vif action on viral infectivity.

실시예 3: 최적의 패키징 신호 서열의 확인Example 3: Identification of Optimal Packaging Signal Sequences

CAEV 전이 벡터 RNA의 캡시드화를 위한 최적의 패키징 신호 서열을 확인하기 위해, CAEV gag-코딩 영역의 서로 다른 부분들 및 5'LTR과 gag 시작 코돈 사이의 비번역 영역을 포함하는 시리즈 플라스미드들을 다음과 같은 방법으로 이들의 벡터 입자 생산 효율성에 대해 비교하였다. 인간 293T 세포를 pMGP/RRE(서열번호: 77) gag-pol 발현 플라스미드, pHGVSV-G(서열번호: 74) env 발현 플라스미드, pHYK/rev(서열번호: 75) rev 발현 플라스미드, pHYK/vif(서열번호: 76) vif 발현 플라스미드, 및 pCAH/SINd(서열번호: 67-71, 73, 78, 및 79) 전이 벡터 시리즈 플라스미드로 공-형질감염시켰다. 음성 대조군으로서, CAEV 전이 벡터 pCAM/lacZ(L)를 패키징 플라스미드 없이 형질감염시켰다. 형질감염 후 2일에, 비리온 RNA를 형질감염된 세포의 배양액으로부터 추출하여 CAEV 전이 벡터 시리즈 RNA 유전체를 검출하기 위한 RRE 프라이머 세트 또는 HIV-1 전이 벡터 RNA 유전체를 검출하기 위한 lacZ 프라이머 세트와 함께 RT-PCR 주형으로 사용하였다. 도 10에 나타낸 바와 같이, gag 영역의 처음 327 bp 뿐 아니라 전장 5'LTR을 포함하는 pCAH/SINd1(서열번호: 68)로 형질감염된 바이러스 생산 293T 세포로부터 회수된 배양액에서 바이러스 RNA를 포함하는 바이러스 입자의 효율적인 방출을 의미하는 강력한 PCR 산물 신호를 확인하였다(도 10의 레인 3). 이러한 신호는 양성 대조군인 HIV-1 벡터를 이용하여 얻어진 것과 비교할만한 것으로, 본 발명의 캡시드화된 CAEV 전이 벡터 RNA의 양이 HIV-1-계 전이 벡터의 것(도 10의 레인 8)과 비교할만하다는 것을 의미한다. gag-코딩 영역의 처음 612 bp 또는 그 이상을 갖는 CAEV 전이 벡터의 패키징 효율은 현저히 감소되었다(레인 4, 5 및 6). 전이 벡터가 gag-코딩 서열 없이 사용되었을 때에는 PCR 산물 신호가 검출되지 않았다(도 10의 레인 1 및 2). 음성 대조군은 전이벡터만으로 형질감염되었으며, 양성대조군인 HIV-1 벡터는 pCMVΔR8-2, pHR/lacZ 및 pHGVSV-G(서열번호: 74)와 함께 형질감염되었다(도 10의 레인 7 및 8).To identify the optimal packaging signal sequence for capsidation of CAEV transfer vector RNA, a series of plasmids containing different portions of the CAEV gag -coding region and the untranslated region between the 5'LTR and the gag start codons are shown below. In the same way, their vector particle production efficiency was compared. Human 293T cells were expressed in pMGP / RRE (SEQ ID NO: 77) gag-pol expression plasmid, pHGVSV-G (SEQ ID NO: 74) env expression plasmid, pHYK / rev (SEQ ID NO: 75) rev expression plasmid, pHYK / vif (SEQ ID NO: No. 76) co-transfected with vif expression plasmid and pCAH / SINd (SEQ ID NOs: 67-71, 73, 78, and 79) transition vector series plasmids. As a negative control, the CAEV transition vector pCAM / lacZ (L) was transfected without the packaging plasmid. Two days after transfection, virion RNA was extracted from the culture of the transfected cells with either a set of RRE primers for detecting CAEV transfer vector series RNA genomes or a set of lac Z primers for detecting HIV-1 transfer vector RNA genomes. Used as RT-PCR template. As shown in FIG. 10, viral particles comprising viral RNA in culture recovered from virus producing 293T cells transfected with pCAH / SINd1 (SEQ ID NO: 68) containing the full length 5′LTR as well as the first 327 bp of the gag region. A strong PCR product signal was identified, indicating efficient release of (lane 3 in FIG. 10). These signals are comparable to those obtained using the HIV-1 vector, a positive control, in which the amount of capsidized CAEV transfer vector RNA of the present invention is comparable to that of the HIV-1-based transfer vector (lane 8 in FIG. 10). It means deserved. The packaging efficiency of the CAEV transition vector with the first 612 bp or more of the gag -coding region was significantly reduced (lanes 4, 5 and 6). PCR product signals were not detected when the transition vectors were used without gag -coding sequences (lanes 1 and 2 in FIG. 10). The negative control group was transfected with only the transition vectors, and the positive control HIV-1 vector was transfected with pCMVΔR8-2, pHR / lacZ and pHGVSV-G (SEQ ID NO: 74) (lanes 7 and 8 of FIG. 10).

결론적으로, 전이 벡터가 5'LTR과 gag 개시 코돈 간의 비번역 영역 전부 뿐 아니라 N-말단 gag-코딩 서열의 약 600 bp 미만을 포함할 때, 전이 벡터 RNA는 패키징 세포에서 효율적으로 캡시드화되었다. 이러한 결과는 패키징 서열 내 RNA의 2차 구조의 역할이 RNA 캡시드화에서의 1차 구조보다 더 중요하다는 것을 의미한다.In conclusion, when the transition vector comprises less than about 600 bp of the N-terminal gag -coding sequence as well as all of the untranslated region between the 5'LTR and the gag start codon, the transition vector RNA was efficiently encapsidated in the packaging cell. These results indicate that the role of the secondary structure of RNA in the packaging sequence is more important than the primary structure in RNA encapsidation.

실시예 4: CAEV 벡터 비리온의 의사형화Example 4 Pseudotypes of CAEV Vector Virions

재조합 CAEV 벡터 비리온이 VSV-G 당단백질 뿐 아니라 GaLV 당단백질로 의사형화될 수 있는지를 결정하기 위해, GaLV 발현 벡터 및 pMYKEF1/env(서열번호: 72), 또는 VSV-G 발현 벡터 및 pHGVSV-G(서열번호: 74)로 전이 벡터 플라스미드 및 패키징 플라스미드와 함께 인간 293T 세포를 공형질감염시켰다. 형질감염후 42시간에, 형질감염된 세포로부터 방출된 의사형 비리온 입자들을 포함하는 배양 상등액을 회수하고, 0.45 ㎛ 막 여과로 정화하여, 293T 인간 표적 세포를 감염시키기 위해 사용하였다. 감염 하루 후, 유전체 DNA를 유전체 DNA 분리 키트(Genomic DNA Isolation kit, Qiagen, HL, Germany)를 사용하여 정제한 후, PCR 실험을 수행하여 통합된 프로바이러스 cDNA를 검출하였다. 예상했던 바와 같이, CAEV 벡터(도 11의 레인 1)는 MuLV-(도 11의 레인 3) 및 HIV-1-계 벡터(도 11의 레인 4)에 비해 VSV-G 단백질에 의해 효율적으로 의사형화되었다. 또한, HIV-1 렌티바이러스 벡터 시스템과는 다르게, 본 발명의 CAEV 벡터는 GaLV 외피(도 11의 레인 2)로도 성공적으로 의사형화되었다. 이러한 CAEV 벡터의 GaLV 외피를 이용한 의사형화 능력은 임상 수준의 렌티바이러스 벡터 시스템의 개발에 큰 장점을 부여할 수 있다. MuLV(pEQPAM3, pMFG/lacZ/puro 및 pHGVSV-G(서열번호: 74)로 형질감염됨) 및 HIV-1(pCMVΔR8-2, pHR/lacZ 및 pHGVSV-G(서열번호: 74)로 형질감염됨) 벡터 대조군은 레인 3 및 4에 각각 제시되어 있다.To determine whether recombinant CAEV vector virions can be pseudotyped with GaLV glycoproteins as well as GaV glycoproteins, GaLV expression vectors and pMYKEF1 / env (SEQ ID NO: 72), or VSV-G expression vectors and pHGVSV- Human 293T cells were cotransfected with the G (SEQ ID NO: 74) transition vector plasmid and packaging plasmid. At 42 hours post transfection, the culture supernatant containing pseudo virion particles released from the transfected cells was recovered, clarified by 0.45 μm membrane filtration and used to infect 293T human target cells. One day after infection, genomic DNA was purified using a genomic DNA isolation kit (Genomic DNA Isolation kit, Qiagen, HL, Germany), followed by PCR experiments to detect integrated proviral cDNA. As expected, CAEV vectors (lane 1 in FIG. 11) are more efficiently pseudotyped by VSV-G proteins compared to MuLV- (lane 3 in FIG. 11) and HIV-1-based vectors (lane 4 in FIG. 11). It became. In addition, unlike the HIV-1 lentiviral vector system, the CAEV vectors of the present invention were successfully pseudotyped into GaLV envelopes (lane 2 in FIG. 11). Pseudotyping ability of the CAEV vector using GaLV envelope can provide a great advantage in the development of lentiviral vector systems at the clinical level. Transfected with MuLV (pEQPAM3, pMFG / lacZ / puro and pHGVSV-G (SEQ ID NO: 74)) and HIV-1 (pCMVΔR8-2, pHR / lacZ and pHGVSV-G (SEQ ID NO: 74) ) Vector controls are shown in lanes 3 and 4, respectively.

실시예 5: CAEV 패키징 세포주의 생산Example 5: Production of CAEV Packaging Cell Lines

pMGP/RRE(서열번호: 77) 및 pHYK/rev(서열번호: 75) 벡터는 진핵세포에서의 선별을 위한 neo r 유전자를 코딩한다. gag-polrev 발현 벡터로 공형질감염 후 효율적인 선별을 위해, neo r 유전자를 박테리아 gpt 유전자와 같은 다른 항생제 내성 유전자로 치환시켜 다른 CAEV gag-pol 발현 벡터를 제작할 수 있다. 한편, gag, polrev 유전자를 코딩하는 패키징 플라스미드 시스템이 사용될 수 있다. CAEV 패키징 단백질을 발현하는 안정(stable) 293T 세포가 만들어질 수 있는지를 결정하기 위해, 선별 배지하에 항생제 내성 콜로니가 선별된다. 안정 293T 세포로부터의 재조합 CAEV 벡터의 생산은 CAEV 벡터 생산을 위한 안정 패키징 세포주 생산의 용이성을 보여준다.pMGP / RRE (SEQ ID NO: 77) and pHYK / rev (SEQ ID NO: 75) vectors encode the neo r gene for selection in eukaryotic cells. For efficient screening after cotransfection with gag-pol and rev expression vectors, other CAEV gag-pol expression vectors can be constructed by substituting neo r genes with other antibiotic resistance genes such as bacterial gpt genes. On the other hand, a packaging plasmid system encoding the gag , pol and rev genes can be used. Antibiotic resistant colonies are selected under selection medium to determine if stable 293T cells expressing the CAEV packaging protein can be made. Production of recombinant CAEV vectors from stable 293T cells shows the ease of producing stable packaging cell lines for CAEV vector production.

실시예 6: 숙주 염색체내로의 CAEV-계 벡터 cDNA의 통합Example 6 Integration of CAEV-Based Vector cDNA into a Host Chromosome

형질도입 후 CAEV 벡터 cDNA의 통합을 시험하기 위해, pMGP/REV/RRE gag-pol 발현 플라스미드, pHGVSV-G(서열번호: 74) env 발현 플라스미드, 및 pCAH/SINd1/hlacZ(서열번호: 79) 전이 벡터 플라스미드를 인간 293T 세포로 리포좀-매개 공-형질감염시켜 CAEV 벡터 입자들을 생산하였다. 양성 대조군으로, pCMVΔR8.2 gag-pol 발현 플라스미드, pHGVSV-G(서열번호: 74) env 발현 플라스미드, 및 pHR/lacZ 전이 벡터를 293T 세포로 공형질감염시켜 HIV-1 벡터 입자들을 생산하였다. 음성 대조군으로, 단지 pCAH/SINd1/hlacZ(서열번호: 79) 전이 벡터 플라스미드만을 형질감염시켰다. 형질감염 후 48시간에, 감염을 위해 배양 상등액을 각 형질감염된 세포로부터 회수하여 8 ㎍/ml 폴리브렌 존재하에 신선한 293T 세포에 적용시켰다. 48시간 후, 유전체 DNA를 각 형질도입된 세포로부터 제조한 후, 제한 효소로 절단하여 서던 블럿 분석을 수행하였다. Dig-표지된 lacZ 프로브로 HIV-1-계 전이 벡터에 대한 3.15kb BamH I-Kpn I 절편, 및 CAEV-계 전이 벡터 및 음성 대조군에 대한 1.35kb Hind III-Ssp I 절편을 검출하였다. 양성 대조군에 대해서는, pCAH/SINd1/hlacZ(서열번호: 79) 전이 벡터 플라스미드의 Hind III-Ssp I DNA 절편 0.3 ng 및 3 ng이 사용되었다. 도 12에 나타낸 바와 같이, 본 발명의 CAEV-계 전이 벡터는 HIV-1-계 렌티바이러스 전이 벡터의 것과 비교할만한 수준으로 통합되었다.To test the integration of CAEV vector cDNA after transduction, pMGP / REV / RRE gag-pol expression plasmid, pHGVSV-G (SEQ ID NO: 74) env expression plasmid, and pCAH / SINd1 / hlacZ (SEQ ID NO: 79) transfer Vector plasmids were liposome-mediated co-transfected with human 293T cells to produce CAEV vector particles. As a positive control, HIV-1 vector particles were produced by cotransfecting the pCMVΔR8.2 gag-pol expression plasmid, the pHGVSV-G (SEQ ID NO: 74) env expression plasmid, and the pHR / lacZ transition vector with 293T cells. As a negative control, only the pCAH / SINd1 / hlacZ (SEQ ID NO: 79) transition vector plasmid was transfected. 48 hours after transfection, culture supernatants were recovered from each transfected cell for infection and applied to fresh 293T cells in the presence of 8 μg / ml polybrene. After 48 hours, genomic DNA was prepared from each transduced cell, followed by Southern blot analysis by digestion with restriction enzymes. Dig-labeled lac Z probe detected 3.15 kb Bam H I-Kpn I fragments for HIV-1-based transition vectors and 1.35 kb Hin d III- Ssp I fragments for CAEV-based transition vectors and negative controls . For the positive control, 0.3 ng and 3 ng of Hind III- Ssp I DNA fragment of the pCAH / SINd1 / hlacZ (SEQ ID NO: 79) transition vector plasmid were used. As shown in Figure 12, the CAEV-based transfer vectors of the present invention were integrated at a level comparable to that of HIV-1-based lentiviral transfer vectors.

실시예 7: 비분열 세포로의 유전자 전이Example 7: Gene Transfer to Non-Dividing Cells

293T 세포를 DNA 합성 저해물질인 아피디콜린(aphidicolin)으로 처리하여 6-웰 배양 플레이트에 분주한 후, lacZ 마커 유전자를 코딩하는 CAEV 벡터 입자로 형질도입시켰다. 대조군으로서, 세포들을 lacZ 발현 MuLV 레트로바이러스 벡터 및 HIV-1 렌티바이러스 벡터로 나란히 감염시켰다. 감염 후 48시간에, 형질도입 효율성을 시험하기 위해, 형질 도입된 lacZ 유전자의 발현을 X-gal 염색법으로 측정하였다. 도 14에 나타낸 바와 같이, MuLV-유래 벡터는 DNA 합성 저해물질로 처리되지 않은 세포를 효율적으로 감염시켰다. 그러나, 세포가 DNA 합성 저해물질의 처리에 의해 세포주기에서 정체되었을 때, 형질도입 효율은 현저하게 떨어졌다. 반면, CAEV-계 벡터는 분열 세포 뿐 아니라 비분열 인간 세포를 HIV-1-계 벡터의 것과 비교할만한 수준으로 효율적으로 형질도입시킬 수 있었다. 293T cells were treated with aphidicolin, a DNA synthesis inhibitor, and dispensed into 6-well culture plates, followed by transduction with CAEV vector particles encoding the lac Z marker gene. As a control, cells were infected side by side with lac Z expressing MuLV retroviral vector and HIV-1 lentivirus vector. 48 hours after infection, to test transduction efficiency, the expression of the transfected lac Z gene was measured by X-gal staining. As shown in FIG. 14, MuLV-derived vectors efficiently infected cells that were not treated with DNA synthesis inhibitors. However, when cells were stagnated in the cell cycle by the treatment of DNA synthesis inhibitors, the transduction efficiency dropped significantly. In contrast, CAEV-based vectors were able to efficiently transduce dividing cells as well as non-dividing human cells to levels comparable to those of HIV-1-based vectors.

실시예 8: 근육 세포의 생체내 형질도입Example 8: In vivo Transduction of Muscle Cells

본 실시예에서, pCAH/SINd1/hlacZ(서열번호: 79) CAEV 벡터는 생체내에서 근육세포를 형질도입시키기 위해 사용된다. 마우스(Beige strain)의 뒷다리를 4 ㎍/㎖의 폴리브렌 존재하에 상기 CAEV 벡터 100 ㎕로 근육내 주사하였다. 이틀 후 상기 마우스들을 희생시키고, 감염된 조직을 동결 절편법(frozen section) 및 β-갈락코시데이즈 분석을 위해 준비하였다. 예상된 결과대로 pCAH/SINd1lacZ(서열번호: 79) CAEV 벡터는 생체내에서 근육 세포로 효과적으로 형질도입된다.In this example, the pCAH / SINd1 / hlacZ (SEQ ID NO: 79) CAEV vector is used to transduce myocytes in vivo. The hind limbs of mice (Beige strain) were injected intramuscularly with 100 μl of the CAEV vector in the presence of 4 μg / ml polybrene. Two days later the mice were sacrificed and infected tissues were prepared for frozen section and β-galaccosidase analysis. As expected, the pCAH / SINd1lacZ (SEQ ID NO: 79) CAEV vector is effectively transduced into muscle cells in vivo.

상기 특정 실시양태 및 실시예를 포함하는 상세한 설명은 본 발명을 설명하기 위한 것이지 한정하기 위한 것은 아니다. 많은 다른 변형 및 수정이 본 발명의 사상 및 범위내에서 이루어질 수 있다. 본원에서 인용된 NCBI 데이터베이스에 기탁된 서열, 특허 및 특허출원은 인용에 의해 그 전체가 상세한 설명에 도입된다.The detailed description, including the specific embodiments and examples above, is intended to illustrate, but not limit, the invention. Many other variations and modifications may be made within the spirit and scope of the invention. The sequences, patents and patent applications deposited in the NCBI database cited herein are hereby incorporated by reference in their entirety.

인용문헌Cited References

Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990). Basic local alignment search tool. J Mol Biol 215(3), 403-10.Altschul, SF, Gish, W., Miller, W., Myers, EW, and Lipman, DJ (1990). Basic local alignment search tool. J Mol Biol 215 (3) , 403-10.

Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25(17), 3389-402.Altschul, SF, Madden, TL, Schaffer, AA, Zhang, J., Zhang, Z., Miller, W., and Lipman, DJ (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25 (17) , 3389-402.

Anderson, M. P., Rich, D. P., Gregory, R. J., Smith, A. E., and Welsh, M. J. (1991). Generation of cAMP-activated chloride currents by expression of CFTR. Science 251(4994), 679-82.Anderson, MP, Rich, DP, Gregory, RJ, Smith, AE, and Welsh, MJ (1991). Generation of cAMP-activated chloride currents by expression of CFTR. Science 251 (4994) , 679-82.

Bridges, S. H., and Sarver, N. (1995). Gene therapy and immune restoration for HIV disease. Lancet 345(8947), 427-32.Bridges, SH, and Sarver, N. (1995). Gene therapy and immune restoration for HIV disease. Lancet 345 (8947) , 427-32.

Burns, J. C., Friedmann, T., Driever, W., Burrascano, M., and Yee, J. K. (1993). Vesicular stomatitis virus G glycoprotein pseudotyped retroviral vectors: concentration to very high titer and efficient gene transfer into mammalian and nonmammalian cells. Proc Natl Acad Sci U S A 90(17), 8033-7.Burns, JC, Friedmann, T., Driever, W., Burrascano, M., and Yee, JK (1993). Vesicular stomatitis virus G glycoprotein pseudotyped retroviral vectors: concentration to very high titer and efficient gene transfer into mammalian and nonmammalian cells. Proc Natl Acad Sci USA 90 (17) , 8033-7.

Carswell, S., and Alwine, J. C. (1989). Efficiency of utilization of the simian virus 40 late polyadenylation site: effects of upstream sequences. Mol Cell Biol 9(10), 4248-58.Carswell, S., and Alwine, JC (1989). Efficiency of utilization of the simian virus 40 late polyadenylation site: effects of upstream sequences. Mol Cell Biol 9 (10) , 4248-58.

Cech, T. R. (1988). Ribozymes and their medical implications. Jama 260(20), 3030-4.Cech, TR (1988). Ribozymes and their medical implications. Jama 260 (20) , 3030-4.

Chang, M. W., Barr, E., Seltzer, J., Jiang, Y. Q., Nabel, G. J., Nabel, E. G., Parmacek, M. S., and Leiden, J. M. (1995). Cytostatic gene therapy for vascular proliferative disorders with a constitutively active form of the retinoblastoma gene product. Science 267(5197), 518-22.Chang, MW, Barr, E., Seltzer, J., Jiang, YQ, Nabel, GJ, Nabel, EG, Parmacek, MS, and Leiden, JM (1995). Cytostatic gene therapy for vascular proliferative disorders with a constitutively active form of the retinoblastoma gene product. Science 267 (5197) , 518-22.

Curran, M. A., and Nolan, G. P. (2002). Nonprimate lentiviral vectors. Curr Top Microbiol Immunol 261, 75-105.Curran, MA, and Nolan, GP (2002). Nonprimate lentiviral vectors. Curr Top Microbiol Immunol 261, 75-105.

Crawford, I., Maloney, P. C., Zeitlin, P. L., Guggino, W. B., Hyde, S. C., Turley, H., Gatter, K. C., Harris, A., and Higgins, C. F. (1991). Immunocytochemical localization of the cystic fibrosis gene product CFTR. Proc Natl Acad Sci U S A 88(20), 9262-6.Crawford, I., Maloney, PC, Zeitlin, PL, Guggino, WB, Hyde, SC, Turley, H., Gatter, KC, Harris, A., and Higgins, CF (1991). Immunocytochemical localization of the cystic fibrosis gene product CFTR. Proc Natl Acad Sci USA 88 (20) , 9262-6.

DuBridge, R. B., Tang, P., Hsia, H. C., Leong, P. M., Miller, J. H., and Calos, M. P. (1987). Analysis of mutation in human cells by using an Epstein-Barr virus shuttle system. Mol Cell Biol 7(1), 379-87.DuBridge, RB, Tang, P., Hsia, HC, Leong, PM, Miller, JH, and Calos, MP (1987). Analysis of mutation in human cells by using an Epstein-Barr virus shuttle system. Mol Cell Biol 7 (1) , 379-87.

Erlich, H. A. (1989). Polymerase chain reaction. J Clin Immunol 9(6), 437-47.Erlich, HA (1989). Polymerase chain reaction. J Clin Immunol 9 (6) , 437-47.

Fagerhol, M. K., and Cox, D. W. (1981). The Pi polymorphism: genetic, biochemical, and clinical aspects of human alpha 1-antitrypsin. Adv Hum Genet 11, 1-62, 371-2.Fagerhol, MK, and Cox, DW (1981). The Pi polymorphism: genetic, biochemical, and clinical aspects of human alpha 1-antitrypsin. Adv Hum Genet 11 , 1-62, 371-2.

Gilbert, J. R., and Wong-Staal, F. (2001). HIV-2 and SIV vector systems. Somat Cell Mol Genet 26(1-6), 83-98.Gilbert, JR, and Wong-Staal, F. (2001). HIV-2 and SIV vector systems. Somat Cell Mol Genet 26 (1-6) , 83-98.

Gillam, S., and Smith, M. (1979). Site-specific mutagenesis using synthetic oligodeoxyribonucleotide primers: I. Optimum conditions and minimum ologodeoxyribonucleotide length. Gene 8(1), 81-97.Gillam, S., and Smith, M. (1979). Site-specific mutagenesis using synthetic oligodeoxyribonucleotide primers: I. Optimum conditions and minimum ologodeoxyribonucleotide length. Gene 8 (1) , 81-97.

Hall, C. V., Jacob, P. E., Ringold, G. M., and Lee, F. (1983). Expression and regulation of Escherichia coli lacZ gene fusions in mammalian cells. J Mol Appl Genet 2(1), 101-9.Hall, CV, Jacob, PE, Ringold, GM, and Lee, F. (1983). Expression and regulation of Escherichia coli lacZ gene fusions in mammalian cells. J Mol Appl Genet 2 (1) , 101-9.

Harmache, A., Bouyac, M., Audoly, G., Hieblot, C., Peveri, P., Vigne, R., and Suzan, M. (1995). The vif gene is essential for efficient replication of caprine arthritis encephalitis virus in goat synovial membrane cells and affects the late steps of the virus replication cycle. J Virol 69(6), 3247-57.Harmache, A., Bouyac, M., Audoly, G., Hieblot, C., Peveri, P., Vigne, R., and Suzan, M. (1995). The vif gene is essential for efficient replication of caprine arthritis encephalitis virus in goat synovial membrane cells and affects the late steps of the virus replication cycle. J Virol 69 (6) , 3247-57.

Harmache, A., Russo, P., Guiguen, F., Vitu, C., Vignoni, M., Bouyac, M., Hieblot, C., Pepin, M., Vigne, R., and Suzan, M. (1996). Requirement of caprine arthritis encephalitis virus vif gene for in vivo replication. Virology 224(1), 246-55.Harmache, A., Russo, P., Guiguen, F., Vitu, C., Vignoni, M., Bouyac, M., Hieblot, C., Pepin, M., Vigne, R., and Suzan, M. (1996). Requirement of caprine arthritis encephalitis virus vif gene for in vivo replication. Virology 224 (1) , 246-55.

Helene, C. (1991). The anti-gene strategy: control of gene expression by triplex-forming-oligonucleotides. Anticancer Drug Des 6(6), 569-84.Helene, C. (1991). The anti-gene strategy: control of gene expression by triplex-forming-oligonucleotides. Anticancer Drug Des 6 (6) , 569-84.

Hoffman, E. P., Brown, R. H., Jr., and Kunkel, L. M. (1987). Dystrophin: the protein product of the Duchenne muscular dystrophy locus. Cell 51(6), 919-28.Hoffman, EP, Brown, RH, Jr., and Kunkel, LM (1987). Dystrophin: the protein product of the Duchenne muscular dystrophy locus. Cell 51 (6) , 919-28.

Horwich, A. L., Fenton, W. A., Williams, K. R., Kalousek, F., Kraus, J. P., Doolittle, R. F., Konigsberg, W., and Rosenberg, L. E. (1984). Structure and expression of a complementary DNA for the nuclear coded precursor of human mitochondrial ornithine transcarbamylase. Science 224(4653), 1068-74.Horwich, AL, Fenton, WA, Williams, KR, Kalousek, F., Kraus, JP, Doolittle, RF, Konigsberg, W., and Rosenberg, LE (1984). Structure and expression of a complementary DNA for the nuclear coded precursor of human mitochondrial ornithine transcarbamylase. Science 224 (4653) , 1068-74.

Karlin, S., and Altschul, S. F. (1990). Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci U S A 87(6), 2264-8.Karlin, S., and Altschul, SF (1990). Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci USA 87 (6) , 2264-8.

Karlin, S., and Altschul, S. F. (1993). Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci U S A 90(12), 5873-7.Karlin, S., and Altschul, SF (1993). Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci USA 90 (12) , 5873-7.

Kim, S. J., Sadelain, M., Choi, K. H., Kim, H. K., Lee, J. S., and Chung, H. Y. (1997). Tetracycline-mediated suppression of gene expression with a new dicistronic retroviral vector. Mol Cells 7(4), 514-20.Kim, SJ, Sadelain, M., Choi, KH, Kim, HK, Lee, JS, and Chung, HY (1997). Tetracycline-mediated suppression of gene expression with a new dicistronic retroviral vector. Mol Cells 7 (4) , 514-20.

Kim, S. Y., Lee, J. H., Shin, H. S., Kang, H. J., and Kim, Y. S. (2002). The human elongation factor 1 alpha (EF-1 alpha) first intron highly enhances expression of foreign genes from the murine cytomegalovirus promoter. J Biotechnol 93(2), 183-7.Kim, SY, Lee, JH, Shin, HS, Kang, HJ, and Kim, YS (2002). The human elongation factor 1 alpha (EF-1 alpha) first intron highly enhances expression of foreign genes from the murine cytomegalovirus promoter. J Biotechnol 93 (2) , 183-7.

Kordower, J. H., Emborg, M. E., Bloch, J., Ma, S. Y., Chu, Y., Leventhal, L., McBride, J., Chen, E. Y., Palfi, S., Roitberg, B. Z., Brown, W. D., Holden, J. E., Pyzalski, R., Taylor, M. D., Carvey, P., Ling, Z., Trono, D., Hantraye, P., Deglon, N., and Aebischer, P. (2000). Neurodegeneration prevented by lentiviral vector delivery of GDNF in primate models of Parkinson's disease. Science 290(5492), 767-73.Kordower, JH, Emborg, ME, Bloch, J., Ma, SY, Chu, Y., Leventhal, L., McBride, J., Chen, EY, Palfi, S., Roitberg, BZ, Brown, WD, Holden , JE, Pyzalski, R., Taylor, MD, Carvey, P., Ling, Z., Trono, D., Hantraye, P., Deglon, N., and Aebischer, P. (2000). Neurodegeneration prevented by lentiviral vector delivery of GDNF in primate models of Parkinson's disease. Science 290 (5492) , 767-73.

Landau, N. R., and Littman, D. R. (1992). Packaging system for rapid production of murine leukemia virus vectors with variable tropism. J Virol 66(8), 5110-3.Landau, NR, and Littman, DR (1992). Packaging system for rapid production of murine leukemia virus vectors with variable tropism. J Virol 66 (8) , 5110-3.

Lee, C. C., Pearlman, J. A., Chamberlain, J. S., and Caskey, C. T. (1991). Expression of recombinant dystrophin and its localization to the cell membrane. Nature 349(6307), 334-6.Lee, CC, Pearlman, JA, Chamberlain, JS, and Caskey, CT (1991). Expression of recombinant dystrophin and its localization to the cell membrane. Nature 349 (6307) , 334-6.

Lemarchand, P., Jaffe, H. A., Danel, C., Cid, M. C., Kleinman, H. K., Stratford-Perricaudet, L. D., Perricaudet, M., Pavirani, A., Lecocq, J. P., and Crystal, R. G. (1992). Adenovirus-mediated transfer of a recombinant human alpha 1-antitrypsin cDNA to human endothelial cells. Proc Natl Acad Sci U S A 89(14), 6482-6.Lemarchand, P., Jaffe, HA, Danel, C., Cid, MC, Kleinman, HK, Stratford-Perricaudet, LD, Perricaudet, M., Pavirani, A., Lecocq, JP, and Crystal, RG (1992). Adenovirus-mediated transfer of a recombinant human alpha 1-antitrypsin cDNA to human endothelial cells. Proc Natl Acad Sci USA 89 (14) , 6482-6.

Maher, L. J., 3rd, Wold, B., and Dervan, P. B. (1991). Oligonucleotide-directed DNA triple-helix formation: an approach to artificial repressors. Antisense Res Dev 1(3), 277-81.Maher, LJ, 3rd, Wold, B., and Dervan, PB (1991). Oligonucleotide-directed DNA triple-helix formation: an approach to artificial repressors. Antisense res Dev 1 (3) , 277-81.

Marcus-Sekura, C. J. (1988). Techniques for using antisense oligodeoxyribonucleotides to study gene expression. Anal Biochem 172(2), 289-95.Marcus-Sekura, CJ (1988). Techniques for using antisense oligodeoxyribonucleotides to study gene expression. Anal Biochem 172 (2) , 289-95.

Miller, A. D. (1992). Human gene therapy comes of age. Nature 357(6378), 455-60.Miller, AD (1992). Human gene therapy comes of age. Nature 357 (6378) , 455-60.

Mitrophanous, K., Yoon, S., Rohll, J., Patil, D., Wilkes, F., Kim, V., Kingsman, S., Kingsman, A., and Mazarakis, N. (1999). Stable gene transfer to the nervous system using a non-primate lentiviral vector. Gene Ther 6(11), 1808-18.Mitrophanous, K., Yoon, S., Rohll, J., Patil, D., Wilkes, F., Kim, V., Kingsman, S., Kingsman, A., and Mazarakis, N. (1999). Stable gene transfer to the nervous system using a non-primate lentiviral vector. Gene Ther 6 (11) , 1808-18.

Mselli-Lakhal, L., Favier, C., Da Silva Teixeira, M. F., Chettab, K., Legras, C., Ronfort, C., Verdier, G., Mornex, J. F., and Chebloune, Y. (1998). Defective RNA packaging is responsible for low transduction efficiency of CAEV-based vectors. Arch Virol 143(4), 681-95.Mselli-Lakhal, L., Favier, C., Da Silva Teixeira, MF, Chettab, K., Legras, C., Ronfort, C., Verdier, G., Mornex, JF, and Chebloune, Y. (1998) . Defective RNA packaging is responsible for low transduction efficiency of CAEV-based vectors. Arch Virol 143 (4) , 681-95.

Mullen, C. A., Kilstrup, M., and Blaese, R. M. (1992). Transfer of the bacterial gene for cytosine deaminase to mammalian cells confers lethal sensitivity to 5-fluorocytosine: a negative selection system. Proc Natl Acad Sci U S A 89(1), 33-7.Mullen, CA, Kilstrup, M., and Blaese, RM (1992). Transfer of the bacterial gene for cytosine deaminase to mammalian cells confers lethal sensitivity to 5-fluorocytosine: a negative selection system. Proc Natl Acad Sci USA 89 (1) , 33-7.

Mulligan, R. C. (1993). The basic science of gene therapy. Science 260(5110), 926-32.Mulligan, RC (1993). The basic science of gene therapy. Science 260 (5110) , 926-32.

Mullis, K. B., and Faloona, F. A. (1987). Specific synthesis of DNA in vitro via a polymerase-catalyzed chain reaction. Methods Enzymol 155, 335-50.Mullis, KB, and Faloona, FA (1987). Specific synthesis of DNA in vitro via a polymerase-catalyzed chain reaction. Methods Enzymol 155, 335-50.

Murakami, K., Amaya, Y., Takiguchi, M., Ebina, Y., and Mori, M. (1988). Reconstitution of mitochondrial protein transport with purified ornithine carbamoyltransferase precursor expressed in Escherichia coli. J Biol Chem 263(34), 18437-42.Murakami, K., Amaya, Y., Takiguchi, M., Ebina, Y., and Mori, M. (1988). Reconstitution of mitochondrial protein transport with purified ornithine carbamoyltransferase precursor expressed in Escherichia coli. J Biol Chem 263 (34) , 18437-42.

Naldini, L., Blomer, U., Gallay, P., Ory, D., Mulligan, R., Gage, F. H., Verma, I. M., and Trono, D. (1996). In vivo gene delivery and stable transduction of nondividing cells by a lentiviral vector. Science 272(5259), 263-7.Naldini, L., Blomer, U., Gallay, P., Ory, D., Mulligan, R., Gage, FH, Verma, IM, and Trono, D. (1996). In vivo gene delivery and stable transduction of nondividing cells by a lentiviral vector. Science 272 (5259) , 263-7.

Ory, D. S., Neugeboren, B. A., and Mulligan, R. C. (1996). A stable human-derived packaging cell line for production of high titer retrovirus/vesicular stomatitis virus G pseudotypes. Proc Natl Acad Sci U S A 93(21), 11400-6.Ory, DS, Neugeboren, BA, and Mulligan, RC (1996). A stable human-derived packaging cell line for production of high titer retrovirus / vesicular stomatitis virus G pseudotypes. Proc Natl Acad Sci USA 93 (21) , 11400-6.

Persons, D. A., Mehaffey, M. G., Kaleko, M., Nienhuis, A. W., and Vanin, E. F. (1998). An improved method for generating retroviral producer clones for vectors lacking a selectable marker gene. Blood Cells Mol Dis 24(2), 167-82.Persons, DA, Mehaffey, MG, Kaleko, M., Nienhuis, AW, and Vanin, EF (1998). An improved method for generating retroviral producer clones for vectors lacking a selectable marker gene. Blood Cells Mol Dis 24 (2) , 167-82.

Pfarr, D. S., Rieser, L. A., Woychik, R. P., Rottman, F. M., Rosenberg, M., and Reff, M. E. (1986). Differential effects of polyadenylation regions on gene expression in mammalian cells. DNA 5(2), 115-22.Pfarr, DS, Rieser, LA, Woychik, RP, Rottman, FM, Rosenberg, M., and Reff, ME (1986). Differential effects of polyadenylation regions on gene expression in mammalian cells. DNA 5 (2) , 115-22.

Rhodes, A., and James, W. (1991). Inhibition of heterologous strains of HIV by antisense RNA. Aids 5(2), 145-51.Rhodes, A., and James, W. (1991). Inhibition of heterologous strains of HIV by antisense RNA. Aids 5 (2) , 145-51.

Riordan, J. R., Rommens, J. M., Kerem, B., Alon, N., Rozmahel, R., Grzelczak, Z., Zielenski, J., Lok, S., Plavsic, N., Chou, J. L., and et al. (1989). Identification of the cystic fibrosis gene: cloning and characterization of complementary DNA. Science 245(4922), 1066-73.Riordan, JR, Rommens, JM, Kerem, B., Alon, N., Rozmahel, R., Grzelczak, Z., Zielenski, J., Lok, S., Plavsic, N., Chou, JL, and et al . (1989). Identification of the cystic fibrosis gene: cloning and characterization of complementary DNA. Science 245 (4922) , 1066-73.

Roberts, S., Cheetham, J. C., and Rees, A. R. (1987). Generation of an antibody with enhanced affinity and specificity for its antigen by protein engineering. Nature 328(6132), 731-4.Roberts, S., Cheetham, JC, and Rees, AR (1987). Generation of an antibody with enhanced affinity and specificity for its antigen by protein engineering. Nature 328 (6132) , 731-4.

Rommens, J. M., Iannuzzi, M. C., Kerem, B., Drumm, M. L., Melmer, G., Dean, M., Rozmahel, R., Cole, J. L., Kennedy, D., Hidaka, N., and et al. (1989). Identification of the cystic fibrosis gene: chromosome walking and jumping. Science 245(4922), 1059-65.Rommens, JM, Iannuzzi, MC, Kerem, B., Drumm, ML, Melmer, G., Dean, M., Rozmahel, R., Cole, JL, Kennedy, D., Hidaka, N., and et al. (1989). Identification of the cystic fibrosis gene: chromosome walking and jumping. Science 245 (4922) , 1059-65.

Saltarelli, M., Querat, G., Konings, D. A., Vigne, R., and Clements, J. E. (1990). Nucleotide sequence and transcriptional analysis of molecular clones of CAEV which generate infectious virus. Virology 179(1), 347-64.Saltarelli, M., Querat, G., Konings, DA, Vigne, R., and Clements, JE (1990). Nucleotide sequence and transcriptional analysis of molecular clones of CAEV which generate infectious virus. Virology 179 (1) , 347-64.

Saltarelli, M. J., Schoborg, R., Gdovin, S. L., and Clements, J. E. (1993). The CAEV tat gene trans-activates the viral LTR and is necessary for efficient viral replication. Virology 197(1), 35-44.Saltarelli, MJ, Schoborg, R., Gdovin, SL, and Clements, JE (1993). The CAEV tat gene trans-activates the viral LTR and is necessary for efficient viral replication. Virology 197 (1) , 35-44.

Saltarelli, M. J., Schoborg, R., Pavlakis, G. N., and Clements, J. E. (1994). Identification of the caprine arthritis encephalitis virus Rev protein and its cis-acting Rev-responsive element. Virology 199(1), 47-55.Saltarelli, MJ, Schoborg, R., Pavlakis, GN, and Clements, JE (1994). Identification of the caprine arthritis encephalitis virus Rev protein and its cis-acting Rev-responsive element. Virology 199 (1) , 47-55.

Sauter, S. L., and Gasmi, M. (2001). FIV vector systems. Somat Cell Mol Genet 26(1-6), 99-129.Sauter, SL, and Gasmi, M. (2001). FIV vector systems. Somat Cell Mol Genet 26 (1-6) , 99-129.

Seroude, V., Audoly, G., Gluschankof, P., and Suzan, M. (2002). Viral and cellular specificities of caprine arthritis encephalitis virus Vif protein. Virology 292(1), 156-61.Seroude, V., Audoly, G., Gluschankof, P., and Suzan, M. (2002). Viral and cellular specificities of caprine arthritis encephalitis virus Vif protein. Virology 292 (1) , 156-61.

Smith, T. F., Waterman, M. S., and Fitch, W. M. (1981). Comparative biosequence metrics. J Mol Evol 18(1), 38-46.Smith, TF, Waterman, MS, and Fitch, WM (1981). Comparative biosequence metrics. J Mol Evol 18 (1) , 38-46.

Soneoka, Y., Cannon, P. M., Ramsdale, E. E., Griffiths, J. C., Romano, G., Kingsman, S. M., and Kingsman, A. J. (1995). A transient three-plasmid expression system for the production of high titer retroviral vectors. Nucleic Acids Res 23(4), 628-33.Soneoka, Y., Cannon, PM, Ramsdale, EE, Griffiths, JC, Romano, G., Kingsman, SM, and Kingsman, AJ (1995). A transient three-plasmid expression system for the production of high titer retroviral vectors. Nucleic Acids Res 23 (4) , 628-33.

To, R. Y., Booth, S. C., and Neiman, P. E. (1986). Inhibition of retroviral replication by anti-sense RNA. Mol Cell Biol 6(12), 4758-62.To, RY, Booth, SC, and Neiman, PE (1986). Inhibition of retroviral replication by anti-sense RNA. Mol Cell Biol 6 (12) , 4758-62.

Toole, J. J., Knopf, J. L., Wozney, J. M., Sultzman, L. A., Buecker, J. L., Pittman, D. D., Kaufman, R. J., Brown, E., Shoemaker, C., Orr, E. C., and et al. (1984). Molecular cloning of a cDNA encoding human antihaemophilic factor. Nature 312(5992), 342-7.Toole, JJ, Knopf, JL, Wozney, JM, Sultzman, LA, Buecker, JL, Pittman, DD, Kaufman, RJ, Brown, E., Shoemaker, C., Orr, EC, and et al. (1984). Molecular cloning of a cDNA encoding human antihaemophilic factor. Nature 312 (5992) , 342-7.

von Ruden, T., and Gilboa, E. (1989). Inhibition of human T-cell leukemia virus type I replication in primary human T cells that express antisense RNA. J Virol 63(2), 677-82.von Ruden, T., and Gilboa, E. (1989). Inhibition of human T-cell leukemia virus type I replication in primary human T cells that express antisense RNA. J Virol 63 (2) , 677-82.

Wang, C. C. (1991). A novel suicide inhibitor strategy for antiparasitic drug development. J Cell Biochem 45(1), 49-53.Wang, CC (1991). A novel suicide inhibitor strategy for antiparasitic drug development. J Cell Biochem 45 (1) , 49-53.

Warne, P. H., Viciana, P. R., and Downward, J. (1993). Direct interaction of Ras and the amino-terminal region of Raf-1 in vitro. Nature 364(6435), 352-5.Warne, PH, Viciana, PR, and Downward, J. (1993). Direct interaction of Ras and the amino-terminal region of Raf-1 in vitro. Nature 364 (6435) , 352-5.

Weintraub, H. M. (1990). Antisense RNA and DNA. Sci Am 262(1), 40-6.Weintraub, HM (1990). Antisense RNA and DNA. Sci Am 262 (1) , 40-6.

Wills, K. N., Maneval, D. C., Menzel, P., Harris, M. P., Sutjipto, S., Vaillancourt, M. T., Huang, W. M., Johnson, D. E., Anderson, S. C., Wen, S. F., and et al. (1994). Development and characterization of recombinant adenoviruses encoding human p53 for gene therapy of cancer. Hum Gene Ther 5(9), 1079-88.Wills, KN, Maneval, DC, Menzel, P., Harris, MP, Sutjipto, S., Vaillancourt, MT, Huang, WM, Johnson, DE, Anderson, SC, Wen, SF, and et al. (1994). Development and characterization of recombinant adenoviruses encoding human p53 for gene therapy of cancer. Hum Gene Ther 5 (9) , 1079-88.

Wion, K. L., Kelly, D., Summerfield, J. A., Tuddenham, E. G., and Lawn, R. M. (1985). Distribution of factor VIII mRNA and antigen in human liver and other tissues. Nature 317(6039), 726-9.Wion, KL, Kelly, D., Summerfield, JA, Tuddenham, EG, and Lawn, RM (1985). Distribution of factor VIII mRNA and antigen in human liver and other tissues. Nature 317 (6039) , 726-9.

Figure 112007026147270-PCT00001
Figure 112007026147270-PCT00001

Figure 112007026147270-PCT00002
Figure 112007026147270-PCT00002

Figure 112007026147270-PCT00003
Figure 112007026147270-PCT00003

Figure 112007026147270-PCT00004
Figure 112007026147270-PCT00004

Figure 112007026147270-PCT00005
Figure 112007026147270-PCT00005

Figure 112007026147270-PCT00006
Figure 112007026147270-PCT00006

Figure 112007026147270-PCT00007
Figure 112007026147270-PCT00007

Figure 112007026147270-PCT00008
Figure 112007026147270-PCT00008

Figure 112007026147270-PCT00009
Figure 112007026147270-PCT00009

Figure 112007026147270-PCT00010
Figure 112007026147270-PCT00010

Figure 112007026147270-PCT00011
Figure 112007026147270-PCT00011

Figure 112007026147270-PCT00012
Figure 112007026147270-PCT00012

Figure 112007026147270-PCT00013
Figure 112007026147270-PCT00013

Figure 112007026147270-PCT00014
Figure 112007026147270-PCT00014

Figure 112007026147270-PCT00015
Figure 112007026147270-PCT00015

Figure 112007026147270-PCT00016
Figure 112007026147270-PCT00016

Figure 112007026147270-PCT00017
Figure 112007026147270-PCT00017

Figure 112007026147270-PCT00018
Figure 112007026147270-PCT00018

Figure 112007026147270-PCT00019
Figure 112007026147270-PCT00019

Figure 112007026147270-PCT00020
Figure 112007026147270-PCT00020

Figure 112007026147270-PCT00021
Figure 112007026147270-PCT00021

Figure 112007026147270-PCT00022
Figure 112007026147270-PCT00022

Figure 112007026147270-PCT00023
Figure 112007026147270-PCT00023

Figure 112007026147270-PCT00024
Figure 112007026147270-PCT00024

Figure 112007026147270-PCT00025
Figure 112007026147270-PCT00025

Figure 112007026147270-PCT00026
Figure 112007026147270-PCT00026

Figure 112007026147270-PCT00027
Figure 112007026147270-PCT00027

Figure 112007026147270-PCT00028
Figure 112007026147270-PCT00028

Figure 112007026147270-PCT00029
Figure 112007026147270-PCT00029

Figure 112007026147270-PCT00030
Figure 112007026147270-PCT00030

Figure 112007026147270-PCT00031
Figure 112007026147270-PCT00031

Figure 112007026147270-PCT00032
Figure 112007026147270-PCT00032

Figure 112007026147270-PCT00033
Figure 112007026147270-PCT00033

Figure 112007026147270-PCT00034
Figure 112007026147270-PCT00034

Figure 112007026147270-PCT00035
Figure 112007026147270-PCT00035

Figure 112007026147270-PCT00036
Figure 112007026147270-PCT00036

Figure 112007026147270-PCT00037
Figure 112007026147270-PCT00037

Figure 112007026147270-PCT00038
Figure 112007026147270-PCT00038

Figure 112007026147270-PCT00039
Figure 112007026147270-PCT00039

Figure 112007026147270-PCT00040
Figure 112007026147270-PCT00040

Figure 112007026147270-PCT00041
Figure 112007026147270-PCT00041

Figure 112007026147270-PCT00042
Figure 112007026147270-PCT00042

Figure 112007026147270-PCT00043
Figure 112007026147270-PCT00043

Figure 112007026147270-PCT00044
Figure 112007026147270-PCT00044

Figure 112007026147270-PCT00045
Figure 112007026147270-PCT00045

Figure 112007026147270-PCT00046
Figure 112007026147270-PCT00046

Figure 112007026147270-PCT00047
Figure 112007026147270-PCT00047

Figure 112007026147270-PCT00048
Figure 112007026147270-PCT00048

Figure 112007026147270-PCT00049
Figure 112007026147270-PCT00049

Figure 112007026147270-PCT00050
Figure 112007026147270-PCT00050

Figure 112007026147270-PCT00051
Figure 112007026147270-PCT00051

Figure 112007026147270-PCT00052
Figure 112007026147270-PCT00052

Figure 112007026147270-PCT00053
Figure 112007026147270-PCT00053

Figure 112007026147270-PCT00054
Figure 112007026147270-PCT00054

Figure 112007026147270-PCT00055
Figure 112007026147270-PCT00055

Figure 112007026147270-PCT00056
Figure 112007026147270-PCT00056

Figure 112007026147270-PCT00057
Figure 112007026147270-PCT00057

Figure 112007026147270-PCT00058
Figure 112007026147270-PCT00058

Figure 112007026147270-PCT00059
Figure 112007026147270-PCT00059

Figure 112007026147270-PCT00060
Figure 112007026147270-PCT00060

Figure 112007026147270-PCT00061
Figure 112007026147270-PCT00061

Figure 112007026147270-PCT00062
Figure 112007026147270-PCT00062

Figure 112007026147270-PCT00063
Figure 112007026147270-PCT00063

Figure 112007026147270-PCT00064
Figure 112007026147270-PCT00064

Figure 112007026147270-PCT00065
Figure 112007026147270-PCT00065

Figure 112007026147270-PCT00066
Figure 112007026147270-PCT00066

Figure 112007026147270-PCT00067
Figure 112007026147270-PCT00067

Figure 112007026147270-PCT00068
Figure 112007026147270-PCT00068

Figure 112007026147270-PCT00069
Figure 112007026147270-PCT00069

Figure 112007026147270-PCT00070
Figure 112007026147270-PCT00070

Figure 112007026147270-PCT00071
Figure 112007026147270-PCT00071

Figure 112007026147270-PCT00072
Figure 112007026147270-PCT00072

Figure 112007026147270-PCT00073
Figure 112007026147270-PCT00073

Figure 112007026147270-PCT00074
Figure 112007026147270-PCT00074

Figure 112007026147270-PCT00075
Figure 112007026147270-PCT00075

SEQUENCE LISTING <110> MACROGEN CO., LTD <120> CAEV-BASED VECTOR SYSTEMS <160> 79 <170> KopatentIn 1.71 <210> 1 <211> 9189 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 1 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat 60 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg 120 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg 180 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga 240 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag 300 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg 360 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg 420 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta 480 atacaacagc aacaccgtaa gaaaatccgc catggtgagt ctagatagag acatggcgag 540 gcaagtctcc ggggggaaaa gagattatcc tgagctcgaa aaatgtatca agcatgcatg 600 caagataaaa gttcgactca gaggggagca cttgacagaa ggaaattgtt tatggtgcct 660 taaaacatta gattacatgt ttgaggacca taaagaggaa ccttggacaa aagtaaaatt 720 taggacaata tggcagaagg tgaagaatct aactcctgag gagagtaaca aaaaagactt 780 tatgtctttg caggccacat tagcgggtct aatgtgttgc caaatgggga tgagacctga 840 gacattgcaa gatgcaatgg ctacagtaat catgaaagat gggttactgg aacaagagga 900 aaagaaggaa gacaaaagag aaaaggaaga gagtgtcttc ccaatagtag tgcaagcagc 960 aggagggaga agctggaaag cagtagattc tgtaatgttc cagcaactgc aaacagtagc 1020 aatgcagcat ggcctcgtgt ctgaggactt tgaaaggcag ttggcatatt atgctactac 1080 ctggacaagt aaagacatac tagaagtatt ggccatgatg cctggaaata gagctcaaaa 1140 ggagttaatt caagggaaat taaatgaaga agcagaaagg tggagaagga ataatccacc 1200 acctccagca ggaggaggat taacagtgga tcaaattatg ggggtaggac aaacaaatca 1260 agcagcagca caagctaaca tggatcaggc aaggcaaata tgcctgcaat gggtaataaa 1320 tgcattaaga gcagtaagac atatggcgca caggccaggg aatccaatgc tagtaaagca 1380 aaaaacgaat gagccatatg aagattttgc agcaagactg ctagaagcaa tagatgcaga 1440 gccagttaca cagcctataa aagattatct aaagctaaca ctatcttata caaatgcatc 1500 agcagattgt cagaagcaaa tggatagaac actaggacaa agagtacaac aagctagtgt 1560 agaagaaaaa atgcaagcat gtagagatgt gggatcagaa gggttcaaaa tgcaattgtt 1620 agcacaagca ttaaggccag gaaaaggaaa agggaatgga cagccacaaa ggtgttacaa 1680 ctgtggaaaa ccgggacatc aagcaaggca atgtagacaa ggaatcatat gtcacaactg 1740 tggaaagaga ggacatatgc aaaaagaatg cagaggaaag agagacataa ggggaaaaca 1800 gcagggaaac gggaggaggg ggatacgtgt ggtgccgtcc gctcctccta tggaataact 1860 tcagcaccac ctatggttca ggtccgcata ggttcccagc agaggaactt gttatttgat 1920 accggggcgg accgaactat agttagatgg catgagggct cgggaaaccc agccggaagg 1980 ataaaactgc aaggaatagg aggaatagta gaaggagaaa aatggaataa tgtagaatta 2040 gaatataaag gagaaacaag aaagggaaca atagtagtgt taccacaaag tccagtagaa 2100 gtattaggac gagataacat ggcccgattt ggaataaaga taataatggc aaatttagag 2160 gaaaaaagaa tcccaattac aaaagtaaaa ttgaaagagg gatgtacggg tccacatgtc 2220 ccacaatggc cattaacaga agagaaatta aaaggtctaa cagaaatcat agataaatta 2280 gtggaagaag gaaaactagg aaaggcaccc ccacattgga catgtaatac tccaatcttt 2340 tgcataaaaa agaaatcagg gaagtggaga atgttaatag atttcagaga attgaacaaa 2400 cagacagaag atttaacaga agcgcagtta ggactcccgc atccgggagg actacaaaag 2460 aaaaaacatg ttacaatatt ggacatagga gatgcatatt ttactatacc cctatatgaa 2520 ccatatcgag agtacacatg ttttactcta ttaagtccta ataatctagg accatgtaaa 2580 agatactatt ggaaagtgct gccacaaggt tggaaattga gtccatctgt atatcaattt 2640 actatgcagg agatcttaga ggattggata cagcagcatc cagaaattca atttggcata 2700 tatatggatg atatttacat aggaagtgat ttagaaatta aaaagcatag agaaatagtg 2760 aaagatttag ccaattatat tgcccaatat ggattcactc tgccagaaga gaagagacaa 2820 aagggatatc cagcaaaatg gctaggattt gaactacacc cgcagacctg gaaatttcag 2880 aagcatacat tacctgaatt aacaaaggga acaataacat taaataaatt acagaaatta 2940 gtaggagaat tagtatggag acaatccata attgggaaaa gcattcctaa cattctgaaa 3000 ttaatggaag gagatagaga attacaaagt gaaagaaaaa ttgaagaagt acatgtgaaa 3060 gaatgggaag catgtaggaa aaaattagaa gaaatggaag gaaattatta taataaagac 3120 aaagatgtct atggacaatt ggcttgggga gacaaagcta tagaatatat agtgtatcag 3180 gagaaaggga aaccattatg ggtaaatgtg gttcacaata taaagaacct aagcatcccg 3240 caacaggtta ttaaagcagc gcaaaaatta acccaagaag tcatcattag gacaggaaaa 3300 ataccatgga tattgttgcc agggaaagaa gaagattgga gactagaatt gcaattaggg 3360 aacatcacat ggatgccaaa attttggtcc tgttatcgag gacatacaag atggagaaaa 3420 agaaatataa tagaagaagt agtagaaggg cctacatatt atacagatgg aggaaaaaag 3480 aataaagtag gaagtctagg gttcatagta tcaacagggg aaaaatttag aaagcatgaa 3540 gagggcacaa accagcaact agaattaaga gccatagagg aagctctaaa acaagggcct 3600 caaacaatga atttagtaac agatagtaga tatgcatttg aatttttatt aagaaattgg 3660 gatgaagaag taataaagaa tccaattcaa gcaagaatta tggaaattgc ccacaagaaa 3720 gataggatag gagtgcattg ggtgccagga cataaaggga ttccccaaaa tgaagaaata 3780 gacaaatata tttcggaaat atttcttgca aaagaaggag aaggaattct cccaaaaaga 3840 gaagaggatg cagggtatga tttaatatgc ccagaagagg ttaccataga gccaggacaa 3900 gtgaaatgca tccccataga gctaagatta aatttaaaga aatcacaatg ggctatgatt 3960 gctacaaaaa gcagcatggc tgccaaagga gtgttcacac aaggaggaat catagactca 4020 ggatatcagg gacaaataca ggtaataatg tataatagca ataaaatagc agtagtcata 4080 ccccaaggga gaaaatttgc acaattaata ttaatggata aaaagcatgg aaaattggaa 4140 ccctgggggg aaagcagaaa aacagaaagg ggagaaaaag gatttgggtc tacaggaatg 4200 tattggatag aaaatattcc tctggcagag gaagaccaca caaaatggca tcaagatgcc 4260 cgatcattgc atctagaatt tgaaattcca agaacagcag cagaagacat agtaaatcaa 4320 tgtgaaatat gcaaagaagc gaggacacct gcagtaatta gaggcggaaa caaaaggggg 4380 gtaaatcatt ggcaagtgga ttatacccat tatgaaaata tcatactatt agtatgggta 4440 gaaacaaatt caggactaat atatgcagaa aaagtaaaag gagaatcagg gcaagaattc 4500 agaataaaag tgatgcattg gtatgcatta tttggtccag agtcattgca gtcagacaat 4560 ggacctgcat ttgcagcaga gcccacacag ctgttaatgc aatacctagg agtaaaacac 4620 acaacaggca taccttggaa tccacagtct caggctatag tagaaagggc acatcaacta 4680 ttgaaaagca ctttaaagaa gttccagcca caatttgtcg ctgtagaatc agccatagca 4740 gcagccctag tcgccataaa tataaaaaga aagggtgggc tggggacaag ccctatggat 4800 atttttatat ataataaaga acagaaaaga ataaataata aatataataa aaattctcaa 4860 aaaattcaat tctgttatta cagaataagg aaaagaggac atcaggagag tggaaaggac 4920 caacccaggt actgtggaaa ggggaaggag ccaattgtgg taaaggatat agaaagtgaa 4980 aagtatttag taatacctta caaagatgca aaattcatcc cgccaccaac aaaagaaaag 5040 gaataaaaaa cctggaccag aattaccctt agcactatgg atacatatag cagaaagcat 5100 taatggggat agctcatggt acataacaat gagactgcaa cagatgatgt ggggaaaaag 5160 aggaaataag ttacaatata agaatgaaga cagggaatat gaaaattggg aaattacatc 5220 atggggatgg aaaatgcacc taaggagagt gaaacaatgg atacaagaca acaggagagg 5280 aagcccatgg cagtacaaag taggaggaac atggaaaagt ataggagtgt ggttcctgca 5340 agcaggagat tacagaaagg tagacaggca cttctggtgg gcatggagga tactgatatg 5400 ttcctgcagg aaagaaaagt ttgatataag agaatttatg agaggaagac atagatggga 5460 tttgtgcaaa tcctgtgctc aaggagaagt agtaaagcat actagaacaa aaagtctgga 5520 aagactagta ctgctacaga tggtagaaca gcatgtgttt caagtattgc cattgtggag 5580 agccaggaga agtagtacaa cagatttccc atggtgcagg gacacaacgg gatacacgca 5640 tgcgtggtct gtccaggagt gctggttgat ggaatatctc ttagaggatg agtgaagaac 5700 tgcctcaaag aagggagaca catccagaag aacttgtaag gaacgtacgg gaaagagaaa 5760 gggatacatg gcaatggaca agcatcagag tacctgcgga aatactgcaa agatggcttg 5820 ctatgcttag gtcaggcaga aatagaaaga aagtgtatag agaaatgcaa aaatggatgt 5880 ggatacatcc caaggcgcct gtgattaggg cctgtggatg cagactatgt aacccggggt 5940 ggggaacata atcaagggaa taataaatgc aaataaatgt aactaacaag tagcaaaagt 6000 gtctgtgtta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg 6060 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc 6120 aaggtaagta tcaaccccag gtaagtaagc aaatagggaa cagaaatact aacccatgct 6180 ttgcctataa agggatattc ctatggagga tatcactaac aatgtggata ttgctaggga 6240 taaatatgtg tgtcagtgca gaggattaca taacactaat atcagatccc tatgggttct 6300 cacccataaa aaatgtgtct ggggtaccag tgacttgtgt aacaaaagaa ttcgcaaaat 6360 ggggatgtca accactagga gcgtaccctg atccagaaat agaatacaga aatgtgagtc 6420 aggaagtagt gaaagaagta tatcaagaga attggccatg gaatacatat cattggcctc 6480 tctggcaaat ggagaatgtt aggtactggt taaaagaaaa tatgcaagaa aatcaacaga 6540 gaaaaaataa tacaaaagag ggtatagagg aattattagc aggaactata aggggaagat 6600 tctgtgtacc atacccattt gccttgttaa aatgcacaaa gtggtgctgg tatacagcgg 6660 ccataaacaa cgagtcagga aaagcaggaa aaataaaaat aaattgcaca gaagcaagag 6720 cagtctcctg tacagaggac atgccattag cctcaataca aagagcatat tgggatgaga 6780 aagacagaga gagcatggcc tttatgaata tcaaagcatg tgatagcaac ctaaggtgtc 6840 agaaaagacc tggagggtgt atggaaggat accctatccc agtaggagca gaaataatcc 6900 ctgaaagtat gaaataccta aggggagcaa agagtcagta tgggggaata aaagataaga 6960 atggagaatt aaaattacca ttaacattaa gagtgtgggt aaaattagca aatgtgtcag 7020 aatgggtaaa tgggacaccc ccggattggc aagacagaat taacggatcc aaaggaataa 7080 atgggacgct ctggggagag cttaacagta tgcatcacct aggatttgcc cttagccaga 7140 acggcaaatg gtgtaactac accggggaaa taaaattagg gcaagaaaca ttccaatatc 7200 attacaagcc aaactggaac tgtaccggga attggacgca atatccggtg tggcaagtga 7260 ttagaaacct ggatatggtg gaacatatga caggagaatg tgtgcagaga ccacaaaggc 7320 acaatataac agtaggaaat ggaaccataa cagggaattg cagtacaaca aactgggatg 7380 gatgtaattg ctcacgatca ggaaactacc tatataacag ctctgaggga ggattgttat 7440 taattctgtg cagacaaaac agcaccctaa caaggatcct gggaacaaat acaaattgga 7500 caactatgtg gggaatatac aaaaattgtt caggatgcga gaatgcaaca ttagacaaca 7560 caggagaagg aaccttagga ggtgtagcta ataagaactg tagcttgcct cataaaaatg 7620 agagcaacaa gtggacttgt gccccaagac aaagagatgg aaaaacagat tcgctataca 7680 tagcaggagg aaaaaagttt tggacacgaa ttaaggccca attcagctgt gaaagtaaca 7740 taggacaatt agatggaatg ttgcatcagc aaatactatt gcaaaaatat caagtaatta 7800 aggtaagagc ttatacatat ggggtgatag aaatgccaga aaactatgca aaaacaagaa 7860 tcataaacag gaaaaaaaga gaactcagcc acaagaggaa gaagagaggc gttggcttgg 7920 tcattatgct agttatcatg gcaatagtag ctgccgcagg ggcttctctg ggagtcgcaa 7980 acgcgattca gcagtcttac actaaggcag ctgtccagac ccttgctaat gcaactgctg 8040 cacagcagga tgtgttagag gcaacctatg ccatggtaca gcatgtggct aaaggcgtac 8100 gaatcttgga agctcgagtg gctcgagtgg aagctatcac agatagaata atgctatacc 8160 aagaattgga ttgttggcac tatcatcaat actgtataac ctctacaaaa acagaagtag 8220 caaaatatat caattggacg aggtttaagg ataattgcac atggcagcag tgggagagag 8280 gattacaggg gtatgataca aacttaacaa tactgttaaa ggaatcagca gcaatgacac 8340 aactagcaga agagcaagca aggaggatac cagaagtatg ggaaagttta aaagacgtct 8400 ttgattggtc aggatggttc tcatggctaa agtatattcc tattatagta gtaggattat 8460 taggatgcat tctgataaga gctgtgatat gtgtatgtca acctcttgtg cagatataca 8520 gaactctaag taccccgaca taccaacggg tcacagtcat catggaaaca agagcagacg 8580 tcgcaggaga aaatcaggat tttggcgatg gcttagagga atcagacaac agcgaaacaa 8640 gcgaaagagt gacagtacag aaagcttgga gccgtgcctg ggagctttgg cagaactcac 8700 cctggaagga gccatggaaa aggggcctgc tgaggctgct cgtccttccg ctgacgatgg 8760 gaatctggat aaatggatgg cttggagaac accacaaaaa taaaaaaaga aagggtgact 8820 gtgagacatg ggctaaagag gactaataac aagctaggcc aaattcctgt aaatcacttg 8880 gggggttata agaaaagcaa gttcactatg acaaagcaaa atgtaaaggc caaattcctg 8940 taaatcactt ggggggttat aagaaaagca agttcactat gacaaagcaa aatgtaaccg 9000 caagtgctga cagatgtaac agctgacata tcagctgatg cttgctcatg ctgacactgt 9060 agctctgagc tgtatataag gagaagcttg ctgcttgcac ttcagagttc taggagagtc 9120 cctcctagtc tctcctctcc gaggaggtac cgagacctca aaataaagga gtgattgcct 9180 tactgccga 9189 <210> 2 <211> 8919 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 2 gtgagtgctc tgaggagctc gaaggaaaga gtcctcagcc tctcctctcc gaggagcttc 60 ggctcataat aaaggagtgc ttgcttcaac agaactgagc tggtcgtggt tattatcggg 120 gaccgaagtc ccgtgcaaca ccggggcggt tctcgcagct ggcgcccaac gtggggctcg 180 agtagcttga gaagctcgac tgagatctga atccaagagc gacatcagac agcaagaaat 240 gagagtaatg agaccgcgag ctctgctgct gtaaaaaaga ggaagtagcg ggttgccgag 300 gcaactgctc agaagaacca ggggaaaggg cttccagcaa cctcaaaaga ggaaccgaga 360 cttcggggac gcctgaagta aggtaagtga ctctgctgta cgcggggcga ggcataggag 420 atccttctat tctaggaaga gaagcgctgt tctgggaggt cttggcgacc gagaatcttg 480 ttaaataagc caggatctcg atcaggacca agacccctca ggagagggta tagacagcgt 540 ggtaagaaat ccgccgtggt gagtctagat agagacatgg tgaggcaggc ctccggaagg 600 ggaaaggagt accccgagct aaaagaatgt ctgaaaaagg catgcaaaat aaaagtaagg 660 gctggggggg agcgcctgac agaaggaaat tgtctctggt gtataaaaac actagagtgt 720 atgtatgagg attgtaggga ggaaccttgg accccagaaa aatgtaaaca attatggaaa 780 aagttgaagc aggtagagcc tgaggagagt agcaaagcag actataactc gttaaaagca 840 accttggcgg ggatagtctg tgtgcaaatg ggaatgcagc ccgagacact gcaggatgcg 900 atagcaacct taaacatgag agatgaagta aaaggaaagg aaaagccatc agaagaaaag 960 aagggaatat atcccatatt agtgcaggca ggaggaggaa gagcatggag agcggtagag 1020 cctgctacct ttcagcagct ccaaacagtg gcaatgcagc atggactagt atcagaagaa 1080 tttgaaaggc agctagcata ctatgccacc acatggacaa gcaaggatat cttagaagta 1140 ttagccatga tgccaggaaa tagagcgcaa aaagaactaa tacaaggaaa gttaaatgag 1200 gaagcagaga gatggagaag gcagaatcca caacctgcgg gcgggttaac cgtggatcag 1260 ataatggggg taggacaaac gaatcaggca gcggcacagg ctaatatgga tcaagcaaga 1320 caaatatgcc tacaatgggt tataacagca ataagaggag ttaggcatat ggcccataga 1380 ccaggaaatc ccatgctggt aagacaaaaa ccaaatgaga actatgaaga gtttgccgca 1440 aggttgttag aagcagtgga tgcagaaccc gttacccaac ctataaaaga atatttaaag 1500 gtaactctgt cttacacaaa tgcaaattcg gaatgtcaaa aacatatgga cagagtgttg 1560 gggcaaagag tacagcaggc ctcaatagaa gaaaaaatgc aggcatgcag ggacatcggg 1620 ggaacagcat atcagatgca gttgcttgca caagccctcc gtggcggaaa agaagatggg 1680 aaaaaatctg tagggaagtg ttataactgt ggaaggcccg gacacagagc aaaagaatgc 1740 agacaaggca ttatatgtca caactgtgga aaaagagggc atatacagaa aaactgcaaa 1800 cagaaaagaa gaaaggagca gggaaacatg aggagggggc tacgtgtggt gccgtccgca 1860 ccccctatgg agtaacgcaa gcaccactaa tagttagggt acaaataggg aatcaggaga 1920 aacaattatt atttgacaca ggggcagata aaacgatagt aagaatgcat gatggaacag 1980 ggattccaaa cggaagaata aaattacaag ggataggagg aatagtagaa ggagaaaaat 2040 ggaataaagt acccatgaca tataagggag aaacatcctg cccaagcttg gttgtgctaa 2100 gagatagccc agtagaagta ttgggaagag ataacatgga agcattcggc gtaaccctaa 2160 taatggcaaa tttagaagat aagaaaattc ccacaatacc agtagaattg aaagaaggat 2220 gtaaagggcc acatgtgccc cagtggccat taacagcaga gaaattacaa ggactaacag 2280 gaatagtaga aaaattacta caggaaggaa aattggcaga ggccccagag ggatggacgt 2340 ggaacacgcc catcttctgc ataaaaaaga agtcaggaaa atggagaatg ttaatagatt 2400 ttagggaatt aaataagcaa acagcagatt tagcagaagc gcagctagga ctgccacacc 2460 caggagggtt gcaaaggaaa aagaatgtaa caattctgga cataggagat gcatatttca 2520 caattccctt atacgagccc tatcagaaat atacatgctt cacactccta agtcctaaca 2580 atttgggacc atgtaaaagg tattattgga aagtattacc ccagggatgg aaattgagcc 2640 cagctgtata tcaattcacc atgcaaaggt tgttaaaagg atggatacaa cagcataaaa 2700 acatacaatt tggaatatat atggatgata tctatattgg aagtgatcta acgatagccc 2760 aacataggaa gataatagaa gaattagcct catttataga acaatttggg tttacattac 2820 cagaagataa gagacaagag ggctatccag caaaatggct aggattcgag ctacatccag 2880 aaaaatggaa atatcaaaag cataaattgc cggaattaca agagggggta ataaccctga 2940 acaaattaca gaagatagta ggggaattag tgtggagaca atccttgata ggaaagagca 3000 tccccaatat cataaaatta atggaaggag atcgcgcatt acaaagtgaa aggaaaatag 3060 aaagaataca tgtacaagaa tgggaagcat gtcaaaagaa attagatgaa atggtaggaa 3120 attattacag agaagaagaa gatatctatg gacaaataac ttggggggat aaggcaataa 3180 aatacatagt attccaaagg aaaggggaac ccctatgggt aaatgtagta catgacataa 3240 aaaatttgag tctcccacag caagtgataa aagcagcaca gaaattaacc caggaagtaa 3300 tcataagaac aggaaaaatc ccatggctgc tactaccagg aagagaagaa gactggagat 3360 tagaactgca ggtagggaac atcacgtgga tgccatcatt ttggtcatgt tatcgaggag 3420 cacccaagtg gaaaagaagg aacatagtgg cagcagtggt agatggaccg acatattata 3480 cagatggggg aaagaaaaac gcacagggaa gctttggctt catctcccca acaggagaaa 3540 agttcagaag gcatgaagat ggaactaatc aggtattaga attaagggca atagaagatc 3600 catgtaaaca aggacctgaa agcatgaaca ttgtaactga cagcaggtat gcttatgaat 3660 tcatgctccg aaactgggat gaacaggtca taagaaaccc cattcaggca agaatcatgg 3720 cagaagtgca caagaaaaag caggtaggaa tacactgggt gccagggcat aaaggaatac 3780 ctcagaatga agagatagac cagtacatat cagaagtatt cttagcacga gaaggaacag 3840 ggatatgtga aaaaaggaag gaagatgctg gatatgattt attatgcccg catgaggtaa 3900 tacttaaacc ccaagaagta aaacggatcc caatagacct aaaattaaaa ttgaaagaaa 3960 agcaatgggc catgataagt gggaaaagta gcgttgcagc aaaaggaata tttgtacaag 4020 gaggcataat agattcaggg tatcagggac aagtacaagt catcctatat aatagtaata 4080 agatagaggt caaaatacca caaggcagga aatttgccca attaatatta atgaacttac 4140 aacatgaaga attagaagaa tggggaaagg aaagaaaaac agaaagagga acaaaaggat 4200 ttgggtctac aggagcattt tggatagaga atattcccca agcagaggaa gaacattaca 4260 aatggcatca agatgctaga tctctgcagc tagaattcaa gatacctaga gcagcagcag 4320 aagacattat acagcactgt gaggtatgtc aagaaggcaa acccgcagcg atcacgagag 4380 ggggaaataa aagaggaata gatcattggc aggtagacta tacacattac aaagaacaca 4440 taatattagt atgggtagag actaattcag gattaatatt tgcagagaaa gtaaaaggag 4500 aatcaggaca agaatttagg atgcagacat tgaaatggta tgctttgttt caaccaaaat 4560 cagtgcaatc agataatggg acagccttca cagctgaggc tacgcagcat ctaatgaagt 4620 atttagggat tcagcacact acgggtattc cgtggaaccc ccagtcacaa agtttagtag 4680 aaagagctca tcaaacatta aaacacatgt tagaaaaatt agaaccacaa tttgtggccc 4740 tacagtctgc catcgcagcc actctagttg cgctcaatat aaaaagaaag ggtggactag 4800 gggcaagccc tatggatatt tacatatata ataaggagca acaaagacaa caagataata 4860 gtaataaatt aattcagaaa aaattttgtt attacaggat cagaaaaaga ggccatccag 4920 gagagtggaa cggcccaact gaggtactgt gggaagggga aggagccata gtagttaaag 4980 acaaagaaag tgatagatat ctagtcatcc catataaaga tgcaaaattt attccgccac 5040 cgtcggaaca gaagggatag aagaataggt ccagaattgc ctttatcttt atggacttat 5100 acagcataca gcataaataa agatcccgca tggtatacaa ccctaagact gcagcaaatg 5160 atgtggcata ggaggggaaa taaattgaca tatgtcaggg aaaatgcaca gtacgaggag 5220 tgggaaatga cctcgtatga gtggaggata agaatgagaa gggacaaaac aaaaagtcat 5280 ccaagagggc atacttcgcc atggcaatat cggagacagg atggatggaa ggatgtggga 5340 acgtggttcc tacagccagg ggactataga aaggcggatc agcagttctg gttcgcttgg 5400 agaatagtgt cgtgttcatg taaaaaggaa ggatttaaca taagagaatt tatgctaggt 5460 acccatagat gggatttgtg taagtcgtgt tgccagggtg aagtagtaaa gagaacacaa 5520 ccctacacct tgcaaaggct cacgtggctt aaattaacag aagaccatgt atttcaagta 5580 atgcccttgt ggagagctcg caaagggatt accatagact ttccctggtg cagggacaca 5640 aaaggattcc tggagccgtg gacaacgcaa gagtgttggc aaatagagta tcccttggag 5700 gatgagtgag gaaaccccag caggaagaga accgactgca gaggaaatat ttgagcaaga 5760 agcagaaagt tggaagagaa caagcgtgcg agtcccaaat gacatattac aaagatggct 5820 agcaatgctt aggcaaagag gaaatagaaa gaaagtgctt agggaaatgc aaaaatgggc 5880 atggaggaat cccacggcgc gggtgattcg gccgtgtgga tgtcggctat gtaaccccgg 5940 ctgggggagt aattaatcat aataaagcaa attgtaacat gctgtgtcag gtgtcttgca 6000 ggaatggcgg agataagaaa agaagcaaag gagccactaa tccagggtaa gtataaaaaa 6060 caggtaagta gaataactat agttatatta ctaacagtaa gagcagcact aggagcagaa 6120 tacataacca taatatcaga cccatatggg ttctctcccg tgagaaatgt gtcaggagta 6180 cctgtaactt gtgtgacaaa agaatttagt aagtggggat gtcagccaat aggagcctac 6240 ccagacccag acttagaata cagaaatata agtaaagaaa tattagagga agtatatcaa 6300 caagactggc cgtggaatac ttatcattgg ccattatggc aaatggataa tgtagtacaa 6360 tgggcaaggc aaaatttaca ggataaccgc aaggaaaaaa gggacctggc agacctatta 6420 gcaggaaaaa taaggggaag attctgtgta ccctacccat ttgcgctcct ggagtgcatg 6480 gaatggtgct ggtgggttaa gaacactaat gcaggggggt atggagaagc agacataaga 6540 ataaattgct caagggcaag agcagtgagc tgcacaagtg aaatgccctt agcatcccta 6600 cagagggtat attgggaaaa ggaggaacga aaaaacatgg agaaaatgac catcaaacct 6660 tgcaataaaa atttggaatg caagaacaga aggggatgcg cagaagggta tccagtacct 6720 cccaaggcag agttattccc tccagcgttt caggatttac agccaaaagg gtacgcatat 6780 ggggcactta gagggaacag caaatttcca caaagagtgt cgctaagaac atgggtgaaa 6840 atagctaacc tgacaggatg ggaaaaagga aagccagcag aatggtggaa taccagccaa 6900 caggttcatt ggtttgatac cacgccacaa tatcatttag gatatgtatt atcccgagcg 6960 cctgagaaca ggagttgtaa tttcacaggg gaaatacgaa tagggcaaca tcagtttgag 7020 tataattaca ccctgacaaa gaattgcaca aaggagaagt ggaaagagta ccccatgtgg 7080 catgtctgga ggcatttaga tcaaaatgag cacttatcta gcatatgttt caaaagaccg 7140 agaagaaatg caacacaaat agggaacagt acactgcaag ggcaatgtaa tagaagtaat 7200 tggacaggat gccactgcaa tgagacaggg ataaacacaa catggagaat aaatggcaca 7260 aagggagctt atctcttaaa tagcactaat ggaaacatca tggtcttgtt atgctggaac 7320 acaacagtgg caggggtata tgagagtcag ctaaagtgga atgagagtct taaagacgga 7380 gactatgggc tctgttttaa ttcaacaaac aggaattgta ctagaaatgg agctcggcac 7440 tatgtaaaca agagagtgat aaaaaacgac acagcagatc ataattgtga tagcagcata 7500 tcagcaatag atggaatggt acatcaacaa atattactgc aaaggtatca agtaattaga 7560 gtaagagctt acacatacgg agtgattgat atgccagaca attatgagac cctaccagga 7620 aggagaagga gagatctcgc aaaggccagg aaaaagaggg gcgtgggcct ggtcatcatg 7680 ttagctatca tggccatagt ggctgctgca ggagcatctc tgggagtcgc gaacgcgatt 7740 cagcagtcct acaccaggga cgctgtccag actcttgcta acgcgactgc tgtgcaacag 7800 caggtgttag aggcgtccta tgccatgata cagcatgtgg ctaagggaat acgcatcctt 7860 gaagcacgcg tggcgagaat ggaagttatg atggatagaa tgatgttata tcaggaagta 7920 gactgctggc attatcacca atattgtgta acctctacaa gagcagacat agtgaattac 7980 attaattgga caaggtttaa agataattgc acatggcaag agtgggaaag ggagataagt 8040 gcgcatgaag gaaacatcac tatattactc aaagaatcag caaggataac acaattagca 8100 caacaaaagg tacaaagaat accagatgtg tggacagcac taagggagtc actaggatgg 8160 acacaatggc tggcttggat aaaatacctt cccataatag tagtagggat attaggatgc 8220 ataatcataa gaataatgtt gtgtgtagta caaccagttc ttcagattta cagaaccttg 8280 actcagacca ggtatcaaca agtcaacttg gtgatggaga cccgggtgca actagaagaa 8340 gaagaagaag aagacggaag ggatggtgga gatggctcag agagatgcag cgatcccgac 8400 aacaaaggaa ttatgaacgc ctggaggaga gcttgggtga cttggagaaa ctcaccttgg 8460 cagaacacat ggaagaatgt ggtggtggcg ccgttggtga ttccgctgac aatcagaatt 8520 tggctccttg gagagaatgg agagaacccc taaaagaaaa ataaaaaggg tggactgtga 8580 ggactgtgag gcctaggagc gagatagaaa cttataggcc tctcttcccg gaaagctaac 8640 tcactgtgag aggaatagca agtcacagtg acactgctaa ttgtacccgc aaccctgaga 8700 tcatgcaaac cacaatcctg agattatgct gacatgtgta acagctgatg cctcagctga 8760 tgcttgctca tgctgacaat gtaactagga gctctatata aacagagccc tagagcttgc 8820 tacttcagag tgctctgagg agctcgaagg aaagagtcct cagcctctcc tctccgagga 8880 gcttcggctc ataataaagg agtgcttgct tcaacagaa 8919 <210> 3 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 3 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 4 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 4 atggtgaggc aggcctccgg aaggggaaag gagtaccccg agctaaaaga atgtctgaaa 60 aaggcatgca aaataaaagt aagggctggg ggggagcgcc tgacagaagg aaattgtctc 120 tggtgtataa aaacactaga gtgtatgtat gaggattgta gggaggaacc ttggacccca 180 gaaaaatgta aacaattatg gaaaaagttg aagcaggtag agcctgagga gagtagcaaa 240 gcagactata actcgttaaa agcaaccttg gcggggatag tctgtgtgca aatgggaatg 300 cagcccgaga cactgcagga tgcgatagca accttaaaca tgagagatga agtaaaagga 360 aaggaaaagc catcagaaga aaagaaggga atatatccca tattagtgca ggcaggagga 420 ggaagagcat ggagagcggt agagcctgct acctttcagc agctccaaac agtggcaatg 480 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc caccacatgg 540 acaagcaagg atatcttaga agtattagcc atgatgccag gaaatagagc gcaaaaagaa 600 ctaatacaag gaaagttaaa tgaggaagca gagagatgga gaaggcagaa tccacaacct 660 gcgggcgggt taaccgtgga tcagataatg ggggtaggac aaacgaatca ggcagcggca 720 <210> 5 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 5 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 6 <211> 1299 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 6 atggtgaggc aggcctccgg aaggggaaag gagtaccccg agctaaaaga atgtctgaaa 60 aaggcatgca aaataaaagt aagggctggg ggggagcgcc tgacagaagg aaattgtctc 120 tggtgtataa aaacactaga gtgtatgtat gaggattgta gggaggaacc ttggacccca 180 gaaaaatgta aacaattatg gaaaaagttg aagcaggtag agcctgagga gagtagcaaa 240 gcagactata actcgttaaa agcaaccttg gcggggatag tctgtgtgca aatgggaatg 300 cagcccgaga cactgcagga tgcgatagca accttaaaca tgagagatga agtaaaagga 360 aaggaaaagc catcagaaga aaagaaggga atatatccca tattagtgca ggcaggagga 420 ggaagagcat ggagagcggt agagcctgct acctttcagc agctccaaac agtggcaatg 480 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc caccacatgg 540 acaagcaagg atatcttaga agtattagcc atgatgccag gaaatagagc gcaaaaagaa 600 ctaatacaag gaaagttaaa tgaggaagca gagagatgga gaaggcagaa tccacaacct 660 gcgggcgggt taaccgtgga tcagataatg ggggtaggac aaacgaatca ggcagcggca 720 caggctaata tggatcaagc aagacaaata tgcctacaat gggttataac agcaataaga 780 ggagttaggc atatggccca tagaccagga aatcccatgc tggtaagaca aaaaccaaat 840 gagaactatg aagagtttgc cgcaaggttg ttagaagcag tggatgcaga acccgttacc 900 caacctataa aagaatattt aaaggtaact ctgtcttaca caaatgcaaa ttcggaatgt 960 caaaaacata tggacagagt gttggggcaa agagtacagc aggcctcaat agaagaaaaa 1020 atgcaggcat gcagggacat cgggggaaca gcatatcaga tgcagttgct tgcacaagcc 1080 ctccgtggcg gaaaagaaga tgggaaaaaa tctgtaggga agtgttataa ctgtggaagg 1140 cccggacaca gagcaaaaga atgcagacaa ggcattatat gtcacaactg tggaaaaaga 1200 gggcatatac agaaaaactg caaacagaaa agaagaaagg agcagggaaa catgaggagg 1260 gggctacgtg tggtgccgtc cgcaccccct atggagtaa 1299 <210> 7 <211> 511 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 7 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat 60 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg 120 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg 180 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga 240 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag 300 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg 360 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg 420 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta 480 atacaacagc aacaccgtaa gaaaatccgc c 511 <210> 8 <211> 576 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 8 gtgagtgctc tgaggagctc gaaggaaaga gtcctcagcc tctcctctcc gaggagcttc 60 ggctcataat aaaggagtgc ttgcttcaac agaactgagc tggtcgtggt tattatcggg 120 gaccgaagtc ccgtgcaaca ccggggcggt tctcgcagct ggcgcccaac gtggggctcg 180 agtagcttga gaagctcgac tgagatctga atccaagagc gacatcagac agcaagaaat 240 gagagtaatg agaccgcgag ctctgctgct gtaaaaaaga ggaagtagcg ggttgccgag 300 gcaactgctc agaagaacca ggggaaaggg cttccagcaa cctcaaaaga ggaaccgaga 360 cttcggggac gcctgaagta aggtaagtga ctctgctgta cgcggggcga ggcataggag 420 atccttctat tctaggaaga gaagcgctgt tctgggaggt cttggcgacc gagaatcttg 480 ttaaataagc caggatctcg atcaggacca agacccctca ggagagggta tagacagcgt 540 ggtaagaaat ccgccgtggt gagtctagat agagac 576 <210> 9 <211> 3318 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 9 atgtcacaac tgtggaaaga gaggacatat gcaaaaagaa tgcagaggaa agagagacat 60 aaggggaaaa cagcagggaa acgggaggag ggggatacgt gtggtgccgt ccgctcctcc 120 tatggaataa cttcagcacc acctatggtt caggtccgca taggttccca gcagaggaac 180 ttgttatttg ataccggggc ggaccgaact atagttagat ggcatgaggg ctcgggaaac 240 ccagccggaa ggataaaact gcaaggaata ggaggaatag tagaaggaga aaaatggaat 300 aatgtagaat tagaatataa aggagaaaca agaaagggaa caatagtagt gttaccacaa 360 agtccagtag aagtattagg acgagataac atggcccgat ttggaataaa gataataatg 420 gcaaatttag aggaaaaaag aatcccaatt acaaaagtaa aattgaaaga gggatgtacg 480 ggtccacatg tcccacaatg gccattaaca gaagagaaat taaaaggtct aacagaaatc 540 atagataaat tagtggaaga aggaaaacta ggaaaggcac ccccacattg gacatgtaat 600 actccaatct tttgcataaa aaagaaatca gggaagtgga gaatgttaat agatttcaga 660 gaattgaaca aacagacaga agatttaaca gaagcgcagt taggactccc gcatccggga 720 ggactacaaa agaaaaaaca tgttacaata ttggacatag gagatgcata ttttactata 780 cccctatatg aaccatatcg agagtacaca tgttttactc tattaagtcc taataatcta 840 ggaccatgta aaagatacta ttggaaagtg ctgccacaag gttggaaatt gagtccatct 900 gtatatcaat ttactatgca ggagatctta gaggattgga tacagcagca tccagaaatt 960 caatttggca tatatatgga tgatatttac ataggaagtg atttagaaat taaaaagcat 1020 agagaaatag tgaaagattt agccaattat attgcccaat atggattcac tctgccagaa 1080 gagaagagac aaaagggata tccagcaaaa tggctaggat ttgaactaca cccgcagacc 1140 tggaaatttc agaagcatac attacctgaa ttaacaaagg gaacaataac attaaataaa 1200 ttacagaaat tagtaggaga attagtatgg agacaatcca taattgggaa aagcattcct 1260 aacattctga aattaatgga aggagataga gaattacaaa gtgaaagaaa aattgaagaa 1320 gtacatgtga aagaatggga agcatgtagg aaaaaattag aagaaatgga aggaaattat 1380 tataataaag acaaagatgt ctatggacaa ttggcttggg gagacaaagc tatagaatat 1440 atagtgtatc aggagaaagg gaaaccatta tgggtaaatg tggttcacaa tataaagaac 1500 ctaagcatcc cgcaacaggt tattaaagca gcgcaaaaat taacccaaga agtcatcatt 1560 aggacaggaa aaataccatg gatattgttg ccagggaaag aagaagattg gagactagaa 1620 ttgcaattag ggaacatcac atggatgcca aaattttggt cctgttatcg aggacataca 1680 agatggagaa aaagaaatat aatagaagaa gtagtagaag ggcctacata ttatacagat 1740 ggaggaaaaa agaataaagt aggaagtcta gggttcatag tatcaacagg ggaaaaattt 1800 agaaagcatg aagagggcac aaaccagcaa ctagaattaa gagccataga ggaagctcta 1860 aaacaagggc ctcaaacaat gaatttagta acagatagta gatatgcatt tgaattttta 1920 ttaagaaatt gggatgaaga agtaataaag aatccaattc aagcaagaat tatggaaatt 1980 gcccacaaga aagataggat aggagtgcat tgggtgccag gacataaagg gattccccaa 2040 aatgaagaaa tagacaaata tatttcggaa atatttcttg caaaagaagg agaaggaatt 2100 ctcccaaaaa gagaagagga tgcagggtat gatttaatat gcccagaaga ggttaccata 2160 gagccaggac aagtgaaatg catccccata gagctaagat taaatttaaa gaaatcacaa 2220 tgggctatga ttgctacaaa aagcagcatg gctgccaaag gagtgttcac acaaggagga 2280 atcatagact caggatatca gggacaaata caggtaataa tgtataatag caataaaata 2340 gcagtagtca taccccaagg gagaaaattt gcacaattaa tattaatgga taaaaagcat 2400 ggaaaattgg aaccctgggg ggaaagcaga aaaacagaaa ggggagaaaa aggatttggg 2460 tctacaggaa tgtattggat agaaaatatt cctctggcag aggaagacca cacaaaatgg 2520 catcaagatg cccgatcatt gcatctagaa tttgaaattc caagaacagc agcagaagac 2580 atagtaaatc aatgtgaaat atgcaaagaa gcgaggacac ctgcagtaat tagaggcgga 2640 aacaaaaggg gggtaaatca ttggcaagtg gattataccc attatgaaaa tatcatacta 2700 ttagtatggg tagaaacaaa ttcaggacta atatatgcag aaaaagtaaa aggagaatca 2760 gggcaagaat tcagaataaa agtgatgcat tggtatgcat tatttggtcc agagtcattg 2820 cagtcagaca atggacctgc atttgcagca gagcccacac agctgttaat gcaataccta 2880 ggagtaaaac acacaacagg cataccttgg aatccacagt ctcaggctat agtagaaagg 2940 gcacatcaac tattgaaaag cactttaaag aagttccagc cacaatttgt cgctgtagaa 3000 tcagccatag cagcagccct agtcgccata aatataaaaa gaaagggtgg gctggggaca 3060 agccctatgg atatttttat atataataaa gaacagaaaa gaataaataa taaatataat 3120 aaaaattctc aaaaaattca attctgttat tacagaataa ggaaaagagg acatcaggag 3180 agtggaaagg accaacccag gtactgtgga aaggggaagg agccaattgt ggtaaaggat 3240 atagaaagtg aaaagtattt agtaatacct tacaaagatg caaaattcat cccgccacca 3300 acaaaagaaa aggaataa 3318 <210> 10 <211> 3324 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 10 atgcagacaa ggcattatat gtcacaactg tggaaaaaga gggcatatac agaaaaactg 60 caaacagaaa agaagaaagg agcagggaaa catgaggagg gggctacgtg tggtgccgtc 120 cgcaccccct atggagtaac gcaagcacca ctaatagtta gggtacaaat agggaatcag 180 gagaaacaat tattatttga cacaggggca gataaaacga tagtaagaat gcatgatgga 240 acagggattc caaacggaag aataaaatta caagggatag gaggaatagt agaaggagaa 300 aaatggaata aagtacccat gacatataag ggagaaacat cctgcccaag cttggttgtg 360 ctaagagata gcccagtaga agtattggga agagataaca tggaagcatt cggcgtaacc 420 ctaataatgg caaatttaga agataagaaa attcccacaa taccagtaga attgaaagaa 480 ggatgtaaag ggccacatgt gccccagtgg ccattaacag cagagaaatt acaaggacta 540 acaggaatag tagaaaaatt actacaggaa ggaaaattgg cagaggcccc agagggatgg 600 acgtggaaca cgcccatctt ctgcataaaa aagaagtcag gaaaatggag aatgttaata 660 gattttaggg aattaaataa gcaaacagca gatttagcag aagcgcagct aggactgcca 720 cacccaggag ggttgcaaag gaaaaagaat gtaacaattc tggacatagg agatgcatat 780 ttcacaattc ccttatacga gccctatcag aaatatacat gcttcacact cctaagtcct 840 aacaatttgg gaccatgtaa aaggtattat tggaaagtat taccccaggg atggaaattg 900 agcccagctg tatatcaatt caccatgcaa aggttgttaa aaggatggat acaacagcat 960 aaaaacatac aatttggaat atatatggat gatatctata ttggaagtga tctaacgata 1020 gcccaacata ggaagataat agaagaatta gcctcattta tagaacaatt tgggtttaca 1080 ttaccagaag ataagagaca agagggctat ccagcaaaat ggctaggatt cgagctacat 1140 ccagaaaaat ggaaatatca aaagcataaa ttgccggaat tacaagaggg ggtaataacc 1200 ctgaacaaat tacagaagat agtaggggaa ttagtgtgga gacaatcctt gataggaaag 1260 agcatcccca atatcataaa attaatggaa ggagatcgcg cattacaaag tgaaaggaaa 1320 atagaaagaa tacatgtaca agaatgggaa gcatgtcaaa agaaattaga tgaaatggta 1380 ggaaattatt acagagaaga agaagatatc tatggacaaa taacttgggg ggataaggca 1440 ataaaataca tagtattcca aaggaaaggg gaacccctat gggtaaatgt agtacatgac 1500 ataaaaaatt tgagtctccc acagcaagtg ataaaagcag cacagaaatt aacccaggaa 1560 gtaatcataa gaacaggaaa aatcccatgg ctgctactac caggaagaga agaagactgg 1620 agattagaac tgcaggtagg gaacatcacg tggatgccat cattttggtc atgttatcga 1680 ggagcaccca agtggaaaag aaggaacata gtggcagcag tggtagatgg accgacatat 1740 tatacagatg ggggaaagaa aaacgcacag ggaagctttg gcttcatctc cccaacagga 1800 gaaaagttca gaaggcatga agatggaact aatcaggtat tagaattaag ggcaatagaa 1860 gatccatgta aacaaggacc tgaaagcatg aacattgtaa ctgacagcag gtatgcttat 1920 gaattcatgc tccgaaactg ggatgaacag gtcataagaa accccattca ggcaagaatc 1980 atggcagaag tgcacaagaa aaagcaggta ggaatacact gggtgccagg gcataaagga 2040 atacctcaga atgaagagat agaccagtac atatcagaag tattcttagc acgagaagga 2100 acagggatat gtgaaaaaag gaaggaagat gctggatatg atttattatg cccgcatgag 2160 gtaatactta aaccccaaga agtaaaacgg atcccaatag acctaaaatt aaaattgaaa 2220 gaaaagcaat gggccatgat aagtgggaaa agtagcgttg cagcaaaagg aatatttgta 2280 caaggaggca taatagattc agggtatcag ggacaagtac aagtcatcct atataatagt 2340 aataagatag aggtcaaaat accacaaggc aggaaatttg cccaattaat attaatgaac 2400 ttacaacatg aagaattaga agaatgggga aaggaaagaa aaacagaaag aggaacaaaa 2460 ggatttgggt ctacaggagc attttggata gagaatattc cccaagcaga ggaagaacat 2520 tacaaatggc atcaagatgc tagatctctg cagctagaat tcaagatacc tagagcagca 2580 gcagaagaca ttatacagca ctgtgaggta tgtcaagaag gcaaacccgc agcgatcacg 2640 agagggggaa ataaaagagg aatagatcat tggcaggtag actatacaca ttacaaagaa 2700 cacataatat tagtatgggt agagactaat tcaggattaa tatttgcaga gaaagtaaaa 2760 ggagaatcag gacaagaatt taggatgcag acattgaaat ggtatgcttt gtttcaacca 2820 aaatcagtgc aatcagataa tgggacagcc ttcacagctg aggctacgca gcatctaatg 2880 aagtatttag ggattcagca cactacgggt attccgtgga acccccagtc acaaagttta 2940 gtagaaagag ctcatcaaac attaaaacac atgttagaaa aattagaacc acaatttgtg 3000 gccctacagt ctgccatcgc agccactcta gttgcgctca atataaaaag aaagggtgga 3060 ctaggggcaa gccctatgga tatttacata tataataagg agcaacaaag acaacaagat 3120 aatagtaata aattaattca gaaaaaattt tgttattaca ggatcagaaa aagaggccat 3180 ccaggagagt ggaacggccc aactgaggta ctgtgggaag gggaaggagc catagtagtt 3240 aaagacaaag aaagtgatag atatctagtc atcccatata aagatgcaaa atttattccg 3300 ccaccgtcgg aacagaaggg atag 3324 <210> 11 <211> 402 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 11 atggatgctg gggccagata catgcgctta actgggaagg aaaactgggt tgaagtaacc 60 atggacggag agaaggaaag gaaaagagaa ggtttcactg cgggacagca agatatacag 120 aactctaagt accccgacat accaacgggt cacagtcatc atggaaacaa gagcagacgt 180 cgcaggagaa aatcaggatt ttggcgatgg cttagaggaa tcagacaaca gcgaaacaag 240 cgaaagagtg acagtacaga aagcttggag ccgtgcctgg gagctttggc agaactcacc 300 ctggaaggag ccatggaaaa ggggcctgct gaggctgctc gtccttccgc tgacgatggg 360 aatctggata aatggatggc ttggagaaca ccacaaaaat aa 402 <210> 12 <211> 321 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 12 atggcggaga taagaaaaga agcaaaggag ccactaatcc aggaccaggt atcaacaagt 60 caacttggtg atggagaccc gggtgcaact agaagaagaa gaagaagaag acggaaggga 120 tggtggagat ggctcagaga gatgcagcga tcccgacaac aaaggaatta tgaacgcctg 180 gaggagagct tgggtgactt ggagaaactc accttggcag aacacatgga agaatgtggt 240 ggtggcgccg ttggtgattc cgctgacaat cagaatttgg ctccttggag agaatggaga 300 gaacccctaa aagaaaaata a 321 <210> 13 <211> 690 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 13 atgcaaaatt catcccgcca ccaacaaaag aaaaggaata aaaaacctgg accagaatta 60 cccttagcac tatggataca tatagcagaa agcattaatg gggatagctc atggtacata 120 acaatgagac tgcaacagat gatgtgggga aaaagaggaa ataagttaca atataagaat 180 gaagacaggg aatatgaaaa ttgggaaatt acatcatggg gatggaaaat gcacctaagg 240 agagtgaaac aatggataca agacaacagg agaggaagcc catggcagta caaagtagga 300 ggaacatgga aaagtatagg agtgtggttc ctgcaagcag gagattacag aaaggtagac 360 aggcacttct ggtgggcatg gaggatactg atatgttcct gcaggaaaga aaagtttgat 420 ataagagaat ttatgagagg aagacataga tgggatttgt gcaaatcctg tgctcaagga 480 gaagtagtaa agcatactag aacaaaaagt ctggaaagac tagtactgct acagatggta 540 gaacagcatg tgtttcaagt attgccattg tggagagcca ggagaagtag tacaacagat 600 ttcccatggt gcagggacac aacgggatac acgcatgcgt ggtctgtcca ggagtgctgg 660 ttgatggaat atctcttaga ggatgagtga 690 <210> 14 <211> 490 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 14 atgcaaaatt tattccgcca ccgtcggaac agaagggata gaagaatagg caaaaagtca 60 tccaagaggg catacttcgc catggcaata tcggagacag gatggatgga aggatgtggg 120 aacgtggttc ctacagccag gggactatag aaaggcggat cagcagttct ggttcgcttg 180 gagaatagtg tcgtgttcat gtaaaaagga aggatttaac ataagagaat ttatgctagg 240 tacccataga tgggatttgt gtaagtcgtg ttgccagggt gaagtagtaa agagaacaca 300 accctacacc ttgcaaaggc tcacgtggct taaattaaca gaagaccatg tatttcaagt 360 aatgcccttg tggagagctc gcaaagggat taccatagac tttccctggt gcagggacac 420 aaaaggattc ctggagccgt ggacaacgca agagtgttgg caaatagagt atcccttgga 480 ggatgagtga 490 <210> 15 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 15 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 16 <211> 591 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 16 gctgtagact ctgtaatgtt ccaacaaatg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaagaca gttagcatat tatgctacta cctggacaag taaagacata 120 ctagaagtat tggccatgat gcctgggaat agggctcaga aagaacttat tcaagggaaa 180 ttgaatgaag aagcagacag gtggagaagg aacaatccac caggaggatt aacagtggat 240 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 300 agacaaatat gcctacaatg ggtaataaac gccttaagag cagtaaggca tatggctcat 360 aggccaggga atccaatgct agtaaagcaa aaaacaaatg agccatatga agaatttgca 420 gcaagactgc tagaagcaat agatgcagaa gcggttacac agcccataaa agagtatcta 480 aagctaacat tatcctatac aaatgcagcc tcagattgtc aaaagcaaat ggagagagtg 540 ctaggacaaa gagtacaaca ggctagtgta gaaaaaaaaa tgcaagcatg t 591 <210> 17 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 17 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 18 <211> 591 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 18 gctgtagact ctgtaatgtt ccaacaaatg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaagaca gttagcatat tatgctacta cctggacaag taaagacata 120 ctagaagtat tggccatgat gcctgggaat agggctcaga aagaacttat tcaagggaaa 180 ttgaatgaag aagcagacag gtggagaagg aacaatccac caggaggatt aacagtggat 240 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 300 agacaaatat gcctacaatg ggtaataaac gccttaagag cagtaaggca tatggctcat 360 aggccaggga atccaatgct agtaaagcaa aaaacaaatg agccatatga agaatttgca 420 gcaagactgc tagaagcaat agatgcagaa gcggttacac agcccataaa agagtatcta 480 aagctaacat tatcctatac aaatgcagcc tcagattgtc aaaagcaaat ggagagagtg 540 ctaggacaaa gagtacaaca ggctagtgta gaaaaaaaaa tgcaagcatg t 591 <210> 19 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 19 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 20 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 20 tcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagtatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaagaaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gtctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aaaacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacagat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag gattcagaat 660 gc 662 <210> 21 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 21 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc aaaaacaaat ggatagaata ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag ggttcagaat 660 gc 662 <210> 22 <211> 651 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 22 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tggcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaacag 180 agctcaaaaa gagttaattc aggggaaatt gaataaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcac aaggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcag gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacaaat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag g 651 <210> 23 <211> 520 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 23 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa gaatatttaa 520 <210> 24 <211> 663 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 24 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcgactaaga gcagtgagac atatggctca caaaccaggg aatccaatgc 420 tagtaaagca aaagacaaat gagtcatatg aaaaattttc agcaagactc ctagaagcaa 480 tagatgcaga accagttaca cagcctataa aagaatattt aaagttaaca ttatcttaca 540 caaatgcatc ctcagactgt caaaaacaaa tggatagagt actaggacag agagtgcaac 600 aagctagtgt ggaagaaaaa atgcaagcat gcagagatgt gggatcagaa ggattcagaa 660 tgc 663 <210> 25 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 25 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 26 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 26 tcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagtatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaagaaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gtctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aaaacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacagat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag gattcagaat 660 gc 662 <210> 27 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 27 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc aaaaacaaat ggatagaata ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag ggttcagaat 660 gc 662 <210> 28 <211> 651 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 28 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tggcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaacag 180 agctcaaaaa gagttaattc aggggaaatt gaataaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcac aaggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcag gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacaaat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag g 651 <210> 29 <211> 520 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 29 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa gaatatttaa 520 <210> 30 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 30 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga aaaattttca gcaagactcc tagaagcaat 480 agatgcagaa ccagttacac agcctataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc aaaaacaaat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag gattcagaat 660 gc 662 <210> 31 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 31 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 32 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 32 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tgtgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaacgagg aagcagaaag gtggagaagg aataatccac cgcctccaca aggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagcccata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa aatgcaa 597 <210> 33 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 33 gcagtagact cagtaatgtt ccagcaactg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaaggca gttggcatat tatgctacta cctggacgag taaagacata 120 ctagaagtat tggccatgat gcctggaaac agagctcaaa aggagttaat tcaagggaaa 180 ttaaatgaag aggcagaaag gtggagaaga cataatccac cccctccggc gggaggagga 240 ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcattaag agcagtgagg 360 tatatgactc acaaaccagg gaatccaatg ctagtaaaac aaaaaacaaa tgaagcatat 420 gaagagttta cagcgagact gctagaagca atagatgcag agccagtaac acagcccaca 480 aaagaatatc taaaactaac attatcttat acaaatgcat cctcagactg tcaaaagcaa 540 atggatagag tactaggaca aagagtgcaa caagctagtg tagaagaaaa aatgcaa 597 <210> 34 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 34 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tatgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaatgagg aagcagaaag gtggagaagg aataatccac cgcctccaca gggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagctcata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa gatgcaa 597 <210> 35 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 35 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 36 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 36 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tgtgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaacgagg aagcagaaag gtggagaagg aataatccac cgcctccaca aggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagcccata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa aatgcaa 597 <210> 37 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 37 gcagtagact cagtaatgtt ccagcaactg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaaggca gttggcatat tatgctacta cctggacgag taaagacata 120 ctagaagtat tggccatgat gcctggaaac agagctcaaa aggagttaat tcaagggaaa 180 ttaaatgaag aggcagaaag gtggagaaga cataatccac cccctccggc gggaggagga 240 ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcattaag agcagtgagg 360 tatatgactc acaaaccagg gaatccaatg ctagtaaaac aaaaaacaaa tgaagcatat 420 gaagagttta cagcgagact gctagaagca atagatgcag agccagtaac acagcccaca 480 aaagaatatc taaaactaac attatcttat acaaatgcat cctcagactg tcaaaagcaa 540 atggatagag tactaggaca aagagtgcaa caagctagtg tagaagaaaa aatgcaa 597 <210> 38 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 38 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tatgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaatgagg aagcagaaag gtggagaagg aataatccac cgcctccaca gggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagctcata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa gatgcaa 597 <210> 39 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 39 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 40 <211> 524 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 40 taaagatata ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat 60 tcaagggaaa ttgaatgaag aagcagaaag gtggagaagg aataatccac cacctcaagc 120 aggcggagga ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcggc 180 acaggctaac atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcactaag 240 agcagtgaga catatggctc acaaaccagg gaatccgatg ctagtaaagc aaaaaacaaa 300 tgagtcatat gaagattttg ccgcaagact gctagaagca atagatgcag aaccagttac 360 aaagcaaata aaagaatatt taaagttaac attatcttac acaaatgcat cctcagactg 420 taagaaacag atggatagag tactaggaca gagagtgcaa caagctagtg tggaagaaaa 480 aatgcaagca tgcagagatg tgggatcaga aggattcaga atgc 524 <210> 41 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 41 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 42 <211> 524 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 42 taaagatata ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat 60 tcaagggaaa ttgaatgaag aagcagaaag gtggagaagg aataatccac cacctcaagc 120 aggcggagga ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcggc 180 acaggctaac atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcactaag 240 agcagtgaga catatggctc acaaaccagg gaatccgatg ctagtaaagc aaaaaacaaa 300 tgagtcatat gaagattttg ccgcaagact gctagaagca atagatgcag aaccagttac 360 aaagcaaata aaagaatatt taaagttaac attatcttac acaaatgcat cctcagactg 420 taagaaacag atggatagag tactaggaca gagagtgcaa caagctagtg tggaagaaaa 480 aatgcaagca tgcagagatg tgggatcaga aggattcaga atgc 524 <210> 43 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 43 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 44 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 44 tgccgtagac tctgtgatgt tccaccagct gcatacagta gcaatgccgc atggcctcgt 60 gtctgaggac tttgaaaggc agttggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagctagac tgctagaagc aatagatcca gccccagtag cacatcctat 480 aaaagattat ttaaagttaa cactatctta tacgaatgca tcatcagatt gtcaaaagca 540 aatgggtaga atgctaggat cgagagtcca tcaagccagt gtgggccaaa aaa 593 <210> 45 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 45 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 46 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 46 tgccgtagac tctgtgatgt tccaccagct gcatacagta gcaatgccgc atggcctcgt 60 gtctgaggac tttgaaaggc agttggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagctagac tgctagaagc aatagatcca gccccagtag cacatcctat 480 aaaagattat ttaaagttaa cactatctta tacgaatgca tcatcagatt gtcaaaagca 540 aatgggtaga atgctaggat cgagagtcca tcaagccagt gtgggccaaa aaa 593 <210> 47 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 47 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 48 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 48 agcagtagat tctgtaatgt tccagcaact gcaaacagta gcaatgcagc atggactcgt 60 gtatgaagac tttgaaaggc tgtcggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagcaagac tgctagaagc aatagatgca gagccagtag cacatcctat 480 aaaagaatac ttaaagttaa cactatctta tacgaatgca tcatcagatt gtcaaaagca 540 aatggataga atgctggaat caagagtaca acaagctagt gtagaacaaa aaa 593 <210> 49 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 49 agccgtagat tctgtaatgt tccagcagct gcaaacagta gcaatgcagc atggcctcgt 60 gtcagaggac tttgaaaggc ttccagcata tcatgctact acctgggcaa gtaaagatat 120 cttagaagta ctggccatga tgcctggaaa tagagctcaa aaagagttaa ttcaagggaa 180 attaaatgaa gaagcagaga ggtggagaag gaataatcca ccacctccag caggaggagg 240 gttaacagtg gatcaaatta tgggagtagg acaaacaaat caggcagcgg cacaagcaaa 300 catggatcag gcaagacaaa tatgcctaca atgggtgata tcagcactaa gagcagtaag 360 gcatatggct cacaagccag ggaatccaat gttagtaaag caaaaagcaa atgagccata 420 tgaagaattt gcagcaaggc tgctggaagc aatagatgcc gagccagtta atcagcccat 480 aaaagaatat ctaaaactaa cgttgtctta tacgaatgca tcctcagatt gtcagaagca 540 aatggataga acactaggac aaagagtcaa acaagctagt gtagaacaaa aaa 593 <210> 50 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 50 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 51 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 51 agcagtagat tctgtaatgt tccagcaact gcaaacagta gcaatgcagc atggactcgt 60 gtatgaagac tttgaaaggc tgtcggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagcaagac tgctagaagc aatagatgca gagccagtag cacatcctat 480 aaaagaatat ctaaaactaa cgttgtctta tacgaatgca tcctcagatt gtcagaagca 540 aatggataga acactaggac aaagagtcaa acaagctagt gtagaacaaa aaa 593 <210> 52 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 52 agccgtagat tctgtaatgt tccagcagct gcaaacagta gcaatgcagc atggcctcgt 60 gtcagaggac tttgaaaggc ttccagcata tcatgctact acctgggcaa gtaaagatat 120 cttagaagta ctggccatga tgcctggaaa tagagctcaa aaagagttaa ttcaagggaa 180 attaaatgaa gaagcagaga ggtggagaag gaataatcca ccacctccag caggaggagg 240 gttaacagtg gatcaaatta tgggagtagg acaaacaaat caggcagcgg cacaagcaaa 300 catggatcag gcaagacaaa tatgcctaca atgggtgata tcagcactaa gagcagtaag 360 gcatatggct cacaagccag ggaatccaat gttagtaaag caaaaagcaa atgagccata 420 tgaagaattt gcagcaaggc tgctggaagc aatagatgcc gagccagtta atcagcccat 480 aaaagaatat ctaaaactaa cgttgtctta tacgaatgca tcctcagatt gtcagaagca 540 aatggataga acactaggac aaagagtcaa acaagctagt gtagaacaaa aaa 593 <210> 53 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 53 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt attacatgtt tgaggaccat aaagaggaac 180 cttggacaaa aaaacattag agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 54 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 54 cagcatggcc tcgtgtccga ggactttgaa aggcagttgg catattatgc tactacctgg 60 acaagtaagg acatattaga agtattggcc atgatgccag gaaatagagc tcaaaaggag 120 ctaattcaa 129 <210> 55 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 55 cagcatggaa tagtatcaga agagtttgag aggcaactgt cttattatgc taccacttgg 60 acaagcaagg atatcttaga ggtactagcc atgatgcctg gcaatagagc attaaaagag 120 ctaatacaa 129 <210> 56 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 56 caacatggga tagtatcaga ggaatttgag agacaaatgt cttattatgc taccacatgg 60 acaagtaagg atattttaga agtactagca atgatgcccg ggaacagagc attaaaggag 120 ctgatacaa 129 <210> 57 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 57 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc cacaacgtgg 60 acaagcaaag acatactaga ggtgttagcc atgatgccag ggaatagagc acaaaaagaa 120 ctaatacaa 129 <210> 58 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 58 cagcatggac ttgtgtccga agattttgag aggcaattgg catattatgc tacaacctgg 60 actagtgaag atatattaga agtattggct atgatgcctg ggaatagagc acagaaagaa 120 ttaatacaa 129 <210> 59 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 59 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 60 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 60 cagcatggcc tcgtgtccga ggactttgaa aggcagttgg catattatgc tactacctgg 60 acaagtaagg acatattaga agtattggcc atgatgccag gaaatagagc tcaaaaggag 120 ctaattcaa 129 <210> 61 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 61 cagcatggaa tagtatcaga agagtttgag aggcaactgt cttattatgc taccacttgg 60 acaagcaagg atatcttaga ggtactagcc atgatgcctg gcaatagagc attaaaagag 120 ctaatacaa 129 <210> 62 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 62 caacatggga tagtatcaga ggaatttgag agacaaatgt cttattatgc taccacatgg 60 acaagtaagg atattttaga agtactagca atgatgcccg ggaacagagc attaaaggag 120 ctgatacaa 129 <210> 63 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 63 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc cacaacgtgg 60 acaagcaaag acatactaga ggtgttagcc atgatgccag ggaatagagc acaaaaagaa 120 ctaatacaa 129 <210> 64 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 64 cagcatggac ttgtgtccga agattttgag aggcaattgg catattatgc tacaacctgg 60 actagtgaag atatattaga agtattggct atgatgcctg ggaatagagc acagaaagaa 120 ttaatacaa 129 <210> 65 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer used to construct dig-labeled probe <400> 65 ctggcgtaat agcgaagagg 20 <210> 66 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer used to construct dig-labeled probe <400> 66 aactcgccgc acatctgaac 20 <210> 67 <211> 3911 <212> DNA <213> Artificial <220> <223> pCAH/SINd0 <400> 67 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag ggaattcgat tctagaggtg 3540 atagaaatgc cagaaaacta tgcaaaaaca agaatcataa acaggaaaaa aagagaactc 3600 agccacaaga ggaagaagag aggcgttggc ttggtcatta tgctagttat catggcaata 3660 gtagctgccg caggggcttc tctgggagtc gcaaacgcga ttcagcagtc ttacactaag 3720 gcagctgtcc agacccttgc taatgcaact gctgcacagc aggatgtgtt agaggcaacc 3780 tatgccatgg tacagcatgt ggctaaaggc gtacgaatct tggaagctcg agtggctcga 3840 gtggaagcta tcacagatag aataatgcta taccaagaat tggattgttg gcactaggat 3900 ccatcgccac c 3911 <210> 68 <211> 4238 <212> DNA <213> Artificial <220> <223> pCAH/SINd1 <400> 68 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgcagga attcgattct agaggtgata gaaatgccag aaaactatgc aaaaacaaga 3900 atcataaaca ggaaaaaaag agaactcagc cacaagagga agaagagagg cgttggcttg 3960 gtcattatgc tagttatcat ggcaatagta gctgccgcag gggcttctct gggagtcgca 4020 aacgcgattc agcagtctta cactaaggca gctgtccaga cccttgctaa tgcaactgct 4080 gcacagcagg atgtgttaga ggcaacctat gccatggtac agcatgtggc taaaggcgta 4140 cgaatcttgg aagctcgagt ggctcgagtg gaagctatca cagatagaat aatgctatac 4200 caagaattgg attgttggca ctaggatcca tcgccacc 4238 <210> 69 <211> 4523 <212> DNA <213> Artificial <220> <223> pCAH/SINd2 <400> 69 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgagaca ttgcaagatg caatggctac agtaatcatg aaagatgggt tactggaaca 3900 agaggaaaag aaggaagaca aaagagaaaa ggaagagagt gtcttcccaa tagtagtgca 3960 agcagcagga gggagaagct ggaaagcagt agattctgta atgttccagc aactgcaaac 4020 agtagcaatg cagcatggcc tcgtgtctga ggactttgaa aggcagttgg catattatgc 4080 tactacctgg acaagtaaag acatactaga agtattggcc atgatgcctg caggaattcg 4140 attctagagg tgatagaaat gccagaaaac tatgcaaaaa caagaatcat aaacaggaaa 4200 aaaagagaac tcagccacaa gaggaagaag agaggcgttg gcttggtcat tatgctagtt 4260 atcatggcaa tagtagctgc cgcaggggct tctctgggag tcgcaaacgc gattcagcag 4320 tcttacacta aggcagctgt ccagaccctt gctaatgcaa ctgctgcaca gcaggatgtg 4380 ttagaggcaa cctatgccat ggtacagcat gtggctaaag gcgtacgaat cttggaagct 4440 cgagtggctc gagtggaagc tatcacagat agaataatgc tataccaaga attggattgt 4500 tggcactagg atccatcgcc acc 4523 <210> 70 <211> 4819 <212> DNA <213> Artificial <220> <223> pCAH/SINd3 <400> 70 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgagaca ttgcaagatg caatggctac agtaatcatg aaagatgggt tactggaaca 3900 agaggaaaag aaggaagaca aaagagaaaa ggaagagagt gtcttcccaa tagtagtgca 3960 agcagcagga gggagaagct ggaaagcagt agattctgta atgttccagc aactgcaaac 4020 agtagcaatg cagcatggcc tcgtgtctga ggactttgaa aggcagttgg catattatgc 4080 tactacctgg acaagtaaag acatactaga agtattggcc atgatgcctg gaaatagagc 4140 tcaaaaggag ttaattcaag ggaaattaaa tgaagaagca gaaaggtgga gaaggaataa 4200 tccaccacct ccagcaggag gaggattaac agtggatcaa attatggggg taggacaaac 4260 aaatcaagca gcagcacaag ctaacatgga tcaggcaagg caaatatgcc tgcaatgggt 4320 aataaatgca ttaagagcag taagacatat ggcgcacagg ccagggaatc caatgctagt 4380 aaagcaaaaa acgaatgagc catatgaaga ttttgcagca agactgcagg aattcgattc 4440 tagaggtgat agaaatgcca gaaaactatg caaaaacaag aatcataaac aggaaaaaaa 4500 gagaactcag ccacaagagg aagaagagag gcgttggctt ggtcattatg ctagttatca 4560 tggcaatagt agctgccgca ggggcttctc tgggagtcgc aaacgcgatt cagcagtctt 4620 acactaaggc agctgtccag acccttgcta atgcaactgc tgcacagcag gatgtgttag 4680 aggcaaccta tgccatggta cagcatgtgg ctaaaggcgt acgaatcttg gaagctcgag 4740 tggctcgagt ggaagctatc acagatagaa taatgctata ccaagaattg gattgttggc 4800 actaggatcc atcgccacc 4819 <210> 71 <211> 5112 <212> DNA <213> Artificial <220> <223> pCAH/SINd4 <400> 71 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgagaca ttgcaagatg caatggctac agtaatcatg aaagatgggt tactggaaca 3900 agaggaaaag aaggaagaca aaagagaaaa ggaagagagt gtcttcccaa tagtagtgca 3960 agcagcagga gggagaagct ggaaagcagt agattctgta atgttccagc aactgcaaac 4020 agtagcaatg cagcatggcc tcgtgtctga ggactttgaa aggcagttgg catattatgc 4080 tactacctgg acaagtaaag acatactaga agtattggcc atgatgcctg gaaatagagc 4140 tcaaaaggag ttaattcaag ggaaattaaa tgaagaagca gaaaggtgga gaaggaataa 4200 tccaccacct ccagcaggag gaggattaac agtggatcaa attatggggg taggacaaac 4260 aaatcaagca gcagcacaag ctaacatgga tcaggcaagg caaatatgcc tgcaatgggt 4320 aataaatgca ttaagagcag taagacatat ggcgcacagg ccagggaatc caatgctagt 4380 aaagcaaaaa acgaatgagc catatgaaga ttttgcagca agactgctag aagcaataga 4440 tgcagagcca gttacacagc ctataaaaga ttatctaaag ctaacactat cttatacaaa 4500 tgcatcagca gattgtcaga agcaaatgga tagaacacta ggacaaagag tacaacaagc 4560 tagtgtagaa gaaaaaatgc aagcatgtag agatgtggga tcagaagggt tcaaaatgca 4620 attgttagca caagcattaa ggccaggaaa aggaaaaggg aatggacagc cacaaaggtg 4680 ttacaactgt ggaaaaccgg gacatcaagc aaggcactgc aggaattcga ttctagaggt 4740 gatagaaatg ccagaaaact atgcaaaaac aagaatcata aacaggaaaa aaagagaact 4800 cagccacaag aggaagaaga gaggcgttgg cttggtcatt atgctagtta tcatggcaat 4860 agtagctgcc gcaggggctt ctctgggagt cgcaaacgcg attcagcagt cttacactaa 4920 ggcagctgtc cagacccttg ctaatgcaac tgctgcacag caggatgtgt tagaggcaac 4980 ctatgccatg gtacagcatg tggctaaagg cgtacgaatc ttggaagctc gagtggctcg 5040 agtggaagct atcacagata gaataatgct ataccaagaa ttggattgtt ggcactagga 5100 tccatcgcca cc 5112 <210> 72 <211> 7579 <212> DNA <213> Artificial <220> <223> pMYKEF1/env <220> <221> misc_feature <222> (994)..(994) <223> n is a, c, g, or t <400> 72 aacaggaaag ttccattgga gccaagtaca ttgagtcaat agggactttc caatgggttt 60 tgcccagtac ataaggtcaa tgggaggtaa gccaatgggt ttttcccatt actggcacgt 120 atactgagtc attagggact ttccaatggg ttttgcccag tacataaggt caataggggt 180 gaatcaacag gaaagtccca ttggagccaa gtacactgag tcaataggga ctttccattg 240 ggttttgccc agtacaaaag gtcaataggg ggtgagtcaa tgggtttttc ccattattgg 300 cacgtacata aggtcaatag gggtgagtca ttgggttttt ccagccaatt taattaaaac 360 gccatgtact ttcccaccat tgacgtcaat gggctattga aactaatgca acgtgacctt 420 taaacggtac tttcccatag ctgattaatg ggaaagtacc gttctcgagc caatacacgt 480 caatgggaag tgaaagggca gccaaaacgt aacaccgccc cggttttccc tggaaattcc 540 atattggcac gcattctatt ggctgagctg cgttcacgtg ggtataagag gcgcgaccag 600 cgtcggtacc gtcgcagtct tcggtctgac caccgtagaa cgcagagctc ctcgctgcag 660 gcatgcaagc ttggtaagtg ccgtgtgtgg ttcccgcggg cctggcctct ttacgggtta 720 tggcccttgc gtgccttgaa ttacttccac gcccctggct gcagtacgtg attcttgatc 780 ccgagcttcg ggttggaagt gggtgggaga gttcaaggcc ttgcgcttaa ggagcccctt 840 cgccttttgc ttgagttgag gcctggcctg ggcgctgggg ccgccgcgtg caaatctggt 900 ggcaccttcg cgcctgtctc gctgctttcg ataagtctct agccatttaa aatttttgat 960 gacctgctgc gacgcttttt ttctggcaag atantcttgt aaatgcgggc caagatctgc 1020 acactggtat ttcggttttt ggggccgcgg gcggctacgg ggcccgtgcg tcccagcgca 1080 catgttcggc gaggaggggc ctgcgagcgc ggccaccgag aatcggacgg gggtagtctc 1140 aagctggccg gcctgctctg gtgcctggcc tcgcgccgcc gtgtatcgcc ccgccctggg 1200 cggcaaggct ggcccggtcg gcaccagttg cgtgagcgga aagatggccg cttcccggcc 1260 ctgctgcagg gagctcaaaa tggaggacgc ggcgctcggg agagcgggcg ggtgagtcac 1320 ccacacaaag gaaaagggcc tttccgtcct cagccgtcgc ttcatgtgac tccacggagt 1380 accgggcgcc gtccaggcac ctcgattagt tctcgagctt ttggagtacg tcgtctttag 1440 gttgggggga ggggttttat gcgatggagt ttccccacac tgagtgggtg gagactgaag 1500 ttaggccagc ttggcacttg atgtaattct ccttggaatt tgcccttttt gagtttggat 1560 cttggttcat tctcaagcct cagacagtgg ttcaaagttt ttttcttcca tttcagggat 1620 ccactagtaa cggccgccag tgtgctggaa ttcgatcata cctggtgttg ctgactaccc 1680 cgaccgcggt aaaagtcgat ggtattgctg cctgggtcca tgcttctcac ctcaaacctg 1740 caccaccttc ggcaccagat gagtcctggg agctggaaaa gactgatcat cctcttaagc 1800 tgcgtattcg gcggcggcgg gacgagtctg caaaataaga acccccacca gcccatgacc 1860 ctcacttggc aggtactgtc ccaaactgga gacgttgtct gggatacaaa ggcagtccag 1920 cccccttgga cttggtggcc cacacttaaa cctgatgtat gtgccttggc ggctagtctt 1980 gagtcctggg atatcccggg aaccgatgtc tcgtcctcta aacgagtcag acctccggac 2040 tcagactata ctgccgctta taagcaaatc acctggggag ccatagggtg cagctaccct 2100 cgggctagga ctagaatggc aagctctacc ttctacgtat gtccccggga tggccggacc 2160 ctttcagaag ctagaaggtg cggggggcta gaatccctat actgtaaaga atgggattgt 2220 gagaccacgg ggaccggtta ttggctatct aaatcctcaa aagacctcat aactgtaaaa 2280 tgggaccaaa atagcgaatg gactcaaaaa tttcaacagt gtcaccagac cggctggtgt 2340 aaccccctta aaatagattt cacagacaaa ggaaaattat ccaaggactg gataacggga 2400 aaaacctggg gattaagatt ctatgtgtct ggacatccag gcgtacagtt caccattcgc 2460 ttaaaaatca ccaacatgcc agctgtggca gtaggtcctg acctcgtcct tgtggaacaa 2520 ggacctccta gaacgtccct cgctctccca cctcctcttc ccccaaggga agcgccaccg 2580 ccatctctcc ccgactctaa ctccacagcc ctggcgacta gtgcacaaac tcccacggtg 2640 agaaaaacaa ttgttaccct aaacactccg cctcccacca caggcgacag actttttgat 2700 cttgtgcagg gggccttcct aaccttaaat gctaccaacc caggggccac tgagtcttgc 2760 tggctttgtt tggccatggg ccccccttat tatgaagcaa tagcctcatc aggagaggtc 2820 gcctactcca ccgaccttga ccggtgccgc tgggggaccc aaggaaagct caccctcact 2880 gaggtctcag gacacgggtt gtgcatagga aaggtgccct ttacccatca gcatctctgc 2940 aatcagaccc tatccatcaa ttcctccgga gaccatcagt atctgctccc ctccaaccat 3000 agctggtggg cttgcagcac tggcctcacc ccttgcctct ccacctcagt ttttaatcag 3060 actagagatt tctgtatcca ggtccagctg attcctcgca tctattacta tcctgaagaa 3120 gttttgttac aggcctatga caattctcac cccaggacta aaagagaggc tgtctcactt 3180 accctagctg ttttactggg gttgggaatc acggcgggaa taggtactgg ttcaactgcc 3240 ttaattaaag gacctataga cctccagcaa ggcctgacaa gcctccagat cgccatagat 3300 gctgacctcc gggccctcca agactcagtc agcaagttag aggactcact gacttccctg 3360 tccgaggtag tgctccaaaa taggagaggc cttgacttgc tgtttctaaa agaaggtggc 3420 ctctgtgcgg ccctaaagga agagtgctgt ttttacatag accactcagg tgcagtacgg 3480 gactccatga aaaaactcaa agaaaaactg gataaaagac agttagagcg ccagaaaagc 3540 caaaactggt atgaaggatg gttcaataac tccccttggt tcactaccct gctatcaacc 3600 atcgctgggc ccctattact cctccttctg ttgctcatcc tcgggccatg catcatcaat 3660 aagttagttc aattcatcaa tgataggata agtgcagtta aaattctggt ccttagacaa 3720 aaatatcagg ccctagagaa cgaaggtaac ctttaatttt gctctaagat tagagctatt 3780 cacaagagaa atggggatca ctagtgaatt ctgcagatat ccatcacact ggcggccgct 3840 cgagcatgca tctagagggc cctattctat agtgtcacct aaatgctaga gctcgctgat 3900 cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3960 ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 4020 cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 4080 gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 4140 aggcggaaag aaccagtggc ggtaatacgg ttatccacag aatcagggga taacgcagga 4200 aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 4260 gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 4320 aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 4380 gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 4440 ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 4500 cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 4560 ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 4620 actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 4680 tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 4740 gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 4800 ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 4860 cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 4920 ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 4980 tttaaatcaa tctaaagtat atatgagtaa cctgaggcta tggcagggcc tgccgccccg 5040 acgttggctg cgagccctgg gccttcaccc gaacttgggg ggtggggtgg ggaaaaggaa 5100 gaaacgcggg cgtattggcc ccaatggggt ctcggtgggg tatcgacaga gtgccagccc 5160 tgggaccgaa ccccgcgttt atgaacaaac gacccaacac cgtgcgtttt attctgtctt 5220 tttattgccg tcatagcgcg ggttccttcc ggtattgtct ccttccgtgt ttcagttagc 5280 ctccccctag ggtgggcgaa gaactccagc atgagatccc cgcgctggag gatcatccag 5340 ccggcgtccc ggaaaacgat tccgaagccc aacctttcat agaaggcggc ggtggaatcg 5400 aaatctcgtg atggcaggtt gggcgtcgct tggtcggtca tttcgaaccc cagagtcccg 5460 ctcagaagaa ctcgtcaaga aggcgataga aggcgatgcg ctgcgaatcg ggagcggcga 5520 taccgtaaag cacgaggaag cggtcagccc attcgccgcc aagctcttca gcaatatcac 5580 gggtagccaa cgctatgtcc tgatagcggt ccgccacacc cagccggcca cagtcgatga 5640 atccagaaaa gcggccattt tccaccatga tattcggcaa gcaggcatcg ccatgggtca 5700 cgacgagatc ctcgccgtcg ggcatgctcg ccttgagcct ggcgaacagt tcggctggcg 5760 cgagcccctg atgctcttga tcatcctgat cgacaagacc ggcttccatc cgagtacgtg 5820 ctcgctcgat gcgatgtttc gcttggtggt cgaatgggca ggtagccgga tcaagcgtat 5880 gcagccgccg cattgcatca gccatgatgg atactttctc ggcaggagca aggtgagatg 5940 acaggagatc ctgccccggc acttcgccca atagcagcca gtcccttccc gcttcagtga 6000 caacgtcgag cacagctgcg caaggaacgc ccgtcgtggc cagccacgat agccgcgctg 6060 cctcgtcttg cagttcattc agggcaccgg acaggtcggt cttgacaaaa agaaccgggc 6120 gcccctgcgc tgacagccgg aacacggcgg catcagagca gccgattgtc tgttgtgccc 6180 agtcatagcc gaatagcctc tccacccaag cggccggaga acctgcgtgc aatccatctt 6240 gttcaatcat gcgaaacgat cctcatcctg tctcttgatc gatctttgca aaagcctagg 6300 cctccaaaaa agcctcctca ctacttctgg aatagctcag aggccgaggc ggcctcggcc 6360 tctgcataaa taaaaaaaat tagtcagcca tggggcggag aatgggcgga actgggcgga 6420 gttaggggcg ggatgggcgg agttaggggc gggactatgg ttgctgacta attgagatgc 6480 atgctttgca tacttctgcc tgctggggag cctggggact ttccacacct ggttgctgac 6540 taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac 6600 cctaactgac acacattcca cagctggttc tttccgcctc aggactcttc ctttttcaat 6660 aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 6720 gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 6780 gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 6840 cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 6900 gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 6960 gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 7020 ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 7080 tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 7140 ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 7200 cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 7260 accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata 7320 cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 7380 tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 7440 cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 7500 acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 7560 atactcttcc tttttcaat 7579 <210> 73 <211> 3566 <212> DNA <213> Artificial <220> <223> pCAH/SINd <400> 73 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct ttgctgcttg 3000 cacttcagag ttctaggaga gtccctccta gtctctcctc tccgaggagg taccgagacc 3060 tcaaaataaa ggagtgattg ccttactgcc gagtggagag tgattactga gcggccggtg 3120 tatcgggagt cgtcccttaa tctgtgcaat accagagcgg ctctcgcagc tggcgggaat 3180 tcgattctag aggtgataga aatgccagaa aactatgcaa aaacaagaat cataaacagg 3240 aaaaaaagag aactcagcca caagaggaag aagagaggcg ttggcttggt cattatgcta 3300 gttatcatgg caatagtagc tgccgcaggg gcttctctgg gagtcgcaaa cgcgattcag 3360 cagtcttaca ctaaggcagc tgtccagacc cttgctaatg caactgctgc acagcaggat 3420 gtgttagagg caacctatgc catggtacag catgtggcta aaggcgtacg aatcttggaa 3480 gctcgagtgg ctcgagtgga agctatcaca gatagaataa tgctatacca agaattggat 3540 tgttggcact aggatccatc gccacc 3566 <210> 74 <211> 7623 <212> DNA <213> Artificial <220> <223> pHGVSV-G <400> 74 gcgcgcgttg acattgatta ttgactagtt attaatagta atcaattacg gggtcattag 60 ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct 120 gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc 180 caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg 240 cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat 300 ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca 360 tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc 420 gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga 480 gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat 540 tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc 600 taactagaga acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga 660 cccaagcttg gtaccgagct cggatccact agtaacggcc gccagtgtgc tggaattcga 720 tgatcctgag aacttcaggg tgagtctatg ggacccttga tgttttcttt ccccttcttt 780 tctatggtta agttcatgtc ataggaaggg gagaagtaac agggtacagt ttagaatggg 840 aaacagacga atgattgcat cagtgtggaa gtctcaggat cgttttagtt tcttttattt 900 gctgttcata acaattgttt tcttttgttt aattcttgct ttcttttttt ttcttctccg 960 caatttttac tattatactt aatgccttaa cattgtgtat aacaaaagga aatatctctg 1020 agatacatta agtaacttaa aaaaaaactt tacacagtct gcctagtaca ttactatttg 1080 gaatatatgt gtgcttattt gcatattcat aatctcccta ctttattttc ttttattttt 1140 aattgataca taatcattat acatatttat gggttaaagt gtaatgtttt aatatgtgta 1200 cacatattga ccaaatcagg gtaattttgc atttgtaatt ttaaaaaatg ctttcttctt 1260 ttaatatact tttttgttta tcttatttct aatactttcc ctaatctctt tctttcaggg 1320 caataatgat acaatgtatc atgcctcttt gcaccattct aaagaataac agtgataatt 1380 tctgggttaa ggcaatagca atatttctgc atataaatat ttctgcatat aaattgtaac 1440 tgatgtaaga ggtttcatat tgctaatagc agctacaatc cagctaccat taggcccttt 1500 tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac gtgctggtct 1560 gtgtgctggc ccatcacttt ggcaaagaat cactagtgaa ttctgcagat atccatcaca 1620 ctggcggccg ctcgaggaat tctgacacta tgaagtgcct tttgtactta gcctttttat 1680 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 1740 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 1800 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 1860 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 1920 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 1980 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 2040 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc 2100 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 2160 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag 2220 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 2280 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 2340 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg 2400 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 2460 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg 2520 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 2580 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 2640 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 2700 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 2760 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 2820 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg 2880 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt 2940 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa 3000 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta 3060 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta 3120 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 3180 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc 3240 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatgaaaaa aaaaaaaaaa 3300 aacggaattc ctcgagcatg catctagagg gccctattct atagtgtcac ctaaatgcta 3360 gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 3420 cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 3480 aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 3540 aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct 3600 ctatggcttc tgaggcggaa agaaccagtg gcggtaatac ggttatccac agaatcaggg 3660 gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3720 gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3780 cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 3840 ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 3900 tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 3960 gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4020 tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4080 ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4140 ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4200 ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4260 accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4320 tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4380 cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4440 taaaaatgaa gttttaaatc aatctaaagt atatatgagt aacctgaggc tatggcaggg 4500 cctgccgccc cgacgttggc tgcgagccct gggccttcac ccgaacttgg ggggtggggt 4560 ggggaaaagg aagaaacgcg ggcgtattgg ccccaatggg gtctcggtgg ggtatcgaca 4620 gagtgccagc cctgggaccg aaccccgcgt ttatgaacaa acgacccaac accgtgcgtt 4680 ttattctgtc tttttattgc cgtcatagcg cgggttcctt ccggtattgt ctccttccgt 4740 gtttcagtta gcctccccct agggtgggcg aagaactcca gcatgagatc cccgcgctgg 4800 aggatcatcc agccggcgtc ccggaaaacg attccgaagc ccaacctttc atagaaggcg 4860 gcggtggaat cgaaatctcg tgatggcagg ttgggcgtcg cttggtcggt catttcgaac 4920 cccagagtcc cgctcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat 4980 cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt 5040 cagcaatatc acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccggc 5100 cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat 5160 cgccatgggt cacgacgaga tcctcgccgt cgggcatgct cgccttgagc ctggcgaaca 5220 gttcggctgg cgcgagcccc tgatgctctt gatcatcctg atcgacaaga ccggcttcca 5280 tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg 5340 gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc tcggcaggag 5400 caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc cagtcccttc 5460 ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg gccagccacg 5520 atagccgcgc tgcctcgtct tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa 5580 aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag cagccgattg 5640 tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga gaacctgcgt 5700 gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tgtctcttga tcgatctttg 5760 caaaagccta ggcctccaaa aaagcctcct cactacttct ggaatagctc agaggccgag 5820 gcggcctcgg cctctgcata aataaaaaaa attagtcagc catggggcgg agaatgggcg 5880 gaactgggcg gagttagggg cgggatgggc ggagttaggg gcgggactat ggttgctgac 5940 taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac 6000 ctggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg ggagcctggg 6060 gactttccac accctaactg acacacattc cacagctggt tctttccgcc tcaggactct 6120 tcctttttca ataaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 6180 tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 6240 tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 6300 gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 6360 gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 6420 aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt 6480 gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 6540 ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 6600 tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 6660 atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 6720 ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 6780 ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 6840 ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 6900 atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 6960 gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 7020 tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 7080 ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 7140 acatttcccc gaaaagtgcc acctgacgcg ccctgtagcg gcgcattaag cgcggcgggt 7200 gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc 7260 gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg 7320 gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat 7380 tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg 7440 ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct 7500 atctcggtct attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa 7560 aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac gcttacaatt 7620 tac 7623 <210> 75 <211> 5419 <212> DNA <213> Artificial <220> <223> pHYK/rev <400> 75 gcgcgcgttg acattgatta ttgactagtt attaatagta atcaattacg gggtcattag 60 ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct 120 gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc 180 caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg 240 cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat 300 ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca 360 tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc 420 gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga 480 gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat 540 tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc 600 taactagaga acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga 660 cccaagctta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg 720 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc 780 aagatataca gaactctaag taccccgaca taccaacggg tcacagtcat catggaaaca 840 agagcagacg tcgcaggaga aaatcaggat tttggcgatg gcttagagga atcagacaac 900 agcgaaacaa gcgaaagagt gacagtacag aaagcttgga gccgtgcctg ggagctttgg 960 cagaactcac cctggaagga gccatggaaa aggggcctgc tgaggctgct cgtccttccg 1020 ctgacgatgg gaatctggat aaatggatgg cttggagaac accacaaaaa taagaattct 1080 gcagatatcc atcacactgg cggccgctcg agcatgcatc tagagggccc tattctatag 1140 tgtcacctaa atgctagagc tcgctgatca gcctcgactg tgccttctag ttgccagcca 1200 tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc 1260 ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg 1320 gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct 1380 ggggatgcgg tgggctctat ggcttctgag gcggaaagaa ccagtggcgg taatacggtt 1440 atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 1500 caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 1560 gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 1620 ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 1680 cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 1740 taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 1800 cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 1860 acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 1920 aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt 1980 atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 2040 atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 2100 gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 2160 gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 2220 ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaacc 2280 tgaggctatg gcagggcctg ccgccccgac gttggctgcg agccctgggc cttcacccga 2340 acttgggggg tggggtgggg aaaaggaaga aacgcgggcg tattggcccc aatggggtct 2400 cggtggggta tcgacagagt gccagccctg ggaccgaacc ccgcgtttat gaacaaacga 2460 cccaacaccg tgcgttttat tctgtctttt tattgccgtc atagcgcggg ttccttccgg 2520 tattgtctcc ttccgtgttt cagttagcct ccccctaggg tgggcgaaga actccagcat 2580 gagatccccg cgctggagga tcatccagcc ggcgtcccgg aaaacgattc cgaagcccaa 2640 cctttcatag aaggcggcgg tggaatcgaa atctcgtgat ggcaggttgg gcgtcgcttg 2700 gtcggtcatt tcgaacccca gagtcccgct cagaagaact cgtcaagaag gcgatagaag 2760 gcgatgcgct gcgaatcggg agcggcgata ccgtaaagca cgaggaagcg gtcagcccat 2820 tcgccgccaa gctcttcagc aatatcacgg gtagccaacg ctatgtcctg atagcggtcc 2880 gccacaccca gccggccaca gtcgatgaat ccagaaaagc ggccattttc caccatgata 2940 ttcggcaagc aggcatcgcc atgggtcacg acgagatcct cgccgtcggg catgctcgcc 3000 ttgagcctgg cgaacagttc ggctggcgcg agcccctgat gctcttgatc atcctgatcg 3060 acaagaccgg cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg 3120 aatgggcagg tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat 3180 actttctcgg caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat 3240 agcagccagt cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc 3300 gtcgtggcca gccacgatag ccgcgctgcc tcgtcttgca gttcattcag ggcaccggac 3360 aggtcggtct tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca 3420 tcagagcagc cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg 3480 gccggagaac ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc tcatcctgtc 3540 tcttgatcga tctttgcaaa agcctaggcc tccaaaaaag cctcctcact acttctggaa 3600 tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta gtcagccatg 3660 gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag ttaggggcgg 3720 gactatggtt gctgactaat tgagatgcat gctttgcata cttctgcctg ctggggagcc 3780 tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca tacttctgcc 3840 tgctggggag cctggggact ttccacaccc taactgacac acattccaca gctggttctt 3900 tccgcctcag gactcttcct ttttcaataa atcaatctaa agtatatatg agtaaacttg 3960 gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg 4020 ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc 4080 atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 4140 agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 4200 ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag 4260 tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat 4320 ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 4380 caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 4440 gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 4500 atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg 4560 accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt 4620 aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct 4680 gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac 4740 tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat 4800 aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat 4860 ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 4920 aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct gtagcggcgc 4980 attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct 5040 agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg 5100 tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga 5160 ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt 5220 ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg 5280 aacaacactc aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc 5340 ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat 5400 attaacgctt acaatttac 5419 <210> 76 <211> 5729 <212> DNA <213> Artificial <220> <223> pHYK/vif <400> 76 gcgcgcgttg acattgatta ttgactagtt attaatagta atcaattacg gggtcattag 60 ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct 120 gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc 180 caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg 240 cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat 300 ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca 360 tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc 420 gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga 480 gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat 540 tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc 600 taactagaga acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga 660 cccaagcttg gtaccgagct cggatccgcg atgcaaaatt catcccgcca ccaacaaaag 720 aaaaggaata aaaaacctgg accagaatta cccttagcac tatggatcca tatagcagaa 780 agcattaatg gggatagctc atggtacata acaatgagac tgcaacagat gatgtgggga 840 aaaagaggaa ataagttaca atataagaat gaagacaggg aatatgaaaa ttgggaaatt 900 acatcatggg gatggaaaat gcacctaagg agagtgaaac aatggataca agacaacagg 960 agaggaagcc catggcagta caaagtagga ggaacatgga aaagtatagg agtgtggttc 1020 ctgcaagcag gagattacag aaaggtagac aggcacttct ggtgggcatg gaggatactg 1080 atatgttcct gcaggaaaga aaagtttgat ataagagaat ttatgagagg aagacataga 1140 tgggatttgt gcaaatcctg tgctcaagga gaagtagtaa agcatactag aacaaaaagt 1200 ctggaaagac tagtactgct acagatggta gaacagcatg tgtttcaagt attgccattg 1260 tggagagcca ggagaagtag tacaacagat ttcccatggt gcagggacac aacgggatac 1320 acgcatgcgt ggtctgtcca ggagtgctgg ttgatggaat atctcttaga ggatgagtga 1380 ccggaattct gcagatatcc atcacactgg cggccgctcg agcatgcatc tagagggccc 1440 tattctatag tgtcacctaa atgctagagc tcgctgatca gcctcgactg tgccttctag 1500 ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac 1560 tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca 1620 ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag 1680 caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa ccagtggcgg 1740 taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 1800 agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 1860 cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 1920 tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 1980 tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 2040 gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 2100 acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 2160 acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 2220 cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 2280 gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 2340 gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 2400 agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 2460 ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 2520 ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 2580 atgagtaacc tgaggctatg gcagggcctg ccgccccgac gttggctgcg agccctgggc 2640 cttcacccga acttgggggg tggggtgggg aaaaggaaga aacgcgggcg tattggcccc 2700 aatggggtct cggtggggta tcgacagagt gccagccctg ggaccgaacc ccgcgtttat 2760 gaacaaacga cccaacaccg tgcgttttat tctgtctttt tattgccgtc atagcgcggg 2820 ttccttccgg tattgtctcc ttccgtgttt cagttagcct ccccctaggg tgggcgaaga 2880 actccagcat gagatccccg cgctggagga tcatccagcc ggcgtcccgg aaaacgattc 2940 cgaagcccaa cctttcatag aaggcggcgg tggaatcgaa atctcgtgat ggcaggttgg 3000 gcgtcgcttg gtcggtcatt tcgaacccca gagtcccgct cagaagaact cgtcaagaag 3060 gcgatagaag gcgatgcgct gcgaatcggg agcggcgata ccgtaaagca cgaggaagcg 3120 gtcagcccat tcgccgccaa gctcttcagc aatatcacgg gtagccaacg ctatgtcctg 3180 atagcggtcc gccacaccca gccggccaca gtcgatgaat ccagaaaagc ggccattttc 3240 caccatgata ttcggcaagc aggcatcgcc atgggtcacg acgagatcct cgccgtcggg 3300 catgctcgcc ttgagcctgg cgaacagttc ggctggcgcg agcccctgat gctcttgatc 3360 atcctgatcg acaagaccgg cttccatccg agtacgtgct cgctcgatgc gatgtttcgc 3420 ttggtggtcg aatgggcagg tagccggatc aagcgtatgc agccgccgca ttgcatcagc 3480 catgatggat actttctcgg caggagcaag gtgagatgac aggagatcct gccccggcac 3540 ttcgcccaat agcagccagt cccttcccgc ttcagtgaca acgtcgagca cagctgcgca 3600 aggaacgccc gtcgtggcca gccacgatag ccgcgctgcc tcgtcttgca gttcattcag 3660 ggcaccggac aggtcggtct tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa 3720 cacggcggca tcagagcagc cgattgtctg ttgtgcccag tcatagccga atagcctctc 3780 cacccaagcg gccggagaac ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc 3840 tcatcctgtc tcttgatcga tctttgcaaa agcctaggcc tccaaaaaag cctcctcact 3900 acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta 3960 gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag 4020 ttaggggcgg gactatggtt gctgactaat tgagatgcat gctttgcata cttctgcctg 4080 ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca 4140 tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac acattccaca 4200 gctggttctt tccgcctcag gactcttcct ttttcaataa atcaatctaa agtatatatg 4260 agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct 4320 gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg 4380 agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc 4440 cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa 4500 ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc 4560 cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt 4620 cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc 4680 ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt 4740 tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc 4800 catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 4860 gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata 4920 gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 4980 tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 5040 catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 5100 aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 5160 attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga 5220 aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct 5280 gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 5340 ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 5400 gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 5460 ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 5520 gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 5580 tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 5640 tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 5700 ttaacaaaat attaacgctt acaatttac 5729 <210> 77 <211> 9446 <212> DNA <213> Artificial <220> <223> pMGP/RRE <400> 77 aacaggaaag ttccattgga gccaagtaca ttgagtcaat agggactttc caatgggttt 60 tgcccagtac ataaggtcaa tgggaggtaa gccaatgggt ttttcccatt actggcacgt 120 atactgagtc attagggact ttccaatggg ttttgcccag tacataaggt caataggggt 180 gaatcaacag gaaagtccca ttggagccaa gtacactgag tcaataggga ctttccattg 240 ggttttgccc agtacaaaag gtcaataggg ggtgagtcaa tgggtttttc ccattattgg 300 cacgtacata aggtcaatag gggtgagtca ttgggttttt ccagccaatt taattaaaac 360 gccatgtact ttcccaccat tgacgtcaat gggctattga aactaatgca acgtgacctt 420 taaacggtac tttcccatag ctgattaatg ggaaagtacc gttctcgagc caatacacgt 480 caatgggaag tgaaagggca gccaaaacgt aacaccgccc cggttttccc tggaaattcc 540 atattggcac gcattctatt ggctgagctg cgttcacgtg ggtataagag gcgcgaccag 600 cgtcggtacc gtcgcagtct tcggtctgac caccgtagaa cgcagagctc ctcgctgcag 660 gcaagcttgg taccgagctc ggatcccggg gaggtaccaa aatccgccat ggtgagtcta 720 gatagagaca tggcgaggca agtctccggg gggaaaagag attatcctga gctcgaaaaa 780 tgtatcaagc atgcatgcaa gataaaagtt cgactcagag gggagcactt gacagaagga 840 aattgtttat ggtgccttaa aacattagat tacatgtttg aggaccataa agaggaacct 900 tggacaaaag taaaatttag gacaatatgg cagaaggtga agaatctaac tcctgaggag 960 agtaacaaaa aagactttat gtctttgcag gccacattag cgggtctaat gtgttgccaa 1020 atggggatga gacctgagac attgcaagat gcaatggcta cagtaatcat gaaagatggg 1080 ttactggaac aagaggaaaa gaaggaagac aaaagagaaa aggaagagag tgtcttccca 1140 atagtagtgc aagcagcagg agggagaagc tggaaagcag tagattctgt aatgttccag 1200 caactgcaaa cagtagcaat gcagcatggc ctcgtgtctg aggactttga aaggcagttg 1260 gcatattatg ctactacctg gacaagtaaa gacatactag aagtattggc catgatgcct 1320 ggaaatagag ctcaaaagga gttaattcaa gggaaattaa atgaagaagc agaaaggtgg 1380 agaaggaata atccaccacc tccagcagga ggaggattaa cagtggatca aattatgggg 1440 gtaggacaaa caaatcaagc agcagcacaa gctaacatgg atcaggcaag gcaaatatgc 1500 ctgcaatggg taataaatgc attaagagca gtaagacata tggcgcacag gccagggaat 1560 ccaatgctag taaagcaaaa aacgaatgag ccatatgaag attttgcagc aagactgcta 1620 gaagcaatag atgcagagcc agttacacag cctataaaag attatctaaa gctaacacta 1680 tcttatacaa atgcatcagc agattgtcag aagcaaatgg atagaacact aggacaaaga 1740 gtacaacaag ctagtgtaga agaaaaaatg caagcatgta gagatgtggg atcagaaggg 1800 ttcaaaatgc aattgttagc acaagcatta aggccaggaa aaggaaaagg gaatggacag 1860 ccacaaaggt gttacaactg tggaaaaccg ggacatcaag caaggcaatg tagacaagga 1920 atcatatgtc acaactgtgg aaagagagga catatgcaaa aagaatgcag aggaaagaga 1980 gacataaggg gaaaacagca gggaaacggg aggaggggga tacgtgtggt gccgtccgct 2040 cctcctatgg aataacttca gcaccaccta tggttcaggt ccgcataggt tcccagcaga 2100 ggaacttgtt atttgatacc ggggcggacc gaactatagt tagatggcat gagggctcgg 2160 gaaacccagc cggaaggata aaactgcaag gaataggagg aatagtagaa ggagaaaaat 2220 ggaataatgt agaattagaa tataaaggag aaacaagaaa gggaacaata gtagtgttac 2280 cacaaagtcc agtagaagta ttaggacgag ataacatggc ccgatttgga ataaagataa 2340 taatggcaaa tttagaggaa aaaagaatcc caattacaaa agtaaaattg aaagagggat 2400 gtacgggtcc acatgtccca caatggccat taacagaaga gaaattaaaa ggtctaacag 2460 aaatcataga taaattagtg gaagaaggaa aactaggaaa ggcaccccca cattggacat 2520 gtaatactcc aatcttttgc ataaaaaaga aatcagggaa gtggagaatg ttaatagatt 2580 tcagagaatt gaacaaacag acagaagatt taacagaagc gcagttagga ctcccgcatc 2640 cgggaggact acaaaagaaa aaacatgtta caatattgga cataggagat gcatatttta 2700 ctatacccct atatgaacca tatcgagagt acacatgttt tactctatta agtcctaata 2760 atctaggacc atgtaaaaga tactattgga aagtgctgcc acaaggttgg aaattgagtc 2820 catctgtata tcaatttact atgcaggaga tcttagagga ttggatacag cagcatccag 2880 aaattcaatt tggcatatat atggatgata tttacatagg aagtgattta gaaattaaaa 2940 agcatagaga aatagtgaaa gatttagcca attatattgc ccaatatgga ttcactctgc 3000 cagaagagaa gagacaaaag ggatatccag caaaatggct aggatttgaa ctacacccgc 3060 agacctggaa atttcagaag catacattac ctgaattaac aaagggaaca ataacattaa 3120 ataaattaca gaaattagta ggagaattag tatggagaca atccataatt gggaaaagca 3180 ttcctaacat tctgaaatta atggaaggag atagagaatt acaaagtgaa agaaaaattg 3240 aagaagtaca tgtgaaagaa tgggaagcat gtaggaaaaa attagaagaa atggaaggaa 3300 attattataa taaagacaaa gatgtctatg gacaattggc ttggggagac aaagctatag 3360 aatatatagt gtatcaggag aaagggaaac cattatgggt aaatgtggtt cacaatataa 3420 agaacctaag catcccgcaa caggttatta aagcagcgca aaaattaacc caagaagtca 3480 tcattaggac aggaaaaata ccatggatat tgttgccagg gaaagaagaa gattggagac 3540 tagaattgca attagggaac atcacatgga tgccaaaatt ttggtcctgt tatcgaggac 3600 atacaagatg gagaaaaaga aatataatag aagaagtagt agaagggcct acatattata 3660 cagatggagg aaaaaagaat aaagtaggaa gtctagggtt catagtatca acaggggaaa 3720 aatttagaaa gcatgaagag ggcacaaacc agcaactaga attaagagcc atagaggaag 3780 ctctaaaaca agggcctcaa acaatgaatt tagtaacaga tagtagatat gcatttgaat 3840 ttttattaag aaattgggat gaagaagtaa taaagaatcc aattcaagca agaattatgg 3900 aaattgccca caagaaagat aggataggag tgcattgggt gccaggacat aaagggattc 3960 cccaaaatga agaaatagac aaatatattt cggaaatatt tcttgcaaaa gaaggagaag 4020 gaattctccc aaaaagagaa gaggatgcag ggtatgattt aatatgccca gaagaggtta 4080 ccatagagcc aggacaagtg aaatgcatcc ccatagagct aagattaaat ttaaagaaat 4140 cacaatgggc tatgattgct acaaaaagca gcatggctgc caaaggagtg ttcacacaag 4200 gaggaatcat agactcagga tatcagggac aaatacaggt aataatgtat aatagcaata 4260 aaatagcagt agtcataccc caagggagaa aatttgcaca attaatatta atggataaaa 4320 agcatgaaaa attggaaccc tggggggaaa gcagaaaaac agaaagggga gaaaaaggat 4380 ttgggtctac aggaatgtat tggatagaaa atattcctct ggcagaggaa gaccacacaa 4440 aatggcatca agatgcccga tcattgcatc tagaatttga aattccaaga acagcagcag 4500 aagacatagt aaatcaatgt gaaatatgca aagaaggcag gacacctgca gtaattagag 4560 gcggaaacaa aaggggggta gatcattggc aagtggatta tacccattat gaaaatatca 4620 tactattagt atgggtagaa acaaattcag gactaatata tgcagaaaaa gtaaaaggag 4680 aatcagggca agaattcaga ataaaagtga tgcaatggta tgcattattt ggtccagagt 4740 cattgcagtc agacaatgga cctgcatttg cagcagagcc cacacagctg ttaatgcaat 4800 acctaggagt aaaacacaca acaggcatac cttggaatcc acagtctcag gctatagtag 4860 aaagggcaca tcaactattg aaaagcactt taaagaagtt ccagccacaa tttgtcgctg 4920 tagaatcagc catagcagca gccctagtcg ccataaatat aaaaagaaag ggtgggctgg 4980 ggacaagccc tatggatatt tttatatata ataaagaaca gaaaagaata aataataaat 5040 ataataaaaa ttctcaaaaa attcaattct gttattacag aataaggaaa agaggacatc 5100 caggagagtg gaaaggacca acccaggtac tgtggaaagg ggaaggagca attgtggtaa 5160 aggatataga aagtgaaaag tatttagtaa taccttacaa agatgcaaaa ttcatcccgc 5220 caccaacaaa agaaaaggaa taaaaaacct ggaccagaat tacccttagc actatggatc 5280 cactagtaac ggccgccagt gtgctggaat tctgcagata tccatcacac tggcggccgg 5340 gctgcaggaa ttcgatagaa aagatatcaa aaacaagaat cataaacagg aaaaaaagag 5400 aactcagcca caagaggaag aagagaggcg ttggcttggt cattatgcta gttatcatgg 5460 caatagtagc tgccgcaggg gcttctctgg gagtcgcaaa cgcgattcag cagtcttaca 5520 ctaaggcagc tgtccagacc cttgctaatg caactgctgc acagcaggat gtgttagagg 5580 caacctatgc catggtacag catgtggcta aaggcgtacg aatcttggaa gctcgagtgg 5640 ctcgagtgga agctatcaca gatagaatag cggccgccca tcaagcttat cgataccgtc 5700 ggccgctcga gcatgcatct agagggccct attctatagt gtcacctaaa tgctagagct 5760 cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc 5820 gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa 5880 attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac 5940 agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt gggctctatg 6000 gcttctgagg cggaaagaac cagtggcggt aatacggtta tccacagaat caggggataa 6060 cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 6120 gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 6180 aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 6240 ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 6300 cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 6360 ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 6420 cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 6480 agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 6540 gaagtggtgg cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct 6600 gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 6660 tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 6720 agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 6780 agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa 6840 atgaagtttt aaatcaatct aaagtatata tgagtaacct gaggctatgg cagggcctgc 6900 cgccccgacg ttggctgcga gccctgggcc ttcacccgaa cttggggggt ggggtgggga 6960 aaaggaagaa acgcgggcgt attggcccca atggggtctc ggtggggtat cgacagagtg 7020 ccagccctgg gaccgaaccc cgcgtttatg aacaaacgac ccaacaccgt gcgttttatt 7080 ctgtcttttt attgccgtca tagcgcgggt tccttccggt attgtctcct tccgtgtttc 7140 agttagcctc cccctagggt gggcgaagaa ctccagcatg agatccccgc gctggaggat 7200 catccagccg gcgtcccgga aaacgattcc gaagcccaac ctttcataga aggcggcggt 7260 ggaatcgaaa tctcgtgatg gcaggttggg cgtcgcttgg tcggtcattt cgaaccccag 7320 agtcccgctc agaagaactc gtcaagaagg cgatagaagg cgatgcgctg cgaatcggga 7380 gcggcgatac cgtaaagcac gaggaagcgg tcagcccatt cgccgccaag ctcttcagca 7440 atatcacggg tagccaacgc tatgtcctga tagcggtccg ccacacccag ccggccacag 7500 tcgatgaatc cagaaaagcg gccattttcc accatgatat tcggcaagca ggcatcgcca 7560 tgggtcacga cgagatcctc gccgtcgggc atgctcgcct tgagcctggc gaacagttcg 7620 gctggcgcga gcccctgatg ctcttgatca tcctgatcga caagaccggc ttccatccga 7680 gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatca 7740 agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg 7800 tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct 7860 tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc 7920 cgcgctgcct cgtcttgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga 7980 accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt 8040 tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat 8100 ccatcttgtt caatcatgcg aaacgatcct catcctgtct cttgatcgat ctttgcaaaa 8160 gcctaggcct ccaaaaaagc ctcctcacta cttctggaat agctcagagg ccgaggcggc 8220 ctcggcctct gcataaataa aaaaaattag tcagccatgg ggcggagaat gggcggaact 8280 gggcggagtt aggggcggga tgggcggagt taggggcggg actatggttg ctgactaatt 8340 gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc cacacctggt 8400 tgctgactaa ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt 8460 tccacaccct aactgacaca cattccacag ctggttcttt ccgcctcagg actcttcctt 8520 tttcaataaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt 8580 aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact 8640 ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat 8700 gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg 8760 aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg 8820 ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat 8880 tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc 8940 ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt 9000 cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc 9060 agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga 9120 gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc 9180 gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa 9240 acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta 9300 acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg 9360 agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 9420 aatactcata ctcttccttt ttcaat 9446 <210> 78 <211> 7856 <212> DNA <213> Artificial <220> <223> pCAH/SINd60/hlacZ <400> 78 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcaa gcttgctgct 600 tgcacttcag agttctagga gagtccctcc tagtctctcc tctccgagga ggtaccgaga 660 cctcaaaata aaggagtgat tgccttactg ccgagtggag agtgattact gagcggccgg 720 tgtatcggga gtcgtccctt aatctgtgca ataccagagc ggctctcgca gctggcgccc 780 aacgtggggc ccgaggagaa gaaaagaaag cggccctgag aactcggctt ctgaaaaaga 840 ggaagaggac aagttgctat agcaacaaga gagaagaagt agagcaaagg tccagtggct 900 cggaaaaaga ggaactgaaa cttcggggac gcctgaagga gtaaggtaag tgactctgct 960 gtacgcgggg cgaggcagag gtttccttct aaattgaaag agaagtgttg ctgcgagagg 1020 tcttggtggt cgagaatcct gtacaaaaaa aaggagggat ctcggtcagg accaggaccc 1080 ctgggagtaa tacaacagca acaccgtaag aaaatccgcc taggtgagtc tagatagaga 1140 ctaggcgagg caagtctccg gggggaaaag agattatcct gcaggaattc gattctagag 1200 gtgatagaaa tgccagaaaa ctatgcaaaa acaagaatca taaacaggaa aaaaagagaa 1260 ctcagccaca agaggaagaa gagaggcgtt ggcttggtca ttatgctagt tatcatggca 1320 atagtagctg ccgcaggggc ttctctggga gtcgcaaacg cgattcagca gtcttacact 1380 aaggcagctg tccagaccct tgctaatgca actgctgcac agcaggatgt gttagaggca 1440 acctatgcca tggtacagca tgtggctaaa ggcgtacgaa tcttggaagc tcgagtggct 1500 cgagtggaag ctatcacaga tagaataatg ctataccaag aattggattg ttggcactag 1560 gatccatcag ccaccattaa cgcttacaat ttacgcgcgc gttgacattg attattgact 1620 agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc 1680 gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg 1740 acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa 1800 tgggtggact atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca 1860 agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac 1920 atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc 1980 atggtgatgc ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga 2040 tttccaagtc tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg 2100 gactttccaa aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcgtgta 2160 cggtgggagg tctatataag cagagctctc tggctaacta gagaacccac tgcttactgg 2220 cttatcgaaa ttaatacgac tcactatagg gagacccaag ctgcttacca tggggggttc 2280 tcatcatcat catcatcatg gtatggctag catgactggt ggacagcaaa tgggtcggga 2340 tctgtacgac gatgacgata aggtacctaa ggatcagctt ggagttgatc ccgtcgtttt 2400 acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc 2460 ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt 2520 gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg gcaccagaag cggtgccgga 2580 aagctggctg gagtgcgatc ttcctgaggc cgatactgtc gtcgtcccct caaactggca 2640 gatgcacggt tacgatgcgc ccatctacac caacgtaacc tatcccatta cggtcaatcc 2700 gccgtttgtt cccacggaga atccgacggg ttgttactcg ctcacattta atgttgatga 2760 aagctggcta caggaaggcc agacgcgaat tatttttgat ggcgttaact cggcgtttca 2820 tctgtggtgc aacgggcgct gggtcggtta cggccaggac agtcgtttgc cgtctgaatt 2880 tgacctgagc gcatttttac gcgccggaga aaaccgcctc gcggtgatgg tgctgcgttg 2940 gagtgacggc agttatctgg aagatcagga tatgtggcgg atgagcggca ttttccgtga 3000 cgtctcgttg ctgcataaac cgactacaca aatcagcgat ttccatgttg ccactcgctt 3060 taatgatgat ttcagccgcg ctgtactgga ggctgaagtt cagatgtgcg gcgagttgcg 3120 tgactaccta cgggtaacag tttctttatg gcagggtgaa acgcaggtcg ccagcggcac 3180 cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt tatgccgatc gcgtcacact 3240 acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa atcccgaatc tctatcgtgc 3300 ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa gcagaagcct gcgatgtcgg 3360 tttccgcgag gtgcggattg aaaatggtct gctgctgctg aacggcaagc cgttgctgat 3420 tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt caggtcatgg atgagcagac 3480 gatggtgcag gatatcctgc tgatgaagca gaacaacttt aacgccgtgc gctgttcgca 3540 ttatccgaac catccgctgt ggtacacgct gtgcgaccgc tacggcctgt atgtggtgga 3600 tgaagccaat attgaaaccc acggcatggt gccaatgaat cgtctgaccg atgatccgcg 3660 ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg cagcgcgatc gtaatcaccc 3720 gagtgtgatc atctggtcgc tggggaatga atcaggccac ggcgctaatc acgacgcgct 3780 gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg cagtatgaag gcggcggagc 3840 cgacaccacg gccaccgata ttatttgccc gatgtacgcg cgcgtggatg aagaccagcc 3900 cttcccggct gtgccgaaat ggtccatcaa aaaatggctt tcgctacctg gagagacgcg 3960 cccgctgatc ctttgcgaat acgcccacgc gatgggtaac agtcttggcg gtttcgctaa 4020 atactggcag gcgtttcgtc agtatccccg tttacagggc ggcttcgtct gggactgggt 4080 ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg tggtcggctt acggcggtga 4140 ttttggcgat acgccgaacg atcgccagtt ctgtatgaac ggtctggtct ttgccgaccg 4200 cacgccgcat ccagcgctga cggaagcaaa acaccagcag cagtttttcc agttccgttt 4260 atccgggcaa accatcgaag tgaccagcga atacctgttc cgtcatagcg ataacgagct 4320 cctgcactgg atggtggcgc tggatggtaa gccgctggca agcggtgaag tgcctctgga 4380 tgtcgctcca caaggtaaac agttgattga actgcctgaa ctaccgcagc cggagagcgc 4440 cgggcaactc tggctcacag tacgcgtagt gcaaccgaac gcgaccgcat ggtcagaagc 4500 cgggcacatc agcgcctggc agcagtggcg tctggcggaa aacctcagtg tgacgctccc 4560 cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa atggattttt gcatcgagct 4620 gggtaataag cgttggcaat ttaaccgcca gtcaggcttt ctttcacaga tgtggattgg 4680 cgataaaaaa caactgctga cgccgctgcg cgatcagttc acccgtgcac cgctggataa 4740 cgacattggc gtaagtgaag cgacccgcat tgaccctaac gcctgggtcg aacgctggaa 4800 ggcggcgggc cattaccagg ccgaagcagc gttgttgcag tgcacggcag atacacttgc 4860 tgatgcggtg ctgattacga ccgctcacgc gtggcagcat caggggaaaa ccttatttat 4920 cagccggaaa acctaccgga ttgatggtag tggtcaaatg gcgattaccg ttgatgttga 4980 agtggcgagc gatacaccgc atccggcgcg gattggcctg aactgccagc tggcgcaggt 5040 agcagagcgg gtaaactggc tcggattagg gccgcaagaa aactatcccg accgccttac 5100 tgccgcctgt tttgaccgct gggatctgcc attgtcagac atgtataccc cgtacgtctt 5160 cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg aattatggcc cacaccagtg 5220 gcgcggcgac ttccagttca acatcagccg ctacagtcaa cagcaactga tggaaaccag 5280 ccatcgccat ctgctgcacg cggaagaagg cacatggctg aatatcgacg gtttccatat 5340 ggggattggt ggcgacgact cctggagccc gtcagtatcg gcggaattac agctgagcgc 5400 cggtcgctac cattaccagt tggtctggtg tcaaaaataa taaagccgaa ttctgcagat 5460 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtga ctgtgagaca 5520 tgggctaaag aggagcggcc gctcgagtct agaactagtg gatcagcttt gctgcttgca 5580 cttcagagtt ctaggagagt ccctcctagt ctctcctctc cgaggaggta ccgagacctc 5640 aaaataaagg agtgattgcc ttactgccga gtggagagtg attactgagc ggccggtgta 5700 tcgggagtcg tcccttaatc tgtgcaatac cagagcggct ctcgcagccg acctcgaggg 5760 ggggccctat tctatagtgt cacctaaatg ctagagctcg ctgatcagcc tcgactgtgc 5820 cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag 5880 gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta 5940 ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag 6000 acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg gaaagaacca 6060 gtggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6120 aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6180 ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6240 acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6300 ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6360 tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6420 tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6480 gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 6540 agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 6600 tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 6660 agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 6720 tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 6780 acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 6840 tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 6900 agtatatatg agtaacctga tcaggactct tccttttcat gaacaataaa actgtctgct 6960 tacataaaca gtaatacaag gggtgttatg agccatattc aacgggaaac gtcttgctct 7020 aggccgcgat taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat 7080 aatgtcgggc aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag 7140 ttgtttctga aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga 7200 ctaaactggc tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct 7260 gatgatgcat ggttactcac cactgcgatc cccgggaaaa cagcattcca ggtattagaa 7320 gaatatcctg attcaggtga aaatattgtt gatgcgctgg cagtgttcct gcgccggttg 7380 cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg tctcgctcag 7440 gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg attttgatga cgagcgtaat 7500 ggctggcctg ttgaacaagt ctggaaagaa atgcataaac ttttgccatt ctcaccggat 7560 tcagtcgtca ctcatggtga tttctcactt gataacctta tttttgacga ggggaaatta 7620 ataggttgta ttgatgttgg acgagtcgga atcgcagacc gataccagga tcttgccatc 7680 ctatggaact gcctcggtga gttttctcct tcattacaga aacggctttt tcaaaaatat 7740 ggtattgata atcctgatat gaataaattg cagtttcatt tgatgctcga tgagtttttc 7800 taagaattcg cgcaattaac cctcactaaa gggaacaaaa gctgggtacc gggccc 7856 <210> 79 <211> 8127 <212> DNA <213> Artificial <220> <223> pCAH/SINd1/hlacZ <400> 79 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcaa gcttgctgct 600 tgcacttcag agttctagga gagtccctcc tagtctctcc tctccgagga ggtaccgaga 660 cctcaaaata aaggagtgat tgccttactg ccgagtggag agtgattact gagcggccgg 720 tgtatcggga gtcgtccctt aatctgtgca ataccagagc ggctctcgca gctggcgccc 780 aacgtggggc ccgaggagaa gaaaagaaag cggccctgag aactcggctt ctgaaaaaga 840 ggaagaggac aagttgctat agcaacaaga gagaagaagt agagcaaagg tccagtggct 900 cggaaaaaga ggaactgaaa cttcggggac gcctgaagga gtaaggtaag tgactctgct 960 gtacgcgggg cgaggcagag gtttccttct aaattgaaag agaagtgttg ctgcgagagg 1020 tcttggtggt cgagaatcct gtacaaaaaa aaggagggat ctcggtcagg accaggaccc 1080 ctgggagtaa tacaacagca acaccgtaag aaaatccgcc taggtgagtc tagatagaga 1140 ctaggcgagg caagtctccg gggggaaaag agattatcct gagctcgaaa aatgtatcaa 1200 gcatgcatgc aagataaaag ttcgactcag aggggagcac ttgacagaag gaaattgttt 1260 atggtgcctt aaaacattag attacatgtt tgaggaccat aaagaggaac cttggacaaa 1320 agtaaaattt aggacaatat ggcagaaggt gaagaatcta actcctgagg agagtaacaa 1380 aaaagacttt atgtctttgc aggccacatt agcgggtcta atgtgttgcc aaatggggat 1440 gagaccgggc tgcaggaatt cgattctaga ggtgatagaa atgccagaaa actatgcaaa 1500 aacaagaatc ataaacagga aaaaaagaga actcagccac aagaggaaga agagaggcgt 1560 tggcttggtc attatgctag ttatcatggc aatagtagct gccgcagggg cttctctggg 1620 agtcgcaaac gcgattcagc agtcttacac taaggcagct gtccagaccc ttgctaatgc 1680 aactgctgca cagcaggatg tgttagaggc aacctatgcc atggtacagc atgtggctaa 1740 aggcgtacga atcttggaag ctcgagtggc tcgagtggaa gctatcacag atagaataat 1800 gctataccaa gaattggatt gttggcacta ggatccatca gccaccatta acgcttacaa 1860 tttacgcgcg cgttgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc 1920 attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc 1980 tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt 2040 aacgccaata gggactttcc attgacgtca atgggtggac tatttacggt aaactgccca 2100 cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg 2160 taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca 2220 gtacatctac gtattagtca tcgctattac catggtgatg cggttttggc agtacatcaa 2280 tgggcgtgga tagcggtttg actcacgggg atttccaagt ctccacccca ttgacgtcaa 2340 tgggagtttg ttttggcacc aaaatcaacg ggactttcca aaatgtcgta acaactccgc 2400 cccattgacg caaatgggcg gtaggcgtgt acggtgggag gtctatataa gcagagctct 2460 ctggctaact agagaaccca ctgcttactg gcttatcgaa attaatacga ctcactatag 2520 ggagacccaa gctttaagct taccatgggg ggttctcatc atcatcatca tcatggtatg 2580 gcatgactgg tggacagcaa atgggtcggg atctgtacga cgatgacgat aaggtaccta 2640 aggatcagct tggagttgat cccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 2700 ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 2760 aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tggcgctttg 2820 cctggtttcc ggcaccagaa gcggtgccgg aaagctggct ggagtgcgat cttcctgagg 2880 ccgatactgt cgtcgtcccc tcaaactggc agatgcacgg ttacgatgcg cccatctaca 2940 ccaacgtaac ctatcccatt acggtcaatc cgccgtttgt tcccacggag aatccgacgg 3000 gttgttactc gctcacattt aatgttgatg aaagctggct acaggaaggc cagacgcgaa 3060 ttatttttga tggcgttaac tcggcgtttc atctgtggtg caacgggcgc tgggtcggtt 3120 acggccagga cagtcgtttg ccgtctgaat ttgacctgag cgcattttta cgcgccggag 3180 aaaaccgcct cgcggtgatg gtgctgcgtt ggagtgacgg cagttatctg gaagatcagg 3240 atatgtggcg gatgagcggc attttccgtg acgtctcgtt gctgcataaa ccgactacac 3300 aaatcagcga tttccatgtt gccactcgct ttaatgatga tttcagccgc gctgtactgg 3360 aggctgaagt tcagatgtgc ggcgagttgc gtgactacct acgggtaaca gtttctttat 3420 ggcagggtga aacgcaggtc gccagcggca ccgcgccttt cggcggtgaa attatcgatg 3480 agcgtggtgg ttatgccgat cgcgtcacac tacgtctgaa cgtcgaaaac ccgaaactgt 3540 ggagcgccga aatcccgaat ctctatcgtg cggtggttga actgcacacc gccgacggca 3600 cgctgattga agcagaagcc tgcgatgtcg gtttccgcga ggtgcggatt gaaaatggtc 3660 tgctgctgct gaacggcaag ccgttgctga ttcgaggcgt taaccgtcac gagcatcatc 3720 ctctgcatgg tcaggtcatg gatgagcaga cgatggtgca ggatatcctg ctgatgaagc 3780 agaacaactt taacgccgtg cgctgttcgc attatccgaa ccatccgctg tggtacacgc 3840 tgtgcgaccg ctacggcctg tatgtggtgg atgaagccaa tattgaaacc cacggcatgg 3900 tgccaatgaa tcgtctgacc gatgatccgc gctggctacc ggcgatgagc gaacgcgtaa 3960 cgcgaatggt gcagcgcgat cgtaatcacc cgagtgtgat catctggtcg ctggggaatg 4020 aatcaggcca cggcgctaat cacgacgcgc tgtatcgctg gatcaaatct gtcgatcctt 4080 cccgcccggt gcagtatgaa ggcggcggag ccgacaccac ggccaccgat attatttgcc 4140 cgatgtacgc gcgcgtggat gaagaccagc ccttcccggc tgtgccgaaa tggtccatca 4200 aaaaatggct ttcgctacct ggagagacgc gcccgctgat cctttgcgaa tacgcccacg 4260 cgatgggtaa cagtcttggc ggtttcgcta aatactggca ggcgtttcgt cagtatcccc 4320 gtttacaggg cggcttcgtc tgggactggg tggatcagtc gctgattaaa tatgatgaaa 4380 acggcaaccc gtggtcggct tacggcggtg attttggcga tacgccgaac gatcgccagt 4440 tctgtatgaa cggtctggtc tttgccgacc gcacgccgca tccagcgctg acggaagcaa 4500 aacaccagca gcagtttttc cagttccgtt tatccgggca aaccatcgaa gtgaccagcg 4560 aatacctgtt ccgtcatagc gataacgagc tcctgcactg gatggtggcg ctggatggta 4620 agccgctggc aagcggtgaa gtgcctctgg atgtcgctcc acaaggtaaa cagttgattg 4680 aactgcctga actaccgcag ccggagagcg ccgggcaact ctggctcaca gtacgcgtag 4740 tgcaaccgaa cgcgaccgca tggtcagaag ccgggcacat cagcgcctgg cagcagtggc 4800 gtctggcgga aaacctcagt gtgacgctcc ccgccgcgtc ccacgccatc ccgcatctga 4860 ccaccagcga aatggatttt tgcatcgagc tgggtaataa gcgttggcaa tttaaccgcc 4920 agtcaggctt tctttcacag atgtggattg gcgataaaaa acaactgctg acgccgctgc 4980 gcgatcagtt cacccgtgca ccgctggata acgacattgg cgtaagtgaa gcgacccgca 5040 ttgaccctaa cgcctgggtc gaacgctgga aggcggcggg ccattaccag gccgaagcag 5100 cgttgttgca gtgcacggca gatacacttg ctgatgcggt gctgattacg accgctcacg 5160 cgtggcagca tcaggggaaa accttattta tcagccggaa aacctaccgg attgatggta 5220 gtggtcaaat ggcgattacc gttgatgttg aagtggcgag cgatacaccg catccggcgc 5280 ggattggcct gaactgccag ctggcgcagg tagcagagcg ggtaaactgg ctcggattag 5340 ggccgcaaga aaactatccc gaccgcctta ctgccgcctg ttttgaccgc tgggatctgc 5400 cattgtcaga catgtatacc ccgtacgtct tcccgagcga aaacggtctg cgctgcggga 5460 cgcgcgaatt gaattatggc ccacaccagt ggcgcggcga cttccagttc aacatcagcc 5520 gctacagtca acagcaactg atggaaacca gccatcgcca tctgctgcac gcggaagaag 5580 gcacatggct gaatatcgac ggtttccata tggggattgg tggcgacgac tcctggagcc 5640 cgtcagtatc ggcggaatta cagctgagcg ccggtcgcta ccattaccag ttggtctggt 5700 gtcaaaaata ataaagccga attctgcaga tatccagcac agtggcggcc gctagcacaa 5760 aaataaaaaa agaaagggtg actgtgagac atgggctaaa gaggagcggc cgctcgagtc 5820 tagaactagt ggatcagctt tgctgcttgc acttcagagt tctaggagag tccctcctag 5880 tctctcctct ccgaggaggt accgagacct caaaataaag gagtgattgc cttactgccg 5940 agtggagagt gattactgag cggccggtgt atcgggagtc gtcccttaat ctgtgcaata 6000 ccagagcggc tctcgcagcc gacctcgagg gggggcccta ttctatagtg tcacctaaat 6060 gctagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 6120 ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 6180 aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 6240 gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggatgcggtg 6300 ggctctatgg cttctgaggc ggaaagaacc agtggcggta atacggttat ccacagaatc 6360 aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 6420 aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa 6480 tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 6540 ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 6600 cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 6660 ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 6720 ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 6780 gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 6840 agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg 6900 cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 6960 aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 7020 aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 7080 ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt 7140 aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaacctg atcaggactc 7200 ttccttttca tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat 7260 gagccatatt caacgggaaa cgtcttgctc taggccgcga ttaaattcca acatggatgc 7320 tgatttatat gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta 7380 tcgattgtat gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt 7440 tgccaatgat gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct 7500 tccgaccatc aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat 7560 ccccgggaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 7620 tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 7680 taacagcgat cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt 7740 tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 7800 aatgcataaa cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 7860 tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 7920 aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 7980 ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 8040 gcagtttcat ttgatgctcg atgagttttt ctaagaattc gcgcaattaa ccctcactaa 8100 agggaacaaa agctgggtac cgggccc 8127                           SEQUENCE LISTING <110> MACROGEN CO., LTD   <120> CAEV-BASED VECTOR SYSTEMS <160> 79 <170> KopatentIn 1.71 <210> 1 <211> 9189 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 1 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat 60 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg 120 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg 180 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga 240 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag 300 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg 360 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg 420 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta 480 atacaacagc aacaccgtaa gaaaatccgc catggtgagt ctagatagag acatggcgag 540 gcaagtctcc ggggggaaaa gagattatcc tgagctcgaa aaatgtatca agcatgcatg 600 caagataaaa gttcgactca gaggggagca cttgacagaa ggaaattgtt tatggtgcct 660 taaaacatta gattacatgt ttgaggacca taaagaggaa ccttggacaa aagtaaaatt 720 taggacaata tggcagaagg tgaagaatct aactcctgag gagagtaaca aaaaagactt 780 tatgtctttg caggccacat tagcgggtct aatgtgttgc caaatgggga tgagacctga 840 gacattgcaa gatgcaatgg ctacagtaat catgaaagat gggttactgg aacaagagga 900 aaagaaggaa gacaaaagag aaaaggaaga gagtgtcttc ccaatagtag tgcaagcagc 960 aggagggaga agctggaaag cagtagattc tgtaatgttc cagcaactgc aaacagtagc 1020 aatgcagcat ggcctcgtgt ctgaggactt tgaaaggcag ttggcatatt atgctactac 1080 ctggacaagt aaagacatac tagaagtatt ggccatgatg cctggaaata gagctcaaaa 1140 ggagttaatt caagggaaat taaatgaaga agcagaaagg tggagaagga ataatccacc 1200 acctccagca ggaggaggat taacagtgga tcaaattatg ggggtaggac aaacaaatca 1260 agcagcagca caagctaaca tggatcaggc aaggcaaata tgcctgcaat gggtaataaa 1320 tgcattaaga gcagtaagac atatggcgca caggccaggg aatccaatgc tagtaaagca 1380 aaaaacgaat gagccatatg aagattttgc agcaagactg ctagaagcaa tagatgcaga 1440 gccagttaca cagcctataa aagattatct aaagctaaca ctatcttata caaatgcatc 1500 agcagattgt cagaagcaaa tggatagaac actaggacaa agagtacaac aagctagtgt 1560 agaagaaaaa atgcaagcat gtagagatgt gggatcagaa gggttcaaaa tgcaattgtt 1620 agcacaagca ttaaggccag gaaaaggaaa agggaatgga cagccacaaa ggtgttacaa 1680 ctgtggaaaa ccgggacatc aagcaaggca atgtagacaa ggaatcatat gtcacaactg 1740 tggaaagaga ggacatatgc aaaaagaatg cagaggaaag agagacataa ggggaaaaca 1800 gcagggaaac gggaggaggg ggatacgtgt ggtgccgtcc gctcctccta tggaataact 1860 tcagcaccac ctatggttca ggtccgcata ggttcccagc agaggaactt gttatttgat 1920 accggggcgg accgaactat agttagatgg catgagggct cgggaaaccc agccggaagg 1980 ataaaactgc aaggaatagg aggaatagta gaaggagaaa aatggaataa tgtagaatta 2040 gaatataaag gagaaacaag aaagggaaca atagtagtgt taccacaaag tccagtagaa 2100 gtattaggac gagataacat ggcccgattt ggaataaaga taataatggc aaatttagag 2160 gaaaaaagaa tcccaattac aaaagtaaaa ttgaaagagg gatgtacggg tccacatgtc 2220 ccacaatggc cattaacaga agagaaatta aaaggtctaa cagaaatcat agataaatta 2280 gtggaagaag gaaaactagg aaaggcaccc ccacattgga catgtaatac tccaatcttt 2340 tgcataaaaa agaaatcagg gaagtggaga atgttaatag atttcagaga attgaacaaa 2400 cagacagaag atttaacaga agcgcagtta ggactcccgc atccgggagg actacaaaag 2460 aaaaaacatg ttacaatatt ggacatagga gatgcatatt ttactatacc cctatatgaa 2520 ccatatcgag agtacacatg ttttactcta ttaagtccta ataatctagg accatgtaaa 2580 agatactatt ggaaagtgct gccacaaggt tggaaattga gtccatctgt atatcaattt 2640 actatgcagg agatcttaga ggattggata cagcagcatc cagaaattca atttggcata 2700 tatatggatg atatttacat aggaagtgat ttagaaatta aaaagcatag agaaatagtg 2760 aaagatttag ccaattatat tgcccaatat ggattcactc tgccagaaga gaagagacaa 2820 aagggatatc cagcaaaatg gctaggattt gaactacacc cgcagacctg gaaatttcag 2880 aagcatacat tacctgaatt aacaaaggga acaataacat taaataaatt acagaaatta 2940 gtaggagaat tagtatggag acaatccata attgggaaaa gcattcctaa cattctgaaa 3000 ttaatggaag gagatagaga attacaaagt gaaagaaaaa ttgaagaagt acatgtgaaa 3060 gaatgggaag catgtaggaa aaaattagaa gaaatggaag gaaattatta taataaagac 3120 aaagatgtct atggacaatt ggcttgggga gacaaagcta tagaatatat agtgtatcag 3180 gagaaaggga aaccattatg ggtaaatgtg gttcacaata taaagaacct aagcatcccg 3240 caacaggtta ttaaagcagc gcaaaaatta acccaagaag tcatcattag gacaggaaaa 3300 ataccatgga tattgttgcc agggaaagaa gaagattgga gactagaatt gcaattaggg 3360 aacatcacat ggatgccaaa attttggtcc tgttatcgag gacatacaag atggagaaaa 3420 agaaatataa tagaagaagt agtagaaggg cctacatatt atacagatgg aggaaaaaag 3480 aataaagtag gaagtctagg gttcatagta tcaacagggg aaaaatttag aaagcatgaa 3540 gagggcacaa accagcaact agaattaaga gccatagagg aagctctaaa acaagggcct 3600 caaacaatga atttagtaac agatagtaga tatgcatttg aatttttatt aagaaattgg 3660 gatgaagaag taataaagaa tccaattcaa gcaagaatta tggaaattgc ccacaagaaa 3720 gataggatag gagtgcattg ggtgccagga cataaaggga ttccccaaaa tgaagaaata 3780 gacaaatata tttcggaaat atttcttgca aaagaaggag aaggaattct cccaaaaaga 3840 gaagaggatg cagggtatga tttaatatgc ccagaagagg ttaccataga gccaggacaa 3900 gtgaaatgca tccccataga gctaagatta aatttaaaga aatcacaatg ggctatgatt 3960 gctacaaaaa gcagcatggc tgccaaagga gtgttcacac aaggaggaat catagactca 4020 ggatatcagg gacaaataca ggtaataatg tataatagca ataaaatagc agtagtcata 4080 ccccaaggga gaaaatttgc acaattaata ttaatggata aaaagcatgg aaaattggaa 4140 ccctgggggg aaagcagaaa aacagaaagg ggagaaaaag gatttgggtc tacaggaatg 4200 tattggatag aaaatattcc tctggcagag gaagaccaca caaaatggca tcaagatgcc 4260 cgatcattgc atctagaatt tgaaattcca agaacagcag cagaagacat agtaaatcaa 4320 tgtgaaatat gcaaagaagc gaggacacct gcagtaatta gaggcggaaa caaaaggggg 4380 gtaaatcatt ggcaagtgga ttatacccat tatgaaaata tcatactatt agtatgggta 4440 gaaacaaatt caggactaat atatgcagaa aaagtaaaag gagaatcagg gcaagaattc 4500 agaataaaag tgatgcattg gtatgcatta tttggtccag agtcattgca gtcagacaat 4560 ggacctgcat ttgcagcaga gcccacacag ctgttaatgc aatacctagg agtaaaacac 4620 acaacaggca taccttggaa tccacagtct caggctatag tagaaagggc acatcaacta 4680 ttgaaaagca ctttaaagaa gttccagcca caatttgtcg ctgtagaatc agccatagca 4740 gcagccctag tcgccataaa tataaaaaga aagggtgggc tggggacaag ccctatggat 4800 atttttatat ataataaaga acagaaaaga ataaataata aatataataa aaattctcaa 4860 aaaattcaat tctgttatta cagaataagg aaaagaggac atcaggagag tggaaaggac 4920 caacccaggt actgtggaaa ggggaaggag ccaattgtgg taaaggatat agaaagtgaa 4980 aagtatttag taatacctta caaagatgca aaattcatcc cgccaccaac aaaagaaaag 5040 gaataaaaaa cctggaccag aattaccctt agcactatgg atacatatag cagaaagcat 5100 taatggggat agctcatggt acataacaat gagactgcaa cagatgatgt ggggaaaaag 5160 aggaaataag ttacaatata agaatgaaga cagggaatat gaaaattggg aaattacatc 5220 atggggatgg aaaatgcacc taaggagagt gaaacaatgg atacaagaca acaggagagg 5280 aagcccatgg cagtacaaag taggaggaac atggaaaagt ataggagtgt ggttcctgca 5340 agcaggagat tacagaaagg tagacaggca cttctggtgg gcatggagga tactgatatg 5400 ttcctgcagg aaagaaaagt ttgatataag agaatttatg agaggaagac atagatggga 5460 tttgtgcaaa tcctgtgctc aaggagaagt agtaaagcat actagaacaa aaagtctgga 5520 aagactagta ctgctacaga tggtagaaca gcatgtgttt caagtattgc cattgtggag 5580 agccaggaga agtagtacaa cagatttccc atggtgcagg gacacaacgg gatacacgca 5640 tgcgtggtct gtccaggagt gctggttgat ggaatatctc ttagaggatg agtgaagaac 5700 tgcctcaaag aagggagaca catccagaag aacttgtaag gaacgtacgg gaaagagaaa 5760 gggatacatg gcaatggaca agcatcagag tacctgcgga aatactgcaa agatggcttg 5820 ctatgcttag gtcaggcaga aatagaaaga aagtgtatag agaaatgcaa aaatggatgt 5880 ggatacatcc caaggcgcct gtgattaggg cctgtggatg cagactatgt aacccggggt 5940 ggggaacata atcaagggaa taataaatgc aaataaatgt aactaacaag tagcaaaagt 6000 gtctgtgtta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg 6060 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc 6120 aaggtaagta tcaaccccag gtaagtaagc aaatagggaa cagaaatact aacccatgct 6180 ttgcctataa agggatattc ctatggagga tatcactaac aatgtggata ttgctaggga 6240 taaatatgtg tgtcagtgca gaggattaca taacactaat atcagatccc tatgggttct 6300 cacccataaa aaatgtgtct ggggtaccag tgacttgtgt aacaaaagaa ttcgcaaaat 6360 ggggatgtca accactagga gcgtaccctg atccagaaat agaatacaga aatgtgagtc 6420 aggaagtagt gaaagaagta tatcaagaga attggccatg gaatacatat cattggcctc 6480 tctggcaaat ggagaatgtt aggtactggt taaaagaaaa tatgcaagaa aatcaacaga 6540 gaaaaaataa tacaaaagag ggtatagagg aattattagc aggaactata aggggaagat 6600 tctgtgtacc atacccattt gccttgttaa aatgcacaaa gtggtgctgg tatacagcgg 6660 ccataaacaa cgagtcagga aaagcaggaa aaataaaaat aaattgcaca gaagcaagag 6720 cagtctcctg tacagaggac atgccattag cctcaataca aagagcatat tgggatgaga 6780 aagacagaga gagcatggcc tttatgaata tcaaagcatg tgatagcaac ctaaggtgtc 6840 agaaaagacc tggagggtgt atggaaggat accctatccc agtaggagca gaaataatcc 6900 ctgaaagtat gaaataccta aggggagcaa agagtcagta tgggggaata aaagataaga 6960 atggagaatt aaaattacca ttaacattaa gagtgtgggt aaaattagca aatgtgtcag 7020 aatgggtaaa tgggacaccc ccggattggc aagacagaat taacggatcc aaaggaataa 7080 atgggacgct ctggggagag cttaacagta tgcatcacct aggatttgcc cttagccaga 7140 acggcaaatg gtgtaactac accggggaaa taaaattagg gcaagaaaca ttccaatatc 7200 attacaagcc aaactggaac tgtaccggga attggacgca atatccggtg tggcaagtga 7260 ttagaaacct ggatatggtg gaacatatga caggagaatg tgtgcagaga ccacaaaggc 7320 acaatataac agtaggaaat ggaaccataa cagggaattg cagtacaaca aactgggatg 7380 gatgtaattg ctcacgatca ggaaactacc tatataacag ctctgaggga ggattgttat 7440 taattctgtg cagacaaaac agcaccctaa caaggatcct gggaacaaat acaaattgga 7500 caactatgtg gggaatatac aaaaattgtt caggatgcga gaatgcaaca ttagacaaca 7560 caggagaagg aaccttagga ggtgtagcta ataagaactg tagcttgcct cataaaaatg 7620 agagcaacaa gtggacttgt gccccaagac aaagagatgg aaaaacagat tcgctataca 7680 tagcaggagg aaaaaagttt tggacacgaa ttaaggccca attcagctgt gaaagtaaca 7740 taggacaatt agatggaatg ttgcatcagc aaatactatt gcaaaaatat caagtaatta 7800 aggtaagagc ttatacatat ggggtgatag aaatgccaga aaactatgca aaaacaagaa 7860 tcataaacag gaaaaaaaga gaactcagcc acaagaggaa gaagagaggc gttggcttgg 7920 tcattatgct agttatcatg gcaatagtag ctgccgcagg ggcttctctg ggagtcgcaa 7980 acgcgattca gcagtcttac actaaggcag ctgtccagac ccttgctaat gcaactgctg 8040 cacagcagga tgtgttagag gcaacctatg ccatggtaca gcatgtggct aaaggcgtac 8100 gaatcttgga agctcgagtg gctcgagtgg aagctatcac agatagaata atgctatacc 8160 aagaattgga ttgttggcac tatcatcaat actgtataac ctctacaaaa acagaagtag 8220 caaaatatat caattggacg aggtttaagg ataattgcac atggcagcag tgggagagag 8280 gattacaggg gtatgataca aacttaacaa tactgttaaa ggaatcagca gcaatgacac 8340 aactagcaga agagcaagca aggaggatac cagaagtatg ggaaagttta aaagacgtct 8400 ttgattggtc aggatggttc tcatggctaa agtatattcc tattatagta gtaggattat 8460 taggatgcat tctgataaga gctgtgatat gtgtatgtca acctcttgtg cagatataca 8520 gaactctaag taccccgaca taccaacggg tcacagtcat catggaaaca agagcagacg 8580 tcgcaggaga aaatcaggat tttggcgatg gcttagagga atcagacaac agcgaaacaa 8640 gcgaaagagt gacagtacag aaagcttgga gccgtgcctg ggagctttgg cagaactcac 8700 cctggaagga gccatggaaa aggggcctgc tgaggctgct cgtccttccg ctgacgatgg 8760 gaatctggat aaatggatgg cttggagaac accacaaaaa taaaaaaaga aagggtgact 8820 gtgagacatg ggctaaagag gactaataac aagctaggcc aaattcctgt aaatcacttg 8880 gggggttata agaaaagcaa gttcactatg acaaagcaaa atgtaaaggc caaattcctg 8940 taaatcactt ggggggttat aagaaaagca agttcactat gacaaagcaa aatgtaaccg 9000 caagtgctga cagatgtaac agctgacata tcagctgatg cttgctcatg ctgacactgt 9060 agctctgagc tgtatataag gagaagcttg ctgcttgcac ttcagagttc taggagagtc 9120 cctcctagtc tctcctctcc gaggaggtac cgagacctca aaataaagga gtgattgcct 9180 tactgccga 9189 <210> 2 <211> 8919 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 2 gtgagtgctc tgaggagctc gaaggaaaga gtcctcagcc tctcctctcc gaggagcttc 60 ggctcataat aaaggagtgc ttgcttcaac agaactgagc tggtcgtggt tattatcggg 120 gaccgaagtc ccgtgcaaca ccggggcggt tctcgcagct ggcgcccaac gtggggctcg 180 agtagcttga gaagctcgac tgagatctga atccaagagc gacatcagac agcaagaaat 240 gagagtaatg agaccgcgag ctctgctgct gtaaaaaaga ggaagtagcg ggttgccgag 300 gcaactgctc agaagaacca ggggaaaggg cttccagcaa cctcaaaaga ggaaccgaga 360 cttcggggac gcctgaagta aggtaagtga ctctgctgta cgcggggcga ggcataggag 420 atccttctat tctaggaaga gaagcgctgt tctgggaggt cttggcgacc gagaatcttg 480 ttaaataagc caggatctcg atcaggacca agacccctca ggagagggta tagacagcgt 540 ggtaagaaat ccgccgtggt gagtctagat agagacatgg tgaggcaggc ctccggaagg 600 ggaaaggagt accccgagct aaaagaatgt ctgaaaaagg catgcaaaat aaaagtaagg 660 gctggggggg agcgcctgac agaaggaaat tgtctctggt gtataaaaac actagagtgt 720 atgtatgagg attgtaggga ggaaccttgg accccagaaa aatgtaaaca attatggaaa 780 aagttgaagc aggtagagcc tgaggagagt agcaaagcag actataactc gttaaaagca 840 accttggcgg ggatagtctg tgtgcaaatg ggaatgcagc ccgagacact gcaggatgcg 900 atagcaacct taaacatgag agatgaagta aaaggaaagg aaaagccatc agaagaaaag 960 aagggaatat atcccatatt agtgcaggca ggaggaggaa gagcatggag agcggtagag 1020 cctgctacct ttcagcagct ccaaacagtg gcaatgcagc atggactagt atcagaagaa 1080 tttgaaaggc agctagcata ctatgccacc acatggacaa gcaaggatat cttagaagta 1140 ttagccatga tgccaggaaa tagagcgcaa aaagaactaa tacaaggaaa gttaaatgag 1200 gaagcagaga gatggagaag gcagaatcca caacctgcgg gcgggttaac cgtggatcag 1260 ataatggggg taggacaaac gaatcaggca gcggcacagg ctaatatgga tcaagcaaga 1320 caaatatgcc tacaatgggt tataacagca ataagaggag ttaggcatat ggcccataga 1380 ccaggaaatc ccatgctggt aagacaaaaa ccaaatgaga actatgaaga gtttgccgca 1440 aggttgttag aagcagtgga tgcagaaccc gttacccaac ctataaaaga atatttaaag 1500 gtaactctgt cttacacaaa tgcaaattcg gaatgtcaaa aacatatgga cagagtgttg 1560 gggcaaagag tacagcaggc ctcaatagaa gaaaaaatgc aggcatgcag ggacatcggg 1620 ggaacagcat atcagatgca gttgcttgca caagccctcc gtggcggaaa agaagatggg 1680 aaaaaatctg tagggaagtg ttataactgt ggaaggcccg gacacagagc aaaagaatgc 1740 agacaaggca ttatatgtca caactgtgga aaaagagggc atatacagaa aaactgcaaa 1800 cagaaaagaa gaaaggagca gggaaacatg aggagggggc tacgtgtggt gccgtccgca 1860 ccccctatgg agtaacgcaa gcaccactaa tagttagggt acaaataggg aatcaggaga 1920 aacaattatt atttgacaca ggggcagata aaacgatagt aagaatgcat gatggaacag 1980 ggattccaaa cggaagaata aaattacaag ggataggagg aatagtagaa ggagaaaaat 2040 ggaataaagt acccatgaca tataagggag aaacatcctg cccaagcttg gttgtgctaa 2100 gagatagccc agtagaagta ttgggaagag ataacatgga agcattcggc gtaaccctaa 2160 taatggcaaa tttagaagat aagaaaattc ccacaatacc agtagaattg aaagaaggat 2220 gtaaagggcc acatgtgccc cagtggccat taacagcaga gaaattacaa ggactaacag 2280 gaatagtaga aaaattacta caggaaggaa aattggcaga ggccccagag ggatggacgt 2340 ggaacacgcc catcttctgc ataaaaaaga agtcaggaaa atggagaatg ttaatagatt 2400 ttagggaatt aaataagcaa acagcagatt tagcagaagc gcagctagga ctgccacacc 2460 caggagggtt gcaaaggaaa aagaatgtaa caattctgga cataggagat gcatatttca 2520 caattccctt atacgagccc tatcagaaat atacatgctt cacactccta agtcctaaca 2580 atttgggacc atgtaaaagg tattattgga aagtattacc ccagggatgg aaattgagcc 2640 cagctgtata tcaattcacc atgcaaaggt tgttaaaagg atggatacaa cagcataaaa 2700 acatacaatt tggaatatat atggatgata tctatattgg aagtgatcta acgatagccc 2760 aacataggaa gataatagaa gaattagcct catttataga acaatttggg tttacattac 2820 cagaagataa gagacaagag ggctatccag caaaatggct aggattcgag ctacatccag 2880 aaaaatggaa atatcaaaag cataaattgc cggaattaca agagggggta ataaccctga 2940 acaaattaca gaagatagta ggggaattag tgtggagaca atccttgata ggaaagagca 3000 tccccaatat cataaaatta atggaaggag atcgcgcatt acaaagtgaa aggaaaatag 3060 aaagaataca tgtacaagaa tgggaagcat gtcaaaagaa attagatgaa atggtaggaa 3120 attattacag agaagaagaa gatatctatg gacaaataac ttggggggat aaggcaataa 3180 aatacatagt attccaaagg aaaggggaac ccctatgggt aaatgtagta catgacataa 3240 aaaatttgag tctcccacag caagtgataa aagcagcaca gaaattaacc caggaagtaa 3300 tcataagaac aggaaaaatc ccatggctgc tactaccagg aagagaagaa gactggagat 3360 tagaactgca ggtagggaac atcacgtgga tgccatcatt ttggtcatgt tatcgaggag 3420 cacccaagtg gaaaagaagg aacatagtgg cagcagtggt agatggaccg acatattata 3480 cagatggggg aaagaaaaac gcacagggaa gctttggctt catctcccca acaggagaaa 3540 agttcagaag gcatgaagat ggaactaatc aggtattaga attaagggca atagaagatc 3600 catgtaaaca aggacctgaa agcatgaaca ttgtaactga cagcaggtat gcttatgaat 3660 tcatgctccg aaactgggat gaacaggtca taagaaaccc cattcaggca agaatcatgg 3720 cagaagtgca caagaaaaag caggtaggaa tacactgggt gccagggcat aaaggaatac 3780 ctcagaatga agagatagac cagtacatat cagaagtatt cttagcacga gaaggaacag 3840 ggatatgtga aaaaaggaag gaagatgctg gatatgattt attatgcccg catgaggtaa 3900 tacttaaacc ccaagaagta aaacggatcc caatagacct aaaattaaaa ttgaaagaaa 3960 agcaatgggc catgataagt gggaaaagta gcgttgcagc aaaaggaata tttgtacaag 4020 gaggcataat agattcaggg tatcagggac aagtacaagt catcctatat aatagtaata 4080 agatagaggt caaaatacca caaggcagga aatttgccca attaatatta atgaacttac 4140 aacatgaaga attagaagaa tggggaaagg aaagaaaaac agaaagagga acaaaaggat 4200 ttgggtctac aggagcattt tggatagaga atattcccca agcagaggaa gaacattaca 4260 aatggcatca agatgctaga tctctgcagc tagaattcaa gatacctaga gcagcagcag 4320 aagacattat acagcactgt gaggtatgtc aagaaggcaa acccgcagcg atcacgagag 4380 ggggaaataa aagaggaata gatcattggc aggtagacta tacacattac aaagaacaca 4440 taatattagt atgggtagag actaattcag gattaatatt tgcagagaaa gtaaaaggag 4500 aatcaggaca agaatttagg atgcagacat tgaaatggta tgctttgttt caaccaaaat 4560 cagtgcaatc agataatggg acagccttca cagctgaggc tacgcagcat ctaatgaagt 4620 atttagggat tcagcacact acgggtattc cgtggaaccc ccagtcacaa agtttagtag 4680 aaagagctca tcaaacatta aaacacatgt tagaaaaatt agaaccacaa tttgtggccc 4740 tacagtctgc catcgcagcc actctagttg cgctcaatat aaaaagaaag ggtggactag 4800 gggcaagccc tatggatatt tacatatata ataaggagca acaaagacaa caagataata 4860 gtaataaatt aattcagaaa aaattttgtt attacaggat cagaaaaaga ggccatccag 4920 gagagtggaa cggcccaact gaggtactgt gggaagggga aggagccata gtagttaaag 4980 acaaagaaag tgatagatat ctagtcatcc catataaaga tgcaaaattt attccgccac 5040 cgtcggaaca gaagggatag aagaataggt ccagaattgc ctttatcttt atggacttat 5100 acagcataca gcataaataa agatcccgca tggtatacaa ccctaagact gcagcaaatg 5160 atgtggcata ggaggggaaa taaattgaca tatgtcaggg aaaatgcaca gtacgaggag 5220 tgggaaatga cctcgtatga gtggaggata agaatgagaa gggacaaaac aaaaagtcat 5280 ccaagagggc atacttcgcc atggcaatat cggagacagg atggatggaa ggatgtggga 5340 acgtggttcc tacagccagg ggactataga aaggcggatc agcagttctg gttcgcttgg 5400 agaatagtgt cgtgttcatg taaaaaggaa ggatttaaca taagagaatt tatgctaggt 5460 acccatagat gggatttgtg taagtcgtgt tgccagggtg aagtagtaaa gagaacacaa 5520 ccctacacct tgcaaaggct cacgtggctt aaattaacag aagaccatgt atttcaagta 5580 atgcccttgt ggagagctcg caaagggatt accatagact ttccctggtg cagggacaca 5640 aaaggattcc tggagccgtg gacaacgcaa gagtgttggc aaatagagta tcccttggag 5700 gatgagtgag gaaaccccag caggaagaga accgactgca gaggaaatat ttgagcaaga 5760 agcagaaagt tggaagagaa caagcgtgcg agtcccaaat gacatattac aaagatggct 5820 agcaatgctt aggcaaagag gaaatagaaa gaaagtgctt agggaaatgc aaaaatgggc 5880 atggaggaat cccacggcgc gggtgattcg gccgtgtgga tgtcggctat gtaaccccgg 5940 ctgggggagt aattaatcat aataaagcaa attgtaacat gctgtgtcag gtgtcttgca 6000 ggaatggcgg agataagaaa agaagcaaag gagccactaa tccagggtaa gtataaaaaa 6060 caggtaagta gaataactat agttatatta ctaacagtaa gagcagcact aggagcagaa 6120 tacataacca taatatcaga cccatatggg ttctctcccg tgagaaatgt gtcaggagta 6180 cctgtaactt gtgtgacaaa agaatttagt aagtggggat gtcagccaat aggagcctac 6240 ccagacccag acttagaata cagaaatata agtaaagaaa tattagagga agtatatcaa 6300 caagactggc cgtggaatac ttatcattgg ccattatggc aaatggataa tgtagtacaa 6360 tgggcaaggc aaaatttaca ggataaccgc aaggaaaaaa gggacctggc agacctatta 6420 gcaggaaaaa taaggggaag attctgtgta ccctacccat ttgcgctcct ggagtgcatg 6480 gaatggtgct ggtgggttaa gaacactaat gcaggggggt atggagaagc agacataaga 6540 ataaattgct caagggcaag agcagtgagc tgcacaagtg aaatgccctt agcatcccta 6600 cagagggtat attgggaaaa ggaggaacga aaaaacatgg agaaaatgac catcaaacct 6660 tgcaataaaa atttggaatg caagaacaga aggggatgcg cagaagggta tccagtacct 6720 cccaaggcag agttattccc tccagcgttt caggatttac agccaaaagg gtacgcatat 6780 ggggcactta gagggaacag caaatttcca caaagagtgt cgctaagaac atgggtgaaa 6840 atagctaacc tgacaggatg ggaaaaagga aagccagcag aatggtggaa taccagccaa 6900 caggttcatt ggtttgatac cacgccacaa tatcatttag gatatgtatt atcccgagcg 6960 cctgagaaca ggagttgtaa tttcacaggg gaaatacgaa tagggcaaca tcagtttgag 7020 tataattaca ccctgacaaa gaattgcaca aaggagaagt ggaaagagta ccccatgtgg 7080 catgtctgga ggcatttaga tcaaaatgag cacttatcta gcatatgttt caaaagaccg 7140 agaagaaatg caacacaaat agggaacagt acactgcaag ggcaatgtaa tagaagtaat 7200 tggacaggat gccactgcaa tgagacaggg ataaacacaa catggagaat aaatggcaca 7260 aagggagctt atctcttaaa tagcactaat ggaaacatca tggtcttgtt atgctggaac 7320 acaacagtgg caggggtata tgagagtcag ctaaagtgga atgagagtct taaagacgga 7380 gactatgggc tctgttttaa ttcaacaaac aggaattgta ctagaaatgg agctcggcac 7440 tatgtaaaca agagagtgat aaaaaacgac acagcagatc ataattgtga tagcagcata 7500 tcagcaatag atggaatggt acatcaacaa atattactgc aaaggtatca agtaattaga 7560 gtaagagctt acacatacgg agtgattgat atgccagaca attatgagac cctaccagga 7620 aggagaagga gagatctcgc aaaggccagg aaaaagaggg gcgtgggcct ggtcatcatg 7680 ttagctatca tggccatagt ggctgctgca ggagcatctc tgggagtcgc gaacgcgatt 7740 cagcagtcct acaccaggga cgctgtccag actcttgcta acgcgactgc tgtgcaacag 7800 caggtgttag aggcgtccta tgccatgata cagcatgtgg ctaagggaat acgcatcctt 7860 gaagcacgcg tggcgagaat ggaagttatg atggatagaa tgatgttata tcaggaagta 7920 gactgctggc attatcacca atattgtgta acctctacaa gagcagacat agtgaattac 7980 attaattgga caaggtttaa agataattgc acatggcaag agtgggaaag ggagataagt 8040 gcgcatgaag gaaacatcac tatattactc aaagaatcag caaggataac acaattagca 8100 caacaaaagg tacaaagaat accagatgtg tggacagcac taagggagtc actaggatgg 8160 acacaatggc tggcttggat aaaatacctt cccataatag tagtagggat attaggatgc 8220 ataatcataa gaataatgtt gtgtgtagta caaccagttc ttcagattta cagaaccttg 8280 actcagacca ggtatcaaca agtcaacttg gtgatggaga cccgggtgca actagaagaa 8340 gaagaagaag aagacggaag ggatggtgga gatggctcag agagatgcag cgatcccgac 8400 aacaaaggaa ttatgaacgc ctggaggaga gcttgggtga cttggagaaa ctcaccttgg 8460 cagaacacat ggaagaatgt ggtggtggcg ccgttggtga ttccgctgac aatcagaatt 8520 tggctccttg gagagaatgg agagaacccc taaaagaaaa ataaaaaggg tggactgtga 8580 ggactgtgag gcctaggagc gagatagaaa cttataggcc tctcttcccg gaaagctaac 8640 tcactgtgag aggaatagca agtcacagtg acactgctaa ttgtacccgc aaccctgaga 8700 tcatgcaaac cacaatcctg agattatgct gacatgtgta acagctgatg cctcagctga 8760 tgcttgctca tgctgacaat gtaactagga gctctatata aacagagccc tagagcttgc 8820 tacttcagag tgctctgagg agctcgaagg aaagagtcct cagcctctcc tctccgagga 8880 gcttcggctc ataataaagg agtgcttgct tcaacagaa 8919 <210> 3 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 3 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 4 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 4 atggtgaggc aggcctccgg aaggggaaag gagtaccccg agctaaaaga atgtctgaaa 60 aaggcatgca aaataaaagt aagggctggg ggggagcgcc tgacagaagg aaattgtctc 120 tggtgtataa aaacactaga gtgtatgtat gaggattgta gggaggaacc ttggacccca 180 gaaaaatgta aacaattatg gaaaaagttg aagcaggtag agcctgagga gagtagcaaa 240 gcagactata actcgttaaa agcaaccttg gcggggatag tctgtgtgca aatgggaatg 300 cagcccgaga cactgcagga tgcgatagca accttaaaca tgagagatga agtaaaagga 360 aaggaaaagc catcagaaga aaagaaggga atatatccca tattagtgca ggcaggagga 420 ggaagagcat ggagagcggt agagcctgct acctttcagc agctccaaac agtggcaatg 480 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc caccacatgg 540 acaagcaagg atatcttaga agtattagcc atgatgccag gaaatagagc gcaaaaagaa 600 ctaatacaag gaaagttaaa tgaggaagca gagagatgga gaaggcagaa tccacaacct 660 gcgggcgggt taaccgtgga tcagataatg ggggtaggac aaacgaatca ggcagcggca 720 <210> 5 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 5 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 6 <211> 1299 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 6 atggtgaggc aggcctccgg aaggggaaag gagtaccccg agctaaaaga atgtctgaaa 60 aaggcatgca aaataaaagt aagggctggg ggggagcgcc tgacagaagg aaattgtctc 120 tggtgtataa aaacactaga gtgtatgtat gaggattgta gggaggaacc ttggacccca 180 gaaaaatgta aacaattatg gaaaaagttg aagcaggtag agcctgagga gagtagcaaa 240 gcagactata actcgttaaa agcaaccttg gcggggatag tctgtgtgca aatgggaatg 300 cagcccgaga cactgcagga tgcgatagca accttaaaca tgagagatga agtaaaagga 360 aaggaaaagc catcagaaga aaagaaggga atatatccca tattagtgca ggcaggagga 420 ggaagagcat ggagagcggt agagcctgct acctttcagc agctccaaac agtggcaatg 480 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc caccacatgg 540 acaagcaagg atatcttaga agtattagcc atgatgccag gaaatagagc gcaaaaagaa 600 ctaatacaag gaaagttaaa tgaggaagca gagagatgga gaaggcagaa tccacaacct 660 gcgggcgggt taaccgtgga tcagataatg ggggtaggac aaacgaatca ggcagcggca 720 caggctaata tggatcaagc aagacaaata tgcctacaat gggttataac agcaataaga 780 ggagttaggc atatggccca tagaccagga aatcccatgc tggtaagaca aaaaccaaat 840 gagaactatg aagagtttgc cgcaaggttg ttagaagcag tggatgcaga acccgttacc 900 caacctataa aagaatattt aaaggtaact ctgtcttaca caaatgcaaa ttcggaatgt 960 caaaaacata tggacagagt gttggggcaa agagtacagc aggcctcaat agaagaaaaa 1020 atgcaggcat gcagggacat cgggggaaca gcatatcaga tgcagttgct tgcacaagcc 1080 ctccgtggcg gaaaagaaga tgggaaaaaa tctgtaggga agtgttataa ctgtggaagg 1140 cccggacaca gagcaaaaga atgcagacaa ggcattatat gtcacaactg tggaaaaaga 1200 gggcatatac agaaaaactg caaacagaaa agaagaaagg agcagggaaa catgaggagg 1260 gggctacgtg tggtgccgtc cgcaccccct atggagtaa 1299 <210> 7 <211> 511 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 7 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat 60 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg 120 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg 180 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga 240 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag 300 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg 360 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg 420 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta 480 atacaacagc aacaccgtaa gaaaatccgc c 511 <210> 8 <211> 576 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 8 gtgagtgctc tgaggagctc gaaggaaaga gtcctcagcc tctcctctcc gaggagcttc 60 ggctcataat aaaggagtgc ttgcttcaac agaactgagc tggtcgtggt tattatcggg 120 gaccgaagtc ccgtgcaaca ccggggcggt tctcgcagct ggcgcccaac gtggggctcg 180 agtagcttga gaagctcgac tgagatctga atccaagagc gacatcagac agcaagaaat 240 gagagtaatg agaccgcgag ctctgctgct gtaaaaaaga ggaagtagcg ggttgccgag 300 gcaactgctc agaagaacca ggggaaaggg cttccagcaa cctcaaaaga ggaaccgaga 360 cttcggggac gcctgaagta aggtaagtga ctctgctgta cgcggggcga ggcataggag 420 atccttctat tctaggaaga gaagcgctgt tctgggaggt cttggcgacc gagaatcttg 480 ttaaataagc caggatctcg atcaggacca agacccctca ggagagggta tagacagcgt 540 ggtaagaaat ccgccgtggt gagtctagat agagac 576 <210> 9 <211> 3318 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 9 atgtcacaac tgtggaaaga gaggacatat gcaaaaagaa tgcagaggaa agagagacat 60 aaggggaaaa cagcagggaa acgggaggag ggggatacgt gtggtgccgt ccgctcctcc 120 tatggaataa cttcagcacc acctatggtt caggtccgca taggttccca gcagaggaac 180 ttgttatttg ataccggggc ggaccgaact atagttagat ggcatgaggg ctcgggaaac 240 ccagccggaa ggataaaact gcaaggaata ggaggaatag tagaaggaga aaaatggaat 300 aatgtagaat tagaatataa aggagaaaca agaaagggaa caatagtagt gttaccacaa 360 agtccagtag aagtattagg acgagataac atggcccgat ttggaataaa gataataatg 420 gcaaatttag aggaaaaaag aatcccaatt acaaaagtaa aattgaaaga gggatgtacg 480 ggtccacatg tcccacaatg gccattaaca gaagagaaat taaaaggtct aacagaaatc 540 atagataaat tagtggaaga aggaaaacta ggaaaggcac ccccacattg gacatgtaat 600 actccaatct tttgcataaa aaagaaatca gggaagtgga gaatgttaat agatttcaga 660 gaattgaaca aacagacaga agatttaaca gaagcgcagt taggactccc gcatccggga 720 ggactacaaa agaaaaaaca tgttacaata ttggacatag gagatgcata ttttactata 780 cccctatatg aaccatatcg agagtacaca tgttttactc tattaagtcc taataatcta 840 ggaccatgta aaagatacta ttggaaagtg ctgccacaag gttggaaatt gagtccatct 900 gtatatcaat ttactatgca ggagatctta gaggattgga tacagcagca tccagaaatt 960 caatttggca tatatatgga tgatatttac ataggaagtg atttagaaat taaaaagcat 1020 agagaaatag tgaaagattt agccaattat attgcccaat atggattcac tctgccagaa 1080 gagaagagac aaaagggata tccagcaaaa tggctaggat ttgaactaca cccgcagacc 1140 tggaaatttc agaagcatac attacctgaa ttaacaaagg gaacaataac attaaataaa 1200 ttacagaaat tagtaggaga attagtatgg agacaatcca taattgggaa aagcattcct 1260 aacattctga aattaatgga aggagataga gaattacaaa gtgaaagaaa aattgaagaa 1320 gtacatgtga aagaatggga agcatgtagg aaaaaattag aagaaatgga aggaaattat 1380 tataataaag acaaagatgt ctatggacaa ttggcttggg gagacaaagc tatagaatat 1440 atagtgtatc aggagaaagg gaaaccatta tgggtaaatg tggttcacaa tataaagaac 1500 ctaagcatcc cgcaacaggt tattaaagca gcgcaaaaat taacccaaga agtcatcatt 1560 aggacaggaa aaataccatg gatattgttg ccagggaaag aagaagattg gagactagaa 1620 ttgcaattag ggaacatcac atggatgcca aaattttggt cctgttatcg aggacataca 1680 agatggagaa aaagaaatat aatagaagaa gtagtagaag ggcctacata ttatacagat 1740 ggaggaaaaa agaataaagt aggaagtcta gggttcatag tatcaacagg ggaaaaattt 1800 agaaagcatg aagagggcac aaaccagcaa ctagaattaa gagccataga ggaagctcta 1860 aaacaagggc ctcaaacaat gaatttagta acagatagta gatatgcatt tgaattttta 1920 ttaagaaatt gggatgaaga agtaataaag aatccaattc aagcaagaat tatggaaatt 1980 gcccacaaga aagataggat aggagtgcat tgggtgccag gacataaagg gattccccaa 2040 aatgaagaaa tagacaaata tatttcggaa atatttcttg caaaagaagg agaaggaatt 2100 ctcccaaaaa gagaagagga tgcagggtat gatttaatat gcccagaaga ggttaccata 2160 gagccaggac aagtgaaatg catccccata gagctaagat taaatttaaa gaaatcacaa 2220 tgggctatga ttgctacaaa aagcagcatg gctgccaaag gagtgttcac acaaggagga 2280 atcatagact caggatatca gggacaaata caggtaataa tgtataatag caataaaata 2340 gcagtagtca taccccaagg gagaaaattt gcacaattaa tattaatgga taaaaagcat 2400 ggaaaattgg aaccctgggg ggaaagcaga aaaacagaaa ggggagaaaa aggatttggg 2460 tctacaggaa tgtattggat agaaaatatt cctctggcag aggaagacca cacaaaatgg 2520 catcaagatg cccgatcatt gcatctagaa tttgaaattc caagaacagc agcagaagac 2580 atagtaaatc aatgtgaaat atgcaaagaa gcgaggacac ctgcagtaat tagaggcgga 2640 aacaaaaggg gggtaaatca ttggcaagtg gattataccc attatgaaaa tatcatacta 2700 ttagtatggg tagaaacaaa ttcaggacta atatatgcag aaaaagtaaa aggagaatca 2760 gggcaagaat tcagaataaa agtgatgcat tggtatgcat tatttggtcc agagtcattg 2820 cagtcagaca atggacctgc atttgcagca gagcccacac agctgttaat gcaataccta 2880 ggagtaaaac acacaacagg cataccttgg aatccacagt ctcaggctat agtagaaagg 2940 gcacatcaac tattgaaaag cactttaaag aagttccagc cacaatttgt cgctgtagaa 3000 tcagccatag cagcagccct agtcgccata aatataaaaa gaaagggtgg gctggggaca 3060 agccctatgg atatttttat atataataaa gaacagaaaa gaataaataa taaatataat 3120 aaaaattctc aaaaaattca attctgttat tacagaataa ggaaaagagg acatcaggag 3180 agtggaaagg accaacccag gtactgtgga aaggggaagg agccaattgt ggtaaaggat 3240 atagaaagtg aaaagtattt agtaatacct tacaaagatg caaaattcat cccgccacca 3300 acaaaagaaa aggaataa 3318 <210> 10 <211> 3324 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 10 atgcagacaa ggcattatat gtcacaactg tggaaaaaga gggcatatac agaaaaactg 60 caaacagaaa agaagaaagg agcagggaaa catgaggagg gggctacgtg tggtgccgtc 120 cgcaccccct atggagtaac gcaagcacca ctaatagtta gggtacaaat agggaatcag 180 gagaaacaat tattatttga cacaggggca gataaaacga tagtaagaat gcatgatgga 240 acagggattc caaacggaag aataaaatta caagggatag gaggaatagt agaaggagaa 300 aaatggaata aagtacccat gacatataag ggagaaacat cctgcccaag cttggttgtg 360 ctaagagata gcccagtaga agtattggga agagataaca tggaagcatt cggcgtaacc 420 ctaataatgg caaatttaga agataagaaa attcccacaa taccagtaga attgaaagaa 480 ggatgtaaag ggccacatgt gccccagtgg ccattaacag cagagaaatt acaaggacta 540 acaggaatag tagaaaaatt actacaggaa ggaaaattgg cagaggcccc agagggatgg 600 acgtggaaca cgcccatctt ctgcataaaa aagaagtcag gaaaatggag aatgttaata 660 gattttaggg aattaaataa gcaaacagca gatttagcag aagcgcagct aggactgcca 720 cacccaggag ggttgcaaag gaaaaagaat gtaacaattc tggacatagg agatgcatat 780 ttcacaattc ccttatacga gccctatcag aaatatacat gcttcacact cctaagtcct 840 aacaatttgg gaccatgtaa aaggtattat tggaaagtat taccccaggg atggaaattg 900 agcccagctg tatatcaatt caccatgcaa aggttgttaa aaggatggat acaacagcat 960 aaaaacatac aatttggaat atatatggat gatatctata ttggaagtga tctaacgata 1020 gcccaacata ggaagataat agaagaatta gcctcattta tagaacaatt tgggtttaca 1080 ttaccagaag ataagagaca agagggctat ccagcaaaat ggctaggatt cgagctacat 1140 ccagaaaaat ggaaatatca aaagcataaa ttgccggaat tacaagaggg ggtaataacc 1200 ctgaacaaat tacagaagat agtaggggaa ttagtgtgga gacaatcctt gataggaaag 1260 agcatcccca atatcataaa attaatggaa ggagatcgcg cattacaaag tgaaaggaaa 1320 atagaaagaa tacatgtaca agaatgggaa gcatgtcaaa agaaattaga tgaaatggta 1380 ggaaattatt acagagaaga agaagatatc tatggacaaa taacttgggg ggataaggca 1440 ataaaataca tagtattcca aaggaaaggg gaacccctat gggtaaatgt agtacatgac 1500 ataaaaaatt tgagtctccc acagcaagtg ataaaagcag cacagaaatt aacccaggaa 1560 gtaatcataa gaacaggaaa aatcccatgg ctgctactac caggaagaga agaagactgg 1620 agattagaac tgcaggtagg gaacatcacg tggatgccat cattttggtc atgttatcga 1680 ggagcaccca agtggaaaag aaggaacata gtggcagcag tggtagatgg accgacatat 1740 tatacagatg ggggaaagaa aaacgcacag ggaagctttg gcttcatctc cccaacagga 1800 gaaaagttca gaaggcatga agatggaact aatcaggtat tagaattaag ggcaatagaa 1860 gatccatgta aacaaggacc tgaaagcatg aacattgtaa ctgacagcag gtatgcttat 1920 gaattcatgc tccgaaactg ggatgaacag gtcataagaa accccattca ggcaagaatc 1980 atggcagaag tgcacaagaa aaagcaggta ggaatacact gggtgccagg gcataaagga 2040 atacctcaga atgaagagat agaccagtac atatcagaag tattcttagc acgagaagga 2100 acagggatat gtgaaaaaag gaaggaagat gctggatatg atttattatg cccgcatgag 2160 gtaatactta aaccccaaga agtaaaacgg atcccaatag acctaaaatt aaaattgaaa 2220 gaaaagcaat gggccatgat aagtgggaaa agtagcgttg cagcaaaagg aatatttgta 2280 caaggaggca taatagattc agggtatcag ggacaagtac aagtcatcct atataatagt 2340 aataagatag aggtcaaaat accacaaggc aggaaatttg cccaattaat attaatgaac 2400 ttacaacatg aagaattaga agaatgggga aaggaaagaa aaacagaaag aggaacaaaa 2460 ggatttgggt ctacaggagc attttggata gagaatattc cccaagcaga ggaagaacat 2520 tacaaatggc atcaagatgc tagatctctg cagctagaat tcaagatacc tagagcagca 2580 gcagaagaca ttatacagca ctgtgaggta tgtcaagaag gcaaacccgc agcgatcacg 2640 agagggggaa ataaaagagg aatagatcat tggcaggtag actatacaca ttacaaagaa 2700 cacataatat tagtatgggt agagactaat tcaggattaa tatttgcaga gaaagtaaaa 2760 ggagaatcag gacaagaatt taggatgcag acattgaaat ggtatgcttt gtttcaacca 2820 aaatcagtgc aatcagataa tgggacagcc ttcacagctg aggctacgca gcatctaatg 2880 aagtatttag ggattcagca cactacgggt attccgtgga acccccagtc acaaagttta 2940 gtagaaagag ctcatcaaac attaaaacac atgttagaaa aattagaacc acaatttgtg 3000 gccctacagt ctgccatcgc agccactcta gttgcgctca atataaaaag aaagggtgga 3060 ctaggggcaa gccctatgga tatttacata tataataagg agcaacaaag acaacaagat 3120 aatagtaata aattaattca gaaaaaattt tgttattaca ggatcagaaa aagaggccat 3180 ccaggagagt ggaacggccc aactgaggta ctgtgggaag gggaaggagc catagtagtt 3240 aaagacaaag aaagtgatag atatctagtc atcccatata aagatgcaaa atttattccg 3300 ccaccgtcgg aacagaaggg atag 3324 <210> 11 <211> 402 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 11 atggatgctg gggccagata catgcgctta actgggaagg aaaactgggt tgaagtaacc 60 atggacggag agaaggaaag gaaaagagaa ggtttcactg cgggacagca agatatacag 120 aactctaagt accccgacat accaacgggt cacagtcatc atggaaacaa gagcagacgt 180 cgcaggagaa aatcaggatt ttggcgatgg cttagaggaa tcagacaaca gcgaaacaag 240 cgaaagagtg acagtacaga aagcttggag ccgtgcctgg gagctttggc agaactcacc 300 ctggaaggag ccatggaaaa ggggcctgct gaggctgctc gtccttccgc tgacgatggg 360 aatctggata aatggatggc ttggagaaca ccacaaaaat aa 402 <210> 12 <211> 321 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 12 atggcggaga taagaaaaga agcaaaggag ccactaatcc aggaccaggt atcaacaagt 60 caacttggtg atggagaccc gggtgcaact agaagaagaa gaagaagaag acggaaggga 120 tggtggagat ggctcagaga gatgcagcga tcccgacaac aaaggaatta tgaacgcctg 180 gaggagagct tgggtgactt ggagaaactc accttggcag aacacatgga agaatgtggt 240 ggtggcgccg ttggtgattc cgctgacaat cagaatttgg ctccttggag agaatggaga 300 gaacccctaa aagaaaaata a 321 <210> 13 <211> 690 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 13 atgcaaaatt catcccgcca ccaacaaaag aaaaggaata aaaaacctgg accagaatta 60 cccttagcac tatggataca tatagcagaa agcattaatg gggatagctc atggtacata 120 acaatgagac tgcaacagat gatgtgggga aaaagaggaa ataagttaca atataagaat 180 gaagacaggg aatatgaaaa ttgggaaatt acatcatggg gatggaaaat gcacctaagg 240 agagtgaaac aatggataca agacaacagg agaggaagcc catggcagta caaagtagga 300 ggaacatgga aaagtatagg agtgtggttc ctgcaagcag gagattacag aaaggtagac 360 aggcacttct ggtgggcatg gaggatactg atatgttcct gcaggaaaga aaagtttgat 420 ataagagaat ttatgagagg aagacataga tgggatttgt gcaaatcctg tgctcaagga 480 gaagtagtaa agcatactag aacaaaaagt ctggaaagac tagtactgct acagatggta 540 gaacagcatg tgtttcaagt attgccattg tggagagcca ggagaagtag tacaacagat 600 ttcccatggt gcagggacac aacgggatac acgcatgcgt ggtctgtcca ggagtgctgg 660 ttgatggaat atctcttaga ggatgagtga 690 <210> 14 <211> 490 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 14 atgcaaaatt tattccgcca ccgtcggaac agaagggata gaagaatagg caaaaagtca 60 tccaagaggg catacttcgc catggcaata tcggagacag gatggatgga aggatgtggg 120 aacgtggttc ctacagccag gggactatag aaaggcggat cagcagttct ggttcgcttg 180 gagaatagtg tcgtgttcat gtaaaaagga aggatttaac ataagagaat ttatgctagg 240 tacccataga tgggatttgt gtaagtcgtg ttgccagggt gaagtagtaa agagaacaca 300 accctacacc ttgcaaaggc tcacgtggct taaattaaca gaagaccatg tatttcaagt 360 aatgcccttg tggagagctc gcaaagggat taccatagac tttccctggt gcagggacac 420 aaaaggattc ctggagccgt ggacaacgca agagtgttgg caaatagagt atcccttgga 480 ggatgagtga 490 <210> 15 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 15 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 16 <211> 591 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 16 gctgtagact ctgtaatgtt ccaacaaatg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaagaca gttagcatat tatgctacta cctggacaag taaagacata 120 ctagaagtat tggccatgat gcctgggaat agggctcaga aagaacttat tcaagggaaa 180 ttgaatgaag aagcagacag gtggagaagg aacaatccac caggaggatt aacagtggat 240 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 300 agacaaatat gcctacaatg ggtaataaac gccttaagag cagtaaggca tatggctcat 360 aggccaggga atccaatgct agtaaagcaa aaaacaaatg agccatatga agaatttgca 420 gcaagactgc tagaagcaat agatgcagaa gcggttacac agcccataaa agagtatcta 480 aagctaacat tatcctatac aaatgcagcc tcagattgtc aaaagcaaat ggagagagtg 540 ctaggacaaa gagtacaaca ggctagtgta gaaaaaaaaa tgcaagcatg t 591 <210> 17 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 17 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 18 <211> 591 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 18 gctgtagact ctgtaatgtt ccaacaaatg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaagaca gttagcatat tatgctacta cctggacaag taaagacata 120 ctagaagtat tggccatgat gcctgggaat agggctcaga aagaacttat tcaagggaaa 180 ttgaatgaag aagcagacag gtggagaagg aacaatccac caggaggatt aacagtggat 240 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 300 agacaaatat gcctacaatg ggtaataaac gccttaagag cagtaaggca tatggctcat 360 aggccaggga atccaatgct agtaaagcaa aaaacaaatg agccatatga agaatttgca 420 gcaagactgc tagaagcaat agatgcagaa gcggttacac agcccataaa agagtatcta 480 aagctaacat tatcctatac aaatgcagcc tcagattgtc aaaagcaaat ggagagagtg 540 ctaggacaaa gagtacaaca ggctagtgta gaaaaaaaaa tgcaagcatg t 591 <210> 19 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 19 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 20 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 20 tcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagtatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaagaaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gtctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aaaacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacagat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag gattcagaat 660 gc 662 <210> 21 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 21 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc aaaaacaaat ggatagaata ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag ggttcagaat 660 gc 662 <210> 22 <211> 651 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 22 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tggcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaacag 180 agctcaaaaa gagttaattc aggggaaatt gaataaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcac aaggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcag gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacaaat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag g 651 <210> 23 <211> 520 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 23 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa gaatatttaa 520 <210> 24 <211> 663 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 24 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcgactaaga gcagtgagac atatggctca caaaccaggg aatccaatgc 420 tagtaaagca aaagacaaat gagtcatatg aaaaattttc agcaagactc ctagaagcaa 480 tagatgcaga accagttaca cagcctataa aagaatattt aaagttaaca ttatcttaca 540 caaatgcatc ctcagactgt caaaaacaaa tggatagagt actaggacag agagtgcaac 600 aagctagtgt ggaagaaaaa atgcaagcat gcagagatgt gggatcagaa ggattcagaa 660 tgc 663 <210> 25 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 25 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 26 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 26 tcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagtatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaagaaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gtctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aaaacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacagat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag gattcagaat 660 gc 662 <210> 27 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 27 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc aaaaacaaat ggatagaata ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag ggttcagaat 660 gc 662 <210> 28 <211> 651 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 28 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tggcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaacag 180 agctcaaaaa gagttaattc aggggaaatt gaataaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcac aaggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcag gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc agaaacaaat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag g 651 <210> 29 <211> 520 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 29 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga agattttgcc gcaagactgc tagaagcaat 480 agatgcagaa ccagttacac agcaaataaa gaatatttaa 520 <210> 30 <211> 662 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 30 gcaagcagca ggagggagaa gctggaaagc agtagactca gtgatgttcc agcaactgca 60 aaatgtagca atgcagcatg gcctcgtgtc cgaggatttt gaaaggcagt tagcatatta 120 tgctactacc tggacaagta aagatatatt agaagtattg gccatgatgc ctggaaatag 180 agctcaaaaa gagttaattc aagggaaatt gaatgaggaa gcagaaaggt ggagaaggaa 240 taatccacca cctcaagcag gcggaggatt aacagtggat caaattatgg gggtaggaca 300 aacaaatcaa gcagcggcac aggctaacat ggatcaggca agacaaatat gcctgcaatg 360 ggtaataaca gcactaagag cagtgagaca tatggctcac aaaccaggga atccaatgct 420 agtaaagcaa aagacaaatg agtcatatga aaaattttca gcaagactcc tagaagcaat 480 agatgcagaa ccagttacac agcctataaa agaatattta aagttaacat tatcttacac 540 aaatgcatcc tcagactgtc aaaaacaaat ggatagagta ctaggacaga gagtgcaaca 600 agctagtgtg gaagaaaaaa tgcaagcatg cagagatgtg ggatcagaag gattcagaat 660 gc 662 <210> 31 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 31 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 32 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 32 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tgtgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaacgagg aagcagaaag gtggagaagg aataatccac cgcctccaca aggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagcccata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa aatgcaa 597 <210> 33 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 33 gcagtagact cagtaatgtt ccagcaactg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaaggca gttggcatat tatgctacta cctggacgag taaagacata 120 ctagaagtat tggccatgat gcctggaaac agagctcaaa aggagttaat tcaagggaaa 180 ttaaatgaag aggcagaaag gtggagaaga cataatccac cccctccggc gggaggagga 240 ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcattaag agcagtgagg 360 tatatgactc acaaaccagg gaatccaatg ctagtaaaac aaaaaacaaa tgaagcatat 420 gaagagttta cagcgagact gctagaagca atagatgcag agccagtaac acagcccaca 480 aaagaatatc taaaactaac attatcttat acaaatgcat cctcagactg tcaaaagcaa 540 atggatagag tactaggaca aagagtgcaa caagctagtg tagaagaaaa aatgcaa 597 <210> 34 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 34 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tatgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaatgagg aagcagaaag gtggagaagg aataatccac cgcctccaca gggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagctcata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa gatgcaa 597 <210> 35 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 35 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 36 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 36 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tgtgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaacgagg aagcagaaag gtggagaagg aataatccac cgcctccaca aggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagcccata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa aatgcaa 597 <210> 37 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 37 gcagtagact cagtaatgtt ccagcaactg caaacagtag caatgcagca tggcctcgtg 60 tccgaggatt ttgaaaggca gttggcatat tatgctacta cctggacgag taaagacata 120 ctagaagtat tggccatgat gcctggaaac agagctcaaa aggagttaat tcaagggaaa 180 ttaaatgaag aggcagaaag gtggagaaga cataatccac cccctccggc gggaggagga 240 ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcattaag agcagtgagg 360 tatatgactc acaaaccagg gaatccaatg ctagtaaaac aaaaaacaaa tgaagcatat 420 gaagagttta cagcgagact gctagaagca atagatgcag agccagtaac acagcccaca 480 aaagaatatc taaaactaac attatcttat acaaatgcat cctcagactg tcaaaagcaa 540 atggatagag tactaggaca aagagtgcaa caagctagtg tagaagaaaa aatgcaa 597 <210> 38 <211> 597 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 38 gcagtcgatg ctgtaatgtt ccagcaaatg caaacagtag ccatgcagca tggtcttgtg 60 tctgaggact ttgaaaggca gttagcatat tatgctacta cctggacaag taaagatata 120 ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat tcaaggaaaa 180 ttaaatgagg aagcagaaag gtggagaagg aataatccac cgcctccaca gggaggggga 240 ttaacagtgg atcaaattat ggggatagga caaacaaatc aagcagcagc acaagctaac 300 atggatcagg caagacacat atgcctgcaa tgggtaataa cagcattaag agcagtaaga 360 catatggctc acagaccagg gaatccaatg ctcgtaaaac aaaaaacaaa tgagccatat 420 gaagagtttg cagcaaaact attagaagca atagatgcag aaccagtaac acagctcata 480 aaagactatc taaagttaac attatcttat acaaatgcgt cctcagactg tcaaaagcaa 540 atggatagag tgctgggaca aagagtgcaa caagctagtg tagacgagaa gatgcaa 597 <210> 39 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 39 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 40 <211> 524 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 40 taaagatata ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat 60 tcaagggaaa ttgaatgaag aagcagaaag gtggagaagg aataatccac cacctcaagc 120 aggcggagga ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcggc 180 acaggctaac atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcactaag 240 agcagtgaga catatggctc acaaaccagg gaatccgatg ctagtaaagc aaaaaacaaa 300 tgagtcatat gaagattttg ccgcaagact gctagaagca atagatgcag aaccagttac 360 aaagcaaata aaagaatatt taaagttaac attatcttac acaaatgcat cctcagactg 420 taagaaacag atggatagag tactaggaca gagagtgcaa caagctagtg tggaagaaaa 480 aatgcaagca tgcagagatg tgggatcaga aggattcaga atgc 524 <210> 41 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 41 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 42 <211> 524 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 42 taaagatata ttagaagtat tggccatgat gcctggaaat agagctcaaa aagagttaat 60 tcaagggaaa ttgaatgaag aagcagaaag gtggagaagg aataatccac cacctcaagc 120 aggcggagga ttaacagtgg atcaaattat gggggtagga caaacaaatc aagcagcggc 180 acaggctaac atggatcagg caagacaaat atgcctgcaa tgggtaataa cagcactaag 240 agcagtgaga catatggctc acaaaccagg gaatccgatg ctagtaaagc aaaaaacaaa 300 tgagtcatat gaagattttg ccgcaagact gctagaagca atagatgcag aaccagttac 360 aaagcaaata aaagaatatt taaagttaac attatcttac acaaatgcat cctcagactg 420 taagaaacag atggatagag tactaggaca gagagtgcaa caagctagtg tggaagaaaa 480 aatgcaagca tgcagagatg tgggatcaga aggattcaga atgc 524 <210> 43 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 43 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 44 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 44 tgccgtagac tctgtgatgt tccaccagct gcatacagta gcaatgccgc atggcctcgt 60 gtctgaggac tttgaaaggc agttggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagctagac tgctagaagc aatagatcca gccccagtag cacatcctat 480 aaaagattat ttaaagttaa cactatctta tacgaatgca tcatcagatt gtcaaaagca 540 aatgggtaga atgctaggat cgagagtcca tcaagccagt gtgggccaaa aaa 593 <210> 45 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 45 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 46 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 46 tgccgtagac tctgtgatgt tccaccagct gcatacagta gcaatgccgc atggcctcgt 60 gtctgaggac tttgaaaggc agttggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagctagac tgctagaagc aatagatcca gccccagtag cacatcctat 480 aaaagattat ttaaagttaa cactatctta tacgaatgca tcatcagatt gtcaaaagca 540 aatgggtaga atgctaggat cgagagtcca tcaagccagt gtgggccaaa aaa 593 <210> 47 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 47 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 48 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 48 agcagtagat tctgtaatgt tccagcaact gcaaacagta gcaatgcagc atggactcgt 60 gtatgaagac tttgaaaggc tgtcggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagcaagac tgctagaagc aatagatgca gagccagtag cacatcctat 480 aaaagaatac ttaaagttaa cactatctta tacgaatgca tcatcagatt gtcaaaagca 540 aatggataga atgctggaat caagagtaca acaagctagt gtagaacaaa aaa 593 <210> 49 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 49 agccgtagat tctgtaatgt tccagcagct gcaaacagta gcaatgcagc atggcctcgt 60 gtcagaggac tttgaaaggc ttccagcata tcatgctact acctgggcaa gtaaagatat 120 cttagaagta ctggccatga tgcctggaaa tagagctcaa aaagagttaa ttcaagggaa 180 attaaatgaa gaagcagaga ggtggagaag gaataatcca ccacctccag caggaggagg 240 gttaacagtg gatcaaatta tgggagtagg acaaacaaat caggcagcgg cacaagcaaa 300 catggatcag gcaagacaaa tatgcctaca atgggtgata tcagcactaa gagcagtaag 360 gcatatggct cacaagccag ggaatccaat gttagtaaag caaaaagcaa atgagccata 420 tgaagaattt gcagcaaggc tgctggaagc aatagatgcc gagccagtta atcagcccat 480 aaaagaatat ctaaaactaa cgttgtctta tacgaatgca tcctcagatt gtcagaagca 540 aatggataga acactaggac aaagagtcaa acaagctagt gtagaacaaa aaa 593 <210> 50 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 50 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 51 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 51 agcagtagat tctgtaatgt tccagcaact gcaaacagta gcaatgcagc atggactcgt 60 gtatgaagac tttgaaaggc tgtcggcata ttatgctact acctggacaa gtaaagatat 120 actggaagta ttggccatga tgcctgggaa tagagctcaa aaagaattaa ttcaaggaaa 180 attaaatgaa gaagcagaaa ggtggagaag gaataatcca ccacctcaag caggcggagg 240 attaacagtg gatcaaatta tgggggtagg acaaacaaat caagcagctg cacaagctaa 300 catggatcag gcaagacaaa tatgcctgca atgggtaata tcagccttaa gagcagtgag 360 acatatgtct cataaaccag ggaatccgct gctagtaaag caaaaaacaa atgagtcata 420 tgaagatttt gcagcaagac tgctagaagc aatagatgca gagccagtag cacatcctat 480 aaaagaatat ctaaaactaa cgttgtctta tacgaatgca tcctcagatt gtcagaagca 540 aatggataga acactaggac aaagagtcaa acaagctagt gtagaacaaa aaa 593 <210> 52 <211> 593 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 52 agccgtagat tctgtaatgt tccagcagct gcaaacagta gcaatgcagc atggcctcgt 60 gtcagaggac tttgaaaggc ttccagcata tcatgctact acctgggcaa gtaaagatat 120 cttagaagta ctggccatga tgcctggaaa tagagctcaa aaagagttaa ttcaagggaa 180 attaaatgaa gaagcagaga ggtggagaag gaataatcca ccacctccag caggaggagg 240 gttaacagtg gatcaaatta tgggagtagg acaaacaaat caggcagcgg cacaagcaaa 300 catggatcag gcaagacaaa tatgcctaca atgggtgata tcagcactaa gagcagtaag 360 gcatatggct cacaagccag ggaatccaat gttagtaaag caaaaagcaa atgagccata 420 tgaagaattt gcagcaaggc tgctggaagc aatagatgcc gagccagtta atcagcccat 480 aaaagaatat ctaaaactaa cgttgtctta tacgaatgca tcctcagatt gtcagaagca 540 aatggataga acactaggac aaagagtcaa acaagctagt gtagaacaaa aaa 593 <210> 53 <211> 720 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 53 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt attacatgtt tgaggaccat aaagaggaac 180 cttggacaaa aaaacattag agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 <210> 54 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 54 cagcatggcc tcgtgtccga ggactttgaa aggcagttgg catattatgc tactacctgg 60 acaagtaagg acatattaga agtattggcc atgatgccag gaaatagagc tcaaaaggag 120 ctaattcaa 129 <210> 55 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 55 cagcatggaa tagtatcaga agagtttgag aggcaactgt cttattatgc taccacttgg 60 acaagcaagg atatcttaga ggtactagcc atgatgcctg gcaatagagc attaaaagag 120 ctaatacaa 129 <210> 56 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 56 caacatggga tagtatcaga ggaatttgag agacaaatgt cttattatgc taccacatgg 60 acaagtaagg atattttaga agtactagca atgatgcccg ggaacagagc attaaaggag 120 ctgatacaa 129 <210> 57 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 57 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc cacaacgtgg 60 acaagcaaag acatactaga ggtgttagcc atgatgccag ggaatagagc acaaaaagaa 120 ctaatacaa 129 <210> 58 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 58 cagcatggac ttgtgtccga agattttgag aggcaattgg catattatgc tacaacctgg 60 actagtgaag atatattaga agtattggct atgatgcctg ggaatagagc acagaaagaa 120 ttaatacaa 129 <210> 59 <211> 1347 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 59 atggtgagtc tagatagaga catggcgagg caagtctccg gggggaaaag agattatcct 60 gagctcgaaa aatgtatcaa gcatgcatgc aagataaaag ttcgactcag aggggagcac 120 ttgacagaag gaaattgttt atggtgcctt aaaacattag attacatgtt tgaggaccat 180 aaagaggaac cttggacaaa agtaaaattt aggacaatat ggcagaaggt gaagaatcta 240 actcctgagg agagtaacaa aaaagacttt atgtctttgc aggccacatt agcgggtcta 300 atgtgttgcc aaatggggat gagacctgag acattgcaag atgcaatggc tacagtaatc 360 atgaaagatg ggttactgga acaagaggaa aagaaggaag acaaaagaga aaaggaagag 420 agtgtcttcc caatagtagt gcaagcagca ggagggagaa gctggaaagc agtagattct 480 gtaatgttcc agcaactgca aacagtagca atgcagcatg gcctcgtgtc tgaggacttt 540 gaaaggcagt tggcatatta tgctactacc tggacaagta aagacatact agaagtattg 600 gccatgatgc ctggaaatag agctcaaaag gagttaattc aagggaaatt aaatgaagaa 660 gcagaaaggt ggagaaggaa taatccacca cctccagcag gaggaggatt aacagtggat 720 caaattatgg gggtaggaca aacaaatcaa gcagcagcac aagctaacat ggatcaggca 780 aggcaaatat gcctgcaatg ggtaataaat gcattaagag cagtaagaca tatggcgcac 840 aggccaggga atccaatgct agtaaagcaa aaaacgaatg agccatatga agattttgca 900 gcaagactgc tagaagcaat agatgcagag ccagttacac agcctataaa agattatcta 960 aagctaacac tatcttatac aaatgcatca gcagattgtc agaagcaaat ggatagaaca 1020 ctaggacaaa gagtacaaca agctagtgta gaagaaaaaa tgcaagcatg tagagatgtg 1080 ggatcagaag ggttcaaaat gcaattgtta gcacaagcat taaggccagg aaaaggaaaa 1140 gggaatggac agccacaaag gtgttacaac tgtggaaaac cgggacatca agcaaggcaa 1200 tgtagacaag gaatcatatg tcacaactgt ggaaagagag gacatatgca aaaagaatgc 1260 agaggaaaga gagacataag gggaaaacag cagggaaacg ggaggagggg gatacgtgtg 1320 gtgccgtccg ctcctcctat ggaataa 1347 <210> 60 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 60 cagcatggcc tcgtgtccga ggactttgaa aggcagttgg catattatgc tactacctgg 60 acaagtaagg acatattaga agtattggcc atgatgccag gaaatagagc tcaaaaggag 120 ctaattcaa 129 <210> 61 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 61 cagcatggaa tagtatcaga agagtttgag aggcaactgt cttattatgc taccacttgg 60 acaagcaagg atatcttaga ggtactagcc atgatgcctg gcaatagagc attaaaagag 120 ctaatacaa 129 <210> 62 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 62 caacatggga tagtatcaga ggaatttgag agacaaatgt cttattatgc taccacatgg 60 acaagtaagg atattttaga agtactagca atgatgcccg ggaacagagc attaaaggag 120 ctgatacaa 129 <210> 63 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 63 cagcatggac tagtatcaga agaatttgaa aggcagctag catactatgc cacaacgtgg 60 acaagcaaag acatactaga ggtgttagcc atgatgccag ggaatagagc acaaaaagaa 120 ctaatacaa 129 <210> 64 <211> 129 <212> DNA <213> Caprine arthritis-encephalitis virus <400> 64 cagcatggac ttgtgtccga agattttgag aggcaattgg catattatgc tacaacctgg 60 actagtgaag atatattaga agtattggct atgatgcctg ggaatagagc acagaaagaa 120 ttaatacaa 129 <210> 65 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer used to construct dig-labeled probe <400> 65 ctggcgtaat agcgaagagg 20 <210> 66 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer used to construct dig-labeled probe <400> 66 aactcgccgc acatctgaac 20 <210> 67 <211> 3911 <212> DNA <213> Artificial <220> <223> pCAH / SINd0 <400> 67 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag ggaattcgat tctagaggtg 3540 atagaaatgc cagaaaacta tgcaaaaaca agaatcataa acaggaaaaa aagagaactc 3600 agccacaaga ggaagaagag aggcgttggc ttggtcatta tgctagttat catggcaata 3660 gtagctgccg caggggcttc tctgggagtc gcaaacgcga ttcagcagtc ttacactaag 3720 gcagctgtcc agacccttgc taatgcaact gctgcacagc aggatgtgtt agaggcaacc 3780 tatgccatgg tacagcatgt ggctaaaggc gtacgaatct tggaagctcg agtggctcga 3840 gtggaagcta tcacagatag aataatgcta taccaagaat tggattgttg gcactaggat 3900 ccatcgccac c 3911 <210> 68 <211> 4238 <212> DNA <213> Artificial <220> <223> pCAH / SINd1 <400> 68 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgcagga attcgattct agaggtgata gaaatgccag aaaactatgc aaaaacaaga 3900 atcataaaca ggaaaaaaag agaactcagc cacaagagga agaagagagg cgttggcttg 3960 gtcattatgc tagttatcat ggcaatagta gctgccgcag gggcttctct gggagtcgca 4020 aacgcgattc agcagtctta cactaaggca gctgtccaga cccttgctaa tgcaactgct 4080 gcacagcagg atgtgttaga ggcaacctat gccatggtac agcatgtggc taaaggcgta 4140 cgaatcttgg aagctcgagt ggctcgagtg gaagctatca cagatagaat aatgctatac 4200 caagaattgg attgttggca ctaggatcca tcgccacc 4238 <210> 69 <211> 4523 <212> DNA <213> Artificial <220> <223> pCAH / SINd2 <400> 69 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgagaca ttgcaagatg caatggctac agtaatcatg aaagatgggt tactggaaca 3900 agaggaaaag aaggaagaca aaagagaaaa ggaagagagt gtcttcccaa tagtagtgca 3960 agcagcagga gggagaagct ggaaagcagt agattctgta atgttccagc aactgcaaac 4020 agtagcaatg cagcatggcc tcgtgtctga ggactttgaa aggcagttgg catattatgc 4080 tactacctgg acaagtaaag acatactaga agtattggcc atgatgcctg caggaattcg 4140 attctagagg tgatagaaat gccagaaaac tatgcaaaaa caagaatcat aaacaggaaa 4200 aaaagagaac tcagccacaa gaggaagaag agaggcgttg gcttggtcat tatgctagtt 4260 atcatggcaa tagtagctgc cgcaggggct tctctgggag tcgcaaacgc gattcagcag 4320 tcttacacta aggcagctgt ccagaccctt gctaatgcaa ctgctgcaca gcaggatgtg 4380 ttagaggcaa cctatgccat ggtacagcat gtggctaaag gcgtacgaat cttggaagct 4440 cgagtggctc gagtggaagc tatcacagat agaataatgc tataccaaga attggattgt 4500 tggcactagg atccatcgcc acc 4523 <210> 70 <211> 4819 <212> DNA <213> Artificial <220> <223> pCAH / SINd3 <400> 70 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgagaca ttgcaagatg caatggctac agtaatcatg aaagatgggt tactggaaca 3900 agaggaaaag aaggaagaca aaagagaaaa ggaagagagt gtcttcccaa tagtagtgca 3960 agcagcagga gggagaagct ggaaagcagt agattctgta atgttccagc aactgcaaac 4020 agtagcaatg cagcatggcc tcgtgtctga ggactttgaa aggcagttgg catattatgc 4080 tactacctgg acaagtaaag acatactaga agtattggcc atgatgcctg gaaatagagc 4140 tcaaaaggag ttaattcaag ggaaattaaa tgaagaagca gaaaggtgga gaaggaataa 4200 tccaccacct ccagcaggag gaggattaac agtggatcaa attatggggg taggacaaac 4260 aaatcaagca gcagcacaag ctaacatgga tcaggcaagg caaatatgcc tgcaatgggt 4320 aataaatgca ttaagagcag taagacatat ggcgcacagg ccagggaatc caatgctagt 4380 aaagcaaaaa acgaatgagc catatgaaga ttttgcagca agactgcagg aattcgattc 4440 tagaggtgat agaaatgcca gaaaactatg caaaaacaag aatcataaac aggaaaaaaa 4500 gagaactcag ccacaagagg aagaagagag gcgttggctt ggtcattatg ctagttatca 4560 tggcaatagt agctgccgca ggggcttctc tgggagtcgc aaacgcgatt cagcagtctt 4620 acactaaggc agctgtccag acccttgcta atgcaactgc tgcacagcag gatgtgttag 4680 aggcaaccta tgccatggta cagcatgtgg ctaaaggcgt acgaatcttg gaagctcgag 4740 tggctcgagt ggaagctatc acagatagaa taatgctata ccaagaattg gattgttggc 4800 actaggatcc atcgccacc 4819 <210> 71 <211> 5112 <212> DNA <213> Artificial <220> <223> pCAH / SINd4 <400> 71 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct tgctgcttgc 3000 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 3060 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 3120 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagct ggcgcccaac 3180 gtggggcccg aggagaagaa aagaaagcgg ccctgagaac tcggcttctg aaaaagagga 3240 agaggacaag ttgctatagc aacaagagag aagaagtaga gcaaaggtcc agtggctcgg 3300 aaaaagagga actgaaactt cggggacgcc tgaaggagta aggtaagtga ctctgctgta 3360 cgcggggcga ggcagaggtt tccttctaaa ttgaaagaga agtgttgctg cgagaggtct 3420 tggtggtcga gaatcctgta caaaaaaaag gagggatctc ggtcaggacc aggacccctg 3480 ggagtaatac aacagcaaca ccgtaagaaa atccgcctag gtgagtctag atagagacta 3540 ggcgaggcaa gtctccgggg ggaaaagaga ttatcctgag ctcgaaaaat gtatcaagca 3600 tgcatgcaag ataaaagttc gactcagagg ggagcacttg acagaaggaa attgtttatg 3660 gtgccttaaa acattagatt acatgtttga ggaccataaa gaggaacctt ggacaaaagt 3720 aaaatttagg acaatatggc agaaggtgaa gaatctaact cctgaggaga gtaacaaaaa 3780 agactttatg tctttgcagg ccacattagc gggtctaatg tgttgccaaa tggggatgag 3840 acctgagaca ttgcaagatg caatggctac agtaatcatg aaagatgggt tactggaaca 3900 agaggaaaag aaggaagaca aaagagaaaa ggaagagagt gtcttcccaa tagtagtgca 3960 agcagcagga gggagaagct ggaaagcagt agattctgta atgttccagc aactgcaaac 4020 agtagcaatg cagcatggcc tcgtgtctga ggactttgaa aggcagttgg catattatgc 4080 tactacctgg acaagtaaag acatactaga agtattggcc atgatgcctg gaaatagagc 4140 tcaaaaggag ttaattcaag ggaaattaaa tgaagaagca gaaaggtgga gaaggaataa 4200 tccaccacct ccagcaggag gaggattaac agtggatcaa attatggggg taggacaaac 4260 aaatcaagca gcagcacaag ctaacatgga tcaggcaagg caaatatgcc tgcaatgggt 4320 aataaatgca ttaagagcag taagacatat ggcgcacagg ccagggaatc caatgctagt 4380 aaagcaaaaa acgaatgagc catatgaaga ttttgcagca agactgctag aagcaataga 4440 tgcagagcca gttacacagc ctataaaaga ttatctaaag ctaacactat cttatacaaa 4500 tgcatcagca gattgtcaga agcaaatgga tagaacacta ggacaaagag tacaacaagc 4560 tagtgtagaa gaaaaaatgc aagcatgtag agatgtggga tcagaagggt tcaaaatgca 4620 attgttagca caagcattaa ggccaggaaa aggaaaaggg aatggacagc cacaaaggtg 4680 ttacaactgt ggaaaaccgg gacatcaagc aaggcactgc aggaattcga ttctagaggt 4740 gatagaaatg ccagaaaact atgcaaaaac aagaatcata aacaggaaaa aaagagaact 4800 cagccacaag aggaagaaga gaggcgttgg cttggtcatt atgctagtta tcatggcaat 4860 agtagctgcc gcaggggctt ctctgggagt cgcaaacgcg attcagcagt cttacactaa 4920 ggcagctgtc cagacccttg ctaatgcaac tgctgcacag caggatgtgt tagaggcaac 4980 ctatgccatg gtacagcatg tggctaaagg cgtacgaatc ttggaagctc gagtggctcg 5040 agtggaagct atcacagata gaataatgct ataccaagaa ttggattgtt ggcactagga 5100 tccatcgcca cc 5112 <210> 72 <211> 7579 <212> DNA <213> Artificial <220> <223> pMYKEF1 / env <220> <221> misc_feature (994). (994) N is a, c, g, or t <400> 72 aacaggaaag ttccattgga gccaagtaca ttgagtcaat agggactttc caatgggttt 60 tgcccagtac ataaggtcaa tgggaggtaa gccaatgggt ttttcccatt actggcacgt 120 atactgagtc attagggact ttccaatggg ttttgcccag tacataaggt caataggggt 180 gaatcaacag gaaagtccca ttggagccaa gtacactgag tcaataggga ctttccattg 240 ggttttgccc agtacaaaag gtcaataggg ggtgagtcaa tgggtttttc ccattattgg 300 cacgtacata aggtcaatag gggtgagtca ttgggttttt ccagccaatt taattaaaac 360 gccatgtact ttcccaccat tgacgtcaat gggctattga aactaatgca acgtgacctt 420 taaacggtac tttcccatag ctgattaatg ggaaagtacc gttctcgagc caatacacgt 480 caatgggaag tgaaagggca gccaaaacgt aacaccgccc cggttttccc tggaaattcc 540 atattggcac gcattctatt ggctgagctg cgttcacgtg ggtataagag gcgcgaccag 600 cgtcggtacc gtcgcagtct tcggtctgac caccgtagaa cgcagagctc ctcgctgcag 660 gcatgcaagc ttggtaagtg ccgtgtgtgg ttcccgcggg cctggcctct ttacgggtta 720 tggcccttgc gtgccttgaa ttacttccac gcccctggct gcagtacgtg attcttgatc 780 ccgagcttcg ggttggaagt gggtgggaga gttcaaggcc ttgcgcttaa ggagcccctt 840 cgccttttgc ttgagttgag gcctggcctg ggcgctgggg ccgccgcgtg caaatctggt 900 ggcaccttcg cgcctgtctc gctgctttcg ataagtctct agccatttaa aatttttgat 960 gacctgctgc gacgcttttt ttctggcaag atantcttgt aaatgcgggc caagatctgc 1020 acactggtat ttcggttttt ggggccgcgg gcggctacgg ggcccgtgcg tcccagcgca 1080 catgttcggc gaggaggggc ctgcgagcgc ggccaccgag aatcggacgg gggtagtctc 1140 aagctggccg gcctgctctg gtgcctggcc tcgcgccgcc gtgtatcgcc ccgccctggg 1200 cggcaaggct ggcccggtcg gcaccagttg cgtgagcgga aagatggccg cttcccggcc 1260 ctgctgcagg gagctcaaaa tggaggacgc ggcgctcggg agagcgggcg ggtgagtcac 1320 ccacacaaag gaaaagggcc tttccgtcct cagccgtcgc ttcatgtgac tccacggagt 1380 accgggcgcc gtccaggcac ctcgattagt tctcgagctt ttggagtacg tcgtctttag 1440 gttgggggga ggggttttat gcgatggagt ttccccacac tgagtgggtg gagactgaag 1500 ttaggccagc ttggcacttg atgtaattct ccttggaatt tgcccttttt gagtttggat 1560 cttggttcat tctcaagcct cagacagtgg ttcaaagttt ttttcttcca tttcagggat 1620 ccactagtaa cggccgccag tgtgctggaa ttcgatcata cctggtgttg ctgactaccc 1680 cgaccgcggt aaaagtcgat ggtattgctg cctgggtcca tgcttctcac ctcaaacctg 1740 caccaccttc ggcaccagat gagtcctggg agctggaaaa gactgatcat cctcttaagc 1800 tgcgtattcg gcggcggcgg gacgagtctg caaaataaga acccccacca gcccatgacc 1860 ctcacttggc aggtactgtc ccaaactgga gacgttgtct gggatacaaa ggcagtccag 1920 cccccttgga cttggtggcc cacacttaaa cctgatgtat gtgccttggc ggctagtctt 1980 gagtcctggg atatcccggg aaccgatgtc tcgtcctcta aacgagtcag acctccggac 2040 tcagactata ctgccgctta taagcaaatc acctggggag ccatagggtg cagctaccct 2100 cgggctagga ctagaatggc aagctctacc ttctacgtat gtccccggga tggccggacc 2160 ctttcagaag ctagaaggtg cggggggcta gaatccctat actgtaaaga atgggattgt 2220 gagaccacgg ggaccggtta ttggctatct aaatcctcaa aagacctcat aactgtaaaa 2280 tgggaccaaa atagcgaatg gactcaaaaa tttcaacagt gtcaccagac cggctggtgt 2340 aaccccctta aaatagattt cacagacaaa ggaaaattat ccaaggactg gataacggga 2400 aaaacctggg gattaagatt ctatgtgtct ggacatccag gcgtacagtt caccattcgc 2460 ttaaaaatca ccaacatgcc agctgtggca gtaggtcctg acctcgtcct tgtggaacaa 2520 ggacctccta gaacgtccct cgctctccca cctcctcttc ccccaaggga agcgccaccg 2580 ccatctctcc ccgactctaa ctccacagcc ctggcgacta gtgcacaaa tcccacggtg 2640 agaaaaacaa ttgttaccct aaacactccg cctcccacca caggcgacag actttttgat 2700 cttgtgcagg gggccttcct aaccttaaat gctaccaacc caggggccac tgagtcttgc 2760 tggctttgtt tggccatggg ccccccttat tatgaagcaa tagcctcatc aggagaggtc 2820 gcctactcca ccgaccttga ccggtgccgc tgggggaccc aaggaaagct caccctcact 2880 gaggtctcag gacacgggtt gtgcatagga aaggtgccct ttacccatca gcatctctgc 2940 aatcagaccc tatccatcaa ttcctccgga gaccatcagt atctgctccc ctccaaccat 3000 agctggtggg cttgcagcac tggcctcacc ccttgcctct ccacctcagt ttttaatcag 3060 actagagatt tctgtatcca ggtccagctg attcctcgca tctattacta tcctgaagaa 3120 gttttgttac aggcctatga caattctcac cccaggacta aaagagaggc tgtctcactt 3180 accctagctg ttttactggg gttgggaatc acggcgggaa taggtactgg ttcaactgcc 3240 ttaattaaag gacctataga cctccagcaa ggcctgacaa gcctccagat cgccatagat 3300 gctgacctcc gggccctcca agactcagtc agcaagttag aggactcact gacttccctg 3360 tccgaggtag tgctccaaaa taggagaggc cttgacttgc tgtttctaaa agaaggtggc 3420 ctctgtgcgg ccctaaagga agagtgctgt ttttacatag accactcagg tgcagtacgg 3480 gactccatga aaaaactcaa agaaaaactg gataaaagac agttagagcg ccagaaaagc 3540 caaaactggt atgaaggatg gttcaataac tccccttggt tcactaccct gctatcaacc 3600 atcgctgggc ccctattact cctccttctg ttgctcatcc tcgggccatg catcatcaat 3660 aagttagttc aattcatcaa tgataggata agtgcagtta aaattctggt ccttagacaa 3720 aaatatcagg ccctagagaa cgaaggtaac ctttaatttt gctctaagat tagagctatt 3780 cacaagagaa atggggatca ctagtgaatt ctgcagatat ccatcacact ggcggccgct 3840 cgagcatgca tctagagggc cctattctat agtgtcacct aaatgctaga gctcgctgat 3900 cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3960 ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 4020 cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 4080 gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 4140 aggcggaaag aaccagtggc ggtaatacgg ttatccacag aatcagggga taacgcagga 4200 aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 4260 gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 4320 aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 4380 gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 4440 ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 4500 cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 4560 ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 4620 actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 4680 tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 4740 gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 4800 ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 4860 cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 4920 ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 4980 tttaaatcaa tctaaagtat atatgagtaa cctgaggcta tggcagggcc tgccgccccg 5040 acgttggctg cgagccctgg gccttcaccc gaacttgggg ggtggggtgg ggaaaaggaa 5100 gaaacgcggg cgtattggcc ccaatggggt ctcggtgggg tatcgacaga gtgccagccc 5160 tgggaccgaa ccccgcgttt atgaacaaac gacccaacac cgtgcgtttt attctgtctt 5220 tttattgccg tcatagcgcg ggttccttcc ggtattgtct ccttccgtgt ttcagttagc 5280 ctccccctag ggtgggcgaa gaactccagc atgagatccc cgcgctggag gatcatccag 5340 ccggcgtccc ggaaaacgat tccgaagccc aacctttcat agaaggcggc ggtggaatcg 5400 aaatctcgtg atggcaggtt gggcgtcgct tggtcggtca tttcgaaccc cagagtcccg 5460 ctcagaagaa ctcgtcaaga aggcgataga aggcgatgcg ctgcgaatcg ggagcggcga 5520 taccgtaaag cacgaggaag cggtcagccc attcgccgcc aagctcttca gcaatatcac 5580 gggtagccaa cgctatgtcc tgatagcggt ccgccacacc cagccggcca cagtcgatga 5640 atccagaaaa gcggccattt tccaccatga tattcggcaa gcaggcatcg ccatgggtca 5700 cgacgagatc ctcgccgtcg ggcatgctcg ccttgagcct ggcgaacagt tcggctggcg 5760 cgagcccctg atgctcttga tcatcctgat cgacaagacc ggcttccatc cgagtacgtg 5820 ctcgctcgat gcgatgtttc gcttggtggt cgaatgggca ggtagccgga tcaagcgtat 5880 gcagccgccg cattgcatca gccatgatgg atactttctc ggcaggagca aggtgagatg 5940 acaggagatc ctgccccggc acttcgccca atagcagcca gtcccttccc gcttcagtga 6000 caacgtcgag cacagctgcg caaggaacgc ccgtcgtggc cagccacgat agccgcgctg 6060 cctcgtcttg cagttcattc agggcaccgg acaggtcggt cttgacaaaa agaaccgggc 6120 gcccctgcgc tgacagccgg aacacggcgg catcagagca gccgattgtc tgttgtgccc 6180 agtcatagcc gaatagcctc tccacccaag cggccggaga acctgcgtgc aatccatctt 6240 gttcaatcat gcgaaacgat cctcatcctg tctcttgatc gatctttgca aaagcctagg 6300 cctccaaaaa agcctcctca ctacttctgg aatagctcag aggccgaggc ggcctcggcc 6360 tctgcataaa taaaaaaaat tagtcagcca tggggcggag aatgggcgga actgggcgga 6420 gttaggggcg ggatgggcgg agttaggggc gggactatgg ttgctgacta attgagatgc 6480 atgctttgca tacttctgcc tgctggggag cctggggact ttccacacct ggttgctgac 6540 taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac 6600 cctaactgac acacattcca cagctggttc tttccgcctc aggactcttc ctttttcaat 6660 aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 6720 gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 6780 gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 6840 cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 6900 gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 6960 gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 7020 ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 7080 tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 7140 ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 7200 cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 7260 accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata 7320 cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 7380 tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 7440 cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 7500 acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 7560 atactcttcc tttttcaat 7579 <210> 73 <211> 3566 <212> DNA <213> Artificial <220> <223> pCAH / SINd <400> 73 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtgg actgtgagac 60 atgggctaaa gaggagcggc cgctcgagtc tagaactagt ggatcagctt tgctgcttgc 120 acttcagagt tctaggagag tccctcctag tctctcctct ccgaggaggt accgagacct 180 caaaataaag gagtgattgc cttactgccg agtggagagt gattactgag cggccggtgt 240 atcgggagtc gtcccttaat ctgtgcaata ccagagcggc tctcgcagcc gacctcgagg 300 gggggcccta ttctatagtg tcacctaaat gctagagctc gctgatcagc ctcgactgtg 360 ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 420 ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 480 aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 540 gacaatagca ggcatgctgg ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc 600 agtggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 660 aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 720 gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 780 gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 840 tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 900 ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 960 ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 1020 tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 1080 tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 1140 ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 1200 aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 1260 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 1320 tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 1380 atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 1440 aagtatatat gagtaacctg atcaggactc ttccttttca tgaacaataa aactgtctgc 1500 ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc 1560 taggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga 1620 taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga 1680 gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag 1740 actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc 1800 tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga 1860 agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt 1920 gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca 1980 ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa 2040 tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat tctcaccgga 2100 ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt 2160 aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat 2220 cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata 2280 tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt 2340 ctaagaattc gcgcaattaa ccctcactaa agggaacaaa agctgggtac cgggcccgtt 2400 gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 2460 catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 2520 acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 2580 ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 2640 aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 2700 ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 2760 tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc 2820 ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt 2880 ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa 2940 tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcaagct ttgctgcttg 3000 cacttcagag ttctaggaga gtccctccta gtctctcctc tccgaggagg taccgagacc 3060 tcaaaataaa ggagtgattg ccttactgcc gagtggagag tgattactga gcggccggtg 3120 tatcgggagt cgtcccttaa tctgtgcaat accagagcgg ctctcgcagc tggcgggaat 3180 tcgattctag aggtgataga aatgccagaa aactatgcaa aaacaagaat cataaacagg 3240 aaaaaaagag aactcagcca caagaggaag aagagaggcg ttggcttggt cattatgcta 3300 gttatcatgg caatagtagc tgccgcaggg gcttctctgg gagtcgcaaa cgcgattcag 3360 cagtcttaca ctaaggcagc tgtccagacc cttgctaatg caactgctgc acagcaggat 3420 gtgttagagg caacctatgc catggtacag catgtggcta aaggcgtacg aatcttggaa 3480 gctcgagtgg ctcgagtgga agctatcaca gatagaataa tgctatacca agaattggat 3540 tgttggcact aggatccatc gccacc 3566 <210> 74 <211> 7623 <212> DNA <213> Artificial <220> <223> pHGVSV-G <400> 74 gcgcgcgttg acattgatta ttgactagtt attaatagta atcaattacg gggtcattag 60 ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct 120 gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc 180 caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg 240 cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat 300 ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca 360 tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc 420 gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga 480 gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat 540 tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc 600 taactagaga acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga 660 cccaagcttg gtaccgagct cggatccact agtaacggcc gccagtgtgc tggaattcga 720 tgatcctgag aacttcaggg tgagtctatg ggacccttga tgttttcttt ccccttcttt 780 tctatggtta agttcatgtc ataggaaggg gagaagtaac agggtacagt ttagaatggg 840 aaacagacga atgattgcat cagtgtggaa gtctcaggat cgttttagtt tcttttattt 900 gctgttcata acaattgttt tcttttgttt aattcttgct ttcttttttt ttcttctccg 960 caatttttac tattatactt aatgccttaa cattgtgtat aacaaaagga aatatctctg 1020 agatacatta agtaacttaa aaaaaaactt tacacagtct gcctagtaca ttactatttg 1080 gaatatatgt gtgcttattt gcatattcat aatctcccta ctttattttc ttttattttt 1140 aattgataca taatcattat acatatttat gggttaaagt gtaatgtttt aatatgtgta 1200 cacatattga ccaaatcagg gtaattttgc atttgtaatt ttaaaaaatg ctttcttctt 1260 ttaatatact tttttgttta tcttatttct aatactttcc ctaatctctt tctttcaggg 1320 caataatgat acaatgtatc atgcctcttt gcaccattct aaagaataac agtgataatt 1380 tctgggttaa ggcaatagca atatttctgc atataaatat ttctgcatat aaattgtaac 1440 tgatgtaaga ggtttcatat tgctaatagc agctacaatc cagctaccat taggcccttt 1500 tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac gtgctggtct 1560 gtgtgctggc ccatcacttt ggcaaagaat cactagtgaa ttctgcagat atccatcaca 1620 ctggcggccg ctcgaggaat tctgacacta tgaagtgcct tttgtactta gcctttttat 1680 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 1740 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 1800 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 1860 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 1920 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 1980 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 2040 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc 2100 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 2160 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag 2220 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 2280 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 2340 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg 2400 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 2460 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg 2520 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 2580 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 2640 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 2700 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 2760 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 2820 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg 2880 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt 2940 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa 3000 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta 3060 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta 3120 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 3180 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc 3240 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatgaaaaa aaaaaaaaaa 3300 aacggaattc ctcgagcatg catctagagg gccctattct atagtgtcac ctaaatgcta 3360 gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 3420 cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 3480 aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 3540 aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct 3600 ctatggcttc tgaggcggaa agaaccagtg gcggtaatac ggttatccac agaatcaggg 3660 gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3720 gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3780 cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 3840 ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 3900 tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 3960 gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4020 tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4080 ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4140 ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4200 ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4260 accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4320 tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4380 cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4440 taaaaatgaa gttttaaatc aatctaaagt atatatgagt aacctgaggc tatggcaggg 4500 cctgccgccc cgacgttggc tgcgagccct gggccttcac ccgaacttgg ggggtggggt 4560 ggggaaaagg aagaaacgcg ggcgtattgg ccccaatggg gtctcggtgg ggtatcgaca 4620 gagtgccagc cctgggaccg aaccccgcgt ttatgaacaa acgacccaac accgtgcgtt 4680 ttattctgtc tttttattgc cgtcatagcg cgggttcctt ccggtattgt ctccttccgt 4740 gtttcagtta gcctccccct agggtgggcg aagaactcca gcatgagatc cccgcgctgg 4800 aggatcatcc agccggcgtc ccggaaaacg attccgaagc ccaacctttc atagaaggcg 4860 gcggtggaat cgaaatctcg tgatggcagg ttgggcgtcg cttggtcggt catttcgaac 4920 cccagagtcc cgctcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat 4980 cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt 5040 cagcaatatc acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccggc 5100 cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat 5160 cgccatgggt cacgacgaga tcctcgccgt cgggcatgct cgccttgagc ctggcgaaca 5220 gttcggctgg cgcgagcccc tgatgctctt gatcatcctg atcgacaaga ccggcttcca 5280 tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg 5340 gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc tcggcaggag 5400 caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc cagtcccttc 5460 ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg gccagccacg 5520 atagccgcgc tgcctcgtct tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa 5580 aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag cagccgattg 5640 tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga gaacctgcgt 5700 gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tgtctcttga tcgatctttg 5760 caaaagccta ggcctccaaa aaagcctcct cactacttct ggaatagctc agaggccgag 5820 gcggcctcgg cctctgcata aataaaaaaa attagtcagc catggggcgg agaatgggcg 5880 gaactgggcg gagttagggg cgggatgggc ggagttaggg gcgggactat ggttgctgac 5940 taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac 6000 ctggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg ggagcctggg 6060 gactttccac accctaactg acacacattc cacagctggt tctttccgcc tcaggactct 6120 tcctttttca ataaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 6180 tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 6240 tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 6300 gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 6360 gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 6420 aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt 6480 gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 6540 ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 6600 tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 6660 atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 6720 ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 6780 ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 6840 ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 6900 atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 6960 gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 7020 tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 7080 ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 7140 acatttcccc gaaaagtgcc acctgacgcg ccctgtagcg gcgcattaag cgcggcgggt 7200 gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc 7260 gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg 7320 gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat 7380 tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg 7440 ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct 7500 atctcggtct attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa 7560 aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac gcttacaatt 7620 tac 7623 <210> 75 <211> 5419 <212> DNA <213> Artificial <220> <223> pHYK / rev <400> 75 gcgcgcgttg acattgatta ttgactagtt attaatagta atcaattacg gggtcattag 60 ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct 120 gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc 180 caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg 240 cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat 300 ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca 360 tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc 420 gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga 480 gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat 540 tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc 600 taactagaga acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga 660 cccaagctta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg 720 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc 780 aagatataca gaactctaag taccccgaca taccaacggg tcacagtcat catggaaaca 840 agagcagacg tcgcaggaga aaatcaggat tttggcgatg gcttagagga atcagacaac 900 agcgaaacaa gcgaaagagt gacagtacag aaagcttgga gccgtgcctg ggagctttgg 960 cagaactcac cctggaagga gccatggaaa aggggcctgc tgaggctgct cgtccttccg 1020 ctgacgatgg gaatctggat aaatggatgg cttggagaac accacaaaaa taagaattct 1080 gcagatatcc atcacactgg cggccgctcg agcatgcatc tagagggccc tattctatag 1140 tgtcacctaa atgctagagc tcgctgatca gcctcgactg tgccttctag ttgccagcca 1200 tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc 1260 ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg 1320 gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct 1380 ggggatgcgg tgggctctat ggcttctgag gcggaaagaa ccagtggcgg taatacggtt 1440 atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 1500 caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 1560 gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 1620 ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 1680 cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 1740 taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 1800 cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 1860 acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 1920 aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt 1980 atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 2040 atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 2100 gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 2160 gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 2220 ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaacc 2280 tgaggctatg gcagggcctg ccgccccgac gttggctgcg agccctgggc cttcacccga 2340 acttgggggg tggggtgggg aaaaggaaga aacgcgggcg tattggcccc aatggggtct 2400 cggtggggta tcgacagagt gccagccctg ggaccgaacc ccgcgtttat gaacaaacga 2460 cccaacaccg tgcgttttat tctgtctttt tattgccgtc atagcgcggg ttccttccgg 2520 tattgtctcc ttccgtgttt cagttagcct ccccctaggg tgggcgaaga actccagcat 2580 gagatccccg cgctggagga tcatccagcc ggcgtcccgg aaaacgattc cgaagcccaa 2640 cctttcatag aaggcggcgg tggaatcgaa atctcgtgat ggcaggttgg gcgtcgcttg 2700 gtcggtcatt tcgaacccca gagtcccgct cagaagaact cgtcaagaag gcgatagaag 2760 gcgatgcgct gcgaatcggg agcggcgata ccgtaaagca cgaggaagcg gtcagcccat 2820 tcgccgccaa gctcttcagc aatatcacgg gtagccaacg ctatgtcctg atagcggtcc 2880 gccacaccca gccggccaca gtcgatgaat ccagaaaagc ggccattttc caccatgata 2940 ttcggcaagc aggcatcgcc atgggtcacg acgagatcct cgccgtcggg catgctcgcc 3000 ttgagcctgg cgaacagttc ggctggcgcg agcccctgat gctcttgatc atcctgatcg 3060 acaagaccgg cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg 3120 aatgggcagg tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat 3180 actttctcgg caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat 3240 agcagccagt cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc 3300 gtcgtggcca gccacgatag ccgcgctgcc tcgtcttgca gttcattcag ggcaccggac 3360 aggtcggtct tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca 3420 tcagagcagc cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg 3480 gccggagaac ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc tcatcctgtc 3540 tcttgatcga tctttgcaaa agcctaggcc tccaaaaaag cctcctcact acttctggaa 3600 tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta gtcagccatg 3660 gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag ttaggggcgg 3720 gactatggtt gctgactaat tgagatgcat gctttgcata cttctgcctg ctggggagcc 3780 tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca tacttctgcc 3840 tgctggggag cctggggact ttccacaccc taactgacac acattccaca gctggttctt 3900 tccgcctcag gactcttcct ttttcaataa atcaatctaa agtatatatg agtaaacttg 3960 gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg 4020 ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc 4080 atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 4140 agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 4200 ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag 4260 tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat 4320 ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 4380 caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 4440 gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 4500 atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg 4560 accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt 4620 aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct 4680 gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac 4740 tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat 4800 aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat 4860 ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 4920 aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct gtagcggcgc 4980 attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct 5040 agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg 5100 tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga 5160 ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt 5220 ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg 5280 aacaacactc aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc 5340 ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat 5400 attaacgctt acaatttac 5419 <210> 76 <211> 5729 <212> DNA <213> Artificial <220> PH223 / vif <400> 76 gcgcgcgttg acattgatta ttgactagtt attaatagta atcaattacg gggtcattag 60 ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct 120 gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc 180 caatagggac tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg 240 cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat 300 ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca 360 tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc 420 gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga 480 gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat 540 tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc 600 taactagaga acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga 660 cccaagcttg gtaccgagct cggatccgcg atgcaaaatt catcccgcca ccaacaaaag 720 aaaaggaata aaaaacctgg accagaatta cccttagcac tatggatcca tatagcagaa 780 agcattaatg gggatagctc atggtacata acaatgagac tgcaacagat gatgtgggga 840 aaaagaggaa ataagttaca atataagaat gaagacaggg aatatgaaaa ttgggaaatt 900 acatcatggg gatggaaaat gcacctaagg agagtgaaac aatggataca agacaacagg 960 agaggaagcc catggcagta caaagtagga ggaacatgga aaagtatagg agtgtggttc 1020 ctgcaagcag gagattacag aaaggtagac aggcacttct ggtgggcatg gaggatactg 1080 atatgttcct gcaggaaaga aaagtttgat ataagagaat ttatgagagg aagacataga 1140 tgggatttgt gcaaatcctg tgctcaagga gaagtagtaa agcatactag aacaaaaagt 1200 ctggaaagac tagtactgct acagatggta gaacagcatg tgtttcaagt attgccattg 1260 tggagagcca ggagaagtag tacaacagat ttcccatggt gcagggacac aacgggatac 1320 acgcatgcgt ggtctgtcca ggagtgctgg ttgatggaat atctcttaga ggatgagtga 1380 ccggaattct gcagatatcc atcacactgg cggccgctcg agcatgcatc tagagggccc 1440 tattctatag tgtcacctaa atgctagagc tcgctgatca gcctcgactg tgccttctag 1500 ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac 1560 tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca 1620 ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag 1680 caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa ccagtggcgg 1740 taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 1800 agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 1860 cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 1920 tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 1980 tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 2040 gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 2100 acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 2160 acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 2220 cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 2280 gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 2340 gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 2400 agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 2460 ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 2520 ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 2580 atgagtaacc tgaggctatg gcagggcctg ccgccccgac gttggctgcg agccctgggc 2640 cttcacccga acttgggggg tggggtgggg aaaaggaaga aacgcgggcg tattggcccc 2700 aatggggtct cggtggggta tcgacagagt gccagccctg ggaccgaacc ccgcgtttat 2760 gaacaaacga cccaacaccg tgcgttttat tctgtctttt tattgccgtc atagcgcggg 2820 ttccttccgg tattgtctcc ttccgtgttt cagttagcct ccccctaggg tgggcgaaga 2880 actccagcat gagatccccg cgctggagga tcatccagcc ggcgtcccgg aaaacgattc 2940 cgaagcccaa cctttcatag aaggcggcgg tggaatcgaa atctcgtgat ggcaggttgg 3000 gcgtcgcttg gtcggtcatt tcgaacccca gagtcccgct cagaagaact cgtcaagaag 3060 gcgatagaag gcgatgcgct gcgaatcggg agcggcgata ccgtaaagca cgaggaagcg 3120 gtcagcccat tcgccgccaa gctcttcagc aatatcacgg gtagccaacg ctatgtcctg 3180 atagcggtcc gccacaccca gccggccaca gtcgatgaat ccagaaaagc ggccattttc 3240 caccatgata ttcggcaagc aggcatcgcc atgggtcacg acgagatcct cgccgtcggg 3300 catgctcgcc ttgagcctgg cgaacagttc ggctggcgcg agcccctgat gctcttgatc 3360 atcctgatcg acaagaccgg cttccatccg agtacgtgct cgctcgatgc gatgtttcgc 3420 ttggtggtcg aatgggcagg tagccggatc aagcgtatgc agccgccgca ttgcatcagc 3480 catgatggat actttctcgg caggagcaag gtgagatgac aggagatcct gccccggcac 3540 ttcgcccaat agcagccagt cccttcccgc ttcagtgaca acgtcgagca cagctgcgca 3600 aggaacgccc gtcgtggcca gccacgatag ccgcgctgcc tcgtcttgca gttcattcag 3660 ggcaccggac aggtcggtct tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa 3720 cacggcggca tcagagcagc cgattgtctg ttgtgcccag tcatagccga atagcctctc 3780 cacccaagcg gccggagaac ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc 3840 tcatcctgtc tcttgatcga tctttgcaaa agcctaggcc tccaaaaaag cctcctcact 3900 acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta 3960 gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag 4020 ttaggggcgg gactatggtt gctgactaat tgagatgcat gctttgcata cttctgcctg 4080 ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca 4140 tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac acattccaca 4200 gctggttctt tccgcctcag gactcttcct ttttcaataa atcaatctaa agtatatatg 4260 agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct 4320 gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg 4380 agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc 4440 cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa 4500 ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc 4560 cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt 4620 cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc 4680 ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt 4740 tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc 4800 catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 4860 gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata 4920 gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 4980 tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 5040 catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 5100 aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 5160 attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga 5220 aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgcgccct 5280 gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 5340 ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 5400 gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 5460 ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 5520 gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 5580 tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 5640 tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 5700 ttaacaaaat attaacgctt acaatttac 5729 <210> 77 <211> 9446 <212> DNA <213> Artificial <220> <223> pMGP / RRE <400> 77 aacaggaaag ttccattgga gccaagtaca ttgagtcaat agggactttc caatgggttt 60 tgcccagtac ataaggtcaa tgggaggtaa gccaatgggt ttttcccatt actggcacgt 120 atactgagtc attagggact ttccaatggg ttttgcccag tacataaggt caataggggt 180 gaatcaacag gaaagtccca ttggagccaa gtacactgag tcaataggga ctttccattg 240 ggttttgccc agtacaaaag gtcaataggg ggtgagtcaa tgggtttttc ccattattgg 300 cacgtacata aggtcaatag gggtgagtca ttgggttttt ccagccaatt taattaaaac 360 gccatgtact ttcccaccat tgacgtcaat gggctattga aactaatgca acgtgacctt 420 taaacggtac tttcccatag ctgattaatg ggaaagtacc gttctcgagc caatacacgt 480 caatgggaag tgaaagggca gccaaaacgt aacaccgccc cggttttccc tggaaattcc 540 atattggcac gcattctatt ggctgagctg cgttcacgtg ggtataagag gcgcgaccag 600 cgtcggtacc gtcgcagtct tcggtctgac caccgtagaa cgcagagctc ctcgctgcag 660 gcaagcttgg taccgagctc ggatcccggg gaggtaccaa aatccgccat ggtgagtcta 720 gatagagaca tggcgaggca agtctccggg gggaaaagag attatcctga gctcgaaaaa 780 tgtatcaagc atgcatgcaa gataaaagtt cgactcagag gggagcactt gacagaagga 840 aattgtttat ggtgccttaa aacattagat tacatgtttg aggaccataa agaggaacct 900 tggacaaaag taaaatttag gacaatatgg cagaaggtga agaatctaac tcctgaggag 960 agtaacaaaa aagactttat gtctttgcag gccacattag cgggtctaat gtgttgccaa 1020 atggggatga gacctgagac attgcaagat gcaatggcta cagtaatcat gaaagatggg 1080 ttactggaac aagaggaaaa gaaggaagac aaaagagaaa aggaagagag tgtcttccca 1140 atagtagtgc aagcagcagg agggagaagc tggaaagcag tagattctgt aatgttccag 1200 caactgcaaa cagtagcaat gcagcatggc ctcgtgtctg aggactttga aaggcagttg 1260 gcatattatg ctactacctg gacaagtaaa gacatactag aagtattggc catgatgcct 1320 ggaaatagag ctcaaaagga gttaattcaa gggaaattaa atgaagaagc agaaaggtgg 1380 agaaggaata atccaccacc tccagcagga ggaggattaa cagtggatca aattatgggg 1440 gtaggacaaa caaatcaagc agcagcacaa gctaacatgg atcaggcaag gcaaatatgc 1500 ctgcaatggg taataaatgc attaagagca gtaagacata tggcgcacag gccagggaat 1560 ccaatgctag taaagcaaaa aacgaatgag ccatatgaag attttgcagc aagactgcta 1620 gaagcaatag atgcagagcc agttacacag cctataaaag attatctaaa gctaacacta 1680 tcttatacaa atgcatcagc agattgtcag aagcaaatgg atagaacact aggacaaaga 1740 gtacaacaag ctagtgtaga agaaaaaatg caagcatgta gagatgtggg atcagaaggg 1800 ttcaaaatgc aattgttagc acaagcatta aggccaggaa aaggaaaagg gaatggacag 1860 ccacaaaggt gttacaactg tggaaaaccg ggacatcaag caaggcaatg tagacaagga 1920 atcatatgtc acaactgtgg aaagagagga catatgcaaa aagaatgcag aggaaagaga 1980 gacataaggg gaaaacagca gggaaacggg aggaggggga tacgtgtggt gccgtccgct 2040 cctcctatgg aataacttca gcaccaccta tggttcaggt ccgcataggt tcccagcaga 2100 ggaacttgtt atttgatacc ggggcggacc gaactatagt tagatggcat gagggctcgg 2160 gaaacccagc cggaaggata aaactgcaag gaataggagg aatagtagaa ggagaaaaat 2220 ggaataatgt agaattagaa tataaaggag aaacaagaaa gggaacaata gtagtgttac 2280 cacaaagtcc agtagaagta ttaggacgag ataacatggc ccgatttgga ataaagataa 2340 taatggcaaa tttagaggaa aaaagaatcc caattacaaa agtaaaattg aaagagggat 2400 gtacgggtcc acatgtccca caatggccat taacagaaga gaaattaaaa ggtctaacag 2460 aaatcataga taaattagtg gaagaaggaa aactaggaaa ggcaccccca cattggacat 2520 gtaatactcc aatcttttgc ataaaaaaga aatcagggaa gtggagaatg ttaatagatt 2580 tcagagaatt gaacaaacag acagaagatt taacagaagc gcagttagga ctcccgcatc 2640 cgggaggact acaaaagaaa aaacatgtta caatattgga cataggagat gcatatttta 2700 ctatacccct atatgaacca tatcgagagt acacatgttt tactctatta agtcctaata 2760 atctaggacc atgtaaaaga tactattgga aagtgctgcc acaaggttgg aaattgagtc 2820 catctgtata tcaatttact atgcaggaga tcttagagga ttggatacag cagcatccag 2880 aaattcaatt tggcatatat atggatgata tttacatagg aagtgattta gaaattaaaa 2940 agcatagaga aatagtgaaa gatttagcca attatattgc ccaatatgga ttcactctgc 3000 cagaagagaa gagacaaaag ggatatccag caaaatggct aggatttgaa ctacacccgc 3060 agacctggaa atttcagaag catacattac ctgaattaac aaagggaaca ataacattaa 3120 ataaattaca gaaattagta ggagaattag tatggagaca atccataatt gggaaaagca 3180 ttcctaacat tctgaaatta atggaaggag atagagaatt acaaagtgaa agaaaaattg 3240 aagaagtaca tgtgaaagaa tgggaagcat gtaggaaaaa attagaagaa atggaaggaa 3300 attattataa taaagacaaa gatgtctatg gacaattggc ttggggagac aaagctatag 3360 aatatatagt gtatcaggag aaagggaaac cattatgggt aaatgtggtt cacaatataa 3420 agaacctaag catcccgcaa caggttatta aagcagcgca aaaattaacc caagaagtca 3480 tcattaggac aggaaaaata ccatggatat tgttgccagg gaaagaagaa gattggagac 3540 tagaattgca attagggaac atcacatgga tgccaaaatt ttggtcctgt tatcgaggac 3600 atacaagatg gagaaaaaga aatataatag aagaagtagt agaagggcct acatattata 3660 cagatggagg aaaaaagaat aaagtaggaa gtctagggtt catagtatca acaggggaaa 3720 aatttagaaa gcatgaagag ggcacaaacc agcaactaga attaagagcc atagaggaag 3780 ctctaaaaca agggcctcaa acaatgaatt tagtaacaga tagtagatat gcatttgaat 3840 ttttattaag aaattgggat gaagaagtaa taaagaatcc aattcaagca agaattatgg 3900 aaattgccca caagaaagat aggataggag tgcattgggt gccaggacat aaagggattc 3960 cccaaaatga agaaatagac aaatatattt cggaaatatt tcttgcaaaa gaaggagaag 4020 gaattctccc aaaaagagaa gaggatgcag ggtatgattt aatatgccca gaagaggtta 4080 ccatagagcc aggacaagtg aaatgcatcc ccatagagct aagattaaat ttaaagaaat 4140 cacaatgggc tatgattgct acaaaaagca gcatggctgc caaaggagtg ttcacacaag 4200 gaggaatcat agactcagga tatcagggac aaatacaggt aataatgtat aatagcaata 4260 aaatagcagt agtcataccc caagggagaa aatttgcaca attaatatta atggataaaa 4320 agcatgaaaa attggaaccc tggggggaaa gcagaaaaac agaaagggga gaaaaaggat 4380 ttgggtctac aggaatgtat tggatagaaa atattcctct ggcagaggaa gaccacacaa 4440 aatggcatca agatgcccga tcattgcatc tagaatttga aattccaaga acagcagcag 4500 aagacatagt aaatcaatgt gaaatatgca aagaaggcag gacacctgca gtaattagag 4560 gcggaaacaa aaggggggta gatcattggc aagtggatta tacccattat gaaaatatca 4620 tactattagt atgggtagaa acaaattcag gactaatata tgcagaaaaa gtaaaaggag 4680 aatcagggca agaattcaga ataaaagtga tgcaatggta tgcattattt ggtccagagt 4740 cattgcagtc agacaatgga cctgcatttg cagcagagcc cacacagctg ttaatgcaat 4800 acctaggagt aaaacacaca acaggcatac cttggaatcc acagtctcag gctatagtag 4860 aaagggcaca tcaactattg aaaagcactt taaagaagtt ccagccacaa tttgtcgctg 4920 tagaatcagc catagcagca gccctagtcg ccataaatat aaaaagaaag ggtgggctgg 4980 ggacaagccc tatggatatt tttatatata ataaagaaca gaaaagaata aataataaat 5040 ataataaaaa ttctcaaaaa attcaattct gttattacag aataaggaaa agaggacatc 5100 caggagagtg gaaaggacca acccaggtac tgtggaaagg ggaaggagca attgtggtaa 5160 aggatataga aagtgaaaag tatttagtaa taccttacaa agatgcaaaa ttcatcccgc 5220 caccaacaaa agaaaaggaa taaaaaacct ggaccagaat tacccttagc actatggatc 5280 cactagtaac ggccgccagt gtgctggaat tctgcagata tccatcacac tggcggccgg 5340 gctgcaggaa ttcgatagaa aagatatcaa aaacaagaat cataaacagg aaaaaaagag 5400 aactcagcca caagaggaag aagagaggcg ttggcttggt cattatgcta gttatcatgg 5460 caatagtagc tgccgcaggg gcttctctgg gagtcgcaaa cgcgattcag cagtcttaca 5520 ctaaggcagc tgtccagacc cttgctaatg caactgctgc acagcaggat gtgttagagg 5580 caacctatgc catggtacag catgtggcta aaggcgtacg aatcttggaa gctcgagtgg 5640 ctcgagtgga agctatcaca gatagaatag cggccgccca tcaagcttat cgataccgtc 5700 ggccgctcga gcatgcatct agagggccct attctatagt gtcacctaaa tgctagagct 5760 cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc 5820 gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa 5880 attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac 5940 agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt gggctctatg 6000 gcttctgagg cggaaagaac cagtggcggt aatacggtta tccacagaat caggggataa 6060 cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 6120 gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 6180 aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 6240 ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 6300 cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 6360 ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 6420 cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 6480 agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 6540 gaagtggtgg cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct 6600 gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 6660 tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 6720 agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 6780 agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa 6840 atgaagtttt aaatcaatct aaagtatata tgagtaacct gaggctatgg cagggcctgc 6900 cgccccgacg ttggctgcga gccctgggcc ttcacccgaa cttggggggt ggggtgggga 6960 aaaggaagaa acgcgggcgt attggcccca atggggtctc ggtggggtat cgacagagtg 7020 ccagccctgg gaccgaaccc cgcgtttatg aacaaacgac ccaacaccgt gcgttttatt 7080 ctgtcttttt attgccgtca tagcgcgggt tccttccggt attgtctcct tccgtgtttc 7140 agttagcctc cccctagggt gggcgaagaa ctccagcatg agatccccgc gctggaggat 7200 catccagccg gcgtcccgga aaacgattcc gaagcccaac ctttcataga aggcggcggt 7260 ggaatcgaaa tctcgtgatg gcaggttggg cgtcgcttgg tcggtcattt cgaaccccag 7320 agtcccgctc agaagaactc gtcaagaagg cgatagaagg cgatgcgctg cgaatcggga 7380 gcggcgatac cgtaaagcac gaggaagcgg tcagcccatt cgccgccaag ctcttcagca 7440 atatcacggg tagccaacgc tatgtcctga tagcggtccg ccacacccag ccggccacag 7500 tcgatgaatc cagaaaagcg gccattttcc accatgatat tcggcaagca ggcatcgcca 7560 tgggtcacga cgagatcctc gccgtcgggc atgctcgcct tgagcctggc gaacagttcg 7620 gctggcgcga gcccctgatg ctcttgatca tcctgatcga caagaccggc ttccatccga 7680 gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatca 7740 agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg 7800 tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct 7860 tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc 7920 cgcgctgcct cgtcttgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga 7980 accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt 8040 tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat 8100 ccatcttgtt caatcatgcg aaacgatcct catcctgtct cttgatcgat ctttgcaaaa 8160 gcctaggcct ccaaaaaagc ctcctcacta cttctggaat agctcagagg ccgaggcggc 8220 ctcggcctct gcataaataa aaaaaattag tcagccatgg ggcggagaat gggcggaact 8280 gggcggagtt aggggcggga tgggcggagt taggggcggg actatggttg ctgactaatt 8340 gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc cacacctggt 8400 tgctgactaa ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt 8460 tccacaccct aactgacaca cattccacag ctggttcttt ccgcctcagg actcttcctt 8520 tttcaataaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt 8580 aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact 8640 ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat 8700 gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg 8760 aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg 8820 ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat 8880 tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc 8940 ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt 9000 cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc 9060 agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga 9120 gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc 9180 gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa 9240 acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta 9300 acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg 9360 agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 9420 aatactcata ctcttccttt ttcaat 9446 <210> 78 <211> 7856 <212> DNA <213> Artificial <220> <223> pCAH / SINd60 / hlacZ <400> 78 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcaa gcttgctgct 600 tgcacttcag agttctagga gagtccctcc tagtctctcc tctccgagga ggtaccgaga 660 cctcaaaata aaggagtgat tgccttactg ccgagtggag agtgattact gagcggccgg 720 tgtatcggga gtcgtccctt aatctgtgca ataccagagc ggctctcgca gctggcgccc 780 aacgtggggc ccgaggagaa gaaaagaaag cggccctgag aactcggctt ctgaaaaaga 840 ggaagaggac aagttgctat agcaacaaga gagaagaagt agagcaaagg tccagtggct 900 cggaaaaaga ggaactgaaa cttcggggac gcctgaagga gtaaggtaag tgactctgct 960 gtacgcgggg cgaggcagag gtttccttct aaattgaaag agaagtgttg ctgcgagagg 1020 tcttggtggt cgagaatcct gtacaaaaaa aaggagggat ctcggtcagg accaggaccc 1080 ctgggagtaa tacaacagca acaccgtaag aaaatccgcc taggtgagtc tagatagaga 1140 ctaggcgagg caagtctccg gggggaaaag agattatcct gcaggaattc gattctagag 1200 gtgatagaaa tgccagaaaa ctatgcaaaa acaagaatca taaacaggaa aaaaagagaa 1260 ctcagccaca agaggaagaa gagaggcgtt ggcttggtca ttatgctagt tatcatggca 1320 atagtagctg ccgcaggggc ttctctggga gtcgcaaacg cgattcagca gtcttacact 1380 aaggcagctg tccagaccct tgctaatgca actgctgcac agcaggatgt gttagaggca 1440 acctatgcca tggtacagca tgtggctaaa ggcgtacgaa tcttggaagc tcgagtggct 1500 cgagtggaag ctatcacaga tagaataatg ctataccaag aattggattg ttggcactag 1560 gatccatcag ccaccattaa cgcttacaat ttacgcgcgc gttgacattg attattgact 1620 agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc 1680 gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg 1740 acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa 1800 tgggtggact atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca 1860 agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac 1920 atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc 1980 atggtgatgc ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga 2040 tttccaagtc tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg 2100 gactttccaa aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcgtgta 2160 cggtgggagg tctatataag cagagctctc tggctaacta gagaacccac tgcttactgg 2220 cttatcgaaa ttaatacgac tcactatagg gagacccaag ctgcttacca tggggggttc 2280 tcatcatcat catcatcatg gtatggctag catgactggt ggacagcaaa tgggtcggga 2340 tctgtacgac gatgacgata aggtacctaa ggatcagctt ggagttgatc ccgtcgtttt 2400 acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc 2460 ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt 2520 gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg gcaccagaag cggtgccgga 2580 aagctggctg gagtgcgatc ttcctgaggc cgatactgtc gtcgtcccct caaactggca 2640 gatgcacggt tacgatgcgc ccatctacac caacgtaacc tatcccatta cggtcaatcc 2700 gccgtttgtt cccacggaga atccgacggg ttgttactcg ctcacattta atgttgatga 2760 aagctggcta caggaaggcc agacgcgaat tatttttgat ggcgttaact cggcgtttca 2820 tctgtggtgc aacgggcgct gggtcggtta cggccaggac agtcgtttgc cgtctgaatt 2880 tgacctgagc gcatttttac gcgccggaga aaaccgcctc gcggtgatgg tgctgcgttg 2940 gagtgacggc agttatctgg aagatcagga tatgtggcgg atgagcggca ttttccgtga 3000 cgtctcgttg ctgcataaac cgactacaca aatcagcgat ttccatgttg ccactcgctt 3060 taatgatgat ttcagccgcg ctgtactgga ggctgaagtt cagatgtgcg gcgagttgcg 3120 tgactaccta cgggtaacag tttctttatg gcagggtgaa acgcaggtcg ccagcggcac 3180 cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt tatgccgatc gcgtcacact 3240 acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa atcccgaatc tctatcgtgc 3300 ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa gcagaagcct gcgatgtcgg 3360 tttccgcgag gtgcggattg aaaatggtct gctgctgctg aacggcaagc cgttgctgat 3420 tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt caggtcatgg atgagcagac 3480 gatggtgcag gatatcctgc tgatgaagca gaacaacttt aacgccgtgc gctgttcgca 3540 ttatccgaac catccgctgt ggtacacgct gtgcgaccgc tacggcctgt atgtggtgga 3600 tgaagccaat attgaaaccc acggcatggt gccaatgaat cgtctgaccg atgatccgcg 3660 ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg cagcgcgatc gtaatcaccc 3720 gagtgtgatc atctggtcgc tggggaatga atcaggccac ggcgctaatc acgacgcgct 3780 gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg cagtatgaag gcggcggagc 3840 cgacaccacg gccaccgata ttatttgccc gatgtacgcg cgcgtggatg aagaccagcc 3900 cttcccggct gtgccgaaat ggtccatcaa aaaatggctt tcgctacctg gagagacgcg 3960 cccgctgatc ctttgcgaat acgcccacgc gatgggtaac agtcttggcg gtttcgctaa 4020 atactggcag gcgtttcgtc agtatccccg tttacagggc ggcttcgtct gggactgggt 4080 ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg tggtcggctt acggcggtga 4140 ttttggcgat acgccgaacg atcgccagtt ctgtatgaac ggtctggtct ttgccgaccg 4200 cacgccgcat ccagcgctga cggaagcaaa acaccagcag cagtttttcc agttccgttt 4260 atccgggcaa accatcgaag tgaccagcga atacctgttc cgtcatagcg ataacgagct 4320 cctgcactgg atggtggcgc tggatggtaa gccgctggca agcggtgaag tgcctctgga 4380 tgtcgctcca caaggtaaac agttgattga actgcctgaa ctaccgcagc cggagagcgc 4440 cgggcaactc tggctcacag tacgcgtagt gcaaccgaac gcgaccgcat ggtcagaagc 4500 cgggcacatc agcgcctggc agcagtggcg tctggcggaa aacctcagtg tgacgctccc 4560 cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa atggattttt gcatcgagct 4620 gggtaataag cgttggcaat ttaaccgcca gtcaggcttt ctttcacaga tgtggattgg 4680 cgataaaaaa caactgctga cgccgctgcg cgatcagttc acccgtgcac cgctggataa 4740 cgacattggc gtaagtgaag cgacccgcat tgaccctaac gcctgggtcg aacgctggaa 4800 ggcggcgggc cattaccagg ccgaagcagc gttgttgcag tgcacggcag atacacttgc 4860 tgatgcggtg ctgattacga ccgctcacgc gtggcagcat caggggaaaa ccttatttat 4920 cagccggaaa acctaccgga ttgatggtag tggtcaaatg gcgattaccg ttgatgttga 4980 agtggcgagc gatacaccgc atccggcgcg gattggcctg aactgccagc tggcgcaggt 5040 agcagagcgg gtaaactggc tcggattagg gccgcaagaa aactatcccg accgccttac 5100 tgccgcctgt tttgaccgct gggatctgcc attgtcagac atgtataccc cgtacgtctt 5160 cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg aattatggcc cacaccagtg 5220 gcgcggcgac ttccagttca acatcagccg ctacagtcaa cagcaactga tggaaaccag 5280 ccatcgccat ctgctgcacg cggaagaagg cacatggctg aatatcgacg gtttccatat 5340 ggggattggt ggcgacgact cctggagccc gtcagtatcg gcggaattac agctgagcgc 5400 cggtcgctac cattaccagt tggtctggtg tcaaaaataa taaagccgaa ttctgcagat 5460 atccagcaca gtggcggccg ctagcacaaa aataaaaaaa gaaagggtga ctgtgagaca 5520 tgggctaaag aggagcggcc gctcgagtct agaactagtg gatcagcttt gctgcttgca 5580 cttcagagtt ctaggagagt ccctcctagt ctctcctctc cgaggaggta ccgagacctc 5640 aaaataaagg agtgattgcc ttactgccga gtggagagtg attactgagc ggccggtgta 5700 tcgggagtcg tcccttaatc tgtgcaatac cagagcggct ctcgcagccg acctcgaggg 5760 ggggccctat tctatagtgt cacctaaatg ctagagctcg ctgatcagcc tcgactgtgc 5820 cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag 5880 gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta 5940 ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag 6000 acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg gaaagaacca 6060 gtggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6120 aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6180 ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6240 acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6300 ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6360 tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6420 tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6480 gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 6540 agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 6600 tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 6660 agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 6720 tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 6780 acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 6840 tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 6900 agtatatatg agtaacctga tcaggactct tccttttcat gaacaataaa actgtctgct 6960 tacataaaca gtaatacaag gggtgttatg agccatattc aacgggaaac gtcttgctct 7020 aggccgcgat taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat 7080 aatgtcgggc aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag 7140 ttgtttctga aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga 7200 ctaaactggc tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct 7260 gatgatgcat ggttactcac cactgcgatc cccgggaaaa cagcattcca ggtattagaa 7320 gaatatcctg attcaggtga aaatattgtt gatgcgctgg cagtgttcct gcgccggttg 7380 cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg tctcgctcag 7440 gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg attttgatga cgagcgtaat 7500 ggctggcctg ttgaacaagt ctggaaagaa atgcataaac ttttgccatt ctcaccggat 7560 tcagtcgtca ctcatggtga tttctcactt gataacctta tttttgacga ggggaaatta 7620 ataggttgta ttgatgttgg acgagtcgga atcgcagacc gataccagga tcttgccatc 7680 ctatggaact gcctcggtga gttttctcct tcattacaga aacggctttt tcaaaaatat 7740 ggtattgata atcctgatat gaataaattg cagtttcatt tgatgctcga tgagtttttc 7800 taagaattcg cgcaattaac cctcactaaa gggaacaaaa gctgggtacc gggccc 7856 <210> 79 <211> 8127 <212> DNA <213> Artificial <220> <223> pCAH / SINd1 / hlacZ <400> 79 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcaa gcttgctgct 600 tgcacttcag agttctagga gagtccctcc tagtctctcc tctccgagga ggtaccgaga 660 cctcaaaata aaggagtgat tgccttactg ccgagtggag agtgattact gagcggccgg 720 tgtatcggga gtcgtccctt aatctgtgca ataccagagc ggctctcgca gctggcgccc 780 aacgtggggc ccgaggagaa gaaaagaaag cggccctgag aactcggctt ctgaaaaaga 840 ggaagaggac aagttgctat agcaacaaga gagaagaagt agagcaaagg tccagtggct 900 cggaaaaaga ggaactgaaa cttcggggac gcctgaagga gtaaggtaag tgactctgct 960 gtacgcgggg cgaggcagag gtttccttct aaattgaaag agaagtgttg ctgcgagagg 1020 tcttggtggt cgagaatcct gtacaaaaaa aaggagggat ctcggtcagg accaggaccc 1080 ctgggagtaa tacaacagca acaccgtaag aaaatccgcc taggtgagtc tagatagaga 1140 ctaggcgagg caagtctccg gggggaaaag agattatcct gagctcgaaa aatgtatcaa 1200 gcatgcatgc aagataaaag ttcgactcag aggggagcac ttgacagaag gaaattgttt 1260 atggtgcctt aaaacattag attacatgtt tgaggaccat aaagaggaac cttggacaaa 1320 agtaaaattt aggacaatat ggcagaaggt gaagaatcta actcctgagg agagtaacaa 1380 aaaagacttt atgtctttgc aggccacatt agcgggtcta atgtgttgcc aaatggggat 1440 gagaccgggc tgcaggaatt cgattctaga ggtgatagaa atgccagaaa actatgcaaa 1500 aacaagaatc ataaacagga aaaaaagaga actcagccac aagaggaaga agagaggcgt 1560 tggcttggtc attatgctag ttatcatggc aatagtagct gccgcagggg cttctctggg 1620 agtcgcaaac gcgattcagc agtcttacac taaggcagct gtccagaccc ttgctaatgc 1680 aactgctgca cagcaggatg tgttagaggc aacctatgcc atggtacagc atgtggctaa 1740 aggcgtacga atcttggaag ctcgagtggc tcgagtggaa gctatcacag atagaataat 1800 gctataccaa gaattggatt gttggcacta ggatccatca gccaccatta acgcttacaa 1860 tttacgcgcg cgttgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc 1920 attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc 1980 tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt 2040 aacgccaata gggactttcc attgacgtca atgggtggac tatttacggt aaactgccca 2100 cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg 2160 taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca 2220 gtacatctac gtattagtca tcgctattac catggtgatg cggttttggc agtacatcaa 2280 tgggcgtgga tagcggtttg actcacgggg atttccaagt ctccacccca ttgacgtcaa 2340 tgggagtttg ttttggcacc aaaatcaacg ggactttcca aaatgtcgta acaactccgc 2400 cccattgacg caaatgggcg gtaggcgtgt acggtgggag gtctatataa gcagagctct 2460 ctggctaact agagaaccca ctgcttactg gcttatcgaa attaatacga ctcactatag 2520 ggagacccaa gctttaagct taccatgggg ggttctcatc atcatcatca tcatggtatg 2580 gcatgactgg tggacagcaa atgggtcggg atctgtacga cgatgacgat aaggtaccta 2640 aggatcagct tggagttgat cccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 2700 ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 2760 aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tggcgctttg 2820 cctggtttcc ggcaccagaa gcggtgccgg aaagctggct ggagtgcgat cttcctgagg 2880 ccgatactgt cgtcgtcccc tcaaactggc agatgcacgg ttacgatgcg cccatctaca 2940 ccaacgtaac ctatcccatt acggtcaatc cgccgtttgt tcccacggag aatccgacgg 3000 gttgttactc gctcacattt aatgttgatg aaagctggct acaggaaggc cagacgcgaa 3060 ttatttttga tggcgttaac tcggcgtttc atctgtggtg caacgggcgc tgggtcggtt 3120 acggccagga cagtcgtttg ccgtctgaat ttgacctgag cgcattttta cgcgccggag 3180 aaaaccgcct cgcggtgatg gtgctgcgtt ggagtgacgg cagttatctg gaagatcagg 3240 atatgtggcg gatgagcggc attttccgtg acgtctcgtt gctgcataaa ccgactacac 3300 aaatcagcga tttccatgtt gccactcgct ttaatgatga tttcagccgc gctgtactgg 3360 aggctgaagt tcagatgtgc ggcgagttgc gtgactacct acgggtaaca gtttctttat 3420 ggcagggtga aacgcaggtc gccagcggca ccgcgccttt cggcggtgaa attatcgatg 3480 agcgtggtgg ttatgccgat cgcgtcacac tacgtctgaa cgtcgaaaac ccgaaactgt 3540 ggagcgccga aatcccgaat ctctatcgtg cggtggttga actgcacacc gccgacggca 3600 cgctgattga agcagaagcc tgcgatgtcg gtttccgcga ggtgcggatt gaaaatggtc 3660 tgctgctgct gaacggcaag ccgttgctga ttcgaggcgt taaccgtcac gagcatcatc 3720 ctctgcatgg tcaggtcatg gatgagcaga cgatggtgca ggatatcctg ctgatgaagc 3780 agaacaactt taacgccgtg cgctgttcgc attatccgaa ccatccgctg tggtacacgc 3840 tgtgcgaccg ctacggcctg tatgtggtgg atgaagccaa tattgaaacc cacggcatgg 3900 tgccaatgaa tcgtctgacc gatgatccgc gctggctacc ggcgatgagc gaacgcgtaa 3960 cgcgaatggt gcagcgcgat cgtaatcacc cgagtgtgat catctggtcg ctggggaatg 4020 aatcaggcca cggcgctaat cacgacgcgc tgtatcgctg gatcaaatct gtcgatcctt 4080 cccgcccggt gcagtatgaa ggcggcggag ccgacaccac ggccaccgat attatttgcc 4140 cgatgtacgc gcgcgtggat gaagaccagc ccttcccggc tgtgccgaaa tggtccatca 4200 aaaaatggct ttcgctacct ggagagacgc gcccgctgat cctttgcgaa tacgcccacg 4260 cgatgggtaa cagtcttggc ggtttcgcta aatactggca ggcgtttcgt cagtatcccc 4320 gtttacaggg cggcttcgtc tgggactggg tggatcagtc gctgattaaa tatgatgaaa 4380 acggcaaccc gtggtcggct tacggcggtg attttggcga tacgccgaac gatcgccagt 4440 tctgtatgaa cggtctggtc tttgccgacc gcacgccgca tccagcgctg acggaagcaa 4500 aacaccagca gcagtttttc cagttccgtt tatccgggca aaccatcgaa gtgaccagcg 4560 aatacctgtt ccgtcatagc gataacgagc tcctgcactg gatggtggcg ctggatggta 4620 agccgctggc aagcggtgaa gtgcctctgg atgtcgctcc acaaggtaaa cagttgattg 4680 aactgcctga actaccgcag ccggagagcg ccgggcaact ctggctcaca gtacgcgtag 4740 tgcaaccgaa cgcgaccgca tggtcagaag ccgggcacat cagcgcctgg cagcagtggc 4800 gtctggcgga aaacctcagt gtgacgctcc ccgccgcgtc ccacgccatc ccgcatctga 4860 ccaccagcga aatggatttt tgcatcgagc tgggtaataa gcgttggcaa tttaaccgcc 4920 agtcaggctt tctttcacag atgtggattg gcgataaaaa acaactgctg acgccgctgc 4980 gcgatcagtt cacccgtgca ccgctggata acgacattgg cgtaagtgaa gcgacccgca 5040 ttgaccctaa cgcctgggtc gaacgctgga aggcggcggg ccattaccag gccgaagcag 5100 cgttgttgca gtgcacggca gatacacttg ctgatgcggt gctgattacg accgctcacg 5160 cgtggcagca tcaggggaaa accttattta tcagccggaa aacctaccgg attgatggta 5220 gtggtcaaat ggcgattacc gttgatgttg aagtggcgag cgatacaccg catccggcgc 5280 ggattggcct gaactgccag ctggcgcagg tagcagagcg ggtaaactgg ctcggattag 5340 ggccgcaaga aaactatccc gaccgcctta ctgccgcctg ttttgaccgc tgggatctgc 5400 cattgtcaga catgtatacc ccgtacgtct tcccgagcga aaacggtctg cgctgcggga 5460 cgcgcgaatt gaattatggc ccacaccagt ggcgcggcga cttccagttc aacatcagcc 5520 gctacagtca acagcaactg atggaaacca gccatcgcca tctgctgcac gcggaagaag 5580 gcacatggct gaatatcgac ggtttccata tggggattgg tggcgacgac tcctggagcc 5640 cgtcagtatc ggcggaatta cagctgagcg ccggtcgcta ccattaccag ttggtctggt 5700 gtcaaaaata ataaagccga attctgcaga tatccagcac agtggcggcc gctagcacaa 5760 aaataaaaaa agaaagggtg actgtgagac atgggctaaa gaggagcggc cgctcgagtc 5820 tagaactagt ggatcagctt tgctgcttgc acttcagagt tctaggagag tccctcctag 5880 tctctcctct ccgaggaggt accgagacct caaaataaag gagtgattgc cttactgccg 5940 agtggagagt gattactgag cggccggtgt atcgggagtc gtcccttaat ctgtgcaata 6000 ccagagcggc tctcgcagcc gacctcgagg gggggcccta ttctatagtg tcacctaaat 6060 gctagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 6120 ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 6180 aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 6240 gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggatgcggtg 6300 ggctctatgg cttctgaggc ggaaagaacc agtggcggta atacggttat ccacagaatc 6360 aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 6420 aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa 6480 tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 6540 ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 6600 cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 6660 ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 6720 ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 6780 gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 6840 agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg 6900 cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 6960 aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 7020 aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 7080 ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt 7140 aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaacctg atcaggactc 7200 ttccttttca tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat 7260 gagccatatt caacgggaaa cgtcttgctc taggccgcga ttaaattcca acatggatgc 7320 tgatttatat gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta 7380 tcgattgtat gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt 7440 tgccaatgat gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct 7500 tccgaccatc aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat 7560 ccccgggaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 7620 tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 7680 taacagcgat cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt 7740 tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 7800 aatgcataaa cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 7860 tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 7920 aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 7980 ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 8040 gcagtttcat ttgatgctcg atgagttttt ctaagaattc gcgcaattaa ccctcactaa 8100 agggaacaaa agctgggtac cgggccc 8127  

Claims (78)

a) (ⅰ) CAEV 5' LTR과 CAEV gag-코딩 서열 사이의 비번역 영역, 및 (ⅱ) 상기 비번역 영역의 3' 말단에 연결된 CAEV gag-코딩 서열의 1 내지 X번째 뉴클레오티드(이때, X는 613 미만임)로 필수적으로 이루어진 염소 관절염 뇌염 바이러스(caprine arthritis encephalitis virus; CAEV) 패키징 서열; 및a) the untranslated region between (iii) the CAEV 5 ′ LTR and the CAEV gag -coding sequence, and (ii) the 1 to X nucleotides of the CAEV gag -coding sequence linked to the 3 ′ end of the untranslated region, wherein X Is less than 613 caprine arthritis encephalitis virus (CAEV) packaging sequence consisting essentially of; And b) 상기 패키징 서열과 작동가능하게 연관된, 폴리아데닐화, RNA 수송, 역전사, 및 통합(integration)에 요구되는 시스-작용 요소b) cis -acting elements required for polyadenylation, RNA transport, reverse transcription, and integration, operably associated with said packaging sequence. 를 포함하는, 전이 벡터.Comprising a transition vector. 제 1 항에 있어서,The method of claim 1, 상기 X가 60, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575 및 600으로 이루어진 군으로부터 선택되는 것을 특징으로 하는 전이 벡터.X is from the group consisting of 60, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575 and 600 The transition vector, characterized in that the selection. 제 1 항에 있어서,The method of claim 1, 상기 X가 다음으로 이루어진 군으로부터 선택되는 것을 특징으로 하는 전이 벡터: (1) X는 25 초과 600 미만,Wherein said X is selected from the group consisting of: (1) X is greater than 25 but less than 600, (2) X는 25 초과 500 미만,(2) X is greater than 25 and less than 500, (3) X는 25 초과 400 미만,(3) X is greater than 25 and less than 400, (4) X는 25 초과 300 미만,(4) X is greater than 25 and less than 300, (5) X는 25 초과 200 미만,(5) X is greater than 25 and less than 200, (6) X는 50 초과 600 미만,(6) X is greater than 50 and less than 600, (7) X는 50 초과 500 미만,(7) X is greater than 50 and less than 500, (8) X는 50 초과 400 미만,(8) X is greater than 50 and less than 400, (9) X는 50 초과 300 미만,(9) X is greater than 50 and less than 300, (10) X는 50 초과 200 미만,(10) X is greater than 50 and less than 200, (11) X는 75 초과 600 미만,(11) X is greater than 75 and less than 600, (12) X는 75 초과 500 미만,(12) X is greater than 75 and less than 500, (13) X는 75 초과 400 미만,(13) X is greater than 75 and less than 400, (14) X는 75 초과 300 미만,(14) X is greater than 75 and less than 300, (15) X는 75 초과 200 미만,(15) X is greater than 75 and less than 200, (16) X는 100 초과 600 미만,(16) X is greater than 100 and less than 600, (17) X는 100 초과 500 미만,(17) X is greater than 100 and less than 500, (18) X는 100 초과 400 미만,(18) X is greater than 100 and less than 400, (19) X는 100 초과 300 미만,(19) X is greater than 100 and less than 300, (20) X는 100 초과 200 미만,(20) X is greater than 100 and less than 200, (21) X는 125 초과 600 미만,(21) X is greater than 125 and less than 600, (22) X는 125 초과 500 미만,(22) X is greater than 125 and less than 500, (23) X는 125 초과 400 미만,(23) X is greater than 125 and less than 400, (24) X는 125 초과 300 미만,(24) X is greater than 125 and less than 300, (25) X는 125 초과 200 미만,(25) X is greater than 125 and less than 200, (26) X는 150 초과 600 미만,(26) X is greater than 150 and less than 600, (27) X는 150 초과 500 미만,(27) X is greater than 150 and less than 500, (28) X는 150 초과 400 미만,(28) X is greater than 150 and less than 400, (29) X는 150 초과 300 미만,(29) X is greater than 150 and less than 300, (30) X는 150 초과 200 미만,(30) X is greater than 150 and less than 200, (31) X는 200 초과 600 미만,(31) X is greater than 200 and less than 600, (32) X는 200 초과 500 미만,(32) X is greater than 200 and less than 500, (33) X는 200 초과 400 미만,(33) X is greater than 200 and less than 400, (34) X는 200 초과 300 미만, (34) X is greater than 200 and less than 300, (35) X는 200 초과 200 미만,(35) X is greater than 200 and less than 200, (36) X는 250 초과 600 미만,(36) X is greater than 250 and less than 600, (37) X는 250 초과 500 미만,(37) X is greater than 250 and less than 500, (38) X는 250 초과 400 미만, 및(38) X is greater than 250 and less than 400, and (39) X는 250 초과 300 미만.(39) X is greater than 250 and less than 300. 제 1 항에 있어서,The method of claim 1, 상기 X가 40 초과 613 미만인 것을 특징으로 하는 전이 벡터.Wherein said X is greater than 40 and less than 613. 제 1 항에 있어서,The method of claim 1, 상기 X가 57 초과 613 미만인 것을 특징으로 하는 전이 벡터.And wherein X is greater than 57 and less than 613. 제 1 항에 있어서,The method of claim 1, 상기 X가 약 327인 것을 특징으로 하는 전이 벡터.Wherein said X is about 327. 제 1 항에 있어서,The method of claim 1, 상기 gag-코딩 서열의 시작 코돈이 gag 단백질의 번역을 방지하기 위해 변이된 것을 특징으로 하는 전이 벡터. Wherein said start codon of said gag -coding sequence is mutated to prevent translation of gag protein. 제 7 항에 있어서,The method of claim 7, wherein 상기 시작 코돈이 TAG로 변이된 것을 특징으로 하는 전이 벡터.Transition vector characterized in that the start codon is mutated to TAG. 제 7 항에 있어서,The method of claim 7, wherein 상기 시작 코돈의 x 염기쌍 하류에 위치한 gag-코딩 서열의 ATG 코돈이 gag 단백질의 번역을 방지하게 위해 변이되고, 이때, x가 30 미만인 것을 특징으로 하는 전이 벡터. Wherein the ATG codon of the gag -coding sequence located downstream of the x base pair of the start codon is mutated to prevent translation of the gag protein, wherein x is less than 30. 제 9 항에 있어서,The method of claim 9, 상기 x가 약 21인 것을 특징으로 하는 전이 벡터.Wherein x is about 21. 17. 제 1 항에 있어서,The method of claim 1, RRE 영역을 추가로 포함하는 것을 특징으로 하는 전이 벡터.A transition vector further comprising an RRE region. 제 1 항에 있어서,The method of claim 1, U3 영역이 삭제된 CAEV 3' LTR을 추가로 포함하는 것을 특징으로 하는 전이 벡터.The transition vector further comprising a CAEV 3 ′ LTR from which the U3 region is deleted. 제 1 항에 있어서,The method of claim 1, 이종의 프로모터를 추가로 포함하는 것을 특징으로 하는 전이 벡터.A transition vector further comprising a heterologous promoter. 제 13 항에 있어서,The method of claim 13, 상기 이종의 프로모터가 인간 거대세포바이러스 주 극초기 프로모터(HCMV MIEP)인 것을 특징으로 하는 전이 벡터.Wherein said heterologous promoter is a human cytomegalovirus major early promoter (HCMV MIEP). 제 1 항에 있어서,The method of claim 1, 상기 벡터가 도 3c에 제시된 pCAH/SINd1 구조를 갖는 것을 특징으로 하는 전이 벡터.Transition vector, characterized in that the vector has a pCAH / SINd1 structure shown in Figure 3c. 제 1 항에 있어서,The method of claim 1, 전사 카세트를 추가로 포함하는 것을 특징으로 하는 전이 벡터.A transition vector further comprising a transcription cassette. 제 16 항에 있어서,The method of claim 16, 상기 전사 카세트가 이종의 프로모터에 작동가능하게 연결된 관심 폴리뉴클레오티드를 포함하는 것을 특징으로 하는 전이 벡터.Wherein said transcription cassette comprises a polynucleotide of interest operably linked to a heterologous promoter. 제 1 항의 전이 벡터; 및 CAEV gag-pol-코딩 서열 및 RRE를 포함하는 제1 폴리뉴클레오티드, 및 바이러스 외피-코딩 서열을 포함하는 제2 폴리뉴클레오티드를 포함하는 패키징 벡터 시스템을 포함하는, 벡터 시스템.The transition vector of claim 1; And a packaging vector system comprising a first polynucleotide comprising a CAEV gag-pol- coding sequence and a RRE, and a second polynucleotide comprising a viral envelope-coding sequence. 제 18 항에 있어서,The method of claim 18, 상기 전이 벡터가 전사 카세트를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.Wherein said transfer vector further comprises a transcription cassette. 제 19 항에 있어서,The method of claim 19, 상기 전사 카세트가 이종의 프로모터에 작동가능하게 연결된 관심 폴리뉴클레오티드를 포함하는 것을 특징으로 하는 벡터 시스템.Wherein said transcription cassette comprises a polynucleotide of interest operably linked to a heterologous promoter. 제 18 항에 있어서,The method of claim 18, 상기 바이러스 외피-코딩 서열이 비-CAEV 외피-코딩 서열인 것을 특징으로 하는 벡터 시스템.Wherein said viral envelope-coding sequence is a non-CAEV envelope-coding sequence. 제 21 항에 있어서,The method of claim 21, 상기 비-CAEV 외피-코딩 서열이 VSV-G-코딩 서열 또는 GaLV-코딩 서열인 것을 특징으로 하는 벡터 시스템.Wherein said non-CAEV envelope-coding sequence is a VSV-G-coding sequence or a GaLV-coding sequence. 제 22 항에 있어서,The method of claim 22, 상기 외피-코딩 서열이 VSV-G-코딩 서열인 것을 특징으로 하는 벡터 시스템.Wherein said envelope-coding sequence is a VSV-G-coding sequence. 제 22 항에 있어서,The method of claim 22, 상기 외피-코딩 서열이 GaLV-코딩 서열인 것을 특징으로 하는 벡터 시스템.And the envelope-coding sequence is a GaLV-coding sequence. 제 18 항에 있어서,The method of claim 18, 상기 벡터 시스템이 rev-코딩 서열을 포함하는 제3 폴리뉴클레오티드 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템. Wherein said vector system further comprises a third polynucleotide sequence comprising a rev -coding sequence. 제 18 항 또는 제 25 항에 있어서, The method of claim 18 or 25, 상기 벡터 시스템이 vif-코딩 서열을 포함하는 제4 폴리뉴클레오티드 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템. Wherein said vector system further comprises a fourth polynucleotide sequence comprising a vif -coding sequence. 제 18 항, 제 25 항 또는 제 26 항 중 어느 한 항에 있어서, The method according to any one of claims 18, 25 or 26, 상기 제1 폴리뉴클레오티드가 상기 CAEV gag-pol-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템.Wherein said first polynucleotide further comprises a heterologous regulatory sequence operably linked to said CAEV gag-pol -coding sequence. 제 18 항, 제 21 항, 또는 제 22 항 중 어느 한 항에 있어서,The method according to any one of claims 18, 21 or 22, 상기 제2 폴리뉴클레오티드가 상기 바이러스 외피-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템.Wherein said second polynucleotide further comprises a heterologous regulatory sequence operably linked to said viral envelope-coding sequence. 제 18 항에 있어서, The method of claim 18, 상기 제1 폴리뉴클레오티드가 상기 CAEV gag-pol-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함하고 상기 제2 폴리뉴클레오티드가 상기 바이러스 외피-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템.Wherein said first polynucleotide further comprises a heterologous regulatory sequence operably linked to said CAEV gag-pol -coding sequence and wherein said second polynucleotide is operably linked to said viral envelope-coding sequence. The vector system further comprises. 제 25 항에 있어서, The method of claim 25, 상기 제3 폴리뉴클레오티드가 상기 rev-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템. Wherein said third polynucleotide further comprises a heterologous regulatory sequence operably linked to said rev -coding sequence. 제 26 항에 있어서,The method of claim 26, 상기 제4 폴리뉴클레오티드가 상기 vif-코딩 서열에 작동가능하게 연결된 이종의 조절 서열을 추가로 포함하는 것을 특징으로 하는 벡터 시스템. Wherein said fourth polynucleotide further comprises a heterologous regulatory sequence operably linked to said vif -coding sequence. 제 27 항, 제 28 항, 제 29 항, 제 30 항, 또는 제 31 항에 있어서, 32. The method of claim 27, 28, 29, 30, or 31, 상기 이종의 조절 서열이 프로모터인 것을 특징으로 하는 벡터 시스템.The heterologous control sequence is a vector system, characterized in that the promoter. 제 18 항에 있어서, The method of claim 18, 상기 패키징 벡터 시스템이 적격(competent) CAEV 패키징 서열을 갖지 않는 것을 특징으로 하는 벡터 시스템.Wherein said packaging vector system does not have a competent CAEV packaging sequence. 제 18 항에 있어서, The method of claim 18, 상기 패키징 벡터 시스템이 CAEV 유전체의 스플라이싱 제공 위치와 gag 시작 코돈 사이의 5' 말단 영역을 갖지 않는 것을 특징으로 하는 벡터 시스템.Wherein the packaging vector system does not have a 5 'end region between the splicing providing location of the CAEV genome and the gag start codon. 제 18 항에 있어서, The method of claim 18, 상기 제1 폴리뉴클레오티드를 포함하는 제1 벡터 및 상기 제2 폴리뉴클레오티드를 포함하는 제2 벡터를 포함하는 것을 특징으로 하는 벡터 시스템.A vector system comprising a first vector comprising the first polynucleotide and a second vector comprising the second polynucleotide. 제 25 항에 있어서, The method of claim 25, 상기 제1 폴리뉴클레오티드를 포함하는 제1 벡터, 상기 제2 폴리뉴클레오티드를 포함하는 제2 벡터, 및 상기 제3 폴리뉴클레오티드를 포함하는 제3 벡터를 포함하는 것을 특징으로 하는 벡터 시스템.A vector system comprising a first vector comprising the first polynucleotide, a second vector comprising the second polynucleotide, and a third vector comprising the third polynucleotide. 제 26 항에 있어서, The method of claim 26, 상기 제1 폴리뉴클레오티드를 포함하는 제1 벡터, 상기 제2 폴리뉴클레오티드를 포함하는 제2 벡터, 상기 제3 폴리뉴클레오티드를 포함하는 제3 벡터, 및 상기 제4 폴리뉴클레오티드를 포함하는 제4 벡터를 포함하는 것을 특징으로 하는 벡터 시스템.A first vector comprising the first polynucleotide, a second vector comprising the second polynucleotide, a third vector comprising the third polynucleotide, and a fourth vector comprising the fourth polynucleotide Vector system, characterized in that. 제 25 항에 있어서, The method of claim 25, 상기 제1 폴리뉴클레오티드 및 상기 제3 폴리뉴클레오티드를 포함하는 제1 벡터, 및 상기 제2 폴리뉴클레오티드를 포함하는 제2 벡터를 포함하는 것을 특징으로 하는 벡터 시스템.And a first vector comprising the first polynucleotide and the third polynucleotide, and a second vector comprising the second polynucleotide. 제 26 항에 있어서, The method of claim 26, 상기 제1 폴리뉴클레오티드 및 상기 제4 폴리뉴클레오티드를 포함하는 제1 벡터, 및 상기 제2 폴리뉴클레오티드를 포함하는 제2 벡터를 포함하는 것을 특징으로 하는 벡터 시스템.And a first vector comprising the first polynucleotide and the fourth polynucleotide, and a second vector comprising the second polynucleotide. 제 32 항에 있어서, The method of claim 32, 상기 CAEV gag-pol-코딩 서열이 MCMV MIEP 프로모터에 작동가능하게 연결된 것을 특징으로 하는 벡터 시스템.The CAEV gag-pol- coding sequence A vector system, operably connected to an MCMV MIEP promoter. 제 40 항에 있어서,The method of claim 40, 상기 제1 벡터가 도 2a에 제시된 pMGP/RRE의 구조를 갖는 것을 특징으로 하는 벡터 시스템.The first system has a structure of pMGP / RRE shown in Figure 2a. 제 35 항에 있어서, 36. The method of claim 35 wherein 상기 바이러스 외피-코딩 서열이 HCMV MIEP에 작동가능하게 연결된 VSV-G-코딩 서열이고, 이때 상기 제2 벡터가 베타 글로빈 인트론을 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The viral envelope-coding sequence is a VSV-G-coding sequence operably linked to an HCMV MIEP, wherein the second vector further comprises a beta globin intron. 제 42 항에 있어서, The method of claim 42, 상기 제2 벡터가 도 6a에 제시된 pHGVSV-G의 구조를 갖는 것을 특징으로 하는 벡터 시스템.The vector system, characterized in that the second vector has a structure of pHGVSV-G shown in Figure 6a. 제 35 항에 있어서, 36. The method of claim 35 wherein 상기 바이러스 외피-코딩 서열이 MCMV MIEP에 작동가능하게 연결된 GaLV-코딩 서열이고, 이때 상기 제2 벡터가 진핵성 연장 인자-1 알파 인트론을 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The viral envelope-coding sequence is a GaLV-coding sequence operably linked to an MCMV MIEP, wherein the second vector further comprises a eukaryotic extension factor-1 alpha intron. 제 44 항에 있어서,The method of claim 44, 상기 제2 벡터가 도 6b에 나타낸 pMYKEF-1/env의 구조를 갖는 것을 특징으로 하는 벡터 시스템.The second system has a structure of pMYKEF-1 / env shown in Fig. 6b. 제 36 항에 있어서,The method of claim 36, 상기 제3 벡터가 도 5에 나타낸 pHYK/rev의 구조를 갖는 것을 특징으로 하는 벡터 시스템.The third system has a structure of pHYK / rev shown in FIG. 5. 제 37 항에 있어서, The method of claim 37, 상기 제4 벡터가 도 4에 나타낸 pHYK/vif의 구조를 갖는 것을 특징으로 하는 벡터 시스템.The fourth system has a structure of pHYK / vif shown in FIG. 1) 제 20 항의 벡터 시스템으로 세포를 형질감염시키는 단계;1) transfecting cells with the vector system of claim 20; 2) 상기 세포를 CAEV-계 렌티바이러스 벡터 입자의 생산이 가능한 조건 하에 배양하는 단계(이때, 상기 벡터 입자는 감염성 및 형질도입 적격이며, 복제 결함임); 및2) culturing the cells under conditions capable of producing CAEV-based lentiviral vector particles, wherein the vector particles are infectious and eligible for transduction and are replication defective; And 3) 상기 벡터 입자를 회수하는 단계3) recovering the vector particles 를 포함하는 CAEV-계 벡터 입자를 생산하는 방법.Method for producing a CAEV-based vector particles comprising a. 제 48 항의 방법에 의해 생산된 벡터 입자.Vector particles produced by the method of claim 48. 제 49 항의 벡터 입자 및 담체를 포함하는 조성물.A composition comprising the vector particle of claim 49 and a carrier. 제 18 항 내지 제 47 항 중 어느 한 항의 벡터 시스템을 포함하는 키트.48. A kit comprising the vector system of any one of claims 18-47. 포유류 세포와 제 49 항의 벡터 입자를 접촉시키는 것을 포함하는, 폴리펩타이드를 포유류 세포에 전달하는 방법.A method of delivering a polypeptide to a mammalian cell comprising contacting the mammalian cell with the vector particle of claim 49. 제 52 항에 있어서,The method of claim 52, wherein 상기 세포가 세포를 벡터 입자와 접촉시키기 전에 포유류로부터 분리된 것을 특징으로 하는 방법.Wherein said cell is isolated from the mammal prior to contacting the cell with the vector particles. 제 52 항에 있어서,The method of claim 52, wherein 상기 포유류 세포가 분열 세포인 것을 특징으로 하는 방법.The mammalian cell is a dividing cell. 제 52 항에 있어서,The method of claim 52, wherein 상기 포유류 세포가 비-분열 세포인 것을 특징으로 하는 방법.The mammalian cell is a non-dividing cell. 제 52 항에 있어서,The method of claim 52, wherein 상기 포유류 세포가 CD34+ 줄기 세포인 것을 특징으로 하는 방법.The mammalian cell is a CD34 + stem cell. 제 49 항의 벡터 입자를 척추동물에 투여하는 것을 포함하는 방법.50. A method comprising administering the vector particle of claim 49 to a vertebrate. 제 1 항 내지 제 17 항 중 어느 한 항의 전이 벡터를 포함하는 키트.18. A kit comprising the transition vector of any one of claims 1 to 17. 제 18 항 내지 제 47 항 중 어느 한 항에 있어서,The method according to any one of claims 18 to 47, 상기 제1 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising the first polynucleotide. 제 28 항, 제 30 항, 제 32 항, 제 36 항 또는 제 38 항 중 어느 한 항에 있어서,The method according to any one of claims 28, 30, 32, 36 or 38, 상기 제1 및 제3 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising the first and third polynucleotides. 제 26 항 내지 제 28 항, 제 31 항 내지 제 32 항, 제 37 항, 제 39 항, 제 41 항, 제 43 항, 또는 제 45 항 중 어느 한 항에 있어서,The method according to any one of claims 26 to 28, 31 to 32, 37, 39, 41, 43, or 45, 상기 제1 및 제4 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising the first and fourth polynucleotides. 제 26 항 내지 제 28 항, 제 31 항 내지 제 32 항, 제 37 항, 제 39 항, 제 41 항, 제 43 항, 또는 제 45 항 중 어느 한 항에 있어서,The method according to any one of claims 26 to 28, 31 to 32, 37, 39, 41, 43, or 45, 상기 제1, 제3 및 제4 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising said first, third and fourth polynucleotides. 제 18 항 내지 제 24 항, 제 26 항, 제 29 항, 제 33 항 내지 제 35 항, 제 40 항, 제 42 항, 제 44 항, 또는 제 46 항 중 어느 한 항에 있어서,The method according to any one of claims 18 to 24, 26, 29, 33-35, 40, 42, 44, or 46, 상기 제1 및 제2 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising the first and second polynucleotides. 제 25 항 내지 제 28 항, 제 30 항, 제 32 항, 제 36 항, 또는 제 38 항 중 어느 한 항에 있어서,The method according to any one of claims 25 to 28, 30, 32, 36, or 38, 상기 제1, 제2 및 제3 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising said first, second and third polynucleotides. 제 26 항 내지 제 28 항, 제 31 항 내지 제 32 항, 제 37 항, 제 39 항, 제 41 항, 제 43 항, 또는 제 45 항 중 어느 한 항에 있어서,The method according to any one of claims 26 to 28, 31 to 32, 37, 39, 41, 43, or 45, 상기 제1, 제2 및 제4 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising said first, second and fourth polynucleotides. 제 26 항 내지 제 28 항, 제 31 항 내지 제 32 항, 제 37 항, 제 39 항, 제 41 항, 제 43 항, 또는 제 45 항 중 어느 한 항에 있어서,The method according to any one of claims 26 to 28, 31 to 32, 37, 39, 41, 43, or 45, 상기 제1, 제2, 제3, 및 제4 폴리뉴클레오티드를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.And a cell comprising said first, second, third, and fourth polynucleotides. 제 35 항 내지 제 47 항 중 어느 한 항에 있어서,The method according to any one of claims 35 to 47, 상기 제1 및 제2 벡터를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising said first and second vectors. 제 36 항 내지 제 46 항 중 어느 한 항에 있어서,The method according to any one of claims 36 to 46, 상기 제1, 제2, 및 제3 벡터를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising said first, second and third vectors. 제 36 항 내지 제 47 항 중 어느 한 항에 있어서,The method according to any one of claims 36 to 47, 상기 제1, 제2, 제3, 및 제4 벡터를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising said first, second, third, and fourth vectors. 제 35 항 내지 제 47 항 중 어느 한 항에 있어서,The method according to any one of claims 35 to 47, 상기 제1 벡터를 포함하는 세포를 추가로 포함하는 것을 특징으로 하는 벡터 시스템.The vector system further comprises a cell comprising the first vector. 1) CAEV gag-pol-코딩 서열 및 RRE를 포함하는 세포를 제조하는 단계;1) preparing a cell comprising a CAEV gag-pol -coding sequence and an RRE; 2) 상기 세포를 제 17 항의 전이 벡터로 형질감염시키는 단계;2) transfecting said cells with the transfer vector of claim 17; 3) 상기 세포를 CAEV-계 렌티바이러스 벡터 입자의 생산이 가능한 조건하에 배양하는 단계(이때, 상기 벡터 입자는 감염성 및 형질도입 적격이며, 복제 결함임); 및3) culturing the cells under conditions capable of producing CAEV-based lentiviral vector particles, wherein the vector particles are infectious and eligible for transduction and are replication deficient; And 4) 상기 벡터 입자를 회수하는 단계4) recovering the vector particles 를 포함하는, CAEV-계 벡터 입자를 생산하는 방법.Including, a method of producing CAEV-based vector particles. 1) CAEV 5'LTR과 CAEV gag-코딩 서열 사이의 비번역 영역, 및 2) 상기 비번역 영역의 3' 말단에 연결된 CAEV gag-코딩 서열의 1 내지 X번째 뉴클레오티드(이때, X는 613 미만임)으로 필수적으로 이루어진 CAEV 패키징 서열을 포함하는 벡터.1) the untranslated region between the CAEV 5'LTR and the CAEV gag -coding sequence, and 2) the 1 to X nucleotides of the CAEV gag- coding sequence linked to the 3 'end of the untranslated region, wherein A X comprising a CAEV packaging sequence consisting essentially of X. 제 15 항에 있어서, The method of claim 15, 상기 벡터가 서열번호: 68과 최소 70%의 상동성을 갖는 것을 특징으로 하는 전이 벡터.Wherein said vector has at least 70% homology with SEQ ID NO: 68. 제 41 항에 있어서, 42. The method of claim 41 wherein 상기 제1 벡터가 서열번호: 77과 최소 70%의 상동성을 갖는 것을 특징으로 하는 벡터 시스템.And wherein said first vector has at least 70% homology with SEQ ID NO. 제 43 항에 있어서,The method of claim 43, 상기 제2 벡터가 서열번호: 74와 최소 70%의 상동성을 갖는 것을 특징으로 하는 벡터 시스템.Wherein said second vector has at least 70% homology with SEQ ID NO. 제 45 항에 있어서,The method of claim 45, 상기 제2 벡터가 서열번호: 72와 최소 70%의 상동성을 갖는 것을 특징으로 하는 벡터 시스템.Wherein said second vector has at least 70% homology with SEQ ID NO: 72. 제 46 항에 있어서, The method of claim 46, 상기 제3 벡터가 서열번호: 75와 최소 70%의 상동성을 갖는 것을 특징으로 하는 벡터 시스템.The vector system of claim 3, wherein the third vector has at least 70% homology with SEQ ID NO: 75. 제 47 항에 있어서,The method of claim 47, 상기 제4 벡터가 서열번호: 76과 최소 70%의 상동성을 갖는 것을 특징으로 하는 벡터 시스템.And said fourth vector has at least 70% homology with SEQ ID NO.
KR1020077007725A 2007-04-04 2004-09-07 Caev-based vector systems KR100884214B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020077007725A KR100884214B1 (en) 2007-04-04 2004-09-07 Caev-based vector systems

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020077007725A KR100884214B1 (en) 2007-04-04 2004-09-07 Caev-based vector systems

Publications (2)

Publication Number Publication Date
KR20070053798A true KR20070053798A (en) 2007-05-25
KR100884214B1 KR100884214B1 (en) 2009-02-19

Family

ID=38275843

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020077007725A KR100884214B1 (en) 2007-04-04 2004-09-07 Caev-based vector systems

Country Status (1)

Country Link
KR (1) KR100884214B1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019503179A (en) * 2016-01-27 2019-02-07 ジャスト バイオセラピューティックス,インコーポレイテッド Hybrid promoter and use thereof
US11098310B2 (en) 2016-01-27 2021-08-24 Just-Evotec Biologics, Inc. Expression from transposon-based vectors and uses
US11261462B2 (en) 2016-01-27 2022-03-01 Just-Evotec Biologics, Inc. Inducible expression from transposon-based vectors and uses

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019503179A (en) * 2016-01-27 2019-02-07 ジャスト バイオセラピューティックス,インコーポレイテッド Hybrid promoter and use thereof
US11028410B2 (en) 2016-01-27 2021-06-08 Just-Evotec Biologics, Inc. Hybrid promoter and uses thereof
US11098310B2 (en) 2016-01-27 2021-08-24 Just-Evotec Biologics, Inc. Expression from transposon-based vectors and uses
US11261462B2 (en) 2016-01-27 2022-03-01 Just-Evotec Biologics, Inc. Inducible expression from transposon-based vectors and uses
US11685933B2 (en) 2016-01-27 2023-06-27 Just-Evotec Biologics, Inc. Inducible expression from transposon-based vectors and uses
US11692193B2 (en) 2016-01-27 2023-07-04 Just-Evotec Biologies, Inc. Expression from transposon-based vectors and uses

Also Published As

Publication number Publication date
KR100884214B1 (en) 2009-02-19

Similar Documents

Publication Publication Date Title
AU2020260485B2 (en) Gene therapies for lysosomal disorders
AU2018229561B2 (en) Recombinant adenoviruses and use thereof
AU2020260476B2 (en) Gene therapies for lysosomal disorders
AU741747B2 (en) Lentivirus-based gene transfer vectors
KR20180048743A (en) 2A &amp;lt; / RTI &amp;gt; peptide.
KR20230091894A (en) Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (PASTE)
AU2016339975A1 (en) Yeast-based immunotherapy against Clostridium difficile infection
KR20210150486A (en) Gene therapy for lysosomal disorders
KR20230066360A (en) Gene Therapy for Neurodegenerative Disorders
KR20190076995A (en) Partial device for T-cell receptor synthesis and stable genomic integration into TCR-presenting cells
KR20230010231A (en) Vectors and methods for in vivo transduction
KR20070114761A (en) Remedy for disease associated with apoptotic degeneration in ocular cell tissue with the use of siv-pedf vector
KR20200107990A (en) Constructs containing nerve survival factors and uses thereof
CN101160139A (en) Therapeutic agent for disease with apoptotic degeneration in eye tissue cell containing PEDF and FGF2
KR100884214B1 (en) Caev-based vector systems
KR20240021906A (en) Expression vectors, bacterial sequence-free vectors, and methods of making and using the same
KR20230117327A (en) An expression vector comprising a soluble alkaline phosphatase construct and a polynucleotide encoding the soluble alkaline phosphatase construct.
CN101014710A (en) CAEV-based vector system
CN114645066B (en) Nucleic acid construct for gene therapy of AIDS
KR20210150487A (en) Gene Therapy for Lysosomal Disorders
CN113677800A (en) Recombinant vector comprising a gene for a binding domain and a secretable peptide
AU2022338817A1 (en) Gene sequence construct for gene therapy for hiv infection
AU2022337765A1 (en) Gene sequence construct for gene therapy of human immunodeficiency virus infection
TW202233830A (en) Compositions and methods for the treatment of cancer using next generation engineered t cell therapy
CN117881788A (en) Expression vectors, bacterial sequence-free vectors, and methods of making and using the same

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20130208

Year of fee payment: 5

FPAY Annual fee payment

Payment date: 20140210

Year of fee payment: 6

FPAY Annual fee payment

Payment date: 20150211

Year of fee payment: 7

FPAY Annual fee payment

Payment date: 20160205

Year of fee payment: 8

FPAY Annual fee payment

Payment date: 20170210

Year of fee payment: 9

FPAY Annual fee payment

Payment date: 20180209

Year of fee payment: 10

FPAY Annual fee payment

Payment date: 20190207

Year of fee payment: 11