KR20150108945A - 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 - Google Patents
유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 Download PDFInfo
- Publication number
- KR20150108945A KR20150108945A KR1020157025105A KR20157025105A KR20150108945A KR 20150108945 A KR20150108945 A KR 20150108945A KR 1020157025105 A KR1020157025105 A KR 1020157025105A KR 20157025105 A KR20157025105 A KR 20157025105A KR 20150108945 A KR20150108945 A KR 20150108945A
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- ala
- arg
- ser
- pro
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
- C12N15/861—Adenoviral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/04—Immunostimulants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
- C07K14/01—DNA viruses
- C07K14/075—Adenoviridae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5256—Virus expressing foreign proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10021—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10321—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10341—Use of virus, viral particle or viral elements as a vector
- C12N2710/10343—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10361—Methods of inactivation or attenuation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16111—Influenzavirus A, i.e. influenza A virus
- C12N2760/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Virology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Gastroenterology & Hepatology (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Epidemiology (AREA)
- Oncology (AREA)
- Communicable Diseases (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
재조합 벡터는 조절 서열의 제어 하에서 유인원 아데노바이러스 SAdV-39, -25.2, -26, -30, -37, 및 -38 서열 및 이종성 유전자를 포함한다. 유인원 아데노바이러스 SAdV-39, -25.2, -26, -30, -37, 및 -383 유전자(들)를 발현시키는 셀 라인이 또한 개시된다. 벡터 및 셀 라인을 사용하는 방법이 제공된다.
Description
CD로 제출된 자료의 참고로써 포함
출원인은 본원에서 제공되는 CD의 서열목록 자료를 참고로써 포함한다. 이 CD는 2개로 공급되며 단지 컴퓨터로 판독가능한 형태로 "서열목록"을 함유한다. 이들 디스크를 각각 "카피 1" 및 "카피 2"의 라벨을 붙인다. 이들 디스크의 파일을 "UPN-U4623 PCT sequence listing.txt"의 라벨을 붙인다.
아데노바이러스는 약 36 킬로베이스(kb)의 게놈 크기를 가지는 이중-나선 DNA 바이러스이며, 이는 다양한 표적 조직에서 고성능 유전자 전이 및 거대 이식 유전자 수용력을 달성하는 그것의 능력에 기인하여 유전자 전달 용도를 위해 널리 사용되었다. 전통적으로 아데노바이러스의 E1 유전자는 결실되고, 선택 프로모터, 관심 유전자의 cDNA 서열 및 폴리 A 시그널로 구성되는 이식 유전자 카세트로 대체되는데, 이는 복제 결함 재조합 바이러스를 초래한다.
아데노바이러스는 3개의 주요 단백질, 헥손(II), 펜톤 염기(III) 및 혹모양 섬유(IV)와 다수의 다른 부수적 단백질, VI, VIII, IX, IIIa 및 IVa2로 구성되는 다면체형의 캡시드를 가지는 특징적인 형태를 가진다[W.C. Russell, J. Gen Virol., 81 :2573-2604 (2000년 11월)]. 바이러스 게놈은 역위 말단 반복(ITR)을 가지는 5' 말단에 공유적으로 부착되는 말단 단백질을 가지는 선형의, 이중-나선 DNA이다. 바이러스 DNA는 고염기성 단백질 VII 및 소펩티드 pX(이전에 뮤로 언급됨)와 상세하게 관련된다. 다른 단백질 V는 이 DNA-단백질 복합체와 함께 패키징되고 단백질 VI를 통해 캡시드에 구조적 연결을 제공한다. 바이러스는 또한 성숙한 감염 바이러스를 만들기 위해 일부 구조적 단백질을 처리하는데 필요한 바이러스-암호화된 프로테아제를 함유한다.
분류체계는 인간, 유인원, 소, 말, 돼지, 양, 개 및 주머니쥐 아데노바이러스를 포함하는 포유류아데노바이러스 과를 위해 개발되었다. 이 분류체계는 적혈구를 교착시키기 위해 과 내의 아데노바이러스 서열의 다른 능력에 기초하여 개발되었다. 결과는 현재 아군 A, B, C, D, E 및 F로서 언급되는 6개의 아군이었다. B.N Fields et al, (Lippincott Raven Publishers, Philadelphia, 1996)에 의해 편집된 FIELD'S VIROLOGY, 6th Ed.의 T. Shenk et al, Adenovihdae : The Viruses and their Replication", Ch. 67, p. 111-2112 참조.
재조합 아데노바이러스는 숙주 세포에 이종성 분자의 전달에 대해 설명되었다. 두 침팬지 아데노바이러스의 게놈을 설명하는 미국 특허 6,083,716 참조. 유인원 아데노바이러스, C5, C6 및 C7은 백신 벡터로서 유용한 미국 특허 7,247,472호에서 설명되었다. 다른 침팬지 아데노바이러스는 아데노바이러스 백신 담체를 제조하는데 유용한 WO 2005/1071093에서 설명되었다.
당업계에서 필요로 되는 것은 표적에 분자를 효과적으로 전달하고 모집단에서 선택된 아데노바이러스 항원형에 기존 면역의 효과를 최소화하는 벡터이다.
아과 E 내의 5개의 신규한 유인원 아데노바이러스의 분리된 핵산 서열 및 아미노산 서열 및 이들 서열을 함유하는 벡터가 본원에서 제공된다. 또한 본 발명의 벡터 및 세포를 사용하는 다수의 방법이 제공된다. 이들 아데노바이러스는 SAdV-39, SAdV-25.2, SAdV-26, SAdV-30, SAdV-37 및 SAdV-38를 포함한다.
본원에 기술되는 방법은 본 발명의 벡터를 투여함으로써 포유동물 환자에 하나 이상의 선택된 이종성 유전자(들)을 전달하는 단계를 수반한다. 백신접종을 위해 본원에서 기술되는 조성물의 사용은 보호성 면역 반응의 유발을 위한 선택된 항원의 제시를 허용한다. 이들 유인원 아데노바이러스에 기초한 벡터는 또한 시험관내 이종성 유전자 생성물을 만들기 위해 사용될 수 있다. 이러한 유전자 생성물은 그 자체가 본원에 기술되는 것과 같은 다양한 목적을 위해 유용하다.
본 발명의 이들 및 다른 구체예 및 이점은 하기에서 더욱 상세하게 기술된다.
모두 침팬지 배설물로부터 분리된 유인원 아데노바이러스 39, SAdV-25.2, SAdV-26, SAdV-30, SAdV-37 및 SAdV-38로부터의 신규 핵산 및 아미노산 서열이 제공된다.
또한 재조합 단백질 또는 단편 또는 다른 시약의 시험관 내 생성에서 사용을 위한 이들 벡터를 생성하기 위한 신규 아데노바이러스 벡터 및 팩키징 셀 라인이 제공된다. 더 나아가 치료적 또는 백신 목적을 위한 이종성 분자를 전달하는데 사용을 위한 조성물이 제공된다. 이러한 치료적 또는 백신 조성물은 삽입된 이종성 분자를 전달하는 아데노바이러스 벡터를 함유한다. 게다가, 신규의 SAdV 서열은 재조합 아데노-관련 바이러스(AAV) 벡터의 생성을 위해 필요로 되는 필수적인 헬퍼 기능을 제공하는데 유용하다. 따라서, 이러한 생성 방법에서 이들 서열을 사용하는 헬퍼 구조체, 방법 및 셀 라인이 제공된다.
핵산 또는 그것의 단편을 말할 때, 용어 "실질적인 상동성" 또는 "실질적인 유사성"은, 다른 핵산(또는 그것의 상보적 가닥)과 함께 적절한 뉴클레오티드 삽입 또는 결실에 의해 최상으로 배열될 때, 배열된 서열의 적어도 약 95 내지 99%로 뉴클레오티드 서열 동일성이 있음을 나타낸다.
아미노산 또는 그것의 단편을 말할 때, 용어 "실질적인 상동성" 또는 "실질적인 유사성"은, 다른 아미노산(또는 그것의 상보적 가닥)과 함께 적절한 아미노산 삽입 또는 결실에 의해 최상으로 배열될 때, 배열된 서열의 적어도 약 95 내지 99%로 아미노산 서열 동일성이 있음을 나타낸다. 바람직하게는, 상동성은 길이에 있어서 적어도 8개의 아미노산, 또는 더 바람직하게는 적어도 15개의 아미노산인 전장 서열, 또는 그것의 단백질, 또는 그것의 단편에 있다. 적절한 단편의 예는 본원에서 기술된다.
핵산 서열에 있어서 용어 "백분율 서열 동일성" 또는 "동일한"은 최대 대응에 대해 배열될 때 동일한 두 서열의 잔기를 말한다. 한 서열과 다른 서열을 배열하는데 갭이 필요로 될 때, 스코어링의 정도는 갭에 대한 불이익 없이 더 긴 서열에 대해 계산된다. 폴리뉴클레오티드 또는 암호화된 폴리펩티드의 기능성을 보존하는 서열은 이에 의해 더욱 밀접하게 동일하다. 서열 길이 동일성 비교는 게놈의 전장(예를 들어, 약 36 kbp), 유전자, 단백질, 서브유닛, 또는 효소의 오픈리딩프레임의 전장[예를 들어, 아데노바이러스 코딩 영역을 제공하는 표]에 걸쳐 있을 수 있고, 또는 적어도 약 500 내지 5000개의 뉴클레오티드의 단편이 요망된다. 그러나, 예를 들어, 적어도 약 9개의 뉴클레오티드, 보통 적어도 약 20 내지 24개의 뉴클레오티드, 적어도 약 28 내지 32개의 뉴클레오티드, 적어도 약 36개 또는 그 이상의 뉴클레오티드를 가지는 더 작은 단편들 사이의 동일성이 또한 요망될 수 있다. 유사하게, "백분율 서열 동일성"은 단백질, 또는 그것의 단편의 전장에 걸쳐서 아미노산 서열에 대해 용이하게 결정될 수 있다. 적절하게, 단편은 길이에 있어서 적어도 8개의 아미노산이며, 약 700개까지의 아미노산이 있을 수 있다. 적절한 단편의 예는 본원에서 기술된다.
동일성은 디폴트 세팅에서 정의되는 바와 같은 이러한 알고리즘 및 컴퓨터 프로그램을 사용하여 용이하게 결정된다. 바람직하게는, 이러한 동일성은 단백질, 효소, 서브유닛의 전장에 걸쳐, 또는 길이에 있어서 적어도 약 8개의 단편에 걸쳐서 있다. 그러나, 동일성은 더 짧은 영역에 기초할 수 있으며, 동일성 유전자 생성물이 배치되는 사용에 적합하다.
본원에서 기술되는 바와 같은, 배열은 인터넷의 웹 서버를 통해 접근가능한 "Clustal W"와 같은 다양한 일반 공중에게 또는 상업적으로 이용가능한 Multiple Sequence Alignment 프로그램을 사용하여 수행된다. 또 다르게는, 벡터 NTI® 유틸리티[InVitrogen]가 또한 사용된다. 상기 기술된 프로그램에 함유된 것들을 포함하는 뉴클레오티드 서열 동일성을 측정하는데 사용될 수 있는 당업계에 공지된 다수의 알고리즘이 있다. 다른 예에서, 폴리뉴클레오티드 서열은 Fasta, GCG Version 6.1의 프로그램을 사용하여 비교될 수 있다. Fasta는 질의와 검색 서열 사이의 최상의 중첩 영역의 배열 및 백분율 서열 동일성을 제공한다. 예를 들어, 핵산 서열 사이의 백분율 서열 동일성은 참고로써 본원에 포함되는 GCG Version 6.1에서 제공되는 바와 같은 Fasta와 그것의 디폴트 매개변수(워드 크기 6 및 스코어링 매트릭스에 대한 NOPAM 인자)를 사용하여 결정될 수 있다. 유사하게 프로그램은 아미노산 배열을 수행하기 위해 이용가능하다. 일반적으로, 당업자가 필요하다면 이들 세팅을 변경할 수 있지만, 이들 프로그램은 디폴트 세팅에서 사용된다. 또 다르게는, 당업자는 기준 알고리즘 및 프로그램에 의해 제공되는 동일성 또는 배열의 최소한의 수준을 제공하는 다른 알고리즘 또는 컴퓨터 프로그램을 이용할 수 있다.
폴리뉴클레오티드에 사용되는 "재조합"은, 폴리뉴클레오티드가 클로닝, 제한 또는 연결 단계, 및 천연에서 발견되는 폴리뉴클레오티드와 별개인 구조체를 초래하는 다른 과정의 다양한 조합의 생성물이라는 것을 의미한다. 재조합 바이러스는 재조합 폴리뉴클레오티드를 포함하는 바이러스 입자이다. 용어는 각각 본래의 폴리뉴클레오티드 구조체의 복제물 및 본래의 바이러스 구조체의 자손을 포함한다.
"이종성"은 비교되는 독립체의 나머지로부터 유전자형으로 완전한 독립체에서 유래됨을 의미한다. 예를 들어, 플라스미드에 유전공학 기술에 의해 도입된 폴리뉴클레오티드 또는 다른 종으로부터 유래된 벡터는 이종성 폴리뉴클레오티드이다. 원래의 코딩 서열로부터 제거되고 천연에서는 연결된 것으로 발견되지 않는 코딩 서열에 작동가능하게 연결된 프로모터는 이종성 프로모터이다. 바이러스 또는 바이러스 벡터의 게놈으로 클로닝된 자리-특이적 재조합 자리(바이러스의 게놈은 천연에서는 그것을 함유하지 않는다)는 이종성 재조합 자리이다. 재조합 효소에 대한 서열을 암호화하는 폴리뉴클레오티드가 재조합효소를 정상적으로 발현하지 않는 세포를 유전적으로 변경하기 위해 사용될 때, 폴리뉴클레오티드와 재조합효소는 둘 다 세포에 이종성이다.
본 명세서 및 청구항을 통해 사용되는, 용어 "포함하다" 및 그것의 변형인 "포함하는"은 다른 성분, 요소, 완전체, 단계 등에 포괄적이다. 용어 "구성된다" 또는 "구성되는"은 다른 성분, 구성요소, 정수, 단계 등에 배타적이다.
I. 유인원 아데노바이러스 서열
본 발명은 각각이 천연에서 연관된 다른 물질로부터 분리된 유인원 아데노바이러스 39 (SAdV-39), SAdV-25.2, -26, -30, -37 또는 -38의 핵산 서열 및 아미노산 서열을 제공한다.
A. 핵산 서열
본원에서 제공되는 SAdV-39 핵산 서열은 SEQ 1D NO: 1의 뉴클레오티드 1 내지 36553을 포함한다. 본원에서 제공되는 SAdV-25.2 핵산 서열은 SEQ ID NO: 130의 뉴클레오티드 1 내지 36629를 포함한다. 본원에 제공되는 SAdV-26 핵산 서열은 SEQ ID NO: 162의 뉴클레오티드 1 내지 36628을 포함한다. SAdV-30 핵산 서열은 SEQ ID NO: 98의 뉴클레오티드 1 내지 36621을 포함한다. 본원에 제공되는 SAdV-37 핵산 서열은 SEQ ID NO: 33의 뉴클레오티드 1 내지 36634를 포함한다. 본원에 제공되는 SAdV-38 핵산 서열은 SEQ ID NO: 65의 1 내지 36494을 포함한다.
본원에 참고로써 포함되는 서열목록을 참조. 한 구체예에서, 본 발명의 핵산 서열은 각각 SEQ ID NO: 1, 130, 162, 98, 130, 또는 65의 서열에 상보적인 가닥뿐만 아니라 하기 서열의 서열의 대응하는 RNA 및 cDNA 서열 및 그것의 상보적 가닥을 더 포함한다. 다른 구체예에서, 핵산 서열은 서열목록과 98.5% 이상 동일한, 바람직하게는 약 99% 동일한 서열을 더 포함한다. 또한 한 구체예에서, SEQ ID NO: 1, 130, 162, 98, 130, 또는 65 및 그것의 상보적 가닥에서 제공된 서열의 천연 변이체 및 공학적 변형이 포함된다. 이러한 변형은, 예를 들어, 당업계에 알려진 표지, 메틸화, 및 하나 이상의 자연적으로 발생하는 뉴클레오티드의 축퇴 뉴클레오티드로의 치환을 포함한다.
[표 1]
한 구체예에서, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 서열의 단편, 및 그것의 상보적 가닥, 그것에 상보적인 cDNA 및 RNA가 제공된다. 적당한 단편은 길이에 있어 적어도 15개의 뉴클레오티드이며, 기능적 단편, 즉, 생물학적 관심이 있는 단편을 포함한다. 예를 들어, 기능적 단편은 요망되는 아데노바이러스 생성물을 발현시킬 수 있고 또는 재조합 바이러스 벡터의 생성에 유용할 수 있다. 이러한 단편은 유전자 서열 및 본원의 표에 열거되는 단편을 포함한다. 표는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열의 전사체 영역 및 오픈리딩 프레임을 제공한다. 특정 유전자에 대해, 전사체 및 오픈리딩프레임(ORF)은 SEQ ID NO: 1, 130, 162, 98, 130, 또는 65에서 존재하는 상보적인 가닥에 위치된다. 예를 들어, E2b, E4 및 E2a 참조. 암호화된 단백질의 계산된 분자량이 또한 나타난다. SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 E1a 오픈리딩프레임 및 E2b 오픈리딩프레임은 내부 스플라이스 자리를 함유한다는 것에 주의한다. 이들 스플라이스 자리는 상기 표에서 기록된다.
SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 아데노바이러스 핵산 서열은 치료제로서 및 다양한 벡터 시스템 및 숙주 세포의 구성에서 유용하다. 본원에서 사용되는, 벡터는 네이키드 DNA, 플라스미드, 바이러스, 코스미드 또는 에피솜을 포함하는 어떤 적당한 핵산 분자를 포함한다. 이들 서열 및 생성물은 단독으로 또는 다른 아데노바이러스 서열 또는 분획과 조합하여, 또는 다른 아데노바이러스 또는 비-아데노바이러스 서열로부터의 요소와 조합하여 사용될 수 있다. SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열은 안티센스 전달 벡터, 유전자 치료 벡터, 또는 백신 벡터로서 또한 유용하다. 따라서, 추가로 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열을 함유하는 핵산 분자, 유전자 전달 벡터, 및 숙주 세포가 제공된다.
예를 들어, 본 발명은 본 발명의 유인원 Ad ITR 서열을 함유하는 핵산 분자를 포함한다. 다른 예에서, 본 발명은 원하는 Ad 유전자 생성물을 암호화하는 본 발명의 유인원 Ad 서열을 함유하는 핵산 분자를 제공한다. 본 발명의 서열을 사용하여 구성되는 또 다른 핵산 분자는 본원에 제공되는 정보의 관점에서 당업자에게 용이하게 명백할 것이다.
한 구체예에서, 본원에서 확인되는 유인원 Ad 유전자 영역은 세포에 이종성 분자의 전달을 위한 다양한 벡터에서 사용될 수 있다. 예를 들어, 벡터는 패키징 숙주 세포에서 바이러스 벡터를 발생시키는 목적을 위해 아데노바이러스 캡시드 단백질(또는 그것의 단편)의 발현에 대해 발생된다. 이러한 벡터는 트랜스 발현을 위해 설계될 수 있다. 또 다르게는, 이러한 벡터는 원하는 아데노바이러스 기능을 발현시키는 서열, 예를 들어, 하나 이상의 E1a, E1b, 말단 반복 서열, E2a, E2b, E4, E4ORF6 영역을 안정하게 함유하는 세포를 제공하기 위해 설계된다.
게다가, 아데노바이러스 유전자 서열 및 그것의 단편은 헬퍼-의존 바이러스(예를 들어, 필수 기능이 결핍된 아데노바이러스 벡터, 또는 아데노-관련 바이러스(AAV))의 생성에 필요한 헬퍼 기능을 제공하는데 유용하다. 이러한 생성 방법에 대해, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38서열은 인간 Ad에 기술된 것과 유사한 방법인 그러한 방법으로 이용될 수 있다. 그러나, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 사이의 서열, 서열과 인간 Ad의 그것들의 차이점 때문에, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열의 사용은 rAAV 생성 동안 감염성 아데노바이러스 오염물질을 생성할 수 있는 인간 Ad E1 기능을 전달하는 숙주 세포, 예를 들어, 293 세포에서 헬퍼 기능을 가지는 상동 재조합의 가능성을 크게 최소화하거나 제거한다.
아데노바이러스 헬퍼 기능을 사용하는 rAAV를 생성하는 방법은 인간 아데노바이러스 항원형과 함께 문헌에서 길이로 기술되었다. 예를 들어, 미국 특허 6,258,595 및 그것에 인용된 참고문헌을 참조. 또한, 미국 특허 5,871,982; WO 99/14354; WO 99/15685; WO 99/47691 참조. 이들 방법은 또한 비-인간 영장류 AAV 항원형을 포함하는 비-인간 항원형 AAV의 생성에 사용될 수 있다. 필요한 헬퍼 기능(예를 들어, E1a, E1b, E2a 및/또는 E4ORF6)을 제공하는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열은 필요한 아데노바이러스 기능을 제공하는데 특히 유용할 수 있는 한편, 어떤 다른 아데노바이러스와 재조합의 가능성을 최소화 또는 제거하는 것은 전형적으로 인간 기원의 rAAV-패키징 세포에서 존재한다. 따라서, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 선택된 유전자 또는 오픈리딩프레임은 이들 rAAV 생성 방법에 사용될 수 있다.
또 다르게는, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 서열을 기초로 한 재조합 벡터는 이들 방법에 사용될 수 있다. 이러한 재조합 아데노바이러스 유인원 벡터는, 그것의 발현을 제어하는 조절 서열의 제어하에서 예를 들어, 침팬지 Ad 서열이 예를 들어, AAV 3' 및/또는 5' ITRs 및 이식 유전자로 구성되는 rAAV 발현 카세트 옆에 배치되는 하이브리드 침팬지 Ad/AAV를 포함할 수 있다. 당업자는 또 다른 유인원 아데노바이러스 벡터 및/또는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 유전자 서열이 아데노바이러스 헬퍼에 의존하여 rAAV 및 다른 바이러스의 생성에 유용할 것임을 인식할 것이다.
또 다른 구체예에서, 핵산 분자는 숙주 세포에서 선택된 아데노바이러스 유전자 생성물의 전달 및 발현을 위해 설계되어 원하는 생리적 효과를 이룬다. 예를 들어, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 E1a 단백질을 암호화하는 서열을 함유하는 핵산 분자는 암 치료제로서 사용을 위해 피험자에게 전달될 수 있다. 선택적으로, 이러한 분자는 지질-계 담체에서 제형화되고, 바람직하게는 암세포를 표적화한다. 이러한 제형은 다른 암 치료제(예를 들어, 시스플라틴, 탁솔 등)와 조합될 수 있다. 본원에 제공되는 아데노바이러스 서열에 대한 또 다른 사용은 당업자에게 용이하게 명백할 것이다.
게다가, 당업자는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열이 치료 및 면역 분자의 시험관 내, 생체 밖 또는 생체 내 전달을 위해 다양한 바이러스 및 비-바이러스 벡터 시스템에 대한 사용에 용이하게 적용될 수 있다는 것을 용이하게 이해할 것이다. 예를 들어, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 유인원 Ad 서열은 다양한 rAd 및 비-rAd 벡터 시스템에 이용될 수 있다. 이러한 벡터 시스템은, 예를 들어, 플라스미드, 렌티바이러스, 레트로바이러스, 수두바이러스, 우두 바이러스, 및 특히 아데노-연관 바이러스 시스템을 포함할 수 있다. 이러한 벡터 시스템의 선택은 본 발명의 제한이 아니다.
본 발명은 추가로 본 발명의 유인원 및 유인원-유래 단백질의 생성에 유용한 분자를 제공한다. 본 발명의 유인원 Ad DNA 서열을 포함하는 폴리뉴클레오티드를 전달하는 이러한 분자는 네이키드 DNA, 플라스미드, 바이러스 또는 다른 유전적 구성요소의 형태일 수 있다.
B. SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 아데노바이러스 단백질
본원에 기술되는 아데노바이러스 핵산에 의해 암호화되는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 아데노바이러스의 유전자 생성물, 예컨대, 단백질, 효소 및 그것의 단편이 제공된다. 더 나아가, 다른 방법에 의해 발생되는 이들 핵산 서열에 의해 암호화된 아미노산 서열을 가지는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 단백질, 효소, 및 그것의 단편이 포함된다. 이러한 단백질은 상기 표에서 확인되는 오픈리딩프레임에 의해 암호화되는 것, 서열목록에서 제공되는 SEQ ID NO에 대한 하기 표에서 나타내는 단백질, 및 단백질 및 폴리펩티드의 단편을 포함한다.
따라서, 한 양태에서, 실질적으로 순수한, 즉, 다른 바이러스 및 단백질성 단백질이 없는 독특한 유인원 아데노바이러스 단백질이 제공된다. 바람직하게는, 이들 단백질은 적어도 10% 상동성, 더 바람직하게는 60% 상동성, 및 가장 바람직하게는 95% 상동성이다.
한 구체예에서, 독특한 유인원-유래 캡시드 단백질이 제공된다. 본원에서 사용된 바와 같은, 유인원-유래 캡시드 단백질은, 제한 없이, 키메라 캡시드 단백질, 융합 단백질, 인공 캡시드 단백질, 합성 캡시드 단백질, 및 재조합 캡시드 단백질을 포함하여, 이들 단백질의 의미에 대한 제한 없이, 상기 정의한 바와 같은 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 캡시드 단백질 또는 그것의 단편을 함유하는 어떤 아데노바이러스 캡시드 단백질을 포함한다.
적당하게, 이들 유인원-유래 캡시드 단백질은 다른 아데노바이러스 항원형의 캡시드 영역 또는 그것의 단편, 또는 본원에서 기술되는 바와 같은 변형된 유인원 캡시드 단백질 또는 단편과 조합하여 하나 이상의 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 영역 또는 그것의 단편(예를 들어, 헥손, 펜톤, 섬유 또는 그것의 단편)을 함유한다. 본원에서 사용되는 바와 같은 "변형된 굴성과 연관된 캡시드 단백질의 변형"은 변경된 캡시드 단백질, 즉, 펜톤, 헥손 또는 섬유 단백질 영역, 또는 그것의 단편, 예로써, 섬유 영역의 혹(knob) 도메인, 또는 이를 암호화하는 폴리뉴클레오티드를 포함하는데, 특이성은 변경된다. 유인원-유래 캡시드는 인간 또는 비-인간 기원일 수 있는 하나 이상의 본 발명 또는 다른 Ad 항원형과 함께 구성될 수 있다. 이러한 Ad는 ATCC, 상업적 및 학업적 공급원을 포함하는 다양한 공급원으로부터 획득될 수 있고, 또는 Ad의 서열은 GenBank 또는 다른 적당한 공급원으로부터 획득될 수 있다.
SAdV-39 [SEQ ID NO: 6], SAdV-25.2 [SEQ ID NO: 135], -26 [SEQ ID NO: 167], -30 [SEQ ID NO: 103], -37 [SEQ ID NO: 38] 또는 -38 [SEQ ID NO: 70]의 펜톤 단백질의 아미노산 서열이 제공된다. 적절하게는, 이들 펜톤 단백질, 또는 그것의 독특한 단편은 다양한 목적을 위해 이용될 수 있다. 적절한 단편의 예는 각각 상기 제공된 아미노산 넘버링 및 각각 SEQ ID NO:6, 103, 135, 38, 70, 167 또는 70에 기초한 약 50, 100, 150, 또는 200개의 아미노산의 N-말단 및/또는 C-말단의 절단(truncation)을 가지는 펜톤을 포함한다. 다른 적당한 단편은 더 짧은 내부의, C-말단의 또는 N-말단의 단편을 포함한다. 추가로, 펜톤 단백질은 당업자에게 공지된 다양한 목적을 위해 변형될 수 있다.
또한 SAdV-39[SEQ ID NO: 11], SAdV-25.2 [SEQ ID NO: 140], -26 [SEQ ID NO: 172], -30 [SEQ ID NO: 108], -37 [SEQ ID NO: 43] 또는 -38 [SEQ ID NO: 75]의 헥손 단백질의 아미노산 서열이 제공된다. 적절하게는, 이들 헥손 단백질, 또는 그것의 독특한 단편은 다양한 목적을 위해 이용될 수 있다. 적절한 단편의 예는 상기 제공된 아미노산 넘버링 및 각각 SEQ ID NO: 11, 140, 172, 108, 43, 또는 75에 기초한 약 50, 100, 150, 200, 300, 400, 또는 500개의 아미노산의 N-말단 및/또는 C-말단의 절단을 가지는 헥손을 포함한다. 다른 적당한 단편은 더 짧은 내부의, C-말단, 또는 N-말단의 단편을 포함한다. 예를 들어, 한 적당한 단편은 헥손 단백질, 지정된 DE1 및 FG1, 또는 그것의 고도가변 영역의 루프 영역(도메인)이다. 이러한 단편은 각각 SEQ ID NO: 11, 140, 172, 108, 43 또는 75에 대하여 유인원 헥손 단백질의 아미노산 잔기 약 125 내지 443; 약 138 내지 441, 또는 더 적은 단편을 걸치는 영역, 예로써, 약 잔기 138 내지 잔기 163; 약 170 내지 약 176; 약 195 내지 약 203; 약 233 내지 약 246; 약 253 내지 약 374; 약 287 내지 약 297; 및 약 404 내지 약 430을 걸치는 것을 포함한다. 다른 적당한 단편은 당업자에 의해 용이하게 확인될 수 있다. 추가로, 헥손 단백질은 당업계에 공지된 다양한 목적을 위해 변형될 수 있다. 헥손 단백질이 아데노바이러스의 항원형에 대한 결정요인이기 때문에, 이러한 인공 헥손 단백질은 인공 항원형을 가지는 아데노바이러스를 초래할 수 있다. 다른 인공 캡시드 단백질은 또한 침팬지 Ad 펜톤 서열 및/또는 본 발명의 섬유 서열 및/또는 그것의 단편을 사용하여 구성될 수 있다.
한 구체예에서, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 헥손 단백질의 서열을 이용하는 변경된 헥손 단백질을 가지는 아데노바이러스가 발생될 수 있다. 헥손 단백질을 변경하는 한 적절한 방법은 참고로써 포함되는 미국 특허 5,922,315호에서 기술된다. 이 방법에서, 아데노바이러스 헥손의 적어도 하나의 루프 영역은 다른 아데노바이러스 항원형의 적어도 하나의 루프 영역으로 변경된다. 따라서, 이러한 변경된 아데노바이러스 헥손 단백질의 적어도 하나의 루프 영역은 SAdV-39의 유인원 Ad 헥손 루프 영역이다. 한 구체예에서, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 헥손 단백질의 루프 영역은 다른 아데노바이러스 항원형으로부터 루프 영역으로써 대체된다. 다른 구체예에서, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 헥손의 루프 영역은 다른 아데노바이러스 항원형의 루프 영역을 대체하기 위해 사용된다. 적절한 아데노바이러스 항원형은 본원에서 기술되는 바와 같은 인간과 비-인간 항원형 중으로부터 용이하게 선택될 수 있다. 적당한 항원형의 선택은 본 발명에서 제한되지 않는다. SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 헥손 단백질 서열에 대한 또 다른 사용은 당업자에게 용이하게 명백할 것이다.
SAdV-39 [SEQ ID NO:22], SAdV-25.2 [SEQ ID NO: 151], -26 [SEQ ID NO: 183], -30 [SEQ ID NO: 118], -37 [SEQ ID NO: 54] 또는 -38 [SEQ ID NO: 85]의 섬유 단백질의 아미노산 서열이 제공된다. 적절하게는, 이 섬유 단백질, 또는 그것의 독특한 단편은 다양한 목적을 위해 이용될 수 있다. 한 적절한 단편은 SEQ ID NO: 22, 151, 183, 119, 54 또는 85 내에 위치되는 섬유 혹이다. 다른 적절한 단편의 예는 SEQ ID NO: 22, 151, 183, 119, 54 또는 85에서 제공되는 아미노산 넘버링에 기초하여 약 50, 100, 150, 또는 200 아미노산의 N-말단의 및/또는 C-말단의 절단을 가지는 섬유를 포함한다. 또 다른 적절한 단편은 내부 단편을 포함한다. 추가로, 섬유 단백질은 당업자에게 공지된 다양한 기술을 사용하여 변형될 수 있다.
SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 단백질의 독특한 단편은 길이에 있어서 적어도 8개의 아미노산이다. 그러나, 다른 원하는 길이의 단편이 용이하게 이용될 수 있다. 게다가, 변형은 SSAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 유전자 생성물의 수율 및/또는 발현을 향상시키기 위해 도입될 수 있고, 예를 들어, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 유전자 생성물의 모두 또는 단편이 향상을 위해 융합 파트너와 융합되는(직접 또는 링커를 통해) 융합 분자의 구성이 본원에서 제공된다. 다른 적절한 변형은, 제한 없이, 보통 절단되는 전- 또는 후-단백질을 제거하기 위해 및 성숙 단백질 또는 효소 및/또는 비밀 유전자 생성물을 제공하기 위한 코딩 영역의 돌연변이를 제공하기 위해 코딩 영역(예를 들어, 단백질 또는 효소)의 절단을 포함한다. 또 다른 변형은 당업자에게 용이하게 명백할 것이다. 더 나아가 본원에 제공된 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 단백질과 적어도 약 99% 동일성을 가지는 단백질이 포함된다.
본원에서 기술되는 바와 같은, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 아데노바이러스 캡시드 단백질을 함유하는 본 발명의 벡터는 중화벡터가 다른 Ad 항원형계 벡터 뿐만 아니라 다른 바이러스 벡터의 유효성을 감소시키는 용도에서의 사용에 특히 적합하다. rAd 벡터는 반복 유전자 치료 또는 부스팅 면역 반응(백신 타이터)을 위한 재투여에서 특히 유리하다.
특정 환경 하에서, 항체를 발생시키기 위한 하나 이상의 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 유전자 생성물(예를 들어, 캡시드 단백질 또는 그것의 단편)을 사용하는 것이 바람직할 수 있다. 본원에서 사용되는 용어 "항체"는 에피토프에 특이적으로 결합할 수 있는 면역글로불린 분자를 말한다. 항체는, 예를 들어, 고친화도 폴리클로날 항체, 모노클로날 항체, 합성 항체, 키메라 항체, 재조합 항체 및 인간화된 항체를 포함하는 다양한 형태로 존재할 수 있다. 이러한 항체는 면역글로불린 분류 IgG, IgM, IgA, IgD 및 IgE로부터 기원한다.
이러한 항체는 당업계에 알려진 어떤 다수의 방법을 사용하여 발생될 수 있다. 적절한 항체는 잘-알려진 전통적인 기술, 예를 들어, Kohler 및 Milstein, 및 그것의 많은 공지된 변형에 의해 발생될 수 있다. 유사하게, 바람직한 고역가 항체는 이들 항원에서 개발된 모노클로날 또는 폴리클로날 항체에 대한 공지된 재조합 기술을 적용함으로써 발생될 수 있다[예를 들어, PCT 특허 출원 No. PCT/GB85/00392; 영국 특허 출원 공개 번호 GB2188638A; Amit et al., 1986 Science, 233:747-753; Queen et al., 1989 Proc . Nat'l . Acad . Sci . USA, 86: 10029-10033; PCT 특허 출원 번호 PCT/WO9007861; 및 Riechmann et al., Nature, 332:323-327 (1988); Huse et al, 1988a Science, 246: 1275-1281 참조]. 또 다르게는, 항체는 본 발명의 항원에 동물 또는 인간 항체의 상보성 결정 영역을 조작함으로써 생성될 수 있다. 예를 들어, E. Mark and Padlin, "Humanization of Monoclonal Antibodies", Chapter 4, The Handbook of Experimental Pharmacology, Vol. 113, The Pharmacology of Monoclonal Antibodies, Springer-Verlag (June, 1994); Harlow et al., 1999, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et al., 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York; Houston et al., 1988, Proc . Natl. Acad . Sci USA 85:5879-5883; 및 Bird et al., 1988, Science 242:423-426 참조. 추가로 본 발명에 의해 항-유전자형 항체(Ab2) 및 항-항-유전자형 항체(Ab3)가 제공된다. 예를 들어, M. Wettendorff et al., "Modulation of anti-tumor immunity by anti-idiotypic antibodies." In Idiotypic Network and Diseases, ed. by J. Cerny and J. Hiernaux, 1990 J. Am. Soc . Microbiol ., Washington DC: pp. 203-229]. 이들 항-유전자형 및 항-항-유전자형 항체는 당업계에 공지된 기술을 사용하여 생성된다. 이들 항체는 진단적 및 임상적 방법 및 키트를 포함하는 다양한 목적을 위해 사용될 수 있다.
특정 환경 하에서, SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 유전자 생성물, 항체 또는 본 발명의 다른 구조체에 검출가능한 표지 또는 태그를 도입하는 것이 바람직할 수 있다. 본원에서 사용되는 바와 같은, 검출가능한 표지는 단독으로 또는 다른 분자와 상호작용하여, 검출가능한 신호를 제공할 수 있는 분자이다. 가장 바람직하게는, 표지는, 예를 들어, 면역 조직 화학 분석 또는 면역 형광 현미경검사에 의해 시각적으로 검출가능하다. 예를 들어, 적당한 표지는 플루오르세인 이소티오시아네이트 (FITC), 피코에리트린 (PE), 알로피코시아닌(APC), 코리포스핀-O (CPO) 또는 탠덤 염료, PE-시아닌-5 (PC5), 및 PE-텍사스 레드(ECD)를 포함한다. 모든 이들 형광 염료는 상업적으로 이용가능하고, 그것들의 사용은 당업계에 공지되어 있다. 다른 유용한 표지는 콜로이드 골드 표지를 포함한다. 또 다른 유용한 표지는 방사성 화합물 또는 원소를 포함한다. 추가적으로, 표지는 분석에서 측색 신호를 나타내기 위해 작동하는 다양한 효소 시스템을 포함하며, 예를 들어, 글루코오스 옥시다아제(기질로서 글루코오스를 사용)는 페록시다아제 및 테트라메틸 벤지딘(TMB)과 같은 수소 도너의 존재하에서 푸른색으로서 보이는 산화된 TMB를 생성하는 생성물로서 과산화물을 방출한다. 다른 예는 ATP, 글루코오스, 및 NAD+와 반응하는 글루코오스-6-포스페이트 탈수소효소와 함께 양고추냉이과산화효소 (HRP), 알칼리 포스파타아제 (AP), 및 헥소키나아제를 포함하여, 특히 340 nm 파장에서 증가된 흡광도로서 검출되는 NADH를 얻는다.
본원에서 기술되는 방법에서 이용되는 다른 표지 시스템은 다른 수단, 예를 들어, 주입된 염료가 적용가능한 분석에서 결과 복합체의 존재하에서 시각적 신호 표시를 제공하기 위한 표적 서열과 콘쥬게이트를 형성하는 효소 대신에 사용되는 착색 라텍스 마이크로입자[Bangs Laboratories, Indiana]에 의해 검출가능하다.
원하는 분자와 표지를 커플링 또는 결합하는 방법은 마찬가지로 통상적이며 당업자에게 공지되어 있다. 표지 부착의 공지된 방법이 기술된다[예를 들어, Handbook of Fluorescent probes and Research Chemicals, 6th Ed., R.P.M. Haugland, Molecular Probes, Inc., Eugene, OR, 1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, Pierce Chemical Company, Rockford, IL, 1994/1995 참조]. 따라서, 표지 및 커플링 방법의 선택은 본 발명을 제한하지 않는다.
SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38의 서열, 단백질, 및 단편은 재조합 생성물, 화학적 합성, 또는 다른 합성 수단을 포함하는 임의의 적절한 수단에 의해 생성될 수 있다. 적절한 생성 기술은 당업자에게 잘 공지되어 있다. 예를 들어, Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY) 참조. 또 다르게는, 펩티드는 또한 잘 공지된 고체 상 펩티드 합성 방법(Merrifield, J. Am. Chem . Soc, 85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62)에 의해 합성될 수 있다. 이들 및 다른 적절한 생성 방법은 당업자의 지식 내이며, 본 발명의 범위를 제한하지 않는다.
게다가, 당업자는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 서열이 치료 및 면역 분자의 시험관 내, 생체 밖 또는 생체 내 전달을 위한 다양한 바이러스 및 비-바이러스 벡터 시스템을 위한 사용에 용이하게 적용될 수 있다는 것을 용이하게 이해할 것이다. 예를 들어, 한 구체예에서, 유인원 Ad 캡시드 단백질 및 본원에서 기술되는 다른 유인원 아데노바이러스 단백질은 비-바이러스, 유전자의 단백질계 전달, 단백질 및 기타 바람직한 진단적, 치료적 및 면역적 분자에 대해 사용된다. 한 이러한 구체예에서, 본 발명의 단백질은 아데노바이러스에 대한 수용체와 함께 세포를 표적화하기 위한 분자에 직접 또는 간접적으로 연결된다. 바람직하게는, 헥손, 펜톤, 섬유 또는 세포 표면 수용체를 위한 리간드를 가지는 그것의 단편과 같은 캡시드 단백질이 이러한 표적을 위해 선택된다. 전달에 적당한 분자는 본원에서 기술되는 치료적 분자와 그것의 유전자 생성물 중에서 선택된다. 지질, 폴리Lys 등을 포함하는 다양한 링커가 링커로서 이용될 수 있다. 예를 들어, 유인원 펜톤 단백질은 Medina-Kauwe LK, et al, Gene Ther . 2001년 5월; 8(10):795-803 및 Medina-Kauwe LK, et al, Gene Ther . 2001년 12월; 8(23): 1753-1761에서 기술되는 것과 유사한 방법으로 유인원 펜톤 서열을 사용하여 융합 단백질의 생성에 의한 목적을 위해 용이하게 이용될 수 있다. 또 다르게는, 유인원 Ad 단백질 IX의 아미노산 서열은 미국 특허 출원 20010047081에 기술되는 바와 같은 세포 표면 수용체에 벡터를 표적화하기 위해 이용될 수 있다. 적당한 리간드는 CD40 항원, RGD-함유 또는 폴리리신-함유 서열 등을 포함한다. 예를 들어, 헥손 단백질 및/또는 섬유 단백질을 포함하는 또 다른 유인원 Ad 단백질은 이들 및 유사한 목적을 위해 사용될 수 있다.
또 다른 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 아데노바이러스 단백질은 당업자에게 용이하게 명백할 다양한 목적을 위하여 단독으로, 또는 다른 아데노바이러스 단백질과 조합하여 사용될 수 있다. 게다가, SAdV 아데노바이러스 단백질의 또 다른 사용은 당업자에게 용이하게 명백할 것이다.
II. 재조합 아데노바이러스 벡터
본원에 기술되는 조성물은 치료 또는 백신 목적을 위해 세포에 이종성 분자를 전달하는 벡터를 포함한다. 본원에서 사용되는, 벡터는 제한 없이, 네이키드 DNA, 파지, 트랜스포존, 코스미드, 에피솜, 플라스미드, 또는 바이러스를 포함하는 어떤 유전적 요소를 포함할 수 있다. 이러한 벡터는 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 및 미니유전자의 유인원 아데노바이러스를 함유한다. "미니유전자" 또는 "발현 카세트"는 숙주 세포에서 유전자 생성물의 번역, 전사 및/또는 발현을 작동하는데 필요한 선택된 이종성 유전자 및 다른 조절 요소의 조합을 의미한다.
전형적으로, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 -유래된 아데노바이러스 벡터는 설계되어 선택된 아데노바이러스 유전자에 고유한 영역에서 미니유전자는 다른 아데노바이러스 서열을 함유하는 핵산 분자에 위치된다. 미니유전자는 원한다면, 영역의 기능을 방해하기 위해 존재하는 유전자 영역에 도입될 수 있다. 또 다르게는, 미니유전자는 부분적으로 또는 완전히 결실된 아데노바이러스 유전자의 자리에 삽입될 수 있다. 예를 들어, 미니유전자는 특히 선택될 수 있는 기능적 E1 결실 또는 기능적 E3 결실의 자리와 같은 자리에서 위치될 수 있다. 용어 "기능적으로 결실된" 또는 "기능적 결실"은 유전자 영역의 충분한 양이 예를 들어, 돌연변이 또는 변형에 의해 제거 또는 다르게는 손상되어, 유전자 영역은 유전자 발현의 기능적 생성물을 더 이상 생성할 수 없음을 의미한다. 원한다면, 전체 유전자 영역이 제거될 수도 있다. 유전자 파괴 또는 결실을 위한 다른 적절한 자리는 본 출원의 어디에서나 논의된다.
예를 들어, 재조합 바이러스의 발생에 유용한 생성 벡터에 대해, 벡터는 미니유전자 및 아데노바이러스 게놈의 5' 말단 또는 아데노바이러스 게놈의 3' 말단 중 하나, 또는 아데노바이러스 게놈의 5'과 3' 둘 다를 함유할 수 있다. 아데노바이러스 게놈의 5' 말단은 패키징 및 복제에 필요한 5' 시스-구성요소; 즉, 5' 역위 말단 반복 (ITR) 서열(복제의 기원으로서 작용) 및 본래의 5' 패키징 인핸서 도메인(E1 프로모터를 위한 패키징 선형 Ad 게놈 및 인핸서 요소에 필요한 서열을 함유)을 함유한다. 아데노바이러스 게놈의 3' 말단은 패키징 및 단백질 막화(encapsidation)에 필요한 3' 시스-구성요소(ITR을 포함)를 포함한다. 적절하게는, 재조합 아데노바이러스는 5' 및 3' 아데노바이러스 시스-구성요소를 함유하며, 미니유전자는 5' 및 3' 아데노바이러스 서열 사이에 위치된다. SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 기초 아데노바이러스 벡터는 또한 추가 아데노바이러스 서열을 함유할 수 있다.
적절하게는, 이들 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 기초 아데노바이러스 벡터는 본 발명의 아데노바이러스 게놈으로부터 유래된 하나 이상의 아데노바이러스 구성요소를 함유할 수 있다. 한 구체예에서, 벡터는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38로부터의 아데노바이러스 ITR 및 동일한 아데노바이러스 항원형으로부터의 추가 아데노바이러스 서열을 함유한다. 다른 구체예에서, 벡터는 ITR을 제공하는 것보다 다른 아데노바이러스 항원형으로부터 유래되는 아데노바이러스 서열을 함유한다.
본원에서 정의되는 바와 같이, 슈도타입화된(pseudotyped) 아데노바이러스는 아데노바이러스의 캡시드 단백질이 ITR을 제공하는 아데노바이러스 보다 다른 아데노바이러스로부터 오는 아데노바이러스를 말한다.
추가로, 키메라 또는 하이브리드 아데노바이러스는 당업자에게 공지된 기술을 사용하여 본원에 기술된 아데노바이러스를 사용하여 구성될 수 있다. 예를 들어, 미국 특허 US 7,291,498호 참조.
ITR의 아데노바이러스 공급원 및 벡터에 존재하는 어떤 다른 아데노바이러스 서열의 공급원은 본 발명을 제한하지 않는다. 다양한 아데노바이러스 균주가 American Type Culture Collection, Manassas, Virginia로부터 이용가능하고, 또는 다양한 상업적 및 기관의 공급원으로부터 이용가능하다. 추가로, 많은 이러한 균주의 서열은 예를 들어, PubMed 및 GenBank를 포함하는 다양한 데이터베이스로부터 이용가능하다. 다른 유인원 또는 인간 아데노바이러스로부터 제조된 상동 아데노바이러스 벡터는 공개된 문헌에서 기술된다[예를 들어, 미국 특허 5,240,846호 참조]. 다수의 아데노바이러스 종류의 DNA 서열은 타입 Ad5[GenBank 등록 번호 M73260]를 포함하여, GenBank로부터 이용가능하다. 아데노바이러스 서열은 항원형 2, 3, 4, 7, 12 및 40과 같은 어떤 공지된 아데노바이러스 항원형으로부터 얻을 수 있고, 또한 어떤 본원에서 확인되는 인간형을 포함한다. 유사하게 비-인간 동물(예를 들어, 유인원)을 감염시키는 것으로 알려진 아데노바이러스는 또한 본 발명의 벡터 구조체에서 사용될 수 있다. 예를 들어, 미국 특허 6,083,716호 참조.
바이러스 서열, 헬퍼 바이러스(필요하다면), 및 재조합 바이러스 입자, 및 다른 벡터 성분 및 본원에 기술되는 벡터의 구조체에서 사용되는 서열은 상기 기술된 바와 같이 획득된다. 본 발명의 SAdV39 , SAdV-25.2, -26, -30, -37 또는 -38 유인원 아데노바이러스 서열의 DNA 서열은 벡터 및 이러한 벡터의 제조에 유용한 셀 라인을 구성하기 위해 사용된다.
서열 결실, 삽입, 및 다른 돌연변이를 포함하는 본 발명의 벡터를 형성하는 핵산 서열의 변형은 표준 분자 생물학적 기술을 사용하여 발생될 수 있고, 본 구체예의 범주 내이다.
A. "미니유전자"
이식 유전자의 선택, "미니유전자"의 클로닝 및 구성 및 바이러스 벡터에 그것의 삽입을 위해 사용되는 방법은 본원에서 제공되는 교시가 주어지는 당업계의 기술 내이다.
1. 이식 유전자
이식 유전자는 관심의 폴리펩티드, 단백질, 또는 다른 생성물을 암호화하는 이식 유전자 옆에 위치하는 벡터 서열에 이종성인 핵산 서열이다. 핵산 코딩 서열은 숙주 세포에서 이식 유전자 전사, 번역 및/또는 발현을 허용하는 방식으로 조절 성분에 작동가능하게 연결된다.
이식 유전자 서열의 조성은 결과 벡터가 위치될 곳에서 사용에 의존할 것이다. 예를 들어, 한 종류의 이식 유전자 서열은 발현이 검출가능한 신호를 생성할 때 리포터 서열을 포함한다. 이러한 리포터 서열은, 제한 없이, DNA 서열 암호화 β-락타마아제, β-갈락토시다아제(LacZ), 알칼린 포스파타아제, 티미딘 키나아제, 녹색 형광 단백질(GFP), 클로람페니콜 아세틸트랜스페라아제(CAT), 루시페라아제, 예를들어, CD2, CD4, CD8를 포함하는 막 결합 단백질, 인플루엔자 헤마그글루티닌 단백질, 및 당업계에 잘 공지된 다른 것을 그것과 관련된 고친화도 항체에서 포함하며, 또는 통상적인 수단, 및 특히 헤마그글루티닌 또는 Myc로부터 항원 태그 도메인에 적절하게 융합된 막 결합 단백질을 포함하는 융합 단백질에 의해 생성될 수 있다. 이들 코딩 서열은, 그것의 발현을 작동시키는 조절 요소와 결합될 때, 효소, 방사선 촬영, 측색, 형광 또는 다른 분광기 분석, 형광 활성화 세포 정렬 분석 및 효소면역분석(ELISA), 방사면역측정법(RIA) 및 면역 조직 화학을 포함하는 면역 분석을 포함하는 통상적인 수단에 의해 검출가능한 신호를 제공한다. 예를 들어, 마커 서열은 LacZ 유전자이며, 신호를 전달하는 벡터의 존재는 베타-갈락토시다아제 활성에 대한 분석에 의해 검출된다. 이식 유전자가 GFP 또는 루시페라아제인 경우, 신호를 전달하는 벡터는 광도계에서 색 또는 광 생성에 의해 시각적으로 측정될 수있다.
한 구체예에서, 이식 유전자는 단백질, 펩티드, RNA, 효소, 또는 촉매적 RNA와 같은 생물 및 의학에서 유용한 생성물을 암호화하는 비-마커 서열이다. 바람직한 RNA 분자는 tRNA, dsRNA, 리보솜 RNA, 촉매적 RNA, 및 안티센스 RNA를 포함한다. 유용한 RNA 서열의 한 예는 처치 동물에서 표적 핵산 서열의 발현을 끝내는 서열이다.
이식 유전자는 암 치료제 또는 백신으로서, 면역 반응의 유발, 및/또는 예방 백신 목적을 위한 예를 들어, 유전적 결함의 치료에 사용될 수 있다. 본원에서 사용되는 바와 같은, 면역 반응의 유발은 분자에서 T 세포 및/또는 체액성 면역반응을 유발하는 분자의 능력(예를 들어, 유전자 생성물)을 말한다. 본 발명은 추가로 예를 들어, 멀티-서브유닛 단백질에 의해 야기되는 질환을 고치거나 또는 완화하기 위해 다양한 이식 유전자를 사용하는 것을 포함한다. 특정 상황에서, 다른 이식 유전자는 단백질의 각 서브유닛을 암호화하고, 또는 다른 펩티드 또는 단백질을 암호화하기 위해 사용될 수 있다. 이는 단백질 서브유닛을 암호화하는 DNA의 크기가 클 때, 예를 들어, 면역글로불린, 혈소판-유래 성장인자, 또는 디스트로핀 단백질에 대해 바람직하다. 멀티-서브유닛 단백질을 생성하기 위한 세포를 위해, 세포는 각각의 다른 서브유닛을 함유하는 재조합 바이러스로 감염된다. 또 다르게는, 단백질의 다른 서브유닛은 동일한 이식 유전자에 의해 암호화될 수 있다. 이 경우에, 단일 이식 유전자는 내부 리보자임 유입 자리(IRES)에 의해 분리된 각 서브유닛에 대한 DNA와 함께, 각각의 서브유닛을 암호화하는 DNA를 포함한다. 이는 각각의 서브유닛을 암호화하는 DNA의 자리가 작을 때, 예를 들어, 서브유닛 및 IRES를 암호화하는 DNA의 전체 크기가 5 킬로베이스 미만일 때, 바람직하다. IRES에 대한 대안으로서, DNA는 번역-후 사건에서 자기-절단하는 2A 펩티드를 암호화하는 서열에 의해 분리될 수 있다. 예를 들어, ML. Donnelly, et al., J. Gen. Virol ., 78(Pt 1): 13-21 (1997년 1월); Furler, S., et al, Gene Ther ., 8(11):864-873 (2001년 6월); Klump H., et al, Gene Ther ., 8(10):811-817(2001년 5월) 참조. 이 2A 펩티드는 IRES보다 상당히 더 작으며, 공간이 제한 인자일 때 사용에 적합하도록 만든다. 그러나, 선택된 이식 유전자가 어떤 생물학적으로 활성인 생성물 또는 다른 생성물, 예를 들어, 연구에 바람직한 생성물을 암호화할 수도 있다.
적당한 이식 유전자는 당업자에 의해 용이하게 선택될 수 있다. 이식 유전자의 선택은 이 구체예를 제한하는 것으로 고려되지 않는다.
2. 조절 요소
미니유전자에 대해 상기 확인된 주요 요소에 더하여, 벡터는 또한 플라스미드 벡터로 트랜스펙팅된 또는 본 발명에 의해 생성되는 바이러스로 감염된 세포에서 그것의 전사, 번역 및/또는 발현을 허용하는 방식으로 이식 유전자에 작동가능하게 연결되는 필요한 통상적인 조절 요소를 포함한다. 본원에 사용되는 바와 같은, "작동가능하게 연결된" 서열은 관심의 유전자와 인접하는 발현 조절 서열 및 트랜스에서 또는 관심의 유전자를 조절하기 위한 거리에서 작용하는 발현 조절 서열을 둘 다 포함한다.
발현 조절 서열은 적절한 전사, 개시, 종결, 프로모터 및 인핸서 서열; 스플라이싱 및 폴리아데닐화(폴리A) 신호와 같은 효율적인 RNA 처리 신호; 세포질 mRNA를 안정화하는 서열; 번역 효율을 향상시키는 서열(즉, Kozak 일치 서열); 단백질 안정성을 향상시키는 서열; 및 필요하다면, 암호화된 생성물의 분비를 향상시키는 서열을 포함한다.
원래의, 구성의, 유도의 및/또는 조직-특이적인 프로모터를 포함하는 매우 다수의 발현 조절 서열은 당업계에 공지되어 있으며 이용될 수 있다. 구성 프로모터의 예는, 제한없이 로우스육종바이러스 (RSV) LTR 프로모터(선택적으로 RSV 인핸서와 함께), 시토메갈로 바이러스(CMV) 프로모터(선택적으로 CMV 인핸서와 함께)[예를 들어, Boshart et al, Cell, 41:521-530 (1985) 참조], SV40 프로모터, 디히드로폴레이트 환원효소 프로모터, β-액틴 프로모터, 포스포글리세롤 키나아제(PGK) 프로모터, 및 EF1α 프로모터[Invitrogen]를 포함한다.
유도성 프로모터는 유전자 발현의 조절을 허용하고 외인성으로 공급된 화합물, 온도와 같은 환경적 인자, 또는 특이적인 생리적 상태의 존재, 예를 들어, 급성 병기, 세포의 특정 분화 상태, 또는 단지 세포를 복제하는 것에 의해 조절될 수 있다. 유도성 프로모터 및 유도성 시스템은, 제한 없이, Invitrogen, Clontech 및 Ariad를 포함하는 다양한 상업적 공급원으로부터 이용가능하다. 많은 다른 시스템이 기술되었고 당업자에 의해 용이하게 선택될 수 있다. 예를 들어, 유도성 프로모터는 아연-유도성 양 메탈로티오닌(MT) 프로모터 및 덱사메타손(Dex)-유도성 마우스 유방 종양 바이러스 (MMTV) 프로모터를 포함한다. 다른 유도성 시스템은 T7 폴리머라아제 프로모터 시스템[WO 98/10088]; 엑디손 곤충 프로모터 [No et al, Proc. Natl . Acad . Sci . USA, 93:3346-3351 (1996)], 테트라사이클린-억제성 시스템[Gossen et al, Proc . Natl . Acad . Sci . USA, 89:5547-5551 (1992)], 테트라사이클린-유도성 시스템[Gossen et al, Science, 268:1766-1769 (1995), 또한 Harvey et al, Curr . Opin . Chem . Biol ., 2:512-518 (1998) 참조]을 포함한다. 다른 시스템은 카스트라디올(castradiol), 디페놀 무리슬레론(diphenol murislerone)을 사용하는 FK506 다이머, VP16 또는 p65, RU486-유도성 시스템[Wang et al, Nat. Biotech., 15:239-243 (1997) 및 Wang et al, Gene Ther ., 4:432-441 (1997)] 및 라파마이신-유도성 시스템[Magari et al, J. Clin . Invest., 100:2865-2872 (1997)]을 포함한다. 일부 유도성 프로모터의 유효성은 시간에 따라 증가한다. 이러한 경우에, 탠덤에서 다양한 억제물질을 삽입함으로써 이러한 시스템, 예를 들어, IRES에 의해 TetR에 연결된 TetR의 효율성을 향상시킬 수 있다. 또 다르게는, 원하는 기능에 대한 스크리닝 전에 적어도 3일을 기다릴 수 있다. 이 시스템의 효율성을 향상시키기 위해 공지된 수단에 의해 원하는 단백질의 발현을 향상시킬 수 있다. 예를 들어, Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element (WPRE)를 사용한다.
다른 구체예에서, 이식 유전자에 대한 원래의 프로모터가 사용될 것이다. 본래 프로모터는 이식 유전자의 발현이 본래 발현을 모방하는 것으로 소망될 때 바람직할 수 있다. 원래 프로모터는 이식 유전자의 발현이 일시적으로 또는 발달적으로, 또는 조직-특이적 방법으로, 또는 특이적 전사 자극에 대한 반응으로 조절되어야 할 때, 사용될 수 있다. 추가 구체예에서, 인핸서 요소, 폴리아데닐화 자리 또는 Kozak 일치 서열과 같은 다른 본래 발현 조절 요소는 또한 본래 발현을 모방하도록 사용될 수 있다.
이식 유전자의 다른 구체예는 조직-특이적 프로모터에 작동가능하게 연결된 이식 유전자를 포함한다. 예를 들어, 골격근에서 발현이 소망된다면, 근육에서 활성인 프로모터가 사용되어야 한다. 이들은 골격의 β-액틴, 미오신 경사슬 2A, 디스트로핀, 근육 크레아틴 키나아제를 암호화하는 유전자로부터의 프로모터뿐만 아니라 자연적으로 발생하는 프로모터보다 더 높은 활성을 가지는 합성 근육 프로모터를 포함한다(Li et al., Nat. Biotech., 17:241-245 (1999)). 조직-특이적인 프로모터의 예는 간(알부민, Miyatake et al, J. Virol , 71 :5124-32 (1997); B형 간염바이러스 코어 프로모터, Sandig et al, Gene Ther ., 3: 1002-9 (1996); 알파-태아 단백질(AFP), Arbuthnot et al., Hum. Gene Ther ., 7: 1503-14 (1996)), 뼈 오스테오칼신(Stein et al, Mol . Biol . Rep., 24:185-96 (1997)); 뼈 시알로단백질(Chen et al, J. Bone Miner. Res., 11:654-64 (1996)), 림프구 (CD2, Hansal et al, J. Immunol, 161:1063-8 (1998); 면역글로불린 중사슬; T 세포 수용체 사슬), 뉴런-특이적 에놀라아제(NSE) 프로모터와 같은 신경세포(Andersen et al, Cell. Mol. Neurobiol, 13:503-15 (1993)), 신경미세섬유 경-사슬 유전자(Piccioli et al, Proc . Natl . Acad . Sci USA, 88:561 1-5 (1991)), 및 특히 뉴런-특이적 vgf 유전자(Piccioli et al, Neuron, 15:373-84 (1995))에 대해 알려져 있다.
선택적으로, 치료적으로 유용한 또는 면역성 생성물을 암호화하는 이식 유전자를 전달하는 벡터는 또한 선택가능한 마커를 포함할 수 있고, 또는 리포터 유전자는 특히 제네티신, 하이그로미신 또는 퓨리마이신 저항을 암호화하는 서열을 포함할 수 있다. 이러한 선택가능한 리포터 또는 마커 유전자(바람직하게는 바이러스 입자안으로 패키징되는 바이러스 게놈 밖에 위치됨)는 암피실린 저항과 같은 박테리아 세포에서 플라스미드의 존재를 표시하는데 사용될 수 있다. 벡터의 다른 성분은 복제의 기원을 포함할 수 있다. 이들 및 다른 프로모터 및 벡터 요소의 선택은 통상적이며 많은 이러한 서열이 이용가능하다[예를 들어, Sambrook et al, 및 그것에 인용된 참고문헌 참조].
이들 벡터는 당업자에게 공지된 기술과 함께, 본원에 제공된 기술 및 서열을 사용하여 발생된다. 이러한 기술은 문헌[Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY]에서 기술되는 것과 같은 cDNA의 통상적인 클로닝 기술, 아데노바이러스 게놈의 중복 올리고뉴클레오티드 서열의 사용, 폴리머라아제 연쇄 반응, 및 원하는 뉴클레오티드 서열을 제공하는 어떤 적당한 방법을 포함한다.
III. 바이러스 벡터의 생성
한 구체예에서, 유인원 아데노바이러스 플라스미드(또는 다른 벡터)는 아데노바이러스 벡터를 만드는데 사용된다. 한 구체예에서, 아데노바이러스 벡터는 복제-결함의 아데노바이러스 입자이다. 한 구체예에서, 아데노바이러스 입자는 E1a 및/또는 E1b 유전자에서 결실에 의한 복제-결함이 제공된다. 또 다르게는, 아데노바이러스는, 선택적으로 E1a 및/또는 E1b 유전자를 보유하는 동안 다른 수단에 의한 복제-결함이 제공된다. 아데노바이러스 벡터는 또한 아데노바이러스 게놈에서 다른 돌연변이, 예를 들어, 다른 유전자에서 온도-민감 돌연변이 또는 결실을 함유할 수 있다. 다른 구체예에서, 아데노바이러스 벡터에서 무결함 E1a 및/또는 E1b 영역을 보유하는 것이 바람직하다. 이러한 무결함 E1 영역은 아데노바이러스 게놈에서 그것의 본래 위치에서 위치될 수도 있고 또는 본래 아데노바이러스 게놈에서 결실 자리(예를 들어, E3 영역)에 위치될 수도 있다.
인간(또는 다른 포유동물) 세포에 유전자의 전달을 위해 유용한 유인원 아데노바이러스 벡터의 구성에서, 아데노바이러스 핵산 서열의 범위는 벡터에서 사용될 수 있다. 예를 들어, 모든 또는 일부의 아데노바이러스 지연 초기 유전자 E3은 재조합 바이러스의 부분을 형성하는 유인원 아데노바이러스 서열로부터 제거될 수 있다. 유인원 E3의 기능은 재조합 바이러스 입자의 기능 및 생성에 무관한 것으로 믿어진다. 유인원 아데노바이러스 벡터는 또한 E4 유전자의 적어도 ORF6 영역의 결실을 가지도록, 더 바람직하게는 이 영역, 전체 E4 영역 기능의 불필요한 중복 때문에 구성될 수 있다. 본 발명의 또 다른 벡터는 지연된 초기 유전자 E2a에서 결실을 함유한다. 결실은 또한 유인원 아데노바이러스 게놈의 L5를 통해 어떤 말기 유전자 L1에서 만들어질 수 있다. 유사하게, 중간 유전자 IX 및 IVa2의 결실은 일부 목적에 유용할 수 있다. 다른 결실은 다른 구조적 또는 비-구조적 아데노바이러스 유전자에서 만들어질 수 있다. 상기 논의된 결실은 개개로 사용될 수 있고, 즉, 본원에 기술되는 바와 같은 사용을 위한 아데노바이러스 서열은 단지 단일 영역에서 결실을 함유할 수 있다. 또 다르게는, 전체 유전자 또는 그것의 생물학적 활성을 파괴하는데 효과적인 그것의 부분의 결실은 어떤 조합으로 사용될 수 있다. 예를 들어, 한 예시적인 벡터에서, 아데노바이러스 서열은 E1 유전자 및 E4 유전자, 또는 E1, E2a 및 E3 유전자, 또는 E1 및 E3 유전자, 또는 E3 등의 결실과 함께 또는 결실 없이, E1, E2a 및 E4 유전자의 결실을 가질 수 있다. 상기 논의한 바와 같이, 이러한 결실은 원하는 결과를 이루기 위해 온도-민감 돌연변이와 같은 다른 돌연변이와 조합하여 사용될 수 있다.
어떤 필수 아데노바이러스 서열을 결핍하는 아데노바이러스 벡터(예를 들어, E1a, E1b, E2a, E2b, E4 ORF6, L1, L2, L3, L4 및 L5)는 아데노바이러스 입자의 바이러스 전염력 및 증식에 필요로 되는 비교대상 외 아데노바이러스 유전자 생성물의 존재하에서 배양될 수 있다. 이들 헬퍼 기능은 하나 이상의 헬퍼 구조체(예를 들어, 플라스미드 또는 바이러스) 또는 패키징 숙주 세포의 존재하에서 아데노바이러스 벡터를 배양함으로써 제공될 수 있다. 예를 들어, 1996년 5월 9일 공개되고, 본원에 참고로써 포함된 국제 특허 출원 WO96/13597의 "최소의" 인간 Ad 벡터의 제조에 대해 설명된 기술을 참조.
1. 헬퍼 바이러스
따라서, 미니유전자을 전달하는데 사용되는 바이러스 벡터의 유인원 아데노바이러스 유전자 함량에 의존하여, 헬퍼 아데노바이러스 또는 비-복제 바이러스 단편이 미니유전자를 함유하는 감염 재조합 바이러스 입자를 생성하는데 필요한 충분한 유인원 아데노바이러스 유전자 서열을 제공하기 위해 필요할 수 있다. 유용한 헬퍼 바이러스는 아데노바이러스 벡터 구조체에서 존재하지 않는 및/또는 벡터가 트랜스펙팅되는 패키징 셀 라인에 의해 발현되지 않는 선택된 아데노바이러스 유전자 서열을 함유한다. 한 구체예에서, 헬퍼 바이러스는 복제-결함이며, 상기 기술된 서열에 더하여 다양한 아데노바이러스 유전자를 함유한다. 이러한 헬퍼 바이러스는 E1-발현 셀 라인과 조합하여 바람직하게 사용된다.
헬퍼 바이러스는 또한 Wu et al, J. Biol . Chem ., 264:16985-16987 (1989); K. J. Fisher 및 J. M. Wilson, Biochem. J., 299:49 (1994년 4월 1일)에서 기술된 바와 같은 폴리-양이온 콘쥬게이트로 형성될 수 있다. 헬퍼 바이러스는 선택적으로 제 2 리포터 미니유전자를 함유할 수 있다. 다수의 이러한 리포터 유전자는 당업계에 공지되어 있다. 아데노바이러스 벡터에서 이식 유전자와 다른 헬퍼 바이러스 상의 리포터 유전자의 존재는 독립적으로 모니터링되는 Ad 벡터와 헬퍼 바이러스 둘 다를 허용한다. 이런 제 2 리포터는 정제 시 결과 재조합 바이러스와 헬퍼 바이러스 사이의 분리를 가능하게 하는데 사용된다.
2. 상보성 셀 라인
상기 기술된 어떤 유전자에서 결실된 재조합 유인원 아데노바이러스(Ad)를 발생시키기 위해, 바이러스의 복제 및 전염력에 필수적이라면, 결실된 유전자 영역의 기능은 헬퍼 바이러스 또는 셀 라인, 즉, 상보성 또는 패키징 셀 라인에 의해 재조합 바이러스에 공급되어야 한다. 많은 환경에서, 인간 E1을 발현시키는 셀 라인은 침팬지 Ad 벡터를 서로 보완하기 위해 사용될 수 있다. 본 발명의 침팬지 Ad 서열과 현재 이용가능한 패키징 세포에서 발견되는 인간 AdE1 서열 사이의 다양성에 기인하여, 현재 인간 E1-함유 세포의 사용이 복제 및 생성 과정 동안 복제-가능 아데노바이러스의 생성을 방지하기 때문에 이는 특히 유리하다. 그러나, 특정 환경에서, E1 유전자 생성물을 발현시키고 E1-결핍 유인원 아데노바이러스의 생성에 이용될 수 있는 셀 라인을 이용하는 것이 바람직할 것이다. 이러한 셀 라인은 기술되었다. 예를 들어, 미국 특허 6,083,716호 참조.
원한다면, 선택된 모 셀 라인에서 발현을 위한 프로모터의 전사 조절 하에서 SAdV28로부터 아데노바이러스 E1 유전자를 최소한으로 발현시키는 패키징 세포 또는 셀 라인을 발생시키기 위해 본원에 제공되는 서열을 이용할 수 있다. 유도성 또는 구성적 프로모터는 이 목적을 위해 사용될 수 있다. 이러한 프로모터의 예는 본 명세서의 어디에서나 상세하게 설명된다. 모 세포는 어떤 요망되는 SAdV28 유전자를 발현시키는 신규 셀 라인의 생성을 위해 선택된다. 제한 없이, 이러한 모 셀 라인은 특히 HeLa [ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL 185], HEK 293, KB [CCL 17], Detroit [예를 들어, Detroit 510, CCL 72] 및 WI-38 [CCL 75] 세포일 수 있다. 이들 셀 라인은 모두 American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209로부터 이용가능하다. 다른 적당한 모 셀 라인은 다른 공급원으로부터 획득될 수 있다.
이러한 E1-발현 셀 라인은 재조합 유인원 아데노바이러스 E1 결실 벡터의 생성에서 유용하다. 추가적으로, 또 다르게는, 하나 이상의 유인원 아데노바이러스 유전자 생성물, 예를 들어, E1a, E1b, E2a, 및/또는 E4 ORF6을 발현시키는 셀 라인은 재조합 유인원 바이러스 벡터의 생성에서 사용되는 바와 같은 본질적으로 동일한 과정을 사용하여 구성될 수 있다. 이러한 셀 라인은 그런 생성물을 암호화하는 필수적 유전자에서 결실된 아데노바이러스를 서로보완하기 위해, 또는 헬퍼-의존 바이러스(예를 들어, 아데노-관련 바이러스)의 패키징에 필요한 헬퍼 기능을 제공하기 위해 이용될 수 있다. 숙주 세포의 제조는 선택된 DNA 서열의 조합과 같은 기술을 수반한다. 이 조합은 통상적인 기술을 이용하여 수행될 수 있다. 이러한 기술은 폴리머라아제 연쇄 반응, 합성 방법, 및 원하는 뉴클레오티드 서열을 제공하는 어떤 다른 적당한 방법과 조합된, 잘 공지되어 있고 상기 인용한 Sambrook et al.에서 기술되는 cDNA 및 게놈 클로닝, 아데노바이러스 게놈의 중복 올리고뉴클레오티드 서열의 사용을 포함한다.
또 다른 대안으로, 필수적인 아데노바이러스 유전자 생성물이 아데노바이러스 벡터 및/또는 헬퍼 바이러스에 의해 트랜스에서 제공된다. 이러한 예에서, 적절한 숙주 세포는 원핵(예를 들어, 박테리아) 세포를 포함하는 어떤 생물학적 유기체, 및 곤충 세포, 효모 세포 및 포유동물 세포를 포함하는 진핵세포로부터 선택될 수 있다. 특히 바람직한 숙주 세포는, 제한 없이, A549, WEHI, 3T3, 10T1/2, HEK 293 세포 또는 PERC6 (이들 둘 다 기능적 아데노바이러스 E1을 발현시킨다) [Fallaux, FJ et al, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12, L 세포, HT1080, HepG2 및 일차 섬유아세포, 인간, 원숭이, 마우스, 래트, 토끼 및 햄스터를 포함하는 포유동물로부터 유래된 간세포 및 근원세포와 같은 세포를 포함하는 어떤 포유동물 종 중에서 선택된다. 세포를 제공하는 포유동물 종의 선택은 본 발명을 제한하지 않으며; 포유동물 세포, 즉, 섬유아세포, 간세포, 종양 세포 등의 종류도 아니다.
3. 셀 라인의 바이러스 입자 및 트랜스펙션의 조합
일반적으로, 트랜스펙션에 의해 미니유전자를 포함하는 벡터를 전달할 때, 벡터는 약 1 x 104 세포 내지 약 1 x 1013 세포, 및 바람직하게는 약 105 세포에서 약 5 μg 내지 약 100 μg DNA, 및 바람직하게는 약 10 내지 약 50 μg DNA의 양으로 전달된다. 그러나, 선택된 벡터, 전달 방법 및 선택된 숙주 세포로서 고려하여, 숙주 세포에서 벡터 DNA의 상대적 양은 조절될 수 있다.
벡터는 네이키드 DNA, 플라스미드, 파지, 트랜스포존, 코스미드, 에피솜, 바이러스 등을 포함하여 당업계에 알려진 또는 상기 기재된 어떤 벡터일 수 있다. 벡터의 숙주 세포에 도입은 트랜스펙션, 및 감염을 포함하는 당업계에 공지된 또는 상기 기재된 바와 같은 어떤 수단에 의해 달성될 수 있다. 하나 이상의 아데노바이러스 유전자는 숙주 세포의 게놈에 안정적으로 통합되고, 에피솜으로서 안정적으로 발현되고, 또는 일시적으로 발현될 수 있다. 유전자 생성물은 모두 에피솜에서 일시적으로 발현되거나 안정적으로 통합될 수 있고, 또는 유전자 생성물의 일부는 안정적으로 발현되는 반면, 나머지는 일시적으로 발현될 수도 있다. 추가로, 각각의 아데노바이러스 유전자의 프로모터는 구성적 프로모터, 유도성 프로모터 또는 본래 아데노바이러스 프로모터로부터 독립적으로 선택될 수 있다. 프로모터는 예를 들어, 유기체 또는 세포의 특이적 생리학적 상태에 의해(즉, 분화상태에 의해 또는 복제 또는 정지 세포(quiescent cell)에서) 또는 외인성으로-첨가된 인자에 의해 조절될 수 있다.
숙주 세포에 분자(플라스미드 또는 바이러스)의 도입은 또한 당업자에게 공지되고, 본 명세서를 통해 논의되는 바와 같은 기술을 사용하여 수행될 수 있다. 바람직한 구체예에서, 표준 트랜스펙션 기술, 예를 들어, CaPO4 트랜스펙션 또는 전기천공법이 사용된다.
재조합 바이러스 입자를 생성하기 위해 아데노바이러스의 선택된 DNA 서열뿐만 아니라 이식 유전자 및 다른 벡터 요소의 다양한 중간체 플라스미드에의 조합, 및 플라스미드 및 벡터의 사용은 통상적인 기술을 사용하여 모두 달성된다. 이러한 기술은 문헌[Sambrook et al, 상기 인용]에서 기술되는 것과 같은 cDNA의 통상적인 클로닝 기술, 아데노바이러스 게놈의 중복 올리고뉴클레오티드 서열의 사용, 폴리머라아제 연쇄 반응, 및 원하는 뉴클레오티드 서열을 제공하는 어떤 적당한 방법을 포함한다. 표준 트랜스펙션 및 공동-트랜스펙션 기술, 예를 들어, CaPO4 침지 기법이 사용된다. 사용되는 다른 통상적인 방법은 바이러스 게놈의 상동 재조합, 한천중층에서 바이러스의 플라크, 신호 생성을 측정하는 방법 등을 포함한다.
예를 들어, 원하는 미니유전자-함유 바이러스 벡터의 구성 및 조합에 따라서, 벡터는 헬퍼 바이러스의 존재하에서 패키징 셀 라인 안으로 시험관 내에서 트랜스펙팅된다. 상동 재조합은 헬퍼와 벡터 서열 사이에서 발생하며, 이는 비리온 캡시드로 복제되고 패키징되는 벡터에서 아데노바이러스-이식 유전자 서열을 허용하여, 재조합 바이러스 벡터 입자를 초래한다. 이러한 바이러스 입자를 생성하는 현재의 방법은 트랜스펙션에 기초한다. 그러나, 본 발명은 이러한 방법에 제한되지 않는다.
결과 재조합 유인원 아데노바이러스는 선택된 이식 유전자가 선택된 세포로 이동하는데 유용하다. 패키징 셀 라인에서 성장한 재조합 바이러스에 의한 생체내 실험에서, 본 발명의 E1-결실 재조합 유인원 아데노바이러스 벡터는 이식 유전자를 비-유인원, 바람직하게는 인간, 세포에 이동시키는데 유용함을 증명한다.
IV. 재조합 아데노바이러스 벡터의 사용
재조합 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 기초 벡터는 시험관내, 생체 밖, 및 생체 내 인간 또는 비-유인원 수의과 환자에서 유전자 전달에 유용하다.
본원에 기술되는 재조합 아데노바이러스 벡터는 시험관내 이종성 유전자에 의해 암호화되는 생성물의 생성을 위한 발현 벡터로서 사용될 수 있다. 예를 들어, E1 결실의 위치로 삽입되는 유전자를 함유하는 재조합 아데노바이러스는 상기 기술한 바와 같은 E1-발현 셀 라인에 트랜스펙팅될 수 있다. 또 다르게는, 복제-가능 아데노바이러스는 다른 선택된 셀 라인에서 사용될 수 있다. 트랜스펙팅된 세포는 그 후 통상적인 방법으로 배양되고, 프로모터로부터 유전자 생성물을 발현시키기 위한 재조합 아데노바이러스를 허용한다. 유전자 생성물은 그 후 배양물로부터 단백질 분리 및 회수의 공지된 통상적인 방법에 의해 배양물 배지로부터 회수될 수 있다.
SAdV39, SAdV-25.2, -26, -30, -37 또는 -38-유래 재조합 유인원 아데노바이러스 벡터는 생체내 또는 생체밖에서 조차 선택되는 숙주 세포에 선택된 이식 유전자를 전달할 수 있는 효율적인 유전자 전달 비히클을 제공하며, 유기체는 하나 이상의 AAV 항원형에서 중화 항체를 가진다. 한 구체예에서, rAAV 및 세포는 생체밖에서 혼합되고; 감염 세포는 통상적인 방법을 사용하여 배양되며; 형질도입된 세포는 환자에 재주입된다. 이들 조성물은 치료적 목적 및 보호 면역을 유발하는 것을 포함하는 면역을 위한 유전자 전달에 특히 적합하다.
더 흔하게는, SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 재조합 아데노바이러스 벡터는 하기 기술되는 바와 같은 치료 또는 면역 분자의 전달을 위해 이용될 것이다. 본 발명의 재조합 아데노바이러스 벡터는 재조합 아데노바이러스 벡터의 반복 전달을 수반하는 요법에서 사용에 특히 적합하다는 것이 두 용도에 대해 용이하게 이해될 것이다. 이러한 요법은 전형적으로 바이러스 캡시드가 변형되는 일련의 바이러스 벡터의 전달을 수반한다. 바이러스 캡시드는 각각의 이후의 투여를 위해, 또는 특정 항원형 캡시드의 미리-선택된 수(예를 들어, 1, 2, 3, 4 또는 그 이상)의 투여 후 변형될 수 있다. 따라서, 요법은 제 1 유인원 캡시드와 함께 rAd의 전달, 제 2 유인원 캡시드와 함께 rAd의 전달, 및 제 3 유인원 캡시드와 함께 전달을 수반할 수 있다. 본 발명의 Ad 캡시드를 단독으로, 다른 것과 조합하여, 또는 다른 아데노바이러스와 조합하여(바람직하게는 면역적으로 비-교차반응임) 사용하는 다양한 다른 요법은 당업자에게 명백할 것이다. 선택적으로, 이러한 요법은 다른 비-인간 영장류 아데노바이러스, 인간 아데노바이러스, 또는 본원에 기술되는 것과 같은 인공 서열의 캡시드와 함께 rAd의 투여를 수반할 수 있다. 요법의 각 단계는 단일 Ad 캡시드로 일련의 주입(또는 다른 전달 경로) 후 다른 Ad 공급원으로부터 일련의 다른 캡시드의 투여를 수반할 수 있다. 또 다르게는, SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 벡터는 다른 바이러스 시스템, 비-바이러스 전달 시스템, 단백질, 펩티드, 및 다른 생물학적으로 활성인 분자를 포함하는 다른 비-아데노바이러스-매개 전달 시스템을 수반하는 요법에서 이용될 수 있다.
하기의 섹션은 본 발명의 아데노바이러스 벡터를 통해 전달될 수 있는 예시적인 분자에 초점을 맞출 것이다.
A. 치료 분자의 Ad-매개 전달
한 구체예에서, 상기-기술된 재조합 벡터는 유전자 치료를 위해 공개된 방법에 따라서 인간에 투여된다. 선택된 이식 유전자를 함유하는 유인원 바이러스 벡터는 환자에 투여될 수 있으며, 바람직하게는 생물학적으로 양립가능한 용액 또는 약학적으로 허용가능한 전달 비히클에서 현탁된다. 적당한 비히클은 멸균 식염수를 포함한다. 약학적으로 허용가능한 담체로 공지되고 당업자에게 잘 알려진 다른 수성 및 비-수성 등장 멸균 주사 용액 및 수성 및 비-수성 멸균 현탁액은 본 목적을 위해 사용될 수 있다.
유인원 아데노바이러스 벡터는 표적 세포를 형질도입하고 유전자 전달 및 발현의 충분한 수준을 제공하는데 충분한 양으로 투여되어, 지나친 불리함 없이 또는 의학적으로 허용가능한 생리적인 효과와 함께 치료적 이점을 제공하며, 이는 의학 분야의 당업자에 의해 결정될 수 있다. 투여의 통상적인 및 약학적으로 허용가능한 경로는, 제한되는 것은 아니지만, 망막에 직접적인 전달 및 다른 안구 전달 방법, 간에 직접적인 전달, 흡입, 비강내, 정맥내, 근육내, 기관내, 피하, 피내, 직장, 경구 및 다른 비경구 투여 경로를 포함한다. 투여 경로는, 원한다면, 이식 유전자 또는 질환에 따라서 조합 또는 조절될 수 있다. 투여 경로는 주로 치료되는 질환의 특성에 의존할 것이다.
바이러스 벡터의 투약은 치료되는 질환, 환자의 연령, 체중 및 건강상태와 같은 요인에 주로 의존할 것이고, 따라서 환자들 사이에서 다양할 수 있다. 예를 들어, 바이러스 벡터의 치료적으로 유효한 성인 인간 또는 수의과 투약량은 일반적으로 약 1 x 106 내지 약 1 x 1015 입자, 약 1 x 1011 내지 1 x 1013 입자, 또는 약 1 x 109 내지 1 x 1012 입자 바이러스의 농도를 함유하는 담체의 약 100 μL 내지 약 100 mL의 범위에 있다. 투약량은 동물의 크기 및 투여 경로에 의존하는 범위에 있을 것이다. 예를 들어, 근육내 주사에 대해 적당한 인간 또는 수의적 투약량(약 80 kg 동물)은 단일 자리에 대해 mL 당, 약 1 x 109 내지 약 5 x 1012 입자의 범위에 있다. 선택적으로, 투여의 다양한 자리는 전달될 수 있다. 다른 예에서, 적당한 인간 또는 수의적 투여는 경구 제형에 대해 약 1 x 1011 내지 약 1 x 1015 입자의 범위에 있을 수 있다. 당업자는 투여 경로, 및 재조합 벡터가 사용되기 위한 치료 또는 백신 용도에 따라서 이들 용량을 조절할 수 있다. 이식 유전자의 발현 수준, 또는 면역원, 순환 항체의 수준은 투약량 투여의 빈도를 결정하기 위해 모니터링 될 수 있다. 투여 빈도의 시간을 결정하기 위한 또 다른 방법은 당업자에게 용이하게 명백할 것이다.
선택적 방법 단계는 바이러스 벡터의 투여와 동시에, 또는 전 또는 후에 적당한 양의 짧은 작동 면역 조절자의 환자에서 공동-투여를 수반한다. 선택된 면역 조절자는 본 발명의 재조합 벡터에 대해 관련된 중화 항체의 형성을 억제할 수 있는 또는 벡터의 T 림프구 (CTL) 제거를 억제할 수 있는 약제로서 본원에 정의된다. 면역 조절자는 T 헬퍼 서브셋(TH1 또는 TH2)과 B 세포 사이에서 상호작용을 방해하여 중화 항체 형성을 억제할 수 있다. 또 다르게는, 면역 조절자는 TH1 세포와 CTL 사이의 상호작용을 억제하여 벡터의 CTL 제거의 발생을 감소시킬 수 있다. 다양한 유용한 면역 조절자 및 그것의 사용을 위한 투약량은, 예를 들어, Yang et al., J. Virol., 70(9) (Sept., 1996); 1996년 5월 2일 공개된 국제 특허 출원 번호 WO96/12406; 및 본원에 모두 참고로써 포함되는 국제 특허출원 번호 PCT/US96/03035에서 개시된다.
1. 치료 이식 유전자
이식 유전자에 의해 암호화되는 유용한 치료적 생성물은, 제한 없이, 인슐린, 글루카곤, 성장 호르몬(GH), 파라티로이드 호르몬(PTH), 성장 호르몬 방출 인자(GRF), 여포 자극 호르몬(FSH), 황체 형성 호르몬(LH), 인간 융모성 고나도트로핀(hCG), 혈관내피성장인자(VEGF), 엔지오포이에틴, 엔지오스태틴, 백혈구조혈성장인자 (GCSF), 에리스로포이에틴(EPO), 결합조직 성장인자(CTGF), 염기성 섬유아세포 성장인자 (bFGF), 산성 섬유아세포 성장인자(aFGF), 상피세포성장인자(EGF), 형질전환 성장인자 (TGF), 혈소판 유래 성장인자 (PDGF), 인슐린 성장 인자 1 및 II (IGF-I 및 IGF-II), TGF, 액티빈, 인히빈을 포함하는 형질전환 성장 인자 수퍼패밀리의 어떤 하나, 또는 어떤 뼈 형성 단백질(BMP) BMPs 1-15, 성장 인자의 헤레귤인/뉴레귤린/ARIA/neu 분화 인자(NDF) 패밀리 중 어떤 하나, 신경 성장인자(NGF), 뇌-유래 신경 친화성 인자(BDNF), 뉴로트로핀 NT-3 및 NT-4/5, 섬모 향신경성 인자(CNTF), 신경아교세포계 유래 신경영양 인자(GDNF), 뉴투린, 애그린, 세마포린/콜랩신의 패밀리 중 어떤 하나, 네트린-1 및 네트린-2, 간세포성장인자(HGF), 에프린, 노긴, 소닉 헤지호그 및 티로신 히드록실라아제를 포함하는 호르몬 및 성장 및 분화 인자를 포함한다.
다른 유용한 이식 유전자 생성물은, 제한 없이, 사이토카인 및 림포카인, 예로써, 트롬보포이에틴(TPO), IL-25를 통한 인터류킨(IL) IL-1(예를 들어, IL-2, IL-4, IL-12 및 IL-18을 포함), 단핵세포 화학유인물질 단백질, 백혈병 억제 인자, 과립성 백혈구 - 대식세포 집락 자극인자, 파스(Fas) 리간드, 종양 괴사 인자 및, 인터페론, 및 줄기 세포 인자, flk-2/flt3 리간드를 포함하는 면역 체계를 조절하는 단백질을 포함한다. 면역 체계에 의해 생성되는 유전자 생성물은 또한 본 발명에 유용하다. 이들은, 제한 없이, 면역글로불린 IgG, IgM, IgA, IgD 및 IgE, 키메라 면역글로불린, 인간화된 항체, 단일쇄 항체, T 세포 수용체, 키메라 T 세포 수용체, 단일쇄 T 세포 수용체, 클래스 I 및 클래스 II MHC 분자, 및 공학변형된 면역글로불린 및 MHC 분자를 포함한다. 유용한 유전자 생성물은 또한 상보적 조절 단백질, 막 보조 단백질(MCP), 붕괴 촉진인자(DAF), CR1, CF2 및 CD59를 포함한다.
또 다른 유용한 유전자 생성물은 호르몬, 성장 인자, 사이토카인, 림포카인, 조절 단백질 및 면역 체계 단백질에 대한 수용체 중 어떤 하나를 포함한다. 본 발명은 저밀도 리포단백질(LDL) 수용체, 고밀도 리포단백질(HDL) 수용체, 매우 낮은 밀도 리포단백질(VLDL) 수용체, 및 스캐빈저 수용체를 포함하는 콜레스테롤 조절을 위한 수용체를 포함한다. 본 발명은 또한 글루코코르티코이드 수용체 및 에스트로겐 수용체, 비타민 D 수용체 및 다른 핵 수용체를 포함하는 스테로이드 호르몬 수용체 수퍼패밀리의 멤버와 같은 유전자 생성물을 포함한다. 게다가, 유용한 유전자 생성물은 전사 인자, 예컨대, jun , fos, max, mad, 혈청반응인자(SRF), AP-1, AP2, myb, MyoD 및 마이오제닌, ETS-박스 함유 단백질, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-박스 결합 단백질, 인터페론 조절 인자 (IRF-1), 윌름 종양 단백질, ETS-결합 단백질, STAT, GATA-박스 결합 단백질, 예컨대, GATA-3, 및 날개달린(winged) 나선형 단백질의 포크헤드(forkhead) 패밀리를 포함한다.
다른 유용한 생성물은, 카르바모일 합성효소 I, 오르니틴 트랜스카르바밀라아제, 아르기노숙시네이트 합성효소, 아르기노숙시네이트 리아제, 아르기나아제, 푸마릴아세트아세테이트 가수분해효소, 페닐알라닌 가수분해효소, 알파-1 안티트립신, 글루코오스-6-포스페이트, 포르포빌리노겐 디아미나아제, 인자 VIII, 인자 IX, 시스타티온 베타-합성효소, 가지사슬 케토산 데카르복실라아제, 알부민, 이소발레릴-coA 탈수소효소, 프로피오닐 CoA 카르복실라아제, 메틸 말로닐 CoA 무타아제, 글루타릴 CoA 탈수소효소, 인슐린, 베타-글루코시다아제, 파이루베이트 카르복실레이트, 간 포스포릴라아제, 포스포릴라아제 키나아제, 글리신 데카르복실라아제, H-단백질, T-단백질, 낭포성 섬유증 막단백질 조절자(CFTR) 서열, 및 디스트로핀 cDNA 서열을 포함한다.
다른 유용한 유전자 생성물은 비-천연적으로 발생하는 폴리펩티드, 예컨대, 삽입, 결실 또는 아미노산 치환을 함유하는 비-천연적으로 발생하는 아미노산 서열을 가지는 키메라 또는 하이브리드 폴리펩티드를 포함한다. 예를 들어, 단일-사슬 공학변형된 면역글로불린은 특정 면역타협 환자에서 유용할 수 있다. 비-천연적으로 발생하는 유전자 서열의 다른 종류는 표적의 과발현을 감소시키는데 사용될 수 있는 안티센스 분자 및 촉매적 핵산, 예를 들어 리보자임을 포함한다.
유전자 발현의 감소 및/또는 조절은 암 및 건선과 같이 이상증식 세포를 특징으로 하는 이상증식 질환의 치료에 특히 바람직하다. 표적 폴리펩티드는 정상 세포와 비교하여 이상증식 세포에서 배타적으로 또는 더 높은 수준으로 생성되는 폴리펩티드를 포함한다. 표적 항원은 myb, myc, fyn, 및 전좌 유전자 bcr/abl, ras, src, P53, neu, trk 및 EGRF과 같은 종양유전자에 의해 암호화되는 폴리펩티드를 포함한다. 표적 항원으로서 종양유전자 생성물에 더하여, 항-암 치료를 위한 표적 폴리펩티드 및 보호 요법은 B 세포 림프종에 의해 만들어지는 항체의 가변 영역 및 T 세포 림프종의 T 세포 수용체의 가변 영역을 포함하며, 일부 구체예에서, 이는 또한 자가면역 질병에 대한 표적 항원으로서 사용된다. 다른 종양-관련 폴리펩티드는 모노클로날 항체 17-1A 및 폴레이트 결합 폴리펩티드에 의해 인식되는 폴리펩티드를 포함하는 종양 세포에서 더 높은 수준으로 발견되는 폴리펩티드와 같은 표적 폴리펩티드로서 사용될 수 있다.
다른 적당한 치료 폴리펩티드 및 단백질은 세포 수용체 및 자기-관련 항체를 생성하는 세포를 포함하는 자가면역과 관련된 표적에 대하여 광범위 기초 보호 면역 반응을 부여함으로써 자가면역 질병 및 장애를 겪고 있는 개체를 치료하는데 유용할 수 있는 것들을 포함한다. T 세포 매개 자가면역 질병은 류마티스 관절염(RA), 다발성경화증(MS), 쇼그렌증후군, 유육종증, 인슐린 의존성 당뇨병(IDDM), 자가면역성 갑상샘염, 반응성 관절염, 강직성 척추염, 경피증, 다발성 근염, 건선, 혈관염, 베게너 육아종증, 크론병 및 궤양성 대장염을 포함한다. 각각의 이들 질병은 내인성 항원에 결합하고 자가면역 질병에 관련되는 면역 캐스캐이드를 시작하는 T 세포 수용체(TCR)를 특징으로 한다.
본 발명의 유인원 아데노바이러스 벡터는 특히 이식 유전자의 다양한 아데노바이러스-매개 전달이 요망되는 치료 요법에서, 예를 들어, 동일한 이식 유전자의 회복을 수반하는 요법에서 또는 다른 이식 유전자의 전달을 수반하는 요법과 조합하여 적합하다. 이러한 요법은 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 유인원 아데노바이러스 벡터의 투여 후, 동일한 항원형 아데노바이러스로부터의 벡터와 함께 재-투여를 수반할 수 있다. 특히 바람직한 요법은 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 유인원 아데노바이러스 벡터의 투여를 수반하며, 제 1 투여로 전달되는 벡터의 아데노바이러스 캡시드 서열의 공급원은 하나 이상의 이후의 투여에서 이용되는 바이러스 벡터의 아데노바이러스 캡시드 서열의 공급원과 다르다. 예를 들어, 치료 요법은 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 벡터의 투여 및 동일 또는 다른 항원형의 하나 이상의 아데노바이러스 벡터에 의한 반복 투여를 수반한다. 다른 예에서, 치료 요법은 아데노바이러스 벡터의 투여 후 제 1 전달 아데노바이러스 벡터에서 캡시드의 공급원과 다른 캡시드를 가지는 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 벡터에 의한 반복 투여, 및 선택적으로 투여 단계 전 벡터의 아데노바이러스 캡시드의 공급원과 동일한 또는, 바람직하게는 다른, 벡터에 의한 투여를 수반한다. 이들 요법은 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 유인원 서열을 사용하여 구성되는 아데노바이러스 벡터의 전달에 제한되지 않는다. 오히려, 이들 요법은, 제한 없이, 하나 이상의 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 벡터와 조합하여, 다른 유인원 아데노바이러스 서열(예를 들어, Pan9 또는 C68, C1, 등), 다른 비-인간 영장류 아데노바이러스 서열, 또는 인간 아데노바이러스 서열을 포함하는 다른 아데노바이러스 서열을 용이하게 이용할 수 있다. 이러한 유인원, 다른 비-인간 영장류 및 인간 아데노바이러스 항원형의 예는 본 문서에서 어디에서나 논의된다. 추가로, 이 치료 요법은 비-아데노바이러스 벡터, 비-바이러스 벡터, 및/또는 다양한 다른 치료적으로 유용한 화합물 또는 분자와 조합하여 SAdV39, SAdV-25.2, -26, -30, -37 또는 -38 아데노바이러스 벡터의 자발적 또는 순차적 전달을 수반할 수 있다. 본 발명은 이들 치료 요법에 제한되지 않으며, 다양한 것들이 당업자에게 용이하게 명확할 것이다.
B. 면역성 이식 유전자의 Ad-매개 전달
재조합 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터는 또한 면역성 조성물로서 사용될 수 있다. 본원에 사용된 바와 같이, 면역성 조성물은 체액(예를 들어, 항체) 또는 세포(예를 들어, 세포독성 T 세포) 반응이 포유동물, 및 바람직하게는 영장류에 전달 후 면역성 조성물에 의해 전달되는 이식 유전자 생성물에 고정되는 조성물이다. 재조합 유인원 Ad는 원하는 면역원을 암호화하는 그것의 아데노바이러스 서열 결실 유전자 중 어떤 것을 함유할 수 있다. 유인원 아데노바이러스는 인간 기원의 아데노바이러스와 비교하여 다른 동물 종에서 살아있는 재조합 바이러스 백신으로서 사용에 더 적합할 가능성이 있지만, 이러한 사용에 제한되는 것은 아니다. 재조합 아데노바이러스는 면역 반응의 유발에 결정적이고 병원체의 확산을 제한할 수 있다는 것이 확인된 항원(들)에 대한 어떤 병원체 및 cDNA에 대해 이용가능한 어떤 병원체에 대하여 예방 또는 치료 백신으로서 사용될 수 있다.
이러한 백신(또는 다른 면역원) 조성물은 상기 기술한 바와 간은 적당한 전달 비히클에서 제형화된다. 일반적으로 면역성 조성물에 대한 용량은 치료 조성물에 대해 상기 정의한 범위에 있다. 선택 유전자의 면역의 수준은, 만약에 있다면, 부스터에 대한 필요를 결정하기 위해 모니터링될 수 있다. 혈청에서 항체 타이터의 평가에 따라서, 선택적인 부스터 면역이 요망될 수 있다.
선택적으로, 본 발명의 백신 조성물은, 예를 들어, 보조제, 안정화제, pH 조절제, 보존제 등을 포함하는 다른 성분을 함유하기 위해 제형화될 수 있다. 이러한 성분은 백신 업계에서 당업자에게 잘 공지되어 있다. 적당한 보조제의 예는, 제한없이, 리포좀, 알륨, 모노포스포릴 지질 A, 및 어떤 생물학적으로 활성인 인자, 예로써, 사이토카인, 인터류킨, 케모킨, 리간드 및 최상으로는 그것의 조합을 포함한다. 특정의 이들 생물학적으로 활성인 인자는 생체내, 예를 들어, 플라스미드 또는 바이러스 벡터를 통해 발현될 수 있다. 예를 들어, 이러한 보조제는 항원만을 암호화하는 DNA 백신과 함께 프라이밍 시 발생되는 면역 반응과 비교하여, 항원-특이적 면역 반응을 향상시키기 위해 항원을 암호화하는 프라이밍 DNA 백신과 함께 투여될 수 있다.
재조합 아데노바이러스는 "면역원 양", 즉, 원하는 세포를 트랜스펙팅하기 위해 투여되는 경로에서 효과적이고, 면역 반응을 유발하기 위해 선택되는 유전자의 발현의 충분한 수준을 제공하는 양으로 투여된다. 보호 면역이 제공되는 경우, 재조합 아데노바이러스는 감염 및/또는 재발을 예방하는데 유용한 백신 조성물이 되는 것으로 고려된다.
또 다르게는, 또는 추가로, 본 발명의 벡터는 선택된 면역원에 대한 면역 반응을 유발하는 펩티드, 폴리펩티드 또는 단백질을 암호화하는 이식 유전자를 함유한다. 재조합 SAdV 벡터는 벡터에 의해 발현되는 삽입된 이종성 항원 단백질에서 세포 용해 T 세포 및 항체를 유발할 때 매우 효율적인 것으로 기대된다.
예를 들어, 면역원은 다양한 바이러스 과로부터 선택될 수 있다. 면역 반응이 바람직한 바이러스 패밀리의 예는, 보통 감기의 약 50%의 경우를 초래하는 리노바이러스 속; 폴리오바이러스, 콕사키 바이러스, 에코바이러스 및 A형 간염 바이러스와 같은 인간 엔테로바이러스를 포함하는 엔테로바이러스 속; 및 주로 비-인간 동물에서 발 및 구강 질병을 초래하는 압소바이러스 속을 포함하는 피코르나바이러스 과를 포함한다. 바이러스의 피코르나바이러스 과 내에서, 표적 항원은 VP1, VP2, VP3, VP4, 및 VPG를 포함한다. 다른 바이러스 과는 바이러스의 노워크(Norwalk) 군을 포함하고, 유행성 위장염의 중요한 감염인자인 칼시바이러스 과를 포함한다. 인간 및 비-인간 동물에서 면역 반응을 유발하기 위한 표적 항원에서 사용에 바람직한 또 다른 바이러스 과는 토가바이러스 과이며, 이는 신드비스 바이러스, 로스리버 바이러스 및 베네수엘라, 동부형&서부형 마뇌염, 및 루벨라 바이러스를 포함하는 루비바이러스를 포함하는 알파바이러스 속을 포함한다. 플라비비리다에과는 뎅기열, 황열, 일본 뇌염, 세인트루이스 뇌막염 및 진드기매개 바이러스를 포함한다. 다른 표적 항원은 C형 간염 또는 코로나바이러스과로부터 발생될 수 있으며, 이는 다수의 비-인간 바이러스, 예컨대, 전염성 기관지염(가금류), 돼지 전염성 위장염 바이러스(돼지), 돼지 혈구응집성 뇌척수염 바이러스(돼지), 고양이 전염성 복막염 바이러스(고양이), 고양이 장 코노나바이러스(고양이), 개 코로나바이러스(개), 및 인간 호흡 코로나바이러스를 포함하며, 이는 보통의 감기 및/또는 비-A, B 또는 C형 간염을 야기할 수 있다. 코로나바이러스과 내에서, 표적 항원은 E1 (또한 M 또는 매트릭스 단백질로 불림), E2 (또한 S 또는 스파이크 단백질로 불림), E3 (또한 HE 또는 헤마그글루틴-엘터로오스로 불림) 글리코단백질(모든 코로나바이러스에 존재하지 않음) 또는 N(뉴클레오캡시드)을 포함한다. 또 다른 항원은 베시큘로바이러스속(예를 들어, 소수포형 구내염 바이러스), 및 리사 바이러스속(예를 들어, 광견병)을 포함하는 랩도 바이러스과를 표적으로 할 수 있다.
랩도바이러스 과 내에서, 적당한 항원은 G 단백질 또는 N 단백질로부터 유래될 수 있다. 마르부르크 및 에볼라 바이러스와 같은 출혈열 바이러스를 포함하는 필로바이러스 과는 항원의 적당한 공급원일 수 있다. 파라믹소바이러스 과는 파라인플루엔자 바이러스 타입 1, 파라인플루엔자 바이러스 타입 3, 소 파라인플루엔자 바이러스 타입 3, 루불라바이러스(멈프스 바이러스), 파라인플루엔자 바이러스 타입 2, 파라인플루엔자 바이러스 타입 4, 뉴캐슬병 바이러스(닭), 우역, 홍역 및 개디스템퍼를 포함하는 모르비리바이러스, 호흡기 세포융합 바이러스를 포함하는 뉴모바이러스를 포함한다. 인플루엔자 바이러스는 오소믹소바이러스 과 내로 분류되며 적당한 항원(예를 들어, HA 단백질, N1 단백질)의 공급원이다. 분야바이러스 과는 분야바이러스 속(캘리포니아뇌염, La Crosse), 플레보바이러스(리프트 밸리열), 한타바이러스 (퓨어말라(puremala)는 헤마하긴(hemahagin) 열 바이러스이다), 나이로바이러스(진드기증(Nairobi sheep disease)) 및 다양한 미지정 분야바이러스를 포함한다. 아레나바이러스 과는 LCM 및 라사열 바이러스에 대한 항원의 공급원을 제공한다. 레오 바이러스 과는 레오바이러스, 로타바이러스(어린이에게서 급성위장염을 야기한다), 오르비바이러스, 및 컬티바이러스(콜로라도진드기열, 레봄보(Lebombo) (인간), 말 뇌증, 청설병) 속을 포함한다.
레트로바이러스 과는 고양이 백혈병 바이러스, HTLVI 및 HTLVII, 렌티바이러스(인간 면역결핍 바이러스(HIV), 유인원 면역결핍 바이러스(SIV), 고양이 면역부전 바이러스(FIV), 말 전염성 빈혈 바이러스 및 스푸마바이러스를 포함)로서 인간 및 수의과 질병을 포함하는 옹코리비리날(oncorivirinal) 아과를 포함한다. 렌티바이러스 중에서, 많은 적당한 항원이 기술되었고 용이하게 선택될 수 있다. 적당한 HIV 및 SIV 항원의 예는, 제한 없이, gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, 및 Rev 단백질뿐만 아니라 그것의 다양한 단편을 포함한다. 예를 들어, Env 단백질의 적당한 단편은 gp120, gp160, gp41과 같은 어떤 그것의 서브유닛, 또는 그것의 더 작은 단편, 예를 들어, 길이에 있어 적어도 약 8개의 아미노산을 포함할 수 있다. 유사하게, tat 단백질의 단편이 선택될 수 있다. [미국 특허 5,891,994호 및 미국 특허 6,193,981호 참조] 또한, D.H. Barouch et al, J. Virol., 75(5):2462-2467 (2001년 3월), 및 R.R. Amara, et al, Science, 292:69-74 (2001년 4월 6일)에서 기술되는 HIV 및 SIV 단백질 참조. 다른 예에서, HIV 및/또는 SIV 면역성 단백질 또는 펩티드는 융합 단백질 또는 다른 면역성 분자를 형성하기 위해 사용될 수 있다. 예를 들어, 2001년 8월 2일 공개된 WO 01/54719, 및 1999년 4월 8일 공개된 WO 99/16884에서 기술되는 HIV-1 Tat 및/또는 Nef 융합 단백질 및 면역 요법을 참조. 본 발명은 HIV 및/또는 SIV 면역원성 단백질 또는 본원에 기술되는 펩디드로 제한되지 않는다. 게다가, 이들 단백질에서 다양한 변형이 기술되었고, 또는 당업자에 의해 용이하게 만들어질 수 있었다. 예를 들어, 미국 특허 5,972,596에서 기술되는 변형된 구역 단백질(gag protein)을 참조. 추가로, 어떤 요망되는 HIV 및/또는 SIV 면역원은 단독으로 또는 조합하여 전달될 수 있다. 이러한 조합은 단일 벡터 또는 다중 벡터로부터 발현을 포함할 수 있다. 선택적으로, 다른 조합은 단백질 형태에서 하나 이상의 면역원의 전달과 함께 하나 이상의 발현된 면역원의 전달을 수반할 수 있다. 이러한 조합은 하기에서 더욱 상세하게 논의된다.
파포바바이러스 과는 폴리오마바이러스 아과(BKU 및 JCU 바이러스) 및 파필로마바이러스 아과(암 또는 유두종의 악성 진행과 관련)를 포함한다. 아데노바이러스 과는 호흡기 질병 및/또는 장염을 야기하는 바이러스(EX, AD7, ARD, O. B.)를 포함한다. 파보바이러스 과 고양이 파보 바이러스(고양이 장염), 고양이 범백혈구감소증 바이러스, 개 파보바이러스 및 돼지 파보바이러스. 헤르페스바이러스 과는 심플렉스바이러스 속 (HSVI, HSVII), 바리셀로바이러스 (가성광견병, 바리셀라-조스터 바이러스)를 포함하는 알파헤르페스바이러스 아과 및 시토메갈로 바이러스(HCMV, 무로메갈로바이러스)를 포함하는 베타헤르페스바이러스 아과 및 림포크립토바이러스 속, EBV (버킷 임파종(Burkitts lymphoma)), 전염성비기관염, 마렉병 바이러스, 및 라디노바이러스를 포함하는 감마헤르페스바이러스 아과를 포함한다. 수두 바이러스과는 오르토폭스바이러스(바리올라 (두창) 및 백시니아 (우두)), 파라폭스바이러스, 아비폭스바이러스, 카프리폭스바이러스, 레포리폭스바이러스, 수이폭스바이러스 속을 포함하는 초르도폭스바이러스아과, 및 엔토모폭스바이러스 아과를 포함한다. 헤파드나바이러스 과는 B형 간염 바이러스를 포함한다. 적당한 항원의 공급원일 수 있는 한 미분류 바이러스는 델타감염 바이러스이다. 또 다른 바이러스 공급원은 조류 전염성 훼브리셔스낭병 바이러스 및 돼지 호흡기 생식기 증후군 바이러스를 포함할 수 있다. 알파바이러스 과는 말동맥염바이러스 및 다양한 뇌염바이러스를 포함한다.
다른 병원체에 대한 인간 또는 비-인간 동물을 면역화하는데 유용한 면역원은, 예를 들어, 인간 및 비-인간 척추동물을 감염시키는 박테리아, 진균, 기생충미생물 또는 다세포 기생충, 또는 암 세포 또는 종양 세포를 포함한다. 박테리아 병원체의 예는 폐렴쌍구균; 포도상구균; 및 연쇄상구균을 포함하는 병원체의 그램양성 구균을 포함한다. 병원체의 그램-음성 구균은 뇌척수막염균; 임균을 포함한다. 병원체의 장 그램-음성 간균은 장내세균(enterobacteriaceae); 슈도모나스, 아시네토박테리아 및 에이케넬라; 멜리오이도시스; 살모넬라; 시겔라; 헤모필루스; 모락셀라; H. 듀크레이(무른 궤양을 야기함); 브루셀라균; 프란시셀라 툴라렌시스균(툴라레미아를 야기); 예르시니아(파스튜렐라); 모닐리포르미스사슬막대균 및 나선균을 포함하고; 그램-양성 간균은 리스테리아모노사이토제네스; 돈단독균(erysipelothrix rhusiopathiae); 코리네박테리움 디프테리아(디프테리아); 콜레라; 탄저균 (탄저병); 도노반증(서혜육아종); 및 바르토넬라증을 포함한다. 병원성 혐기성 세균에 의해 야기되는 질병은 파상풍; 보툴리즘; 다른 클로스트리디아; 결핵; 나병; 및 다른 마이코박테리아를 포함한다. 병원성 스피로헤타병은 매독; 트레포네마병: 매종, 핀타 및 풍토병성 매독; 및 렙토스피라병을 포함한다. 더 고등의 병원체 박테리아 및 병원성 진균에 의해 야기되는 다른 감염은 방선균증; 노카르디아증; 효모균증, 분아진균증, 히스토플라스마증 및 콕시디오이데스 진균증; 칸디다증, 아스페르길루스증, 및 뮤코르 진균증; 스포로트릭스증; 파라콕시디오이드마이세스증, 페트리엘리듐증, 토룰롭시스증, 균종 및 색소진균증; 및 피부사상균증을 포함한다. 리케차감염은 발진티푸스, 로키산 홍반열, Q열, 및 리켓치아폭스를 포함한다. 마이코플라스마 및 클라미디아 감염의 예는: 마이코플라즈마 뉴모니아; 서혜 림프 육아종; 앵무새병; 및 주산기 클라미디아 감염을 포함한다. 병원성 진핵생물은 병원성 원생동물 및 장내 기생충을 포함하고, 이에 의해 생성되는 감염은: 아메바성 감염; 말라리아; 리슈만편모충증; 트리파노소마증; 톡소플라스마증; 폐포자충(Pneumocystis carinii); 트리칸스(Trichans); 톡소포자충(Toxoplasma gondii) ; 바베스열원충증; 지알디아증; 선모충병; 필라리아병; 주혈흡충병; 선충; 흡충 또는 요행; 및 촌충류(촌충) 감염을 포함한다.
다수의 이들 유기체 및/또는 이에 의해 생성되는 독소는 생물학적 공격에서 사용을 위한 가능성을 가지는 약제로서 질병 대책 센터(Centers for Disease Control)[(CDC), Department of Heath and Human Services, USA]에 의해 확인되었다. 예를 들어, 일부의 이들 생물학적 약제는 탄저균 (탄저병), 클로스트리디움 보툴리늄 및 그것의 독소(보툴리즘), 페스트균(Yersinia pestis)(흑사병), 대두창(두창), 프란키셀라 툴라렌시스(Francisella tularensis)(툴라레미아), 및 바이러스성 출혈열[필로바이러스(예를 들어, Ebola, Marburg], 및 아레나바이러스[예를 들어, Lassa, Machupo])를 포함하며, 이들 모두는 현재 카테고리 A 약제로서 분류되며; 콕시엘라 부르네티(Q 열); 브루셀라 종(브루셀라병), 비저균(Burkholderia mallei)(마비저), 부르코홀데리아 슈도 말레이(Burkholderia pseudomallei)(유비저), 피마자 및 그것의 독소(리신 독소), 클로스트리듐 균(clostridium perfringen) 및 그것의 독소(엡실론 독소), 포도상구균 종 및 그것의 독소(엔테로톡시 B), 클라미디아 시타시(앵무새병), 물의 안전성 위협(예를 들어, 비브리오콜레라, 크립토스포리듐 파르붐), 발진티푸스(리케챠 포와제키(Rickettsia powazekii)), 및 바이러스성뇌염(알파바이러스, 예를 들어, 베네수엘라마뇌염; 동부형마 뇌막염; 서부형 마뇌염)를 포함하고; 이들 모두는 카테고리 B 약제로서 분류되고; 니판 바이러스 및 한타바이러스를 포함하고, 이것은 카테고리 C 약제로서 분류된다. 게다가, 이렇게 분류 또는 다르게 분류되는 다른 유기체는 장래의 목적을 위해 확인 및/또는 사용될 수 있다. 본원에서 기술되는 바이러스 벡터 및 다른 구조체는 이들 유기체, 바이러스, 그것의 독소 또는 다른 부산물로부터, 이들 생물학적 약제에 의한 감염 또는 다른 역반응을 예방 및/또는 치료할 항원을 전달하는데 유용하다는 것이 이해될 것이다.
T 세포의 가변 영역에 대해 면역원을 전달하는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터의 투여는 이들 T 세포를 제거하기 위해 CTL을 포함하는 면역반응을 일으키는 것으로 예상된다. RA에서, 질병에 수반되는 TCR의 몇몇의 특정 가변 영역은 특성이 기술되었다. 이들 TCR은 V-3, V-14, V-17 및 Vα-17을 포함한다. 따라서, 적어도 하나의 이들 폴리펩티드를 암호화하는 핵산 서열의 전달은 RA에 수반된 T 세포를 표적화할 면역반응을 유발할 것이다. MS에서, 질병에 수반된 TCR의 몇몇 특정 가변 영역은 특성이 기술되었다. 이들 TCR은 V-7 및 Vα-10을 포함한다. 따라서, 적어도 하나의 이들 폴리펩티드를 암호화하는 핵산 서열의 전달은 MS에 수반되는 T세포를 표적화할 면역반응을 유발할 것이다. 경피증에서, 질병에 수반된 TCR의 몇몇 특정 가변 영역은 특성이 기술되었다. 이들 TCR은 V-6, V-8, V-14 및 Vα-16, Vα-3C, Vα-7, Vα-14, Vα-15, Vα-16, Vα-28 및 Vα-12을 포함한다. 따라서, 적어도 하나의 이들 폴리펩티드를 암호화하는 재조합 유인원 아데노바이러스의 전달은 경피증에 수반된 T 세포를 표적화할 면역 반응을 일으킬 것이다.
C. Ad-매개 전달 방법
선택된 유전자의 치료 수준, 또는 면역의 수준은, 만약에 있다면, 부스터에 대한 필요를 결정하기 위해 모니터링될 수 있다. 혈청에서 CD8+ T 세포 반응, 또는 선택적으로 항체 타이터의 평가에 따라서, 선택적인 부스터 면역화가 요망될 수 있다. 선택적으로, 재조합 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터는 단일 투여에서 또는 예를 들어, 다른 활성 성분을 수반하는 요법 또는 치료의 과정과 조합하는 다양한 요법 또는 프라임-부스트 요법에서 전달될 수 있다. 다양한 이러한 요법은 당업계에서 기술되었고 용이하게 선택될 수 있다.
예를 들어, 프라임-부스트 요법은 일차 면역 체계로 DNA(예를 들어, 플라스미드) 기초 벡터를, 이차의 부스터로 이러한 항원을 암호화하는 서열을 전달하는 단백질 또는 재조합 바이러스와 같은 일반적인 항원의 투여를 수반할 수 있다. 예를 들어, 참고로써 포함되는 2000년 3월 2일 공개된 WO 00/11140 참조. 또 다르게는, 면역 요법은 항원, 또는 단백질을 전달하는 벡터(바이러스 또는 DNA-기초)에 대한 면역 반응을 촉진하기 위해 재조합 SAdV-39, SAdV-25.2, -30, -37 또는 -38 벡터의 투여를 수반할 수 있다. 또 다른 대안으로, 면역 요법은 단백질의 투여 후 항원을 암호화하는 벡터와 함께 부스터를 수반한다.
한 구체예에서, 상기 항원을 전달하는 플라스미드 DNA 벡터를 전달한 후 재조합 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터로 부스팅함으로써 선택된 항원에서 면역반응을 프라이밍하고 부스팅하는 방법이 기술된다. 한 구체예에서, 프라임-부스트 요법은 프라임 및/또는 부스트 비히클로부터 멀티단백질의 발현을 수반한다. 예를 들어, HIV 및 SIV에 대한 면역 반응을 발생시키는데 유용한 단백질 서브유닛의 발현에 대한 멀티단백질 요법을 기술하는 R. R. Amara, Science, 292:69-74 (2001년 4월 6일) 참조. 예를 들어, DNA 프라임은 단일 전사로부터 Gag, Pol, Vif, VPX 및 Vpr 및 Env, Tat, 및 Rev를 전달할 수 있다. 또 다르게는, SIV Gag, Pol 및 HIV-1 Env는 재조합 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 아데노바이러스 구조체에서 전달된다. 또 다른 요법은 WO 99/16884 및 WO 01/54719에서 기술된다.
그러나, 프라임-부스트 요법은 HIV에 대한 면역 또는 이들 항원의 전달에 제한되지 않는다. 예를 들어, 프라이밍은 제 1 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터에 의한 전달단계 후 제 2 Ad 벡터로, 또는 단백질 형태에서 항원 그 자체를 함유하는 조성물과 함께 부스팅하는 단계를 수반할 수 있다. 한 예에서, 프라임-부스트 요법은 항원이 유래된 바이러스, 박테리아 또는 다른 유기체에 대한 보호 면역 반응을 제공할 수 있다. 다른 구체예에서, 프라임-부스트 요법은 치료제가 투여되는 질환의 존재의 검출을 위한 통상적인 분석을 사용하여 측정될 수 있는 치료효과를 제공한다.
프라이밍 조성물은 요망되는 면역 반응이 표적화되는 항원에 따라서 용량 의존적 방법으로 다양한 자리에 투여될 수 있다. 주사(들)의 양 또는 위치 또는 약학적 담체는 제한되지 않는다. 오히려, 요법은 이들 각각이 매 시간마다, 매일, 주마다 또는 매월 또는 매년마다 투여되는 단일 용량 또는 투약량을 포함할 수 있는 프라이밍 및/또는 부스팅 단계를 수반할 수 있다. 예로서, 포유동물은 담체에서 약 10 μg 내지 약 50 μg의 플라스미드를 함유하는 하나 이상의 용량을 수용할 수 있다. DNA 조성물의 바람직한 양은 약 1 μg 내지 약 10,000 μg의 DNA 벡터의 범위에 있다. 투약량은 피험자 체중 당 1 μg 내지 1000 μg DNA로 다양할 것이다. 전달의 양 또는 자리는 포유동물의 동일성 및 질환에 기초하여 바람직하게 선택된다.
포유동물에 대한 항원의 전달에 적당한 벡터의 투약 단위는 본원에서 기술된다. 벡터는 등장 식염수; 등장 염 용액 또는 이러한 투여에서 당업자에게 명백할 다른 제형과 같은 약학적으로 또는 생리학적으로 허용가능한 담체로 현탁 또는 용해됨으로써 투여를 위해 제조된다. 적절한 담체는 당업자에게 명백할 것이고 투여 경로의 상당 부분에 의존할 것이다. 본원에 기술되는 조성물은 상기 기술된 경로에 따라서, 서방성 제형으로 생체분해가능한 생체적합성 폴리머를 사용하여, 또는 미셀, 겔 및 리포좀을 사용하는 현장 전달에 의해 포유동물에 투여될 수 있다. 선택적으로, 프라이밍 단계는 또한 본원에 기술되는 바와 같은 프라이밍 조성물, 적당한 양의 보조제와 함께 투여하는 단계를 포함한다.
바람직하게는, 부스팅 조성물은 포유동물 피험자에 대해 프라이밍 조성물을 투여 후 약 2 내지 약 27주에 투여된다. 부스팅 조성물의 투여는 프라이밍 DNA 백신에 의해 투여되는 동일한 항원을 함유하는 또는 전달할 수 있는 부스팅 조성물의 유효량을 사용하여 수행된다. 부스팅 조성물은 동일한 바이러스 공급원(예를 들어, 본 발명의 아데노바이러스 서열) 또는 다른 공급원으로부터 유래된 재조합 바이러스 벡터로 구성될 수 있다. 또 다르게는, "부스팅 조성물"은 프라이밍 DNA 백신에서, 그러나 조성물이 숙주에서 면역 반응을 유발하는 단백질 또는 펩티드의 형태로 암호화되는 바와 같은 동일한 항원을 함유하는 조성물일 수 있다. 다른 구체예에서, 부스팅 조성물은 포유동물 세포에서 그것의 발현을 지시하는 조절 서열, 예를 들어, 잘-공지된 박테리아 또는 바이러스 벡터와 같은 벡터의 제어하에서 항원을 암호화하는 DNA 서열을 함유한다. 부스팅 조성물의 일차적 요건은 조성물의 항원이 프라이밍 조성물에 의해 암호화되는 동일 항원, 또는 교차-반응 항원이다.
다른 구체예에서, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터는 또한 다양한 다른 면역 및 치료 요법에서 사용을 위해 적합하게 된다. 이러한 요법은 다른 항원형 캡시드의 Ad 벡터와 함께 동시에 또는 순차적으로 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터의 전달, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터가 동시에 또는 순차적으로 비-Ad 벡터와 함께 전달되는 요법, SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 벡터가 동시에 또는 순차적으로 단백질, 펩티드 및/또는 다른 생물학적으로 유용한 치료 또는 면역원성 화합물과 함께 전달되는 요법을 수반할 수 있다. 이러한 사용은 당업자에게 용이하게 명백할 것이다.
또 다른 구체예에서, 본 발명은 이들 바이러스의 캡시드(선택적으로 무결함 또는 재조합 바이러스 입자 또는 중공 캡시드)의 사용이 아데노바이러스 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38을 피험자에 전달함으로써 다른 활성 약제에서 면역조절 효과 반응을 유발하기 위해, 또는 세포독성 T 세포 반응을 향상 또는 보조하기 위해 사용된다는 것을 제공한다. SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 캡시드는 단독으로 또는 그것에 면역 반응을 향상시키기 위해 활성 약제와 요법을 조합하여 전달될 수 있다. 유리하게는, 요망되는 효과는 아군 E 아데노바이러스에 의해 숙주를 감염시키지 않고 수행될 수 있다. 다른 양태에서, 피험자에 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 캡시드를 전달하는 단계를 포함하는 그것이 필요한 피험자에서 인터페론 알파 생성을 유발하는 방법이 제공된다. 또 다른 양태에서, 배양물에서 하나 이상의 시토킨(예를 들어, IFN-α)/케모킨)을 생성하기 위한 방법이 제공된다. 이 방법은 특히 알파 인터페론을 포함하는 사이토카인/케모킨을 생성하는데 적당한 조건하에서 수지상 세포를 함유하는 배양물 및 본원에 기술되는 SAdV-39, SAdV-25.2, -26, -30, -37 또는 -38 캡시드를 배양하는 단계를 수반한다.
이렇게 생성된 사이토카인은 다양한 용도에서 유용하다. 예를 들어, IFNα의 경우에, 박테리아에서 생성된 IFNα의 단지 하나 또는 두 가지의 서브타입을 함유하는 상업적으로 이용가능한 재조합적으로 생성된 IFNα에 대한 이점을 제공할 것으로 믿어지기 때문에, 본원에 기술되는 생성은 특히 바람직하다. 대조적으로, 본 방법은 천연 인간 IFNα의 다양한 서브타입을 생성하는 것으로 예상되며, 더 넓은 작용의 스펙트럼을 초래하는 것으로 기대된다. 각 서브타입은 특이적 생물학적 활성을 사용하는 것으로 믿어진다. 추가로, 본원에서 제공되는 방법에 의해 생성되는 천연 인터페론은 환자의 자연적으로 생성된 인터페론과 면역학적으로 구별되지 않을 것이며, 이에 의해 보통 재조합적으로 생성된 인터페론에 대해 중화항체의 형성에 의해 야기되는 피험자의 면역 시스템에 의해 거부되는 약물의 위험을 감소시킨다.
하기 예는 SAdV-39, SAdV-25.2, -26, 30, -37 또는 -38의 클로닝 및 대표적인 재조합 SAdV-39, SAdV-25.2, -26, 30, -37 또는 -38 벡터의 구성을 예시한다. 이들 예는 단지 예시적인 것이며, 본 발명의 범주를 제한하는 것은 아니다.
실시예 1 - 유인원 아데노바이러스의 분리.
University of Louisiana New Iberia Research Center, 4401 W. Admiral Doyle Drive, New Iberia, Louisiana, USA에서 침팬지 집단, 및 Michael E. Keeling Center for Comparative Medicine and Research, University of Texas M. D. Anderson Cancer Center, Bastrop, Texas, USA에서 침팬지 집단으로부터 채변 샘플을 얻었다. 행크스 평형화 염 용액의 현탁액에서 침팬지 채변 샘플로부터의 상청액을 0.2 미크론 실린지 필터를 통해 멸균 여과하였다. 100 μl의 각각의 여과된 샘플을 인간 셀 라인 A549 배양물에 접종하였다. 이들 세포를 10% FBS, 1% Penn-Strep 및 50μg/ml 겐타마이신과 함께 Ham's F12에서 성장시켰다. 배양물에서 약 1 내지 2주 후, 시각적 세포변성 효과(CPE)는 몇몇의 접종물과 함께 세포 배양물에서 명백하였다. 아데노바이러스를 아데노바이러스 정제를 위한 표준 공개된 염화세슘 기울기 기술을 사용하여 A549 세포에서 배양물로부터 정제하였다. 정제한 아데노바이러스로부터 DNA를 분리하였고 Qiagen Genomic services, Hilden, Germany에 의해 완전히 서열화하였다.
바이러스 DNA 서열의 계통발생적 분석에 기초하여, 아데노바이러스 지정된 유인원 아데노바이러스 25.2 (SAdV-25.2), 유인원 아데노바이러스 26 (SAdV-26), 유인원 아데노바이러스 30 (SAdV-30), 유인원 아데노바이러스 37 (SAdV-37), 유인원 아데노바이러스 38, (SAdV-38) 및 유인원 아데노바이러스 39 (SAdV-39)를 인간 아군 E로서 동일 아군 내가 되도록 결정하였다.
서열 분석은 SAdV-26의 헥손에 서열 분석은 가장 가까운 헥손 매치는 침팬지 아데노바이러스 6 (98.4 퍼센트 (%) 동일성)이고 가장 가까운 섬유소 매치는 인간 아데노바이러스 4(93% 동일성)라는 것을 나타내었다.
서열 분석은 SAdV25.2 바이러스의 가장 가까운 게놈의 매치는 유인원(침팬지) 아데노바이러스 25 [Genbank 등록 번호 AC000011]라는 것을 나타내었다. SAdV25는 이전에 C68 또는 Pan9 [미국 특허 6083716호]로 명명되었다. 핵산 수중에서, 벡터 NTI-AlignX에 의해 결정되는 바와 같이 SAdV25.2 및 SAdV25 사이에 94% 동일성이 있다. 헥손(아미노산) 수준에서, SAdV-25.2는 2개의 보존적 아미노산 변화와 2개의 비-보존적 변화를 가지는 유인원 아데노바이러스 25와 99% 동일성을 가진다. 하기 표는 SEQ ID NO: 140에서 본원에서 제공되는 SAdV25.2의 헥손에 관하여 SAdV25 헥손 서열과 비교하여, SAdV25.2에서 아미노산 변화를 나타낸다. 두 서열의 넘버링은 동일하다.
벡터를 만들기 위해 사용되는 방법은 전체 E1-결핍 아데노바이러스 벡터의 박테리아 플라스미드 분자 클론을 우선 만든 다음 E1 보완 셀 라인 HEK 293에 플라스미드 DNA의 트랜스펙션을 하여 바이러스 벡터를 구제한다.
E1-결핍 아데노바이러스 벡터의 분자 클론을 만들기 위해, 희소-절단 제한 효소 I-CeuI 및 PI-SceI에 대한 인식 자리가 E1 결핍 대신에 삽입된 곳에서 E1-결핍 아데노바이러스의 플라스미드 분자 클론을 우선 만들었다. I-CeuI 및 PI-SceI 옆에 위치하고, 이들 제한 효소를 사용하여 절단한 발현 카세트를 E1-결핍 아데노바이러스 플라스미드 클론에 연결하였다. E1 결핍 대신에 요망되는 발현 카세트를 포함하는 플라스미드 아데노바이러스 분자 클론을 HEK 293 세포에 트랜스펙팅하여 재조합 아데노바이러스 벡터를 구제하였다. 트랜스펙션 다음의 구제는 제한 효소 분해에 의해 플라스미드로부터 선형의 아데노바이러스 게놈을 우선 방출함으로써 가능하게 된다는 것을 발견하였다.
실시예 2 - 표준 분자 생물학 기술을 사용하는 SAdV -39, SAdV -25.2, SAdV -26, SAdV-30, SAdV-37, 또는 SAdV-38에 기초한 E1-결핍 플라스미드 분자 클론의 구성.
A. SAdV-39의 벡터 구성
SAdV-39 (아군 E)를 사용하는 E1 결핍 벡터를 설명한 바와 같이 제조하였다.
1. pSR3의 구성:
SwaI 자리 옆에 위치하는 SmaI, HindII, EcoRV 자리를 함유하는 링커를 다음과 같이 EcoRI 및 NdeI에 의해 pBR322 절단으로 클로닝하였다.
올리고머 SEQ ID NO: 196: SV25 Top: AATTATTTAAATCCCGGGTATCAA-GCTTGATAGATATCATTTAAAT 및 SEQ ID NO: 197: SV25 Bot TAATTTAAATGATATCTATCAAGCTTGATACCCGGGATTTAAAT를 함께 어닐링하여 링커를 만들었다.
2. HindIII 자리(7152)에서 SAdV-39 바이러스 왼쪽 말단의 클로닝
바이러스 DNA를 HindIII로 분해하였고 7152 bp 왼쪽 말단 단편을 SmaI 및 HindIII로 분해한 pSR3에 클로닝하여 pSR3 C39 LE를 수득하였다.
3. I-CeuI 및 PI-SceI 자리의 E1 기능적 결실 및 삽입:
플라스미드 pC39LE는 SnaBI 및 NdeI (Klenow 채워짐) 사이에서 결실되어 E1a 및 대부분의 E1b 코딩 영역을 결실하였고; 그 자리에서, DNA 단편(I-CeuI 및 PI-Scel에 대한 pBleuSK I-PI 포함 자리로부터 EcoRV 단편)을 연결하여 pC39LEIP를 수득하였다.
4. NheI 자리 (35779)로부터 SAdV-39 바이러스 오른쪽 말단의 클로닝.
SAdV-39 바이러스 DNA를 NheI로 분해하였고, 775 bp 오른쪽 말단 단편을 EcoRV 및 NheI 사이의 pC39LEIP에 클로닝하여 pC39LE IP RE를 수득하였다.
5. SAdV-39 바이러스 NheI (3033 - 35779) 단편의 클로닝
플라스미드 pC37-LE-IP-RE를 HindIII로 분해하였고, 32746 bp 바이러스 NheI 단편을 연결하였다. 정확한 방향성(orientaion)을 가지는 클론을 pC39 IP로 불렀다.
B. 표준 분자 생물학 기술을 사용하는 SAdV -25.2에 기초한 E1-결실 플라스미드 분자 클론의 구성.
SAdV-25.2 (아군 E)를 사용하는 E1 결실 벡터를 설명한 바와 같이 제조하였다.
1. pSR6의 구성:
PacI 자리 측면에 위치되는 SmaI, AscI, AvrII, EcoRV 자리를 함유하는 링커를 하기와 같이 EcoRI 및 NdeI에 의한 pBR322 절단으로 클로닝한다.
올리고머 SEQ ID NO: 198: pSR6 top: AATTTTAATTAACCCGGGTATCGGC- GCGCCTTAACCTAGGGATAGATATCTTAATTAA 및 SEQ ID NO: 199: pSR6 bot: TATTAATTAAGATATCTATCCCTAGGTTAAGGCGCGCCGATACCCGGGTTAATTAA를 어닐링하여 링커를 만들었다.
2. AscI 자리(7959)에 바이러스 왼쪽 말단의 클로닝
바이러스 DNA를 AscI로 분해하였고, 7959 bp 왼쪽 단편을 SmaI 및 AscI로 분해한 pSR6로 클로닝하여 pSR5 C25.2 LE를 수득하였다.
3. I-CeuI 및 PI-SceI 자리의 E1 기능적 결실 및 삽입:
플라스미드 pSR5 C25.2LE를 SnaBI + NdeI로 분해하였고; NdeI 자리를 Klenow로 채웠다. pBleuSK I-PI로부터의 EcoRV 단편을 연결하여 pSR5 C25.2 LE IP를 만들었다.
4. XbaI 자리(30071)로부터 바이러스 오른쪽 말단의 클로닝:
플라스미드 pSR5 C25.2 LE IP를 XbaI + EcoRV로 분해하였다. SAdV-25.2 DNA로부터 6559 bp 오른쪽 말단(XbaI 분해) 단편을 연결하여 pAdC12-LE-IP-RE를 만들었다.
5. 바이러스 중간 XbaI 단편(6037 - 30071)의 클로닝
플라스미드 pAdC12-LE-IP-RE를 XbaI로 분해하였다. SAdV-25.2 DNA로부터 24034 bp 단편을 연결하여 pAdC25.2 IP를 만들었다.
C. 표준 분자 생물학 기술을 사용하는 SAdV -26에 기초한 E1-결핍 플라스미드 분자 클론의 구성
SAdV-26 (아군 E)을 사용하는 E1 결핍 벡터를 설명한 바와 같이 제조하였다.
1. pSR5의 구성:
SwaI 측면에 위치하는 SmaI, ClaI, XbaI, SpeI, EcoRV 자리를 함유하는 링커를 EcoRI 및 NdeI에 의한 pBR322 절단에 클로닝한다.
2. XbaI 자리 (6029) 에서 바이러스 왼쪽 말단의 클로닝
바이러스 DNA를 XbaI로 분해하고 6 kb 단편(왼쪽 및 오른쪽 말단)을 겔 정제하였고 SmaI 및 XbaI로 분해된 pSR5에 연결하였다.
3. I-CeuI 및 PI-SceI 자리의 E1 기능적 결실 및 삽입:
플라스미드 pSR5-C12-LE를 SnaBI + NdeI로 분해하였고; NdeI 자리를 Klenow로 채웠다. pBleuSK I-PI로부터의 EcoRV 단편을 연결하여 pAdC 12-LE-IP를 만들었다.
4. XbaI 자리(30158)로부터의 바이러스 오른쪽 말단의 클로닝:
플라스미드 pAdC12-LE-IP를 XbaI + EcoRV로 분해하였다. SAdV-26 DNA로부터의 6471 bp 오른쪽 말단 (XbaI 분해) 단편을 연결하여 PAdC12-LE-IP-RE를 만들었다.
5. 바이러스 중간 XbaI 단편(6029 - 30158)의 클로닝
플라스미드 pAdC12-LE-IP-RE를 XbaI + EcoRV로 분해하였다. SAdV-26 DNA로부터의 24129 bp 단편을 연결하여 pC26 IP를 만들었다.
D. SAdV-30의 벡터 구성
SAdV-30 (아군 E)을 사용하는 E1 결실 벡터를 설명한 바와 같이 제조하였다.
1. pSR3의 구성:
SwaI 자리 옆에 위치하는 SmaI, HindII,, EcoRV 자리를 함유하는 링커를 하기와 같이 EcoRI 및 NdeI에 의해 pBR322 절단으로 클로닝하였다.
올리고머 SEQ ID NO: 196: SV25 Top:
AATTATTTAAATCCCGGGTATCAAGCTTGATAGATATCATTTAAAT 및 SEQ ID NO: 197: SV25 Bot: TAATTTAAATGATATCTATCAAGCTTGATACCCGGGATTTAAAT를 함께 어닐링하여 링커를 만들었다.
2. HindIII 자리 (7146)에서 바이러스 왼쪽 말단의 클로닝
바이러스 DNA를 HindIII로 분해하였고 7146 bp 왼쪽 말단 단편을 SmaI 및 HindIII으로 분해하여 pSR3에 클로닝하여 pSR3 C30 LE를 수득하였다.
3. I-CeuI 및 PI-SceI 자리의 E1 기능적 결실 및 삽입:
플라스미드 pSR3C30 LE를 SnaBI + NdeI로 분해하였고; NdeI 자리를 Klenow으로 채웠다. pBleuSK I-PI로부터의 EcoRV 단편을 연결하여 pC30 LE IP를 만들었다. 내부 EcoRI 자리(왼쪽 ITR의 시작으로부터 위치 1040 bp에서)를 EcoRI로 pC30 LE IP를 분해함으로써 파괴하였고, Klenow 폴리머라아제로 돌출부분을 채우고 재-연결하였다. 이것으로 플라스미드 pC30 LE IP (EcoRI del)를 수득하였다.
4. HindIII 자리(33048)로부터 바이러스 오른쪽 말단의 클로닝:
플라스미드 pC30 LE IP (EcoRI del)를 HindIII + EcoRV로 분해하였다. SAdV-30 DNA로부터의 3574 bp 오른쪽 말단(HindIII 분해) 단편을 연결하여 PC30-LE-IP-RE를 만들었다.
5. EcoRI (33631) 단편에서 바이러스 중간 XbaI (6035)의 클로닝
플라스미드 pC30-LE-IP-RE를 XbaI + HindIII로 분해하였다. SAdV-30 DNA로부터의 27596 bp 단편을 연결하여 pC30 IP를 만들었다.
E. SAdV-37의 벡터 구성
SAdV-37 (아군 E)를 사용하는 E1 결실 벡터를 설명한 바와 같이 제조하였다.
1. pSR3의 구성:
SwaI 옆에 위치하는 SmaI, HindII, EcoRV 자리를 함유하는 링커를 하기와 같이 EcoRI 및 NdeI에 의한 pBR322 절단으로 클로닝하였다. 올리고머: SEQ ID NO: 196: SV25 Top: AATTATTTAAATCCCGGGTATCAAGCTTGATAGATATCATTTAAAT 및 SEQ ID NO: 197: SV25 Bot: TAATTTAAATGATATCTATCAAGCTTGATACCCGGGATTTAAAT를 함께 어닐링하여 링커를 만들었다.
2. HindIII 자리 (7147)에서 SAdV-37 바이러스 왼쪽 말단의 클로닝
바이러스 DNA를 HindIII로 분해하였고 7147 bp 왼쪽 말단 단편을 SmaI 및 HindIII으로 분해한 pSR3에 클로닝하여 pSR3 C37 LE를 수득하였다.
3. I-CeuI 및 PI-SceI 자리의 E1 기능적 결실 및 삽입:
플라스미드 pSR3 C37LE를 SnaBI 및 NdeI(Klenow를 채움) 사이에서 결실하여 E1a 및 대부분의 E1b 코딩 영역을 결실하였고; 그 자리에서 DNA 단편(pBleuSK I-PI로부터의 EcoRV 단편은 I-Ceul 및 PI-SceI에 대한 자리를 포함한다)을 연결하여 pSR3 C37 LE IP를 수득하였다.
4. HindIII 자리(33048)로부터의 SAdV-37 바이러스 오른쪽 말단의 클로닝.
플라스미드 pC37 LE IP를 HindIII + EcoRV로 분해하였다. SAdV-37 DNA로부터의 3575 bp 오른쪽 말단(HindIII 분해) 단편을 연결하여 pC37-LE-IP-RE를 만들었다.
5. SAdV-37 바이러스 HindIII (23522 - 33060) 단편의 클로닝
플라스미드 pC37-LE-IP-RE를 HindIII로 분해하였고 9538 bp 바이러스 HindIII 단편을 연결하였다. 정확한 방향성을 가지는 클론을 pC37 del Xba Pac로 불렀다.
6. SAdV-37 바이러스 XbaI (6036) PacI(30181) 단편의 클로닝
플라스미드 pC37 del Xba Pac를 XbaI 및 PacI로 분해하였고 24145 bp 바이러스 XbaI-PacI 단편을 연결하여 pC37IP를 수득하였다.
F. E1-결실 아데노바이러스 벡터의 구성
E1 결실 대신에 I-CeuI 및 PI-SceI 인식 자리를 포함하는 DNA 부분을 삽입하기 위해, 플라스미드 pBleuSK I-PI를 사용하였다. 플라스미드 pBleuSK I-PI는 pBluescript II SK(+) (Stratagene)의 EcoRV 자리에 삽입되는 654 bp 단편을 함유한다. 654 bp 절편은 희소-절단 제한 효소 I-CeuI 및 PI-SceI에 대한 인식 자리를 포함한다. E1 결실 대신에 I-CeuI 및 PI-SceI 인식 자리를 포함하는 DNA 부분을 삽입하기 위해, pBleuSK I-PI를 EcoRV로 분해하였고 654 bp 단편을 아데노바이러스 게놈 E1 결실의 위치에 연결하였다. 삽입한 DNA의 서열은 하기에서 EcoRV 인식 자리 옆에 위치됨을 나타낸다. I-CeuI 및 PI-SceI에 대한 인식 서열을 밑줄친다.
인플루엔자 바이러스 뉴클레오단백질을 발현시키는 E1-결실 아데노바이러스 벡터를 구성하기 위해서, H1N1 인플루엔자 A 바이러스 NP를 암호화하는 뉴클레오티드 서열(A/Puerto Rico/8/34/Mount Sinai, GenBank 등록번호 AF389119.1)은 최적화된 코돈이었고, 완전히 합성하였다(Celtek Genes, Nashville, TN). 인간 사이토메갈로바이러스 초기 프로모터, 합성 인트론(플라스미드 pCI (Promega, Madison, Wisconsin)로부터 얻음), 코돈 최적화 인플루엔자 A NP 코딩 서열 및 소 성장 호르몬 폴리아데닐화 신호로 구성되는 발현 카세트를 구성하였다. 플라스미드 pShuttle CMV PI FluA NP는 상기 기술된 발현 카세트를 포함하며, 이는 각각 희소-절단 제한 효소 I-CeuI 및 PI-SceI (New England Biolabs)에 대한 인식 자리 옆에 위치한다. E1-결핍 아데노바이러스 벡터의 분자 클론을 만들기 위해서, E1-결핍 아데노바이러스의 플라스미드 분자 클론을 본 실시예의 앞 부분에서 설명한 바와 같이 만들었고, 희소-절단 제한 효소 I-CeuI 및 PI-SceI에 대한 인식 자리를 E1 결실 대신 삽입하였다. E1-결실 아데노바이러스 플라스미드를 그 후 I-CeuI 및 PI-SceI로 분해하였고 발현 카세트(동일 효소에 의해 분해)를 연결하였다. 결과 아데노바이러스 플라스미드 분자 클론을 HEK 293 세포에 트랜스펙팅하여 재조합 아데노바이러스 벡터를 구하였다. 트랜스펙션 후 구제는 제한 효소 분해에 의해 플라스미드로부터 선형 아데노바이러스 게놈을 우선 방출함으로써 가능하게 됨을 발견하였다.
실시예 3 - 교차-중화 항체의 평가
야생형 SAdV-39, SAdV-25.2, SAdV-26, SAdV-30, SAdV-37 및 SAdV-38을 직접 면역 형광법에 의해 모니터링되는 감염 억제 중화 항체 분석을 사용하여 인간 아데노바이러스 5(아종 C) 및 침팬지 아데노바이러스 7(SAdV-24), 및 인간 풀링된 IgG와 비교하여 교차-중화 활성에 대해 평가하였다. 일반적 인간 집단이 노출된 다수의 항원에 대한 항체를 함유하기 때문에, 인간 풀링된 IgG[Hu Pooled IgG]를 상업적으로 구입하고, 면역타협 환자에서 투여를 위해 승인한다. 인간 풀링된 IgG에 대한 유인원 아데노바이러스에서 중화 항체의 존재 또는 부존재는 일반적 모집단에서 이들 아데노바이러스에 대한 항체의 보급의 반영이다.
분석을 하기와 같이 수행하였다. 앞서 HAdV-5 또는 SAdV-24로 주사한 토끼로부터의 혈청 샘플을 35분 동안 56℃에서 가열하여 불활성화하였다. 야생형 아데노바이러스(108 입자/웰)를 무혈청 둘베코 변형 이글 배지(DMEM)에서 희석하였고, 37℃에서 1시간 동안 DMEM에서 가열-불활성화된 혈청의 2-배 연속 희석으로 배양하였다. 이후에, 혈청-아데노바이러스 혼합물을 105 단일층 A549 세포와 함께 웰 내의 슬라이드에 첨가하였다. 1시간 후, 각 웰의 세포를 100 μl의 20% 소 태아혈청(FBS)-DMEM으로 보충하였고, 5% CO2로 37℃에서 22시간 동안 배양하였다. 다음에, 세포를 PBS로 2회 헹구고 DAPI로 염색하였고, 염소에 FITC로 표지된, 광범위하게 교차 반응성인 항체(Virostat)를 파라포름알데히드(4%, 30 분)에서 고정 및 0.2% Triton (4℃, 20 분)에서 침투 후 HAdV-5에 대해 길렀다. 감염의 수준을 현미경관찰 하에서 FITC 양성 세포의 수를 카운팅함으로써 결정하였다. NAB 타이터를 나이브(naive) 혈청 대조군과 비교하여 50% 이상으로써 아데노바이러스 감염을 억제한 가장 높은 혈청 희석으로서 기록한다. < 1/20의 타이터 값이 나타나면, 중화 항체 농도는 검출의 제한, 즉 1/20 이하이다.
이들 데이터는 일반 모집단에서 이들 아데노바이러스에 대해 최소한의 면역반응성이 있음을 나타낸다. 이들 데이터는 HAdV-5 및 SAdV-24와 교차-반응하지 않는 앞의 표에 있는 유인원 아데노바이러스가 아데노바이러스의 순차적 전달을 수반하는 요법, 예를 들어, 프라임-부스트 또는 암 치료법에 유용할 수 있음을 추가로 나타낸다.
실시예 4 - 사이토카인 유도
형질세포양 수지상세포를 인간 말초혈액 단핵구(PBMC)로부터 분리하였고, 96웰 플레이트로 배지에서 배양하였고 아데노바이러스로 감염시켰다. 48시간 후 세포를 스핀다운하고 상청액을 수집하였고 인터페론 α의 존재하에서 분석하였다.
더 구체적으로, PBMC를 펜실베니아 유니버시티에서 CFAR(Center For AIDS Research) 면역학 코어로부터 획득하였다. 3억개의 이들 세포를 그 후 키트와 함께 제공된 설명서에 따라서 Miltenyi Biotec제의 "인간 형질세포양 수지상세포 분리 키트"를 사용하여 형질세포양 수지상세포(pDCs)를 분리하기 위해 사용하였다. 이 키트를 사용하는 분리는 모든 다른 세포 종류를 제거하는 것을, 그러나 pDC는 PBMC로부터 제거하는 것을 기초로 하였다.
최종 세포 수는 보통 도너로부터 도너까지 다양하지만, 4십만 내지 7십만개의 세포의 범위에 있다. 따라서 발생된 데이터(하기 논의)는 다중 도너로부터의 세포의 분석에서 비롯된다. 그렇지만 놀랍게도, 인터페론 또는 다른 사이토카인 방출에 기초한 아군의 분리는 다양한 도너로부터 세포를 분리할 때조차 유지된다.
세포를 L-글루타민, 10% 소 태아 혈청(Mediatech), 1OmM 헤페스 완충제 용액(Invitrogen), 항생물질(페니실린, 스트렙토마이신 및 겐타마이신-Mediatech 제) 및 인간-인터류킨 3 (20ng/mL - R&D)으로 보충한 RPMI-1640 배지(Mediatech)에서 배양하였다. 야생형 아데노바이러스를 10,000 (세포 당 10,000개의 바이러스 입자, 106 세포/ml의 농도로)의 감염다중도(MOI)에서 세포에 직접 첨가하였다. 48 시간 후, 세포를 스핀 다운하였고, 상청액을 인터페론의 존재하에서 분석하였다. 사이토카인을 제조업자로부터 추천된 프로토콜을 사용하여 PBL 생물의학 연구소로부터 효소-결합면역흡착분석법(ELISA) 키트를 사용하여 분석하였다.
본 연구는 아군 C 아데노바이러스가 IFNα의 검출가능하지 않은 양을 만든다는 것을 나타내었다(본 분석은 1250 pg/mL의 검출 제한을 가진다). 반대로, 아군 E 아데노바이러스의 모든 시험 멤버는 IFNα를 생성하였고, 일반적으로 아군 B 아데노바이러스와 비교하여 상당히 우수한 IFNα를 생성하였다.
다양한 다른 사이토카인을 또한 아데노바이러스의 스크리닝에서 검출하였다. 그러나, 일반적으로, 아군 E 아데노바이러스는 아군 C 아데노바이러스보다 상당히 더 높은 수준의 IL-6, RANTES, MIP-1α, TNF-α, IL-8, 및 IP-10을 생성하였다. 아군 B 아데노바이러스는 또한 IFNα, IL-6, RANTES, 및 MIP1α의 유도에서 아군 C 아데노바이러스를 능가하였다.
상당한 세포 용혈이 이 연구에서 관찰되지 않았기 때문에, 이는 감염과 상관없이, 바이러스 복제의 어떤 상당한 양의 존재하에서 사이토카인이 아군 E 아데노바이러스와 세포를 접촉함으로써 생성된다는 것을 제안한다.
다른 연구에서(제시하지 않음), 세포를 중공 C7 캡시드 단백질(Ad 아군 E) 또는 UV-불활성 아데노바이러스 C7 바이러스 벡터(UV 불활성화는 교차-연결을 야기하며, 아데노바이러스 유전자 발현을 제거한다) 중 하나와 함께 상기 기술된 바와 같이 배양하였다. 이들 연구에서, 동일 또는 더 높은 수준의 IFNα가 무결함 C7과 비교하여 중공 캡시드와 불활성 바이러스 벡터 둘 다에 대해 관찰되었다.
본 발명자들은 PBMC, PBL, 및 수지상 세포와 같은 사이토카인-생성 세포 또는 케모킨-생성 세포를 아군 E 아데노바이러스의 멤버로부터 캡시드에 노출시키는 것은 사이토카인을 유도하고, 특히 IFNα, 또는 케모킨은 다른 아데노바이러스 아군에 의한 것보다 상당히 더 높은 양으로 유도된다는 것을 발견하였다. 따라서, 이 아군의 멤버는 배양물에서 알파 인터페론, 더 적은 양으로, 다수의 다른 사이토카인/케모킨을 유발하는데 유용하다.
재조합적으로 생성된 IFNα에 유리하게 되는 것으로 믿어지는 바와 같이, IFNα의 경우에, 생성 방법은 특히 바람직하다. 대조적으로, 본원에서 제공되는 방법은 더 넓은 작용의 스펙트럼을 초래하는 것으로 기대되는 천연 인간 IFNα의 다수의 서브타입을 생성하는 것으로 예상된다. 각 서브타입은 생물학적 활성을 사용하는 것으로 믿어진다. 추가로, 본원에서 제공되는 방법에 의해 생성되는 천연 인터페론은 환자의 자연적으로 생성된 인터페론과 면역학적으로 구별되지 않을 것임이 예상되며, 이에 의해 약물이, 보통 재조합적으로 생성된 인터페론에 대해 중화항체의 형성에 의해 야기되는 환자의 면역 체계에 의해 거부되는 위험을 감소시킨다.
아군 E 아데노바이러스에 의해 생성되는 다른 사이토카인은 인터류킨 (IL)-6, IL-8, IP-1O, 대식세포 염증 단백질-1 알파(MIP-1α), RANTES, 및 종양 괴사 인자 알파를 포함한다. 배양물로부터 사이토카인/케모킨을 정제하는 방법 및 이들 사이토카인/케모킨의 치료적 또는 보조적 사용은 본 문헌에서 기술되었다. 추가로, 상업적으로 이용가능한 컬럼 또는 키트는 본 발명에 따라서 제조되는 사이토카인/케모킨의 정제를 위해 사용될 수 있다. 본 발명을 사용하여 생성되는 사이토카인/케모킨은 다양한 증상에서 사용을 위해 제형으로 될 수 있다.
예를 들어, 본원에서 기술되는 사이토카인은, 인터페론 알파(IFNα), 종양 괴사 인자 알파(TNFα), IP-10 (인터페론 감마 유도성 단백질), 인터류킨-6 (IL-6), 및 IL-8을 포함한다. IFNα는 인플루엔자, 간염(예를 들어, B형 간염 및 C형 간염을 포함), 및 다양한 신생물, 예를 들어, 신장 (신장암), 흑색종, 악성 종양, 다발성 골수종, 유암종, 림프종 및 백혈병(예를 들어, 만성 골수성 백혈병 및 모양 세포성 백혈병)의 치료에 유용한 것으로 기술되었다. 본원에서 기술되는 바와 같이 생성된 IFNα 서브타입의 혼합물은 공지된 기술을 사용하여 정제될 수 있다. 예를 들어, 모노클로날 항체 및 컬럼 정제의 사용을 기술하는 WO 2006/085092 참조. 다른 기술은 본 문헌에서 설명하였다.
본원에서 설명하는 바와 같이 생성한 IFNα는 공지된 방법을 사용하여 정제될 수 있다. 예를 들어, 미국 특허 4,680,260호, 미국 특허 4,732,683호, 및 G. Allen, Biochem J., 207:397-408 (1982)을 참조. TNFα를 예를 들어, 건선 및 류마티스 관절염을 포함하는 자가면역 질환의 치료에 유용한 것으로 설명하였다. IP-10, 인터페론 감마 유도성 단백질은 혈관생성의 강력한 억제제로서 사용될 수 있고, 강력한 흉선-의존 항-종양 효과를 가지는 것으로 사용될 수 있다.
따라서, 또 다른 양태에서, 사이토카인을 생성하기에 적당한 조건하에서 수지상 세포 및 아군 E 아데노바이러스 캡시드를 함유하는 배양물을 배양함으로써 IFNα를 생성하는 방법이 제공된다.
한 구체예에서, 혈액은 건강한 도너(바람직하게는 인간)로부터 회수하고 말초혈액 백혈구 (PBL) 또는 말초혈액 단핵세포(PBMC)를 공지 기술을 사용하여 제조한다. 한 구체예에서, PBL을 본 발명의 방법에 따라서 사이토카인-생성 세포로서 사용한다. 다른 구체예에서, PBMC를 사이토카인-생성 세포로서 사용한다. 다른 구체예에서, 형질세포양 수지상세포를 공지 기술, 예를 들어, Miltenyi Biotec GmbH (독일)에 의한 상업적으로 이용가능한 키트 "인간 형질세포양 수지상세포 분리 키트"를 사용하여 PBL 또는 PBMC로부터 분리한다. 선택 세포를 적절한 배지 및 아데노바이러스 아군 E 캡시드 단백질과 함께 현탁으로 배양한다. 적절한 배지는 당업자에 의해 용이하게 결정될 수 있다. 그러나, 한 구체예에서, 배지는 RPMI-1640 배지이다. 또 다르게는, 다른 배지는 용이하게 선택될 수 있다.
세포를 적당한 용기, 예를 들어, 마이크로타이터 웰, 플라스크, 또는 더 큰 용기에서 배양할 수 있다. 한 구체예에서, 세포의 농도는 약 1백만 개 세포/mL 배양물 배지이다. 그러나, 다른 적당한 세포 농도는 당업자에 의해 용이하게 결정될 수 있다.
유리하게는, 본 발명은 프라이머로서 인터페론의 사용을 필요로 하지 않는다. 그러나, 원한다면, 배지는 세포 성장을 자극하기 위해서 적절한 사이토카인, IL-3을 포함할 수 있다. 한 적당한 농도는 약 20 ng/mL이다. 그러나, 다른 농도가 사용될 수 있다.
한 구체예에서, 아데노바이러스 캡시드 단백질은 세포를 함유하는 배양물에 포함된다. 아데노바이러스 캡시드 단백질은 본원에서 설명되는 어떤 형태(예를 들어, 중공 캡시드 입자, Ad 아군 E 캡시드 등을 포함하는 바이러스 입자)로 배양물에 전달될 수 있다. 전형적으로 캡시드 단백질은 적당한 담체, 예를 들어, 배양물 배지, 식염수 등에서 현택될 것이다.
적절하게는, 아데노바이러스 아군 E 캡시드는 세포 당 약 100 내지 100,000개의 아데노바이러스 아군 E 입자의 양으로 배양물에 첨가된다. 혼합물을 그 후, 예를 들어, 약 28℃ 내지 약 40℃의 범위, 약 35℃ 내지 약 37℃의 범위, 또는 약 37℃에서 배양한다.
전형적으로, 대략 12 내지 96 시간, 또는 약 48 시간 후, 세포를 스핀다운하고, 상청액을 수집한다. 적절하게는, 이것을 세포 용해를 피하는 조건하에서 수행하고, 이에 의해 상청액에서 세포 파편의 존재를 감소 또는 제거한다. 원심분리는 세포로부터 사이토카인의 분리를 허용하며, 이에 의해 천연 그대로의 분리된 사이토카인을 제공한다. 사이징(sizing) 컬럼, 다른 공지의 컬럼 및 방법은 아데노바이러스 및 아데노바이러스 캡시드 등으로부터 사이토카인의 추가 정제를 위해 이용가능하다.
이렇게 정제한 이들 사이토카인을 제형 및 다양한 용도를 위해 이용가능하다.
본원에서 설명한 바와 같이, 그리고 이론에 의해 뒷받침 없이, 아데노바이러스 아군 E의 면역 향상 및/또는 사이토카인 생성 능력은 아데노바이러스 입자의 전염력 또는 복제 능력에 관계없이 세포와 아데노바이러스 캡시드 사이의 접촉에 기초하여 나타난다. 따라서, 한 구체예에서, 중공 아데노바이러스 아군 E 입자(즉, 어떤 아데노바이러스 또는 유전자 이식 생성물을 발현시키는 그것에서 DNA 패키징을 가지지 않는 아데노바이러스 캡시드)는 세포에 전달된다. 다른 구체예에서, 비-전염성 야생성 아군 E 입자 또는 아데노바이러스 아군 E 캡시드(입자)에 패키징된 재조합 아데노바이러스 벡터가 사용된다. 이러한 바이러스 입자를 불활성화하는데 적절한 기술은 당업계에 공지되어 있으며 제한 없이, 예를 들어, UV 조사를 포함할 수 있다(발현을 방지하는 게놈의 DNA를 효과적으로 교차-연결).
상기 인용된 모든 문헌은 참고로써 본원에 포함된다. 수많은 변형 및 변경이 상기 확인된 설명의 범주에 포함되며 당업자에게 명백한 것으로 기대된다. 이러한 조성물 및 공정에 대한 변형 및 변경, 예로써 다른 미니유전자의 선택 또는 벡터 또는 면역 조절자의 선택 또는 투약량은 첨부되는 청구항의 범주 내인 것으로 믿어진다.
SEQUENCE LISTING
<110> The Trustees of the University of Pennsylvania
Roy, Soumitra
Wilson, James M.
Vandenberghe, Luc
<120> Simian Subfamily E Adenoviruses SAdV-39, -25.2, -26, -30, -37,
and -38 and Uses Thereof
<130> UPN-U4623PCT
<150> US 61/004,532
<151> 2007-11-28
<150> US 61/004,507
<151> 2007-11-28
<150> US 61/004,541
<151> 2007-11-28
<150> US 61/004,461
<151> 2007-11-28
<150> US 61/004,499
<151> 2007-11-28
<150> US 61/004,464
<151> 2007-11-28
<160> 200
<170> PatentIn version 3.5
<210> 1
<211> 36553
<212> DNA
<213> Simian adenovirus 39
<220>
<221> repeat_region
<222> (1)..(126)
<223> label=ITR
<220>
<221> CDS
<222> (1905)..(3416)
<223> label=Elb\55K
<220>
<221> CDS
<222> (3504)..(3929)
<223> label=pIX
<220>
<221> misc_feature
<222> (3994)..(5615)
<223> complement (3994..5324, 5603..5615) label=IVa2
<220>
<221> misc_feature
<222> (5097)..(13870)
<223> complement (5097..8672, 13862..13870) label=pol
<220>
<221> misc_feature
<222> (8474)..(13870)
<223> complement (8474..10408, 13862..13870) label=pTP
<220>
<221> CDS
<222> (10859)..(12037)
<223> label=52K
<220>
<221> CDS
<222> (12064)..(13833)
<223> label=pIIIa
<220>
<221> CDS
<222> (13915)..(15510)
<223> label=penton
<220>
<221> CDS
<222> (15517)..(16095)
<223> label=pVII
<220>
<221> CDS
<222> (16140)..(17180)
<223> label=V
<220>
<221> CDS
<222> (17208)..(17438)
<223> label=pX
<220>
<221> CDS
<222> (17473)..(18249)
<223> label=pVI
<220>
<221> CDS
<222> (18359)..(21178)
<223> label=hexon
<220>
<221> CDS
<222> (21202)..(21825)
<223> label=protease
<220>
<221> misc_feature
<222> (21910)..(23445)
<223> complement label=DBP
<220>
<221> CDS
<222> (23468)..(25870)
<223> label=100K
<220>
<221> CDS
<222> (26484)..(27164)
<223> label=pVIII
<220>
<221> CDS
<222> (27168)..(27485)
<223> label=E3\12.5K
<220>
<221> CDS
<222> (28056)..(28583)
<223> label=E3\gp19K
<220>
<221> CDS
<222> (28616)..(29233)
<223> label=E3\CR1\beta
<220>
<221> CDS
<222> (29249)..(29863)
<223> label=E3\CR1\gamma
<220>
<221> CDS
<222> (29881)..(30765)
<223> label=E3\CR1\delta
<220>
<221> CDS
<222> (30776)..(31048)
<223> label=E3\RID\alpha
<220>
<221> CDS
<222> (31057)..(31482)
<223> label=E3\RID\beta
<220>
<221> CDS
<222> (31997)..(33463)
<223> label=fiber
<220>
<221> misc_feature
<222> (33567)..(334708)
<223> complement (33567..33815, 34529..34708) label=E4\orf6/7
<220>
<221> misc_feature
<222> (33815)..(34708)
<223> complement label=E4\orf6
<220>
<221> misc_feature
<222> (34617)..(34979)
<223> complement label=E4\orf4
<220>
<221> misc_feature
<222> (34992)..(35342)
<223> complement label=E4\orf3
<220>
<221> misc_feature
<222> (35342)..(35728)
<223> complement label=E4\orf2
<220>
<221> misc_feature
<222> (35772)..(36143)
<223> complement label=E4\orf1
<220>
<221> repeat_region
<222> (36428)..(36553)
<223> complement label=ITR
<400> 1
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg aggcgtttga 60
atttggggat gcggggcggt gattggcgga gagaagggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtgtt tgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660
gtgacgaccc tccggagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg ctaatatgga ctctggctca gacagcgatt 840
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgatgt tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960
aggagacgat tcgagctgcg gcgaaccagg gagtgaaagc tgcgggcgag agctttagcc 1020
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagctggggg ggcagagggt gactgggtgc tgactggttt 1200
atttatgtat atgtttttta tgtgtaggtc ccgtctctga cgcagatgag acccccactt 1260
cagagtgcat ttcatcaccc ccagaaattg gcgaggaacc gcccgaagat attattcata 1320
gaccagttgc agtgagagtc accgggcgga gagcagctgt ggagagtttg gatgacttgc 1380
tacagggtgg ggatgaacct ttggacttgt gtacccggaa acgccccagg cactaagtgc 1440
cacacatgtg tgtttactta aggtgatgtc agtatttata gggtgtggag tgcaataaaa 1500
atatgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560
gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gacggtcttg 1620
gaagactttc accagactag acagctgcta gagaactcat cggaggaagt ctcctacctg 1680
tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740
aaggatcaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800
ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860
agaactaccg ccgcggtagc cttttttgcc tttatccttg acaa atg gag tca aga 1916
Met Glu Ser Arg
1
aac cca ttt cag cag gga tta ccg tct gga ctg ctt agc agt agc ttt 1964
Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu Ser Ser Ser Phe
5 10 15 20
gtg gag aac atg gag gtg cca gcg cct gaa tgc aat ctc cgg cta ctt 2012
Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu
25 30 35
gcc agt aca gcc ggt aga cac gct gag gat cct gag tct cca gtc acc 2060
Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu Ser Pro Val Thr
40 45 50
cca gga aca cca acg ccg cca gca gcc gca gca gga gca gca gca aga 2108
Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly Ala Ala Ala Arg
55 60 65
gga gga gga gga ccg aga aga gaa ccc gag agc cgg tct gga ccc tcc 2156
Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser Gly Pro Ser
70 75 80
ggt ggc gga gga gga gga gta gct gac ttg ttt ccc gag ctg cgc cgg 2204
Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg
85 90 95 100
gtg ctg act agg tct tcc agt gga cgg gag agg ggg att aag cgg gag 2252
Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu
105 110 115
agg cat gag gag act agc cac aga acc gaa ctg act gtc agt ctg atg 2300
Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val Ser Leu Met
120 125 130
agc cgc agg cgc cca gaa tcg gtg tgg tgg cat gag gtg cag tcg cag 2348
Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val Gln Ser Gln
135 140 145
ggg ata gat gag gtc tca gtg atg cat gag aaa tat tcc cta gaa caa 2396
Gly Ile Asp Glu Val Ser Val Met His Glu Lys Tyr Ser Leu Glu Gln
150 155 160
gtc aag act tgt tgg ttg gag ccc gag gat gat tgg gag gta gcc atc 2444
Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile
165 170 175 180
agg aat tat gcc aag ctg gct ctg aag cca gac aag aag tac aag att 2492
Arg Asn Tyr Ala Lys Leu Ala Leu Lys Pro Asp Lys Lys Tyr Lys Ile
185 190 195
acc aaa ctg att aat atc aga aat tcc tgc tac att tca ggg aat ggg 2540
Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser Gly Asn Gly
200 205 210
gcc gag gtg gag atc agt acc cag gag agg gcg gcc ttc aga tgt tgt 2588
Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Ala Ala Phe Arg Cys Cys
215 220 225
atg atg aat atg tac ccg ggg gtg gtg ggc atg gag gga gtc acc ttt 2636
Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly Val Thr Phe
230 235 240
atg aac acg agg ttc agg ggt gat ggg tat aat ggg gtg gtc ttt atg 2684
Met Asn Thr Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val Val Phe Met
245 250 255 260
gcc aac acc aag ctg aca gtg cac gga tgc tcc ttc ttt ggc ttc aat 2732
Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn
265 270 275
aac atg tgc atc gag gcc tgg ggc agt gtt tca gtg agg gga tgc agt 2780
Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg Gly Cys Ser
280 285 290
ttt tca gcc aac tgg atg ggg atc gtg ggc agg acc aag agt gtg ctg 2828
Phe Ser Ala Asn Trp Met Gly Ile Val Gly Arg Thr Lys Ser Val Leu
295 300 305
tct gtg aag aaa tgc ttg ttc gag agg tgc cac ctg ggg gtg atg agc 2876
Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser
310 315 320
gag ggc gaa gcc aga atc cgc cac tgc gcc tct acc gag acg ggc tgc 2924
Glu Gly Glu Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys
325 330 335 340
ttt gtg ctg tgc aag ggc aat gct aag atc aag cat aat atg atc tgt 2972
Phe Val Leu Cys Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys
345 350 355
gga gcc tcg gac gag cgc ggc tac cag atg ctg acc tgc gcc ggt ggg 3020
Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly
360 365 370
aac agc cat atg cta gcc acc gtg cat gtg gcc tcc cat gct cgc aag 3068
Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Ala Arg Lys
375 380 385
ccc tgg ccc gag ttc gag cac aat gtc atg acc agg tgc aat atg cat 3116
Pro Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His
390 395 400
ctg ggg tcc cgc cga ggc atg ttc atg ccc tac cag tgc aac ctg aat 3164
Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn
405 410 415 420
tat gtg aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg 3212
Tyr Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu
425 430 435
acg ggg gtg ttt gac atg aat gtg gat gtg tgg aag att ctg aga tat 3260
Thr Gly Val Phe Asp Met Asn Val Asp Val Trp Lys Ile Leu Arg Tyr
440 445 450
gat gaa tcc aag acc agg tgc cga gcc tgc gag tgc gga ggg aag cat 3308
Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His
455 460 465
gcc agg ttc cag ccc gtg tgt gtg gat gtg acg gag gac ctg cga ccc 3356
Ala Arg Phe Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro
470 475 480
gat cat ttg gtg ttg tcc tgc acc ggg acg gag ttc ggt tcc agc ggg 3404
Asp His Leu Val Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly
485 490 495 500
gaa gaa tct gac tagagtgagt agtgttctgg ggcgggggag gacctgcatg 3456
Glu Glu Ser Asp
agggccagaa tgactgaaat ctgtgctttt ctgtgtgttg cagcagc atg agc gga 3512
Met Ser Gly
505
agc ggc tcc ttt gag gga ggg gta ttc agc cct tat ctg acg ggg cgt 3560
Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg
510 515 520
ctc ccc tcc tgg gcg gga gtg cgt cag aat gtg atg gga tcc acg gtg 3608
Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val
525 530 535
gac ggc cgg ccc gtg cag ccc gcg aac tct tca acc ctg acc tat gca 3656
Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala
540 545 550 555
acc ctg agc tct tcg tcg gtg gac gca gct gcc gcc gca gct gct gca 3704
Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala Ala Ala
560 565 570
tct gcc gcc agc gcc gtg cgc gga atg gcc atg ggc gcc ggc tac tac 3752
Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala Gly Tyr Tyr
575 580 585
ggc act ctg gtg gcc aac tcg agt tcc acc aat aat ccc gcc agc ctg 3800
Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu
590 595 600
aac gag gag aag ctg ctg ctg ctg atg gca cag ctc gag gcc ttg acc 3848
Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr
605 610 615
cag cgc ctg ggc gag ctg acc cag cag gtg gct cag ctg cag gag cag 3896
Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Glu Gln
620 625 630 635
acg cgg gct gcg gtt gcc acg gtg aaa tcc aaa taaaaaatga ttcaataaat 3949
Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys
640 645
aaacggagac ggttgttgat tttaacacag agtctgaatc tttatttgat ttttcgcgcg 4009
cggtaggccc tggaccaccg gtctcgatca ttgagcaccc ggtggatctt ttccaggacc 4069
cggtagaggt gggcttggat gttgaggtac atgggcatga gcccgtcccg ggggtggagg 4129
tagctccatt gcagggcctc gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag 4189
gggcgcaggg cgtggtgttg cacaatatct ttgaggagga gactgatggc cacgggcagc 4249
cctttggtgt aggtgtttac aaatctgtta agctgggagg gatgcatgcg gggggagatg 4309
aggtgcatct tggcctggat cttgagattg gcgatgttgc cgcccagatc ccgcctgggg 4369
ttcatgttgt gcaggaccac cagcacggtg tatccggtgc acttggggaa tttatcatgc 4429
aacttggaag ggaaggcgtg aaagaatttg gcgacgccct tgtgcccgcc caggttttcc 4489
atgcactcat ccatgatgat ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt 4549
cgggggtcgg acacatcata gttgtggtcc tgggtgagat catcataggc cattttaatg 4609
aatttggggc ggagggtgcc cgattggggg acaaaggtac cctcgatccc gggggcgtag 4669
ttcccctcac agatctgcat ctcccaggct ttgagctcgg agggggggat catgtccacc 4729
tgcggggcga taaagaacac ggtttccggg gcgggggaga tgagctgggc cgaaagcaag 4789
ttgcggagca gctgtgactt gccgcagccg gtggggccgt agatgacccc gatgaccggc 4849
tgcaggtggt agttgaggga gagacagctg ccgtcctccc ggaggagggg ggccacctcg 4909
ttcatcatct cgcgcacatg catgttctcg cgcaccagtt ccgccaggag gcgctctccc 4969
cccagggata ggagctcctg gagcgaggcg aagtttttca gcggcttgag tccgtcggcc 5029
atgggcattt tggagagggt ctgttgcaag agttccaagc ggtcccagag ctcggtgatg 5089
tgctctacgg catctcgatc cagcagacct cctcgtttcg cgggttggga cgactgcggg 5149
agtagggcac cagacgatgg gcgtccagcg cagccagggt ccggtccttc cagggtcgca 5209
gcgtccgcgt cagggtggtc tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg 5269
cgagggtgcg cttcaggctc atccggctgg tcgaaaaccg ctcccgatcg gcgccctgcg 5329
cgtcggccag gtagcaattg accatgagtt cgtagttgag cgcctcggcc gcgtggcctt 5389
tggcgcggag cttacctttg gaagtctgcc cgcaggcggg acagaggagg gacttgaggg 5449
cgtagagctt gggggcgagg aagacggact cgggggcgta ggcgtccgcg ccgcagtggg 5509
cgcagacggt ctcgcactcc acaagccagg tgaggtcggg ctggtcgggg tcaaaaacca 5569
gtttcccgcc gttctttttg atgcgtttct tacctttggt ctccatgagc tcgtgtcccc 5629
gctgggtgac aaagaggctg tccgtgtccc cgtagaccga ctttatgggc cggtcctcga 5689
gcggtgtgcc gcggtcctcc tcgtagagga accccgccca ctccgagacg aaagcccggg 5749
tccaggccag cacgaaggag gccacgtggg acgggtagcg gtcgttgtcc accagcgggt 5809
ccaccttttc cagggtatgc aaacacatgt ccccctcgtc cacatccagg aaggtgattg 5869
gcttgtaagt gtaggccacg tgaccggggg tcccagccgg gggggtataa aagggggcgg 5929
gcccctgctc gtcctcactg tcttccggat cgctgtccag gagcgccagc tgttggggta 5989
ggtattccct ctcgaaggcg ggcatgacct cggcactcag gttgtcagtt tctagaaacg 6049
aggaggattt gatattgacg gtgccgttgg agacgccttt catgagcccc tcgtccatct 6109
ggtcagaaaa gacgatcttt ttgttgtcga gcttggtggc gaaggagccg tagagggcgt 6169
tggagagcag cttggcgatg gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct 6229
tggcggcgat gttgagctgc acgtactcgc gcgccacgca cttccattcg gggaatacgg 6289
tggtgagctc gtcgggcacg attctgaccc gccagccgcg gttgtgcagg gtgatgaggt 6349
ccacgctggt ggccacctcg ccgcgcaggg gctcgttggt ccagcagagg cgcccgccct 6409
tgcgcgagca gaaggggggc agcgggtcca gcatgagctc gtcggggggg tcggcgtcca 6469
cggtgaagat gccgggcagg agctcggggt cgaagtagct gatgcaggtg cccagatcgt 6529
ccagacttgc ttgccagtcg cgcacggcca gcgcgcgctc gtaggggctg aggggcgtgc 6589
cccagggcat ggggtgcgtg agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga 6649
ggggctcctc gaggacgccg atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc 6709
gcacgtagtc gtacagctcg tgcgagggcg cgaggagccc cgtgccgaga ttggagcgct 6769
gcggcttttc ggcgcggtag acgatctggc ggaagatggc gtgggagttg gaggagatgg 6829
tgggcctctg gaagatgttg aagtgggcgt ggggcagtcc gaccgagtcc ctgatgaagt 6889
gggcgtagga gtcctgcagc ttggcgacga gctcggcggt gacgaggacg tccagggcgc 6949
agtagtcgag ggtctcttgg atgatgtcgt acttgagctg gcccttctgc ttccacagct 7009
cgcggttgag aaggaactct tcgcggtcct tccagtactc ttcgaggggg aacccgtcct 7069
gatcggcacg gtaagagccc accatgtaga actggttgac ggccttgtag gcgcagcagc 7129
ccttctccac ggggagggcg taagcttgcg cggccttgcg cagggaggtg tgggtgaggg 7189
cgaaggtgtc gcgcaccatg accttgagga actggtgctt gaagtcgagg tcgtcgcagc 7249
cgccctgctc ccagagctgg aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga 7309
aagtaacatc gttgaagagg atcttgcccg cgcggggcat gaagttgcga gtgatgcgga 7369
aaggctgggg cacatcggcc cggttgttga tgacctgggc ggcgaggacg atctcgtcga 7429
agccgttgat gttgtgcccg acgatgtaga gttccacgaa tcgcgggcgg cccttgacgt 7489
ggggcagctt cttgagctcg tcgtaggtga gctcggcggg gtcgctgagc ccgtgctgct 7549
cgagggccca gtcggcgacg tgggggttgg cgcggaggaa ggaagtccag agatccacag 7609
ccagggcggt ctgcaagcgg tcccggtact gacggaactg ctggcccacg gccatttttt 7669
cgggggtgac gcagtagaag gtgcgggggt cgccgtgcca gcggtcccac ttgagctgga 7729
gggcgaggtc gtgggcgagc tcgacgagcg gcgggtcccc ggagagtttc atgaccagca 7789
tgaaggggac gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg 7849
tgaggaagag cctttcggtg cgaggatgcg agccgatggg gaagaactgg atctcctgcc 7909
accagttgga ggaatggctg ttgatgtgat ggaagtagaa atgccgacgg cgcgccgagc 7969
actcgtgctt gtgtttatac aagcgtccgc agtgctcgca acgctgcacg ggatgcacgt 8029
gctgcacgag ctgtacctgg gttcctttga cgaggaattt cagtgggcag tggagcgctg 8089
gcggctgcat ctggtgctgt actacgtcct ggccatcggc gtggccatcg tctgcctcga 8149
tggtggtcat gctgacgagc ccgcgcggga ggcaggtcca gacctcggct cggacgggtc 8209
ggagagcgag gacgagggcg cgcaggccgg agctgtccag ggtcctgaga cgctgcggag 8269
tcaggtcagt gggcagcggc ggcgcgcggt tgacttgcag gagcttttcc agggcgcgcg 8329
ggaggtccag atggtacttg atctccacgg cgccgttggt ggcgacgtcc acggcttgca 8389
gggtgccgtg cccctggggc gccaccaccg tgccccgttt cttcttgggc gctggcggcg 8449
ttggcgctgc ttccatgtcg gtcagaagcg gcggcgagga cgcgcgccgg gcggcagggg 8509
cggctcgggg cccggaggca ggggcggcag gggcacgtcg gcgccgcgcg cgggcaggtt 8569
ctggtactgc gcccggagaa gactggcgtg agcgacgacg cgacggttga cgtcctggat 8629
ctgacgcctc tgggtgaagg ccacgggacc cgtgagtttg aacctgaaag agagttcgac 8689
agaatcaatc tcggtatcgt tgacggcggc ctgccgcagg atctcttgca cgtcgcccga 8749
gttgtcctgg taggcgatct cggtcatgaa ctgctcgatc tcctcctcct gaaggtctcc 8809
gcggccggcg cgctcgacgg tggccgcgag gtcgttggag atgcggccca tgagctgcga 8869
gaaggcgttc atgccggcct cgttccagac gcggctgtag accacggctc cgtcggggtc 8929
gcgcgcgcgc atgaccacct gggcgaggtt gagctcgacg tggcgcgtga agaccgcgta 8989
gttgcagagg cgctggtaga ggtagttgag cgtggtggcg atgtgctcgg tgacgaagaa 9049
gtacatgatc cagcggcgga gcggcatctc gctgacgtcg cccagggctt ccaagcgctc 9109
catggcctcg tagaagtcca cggcgaagtt gaaaaactgg gagttgcgcg ccgagacggt 9169
caactcctcc tccagaagac ggatgagctc ggcgatggtg gcgcgcacct cgcgctcgaa 9229
ggccccgggg ggctcctctt cttccatctc ctcctcctct tcctcctcca ctaacatctc 9289
ttctacttcc tcctcaggag gcggcggcgg gggaggggcc ctgcgtcgcc ggcggcgcac 9349
gggcagacgg tcgatgaagc gctcgatggt ctccccgcgc cggcgacgca tggtctcggt 9409
gacggcgcgc ccgtcctcgc ggggccgcag cgtgaagacg ccgccgcgca tctccaggtg 9469
gccgccgggg gggcctccgt tgggcaggga gagggcgctg acgatgcatc ttatcaattg 9529
gcccgtaggg actccgcgca aggacctgag cgtctcgaga tccacgggat ccgaaaaccg 9589
ctgaacgaag gcttcgagcc agtcgcagtc gcaaggtagg ctgagcacgg tttcttctgg 9649
cgggtctggc tggggagcgg ggcgggcgat gctgctggtg atgaagttga aataggcggt 9709
tctgagacgg cggatggtgg cgaggagcac caggtccttg ggcccggctt gctggatgcg 9769
cagacggtcg gccatgcccc aggcgtggtc ctgacacctg gcgaggtcct tgtagtagtc 9829
ctgcatgagc cgctccacgg gcacctcctc ctcgcccgcg cggccgtgca tgcgcgtgag 9889
cccgaacccg cgctgcggct ggacgagcgc caggtcggcg acgacgcgct cggcgaggat 9949
ggcctgctgg atctgggtga gggtggtctg gaagtcgtcg aagtcgacga agcggtggta 10009
ggctccggtg ttgatggtgt aggagcagtt ggccatgacg gaccagttga cggtctggtg 10069
gccggggcgc acgagctcgt ggtacttgag gcgcgagtag gcgcgcgtgt cgaagatgta 10129
gtcgttgcag gtgcgcacga ggtactggta tccgacgagg aagtgcggcg gcggctggcg 10189
gtagagcggc catcgctcgg tggcgggggc gccgggcgcg aggtcctcga gcatgaggcg 10249
gtggtagccg tagatgtacc tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg 10309
cgggaactcg cggacgcggt tccagatgtt gcgcagcggc aggaagtagt tcatggtggc 10369
cgcggtctgg cccgtgaggc gcgcgcagtc gtggatgctc tagacatacg ggcaaaaacg 10429
aaagcggtca gcggctcgac tccgtggcct ggaggctaag cgaacgggtt gggctgcgcg 10489
tgtaccccgg ttcgaatctc gaatcaggct ggagccgcag ctaacgtggt actggcactc 10549
ccgtctcgac ccaagcctgc taacgaaacc tccaggatac ggaggcgggt cgttttttgg 10609
ccttggccgc tggtcatgaa aaactagtaa gcgcggaaag cggccgtccg cgatggctcg 10669
ctgccgtagt ctggagaaag aatcgccagg gttgcgttgc ggtgtgcccc ggttcgagcc 10729
tcagcgctcg gcgccggccg gattccgcgg ctaacgtggg cgtggctgcc ccgtcgtttc 10789
caagacccct tagccagccg acttctccag ttacggagcg agcccctctt tttcttgtgt 10849
ttttgccag atg cat ccc gta ctg cgg cag atg cgc ccc cac cct cca cca 10900
Met His Pro Val Leu Arg Gln Met Arg Pro His Pro Pro Pro
650 655 660
caa ccg ccc cta ccg ccg cag cag cag caa cag ccg gcg ctt ctg ccc 10948
Gln Pro Pro Leu Pro Pro Gln Gln Gln Gln Gln Pro Ala Leu Leu Pro
665 670 675
ccg ccc cag cag cag cag cag cca gcc act acc gcg gcg gcc gcc gtg 10996
Pro Pro Gln Gln Gln Gln Gln Pro Ala Thr Thr Ala Ala Ala Ala Val
680 685 690
agc gga gcc ggc gtt cag tat gac ctg gcc ttg gaa gag ggc gag ggg 11044
Ser Gly Ala Gly Val Gln Tyr Asp Leu Ala Leu Glu Glu Gly Glu Gly
695 700 705
ctg gcg cgg ctg ggg gcg tcg tcg ccg gag cgg cac ccg cgc gtg cag 11092
Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln
710 715 720
atg aaa agg gac gct cgc gag gcc tac gtg ccc aag cag aac ctg ttc 11140
Met Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe
725 730 735 740
aga gac agg agc ggc gag gag ccc gag gag atg cgc gcg gcc cgg ttc 11188
Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe
745 750 755
cac gcg ggg cgg gag ctg cgg cgc ggc ctg gac cga aag agg gtg ctg 11236
His Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu
760 765 770
agg gac gag gat ttc gag gcg gac gag ctg acg ggg atc agc ccc gcg 11284
Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala
775 780 785
cgc gcg cac gtg gcc gcg gcc aac ctg gtc acg gcg tac gag cag acc 11332
Arg Ala His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr
790 795 800
gtg aag gag gag agc aac ttc caa aaa tcc ttc aac aac cac gtg cgc 11380
Val Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg
805 810 815 820
acc ctg atc gcg cgc gag gag gtg acc ctg ggc ctg atg cac ctg tgg 11428
Thr Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp
825 830 835
gac ctg ctg gag gcc atc gtg cag aac ccc acc agc aag ccg ctg acg 11476
Asp Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr
840 845 850
gcg cag ctg ttc ctg gtg gtg cag cat agt cgg gac aac gag gcg ttc 11524
Ala Gln Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe
855 860 865
agg gag gcg ctg ctg aat atc acc gag ccc gag ggc cgc tgg ctc ctg 11572
Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu
870 875 880
gac ctg gtg aac att ctg cag agc atc gtg gtg cag gag cgc ggg ctg 11620
Asp Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu
885 890 895 900
ccg ctg tcc gag aag ctg gcg gcc atc aac ttc tcg gtg ctg agt ctg 11668
Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu
905 910 915
ggc aag tac tac gct agg aag atc tac aag acc ccg tac gtg ccc ata 11716
Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile
920 925 930
gac aag gag gtg aag atc gac ggg ttt tac atg cgc atg acc ctg aaa 11764
Asp Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys
935 940 945
gtg ctg acc ctg agc gac gat ctg ggg gtg tac cgc aac gac agg atg 11812
Val Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met
950 955 960
cac cgc gcg gtg agc gcc agc agg cgg cgc gag ctg agc gac cag gag 11860
His Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu
965 970 975 980
ctg atg cac agc ctg cag cgg gcc ctg acc ggg gcc ggg acc gag ggg 11908
Leu Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly
985 990 995
gag agc tac ttt gac atg ggc gcg gac ctg cac tgg cag ccc agc 11953
Glu Ser Tyr Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser
1000 1005 1010
cgc cgg gcc ttg gaa gcg gcg gca gga ccc tac gta gaa gag gtg 11998
Arg Arg Ala Leu Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val
1015 1020 1025
gac gat gag gtg gac gag gag ggc gag tac ctg gaa gac tgatggcgcg 12047
Asp Asp Glu Val Asp Glu Glu Gly Glu Tyr Leu Glu Asp
1030 1035
accgtatttt tgctag atg caa caa cag cca cct cct gat ccc gcg atg 12096
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met
1040 1045 1050
cgg gcg gcg ctg cag agc cag ccg tcc ggc att aac tcc tcg gac 12141
Arg Ala Ala Leu Gln Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp
1055 1060 1065
gat tgg acc cag gcc atg caa cgc atc atg gcg ctg acg acc cgc 12186
Asp Trp Thr Gln Ala Met Gln Arg Ile Met Ala Leu Thr Thr Arg
1070 1075 1080
aac ccc gaa gcc ttt aga cag cag ccc cag gcc aac cgg ctc tcg 12231
Asn Pro Glu Ala Phe Arg Gln Gln Pro Gln Ala Asn Arg Leu Ser
1085 1090 1095
gcc atc ctg gag gcc gtg gtg ccc tcg cgc tcc aac ccc acg cac 12276
Ala Ile Leu Glu Ala Val Val Pro Ser Arg Ser Asn Pro Thr His
1100 1105 1110
gag aag gtc ctg gcc atc gtg aac gcg ctg gtg gag aac aag gcc 12321
Glu Lys Val Leu Ala Ile Val Asn Ala Leu Val Glu Asn Lys Ala
1115 1120 1125
atc cgc ggc gac gag gcc ggc ctg gtg tac aac gcg ctg ctg gag 12366
Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr Asn Ala Leu Leu Glu
1130 1135 1140
cgc gtg gcc cgc tac aac agc acc aac gtg cag acc aac ctg gac 12411
Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln Thr Asn Leu Asp
1145 1150 1155
cgc atg gtg acc gac gtg cgc gag gcc gtg gcc cag cgc gag cgg 12456
Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln Arg Glu Arg
1160 1165 1170
ttc cac cgc gag tcc aac ctg gga tcc ctg gtg gcg ctg aac gcc 12501
Phe His Arg Glu Ser Asn Leu Gly Ser Leu Val Ala Leu Asn Ala
1175 1180 1185
ttc ctc agc acc cag ccc gcc aac gtg ccc cgg ggc cag gag gac 12546
Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu Asp
1190 1195 1200
tac acc aac ttc atc agc gcc ctg cgc ctg atg gtg acc gag gtg 12591
Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val
1205 1210 1215
ccc cag agc gag gtg tac cag tcc ggg ccg gac tac ttc ttc cag 12636
Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln
1220 1225 1230
acc agt cgc cag ggc ttg cag acc gtg aac ctg agc cag gcg ttc 12681
Thr Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe
1235 1240 1245
aag aac ttg cag ggc ctg tgg ggc gtg cag gcc ccg gtc ggg gac 12726
Lys Asn Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp
1250 1255 1260
cgc gcg acg gtg tcg agc ctg ctg acg ccg aac tcg cgc ctg ctg 12771
Arg Ala Thr Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu
1265 1270 1275
ctg ctg ctg gtg gcc ccc ttc acg gac agc ggc agc atc aac cgc 12816
Leu Leu Leu Val Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg
1280 1285 1290
aac tcg tac ctg ggc tac ctg att aac ctg tac cgc gag gcc atc 12861
Asn Ser Tyr Leu Gly Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile
1295 1300 1305
ggc cag gcg cac gtg gac gag cag acc tac cag gag atc acc cac 12906
Gly Gln Ala His Val Asp Glu Gln Thr Tyr Gln Glu Ile Thr His
1310 1315 1320
gtg agc cgc gcc ctg ggc cag gac gac ccg ggc aat ctg gaa gcc 12951
Val Ser Arg Ala Leu Gly Gln Asp Asp Pro Gly Asn Leu Glu Ala
1325 1330 1335
acc ctg aac ttt ttg ctg acc aac cgg tcg cag aaa atc ccg ccc 12996
Thr Leu Asn Phe Leu Leu Thr Asn Arg Ser Gln Lys Ile Pro Pro
1340 1345 1350
cag tac gcc ctc agc gcc gag gag gag cgc att ctg cga tac gtg 13041
Gln Tyr Ala Leu Ser Ala Glu Glu Glu Arg Ile Leu Arg Tyr Val
1355 1360 1365
cag cag agc gtg ggc ctg ttc ctg atg cag gag ggg gcc acc ccc 13086
Gln Gln Ser Val Gly Leu Phe Leu Met Gln Glu Gly Ala Thr Pro
1370 1375 1380
agc gcc gcg ctc gac atg acc gcg cgc aac atg gag ccc agc atg 13131
Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met Glu Pro Ser Met
1385 1390 1395
tac gcc agc aac cgc ccg ttc atc aat aaa ctg atg gac tac ttg 13176
Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met Asp Tyr Leu
1400 1405 1410
cat cgg gcg gcc gcc atg aac tcg gac tat ttc acc aac gcc atc 13221
His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn Ala Ile
1415 1420 1425
ctg aat ccc cac tgg ctc ccg ccg ccg ggg ttt tac acg ggc gag 13266
Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly Glu
1430 1435 1440
tac gac atg ccc gac ccc aat gac ggg ttc ctg tgg gac gat gtg 13311
Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val
1445 1450 1455
gac agc agc gtg ttc tcc ccc cga ccg ggt gct aac gag cgc ccc 13356
Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro
1460 1465 1470
ttg tgg aag aag gaa ggc agc gac cga cgc ccg tcc tcg gcg ctg 13401
Leu Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu
1475 1480 1485
tcc ggc cgc gag ggt gct gcc gcg gcg gtg ccc gag gcc gcc agt 13446
Ser Gly Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser
1490 1495 1500
cct ttc ccg agc ttg ccc ttc tcg ctg aac agt att cgc agc agc 13491
Pro Phe Pro Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser
1505 1510 1515
gag ctg ggc agg atc acg cgc ccg cgt ttg ctg ggc gag gag gag 13536
Glu Leu Gly Arg Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu
1520 1525 1530
tac ttg aat gac tcg ctg ttg aga ccc gag cgg gag aag aac ttc 13581
Tyr Leu Asn Asp Ser Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe
1535 1540 1545
ccc aat aac ggg ata gag agc ctg gtg gac aag atg agc cgc tgg 13626
Pro Asn Asn Gly Ile Glu Ser Leu Val Asp Lys Met Ser Arg Trp
1550 1555 1560
aag acg tac gcg cag gag cac agg gac gat ccg tcg cag ggg gcc 13671
Lys Thr Tyr Ala Gln Glu His Arg Asp Asp Pro Ser Gln Gly Ala
1565 1570 1575
acg agc cgg ggc agc gcc gcc cgt aaa cgc cgg tgg cac gac agg 13716
Thr Ser Arg Gly Ser Ala Ala Arg Lys Arg Arg Trp His Asp Arg
1580 1585 1590
cag cgg gga ctg atg tgg gac gat gag gat tcc gcc gac gac agc 13761
Gln Arg Gly Leu Met Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser
1595 1600 1605
agc gtg ttg gac ttg ggt ggg agt ggt ggt aac ccg ttc gct cac 13806
Ser Val Leu Asp Leu Gly Gly Ser Gly Gly Asn Pro Phe Ala His
1610 1615 1620
ctg cgc ccc cgc atc ggg cgc atg atg taagaaaccg aaaataaatg 13853
Leu Arg Pro Arg Ile Gly Arg Met Met
1625
atactcacca aggccatggc gaccagcgtg cgttcgtttc ttctctgttg ttgtatctag 13913
t atg atg agg cgt gcg tac ccg gag ggt cct cct ccc tcg tac gag 13959
Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu
1630 1635 1640
agc gtg atg cag cag gcg atg gcg gcg gcg gcg atg cag ccc ccg 14004
Ser Val Met Gln Gln Ala Met Ala Ala Ala Ala Met Gln Pro Pro
1645 1650 1655
ctg gag gct cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag 14049
Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu
1660 1665 1670
ggg cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat 14094
Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp
1675 1680 1685
acc acc cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc 14139
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala
1690 1695 1700
tcg ctg aac tac cag aac gac cac agc aac ttc ctg acc acc gtg 14184
Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val
1705 1710 1715
gtg cag aac aat gac ttc acc ccc acg gag gcc agc acc cag acc 14229
Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr
1720 1725 1730
atc aac ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc 14274
Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr
1735 1740 1745
atc atg cac acc aac atg ccc aac gtg aac gag ttc atg tac agc 14319
Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser
1750 1755 1760
aac aag ttc aag gcg cgg gtc atg gtc tcc cgc aag acc ccc aac 14364
Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn
1765 1770 1775
ggg gtg gga gag gat tat gat ggt agt cag gat gag ctg aaa tac 14409
Gly Val Gly Glu Asp Tyr Asp Gly Ser Gln Asp Glu Leu Lys Tyr
1780 1785 1790
gaa tgg gtg gag ttt gag ctg ccc gaa ggc aac ttc tcg gtg acc 14454
Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr
1795 1800 1805
atg acc atc gac ctg atg aac aac gcc atc atc gac aat tac ttg 14499
Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu
1810 1815 1820
gcg gtg ggg cgg cag aac ggg gtc ctg gag agc gat atc ggc gtg 14544
Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val
1825 1830 1835
aag ttc gac act agg aac ttc agg ctg ggg tgg gac ccc gtg acc 14589
Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr
1840 1845 1850
gag ctg gtc atg ccc ggg gtg tac acc aac gag gcc ttc cat ccc 14634
Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro
1855 1860 1865
gat att gtc ttg ctg ccc ggc tgc ggg gtg gac ttc acc gag agc 14679
Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser
1870 1875 1880
cgc ctc agc aac ctg ctg ggc att cgc aag agg cag cca ttc cag 14724
Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln
1885 1890 1895
gag ggt ttc cag atc atg tac gag gat ctg gag ggg ggc aac atc 14769
Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile
1900 1905 1910
ccc gca ctc ctg gat gtc gac gcc tat gag aaa agc aag gag gaa 14814
Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Glu
1915 1920 1925
gca gca gct gag gca acc gca gcc gta gcc acc gcc tct acc gag 14859
Ala Ala Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu
1930 1935 1940
gtc agg ggc gat aat ttt gcc agc cct gca gca gtg gca gcg gcc 14904
Val Arg Gly Asp Asn Phe Ala Ser Pro Ala Ala Val Ala Ala Ala
1945 1950 1955
gag gcg gct gaa acc gaa agt aag ata gtc att cag ccg gtg gag 14949
Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu
1960 1965 1970
aag gat agc aag aac agg agc tac aac gta cta ccg gac aag ata 14994
Lys Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu Pro Asp Lys Ile
1975 1980 1985
aac acc gcc tac cgc agc tgg tac ctg gcc tac aac tat ggc gac 15039
Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp
1990 1995 2000
ccc gag aag ggc gtg cgc tcc tgg acg ctg ctc acc acc tcg gac 15084
Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp
2005 2010 2015
gtc acc tgc ggc gtg gag caa gtc tac tgg tcg ctg ccc gac atg 15129
Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met
2020 2025 2030
atg caa gac ccg gtc acc ttc cgc tcc acg cgt caa gtt agc aac 15174
Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn
2035 2040 2045
tac ccg gtg gtg ggc gcc gag ctc ctg ccc gtc tac tcc aag agc 15219
Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser
2050 2055 2060
ttc ttc aac gag cag gcc gtc tac tcg cag cag ctg cgc gcc ttc 15264
Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe
2065 2070 2075
acc tcg ctc acg cac gtc ttc aac cgc ttt ccc gag aac cag atc 15309
Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile
2080 2085 2090
ctc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt gaa aac 15354
Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn
2095 2100 2105
gtt cct gct ctc aca gat cac ggg acc ctg ccg ctg cgc agc agt 15399
Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser
2110 2115 2120
atc cgg gga gtc cag cgc gtg acc gtt act gac gcc aga cgc cgc 15444
Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg
2125 2130 2135
acc tgc ccc tac gtc tac aag gcc ctg ggc ata gtc gcg ccg cgc 15489
Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg
2140 2145 2150
gtc ctc tcg agc cgc acc ttc taaaaa atg tcc att ctc atc tcg ccc 15537
Val Leu Ser Ser Arg Thr Phe Met Ser Ile Leu Ile Ser Pro
2155 2160 2165
agt aat aac acc ggt tgg ggc ctg cgc gcg ccc agc aag atg tac 15582
Ser Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro Ser Lys Met Tyr
2170 2175 2180
gga ggc gct cgc caa cgc tcc acg caa cac ccc gtg cgc gtg cgc 15627
Gly Gly Ala Arg Gln Arg Ser Thr Gln His Pro Val Arg Val Arg
2185 2190 2195
ggg cac ttc cgc gct ccc tgg ggc gcc ctc aag ggc cgc gtg cgc 15672
Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys Gly Arg Val Arg
2200 2205 2210
tcg cgc acc acc gtc gac gac gtg atc gac cag gtg gtg gcc gac 15717
Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val Val Ala Asp
2215 2220 2225
gcg cgc aac tac acg ccc gcc gcc gcg ccc gcc tcc acc gtg gac 15762
Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Ala Ser Thr Val Asp
2230 2235 2240
gcc gtc atc gac agc gtg gtg gcc gac gcg cgc cgg tac gcc cgc 15807
Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala Arg
2245 2250 2255
gcc aag agc cgg cgg cgg cgc atc gcc cgg cgg cac cgg agc acc 15852
Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
2260 2265 2270
ccc gcc atg cgc gcg gcg cga gcc ttg ctg cgc agg gcc agg cgc 15897
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg
2275 2280 2285
acg gga cgc agg gcc atg ctc agg gcg gcc aga cgc gcg gct tca 15942
Thr Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser
2290 2295 2300
ggc gcc agc gcc ggc agg acc cgg aga cgc gcg gcc acg gcg gcg 15987
Gly Ala Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala
2305 2310 2315
gca gcg gcc atc gcc agc atg tcc cgc ccg cgg cga ggg aac gtg 16032
Ala Ala Ala Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val
2320 2325 2330
tac tgg gtg cgc gac gcc gcc acc ggt gtg cgc gtg ccc gtg cgc 16077
Tyr Trp Val Arg Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg
2335 2340 2345
acc cgc ccc cct cgc act tgaagatgtt cacttcgcga tgttgatgtg 16125
Thr Arg Pro Pro Arg Thr
2350
tcccagcggc gagg atg tcc aag cgc aaa ttc aag gaa gag atg ctc 16172
Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu
2355 2360 2365
cag gtc atc gcg cct gag atc tac ggc ccc gcg gcg gcg gtg aag 16217
Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys
2370 2375 2380
gag gaa aga aag ccc cgc aaa ctg aag cgg gtc aaa aag gac aaa 16262
Glu Glu Arg Lys Pro Arg Lys Leu Lys Arg Val Lys Lys Asp Lys
2385 2390 2395
aag gag gag gaa gat gac gga ctg gtg gag ttt gtg cgc gag ttc 16307
Lys Glu Glu Glu Asp Asp Gly Leu Val Glu Phe Val Arg Glu Phe
2400 2405 2410
gcc ccc cgg cgg cgc gtg cag tgg cgc ggg cgg aag gtg cag ccg 16352
Ala Pro Arg Arg Arg Val Gln Trp Arg Gly Arg Lys Val Gln Pro
2415 2420 2425
gtg ctg aga ccc ggc acc acc gtg gtc ttc acg ccc ggc gag cgc 16397
Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr Pro Gly Glu Arg
2430 2435 2440
tcc ggc acc gct tcc aag cgc tcc tac gac gag gtg tac ggg gat 16442
Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu Val Tyr Gly Asp
2445 2450 2455
gat gat att ctg gag cag gcg gcc gag cgc ctg ggc gag ttt gct 16487
Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu Gly Glu Phe Ala
2460 2465 2470
tac ggc aag cgc agc cgc tcc gcg ctg aag gaa gag gcg gtg tcc 16532
Tyr Gly Lys Arg Ser Arg Ser Ala Leu Lys Glu Glu Ala Val Ser
2475 2480 2485
atc ccg ctg gac cac ggc aac ccc acg ccg agc ctc aag ccc gtg 16577
Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro Val
2490 2495 2500
acc ctg cag cag gtg ctg ccg agc gcg gcg ccg cga agg ggg ttc 16622
Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly Phe
2505 2510 2515
aag cgc gag ggc gag gat ctg tat ccc acc atg cag ctg atg gtg 16667
Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val
2520 2525 2530
ccc aaa cgc cag aag ctg gaa gac gtg ctg gaa acc atg aag gtg 16712
Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val
2535 2540 2545
gac ccg gac gtg cag ccc gag gtc aag gtg cgg ccc atc aag cag 16757
Asp Pro Asp Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln
2550 2555 2560
gtg gcc ccg ggt ctg ggc gtg cag acc gtg gac atc aag atc ccc 16802
Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro
2565 2570 2575
acg gag ccc atg gaa acg cag acc gag ccc atg atc aag ccc agt 16847
Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Met Ile Lys Pro Ser
2580 2585 2590
acc agc acc atg gag gtg cag acg gat ccc tgg atg cca gcc gcc 16892
Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp Met Pro Ala Ala
2595 2600 2605
ccc acc agc agc cga aga ccc cgg cgc aag tac ggc gcg gcc agc 16937
Pro Thr Ser Ser Arg Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser
2610 2615 2620
ctg ctg atg ccc aac tac gcg ctg cat cct tcc atc atc ccc acg 16982
Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr
2625 2630 2635
ccg ggc tac cgc ggc acg cgc ttc tac cgc ggt cat aca acc agc 17027
Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly His Thr Thr Ser
2640 2645 2650
tcc cgc cgc cgc aag acc acc acc cgc cgc cgt cgt cgc agc cgc 17072
Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Arg Arg Arg Ser Arg
2655 2660 2665
cgc agc agc acc gcg act tcc gcc gcc gcc ctg gtg cgg aga gtg 17117
Arg Ser Ser Thr Ala Thr Ser Ala Ala Ala Leu Val Arg Arg Val
2670 2675 2680
tac cgc agc ggg cgc gag cct ctg acc ctg ccg cgc gcg cgc tac 17162
Tyr Arg Ser Gly Arg Glu Pro Leu Thr Leu Pro Arg Ala Arg Tyr
2685 2690 2695
cac ccg agc atc gcc att taactctgcc gtcgcctcct tgcagat atg gcc 17213
His Pro Ser Ile Ala Ile Met Ala
2700
ctc aca tgc cgc ctc cgc gtc ccc att acg ggc tac cga gga aga 17258
Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly Arg
2705 2710 2715
aag ccg cgc cgt aga agg ctg gcg ggg aac ggg atg cgt cgc cac 17303
Lys Pro Arg Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His
2720 2725 2730
cac cac cgg cgg cgg cgc gcc atc agc aag cgg ttg ggg gga ggc 17348
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly
2735 2740 2745
ttc ctg ccc gcg ctg atc ccc atc atc gcc gcg gcg atc ggg gcg 17393
Phe Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala
2750 2755 2760
atc ccc ggc att gct tcc gtg gcg gtg cag gcc tct cag cgc cac 17438
Ile Pro Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
2765 2770 2775
tgagacacac ttggaaacat cttgtaataa acca atg gac tct gac gct cct 17490
Met Asp Ser Asp Ala Pro
2780
ggt cct gtg atg tgt ttt cgt aga cag atg gaa gac atc aat ttt 17535
Gly Pro Val Met Cys Phe Arg Arg Gln Met Glu Asp Ile Asn Phe
2785 2790 2795
tcg tcc ctg gct ccg cga cac ggc acg cgg ccg ttc atg ggc acc 17580
Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met Gly Thr
2800 2805 2810
tgg agc gac atc ggc acc agc caa ctg aac ggg ggc gcc ttc aat 17625
Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly Ala Phe Asn
2815 2820 2825
tgg agc agt ctc tgg agc ggg ctt aag aat ttc ggg tcc acg ctt 17670
Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu
2830 2835 2840
aaa acc tat ggc agc aag gcg tgg aac agc acc aca ggg cag gcg 17715
Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln Ala
2845 2850 2855
ctg agg gat aag ctg aaa gag cag aac ttc cag cag aag gtg gtc 17760
Leu Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val
2860 2865 2870
gat ggg ctc gcc tcg ggc atc aac ggg gtg gtg gac ctg gcc aac 17805
Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn
2875 2880 2885
cag gcc gtg cag cgg cag atc aac agc cgc ctg gac ccg gtg ccg 17850
Gln Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro
2890 2895 2900
ccc gcc ggc tcc gtg gag atg ccg cag gtg gag gag gag ctg cct 17895
Pro Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro
2905 2910 2915
ccc ctg gac aag cgg ggc gag aag cga ccc cgc ccc gac gcg gag 17940
Pro Leu Asp Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu
2920 2925 2930
gag acg ctg ctg acg cac acg gac gag ccg ccc ccg tac gag gag 17985
Glu Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu
2935 2940 2945
gcg gtg aaa ctg ggt ctg ccc acc acg cgg ccc atc gcg ccc ctg 18030
Ala Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu
2950 2955 2960
gcc acc ggg gtg ctg aaa ccc gaa agt aat aag ccc gcg acc ctg 18075
Ala Thr Gly Val Leu Lys Pro Glu Ser Asn Lys Pro Ala Thr Leu
2965 2970 2975
gac ttg cct cct ccc cag cct tct cgc ccc tcc aca gtg gct aag 18120
Asp Leu Pro Pro Pro Gln Pro Ser Arg Pro Ser Thr Val Ala Lys
2980 2985 2990
ccc ctg ccg ccg gtg gcc gtg gcc cgc gcg cga ccc ggg ggc acc 18165
Pro Leu Pro Pro Val Ala Val Ala Arg Ala Arg Pro Gly Gly Thr
2995 3000 3005
gcc cgc cct cat gcg aac tgg cag agc act ctg aac agc atc gtg 18210
Ala Arg Pro His Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val
3010 3015 3020
ggt ctg gga gtg cag agt gtg aag cgc cgc cgc tgc tat taaacctacc 18259
Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr
3025 3030 3035
gtagcgctta acttgcttgt ctgtgtgtgt atgtattatg tcgccgccgc cgctgtccac 18319
cagaaggagg agtgaagagg cgcgtcgccg agttgcaag atg gcc acc cca tcg 18373
Met Ala Thr Pro Ser
3040
atg ctg ccc cag tgg gcg tac ata cac atc gcc gga cag gac gct 18418
Met Leu Pro Gln Trp Ala Tyr Ile His Ile Ala Gly Gln Asp Ala
3045 3050 3055
tcg gag tac ctg agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca 18463
Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr
3060 3065 3070
gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc acg gtg 18508
Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val
3075 3080 3085
gcg ccc acg cac gat gtg acc acc gac cgc agc cag cgg ctg acg 18553
Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr
3090 3095 3100
ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac tcg tac 18598
Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
3105 3110 3115
aaa gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac 18643
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp
3120 3125 3130
atg gcc agc acc tac ttt gac atc cgc ggc gtg ctg gac cgg ggc 18688
Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly
3135 3140 3145
cct agc ttc aaa ccc tac tcc ggc acc gcc tac aac agc ctg gct 18733
Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala
3150 3155 3160
ccc aag gga gcg ccc aat tcc agc cag tgg gag caa aaa aag act 18778
Pro Lys Gly Ala Pro Asn Ser Ser Gln Trp Glu Gln Lys Lys Thr
3165 3170 3175
ggc aat aat gcc aat gga gat acg gag aat gtc act tat ggt gta 18823
Gly Asn Asn Ala Asn Gly Asp Thr Glu Asn Val Thr Tyr Gly Val
3180 3185 3190
gct gcc atg gga gga att gac atc gat aaa aat ggc ctt caa att 18868
Ala Ala Met Gly Gly Ile Asp Ile Asp Lys Asn Gly Leu Gln Ile
3195 3200 3205
gga acc gat gac acc aaa gat aac gat aat gac att tat gca gac 18913
Gly Thr Asp Asp Thr Lys Asp Asn Asp Asn Asp Ile Tyr Ala Asp
3210 3215 3220
aaa aca tat cag cct gag ccg caa ata gga gag gaa aac tgg caa 18958
Lys Thr Tyr Gln Pro Glu Pro Gln Ile Gly Glu Glu Asn Trp Gln
3225 3230 3235
gaa aca tat tcc tac tat gga ggt aga gct ctt aaa aaa gat acc 19003
Glu Thr Tyr Ser Tyr Tyr Gly Gly Arg Ala Leu Lys Lys Asp Thr
3240 3245 3250
aaa atg aag cca tgc tat ggc tca ttt gcc aga cct acc aat gtg 19048
Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Arg Pro Thr Asn Val
3255 3260 3265
aaa gga gga cag gca aaa ata aaa aca gat gga gat gtt aag tca 19093
Lys Gly Gly Gln Ala Lys Ile Lys Thr Asp Gly Asp Val Lys Ser
3270 3275 3280
ttt gac ata gac cta gcc ttc ttt gat att ccc aat tct ggc gcg 19138
Phe Asp Ile Asp Leu Ala Phe Phe Asp Ile Pro Asn Ser Gly Ala
3285 3290 3295
gga aat ggc aca aat gtt aac aat tat cca gat atg gtt atg tat 19183
Gly Asn Gly Thr Asn Val Asn Asn Tyr Pro Asp Met Val Met Tyr
3300 3305 3310
aca gaa aat gta aat ctg gaa acc cca gat act cat att gtg tac 19228
Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Thr His Ile Val Tyr
3315 3320 3325
aaa cca gga act tca gat gac agc tca aag gtc aac ttg tgt cag 19273
Lys Pro Gly Thr Ser Asp Asp Ser Ser Lys Val Asn Leu Cys Gln
3330 3335 3340
caa tcc atg cct aac aga ccc aat tat att ggc ttc aga gac aat 19318
Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn
3345 3350 3355
ttt att ggg ctt atg tac tac aac agc act ggc aat atg ggt gtg 19363
Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val
3360 3365 3370
ctg gct ggt cag gcc tct caa ctg aat gcc gtg gtg gac ttg caa 19408
Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln
3375 3380 3385
gac aga aac aca gag ctg tcc tac cag ctc ttg ctt gac tct ctg 19453
Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu
3390 3395 3400
ggt gac aga acc agg tat ttc agt atg tgg aat cag gcg gtg gac 19498
Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp
3405 3410 3415
agt tat gat cct gat gtg cgc att att gaa aac cat ggt gtg gag 19543
Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu
3420 3425 3430
gat gaa ttg cca aac tat tgc ttc ccc ttg gat gga gca ggc acc 19588
Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ala Gly Thr
3435 3440 3445
aat tcg gtt tac caa ggt gtt aaa cca aaa act gac aat ggc aac 19633
Asn Ser Val Tyr Gln Gly Val Lys Pro Lys Thr Asp Asn Gly Asn
3450 3455 3460
gat cag tgg gaa aca gat tcc aca gtt tca agt cac aat cag ata 19678
Asp Gln Trp Glu Thr Asp Ser Thr Val Ser Ser His Asn Gln Ile
3465 3470 3475
tgc aaa ggc aat atc tat gcc atg gag atc aac ctc cag gcc aac 19723
Cys Lys Gly Asn Ile Tyr Ala Met Glu Ile Asn Leu Gln Ala Asn
3480 3485 3490
ctg tgg aga agt ttt ctc tac tcg aac gtg gcc ctg tac ctg ccc 19768
Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro
3495 3500 3505
gat tct tac aag tac acg ccg gcc aac atc acc ctg ccc acc aac 19813
Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn
3510 3515 3520
acc aac acc tac gat tac atg aac ggg aga gtg gtg cct ccc tcg 19858
Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Pro Pro Ser
3525 3530 3535
ctg gtg gac gcc tac atc aac atc ggg gcg cgc tgg tcg ctg gac 19903
Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp
3540 3545 3550
ccc atg gac aac gtg aat ccc ttc aac cac cac cgc aac gcg ggc 19948
Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly
3555 3560 3565
ctg cgc tac cgc tcc atg ctc ctg ggc aac ggg cgc tac gtg ccc 19993
Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro
3570 3575 3580
ttc cac atc cag gtg ccc cag aaa ttt ttc gcc att aag agc ctc 20038
Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu
3585 3590 3595
ctg ctc ctg ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag 20083
Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys
3600 3605 3610
gac gtc aac atg atc ctg cag agc tcc ctc ggc aac gac ctg cgc 20128
Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg
3615 3620 3625
acg gac ggg gcc tcc atc tcc ttc acc agc atc aac ctc tac gcc 20173
Thr Asp Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala
3630 3635 3640
acc ttc ttc ccc atg gcg cac aac acg gct tcc acg ctc gag gcc 20218
Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala
3645 3650 3655
atg ctg cgc aac gac acc aac gac cag tcc ttc aac gac tac ctc 20263
Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu
3660 3665 3670
tcg gcg gcc aac atg ctc tac ccc atc ccg gcc aac gcc acc aac 20308
Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn
3675 3680 3685
gtg ccc atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc ggc 20353
Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly
3690 3695 3700
tgg tcc ttc acg cgc ctc aag acc aag gag acg ccc tcg ctg ggc 20398
Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly
3705 3710 3715
tcc ggg ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac 20443
Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr
3720 3725 3730
ctc gac ggc acc ttc tac ctc aac cac acc ttc aag aag gtc tcc 20488
Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser
3735 3740 3745
atc acc ttc gac tcc tcc gtc agc tgg ccc ggc aac gac cgg ctc 20533
Ile Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu
3750 3755 3760
ctg acg ccc aac gag ttc gaa atc aag cgc acc gtc gac ggc gag 20578
Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu
3765 3770 3775
ggc tac aac gtg gcc cag tgc aac atg acc aag gac tgg ttc ctg 20623
Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu
3780 3785 3790
gtc cag atg ctg gcc cac tac aac atc ggc tac cag ggc ttc tac 20668
Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr
3795 3800 3805
gtg ccc gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac 20713
Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn
3810 3815 3820
ttc cag ccc atg agc cgc cag gtg gtg gac gag gtc aac tac aag 20758
Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys
3825 3830 3835
gac tac cag gcc gtc acc ctg gcc tac cag cac aac aac tcg ggc 20803
Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly
3840 3845 3850
ttc gtc ggc tac ctc gcg ccc acc atg tgc cag ggc cag ccc tac 20848
Phe Val Gly Tyr Leu Ala Pro Thr Met Cys Gln Gly Gln Pro Tyr
3855 3860 3865
ccc gcc aac tac ccg tac ccg ctc atc ggc aag agc gcc gtc acc 20893
Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr
3870 3875 3880
agc gtc acc cag aaa aag ttc ctc tgc gac agg gtc atg tgg cgc 20938
Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg
3885 3890 3895
atc ccc ttc tcc agc aac ttc atg tcc atg ggc gcg ctc acc gac 20983
Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp
3900 3905 3910
ctc ggc cag aac atg ctc tat gcc aac tcc gcc cac gcg cta gac 21028
Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp
3915 3920 3925
atg aat ttc gaa gtc gac ccc atg gat gag tcc acc ctt ctc tat 21073
Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr
3930 3935 3940
gtt gtc ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac 21118
Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His
3945 3950 3955
cgc ggc gtc atc gag gcc gtc tac ctg cgc acc ccc ttc tcg gcc 21163
Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala
3960 3965 3970
ggt aac gcc acc acc taagctcttg cttcttgcaa gct atg gct gag ccc 21213
Gly Asn Ala Thr Thr Met Ala Glu Pro
3975 3980
acg ggc tcc ggc gag cag gag ctc agg gcc atc atc cgc gac ctg 21258
Thr Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Ile Arg Asp Leu
3985 3990 3995
ggc tgc ggg ccc tac ttc ctg ggc acc ttc gat aag cgc ttc ccg 21303
Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro
4000 4005 4010
gga ttc atg gcc ccg cac aag ctg gcc tgc gcc atc gtc aac acg 21348
Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr
4015 4020 4025
gcc ggc cgc gag acc ggg ggc gag cac tgg ctg gcc ttc gcc tgg 21393
Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp
4030 4035 4040
aac ccg cgc tcg aac acc tgc tac ctc ttc gac ccc ttc ggg ttc 21438
Asn Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe
4045 4050 4055
tcg gac gag cgc ctc aag cag atc tac cag ttc gag tac gag ggc 21483
Ser Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly
4060 4065 4070
ctg ctg cgc cgc agc gcc ctg gcc acc gag gac cgc tgc gtc acc 21528
Leu Leu Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr
4075 4080 4085
ctg gaa aag tcc acc cag acc gtg cag ggt ccg cgc tcg gcc gcc 21573
Leu Glu Lys Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala
4090 4095 4100
tgc ggg ctc ttc tgc tgc atg ttc ctg cac gcc ttc gtg cac tgg 21618
Cys Gly Leu Phe Cys Cys Met Phe Leu His Ala Phe Val His Trp
4105 4110 4115
ccc gac cgc ccc atg gac aag aac ccc acc atg aac tta ctg acg 21663
Pro Asp Arg Pro Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr
4120 4125 4130
ggg gtg ccc aac ggc atg ctc cag tcg ccc cag gtg gaa ccc acc 21708
Gly Val Pro Asn Gly Met Leu Gln Ser Pro Gln Val Glu Pro Thr
4135 4140 4145
ctg cgc cgc aac cag gag gcg ctc tac cgc ttc ctc aac gcc cac 21753
Leu Arg Arg Asn Gln Glu Ala Leu Tyr Arg Phe Leu Asn Ala His
4150 4155 4160
tcc gcc tac ttt cgc tcc cac cgc gcg cgc atc gag aag gcc acc 21798
Ser Ala Tyr Phe Arg Ser His Arg Ala Arg Ile Glu Lys Ala Thr
4165 4170 4175
gcc ttc gac cgc atg aat caa gac atg taaaccgtgt gtgtatgtga 21845
Ala Phe Asp Arg Met Asn Gln Asp Met
4180 4185
atgctttatt cataataaac agcacatgtt tatgccacct tctctgaggc tctgacttta 21905
tttagaaatc gaaggggttc tgccggctct cggcgtgccc cgcgggcagg gatacgttgc 21965
ggaactggta cttgggcagc cacttgaact cggggatcag cagcttcggc acggggaggt 22025
cggggaacga gtcgctccac agcttgcgcg tgagttgcag ggcgcccagc aggtcgggcg 22085
cggagatctt gaaatcacag ttgggacccg cgttctgcgc gcgagagttg cggtacacgg 22145
ggttgcagca ctggaacacc atcagggccg ggtgcttcac gctcgccagc accgtcgcgt 22205
cggtgatgcc ctccatgtcc agatcctcgg cgttggccat cccgaagggg gtcatcttgc 22265
aggtctgccg ccccatgctg ggcacgcagc cgggcttgtg gttgcaatcg cagtgcaggg 22325
ggatcagcat catctgggcc tgttcggagc tcatgcccgg gtacatggcc ttcatgaaag 22385
cctccagctg gcggaaggcc tgctgcgcct tgccgccctc ggtgaagaag accccgcagg 22445
acttgctaga gaactggttg gtggcgcagc ccgcgtcgtg cacgcagcag cgcgcgtcgt 22505
tgttggccag ctgcaccacg ctgcgccccc agcggttctg ggtgatcttg gcccggtcgg 22565
ggttctcctt cagcgcgcgc tgcccgttct cgctcgccac atccatctcg atcgtgtgct 22625
ccttctggat catcacggtc ccgtgcaggc accgcagctt gccctcggcc tcggtgcagc 22685
cgtgcagcca cagcgcgcag ccggtgctct cccagttctt gtgggcgatc tgggagtgcg 22745
agtgcacgaa gccctgcagg aagcggccca tcatcgtggt cagggtcttg ttgctggtga 22805
aggtcagcgg gatgccgcgg tgctcctcgt tcacatacat gtggcagatg cggcggtaca 22865
cctcgccctg ctcgggcatc agctggaagg cggacttcag gtcgctctcc acgcggtacc 22925
ggtccatcag cagcgtcatg acttccatgc ccttctccca ggccgaaacg atcggcaggc 22985
tcagggggtt cttcaccgcc attgtcatct tagtcgccgc cgccgaggtc agggggtcgt 23045
tctcgtccag ggtctcaaac actcgcttgc cgtccttctc ggtgatgcgc acggggggga 23105
aggcgaagcc cacggccgcc agctcctcct cggcctgcct ttcgtcctcg ctgtcctggc 23165
tgatgtcttg caaaggcaca tgcttggtct tgcggggttt ctttttgggc ggcagaggcg 23225
gcggcgatgt gctgggcgag cgcgagttct cgctcaccac gactatttct tctccttggc 23285
cgtcgtccga gaccacgcgg cggtaggcat gcctcttctg gggcagaggc ggaggcgacg 23345
ggctctcgcg gttcggcggg cggctggcag agccccttcc gcgttcgggg gtgcgctcct 23405
ggcggcgctg ctctgactga cttcctccgc ggccggccat tgtgttctcc tagggagcaa 23465
gc atg gag act cag cca tcg tcg cca aca tcg cca tct gcc ccc gcc 23512
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala
4190 4195 4200
gcc acc gcc gac gag aac cag cag cag cag aat gaa agc tta acc 23557
Ala Thr Ala Asp Glu Asn Gln Gln Gln Gln Asn Glu Ser Leu Thr
4205 4210 4215
gcc ccg ccg ccc agc ccc acc tcc gac gcc gcg gcc cca gac atg 23602
Ala Pro Pro Pro Ser Pro Thr Ser Asp Ala Ala Ala Pro Asp Met
4220 4225 4230
caa gag atg gag gaa tcc atc gag att gac ctg ggc tac gtg acg 23647
Gln Glu Met Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr
4235 4240 4245
ccc gcg gag cac gag gag gag ctg gca gcg cgc ttt tca gcc ccg 23692
Pro Ala Glu His Glu Glu Glu Leu Ala Ala Arg Phe Ser Ala Pro
4250 4255 4260
gaa gag aac cac caa gag cag cca gag cag gaa gca gag agc gag 23737
Glu Glu Asn His Gln Glu Gln Pro Glu Gln Glu Ala Glu Ser Glu
4265 4270 4275
cag agc cag gct ggg ctc gag cat ggc gac tac ctg agc ggg gca 23782
Gln Ser Gln Ala Gly Leu Glu His Gly Asp Tyr Leu Ser Gly Ala
4280 4285 4290
gag gac gtg ctc atc aag cat ctg gcc cgc caa tgc atc atc gtc 23827
Glu Asp Val Leu Ile Lys His Leu Ala Arg Gln Cys Ile Ile Val
4295 4300 4305
aag gat gcg ctg ctc gac cgc gcc gag gtg ccc ctc agc gtg gcg 23872
Lys Asp Ala Leu Leu Asp Arg Ala Glu Val Pro Leu Ser Val Ala
4310 4315 4320
gag ctc agc cgc gcc tac gag cgc aac ctc ttc tcg ccg cgc gtg 23917
Glu Leu Ser Arg Ala Tyr Glu Arg Asn Leu Phe Ser Pro Arg Val
4325 4330 4335
ccc ccc aag cgc cag ccc aac ggc acc tgc gag ccc aac ccg cgc 23962
Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg
4340 4345 4350
ctc aac ttc tac ccg gtc ttc gcg gtg ccc gag gcc ctg gcc acc 24007
Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr
4355 4360 4365
tac cac ctc ttt ttc aag aac caa agg atc ccc gtc tcc tgc cgc 24052
Tyr His Leu Phe Phe Lys Asn Gln Arg Ile Pro Val Ser Cys Arg
4370 4375 4380
gcc aac cgc acc cgc gcc gac gcc ctg ctc aac ctg ggc ccc ggc 24097
Ala Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly Pro Gly
4385 4390 4395
gcc cgc cta cct gat atc gcc tcc ttg gaa gag gtt ccc aag atc 24142
Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile
4400 4405 4410
ttc gag ggt ctg ggc agc gac gag act cgg gcc gcg aac gct ctg 24187
Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu
4415 4420 4425
caa gga agc gga gag gag cat gag cac cac agc gcc ctg gtg gag 24232
Gln Gly Ser Gly Glu Glu His Glu His His Ser Ala Leu Val Glu
4430 4435 4440
ttg gaa ggc gac aac gcg cgc ctg gcg gtc ctc aag cgc acg gtc 24277
Leu Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val
4445 4450 4455
gag ctg acc cac ttc gcc tac ccg gcg ctc aac ctg ccc ccc aag 24322
Glu Leu Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys
4460 4465 4470
gtc atg agc gcc gtc atg gac cag gtg ctc atc aag cgc gcc tcg 24367
Val Met Ser Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser
4475 4480 4485
ccc ctc tcg gag gag gag atg cag gac ccc gag agc tcg gac gag 24412
Pro Leu Ser Glu Glu Glu Met Gln Asp Pro Glu Ser Ser Asp Glu
4490 4495 4500
ggc aag ccc gtg gtc agc gac gag cag ctg gcg cgc tgg ctg gga 24457
Gly Lys Pro Val Val Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly
4505 4510 4515
gcg agt agc acc ccc cag agc ctg gaa gag cgg cgc aag ctc atg 24502
Ala Ser Ser Thr Pro Gln Ser Leu Glu Glu Arg Arg Lys Leu Met
4520 4525 4530
atg gcc gtg gtc ctg gtg acc gtg gag ctg gag tgt ctg cgc cgc 24547
Met Ala Val Val Leu Val Thr Val Glu Leu Glu Cys Leu Arg Arg
4535 4540 4545
ttc ttc gcc gac gcg gag acc ctg cgc aag gtc gag gag aac ctg 24592
Phe Phe Ala Asp Ala Glu Thr Leu Arg Lys Val Glu Glu Asn Leu
4550 4555 4560
cac tac ctc ttc agg cac ggg ttc gtg cgc cag gcc tgc aag atc 24637
His Tyr Leu Phe Arg His Gly Phe Val Arg Gln Ala Cys Lys Ile
4565 4570 4575
tcc aac gtg gag ctg acc aac ctg gtc tcc tac atg ggc atc ctg 24682
Ser Asn Val Glu Leu Thr Asn Leu Val Ser Tyr Met Gly Ile Leu
4580 4585 4590
cac gag aac cgc ctg ggg cag aac gtg ctg cac acc acc ctg cgc 24727
His Glu Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr Leu Arg
4595 4600 4605
ggg gag gcc cgc cgc gac tac atc cgc gac tgc gtc tac ctg tac 24772
Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr
4610 4615 4620
ctc tgc cac acc tgg cag acg ggc atg ggc gtg tgg cag cag tgc 24817
Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys
4625 4630 4635
ctg gag gag cag aac ctg aaa gag ctc tgc aag ctc ctg cag aag 24862
Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys
4640 4645 4650
aac ctg aag gcc ctg tgg acc ggg ttc gac gag cgc acc acc gcc 24907
Asn Leu Lys Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala
4655 4660 4665
tcg gac ctg gcc gac ctc atc ttc ccc gag cgc ctg cgg ctg acg 24952
Ser Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr
4670 4675 4680
ctg cgc aac ggg ctg ccc gac ttt atg agc caa agc atg ttg caa 24997
Leu Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln
4685 4690 4695
aac ttt cgc tct ttc atc ctc gaa cgc tcc ggg atc ctg ccc gcc 25042
Asn Phe Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala
4700 4705 4710
acc tgc tcc gcg ctg ccc tcg gac ttc gtg ccg ctg acc ttc cgc 25087
Thr Cys Ser Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg
4715 4720 4725
gag tgc ccc ccg ccg ctc tgg agc cac tgc tac ctg ctg cgt ctg 25132
Glu Cys Pro Pro Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu
4730 4735 4740
gcc aac tac ctg gcc tac cac tcg gac gtg atc gag gac gtc agc 25177
Ala Asn Tyr Leu Ala Tyr His Ser Asp Val Ile Glu Asp Val Ser
4745 4750 4755
ggc gag ggt ctg ctc gag tgc cac tgt cgc tgc aac ctc tgc acg 25222
Gly Glu Gly Leu Leu Glu Cys His Cys Arg Cys Asn Leu Cys Thr
4760 4765 4770
ccg cac cgc tcc ctg gcc tgc aac ccc cag ctg ctg agc gag acc 25267
Pro His Arg Ser Leu Ala Cys Asn Pro Gln Leu Leu Ser Glu Thr
4775 4780 4785
cag atc atc ggc acc ttc gag ttg caa ggc ccc ggc gag gag ggc 25312
Gln Ile Ile Gly Thr Phe Glu Leu Gln Gly Pro Gly Glu Glu Gly
4790 4795 4800
aag ggg ggt ctg aaa ctc acc ccg ggg ctg tgg acc tcg gcc tac 25357
Lys Gly Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala Tyr
4805 4810 4815
ttg cgc aag ttc gtg ccc gag gac tac cat ccc ttc gag atc agg 25402
Leu Arg Lys Phe Val Pro Glu Asp Tyr His Pro Phe Glu Ile Arg
4820 4825 4830
ttc tac gag gac caa tcc cag ccg ccc aag gcc gag ctg tcg gcc 25447
Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser Ala
4835 4840 4845
tgc gtc atc acc cag ggg gcc atc ctg gcc caa ttg caa gcc atc 25492
Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile
4850 4855 4860
cag aaa tcc cgc caa gaa ttt ctg ctg aaa aag ggc cac ggg gtc 25537
Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His Gly Val
4865 4870 4875
tac ttg gac ccc cag acc gga gag gag ctc aac ccc agc ttc ccc 25582
Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe Pro
4880 4885 4890
cag gat gcc ccg agg aag cag caa gaa gct gaa agt gga gct gcc 25627
Gln Asp Ala Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala
4895 4900 4905
gcc gga gga ttt gga gga aga ctg gga gag cag tca ggc aga gga 25672
Ala Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly
4910 4915 4920
gga gga gat gga aga ctg gga cag cac tca ggc aga gga gga cag 25717
Gly Gly Asp Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln
4925 4930 4935
cct gca aga cag tct gga gga aga cga ggt gga gga gga ggc aga 25762
Pro Ala Arg Gln Ser Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg
4940 4945 4950
gga aga agc agc cgc cgc cag acc gtc gtc ctc ggc gga gga gga 25807
Gly Arg Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Gly Gly
4955 4960 4965
gaa agc aag cag cac gga tac cat ctc cgc tcc ggg tcg ggg tcg 25852
Glu Ser Lys Gln His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser
4970 4975 4980
cgg cgg ccg ggc cca cag tagatgggac gagaccgggc gcttcccgaa 25900
Arg Arg Pro Gly Pro Gln
4985
ccccaccacc cagaccggta agaaggagcg gcagggatac aagtcctggc gggggcacaa 25960
aaacgccatc gtctcctgct tgcaagcctg cgggggcaac atctccttca cccggcgcta 26020
cctgctcttc caccgcgggg tgaacttccc ccgcaacatc ttgcattact accgtcacct 26080
ccacagcccc tactactgtt tccaagaaga ggcagaaacc cagcagcagc agcagaaaac 26140
cagcagcagc tagaaaatcc acggcggcag gtggactgag gatcgcggcg aacgagccgg 26200
cgcagacccg ggagctgagg aaccggatct ttcccaccct ctatgccatc ttccagcaga 26260
gtcgggggca ggagcaggaa ctgaaagtca agaaccgttc tctgcgctcg ctcacccgca 26320
gttgtctgta tcacaagagc gaagaccaac ttcagcgcac tctcgaggac gccgaggctc 26380
tcttcaacaa gtactgcgcg ctcactctta aagagtagcc cgcgcccgcc cagccgcaga 26440
aaaaggcggg aattacgtca cctgtgccct tcgcccgacc atc atg agc aaa gag 26495
Met Ser Lys Glu
4990
att ccc acg cct tac atg tgg agc tac cag ccc cag atg ggc ctg 26540
Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln Met Gly Leu
4995 5000 5005
gcc gcc ggc gcc gcc cag gac tac tcc acc cgc atg aat tgg ctc 26585
Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu
5010 5015 5020
agc gcc ggg ccc gcg atg atc tca cgg gtg aat gac atc cgc gcc 26630
Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg Ala
5025 5030 5035
cac cga aac cag ata ctc cta gaa cag tca gcg ctc acc gcc acg 26675
His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr
5040 5045 5050
ccc cgc aat cac ctc aac ccg cgt aat tgg ccc gcc gcc ctg gtg 26720
Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val
5055 5060 5065
tac cag gaa att ccc cag ccc acg acc gta cta ctt ccg cga gac 26765
Tyr Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp
5070 5075 5080
gcc cag gcc gaa gtc cag ctg act aac tca ggt gtc cag ctg gcg 26810
Ala Gln Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala
5085 5090 5095
ggc ggc gcc acc ctg tgt cgt cac cgc ccc gct cag ggt ata aag 26855
Gly Gly Ala Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys
5100 5105 5110
cgg ctg gtg atc cgg ggc aga ggc aca cag ctc aac gac gag gtg 26900
Arg Leu Val Ile Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val
5115 5120 5125
gtg agc tct tcg ctg ggt ctg cga cct gac gga gtc ttc caa ctc 26945
Val Ser Ser Ser Leu Gly Leu Arg Pro Asp Gly Val Phe Gln Leu
5130 5135 5140
gcc gga tcg ggg aga tct tcc ttc acg cct cgt cag gcc gtg ctg 26990
Ala Gly Ser Gly Arg Ser Ser Phe Thr Pro Arg Gln Ala Val Leu
5145 5150 5155
act ttg gag agt tcg tcc tcg cag ccc cgc tcg ggc ggc atc ggc 27035
Thr Leu Glu Ser Ser Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly
5160 5165 5170
act ctc cag ttc gtg gag gag ttc act ccc tcg gtc tac ttc aac 27080
Thr Leu Gln Phe Val Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn
5175 5180 5185
ccc ttc tcc ggc tcc ccc ggg cac tac ccg gac gag ttc atc ccg 27125
Pro Phe Ser Gly Ser Pro Gly His Tyr Pro Asp Glu Phe Ile Pro
5190 5195 5200
aac ttt gac gcc atc agc gag tcg gtg gac ggc tac gat tga atg 27170
Asn Phe Asp Ala Ile Ser Glu Ser Val Asp Gly Tyr Asp Met
5205 5210
tcc cat ggt ggc gcg gct gac ata gct cgg ctt cga cac ctg gac 27215
Ser His Gly Gly Ala Ala Asp Ile Ala Arg Leu Arg His Leu Asp
5215 5220 5225
cac tgc cgc cgc ttt cgc tgc ttc gct cgg gac ctc gcc gag ttc 27260
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe
5230 5235 5240
acc tac ttt gag ctg ccc gag gag cat cct cag ggc ccg gcc cac 27305
Thr Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His
5245 5250 5255
gga gtg cgg atc gtc gtc gaa ggg ggc cta gac tcc cac ctg ctt 27350
Gly Val Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu
5260 5265 5270
cgg atc ttc agc cag cgc ccg atc ctg gtc gag cgc caa cag ggc 27395
Arg Ile Phe Ser Gln Arg Pro Ile Leu Val Glu Arg Gln Gln Gly
5275 5280 5285
aac acc ctc ctg acc ctc tac tgc atc tgc gac cac ccc ggc ctg 27440
Asn Thr Leu Leu Thr Leu Tyr Cys Ile Cys Asp His Pro Gly Leu
5290 5295 5300
cat gaa agt ctt tgt tgt ctg ctg tgt act gag tat aat aaa agc 27485
His Glu Ser Leu Cys Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
5305 5310 5315
tgagatcagc gactactccg gactcaactg tggtgtttct gcatccatca accagtctct 27545
aaccttcacc gggaacgaga ccgagctcca gctccagtgt aagccccaca agaagtacct 27605
cacctggctg taccagggct ccccgatcgc cgttgttaac cactgcgacg acgacggagt 27665
cctgctgaac ggccccgcca accttacttt ttccacccgc agaagcaagc tactgctctt 27725
ccgacccttc ctccccggca cctatcagtg cgtctcggga ccctgccatc acaccttcca 27785
cctgatcccg aataccacct cttccccagc gccgctcccc actaacaacc aaactaacca 27845
ccaccgctac cgacgcgacc tcgttgaatc taataccacc cacaccggag gtgagctcca 27905
aggtcgcaaa ccctctggga tttattacgg cccctgggag gtggtggggt taatagcttt 27965
aggcttagta gcgggtgggc ttttggctct ctgctaccta tacatccctt gctgttctta 28025
cttagtggtg ctgtgttgct ggtttaagaa atg ggg cag atc acc cta gtg 28076
Met Gly Gln Ile Thr Leu Val
5320 5325
agc tgc ggt gtg ctg gtg gcg gtg gtg ctt tcg att gtg gga ctg 28121
Ser Cys Gly Val Leu Val Ala Val Val Leu Ser Ile Val Gly Leu
5330 5335 5340
ggc ggc gcg gct gta gtg aag gag aag gcc gat ccc tgc ttg cat 28166
Gly Gly Ala Ala Val Val Lys Glu Lys Ala Asp Pro Cys Leu His
5345 5350 5355
ttc aat ccc gac aaa tgc cag ctg agt ttt cag ccc gat ggc aat 28211
Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln Pro Asp Gly Asn
5360 5365 5370
cgg tgc gcg gtg ctg atc aag tgc gga tgg gaa tgc gag aac gtg 28256
Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys Glu Asn Val
5375 5380 5385
aga atc gag tac aat aac aag act cgg aac aat act ctc gcg tcc 28301
Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu Ala Ser
5390 5395 5400
gtg tgg cag ccc ggg gac ccc gag tgg tac acc gtc tct gtc ccc 28346
Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val Pro
5405 5410 5415
ggt gct gac ggc tcc ccg cgc acc gtg aat aat act ttc att ttt 28391
Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe
5420 5425 5430
gcg cac atg tgc gac acg gtc atg tgg atg agc aag cag tac gat 28436
Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp
5435 5440 5445
atg tgg ccc ccc acg aag gag aac atc gtg gtc ttc tcc atc gct 28481
Met Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala
5450 5455 5460
tac agc gtg tgc acg gcg cta atc acc gct atc gtg tgc ctg agc 28526
Tyr Ser Val Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser
5465 5470 5475
att cac atg ctc atc gct att cgc ccc aga aat aat gcc gaa aaa 28571
Ile His Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys
5480 5485 5490
gaa aaa cag cca taacacgttt tttcacacac ctttttcaga cc atg gcc tct 28624
Glu Lys Gln Pro Met Ala Ser
5495
gtt aaa ttt ttg ctt tta ttt gcc agt ctc att acc gtc att cat 28669
Val Lys Phe Leu Leu Leu Phe Ala Ser Leu Ile Thr Val Ile His
5500 5505 5510
gga atg agt aat gag aaa att act att tac act ggc act aat cac 28714
Gly Met Ser Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn His
5515 5520 5525
aca ttg aaa ggt cca gaa aaa gcc aca gaa gtt tca tgg tat tgt 28759
Thr Leu Lys Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys
5530 5535 5540
tat ttt aat gaa tca gat gta tct act gaa ctc tgt gga aac aag 28804
Tyr Phe Asn Glu Ser Asp Val Ser Thr Glu Leu Cys Gly Asn Lys
5545 5550 5555
aac aaa aaa aat gag agc att act ctc atc aag ttt caa tgt gga 28849
Asn Lys Lys Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly
5560 5565 5570
tct gac tta acc cta att aac atc act aga gac tat gta ggt atg 28894
Ser Asp Leu Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met
5575 5580 5585
tat tat gga act aca gca ggc att tca gac atg gaa ttt tat caa 28939
Tyr Tyr Gly Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln
5590 5595 5600
gtt tct gtg tct gaa ccc acc acg cct aga atg acc aca acc aca 28984
Val Ser Val Ser Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr
5605 5610 5615
aaa act aca cct gtt acc act atg cag ctc act acc aat ggc ttt 29029
Lys Thr Thr Pro Val Thr Thr Met Gln Leu Thr Thr Asn Gly Phe
5620 5625 5630
ctt gcc atg ctt caa gtg gct gaa aat agc acc agc att caa ccc 29074
Leu Ala Met Leu Gln Val Ala Glu Asn Ser Thr Ser Ile Gln Pro
5635 5640 5645
acc cca ccc agt gag gaa att ccc aga tcc atg att ggc att att 29119
Thr Pro Pro Ser Glu Glu Ile Pro Arg Ser Met Ile Gly Ile Ile
5650 5655 5660
gtt gct gta gtg gtg tgc atg ttg atc atc gcc ttg tgc atg gtg 29164
Val Ala Val Val Val Cys Met Leu Ile Ile Ala Leu Cys Met Val
5665 5670 5675
tac tat gcc ttc tgc tac aga aag cac aga ctg aac gac aag ctg 29209
Tyr Tyr Ala Phe Cys Tyr Arg Lys His Arg Leu Asn Asp Lys Leu
5680 5685 5690
gaa cac tta cta agt gtt gaa ttt taatttttta gaacc atg aag atc 29257
Glu His Leu Leu Ser Val Glu Phe Met Lys Ile
5695 5700
cta ggc ctt tta att ttt tct atc att acc tct gct cta tgc aat 29302
Leu Gly Leu Leu Ile Phe Ser Ile Ile Thr Ser Ala Leu Cys Asn
5705 5710 5715
tct gac aat gag gac gtt act gtc gtt gtc gga acc aat tat aca 29347
Ser Asp Asn Glu Asp Val Thr Val Val Val Gly Thr Asn Tyr Thr
5720 5725 5730
ctg aaa ggt cca gcg aag ggt atg ctt tcg tgg tat tgc tat ttt 29392
Leu Lys Gly Pro Ala Lys Gly Met Leu Ser Trp Tyr Cys Tyr Phe
5735 5740 5745
gga tct gac act aca gaa act gaa tta tgc aat ctt aag aat ggc 29437
Gly Ser Asp Thr Thr Glu Thr Glu Leu Cys Asn Leu Lys Asn Gly
5750 5755 5760
aaa att caa aat tct aaa att aac aat tat ata tgc aat ggt act 29482
Lys Ile Gln Asn Ser Lys Ile Asn Asn Tyr Ile Cys Asn Gly Thr
5765 5770 5775
gat ctg ata ctc ctc aat atc acg aaa tca tat gct ggc agt tac 29527
Asp Leu Ile Leu Leu Asn Ile Thr Lys Ser Tyr Ala Gly Ser Tyr
5780 5785 5790
acc tgc cct gga gat gat gct gac agt atg att ttt tac aaa gta 29572
Thr Cys Pro Gly Asp Asp Ala Asp Ser Met Ile Phe Tyr Lys Val
5795 5800 5805
act gtt gtt gat cct act act cca cct ccg ccc acc acc aca act 29617
Thr Val Val Asp Pro Thr Thr Pro Pro Pro Pro Thr Thr Thr Thr
5810 5815 5820
act cac acc aca cac ata gaa caa acc aca gca gag gca gca gga 29662
Thr His Thr Thr His Ile Glu Gln Thr Thr Ala Glu Ala Ala Gly
5825 5830 5835
gag tta gcc ttg cag gtt cag gaa gat tcc ctt atg gct aat acc 29707
Glu Leu Ala Leu Gln Val Gln Glu Asp Ser Leu Met Ala Asn Thr
5840 5845 5850
cct aca ccc gat cat cgg tgt ccg ggg ctg ctc gtc agc ggc att 29752
Pro Thr Pro Asp His Arg Cys Pro Gly Leu Leu Val Ser Gly Ile
5855 5860 5865
gtc ggt gtg ctt tcg gga tta gca gtc ata atc atc tgc atg ttc 29797
Val Gly Val Leu Ser Gly Leu Ala Val Ile Ile Ile Cys Met Phe
5870 5875 5880
att ttt gct tgc tgc tat aga agg ctt tac cga caa aaa tca gac 29842
Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg Gln Lys Ser Asp
5885 5890 5895
cca ctg ctg aac ctc tat gtt taattttttc cagagcc atg aag gca gtt 29892
Pro Leu Leu Asn Leu Tyr Val Met Lys Ala Val
5900 5905 5910
agc gct cta gtt ttt tgt tct ttg att ggc att gtt ttt agt gct 29937
Ser Ala Leu Val Phe Cys Ser Leu Ile Gly Ile Val Phe Ser Ala
5915 5920 5925
ggg ttt ttg aaa aat ctt acc att tat gaa ggt gag aat gcc act 29982
Gly Phe Leu Lys Asn Leu Thr Ile Tyr Glu Gly Glu Asn Ala Thr
5930 5935 5940
cta gtg ggc atc agt ggt caa aat gtc agc tgg cta aaa tac cat 30027
Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp Leu Lys Tyr His
5945 5950 5955
cta gat ggg tgg aaa gac att tgc gat tgg aat gtc act gtg tat 30072
Leu Asp Gly Trp Lys Asp Ile Cys Asp Trp Asn Val Thr Val Tyr
5960 5965 5970
aca tgt aat gga gtt aac ctc acc att act aat gcc acc caa gat 30117
Thr Cys Asn Gly Val Asn Leu Thr Ile Thr Asn Ala Thr Gln Asp
5975 5980 5985
cag aat ggt agg ttt aag ggc cag agt ttc act aga aat aat ggg 30162
Gln Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr Arg Asn Asn Gly
5990 5995 6000
tat gaa tcc cat aac atg ttt atc tat gac gtc act gtc atc aga 30207
Tyr Glu Ser His Asn Met Phe Ile Tyr Asp Val Thr Val Ile Arg
6005 6010 6015
aac gag act gcc acc aca cag atg ccc act aca cac agt tct acc 30252
Asn Glu Thr Ala Thr Thr Gln Met Pro Thr Thr His Ser Ser Thr
6020 6025 6030
act act agc atg caa acc aca cag aca acc act ttt tat aca tca 30297
Thr Thr Ser Met Gln Thr Thr Gln Thr Thr Thr Phe Tyr Thr Ser
6035 6040 6045
act cag cat atc acc act aca gca gca aag cca agt agc gca gcg 30342
Thr Gln His Ile Thr Thr Thr Ala Ala Lys Pro Ser Ser Ala Ala
6050 6055 6060
cct cag cca cag gct ttg gct ttg aaa gct gca caa cct agt aca 30387
Pro Gln Pro Gln Ala Leu Ala Leu Lys Ala Ala Gln Pro Ser Thr
6065 6070 6075
act act agg acc aat gag cag act act gat ttt ttg tcc act gtc 30432
Thr Thr Arg Thr Asn Glu Gln Thr Thr Asp Phe Leu Ser Thr Val
6080 6085 6090
gag agc cac acc aca gct acc tcg agt gcc ttc tct agc acc gcc 30477
Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr Ala
6095 6100 6105
aat ctc tcc tcg ctt tcc tct aca cca atc agt ccc gct act act 30522
Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala Thr Thr
6110 6115 6120
cct acc ccc gct ctt ctc ccc act ccc ctg aag caa act gag aac 30567
Pro Thr Pro Ala Leu Leu Pro Thr Pro Leu Lys Gln Thr Glu Asn
6125 6130 6135
agc ggc atg caa tgg cag atc acc ctg ctc att gtg atc ggg ttg 30612
Ser Gly Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu
6140 6145 6150
gtc atc ctg gcc gtg ttg ctc tac tac atc ttc tgc cgc cgc att 30657
Val Ile Leu Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile
6155 6160 6165
ccc aac gcg cac cgc aag ccg gtc tac aag ccc atc gtt atc ggg 30702
Pro Asn Ala His Arg Lys Pro Val Tyr Lys Pro Ile Val Ile Gly
6170 6175 6180
cag ccg gag ccg ctt cag gtg gaa ggg ggt cta agg aat ctt ctc 30747
Gln Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu
6185 6190 6195
ttc tct ttt aca gta tgg tgattgaact atg att cct aga caa ttc ttg 30796
Phe Ser Phe Thr Val Trp Met Ile Pro Arg Gln Phe Leu
6200 6205
atc act att ctt atc tgc ctc ctc caa gtc tgt gcc acc ctc gct 30841
Ile Thr Ile Leu Ile Cys Leu Leu Gln Val Cys Ala Thr Leu Ala
6210 6215 6220
cta gtg gcc aac gcc agt cca gac tgt att ggg ccc ttc gcc tcc 30886
Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro Phe Ala Ser
6225 6230 6235
tac gtg ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc ata 30931
Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser Ile
6240 6245 6250
gtc tgc ctg ctt atc acc ttc ttc cag ttc att gac tgg atc ttt 30976
Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Ile Phe
6255 6260 6265
gtg cgc atc gcc tac ctg cgc cac cac ccc cag tac cgc gac cag 31021
Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp Gln
6270 6275 6280
caa gtg gcg cga ctg ctc agg ctc ctc tgataagc atg cgg gct ctg 31068
Gln Val Ala Arg Leu Leu Arg Leu Leu Met Arg Ala Leu
6285 6290 6295
cta ctt ctc gcg ctt ctg ctg tta gtg ctc ccc cgt ccc gtc gac 31113
Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg Pro Val Asp
6300 6305 6310
ccc cgg ccc ccc act cag tcc ccc gag gag gtc cgc aaa tgc aaa 31158
Pro Arg Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys Lys
6315 6320 6325
ttc caa gaa ccc tgg aaa ttc ctc aaa tgc tac cgc caa aaa tca 31203
Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys Ser
6330 6335 6340
gac atg cat ccc agc tgg atc atg atc att ggg atc gtg aac att 31248
Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile
6345 6350 6355
ctg gcc tgc acc ctc atc tcc ttt gtg att tac ccc tgc ttt gac 31293
Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp
6360 6365 6370
ttt ggt tgg aac tcg cca gag gcg ctc tat ctc ccg cct gaa cct 31338
Phe Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro
6375 6380 6385
gac aca cca cca cag caa cct cag gca cat gca cta cca cca cag 31383
Asp Thr Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Gln
6390 6395 6400
cct agg cca caa tac atg ccc ata tta gac tat gag gcc gag cca 31428
Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro
6405 6410 6415
cag cga ccc atg ctc ccc gct att agt tac ttc aat cta acc ggc 31473
Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly
6420 6425 6430
gga gat gac tgacccactg gccaacaaca acgtcaacga ccttctcctg 31522
Gly Asp Asp
gacatggacg gccgcgcctc ggagcagcga ctcgcccaac tccgcatccg ccagcagcag 31582
gagagagccg tcaaggagct gcaggatgcg gtggccatcc accagtgcaa gaaaggcatc 31642
ttctgcctgg tgaagcaggc caagatcacc ttcgaggtga cttccaccga ccatcgcctc 31702
tcctacgagc tcctgcagca acgccagaag ttcacctgcc tggtcggagt caaccccatc 31762
gtcatcaccc agcagtctgg cgataccaag ggttgcatcc actgctcctg cgactccccc 31822
gagtgcgttc acaccatgat caagaccctc tgcggcctcc gcgacctcct ccccatgaac 31882
taatcaacta acccctaccc catttaccca gatacaataa agattaaaga gatgatgatt 31942
tgaattgatc aataaagaat cacttacttg aaatctgaaa ccaggtctct gtcc atg 31999
Met
6435
ttt tct gtc agc agc act tca ctc ccc tct tcc cag ctc tgg tac 32044
Phe Ser Val Ser Ser Thr Ser Leu Pro Ser Ser Gln Leu Trp Tyr
6440 6445 6450
tgc aga ccc cgg cgg gct gca aac ttc ctc cac act ctg aag ggg 32089
Cys Arg Pro Arg Arg Ala Ala Asn Phe Leu His Thr Leu Lys Gly
6455 6460 6465
atg tca aat tcc tcc tgt ccc tca atc ttc att ttt atc ttc tat 32134
Met Ser Asn Ser Ser Cys Pro Ser Ile Phe Ile Phe Ile Phe Tyr
6470 6475 6480
cag atg tcc aaa aag cgc gcg cgg gtg gat gat ggc ttc gac ccc 32179
Gln Met Ser Lys Lys Arg Ala Arg Val Asp Asp Gly Phe Asp Pro
6485 6490 6495
gtg tac ccc tac gat gca gac aac gca ccg act gtg ccc ttc atc 32224
Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile
6500 6505 6510
aac cct ccc ttc gtc tct tca gat gga ttc caa gaa aag ccc ctg 32269
Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu
6515 6520 6525
ggg gtg ttg tcc ctg cgc ctg gcc gac ccc gtc acc acc aag aac 32314
Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn
6530 6535 6540
ggg gct gtc acc ctc aag ctg ggg gag ggg gtg gac ctc gac gag 32359
Gly Ala Val Thr Leu Lys Leu Gly Glu Gly Val Asp Leu Asp Glu
6545 6550 6555
tcg gga aaa ctc atc tcc aaa aat gcc acc aag gcc act gcc cct 32404
Ser Gly Lys Leu Ile Ser Lys Asn Ala Thr Lys Ala Thr Ala Pro
6560 6565 6570
ctc agt att tcc aac aac acc att tcc ctt aac ctg gat acc cct 32449
Leu Ser Ile Ser Asn Asn Thr Ile Ser Leu Asn Leu Asp Thr Pro
6575 6580 6585
ttt tat acc aaa gat gga aaa tta acc atg cag gta act gca cca 32494
Phe Tyr Thr Lys Asp Gly Lys Leu Thr Met Gln Val Thr Ala Pro
6590 6595 6600
tta aag tta gca aac aca gcc ata cta aac act cta gct atg gcc 32539
Leu Lys Leu Ala Asn Thr Ala Ile Leu Asn Thr Leu Ala Met Ala
6605 6610 6615
tat gga aat ggt tta ggt cta agc aac aat gct ctc act gtt cag 32584
Tyr Gly Asn Gly Leu Gly Leu Ser Asn Asn Ala Leu Thr Val Gln
6620 6625 6630
gta aca tct cca ctc acc ttt gat aat agc aaa gtc aag att aac 32629
Val Thr Ser Pro Leu Thr Phe Asp Asn Ser Lys Val Lys Ile Asn
6635 6640 6645
cta ggt aat gga cca cta atg gta tct gcc aac aag ctt tca atc 32674
Leu Gly Asn Gly Pro Leu Met Val Ser Ala Asn Lys Leu Ser Ile
6650 6655 6660
aac tgc tta cgg ggc ctg tat gtt gcc cct aat aat acc gga cta 32719
Asn Cys Leu Arg Gly Leu Tyr Val Ala Pro Asn Asn Thr Gly Leu
6665 6670 6675
gaa acc aac ata agc tgg gca aac gca atg cgc ttt gag ggt aac 32764
Glu Thr Asn Ile Ser Trp Ala Asn Ala Met Arg Phe Glu Gly Asn
6680 6685 6690
gca atg gct gtt aat ata gac aca aat aaa ggc cta caa ttt ggc 32809
Ala Met Ala Val Asn Ile Asp Thr Asn Lys Gly Leu Gln Phe Gly
6695 6700 6705
act act agt aca gaa aca ggt gtc acc aat gct tac ccc ata caa 32854
Thr Thr Ser Thr Glu Thr Gly Val Thr Asn Ala Tyr Pro Ile Gln
6710 6715 6720
gtc aaa ctt ggc gca ggc ctt gca ttt gat agc aca ggg gct att 32899
Val Lys Leu Gly Ala Gly Leu Ala Phe Asp Ser Thr Gly Ala Ile
6725 6730 6735
gtt gct tgg aac aaa gaa aat gac agc ctc act ttg tgg act aca 32944
Val Ala Trp Asn Lys Glu Asn Asp Ser Leu Thr Leu Trp Thr Thr
6740 6745 6750
cca gac ccc tct cca aat tgt aaa ata gct tct gaa aaa gat gca 32989
Pro Asp Pro Ser Pro Asn Cys Lys Ile Ala Ser Glu Lys Asp Ala
6755 6760 6765
aaa ctc aca ctt tgc ttg aca aag tgt ggt agt caa atc cta ggc 33034
Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly
6770 6775 6780
act gtc tcc cta tta gca gtc agt ggc agc ttg gct cct atc aca 33079
Thr Val Ser Leu Leu Ala Val Ser Gly Ser Leu Ala Pro Ile Thr
6785 6790 6795
ggg gct gtt agt act gca ctt gta tca ctc aaa ttc aat gcc aat 33124
Gly Ala Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asn Ala Asn
6800 6805 6810
gga gcc ctt ttg gac aaa tca act ctg aac aaa gaa tac tgg aac 33169
Gly Ala Leu Leu Asp Lys Ser Thr Leu Asn Lys Glu Tyr Trp Asn
6815 6820 6825
tac aga caa gga gat cta att cca ggt aca cca tat aca cat gct 33214
Tyr Arg Gln Gly Asp Leu Ile Pro Gly Thr Pro Tyr Thr His Ala
6830 6835 6840
gtg ggt ttc atg cct aac aaa aaa gcc tac cct aaa aac aca act 33259
Val Gly Phe Met Pro Asn Lys Lys Ala Tyr Pro Lys Asn Thr Thr
6845 6850 6855
gca gct tcc aag agc cac att gtg ggt gat gtt tat tta gat gga 33304
Ala Ala Ser Lys Ser His Ile Val Gly Asp Val Tyr Leu Asp Gly
6860 6865 6870
gat gca gat aag cct tta tct ctt atc atc act ttc aat gaa act 33349
Asp Ala Asp Lys Pro Leu Ser Leu Ile Ile Thr Phe Asn Glu Thr
6875 6880 6885
gat gat gaa acc tgt gat tac tgc atc aac ttt caa tgg aaa tgg 33394
Asp Asp Glu Thr Cys Asp Tyr Cys Ile Asn Phe Gln Trp Lys Trp
6890 6895 6900
agt gct gaa caa tat aaa gaa aag aca ctc gca acc agt tca ttc 33439
Ser Ala Glu Gln Tyr Lys Glu Lys Thr Leu Ala Thr Ser Ser Phe
6905 6910 6915
acc ttc tca tac atc gcc caa gaa taaacccgcc ctgcatgcca ccccttgtcc 33493
Thr Phe Ser Tyr Ile Ala Gln Glu
6920
cactgctcta caatggaaaa ctctgaagca gaaaaataaa gttcaagtgt tttattgatt 33553
caacagtttt tcacagaatt cgagtagtta ttttccctcc tccctcccaa ctcatggaat 33613
acaccaccct ctccccacgc acagccttaa acatctgaat gccattggta atggacatgg 33673
ttttggtttc cacattccac acagtttcag agcgagccag tctcgggtcg gtcaaggaga 33733
tgaaaccctc cgggcactcc tgcatctgca cctcaaagtt caacagctga gggctgtcct 33793
cggtggtcgg gatcacggtt atctggaaga agagcgatga gagtcataat ccgcgaatgg 33853
gatcgggcgg ttgtgttgca tcaggccccg cagcagtcgc tgtctgcgcc gctccgtcaa 33913
gctgctgctc aaggggaccg ggtccaggga ctccctacgc atgatgccga tggccctgag 33973
catcagtcgc ctggtgcggc gggcgcagca gcggatgcgg atctcgctca ggtcggagca 34033
gtacgtgcag cacagcacca ccaagttgtt caacagtcca tagttcaacg tgctccagcc 34093
aaaactcatc tgtggaacta tgctgcccac atgtccatcg taccagatcc tgatgtaaat 34153
caggtggcgc cccctccaga acacactgcc catgtacatg atctccttgg gcatatgcag 34213
gttcaccacc tcccggtacc acatcacccg ctggttgaac atgcagccct ggataattct 34273
gcggaaccag atggccagca ccgccccgcc cgccatgcag cgcagggacc ccgggtcctg 34333
acagtggcag tggaggaccc accgctcgcg gccgtggatc aactgggagc tgaacaggtc 34393
tatgttggca cagcacaggc acacgctcat gcatgtcttc agcactctca gttcctcggg 34453
ggtcaggacc atgtcccagg gcacggggaa ctcttgcagg acagtgaacc cggcagaaca 34513
gggcagccct cgcacacaac ttacattgtg catggacagg gtatcgcaat caggcagcac 34573
cgggtgatcc tccaccagag aagcgcgggt ctcggtctcc tcacagcgtg gtaagggggc 34633
cggccgatac gggtgatggc gggacgcggc tgatcgtgtt cgcgaccgtg tcatgatgca 34693
gttgctttcg gacattttcg tacttgctgt agcagaacct ggtccgggcg ctgcacaccg 34753
atcgccggcg gcggtcccgg cgcttggaac gctcggtgtt gaagttgtaa aacagccact 34813
ctctcagacc gtgcagcaga tctagggcct caggagtgat gaagatccca tcatgcctga 34873
tggctctaat cacatcgacc accgtggaat gggccagacc cagccagatg atgcaatttt 34933
gttgggtttc ggtgacggcg ggggagggaa gaacaggaag aaccatgatt aacttttaat 34993
ccaaacggtc tcggagcact tcaaaatgaa gatcgcggag atggcacctc tcgcccccgc 35053
tgtgttggtg gaaaataaca gccaggtcaa aggtgatacg gttctcgaga tgttccacgg 35113
tggcttccag caaagcctcc acgcgcacat ccagaaacaa gacaatagcg aaagcgggag 35173
ggttctctaa ttcctcaatc atcatgttac actcctgcac catccccaga taattttcat 35233
ttttccagcc ttgaatgatt cgaactagtt cctgaggtaa atccaagcca gccatgataa 35293
agagctcgcg cagagcgccc tccaccggca ttcttaagca caccctcata attccaagat 35353
attctgctcc tggttcacct gcagcagatt gacaagcgga atatcaaaat ctctgccgcg 35413
atccctaagc tcctccctca gcaataactg taagtactct ttcatatcct ctccgaaatt 35473
tttagccata ggaccgccag gaatgagatt aggacaagcc acattacaga taaaccgaag 35533
tcccccccag tgagcattgc caaatgtaag attgaaataa gcatgctggc tagacccggt 35593
gatatcttcc agataactgg acagaaaatc gcccaggcaa tttttaagaa aatcaacaaa 35653
agaaaaatct tccaggtgca cgtttagggc ctcgggaaca acgatggagt aagtgcaagg 35713
ggtgcgttcc agcatggtta gttagctgat ctgtaaaaaa acaaaaaata aaacattaaa 35773
ccatgctagc ctggcgaaca ggtgggtaaa tcgttctctc cagcaccagg caggccacgg 35833
ggtctccggc gcgaccctcg taaaaattgt cgctatgatt gaaaaccatc acagagagac 35893
gttcccggtg gccggcgtga atgattcgac aagatgaata cacccccgga acattggcgt 35953
ccgcgagtga aaaaaagcgg ccgaggaagc aataaggcac tacaatgctc agtctcaagt 36013
ccagcaaagc gatgccatgc ggatgaagca caaaattctc aggtgcgtac aaaatgtaat 36073
tactcccctc ctgcacaggc agcaaagccc cagatccctc cagatacaca tacaaagcct 36133
cagcgtccat agcttaccga gcagcagcac acaacaggcg caagagtcag agaaaggctg 36193
agctctaacc tgtcccccgc tctctgctca atatatagcc cagatctaca ctgacgtaaa 36253
ggccaaagtc taaaaatacc cgccaaataa tcacacacgc ccagcacacg cccagaaacc 36313
ggtgacacac tcaaaaaaat acgcgcactt cctcaaacgc ccaaactgcc gtcatttccg 36373
ggttcccacg ctacgtcatc agaattcgac tttcaaatcc gtcgaccgtt aaacacgtca 36433
ctcgccccgc ccctaacggt cgcccttctc tccgccaatc accgccccgc atccccaaat 36493
tcaaacgcct catttgcata ttaacgcgca ccaaaagttt gaggtatatt attgatgatg 36553
<210> 2
<211> 504
<212> PRT
<213> Simian adenovirus 39
<400> 2
Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu
1 5 10 15
Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn
20 25 30
Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu
35 40 45
Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly
50 55 60
Ala Ala Ala Arg Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg
65 70 75 80
Ser Gly Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro
85 90 95
Glu Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly
100 105 110
Ile Lys Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr
115 120 125
Val Ser Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu
130 135 140
Val Gln Ser Gln Gly Ile Asp Glu Val Ser Val Met His Glu Lys Tyr
145 150 155 160
Ser Leu Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp
165 170 175
Glu Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Lys Pro Asp Lys
180 185 190
Lys Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile
195 200 205
Ser Gly Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Ala Ala
210 215 220
Phe Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu
225 230 235 240
Gly Val Thr Phe Met Asn Thr Arg Phe Arg Gly Asp Gly Tyr Asn Gly
245 250 255
Val Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe
260 265 270
Phe Gly Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val
275 280 285
Arg Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Ile Val Gly Arg Thr
290 295 300
Lys Ser Val Leu Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu
305 310 315 320
Gly Val Met Ser Glu Gly Glu Ala Arg Ile Arg His Cys Ala Ser Thr
325 330 335
Glu Thr Gly Cys Phe Val Leu Cys Lys Gly Asn Ala Lys Ile Lys His
340 345 350
Asn Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr
355 360 365
Cys Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser
370 375 380
His Ala Arg Lys Pro Trp Pro Glu Phe Glu His Asn Val Met Thr Arg
385 390 395 400
Cys Asn Met His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln
405 410 415
Cys Asn Leu Asn Tyr Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser
420 425 430
Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Asp Val Trp Lys
435 440 445
Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys
450 455 460
Gly Gly Lys His Ala Arg Phe Gln Pro Val Cys Val Asp Val Thr Glu
465 470 475 480
Asp Leu Arg Pro Asp His Leu Val Leu Ser Cys Thr Gly Thr Glu Phe
485 490 495
Gly Ser Ser Gly Glu Glu Ser Asp
500
<210> 3
<211> 142
<212> PRT
<213> Simian adenovirus 39
<400> 3
Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu
1 5 10 15
Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly
20 25 30
Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu
35 40 45
Thr Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala
50 55 60
Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala
65 70 75 80
Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro
85 90 95
Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu
100 105 110
Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu
115 120 125
Gln Glu Gln Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys
130 135 140
<210> 4
<211> 393
<212> PRT
<213> Simian adenovirus 39
<400> 4
Met His Pro Val Leu Arg Gln Met Arg Pro His Pro Pro Pro Gln Pro
1 5 10 15
Pro Leu Pro Pro Gln Gln Gln Gln Gln Pro Ala Leu Leu Pro Pro Pro
20 25 30
Gln Gln Gln Gln Gln Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly
35 40 45
Ala Gly Val Gln Tyr Asp Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala
50 55 60
Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys
65 70 75 80
Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp
85 90 95
Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala
100 105 110
Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp
115 120 125
Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala
130 135 140
His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys
145 150 155 160
Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu
165 170 175
Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu
180 185 190
Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln
195 200 205
Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe Arg Glu
210 215 220
Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu
225 230 235 240
Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu
245 250 255
Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys
260 265 270
Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys
275 280 285
Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu
290 295 300
Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg
305 310 315 320
Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met
325 330 335
His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser
340 345 350
Tyr Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg Ala
355 360 365
Leu Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp Glu Val
370 375 380
Asp Glu Glu Gly Glu Tyr Leu Glu Asp
385 390
<210> 5
<211> 590
<212> PRT
<213> Simian adenovirus 39
<400> 5
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln
1 5 10 15
Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
20 25 30
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln
35 40 45
Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro
50 55 60
Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala
65 70 75 80
Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr
85 90 95
Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln
100 105 110
Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln
115 120 125
Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Leu Val Ala Leu
130 135 140
Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu
145 150 155 160
Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val
165 170 175
Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr
180 185 190
Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn
195 200 205
Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr
210 215 220
Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val
225 230 235 240
Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly
245 250 255
Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
260 265 270
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln
275 280 285
Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn
290 295 300
Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu Glu
305 310 315 320
Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln
325 330 335
Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met
340 345 350
Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met
355 360 365
Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn
370 375 380
Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly
385 390 395 400
Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val
405 410 415
Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu
420 425 430
Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly
435 440 445
Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro
450 455 460
Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg
465 470 475 480
Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser
485 490 495
Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
500 505 510
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His
515 520 525
Arg Asp Asp Pro Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala Ala Arg
530 535 540
Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp Asp Glu
545 550 555 560
Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser Gly Gly
565 570 575
Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met Met
580 585 590
<210> 6
<211> 532
<212> PRT
<213> Simian adenovirus 39
<400> 6
Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Met Ala Ala Ala Ala Met Gln Pro Pro Leu Glu
20 25 30
Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn
35 40 45
Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu
50 55 60
Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln
65 70 75 80
Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe
85 90 95
Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser
100 105 110
Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn
115 120 125
Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val
130 135 140
Ser Arg Lys Thr Pro Asn Gly Val Gly Glu Asp Tyr Asp Gly Ser Gln
145 150 155 160
Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn
165 170 175
Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp
180 185 190
Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile
195 200 205
Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val
210 215 220
Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro
225 230 235 240
Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg
245 250 255
Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly
260 265 270
Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu
275 280 285
Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala Glu
290 295 300
Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn
305 310 315 320
Phe Ala Ser Pro Ala Ala Val Ala Ala Ala Glu Ala Ala Glu Thr Glu
325 330 335
Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asn Arg Ser
340 345 350
Tyr Asn Val Leu Pro Asp Lys Ile Asn Thr Ala Tyr Arg Ser Trp Tyr
355 360 365
Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr
370 375 380
Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp
385 390 395 400
Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg
405 410 415
Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr
420 425 430
Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg
435 440 445
Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln
450 455 460
Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn
465 470 475 480
Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile
485 490 495
Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys
500 505 510
Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val Leu Ser
515 520 525
Ser Arg Thr Phe
530
<210> 7
<211> 193
<212> PRT
<213> Simian adenovirus 39
<400> 7
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
1 5 10 15
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His
20 25 30
Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
35 40 45
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val
50 55 60
Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Ala Ser Thr
65 70 75 80
Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala
85 90 95
Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
100 105 110
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
115 120 125
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala
130 135 140
Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala
145 150 155 160
Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg
165 170 175
Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg
180 185 190
Thr
<210> 8
<211> 347
<212> PRT
<213> Simian adenovirus 39
<400> 8
Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro
1 5 10 15
Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys Glu Glu Arg Lys Pro Arg
20 25 30
Lys Leu Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Asp Asp Gly
35 40 45
Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp
50 55 60
Arg Gly Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val
65 70 75 80
Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp
85 90 95
Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu
100 105 110
Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Leu Lys Glu Glu
115 120 125
Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys
130 135 140
Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly
145 150 155 160
Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val
165 170 175
Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val Asp
180 185 190
Pro Asp Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala
195 200 205
Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro
210 215 220
Met Glu Thr Gln Thr Glu Pro Met Ile Lys Pro Ser Thr Ser Thr Met
225 230 235 240
Glu Val Gln Thr Asp Pro Trp Met Pro Ala Ala Pro Thr Ser Ser Arg
245 250 255
Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr
260 265 270
Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg
275 280 285
Phe Tyr Arg Gly His Thr Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr
290 295 300
Arg Arg Arg Arg Arg Ser Arg Arg Ser Ser Thr Ala Thr Ser Ala Ala
305 310 315 320
Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu Thr Leu
325 330 335
Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
340 345
<210> 9
<211> 77
<212> PRT
<213> Simian adenovirus 39
<400> 9
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly
1 5 10 15
Arg Lys Pro Arg Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His
20 25 30
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
35 40 45
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro
50 55 60
Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
65 70 75
<210> 10
<211> 259
<212> PRT
<213> Simian adenovirus 39
<400> 10
Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met
1 5 10 15
Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro
20 25 30
Phe Met Gly Thr Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly
35 40 45
Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser
50 55 60
Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln
65 70 75 80
Ala Leu Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val
85 90 95
Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln
100 105 110
Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala
115 120 125
Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp
130 135 140
Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu
145 150 155 160
Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly
165 170 175
Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys
180 185 190
Pro Glu Ser Asn Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro Gln Pro
195 200 205
Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala
210 215 220
Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser
225 230 235 240
Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg
245 250 255
Arg Cys Tyr
<210> 11
<211> 940
<212> PRT
<213> Simian adenovirus 39
<400> 11
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Ile His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Ser Gln Trp Glu Gln Lys Lys Thr Gly Asn Asn Ala
130 135 140
Asn Gly Asp Thr Glu Asn Val Thr Tyr Gly Val Ala Ala Met Gly Gly
145 150 155 160
Ile Asp Ile Asp Lys Asn Gly Leu Gln Ile Gly Thr Asp Asp Thr Lys
165 170 175
Asp Asn Asp Asn Asp Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro
180 185 190
Gln Ile Gly Glu Glu Asn Trp Gln Glu Thr Tyr Ser Tyr Tyr Gly Gly
195 200 205
Arg Ala Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe
210 215 220
Ala Arg Pro Thr Asn Val Lys Gly Gly Gln Ala Lys Ile Lys Thr Asp
225 230 235 240
Gly Asp Val Lys Ser Phe Asp Ile Asp Leu Ala Phe Phe Asp Ile Pro
245 250 255
Asn Ser Gly Ala Gly Asn Gly Thr Asn Val Asn Asn Tyr Pro Asp Met
260 265 270
Val Met Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Thr His Ile
275 280 285
Val Tyr Lys Pro Gly Thr Ser Asp Asp Ser Ser Lys Val Asn Leu Cys
290 295 300
Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn
305 310 315 320
Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu
325 330 335
Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg
340 345 350
Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg
355 360 365
Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro
370 375 380
Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn
385 390 395 400
Tyr Cys Phe Pro Leu Asp Gly Ala Gly Thr Asn Ser Val Tyr Gln Gly
405 410 415
Val Lys Pro Lys Thr Asp Asn Gly Asn Asp Gln Trp Glu Thr Asp Ser
420 425 430
Thr Val Ser Ser His Asn Gln Ile Cys Lys Gly Asn Ile Tyr Ala Met
435 440 445
Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn
450 455 460
Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Ile
465 470 475 480
Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val
485 490 495
Val Pro Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp
500 505 510
Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn
515 520 525
Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val
530 535 540
Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu
545 550 555 560
Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp
565 570 575
Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp
580 585 590
Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe
595 600 605
Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn
610 615 620
Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met
625 630 635 640
Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro
645 650 655
Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys
660 665 670
Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val
675 680 685
Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His
690 695 700
Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro
705 710 715 720
Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr
725 730 735
Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp
740 745 750
Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly
755 760 765
Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg
770 775 780
Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys
785 790 795 800
Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe
805 810 815
Val Gly Tyr Leu Ala Pro Thr Met Cys Gln Gly Gln Pro Tyr Pro Ala
820 825 830
Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr
835 840 845
Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser
850 855 860
Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met
865 870 875 880
Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp
885 890 895
Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp
900 905 910
Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr
915 920 925
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
930 935 940
<210> 12
<211> 208
<212> PRT
<213> Simian adenovirus 39
<400> 12
Met Ala Glu Pro Thr Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Ile
1 5 10 15
Arg Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys Arg
20 25 30
Phe Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn
35 40 45
Thr Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp
50 55 60
Asn Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser
65 70 75 80
Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu
85 90 95
Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys
100 105 110
Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe
115 120 125
Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met
130 135 140
Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met
145 150 155 160
Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala
165 170 175
Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His Arg
180 185 190
Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp Met
195 200 205
<210> 13
<211> 801
<212> PRT
<213> Simian adenovirus 39
<400> 13
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Ala
1 5 10 15
Thr Ala Asp Glu Asn Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro
20 25 30
Pro Pro Ser Pro Thr Ser Asp Ala Ala Ala Pro Asp Met Gln Glu Met
35 40 45
Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His
50 55 60
Glu Glu Glu Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln
65 70 75 80
Glu Gln Pro Glu Gln Glu Ala Glu Ser Glu Gln Ser Gln Ala Gly Leu
85 90 95
Glu His Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His
100 105 110
Leu Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala
115 120 125
Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg Asn
130 135 140
Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys
145 150 155 160
Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu
165 170 175
Ala Leu Ala Thr Tyr His Leu Phe Phe Lys Asn Gln Arg Ile Pro Val
180 185 190
Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly
195 200 205
Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys
210 215 220
Ile Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu
225 230 235 240
Gln Gly Ser Gly Glu Glu His Glu His His Ser Ala Leu Val Glu Leu
245 250 255
Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu
260 265 270
Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser
275 280 285
Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu
290 295 300
Glu Glu Met Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val
305 310 315 320
Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln
325 330 335
Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr
340 345 350
Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu
355 360 365
Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val
370 375 380
Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser
385 390 395 400
Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His
405 410 415
Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val
420 425 430
Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln
435 440 445
Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln
450 455 460
Lys Asn Leu Lys Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala
465 470 475 480
Ser Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu
485 490 495
Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe
500 505 510
Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser
515 520 525
Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro
530 535 540
Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala
545 550 555 560
Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu
565 570 575
Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys
580 585 590
Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu
595 600 605
Gln Gly Pro Gly Glu Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro Gly
610 615 620
Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His
625 630 635 640
Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala
645 650 655
Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu
660 665 670
Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His
675 680 685
Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe
690 695 700
Pro Gln Asp Ala Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala
705 710 715 720
Ala Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly Gly
725 730 735
Gly Asp Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala
740 745 750
Arg Gln Ser Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser
755 760 765
Ser Arg Arg Gln Thr Val Val Leu Gly Gly Gly Gly Glu Ser Lys Gln
770 775 780
His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro
785 790 795 800
Gln
<210> 14
<211> 227
<212> PRT
<213> Simian adenovirus 39
<400> 14
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn
20 25 30
Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg
35 40 45
Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr
50 55 60
Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr
65 70 75 80
Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
85 90 95
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala
100 105 110
Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile
115 120 125
Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
130 135 140
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser
145 150 155 160
Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser
165 170 175
Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
180 185 190
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr
195 200 205
Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp
210 215 220
Gly Tyr Asp
225
<210> 15
<211> 106
<212> PRT
<213> Simian adenovirus 39
<400> 15
Met Ser His Gly Gly Ala Ala Asp Ile Ala Arg Leu Arg His Leu Asp
1 5 10 15
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Thr
20 25 30
Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
35 40 45
Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe
50 55 60
Ser Gln Arg Pro Ile Leu Val Glu Arg Gln Gln Gly Asn Thr Leu Leu
65 70 75 80
Thr Leu Tyr Cys Ile Cys Asp His Pro Gly Leu His Glu Ser Leu Cys
85 90 95
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
100 105
<210> 16
<211> 176
<212> PRT
<213> Simian adenovirus 39
<400> 16
Met Gly Gln Ile Thr Leu Val Ser Cys Gly Val Leu Val Ala Val Val
1 5 10 15
Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Lys Ala
20 25 30
Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln
35 40 45
Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys
50 55 60
Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu
65 70 75 80
Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val
85 90 95
Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe
100 105 110
Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met
115 120 125
Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
130 135 140
Val Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met
145 150 155 160
Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
165 170 175
<210> 17
<211> 206
<212> PRT
<213> Simian adenovirus 39
<400> 17
Met Ala Ser Val Lys Phe Leu Leu Leu Phe Ala Ser Leu Ile Thr Val
1 5 10 15
Ile His Gly Met Ser Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn
20 25 30
His Thr Leu Lys Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys
35 40 45
Tyr Phe Asn Glu Ser Asp Val Ser Thr Glu Leu Cys Gly Asn Lys Asn
50 55 60
Lys Lys Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp
65 70 75 80
Leu Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly
85 90 95
Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val Ser
100 105 110
Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Val
115 120 125
Thr Thr Met Gln Leu Thr Thr Asn Gly Phe Leu Ala Met Leu Gln Val
130 135 140
Ala Glu Asn Ser Thr Ser Ile Gln Pro Thr Pro Pro Ser Glu Glu Ile
145 150 155 160
Pro Arg Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys Met Leu
165 170 175
Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His
180 185 190
Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe
195 200 205
<210> 18
<211> 205
<212> PRT
<213> Simian adenovirus 39
<400> 18
Met Lys Ile Leu Gly Leu Leu Ile Phe Ser Ile Ile Thr Ser Ala Leu
1 5 10 15
Cys Asn Ser Asp Asn Glu Asp Val Thr Val Val Val Gly Thr Asn Tyr
20 25 30
Thr Leu Lys Gly Pro Ala Lys Gly Met Leu Ser Trp Tyr Cys Tyr Phe
35 40 45
Gly Ser Asp Thr Thr Glu Thr Glu Leu Cys Asn Leu Lys Asn Gly Lys
50 55 60
Ile Gln Asn Ser Lys Ile Asn Asn Tyr Ile Cys Asn Gly Thr Asp Leu
65 70 75 80
Ile Leu Leu Asn Ile Thr Lys Ser Tyr Ala Gly Ser Tyr Thr Cys Pro
85 90 95
Gly Asp Asp Ala Asp Ser Met Ile Phe Tyr Lys Val Thr Val Val Asp
100 105 110
Pro Thr Thr Pro Pro Pro Pro Thr Thr Thr Thr Thr His Thr Thr His
115 120 125
Ile Glu Gln Thr Thr Ala Glu Ala Ala Gly Glu Leu Ala Leu Gln Val
130 135 140
Gln Glu Asp Ser Leu Met Ala Asn Thr Pro Thr Pro Asp His Arg Cys
145 150 155 160
Pro Gly Leu Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala
165 170 175
Val Ile Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu
180 185 190
Tyr Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
195 200 205
<210> 19
<211> 295
<212> PRT
<213> Simian adenovirus 39
<400> 19
Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser Leu Ile Gly Ile Val
1 5 10 15
Phe Ser Ala Gly Phe Leu Lys Asn Leu Thr Ile Tyr Glu Gly Glu Asn
20 25 30
Ala Thr Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp Leu Lys Tyr
35 40 45
His Leu Asp Gly Trp Lys Asp Ile Cys Asp Trp Asn Val Thr Val Tyr
50 55 60
Thr Cys Asn Gly Val Asn Leu Thr Ile Thr Asn Ala Thr Gln Asp Gln
65 70 75 80
Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr Arg Asn Asn Gly Tyr Glu
85 90 95
Ser His Asn Met Phe Ile Tyr Asp Val Thr Val Ile Arg Asn Glu Thr
100 105 110
Ala Thr Thr Gln Met Pro Thr Thr His Ser Ser Thr Thr Thr Ser Met
115 120 125
Gln Thr Thr Gln Thr Thr Thr Phe Tyr Thr Ser Thr Gln His Ile Thr
130 135 140
Thr Thr Ala Ala Lys Pro Ser Ser Ala Ala Pro Gln Pro Gln Ala Leu
145 150 155 160
Ala Leu Lys Ala Ala Gln Pro Ser Thr Thr Thr Arg Thr Asn Glu Gln
165 170 175
Thr Thr Asp Phe Leu Ser Thr Val Glu Ser His Thr Thr Ala Thr Ser
180 185 190
Ser Ala Phe Ser Ser Thr Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro
195 200 205
Ile Ser Pro Ala Thr Thr Pro Thr Pro Ala Leu Leu Pro Thr Pro Leu
210 215 220
Lys Gln Thr Glu Asn Ser Gly Met Gln Trp Gln Ile Thr Leu Leu Ile
225 230 235 240
Val Ile Gly Leu Val Ile Leu Ala Val Leu Leu Tyr Tyr Ile Phe Cys
245 250 255
Arg Arg Ile Pro Asn Ala His Arg Lys Pro Val Tyr Lys Pro Ile Val
260 265 270
Ile Gly Gln Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu
275 280 285
Leu Phe Ser Phe Thr Val Trp
290 295
<210> 20
<211> 91
<212> PRT
<213> Simian adenovirus 39
<400> 20
Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln
1 5 10 15
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile
20 25 30
Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys
35 40 45
Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp
50 55 60
Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg
65 70 75 80
Asp Gln Gln Val Ala Arg Leu Leu Arg Leu Leu
85 90
<210> 21
<211> 142
<212> PRT
<213> Simian adenovirus 39
<400> 21
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg
1 5 10 15
Pro Val Asp Pro Arg Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys
20 25 30
Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys
35 40 45
Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile
50 55 60
Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe
65 70 75 80
Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr
85 90 95
Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Gln Pro Arg Pro
100 105 110
Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg Pro Met
115 120 125
Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp
130 135 140
<210> 22
<211> 489
<212> PRT
<213> Simian adenovirus 39
<400> 22
Met Phe Ser Val Ser Ser Thr Ser Leu Pro Ser Ser Gln Leu Trp Tyr
1 5 10 15
Cys Arg Pro Arg Arg Ala Ala Asn Phe Leu His Thr Leu Lys Gly Met
20 25 30
Ser Asn Ser Ser Cys Pro Ser Ile Phe Ile Phe Ile Phe Tyr Gln Met
35 40 45
Ser Lys Lys Arg Ala Arg Val Asp Asp Gly Phe Asp Pro Val Tyr Pro
50 55 60
Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro Phe
65 70 75 80
Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser Leu
85 90 95
Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Ala Val Thr Leu Lys
100 105 110
Leu Gly Glu Gly Val Asp Leu Asp Glu Ser Gly Lys Leu Ile Ser Lys
115 120 125
Asn Ala Thr Lys Ala Thr Ala Pro Leu Ser Ile Ser Asn Asn Thr Ile
130 135 140
Ser Leu Asn Leu Asp Thr Pro Phe Tyr Thr Lys Asp Gly Lys Leu Thr
145 150 155 160
Met Gln Val Thr Ala Pro Leu Lys Leu Ala Asn Thr Ala Ile Leu Asn
165 170 175
Thr Leu Ala Met Ala Tyr Gly Asn Gly Leu Gly Leu Ser Asn Asn Ala
180 185 190
Leu Thr Val Gln Val Thr Ser Pro Leu Thr Phe Asp Asn Ser Lys Val
195 200 205
Lys Ile Asn Leu Gly Asn Gly Pro Leu Met Val Ser Ala Asn Lys Leu
210 215 220
Ser Ile Asn Cys Leu Arg Gly Leu Tyr Val Ala Pro Asn Asn Thr Gly
225 230 235 240
Leu Glu Thr Asn Ile Ser Trp Ala Asn Ala Met Arg Phe Glu Gly Asn
245 250 255
Ala Met Ala Val Asn Ile Asp Thr Asn Lys Gly Leu Gln Phe Gly Thr
260 265 270
Thr Ser Thr Glu Thr Gly Val Thr Asn Ala Tyr Pro Ile Gln Val Lys
275 280 285
Leu Gly Ala Gly Leu Ala Phe Asp Ser Thr Gly Ala Ile Val Ala Trp
290 295 300
Asn Lys Glu Asn Asp Ser Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser
305 310 315 320
Pro Asn Cys Lys Ile Ala Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys
325 330 335
Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Ser Leu Leu Ala
340 345 350
Val Ser Gly Ser Leu Ala Pro Ile Thr Gly Ala Val Ser Thr Ala Leu
355 360 365
Val Ser Leu Lys Phe Asn Ala Asn Gly Ala Leu Leu Asp Lys Ser Thr
370 375 380
Leu Asn Lys Glu Tyr Trp Asn Tyr Arg Gln Gly Asp Leu Ile Pro Gly
385 390 395 400
Thr Pro Tyr Thr His Ala Val Gly Phe Met Pro Asn Lys Lys Ala Tyr
405 410 415
Pro Lys Asn Thr Thr Ala Ala Ser Lys Ser His Ile Val Gly Asp Val
420 425 430
Tyr Leu Asp Gly Asp Ala Asp Lys Pro Leu Ser Leu Ile Ile Thr Phe
435 440 445
Asn Glu Thr Asp Asp Glu Thr Cys Asp Tyr Cys Ile Asn Phe Gln Trp
450 455 460
Lys Trp Ser Ala Glu Gln Tyr Lys Glu Lys Thr Leu Ala Thr Ser Ser
465 470 475 480
Phe Thr Phe Ser Tyr Ile Ala Gln Glu
485
<210> 23
<211> 590
<212> DNA
<213> Simian adenovirus 39
<220>
<221> CDS
<222> (10)..(585)
<223> label=Elb\19K
<400> 23
gcaggactc atg gag atc tgg acg gtc ttg gaa gac ttt cac cag act aga 51
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg
1 5 10
cag ctg cta gag aac tca tcg gag gaa gtc tcc tac ctg tgg aga ttc 99
Gln Leu Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe
15 20 25 30
tgc ttc ggt ggg cct cta gct aag cta gtc tat agg gcc aaa cag gat 147
Cys Phe Gly Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp
35 40 45
tat aag gat caa ttt gag gat att ttg aga gag tgt cct ggt att ttt 195
Tyr Lys Asp Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe
50 55 60
gac tct ctc aac ttg ggc cat cag tct cac ttt aac cag agt att ctg 243
Asp Ser Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu
65 70 75
aga gcc ctt gac ttt tct act cct ggc aga act acc gcc gcg gta gcc 291
Arg Ala Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala
80 85 90
ttt ttt gcc ttt atc ctt gac aaa tgg agt caa gaa acc cat ttc agc 339
Phe Phe Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser
95 100 105 110
agg gat tac cgt ctg gac tgc tta gca gta gct ttg tgg aga aca tgg 387
Arg Asp Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp
115 120 125
agg tgc cag cgc ctg aat gca atc tcc ggc tac ttg cca gta cag ccg 435
Arg Cys Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro
130 135 140
gta gac acg ctg agg atc ctg agt ctc cag tca ccc cag gaa cac caa 483
Val Asp Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln
145 150 155
cgc cgc cag cag ccg cag cag gag cag cag caa gag gag gag gag gac 531
Arg Arg Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Glu Asp
160 165 170
cga gaa gag aac ccg aga gcc ggt ctg gac cct ccg gtg gcg gag gag 579
Arg Glu Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu
175 180 185 190
gag gag tagct 590
Glu Glu
<210> 24
<211> 192
<212> PRT
<213> Simian adenovirus 39
<400> 24
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
Gly Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Lys
35 40 45
Asp Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Glu Asp Arg Glu
165 170 175
Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
<210> 25
<211> 6310
<212> DNA
<213> Simian adenovirus 39
<220>
<221> CDS
<222> (7)..(570)
<223> label=22K
<220>
<221> CDS
<222> (1862)..(2491)
<223> label=E3\CR1\alpha
<220>
<221> CDS
<222> (5898)..(6302)
<223> label=E3\14.7K
<400> 25
cccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 48
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag gag gag 96
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu
15 20 25 30
gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 144
Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln
35 40 45
gac agt ctg gag gaa gac gag gtg gag gag gag gca gag gaa gaa gca 192
Asp Ser Leu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala
50 55 60
gcc gcc gcc aga ccg tcg tcc tcg gcg gag gag gag aaa gca agc agc 240
Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Glu Glu Lys Ala Ser Ser
65 70 75
acg gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac 288
Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His
80 85 90
agt aga tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc 336
Ser Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
95 100 105 110
ggt aag aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac 384
Gly Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn
115 120 125
gcc atc gtc tcc tgc ttg caa gcc tgc ggg ggc aac atc tcc ttc acc 432
Ala Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr
130 135 140
cgg cgc tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc 480
Arg Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile
145 150 155
ttg cat tac tac cgt cac ctc cac agc ccc tac tac tgt ttc caa gaa 528
Leu His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu
160 165 170
gag gca gaa acc cag cag cag cag cag aaa acc agc agc agc 570
Glu Ala Glu Thr Gln Gln Gln Gln Gln Lys Thr Ser Ser Ser
175 180 185
tagaaaatcc acggcggcag gtggactgag gatcgcggcg aacgagccgg cgcagacccg 630
ggagctgagg aaccggatct ttcccaccct ctatgccatc ttccagcaga gtcgggggca 690
ggagcaggaa ctgaaagtca agaaccgttc tctgcgctcg ctcacccgca gttgtctgta 750
tcacaagagc gaagaccaac ttcagcgcac tctcgaggac gccgaggctc tcttcaacaa 810
gtactgcgcg ctcactctta aagagtagcc cgcgcccgcc cagccgcaga aaaaggcggg 870
aattacgtca cctgtgccct tcgcccgacc atcatgagca aagagattcc cacgccttac 930
atgtggagct accagcccca gatgggcctg gccgccggcg ccgcccagga ctactccacc 990
cgcatgaatt ggctcagcgc cgggcccgcg atgatctcac gggtgaatga catccgcgcc 1050
caccgaaacc agatactcct agaacagtca gcgctcaccg ccacgccccg caatcacctc 1110
aacccgcgta attggcccgc cgccctggtg taccaggaaa ttccccagcc cacgaccgta 1170
ctacttccgc gagacgccca ggccgaagtc cagctgacta actcaggtgt ccagctggcg 1230
ggcggcgcca ccctgtgtcg tcaccgcccc gctcagggta taaagcggct ggtgatccgg 1290
ggcagaggca cacagctcaa cgacgaggtg gtgagctctt cgctgggtct gcgacctgac 1350
ggagtcttcc aactcgccgg atcggggaga tcttccttca cgcctcgtca ggccgtgctg 1410
actttggaga gttcgtcctc gcagccccgc tcgggcggca tcggcactct ccagttcgtg 1470
gaggagttca ctccctcggt ctacttcaac cccttctccg gctcccccgg gcactacccg 1530
gacgagttca tcccgaactt tgacgccatc agcgagtcgg tggacggcta cgattgaatg 1590
tcccatggtg gcgcggctga catagctcgg cttcgacacc tggaccactg ccgccgcttt 1650
cgctgcttcg ctcgggacct cgccgagttc acctactttg agctgcccga ggagcatcct 1710
cagggcccgg cccacggagt gcggatcgtc gtcgaagggg gcctagactc ccacctgctt 1770
cggatcttca gccagcgccc gatcctggtc gagcgccaac agggcaacac cctcctgacc 1830
ctctactgca tctgcgacca ccccggcctg c atg aaa gtc ttt gtt gtc tgc 1882
Met Lys Val Phe Val Val Cys
190 195
tgt gta ctg agt ata ata aaa gct gag atc agc gac tac tcc gga ctc 1930
Cys Val Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp Tyr Ser Gly Leu
200 205 210
aac tgt ggt gtt tct gca tcc atc aac cag tct cta acc ttc acc ggg 1978
Asn Cys Gly Val Ser Ala Ser Ile Asn Gln Ser Leu Thr Phe Thr Gly
215 220 225
aac gag acc gag ctc cag ctc cag tgt aag ccc cac aag aag tac ctc 2026
Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys Pro His Lys Lys Tyr Leu
230 235 240
acc tgg ctg tac cag ggc tcc ccg atc gcc gtt gtt aac cac tgc gac 2074
Thr Trp Leu Tyr Gln Gly Ser Pro Ile Ala Val Val Asn His Cys Asp
245 250 255
gac gac gga gtc ctg ctg aac ggc ccc gcc aac ctt act ttt tcc acc 2122
Asp Asp Gly Val Leu Leu Asn Gly Pro Ala Asn Leu Thr Phe Ser Thr
260 265 270 275
cgc aga agc aag cta ctg ctc ttc cga ccc ttc ctc ccc ggc acc tat 2170
Arg Arg Ser Lys Leu Leu Leu Phe Arg Pro Phe Leu Pro Gly Thr Tyr
280 285 290
cag tgc gtc tcg gga ccc tgc cat cac acc ttc cac ctg atc ccg aat 2218
Gln Cys Val Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro Asn
295 300 305
acc acc tct tcc cca gcg ccg ctc ccc act aac aac caa act aac cac 2266
Thr Thr Ser Ser Pro Ala Pro Leu Pro Thr Asn Asn Gln Thr Asn His
310 315 320
cac cgc tac cga cgc gac ctc gtt gaa tct aat acc acc cac acc gga 2314
His Arg Tyr Arg Arg Asp Leu Val Glu Ser Asn Thr Thr His Thr Gly
325 330 335
ggt gag ctc caa ggt cgc aaa ccc tct ggg att tat tac ggc ccc tgg 2362
Gly Glu Leu Gln Gly Arg Lys Pro Ser Gly Ile Tyr Tyr Gly Pro Trp
340 345 350 355
gag gtg gtg ggg tta ata gct tta ggc tta gta gcg ggt ggg ctt ttg 2410
Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val Ala Gly Gly Leu Leu
360 365 370
gct ctc tgc tac cta tac atc cct tgc tgt tct tac tta gtg gtg ctg 2458
Ala Leu Cys Tyr Leu Tyr Ile Pro Cys Cys Ser Tyr Leu Val Val Leu
375 380 385
tgt tgc tgg ttt aag aaa tgg ggc aga tca ccc tagtgagctg cggtgtgctg 2511
Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
390 395
gtggcggtgg tgctttcgat tgtgggactg ggcggcgcgg ctgtagtgaa ggagaaggcc 2571
gatccctgct tgcatttcaa tcccgacaaa tgccagctga gttttcagcc cgatggcaat 2631
cggtgcgcgg tgctgatcaa gtgcggatgg gaatgcgaga acgtgagaat cgagtacaat 2691
aacaagactc ggaacaatac tctcgcgtcc gtgtggcagc ccggggaccc cgagtggtac 2751
accgtctctg tccccggtgc tgacggctcc ccgcgcaccg tgaataatac tttcattttt 2811
gcgcacatgt gcgacacggt catgtggatg agcaagcagt acgatatgtg gccccccacg 2871
aaggagaaca tcgtggtctt ctccatcgct tacagcgtgt gcacggcgct aatcaccgct 2931
atcgtgtgcc tgagcattca catgctcatc gctattcgcc ccagaaataa tgccgaaaaa 2991
gaaaaacagc cataacacgt tttttcacac acctttttca gaccatggcc tctgttaaat 3051
ttttgctttt atttgccagt ctcattaccg tcattcatgg aatgagtaat gagaaaatta 3111
ctatttacac tggcactaat cacacattga aaggtccaga aaaagccaca gaagtttcat 3171
ggtattgtta ttttaatgaa tcagatgtat ctactgaact ctgtggaaac aagaacaaaa 3231
aaaatgagag cattactctc atcaagtttc aatgtggatc tgacttaacc ctaattaaca 3291
tcactagaga ctatgtaggt atgtattatg gaactacagc aggcatttca gacatggaat 3351
tttatcaagt ttctgtgtct gaacccacca cgcctagaat gaccacaacc acaaaaacta 3411
cacctgttac cactatgcag ctcactacca atggctttct tgccatgctt caagtggctg 3471
aaaatagcac cagcattcaa cccaccccac ccagtgagga aattcccaga tccatgattg 3531
gcattattgt tgctgtagtg gtgtgcatgt tgatcatcgc cttgtgcatg gtgtactatg 3591
ccttctgcta cagaaagcac agactgaacg acaagctgga acacttacta agtgttgaat 3651
tttaattttt tagaaccatg aagatcctag gccttttaat tttttctatc attacctctg 3711
ctctatgcaa ttctgacaat gaggacgtta ctgtcgttgt cggaaccaat tatacactga 3771
aaggtccagc gaagggtatg ctttcgtggt attgctattt tggatctgac actacagaaa 3831
ctgaattatg caatcttaag aatggcaaaa ttcaaaattc taaaattaac aattatatat 3891
gcaatggtac tgatctgata ctcctcaata tcacgaaatc atatgctggc agttacacct 3951
gccctggaga tgatgctgac agtatgattt tttacaaagt aactgttgtt gatcctacta 4011
ctccacctcc gcccaccacc acaactactc acaccacaca catagaacaa accacagcag 4071
aggcagcagg agagttagcc ttgcaggttc aggaagattc ccttatggct aataccccta 4131
cacccgatca tcggtgtccg gggctgctcg tcagcggcat tgtcggtgtg ctttcgggat 4191
tagcagtcat aatcatctgc atgttcattt ttgcttgctg ctatagaagg ctttaccgac 4251
aaaaatcaga cccactgctg aacctctatg tttaattttt tccagagcca tgaaggcagt 4311
tagcgctcta gttttttgtt ctttgattgg cattgttttt agtgctgggt ttttgaaaaa 4371
tcttaccatt tatgaaggtg agaatgccac tctagtgggc atcagtggtc aaaatgtcag 4431
ctggctaaaa taccatctag atgggtggaa agacatttgc gattggaatg tcactgtgta 4491
tacatgtaat ggagttaacc tcaccattac taatgccacc caagatcaga atggtaggtt 4551
taagggccag agtttcacta gaaataatgg gtatgaatcc cataacatgt ttatctatga 4611
cgtcactgtc atcagaaacg agactgccac cacacagatg cccactacac acagttctac 4671
cactactagc atgcaaacca cacagacaac cactttttat acatcaactc agcatatcac 4731
cactacagca gcaaagccaa gtagcgcagc gcctcagcca caggctttgg ctttgaaagc 4791
tgcacaacct agtacaacta ctaggaccaa tgagcagact actgattttt tgtccactgt 4851
cgagagccac accacagcta cctcgagtgc cttctctagc accgccaatc tctcctcgct 4911
ttcctctaca ccaatcagtc ccgctactac tcctaccccc gctcttctcc ccactcccct 4971
gaagcaaact gagaacagcg gcatgcaatg gcagatcacc ctgctcattg tgatcgggtt 5031
ggtcatcctg gccgtgttgc tctactacat cttctgccgc cgcattccca acgcgcaccg 5091
caagccggtc tacaagccca tcgttatcgg gcagccggag ccgcttcagg tggaaggggg 5151
tctaaggaat cttctcttct cttttacagt atggtgattg aactatgatt cctagacaat 5211
tcttgatcac tattcttatc tgcctcctcc aagtctgtgc caccctcgct ctagtggcca 5271
acgccagtcc agactgtatt gggcccttcg cctcctacgt gctctttgcc ttcatcacct 5331
gcatctgctg ctgtagcata gtctgcctgc ttatcacctt cttccagttc attgactgga 5391
tctttgtgcg catcgcctac ctgcgccacc acccccagta ccgcgaccag caagtggcgc 5451
gactgctcag gctcctctga taagcatgcg ggctctgcta cttctcgcgc ttctgctgtt 5511
agtgctcccc cgtcccgtcg acccccggcc ccccactcag tcccccgagg aggtccgcaa 5571
atgcaaattc caagaaccct ggaaattcct caaatgctac cgccaaaaat cagacatgca 5631
tcccagctgg atcatgatca ttgggatcgt gaacattctg gcctgcaccc tcatctcctt 5691
tgtgatttac ccctgctttg actttggttg gaactcgcca gaggcgctct atctcccgcc 5751
tgaacctgac acaccaccac agcaacctca ggcacatgca ctaccaccac agcctaggcc 5811
acaatacatg cccatattag actatgaggc cgagccacag cgacccatgc tccccgctat 5871
tagttacttc aatctaaccg gcggag atg act gac cca ctg gcc aac aac aac 5924
Met Thr Asp Pro Leu Ala Asn Asn Asn
400 405
gtc aac gac ctt ctc ctg gac atg gac ggc cgc gcc tcg gag cag cga 5972
Val Asn Asp Leu Leu Leu Asp Met Asp Gly Arg Ala Ser Glu Gln Arg
410 415 420
ctc gcc caa ctc cgc atc cgc cag cag cag gag aga gcc gtc aag gag 6020
Leu Ala Gln Leu Arg Ile Arg Gln Gln Gln Glu Arg Ala Val Lys Glu
425 430 435
ctg cag gat gcg gtg gcc atc cac cag tgc aag aaa ggc atc ttc tgc 6068
Leu Gln Asp Ala Val Ala Ile His Gln Cys Lys Lys Gly Ile Phe Cys
440 445 450 455
ctg gtg aag cag gcc aag atc acc ttc gag gtg act tcc acc gac cat 6116
Leu Val Lys Gln Ala Lys Ile Thr Phe Glu Val Thr Ser Thr Asp His
460 465 470
cgc ctc tcc tac gag ctc ctg cag caa cgc cag aag ttc acc tgc ctg 6164
Arg Leu Ser Tyr Glu Leu Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu
475 480 485
gtc gga gtc aac ccc atc gtc atc acc cag cag tct ggc gat acc aag 6212
Val Gly Val Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys
490 495 500
ggt tgc atc cac tgc tcc tgc gac tcc ccc gag tgc gtt cac acc atg 6260
Gly Cys Ile His Cys Ser Cys Asp Ser Pro Glu Cys Val His Thr Met
505 510 515
atc aag acc ctc tgc ggc ctc cgc gac ctc ctc ccc atg aac taatcaac 6310
Ile Lys Thr Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn
520 525 530
<210> 26
<211> 188
<212> PRT
<213> Simian adenovirus 39
<400> 26
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Glu Met
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala Ala
50 55 60
Ala Arg Pro Ser Ser Ser Ala Glu Glu Glu Lys Ala Ser Ser Thr Asp
65 70 75 80
Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg
85 90 95
Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys
100 105 110
Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile
115 120 125
Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg
130 135 140
Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His
145 150 155 160
Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala
165 170 175
Glu Thr Gln Gln Gln Gln Gln Lys Thr Ser Ser Ser
180 185
<210> 27
<211> 210
<212> PRT
<213> Simian adenovirus 39
<400> 27
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu
1 5 10 15
Ile Ser Asp Tyr Ser Gly Leu Asn Cys Gly Val Ser Ala Ser Ile Asn
20 25 30
Gln Ser Leu Thr Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys
35 40 45
Lys Pro His Lys Lys Tyr Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile
50 55 60
Ala Val Val Asn His Cys Asp Asp Asp Gly Val Leu Leu Asn Gly Pro
65 70 75 80
Ala Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Leu Leu Phe Arg
85 90 95
Pro Phe Leu Pro Gly Thr Tyr Gln Cys Val Ser Gly Pro Cys His His
100 105 110
Thr Phe His Leu Ile Pro Asn Thr Thr Ser Ser Pro Ala Pro Leu Pro
115 120 125
Thr Asn Asn Gln Thr Asn His His Arg Tyr Arg Arg Asp Leu Val Glu
130 135 140
Ser Asn Thr Thr His Thr Gly Gly Glu Leu Gln Gly Arg Lys Pro Ser
145 150 155 160
Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly
165 170 175
Leu Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Ile Pro Cys
180 185 190
Cys Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg
195 200 205
Ser Pro
210
<210> 28
<211> 135
<212> PRT
<213> Simian adenovirus 39
<400> 28
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp
1 5 10 15
Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg
20 25 30
Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile
35 40 45
His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile
50 55 60
Thr Phe Glu Val Thr Ser Thr Asp His Arg Leu Ser Tyr Glu Leu Leu
65 70 75 80
Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val
85 90 95
Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys
100 105 110
Asp Ser Pro Glu Cys Val His Thr Met Ile Lys Thr Leu Cys Gly Leu
115 120 125
Arg Asp Leu Leu Pro Met Asn
130 135
<210> 29
<211> 870
<212> DNA
<213> Simian adenovirus 39
<220>
<221> CDS
<222> (6)..(573)
<223> label=Ela
<220>
<221> CDS
<222> (658)..(863)
<223> label=Ela
<400> 29
gaaag atg agg cac ctg aga gac ctg ccc ggt aat gtt ttc ctg gct act 50
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr
1 5 10 15
ggg aac gag att ctg gaa ctg gtg gtg gac gcc atg atg ggt gac gac 98
Gly Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp
20 25 30
cct ccg gag ccc cct acc cca ttt gag gcg cct tcg ctg tac gat ttg 146
Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu
35 40 45
tat gat ctg gag gtg gat gtg ccc gag aac gac ccc aac gag gag gcg 194
Tyr Asp Leu Glu Val Asp Val Pro Glu Asn Asp Pro Asn Glu Glu Ala
50 55 60
gtg aat gat ttg ttt agc gat gcc gcg ctg ctg gct gcc gag cag gct 242
Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala
65 70 75
aat atg gac tct ggc tca gac agc gat tcc tct ctc cat acc ccg aga 290
Asn Met Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg
80 85 90 95
ccc ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag ctc 338
Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu
100 105 110
gac ctg cga tgt tat gag gaa tgc ttg cct ccg agc gat gat gag gag 386
Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu
115 120 125
gac gag gag acg att cga gct gcg gcg aac cag gga gtg aaa gct gcg 434
Asp Glu Glu Thr Ile Arg Ala Ala Ala Asn Gln Gly Val Lys Ala Ala
130 135 140
ggc gag agc ttt agc ctg gac tgt cct act ctg ccc gga cac ggc tgt 482
Gly Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys
145 150 155
aag tct tgt gaa ttt cat cgc atg aat act gga gat aag aat gtg atg 530
Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met
160 165 170 175
tgt gcc ctg tgc tat atg aga gct tac aac cat tgt gtt tac a 573
Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr
180 185
gtaagtgtga ttaactttag ctgggggggc agagggtgac tgggtgctga ctggtttatt 633
tatgtatatg ttttttatgt gtag gt ccc gtc tct gac gca gat gag acc 683
Ser Pro Val Ser Asp Ala Asp Glu Thr
195
ccc act tca gag tgc att tca tca ccc cca gaa att ggc gag gaa ccg 731
Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro Glu Ile Gly Glu Glu Pro
200 205 210
ccc gaa gat att att cat aga cca gtt gca gtg aga gtc acc ggg cgg 779
Pro Glu Asp Ile Ile His Arg Pro Val Ala Val Arg Val Thr Gly Arg
215 220 225 230
aga gca gct gtg gag agt ttg gat gac ttg cta cag ggt ggg gat gaa 827
Arg Ala Ala Val Glu Ser Leu Asp Asp Leu Leu Gln Gly Gly Asp Glu
235 240 245
cct ttg gac ttg tgt acc cgg aaa cgc ccc agg cac taagtgc 870
Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His
250 255
<210> 30
<211> 258
<212> PRT
<213> Simian adenovirus 39
<400> 30
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr Gly
1 5 10 15
Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp Pro
20 25 30
Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu Tyr
35 40 45
Asp Leu Glu Val Asp Val Pro Glu Asn Asp Pro Asn Glu Glu Ala Val
50 55 60
Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala Asn
65 70 75 80
Met Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro
85 90 95
Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp
100 105 110
Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp
115 120 125
Glu Glu Thr Ile Arg Ala Ala Ala Asn Gln Gly Val Lys Ala Ala Gly
130 135 140
Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys Lys
145 150 155 160
Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met Cys
165 170 175
Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val
180 185 190
Ser Asp Ala Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro
195 200 205
Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala
210 215 220
Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp Asp Leu
225 230 235 240
Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro
245 250 255
Arg His
<210> 31
<211> 840
<212> DNA
<213> Simian adenovirus 39
<220>
<221> CDS
<222> (7)..(337)
<223> label=33K
<220>
<221> CDS
<222> (507)..(835)
<223> label=33K
<400> 31
cccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 48
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag gag gag 96
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu
15 20 25 30
gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 144
Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln
35 40 45
gac agt ctg gag gaa gac gag gtg gag gag gag gca gag gaa gaa gca 192
Asp Ser Leu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala
50 55 60
gcc gcc gcc aga ccg tcg tcc tcg gcg gag gag gag aaa gca agc agc 240
Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Glu Glu Lys Ala Ser Ser
65 70 75
acg gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac 288
Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His
80 85 90
agt aga tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc g 337
Ser Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
95 100 105 110
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 397
gcttgcaagc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 457
gggtgaactt cccccgcaac atcttgcatt actaccgtca cctccacag cc cct act 514
Ala Pro Thr
act gtt tcc aag aag agg cag aaa ccc agc agc agc agc aga aaa cca 562
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Ser Arg Lys Pro
115 120 125
gca gca gct aga aaa tcc acg gcg gca ggt gga ctg agg atc gcg gcg 610
Ala Ala Ala Arg Lys Ser Thr Ala Ala Gly Gly Leu Arg Ile Ala Ala
130 135 140 145
aac gag ccg gcg cag acc cgg gag ctg agg aac cgg atc ttt ccc acc 658
Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro Thr
150 155 160
ctc tat gcc atc ttc cag cag agt cgg ggg cag gag cag gaa ctg aaa 706
Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu Lys
165 170 175
gtc aag aac cgt tct ctg cgc tcg ctc acc cgc agt tgt ctg tat cac 754
Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr His
180 185 190
aag agc gaa gac caa ctt cag cgc act ctc gag gac gcc gag gct ctc 802
Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu Ala Leu
195 200 205
ttc aac aag tac tgc gcg ctc act ctt aaa gag tagcc 840
Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 32
<211> 220
<212> PRT
<213> Simian adenovirus 39
<400> 32
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Glu Met
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala Ala
50 55 60
Ala Arg Pro Ser Ser Ser Ala Glu Glu Glu Lys Ala Ser Ser Thr Asp
65 70 75 80
Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg
85 90 95
Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro
100 105 110
Thr Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Ser Arg Lys
115 120 125
Pro Ala Ala Ala Arg Lys Ser Thr Ala Ala Gly Gly Leu Arg Ile Ala
130 135 140
Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro
145 150 155 160
Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu
165 170 175
Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr
180 185 190
His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu Ala
195 200 205
Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 33
<211> 36634
<212> DNA
<213> Simian adenovirus 37
<220>
<221> repeat_region
<222> (1)..(127)
<223> label=ITR
<220>
<221> CDS
<222> (1907)..(3415)
<223> label=Elb\55K
<220>
<221> CDS
<222> (3453)..(3923)
<223> label=pIX
<220>
<221> misc_feature
<222> (3989)..(5610)
<223> complement (3989..5319, 5598..5610) label=IVa2
<220>
<221> misc_feature
<222> (5092)..(13844)
<223> complement (5092..8664, 13836..13844) label=pol
<220>
<221> misc_feature
<222> (8467)..(13844)
<223> complement (8467..10400, 13836..13844) label=pTP
<220>
<221> CDS
<222> (10827)..(12002)
<223> label=55K
<220>
<221> CDS
<222> (12029)..(13807)
<223> label=pIIIa
<220>
<221> CDS
<222> (13889)..(15514)
<223> label=penton
<220>
<221> CDS
<222> (15521)..(16099)
<223> label=pVII
<220>
<221> CDS
<222> (16144)..(17181)
<223> label=V
<220>
<221> CDS
<222> (17209)..(17439)
<223> label=pX
<220>
<221> CDS
<222> (17512)..(18234)
<223> label=pVI
<220>
<221> CDS
<222> (18328)..(21153)
<223> label=hexon
<220>
<221> CDS
<222> (21172)..(21798)
<223> label=protease
<220>
<221> misc_feature
<222> (21883)..(23418)
<223> complement label=DBP
<220>
<221> CDS
<222> (23441)..(25840)
<223> label=100K
<220>
<221> CDS
<222> (26463)..(27143)
<223> label=pVIII
<220>
<221> CDS
<222> (27147)..(27464)
<223> label=E3\12.5K
<220>
<221> CDS
<222> (28029)..(28556)
<223> label=E3\gp\19K
<220>
<221> CDS
<222> (28593)..(29276)
<223> label=E3\CR1\beta
<220>
<221> CDS
<222> (29292)..(29900)
<223> label=E3\CR1\gamma
<220>
<221> CDS
<222> (29918)..(30781)
<223> label=E3\CR1\delta
<220>
<221> CDS
<222> (30792)..(31064)
<223> label=E3\RID\alpha
<220>
<221> CDS
<222> (31073)..(31504)
<223> label=E3\RID\beta
<220>
<221> CDS
<222> (32201)..(33535)
<223> label=fiber
<220>
<221> misc_feature
<222> (33640)..(34784)
<223> complement (33640..33888, 34602..34784) label=E4\orf6/7
<220>
<221> misc_feature
<222> (33888)..(34784)
<223> complement label=E4\orf6
<220>
<221> misc_feature
<222> (34763)..(35055)
<223> complement label=E4\orf4
<220>
<221> misc_feature
<222> (35067)..(35417)
<223> complement label=E4\orf3
<220>
<221> misc_feature
<222> (35417)..(35803)
<223> complement label=E4\orf2
<220>
<221> misc_feature
<222> (35856)..(36227)
<223> complement label=E4\orf1
<220>
<221> repeat_region
<222> (36508)..(36634)
<223> complement label=ITR
<400> 33
catcatcaaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga gggaggaagg tgattggctg cgacgaaggc gaccgttagg ggcggggcgg 120
gtgacgtttt gatgacgtgg ccatgaggcg gagccggttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaattcc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtattt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttacgtagg tgtcagctga tcgccagggt 480
atttaaacct gcgctctcta gtcaagaggc cactcttgag tgccagcgag tagagttttc 540
tcctccgcgc cgcgagtcag atctacactt tgaaagatga ggcacctgag agacctgccc 600
ggtaatgttt tcctggctac tgggaacgag attctggaac tggtggtgga cgccatgatg 660
ggtgacgacc ctcccgagcc ccctacccca tttgaggcgc cttcgctgta cgatttgtat 720
gatctggagg tggatgtgtc cgagaacgac cccaacgagg aggcggtgaa tgatttgttt 780
agcgatgccg cgctgctggc tgccgagcag gctaatacgg actctggctc agacagcgat 840
tcctctctcc ataccccgag acccggcaga ggtgagaaaa agatccccga gcttaaaggg 900
gaagagctcg acctgcgctg ctatgaggaa tgcttgcctc cgagcgatga tgaggaggac 960
gaggaggcga ttcgagctgc agcgagcgag ggagtgaaag ttgcgggcga gagctttagc 1020
ctggactgtc ctactctgcc cggacacggc tgtaagtctt gtgaatttca tcgcatgaat 1080
actggagata agaatgtgat gtgtgccctg tgctatatga gagcttacaa ccattgtgtt 1140
tacagtaagt gtgattaact ttagttggga aaggcagagg gtgactgggt gctgactggt 1200
ttatttatgt atatgttttt tatgtgtagg tcccgtctct gacgcagatg agacccccac 1260
ttcagagtgc atttcatcac ccccagaaat tggcgaggaa ccgcccgaag atattattca 1320
tagaccagtt gcagtgagag tcaccgggcg gagagcagct gtggagagtt tggatgactt 1380
gctacagggt ggggatgaac ctttggactt gtgtacccgg aaacgcccca ggcactaagt 1440
gccacacatg tgtgtttact taaggtgatg tcagtattta tagggtgtgg agtgcaataa 1500
aaatatgtgt tgactttaag tgcgtgtttt atgactcagg ggtggggact gtgggtatat 1560
aagcaggtgc agacctgtgt ggtcagttca gagcaggact tatggagatc tggacagtct 1620
tggaagactt tcaccagact agacagctgc tagagaactc atcggaggga gtctcttacc 1680
tgtggagatt ctgcttcgct gggcctctag ctaagctagt ctatagggcc aagcaggatt 1740
atagggaaca atttgaggat attttgagag agtgtcctgg tatttttgac tctctcaact 1800
tgggccatca gtctcacttt aaccagagta ttctgagagc ccttgacttt tctactcctg 1860
gcagaactac cgccgcggta gccttttttg cctttatcct tgacaa atg gag tca 1915
Met Glu Ser
1
aga aac cca ttt cag cag gga tta ccg tct gga ctg ctt agc agt agc 1963
Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu Ser Ser Ser
5 10 15
ttt gtg gag aac atg gag gtg cca gcg cct gaa tgc aat ctc cgg cta 2011
Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu
20 25 30 35
ctt gcc agt aca gcc ggt aga cac gct gag gat cct gag tct cca gtc 2059
Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu Ser Pro Val
40 45 50
acc cca gga aca cca acg ccg cca gca gcc gca gca gga gca gca gca 2107
Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly Ala Ala Ala
55 60 65
aga gga gga gga ccg aga aga gaa ccc gag agc cgg tct gga ccc tcc 2155
Arg Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser Gly Pro Ser
70 75 80
ggt ggc gga gga gga gga gta gct gac ttg ttt ccc gag ctg cgc cgg 2203
Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg
85 90 95
gtg ctg act agg tct tcc agt gga cgg gag agg ggg att aag cgg gag 2251
Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu
100 105 110 115
agg cat gag gag act agc cac aga act gaa ctg act gtc agt ctg atg 2299
Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val Ser Leu Met
120 125 130
agc cgc agg cgc cca gaa tcg gtg tgg tgg cat gag gtt cag tcg cag 2347
Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val Gln Ser Gln
135 140 145
ggg gta gat gag gtc tcg gtg atg cat gag aaa tat tcc cta gaa caa 2395
Gly Val Asp Glu Val Ser Val Met His Glu Lys Tyr Ser Leu Glu Gln
150 155 160
gtc aag act tgt tgg ttg gag ccc gag gat gat tgg gag gta gcc atc 2443
Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile
165 170 175
agg aat tat gcc aag ctg gct ctg agg cca gac aag aag tac aag att 2491
Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile
180 185 190 195
acc aaa ctg att aat atc aga aat tcc tgc tac att tca ggg aat ggg 2539
Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser Gly Asn Gly
200 205 210
gcc gag gtg gag atc agt acc cag gag agg gtg gct ttc aga tgc tgc 2587
Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala Phe Arg Cys Cys
215 220 225
atg atg aat atg tac ccg ggg gtg gtg ggc atg gag gga gtc acc ttt 2635
Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly Val Thr Phe
230 235 240
atg aac gcg agg ttc agg ggt gat ggg tat aat ggg gtg gtc ttt atg 2683
Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val Val Phe Met
245 250 255
gcc aac acc aag ctg aca gtg cac gga tgc tcc ttc ttt ggc ttc aat 2731
Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn
260 265 270 275
aac atg tgc atc gag gcc tgg ggc agt gtt tca gtg agg gga tgc agc 2779
Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg Gly Cys Ser
280 285 290
ttt tca gcc aac tgg atg ggg gtc gtg ggc aga acc aag agc aag gtg 2827
Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Lys Val
295 300 305
tca gtg aag aaa tgc ctg ttc gag agg tgc cac ctg ggg gtg atg agc 2875
Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser
310 315 320
gag ggc gaa gcc aaa gtc aaa cac tgc gcc tct acc gag acg ggc tgc 2923
Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu Thr Gly Cys
325 330 335
ttt gtg tgt atc aag ggc aat gcc caa gtc aag cat aac atg atc tgt 2971
Phe Val Cys Ile Lys Gly Asn Ala Gln Val Lys His Asn Met Ile Cys
340 345 350 355
ggg gcc tcg gat gag cgc ggc tac cag atg ctg acc tgc gcc ggt ggg 3019
Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly
360 365 370
aac agc cat atg ctg gcc acc gtg cat gtg gcc tcg cac ccc cgc aag 3067
Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Pro Arg Lys
375 380 385
aca tgg ccc gag ttc gag cac aac gtc atg acc cgc tgc aat gtg cac 3115
Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Val His
390 395 400
ctg ggc tcc cgc cga ggc atg ttc atg cca tac cag tgc aac atg caa 3163
Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Met Gln
405 410 415
ttt gtg aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg 3211
Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu
420 425 430 435
gcg ggg gtg ttt gac atg aat gtg gag ctg tgg aaa att ctg aga tat 3259
Ala Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile Leu Arg Tyr
440 445 450
gat gaa tcc aag acc agg tgc cgg gcc tgc gaa tgc gga ggc aag cac 3307
Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His
455 460 465
gcc agg ctt cag ccc gtg tgt gtg gag gtg acg gag gac ctg cga ccc 3355
Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro
470 475 480
gat cat ttg gtg ttg tcc tgc aac ggg acg gag ttc ggc tcc agc ggg 3403
Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly Ser Ser Gly
485 490 495
gaa gaa tct gac tagagtgagt agtgtttggg ggcgggtggg agcctgc atg agg 3458
Glu Glu Ser Asp Met Arg
500 505
ggc aga atg act aaa atc tgt gtt ttt ctg tgc agc agc atg agc gga 3506
Gly Arg Met Thr Lys Ile Cys Val Phe Leu Cys Ser Ser Met Ser Gly
510 515 520
agc gcc tcc ttt gag gga ggg gta ttc agc cct tat ctg acg ggg cgt 3554
Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg
525 530 535
ctc ccc tcc tgg gcg gga gtg cgt cag aat gtg atg gga tct acg gtg 3602
Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val
540 545 550
gac ggc cgg ccc gtg cag ccc gcg aac tct tca acc ctg acc tac gcg 3650
Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala
555 560 565
acc ctg agc tcc tcg tcc gtg gac gca gct gcc gcc gca gct gct gct 3698
Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala Ala Ala
570 575 580 585
tcc gcc gcc agc gcc gtg cgc gga atg gcc ctg ggc gcc ggc tac tac 3746
Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly Tyr Tyr
590 595 600
agc tct ctg gtg gcc aac tcg agt tcc acc aat aat ccc gcc agc ctg 3794
Ser Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu
605 610 615
aac gag gag aag ctg ctg ctg ctg atg gcc cag ctc gag gcc ctg acc 3842
Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr
620 625 630
cag cgc ctg ggc gag ctg acc cag cag gtg gct cag ctg cag gcg gag 3890
Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Ala Glu
635 640 645
acg cgg gcc gcg gtt gcc acg gtg aaa acc aaa taaaaaatga atcaataaat 3943
Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys
650 655 660
aaacggagac ggttgttgat tttaacacag agtcttgaat ctttatttga tttttcgcgc 4003
gcggtaggcc ctggaccacc ggtctcgatc attgagcacc cggtggatct tttccaggac 4063
ccggtagagg tgggcttgga tgttgaggta catgggcatg agcccgtccc gggggtggag 4123
gtagctccat tgcagggcct cgtgctcggg ggtggtgttg taaatcaccc agtcatagca 4183
ggggcgcagt gcgtggtgct gcacgatgtc cttgaggagg agactgatgg ccacgggcag 4243
ccccttggtg taggtgttga cgaacctgtt gagctgggag ggatgcatgc ggggggagat 4303
gagatgcatc ttggcctgga tcttgagatt ggcgatgttc ccgcccagat cccgccgggg 4363
gttcatgttg tgcaggacca ccagcacggt gtatccggtg cacttgggga atttgtcatg 4423
caacttggaa gggaaggcgt gaaagaattt ggagacgccc ttgtgaccgc ccaggttttc 4483
catgcactca tccatgatga tggcgatggg cccgtgggcg gcggcctggg caaagacgtt 4543
tcgggggtcg gacacatcgt agttgtggtc ctgggtgagc tcgtcatagg ccattttaat 4603
gaatttgggg cggagggtgc ccgactgggg gacgaaggtg ccttcgatcc cgggggcgta 4663
gttgccctcg cagatctgca tctcccaggc cttgagctcg gaggggggga tcatgtccac 4723
ctgcggggcg atgaaaaaaa cggtttccgg ggcgggggag atgagctgcg ccgaaagcag 4783
gttccggagc agctgggact tgccgcagcc ggtggggccg tagatgaccc cgatgaccgg 4843
ctgcaggtgg tagttgaggg agagacagct gccgtcctcg cgtaggaggg gggccacctc 4903
gttcatcatc tcgcgcacat gcatgttctc gcgcacgagt tccgccagga ggcgctcgcc 4963
ccccagcgag aggagctctt gcagcgaggc gaagtttttc agcggcttga gcccgtcggc 5023
catgggcatt ttggagaggg tctgttgcaa gagttccaga cggtcccaga gctcggtgat 5083
gtgctctacg gcatctcgat ccagcagacc tcctcgtttc gcgggttggg acgactgcgg 5143
gagtagggca ccagacgatg ggcgtccagc gcagccaggg tccggtcctt ccagggtcgc 5203
agcgtccgcg tcagcgtggt ctccgtcacg gtgaaggggt gcgcgccggg ctgggcgctt 5263
gcgagggtgc gcttcaggct catccggctg gtcgagaacc gctcccgatc ggcgccctgc 5323
gcgtcggcca ggtagcaatt gaccatgagt tcgtagttga gcgcctcggc cgcgtggcct 5383
ttggcgcgga gcttaccttt ggaagtctgc ccgcaggcgg gacagaggag ggacttgagg 5443
gcgtagagct tgggggcgag gaagacggac tcgggggcgt aggcgtccgc gccgcagtgg 5503
gcgcagacgg tctcgcactc cacaagccag gtgaggtcgg ggcggtcggg gtcaaaaacg 5563
aggtttcctc cgtgcttttt gatgcgtttc ttacctctgg tctccatgag ctcgtgtccc 5623
cgctgggtga caaagaggct gtccgtgtcc ccgtagaccg actttatggg ccggtcctcg 5683
agcggggtgc cgcggtcctc gtcgtagagg aaccccgccc actccgagac gaaggcccgg 5743
gtccaggcca gcacgaagga ggccacgtgg gaggggtagc ggtcgttgtc caccagcggg 5803
tccaccttct ccagggtatg caagcacatg tccccctcgt ccacatccag gaaggtgatt 5863
ggcttgtaag tgtaggccac gtgaccgggg gtcccggccg ggggggtata aaagggggcg 5923
ggcccctgct cgtcctcact gtcttccgga tcgctgtcca ggagcgccag ctgttggggt 5983
aggtattccc tctcgaaggc gggcatgacc tcggcactca ggttgtcagt ttctagaaac 6043
gaggaggatt tgatattgac ggtgccgttg gagacgcctt tcatgagccc ctcgtccatc 6103
tggtcagaaa agacgatctt tttgttgtcg agcttggtgg cgaaggagcc gtagagggcg 6163
ttggagagca gcttggcgat ggagcgcatg gtctggttct tttccttgtc ggcgcgctcc 6223
ttggcggcga tgttgagctg cacgtactcg cgcgccacgc acttccattc ggggaagacg 6283
gtggtgagct cgtcgggcac gattctgacc cgccagccgc ggttgtgcag ggtgatgagg 6343
tccacgctgg tggccacctc gccgcgcagg ggctcgttgg tccagcagag gcgcccgccc 6403
ttgcgcgagc agaagggggg cagcgggtcc agcatgagct cgtcgggggg gtcggcgtcc 6463
acggtgaaga tgccgggcag gagctcgggg tcgaagtagc tgatgcaggt gcccagatcg 6523
tccagcgccg cttgccagtc gcgcacggcc agcgcgcgct cgtaggggct gaggggcgtg 6583
ccccagggca tggggtgcgt gagcgcggag gcgtacatgc cgcagatgtc gtagacgtag 6643
aggggctcct cgaggacgcc gatgtaggtg gggtagcagc gccccccgcg gatgctggcg 6703
cgcacgtagt cgtacagctc gtgcgagggc gcgaggagcc ccgcgccgag gttggagcgc 6763
tgcggctttt cggcgcggta gacgatctgg cggaagatgg cgtgggagtt ggaggagatg 6823
gtgggcctct ggaagatgtt gaagtgggcg tggggcaggc cgaccgagtc cctgatgaag 6883
tgggcgtagg agtcctgcag cttggcgacg agctcggcgg tgacgaggac gtccagggcg 6943
cagtagtcga gggtctcttg gatgatgtcg tacttgagct ggcccttctg cttccacagc 7003
tcgcggttga gaaggaactc ttcgcggtcc ttccagtact cttcgagggg gaacccgtcc 7063
tgatcggcac ggtaagagcc caccatgtag aactggttga cggccttgta ggcgcagcag 7123
cccttctcca cggggagggc ataagcttgc gcggccttgc gcagggaggt gtgggtgagg 7183
gcgaaggtgt cgcgcaccat gaccttgagg aactggtgct tgaagtcgag gtcgtcgcag 7243
ccgccctgct cccagagttg gaagtccgtg cgcttcttgt aggcggggtt gggcaaagcg 7303
aaagtaacat cgttgaagag gatcttgccc gcgcggggca tgaagttgcg agtgatgcgg 7363
aaaggctggg gcacctcggc ccggttgttg atgacctggg cggcgaggac gatctcgtcg 7423
aagccgttga tgttgtgccc gacgatgtag agttccacga atcgcgggcg gcccttgacg 7483
tggggcagct tcttgagctc gtcgtaggtg agctcggcgg ggtcgctgag gccgtgctgc 7543
tcaagggccc agtcggcgac gtgggggttg gcgctgagga aggaagtcca gagatccacg 7603
gccagggcgg tttgcaagcg gtcccggtac tgacggaact gctggcccac ggccattttt 7663
tcgggggtga tgcagtagaa ggtgcggggg tcgccgtgcc agcggtccca cttgagctgg 7723
agggcgaggt cgtgggcgag ctcgacgagc ggcgggtccc cggagagttt catgaccagc 7783
atgaagggga cgagctgctt gccgaaggac cccatccagg tgtaggtttc cacatcgtag 7843
gtgaggaaga gcctttcggt gcgaggatgc gagccgatgg ggaagaactg gatctcctgc 7903
caccagttgg aggaatggct gttgatgtga tggaagtaga aatgccgacg gcgcgccgag 7963
cactcgtgct tgtgtttata caagcgtccg cagtgctcgc aacgctgcac gggatgcacg 8023
tgctgcacga gctgtacctg ggttcctttg acgaggaatt tcagtgggca gtggagcgct 8083
ggcggctgca tctggtgctg tactacgtcc tggccatcgg cgtggccatc gtctgcctcg 8143
atggtggtca tgctgacgag cccgcgcggg aggcaggtcc agacctcggc tcggacgggt 8203
cggagagcga ggacgagggc gcgcaggccg gagctgtcca gggtcctgag acgctgcgga 8263
gtcaggtcag tgggcagcgg cggcgcgcgg ttgacttgca ggagcttttc cagggcgcgc 8323
gggaggtcca gatggtactt gatctccacg gcgccgttgg tggcgacgtc cacggcttgc 8383
agggtcccgt gcccctgggg cgccaccacc gtgccccgtt tcttcttggg cgctggcgtt 8443
ggcgctgctt ccatgtcggt cagaagcggc ggcgaggacg cgcgccgggc ggcaggggcg 8503
gctcggggcc cggaggcagg ggcggcaggg gcacgtcggc gccgcgcgcg ggcaggttct 8563
ggtactgcgc ccggagaaga ctggcgtgag cgacgacgcg acggttgacg tcctggatct 8623
gacgcctctg ggtgaaggcc acgggacccg tgagtttgaa cctgaaagag agttcgacag 8683
aatcaatctc ggtatcgttg acggcggcct gccgcaggat ctcttgcacg tcgcccgagt 8743
tgtcctggta ggcgatctcg gtcatgaact gctcgatctc ctcctcctga aggtctccgc 8803
ggccggcgcg ctcgacggtg gccgcgaggt cgttggagat gcggcccatg agctgcgaga 8863
aggcgttcat gccggcttcg ttccagacgc ggctgtagac cacggctccg tcggggtcgc 8923
gcgcgcgcat gaccacctgg gcgaggttga gctcgacgtg gcgcgtgaag accgcgtagt 8983
tgcagaggcg ctggtagagg tagttgagcg tggtggcgat gtgctcggtg acgaagaagt 9043
acatgatcca gcggcggagc ggcatctcgc tgacgtcgcc cagggcttcc aaacgttcca 9103
tggcctcgta aaagtccacg gcgaagttga aaaactggga gttgcgcgcc gagacggtca 9163
actcctcctc cagaagacgg atgagctcgg cgatggtggc gcgcacctcg cgctcgaagg 9223
cccccgggag ttcctccact tcctcttctt cttcctcctc cactaacatc tcttctactt 9283
cctcctcagg cggcagtggt ggcgggggag ggggcctgcg tcgccggcgg cgcacgggca 9343
gacggtcgat gaagcgctcg atggtctcgc cgcgccggcg tcgcatggtc tcggtgacgg 9403
cgcgcccgtc ctcgcggggc cgcagcgtga agacgccgcc gcgcatctcc aggtggccgg 9463
gggggtcccc gttgggcagg gagagggcgc tgacgatgca tcttatcaat tgccccgtag 9523
ggactccgcg caaggacctg agcgtctcga gatccacggg atctgaaaac cgttgaacga 9583
aggcttcgag ccagtcgcag tcgcaaggta ggctgagcac ggtttcttct ggcgggtcat 9643
gttggttgga gggagcgggg cgggcgatgc tgctggtgat gaagttgaaa taggcggttc 9703
tgagacggcg gatggtggcg aggagcacca ggtccttggg cccggcttgc tggatgcgca 9763
gacggtcggc catgccccag gcgtggtcct gacacctggc gaggtccttg tagtagtcct 9823
gcatgagccg ctccacgggc acctcctcct cgcccgcgcg gccgtgcatg cgcgtgagcc 9883
cgaacccgcg ctgcggctgg acgagcgcca ggtcggcgac gacgcgctcg gcgaggatgg 9943
cctgctggat ctgggtgagg gtggtctgga agtcgtcgaa gtcgacgaag cggtggtagg 10003
ctccggtgtt gatggtgtag gagcagttgg ccatgacgga ccagttgacg gtctggtggc 10063
ccggacgcac gagctcgtgg tacttgaggc gcgagtaggc gcgcgtgtcg aagatgtagt 10123
cgttgcaggt gcgcaccagg tactggtagc cgatgaggaa gtgcggcggc ggctggcggt 10183
agagcggcca tcgctcggtg gcgggggcgc cgggcgcgag gtcctcgagc atgaggcggt 10243
ggtagccgta gatgtacctg gacatccagg tgatgccggc ggcggtggtg gaggcgcgcg 10303
ggaactcgcg gacgcggttc cagatgttgc gcagcggcag gaagtagttc atggtgggca 10363
cggtctggcc cgtgaggcgc gcgcagtcgt ggatgctcta tacgggcaaa aacgaaagcg 10423
gtcagcggct cgactccgtg gcctggaggc taagcgaacg ggttgggctg cgcgtgtacc 10483
ccggttcgaa tctcgaatca ggctggagcc gcagctaacg tggtactggc actcccgtct 10543
cgacccaagc ctgcaccaac cctccaggat acggaggcgg gtcgttttgc aatttttttc 10603
ggaggccgga gactagtaag cgcggaaagc ggccgaccgc gatggctcgc tgccgtagtc 10663
tggagaagaa tcgccagggt tgcgttgcgg tgtgccccgg ttcgaggccg gccggattcc 10723
gcggctaacg agggcgtggc tgccccgtcg tttccaagac ccctagccag ccgacttctc 10783
cagttacgga gcgagcccct cttttgtttt ttgtttttgc cag atg cat ccc gta 10838
Met His Pro Val
ctg cgg cag atg cgc ccc cac cac cct cca ccg caa caa cag ccc cct 10886
Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro
665 670 675 680
cca cag ccg gcg ctt ctg ccc ccg ccc cag cag caa ctt cca gcc acg 10934
Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln Leu Pro Ala Thr
685 690 695
acc gcc gcg gcc gcc gtg agc ggg gct gga cag agt tat gac cac cag 10982
Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Ser Tyr Asp His Gln
700 705 710
ctg gcc ttg gaa gag ggc gag ggg ctg gcg cgc ctg ggg gcg tcg tcg 11030
Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser
715 720 725
ccg gag cgg cac ccg cgc gtg cag atg aaa agg gac gct cgc gag gcc 11078
Pro Glu Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala
730 735 740
tac gtg ccc aag cag aac ctg ttc aga gac agg agc ggc gag gag ccc 11126
Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro
745 750 755 760
gag gag atg cgc gcg gcc cgg ttc cac gcg ggg cgg gag ctg cgg cgc 11174
Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg
765 770 775
ggc ctg gac cga aag agg gtg ctg agg gac gag gat ttc gag gcg gac 11222
Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp
780 785 790
gag ctg acg ggg atc agc ccc gcg cgc gcg cac gtg gcc gcg gcc aac 11270
Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn
795 800 805
ctg gtc acg gcg tac gag cag acc gtg aag gag gag agc aac ttc caa 11318
Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln
810 815 820
aaa tcc ttc aac aac cac gtg cgc acg ctg atc gcg cgc gag gag gtg 11366
Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val
825 830 835 840
acc ctg ggc ctg atg cac ctg tgg gac ctg ctg gag gcc atc gtg cag 11414
Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln
845 850 855
aac ccc acg agc aag ccg ctg acg gcg cag ctg ttc ctg gtg gtg cag 11462
Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln
860 865 870
cac agt cgg gac aac gag acg ttc agg gag gcg ctg ctg aat atc acc 11510
His Ser Arg Asp Asn Glu Thr Phe Arg Glu Ala Leu Leu Asn Ile Thr
875 880 885
gag ccc gag ggc cgc tgg ctc ctg gac ctg gtg aac att ctg cag agc 11558
Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser
890 895 900
atc gtg gtg cag gag cgc ggg ctg ccg ctg tcc gag aag ctg gcg gcc 11606
Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala
905 910 915 920
atc aac ttc tcg gtg ctg agt ttg ggc aag tac tac gct agg aag atc 11654
Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile
925 930 935
tac aag acc ccg tac gtg ccc ata gac aag gag gtg aag atc gac ggg 11702
Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly
940 945 950
ttt tac atg cgc atg acc ctg aaa gtg ctg acc ctg agc gac gat ctg 11750
Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu
955 960 965
ggg gtg tac cgc aac gac agg atg cac cgt gcg gtg agc gcc agc cgc 11798
Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg
970 975 980
cgg cgc gag ctg agc gac cag gag ctg atg cac agc ctg cag cgg gcc 11846
Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala
985 990 995 1000
ctg acc ggg gcc ggg acc gag ggg gag agc tac ttt gac atg ggc 11891
Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly
1005 1010 1015
gcg gac ctg cgc tgg cag ccc agc cgc cgg gcc ttg gaa gct gcc 11936
Ala Asp Leu Arg Trp Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala
1020 1025 1030
ggc ggt tcc ccc tac gta gaa gag gtg gac gat gag gtg gac gag 11981
Gly Gly Ser Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp Glu
1035 1040 1045
gag ggc gag tac ctg gaa gac tgatggcgcg accgtatttt tgctag atg caa 12034
Glu Gly Glu Tyr Leu Glu Asp Met Gln
1050
caa cag cca cct cct gat ccc gcg atg cgg gcg gcg ctg cag agc 12079
Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln Ser
1055 1060 1065
cag ccg tcc ggc att aac tcc tcg gac gat tgg acc cag gcc atg 12124
Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
1070 1075 1080
caa cgc atc atg gcg ctg acg acc cgc aac ccc gaa gcc ttt aga 12169
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg
1085 1090 1095
cag cag ccc cag gcc aac cgg ctc tcg gcc atc ctg gag gcc gtg 12214
Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val
1100 1105 1110
gtg ccc tcg cgc tcc aac ccc acg cac gag aag gtc ctg gcc atc 12259
Val Pro Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile
1115 1120 1125
gtg aac gcg ctg gtg gag aac aag gcc atc cgc ggc gac gag gcc 12304
Val Asn Ala Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala
1130 1135 1140
ggc ctg gtg tac aac gcg ctg ctg gag cgc gtg gcc cgc tac aac 12349
Gly Leu Val Tyr Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn
1145 1150 1155
agc acc aac gtg cag acc aac ctg gac cgc atg gtg acc gac gtg 12394
Ser Thr Asn Val Gln Thr Asn Leu Asp Arg Met Val Thr Asp Val
1160 1165 1170
cgc gag gcc gtg gcc cag cgc gag cgg ttc cac cgc gag tcc aac 12439
Arg Glu Ala Val Ala Gln Arg Glu Arg Phe His Arg Glu Ser Asn
1175 1180 1185
ctg gga tcc atg gtg gcg ctg aac gcc ttc ctc agc acc cag ccc 12484
Leu Gly Ser Met Val Ala Leu Asn Ala Phe Leu Ser Thr Gln Pro
1190 1195 1200
gcc aac gtg ccc cgg ggc cag gag gac tac acc aac ttc atc agc 12529
Ala Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn Phe Ile Ser
1205 1210 1215
gcc ctg cgc ctg atg gtg acc gag gtg ccc cag agc gag gtg tac 12574
Ala Leu Arg Leu Met Val Thr Glu Val Pro Gln Ser Glu Val Tyr
1220 1225 1230
cag tcc ggg ccg gac tac ttc ttc cag acc agt cgc cag ggc ttg 12619
Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr Ser Arg Gln Gly Leu
1235 1240 1245
cag acc gtg aac ctg agc cag gct ttc aag aac ttg cag gga ttg 12664
Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu
1250 1255 1260
tgg ggc gtg cag gcc ccg gtc ggg gac cgc gcg acg gtg tcg agc 12709
Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr Val Ser Ser
1265 1270 1275
ctg ctg acg ccg aac tcg cgc ctg ctg ctg ctg ctg gtg gcc ccc 12754
Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val Ala Pro
1280 1285 1290
ttc acg gac agc ggc agc atc aac cgc aac tcg tac ctg ggc tac 12799
Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly Tyr
1295 1300 1305
ctg att aac ctg tac cgc gag gcc atc ggc cag gcg cac gtg gac 12844
Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
1310 1315 1320
gag cag acc tac cag gag atc acc cac gtg agc cgc gcc ctg ggc 12889
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly
1325 1330 1335
cag gac gac ccg ggc aac ctg gaa gcc acc ctg aac ttt ttg ctg 12934
Gln Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu
1340 1345 1350
acc aac cgg tcg cag aag atc ccg ccc cag tac gcg ctc agc acc 12979
Thr Asn Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr
1355 1360 1365
gag gag gag cgc atc ctg cgt tac gtg cag cag agc gtg ggc ctg 13024
Glu Glu Glu Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu
1370 1375 1380
ttc ctg atg cag gag ggg gcc acc ccc agc gcc gcg ctc gac atg 13069
Phe Leu Met Gln Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met
1385 1390 1395
acc gcg cgc aac atg gag ccc agc atg tac gcc agc aac cgc ccg 13114
Thr Ala Arg Asn Met Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro
1400 1405 1410
ttc atc aat aaa ctg atg gac tac ttg cat cgg gcg gcc gcc atg 13159
Phe Ile Asn Lys Leu Met Asp Tyr Leu His Arg Ala Ala Ala Met
1415 1420 1425
aac tct gac tat ttc acc aac gcc atc ctg aat ccc cac tgg ctc 13204
Asn Ser Asp Tyr Phe Thr Asn Ala Ile Leu Asn Pro His Trp Leu
1430 1435 1440
ccg ccg cct ggg ttc tac acg ggc gag tac gac atg ccc gac ccc 13249
Pro Pro Pro Gly Phe Tyr Thr Gly Glu Tyr Asp Met Pro Asp Pro
1445 1450 1455
aat gac ggg ttc ctg tgg gac gat gtg gac agc agc gtg ttc tcc 13294
Asn Asp Gly Phe Leu Trp Asp Asp Val Asp Ser Ser Val Phe Ser
1460 1465 1470
ccc cga ccg ggt gct aac gag cgc ccc ttg tgg aag aag gaa ggc 13339
Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu Trp Lys Lys Glu Gly
1475 1480 1485
agc gac cga cgc ccg tcc tcg gcg ctg tcc ggc cgc gag ggt gct 13384
Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Glu Gly Ala
1490 1495 1500
gcc gcg gcg gtg ccc gag gcc gcc agt cct ttt cct agc ttg ccc 13429
Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro
1505 1510 1515
ttc tcg ctg aac agt atc cgc agc agc gag ctg ggg agg atc acg 13474
Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr
1520 1525 1530
cgc ccg cgc ttg ctg ggc gag gag gag tac ttg aat gac tcg ctg 13519
Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu
1535 1540 1545
ttg aga ccc gag cgg gag aag aac ttc ccc aat aac ggg ata gag 13564
Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
1550 1555 1560
agc ctg gtg gac aag atg agc cgc tgg aag acg tat gcg cag gag 13609
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu
1565 1570 1575
cac agg gac gat ccc cgg gcg tcg cag ggg gcc acg agc cgg ggc 13654
His Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly
1580 1585 1590
agc gcc gcc cgt aaa cgc cgg tgg cac gac agg cag cgg gga ctg 13699
Ser Ala Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu
1595 1600 1605
atg tgg gac gat gag gat tcc gcc gac gac agc agc gtg ttg gac 13744
Met Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp
1610 1615 1620
ttg ggt ggg agt ggt ggt ggt aac ccg ttc gct cac ctg cgc ccc 13789
Leu Gly Gly Ser Gly Gly Gly Asn Pro Phe Ala His Leu Arg Pro
1625 1630 1635
cgc atc ggg cgc atg atg taagaaaccg aaaataaatg atactcacca 13837
Arg Ile Gly Arg Met Met
1640 1645
aggccatggc gaccagcgtg cgttcgtttc ttctctgttg ttgtatctag t atg atg 13894
Met Met
agg cgt gcg tac ccg gag ggt cct cct ccc tcg tac gag agc gtg 13939
Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val
1650 1655 1660
atg cag cag gcg atg gcg gcg gcg gcg gcg atg cag ccc ccg ctg 13984
Met Gln Gln Ala Met Ala Ala Ala Ala Ala Met Gln Pro Pro Leu
1665 1670 1675
gag gct cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag ggg 14029
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly
1680 1685 1690
cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat acc 14074
Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr
1695 1700 1705
acc cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc tcg 14119
Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser
1710 1715 1720
ctg aac tac cag aac gac cac agc aac ttc ctg acc acc gtg gtg 14164
Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val
1725 1730 1735
cag aac aat gac ttc acc ccc acg gag gcc agc acc cag acc atc 14209
Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile
1740 1745 1750
aac ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc 14254
Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile
1755 1760 1765
atg cac acc aac atg ccc aac gtg aac gag ttc atg tac agc aac 14299
Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn
1770 1775 1780
aag ttc aag gcg cgg gtc atg gtc tcc cgc aag acc ccc aac ggg 14344
Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly
1785 1790 1795
gtc aag gta gat gat acg tat gat ggt agt cag gat gag cta aaa 14389
Val Lys Val Asp Asp Thr Tyr Asp Gly Ser Gln Asp Glu Leu Lys
1800 1805 1810
tac gag tgg gtg gag ttt gag ctg ccc gaa ggc aac ttc tcg gtg 14434
Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val
1815 1820 1825
acc atg acc atc gac ctg atg aac aac gcc atc atc gac aat tac 14479
Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr
1830 1835 1840
ttg gcg gtg ggg cgg cag aac ggg gtg ttg gag agc gac atc ggc 14524
Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly
1845 1850 1855
gtg aag ttc gac act agg aac ttc agg ctg ggc tgg gac ccc gtg 14569
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val
1860 1865 1870
acc gag ctg gtc atg ccc ggg gtg tac acc aac gag gcc ttc cat 14614
Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His
1875 1880 1885
ccc gat att gtc ttg ctg ccc ggc tgc ggg gtg gac ttc acc gag 14659
Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu
1890 1895 1900
agc cgc ctc agc aac ctg ctg ggc att cgc aaa agg cag ccc ttc 14704
Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe
1905 1910 1915
cag gag ggc ttc cag atc atg tac gag gat ctg gag ggg ggc aac 14749
Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn
1920 1925 1930
atc ccc gcc ctc ctg gat gtc gag gcc tat gag gac agt aag aaa 14794
Ile Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Asp Ser Lys Lys
1935 1940 1945
aaa gca gaa gcc gag gcg act gca gcc gtg gct acc gcc gcg acc 14839
Lys Ala Glu Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ala Thr
1950 1955 1960
aat gca gat gcc aat gtg act aga ggc gat aca ttc gcc act cag 14884
Asn Ala Asp Ala Asn Val Thr Arg Gly Asp Thr Phe Ala Thr Gln
1965 1970 1975
gcg gag gaa gca gcc gcc cta gcg gtc gcc gat gat agt gaa agt 14929
Ala Glu Glu Ala Ala Ala Leu Ala Val Ala Asp Asp Ser Glu Ser
1980 1985 1990
aag ata gtc att cag ccg gtg aag aag gat agc aag aac agg agc 14974
Lys Ile Val Ile Gln Pro Val Lys Lys Asp Ser Lys Asn Arg Ser
1995 2000 2005
tac aac gtg ctg ccg gac gag gta aac acc gcc tac cgc agc tgg 15019
Tyr Asn Val Leu Pro Asp Glu Val Asn Thr Ala Tyr Arg Ser Trp
2010 2015 2020
tac ctg gcc tac aac tat ggc gac ccc gag aag ggc gtg cgc tcc 15064
Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser
2025 2030 2035
tgg acg ctg ctc acc acc tcg gac gtc acc tgc ggc gtg gag caa 15109
Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln
2040 2045 2050
gtc tac tgg tcg ctg ccc gac atg atg caa gac ccg gtc acc ttc 15154
Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe
2055 2060 2065
cgc tcc acg cgt caa gtt agc aac tac ccg gtg gtg ggc gcc gag 15199
Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu
2070 2075 2080
ctc ctg ccc gtc tac tcc aag agc ttc ttc aac gag cag gcc gtc 15244
Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val
2085 2090 2095
tac tcg cag cag ctg cgc gcc ttc acc tcg ctc acg cac gtc ttc 15289
Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe
2100 2105 2110
aac cgc ttc ccc gag aac cag atc ctc gtc cgc ccg ccc gcg ccc 15334
Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro
2115 2120 2125
acc att acc acc gtc agt gaa aac gtt cct gct ctc aca gat cac 15379
Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His
2130 2135 2140
ggg acc ctg ccg ctg cgc agc agt atc cgg gga gtc cag cgc gtg 15424
Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val
2145 2150 2155
acc gtt act gac gcc aga cgc cgc acc tgc ccc tac gtc tac aag 15469
Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys
2160 2165 2170
gcc ctg ggc ata gtc gcg ccg cgc gtc ctc tcg agc cgc acc ttc 15514
Ala Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe
2175 2180 2185
taaaaa atg tcc att ctc atc tcg ccc agt aat aac acc ggt tgg ggc 15562
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly
2190 2195 2200
ctg cgc gcg ccc agc aag atg tac gga ggc gct cgc caa cgc tcc 15607
Leu Arg Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser
2205 2210 2215
acg caa cac ccc gtg cgc gtg cgc ggg cac ttc cgc gct ccc tgg 15652
Thr Gln His Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp
2220 2225 2230
ggc gcc ctc aag ggc cgc gtg cgg tcg cgc acc acc gtc gac gac 15697
Gly Ala Leu Lys Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp
2235 2240 2245
gtg atc gac cag gtg gtg gcc gac gcg cgc aac tac acc ccc gcc 15742
Val Ile Asp Gln Val Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala
2250 2255 2260
gcc gcg ccc gtc tcc acc gtg gac gcc gtc atc gac agc gtg gtg 15787
Ala Ala Pro Val Ser Thr Val Asp Ala Val Ile Asp Ser Val Val
2265 2270 2275
gcc gac gcg cgc cgg tac gcc cgc gcc aag agc cgg cgg cgg cgc 15832
Ala Asp Ala Arg Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg
2280 2285 2290
atc gcc cgg cgg cac cgg agc acc ccc gcc atg cgc gcg gcg cga 15877
Ile Ala Arg Arg His Arg Ser Thr Pro Ala Met Arg Ala Ala Arg
2295 2300 2305
gcc ttg ctg cgc agg gcc agg cgc acg gga cgc agg gcc atg ctc 15922
Ala Leu Leu Arg Arg Ala Arg Arg Thr Gly Arg Arg Ala Met Leu
2310 2315 2320
agg gcg gcc aga cgc gcg gct tca ggc gcc agc gcc ggc agg acc 15967
Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala Ser Ala Gly Arg Thr
2325 2330 2335
cgg aga cgc gcg gcc acg gcg gcg gca gcg gcc atc gcc agc atg 16012
Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala Ile Ala Ser Met
2340 2345 2350
tcc cgc ccg cgg cga ggg aac gtg tac tgg gtg cgc gac gcc gcc 16057
Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg Asp Ala Ala
2355 2360 2365
acc ggt gtg cgc gtg ccc gtg cgc acc cgc ccc cct cgc act 16099
Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg Thr
2370 2375 2380
tgaagatgtt cacttcgcga tgttgatgtg tcccagcggc gagg atg tcc aag cgc 16155
Met Ser Lys Arg
aaa ttc aag gaa gag atg ctc cag gtc atc gcg cct gag atc tac 16200
Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr
2385 2390 2395
ggc ccc gcg gcg gtg gtg aag gag gaa aga aag ccc cgc aaa atc 16245
Gly Pro Ala Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys Ile
2400 2405 2410
aag cgg gtc aaa aag gac aaa aag gaa gaa gat gtg gac gat atg 16290
Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Val Asp Asp Met
2415 2420 2425
gtg gag ttt gtg cgc gag ttc gcc ccc cgg cgg cgc gtg cag tgg 16335
Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp
2430 2435 2440
cgc ggg cgg aag gtg cgc ccg gtg ctg aga ccc ggc acc acc gtg 16380
Arg Gly Arg Lys Val Arg Pro Val Leu Arg Pro Gly Thr Thr Val
2445 2450 2455
gtc ttc acg ccc ggc gag cgc tcc ggc acc gct tcc aag cgc tcc 16425
Val Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser
2460 2465 2470
tac gac gag gtg tac ggg gat gat gat att ctg gag cag gcg gcc 16470
Tyr Asp Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala
2475 2480 2485
gag cgc ctg ggc gag ttt gct tac ggc aag cgc agc cgc ccc gcg 16515
Glu Arg Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala
2490 2495 2500
ccc ttg aaa gag gag gcg gtg tcc atc ccg ctg gac cac ggc aac 16560
Pro Leu Lys Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn
2505 2510 2515
ccc acg ccg agc ctg aag ccg gtg acc ctg cag cag gtg ctg ccg 16605
Pro Thr Pro Ser Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro
2520 2525 2530
agc gcg gcg ccg cgc cgg ggc ttc aag cgc gag ggc ggc gag gat 16650
Ser Ala Ala Pro Arg Arg Gly Phe Lys Arg Glu Gly Gly Glu Asp
2535 2540 2545
ctg tac ccg acc atg cag ctg atg gtg ccc aag cgc cag aag ctg 16695
Leu Tyr Pro Thr Met Gln Leu Met Val Pro Lys Arg Gln Lys Leu
2550 2555 2560
gag gac gtg ctg gag cac atg aag gtg gac ccc gag gtg cag ccc 16740
Glu Asp Val Leu Glu His Met Lys Val Asp Pro Glu Val Gln Pro
2565 2570 2575
gag gtc aag gtg cgg ccc atc aag cag gtg gcc ccg ggc ctg ggc 16785
Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro Gly Leu Gly
2580 2585 2590
gtg cag acc gtg gac atc aag atc ccc acg gag ccc atg gaa acg 16830
Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro Met Glu Thr
2595 2600 2605
cag acc gag ccc gtg aag ccc agc acc agc acc atg gag gtg cag 16875
Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr Met Glu Val Gln
2610 2615 2620
acg gat ccc tgg atg ccg gcg ccg gct tcc acc acc act cgc cga 16920
Thr Asp Pro Trp Met Pro Ala Pro Ala Ser Thr Thr Thr Arg Arg
2625 2630 2635
aga cgc aag tac ggc gcg gcc agc ctg ctg atg ccc aac tac gcg 16965
Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala
2640 2645 2650
ctg cat cct tcc atc atc ccc acg ccg ggc tac cgc ggc acg cgc 17010
Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg
2655 2660 2665
ttc tac cgc ggc tac agc agc cgc cgc aag acc acc acc cgc cgc 17055
Phe Tyr Arg Gly Tyr Ser Ser Arg Arg Lys Thr Thr Thr Arg Arg
2670 2675 2680
cgc cgt cgc cgc acc cgc cgc agc acc acc gcg act tcc gcc gcc 17100
Arg Arg Arg Arg Thr Arg Arg Ser Thr Thr Ala Thr Ser Ala Ala
2685 2690 2695
gcc ttg gtg cgg aga gtg tac cgc agc ggg cgt gag cct ctg acc 17145
Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu Thr
2700 2705 2710
ctg ccg cgc gcg cgc tac cac ccg agc atc gcc att taactctgcc 17191
Leu Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
2715 2720 2725
gtcgcctcct tgcagat atg gcc ctc aca tgc cgc ctc cgc gtc ccc att 17241
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile
2730 2735
acg ggc tac cga gga aga aag ccg cgc cgt aga agg ctg acg ggg 17286
Thr Gly Tyr Arg Gly Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly
2740 2745 2750
aac ggg ctg cgt cgc cat cac cac cgg cgg cgg cgc gcc atc agc 17331
Asn Gly Leu Arg Arg His His His Arg Arg Arg Arg Ala Ile Ser
2755 2760 2765
aag cgg ttg ggg gga ggc ttc ctg ccc gcg ctg atc ccc atc atc 17376
Lys Arg Leu Gly Gly Gly Phe Leu Pro Ala Leu Ile Pro Ile Ile
2770 2775 2780
gcc gcg gcg atc ggg gcg atc ccc ggc ata gct tcc gtg gcg gtg 17421
Ala Ala Ala Ile Gly Ala Ile Pro Gly Ile Ala Ser Val Ala Val
2785 2790 2795
cag gcc tct cag cgc cac tgagacacag cttggaaaat ttgtaataaa 17469
Gln Ala Ser Gln Arg His
2800
aaaatggact gacgctcctg gtcctgtgat gtgtgttttt ag atg gaa gac atc 17523
Met Glu Asp Ile
2805
aat ttt tcg tcc ctg gca ccg cga cac ggc acg cgg ccg ttt atg 17568
Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met
2810 2815 2820
ggc acc tgg agc gac atc ggc aac agc caa ctg aac ggg ggc gcc 17613
Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly Gly Ala
2825 2830 2835
ttc aat tgg agc agt ctc tgg agc ggg ctt aag aat ttc ggg tcc 17658
Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser
2840 2845 2850
acg ctc aaa acc tat ggc aac aag gcg tgg aac agc agc aca ggg 17703
Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly
2855 2860 2865
cag gcg ctg agg gaa aag ctg aaa gag cag aac ttc cag cag aag 17748
Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys
2870 2875 2880
gtg gtc gat ggc ctg gcc tcg ggc atc aac ggg gtg gtg gac ctg 17793
Val Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu
2885 2890 2895
gcc aac cag gcc gtg cag aaa cag atc aac agc cgc ctg gac gcg 17838
Ala Asn Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala
2900 2905 2910
gtc ccg ccc gcg ggg tcc gtg gag atg ccc cag gtg gag gag gag 17883
Val Pro Pro Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu
2915 2920 2925
ctg cct ccc ctg gac aag cgc ggc gac aag cga ccg cgt ccc gac 17928
Leu Pro Pro Leu Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp
2930 2935 2940
gcg gag gag acg ctg ctg acg cac acg gac gag ccg ccc ccg tac 17973
Ala Glu Glu Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr
2945 2950 2955
gag gag gcg gtg aaa ctg ggt ctg ccc acc acg cgg ccc gtg gcg 18018
Glu Glu Ala Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Val Ala
2960 2965 2970
cct ctg gcc acc ggg gtg ctg aaa ccc agc agc agc agc agc cag 18063
Pro Leu Ala Thr Gly Val Leu Lys Pro Ser Ser Ser Ser Ser Gln
2975 2980 2985
ccc gcg acc ctg gac ttg cct cca cct cgc ccc tcc aca gtg gct 18108
Pro Ala Thr Leu Asp Leu Pro Pro Pro Arg Pro Ser Thr Val Ala
2990 2995 3000
aag ccc ctg ccg ccg gtg gcc gtc gcg tcg cgc gcc ctc cga ggc 18153
Lys Pro Leu Pro Pro Val Ala Val Ala Ser Arg Ala Leu Arg Gly
3005 3010 3015
cgc ccc cag gcg aac tgg cag agc act ctg aac agc atc gtg ggt 18198
Arg Pro Gln Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly
3020 3025 3030
ctg gga gtg cag agt gtg aag cgc cgc cgc tgc tat taaaagacac 18244
Leu Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr
3035 3040
tgtagcgctt aacttgcttg tctgtgtgta tgtatgtccg ccgaccagaa ggaggaggaa 18304
gaggcgcgtc gccgagttgc aag atg gcc acc cca tcg atg ctg ccc cag 18354
Met Ala Thr Pro Ser Met Leu Pro Gln
3045 3050
tgg gcg tac atg cac atc gcc gga cag gac gct tcg gag tac ctg 18399
Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu
3055 3060 3065
agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc 18444
Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe
3070 3075 3080
agt ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac 18489
Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His
3085 3090 3095
gat gtg acc acc gac cgc agc cag cgg ctg acg ctg cgc ttc gtg 18534
Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val
3100 3105 3110
ccc gtg gac cgc gag gac aac acc tac tcg tac aaa gtg cgc tac 18579
Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr
3115 3120 3125
acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc agc acc 18624
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr
3130 3135 3140
tac ttt gac atc cgc ggc gtg ctg gat cgg ggc ccc agc ttc aaa 18669
Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys
3145 3150 3155
ccc tac tcc ggc acc gcc tac aac agc ctg gct ccc aag gga gct 18714
Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala
3160 3165 3170
ccc aat tcc agc cag tgg gag caa gca aaa aca ggc aat ggg gga 18759
Pro Asn Ser Ser Gln Trp Glu Gln Ala Lys Thr Gly Asn Gly Gly
3175 3180 3185
act atg gaa aca cac aca tat ggt gtg gcc cca atg ggc gga gag 18804
Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu
3190 3195 3200
aat att aca aaa gat ggt ctt caa att gga act gac gtt aca gcg 18849
Asn Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Asp Val Thr Ala
3205 3210 3215
aat cag aat aaa cca att tat gcc gac aaa aca ttt caa cca gaa 18894
Asn Gln Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu
3220 3225 3230
ccg caa gta gga gaa gaa aat tgg caa gaa act gaa aac ttt tat 18939
Pro Gln Val Gly Glu Glu Asn Trp Gln Glu Thr Glu Asn Phe Tyr
3235 3240 3245
ggc ggt aga gct ctt aaa aaa gac aca aac atg aaa cct tgc tat 18984
Gly Gly Arg Ala Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr
3250 3255 3260
ggc tcc tat gct aga ccc acc aat gaa aaa gga ggt caa gct aaa 19029
Gly Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gln Ala Lys
3265 3270 3275
ctt aaa gtt gga gat gat gga gtt cca acc aaa gaa ttc gac ata 19074
Leu Lys Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp Ile
3280 3285 3290
gac ctg gct ttc ttt gat act ccc ggt ggc acc gtg aac ggt caa 19119
Asp Leu Ala Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gln
3295 3300 3305
gac gag tat aaa gca gac att gtc atg tat acc gaa aac acg tat 19164
Asp Glu Tyr Lys Ala Asp Ile Val Met Tyr Thr Glu Asn Thr Tyr
3310 3315 3320
ctg gaa act cca gac acg cat gtg gta tac aaa cca ggc aag gat 19209
Leu Glu Thr Pro Asp Thr His Val Val Tyr Lys Pro Gly Lys Asp
3325 3330 3335
gat gca agt tct gaa att aac ctg gtt cag cag tct atg ccc aac 19254
Asp Ala Ser Ser Glu Ile Asn Leu Val Gln Gln Ser Met Pro Asn
3340 3345 3350
aga ccc aac tac att ggg ttc agg gac aac ttt atc ggt ctt atg 19299
Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met
3355 3360 3365
tac tac aac agt act ggc aat atg ggt gtg ctt gct ggt cag gcc 19344
Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala
3370 3375 3380
tcc cag ctg aat gct gtg gtt gac ttg caa gac aga aac aca gag 19389
Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu
3385 3390 3395
ctg tcc tac cag ctc ttg ctt gac tct ttg ggt gac aga acc agg 19434
Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg
3400 3405 3410
tat ttc agt atg tgg aat cag gcg gtg gac agt tat gat cct gat 19479
Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp
3415 3420 3425
gtg cgc att att gaa aac cat ggt gtg gaa gat gaa ctt ccc aac 19524
Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn
3430 3435 3440
tat tgc ttc ccc ctg gat ggg tct ggc act aac gcc gct tac caa 19569
Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gln
3445 3450 3455
ggt gtg aaa gta aaa aat ggt caa gat ggt gat gtt gag agc gaa 19614
Gly Val Lys Val Lys Asn Gly Gln Asp Gly Asp Val Glu Ser Glu
3460 3465 3470
tgg gaa aaa gat gat act gtc gca gct caa aat caa tta tgt aaa 19659
Trp Glu Lys Asp Asp Thr Val Ala Ala Gln Asn Gln Leu Cys Lys
3475 3480 3485
ggt aac att ttt gcc atg gag atc aat ctc cag gct aac ctg tgg 19704
Gly Asn Ile Phe Ala Met Glu Ile Asn Leu Gln Ala Asn Leu Trp
3490 3495 3500
aga agt ttc ctc tac tcg aac gtg gcc ctg tac ctg ccc gac tcc 19749
Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser
3505 3510 3515
tac aag tac acg ccg acc aac gtc acg ctg ccg acc aac acc aac 19794
Tyr Lys Tyr Thr Pro Thr Asn Val Thr Leu Pro Thr Asn Thr Asn
3520 3525 3530
acc tac gat tac atg aac ggg aga gtg aca cct ccc tcg ctg gta 19839
Thr Tyr Asp Tyr Met Asn Gly Arg Val Thr Pro Pro Ser Leu Val
3535 3540 3545
gac gcc tac atc aac atc ggg gcg cgc tgg tcg ctg gac ccc atg 19884
Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met
3550 3555 3560
gac aac gtc aat ccc ttc aac cac cat cgc aac gcg ggg ctg cgc 19929
Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg
3565 3570 3575
tac cgc tcc atg ctc ctg ggc aac ggg cgc tac gtg ccc ttc cac 19974
Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His
3580 3585 3590
atc cag gtg ccc cag aaa ttt ttc gcc att aag agc ctc ctg ctc 20019
Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu
3595 3600 3605
ctg ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac gtc 20064
Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val
3610 3615 3620
aac atg atc ctg cag agc tcc ctc ggc aac gac ctg cgc acg gac 20109
Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp
3625 3630 3635
ggg gcc tcc atc tcc ttc acc agc atc aac ctc tac gca acc ttc 20154
Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe
3640 3645 3650
ttc ccc atg gcg cac aac acg gct tcc acg ctc gag gcc atg ctg 20199
Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu
3655 3660 3665
cgc aac gac acc aac gac cag tcc ttc aac gac tac ctc tcg gcg 20244
Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala
3670 3675 3680
gcc aac atg ctc tac ccc atc ccg gcc aac gcc acc aac gtg ccc 20289
Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro
3685 3690 3695
atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc ggc tgg tcc 20334
Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser
3700 3705 3710
ttc acg cgc ctc aag acc aag gag acg ccc tcg ctg ggc tcc ggg 20379
Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly
3715 3720 3725
ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac ctc gac 20424
Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp
3730 3735 3740
ggc acc ttc tac ctc aac cac acc ttc aag aag gtc tcc atc acc 20469
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr
3745 3750 3755
ttc gac tcc tcc gtc agc tgg ccc ggc aac gac cgg ctc ctg acg 20514
Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr
3760 3765 3770
ccc aac gag ttc gaa atc aag cgc acc gtc gac ggc gag ggc tac 20559
Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr
3775 3780 3785
aac gtg gcc cag tgc aac atg acc aag gac tgg ttc ctg gtc cag 20604
Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln
3790 3795 3800
atg ctg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg ccc 20649
Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro
3805 3810 3815
gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc cag 20694
Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln
3820 3825 3830
ccc atg agc cgc cag gtg gtg gac gag gtc aac tac aag gac tac 20739
Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr
3835 3840 3845
cag gcc gtc acc ctg gcc tac cag cac aac aac tcg ggc ttc gtc 20784
Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val
3850 3855 3860
ggc tac ctc gcg ccc acc atg cgc cag ggc cag ccc tac ccc gcc 20829
Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala
3865 3870 3875
aac tac ccc tac ccg ctc atc ggc aag agc gcc gtc acc agc gtc 20874
Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val
3880 3885 3890
acc cag aaa aag ttc ctc tgc gac agg gtc atg tgg cgc atc ccc 20919
Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro
3895 3900 3905
ttc tcc agc aac ttc atg tcc atg ggc gcg ctc acc gac ctc ggc 20964
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly
3910 3915 3920
cag aac atg ctc tat gcc aac tcc gcc cac gcg cta gac atg aat 21009
Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn
3925 3930 3935
ttc gaa gtc gac ccc atg gat gag tcc acc ctt ctc tat gtt gtc 21054
Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val
3940 3945 3950
ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc 21099
Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly
3955 3960 3965
gtc atc gag gcc gtc tac ctg cgc acc ccc ttc tcg gcc ggt aac 21144
Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn
3970 3975 3980
gcc acc acc taagctcttg cttcttgc atg atg gct gag ccc acg ggc tcc 21195
Ala Thr Thr Met Met Ala Glu Pro Thr Gly Ser
3985 3990
ggc gag cag gag ctc agg gcc atc atc cgc gac ctg ggc tgc ggg 21240
Gly Glu Gln Glu Leu Arg Ala Ile Ile Arg Asp Leu Gly Cys Gly
3995 4000 4005
ccc tac ttc ctg ggc acc ttc gat aag cgc ttc ccg gga ttc atg 21285
Pro Tyr Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro Gly Phe Met
4010 4015 4020
gcc ccg cac aag ctg gcc tgc gcc atc gtc aac acg gcc ggc cgc 21330
Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg
4025 4030 4035
gag acc ggg ggc gag cac tgg ctg gcc ttc gcc tgg aac ccg cgc 21375
Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn Pro Arg
4040 4045 4050
tcg aac acc tgc tac ctc ttc gac ccc ttc ggg ttc tcg gac gag 21420
Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu
4055 4060 4065
cgc ctc aag cag atc tac cag ttc gag tac gag ggc ctg ctg cgc 21465
Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg
4070 4075 4080
cgt agc gcc ctg gcc acc gag gac cgc tgc gtc acc ctg gaa aag 21510
Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys
4085 4090 4095
tcc acc cag acc gtg cag ggt ccg cgc tcg gcc gcc tgc ggg ctc 21555
Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu
4100 4105 4110
ttc tgc tgc atg ttc ctg cac gcc ttc gtg cac tgg ccc gac cgc 21600
Phe Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg
4115 4120 4125
ccc atg gac aag aac ccc acc atg aac tta ctg acg ggg gtg ccc 21645
Pro Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro
4130 4135 4140
aac ggc atg ctc cag tcg ccc cag gtg gaa ccc acc ctg cgc cgc 21690
Asn Gly Met Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg
4145 4150 4155
aac cag gag gcg ctc tac cgc ttc ctc aac gcc cac tcc gcc tac 21735
Asn Gln Glu Ala Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr
4160 4165 4170
ttt cgc tcc cac cgc gcg cgc atc gag aag gcc acc gcc ttc gac 21780
Phe Arg Ser His Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp
4175 4180 4185
cgc atg aat caa gac atg taaaccgtgt gtgtatgtga atgctttatt 21828
Arg Met Asn Gln Asp Met
4190 4195
cataataaac agcacatgtt tatgccacct tctctgaggc tctgacttta tttagaaatc 21888
gaaggggttc tgccggctct cggcgtgccc cgcgggcagg gatacgttgc ggaactggta 21948
cttgggcagc cacttgaact cggggatcag cagcttcggc acggggaggt cggggaacga 22008
gtcgctccac agcttgcgcg tgagttgcag ggcgcccagc aggtcgggcg cggagatctt 22068
gaaatcacag ttgggacccg cgttctgcgc gcgagagttg cggtacacgg ggttgcagca 22128
ctggaacacc atcagggccg ggtgcttcac gctcgccagc accgtcgcgt cggtgatgcc 22188
ctccacgtcc agatcctcgg cgttggccat cccgaagggg gtcatcttgc aggtctgccg 22248
ccccatgctg ggcacgcagc cgggcttgtg gttgcaatcg cagtgcaggg ggatcagcat 22308
catctgggcc tgttcggagc tcatgcccgg gtacatggcc ttcatgaaag cctccagctg 22368
gcggaaggcc tgctgcgcct tgccgccctc ggtgaagaag accccgcagg acttgctaga 22428
gaactggttg gtagcgcagc ccgcgtcgtg cacgcagcag cgcgcgtcgt tgttggccag 22488
ctgcaccacg ctgcgccccc agcggttctg ggtgatcttg gcccggtcgg ggttctcctt 22548
cagcgcgcgc tgcccgttct cgctcgccac atccatctcg atcgtgtgct ccttctggat 22608
catcacggtc ccgtgcaggc accgcagctt gccctcggcc tcggtgcagc cgtgcagcca 22668
cagcgcgcag ccggtgctct cccagttctt gtgggcgatc tgggagtgcg agtgcacgaa 22728
gccctgcagg aagcggccca tcatcgtggt cagggtcttg ttgctggtga aggtcagcgg 22788
gatgccgcgg tgctcctcgt tcacatacat gtggcagatg cggcggtaca cctcgccctg 22848
ctcgggcatc agctggaagg cggacttcag gtcgctctcc acgcggtacc ggtccatcag 22908
cagcgtcatg acttccatgc ccttctccca ggccgaaacg atcggcaggc tcagggggtt 22968
cttcaccgcc attgtcatct tagtcgccgc cgccgaggtc agggggtcgt tctcgtccag 23028
ggtctcaaac actcgcttgc cgtccttctc ggtgatgcgc acggggggga aggcgaagcc 23088
cacggccgcc agctcctcct cggcctgcct ttcgtcctcg ctgtcctggc tgatgtcttg 23148
caaaggcaca tgcttggtct tgcggggttt ctttttgggc ggcagaggcg gcggcgatgt 23208
gctgggcgag cgcgagttct cgctcaccac gactatttct tctccttggc cgtcgtccga 23268
gaccacgcgg cggtaggcat gcctcttctg gggcagaggc ggaggcgacg ggctctcgcg 23328
gttcggcggg cggctggcag agccccttcc gcgttcgggg gtgcgctcct ggcggcgctg 23388
ctctgactga cttcctccgc ggccggccat tgtgttctcc tagggagcaa gc atg gag 23446
Met Glu
act cag cca tcg tcg cca aca tcg cca tct gcc ccc gcc gcc acc 23491
Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Ala Thr
4200 4205 4210
gcc gac gag aac cag cag cag cag aat gaa agc tta acc gcc ccg 23536
Ala Asp Glu Asn Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro
4215 4220 4225
ccg ccc agc ccc acc tcc gac gcc gcg gcc cca gac atg caa gag 23581
Pro Pro Ser Pro Thr Ser Asp Ala Ala Ala Pro Asp Met Gln Glu
4230 4235 4240
atg gag gaa tcc atc gag att gac ctg ggc tac gtg acg ccc gcg 23626
Met Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala
4245 4250 4255
gag cac gag gag gag ctg gca gcg cgc ttt tca gcc ccg gaa gaa 23671
Glu His Glu Glu Glu Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu
4260 4265 4270
aac cac caa gag cag cca gag cag gaa gca gag agc gag cag aac 23716
Asn His Gln Glu Gln Pro Glu Gln Glu Ala Glu Ser Glu Gln Asn
4275 4280 4285
cag gct ggg ctc gag cat ggc gac tac ctg agc ggg gca gag gac 23761
Gln Ala Gly Leu Glu His Gly Asp Tyr Leu Ser Gly Ala Glu Asp
4290 4295 4300
gtg ctc atc aag cat ctg gcc cgc caa tgc atc atc gtc aag gac 23806
Val Leu Ile Lys His Leu Ala Arg Gln Cys Ile Ile Val Lys Asp
4305 4310 4315
gcg ctg ctc gac cgc gcc gag gtg ccc ctc agc gtg gcg gag ctc 23851
Ala Leu Leu Asp Arg Ala Glu Val Pro Leu Ser Val Ala Glu Leu
4320 4325 4330
agc cgc gcc tac gag cgc aac ctc ttc tcg cca cgc gtg ccc ccc 23896
Ser Arg Ala Tyr Glu Arg Asn Leu Phe Ser Pro Arg Val Pro Pro
4335 4340 4345
aag cgc cag ccc aac ggc acc tgt gag ccc aac ccg cgc ctc aac 23941
Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg Leu Asn
4350 4355 4360
ttc tac ccg gtc ttc gcg gtg ccc gag gcc ctg gcc acc tac cac 23986
Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr Tyr His
4365 4370 4375
ctc ttt ttc aag aac caa agg atc ccc gtc tcc tgc cgc gcc aac 24031
Leu Phe Phe Lys Asn Gln Arg Ile Pro Val Ser Cys Arg Ala Asn
4380 4385 4390
cgc acc cgc gcc gac gcc ctg ctc aac ctg ggc ccc ggc gcc cgc 24076
Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly Pro Gly Ala Arg
4395 4400 4405
cta cct gat atc acc tcc ttg gaa gag gtt ccc aag atc ttt gag 24121
Leu Pro Asp Ile Thr Ser Leu Glu Glu Val Pro Lys Ile Phe Glu
4410 4415 4420
ggt ctg ggc agc gac gag act cgg gcc gcg aac gct ctg caa gga 24166
Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly
4425 4430 4435
agc gga gag gag cat gag cac cac agc gcc ctg gtg gag ttg gaa 24211
Ser Gly Glu Glu His Glu His His Ser Ala Leu Val Glu Leu Glu
4440 4445 4450
ggc gac aac gcg cgc ctg gcg gtc ctc aag cgc acg gtc gag ctg 24256
Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu
4455 4460 4465
acc cac ttc gcc tac ccg gcg ctc aac ctg ccc ccc aag gtc atg 24301
Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met
4470 4475 4480
agc gcc gtc atg gac cag gtg ctc atc aag cgc gcc tcg ccc ctc 24346
Ser Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Leu
4485 4490 4495
tcg gag gag gag atg cag gac ccc gag agc tcg gac gag ggc aag 24391
Ser Glu Glu Glu Met Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys
4500 4505 4510
ccc gtg gtc agc gac gag cag ctg gcg cgc tgg ctg gga gcg agt 24436
Pro Val Val Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser
4515 4520 4525
agc acc ccc cag agc ctg gaa gag cgg cgc aag ctc atg atg gcc 24481
Ser Thr Pro Gln Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala
4530 4535 4540
gtg gtc ctg gtg acc gtg gag ctg gag tgt ctg cgc cgc ttc ttt 24526
Val Val Leu Val Thr Val Glu Leu Glu Cys Leu Arg Arg Phe Phe
4545 4550 4555
gcc gac gcg gag acc ctg cgc aag gtc gag gag aac ctg cac tac 24571
Ala Asp Ala Glu Thr Leu Arg Lys Val Glu Glu Asn Leu His Tyr
4560 4565 4570
ctc ttc agg cac ggg ttc gtg cgc cag gcc tgc aag atc tcc aac 24616
Leu Phe Arg His Gly Phe Val Arg Gln Ala Cys Lys Ile Ser Asn
4575 4580 4585
gtg gag ctg acc aac ctg gtc tcc tac atg ggc atc ctg cac gag 24661
Val Glu Leu Thr Asn Leu Val Ser Tyr Met Gly Ile Leu His Glu
4590 4595 4600
aac cgc ctg ggg cag aac gtg ctg cac acc acc ctg cgc ggg gag 24706
Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr Leu Arg Gly Glu
4605 4610 4615
gcc cgc cgc gac tac atc cgc gac tgc gtc tac ctg tac ctc tgc 24751
Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu Cys
4620 4625 4630
cac acc tgg cag acg ggc atg ggc gtg tgg cag cag tgc ctg gag 24796
His Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys Leu Glu
4635 4640 4645
gag cag aac ctg aaa gag ctc tgc aag ctc ctg cag aag aac ctg 24841
Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu
4650 4655 4660
aag gcc ctg tgg acc ggg ttc gac gag cgc acc acc gcc tcg gac 24886
Lys Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ser Asp
4665 4670 4675
ctg gcc gac ctc atc ttc ccc gag cgc ctg cgg ctg acg ctg cgc 24931
Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg
4680 4685 4690
aac ggg ctg ccc gac ttt atg agc caa agc atg ttg caa aac ttt 24976
Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe
4695 4700 4705
cgc tct ttc atc ctc gaa cgc tcc ggg atc ctg ccc gcc acc tgc 25021
Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys
4710 4715 4720
tcc gcg ctg ccc tcg gac ttc gtg ccg ctg acc ttc cgc gag tgc 25066
Ser Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys
4725 4730 4735
ccc ccg ccg ctc tgg agc cac tgc tac ttg ctg cgc ctg gcc aac 25111
Pro Pro Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn
4740 4745 4750
tac ctg gcc tac cac tcg gac gtg atc gag gac gtc agc ggc gag 25156
Tyr Leu Ala Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu
4755 4760 4765
ggt ctg ctc gag tgc cac tgt cgc tgc aac ctc tgc acg ccg cac 25201
Gly Leu Leu Glu Cys His Cys Arg Cys Asn Leu Cys Thr Pro His
4770 4775 4780
cgc tcc ctg gcc tgc aac ccc cag ctg ctg agc gag acc cag atc 25246
Arg Ser Leu Ala Cys Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile
4785 4790 4795
atc ggc acc ttc gag ttg caa ggc ccc ggc gag gag ggc aag ggg 25291
Ile Gly Thr Phe Glu Leu Gln Gly Pro Gly Glu Glu Gly Lys Gly
4800 4805 4810
ggt ctg aaa ctc acc ccg ggg ctg tgg acc tcg gcc tac ttg cgc 25336
Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg
4815 4820 4825
aag ttc gtg ccc gag gac tac cat ccc ttc gag atc agg ttc tac 25381
Lys Phe Val Pro Glu Asp Tyr His Pro Phe Glu Ile Arg Phe Tyr
4830 4835 4840
gag gac caa tcc cag ccg ccc aag gcc gag ctg tcg gcc tgc gtc 25426
Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser Ala Cys Val
4845 4850 4855
atc acc cag ggg gcc atc ctg gcc caa ttg caa gcc atc cag aaa 25471
Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile Gln Lys
4860 4865 4870
tcc cgc caa gaa ttt ctg ctg aaa aag ggc cac ggg gtc tac ttg 25516
Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His Gly Val Tyr Leu
4875 4880 4885
gac ccc cag acc gga gag gag ctc aac ccc agc ttc ccc cag gat 25561
Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe Pro Gln Asp
4890 4895 4900
gcc ccg agg aag cag caa gaa gct gaa agt gga gct gcc gcc gcc 25606
Ala Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala Ala Ala
4905 4910 4915
gcc gga gga ttt gga gga aga ctg gga gag cag tca ggc aga gga 25651
Ala Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly
4920 4925 4930
gat gga aga ctg gga cag cac tca ggc aga gga gga cag cct gca 25696
Asp Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala
4935 4940 4945
aga cag tct gga gga gga aga cga ggt gga gga gga ggc aga gga 25741
Arg Gln Ser Gly Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly
4950 4955 4960
aga agc agc cgc cgc cag acc gtc gtc ctc ggc gga gaa agc aag 25786
Arg Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Glu Ser Lys
4965 4970 4975
cag cac gga tac cat ctc cgc tcc ggg tcg ggg tcg cgg cgg ccg 25831
Gln His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Arg Pro
4980 4985 4990
ggc cca cag taggtgggac gagaccgggc gcttcccgaa ccccaccacc 25880
Gly Pro Gln
4995
cagaccggta agaaggagcg gcagggatac aagtcctggc gggggcacaa aaacgccatc 25940
gtctcctgct tgcaagcctg cgggggcaac atctccttca cccggcgcta cctgctcttc 26000
caccgcgggg tgaacttccc ccgcaacatc ttgcattact accgtcacct ccacagcccc 26060
tactactgtt tccaagaaga ggcagaaacc cagcagcagc agaaaaccag cggcagcagc 26120
agctagaaaa tccacagcgg cggcaggtgg actgaggatc gcggcgaacg agccggcgca 26180
gacccgggag ctgaggaacc ggatctttcc caccctctat gccatcttcc agcagagtcg 26240
ggggcaggag caggaactga aagtcaagaa ccgttctctg cgctcgctca cccgcagttg 26300
tctgtatcac aagagcgaag accaacttca gcgcactctc gaggacgccg aggctctctt 26360
caacaagtac tgcgcgctca ctcttaaaga gtagcccgcg cccgcccaca cacggaaaaa 26420
ggcgggaatt acgtcaccac ctgcgccctt cgcccgacca tc atg agc aaa gag 26474
Met Ser Lys Glu
att ccc acg cct tac atg tgg agc tac cag ccc cag atg ggc ctg 26519
Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln Met Gly Leu
5000 5005 5010
gcc gcc ggc gcc gcc cag gac tac tcc acc cgc atg aac tgg ctc 26564
Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu
5015 5020 5025
agt gcc ggg ccc gcg atg atc tca cgg gtg aat gac atc cgc gcc 26609
Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg Ala
5030 5035 5040
cgc cga aac cag ata ctc cta gaa cag tca gcg atc acc gcc acg 26654
Arg Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr
5045 5050 5055
ccc cgc cat cac ctt aat ccg cgt aat tgg ccc gcc gcc ctg gtg 26699
Pro Arg His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val
5060 5065 5070
tac cag gaa att ccc cag ccc acg acc gta cta ctt ccg cga gac 26744
Tyr Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp
5075 5080 5085
gcc cag gcc gaa gtc cag ctg act aac tca ggt gtc cag ctg gcc 26789
Ala Gln Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala
5090 5095 5100
ggc ggc gcc gcc ctg tgt cgt cac cgc ccc gct cag ggt ata aag 26834
Gly Gly Ala Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys
5105 5110 5115
cgg ctg gtg atc cga ggc aga ggc aca cag ctc aac gac gag gtg 26879
Arg Leu Val Ile Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val
5120 5125 5130
gtg agc tct tcg ctg ggt ctg cga cct gac gga gtc ttc caa ctc 26924
Val Ser Ser Ser Leu Gly Leu Arg Pro Asp Gly Val Phe Gln Leu
5135 5140 5145
gcc gga tcg ggg aga tct tcc ttc acg cct cgt cag gcc gtc ctg 26969
Ala Gly Ser Gly Arg Ser Ser Phe Thr Pro Arg Gln Ala Val Leu
5150 5155 5160
act ttg gag agt tcg tcc tcg cag ccc cgc tcg ggc ggc atc ggc 27014
Thr Leu Glu Ser Ser Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly
5165 5170 5175
act ctc cag ttc gtg gag gag ttc act ccc tcg gtc tac ttc aac 27059
Thr Leu Gln Phe Val Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn
5180 5185 5190
ccc ttc tcc ggc tcc ccc ggc cac tac ccg gac gag ttc atc ccg 27104
Pro Phe Ser Gly Ser Pro Gly His Tyr Pro Asp Glu Phe Ile Pro
5195 5200 5205
aac ttc gac gcc atc agc gag tcg gtg gac ggc tac gat tga atg 27149
Asn Phe Asp Ala Ile Ser Glu Ser Val Asp Gly Tyr Asp Met
5210 5215 5220
tcc cat ggt ggc gca gct gac cta gct cgg ctt cga cac ctg gac 27194
Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp
5225 5230 5235
cac tgc cgc cgc ttc cgc tgc ttc gct cgg gat ctc gcc gag ttt 27239
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe
5240 5245 5250
gcc tac ttt gag ctg ccc gag gag cac cct cag ggc cca gcc cac 27284
Ala Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His
5255 5260 5265
gga gtg cgg atc atc gtc gaa ggg ggc ctc gac tcc cac ctg ctt 27329
Gly Val Arg Ile Ile Val Glu Gly Gly Leu Asp Ser His Leu Leu
5270 5275 5280
cgg atc ttc agc cag cga ccg atc ctg gtc gag cgc gaa caa gga 27374
Arg Ile Phe Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly
5285 5290 5295
cag acc cgt ctg acc ctg tac tgc atc tgc aac cac ccc ggc ctg 27419
Gln Thr Arg Leu Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu
5300 5305 5310
cat gaa agt ctt tgt tgt ctg ctg tgt act gag tat aat aaa agc 27464
His Glu Ser Leu Cys Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
5315 5320 5325
tgagatcagc gactactccg gactcgattg tggtgttcct gctatcaacc ggtccctgtt 27524
cttcaccggg aacgaaaccg agctccagct ccagtgtaag ccccacaaga agtacctcac 27584
ctggctgttc cagggctccc ccatcgccgt tgtcaaccac tgcgacaacg acggagtcct 27644
gctgagcggc cctgccaacc ttactttttc cacccgcaga agcaagctcc agctcttcca 27704
acccttcctc cccgggacct atcagtgcgt ctcgggaccc tgccatcaca ccttccacct 27764
gatcccgaat accacagcgc cgctccccgc tactaacaac caaactaacc tccaccaacg 27824
ccaccgtcgc gacctttcct ctgaatctaa taccactacc ggaggtgagc tccgaggtcg 27884
accaacctct gggatttact acggcccctg ggaggtggtg gggttaatag cgctaggcct 27944
agttgtgggt gggcttttgg ctctctgcta cctatacctc ccttgctgtt cgtacttagt 28004
ggtgctgtgt tgctggttta agaa atg ggg cag atc acc cta gtg agc tgc 28055
Met Gly Gln Ile Thr Leu Val Ser Cys
5330 5335
ggt gtg ctg gtg gcg gtg ctt tcg att gtg gga ctg ggc ggc gcg 28100
Gly Val Leu Val Ala Val Leu Ser Ile Val Gly Leu Gly Gly Ala
5340 5345 5350
gct gta gtg aag gag gag aag gcc gat ccc tgc ttg cat ttc aat 28145
Ala Val Val Lys Glu Glu Lys Ala Asp Pro Cys Leu His Phe Asn
5355 5360 5365
ccc gac aaa tgc cag ctg agt ttt cag ccc gat ggc aat cgg tgc 28190
Pro Asp Lys Cys Gln Leu Ser Phe Gln Pro Asp Gly Asn Arg Cys
5370 5375 5380
gcg gtg ctg atc aag tgc gga tgg gaa tgc gag aac gtg aga atc 28235
Ala Val Leu Ile Lys Cys Gly Trp Glu Cys Glu Asn Val Arg Ile
5385 5390 5395
gag tac aat aac aag act cgg aac aat act ctc gcg tcc gtg tgg 28280
Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu Ala Ser Val Trp
5400 5405 5410
cag ccc ggg gac ccc gag tgg tac acc gtc tct gtc ccc ggt gct 28325
Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val Pro Gly Ala
5415 5420 5425
gac ggc tcc ccg cgc acc gtg aat aat act ttc att ttt gcg cac 28370
Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe Ala His
5430 5435 5440
atg tgc aac acg gtc atg tgg atg agc aag cag tac gat atg tgg 28415
Met Cys Asn Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met Trp
5445 5450 5455
ccc ccc acg aag gag aac atc gtg gtc ttc tcc atc gct tac agc 28460
Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
5460 5465 5470
ctg tgc acg gcg cta atc acc gct atc gtg tgc ctg agc att cac 28505
Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His
5475 5480 5485
atg ctc atc gct att cgc ccc aga aat aat gcc gag aaa gag aaa 28550
Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys
5490 5495 5500
cag cca taacacgttt tttcacacac cttgttttta cagaca atg cgt ctg tta 28604
Gln Pro Met Arg Leu Leu
5505
aat ttt tta aac att gtg ctc agt att gct tat gcc tct ggt tat 28649
Asn Phe Leu Asn Ile Val Leu Ser Ile Ala Tyr Ala Ser Gly Tyr
5510 5515 5520
gca aac ata cag aaa acc ctt tat gta gga tct gat ggt aca cta 28694
Ala Asn Ile Gln Lys Thr Leu Tyr Val Gly Ser Asp Gly Thr Leu
5525 5530 5535
gag ggt acc caa tca caa gcc aag gtt gca tgg tat ttt tat aga 28739
Glu Gly Thr Gln Ser Gln Ala Lys Val Ala Trp Tyr Phe Tyr Arg
5540 5545 5550
acc aac act gat cca gtt aaa ctt tgt aag ggt gaa ttg ccg cgt 28784
Thr Asn Thr Asp Pro Val Lys Leu Cys Lys Gly Glu Leu Pro Arg
5555 5560 5565
aca cat aaa act cca ctt aca ttt agt tgc agc aat aat aat ctt 28829
Thr His Lys Thr Pro Leu Thr Phe Ser Cys Ser Asn Asn Asn Leu
5570 5575 5580
aca ctt ttt tca att aca aaa caa tat act ggt act tat tac agt 28874
Thr Leu Phe Ser Ile Thr Lys Gln Tyr Thr Gly Thr Tyr Tyr Ser
5585 5590 5595
aca aac ttt cat aca gga caa gat aaa tat tat act gtt aag gta 28919
Thr Asn Phe His Thr Gly Gln Asp Lys Tyr Tyr Thr Val Lys Val
5600 5605 5610
gaa aat cct acc act cct aga act acc acc acc acc acc act act 28964
Glu Asn Pro Thr Thr Pro Arg Thr Thr Thr Thr Thr Thr Thr Thr
5615 5620 5625
gca aag ccc act gtg aaa act aca act agg acc acc aca act aca 29009
Ala Lys Pro Thr Val Lys Thr Thr Thr Arg Thr Thr Thr Thr Thr
5630 5635 5640
gaa acc acc acc agc aca aca ctt gct gca act aca cac aca cac 29054
Glu Thr Thr Thr Ser Thr Thr Leu Ala Ala Thr Thr His Thr His
5645 5650 5655
act aag cta acc tta cag acc act aat gat ttg atc gcc ctg ctg 29099
Thr Lys Leu Thr Leu Gln Thr Thr Asn Asp Leu Ile Ala Leu Leu
5660 5665 5670
caa aag ggg gat aac agc acc act tcc aat gag gag ata ccc aaa 29144
Gln Lys Gly Asp Asn Ser Thr Thr Ser Asn Glu Glu Ile Pro Lys
5675 5680 5685
tcc atg att ggc att att gtt gct gta gtg gtg tgc atg ttg atc 29189
Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys Met Leu Ile
5690 5695 5700
atc gcc ttg tgc atg gtg tac tat gcc ttc tgc tac aga aag cac 29234
Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His
5705 5710 5715
aga ctg aac gac aag ctg gaa cac tta cta agt gtt gaa ttt 29276
Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe
5720 5725 5730
taatttttta gaacc atg aag atc cta ggc ctt ttt agt ttt tct atc 29324
Met Lys Ile Leu Gly Leu Phe Ser Phe Ser Ile
5735 5740
att acc tct gct ctt tgt gaa tca gtg gat aga gat gtt act att 29369
Ile Thr Ser Ala Leu Cys Glu Ser Val Asp Arg Asp Val Thr Ile
5745 5750 5755
acc act ggt tct aat tat aca ctg aaa ggg cca ccc tca ggt atg 29414
Thr Thr Gly Ser Asn Tyr Thr Leu Lys Gly Pro Pro Ser Gly Met
5760 5765 5770
ctt tcg tgg tat tgc tat ttt gga act gac act gat caa act gaa 29459
Leu Ser Trp Tyr Cys Tyr Phe Gly Thr Asp Thr Asp Gln Thr Glu
5775 5780 5785
tta tgc aat ttt caa aaa ggc aaa acc tca aac tct aaa atc tct 29504
Leu Cys Asn Phe Gln Lys Gly Lys Thr Ser Asn Ser Lys Ile Ser
5790 5795 5800
aat tat caa tgc aat ggc act gat ctg ata cta ctc aat gtc acg 29549
Asn Tyr Gln Cys Asn Gly Thr Asp Leu Ile Leu Leu Asn Val Thr
5805 5810 5815
aaa gca tat ggt ggc agt tat tat tgc cct gga caa aac act gaa 29594
Lys Ala Tyr Gly Gly Ser Tyr Tyr Cys Pro Gly Gln Asn Thr Glu
5820 5825 5830
gaa atg att ttt tac aaa gtg gaa gtg gtt gat ccc act aca cca 29639
Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp Pro Thr Thr Pro
5835 5840 5845
ccc acc acc aca act att cat acc aca cac aca gaa caa aca cca 29684
Pro Thr Thr Thr Thr Ile His Thr Thr His Thr Glu Gln Thr Pro
5850 5855 5860
gag gca aca gaa gca gag ttg gcc ttc cag gtt cac gga gat tcc 29729
Glu Ala Thr Glu Ala Glu Leu Ala Phe Gln Val His Gly Asp Ser
5865 5870 5875
ttt gct gtc aat acc cct aca ccc gat cag cgg tgt ccg ggg ccg 29774
Phe Ala Val Asn Thr Pro Thr Pro Asp Gln Arg Cys Pro Gly Pro
5880 5885 5890
cta gtc agc ggc att gtc ggt gtg ctt tcg gga tta gca gtc ata 29819
Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala Val Ile
5895 5900 5905
atc atc tgc atg ttc att ttt gct tgc tgc tat aga agg ctt tac 29864
Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr
5910 5915 5920
cga caa aaa tca gac cca ctg ctg aac ctc tat gtt taattttttc 29910
Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
5925 5930 5935
cagagcc atg aag gca gtt agc gct cta gtt ttt tgt tct ttg att ggc 29959
Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser Leu Ile Gly
5940 5945
att gtt ttt aat agt aaa att acc aaa gtt agc ttt att aaa cat 30004
Ile Val Phe Asn Ser Lys Ile Thr Lys Val Ser Phe Ile Lys His
5950 5955 5960
gtt aat gta act gaa gga gat aac atc aca cta gca ggt gta gaa 30049
Val Asn Val Thr Glu Gly Asp Asn Ile Thr Leu Ala Gly Val Glu
5965 5970 5975
ggt gct caa aac acc acc tgg aca aaa tac cat cta gga tgg aga 30094
Gly Ala Gln Asn Thr Thr Trp Thr Lys Tyr His Leu Gly Trp Arg
5980 5985 5990
gat att tgc acc tgg aat gta act tat tat tgc ata gga att aat 30139
Asp Ile Cys Thr Trp Asn Val Thr Tyr Tyr Cys Ile Gly Ile Asn
5995 6000 6005
ctt acc att gtt aac gct aac caa tct cag aat ggg tta att aaa 30184
Leu Thr Ile Val Asn Ala Asn Gln Ser Gln Asn Gly Leu Ile Lys
6010 6015 6020
gga cag agt gtt agt gtg acc agt gat ggg tac tat acc cag cat 30229
Gly Gln Ser Val Ser Val Thr Ser Asp Gly Tyr Tyr Thr Gln His
6025 6030 6035
agt ttt aac tac aac att act gtc ata cca ctg cct acg cct agc 30274
Ser Phe Asn Tyr Asn Ile Thr Val Ile Pro Leu Pro Thr Pro Ser
6040 6045 6050
cca cct agc act acc aca cag aca acc aca tac agt aca tca aat 30319
Pro Pro Ser Thr Thr Thr Gln Thr Thr Thr Tyr Ser Thr Ser Asn
6055 6060 6065
cag cct acc acc act aca gca gca gag gtt gcc agc tcg tct ggg 30364
Gln Pro Thr Thr Thr Thr Ala Ala Glu Val Ala Ser Ser Ser Gly
6070 6075 6080
gtc cga gtg gca ttt ttg atg ttg gcc cca tct agc agt ccc act 30409
Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser Ser Ser Pro Thr
6085 6090 6095
gct agt acc aat gag cag act act gaa ttt ttg tcc act gtc gag 30454
Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser Thr Val Glu
6100 6105 6110
agc cac acc aca gct acc tcc agt gcc ttc tct agc acc gcc aat 30499
Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr Ala Asn
6115 6120 6125
ctc tcc tcg ctt tcc tct aca cca atc agc ccc gct act act cct 30544
Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala Thr Thr Pro
6130 6135 6140
agc ccc gct cct ctt ccc act ccc ctg aag caa aca gac ggc ggc 30589
Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr Asp Gly Gly
6145 6150 6155
atg caa tgg cag atc acc ctg ctc att gtg atc ggg ttg gtc atc 30634
Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu Val Ile
6160 6165 6170
ctg gcc gtg ttg ctc tac tac atc ttc tgc cgc cgc att ccc aac 30679
Leu Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile Pro Asn
6175 6180 6185
gcg cac cgc aag ccg gcc tac aag ccc atc gtt atc ggg cag ccg 30724
Ala His Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Gln Pro
6190 6195 6200
gag ccg ctt cag gtg gaa ggg ggt cta agg aat ctt ctc ttc tct 30769
Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser
6205 6210 6215
ttt aca gta tgg tgattgaact atg att cct aga caa ttc ttg atc act 30818
Phe Thr Val Trp Met Ile Pro Arg Gln Phe Leu Ile Thr
6220 6225 6230
att ctt atc tgc ctc ctc caa gtc tgt gcc acc ctc gct ctg gtg 30863
Ile Leu Ile Cys Leu Leu Gln Val Cys Ala Thr Leu Ala Leu Val
6235 6240 6245
gcc aac gcc agt cca gac tgt att ggg ccc ttc gcc tcc tac gtg 30908
Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro Phe Ala Ser Tyr Val
6250 6255 6260
ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc ata gtc tgc 30953
Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser Ile Val Cys
6265 6270 6275
ctg ctt atc acc ttc ttc cag ttc att gac tgg atc ttt gtg cgc 30998
Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Ile Phe Val Arg
6280 6285 6290
atc gcc tac ctg cgc cac cac ccc cag tac cgc gac cag cga gtg 31043
Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp Gln Arg Val
6295 6300 6305
gcg cag ctg ctc agg ctc ctc tgataagc atg cgg gct ctg cta ctt 31090
Ala Gln Leu Leu Arg Leu Leu Met Arg Ala Leu Leu Leu
6310 6315 6320
ctc gcg ctt ctg ctg tta gtg ctc ccc cgt ccc gtt gac ccc cgg 31135
Leu Ala Leu Leu Leu Leu Val Leu Pro Arg Pro Val Asp Pro Arg
6325 6330 6335
ccc ccc act cag tcc ccc gag gag gtc cgc aaa tgc aaa ttc caa 31180
Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys Lys Phe Gln
6340 6345 6350
gaa ccc tgg aaa ttc ctc aaa tgc tac cgc caa aaa tca gac atg 31225
Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys Ser Asp Met
6355 6360 6365
cat ccc agc tgg atc atg atc att ggg atc gtg aac att ctg gcc 31270
His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile Leu Ala
6370 6375 6380
tgc acc ctc atc tcc ttt gtg att tac ccc tgc ttt gac ttt ggt 31315
Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe Gly
6385 6390 6395
tgg aac tcg cca gag gcg ctc tat ctc ccg cct gaa cct gac aca 31360
Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr
6400 6405 6410
cca cca cag caa cct cag gca cac gca cta cca cca cca cca cag 31405
Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Pro Pro Gln
6415 6420 6425
cct agg cca caa tac atg ccc ata tta gac tat gag gcc gag cca 31450
Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro
6430 6435 6440
cag cga ccc atg ctc ccc gct att agt tac ttc aat cta acc ggc 31495
Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly
6445 6450 6455
gga gat gac tgacccactg gccaacaaca acgtcaacga ccttctcctg 31544
Gly Asp Asp
gacatggacg gccgcgcctc ggagcagcga ctcgcccaac ttcgcattcg ccagcagcag 31604
gagagagccg tcaaggagct gcaggacggc atagccatcc accagtgcaa gaaaggcatc 31664
ttctgcctgg tgaaacaggc caagatctcc tacgaggtca cccagaccga ccatcgcctc 31724
tcctacgagc tcctgcagca gcgccagaag ttcacctgcc tggtcggagt caaccccatc 31784
gtcatcaccc agcagtcggg cgataccaag gggtgcatcc actgctcctg cgactccccc 31844
gactgcgtcc acactctgat caagaccctc tgcggcctcc gcgacctcct ccccatgaac 31904
taatcacccc cttatccagt gaaataaaga tcatattgat gattaaataa aaaaaataat 31964
catttgattt gaaataaaga tacaatcata ttgatgattt gagtttaata aaaataaaga 32024
atcacttact tgaaatctga taccaggtct ctgtccatgt tttctgccaa caccacttca 32084
ctcccctctt cccagctctg gtactgcagg ccccggcggg ctgcaaactt cctccacacc 32144
ctgaagggga tgtcaaattc ctcctgtccc tcaatcttca ttttatcttc tatcag atg 32203
Met
tcc aaa aag cgc gtc cgg gtg gat gat gac ttc gac ccc gtc tac 32248
Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
6460 6465 6470
ccc tac gat gca gac aac gca ccg acc gtg ccc ttc atc aac ccc 32293
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro
6475 6480 6485
ccc ttc gtc tct tca gat gga ttc caa gag aag ccc ctg ggg gtg 32338
Pro Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val
6490 6495 6500
ctg tcc ctg cgt ctg gcc gat ccc gtc acc acc aag aac ggg gaa 32383
Leu Ser Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu
6505 6510 6515
atc acc ctc aag ctg gga gat ggg gtg gac ctc gac gac tcg gga 32428
Ile Thr Leu Lys Leu Gly Asp Gly Val Asp Leu Asp Asp Ser Gly
6520 6525 6530
aaa ctc atc tcc aac acg gcc acc aag gcc gcc gcc cct ctc agt 32473
Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser
6535 6540 6545
ttt tcc aac aac acc att tcc ctt aac atg gat acc cct ctt tac 32518
Phe Ser Asn Asn Thr Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr
6550 6555 6560
aac aac aat gga aag cta ggt atg aag gta acc gca cca tta aag 32563
Asn Asn Asn Gly Lys Leu Gly Met Lys Val Thr Ala Pro Leu Lys
6565 6570 6575
ata tta gac aca gat cta cta aaa aca ctt gtt gtt gct tat ggg 32608
Ile Leu Asp Thr Asp Leu Leu Lys Thr Leu Val Val Ala Tyr Gly
6580 6585 6590
cag gga tta gga aca aac acc aat ggt gct ctt gtt gcc caa cta 32653
Gln Gly Leu Gly Thr Asn Thr Asn Gly Ala Leu Val Ala Gln Leu
6595 6600 6605
gca tac cca ctt gtt ttt aat acc gct agc aaa att gcc ctt aat 32698
Ala Tyr Pro Leu Val Phe Asn Thr Ala Ser Lys Ile Ala Leu Asn
6610 6615 6620
tta ggc aat gga cca tta aaa gtg gat gca aat aga ctg aac att 32743
Leu Gly Asn Gly Pro Leu Lys Val Asp Ala Asn Arg Leu Asn Ile
6625 6630 6635
aat tgc aaa aga ggt atc tat gtc act acc aca aaa gat gca ctg 32788
Asn Cys Lys Arg Gly Ile Tyr Val Thr Thr Thr Lys Asp Ala Leu
6640 6645 6650
gag att aat atc agt tgg gca aat gct atg aca ttt ata gga aat 32833
Glu Ile Asn Ile Ser Trp Ala Asn Ala Met Thr Phe Ile Gly Asn
6655 6660 6665
gcc att ggt gtc aat att gac aca aaa aaa ggc cta cag ttc ggc 32878
Ala Ile Gly Val Asn Ile Asp Thr Lys Lys Gly Leu Gln Phe Gly
6670 6675 6680
act tca agc act gaa aca gat gtt aaa aat gct ttt cca ctc caa 32923
Thr Ser Ser Thr Glu Thr Asp Val Lys Asn Ala Phe Pro Leu Gln
6685 6690 6695
gta aaa ctt gga gct ggt ctt aca ttt gac agc aca ggt gcc att 32968
Val Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile
6700 6705 6710
gtt gct tgg aac aaa gaa gat gac aaa ctt aca ctg tgg acc aca 33013
Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr
6715 6720 6725
gcc gat cca tct cca aac tgt cac ata tat tct gca aag gat gct 33058
Ala Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Ala Lys Asp Ala
6730 6735 6740
aag ctt aca ctc tgc ttg aca aag tgt ggt agt cag ata ctg ggc 33103
Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly
6745 6750 6755
act gtt tct ctc ata gct gtt gat act ggt agc tta aat cca ata 33148
Thr Val Ser Leu Ile Ala Val Asp Thr Gly Ser Leu Asn Pro Ile
6760 6765 6770
aca gga aaa gta acc act gct ctt gtt tca ctt aaa ttc gat gcc 33193
Thr Gly Lys Val Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala
6775 6780 6785
aat gga gtt ttg caa gcc agt tca aca cta gat aaa gaa tat tgg 33238
Asn Gly Val Leu Gln Ala Ser Ser Thr Leu Asp Lys Glu Tyr Trp
6790 6795 6800
aat ttc aga aaa gga gat gtg aca cct gct gac ccc tac act aat 33283
Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Asp Pro Tyr Thr Asn
6805 6810 6815
gct ata ggc ttt atg ccc aac ctt aat gca tac cca aaa aac aca 33328
Ala Ile Gly Phe Met Pro Asn Leu Asn Ala Tyr Pro Lys Asn Thr
6820 6825 6830
aac gca gct gca aaa agt cac att gtt gga aaa gta tac cta cat 33373
Asn Ala Ala Ala Lys Ser His Ile Val Gly Lys Val Tyr Leu His
6835 6840 6845
ggg gat gta agc aag cca cta gac ttg ata att aca ttt aat gaa 33418
Gly Asp Val Ser Lys Pro Leu Asp Leu Ile Ile Thr Phe Asn Glu
6850 6855 6860
acc agt gat gaa tcc tgt act tat tgc att aac ttt cag tgg cag 33463
Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile Asn Phe Gln Trp Gln
6865 6870 6875
tgg gga act gac caa tat aaa gat gaa aca ctt gca gtc agt tca 33508
Trp Gly Thr Asp Gln Tyr Lys Asp Glu Thr Leu Ala Val Ser Ser
6880 6885 6890
ttc acc ttc tca tac att gct aaa gaa taacatccac cctgcatgcc 33555
Phe Thr Phe Ser Tyr Ile Ala Lys Glu
6895 6900
aacccatttc cctctatcta tacatggaaa actctgaagc agaaaaaata aagttcaagt 33615
gttttattga ttcaacagtt tttacagaat tcgagtagtt attttccctc caccctccca 33675
actcatggaa tacaccatcc tctccccacg cacagcctta aacatctgaa tgccattggt 33735
aatggacatg gttttggcct ccacattcca cacagtttca gagcgagcca gtctcgggtc 33795
ggtcagggag atgaaaccct ccgggcactc ctgcatctgc acctcacagt tcaacagctg 33855
agggctgtcc tcggtggtcg ggatcacggt tatctggaag aagagcgatg agaatcataa 33915
tccgcgaacg ggatcgggcg gttgtggcgc atcaggcccc gcagcagtcg ctgtctgcgc 33975
cgctccgtca agctgctgct caaggggtcc gggtccaggg actccctgcg catgatgccg 34035
atggccctga gcatcagtcg cctggtgcgg cgggcgcagc agcggatgcg gatctcactc 34095
aggtcggagc agtacgtgca gcacagcacc accaagttgt tcaacagtcc atagttcaac 34155
acgctccagc caaaactcat ctgtggaact atgctgccca cgtgtccatc gtaccagatc 34215
ctgatgtaaa tcaggtggcg ccccctccag aacacactgc ccatgtacat gatctccttg 34275
ggcatatgca ggttcaccac ctcccggtac cacatcaccc gctggttgaa catgcagccc 34335
tggataattc tgcggaacca gatggccagc accgccccgc ccgccatgca gcgcagggac 34395
cccgggtcct gacagtggca gtggaggacc caccgctcgc ggccgtggat caactgggag 34455
ctgaacaggt ctatgttggc acagcacagg cacacgctca tgcatgtctt cagcactctc 34515
agttcctcgg gggtcaggac catgtcccag ggcacgggga actcttgcag gacagtgaac 34575
ccggcagaac agggcagccc tcgcacacaa cttacattgt gcatggacag ggtatcgcaa 34635
tcaggcagca ccggatgatc ctccaccaga gaagcgcggg tctcggtctc ctcacaacga 34695
ggtaaggggg ccggcggttg gtacggatga tggcgggatg acgctaatcg tgttctggat 34755
cgtgtcatga tggagcttct tcctgacatc ttcgtatttc atgtagcaga acctggtccg 34815
ggcactgcac accgctcgcc ggcgacggtc tcggcgcttc gagcgctcgg tgttgaagtt 34875
gtaaaacagc cactccctca gagcgtgcag tatctcttga gcctcttggg tgatgaaaat 34935
cccatccgcc ctgatggctc tgatcacatc gaccacggtg gaatgggcca gacccagcca 34995
gatgatgcaa ttttgttggg tttcggtgac ggcgggggag ggaagaacag gaagaaccat 35055
gattaacttt attccaaacg gtctcggagc acttcaaaat gcaggtcgcg gagatggcac 35115
ctctcgcccc cactgtgttg atggaaaata acagccaggt caaaggtgac acggttctcg 35175
agatgttcca cggtggcttc cagcaaagcc tccacgcgca catccagaaa caagaggaca 35235
gcgaaagcgg gagcgttctc taattcctca atcatcatat tacactcctg caccatcccc 35295
agataatttt catttttcca gccttgaatg atttgaacta gttcctgagg taaatccaag 35355
ccagccatga taaaaagctc gcgcagagcg ccctccaccg gcattcttaa gcacaccctc 35415
ataattccaa gagattctgc tcctggttca cctgcagcag attaacaagg ggaatatcaa 35475
aatctctgcc gcgatctcta agctcctccc tcagcaataa ctgcaagtac tctttcatat 35535
cttctccgaa atttttagcc atagggccgc caggaatgag agcagggcaa gccacattac 35595
agataaagcg aagtcctccc cagtgagcat tgccaaatgt aagattgaaa taagcatgct 35655
ggctagaccc ggtgatatct tccagataac tggacagaaa atcaggcaag caatttttaa 35715
gaaaatcaac aaaagaaaag tcgtccaggt gcaagtttag agcctcagga acaacgatgg 35775
aataagtgca aggagtgcgt tccagcatgg ttagtgtttt tttggtgatc tgtagaacaa 35835
aaaataaaca tgcaatatta aaccatgcta gcctggcgaa caggtgggta aatcactctt 35895
tccagcacca ggcaggctac ggggtctccg gcgcgaccct cgtagaagct gtcgccatga 35955
ttgaaaagca tcaccgaaag actttcccgg tggccggcat ggatgattcg cgaagacgcg 36015
tacactccgg gaacattggc atccgtgagt gaaaaaaatc gccccaagaa gccccgaggc 36075
actacaatgc tcaaccttaa ttccagcaga gcgaccccat gcggatgaag cacaaaattg 36135
gtaggtgcgt aaaaaatgta attactcccc tcctgcacag gcagcaaagc ccccgctccc 36195
tccagaaaca catacaaagc ctcagcgtcc atagcttacc gagcacggca ggcgcaagat 36255
tcagagaaaa ggctgagctc taacctgact gcccgctcct gagctcaata tatagcccta 36315
acctacactg acgtaaaggc caaagtctaa aaatacccgc caaaatgaca cacacgccca 36375
gcacacgccc agaaaccggt gacacactca aaaaaatacg tgcgcttcct caaacgccca 36435
aaccggcgtc atttccgggt tcccacgcta cgtcaccgct cagcgacttt caaatttcgt 36495
cgaccgttaa acacgtcact cgccccgccc ctaacggtcg ccctcctctc ggccaatcac 36555
agccccgcat ccccaaattc aaacgcctca tttgcatatt aacgcgcaca aaaagtttga 36615
ggtatattat ttgatgatg 36634
<210> 34
<211> 503
<212> PRT
<213> Simian adenovirus 37
<400> 34
Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu
1 5 10 15
Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn
20 25 30
Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu
35 40 45
Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly
50 55 60
Ala Ala Ala Arg Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser
65 70 75 80
Gly Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu
85 90 95
Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile
100 105 110
Lys Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val
115 120 125
Ser Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val
130 135 140
Gln Ser Gln Gly Val Asp Glu Val Ser Val Met His Glu Lys Tyr Ser
145 150 155 160
Leu Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu
165 170 175
Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys
180 185 190
Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser
195 200 205
Gly Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala Phe
210 215 220
Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly
225 230 235 240
Val Thr Phe Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val
245 250 255
Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe
260 265 270
Gly Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg
275 280 285
Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys
290 295 300
Ser Lys Val Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly
305 310 315 320
Val Met Ser Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu
325 330 335
Thr Gly Cys Phe Val Cys Ile Lys Gly Asn Ala Gln Val Lys His Asn
340 345 350
Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys
355 360 365
Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His
370 375 380
Pro Arg Lys Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys
385 390 395 400
Asn Val His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys
405 410 415
Asn Met Gln Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg
420 425 430
Val Ser Leu Ala Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile
435 440 445
Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly
450 455 460
Gly Lys His Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp
465 470 475 480
Leu Arg Pro Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly
485 490 495
Ser Ser Gly Glu Glu Ser Asp
500
<210> 35
<211> 157
<212> PRT
<213> Simian adenovirus 37
<400> 35
Met Arg Gly Arg Met Thr Lys Ile Cys Val Phe Leu Cys Ser Ser Met
1 5 10 15
Ser Gly Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr
20 25 30
Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser
35 40 45
Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr
50 55 60
Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala
65 70 75 80
Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly
85 90 95
Tyr Tyr Ser Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala
100 105 110
Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala
115 120 125
Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln
130 135 140
Ala Glu Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys
145 150 155
<210> 36
<211> 392
<212> PRT
<213> Simian adenovirus 37
<400> 36
Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln
1 5 10 15
Gln Gln Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln
20 25 30
Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Ser
35 40 45
Tyr Asp His Gln Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu
50 55 60
Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg Asp
65 70 75 80
Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser
85 90 95
Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg
100 105 110
Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp
115 120 125
Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val
130 135 140
Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu
145 150 155 160
Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala
165 170 175
Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu
180 185 190
Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe
195 200 205
Leu Val Val Gln His Ser Arg Asp Asn Glu Thr Phe Arg Glu Ala Leu
210 215 220
Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn
225 230 235 240
Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu
245 250 255
Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr
260 265 270
Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val
275 280 285
Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu
290 295 300
Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val
305 310 315 320
Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser
325 330 335
Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe
340 345 350
Asp Met Gly Ala Asp Leu Arg Trp Gln Pro Ser Arg Arg Ala Leu Glu
355 360 365
Ala Ala Gly Gly Ser Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp
370 375 380
Glu Glu Gly Glu Tyr Leu Glu Asp
385 390
<210> 37
<211> 593
<212> PRT
<213> Simian adenovirus 37
<400> 37
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln
1 5 10 15
Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
20 25 30
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln
35 40 45
Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro
50 55 60
Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala
65 70 75 80
Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr
85 90 95
Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln
100 105 110
Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln
115 120 125
Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala Leu
130 135 140
Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu
145 150 155 160
Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val
165 170 175
Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr
180 185 190
Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn
195 200 205
Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr
210 215 220
Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val
225 230 235 240
Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly
245 250 255
Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
260 265 270
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln
275 280 285
Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn
290 295 300
Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr Glu Glu Glu
305 310 315 320
Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln
325 330 335
Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met
340 345 350
Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met
355 360 365
Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn
370 375 380
Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly
385 390 395 400
Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val
405 410 415
Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu
420 425 430
Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly
435 440 445
Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro
450 455 460
Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg
465 470 475 480
Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser
485 490 495
Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
500 505 510
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His
515 520 525
Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala
530 535 540
Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp
545 550 555 560
Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser
565 570 575
Gly Gly Gly Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met
580 585 590
Met
<210> 38
<211> 542
<212> PRT
<213> Simian adenovirus 37
<400> 38
Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Met Ala Ala Ala Ala Ala Met Gln Pro Pro Leu
20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg
35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg
50 55 60
Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr
65 70 75 80
Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp
85 90 95
Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg
100 105 110
Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro
115 120 125
Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met
130 135 140
Val Ser Arg Lys Thr Pro Asn Gly Val Lys Val Asp Asp Thr Tyr Asp
145 150 155 160
Gly Ser Gln Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro
165 170 175
Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala
180 185 190
Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu
195 200 205
Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp
210 215 220
Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala
225 230 235 240
Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr
245 250 255
Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe
260 265 270
Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile
275 280 285
Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Asp Ser Lys Lys Lys Ala
290 295 300
Glu Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ala Thr Asn Ala Asp
305 310 315 320
Ala Asn Val Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala Glu Glu Ala
325 330 335
Ala Ala Leu Ala Val Ala Asp Asp Ser Glu Ser Lys Ile Val Ile Gln
340 345 350
Pro Val Lys Lys Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu Pro Asp
355 360 365
Glu Val Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly
370 375 380
Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp
385 390 395 400
Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met
405 410 415
Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro
420 425 430
Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn
435 440 445
Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr
450 455 460
His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro
465 470 475 480
Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp
485 490 495
His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val
500 505 510
Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala
515 520 525
Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe
530 535 540
<210> 39
<211> 193
<212> PRT
<213> Simian adenovirus 37
<400> 39
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
1 5 10 15
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His
20 25 30
Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
35 40 45
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val
50 55 60
Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr
65 70 75 80
Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala
85 90 95
Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
100 105 110
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
115 120 125
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala
130 135 140
Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala
145 150 155 160
Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg
165 170 175
Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg
180 185 190
Thr
<210> 40
<211> 346
<212> PRT
<213> Simian adenovirus 37
<400> 40
Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro
1 5 10 15
Glu Ile Tyr Gly Pro Ala Ala Val Val Lys Glu Glu Arg Lys Pro Arg
20 25 30
Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Val Asp Asp
35 40 45
Met Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp
50 55 60
Arg Gly Arg Lys Val Arg Pro Val Leu Arg Pro Gly Thr Thr Val Val
65 70 75 80
Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp
85 90 95
Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu
100 105 110
Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Pro Leu Lys Glu
115 120 125
Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu
130 135 140
Lys Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg
145 150 155 160
Gly Phe Lys Arg Glu Gly Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu
165 170 175
Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu His Met Lys
180 185 190
Val Asp Pro Glu Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln
195 200 205
Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr
210 215 220
Glu Pro Met Glu Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr
225 230 235 240
Met Glu Val Gln Thr Asp Pro Trp Met Pro Ala Pro Ala Ser Thr Thr
245 250 255
Thr Arg Arg Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn
260 265 270
Tyr Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr
275 280 285
Arg Phe Tyr Arg Gly Tyr Ser Ser Arg Arg Lys Thr Thr Thr Arg Arg
290 295 300
Arg Arg Arg Arg Thr Arg Arg Ser Thr Thr Ala Thr Ser Ala Ala Ala
305 310 315 320
Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu Thr Leu Pro
325 330 335
Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
340 345
<210> 41
<211> 77
<212> PRT
<213> Simian adenovirus 37
<400> 41
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly
1 5 10 15
Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly Asn Gly Leu Arg Arg His
20 25 30
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
35 40 45
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro
50 55 60
Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
65 70 75
<210> 42
<211> 241
<212> PRT
<213> Simian adenovirus 37
<400> 42
Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg
1 5 10 15
Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly
20 25 30
Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly
35 40 45
Ser Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly
50 55 60
Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val
65 70 75 80
Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn
85 90 95
Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala Val Pro Pro
100 105 110
Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu
115 120 125
Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu
130 135 140
Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu
145 150 155 160
Gly Leu Pro Thr Thr Arg Pro Val Ala Pro Leu Ala Thr Gly Val Leu
165 170 175
Lys Pro Ser Ser Ser Ser Ser Gln Pro Ala Thr Leu Asp Leu Pro Pro
180 185 190
Pro Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala
195 200 205
Ser Arg Ala Leu Arg Gly Arg Pro Gln Ala Asn Trp Gln Ser Thr Leu
210 215 220
Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Cys
225 230 235 240
Tyr
<210> 43
<211> 942
<212> PRT
<213> Simian adenovirus 37
<400> 43
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Ser Gln Trp Glu Gln Ala Lys Thr Gly Asn Gly Gly
130 135 140
Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn
145 150 155 160
Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Asp Val Thr Ala Asn Gln
165 170 175
Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val
180 185 190
Gly Glu Glu Asn Trp Gln Glu Thr Glu Asn Phe Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg
210 215 220
Pro Thr Asn Glu Lys Gly Gly Gln Ala Lys Leu Lys Val Gly Asp Asp
225 230 235 240
Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala Phe Phe Asp Thr
245 250 255
Pro Gly Gly Thr Val Asn Gly Gln Asp Glu Tyr Lys Ala Asp Ile Val
260 265 270
Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp Thr His Val Val
275 280 285
Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu Ile Asn Leu Val Gln
290 295 300
Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe
305 310 315 320
Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala
325 330 335
Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn
340 345 350
Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr
355 360 365
Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp
370 375 380
Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr
385 390 395 400
Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gln Gly Val
405 410 415
Lys Val Lys Asn Gly Gln Asp Gly Asp Val Glu Ser Glu Trp Glu Lys
420 425 430
Asp Asp Thr Val Ala Ala Gln Asn Gln Leu Cys Lys Gly Asn Ile Phe
435 440 445
Ala Met Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu Tyr
450 455 460
Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Thr
465 470 475 480
Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly
485 490 495
Arg Val Thr Pro Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala
500 505 510
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His
515 520 525
Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg
530 535 540
Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys
545 550 555 560
Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg
565 570 575
Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg
580 585 590
Thr Asp Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr
595 600 605
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu
610 615 620
Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala
625 630 635 640
Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser
645 650 655
Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg
660 665 670
Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr
675 680 685
Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu
690 695 700
Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser
705 710 715 720
Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys
725 730 735
Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr
740 745 750
Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr
755 760 765
Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe
770 775 780
Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn
785 790 795 800
Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser
805 810 815
Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr
820 825 830
Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser
835 840 845
Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro
850 855 860
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln
865 870 875 880
Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu
885 890 895
Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val
900 905 910
Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala
915 920 925
Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
930 935 940
<210> 44
<211> 209
<212> PRT
<213> Simian adenovirus 37
<400> 44
Met Met Ala Glu Pro Thr Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile
1 5 10 15
Ile Arg Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys
20 25 30
Arg Phe Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val
35 40 45
Asn Thr Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala
50 55 60
Trp Asn Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe
65 70 75 80
Ser Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu
85 90 95
Leu Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu
100 105 110
Lys Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu
115 120 125
Phe Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro
130 135 140
Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly
145 150 155 160
Met Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu
165 170 175
Ala Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His
180 185 190
Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp
195 200 205
Met
<210> 45
<211> 800
<212> PRT
<213> Simian adenovirus 37
<400> 45
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Ala
1 5 10 15
Thr Ala Asp Glu Asn Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro
20 25 30
Pro Pro Ser Pro Thr Ser Asp Ala Ala Ala Pro Asp Met Gln Glu Met
35 40 45
Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His
50 55 60
Glu Glu Glu Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln
65 70 75 80
Glu Gln Pro Glu Gln Glu Ala Glu Ser Glu Gln Asn Gln Ala Gly Leu
85 90 95
Glu His Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His
100 105 110
Leu Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala
115 120 125
Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg Asn
130 135 140
Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys
145 150 155 160
Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu
165 170 175
Ala Leu Ala Thr Tyr His Leu Phe Phe Lys Asn Gln Arg Ile Pro Val
180 185 190
Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly
195 200 205
Pro Gly Ala Arg Leu Pro Asp Ile Thr Ser Leu Glu Glu Val Pro Lys
210 215 220
Ile Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu
225 230 235 240
Gln Gly Ser Gly Glu Glu His Glu His His Ser Ala Leu Val Glu Leu
245 250 255
Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu
260 265 270
Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser
275 280 285
Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu
290 295 300
Glu Glu Met Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val
305 310 315 320
Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln
325 330 335
Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr
340 345 350
Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu
355 360 365
Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val
370 375 380
Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser
385 390 395 400
Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His
405 410 415
Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val
420 425 430
Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln
435 440 445
Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln
450 455 460
Lys Asn Leu Lys Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala
465 470 475 480
Ser Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu
485 490 495
Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe
500 505 510
Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser
515 520 525
Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro
530 535 540
Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala
545 550 555 560
Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu
565 570 575
Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys
580 585 590
Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu
595 600 605
Gln Gly Pro Gly Glu Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro Gly
610 615 620
Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His
625 630 635 640
Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala
645 650 655
Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu
660 665 670
Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His
675 680 685
Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe
690 695 700
Pro Gln Asp Ala Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala
705 710 715 720
Ala Ala Ala Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg
725 730 735
Gly Asp Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala
740 745 750
Arg Gln Ser Gly Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg
755 760 765
Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Glu Ser Lys Gln His
770 775 780
Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro Gln
785 790 795 800
<210> 46
<211> 227
<212> PRT
<213> Simian adenovirus 37
<400> 46
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn
20 25 30
Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg
35 40 45
Ala Arg Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr
50 55 60
Pro Arg His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr
65 70 75 80
Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
85 90 95
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala
100 105 110
Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile
115 120 125
Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
130 135 140
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser
145 150 155 160
Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser
165 170 175
Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
180 185 190
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr
195 200 205
Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp
210 215 220
Gly Tyr Asp
225
<210> 47
<211> 106
<212> PRT
<213> Simian adenovirus 37
<400> 47
Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp
1 5 10 15
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala
20 25 30
Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
35 40 45
Arg Ile Ile Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe
50 55 60
Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Arg Leu
65 70 75 80
Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu Cys
85 90 95
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
100 105
<210> 48
<211> 176
<212> PRT
<213> Simian adenovirus 37
<400> 48
Met Gly Gln Ile Thr Leu Val Ser Cys Gly Val Leu Val Ala Val Leu
1 5 10 15
Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Glu Lys Ala
20 25 30
Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln
35 40 45
Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys
50 55 60
Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu
65 70 75 80
Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val
85 90 95
Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe
100 105 110
Ala His Met Cys Asn Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met
115 120 125
Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
130 135 140
Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met
145 150 155 160
Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
165 170 175
<210> 49
<211> 228
<212> PRT
<213> Simian adenovirus 37
<400> 49
Met Arg Leu Leu Asn Phe Leu Asn Ile Val Leu Ser Ile Ala Tyr Ala
1 5 10 15
Ser Gly Tyr Ala Asn Ile Gln Lys Thr Leu Tyr Val Gly Ser Asp Gly
20 25 30
Thr Leu Glu Gly Thr Gln Ser Gln Ala Lys Val Ala Trp Tyr Phe Tyr
35 40 45
Arg Thr Asn Thr Asp Pro Val Lys Leu Cys Lys Gly Glu Leu Pro Arg
50 55 60
Thr His Lys Thr Pro Leu Thr Phe Ser Cys Ser Asn Asn Asn Leu Thr
65 70 75 80
Leu Phe Ser Ile Thr Lys Gln Tyr Thr Gly Thr Tyr Tyr Ser Thr Asn
85 90 95
Phe His Thr Gly Gln Asp Lys Tyr Tyr Thr Val Lys Val Glu Asn Pro
100 105 110
Thr Thr Pro Arg Thr Thr Thr Thr Thr Thr Thr Thr Ala Lys Pro Thr
115 120 125
Val Lys Thr Thr Thr Arg Thr Thr Thr Thr Thr Glu Thr Thr Thr Ser
130 135 140
Thr Thr Leu Ala Ala Thr Thr His Thr His Thr Lys Leu Thr Leu Gln
145 150 155 160
Thr Thr Asn Asp Leu Ile Ala Leu Leu Gln Lys Gly Asp Asn Ser Thr
165 170 175
Thr Ser Asn Glu Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala
180 185 190
Val Val Val Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala
195 200 205
Phe Cys Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu
210 215 220
Ser Val Glu Phe
225
<210> 50
<211> 203
<212> PRT
<213> Simian adenovirus 37
<400> 50
Met Lys Ile Leu Gly Leu Phe Ser Phe Ser Ile Ile Thr Ser Ala Leu
1 5 10 15
Cys Glu Ser Val Asp Arg Asp Val Thr Ile Thr Thr Gly Ser Asn Tyr
20 25 30
Thr Leu Lys Gly Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe
35 40 45
Gly Thr Asp Thr Asp Gln Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys
50 55 60
Thr Ser Asn Ser Lys Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu
65 70 75 80
Ile Leu Leu Asn Val Thr Lys Ala Tyr Gly Gly Ser Tyr Tyr Cys Pro
85 90 95
Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp
100 105 110
Pro Thr Thr Pro Pro Thr Thr Thr Thr Ile His Thr Thr His Thr Glu
115 120 125
Gln Thr Pro Glu Ala Thr Glu Ala Glu Leu Ala Phe Gln Val His Gly
130 135 140
Asp Ser Phe Ala Val Asn Thr Pro Thr Pro Asp Gln Arg Cys Pro Gly
145 150 155 160
Pro Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala Val Ile
165 170 175
Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg
180 185 190
Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
195 200
<210> 51
<211> 288
<212> PRT
<213> Simian adenovirus 37
<400> 51
Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser Leu Ile Gly Ile Val
1 5 10 15
Phe Asn Ser Lys Ile Thr Lys Val Ser Phe Ile Lys His Val Asn Val
20 25 30
Thr Glu Gly Asp Asn Ile Thr Leu Ala Gly Val Glu Gly Ala Gln Asn
35 40 45
Thr Thr Trp Thr Lys Tyr His Leu Gly Trp Arg Asp Ile Cys Thr Trp
50 55 60
Asn Val Thr Tyr Tyr Cys Ile Gly Ile Asn Leu Thr Ile Val Asn Ala
65 70 75 80
Asn Gln Ser Gln Asn Gly Leu Ile Lys Gly Gln Ser Val Ser Val Thr
85 90 95
Ser Asp Gly Tyr Tyr Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val
100 105 110
Ile Pro Leu Pro Thr Pro Ser Pro Pro Ser Thr Thr Thr Gln Thr Thr
115 120 125
Thr Tyr Ser Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val
130 135 140
Ala Ser Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser
145 150 155 160
Ser Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser
165 170 175
Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr
180 185 190
Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala Thr Thr
195 200 205
Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr Asp Gly Gly
210 215 220
Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu Val Ile Leu
225 230 235 240
Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile Pro Asn Ala His
245 250 255
Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Gln Pro Glu Pro Leu
260 265 270
Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp
275 280 285
<210> 52
<211> 91
<212> PRT
<213> Simian adenovirus 37
<400> 52
Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln
1 5 10 15
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile
20 25 30
Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys
35 40 45
Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp
50 55 60
Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg
65 70 75 80
Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Leu
85 90
<210> 53
<211> 144
<212> PRT
<213> Simian adenovirus 37
<400> 53
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg
1 5 10 15
Pro Val Asp Pro Arg Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys
20 25 30
Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys
35 40 45
Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile
50 55 60
Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe
65 70 75 80
Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr
85 90 95
Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Pro Pro Gln Pro
100 105 110
Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg
115 120 125
Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp
130 135 140
<210> 54
<211> 445
<212> PRT
<213> Simian adenovirus 37
<400> 54
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Asp Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr Asn Asn Asn Gly Lys Leu
100 105 110
Gly Met Lys Val Thr Ala Pro Leu Lys Ile Leu Asp Thr Asp Leu Leu
115 120 125
Lys Thr Leu Val Val Ala Tyr Gly Gln Gly Leu Gly Thr Asn Thr Asn
130 135 140
Gly Ala Leu Val Ala Gln Leu Ala Tyr Pro Leu Val Phe Asn Thr Ala
145 150 155 160
Ser Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala
165 170 175
Asn Arg Leu Asn Ile Asn Cys Lys Arg Gly Ile Tyr Val Thr Thr Thr
180 185 190
Lys Asp Ala Leu Glu Ile Asn Ile Ser Trp Ala Asn Ala Met Thr Phe
195 200 205
Ile Gly Asn Ala Ile Gly Val Asn Ile Asp Thr Lys Lys Gly Leu Gln
210 215 220
Phe Gly Thr Ser Ser Thr Glu Thr Asp Val Lys Asn Ala Phe Pro Leu
225 230 235 240
Gln Val Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile
245 250 255
Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala
260 265 270
Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Ala Lys Asp Ala Lys Leu
275 280 285
Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Ser
290 295 300
Leu Ile Ala Val Asp Thr Gly Ser Leu Asn Pro Ile Thr Gly Lys Val
305 310 315 320
Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gln
325 330 335
Ala Ser Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp
340 345 350
Val Thr Pro Ala Asp Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn
355 360 365
Leu Asn Ala Tyr Pro Lys Asn Thr Asn Ala Ala Ala Lys Ser His Ile
370 375 380
Val Gly Lys Val Tyr Leu His Gly Asp Val Ser Lys Pro Leu Asp Leu
385 390 395 400
Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile
405 410 415
Asn Phe Gln Trp Gln Trp Gly Thr Asp Gln Tyr Lys Asp Glu Thr Leu
420 425 430
Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu
435 440 445
<210> 55
<211> 580
<212> DNA
<213> Simian adenovirus 37
<220>
<221> CDS
<222> (2)..(574)
<223> label=Elb\19K
<400> 55
t atg gag atc tgg aca gtc ttg gaa gac ttt cac cag act aga cag ctg 49
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
cta gag aac tca tcg gag gga gtc tct tac ctg tgg aga ttc tgc ttc 97
Leu Glu Asn Ser Ser Glu Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
gct ggg cct cta gct aag cta gtc tat agg gcc aag cag gat tat agg 145
Ala Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg
35 40 45
gaa caa ttt gag gat att ttg aga gag tgt cct ggt att ttt gac tct 193
Glu Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
ctc aac ttg ggc cat cag tct cac ttt aac cag agt att ctg aga gcc 241
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
ctt gac ttt tct act cct ggc aga act acc gcc gcg gta gcc ttt ttt 289
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
gcc ttt atc ctt gac aaa tgg agt caa gaa acc cat ttc agc agg gat 337
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
tac cgt ctg gac tgc tta gca gta gct ttg tgg aga aca tgg agg tgc 385
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
cag cgc ctg aat gca atc tcc ggc tac ttg cca gta cag ccg gta gac 433
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
acg ctg agg atc ctg agt ctc cag tca ccc cag gaa cac caa cgc cgc 481
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
cag cag ccg cag cag gag cag cag caa gag gag gag gac cga gaa gag 529
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Asp Arg Glu Glu
165 170 175
aac ccg aga gcc ggt ctg gac cct ccg gtg gcg gag gag gag gag 574
Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
tagctg 580
<210> 56
<211> 191
<212> PRT
<213> Simian adenovirus 37
<400> 56
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
Leu Glu Asn Ser Ser Glu Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
Ala Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg
35 40 45
Glu Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Asp Arg Glu Glu
165 170 175
Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
<210> 57
<211> 6360
<212> DNA
<213> Simian adenovirus 37
<220>
<221> CDS
<222> (10)..(573)
<223> label=22K
<220>
<221> CDS
<222> (1871)..(2494)
<223> label=E3\CR1\alpha
<220>
<221> CDS
<222> (5950)..(6354)
<223> label=E3\14.7K
<400> 57
tcccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 51
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ccg ccg ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag 99
Pro Pro Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu
15 20 25 30
gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 147
Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln
35 40 45
gac agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa 195
Asp Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu
50 55 60
gca gcc gcc gcc aga ccg tcg tcc tcg gcg gag aaa gca agc agc acg 243
Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 291
Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser
80 85 90
agg tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc ggt 339
Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly
95 100 105 110
aag aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac gcc 387
Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala
115 120 125
atc gtc tcc tgc ttg caa gcc tgc ggg ggc aac atc tcc ttc acc cgg 435
Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg
130 135 140
cgc tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc ttg 483
Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu
145 150 155
cat tac tac cgt cac ctc cac agc ccc tac tac tgt ttc caa gaa gag 531
His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu
160 165 170
gca gaa acc cag cag cag cag aaa acc agc ggc agc agc agc 573
Ala Glu Thr Gln Gln Gln Gln Lys Thr Ser Gly Ser Ser Ser
175 180 185
tagaaaatcc acagcggcgg caggtggact gaggatcgcg gcgaacgagc cggcgcagac 633
ccgggagctg aggaaccgga tctttcccac cctctatgcc atcttccagc agagtcgggg 693
gcaggagcag gaactgaaag tcaagaaccg ttctctgcgc tcgctcaccc gcagttgtct 753
gtatcacaag agcgaagacc aacttcagcg cactctcgag gacgccgagg ctctcttcaa 813
caagtactgc gcgctcactc ttaaagagta gcccgcgccc gcccacacac ggaaaaaggc 873
gggaattacg tcaccacctg cgcccttcgc ccgaccatca tgagcaaaga gattcccacg 933
ccttacatgt ggagctacca gccccagatg ggcctggccg ccggcgccgc ccaggactac 993
tccacccgca tgaactggct cagtgccggg cccgcgatga tctcacgggt gaatgacatc 1053
cgcgcccgcc gaaaccagat actcctagaa cagtcagcga tcaccgccac gccccgccat 1113
caccttaatc cgcgtaattg gcccgccgcc ctggtgtacc aggaaattcc ccagcccacg 1173
accgtactac ttccgcgaga cgcccaggcc gaagtccagc tgactaactc aggtgtccag 1233
ctggccggcg gcgccgccct gtgtcgtcac cgccccgctc agggtataaa gcggctggtg 1293
atccgaggca gaggcacaca gctcaacgac gaggtggtga gctcttcgct gggtctgcga 1353
cctgacggag tcttccaact cgccggatcg gggagatctt ccttcacgcc tcgtcaggcc 1413
gtcctgactt tggagagttc gtcctcgcag ccccgctcgg gcggcatcgg cactctccag 1473
ttcgtggagg agttcactcc ctcggtctac ttcaacccct tctccggctc ccccggccac 1533
tacccggacg agttcatccc gaacttcgac gccatcagcg agtcggtgga cggctacgat 1593
tgaatgtccc atggtggcgc agctgaccta gctcggcttc gacacctgga ccactgccgc 1653
cgcttccgct gcttcgctcg ggatctcgcc gagtttgcct actttgagct gcccgaggag 1713
caccctcagg gcccagccca cggagtgcgg atcatcgtcg aagggggcct cgactcccac 1773
ctgcttcgga tcttcagcca gcgaccgatc ctggtcgagc gcgaacaagg acagacccgt 1833
ctgaccctgt actgcatctg caaccacccc ggcctgc atg aaa gtc ttt gtt gtc 1888
Met Lys Val Phe Val Val
190
tgc tgt gta ctg agt ata ata aaa gct gag atc agc gac tac tcc gga 1936
Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp Tyr Ser Gly
195 200 205 210
ctc gat tgt ggt gtt cct gct atc aac cgg tcc ctg ttc ttc acc ggg 1984
Leu Asp Cys Gly Val Pro Ala Ile Asn Arg Ser Leu Phe Phe Thr Gly
215 220 225
aac gaa acc gag ctc cag ctc cag tgt aag ccc cac aag aag tac ctc 2032
Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys Pro His Lys Lys Tyr Leu
230 235 240
acc tgg ctg ttc cag ggc tcc ccc atc gcc gtt gtc aac cac tgc gac 2080
Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala Val Val Asn His Cys Asp
245 250 255
aac gac gga gtc ctg ctg agc ggc cct gcc aac ctt act ttt tcc acc 2128
Asn Asp Gly Val Leu Leu Ser Gly Pro Ala Asn Leu Thr Phe Ser Thr
260 265 270
cgc aga agc aag ctc cag ctc ttc caa ccc ttc ctc ccc ggg acc tat 2176
Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro Phe Leu Pro Gly Thr Tyr
275 280 285 290
cag tgc gtc tcg gga ccc tgc cat cac acc ttc cac ctg atc ccg aat 2224
Gln Cys Val Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro Asn
295 300 305
acc aca gcg ccg ctc ccc gct act aac aac caa act aac ctc cac caa 2272
Thr Thr Ala Pro Leu Pro Ala Thr Asn Asn Gln Thr Asn Leu His Gln
310 315 320
cgc cac cgt cgc gac ctt tcc tct gaa tct aat acc act acc gga ggt 2320
Arg His Arg Arg Asp Leu Ser Ser Glu Ser Asn Thr Thr Thr Gly Gly
325 330 335
gag ctc cga ggt cga cca acc tct ggg att tac tac ggc ccc tgg gag 2368
Glu Leu Arg Gly Arg Pro Thr Ser Gly Ile Tyr Tyr Gly Pro Trp Glu
340 345 350
gtg gtg ggg tta ata gcg cta ggc cta gtt gtg ggt ggg ctt ttg gct 2416
Val Val Gly Leu Ile Ala Leu Gly Leu Val Val Gly Gly Leu Leu Ala
355 360 365 370
ctc tgc tac cta tac ctc cct tgc tgt tcg tac tta gtg gtg ctg tgt 2464
Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu Val Val Leu Cys
375 380 385
tgc tgg ttt aag aaa tgg ggc aga tca ccc tagtgagctg cggtgtgctg 2514
Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
390 395
gtggcggtgc tttcgattgt gggactgggc ggcgcggctg tagtgaagga ggagaaggcc 2574
gatccctgct tgcatttcaa tcccgacaaa tgccagctga gttttcagcc cgatggcaat 2634
cggtgcgcgg tgctgatcaa gtgcggatgg gaatgcgaga acgtgagaat cgagtacaat 2694
aacaagactc ggaacaatac tctcgcgtcc gtgtggcagc ccggggaccc cgagtggtac 2754
accgtctctg tccccggtgc tgacggctcc ccgcgcaccg tgaataatac tttcattttt 2814
gcgcacatgt gcaacacggt catgtggatg agcaagcagt acgatatgtg gccccccacg 2874
aaggagaaca tcgtggtctt ctccatcgct tacagcctgt gcacggcgct aatcaccgct 2934
atcgtgtgcc tgagcattca catgctcatc gctattcgcc ccagaaataa tgccgagaaa 2994
gagaaacagc cataacacgt tttttcacac accttgtttt tacagacaat gcgtctgtta 3054
aattttttaa acattgtgct cagtattgct tatgcctctg gttatgcaaa catacagaaa 3114
accctttatg taggatctga tggtacacta gagggtaccc aatcacaagc caaggttgca 3174
tggtattttt atagaaccaa cactgatcca gttaaacttt gtaagggtga attgccgcgt 3234
acacataaaa ctccacttac atttagttgc agcaataata atcttacact tttttcaatt 3294
acaaaacaat atactggtac ttattacagt acaaactttc atacaggaca agataaatat 3354
tatactgtta aggtagaaaa tcctaccact cctagaacta ccaccaccac caccactact 3414
gcaaagccca ctgtgaaaac tacaactagg accaccacaa ctacagaaac caccaccagc 3474
acaacacttg ctgcaactac acacacacac actaagctaa ccttacagac cactaatgat 3534
ttgatcgccc tgctgcaaaa gggggataac agcaccactt ccaatgagga gatacccaaa 3594
tccatgattg gcattattgt tgctgtagtg gtgtgcatgt tgatcatcgc cttgtgcatg 3654
gtgtactatg ccttctgcta cagaaagcac agactgaacg acaagctgga acacttacta 3714
agtgttgaat tttaattttt tagaaccatg aagatcctag gcctttttag tttttctatc 3774
attacctctg ctctttgtga atcagtggat agagatgtta ctattaccac tggttctaat 3834
tatacactga aagggccacc ctcaggtatg ctttcgtggt attgctattt tggaactgac 3894
actgatcaaa ctgaattatg caattttcaa aaaggcaaaa cctcaaactc taaaatctct 3954
aattatcaat gcaatggcac tgatctgata ctactcaatg tcacgaaagc atatggtggc 4014
agttattatt gccctggaca aaacactgaa gaaatgattt tttacaaagt ggaagtggtt 4074
gatcccacta caccacccac caccacaact attcatacca cacacacaga acaaacacca 4134
gaggcaacag aagcagagtt ggccttccag gttcacggag attcctttgc tgtcaatacc 4194
cctacacccg atcagcggtg tccggggccg ctagtcagcg gcattgtcgg tgtgctttcg 4254
ggattagcag tcataatcat ctgcatgttc atttttgctt gctgctatag aaggctttac 4314
cgacaaaaat cagacccact gctgaacctc tatgtttaat tttttccaga gccatgaagg 4374
cagttagcgc tctagttttt tgttctttga ttggcattgt ttttaatagt aaaattacca 4434
aagttagctt tattaaacat gttaatgtaa ctgaaggaga taacatcaca ctagcaggtg 4494
tagaaggtgc tcaaaacacc acctggacaa aataccatct aggatggaga gatatttgca 4554
cctggaatgt aacttattat tgcataggaa ttaatcttac cattgttaac gctaaccaat 4614
ctcagaatgg gttaattaaa ggacagagtg ttagtgtgac cagtgatggg tactataccc 4674
agcatagttt taactacaac attactgtca taccactgcc tacgcctagc ccacctagca 4734
ctaccacaca gacaaccaca tacagtacat caaatcagcc taccaccact acagcagcag 4794
aggttgccag ctcgtctggg gtccgagtgg catttttgat gttggcccca tctagcagtc 4854
ccactgctag taccaatgag cagactactg aatttttgtc cactgtcgag agccacacca 4914
cagctacctc cagtgccttc tctagcaccg ccaatctctc ctcgctttcc tctacaccaa 4974
tcagccccgc tactactcct agccccgctc ctcttcccac tcccctgaag caaacagacg 5034
gcggcatgca atggcagatc accctgctca ttgtgatcgg gttggtcatc ctggccgtgt 5094
tgctctacta catcttctgc cgccgcattc ccaacgcgca ccgcaagccg gcctacaagc 5154
ccatcgttat cgggcagccg gagccgcttc aggtggaagg gggtctaagg aatcttctct 5214
tctcttttac agtatggtga ttgaactatg attcctagac aattcttgat cactattctt 5274
atctgcctcc tccaagtctg tgccaccctc gctctggtgg ccaacgccag tccagactgt 5334
attgggccct tcgcctccta cgtgctcttt gccttcatca cctgcatctg ctgctgtagc 5394
atagtctgcc tgcttatcac cttcttccag ttcattgact ggatctttgt gcgcatcgcc 5454
tacctgcgcc accaccccca gtaccgcgac cagcgagtgg cgcagctgct caggctcctc 5514
tgataagcat gcgggctctg ctacttctcg cgcttctgct gttagtgctc ccccgtcccg 5574
ttgacccccg gccccccact cagtcccccg aggaggtccg caaatgcaaa ttccaagaac 5634
cctggaaatt cctcaaatgc taccgccaaa aatcagacat gcatcccagc tggatcatga 5694
tcattgggat cgtgaacatt ctggcctgca ccctcatctc ctttgtgatt tacccctgct 5754
ttgactttgg ttggaactcg ccagaggcgc tctatctccc gcctgaacct gacacaccac 5814
cacagcaacc tcaggcacac gcactaccac caccaccaca gcctaggcca caatacatgc 5874
ccatattaga ctatgaggcc gagccacagc gacccatgct ccccgctatt agttacttca 5934
atctaaccgg cggag atg act gac cca ctg gcc aac aac aac gtc aac gac 5985
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp
400 405
ctt ctc ctg gac atg gac ggc cgc gcc tcg gag cag cga ctc gcc caa 6033
Leu Leu Leu Asp Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln
410 415 420
ctt cgc att cgc cag cag cag gag aga gcc gtc aag gag ctg cag gac 6081
Leu Arg Ile Arg Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp
425 430 435 440
ggc ata gcc atc cac cag tgc aag aaa ggc atc ttc tgc ctg gtg aaa 6129
Gly Ile Ala Ile His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys
445 450 455
cag gcc aag atc tcc tac gag gtc acc cag acc gac cat cgc ctc tcc 6177
Gln Ala Lys Ile Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser
460 465 470
tac gag ctc ctg cag cag cgc cag aag ttc acc tgc ctg gtc gga gtc 6225
Tyr Glu Leu Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val
475 480 485
aac ccc atc gtc atc acc cag cag tcg ggc gat acc aag ggg tgc atc 6273
Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile
490 495 500
cac tgc tcc tgc gac tcc ccc gac tgc gtc cac act ctg atc aag acc 6321
His Cys Ser Cys Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr
505 510 515 520
ctc tgc ggc ctc cgc gac ctc ctc ccc atg aac taatca 6360
Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn
525 530
<210> 58
<211> 188
<212> PRT
<213> Simian adenovirus 37
<400> 58
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Pro
1 5 10 15
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Met
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala
50 55 60
Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys Lys
100 105 110
Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile Val
115 120 125
Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg Tyr
130 135 140
Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His Tyr
145 150 155 160
Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala Glu
165 170 175
Thr Gln Gln Gln Gln Lys Thr Ser Gly Ser Ser Ser
180 185
<210> 59
<211> 208
<212> PRT
<213> Simian adenovirus 37
<400> 59
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu
1 5 10 15
Ile Ser Asp Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg
20 25 30
Ser Leu Phe Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys
35 40 45
Pro His Lys Lys Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala
50 55 60
Val Val Asn His Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala
65 70 75 80
Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro
85 90 95
Phe Leu Pro Gly Thr Tyr Gln Cys Val Ser Gly Pro Cys His His Thr
100 105 110
Phe His Leu Ile Pro Asn Thr Thr Ala Pro Leu Pro Ala Thr Asn Asn
115 120 125
Gln Thr Asn Leu His Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser
130 135 140
Asn Thr Thr Thr Gly Gly Glu Leu Arg Gly Arg Pro Thr Ser Gly Ile
145 150 155 160
Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val
165 170 175
Val Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser
180 185 190
Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
195 200 205
<210> 60
<211> 135
<212> PRT
<213> Simian adenovirus 37
<400> 60
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp
1 5 10 15
Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg
20 25 30
Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala Ile
35 40 45
His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile
50 55 60
Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu Leu
65 70 75 80
Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val
85 90 95
Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys
100 105 110
Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu
115 120 125
Arg Asp Leu Leu Pro Met Asn
130 135
<210> 61
<211> 870
<212> DNA
<213> Simian adenovirus 37
<220>
<221> CDS
<222> (7)..(574)
<223> label=Ela
<220>
<221> CDS
<222> (660)..(865)
<223> label=Ela
<400> 61
tgaaag atg agg cac ctg aga gac ctg ccc ggt aat gtt ttc ctg gct 48
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala
1 5 10
act ggg aac gag att ctg gaa ctg gtg gtg gac gcc atg atg ggt gac 96
Thr Gly Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp
15 20 25 30
gac cct ccc gag ccc cct acc cca ttt gag gcg cct tcg ctg tac gat 144
Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp
35 40 45
ttg tat gat ctg gag gtg gat gtg tcc gag aac gac ccc aac gag gag 192
Leu Tyr Asp Leu Glu Val Asp Val Ser Glu Asn Asp Pro Asn Glu Glu
50 55 60
gcg gtg aat gat ttg ttt agc gat gcc gcg ctg ctg gct gcc gag cag 240
Ala Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln
65 70 75
gct aat acg gac tct ggc tca gac agc gat tcc tct ctc cat acc ccg 288
Ala Asn Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro
80 85 90
aga ccc ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag 336
Arg Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu
95 100 105 110
ctc gac ctg cgc tgc tat gag gaa tgc ttg cct ccg agc gat gat gag 384
Leu Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu
115 120 125
gag gac gag gag gcg att cga gct gca gcg agc gag gga gtg aaa gtt 432
Glu Asp Glu Glu Ala Ile Arg Ala Ala Ala Ser Glu Gly Val Lys Val
130 135 140
gcg ggc gag agc ttt agc ctg gac tgt cct act ctg ccc gga cac ggc 480
Ala Gly Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly
145 150 155
tgt aag tct tgt gaa ttt cat cgc atg aat act gga gat aag aat gtg 528
Cys Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val
160 165 170
atg tgt gcc ctg tgc tat atg aga gct tac aac cat tgt gtt tac a 574
Met Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr
175 180 185
gtaagtgtga ttaactttag ttgggaaagg cagagggtga ctgggtgctg actggtttat 634
ttatgtatat gttttttatg tgtag gt ccc gtc tct gac gca gat gag acc 685
Ser Pro Val Ser Asp Ala Asp Glu Thr
195
ccc act tca gag tgc att tca tca ccc cca gaa att ggc gag gaa ccg 733
Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro Glu Ile Gly Glu Glu Pro
200 205 210
ccc gaa gat att att cat aga cca gtt gca gtg aga gtc acc ggg cgg 781
Pro Glu Asp Ile Ile His Arg Pro Val Ala Val Arg Val Thr Gly Arg
215 220 225 230
aga gca gct gtg gag agt ttg gat gac ttg cta cag ggt ggg gat gaa 829
Arg Ala Ala Val Glu Ser Leu Asp Asp Leu Leu Gln Gly Gly Asp Glu
235 240 245
cct ttg gac ttg tgt acc cgg aaa cgc ccc agg cac taagt 870
Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His
250 255
<210> 62
<211> 258
<212> PRT
<213> Simian adenovirus 37
<400> 62
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr Gly
1 5 10 15
Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp Pro
20 25 30
Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu Tyr
35 40 45
Asp Leu Glu Val Asp Val Ser Glu Asn Asp Pro Asn Glu Glu Ala Val
50 55 60
Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala Asn
65 70 75 80
Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro
85 90 95
Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp
100 105 110
Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp
115 120 125
Glu Glu Ala Ile Arg Ala Ala Ala Ser Glu Gly Val Lys Val Ala Gly
130 135 140
Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys Lys
145 150 155 160
Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met Cys
165 170 175
Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val
180 185 190
Ser Asp Ala Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro
195 200 205
Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala
210 215 220
Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp Asp Leu
225 230 235 240
Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro
245 250 255
Arg His
<210> 63
<211> 850
<212> DNA
<213> Simian adenovirus 37
<220>
<221> CDS
<222> (10)..(337)
<223> label=33K
<220>
<221> CDS
<222> (507)..(841)
<223> label=33K
<400> 63
tcccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 51
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ccg ccg ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag 99
Pro Pro Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu
15 20 25 30
gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 147
Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln
35 40 45
gac agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa 195
Asp Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu
50 55 60
gca gcc gcc gcc aga ccg tcg tcc tcg gcg gag aaa gca agc agc acg 243
Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 291
Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser
80 85 90
agg tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc g 337
Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
95 100 105
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 397
gcttgcaagc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 457
gggtgaactt cccccgcaac atcttgcatt actaccgtca cctccacag cc cct act 514
Ala Pro Thr
act gtt tcc aag aag agg cag aaa ccc agc agc agc aga aaa cca gcg 562
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
gca gca gca gct aga aaa tcc aca gcg gcg gca ggt gga ctg agg atc 610
Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Gly Gly Leu Arg Ile
130 135 140
gcg gcg aac gag ccg gcg cag acc cgg gag ctg agg aac cgg atc ttt 658
Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe
145 150 155 160
ccc acc ctc tat gcc atc ttc cag cag agt cgg ggg cag gag cag gaa 706
Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu
165 170 175
ctg aaa gtc aag aac cgt tct ctg cgc tcg ctc acc cgc agt tgt ctg 754
Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu
180 185 190
tat cac aag agc gaa gac caa ctt cag cgc act ctc gag gac gcc gag 802
Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu
195 200 205
gct ctc ttc aac aag tac tgc gcg ctc act ctt aaa gag tagcccgcg 850
Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 64
<211> 221
<212> PRT
<213> Simian adenovirus 37
<400> 64
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Pro
1 5 10 15
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Met
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala
50 55 60
Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr
100 105 110
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Gly Gly Leu Arg Ile
130 135 140
Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe
145 150 155 160
Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu
165 170 175
Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu
180 185 190
Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu
195 200 205
Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 65
<211> 36494
<212> DNA
<213> Simian adenovirus 38
<220>
<221> repeat_region
<222> (1)..(130)
<223> label=ITR
<220>
<221> CDS
<222> (1906)..(3417)
<223> label=Elb\55K
<220>
<221> CDS
<222> (3505)..(3930)
<223> label=pIX
<220>
<221> misc_feature
<222> (3996)..(5617)
<223> complement (3996..5326, 5605..5617) label=IVa2
<220>
<221> misc_feature
<222> (5099)..(13873)
<223> complement (5099..8671, 13865..13873) label=pol
<220>
<221> misc_feature
<222> (8473)..(13873)
<223> complement (8473..10410, 13865..13873) label=pTP
<220>
<221> misc_feature
<222> (10444)..(10608)
<223> label=VA\I
<220>
<221> misc_feature
<222> (10673)..(10843)
<223> label=VA\II
<220>
<221> CDS
<222> (10871)..(12046)
<223> label=52\55K
<220>
<221> CDS
<222> (12073)..(13830)
<223> label=pIIIa
<220>
<221> CDS
<222> (13913)..(15529)
<223> label=penton
<220>
<221> CDS
<222> (15536)..(16117)
<223> label=pVII
<220>
<221> CDS
<222> (16165)..(17211)
<223> label=V
<220>
<221> CDS
<222> (17239)..(17469)
<223> label=pX
<220>
<221> CDS
<222> (17542)..(18261)
<223> label=pVI
<220>
<221> CDS
<222> (18357)..(21146)
<223> label=hexon
<220>
<221> CDS
<222> (21171)..(21791)
<223> label=protease
<220>
<221> misc_feature
<222> (21869)..(23404)
<223> complement label=DBP
<220>
<221> CDS
<222> (23430)..(25814)
<223> label=100K
<220>
<221> CDS
<222> (26452)..(27132)
<223> label=pVIII
<220>
<221> CDS
<222> (27136)..(27453)
<223> label=E3\12.5K
<220>
<221> CDS
<222> (28581)..(29198)
<223> label=E3\CR1\beta
<220>
<221> CDS
<222> (29214)..(29822)
<223> label=E3\CR1\gamma
<220>
<221> CDS
<222> (29840)..(30703)
<223> label=E3\CR1\delta
<220>
<221> CDS
<222> (30714)..(30986)
<223> label=E3\RID\alpha
<220>
<221> CDS
<222> (31419)..(31823)
<223> label=E3\14.7K
<220>
<221> CDS
<222> (32096)..(33370)
<223> label=fiber
<220>
<221> misc_feature
<222> (33485)..(33676)
<223> complement label=E4\orf7
<220>
<221> misc_feature
<222> (33485)..(34635)
<223> complement (33485..33733, 34465..34635) label=E4\orf6/7
<220>
<221> misc_feature
<222> (33733)..(34635)
<223> complement label=E4\orf6
<220>
<221> misc_feature
<222> (34544)..(34906)
<223> complement label=E4\orf4
<220>
<221> misc_feature
<222> (34919)..(35269)
<223> complement label=E4\orf3
<220>
<221> misc_feature
<222> (35269)..(35655)
<223> complement label=E4\orf2
<220>
<221> misc_feature
<222> (35699)..(36070)
<223> complement label=E4\orf1
<220>
<221> repeat_region
<222> (36365)..(36494)
<223> complement label=ITR
<400> 65
catcatcaaa taatatacct taaacttttg gtgcgcgtta atatgcaaat gaggtatttg 60
aatttgggga tgcggggcgg tgattggctg cgggagcggc gaccgttagg ggcggggcgg 120
gtgacgtttt gatgacgtgt ttgtgaggcg gagccggttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtattt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttacgtagg tgtcagctga tcgccagggt 480
atttaaacct gcgctctcta gtcaagaggc cactcttgag tgccagcgag tagagttttc 540
tcctccgcgc cgcgagtcag atctacactt tgaaagatga ggcacctgag agacctgccc 600
ggtaatgttt tcctggctac tgggaacgag attctggaac tggtggtgga cgccatgatg 660
ggtgacgacc ctccggagcc ccctacccca tttgaggcgc cttcgctgta cgatttgtat 720
gatctggagg tggatgtgcc cgagaacgac cccaacgagg aggcggtgaa tgatttgttt 780
agcgatgccg cgctgctggc tgccgagcag gctaatacgg actctggctc agacagcgat 840
tcctctctcc ataccccgag acccggcaga ggtgagaaaa agatccccga gcttaaaggg 900
gaagagctcg acctgcgctg ctatgaggaa tgcttgcctc cgagcgatga tgaggaggac 960
gaggaggcga ttcgagctgc agcgaaccag ggagtgaaag cggcgggcga gggctttagc 1020
ctggactgtc ctactctgcc cggacacggc tgtaagtctt gtgaatttca tcgcatgaat 1080
actggagata agaatgtgat gtgtgccctg tgctatatga gagcttacaa ccattgtgtt 1140
tacagtaagt gtgattaact ttagttggga aggcagaggg tgactgggtg ctgactggtt 1200
tatttatgta tatgtttttt tatgtgtagg tcccgtctct gacgtagatg agacccccac 1260
ttcagagtgc atttcatcac ccccagaaat tggcgaggaa ccgcccgaag atattattca 1320
tagaccagtt gcagtgagag tcaccgggcg gagagcagct gtggagagtt tggatgactt 1380
gctacagggt ggggatgaac ctttggactt gtgtacccgg aaacgcccca ggcactaagt 1440
gccacacatg tgtgtttact taaggtgatg tcagtattta tagggtgtgg agtgcaataa 1500
aatccgtgtt gactttaagt gcgtgtttta tgactcaggg gtggggactg tgggtatata 1560
agcaggtgca gacctgtgtg gtcagttcag agcaggactc atggagatct ggacggtctt 1620
ggaagacttt catcagacta gacagctgct agagaactca tcggaggaag tctcttacct 1680
gtggagattt tgcttcggtg gggctctagc taagctagtc tatagggcca aacaggatta 1740
taaggatcaa tttgaggata ttttgagaga gtgtcctggt atttttgact ctctcaactt 1800
gggccatcag tctcacttta accagagtat tctgagagcc cttgactttt ctactcctgg 1860
cagaactacc gccgcggtag ccttttttgc ctttatcctt gacaa atg gag tca aga 1917
Met Glu Ser Arg
1
aac cca ttt cag cag gga tta ccg tct gga ctg ctt agc agt agc ttt 1965
Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu Ser Ser Ser Phe
5 10 15 20
gtg gag aac atg gag gtg cca gcg cct gaa tgc aat ctc cgg cta ctt 2013
Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu
25 30 35
gcc agt aca gcc ggt aga cac gct gag gat cct gag tct cca gtc acc 2061
Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu Ser Pro Val Thr
40 45 50
cca gga aca cca acg ccg cca gca gcc gca gca gga gca gca gca aga 2109
Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly Ala Ala Ala Arg
55 60 65
gga gga gga gga ccg aga aga gaa ccc gag agc cgg tct gga ccc tcc 2157
Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser Gly Pro Ser
70 75 80
ggt ggc gga gga gga gga gta gct gac ttg ttt ccc gag ctg cgc cgg 2205
Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg
85 90 95 100
gtg ctg act agg tct tcc agt gga cgg gag agg ggg att aag cgg gag 2253
Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu
105 110 115
agg cat gag gag act agc cac aga act gaa ctg act gtc agt ctg atg 2301
Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val Ser Leu Met
120 125 130
agc cgc agg cgc cca gaa tcg gtg tgg tgg cat gag gtg cag tcg cag 2349
Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val Gln Ser Gln
135 140 145
ggg ata gat gag gtc tca gtg atg cat gag aaa tat tcc cta gaa caa 2397
Gly Ile Asp Glu Val Ser Val Met His Glu Lys Tyr Ser Leu Glu Gln
150 155 160
gtc aag act tgt tgg ttg gag ccc gag gat gat tgg gag gta gcc atc 2445
Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile
165 170 175 180
agg aat tat gcc aag ctg gct ctg aag cca gac aag aag tac aag att 2493
Arg Asn Tyr Ala Lys Leu Ala Leu Lys Pro Asp Lys Lys Tyr Lys Ile
185 190 195
acc aaa ctg att aat atc aga aat tcc tgc tac att tca ggg aat ggg 2541
Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser Gly Asn Gly
200 205 210
gcc gag gtg gag atc agt acc cag gag agg gcg gcc ttc aga tgt tgt 2589
Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Ala Ala Phe Arg Cys Cys
215 220 225
atg atg aat atg tac ccg ggg gtg gtg ggc atg gag gga gtc acc ttt 2637
Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly Val Thr Phe
230 235 240
atg aac acg agg ttc agg ggt gat ggg tat aat ggg gtg gtc ttt atg 2685
Met Asn Thr Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val Val Phe Met
245 250 255 260
gcc aac acc aag ttg aca gtg cac gga tgc tcc ttc ttt ggc ttc aat 2733
Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn
265 270 275
aac atg tgc atc gag gcc tgg ggc agt gtt tca gtg agg gga tgc agc 2781
Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg Gly Cys Ser
280 285 290
ttt tca gcc aac tgg atg ggg gtc gtg ggc aga acc aag agc gtg gtt 2829
Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Val Val
295 300 305
tca gtg aag aaa tgc ctg ttc gag agg tgc cac ctg ggg gtg atg agc 2877
Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser
310 315 320
gag ggc gaa gcc aaa gtc aaa cac tgc gcc tct act gag acg ggc tgc 2925
Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu Thr Gly Cys
325 330 335 340
ttt gtg ctg atc aag ggc aat gcc caa gtc aag cat aac atg atc tgt 2973
Phe Val Leu Ile Lys Gly Asn Ala Gln Val Lys His Asn Met Ile Cys
345 350 355
ggg gcc tcg gat gag cgc ggc tac cag atg ctg acc tgc gcc ggt ggg 3021
Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly
360 365 370
aac agc cat atg ctg gcc acc gtg cat gtg gcc tcg cac ccc cgc aag 3069
Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Pro Arg Lys
375 380 385
aca tgg ccc gag ttc gag cac aac gtc atg acc cgc tgc aat gtg cac 3117
Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Val His
390 395 400
ctg ggg tcc cgc cga ggc atg ttc atg ccc tac cag tgc aac atg caa 3165
Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Met Gln
405 410 415 420
ttt gtg aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg 3213
Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu
425 430 435
acg ggg gtg ttt gac atg aat gtg gag ctg tgg aaa att ctg aga tat 3261
Thr Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile Leu Arg Tyr
440 445 450
gat gaa tcc aag acc agg tgc cgg gcc tgc gaa tgc gga ggc aag cac 3309
Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His
455 460 465
gcc agg ctt cag ccc gtg tgt gtg gag gtg acg gag gac ctg cga ccc 3357
Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro
470 475 480
gat cat ttg gtg ttg tcc tgc aac ggg acg gag ttc ggc tcc agc ggg 3405
Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly Ser Ser Gly
485 490 495 500
gaa gaa tct gac tagagtgagt agtgtttggg ggaggtggag ggcttgtatg 3457
Glu Glu Ser Asp
aggggcagaa tgactaaaat ctgtgttttt ctgtgtgttg cagcagc atg agc gga 3513
Met Ser Gly
505
agc gcc tcc ttt gag gga ggg gta ttc agc ccg tat ctg acg ggg cgt 3561
Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg
510 515 520
ctc ccc tcc tgg gct gga gtg cgt cag aat gtg atg gga tcc acg gtg 3609
Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val
525 530 535
gac ggc cgg ccc gtg cag ccc gcg aac tct tca acc ctg acc tac gcg 3657
Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala
540 545 550 555
acc ctg agc tcc tcg tcc gtg gac gca gct gcc gcc gca gct gct gct 3705
Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala Ala Ala
560 565 570
tcc gcc gcc agc gcc gtg cgc gga atg gcc ctg ggc gcc ggc tac tac 3753
Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly Tyr Tyr
575 580 585
agc tct ctg gtg gcc aac tcg agt tcc acc aat aat ccc gcc agc ctg 3801
Ser Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu
590 595 600
aac gag gag aag ctg ctg ctg ctg atg gcc cag ctc gag gcc ctg acc 3849
Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr
605 610 615
cag cgc ctg ggc gag ctg acc cag cag gtg gct cag ctg cag gcg gag 3897
Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Ala Glu
620 625 630 635
acg cgg gcc gcg gtt gcc acg gtg aaa acc aaa taaaaaatga atcaataaat 3950
Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys
640 645
aaacggagac ggttgttgat tttaacacag agtcttgaat ctttatttga tttttcgcgc 4010
gcggtaggcc ctggaccacc ggtctcgatc attgagcacc cggtggatct tttccaggac 4070
ccggtagagg tgggcttgga tgttgaggta catgggcatg agcccgtccc gggggtggag 4130
gtagctccat tgcagggcct cgtgctcggg ggtggtgttg taaatcaccc agtcatagca 4190
ggggcgcagt gcgtggtgct gcacgatgtc cttgaggagg agactgacgg ccacgggcag 4250
ccccttggtg taggtgttga cgaacctgtt gagctgggag ggatgcatgc ggggggagat 4310
gagatgcatc ttggcctgga tcttgagatt ggcgatgttc ccgcccagat cccgccgggg 4370
gttcatgttg tgcaggacca ccagcacggt gtatccggtg cacttgggga atttgtcatg 4430
caacttggaa gggaaggcgt gaaagaattt ggagacgccc ttgtggccgc ccaggttttc 4490
catgcactca tccatgatga tggcgatggg cccgtgggcg gcggcctggg caaagacgtt 4550
tcgggggtcg gacacatcgt agttgtggtc ctgggtgagc tcgtcatagg ccattttaat 4610
gaatttgggg cggagggtgc ccgactgggg gacgaaggtg ccctcgatcc cgggggcgta 4670
gttgccctcg cagatctgca tctcccaggc cttgagctcg gaggggggga tcatgtccac 4730
ctgcggggcg atgaaaaaaa cggtttccgg ggcgggggag atgagctggg ccgaaagcag 4790
gttccggagc agctgggact tgccgcagcc ggtgggaccg tagatgaccc cgatgaccgg 4850
ctgcaggtgg tagttgaggg agagacagct gccgtcctcg cggaggaggg gggccacctc 4910
gttcatcatc tcgcgcacat gcatgttctc gcgcacgagt tccgccagga ggcgctcgcc 4970
ccccagcgag aggagctctt gcagcgaggc gaagtttttc agcggcttga gcccgtcggc 5030
catgggcatt ttggagaggg tctgttgcaa gagttccaga cggtcccaga gctcggtgat 5090
gtgctctagg gcatctcgat ccagcagacc tcctcgtttc gcgggttggg gcgactgcgg 5150
gagtagggca ccaggcgatg ggcgtccagc gaggccaggg tccggtcctt ccagggccgt 5210
agggtccgcg tcagcgtggt ctccgtcacg gtgaaggggt gcgcgccggg ctgggcgctt 5270
gcgagggtgc gcttcaggct catccggctg gtcgagaacc gctcccggtc ggcgccctgc 5330
gcgtcggcca ggtagcaatt gagcatgagt tcgtagttga gcgcctcggc cgcgtggccc 5390
ttggcgcgga gcttaccttt ggaagtgtgt ccgcagacgg gacagaggag ggacttgagg 5450
gcgtagagct tgggggcgag gaagacggac tcgggggcgt aggcgtccgc gccgcagctg 5510
gcgcagacgg tctcgcactc cacgagccag gtgaggtcgg ggcggtcggg gtcaaaaacg 5570
aggtttcctc cgtgcttttt gatgcgtttc ttacctctgg tctccatgag ctcgtgtccc 5630
cgctgggtga caaagaggct gtccgtgtcc ccgtagaccg actttatggg ccggtcctcg 5690
agcggggtgc cgcggtcctc gtcgtagagg aaccccgccc actccgagac gaaggcccgg 5750
gtccaggcca gcacgaagga ggccacgtgg gaggggtagc ggtcgttgtc caccagcggg 5810
tccaccttct ccagggtatg caagcacatg tccccctcgt ccacatccag gaaggtgatt 5870
ggcttgtaag tgtaggccac gtgaccgggg gtcccggccg ggggggtata aaagggggcg 5930
ggcccctgct cgtcctcact gtcttccgga tcgctgtcca ggagcgccag ctgttggggt 5990
aggtattccc tctcgaaggc gggcatgacc tcggcactca ggttgtcagt ttctagaaac 6050
gaggaggatt tgatattgac ggtgccgttg gagacgcctt tcatgagccc ctcgtccatc 6110
tggtcagaaa agacgatctt tttgttgtcg agcttggtgg cgaaggagcc gtagagggca 6170
ttggagagga gcttggcgat ggagcgcatg gtctggttct tttccttgtc ggcgcgctcc 6230
ttggcggcga tgttgagctg cacgtactcg cgcgccacgc acttccattc ggggaagacg 6290
gtggtgagct cgtcgggcac gattctgacc cgccagccgc ggttgtgcag ggtgatgagg 6350
tccacgctgg tggccacctc gccgcgcagg ggctcgttgg tccagcagag gcgcccgccc 6410
ttgcgcgagc agaagggggg cagcgggtcc agcatgagct cgtcgggggg gtcggcgtcc 6470
acggtgaaga tgccgggcag gagctcgggg tcgaagtagc tgatgcaggt gcccagatcg 6530
tccagcgccg cttgccagtc gcgcacggcc agcgcgcgct cgtaggggct gaggggcgtg 6590
ccccagggca tggggtgcgt gagcgcggag gcgtacatgc cgcagatgtc gtagacgtag 6650
aggggctcct cgaggacgcc gatgtaggtg gggtagcagc gccccccgcg gatgctggcg 6710
cgcacgtagt cgtacagctc gtgcgagggc gcgaggagcc ccgtgccgag attggagcgc 6770
tgcggctttt cggcgcggta gacgatctgg cggaagatgg cgtgggagtt ggaggagatg 6830
gtgggcctct ggaagatgtt gaagtgggcg tggggcaggc cgaccgagtc cctgatgaag 6890
tgggcgtagg agtcctgcag cttggcgacg agctcggcgg tgacgaggac gtccagggcg 6950
cagtagtcga gggtctcttg gatgatgtcg tacttgagct ggcccttctg cttccacagc 7010
tcgcggttga gaaggaactc ttcgcggtcc ttccagtact cttcgagggg gaacccgtcc 7070
tgatcggcac ggtaagagcc caccatgtag aactggttga cggccttgta ggcgcagcag 7130
cccttctcca cggggagggc ataagcttgc gcggccttgc gcagggaggt gtgggtgagg 7190
gcgaaggtgt cgcgcaccat gaccttgagg aactggtgct tgaagtcgag gtcgtcgcag 7250
ccgccctgct cccagagttg gaagtccgtg cgcttcttgt aggcggggtt gggcaaagcg 7310
aaagtaacat cgttgaagag gatcttgccc gcgcggggca tgaagttgcg agtgatgcgg 7370
aaaggctggg gcacctcggc ccggttgttg atgacctggg cggcgaggac gatctcgtcg 7430
aagccgttga tgttgtgccc gacgatgtag agttccacga atcgcgggcg gcccttgacg 7490
tggggcagct tcttgagctc gtcgtaggtg agctcggcgg ggtcgctgag gccgtgctgc 7550
tcgagggccc agtcggcgac gtgggggttg gcgctgagga aggaagtcca gagatccacg 7610
gccagggcgg tctgcaagcg gtcccggtat tgacggaact gctggcccac ggccattttt 7670
tcgggggtga cgcagtagaa ggtgcggggg tcgccgtgcc agcggtccca cttgagctgg 7730
agggcgaggt cgtgggcgag ctcgacgagc ggcgggtccc cggagagttt catgaccagc 7790
atgaagggga cgagctgctt gccgaaggac cccatccagg tgtaggtttc cacatcgtag 7850
gtgaggaaga gcctttcggt gcgaggatgc gagccgatgg ggaagaactg gatctcctgc 7910
caccagttgg aggaatggct gttgatgtga tggaagtaga aatgccgacg gcgcgccgag 7970
cactcgtgct tgtgtttata caagcgtccg cagtgctcgc aacgctgcac gggatgcacg 8030
tgctgcacga gctgtacctg agttcctttg acgaggaatt tcagtgggca gtggagcgct 8090
ggcggctgca tctggtgctg tactacgtcc tggccatcgg cgtggccatc gtctgcctcg 8150
atggtggtca tgctgacgag gccgcgcggg aggcaggtcc agacctcggc tcggacgggt 8210
cggagagcga ggacgagggc gcgcaggccg gagctgtcca gggtcctgag acgctgcgga 8270
gtcaggtcag tgggcagcgg cggcgcgcgg ttgacttgca ggagcttttc cagggcgcgc 8330
gggaggtcca gatggtactt gatctccacg gcgccgttgg tggcgacgtc cacggcttgc 8390
agggtcccgt gcccctgggg cgccaccacc gtgccccgtt tcttcttggg cgctggcgtt 8450
ggcgctgctt ccatgtcggt cagaagcggc ggcgaggacg cgcgccgggc ggcaggggcg 8510
gctcggggcc cggaggcagg ggcggcaggg gcacgtcggc gccgcgcgcg ggcaggttct 8570
ggtactgcgc ccggagaaga ctggcgtgag cgacgacgcg acggttgacg tcctggatct 8630
gacgcctctg ggtgaaggcc acgggacccg tgagtttgaa cctgaaagag agttcgacag 8690
aatcaatctc ggtatcgttg acggcggcct gccgcaggat ctcttgcacg tcgcccgagt 8750
tgtcctggta ggcgatctcg gtcatgaact gctcgatctc ctcctcctga aggtctccgc 8810
ggccggcgcg ctcgacggtg gccgcgaggt cgttggagat gcggcccatg agctgcgaga 8870
aggcgttcat gccggcctcg ttccagacgc ggctgtagac cacggatccg tcggggtcgc 8930
gcgcgcgcat gaccacctgg gcgaggttga gctccacgtg gcgcgtgaag accgcgtagt 8990
tgcagaggcg ctggtagagg tagttgagcg tggtggcgat gtgctcggtg acgaagaagt 9050
acatgatcca gcggcggagc ggcatctcgc tgacgtcgcc cagggcttcc aagcgctcca 9110
tggcctcgta gaagtccacg gcgaagttga aaaactggga gttgcgcgcc gagacggtca 9170
actcctcctc cagaagacgg atgagctcgg cgatggtggc gcgcacttcg cgctcgaagg 9230
ccccgggggg ctcctcctct tccatctcct cctcttcctc ctcctccact aacatctctt 9290
ctacttcctc ctcaggaggc ggcggcgggg gaggggccct gcgtcgccgg cggcgcacgg 9350
gcagacggtc gatgaagcgc tcgatggtct ccccgcgccg gcgacgcatg gtctcggtga 9410
cggcgcgccc gtcctcgcgg ggccgcagcg tgaagacgcc gccgcgcatc tccaggtggc 9470
cgccgggggg gtctccgttg ggcagggaga gggcgctgac gatgcatctt atcaattggc 9530
ccgtagggac tccgcgcaag gacctgagcg tctcgagatc cacgggatcc gaaaaccgct 9590
gaacgaaggc ttcgagccag tcgcagtcgc aaggtaggct gagcccggtt tcttgttctt 9650
cgggtatttg gtcgggaggc gggcgggcga tgctgctggt gatgaagttg aagtaggcgg 9710
tcctgagacg gcggatggtg gcgaggagca ccaggtcctt gggcccggct tgctggatgc 9770
gcagacggtc ggccatgccc caggcgtggt cctgacacct ggcgaggtcc ttgtagtagt 9830
cctgcatgag ccgctccacg ggcacctcct cctcgcccgc gcggccgtgc atgcgcgtga 9890
gcccgaaccc gcgctgcggc tggacgagcg ccaggtcggc gacgacgcgc tcggcgagga 9950
tggcctgctg gatctgggtg agggtggtct ggaagtcgtc gaagtcgacg aagcggtggt 10010
aggctccggt gttgatggtg taggagcagt tggccatgac ggaccagttg acggtctggt 10070
ggccggggcg cacgagctcg tggtacttga ggcgcgagta ggcgcgcgtg tcgaagatgt 10130
agtcgttgca ggtgcgcacg aggtactggt atccgacgag gaagtgcggc ggcggctggc 10190
ggtagagcgg ccatcgctcg gtggcggggg cgccgggcgc gaggtcttcg agcatgaggc 10250
ggtggtagcc gtagatgtac ctggacatcc aggtgatgcc ggcggcggtg gtggaggcgc 10310
gcgggaactc gcggacgcgg ttccagatgt tgcgcagcgg caggaagtag ttcatggtgg 10370
ccgcggtctg gcccgtgagg cgcgcgcagt cgtggatgct ctagacatac gggcaaaaac 10430
gaaagcggtc agcggctcga ctccgtggcc tggaggctaa gcgaacgggt tgggctgcgc 10490
gtgtaccccg gttcgagtcc ctgctcgaat caggctggag ccgcagctaa cgtggtactg 10550
gcactcccgt ctcgacccaa gcctgctaac gaaacctcca ggatacggag gcgggtcgtt 10610
ttggcgtttt ttcgtcaggc cggaaatgaa actagtaagc gcggaaagcg gccgcccgca 10670
atggctcgct gccgtagtct ggagaaagaa tcgccagggt tgcgttgcgg tgtgccccgg 10730
ttcgagactc agcgctcggc gccggccgga ttccgcggct aacgtgggcg tggctgcccc 10790
gtcgtttcca agacccctta gccagccgac ttctccagtt acggagcgag cccctctttt 10850
ttttcttgtg tttttgccag atg cat ccc gta ctg cgg cag atg cgc ccc cac 10903
Met His Pro Val Leu Arg Gln Met Arg Pro His
650 655
cct cca cca caa ccg ccc cta ccg cag cag cag caa cag ccg gcg ctt 10951
Pro Pro Pro Gln Pro Pro Leu Pro Gln Gln Gln Gln Gln Pro Ala Leu
660 665 670
ctg ccc ccg ccc cag cag cag cag cag cca gcc act acc gcg gcg gcc 10999
Leu Pro Pro Pro Gln Gln Gln Gln Gln Pro Ala Thr Thr Ala Ala Ala
675 680 685
gcc gtg agc gga gcc ggc gtt cag tat gac ctg gcc ttg gaa gag ggc 11047
Ala Val Ser Gly Ala Gly Val Gln Tyr Asp Leu Ala Leu Glu Glu Gly
690 695 700 705
gag ggg ctg gcg cgg ctg ggg gcg tcg tcg ccg gag cgg cac ccg cgc 11095
Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg
710 715 720
gtg cag atg aaa agg gac gct cgc gag gcc tac gtg ccc aag cag aac 11143
Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn
725 730 735
ctg ttc aga gac agg agc ggc gag gag ccc gag gag atg cgc gcc tcc 11191
Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ser
740 745 750
cgc ttc cac gcg ggg cgg gag ctg cgg cgc ggc ctg gac cga aag cgg 11239
Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg
755 760 765
gtg ctg agg gac gag gat ttc gag gcg gac gag ctg acg ggg atc agc 11287
Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser
770 775 780 785
ccc gcg cgc gcg cac gtg gcc gcg gcc aac ctg gtc acg gcg tac gag 11335
Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu
790 795 800
cag acc gtg aag gag gag agc aac ttt caa aaa tcc ttc aac aac cac 11383
Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His
805 810 815
gtg cgc acc ttg atc gcg cgc gag gag gtg acc ctg ggc ctg atg cat 11431
Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His
820 825 830
ctg tgg gac ctg ctg gag gcc atc gtg cag aac ccc acg agc aag ccg 11479
Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro
835 840 845
ctg acg gcg cag ctg ttt ctg gtg gtg cag cac agt cgg gac aac gag 11527
Leu Thr Ala Gln Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu
850 855 860 865
acg ttc agg gag gcg ctg ctg aat atc acc gag ccc gag ggc cgc tgg 11575
Thr Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp
870 875 880
ctc ctg gac ctg gtg aac att ctg cag agc atc gtg gtg cag gag cgc 11623
Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg
885 890 895
ggg ctg ccg ctg tcc gag aag ctg gcg gcc atc aac ttc tcg gtg ctg 11671
Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu
900 905 910
agc ctg ggc aag tac tac gct agg aag atc tac aag acc ccg tac gtg 11719
Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val
915 920 925
ccc ata gac aag gag gtg aag atc gat ggg ttt tac atg cgc atg acc 11767
Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr
930 935 940 945
ctg aaa gtg ctg acc ctg agc gac gat ctg ggg gtg tac cgc aac gac 11815
Leu Lys Val Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp
950 955 960
agg atg cac cgc gcg gtg agc gcc agc cgc cgg cgc gag ctg agc gac 11863
Arg Met His Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp
965 970 975
cag gag ctg atg cac agc ctg cag cgg gcc ctg acc ggg gcc ggg acc 11911
Gln Glu Leu Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr
980 985 990
gag ggg gag agc tac ttt gac atg ggc gcg gac ctg cgc tgg cag ccc 11959
Glu Gly Glu Ser Tyr Phe Asp Met Gly Ala Asp Leu Arg Trp Gln Pro
995 1000 1005
agc cgc cgg gcc ttg gaa gct gcc ggc ggc gtg ccc tac gtg gag 12004
Ser Arg Arg Ala Leu Glu Ala Ala Gly Gly Val Pro Tyr Val Glu
1010 1015 1020
gag gtg gac gat gag gag gag gag ggc gag tac ctg gaa gac 12046
Glu Val Asp Asp Glu Glu Glu Glu Gly Glu Tyr Leu Glu Asp
1025 1030 1035
tgatggcgcg accgtatttt tgctag atg cag caa cag cca ccg cct cct 12096
Met Gln Gln Gln Pro Pro Pro Pro
1040 1045
gat ccc gcg atg cgg gcg gcg ctg cag agc cag ccg tcc ggc att 12141
Asp Pro Ala Met Arg Ala Ala Leu Gln Ser Gln Pro Ser Gly Ile
1050 1055 1060
aac tcc tcg gac gat tgg acc cag gcc atg caa cgc atc atg gcg 12186
Asn Ser Ser Asp Asp Trp Thr Gln Ala Met Gln Arg Ile Met Ala
1065 1070 1075
ctg acg acc cgc aat ccc gaa gcc ttt aga cag cag cct cag gcc 12231
Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln Gln Pro Gln Ala
1080 1085 1090
aac cgg ctc tcg gcc atc ctg gag gcc gtg gtg ccc tcg cgc tcg 12276
Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro Ser Arg Ser
1095 1100 1105
aac ccc acg cac gag aag gtg ctg gcc atc gtg aac gcg ctg gtg 12321
Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala Leu Val
1110 1115 1120
gag aac aag gcc atc cgc ggc gac gag gcc ggg ctg gtg tac aac 12366
Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr Asn
1125 1130 1135
gcg ctg ctg gag cgc gtg gcc cgc tac aac agc acc aac gtg cag 12411
Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln
1140 1145 1150
acg aac ctg gac cgc atg gtg acc gac gtg cgc gag gcg gtg tcg 12456
Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ser
1155 1160 1165
cag cgc gag cgg ttc cac cgc gag tcg aac ctg ggc tcc atg gtg 12501
Gln Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val
1170 1175 1180
gcg ctg aac gcc ttc ctg agc acg cag ccc gcc aac gtg ccc cgg 12546
Ala Leu Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg
1185 1190 1195
ggc cag gag gac tac acc aac ttc atc agc gcg ctg cgg ctg atg 12591
Gly Gln Glu Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met
1200 1205 1210
gtg gcc gag gtg ccc cag agc gag gtg tac cag tcg ggg ccg gac 12636
Val Ala Glu Val Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp
1215 1220 1225
tac ttc ttc cag acc agt cgc cag ggc ttg cag acc gtg aac ctg 12681
Tyr Phe Phe Gln Thr Ser Arg Gln Gly Leu Gln Thr Val Asn Leu
1230 1235 1240
agc cag gct ttc aag aac ttg cag gga ctg tgg ggc gtg cag gcc 12726
Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu Trp Gly Val Gln Ala
1245 1250 1255
ccg gtc ggg gac cgc gcg acg gtg tcg agc ctg ctg acg ccg aac 12771
Pro Val Gly Asp Arg Ala Thr Val Ser Ser Leu Leu Thr Pro Asn
1260 1265 1270
tcg cgc ctg ctg ctg ctg ctg gtg gcg ccc ttc acg gac agc ggc 12816
Ser Arg Leu Leu Leu Leu Leu Val Ala Pro Phe Thr Asp Ser Gly
1275 1280 1285
agc gtg agc cgc gac tcg tac ctg ggc tac ctg ctt aac ctg tac 12861
Ser Val Ser Arg Asp Ser Tyr Leu Gly Tyr Leu Leu Asn Leu Tyr
1290 1295 1300
cgc gag gcc atc ggg cag gcg cac gtg gac gag cag acc tac cag 12906
Arg Glu Ala Ile Gly Gln Ala His Val Asp Glu Gln Thr Tyr Gln
1305 1310 1315
gag atc acc cac gtg agc cgc gcg ctg ggc cag gag gac ccg ggc 12951
Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln Glu Asp Pro Gly
1320 1325 1330
aac ctg gag gcc acc ctg aac ttc ctg ctg acc aac cgg tcg cag 12996
Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn Arg Ser Gln
1335 1340 1345
aag atc ccg ccc cag tac gcg ctg agc acc gag gag gag cgc atc 13041
Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr Glu Glu Glu Arg Ile
1350 1355 1360
ctg cgc tac gtg cag cag agc gtg ggg ctg ttc ctg atg cag gag 13086
Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln Glu
1365 1370 1375
ggg gcc acg ccc agc gcc gcg ctc gac atg acc gcg cgc aac atg 13131
Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met
1380 1385 1390
gag ccc agc atg tac gcc cgc aac cgc ccg ttc atc aat aag ctg 13176
Glu Pro Ser Met Tyr Ala Arg Asn Arg Pro Phe Ile Asn Lys Leu
1395 1400 1405
atg gac tac ttg cat cgg gcg gcc gcc atg aac tcg gac tac ttt 13221
Met Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe
1410 1415 1420
acc aac gcc atc ttg aac ccg cac tgg ctc ccg ccg ccc ggg ttc 13266
Thr Asn Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe
1425 1430 1435
tac acg ggc gag tac gac atg ccc gac ccc aac gac ggg ttc ctg 13311
Tyr Thr Gly Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu
1440 1445 1450
tgg gac gac gtg gac agc agc gtg ttc tcg ccg cgc ccc acc acc 13356
Trp Asp Asp Val Asp Ser Ser Val Phe Ser Pro Arg Pro Thr Thr
1455 1460 1465
acc gtg tgg aag aaa gag ggc ggg gac cgg cgg ccg tcc tcg gcg 13401
Thr Val Trp Lys Lys Glu Gly Gly Asp Arg Arg Pro Ser Ser Ala
1470 1475 1480
ctg tcc ggt cgc gcg ggt gct gcc gcg gcg gtg ccc gag gcc gcc 13446
Leu Ser Gly Arg Ala Gly Ala Ala Ala Ala Val Pro Glu Ala Ala
1485 1490 1495
agc ccc ttc ccg agc ctg ccc ttt tcg ctg aac agc gtg cgc agc 13491
Ser Pro Phe Pro Ser Leu Pro Phe Ser Leu Asn Ser Val Arg Ser
1500 1505 1510
agc gag ctg ggt cgg ctg acg cgg ccg cgc ctg ctg ggc gag gag 13536
Ser Glu Leu Gly Arg Leu Thr Arg Pro Arg Leu Leu Gly Glu Glu
1515 1520 1525
gag tac ctg aac gac tcc ttg ttg agg ccc gag cgc gag aaa aac 13581
Glu Tyr Leu Asn Asp Ser Leu Leu Arg Pro Glu Arg Glu Lys Asn
1530 1535 1540
ttc ccc aat aac ggg ata gag agc ctg gtg gac aag atg agc cgc 13626
Phe Pro Asn Asn Gly Ile Glu Ser Leu Val Asp Lys Met Ser Arg
1545 1550 1555
tgg aag acg tac gcg cac gag cac agg gac gag ccc cga gct agc 13671
Trp Lys Thr Tyr Ala His Glu His Arg Asp Glu Pro Arg Ala Ser
1560 1565 1570
agc agc acc ggc gcc cgt aga cgc cag cgg cac gac agg cag cgg 13716
Ser Ser Thr Gly Ala Arg Arg Arg Gln Arg His Asp Arg Gln Arg
1575 1580 1585
gga ctg gtg tgg gac gat gag gat tcc gcc gac gac agc agc gtg 13761
Gly Leu Val Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser Ser Val
1590 1595 1600
ttg gac ttg ggt ggg agt ggt ggt ggt aac ccg ttc gct cac ctg 13806
Leu Asp Leu Gly Gly Ser Gly Gly Gly Asn Pro Phe Ala His Leu
1605 1610 1615
cgc ccc cgt atc ggg cgc ctg atg taagaatctg aaaaaataaa aaaacggtac 13860
Arg Pro Arg Ile Gly Arg Leu Met
1620
tcaccaaggc catggcgacc agcgtgcgtt cttctctgtt gtttgtagta gt atg 13915
Met
1625
atg agg cgc gtg tac ccg gag ggt cct cct ccc tcg tac gag agc 13960
Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1630 1635 1640
gtg atg cag cag gcg gtg gcg gcg gcg atg cag ccc ccg ctg gag 14005
Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro Pro Leu Glu
1645 1650 1655
gcg cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag ggg cgg 14050
Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg
1660 1665 1670
aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat acc acc 14095
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr
1675 1680 1685
cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc tcg ctg 14140
Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu
1690 1695 1700
aac tac cag aac gac cac agc aac ttc ctg acc acc gtg gtg cag 14185
Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln
1705 1710 1715
aac aac gat ttc acc ccc acg gag gcc agc acc cag acc atc aac 14230
Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn
1720 1725 1730
ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc atg 14275
Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met
1735 1740 1745
cac acc aac atg ccc aac gtg aac gag ttc atg tac agc aac aag 14320
His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys
1750 1755 1760
ttc aag gcg cgg gtg atg gtc tcg cgc aag acc ccc aac ggg gtc 14365
Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val
1765 1770 1775
aca gta aca gat ggt agt cag gac gag ctg acc tac gag tgg gtg 14410
Thr Val Thr Asp Gly Ser Gln Asp Glu Leu Thr Tyr Glu Trp Val
1780 1785 1790
gag ttt gag ctg ccc gag ggc aac ttc tcg gtg acc atg acc atc 14455
Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile
1795 1800 1805
gat ctg atg aac aac gcc atc atc gac aac tac ttg gcg gtg gga 14500
Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly
1810 1815 1820
cgg cag aac ggg gtg ctg gag agc gac atc ggc gtg aag ttc gac 14545
Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp
1825 1830 1835
acg cgc aac ttc cgg ctg ggc tgg gac ccc gtg acc gag ctg gtg 14590
Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val
1840 1845 1850
atg ccg ggc gtg tac acc aac gag gcc ttc cac ccc gac atc gtc 14635
Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val
1855 1860 1865
ctg ctg ccc ggc tgc ggc gtg gac ttc acc gag agc cgc ctc agc 14680
Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser
1870 1875 1880
aac ctg ctg ggc atc cgc aag cgg cag ccc ttc cag gag ggc ttc 14725
Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe
1885 1890 1895
cag atc ctg tac gag gac ctg gag ggg ggc aac atc ccc gcg ctg 14770
Gln Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu
1900 1905 1910
ctg gac gtc gaa gcc tac gag aaa agc aag gag gag gcc gcc gca 14815
Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala
1915 1920 1925
gcg gcg acc gcg gcc gtg gct act gct gcg acc acc gat gca gat 14860
Ala Ala Thr Ala Ala Val Ala Thr Ala Ala Thr Thr Asp Ala Asp
1930 1935 1940
gca gct act act acc agg ggc gat aca ttc gcc acc cag gcg gag 14905
Ala Ala Thr Thr Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala Glu
1945 1950 1955
gaa gca gcc gcc cta gcg gcg acc gat gat agt gaa agt aag ata 14950
Glu Ala Ala Ala Leu Ala Ala Thr Asp Asp Ser Glu Ser Lys Ile
1960 1965 1970
gtc atc aag ccg gtg gag aag gac agc aag gac agg agc tac aac 14995
Val Ile Lys Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn
1975 1980 1985
gtt cta tcg gat gga aag aac acc gcc tac cgc agc tgg tac ctg 15040
Val Leu Ser Asp Gly Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu
1990 1995 2000
gcc tac aac tac ggc gac cct gag aag ggc gtg cgc tcc tgg acg 15085
Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr
2005 2010 2015
ctg ctc acc acc tcg gac gtc acc tgc ggc gtg gag caa gtc tac 15130
Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr
2020 2025 2030
tgg tcg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc tcc 15175
Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser
2035 2040 2045
acg cgt caa gtt agc aac tac ccg gtg gtg ggc gcc gag ctc ctg 15220
Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu
2050 2055 2060
ccc gtc tac tcc aag agc ttc ttc aac gag cag gcc gtc tac tcg 15265
Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser
2065 2070 2075
cag cag ctg cgc gcc ttc acc tcg ctc acg cac gtc ttc aac cgc 15310
Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg
2080 2085 2090
ttc ccc gag aac cag atc ctc gtc cgc ccg ccc gcg ccc acc att 15355
Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile
2095 2100 2105
acc acc gtc agt gaa aac gtt cct gct ctc aca gat cac ggg acc 15400
Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr
2110 2115 2120
ctg ccg ctg cgc agc agt atc cgg gga gtc cag cgc gtg acc gtc 15445
Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val
2125 2130 2135
act gac gcc aga cgc cgc acc tgc ccc tac gtc tac aag gcc ctg 15490
Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu
2140 2145 2150
ggc gta gtc gcg ccg cgc gtc ctc tcg agc cgc acc ttc taaaaa atg 15538
Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe Met
2155 2160
tcc att ctc atc tcg ccc agt aat aac acc ggt tgg ggc ctg cgc 15583
Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
2165 2170 2175
gcg ccc agc aag atg tac gga ggc gct cgc caa cgc tcc acg caa 15628
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln
2180 2185 2190
cac ccc gtg cgc gtg cgc ggg cac ttc cgc gct ccc tgg ggc gcc 15673
His Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala
2195 2200 2205
ctc aag ggc cgc gtg cgc tcg cgc acc acc gtc gac gac gtg atc 15718
Leu Lys Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile
2210 2215 2220
gac cag gtg gtg gcc gac gcg cgc aac tac acg ccc gcc gcc gcg 15763
Asp Gln Val Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala
2225 2230 2235
ccc gcc tcc acc gtg gac gcc gtc atc gac agc gtg gtg gcc gac 15808
Pro Ala Ser Thr Val Asp Ala Val Ile Asp Ser Val Val Ala Asp
2240 2245 2250
gcg cgc cgg tac gcc cgc gcc aag agc cgg cgg cgg cgc atc gcc 15853
Ala Arg Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala
2255 2260 2265
cgg cgg cac cgg agc acc ccc gcc atg cgc gcg gcg cga gcc ttg 15898
Arg Arg His Arg Ser Thr Pro Ala Met Arg Ala Ala Arg Ala Leu
2270 2275 2280
ctg cgc agg gcc agg cgc acg gga cgc agg gcc atg ctc agg gcg 15943
Leu Arg Arg Ala Arg Arg Thr Gly Arg Arg Ala Met Leu Arg Ala
2285 2290 2295
gcc aga cgc gcg gcc tcc ggc agc agc agc gcc ggc agg acc cgc 15988
Ala Arg Arg Ala Ala Ser Gly Ser Ser Ser Ala Gly Arg Thr Arg
2300 2305 2310
aga cgc gcg gcc acg gcg gcg gcg gcg gcc atc gcc agc atg tcc 16033
Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala Ile Ala Ser Met Ser
2315 2320 2325
cgc ccg cgg cgc ggc aac gtg tac tgg gtg cgc gac gcc gcc acc 16078
Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg Asp Ala Ala Thr
2330 2335 2340
ggt gtg cgc gtg ccc gtg cgc acc cgc ccc cct cgc act tgaagatgct 16127
Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg Thr
2345 2350 2355
gacttcgcga tgttgatgtg tcccagcggc gaggagg atg tcc aag cgc aaa tac 16182
Met Ser Lys Arg Lys Tyr
2360
aag gaa gag atg ctc cag gtc atc gcg cct gag atc tac ggc ccc 16227
Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro
2365 2370 2375
gcg gcg gcg gtg aag gag gaa aga aag ccc cgc aag ctg aag cgg 16272
Ala Ala Ala Val Lys Glu Glu Arg Lys Pro Arg Lys Leu Lys Arg
2380 2385 2390
gtc aaa aag gac aaa aag gag gag gaa gat gtg aac gga ctg gtg 16317
Val Lys Lys Asp Lys Lys Glu Glu Glu Asp Val Asn Gly Leu Val
2395 2400 2405
gag ttt gtg cgc gag ttc gcc ccc cgg cgg cgc gtg cag tgg cgc 16362
Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg
2410 2415 2420
ggg cgg aaa gtg aaa ccg gtg ctg cgg ccc ggc acc acg gtg gtc 16407
Gly Arg Lys Val Lys Pro Val Leu Arg Pro Gly Thr Thr Val Val
2425 2430 2435
ttc acg ccc ggc gag cgt tcc ggc tcc gcc tcc aag cgc tcc tac 16452
Phe Thr Pro Gly Glu Arg Ser Gly Ser Ala Ser Lys Arg Ser Tyr
2440 2445 2450
gac gag gtg tac ggg gac gag gac atc ctc gag cag gcg gtc gag 16497
Asp Glu Val Tyr Gly Asp Glu Asp Ile Leu Glu Gln Ala Val Glu
2455 2460 2465
cgt ctg ggc gag ttt gcg tac ggc aag cgc agc cgc ccc gcg ccc 16542
Arg Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Pro
2470 2475 2480
ttg aaa gag gag gcg gtg tcc atc ccg ctg gac cac ggc aac ccc 16587
Leu Lys Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro
2485 2490 2495
acg ccg agc ctg aag ccg gtg acc ctg cag cag gtg ctg cct ggt 16632
Thr Pro Ser Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Gly
2500 2505 2510
gcg gcg ccg cgc cgg ggc ttc aaa cgc gag ggc ggc gag gat ctg 16677
Ala Ala Pro Arg Arg Gly Phe Lys Arg Glu Gly Gly Glu Asp Leu
2515 2520 2525
tac ccg acc atg cag ctg atg gtg ccc aag cgc cag aag ctg gag 16722
Tyr Pro Thr Met Gln Leu Met Val Pro Lys Arg Gln Lys Leu Glu
2530 2535 2540
gac gtg ctg gag cac atg aag gtg gac ccc gag gtg cag ccc gag 16767
Asp Val Leu Glu His Met Lys Val Asp Pro Glu Val Gln Pro Glu
2545 2550 2555
gtc aag gtg cgg ccc atc aag cag gtg gct ccg ggc ctg ggc gtg 16812
Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro Gly Leu Gly Val
2560 2565 2570
cag acc gtg gac atc aag atc ccc acg gag ccc atg gaa acg cag 16857
Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro Met Glu Thr Gln
2575 2580 2585
acc gag ccc gtg aag ccc agc acc agc acc atg gag gtg cag acg 16902
Thr Glu Pro Val Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr
2590 2595 2600
gat ccc tgg atg ccg gcg ccg gct tcc acc acc acc acc cgc cga 16947
Asp Pro Trp Met Pro Ala Pro Ala Ser Thr Thr Thr Thr Arg Arg
2605 2610 2615
aga cgc aag tac ggc gcg gcc agc ctg ctg atg ccc aac tac gcg 16992
Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala
2620 2625 2630
ctg cat cct tcc atc atc ccc acg ccg ggc tac cgc ggc acg cgc 17037
Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg
2635 2640 2645
ttc tac cgc ggc tac acc agc agc cgc cgc cgc aag acc acc acc 17082
Phe Tyr Arg Gly Tyr Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr
2650 2655 2660
cgc cgc cgt cgt cgc agc cgc cgc agc agc acc gcg act tcc gcc 17127
Arg Arg Arg Arg Arg Ser Arg Arg Ser Ser Thr Ala Thr Ser Ala
2665 2670 2675
gcc gcc ctg gtg cgg aga gtg tac cgc agc ggg cgc gag cct ctg 17172
Ala Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu
2680 2685 2690
acc ctg ccg cgc gcg cgc tac cac ccg agc atc gcc att taactctgcc 17221
Thr Leu Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
2695 2700 2705
gtcgcctcct tgcagat atg gcc ctc aca tgc cgc ctc cgc gtc ccc att 17271
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile
2710 2715
acg ggc tac cga gga aga aag ccg cgc cgt aga agg ctg acg ggg 17316
Thr Gly Tyr Arg Gly Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly
2720 2725 2730
aac ggg ctg cgt cgc cat cac cac cgg cgg cgg cgc gcc atc agc 17361
Asn Gly Leu Arg Arg His His His Arg Arg Arg Arg Ala Ile Ser
2735 2740 2745
aag cgg ttg ggg gga ggc ttc ctg ccc gcg ctg atc ccc atc atc 17406
Lys Arg Leu Gly Gly Gly Phe Leu Pro Ala Leu Ile Pro Ile Ile
2750 2755 2760
gcc gcg gcg atc ggg gcg atc ccc ggc ata gct tcc gtg gcg gtg 17451
Ala Ala Ala Ile Gly Ala Ile Pro Gly Ile Ala Ser Val Ala Val
2765 2770 2775
cag gcc tct cag cgc cac tgagacacag cttggaaaat ttgtaataaa 17499
Gln Ala Ser Gln Arg His
2780
aaaatggact gacgctcctg gtcctgtgat gtgtgttttt ag atg gaa gac atc 17553
Met Glu Asp Ile
2785
aat ttt tcg tcc ctg gca ccg cga cac ggc acg cgg ccg ttt atg 17598
Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met
2790 2795 2800
ggc acc tgg agc gac atc ggc aac agc caa ctg aac ggg ggc gcc 17643
Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly Gly Ala
2805 2810 2815
ttc aat tgg agc agt ctc tgg agc ggg ctt aag aat ttc ggg tcc 17688
Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser
2820 2825 2830
acg ctc aaa acc tat ggc aac aag gcg tgg aac agc agc aca ggg 17733
Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly
2835 2840 2845
cag gcg ctg agg gaa aag ctg aaa gag cag aac ttc cag cag aag 17778
Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys
2850 2855 2860
gtg gtc gat ggc ctg gcc tcg ggc atc aac ggg gtg gtg gac ctg 17823
Val Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu
2865 2870 2875
gcc aac cag gcc gtg cag aaa cag atc aac agc cgc ctg gac gcg 17868
Ala Asn Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala
2880 2885 2890
gtc ccg ccc gcg ggg tcc gtg gag atg ccc cag gtg gag gag gag 17913
Val Pro Pro Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu
2895 2900 2905
ctg cct ccc ctg gac aag cgc ggc gac aag cga ccg cgt ccc gac 17958
Leu Pro Pro Leu Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp
2910 2915 2920
gcg gag gag acg ctg ctg acg cac acg gac gag ccg ccc ccg tac 18003
Ala Glu Glu Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr
2925 2930 2935
gag gag gcg gtg aaa ctg ggt ctg ccc acc acg cgg ccc gtg gcg 18048
Glu Glu Ala Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Val Ala
2940 2945 2950
cct ctg gcc acc ggg gtg ctg aaa ccc agc agc agc agc cag ccc 18093
Pro Leu Ala Thr Gly Val Leu Lys Pro Ser Ser Ser Ser Gln Pro
2955 2960 2965
gcg acc ctg gac ttg cct ccg cct cgc ccc tcc aca gtg gct aag 18138
Ala Thr Leu Asp Leu Pro Pro Pro Arg Pro Ser Thr Val Ala Lys
2970 2975 2980
ccc ctg ccg ccg gtg gcc gtc gcg tcg cgc gcc ccc cga ggc cgc 18183
Pro Leu Pro Pro Val Ala Val Ala Ser Arg Ala Pro Arg Gly Arg
2985 2990 2995
ccc cag gcg aac tgg cag agc act ctg aac agc atc gtg ggt ctg 18228
Pro Gln Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly Leu
3000 3005 3010
gga gtg cag agt gtg aag cgc cgc cgc tgc tat taaaagacac 18271
Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr
3015 3020
tgtagcgctt aacttgcttg tctgtgtgta tatgtatgtc cgccgaccag aaggaggagg 18331
aagaggcgcg tcgccgagtt gcaag atg gcc acc cca tcg atg ctg ccc cag 18383
Met Ala Thr Pro Ser Met Leu Pro Gln
3025 3030
tgg gcg tac atg cac atc gcc gga cag gac gct tcg gag tac ctg 18428
Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu
3035 3040 3045
agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc 18473
Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe
3050 3055 3060
agt ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac 18518
Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His
3065 3070 3075
gat gtg acc acc gac cgc agc cag cgg ctg acg ctg cgc ttc gtg 18563
Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val
3080 3085 3090
ccc gtg gac cgc gag gac aac acc tac tcg tac aaa gtg cgc tac 18608
Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr
3095 3100 3105
acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc agc acc 18653
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr
3110 3115 3120
tac ttt gac atc cgc ggc gtg ctg gat cgg ggc ccc agc ttc aaa 18698
Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys
3125 3130 3135
ccc tac tcc ggc acc gcc tac aac agc ctg gct ccc aag gga gcg 18743
Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala
3140 3145 3150
ccc aac act tgc cag tgg aca tat act gat aac caa act gag aaa 18788
Pro Asn Thr Cys Gln Trp Thr Tyr Thr Asp Asn Gln Thr Glu Lys
3155 3160 3165
aca gcc aca tat gga aat gca ccc gta gag ggc att aac att aca 18833
Thr Ala Thr Tyr Gly Asn Ala Pro Val Glu Gly Ile Asn Ile Thr
3170 3175 3180
aaa gat ggc att caa ctt gga act gac agc gat ggt cag gca atc 18878
Lys Asp Gly Ile Gln Leu Gly Thr Asp Ser Asp Gly Gln Ala Ile
3185 3190 3195
tat gca gac gaa act tat cag ccc gaa cct cag gtg gga gat cct 18923
Tyr Ala Asp Glu Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Pro
3200 3205 3210
gaa tgg cat gat acc aca ggt aca gaa gaa aaa tat gga ggc aga 18968
Glu Trp His Asp Thr Thr Gly Thr Glu Glu Lys Tyr Gly Gly Arg
3215 3220 3225
gcg ctt aaa cct gcc acc gac atg aaa cct tgc tat ggc tct ttt 19013
Ala Leu Lys Pro Ala Thr Asp Met Lys Pro Cys Tyr Gly Ser Phe
3230 3235 3240
gcc aag cca act aat gtt aag gga ggt cag gcc aaa agc aga aca 19058
Ala Lys Pro Thr Asn Val Lys Gly Gly Gln Ala Lys Ser Arg Thr
3245 3250 3255
aaa act gat gga aca act gag cct gat att gac atg gcc ttt ttt 19103
Lys Thr Asp Gly Thr Thr Glu Pro Asp Ile Asp Met Ala Phe Phe
3260 3265 3270
gat ggt aga aat gca aca aca gct ggt ttg act cca gaa att gtt 19148
Asp Gly Arg Asn Ala Thr Thr Ala Gly Leu Thr Pro Glu Ile Val
3275 3280 3285
ttg tat act gaa aat gtg gat ctg gaa act cca gat acc cat att 19193
Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile
3290 3295 3300
gta tac aag gca ggc aca gat gac agc agc tct tca att aat ttg 19238
Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu
3305 3310 3315
ggt cag cag tcc atg ccc aac aga ccc aac tac att ggg ttt aga 19283
Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg
3320 3325 3330
gac aac ttc att ggg ctc atg tac tac aac agc act ggc aat atg 19328
Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met
3335 3340 3345
ggc gta ctg gct ggt cag gcc tcc cag ctg aat gct gtg gtg gac 19373
Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp
3350 3355 3360
ttg cag gac aga aac acc gaa ctg tcc tac cag ctc ttg ctt gac 19418
Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp
3365 3370 3375
tct ctg ggt gac aga acc agg tat ttc agt atg tgg aat cag gcg 19463
Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala
3380 3385 3390
gtg gac agt tat gac ccc gat gtg cgc att att gaa aat cac ggt 19508
Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly
3395 3400 3405
gtg gag gat gaa cta ccc aac tat tgc ttc ccc ctg aat gct gtg 19553
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Ala Val
3410 3415 3420
ggt aga aca gat agt tat cag gga att aag ccc aat gga ggc gat 19598
Gly Arg Thr Asp Ser Tyr Gln Gly Ile Lys Pro Asn Gly Gly Asp
3425 3430 3435
cca gct aca tgg gcc aaa gat gaa agc gtc aat gat tct aat gaa 19643
Pro Ala Thr Trp Ala Lys Asp Glu Ser Val Asn Asp Ser Asn Glu
3440 3445 3450
ttg ggc aag ggc aat cct ttc gcc atg gag atc aac atc cag gcc 19688
Leu Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala
3455 3460 3465
aac ctg tgg cgg aac ttc ctc tac gcg aac gtg gcc ctg tac ctg 19733
Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu
3470 3475 3480
ccc gac tcc tac aag tac acg ccg gcc aac atc acg ctg ccc acc 19778
Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr
3485 3490 3495
aac acc aac acc tac gat tac atg aac ggc cgc gtg gtg gcg ccc 19823
Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro
3500 3505 3510
tcg ctg gtg gac gcc tac atc aac atc ggg gcg cgc tgg tcg ctg 19868
Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu
3515 3520 3525
gac ccc atg gac aac gtc aac ccc ttc aac cac cac cgc aac gcg 19913
Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala
3530 3535 3540
ggc ctg cgc tac cgc tcc atg ctc ctg ggc aac ggg cgc tac gtg 19958
Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val
3545 3550 3555
ccc ttc cac atc cag gtg ccc caa aag ttc ttc gcc atc aag agc 20003
Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser
3560 3565 3570
ctc ctg ctc ctg ccc ggg tcc tac acc tac gag tgg aac ttc cgc 20048
Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg
3575 3580 3585
aag gac gtc aac atg atc ctg cag agc tcc ctc ggc aac gac ctg 20093
Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu
3590 3595 3600
cgc acg gac ggg gcc tcc atc gcc ttc acc agc atc aac ctc tac 20138
Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr
3605 3610 3615
gcc acc ttc ttc ccc atg gcg cac aac acc gcc tcc acg ctc gag 20183
Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu
3620 3625 3630
gcc atg ctg cgc aac gac acc aac gac cag tcc ttc aac gac tac 20228
Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr
3635 3640 3645
ctc tcg gcg gcc aac atg ctc tac ccc atc ccg gcc aac gcc acc 20273
Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr
3650 3655 3660
aac gtg ccc atc tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc 20318
Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg
3665 3670 3675
ggc tgg tcc ttc acg cgc ctc aag acc cgc gag acg ccc tcg ctc 20363
Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu
3680 3685 3690
ggc tcc ggg ttc gac ccc tac ttc gtc tac tcg ggc tcc atc ccc 20408
Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro
3695 3700 3705
tac ctc gac ggc acc ttc tac ctc aac cac acc ttc aag aag gtc 20453
Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val
3710 3715 3720
tcc atc acc ttc gac tcc tcc gtc agc tgg ccc ggc aac gac cgc 20498
Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg
3725 3730 3735
ctc ctg acg ccc aac gag ttc gaa atc aag cgc acc gtc gac gga 20543
Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly
3740 3745 3750
gag ggg tac aac gtg gcc cag tgc aac atg acc aag gac tgg ttc 20588
Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe
3755 3760 3765
ctg gtc cag atg ctg gcc cac tac aac atc ggc tac cag ggc ttc 20633
Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe
3770 3775 3780
tac gtg ccc gag ggc tac aag gac cgc atg tac tcc ttc ttc cgc 20678
Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg
3785 3790 3795
aac ttc cag ccc atg agc cgc cag gtc gtg gac gag gtc aac tac 20723
Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr
3800 3805 3810
aag gac tac cag gcc gtc acc ctg acc tac cag cat aac aac tcg 20768
Lys Asp Tyr Gln Ala Val Thr Leu Thr Tyr Gln His Asn Asn Ser
3815 3820 3825
ggc ttc gtc ggc tac ctc gcg ccc acc atg cgc cag ggt cag ccc 20813
Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro
3830 3835 3840
tac ccc gcc aac tac ccc tac ccg ctc atc ggc aag agc gcc gtc 20858
Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val
3845 3850 3855
gcc agc gtc acc cag aaa aag ttc ctc tgc gac cgg gtc atg tgg 20903
Ala Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp
3860 3865 3870
cgc atc ccc ttc tcc agc aac ttc atg tcc atg ggc gcg ctc acc 20948
Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr
3875 3880 3885
gac ctc ggc cag aac atg ctc tac gcc aac tcc gcc cac gcg cta 20993
Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu
3890 3895 3900
gac atg aat ttc gaa gtc gac ccc atg gat gag tcc acc ctt ctc 21038
Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu
3905 3910 3915
tat gtt gtc ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc 21083
Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro
3920 3925 3930
cac cgc ggc gtc atc gag gcc gtc tac ctg cgc acg ccc ttc tcg 21128
His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser
3935 3940 3945
gcc ggc aac gcc acc acc taagccgctc ttgcttcttg caag atg acg gcc 21179
Ala Gly Asn Ala Thr Thr Met Thr Ala
3950 3955
tgt ggc tcc ggc gag cag gag ctc agg gcc atc ctc cgc gac ctg 21224
Cys Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Leu Arg Asp Leu
3960 3965 3970
ggc tgc ggg ccc tgc ttc ctg ggc acc ttc gac aag cgc ttc ccg 21269
Gly Cys Gly Pro Cys Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro
3975 3980 3985
gga ttc atg gcc ccg cac aag ctg gcc tgc gcc atc gtc aac acg 21314
Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr
3990 3995 4000
gcc ggc cgc gag acc ggg ggc gag cac tgg ctg gcc ttc gcc tgg 21359
Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp
4005 4010 4015
aac ccg cgc tcc cac acc tgc tac ctc ttc gac ccc ttc ggg ttc 21404
Asn Pro Arg Ser His Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe
4020 4025 4030
tcg gac gag cgc ctc aag cag atc tac cag ttc gag tac gag ggc 21449
Ser Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly
4035 4040 4045
ctg ctg cgc cgc agc gcc ctg gcc acc gag gac cgc tgc gtc acc 21494
Leu Leu Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr
4050 4055 4060
ctg gaa aag tcc acc cag acc gtg cag ggt ccg cgc tcg gcc gcc 21539
Leu Glu Lys Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala
4065 4070 4075
tgc ggg ctc ttc tgc tgc atg ttc ctg cac gcc ttc gtg cac tgg 21584
Cys Gly Leu Phe Cys Cys Met Phe Leu His Ala Phe Val His Trp
4080 4085 4090
ccc gac cgc ccc atg gac aag aac ccc acc atg aac ttg ctg acg 21629
Pro Asp Arg Pro Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr
4095 4100 4105
ggg gtg ccc aac ggc atg ctc cag tcg ccc cag gtg gaa ccc acc 21674
Gly Val Pro Asn Gly Met Leu Gln Ser Pro Gln Val Glu Pro Thr
4110 4115 4120
ctg cgc cgc aac cag gag gcg ctc tac cgc ttc ctc aat gcc cac 21719
Leu Arg Arg Asn Gln Glu Ala Leu Tyr Arg Phe Leu Asn Ala His
4125 4130 4135
tcc gcc tac ttt cgc tcc cac cgc gcg cgc atc gag aag gcc acc 21764
Ser Ala Tyr Phe Arg Ser His Arg Ala Arg Ile Glu Lys Ala Thr
4140 4145 4150
gcc ttc gac cgc atg aat caa gac atg taaaccgtgt gtatgtgaat 21811
Ala Phe Asp Arg Met Asn Gln Asp Met
4155 4160
gctttattca taataaacag cacatgttta tgccacctga ggctctgact ttatttagaa 21871
atcgaagggg ttctgccggc tctcggcatg gcccgcgggc agggatacgt tgcggaactg 21931
gtacttgggc agccacttga actcggggat cagcagcttc ggcacgggga ggtcggggaa 21991
cgagtcgctc cacagcttgc gcgtgagttg cagggcgccc agcaggtcgg gcgcggagat 22051
cttgaaatcg cagttgggac ccgcgttctg cgcgcgagag ttgcggtaca cggggttgca 22111
gcactggaac accattaggg ccgggtgctt cacgctcgcc agcaccgtcg cgtcggtgat 22171
gccctccacg tccagatcct cggcgttggc catcccgaag ggggtcatct tgcaggtctg 22231
ccgccccatg ctgggcacgc agccgggctt gtggttgcaa tcgcagtgca ggggaatcag 22291
catcatctgg gcctgctcgg agctcatgcc cgggtacatg gccttcatga aagcctccag 22351
ctggcggaag gcctgctgcg ccttgccgcc ctcggtgaag aagaccccgc aggacttgct 22411
agagaactgg ttggtggcgc agcccgcgtc gtgcacgcag cagcgcgcgt cgttgttggc 22471
cagctgcacc acgctgcgcc cccagcggtt ctgggtgatc ttggcccggt cggggttctc 22531
cttcagcgcg cgctgcccgt tctcgctcgc cacatccatc tcgatcgtgt gctccttctg 22591
gatcatcacg gtcccgtgca ggcaccgcag cttgccctcg gcctcggtgc atccgtgcag 22651
ccacagcgcg cagccggtgc tctcccagtt cttgtgggcg atctgggagt gcgagtgcac 22711
gaagccctgc aggaagcggc ccatcatcgt ggtcagggtc ttgttgctgg tgaaggtcag 22771
cgggatgccg cggtgctcct cgttcacata caggtggcag atgcggcggt acacctcgcc 22831
ctgctcgggc atcagctgga aggcggactt caggtcgctc tccacgcggt accgctccat 22891
cagcagcgtc atcacttcca tgcccttctc ccaggccgaa acgatcggca ggctcagggg 22951
gttcttcacc gttgtcatct tagtcgccgc cgccgaggtc agggggtcgt tctcgtccag 23011
ggtctcaaac actcgcttgc cgtccttctc ggtgatgcgc acggggggga aggcgaagcc 23071
cacggccgcc agctcctcct cggcctgcct ttcgtcctcg ctgtcctggc tgatgtcttg 23131
caaaggcaca tgcttggtct tgcggggttt ctttttgggc ggcagaggcg gcggcggaga 23191
cgtgctgggc gagcgcgagt tctcgctcac cacgactatt tcttctcctt ggccgtcgtc 23251
cgagaccacg cggcggtagg catgcctctt ctggggcaga ggcggaggcg acgggctctc 23311
gcggttcggc gggcggctgg cagagcccct tccgcgttcg ggggtgcgct cctggcggcg 23371
ctgctctgac tgacttcctc cgcggccggc cattatgttc tcctagggag caacaagc 23429
atg gag act cag cca tcg tcg cca aca tcg cca tct gcc ccc gcc 23474
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala
4165 4170 4175
gac gag aac cag cag cag aat gaa agc tta acc gcc ccg ccg ccc 23519
Asp Glu Asn Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro
4180 4185 4190
agc ccc acc tcc gac gcg gcc cca gac atg caa gag atg gag gaa 23564
Ser Pro Thr Ser Asp Ala Ala Pro Asp Met Gln Glu Met Glu Glu
4195 4200 4205
tcc atc gag att gac ctg ggc tac gtg acg ccc gcg gag cac gag 23609
Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu
4210 4215 4220
gag gag ctg gca gcg cgc ttt tca gcc ccg gaa gag aac cac caa 23654
Glu Glu Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln
4225 4230 4235
gag cag cca gag cag gaa gca gag agc gag cag cag cag gct ggg 23699
Glu Gln Pro Glu Gln Glu Ala Glu Ser Glu Gln Gln Gln Ala Gly
4240 4245 4250
ctc gag cat ggc gac tac ctg agc ggg gca gag gac gtg ctc atc 23744
Leu Glu His Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile
4255 4260 4265
aag cat ctg gcc cgc caa tgc atc atc gtc aag gac gcg ctg ctc 23789
Lys His Leu Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu
4270 4275 4280
gac cgc gcc gag gtg ccc ctc agc gtg gcg gag ctc agc cgc gcc 23834
Asp Arg Ala Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala
4285 4290 4295
tac gag cgc aac ctc ttc tcg ccg cgc gtg ccc ccc aag cgc cag 23879
Tyr Glu Arg Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln
4300 4305 4310
ccc aac ggc acc tgc gag ccc aac ccg cgc ctt aac ttc tac ccg 23924
Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro
4315 4320 4325
gtc ttc gcg gtg ccc gag gcc ctg gcc acc tac cac atc ttt ttc 23969
Val Phe Ala Val Pro Glu Ala Leu Ala Thr Tyr His Ile Phe Phe
4330 4335 4340
aag aac caa aag atc ccc gtc tcc tgc cgc gcc aac cgc acc cgc 24014
Lys Asn Gln Lys Ile Pro Val Ser Cys Arg Ala Asn Arg Thr Arg
4345 4350 4355
gcc gac gcc ctt ttc aac ctg ggc ccc ggc gcc cgc cta cct gat 24059
Ala Asp Ala Leu Phe Asn Leu Gly Pro Gly Ala Arg Leu Pro Asp
4360 4365 4370
atc gcc tcc ttg gaa gag gtt ccc aag atc ttc gag ggt ctg ggc 24104
Ile Ala Ser Leu Glu Glu Val Pro Lys Ile Phe Glu Gly Leu Gly
4375 4380 4385
agc gac gag act cgg gcc gcg aac gct ctg caa gga aat gaa gag 24149
Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly Asn Glu Glu
4390 4395 4400
cat gag cac cac agc gcc ctg gtc gag ttg gaa ggc gac aac gcg 24194
His Glu His His Ser Ala Leu Val Glu Leu Glu Gly Asp Asn Ala
4405 4410 4415
cgc ctg gcg gtc ctc aag cgc acg gtc gag ctg acc cac ttc gcc 24239
Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu Thr His Phe Ala
4420 4425 4430
tac ccg gcg ctc aac ctg ccc ccc aag gtc atg aac gcc gtc atg 24284
Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Asn Ala Val Met
4435 4440 4445
gac cag gtg ctc atc aag cgc gcc tcg ccc ctc tcg gag gag gag 24329
Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu
4450 4455 4460
atg cag gac ccc gag agc tcg gac gag ggc aag ccc gtg gtc agc 24374
Met Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser
4465 4470 4475
gac gag cag ctg gcg cgc tgg ctg gga gcg agt agc acc ccc cag 24419
Asp Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln
4480 4485 4490
agc ctg gaa gag cgg cgc aag ctc atg atg gcc gtg gtc ctg gtg 24464
Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val
4495 4500 4505
acc gtg gag ctg gag tgt ctg cgc cgc ttc ttt gcc gac gcg gag 24509
Thr Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu
4510 4515 4520
acc ctg cgc aag gtc gag gag aac ctg cac tac ctc ttc agg cac 24554
Thr Leu Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His
4525 4530 4535
ggg ttc gtg cgc cag gcc tgc aag atc tcc aac gtg gag ctg acc 24599
Gly Phe Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr
4540 4545 4550
aac ctg gtc tcc tac atg ggc atc ctg cac gag aac cgc ctg ggg 24644
Asn Leu Val Ser Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly
4555 4560 4565
caa aac gtg ctg cac acc acc ctg cgc ggg gag gcc cgc cgc gac 24689
Gln Asn Val Leu His Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp
4570 4575 4580
tac atc cgc gac tgc gtc tac ctg tac ctc tgc cac acc tgg cag 24734
Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu Cys His Thr Trp Gln
4585 4590 4595
acg ggc atg ggc gtg tgg cag cag tgc ctg gag gag cag aac ctg 24779
Thr Gly Met Gly Val Trp Gln Gln Cys Leu Glu Glu Gln Asn Leu
4600 4605 4610
aaa gag ctc tgc aag ctc ctg cag aag aac ctg aag gcc ctg tgg 24824
Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys Ala Leu Trp
4615 4620 4625
acc ggg ttc gac gag cgc acc acc gcc gcg gac ctg gcc gac ctc 24869
Thr Gly Phe Asp Glu Arg Thr Thr Ala Ala Asp Leu Ala Asp Leu
4630 4635 4640
atc ttc ccc gag cgc ctg cgg cta acg ctg cgc aac ggg ctg ccc 24914
Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu Pro
4645 4650 4655
gac ttt atg agc caa agc atg ttg caa aac ttt cgc tct ttc atc 24959
Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe Ile
4660 4665 4670
ctc gaa cgc tcc ggg atc ctg ccc gcc acc tgc tcc gca ctg ccc 25004
Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu Pro
4675 4680 4685
tcg gac ttc gtg ccg ctg acc ttc cgc gag tgc ccc ccg ccg ctc 25049
Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu
4690 4695 4700
tgg agc cac tgc tac ttg ctg cgc ctg gct aac tac ctg gcc tac 25094
Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr
4705 4710 4715
cac tcg gac gtg atc gag gac gtc agc ggc gag ggt ctg ctc gag 25139
His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu
4720 4725 4730
tgc cac tgc cgc tgc aac ctc tgc acg ccg cac cgc tcc ctg gcc 25184
Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala
4735 4740 4745
tgc aac ccc cag ctg ctg agc gag acc cag atc atc ggc acc ttc 25229
Cys Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe
4750 4755 4760
gag ttg caa ggc ccc ggc gac agc gag ggc aag ggg ggt ctg aaa 25274
Glu Leu Gln Gly Pro Gly Asp Ser Glu Gly Lys Gly Gly Leu Lys
4765 4770 4775
ctc acc cct ggg ctg tgg acc tcg gcc tac ttg cgc aag ttc gtg 25319
Leu Thr Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val
4780 4785 4790
ccc gag gac tac cat ccc ttc gag atc agg ttc tac gag gac caa 25364
Pro Glu Asp Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln
4795 4800 4805
tcc cag ccg ccc aag gcc gag ctg tcg gcc tgc gtc atc acc cag 25409
Ser Gln Pro Pro Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln
4810 4815 4820
ggg gcc atc ctg gcc caa ttg caa gcc atc cag aaa tcc cgc caa 25454
Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln
4825 4830 4835
gaa ttt ctg ctg aaa aag ggc cac ggg gtc tac ctg gac ccc cag 25499
Glu Phe Leu Leu Lys Lys Gly His Gly Val Tyr Leu Asp Pro Gln
4840 4845 4850
acc gga gag gag ctc aac ccc agc ttc ccc cag gat gcc ccg agg 25544
Thr Gly Glu Glu Leu Asn Pro Ser Phe Pro Gln Asp Ala Pro Arg
4855 4860 4865
aag cag caa gaa gct gaa agt gga gct gcc gcc gga gga ttt gga 25589
Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala Ala Gly Gly Phe Gly
4870 4875 4880
gga aga ctg gga gag cag tca ggc aga gga gga gat gga aga ctg 25634
Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly Gly Asp Gly Arg Leu
4885 4890 4895
gga cag cac tca ggc aga gga gga cag cct gca aga cag tct gga 25679
Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly
4900 4905 4910
gga gga aga cga ggt gga gga gga ggc aga gga aga agc agc cgc 25724
Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg
4915 4920 4925
cgc cag acc gtc gtc ctc ggc gga gga gaa agc aag cag cac gga 25769
Arg Gln Thr Val Val Leu Gly Gly Gly Glu Ser Lys Gln His Gly
4930 4935 4940
tac cat ctc cgc tcc ggg tcg ggg tcg cgg cgg tcg ggc cca cag 25814
Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Arg Ser Gly Pro Gln
4945 4950 4955
tagatgggac gagaccgggc gcttcccgaa ccccaccacc cagaccggta agaaggagcg 25874
gcagggatac aagtcctggc gggggcacaa aaacgccatc gtctcctgct tgcaagcctg 25934
cgggggcaac atctccttca cccggcgcta cctgctcttc caccgcgggg tgaacttccc 25994
ccgcaacatc ttgcattact accgtcacct ccacagcccc tactactgtt tccaagaaga 26054
ggcagaaacc cagcagcagc agaaaaccag cggcagcagc agcagcagct agaaaatcca 26114
cagcggcggc ggcggcaggt ggactgagga tcgcggcgaa cgagccggcg cagacccggg 26174
agctgaggaa ccggatcttt cccaccctct atgccatctt ccagcagagt cgggggcagg 26234
agcaggaact gaaagtcaag aaccgttctc tgcgctcgct cacccgcagt tgtctgtatc 26294
acaagagcga agaccaactt cagcgcactc tcgaggacgc cgaggctctc ttcaacaagt 26354
actgcgcgct cactcttaaa gagtagcccg cgcccgccca cacacggaaa aaggcgggaa 26414
ttacgtcacc acctgcgccc ttcgcccgac catcatc atg agc aaa gag att ccc 26469
Met Ser Lys Glu Ile Pro
4960
acg cct tac atg tgg agc tac cag ccc cag atg ggc ctg gcc gcc 26514
Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln Met Gly Leu Ala Ala
4965 4970 4975
ggc gcc gcc cag gac tac tcc acc cgc atg aac tgg ctc agt gcc 26559
Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu Ser Ala
4980 4985 4990
ggg ccc gcg atg atc tca cgg gtg aat gac atc cgc gcc cac cga 26604
Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg Ala His Arg
4995 5000 5005
aac cag ata ctc cta gaa cag tca gcg atc acc gcc acg ccc cgc 26649
Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr Pro Arg
5010 5015 5020
cat cac ctt aat ccg cgt aat tgg ccc gcc gcc ctg gtg tac cag 26694
His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr Gln
5025 5030 5035
gaa att ccc cag ccc acg acc gta cta ctt ccg cga gac gcc cag 26739
Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
5040 5045 5050
gcc gaa gtc cag ctg act aac tca ggt gtc cag ctg gcc ggc ggc 26784
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly
5055 5060 5065
gcc gcc ctg tgt cgt cac cgc ccc gct cag ggt ata aag cgg ctg 26829
Ala Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu
5070 5075 5080
gtg atc cga ggc aga ggc aca cag ctc aac gac gag gtg gtg agc 26874
Val Ile Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser
5085 5090 5095
tct tcg ctg ggt ctg cga cct gac gga gtc ttc caa ctc gcc gga 26919
Ser Ser Leu Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly
5100 5105 5110
tcg ggg aga tct tcc ttc acg cct cgt cag gcc gtc ctg act ttg 26964
Ser Gly Arg Ser Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu
5115 5120 5125
gag agt tcg tcc tcg cag ccc cgc tcg ggc ggc atc ggc act ctc 27009
Glu Ser Ser Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu
5130 5135 5140
cag ttc gtg gag gag ttc act ccc tcg gtc tac ttc aac ccc ttc 27054
Gln Phe Val Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn Pro Phe
5145 5150 5155
tcc ggc tcc ccc ggc cac tac ccg gac gag ttc atc ccg aac ttc 27099
Ser Gly Ser Pro Gly His Tyr Pro Asp Glu Phe Ile Pro Asn Phe
5160 5165 5170
gac gcc atc agc gag tcg gtg gac ggc tac gat tga atg tcc cat 27144
Asp Ala Ile Ser Glu Ser Val Asp Gly Tyr Asp Met Ser His
5175 5180 5185
ggt ggc gcg gct gac cta gct cgg ctt cga cac ctg gac cac tgc 27189
Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp His Cys
5190 5195 5200
cgc cgc ttc cgc tgc ttc gct cgg gat ctc gcc gag ttt gcc tac 27234
Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala Tyr
5205 5210 5215
ttt gag ctg ccc gag gag cac cct cag ggc ccg gcc cac gga gtg 27279
Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
5220 5225 5230
cgg atc gtc gtc gaa ggg ggc ctc gac tcc cac ctg ctt cgg atc 27324
Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile
5235 5240 5245
ttc agc cag cga ccg atc ctg gtc gag cgc gag caa gga cag acc 27369
Phe Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr
5250 5255 5260
cgt ctg acc ctg tac tgc atc tgc aac cac ccc ggc ctg cat gaa 27414
Arg Leu Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu
5265 5270 5275
agt ctt tgt tgt ctg ctg tgt act gag tat aat aaa agc tgagatcagc 27463
Ser Leu Cys Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
5280 5285
gactactccg gactcgattg tggtgttcct gctatcaacc ggtccctgtt cttcaccggg 27523
aacgagaccg agctccagct ccagtgtaag ccccacaaga agtatctcac ctggctgttc 27583
cagggctccc cgatcgccgt tgtcaaccac tgcgacaacg acggagtcct gctgagcggc 27643
cccgccaacc ttactttttc cacccgcaga agcaagctcc agctcttcca acccttcctc 27703
cccgggacct atcagtgcct ctcgggaccc tgccatcaca ccttccacct gatcccgaat 27763
accacagcgg cgctccccgc tactaacaac caaactaccc accaacgcca ccgtcgcgac 27823
ctttcctctg aatctaatac taccacccac accggaggtg agctccgagg tcaaccaacc 27883
tctgggattt actacggccc ctgggaggtg gtggggttaa tagcgctagg cctagttgcg 27943
ggtgggcttt tggctctctg ctacctatac ctcccttgct gttcttactt agtggtgctg 28003
tgttgctggt ttaagaaatg gggaagatca ccctagtgag ctgcggtgcg ctggtggcgg 28063
tggtgctttc gattgtggga ctgggcggcg cggctgtagt gaaggagaag gccgatccct 28123
gcttgcattt caatcccgac aaatgccagc tgagttttca gcccgatggc aatcggtgcg 28183
cggtgctgat caagtgcgga tgggaatgcg agaacgtgag aatcgagtac aataacaaga 28243
ctcggaacaa tactctcgcg tccgtgtggc agcccgggga ccccgagtgg tacaccgtct 28303
ctgtccccgg tgctgacggc tccccgcgca ccgtgaacaa tactttcatt tttgcacaca 28363
tgtgcgacac ggtcatgtgg atgagcaagc agtacgatat gtggcccccc acgaaggaga 28423
acatcgtggt cttctccatc gcttacagcc tgtgcacggc gctaatcacc gctatcgtgt 28483
gcctgagcat tcacatgctc atcgctattc gccccagaaa taatgccgaa aaagagaaac 28543
agccataaca cgttttttca cacacctttt tcagacc atg gcc tct gtt aaa ttt 28598
Met Ala Ser Val Lys Phe
5290
ttt gct tta ttt gcc agt ctc att acc gtc att cat gga atg agt 28643
Phe Ala Leu Phe Ala Ser Leu Ile Thr Val Ile His Gly Met Ser
5295 5300 5305
aat gag aaa att act att tac act ggc act aat cac aca ttg aaa 28688
Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn His Thr Leu Lys
5310 5315 5320
ggt cca gaa aaa gcc aca gaa gtt tca tgg tat tgt tat ttt aat 28733
Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys Tyr Phe Asn
5325 5330 5335
gaa tca gat gta gct act gaa ctc tgt gga aac aac aac aaa aaa 28778
Glu Ser Asp Val Ala Thr Glu Leu Cys Gly Asn Asn Asn Lys Lys
5340 5345 5350
aat gag agc att act ctc atc aag ttt caa tgt gga tct gac tta 28823
Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp Leu
5355 5360 5365
acc ctc att aac atc act aga gac tat gta ggt atg tat tat gga 28868
Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly
5370 5375 5380
act aca gca ggc att tcg gac atg gaa ttt tat caa gtt tct gtg 28913
Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val
5385 5390 5395
tct gaa ccc acc acg cct aga atg acc aca acc aca aaa act aca 28958
Ser Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr
5400 5405 5410
cct act acc acc aca cag ctc act acc aat ggc ttt ttt gcc atg 29003
Pro Thr Thr Thr Thr Gln Leu Thr Thr Asn Gly Phe Phe Ala Met
5415 5420 5425
ctt caa gtg gct gaa aat agc acc agc att caa ccc acc cca ccc 29048
Leu Gln Val Ala Glu Asn Ser Thr Ser Ile Gln Pro Thr Pro Pro
5430 5435 5440
agt gag gaa att ccc aaa tcc atg att ggc att att gtt gct gta 29093
Ser Glu Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val
5445 5450 5455
gtg gtg tgc atg ttg atc atc gcc ttg tgc atg gtg tac tat gcc 29138
Val Val Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala
5460 5465 5470
ttc tgc tac aga aag cac aga ctg aac gac aag ctg gaa cac tta 29183
Phe Cys Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu
5475 5480 5485
cta agt gtt gaa ttt taatttttta gaacc atg aag atc cta tgc ctt 29231
Leu Ser Val Glu Phe Met Lys Ile Leu Cys Leu
5490 5495 5500
tta gtt ttt tat atc att acc tct gct ctt tgt gaa tca gtg gat 29276
Leu Val Phe Tyr Ile Ile Thr Ser Ala Leu Cys Glu Ser Val Asp
5505 5510 5515
aaa gat gtt act att acc act ggt tct aat tat aca ctg aaa gga 29321
Lys Asp Val Thr Ile Thr Thr Gly Ser Asn Tyr Thr Leu Lys Gly
5520 5525 5530
cca ccc tca ggt atg ctt tcg tgg tat tgc tat ttt gga act gac 29366
Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe Gly Thr Asp
5535 5540 5545
act gat caa act gaa tta tgc aat ttt caa aaa ggc aaa acc tca 29411
Thr Asp Gln Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys Thr Ser
5550 5555 5560
aac tct aaa atc tct aat tat caa tgc aat ggc act gat ctg ata 29456
Asn Ser Lys Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu Ile
5565 5570 5575
cta ctc aat gtc acg aaa gca tat ggt ggc agt tat tca tgc cct 29501
Leu Leu Asn Val Thr Lys Ala Tyr Gly Gly Ser Tyr Ser Cys Pro
5580 5585 5590
gga caa aac act gaa gaa atg att ttt tac aaa gtg gaa gtg gtt 29546
Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val
5595 5600 5605
gat ccc act act cca cct cca ccc gcc aca act act cac acc aca 29591
Asp Pro Thr Thr Pro Pro Pro Pro Ala Thr Thr Thr His Thr Thr
5610 5615 5620
cac aca gaa caa agc aca gca gag gca gca aag tta gcc ttg cag 29636
His Thr Glu Gln Ser Thr Ala Glu Ala Ala Lys Leu Ala Leu Gln
5625 5630 5635
gtc caa gac agt tca ttt gtt ggc att acc cct aca cct gat cag 29681
Val Gln Asp Ser Ser Phe Val Gly Ile Thr Pro Thr Pro Asp Gln
5640 5645 5650
cgg tgt ccg ggg ctg ctc gtc agc ggc att gtc ggt gtg ctt tcg 29726
Arg Cys Pro Gly Leu Leu Val Ser Gly Ile Val Gly Val Leu Ser
5655 5660 5665
gga tta gca gtc ata atc atc tgc atg ttc att ttt gct tgc tgc 29771
Gly Leu Ala Val Ile Ile Ile Cys Met Phe Ile Phe Ala Cys Cys
5670 5675 5680
tat aga agg ctt tac cga caa aaa tca gac cca ctg ctg aac ctc 29816
Tyr Arg Arg Leu Tyr Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu
5685 5690 5695
tat gtt taattttttc cagagcc atg aag gca gtt agc gct cta gtt ttt 29866
Tyr Val Met Lys Ala Val Ser Ala Leu Val Phe
5700 5705
tgt tct ttg att ggc att gtt ttt aat agt aaa att acc aga gtt 29911
Cys Ser Leu Ile Gly Ile Val Phe Asn Ser Lys Ile Thr Arg Val
5710 5715 5720
agc ttt att aaa cat gtt aat gta act gaa gga gat aac atc aca 29956
Ser Phe Ile Lys His Val Asn Val Thr Glu Gly Asp Asn Ile Thr
5725 5730 5735
cta gca ggt gta gaa ggt gct caa aac acc acc tgg aca aaa tac 30001
Leu Ala Gly Val Glu Gly Ala Gln Asn Thr Thr Trp Thr Lys Tyr
5740 5745 5750
cat cta gga tgg aga gat att tgc acc tgg aat gta act tat tat 30046
His Leu Gly Trp Arg Asp Ile Cys Thr Trp Asn Val Thr Tyr Tyr
5755 5760 5765
tgc ata gga gtt aat ctt acc att gtt aac gct aac caa tct cag 30091
Cys Ile Gly Val Asn Leu Thr Ile Val Asn Ala Asn Gln Ser Gln
5770 5775 5780
aat ggg tta att aaa gga cag agt gtt agt gtg acc agt gat ggg 30136
Asn Gly Leu Ile Lys Gly Gln Ser Val Ser Val Thr Ser Asp Gly
5785 5790 5795
tac tat acc cag cat agt ttt aac tac aac att act gtc ata cca 30181
Tyr Tyr Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val Ile Pro
5800 5805 5810
ctg cct acg cct agc cca cct agc act acc gca cag aca acc aca 30226
Leu Pro Thr Pro Ser Pro Pro Ser Thr Thr Ala Gln Thr Thr Thr
5815 5820 5825
tac agt aca tca aat cag cct acc acc act aca gca gca gag gtt 30271
Tyr Ser Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val
5830 5835 5840
gcc agc tcg tct ggg gtc cga gtg gca ttt ttg atg ttg gcc cca 30316
Ala Ser Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro
5845 5850 5855
tct agc agt ccc act gct agt acc aat gag cag act act gaa ttt 30361
Ser Ser Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe
5860 5865 5870
ttg tcc act gtc gag agc cac acc aca gct acc tcc agt gcc ttc 30406
Leu Ser Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe
5875 5880 5885
tct agc acc gcc aat ctc tcc tcg ctt tcc tct aca cca atc agt 30451
Ser Ser Thr Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser
5890 5895 5900
ccc gct act act cct agc ccc gct cct ctt ccc act ccc ctg aag 30496
Pro Ala Thr Thr Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys
5905 5910 5915
caa aca gac ggc ggc atg caa tgg cag atc acc ctg ctc att gtg 30541
Gln Thr Asp Gly Gly Met Gln Trp Gln Ile Thr Leu Leu Ile Val
5920 5925 5930
atc ggg ttg gtc atc ctg gcc gtg ttg cta tac tac atc ttc tgc 30586
Ile Gly Leu Val Ile Leu Ala Val Leu Leu Tyr Tyr Ile Phe Cys
5935 5940 5945
cgc cgc att ccc aac gcg cac cgc aag ccg gcc tac aag ccc atc 30631
Arg Arg Ile Pro Asn Ala His Arg Lys Pro Ala Tyr Lys Pro Ile
5950 5955 5960
gtt atc ggg cag ccg gag ccg ctt cag gtg gaa ggg ggt cta agg 30676
Val Ile Gly Gln Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg
5965 5970 5975
aat ctt ctc ttc tct ttt aca gta tgg tgattgaact atg att cct aga 30725
Asn Leu Leu Phe Ser Phe Thr Val Trp Met Ile Pro Arg
5980 5985
caa ttc ttg atc act att ctt atc tgc ctc ctc caa gtc tgt gcc 30770
Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln Val Cys Ala
5990 5995 6000
acc ctc gct ctg gtg gcc aac gcc agt cca gac tgt att gga ccc 30815
Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro
6005 6010 6015
ttc gcc tcc tac gtg ctc ttt gcc ttc gtc acc tgc atc tgc tgc 30860
Phe Ala Ser Tyr Val Leu Phe Ala Phe Val Thr Cys Ile Cys Cys
6020 6025 6030
tgt agc ata gtc tgc ctg ctt atc acc ttc ttc cag ttc att gac 30905
Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp
6035 6040 6045
tgg atc ttt gtg cgc atc gcc tac ctg cgc cac cac ccc cag tac 30950
Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr
6050 6055 6060
cgc gac cag cga gtg gcg cag ctg ctc agg ctc ctc tgataagcat 30996
Arg Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Leu
6065 6070 6075
gcgggctctg ctacttctcg cgcttctgct gttagtgctc ccccgtcccg ttgacccccg 31056
gccccccact cagtcccccg aggaggtccg caaatgcaaa ttccaagaac cctggaaatt 31116
cctcaaatgc taccgccaaa aatcagacat gcatcccagc tggatcatga tcattgggat 31176
cgtgaacatt ctggcctgca ccctcatctc ctttgtgatt tacccctgct ttgactttgg 31236
ttggaactcg ccagaggcgc tctatctccc gcctgaacct gacacaccac cacagcaacc 31296
tcaggcacac gcactaccac caccacagcc taggccacaa tacatgccca tattagacta 31356
tgaggccgag ccacagcgac ccatgctccc cgctattagt tacttcaatc taaccggcgg 31416
ag atg act gac cca ctg gcc aac aac aac gtc aac gac ctt ctc ctg 31463
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu
6080 6085 6090
gac atg gac ggc cgc gcc tcg gag cag cga ctc gcc caa ctt cgc 31508
Asp Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg
6095 6100 6105
att cgc cag cag cag gag aga gcc gtc aag gag ctg cag gac ggc 31553
Ile Arg Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly
6110 6115 6120
ata gcc atc cac cag tgc aag aaa ggc atc ttc tgc ctg gtg aaa 31598
Ile Ala Ile His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys
6125 6130 6135
cag gcc aag atc tcc tac gag gtc acc cag acc gac cat cgc ctc 31643
Gln Ala Lys Ile Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu
6140 6145 6150
tcc tac gag ctc ctg cag cag cgc cag aag ttc acc tgc ctg gtc 31688
Ser Tyr Glu Leu Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu Val
6155 6160 6165
gga gtc aac ccc atc gtc atc acc cag cag tcg ggc gat acc aag 31733
Gly Val Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys
6170 6175 6180
ggg tgc atc cac tgc tcc tgc gac tcc ccc gac tgc gtc cac act 31778
Gly Cys Ile His Cys Ser Cys Asp Ser Pro Asp Cys Val His Thr
6185 6190 6195
ctg atc aag acc ctc tgc ggc ctc cgc gac ctc ctc ccc atg aac 31823
Leu Ile Lys Thr Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn
6200 6205 6210
taatcaccca cttatccagt gaaataaaaa aataatcatt tgatttgaaa taaagataca 31883
atcatattga tgatttgagt ttaacaaaaa taaagaatca cttacttgaa atctgatacc 31943
aggtctctgt ccatattttc tgccaacacc acctcactcc cctcttccca gctctggtac 32003
tgcaggcccc ggcgggctgc aaacttcctc cacacgctga aggggatgtc aaattcctcc 32063
tgcccctcaa tcttcatttt atcttctatc ag atg tcc aaa aag cgc gtc cgg 32116
Met Ser Lys Lys Arg Val Arg
6215
gtg gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac 32161
Val Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn
6220 6225 6230
gca ccg acc gtg ccc ttc atc aac ccc ccc ttc gtc tct tca gat 32206
Ala Pro Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp
6235 6240 6245
gga ttc caa gag aag ccc ctg ggg gtg ttg tcc ctg cga ctg gcc 32251
Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala
6250 6255 6260
gac ccc gtc acc acc aag aac ggg gaa atc acc ctc aag ctg gga 32296
Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly
6265 6270 6275
gag ggg gtg gac ctc gac tcc tcg gga aaa ctc atc tcc aac acg 32341
Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr
6280 6285 6290
gcc gcc aag gcc gct gcc cct ctc agt ttt tcc aac aac acc att 32386
Ala Ala Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile
6295 6300 6305
tcc ctt aac atg gat cac ccc ttt tac act aaa gat gga aaa tta 32431
Ser Leu Asn Met Asp His Pro Phe Tyr Thr Lys Asp Gly Lys Leu
6310 6315 6320
gcc tta caa gtt tct cca cca tta aat ata ctg aga aca agc att 32476
Ala Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Arg Thr Ser Ile
6325 6330 6335
cta aac aca cta gct tta ggt ttt gga tca ggt tta gga ctc cgt 32521
Leu Asn Thr Leu Ala Leu Gly Phe Gly Ser Gly Leu Gly Leu Arg
6340 6345 6350
ggc tct gcc ttg gca gta cag tta gtc tct cca ctt aca ttt gat 32566
Gly Ser Ala Leu Ala Val Gln Leu Val Ser Pro Leu Thr Phe Asp
6355 6360 6365
act gat gga aac ata aag ctt acc tta gac aga ggt ttg cat gtt 32611
Thr Asp Gly Asn Ile Lys Leu Thr Leu Asp Arg Gly Leu His Val
6370 6375 6380
aca aca gga gat gca att gaa agc aac ata agc tgg gct aaa ggt 32656
Thr Thr Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Ala Lys Gly
6385 6390 6395
tta aaa ttt gaa gat gga gcc ata gca acc aac att gga aat ggg 32701
Leu Lys Phe Glu Asp Gly Ala Ile Ala Thr Asn Ile Gly Asn Gly
6400 6405 6410
tta gag ttt gga agc agt agt aca gaa aca ggt gtc gat gat gct 32746
Leu Glu Phe Gly Ser Ser Ser Thr Glu Thr Gly Val Asp Asp Ala
6415 6420 6425
tac cca atc caa gtt aaa ctt gga tct ggc ctt agc ttt gac agt 32791
Tyr Pro Ile Gln Val Lys Leu Gly Ser Gly Leu Ser Phe Asp Ser
6430 6435 6440
aca gga gcc ata atg gct ggt aac aaa gaa gac gat aaa ctc act 32836
Thr Gly Ala Ile Met Ala Gly Asn Lys Glu Asp Asp Lys Leu Thr
6445 6450 6455
ttg tgg aca aca cct gat cca tca cca aac tgt caa ata ctc gca 32881
Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Ile Leu Ala
6460 6465 6470
gaa aat gat gca aaa cta aca ctt tgc ttg act aaa tgt ggt agt 32926
Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser
6475 6480 6485
caa ata ctg gcc act gtg tca gtc tta gtt gta gga agt gga aac 32971
Gln Ile Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn
6490 6495 6500
cta aac ccc att act ggc acc gta agc agt gct cag gtg ttt cta 33016
Leu Asn Pro Ile Thr Gly Thr Val Ser Ser Ala Gln Val Phe Leu
6505 6510 6515
cgt ttt gat gca aac ggt gtt ctt tta aca gaa cat tct aca cta 33061
Arg Phe Asp Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu
6520 6525 6530
aaa aaa tac tgg ggg tat agg cag gga gat agc ata gat ggc act 33106
Lys Lys Tyr Trp Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly Thr
6535 6540 6545
cca tat gcc aat gct gta gga ttc atg ccc aat tta aaa gct tat 33151
Pro Tyr Ala Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr
6550 6555 6560
cca aag tca caa agt tct act act aaa aat aat ata gta ggg caa 33196
Pro Lys Ser Gln Ser Ser Thr Thr Lys Asn Asn Ile Val Gly Gln
6565 6570 6575
gta tac atg aat gga gat gtt tca aaa cct atg ctt ctc act ata 33241
Val Tyr Met Asn Gly Asp Val Ser Lys Pro Met Leu Leu Thr Ile
6580 6585 6590
acc ctc aat ggt act gat gac agc aac agt aca tat tca atg tca 33286
Thr Leu Asn Gly Thr Asp Asp Ser Asn Ser Thr Tyr Ser Met Ser
6595 6600 6605
ttt tca tac acc tgg act aat gga agc tat gtt gga gca aca ttt 33331
Phe Ser Tyr Thr Trp Thr Asn Gly Ser Tyr Val Gly Ala Thr Phe
6610 6615 6620
gga gct aac tct tat acc ttc tcc tac atc gcc caa gaa tgaatactgt 33380
Gly Ala Asn Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu
6625 6630 6635
atcccaccct gcatgcccaa ccctccccca cctctgtcta tatggaaaac tctgaaacac 33440
aaaataaaat aaagttcaag tgttttattg attcaacagt tttacaggat tcgagcagtt 33500
atttttcctc caccctccca ggacatggaa tacaccaccc tctccccccg cacagccttg 33560
aacatctgaa tgccattggt gatggacatg cttttggtct ccacgttcca cacagtttca 33620
gagcgagcca gtctcgggtc ggtcagggag atgaaaccct ccgggcactc ccgcatctgc 33680
acctcacagc tcaacagctg aggattgtcc tcggtggtcg ggatcacggt tatctggaag 33740
aagcagaaga gcggcggtgg gaatcatagt ccgcgaacgg gatcggccgg tggtgtcgca 33800
tcaggccccg cagcagtcgc tgccgccgcc gctccgtcaa gctgctgctc agggggtccg 33860
ggtccaggga ctccctcagc atgatgccca cggccctcag catcagtcgt ctggtgcggc 33920
gggcgcagca gcgcatgcgg atctcgctca cgtcgctgca gtacgtgcaa cacaggacca 33980
ccaggttgtt caacagtcca tagttcaaca cgctccagcc gaaactcatc gcgggaagga 34040
tgctacccac gtggccgtcg taccagatcc tcaggtaaat caagtggcgc cccctccaga 34100
acacgctgcc catgtacatg atctccttgg gcatgtggcg gttcaccacc tcccggtacc 34160
acatcaccct ctggttgaac atgcagcccc ggatgatcct gcggaaccac agggccagca 34220
ccgccccgcc cgccatgcag cgaagagacc ccgggtcccg gcaatggcaa tggaggaccc 34280
accgctcgta cccgtggatc atctgggagc tgaacaagtc tatgttggca cagcacaggc 34340
acacgctcat gcatctcttc agcactctca gctcctcggg ggtcaaaacc atatcccagg 34400
gcacgggaaa ctcttgcagg acagcgaagc ccgcagaaca gggcaatcct cgcacataac 34460
ttacattgtg catggacagg gtatcgcaat caggcagcac cgggtgatcc tccaccagag 34520
aagcgcgggt ctcggtctcc tcacagcgtg gtaagggggc cggccgatac gggtgatggc 34580
gggacgcggc tgatcgtgtt cgcgaccgtg tcatgatgca gttgctttcg gacattttcg 34640
tacttgctga agcagaacct ggtccgggcg ctgcacaccg atcgccggcg gcggtctcgg 34700
cgcttggaac gctcggtgtt gaagttgtaa aacagccact ctctcagacc gtgcagcaga 34760
tctagggcct caggagtgat gaagatccca tcatgcctga tggctctgat cacatcgacc 34820
accgtggaat gggccagacc cagccagatg atgcaatttt gttgggtttc ggtgacggcg 34880
ggggagggaa gaacaggaag aaccatgatt aacttttaat ccaaacggtc tcggagcact 34940
tcaaaatgaa ggtcgcggag atggcacctc tcgcccccgc tgtgttggtg gaaaataaca 35000
gccaggtcaa aggtgatacg gttctcgaga tgttccacgg tggcttccag caaagcctcc 35060
acgcgcacat ccagaaacaa gacaatagcg aaagcgggag ggttctctaa ttcctcaatc 35120
atcatgttac actcctgcac catccccaga taattttcat ttttccagcc ttgaatgatt 35180
cgaactagtt cctgaggtaa atccaagcca gccatgataa agagctcgcg cagagcgccc 35240
tccaccggca ttcttaagca caccctcata attccaagat attctgctcc tggttcacct 35300
gcagcagatt gacaagcggg atatcaaaat ctctgccgcg atccctgagc tcctccctca 35360
gcaataactg taagtactct ttcatatcct ctccgaaatt tttagccata ggacccccag 35420
gaataagaga agggcaagcc acattacaga taaaccgaag tcccccccag tgagcattgc 35480
caaatgtaag attgaaataa gcatgctggc tagacccggt gatatcttcc agataactgg 35540
acagaaaatc gggcaagcaa tttttaagaa aatcaacaaa agaaaaatct tccaggtgca 35600
cgtttagggc ctcgggaaca acgatggagt aagtgcaagg ggtgcgttcc agcatggtta 35660
gttagctgat ctgtaaaaaa acaaaaaata aaacattaaa ccatgctagc ctggcgaaca 35720
ggtgggtaaa tcgttctctc cagcaccagg caggccacgg ggtctccggc gcgaccctcg 35780
taaaaattgt cgctatgatt gaaaaccatc acagagagac gttcccggtg gccggcgtga 35840
atgattcgag aagaagcata cacccccgga acattggagt ccgtgagtga aaaaaagcgg 35900
ccgaggaagc aatgaggcac tacaacgctc actctcaagt ccagcaaagc gatgccatgc 35960
ggatgaagca caaaattttc aggtgcgtaa aaaatgtaat tactcccctc ctgcacaggc 36020
agcgaagctc ccgatccctc cagatacaca tacaaagcct cagcgtccat agcttaccga 36080
gcggcagcag cagcggcaca caacaggcgc aagagtcaga gaaaagactg agctctaacc 36140
tgtccgcccg ctctctgctc aatatatagc cccagatcta cactgacgta aaggccaaag 36200
tctaaaaata cccgccaaat aatcacacac gcccagcaca cgcccagaaa ccggtgacac 36260
actcaaaaaa atacgcgcac ttcctcaaac gcccaaactg ccgtcatttc cgggttccca 36320
cgctacgtca tcaaaacacg actttcaaat tccgtcgacc gttaaaaacg tcacccgccc 36380
cgcccctaac ggtcgccgct cccgcagcca atcaccgccc cgcatcccca aattcaaata 36440
cctcatttgc atattaacgc gcaccaaaag tttaaggtat attatttgat gatg 36494
<210> 66
<211> 504
<212> PRT
<213> Simian adenovirus 38
<400> 66
Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu
1 5 10 15
Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn
20 25 30
Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu
35 40 45
Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly
50 55 60
Ala Ala Ala Arg Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg
65 70 75 80
Ser Gly Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro
85 90 95
Glu Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly
100 105 110
Ile Lys Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr
115 120 125
Val Ser Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu
130 135 140
Val Gln Ser Gln Gly Ile Asp Glu Val Ser Val Met His Glu Lys Tyr
145 150 155 160
Ser Leu Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp
165 170 175
Glu Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Lys Pro Asp Lys
180 185 190
Lys Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile
195 200 205
Ser Gly Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Ala Ala
210 215 220
Phe Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu
225 230 235 240
Gly Val Thr Phe Met Asn Thr Arg Phe Arg Gly Asp Gly Tyr Asn Gly
245 250 255
Val Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe
260 265 270
Phe Gly Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val
275 280 285
Arg Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr
290 295 300
Lys Ser Val Val Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu
305 310 315 320
Gly Val Met Ser Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr
325 330 335
Glu Thr Gly Cys Phe Val Leu Ile Lys Gly Asn Ala Gln Val Lys His
340 345 350
Asn Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr
355 360 365
Cys Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser
370 375 380
His Pro Arg Lys Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg
385 390 395 400
Cys Asn Val His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln
405 410 415
Cys Asn Met Gln Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser
420 425 430
Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys
435 440 445
Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys
450 455 460
Gly Gly Lys His Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu
465 470 475 480
Asp Leu Arg Pro Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe
485 490 495
Gly Ser Ser Gly Glu Glu Ser Asp
500
<210> 67
<211> 142
<212> PRT
<213> Simian adenovirus 38
<400> 67
Met Ser Gly Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu
1 5 10 15
Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly
20 25 30
Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu
35 40 45
Thr Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala
50 55 60
Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala
65 70 75 80
Gly Tyr Tyr Ser Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro
85 90 95
Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu
100 105 110
Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu
115 120 125
Gln Ala Glu Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys
130 135 140
<210> 68
<211> 392
<212> PRT
<213> Simian adenovirus 38
<400> 68
Met His Pro Val Leu Arg Gln Met Arg Pro His Pro Pro Pro Gln Pro
1 5 10 15
Pro Leu Pro Gln Gln Gln Gln Gln Pro Ala Leu Leu Pro Pro Pro Gln
20 25 30
Gln Gln Gln Gln Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala
35 40 45
Gly Val Gln Tyr Asp Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg
50 55 60
Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg
65 70 75 80
Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg
85 90 95
Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ser Arg Phe His Ala Gly
100 105 110
Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu
115 120 125
Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His
130 135 140
Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu
145 150 155 160
Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile
165 170 175
Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu
180 185 190
Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu
195 200 205
Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Thr Phe Arg Glu Ala
210 215 220
Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val
225 230 235 240
Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser
245 250 255
Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr
260 265 270
Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu
275 280 285
Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr
290 295 300
Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala
305 310 315 320
Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His
325 330 335
Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr
340 345 350
Phe Asp Met Gly Ala Asp Leu Arg Trp Gln Pro Ser Arg Arg Ala Leu
355 360 365
Glu Ala Ala Gly Gly Val Pro Tyr Val Glu Glu Val Asp Asp Glu Glu
370 375 380
Glu Glu Gly Glu Tyr Leu Glu Asp
385 390
<210> 69
<211> 586
<212> PRT
<213> Simian adenovirus 38
<400> 69
Met Gln Gln Gln Pro Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu
1 5 10 15
Gln Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala
20 25 30
Met Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg
35 40 45
Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val
50 55 60
Pro Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn
65 70 75 80
Ala Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val
85 90 95
Tyr Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val
100 105 110
Gln Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ser
115 120 125
Gln Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala
130 135 140
Leu Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln
145 150 155 160
Glu Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Ala Glu
165 170 175
Val Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln
180 185 190
Thr Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys
195 200 205
Asn Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala
210 215 220
Thr Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu
225 230 235 240
Val Ala Pro Phe Thr Asp Ser Gly Ser Val Ser Arg Asp Ser Tyr Leu
245 250 255
Gly Tyr Leu Leu Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val
260 265 270
Asp Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly
275 280 285
Gln Glu Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr
290 295 300
Asn Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr Glu Glu
305 310 315 320
Glu Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met
325 330 335
Gln Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn
340 345 350
Met Glu Pro Ser Met Tyr Ala Arg Asn Arg Pro Phe Ile Asn Lys Leu
355 360 365
Met Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr
370 375 380
Asn Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr
385 390 395 400
Gly Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp
405 410 415
Val Asp Ser Ser Val Phe Ser Pro Arg Pro Thr Thr Thr Val Trp Lys
420 425 430
Lys Glu Gly Gly Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Ala
435 440 445
Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu
450 455 460
Pro Phe Ser Leu Asn Ser Val Arg Ser Ser Glu Leu Gly Arg Leu Thr
465 470 475 480
Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu Leu
485 490 495
Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu Ser Leu
500 505 510
Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala His Glu His Arg Asp
515 520 525
Glu Pro Arg Ala Ser Ser Ser Thr Gly Ala Arg Arg Arg Gln Arg His
530 535 540
Asp Arg Gln Arg Gly Leu Val Trp Asp Asp Glu Asp Ser Ala Asp Asp
545 550 555 560
Ser Ser Val Leu Asp Leu Gly Gly Ser Gly Gly Gly Asn Pro Phe Ala
565 570 575
His Leu Arg Pro Arg Ile Gly Arg Leu Met
580 585
<210> 70
<211> 539
<212> PRT
<213> Simian adenovirus 38
<400> 70
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gln Asp Glu
145 150 155 160
Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser
165 170 175
Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr
180 185 190
Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val
195 200 205
Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu
210 215 220
Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile
225 230 235 240
Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser
245 250 255
Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln
260 265 270
Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp
275 280 285
Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala Ala Ala Thr
290 295 300
Ala Ala Val Ala Thr Ala Ala Thr Thr Asp Ala Asp Ala Ala Thr Thr
305 310 315 320
Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala Glu Glu Ala Ala Ala Leu
325 330 335
Ala Ala Thr Asp Asp Ser Glu Ser Lys Ile Val Ile Lys Pro Val Glu
340 345 350
Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val Leu Ser Asp Gly Lys Asn
355 360 365
Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu
370 375 380
Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys
385 390 395 400
Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro
405 410 415
Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly
420 425 430
Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala
435 440 445
Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe
450 455 460
Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr
465 470 475 480
Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr
485 490 495
Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr
500 505 510
Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val
515 520 525
Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe
530 535
<210> 71
<211> 194
<212> PRT
<213> Simian adenovirus 38
<400> 71
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
1 5 10 15
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His
20 25 30
Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
35 40 45
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val
50 55 60
Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Ala Ser Thr
65 70 75 80
Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala
85 90 95
Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
100 105 110
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
115 120 125
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ser
130 135 140
Ser Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala
145 150 155 160
Ala Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val
165 170 175
Arg Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro
180 185 190
Arg Thr
<210> 72
<211> 349
<212> PRT
<213> Simian adenovirus 38
<400> 72
Met Ser Lys Arg Lys Tyr Lys Glu Glu Met Leu Gln Val Ile Ala Pro
1 5 10 15
Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys Glu Glu Arg Lys Pro Arg
20 25 30
Lys Leu Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Asp Val Asn
35 40 45
Gly Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln
50 55 60
Trp Arg Gly Arg Lys Val Lys Pro Val Leu Arg Pro Gly Thr Thr Val
65 70 75 80
Val Phe Thr Pro Gly Glu Arg Ser Gly Ser Ala Ser Lys Arg Ser Tyr
85 90 95
Asp Glu Val Tyr Gly Asp Glu Asp Ile Leu Glu Gln Ala Val Glu Arg
100 105 110
Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Pro Leu Lys
115 120 125
Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser
130 135 140
Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Gly Ala Ala Pro Arg
145 150 155 160
Arg Gly Phe Lys Arg Glu Gly Gly Glu Asp Leu Tyr Pro Thr Met Gln
165 170 175
Leu Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu His Met
180 185 190
Lys Val Asp Pro Glu Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys
195 200 205
Gln Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro
210 215 220
Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser
225 230 235 240
Thr Met Glu Val Gln Thr Asp Pro Trp Met Pro Ala Pro Ala Ser Thr
245 250 255
Thr Thr Thr Arg Arg Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met
260 265 270
Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg
275 280 285
Gly Thr Arg Phe Tyr Arg Gly Tyr Thr Ser Ser Arg Arg Arg Lys Thr
290 295 300
Thr Thr Arg Arg Arg Arg Arg Ser Arg Arg Ser Ser Thr Ala Thr Ser
305 310 315 320
Ala Ala Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu
325 330 335
Thr Leu Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
340 345
<210> 73
<211> 77
<212> PRT
<213> Simian adenovirus 38
<400> 73
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly
1 5 10 15
Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly Asn Gly Leu Arg Arg His
20 25 30
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
35 40 45
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro
50 55 60
Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
65 70 75
<210> 74
<211> 240
<212> PRT
<213> Simian adenovirus 38
<400> 74
Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg
1 5 10 15
Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly
20 25 30
Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly
35 40 45
Ser Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly
50 55 60
Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val
65 70 75 80
Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn
85 90 95
Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala Val Pro Pro
100 105 110
Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu
115 120 125
Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu
130 135 140
Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu
145 150 155 160
Gly Leu Pro Thr Thr Arg Pro Val Ala Pro Leu Ala Thr Gly Val Leu
165 170 175
Lys Pro Ser Ser Ser Ser Gln Pro Ala Thr Leu Asp Leu Pro Pro Pro
180 185 190
Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala Ser
195 200 205
Arg Ala Pro Arg Gly Arg Pro Gln Ala Asn Trp Gln Ser Thr Leu Asn
210 215 220
Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr
225 230 235 240
<210> 75
<211> 930
<212> PRT
<213> Simian adenovirus 38
<400> 75
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Cys Gln Trp Thr Tyr Thr Asp Asn Gln Thr Glu Lys
130 135 140
Thr Ala Thr Tyr Gly Asn Ala Pro Val Glu Gly Ile Asn Ile Thr Lys
145 150 155 160
Asp Gly Ile Gln Leu Gly Thr Asp Ser Asp Gly Gln Ala Ile Tyr Ala
165 170 175
Asp Glu Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Pro Glu Trp His
180 185 190
Asp Thr Thr Gly Thr Glu Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro
195 200 205
Ala Thr Asp Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn
210 215 220
Val Lys Gly Gly Gln Ala Lys Ser Arg Thr Lys Thr Asp Gly Thr Thr
225 230 235 240
Glu Pro Asp Ile Asp Met Ala Phe Phe Asp Gly Arg Asn Ala Thr Thr
245 250 255
Ala Gly Leu Thr Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asp Leu
260 265 270
Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp Asp Ser
275 280 285
Ser Ser Ser Ile Asn Leu Gly Gln Gln Ser Met Pro Asn Arg Pro Asn
290 295 300
Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser
305 310 315 320
Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala
325 330 335
Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu
340 345 350
Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln
355 360 365
Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly
370 375 380
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Ala Val Gly
385 390 395 400
Arg Thr Asp Ser Tyr Gln Gly Ile Lys Pro Asn Gly Gly Asp Pro Ala
405 410 415
Thr Trp Ala Lys Asp Glu Ser Val Asn Asp Ser Asn Glu Leu Gly Lys
420 425 430
Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg
435 440 445
Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys
450 455 460
Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp
465 470 475 480
Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala Tyr Ile
485 490 495
Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro
500 505 510
Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu
515 520 525
Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe
530 535 540
Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu
545 550 555 560
Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly
565 570 575
Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn
580 585 590
Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu
595 600 605
Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr
610 615 620
Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn
625 630 635 640
Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp
645 650 655
Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly
660 665 670
Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly
675 680 685
Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp
690 695 700
Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu
705 710 715 720
Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln
725 730 735
Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr
740 745 750
Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg
755 760 765
Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val
770 775 780
Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu Thr Tyr Gln
785 790 795 800
His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln
805 810 815
Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser
820 825 830
Ala Val Ala Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met
835 840 845
Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr
850 855 860
Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp
865 870 875 880
Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val
885 890 895
Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly
900 905 910
Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala
915 920 925
Thr Thr
930
<210> 76
<211> 207
<212> PRT
<213> Simian adenovirus 38
<400> 76
Met Thr Ala Cys Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Leu Arg
1 5 10 15
Asp Leu Gly Cys Gly Pro Cys Phe Leu Gly Thr Phe Asp Lys Arg Phe
20 25 30
Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr
35 40 45
Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn
50 55 60
Pro Arg Ser His Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp
65 70 75 80
Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg
85 90 95
Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser
100 105 110
Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys
115 120 125
Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met Asp
130 135 140
Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met Leu
145 150 155 160
Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala Leu
165 170 175
Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His Arg Ala
180 185 190
Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp Met
195 200 205
<210> 77
<211> 795
<212> PRT
<213> Simian adenovirus 38
<400> 77
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Asp
1 5 10 15
Glu Asn Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro Ser Pro
20 25 30
Thr Ser Asp Ala Ala Pro Asp Met Gln Glu Met Glu Glu Ser Ile Glu
35 40 45
Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu Glu Glu Leu Ala
50 55 60
Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln Glu Gln Pro Glu Gln
65 70 75 80
Glu Ala Glu Ser Glu Gln Gln Gln Ala Gly Leu Glu His Gly Asp Tyr
85 90 95
Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His Leu Ala Arg Gln Cys
100 105 110
Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala Glu Val Pro Leu Ser
115 120 125
Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg Asn Leu Phe Ser Pro Arg
130 135 140
Val Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg
145 150 155 160
Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr Tyr
165 170 175
His Ile Phe Phe Lys Asn Gln Lys Ile Pro Val Ser Cys Arg Ala Asn
180 185 190
Arg Thr Arg Ala Asp Ala Leu Phe Asn Leu Gly Pro Gly Ala Arg Leu
195 200 205
Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile Phe Glu Gly Leu
210 215 220
Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly Asn Glu Glu
225 230 235 240
His Glu His His Ser Ala Leu Val Glu Leu Glu Gly Asp Asn Ala Arg
245 250 255
Leu Ala Val Leu Lys Arg Thr Val Glu Leu Thr His Phe Ala Tyr Pro
260 265 270
Ala Leu Asn Leu Pro Pro Lys Val Met Asn Ala Val Met Asp Gln Val
275 280 285
Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu Met Gln Asp Pro
290 295 300
Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser Asp Glu Gln Leu Ala
305 310 315 320
Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln Ser Leu Glu Glu Arg Arg
325 330 335
Lys Leu Met Met Ala Val Val Leu Val Thr Val Glu Leu Glu Cys Leu
340 345 350
Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu Arg Lys Val Glu Glu Asn
355 360 365
Leu His Tyr Leu Phe Arg His Gly Phe Val Arg Gln Ala Cys Lys Ile
370 375 380
Ser Asn Val Glu Leu Thr Asn Leu Val Ser Tyr Met Gly Ile Leu His
385 390 395 400
Glu Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr Leu Arg Gly Glu
405 410 415
Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu Cys His
420 425 430
Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys Leu Glu Glu Gln
435 440 445
Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys Ala Leu
450 455 460
Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ala Asp Leu Ala Asp Leu
465 470 475 480
Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu Pro Asp
485 490 495
Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe Ile Leu Glu
500 505 510
Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu Pro Ser Asp Phe
515 520 525
Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu Trp Ser His Cys
530 535 540
Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His Ser Asp Val Ile
545 550 555 560
Glu Asp Val Ser Gly Glu Gly Leu Leu Glu Cys His Cys Arg Cys Asn
565 570 575
Leu Cys Thr Pro His Arg Ser Leu Ala Cys Asn Pro Gln Leu Leu Ser
580 585 590
Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu Gln Gly Pro Gly Asp Ser
595 600 605
Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala
610 615 620
Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His Pro Phe Glu Ile Arg
625 630 635 640
Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser Ala Cys
645 650 655
Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile Gln Lys
660 665 670
Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His Gly Val Tyr Leu Asp
675 680 685
Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe Pro Gln Asp Ala Pro
690 695 700
Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala Ala Gly Gly Phe Gly
705 710 715 720
Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly Gly Asp Gly Arg Leu Gly
725 730 735
Gln His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly Gly Gly
740 745 750
Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg Arg Gln Thr
755 760 765
Val Val Leu Gly Gly Gly Glu Ser Lys Gln His Gly Tyr His Leu Arg
770 775 780
Ser Gly Ser Gly Ser Arg Arg Ser Gly Pro Gln
785 790 795
<210> 78
<211> 227
<212> PRT
<213> Simian adenovirus 38
<400> 78
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn
20 25 30
Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg
35 40 45
Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr
50 55 60
Pro Arg His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr
65 70 75 80
Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
85 90 95
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala
100 105 110
Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile
115 120 125
Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
130 135 140
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser
145 150 155 160
Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser
165 170 175
Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
180 185 190
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr
195 200 205
Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp
210 215 220
Gly Tyr Asp
225
<210> 79
<211> 106
<212> PRT
<213> Simian adenovirus 38
<400> 79
Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp
1 5 10 15
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala
20 25 30
Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
35 40 45
Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe
50 55 60
Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Arg Leu
65 70 75 80
Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu Cys
85 90 95
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
100 105
<210> 80
<211> 206
<212> PRT
<213> Simian adenovirus 38
<400> 80
Met Ala Ser Val Lys Phe Phe Ala Leu Phe Ala Ser Leu Ile Thr Val
1 5 10 15
Ile His Gly Met Ser Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn
20 25 30
His Thr Leu Lys Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys
35 40 45
Tyr Phe Asn Glu Ser Asp Val Ala Thr Glu Leu Cys Gly Asn Asn Asn
50 55 60
Lys Lys Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp
65 70 75 80
Leu Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly
85 90 95
Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val Ser
100 105 110
Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Thr
115 120 125
Thr Thr Thr Gln Leu Thr Thr Asn Gly Phe Phe Ala Met Leu Gln Val
130 135 140
Ala Glu Asn Ser Thr Ser Ile Gln Pro Thr Pro Pro Ser Glu Glu Ile
145 150 155 160
Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys Met Leu
165 170 175
Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His
180 185 190
Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe
195 200 205
<210> 81
<211> 203
<212> PRT
<213> Simian adenovirus 38
<400> 81
Met Lys Ile Leu Cys Leu Leu Val Phe Tyr Ile Ile Thr Ser Ala Leu
1 5 10 15
Cys Glu Ser Val Asp Lys Asp Val Thr Ile Thr Thr Gly Ser Asn Tyr
20 25 30
Thr Leu Lys Gly Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe
35 40 45
Gly Thr Asp Thr Asp Gln Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys
50 55 60
Thr Ser Asn Ser Lys Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu
65 70 75 80
Ile Leu Leu Asn Val Thr Lys Ala Tyr Gly Gly Ser Tyr Ser Cys Pro
85 90 95
Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp
100 105 110
Pro Thr Thr Pro Pro Pro Pro Ala Thr Thr Thr His Thr Thr His Thr
115 120 125
Glu Gln Ser Thr Ala Glu Ala Ala Lys Leu Ala Leu Gln Val Gln Asp
130 135 140
Ser Ser Phe Val Gly Ile Thr Pro Thr Pro Asp Gln Arg Cys Pro Gly
145 150 155 160
Leu Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala Val Ile
165 170 175
Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg
180 185 190
Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
195 200
<210> 82
<211> 288
<212> PRT
<213> Simian adenovirus 38
<400> 82
Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser Leu Ile Gly Ile Val
1 5 10 15
Phe Asn Ser Lys Ile Thr Arg Val Ser Phe Ile Lys His Val Asn Val
20 25 30
Thr Glu Gly Asp Asn Ile Thr Leu Ala Gly Val Glu Gly Ala Gln Asn
35 40 45
Thr Thr Trp Thr Lys Tyr His Leu Gly Trp Arg Asp Ile Cys Thr Trp
50 55 60
Asn Val Thr Tyr Tyr Cys Ile Gly Val Asn Leu Thr Ile Val Asn Ala
65 70 75 80
Asn Gln Ser Gln Asn Gly Leu Ile Lys Gly Gln Ser Val Ser Val Thr
85 90 95
Ser Asp Gly Tyr Tyr Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val
100 105 110
Ile Pro Leu Pro Thr Pro Ser Pro Pro Ser Thr Thr Ala Gln Thr Thr
115 120 125
Thr Tyr Ser Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val
130 135 140
Ala Ser Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser
145 150 155 160
Ser Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser
165 170 175
Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr
180 185 190
Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala Thr Thr
195 200 205
Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr Asp Gly Gly
210 215 220
Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu Val Ile Leu
225 230 235 240
Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile Pro Asn Ala His
245 250 255
Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Gln Pro Glu Pro Leu
260 265 270
Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp
275 280 285
<210> 83
<211> 91
<212> PRT
<213> Simian adenovirus 38
<400> 83
Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln
1 5 10 15
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile
20 25 30
Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Val Thr Cys Ile Cys
35 40 45
Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp
50 55 60
Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg
65 70 75 80
Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Leu
85 90
<210> 84
<211> 135
<212> PRT
<213> Simian adenovirus 38
<400> 84
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp
1 5 10 15
Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg
20 25 30
Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala Ile
35 40 45
His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile
50 55 60
Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu Leu
65 70 75 80
Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val
85 90 95
Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys
100 105 110
Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu
115 120 125
Arg Asp Leu Leu Pro Met Asn
130 135
<210> 85
<211> 425
<212> PRT
<213> Simian adenovirus 38
<400> 85
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Ala Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp His Pro Phe Tyr Thr Lys Asp Gly Lys Leu
100 105 110
Ala Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Arg Thr Ser Ile Leu
115 120 125
Asn Thr Leu Ala Leu Gly Phe Gly Ser Gly Leu Gly Leu Arg Gly Ser
130 135 140
Ala Leu Ala Val Gln Leu Val Ser Pro Leu Thr Phe Asp Thr Asp Gly
145 150 155 160
Asn Ile Lys Leu Thr Leu Asp Arg Gly Leu His Val Thr Thr Gly Asp
165 170 175
Ala Ile Glu Ser Asn Ile Ser Trp Ala Lys Gly Leu Lys Phe Glu Asp
180 185 190
Gly Ala Ile Ala Thr Asn Ile Gly Asn Gly Leu Glu Phe Gly Ser Ser
195 200 205
Ser Thr Glu Thr Gly Val Asp Asp Ala Tyr Pro Ile Gln Val Lys Leu
210 215 220
Gly Ser Gly Leu Ser Phe Asp Ser Thr Gly Ala Ile Met Ala Gly Asn
225 230 235 240
Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro
245 250 255
Asn Cys Gln Ile Leu Ala Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu
260 265 270
Thr Lys Cys Gly Ser Gln Ile Leu Ala Thr Val Ser Val Leu Val Val
275 280 285
Gly Ser Gly Asn Leu Asn Pro Ile Thr Gly Thr Val Ser Ser Ala Gln
290 295 300
Val Phe Leu Arg Phe Asp Ala Asn Gly Val Leu Leu Thr Glu His Ser
305 310 315 320
Thr Leu Lys Lys Tyr Trp Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly
325 330 335
Thr Pro Tyr Ala Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr
340 345 350
Pro Lys Ser Gln Ser Ser Thr Thr Lys Asn Asn Ile Val Gly Gln Val
355 360 365
Tyr Met Asn Gly Asp Val Ser Lys Pro Met Leu Leu Thr Ile Thr Leu
370 375 380
Asn Gly Thr Asp Asp Ser Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr
385 390 395 400
Thr Trp Thr Asn Gly Ser Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser
405 410 415
Tyr Thr Phe Ser Tyr Ile Ala Gln Glu
420 425
<210> 86
<211> 580
<212> DNA
<213> Simian adenovirus 38
<220>
<221> CDS
<222> (1)..(576)
<223> label=Elb\19K
<400> 86
atg gag atc tgg acg gtc ttg gaa gac ttt cat cag act aga cag ctg 48
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
cta gag aac tca tcg gag gaa gtc tct tac ctg tgg aga ttt tgc ttc 96
Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
ggt ggg gct cta gct aag cta gtc tat agg gcc aaa cag gat tat aag 144
Gly Gly Ala Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Lys
35 40 45
gat caa ttt gag gat att ttg aga gag tgt cct ggt att ttt gac tct 192
Asp Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
ctc aac ttg ggc cat cag tct cac ttt aac cag agt att ctg aga gcc 240
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
ctt gac ttt tct act cct ggc aga act acc gcc gcg gta gcc ttt ttt 288
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
gcc ttt atc ctt gac aaa tgg agt caa gaa acc cat ttc agc agg gat 336
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
tac cgt ctg gac tgc tta gca gta gct ttg tgg aga aca tgg agg tgc 384
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
cag cgc ctg aat gca atc tcc ggc tac ttg cca gta cag ccg gta gac 432
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
acg ctg agg atc ctg agt ctc cag tca ccc cag gaa cac caa cgc cgc 480
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
cag cag ccg cag cag gag cag cag caa gag gag gag gag gac cga gaa 528
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Glu Asp Arg Glu
165 170 175
gag aac ccg aga gcc ggt ctg gac cct ccg gtg gcg gag gag gag gag 576
Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
tagc 580
<210> 87
<211> 192
<212> PRT
<213> Simian adenovirus 38
<400> 87
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
Gly Gly Ala Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Lys
35 40 45
Asp Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Glu Asp Arg Glu
165 170 175
Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
<210> 88
<211> 530
<212> DNA
<213> Simian adenovirus 38
<220>
<221> CDS
<222> (1)..(528)
<223> label=E3\gp19K
<400> 88
atg ggg aag atc acc cta gtg agc tgc ggt gcg ctg gtg gcg gtg gtg 48
Met Gly Lys Ile Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val Val
1 5 10 15
ctt tcg att gtg gga ctg ggc ggc gcg gct gta gtg aag gag aag gcc 96
Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Lys Ala
20 25 30
gat ccc tgc ttg cat ttc aat ccc gac aaa tgc cag ctg agt ttt cag 144
Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln
35 40 45
ccc gat ggc aat cgg tgc gcg gtg ctg atc aag tgc gga tgg gaa tgc 192
Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys
50 55 60
gag aac gtg aga atc gag tac aat aac aag act cgg aac aat act ctc 240
Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu
65 70 75 80
gcg tcc gtg tgg cag ccc ggg gac ccc gag tgg tac acc gtc tct gtc 288
Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val
85 90 95
ccc ggt gct gac ggc tcc ccg cgc acc gtg aac aat act ttc att ttt 336
Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe
100 105 110
gca cac atg tgc gac acg gtc atg tgg atg agc aag cag tac gat atg 384
Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met
115 120 125
tgg ccc ccc acg aag gag aac atc gtg gtc ttc tcc atc gct tac agc 432
Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
130 135 140
ctg tgc acg gcg cta atc acc gct atc gtg tgc ctg agc att cac atg 480
Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met
145 150 155 160
ctc atc gct att cgc ccc aga aat aat gcc gaa aaa gag aaa cag cca 528
Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
165 170 175
ta 530
<210> 89
<211> 176
<212> PRT
<213> Simian adenovirus 38
<400> 89
Met Gly Lys Ile Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val Val
1 5 10 15
Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Lys Ala
20 25 30
Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln
35 40 45
Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys
50 55 60
Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu
65 70 75 80
Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val
85 90 95
Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe
100 105 110
Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met
115 120 125
Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
130 135 140
Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met
145 150 155 160
Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
165 170 175
<210> 90
<211> 5900
<212> DNA
<213> Simian adenovirus 38
<220>
<221> CDS
<222> (4)..(573)
<223> label=22K
<220>
<221> CDS
<222> (1880)..(2506)
<223> label=E3\CR1\alpha
<220>
<221> CDS
<222> (5465)..(5893)
<223> label=E3\RID\beta
<400> 90
agg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg ccg 48
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro
1 5 10 15
gag gat ttg gag gaa gac tgg gag agc agt cag gca gag gag gag atg 96
Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Met
20 25 30
gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa gac agt 144
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa gca gcc 192
Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala
50 55 60
gcc gcc aga ccg tcg tcc tcg gcg gag gag aaa gca agc agc acg gat 240
Ala Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr Asp
65 70 75
acc atc tcc gct ccg ggt cgg ggt cgc ggc ggt cgg gcc cac agt aga 288
Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg
80 85 90 95
tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc ggt aag 336
Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys
100 105 110
aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac gcc atc 384
Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile
115 120 125
gtc tcc tgc ttg caa gcc tgc ggg ggc aac atc tcc ttc acc cgg cgc 432
Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg
130 135 140
tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc ttg cat 480
Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His
145 150 155
tac tac cgt cac ctc cac agc ccc tac tac tgt ttc caa gaa gag gca 528
Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala
160 165 170 175
gaa acc cag cag cag cag aaa acc agc ggc agc agc agc agc agc 573
Glu Thr Gln Gln Gln Gln Lys Thr Ser Gly Ser Ser Ser Ser Ser
180 185 190
tagaaaatcc acagcggcgg cggcggcagg tggactgagg atcgcggcga acgagccggc 633
gcagacccgg gagctgagga accggatctt tcccaccctc tatgccatct tccagcagag 693
tcgggggcag gagcaggaac tgaaagtcaa gaaccgttct ctgcgctcgc tcacccgcag 753
ttgtctgtat cacaagagcg aagaccaact tcagcgcact ctcgaggacg ccgaggctct 813
cttcaacaag tactgcgcgc tcactcttaa agagtagccc gcgcccgccc acacacggaa 873
aaaggcggga attacgtcac cacctgcgcc cttcgcccga ccatcatcat gagcaaagag 933
attcccacgc cttacatgtg gagctaccag ccccagatgg gcctggccgc cggcgccgcc 993
caggactact ccacccgcat gaactggctc agtgccgggc ccgcgatgat ctcacgggtg 1053
aatgacatcc gcgcccaccg aaaccagata ctcctagaac agtcagcgat caccgccacg 1113
ccccgccatc accttaatcc gcgtaattgg cccgccgccc tggtgtacca ggaaattccc 1173
cagcccacga ccgtactact tccgcgagac gcccaggccg aagtccagct gactaactca 1233
ggtgtccagc tggccggcgg cgccgccctg tgtcgtcacc gccccgctca gggtataaag 1293
cggctggtga tccgaggcag aggcacacag ctcaacgacg aggtggtgag ctcttcgctg 1353
ggtctgcgac ctgacggagt cttccaactc gccggatcgg ggagatcttc cttcacgcct 1413
cgtcaggccg tcctgacttt ggagagttcg tcctcgcagc cccgctcggg cggcatcggc 1473
actctccagt tcgtggagga gttcactccc tcggtctact tcaacccctt ctccggctcc 1533
cccggccact acccggacga gttcatcccg aacttcgacg ccatcagcga gtcggtggac 1593
ggctacgatt gaatgtccca tggtggcgcg gctgacctag ctcggcttcg acacctggac 1653
cactgccgcc gcttccgctg cttcgctcgg gatctcgccg agtttgccta ctttgagctg 1713
cccgaggagc accctcaggg cccggcccac ggagtgcgga tcgtcgtcga agggggcctc 1773
gactcccacc tgcttcggat cttcagccag cgaccgatcc tggtcgagcg cgagcaagga 1833
cagacccgtc tgaccctgta ctgcatctgc aaccaccccg gcctgc atg aaa gtc 1888
Met Lys Val
ttt gtt gtc tgc tgt gta ctg agt ata ata aaa gct gag atc agc gac 1936
Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp
195 200 205
tac tcc gga ctc gat tgt ggt gtt cct gct atc aac cgg tcc ctg ttc 1984
Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg Ser Leu Phe
210 215 220 225
ttc acc ggg aac gag acc gag ctc cag ctc cag tgt aag ccc cac aag 2032
Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys Pro His Lys
230 235 240
aag tat ctc acc tgg ctg ttc cag ggc tcc ccg atc gcc gtt gtc aac 2080
Lys Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala Val Val Asn
245 250 255
cac tgc gac aac gac gga gtc ctg ctg agc ggc ccc gcc aac ctt act 2128
His Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala Asn Leu Thr
260 265 270
ttt tcc acc cgc aga agc aag ctc cag ctc ttc caa ccc ttc ctc ccc 2176
Phe Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro Phe Leu Pro
275 280 285
ggg acc tat cag tgc ctc tcg gga ccc tgc cat cac acc ttc cac ctg 2224
Gly Thr Tyr Gln Cys Leu Ser Gly Pro Cys His His Thr Phe His Leu
290 295 300 305
atc ccg aat acc aca gcg gcg ctc ccc gct act aac aac caa act acc 2272
Ile Pro Asn Thr Thr Ala Ala Leu Pro Ala Thr Asn Asn Gln Thr Thr
310 315 320
cac caa cgc cac cgt cgc gac ctt tcc tct gaa tct aat act acc acc 2320
His Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser Asn Thr Thr Thr
325 330 335
cac acc gga ggt gag ctc cga ggt caa cca acc tct ggg att tac tac 2368
His Thr Gly Gly Glu Leu Arg Gly Gln Pro Thr Ser Gly Ile Tyr Tyr
340 345 350
ggc ccc tgg gag gtg gtg ggg tta ata gcg cta ggc cta gtt gcg ggt 2416
Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val Ala Gly
355 360 365
ggg ctt ttg gct ctc tgc tac cta tac ctc cct tgc tgt tct tac tta 2464
Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu
370 375 380 385
gtg gtg ctg tgt tgc tgg ttt aag aaa tgg gga aga tca ccc 2506
Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
390 395
tagtgagctg cggtgcgctg gtggcggtgg tgctttcgat tgtgggactg ggcggcgcgg 2566
ctgtagtgaa ggagaaggcc gatccctgct tgcatttcaa tcccgacaaa tgccagctga 2626
gttttcagcc cgatggcaat cggtgcgcgg tgctgatcaa gtgcggatgg gaatgcgaga 2686
acgtgagaat cgagtacaat aacaagactc ggaacaatac tctcgcgtcc gtgtggcagc 2746
ccggggaccc cgagtggtac accgtctctg tccccggtgc tgacggctcc ccgcgcaccg 2806
tgaacaatac tttcattttt gcacacatgt gcgacacggt catgtggatg agcaagcagt 2866
acgatatgtg gccccccacg aaggagaaca tcgtggtctt ctccatcgct tacagcctgt 2926
gcacggcgct aatcaccgct atcgtgtgcc tgagcattca catgctcatc gctattcgcc 2986
ccagaaataa tgccgaaaaa gagaaacagc cataacacgt tttttcacac acctttttca 3046
gaccatggcc tctgttaaat tttttgcttt atttgccagt ctcattaccg tcattcatgg 3106
aatgagtaat gagaaaatta ctatttacac tggcactaat cacacattga aaggtccaga 3166
aaaagccaca gaagtttcat ggtattgtta ttttaatgaa tcagatgtag ctactgaact 3226
ctgtggaaac aacaacaaaa aaaatgagag cattactctc atcaagtttc aatgtggatc 3286
tgacttaacc ctcattaaca tcactagaga ctatgtaggt atgtattatg gaactacagc 3346
aggcatttcg gacatggaat tttatcaagt ttctgtgtct gaacccacca cgcctagaat 3406
gaccacaacc acaaaaacta cacctactac caccacacag ctcactacca atggcttttt 3466
tgccatgctt caagtggctg aaaatagcac cagcattcaa cccaccccac ccagtgagga 3526
aattcccaaa tccatgattg gcattattgt tgctgtagtg gtgtgcatgt tgatcatcgc 3586
cttgtgcatg gtgtactatg ccttctgcta cagaaagcac agactgaacg acaagctgga 3646
acacttacta agtgttgaat tttaattttt tagaaccatg aagatcctat gccttttagt 3706
tttttatatc attacctctg ctctttgtga atcagtggat aaagatgtta ctattaccac 3766
tggttctaat tatacactga aaggaccacc ctcaggtatg ctttcgtggt attgctattt 3826
tggaactgac actgatcaaa ctgaattatg caattttcaa aaaggcaaaa cctcaaactc 3886
taaaatctct aattatcaat gcaatggcac tgatctgata ctactcaatg tcacgaaagc 3946
atatggtggc agttattcat gccctggaca aaacactgaa gaaatgattt tttacaaagt 4006
ggaagtggtt gatcccacta ctccacctcc acccgccaca actactcaca ccacacacac 4066
agaacaaagc acagcagagg cagcaaagtt agccttgcag gtccaagaca gttcatttgt 4126
tggcattacc cctacacctg atcagcggtg tccggggctg ctcgtcagcg gcattgtcgg 4186
tgtgctttcg ggattagcag tcataatcat ctgcatgttc atttttgctt gctgctatag 4246
aaggctttac cgacaaaaat cagacccact gctgaacctc tatgtttaat tttttccaga 4306
gccatgaagg cagttagcgc tctagttttt tgttctttga ttggcattgt ttttaatagt 4366
aaaattacca gagttagctt tattaaacat gttaatgtaa ctgaaggaga taacatcaca 4426
ctagcaggtg tagaaggtgc tcaaaacacc acctggacaa aataccatct aggatggaga 4486
gatatttgca cctggaatgt aacttattat tgcataggag ttaatcttac cattgttaac 4546
gctaaccaat ctcagaatgg gttaattaaa ggacagagtg ttagtgtgac cagtgatggg 4606
tactataccc agcatagttt taactacaac attactgtca taccactgcc tacgcctagc 4666
ccacctagca ctaccgcaca gacaaccaca tacagtacat caaatcagcc taccaccact 4726
acagcagcag aggttgccag ctcgtctggg gtccgagtgg catttttgat gttggcccca 4786
tctagcagtc ccactgctag taccaatgag cagactactg aatttttgtc cactgtcgag 4846
agccacacca cagctacctc cagtgccttc tctagcaccg ccaatctctc ctcgctttcc 4906
tctacaccaa tcagtcccgc tactactcct agccccgctc ctcttcccac tcccctgaag 4966
caaacagacg gcggcatgca atggcagatc accctgctca ttgtgatcgg gttggtcatc 5026
ctggccgtgt tgctatacta catcttctgc cgccgcattc ccaacgcgca ccgcaagccg 5086
gcctacaagc ccatcgttat cgggcagccg gagccgcttc aggtggaagg gggtctaagg 5146
aatcttctct tctcttttac agtatggtga ttgaactatg attcctagac aattcttgat 5206
cactattctt atctgcctcc tccaagtctg tgccaccctc gctctggtgg ccaacgccag 5266
tccagactgt attggaccct tcgcctccta cgtgctcttt gccttcgtca cctgcatctg 5326
ctgctgtagc atagtctgcc tgcttatcac cttcttccag ttcattgact ggatctttgt 5386
gcgcatcgcc tacctgcgcc accaccccca gtaccgcgac cagcgagtgg cgcagctgct 5446
caggctcctc tgataagc atg cgg gct ctg cta ctt ctc gcg ctt ctg ctg 5497
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu
400 405 410
tta gtg ctc ccc cgt ccc gtt gac ccc cgg ccc ccc act cag tcc ccc 5545
Leu Val Leu Pro Arg Pro Val Asp Pro Arg Pro Pro Thr Gln Ser Pro
415 420 425
gag gag gtc cgc aaa tgc aaa ttc caa gaa ccc tgg aaa ttc ctc aaa 5593
Glu Glu Val Arg Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys
430 435 440
tgc tac cgc caa aaa tca gac atg cat ccc agc tgg atc atg atc att 5641
Cys Tyr Arg Gln Lys Ser Asp Met His Pro Ser Trp Ile Met Ile Ile
445 450 455
ggg atc gtg aac att ctg gcc tgc acc ctc atc tcc ttt gtg att tac 5689
Gly Ile Val Asn Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr
460 465 470
ccc tgc ttt gac ttt ggt tgg aac tcg cca gag gcg ctc tat ctc ccg 5737
Pro Cys Phe Asp Phe Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro
475 480 485 490
cct gaa cct gac aca cca cca cag caa cct cag gca cac gca cta cca 5785
Pro Glu Pro Asp Thr Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro
495 500 505
cca cca cag cct agg cca caa tac atg ccc ata tta gac tat gag gcc 5833
Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala
510 515 520
gag cca cag cga ccc atg ctc ccc gct att agt tac ttc aat cta acc 5881
Glu Pro Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr
525 530 535
ggc gga gat gac tgaccca 5900
Gly Gly Asp Asp
540
<210> 91
<211> 190
<212> PRT
<213> Simian adenovirus 38
<400> 91
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Met Glu
20 25 30
Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser Leu
35 40 45
Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala Ala
50 55 60
Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys Lys
100 105 110
Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile Val
115 120 125
Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg Tyr
130 135 140
Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His Tyr
145 150 155 160
Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala Glu
165 170 175
Thr Gln Gln Gln Gln Lys Thr Ser Gly Ser Ser Ser Ser Ser
180 185 190
<210> 92
<211> 209
<212> PRT
<213> Simian adenovirus 38
<400> 92
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu
1 5 10 15
Ile Ser Asp Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg
20 25 30
Ser Leu Phe Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys
35 40 45
Pro His Lys Lys Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala
50 55 60
Val Val Asn His Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala
65 70 75 80
Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro
85 90 95
Phe Leu Pro Gly Thr Tyr Gln Cys Leu Ser Gly Pro Cys His His Thr
100 105 110
Phe His Leu Ile Pro Asn Thr Thr Ala Ala Leu Pro Ala Thr Asn Asn
115 120 125
Gln Thr Thr His Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser Asn
130 135 140
Thr Thr Thr His Thr Gly Gly Glu Leu Arg Gly Gln Pro Thr Ser Gly
145 150 155 160
Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu
165 170 175
Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys
180 185 190
Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser
195 200 205
Pro
<210> 93
<211> 143
<212> PRT
<213> Simian adenovirus 38
<400> 93
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg
1 5 10 15
Pro Val Asp Pro Arg Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys
20 25 30
Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys
35 40 45
Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile
50 55 60
Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe
65 70 75 80
Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr
85 90 95
Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Pro Gln Pro Arg
100 105 110
Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg Pro
115 120 125
Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp
130 135 140
<210> 94
<211> 1440
<212> DNA
<213> Simian adenovirus 38
<220>
<221> CDS
<222> (577)..(1144)
<223> label=Ela\13S
<220>
<221> CDS
<222> (1230)..(1435)
<223> label=Ela\13S
<400> 94
catcatcaaa taatatacct taaacttttg gtgcgcgtta atatgcaaat gaggtatttg 60
aatttgggga tgcggggcgg tgattggctg cgggagcggc gaccgttagg ggcggggcgg 120
gtgacgtttt gatgacgtgt ttgtgaggcg gagccggttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtattt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttacgtagg tgtcagctga tcgccagggt 480
atttaaacct gcgctctcta gtcaagaggc cactcttgag tgccagcgag tagagttttc 540
tcctccgcgc cgcgagtcag atctacactt tgaaag atg agg cac ctg aga gac 594
Met Arg His Leu Arg Asp
1 5
ctg ccc ggt aat gtt ttc ctg gct act ggg aac gag att ctg gaa ctg 642
Leu Pro Gly Asn Val Phe Leu Ala Thr Gly Asn Glu Ile Leu Glu Leu
10 15 20
gtg gtg gac gcc atg atg ggt gac gac cct ccg gag ccc cct acc cca 690
Val Val Asp Ala Met Met Gly Asp Asp Pro Pro Glu Pro Pro Thr Pro
25 30 35
ttt gag gcg cct tcg ctg tac gat ttg tat gat ctg gag gtg gat gtg 738
Phe Glu Ala Pro Ser Leu Tyr Asp Leu Tyr Asp Leu Glu Val Asp Val
40 45 50
ccc gag aac gac ccc aac gag gag gcg gtg aat gat ttg ttt agc gat 786
Pro Glu Asn Asp Pro Asn Glu Glu Ala Val Asn Asp Leu Phe Ser Asp
55 60 65 70
gcc gcg ctg ctg gct gcc gag cag gct aat acg gac tct ggc tca gac 834
Ala Ala Leu Leu Ala Ala Glu Gln Ala Asn Thr Asp Ser Gly Ser Asp
75 80 85
agc gat tcc tct ctc cat acc ccg aga ccc ggc aga ggt gag aaa aag 882
Ser Asp Ser Ser Leu His Thr Pro Arg Pro Gly Arg Gly Glu Lys Lys
90 95 100
atc ccc gag ctt aaa ggg gaa gag ctc gac ctg cgc tgc tat gag gaa 930
Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp Leu Arg Cys Tyr Glu Glu
105 110 115
tgc ttg cct ccg agc gat gat gag gag gac gag gag gcg att cga gct 978
Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp Glu Glu Ala Ile Arg Ala
120 125 130
gca gcg aac cag gga gtg aaa gcg gcg ggc gag ggc ttt agc ctg gac 1026
Ala Ala Asn Gln Gly Val Lys Ala Ala Gly Glu Gly Phe Ser Leu Asp
135 140 145 150
tgt cct act ctg ccc gga cac ggc tgt aag tct tgt gaa ttt cat cgc 1074
Cys Pro Thr Leu Pro Gly His Gly Cys Lys Ser Cys Glu Phe His Arg
155 160 165
atg aat act gga gat aag aat gtg atg tgt gcc ctg tgc tat atg aga 1122
Met Asn Thr Gly Asp Lys Asn Val Met Cys Ala Leu Cys Tyr Met Arg
170 175 180
gct tac aac cat tgt gtt tac a gtaagtgtga ttaactttag ttgggaaggc 1174
Ala Tyr Asn His Cys Val Tyr
185
agagggtgac tgggtgctga ctggtttatt tatgtatatg tttttttatg tgtag gt 1231
Ser
ccc gtc tct gac gta gat gag acc ccc act tca gag tgc att tca tca 1279
Pro Val Ser Asp Val Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser
195 200 205
ccc cca gaa att ggc gag gaa ccg ccc gaa gat att att cat aga cca 1327
Pro Pro Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro
210 215 220
gtt gca gtg aga gtc acc ggg cgg aga gca gct gtg gag agt ttg gat 1375
Val Ala Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp
225 230 235
gac ttg cta cag ggt ggg gat gaa cct ttg gac ttg tgt acc cgg aaa 1423
Asp Leu Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys
240 245 250
cgc ccc agg cac taagt 1440
Arg Pro Arg His
255
<210> 95
<211> 258
<212> PRT
<213> Simian adenovirus 38
<400> 95
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr Gly
1 5 10 15
Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp Pro
20 25 30
Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu Tyr
35 40 45
Asp Leu Glu Val Asp Val Pro Glu Asn Asp Pro Asn Glu Glu Ala Val
50 55 60
Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala Asn
65 70 75 80
Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro
85 90 95
Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp
100 105 110
Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp
115 120 125
Glu Glu Ala Ile Arg Ala Ala Ala Asn Gln Gly Val Lys Ala Ala Gly
130 135 140
Glu Gly Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys Lys
145 150 155 160
Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met Cys
165 170 175
Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val
180 185 190
Ser Asp Val Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro
195 200 205
Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala
210 215 220
Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp Asp Leu
225 230 235 240
Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro
245 250 255
Arg His
<210> 96
<211> 850
<212> DNA
<213> Simian adenovirus 38
<220>
<221> CDS
<222> (4)..(331)
<223> label=33K
<220>
<221> CDS
<222> (501)..(847)
<223> label=33K
<400> 96
agg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg ccg 48
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro
1 5 10 15
gag gat ttg gag gaa gac tgg gag agc agt cag gca gag gag gag atg 96
Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Met
20 25 30
gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa gac agt 144
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa gca gcc 192
Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala
50 55 60
gcc gcc aga ccg tcg tcc tcg gcg gag gag aaa gca agc agc acg gat 240
Ala Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr Asp
65 70 75
acc atc tcc gct ccg ggt cgg ggt cgc ggc ggt cgg gcc cac agt aga 288
Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg
80 85 90 95
tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc g 331
Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
100 105
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 391
gcttgcaagc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 451
gggtgaactt cccccgcaac atcttgcatt actaccgtca cctccacag cc cct act 508
Ala Pro Thr
act gtt tcc aag aag agg cag aaa ccc agc agc agc aga aaa cca gcg 556
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
gca gca gca gca gca gct aga aaa tcc aca gcg gcg gcg gcg gca ggt 604
Ala Ala Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Ala Gly
130 135 140
gga ctg agg atc gcg gcg aac gag ccg gcg cag acc cgg gag ctg agg 652
Gly Leu Arg Ile Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg
145 150 155 160
aac cgg atc ttt ccc acc ctc tat gcc atc ttc cag cag agt cgg ggg 700
Asn Arg Ile Phe Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly
165 170 175
cag gag cag gaa ctg aaa gtc aag aac cgt tct ctg cgc tcg ctc acc 748
Gln Glu Gln Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr
180 185 190
cgc agt tgt ctg tat cac aag agc gaa gac caa ctt cag cgc act ctc 796
Arg Ser Cys Leu Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu
195 200 205
gag gac gcc gag gct ctc ttc aac aag tac tgc gcg ctc act ctt aaa 844
Glu Asp Ala Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys
210 215 220
gag tag 850
Glu
225
<210> 97
<211> 225
<212> PRT
<213> Simian adenovirus 38
<400> 97
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Met Glu
20 25 30
Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser Leu
35 40 45
Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala Ala
50 55 60
Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr
100 105 110
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
Ala Ala Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Ala Gly
130 135 140
Gly Leu Arg Ile Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg
145 150 155 160
Asn Arg Ile Phe Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly
165 170 175
Gln Glu Gln Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr
180 185 190
Arg Ser Cys Leu Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu
195 200 205
Glu Asp Ala Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys
210 215 220
Glu
225
<210> 98
<211> 36621
<212> DNA
<213> Simian adenovirus 30
<220>
<221> repeat_region
<222> (1)..(126)
<223> label=ITR
<220>
<221> CDS
<222> (1906)..(3414)
<223> label=Elb\55K
<220>
<221> CDS
<222> (3452)..(3922)
<223> label=pIX
<220>
<221> misc_feature
<222> (3988)..(5609)
<223> complement(3988..5318,5597..5609) label=IVa2
<220>
<221> misc_feature
<222> (5091)..(13843)
<223> complement(5091..8663,13835..13843) label=pol
<220>
<221> misc_feature
<222> (8465)..(13843)
<223> complement(8465..10399,13835..13843) label=pTP
<220>
<221> CDS
<222> (10826)..(12001)
<223> label=52K
<220>
<221> CDS
<222> (12028)..(13806)
<223> label=pIIIa
<220>
<221> CDS
<222> (13888)..(15486)
<223> label=penton
<220>
<221> CDS
<222> (15493)..(16071)
<223> label=pVII
<220>
<221> CDS
<222> (16116)..(17153)
<223> label=V
<220>
<221> CDS
<222> (17177)..(17407)
<223> label=pX
<220>
<221> CDS
<222> (17442)..(18218)
<223> label=pVI
<220>
<221> CDS
<222> (18328)..(21141)
<223> label=hexon
<220>
<221> CDS
<222> (21160)..(21786)
<223> label=protease
<220>
<221> misc_feature
<222> (21871)..(23403)
<223> complement label=DBP
<220>
<221> CDS
<222> (23426)..(25828)
<223> label=100K
<220>
<221> CDS
<222> (26451)..(27131)
<223> label=pVIII
<220>
<221> CDS
<222> (27409)..(28032)
<223> label=E3\CR1\alpha
<220>
<221> CDS
<222> (28581)..(29264)
<223> label=E3\CR1\beta
<220>
<221> CDS
<222> (29280)..(29888)
<223> label=E3\CR1\gamma
<220>
<221> CDS
<222> (29906)..(30769)
<223> label=E3\CR1\delta
<220>
<221> CDS
<222> (30780)..(31052)
<223> label=E3\RID\alpha
<220>
<221> CDS
<222> (31061)..(31492)
<223> label=E3\RID\beta
<220>
<221> CDS
<222> (32189)..(33523)
<223> label=fiber
<220>
<221> misc_feature
<222> (33628)..(34772)
<223> complement(33628..33876,34623..34772) label=E4\orf6/7
<220>
<221> misc_feature
<222> (33876)..(34772)
<223> complement label=E4\orf6
<220>
<221> misc_feature
<222> (34678)..(35043)
<223> complement label=E4\orf4
<220>
<221> misc_feature
<222> (35055)..(35405)
<223> complement label=E4\orf3
<220>
<221> misc_feature
<222> (35405)..(35791)
<223> complement label=E4\orf2
<220>
<221> misc_feature
<222> (35844)..(36215)
<223> complement label=E4\orf1
<220>
<221> repeat_region
<222> (36496)..(36621)
<223> label=ITR
<400> 98
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agccgtttga 60
atttggggat ggaggaaggt gattggctgt gggagcggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtggc catgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660
gtgacgaccc tcccgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgtcc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960
aggaggcgat tcgagctgca gcgagcgagg gagtgaaagt tgcgggcgag agctttagcc 1020
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagttgggaa aggcagaggg tgactgggtg ctgactggtt 1200
tatttatgta tatgtttttt atgtgtaggt cccgtctctg acgcagatga gacccccact 1260
tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320
agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380
ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440
ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500
aatatgtgtt gactttaagt gcgtgtttta tgactcaggg gtggggactg tgggtatata 1560
agcaggtgca gacctgtgtg gtcagttcag agcaggactc atggagatct ggacagtctt 1620
ggaagacttt caccagacta gacagctgct agagaactca tcggagggag tctcttacct 1680
gtggagattc tgcttcgctg ggcctctagc taagctagtc tatagggcca agcaggatta 1740
tagggaacaa tttgaggata ttttgagaga gtgtcctggt atttttgact ctctcaactt 1800
gggccatcag tctcacttta accagagtat tctgagagcc cttgactttt ctactcctgg 1860
cagaactacc gccgcggtag ccttttttgc ctttatcctt gacaa atg gag tca aga 1917
Met Glu Ser Arg
1
aac cca ttt cag cag gga tta ccg tct gga ctg ctt agc agt agc ttt 1965
Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu Ser Ser Ser Phe
5 10 15 20
gtg gag aac atg gag gtg cca gcg cct gaa tgc aat ctc cgg cta ctt 2013
Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu
25 30 35
gcc agt aca gcc ggt aga cac gct gag gat cct gag tct cca gtc acc 2061
Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu Ser Pro Val Thr
40 45 50
cca gga aca cca acg ccg cca gca gcc gca gca gga gca gca gca aga 2109
Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly Ala Ala Ala Arg
55 60 65
gga gga gga ccg aga aga gaa ccc gag agc cgg tct gga ccc tcc ggt 2157
Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser Gly Pro Ser Gly
70 75 80
ggc gga gga gga gga gta gct gac ttg ttt ccc gag ctg cgc cgg gtg 2205
Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val
85 90 95 100
ctg act agg tct tcc agt gga cgg gag agg ggg att aag cgg gag agg 2253
Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg
105 110 115
cat gag gag act agc cac aga act gaa ctg act gtc agt ctg atg agc 2301
His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val Ser Leu Met Ser
120 125 130
cgc agg cgc cca gaa tcg gtg tgg tgg cat gag gtt cag tcg cag ggg 2349
Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val Gln Ser Gln Gly
135 140 145
gta gat gag gtc tcg gtg atg cat gag aaa tat tcc cta gaa caa gtc 2397
Val Asp Glu Val Ser Val Met His Glu Lys Tyr Ser Leu Glu Gln Val
150 155 160
aag act tgt tgg ttg gag ccc gag gat gat tgg gag gta gcc atc agg 2445
Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg
165 170 175 180
aat tat gcc aag ctg gct ctg agg cca gac aag aag tac aag att acc 2493
Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr
185 190 195
aaa ctg att aat atc aga aat tcc tgc tac att tca ggg aat ggg gcc 2541
Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser Gly Asn Gly Ala
200 205 210
gag gtg gag atc agt acc cag gag agg gtg gct ttc aga tgc tgc atg 2589
Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala Phe Arg Cys Cys Met
215 220 225
atg aat atg tac ccg ggg gtg gtg ggc atg gag gga gtc acc ttt atg 2637
Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly Val Thr Phe Met
230 235 240
aac gcg agg ttc agg ggt gat ggg tat aat ggg gtg gtc ttt atg gcc 2685
Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val Val Phe Met Ala
245 250 255 260
aac acc aag ctg aca gtg cac gga tgc tcc ttc ttt ggc ttc aat aac 2733
Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn
265 270 275
atg tgc atc gag gcc tgg ggc agt gtt tca gtg agg gga tgc agc ttt 2781
Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg Gly Cys Ser Phe
280 285 290
tca gcc aac tgg atg ggg gtc gtg ggc aga acc aag agc aag gtg tca 2829
Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Lys Val Ser
295 300 305
gtg aag aaa tgc ctg ttc gag agg tgc cac ctg ggg gtg atg agc gag 2877
Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu
310 315 320
ggc gaa gcc aaa gtc aaa cac tgc gcc tct acc gag acg ggc tgc ttt 2925
Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu Thr Gly Cys Phe
325 330 335 340
gtg tgt atc aag ggc aat gcc caa gtc aag cat aac atg atc tgt ggg 2973
Val Cys Ile Lys Gly Asn Ala Gln Val Lys His Asn Met Ile Cys Gly
345 350 355
gcc tcg gat gag cgc ggc tac cag atg ctg acc tgc gcc ggt ggg aac 3021
Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn
360 365 370
agc cat atg ctg gcc acc gtg cat gtg gcc tcg cac ccc cgc aag aca 3069
Ser His Met Leu Ala Thr Val His Val Ala Ser His Pro Arg Lys Thr
375 380 385
tgg ccc gag ttc gag cac aac gtc atg acc cgc tgc aat gtg cac ctg 3117
Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Val His Leu
390 395 400
ggc tcc cgc cga ggc atg ttc atg cca tac cag tgc aac atg caa ttt 3165
Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Met Gln Phe
405 410 415 420
gtg aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg gcg 3213
Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Ala
425 430 435
ggg gtg ttt gac atg aat gtg gag ctg tgg aaa att ctg aga tat gat 3261
Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile Leu Arg Tyr Asp
440 445 450
gaa tcc aag acc agg tgc cgg gcc tgc gaa tgc gga ggc aag cac gcc 3309
Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala
455 460 465
agg ctt cag ccc gtg tgt gtg gag gtg acg gag gac ctg cga ccc gat 3357
Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro Asp
470 475 480
cat ttg gtg ttg tcc tgc aac ggg acg gag ttc ggc tcc agc ggg gaa 3405
His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly Ser Ser Gly Glu
485 490 495 500
gaa tct gac tagagtgagt agtgtttggg ggcgggtggg agcctgc atg agg ggc 3460
Glu Ser Asp Met Arg Gly
505
aga atg act aaa atc tgt gtt ttt ctg tgc agc agc atg agc gga agc 3508
Arg Met Thr Lys Ile Cys Val Phe Leu Cys Ser Ser Met Ser Gly Ser
510 515 520
gcc tcc ttt gag gga ggg gta ttc agc cct tat ctg acg ggg cgt ctc 3556
Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu
525 530 535
ccc tcc tgg gcg gga gtg cgt cag aat gtg atg gga tct acg gtg gac 3604
Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val Asp
540 545 550
ggc cgg ccc gtg cag ccc gcg aac tct tca acc ctg acc tac gcg acc 3652
Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr
555 560 565 570
ctg agc tcc tcg tcc gtg gac gca gct gcc gcc gca gct gct gct tcc 3700
Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser
575 580 585
gcc gcc agc gcc gtg cgc gga atg gcc ctg ggc gcc ggc tac tac agc 3748
Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly Tyr Tyr Ser
590 595 600
tct ctg gtg gcc aac tcg agt tcc acc aat aat ccc gcc agc ctg aac 3796
Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn
605 610 615
gag gag aag ctg ctg ctg ctg atg gcc cag ctc gag gcc ctg acc cag 3844
Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln
620 625 630
cgc ctg ggc gag ctg acc cag cag gtg gct cag ctg cag gcg gag acg 3892
Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Ala Glu Thr
635 640 645 650
cgg gcc gcg gtt gcc acg gtg aaa acc aaa taaaaaatga atcaataaat 3942
Arg Ala Ala Val Ala Thr Val Lys Thr Lys
655 660
aaacggagac ggttgttgat tttaacacag agtcttgaat ctttatttga tttttcgcgc 4002
gcggtaggcc ctggaccacc ggtctcgatc attgagcacc cggtggatct tttccaggac 4062
ccggtagagg tgggcttgga tgttgaggta catgggcatg agcccgtccc gggggtggag 4122
gtagctccat tgcagggcct cgtgctcggg ggtggtgttg taaatcaccc agtcatagca 4182
ggggcgcagt gcgtggtgct gcacgatgtc cttgaggagg agactgatgg ccacgggcag 4242
ccccttggtg taggtgttga cgaacctgtt gagctgggag ggatgcatgc ggggggagat 4302
gagatgcatc ttggcctgga tcttgagatt ggcgatgttc ccgcccagat cccgccgggg 4362
gttcatgttg tgcaggacca ccagcacggt gtatccggtg cacttgggga atttgtcatg 4422
caacttggaa gggaaggcgt gaaagaattt ggagacgccc ttgtgaccgc ccaggttttc 4482
catgcactca tccatgatga tggcgatggg cccgtgggcg gcggcctggg caaagacgtt 4542
tcgggggtcg gacacatcgt agttgtggtc ctgggtgagc tcgtcatagg ccattttaat 4602
gaatttgggg cggagggtgc ccgactgggg gacgaaggtg ccttcgatcc cgggggcgta 4662
gttgccctcg cagatctgca tctcccaggc cttgagctcg gaggggggga tcatgtccac 4722
ctgcggggcg atgaaaaaaa cggtttccgg ggcgggggag atgagctgcg ccgaaagcag 4782
gttccggagc agctgggact tgccgcagcc ggtggggccg tagatgaccc cgatgaccgg 4842
ctgcaggtgg tagttgaggg agagacagct gccgtcctcg cgtaggaggg gggccacctc 4902
gttcatcatc tcgcgcacat gcatgttctc gcgcacgagt tccgccagga ggcgctcgcc 4962
ccccagcgag aggagctctt gcagcgaggc gaagtttttc agcggcttga gcccgtcggc 5022
catgggcatt ttggagaggg tctgttgcaa gagttccaga cggtcccaga gctcggtgat 5082
gtgctctacg gcatctcgat ccagcagacc tcctcgtttc gcgggttggg acgactgcgg 5142
gagtagggca ccagacgatg ggcgtccagc gcagccaggg tccggtcctt ccagggtcgc 5202
agcgtccgcg tcagcgtggt ctccgtcacg gtgaaggggt gcgcgccggg ctgggcgctt 5262
gcgagggtgc gcttcaggct catccggctg gtcgagaacc gctcccgatc ggcgccctgc 5322
gcgtcggcca ggtagcaatt gaccatgagt tcgtagttga gcgcctcggc cgcgtggcct 5382
ttggcgcgga gcttaccttt ggaagtctgc ccgcaggcgg gacagaggag ggacttgagg 5442
gcgtagagct tgggggcgag gaagacggac tcgggggcgt aggcgtccgc gccgcagtgg 5502
gcgcagacgg tctcgcactc cacaagccag gtgaggtcgg ggcggtcggg gtcaaaaacg 5562
aggtttcctc cgtgcttttt gatgcgtttc ttacctctgg tctccatgag ctcgtgtccc 5622
cgctgggtga caaagaggct gtccgtgtcc ccgtagaccg actttatggg ccggtcctcg 5682
agcggggtgc cgcggtcctc gtcgtagagg aaccccgccc actccgagac gaaggcccgg 5742
gtccaggcca gcacgaagga ggccacgtgg gaggggtagc ggtcgttgtc caccagcggg 5802
tccaccttct ccagggtatg caagcacatg tccccctcgt ccacatccag gaaggtgatt 5862
ggcttgtaag tgtaggccac gtgaccgggg gtcccggccg ggggggtata aaagggggcg 5922
ggcccctgct cgtcctcact gtcttccgga tcgctgtcca ggagcgccag ctgttggggt 5982
aggtattccc tctcgaaggc gggcatgacc tcggcactca ggttgtcagt ttctagaaac 6042
gaggaggatt tgatattgac ggtgccgttg gagacgcctt tcatgagccc ctcgtccatc 6102
tggtcagaaa agacgatctt tttgttgtcg agcttggtgg cgaaggagcc gtagagggcg 6162
ttggagagca gcttggcgat ggagcgcatg gtctggttct tttccttgtc ggcgcgctcc 6222
ttggcggcga tgttgagctg cacgtactcg cgcgccacgc acttccattc ggggaagacg 6282
gtggtgagct cgtcgggcac gattctgacc cgccagccgc ggttgtgcag ggtgatgagg 6342
tccacgctgg tggccacctc gccgcgcagg ggctcgttgg tccagcagag gcgcccgccc 6402
ttgcgcgagc agaagggggg cagcgggtcc agcatgagct cgtcgggggg gtcggcgtcc 6462
acggtgaaga tgccgggcag gagctcgggg tcgaagtagc tgatgcaggt gcccagatcg 6522
tccagcgccg cttgccagtc gcgcacggcc agcgcgcgct cgtaggggct gaggggcgtg 6582
ccccagggca tggggtgcgt gagcgcggag gcgtacatgc cgcagatgtc gtagacgtag 6642
aggggctcct cgaggacgcc gatgtaggtg gggtagcagc gccccccgcg gatgctggcg 6702
cgcacgtagt cgtacagctc gtgcgagggc gcgaggagcc ccgcgccgag gttggagcgc 6762
tgcggctttt cggcgcggta gacgatctgg cggaagatgg cgtgggagtt ggaggagatg 6822
gtgggcctct ggaagatgtt gaagtgggcg tggggcaggc cgaccgagtc cctgatgaag 6882
tgggcgtagg agtcctgcag cttggcgacg agctcggcgg tgacgaggac gtccagggcg 6942
cagtagtcga gggtctcttg gatgatgtcg tacttgagct ggcccttctg cttccacagc 7002
tcgcggttga gaaggaactc ttcgcggtcc ttccagtact cttcgagggg gaacccgtcc 7062
tgatcggcac ggtaagagcc caccatgtag aactggttga cggccttgta ggcgcagcag 7122
cccttctcca cggggagggc ataagcttgc gcggccttgc gcagggaggt gtgggtgagg 7182
gcgaaggtgt cgcgcaccat gaccttgagg aactggtgct tgaagtcgag gtcgtcgcag 7242
ccgccctgct cccagagttg gaagtccgtg cgcttcttgt aggcggggtt gggcaaagcg 7302
aaagtaacat cgttgaagag gatcttgccc gcgcggggca tgaagttgcg agtgatgcgg 7362
aaaggctggg gcacctcggc ccggttgttg atgacctggg cggcgaggac gatctcgtcg 7422
aagccgttga tgttgtgccc gacgatgtag agttccacga atcgcgggcg gcccttgacg 7482
tggggcagct tcttgagctc gtcgtaggtg agctcggcgg ggtcgctgag gccgtgctgc 7542
tcaagggccc agtcggcgac gtgggggttg gcgctgagga aggaagtcca gagatccacg 7602
gccagggcgg tttgcaagcg gtcccggtac tgacggaact gctggcccac ggccattttt 7662
tcgggggtga tgcagtagaa ggtgcggggg tcgccgtgcc agcggtccca cttgagctgg 7722
agggcgaggt cgtgggcgag ctcgacgagc ggcgggtccc cggagagttt catgaccagc 7782
atgaagggga cgagctgctt gccgaaggac cccatccagg tgtaggtttc cacatcgtag 7842
gtgaggaaga gcctttcggt gcgaggatgc gagccgatgg ggaagaactg gatctcctgc 7902
caccagttgg aggaatggct gttgatgtga tggaagtaga aatgccgacg gcgcgccgag 7962
cactcgtgct tgtgtttata caagcgtccg cagtgctcgc aacgctgcac gggatgcacg 8022
tgctgcacga gctgtacctg ggttcctttg acgaggaatt tcagtgggca gtggagcgct 8082
ggcggctgca tctggtgctg tactacgtcc tggccatcgg cgtggccatc gtctgcctcg 8142
atggtggtca tgctgacgag cccgcgcggg aggcaggtcc agacctcggc tcggacgggt 8202
cggagagcga ggacgagggc gcgcaggccg gagctgtcca gggtcctgag acgctgcgga 8262
gtcaggtcag tgggcagcgg cggcgcgcgg ttgacttgca ggagcttttc cagggcgcgc 8322
gggaggtcca gatggtactt gatctccacg gcgccgttgg tggcgacgtc cacggcttgc 8382
agggtcccgt gcccctgggg cgccaccacc gtgccccgtt tcttcttggg cgctggcgtt 8442
ggcgctgctt ccatgtcggt cagaagcggc ggcgaggacg cgcgccgggc ggcaggggcg 8502
gctcggggcc cggaggcagg ggcggcaggg gcacgtcggc gccgcgcgcg ggcaggttct 8562
ggtactgcgc ccggagaaga ctggcgtgag cgacgacgcg acggttgacg tcctggatct 8622
gacgcctctg ggtgaaggcc acgggacccg tgagtttgaa cctgaaagag agttcgacag 8682
aatcaatctc ggtatcgttg acggcggcct gccgcaggat ctcttgcacg tcgcccgagt 8742
tgtcctggta ggcgatctcg gtcatgaact gctcgatctc ctcctcctga aggtctccgc 8802
ggccggcgcg ctcgacggtg gccgcgaggt cgttggagat gcggcccatg agctgcgaga 8862
aggcgttcat gccggcttcg ttccagacgc ggctgtagac cacggctccg tcggggtcgc 8922
gcgcgcgcat gaccacctgg gcgaggttga gctcgacgtg gcgcgtgaag accgcgtagt 8982
tgcagaggcg ctggtagagg tagttgagcg tggtggcgat gtgctcggtg acgaagaagt 9042
acatgatcca gcggcggagc ggcatctcgc tgacgtcgcc cagggcttcc aaacgttcca 9102
tggcctcgta aaagtccacg gcgaagttga aaaactggga gttgcgcgcc gagacggtca 9162
actcctcctc cagaagacgg atgagctcgg cgatggtggc gcgcacctcg cgctcgaagg 9222
cccccgggag ttcctccact tcctcttctt cttcctcctc cactaacatc tcttctactt 9282
cctcctcagg cggcagtggt ggcgggggag ggggcctgcg tcgccggcgg cgcacgggca 9342
gacggtcgat gaagcgctcg atggtctcgc cgcgccggcg tcgcatggtc tcggtgacgg 9402
cgcgcccgtc ctcgcggggc cgcagcgtga agacgccgcc gcgcatctcc aggtggccgg 9462
gggggtcccc gttgggcagg gagagggcgc tgacgatgca tcttatcaat tgccccgtag 9522
ggactccgcg caaggacctg agcgtctcga gatccacggg atctgaaaac cgttgaacga 9582
aggcttcgag ccagtcgcag tcgcaaggta ggctgagcac ggtttcttct ggcgggtcat 9642
gttggttgga gggagcgggg cgggcgatgc tgctggtgat gaagttgaaa taggcggttc 9702
tgagacggcg gatggtggcg aggagcacca ggtccttggg cccggcttgc tggatgcgca 9762
gacggtcggc catgccccag gcgtggtcct gacacctggc gaggtccttg tagtagtcct 9822
gcatgagccg ctccacgggc acctcctcct cgcccgcgcg gccgtgcatg cgcgtgagcc 9882
cgaacccgcg ctgcggctgg acgagcgcca ggtcggcgac gacgcgctcg gcgaggatgg 9942
cctgctggat ctgggtgagg gtggtctgga agtcgtcgaa gtcgacgaag cggtggtagg 10002
ctccggtgtt gatggtgtag gagcagttgg ccatgacgga ccagttgacg gtctggtggc 10062
ccggacgcac gagctcgtgg tacttgaggc gcgagtaggc gcgcgtgtcg aagatgtagt 10122
cgttgcaggt gcgcaccagg tactggtagc cgatgaggaa gtgcggcggc ggctggcggt 10182
agagcggcca tcgctcggtg gcgggggcgc cgggcgcgag gtcctcgagc atgaggcggt 10242
ggtagccgta gatgtacctg gacatccagg tgatgccggc ggcggtggtg gaggcgcgcg 10302
ggaactcgcg gacgcggttc cagatgttgc gcagcggcag gaagtagttc atggtgggca 10362
cggtctggcc cgtgaggcgc gcgcagtcgt ggatgctcta tacgggcaaa aacgaaagcg 10422
gtcagcggct cgactccgtg gcctggaggc taagcgaacg ggttgggctg cgcgtgtacc 10482
ccggttcgaa tctcgaatca ggctggagcc gcagctaacg tggtactggc actcccgtct 10542
cgacccaagc ctgcaccaac cctccaggat acggaggcgg gtcgttttgc aatttttttc 10602
ggaggccgga gactagtaag cgcggaaagc ggccgaccgc gatggctcgc tgccgtagtc 10662
tggagaagaa tcgccagggt tgcgttgcgg tgtgccccgg ttcgaggccg gccggattcc 10722
gcggctaacg agggcgtggc tgccccgtcg tttccaagac ccctagccag ccgacttctc 10782
cagttacgga gcgagcccct cttttgtttt ttgtttttgc cag atg cat ccc gta 10837
Met His Pro Val
ctg cgg cag atg cgc ccc cac cac cct cca ccg caa caa cag ccc cct 10885
Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro
665 670 675 680
cca cag ccg gcg ctt ctg ccc ccg ccc cag cag caa ctt cca gcc acg 10933
Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln Leu Pro Ala Thr
685 690 695
acc gcc gcg gcc gcc gtg agc ggg gct gga cag agt tat gac cac cag 10981
Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Ser Tyr Asp His Gln
700 705 710
ctg gcc ttg gaa gag ggc gag ggg ctg gcg cgc ctg ggg gcg tcg tcg 11029
Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser
715 720 725
ccg gag cgg cac ccg cgc gtg cag atg aaa agg gac gct cgc gag gcc 11077
Pro Glu Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala
730 735 740
tac gtg ccc aag cag aac ctg ttc aga gac agg agc ggc gag gag ccc 11125
Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro
745 750 755 760
gag gag atg cgc gcg gcc cgg ttc cac gcg ggg cgg gag ctg cgg cgc 11173
Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg
765 770 775
ggc ctg gac cga aag agg gtg ctg agg gac gag gat ttc gag gcg gac 11221
Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp
780 785 790
gag ctg acg ggg atc agc ccc gcg cgc gcg cac gtg gcc gcg gcc aac 11269
Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn
795 800 805
ctg gtc acg gcg tac gag cag acc gtg aag gag gag agc aac ttc caa 11317
Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln
810 815 820
aaa tcc ttc aac aac cac gtg cgc acg ctg atc gcg cgc gag gag gtg 11365
Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val
825 830 835 840
acc ctg ggc ctg atg cac ctg tgg gac ctg ctg gag gcc atc gtg cag 11413
Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln
845 850 855
aac ccc acg agc aag ccg ctg acg gcg cag ctg ttc ctg gtg gtg cag 11461
Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln
860 865 870
cac agt cgg gac aac gag acg ttc agg gag gcg ctg ctg aat atc acc 11509
His Ser Arg Asp Asn Glu Thr Phe Arg Glu Ala Leu Leu Asn Ile Thr
875 880 885
gag ccc gag ggc cgc tgg ctc ctg gac ctg gtg aac att ctg cag agc 11557
Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser
890 895 900
atc gtg gtg cag gag cgc ggg ctg ccg ctg tcc gag aag ctg gcg gcc 11605
Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala
905 910 915 920
atc aac ttc tcg gtg ctg agt ttg ggc aag tac tac gct agg aag atc 11653
Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile
925 930 935
tac aag acc ccg tac gtg ccc ata gac aag gag gtg aag atc gac ggg 11701
Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly
940 945 950
ttt tac atg cgc atg acc ctg aaa gtg ctg acc ctg agc gac gat ctg 11749
Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu
955 960 965
ggg gtg tac cgc aac gac agg atg cac cgt gcg gtg agc gcc agc cgc 11797
Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg
970 975 980
cgg cgc gag ctg agc gac cag gag ctg atg cac agc ctg cag cgg gcc 11845
Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala
985 990 995 1000
ctg acc ggg gcc ggg acc gag ggg gag agc tac ttt gac atg ggc 11890
Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly
1005 1010 1015
gcg gac ctg cgc tgg cag ccc agc cgc cgg gcc ttg gaa gct gcc 11935
Ala Asp Leu Arg Trp Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala
1020 1025 1030
ggc ggt tcc ccc tac gta gaa gag gtg gac gat gag gtg gac gag 11980
Gly Gly Ser Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp Glu
1035 1040 1045
gag ggc gag tac ctg gaa gac tgatggcgcg accgtatttt tgctag atg caa 12033
Glu Gly Glu Tyr Leu Glu Asp Met Gln
1050
caa cag cca cct cct gat ccc gcg atg cgg gcg gcg ctg cag agc 12078
Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln Ser
1055 1060 1065
cag ccg tcc ggc att aac tcc tcg gac gat tgg acc cag gcc atg 12123
Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
1070 1075 1080
caa cgc atc atg gcg ctg acg acc cgc aac ccc gaa gcc ttt aga 12168
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg
1085 1090 1095
cag cag ccc cag gcc aac cgg ctc tcg gcc atc ctg gag gcc gtg 12213
Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val
1100 1105 1110
gtg ccc tcg cgc tcc aac ccc acg cac gag aag gtc ctg gcc atc 12258
Val Pro Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile
1115 1120 1125
gtg aac gcg ctg gtg gag aac aag gcc atc cgc ggc gac gag gcc 12303
Val Asn Ala Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala
1130 1135 1140
ggc ctg gtg tac aac gcg ctg ctg gag cgc gtg gcc cgc tac aac 12348
Gly Leu Val Tyr Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn
1145 1150 1155
agc acc aac gtg cag acc aac ctg gac cgc atg gtg acc gac gtg 12393
Ser Thr Asn Val Gln Thr Asn Leu Asp Arg Met Val Thr Asp Val
1160 1165 1170
cgc gag gcc gtg gcc cag cgc gag cgg ttc cac cgc gag tcc aac 12438
Arg Glu Ala Val Ala Gln Arg Glu Arg Phe His Arg Glu Ser Asn
1175 1180 1185
ctg gga tcc atg gtg gcg ctg aac gcc ttc ctc agc acc cag ccc 12483
Leu Gly Ser Met Val Ala Leu Asn Ala Phe Leu Ser Thr Gln Pro
1190 1195 1200
gcc aac gtg ccc cgg ggc cag gag gac tac acc aac ttc atc agc 12528
Ala Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn Phe Ile Ser
1205 1210 1215
gcc ctg cgc ctg atg gtg acc gag gtg ccc cag agc gag gtg tac 12573
Ala Leu Arg Leu Met Val Thr Glu Val Pro Gln Ser Glu Val Tyr
1220 1225 1230
cag tcc ggg ccg gac tac ttc ttc cag acc agt cgc cag ggc ttg 12618
Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr Ser Arg Gln Gly Leu
1235 1240 1245
cag acc gtg aac ctg agc cag gct ttc aag aac ttg cag gga ttg 12663
Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu
1250 1255 1260
tgg ggc gtg cag gcc ccg gtc ggg gac cgc gcg acg gtg tcg agc 12708
Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr Val Ser Ser
1265 1270 1275
ctg ctg acg ccg aac tcg cgc ctg ctg ctg ctg ctg gtg gcc ccc 12753
Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val Ala Pro
1280 1285 1290
ttc acg gac agc ggc agc atc aac cgc aac tcg tac ctg ggc tac 12798
Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly Tyr
1295 1300 1305
ctg att aac ctg tac cgc gag gcc atc ggc cag gcg cac gtg gac 12843
Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
1310 1315 1320
gag cag acc tac cag gag atc acc cac gtg agc cgc gcc ctg ggc 12888
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly
1325 1330 1335
cag gac gac ccg ggc aac ctg gaa gcc acc ctg aac ttt ttg ctg 12933
Gln Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu
1340 1345 1350
acc aac cgg tcg cag aag atc ccg ccc cag tac gcg ctc agc acc 12978
Thr Asn Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr
1355 1360 1365
gag gag gag cgc atc ctg cgt tac gtg cag cag agc gtg ggc ctg 13023
Glu Glu Glu Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu
1370 1375 1380
ttc ctg atg cag gag ggg gcc acc ccc agc gcc gcg ctc gac atg 13068
Phe Leu Met Gln Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met
1385 1390 1395
acc gcg cgc aac atg gag ccc agc atg tac gcc agc aac cgc ccg 13113
Thr Ala Arg Asn Met Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro
1400 1405 1410
ttc atc aat aaa ctg atg gac tac ttg cat cgg gcg gcc gcc atg 13158
Phe Ile Asn Lys Leu Met Asp Tyr Leu His Arg Ala Ala Ala Met
1415 1420 1425
aac tct gac tat ttc acc aac gcc atc ctg aat ccc cac tgg ctc 13203
Asn Ser Asp Tyr Phe Thr Asn Ala Ile Leu Asn Pro His Trp Leu
1430 1435 1440
ccg ccg cct ggg ttc tac acg ggc gag tac gac atg ccc gac ccc 13248
Pro Pro Pro Gly Phe Tyr Thr Gly Glu Tyr Asp Met Pro Asp Pro
1445 1450 1455
aat gac ggg ttc ctg tgg gac gat gtg gac agc agc gtg ttc tcc 13293
Asn Asp Gly Phe Leu Trp Asp Asp Val Asp Ser Ser Val Phe Ser
1460 1465 1470
ccc cga ccg ggt gct aac gag cgc ccc ttg tgg aag aag gaa ggc 13338
Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu Trp Lys Lys Glu Gly
1475 1480 1485
agc gac cga cgc ccg tcc tcg gcg ctg tcc ggc cgc gag ggt gct 13383
Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Glu Gly Ala
1490 1495 1500
gcc gcg gcg gtg ccc gag gcc gcc agt cct ttt cct agc ttg ccc 13428
Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro
1505 1510 1515
ttc tcg ctg aac agt atc cgc agc agc gag ctg ggg agg atc acg 13473
Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr
1520 1525 1530
cgc ccg cgc ttg ctg ggc gag gag gag tac ttg aat gac tcg ctg 13518
Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu
1535 1540 1545
ttg aga ccc gag cgg gag aag aac ttc ccc aat aac ggg ata gag 13563
Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
1550 1555 1560
agc ctg gtg gac aag atg agc cgc tgg aag acg tat gcg cag gag 13608
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu
1565 1570 1575
cac agg gac gat ccc cgg gcg tcg cag ggg gcc acg agc cgg ggc 13653
His Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly
1580 1585 1590
agc gcc gcc cgt aaa cgc cgg tgg cac gac agg cag cgg gga ctg 13698
Ser Ala Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu
1595 1600 1605
atg tgg gac gat gag gat tcc gcc gac gac agc agc gtg ttg gac 13743
Met Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp
1610 1615 1620
ttg ggt ggg agt ggt ggt ggt aac ccg ttc gct cac ctg cgc ccc 13788
Leu Gly Gly Ser Gly Gly Gly Asn Pro Phe Ala His Leu Arg Pro
1625 1630 1635
cgc atc ggg cgc atg atg taagaaaccg aaaataaatg atactcacca 13836
Arg Ile Gly Arg Met Met
1640 1645
aggccatggc gaccagcgtg cgttcgtttc ttctctgttg ttgtatctag t atg atg 13893
Met Met
agg cgt gcg tac ccg gag ggt cct cct ccc tcg tac gag agc gtg 13938
Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val
1650 1655 1660
atg cag cag gcg atg gcg gcg gcg gcg gcg atg cag ccc ccg ctg 13983
Met Gln Gln Ala Met Ala Ala Ala Ala Ala Met Gln Pro Pro Leu
1665 1670 1675
gag gct cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag ggg 14028
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly
1680 1685 1690
cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat acc 14073
Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr
1695 1700 1705
acc cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc tcg 14118
Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser
1710 1715 1720
ctg aac tac cag aac gac cac agc aac ttc ctg acc acc gtg gtg 14163
Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val
1725 1730 1735
cag aac aat gac ttc acc ccc acg gag gcc agc acc cag acc atc 14208
Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile
1740 1745 1750
aac ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc 14253
Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile
1755 1760 1765
atg cac acc aac atg ccc aac gtg aac gag ttc atg tac agc aac 14298
Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn
1770 1775 1780
aag ttc aag gcg cgg gtc atg gtc tcc cgc aag acc ccc aac ggg 14343
Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly
1785 1790 1795
gtg gga gag gat tat gat ggt agt cag gat gag ctg aaa tac gaa 14388
Val Gly Glu Asp Tyr Asp Gly Ser Gln Asp Glu Leu Lys Tyr Glu
1800 1805 1810
tgg gtg gag ttt gag ctg ccc gaa ggc aac ttc tcg gtg acc atg 14433
Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met
1815 1820 1825
acc atc gac ctg atg aac aac gcc atc atc gac aat tac ttg gcg 14478
Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala
1830 1835 1840
gtg ggg cgg cag aac ggg gtc ctg gag agc gat atc ggc gtg aag 14523
Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys
1845 1850 1855
ttc gac act agg aac ttc agg ctg ggg tgg gac ccc gtg acc gag 14568
Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu
1860 1865 1870
ctg gtc atg ccc ggg gtg tac acc aac gag gcc ttc cat ccc gat 14613
Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp
1875 1880 1885
att gtc ttg ctg ccc ggc tgc ggg gtg gac ttc acc gag agc cgc 14658
Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg
1890 1895 1900
ctc agc aac ctg ctg ggc att cgc aag agg cag cca ttc cag gag 14703
Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu
1905 1910 1915
ggt ttc cag atc atg tac gag gat ctg gag ggg ggc aac atc ccc 14748
Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro
1920 1925 1930
gcg ctc ctg gat gtc gac gcc tat gag aaa agc aag gag gaa gca 14793
Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Glu Ala
1935 1940 1945
gca gct gag gca acc gca gcc gta gcc acc gcc tct acc gag gtc 14838
Ala Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val
1950 1955 1960
agg ggc gat aat ttt gca agc gcc gca gca gtg gca gcg gcc gag 14883
Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Ala Ala Glu
1965 1970 1975
gcg gct gaa acc gaa agt aag ata gtc att cag ccg gtg gag aag 14928
Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys
1980 1985 1990
gat agc aag aac agg agc tac aac gta cta ccg gac aag ata aac 14973
Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu Pro Asp Lys Ile Asn
1995 2000 2005
acc gcc tac cgc agc tgg tac ctg gcc tac aac tat ggc gac ccc 15018
Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro
2010 2015 2020
gag aag ggc gtg cgc tcc tgg acg ctg ctc acc acc tcg gac gtc 15063
Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val
2025 2030 2035
acc tgc ggc gtg gag caa gtc tac tgg tcg ctg ccc gac atg atg 15108
Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met
2040 2045 2050
caa gac ccg gtc acc ttc cgc tcc acg cgt caa gtt agc aac tac 15153
Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr
2055 2060 2065
ccg gtg gtg ggc gcc gag ctc ctg ccc gtc tac tcc aag agc ttc 15198
Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe
2070 2075 2080
ttc aac gag cag gcc gtc tac tcg cag cag ctg cgc gcc ttc acc 15243
Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr
2085 2090 2095
tcg ctc acg cac gtc ttc aac cgc ttc ccc gag aac cag atc ctc 15288
Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu
2100 2105 2110
gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt gaa aac gtt 15333
Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val
2115 2120 2125
cct gct ctc aca gat cac ggg acc ctg ccg ctg cgc agc agt atc 15378
Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile
2130 2135 2140
cgg gga gtc cag cgc gtg acc gtt act gac gcc aga cgc cgc acc 15423
Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr
2145 2150 2155
tgc ccc tac gtc tac aag gcc ctg ggc ata gtc gcg ccg cgc gtc 15468
Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val
2160 2165 2170
ctc tcg agc cgc acc ttc taaaaa atg tcc att ctc atc tcg ccc agt 15516
Leu Ser Ser Arg Thr Phe Met Ser Ile Leu Ile Ser Pro Ser
2175 2180 2185
aat aac acc ggt tgg ggc ctg cgc gcg ccc agc aag atg tac gga 15561
Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro Ser Lys Met Tyr Gly
2190 2195 2200
ggc gct cgc caa cgc tcc acg caa cac ccc gtg cgc gtg cgc ggg 15606
Gly Ala Arg Gln Arg Ser Thr Gln His Pro Val Arg Val Arg Gly
2205 2210 2215
cac ttc cgc gct ccc tgg ggc gcc ctc aag ggc cgc gtg cgg tcg 15651
His Phe Arg Ala Pro Trp Gly Ala Leu Lys Gly Arg Val Arg Ser
2220 2225 2230
cgc acc acc gtc gac gac gtg atc gac cag gtg gtg gcc gac gcg 15696
Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val Val Ala Asp Ala
2235 2240 2245
cgc aac tac acc ccc gcc gcc gcg ccc gtc tcc acc gtg gac gcc 15741
Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr Val Asp Ala
2250 2255 2260
gtc atc gac agc gtg gtg gcc gac gcg cgc cgg tac gcc cgc gcc 15786
Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala Arg Ala
2265 2270 2275
aag agc cgg cgg cgg cgc atc gcc cgg cgg cac cgg agc acc ccc 15831
Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr Pro
2280 2285 2290
gcc atg cgc gcg gcg cga gcc ttg ctg cgc agg gcc agg cgc acg 15876
Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
2295 2300 2305
gga cgc agg gcc atg ctc agg gcg gcc aga cgc gcg gct tca ggc 15921
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly
2310 2315 2320
gcc agc gcc ggc agg acc cgg aga cgc gcg gcc acg gcg gcg gca 15966
Ala Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala
2325 2330 2335
gcg gcc atc gcc agc atg tcc cgc ccg cgg cga ggg aac gtg tac 16011
Ala Ala Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr
2340 2345 2350
tgg gtg cgc gac gcc gcc acc ggt gtg cgc gtg ccc gtg cgc acc 16056
Trp Val Arg Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr
2355 2360 2365
cgc ccc cct cgc act tgaagatgtt cacttcgcga tgttgatgtg tcccagcggc 16111
Arg Pro Pro Arg Thr
2370
gagg atg tcc aag cgc aaa ttc aag gaa gag atg ctc cag gtc atc 16157
Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile
2375 2380 2385
gcg cct gag atc tac ggc ccc gcg gcg gcg gtg aag gag gaa aga 16202
Ala Pro Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys Glu Glu Arg
2390 2395 2400
aag ccc cgc aaa atc aag cgg gtc aaa aag gac aaa aag gaa gaa 16247
Lys Pro Arg Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu
2405 2410 2415
gat gtg gac gat gtg gtg gag ttt gtg cgc gag ttc gcc ccc cgg 16292
Asp Val Asp Asp Val Val Glu Phe Val Arg Glu Phe Ala Pro Arg
2420 2425 2430
cgg cgc gtg cag tgg cgc ggg cgg aag gtg cag ccg gtg ctg aga 16337
Arg Arg Val Gln Trp Arg Gly Arg Lys Val Gln Pro Val Leu Arg
2435 2440 2445
ccc ggc acc acc gtg gtc ttc acg ccc ggc gag cgc tcc ggc acc 16382
Pro Gly Thr Thr Val Val Phe Thr Pro Gly Glu Arg Ser Gly Thr
2450 2455 2460
gct tcc aag cgc tcc tac gac gag gtg tac ggg gat gat gat att 16427
Ala Ser Lys Arg Ser Tyr Asp Glu Val Tyr Gly Asp Asp Asp Ile
2465 2470 2475
ctg gag cag gcg gcc gag cgc ctg ggc gag ttt gct tac ggc aag 16472
Leu Glu Gln Ala Ala Glu Arg Leu Gly Glu Phe Ala Tyr Gly Lys
2480 2485 2490
cgc agc cgc tcc gcg ccg aag gaa gag gcg gtg tcc atc ccg ctg 16517
Arg Ser Arg Ser Ala Pro Lys Glu Glu Ala Val Ser Ile Pro Leu
2495 2500 2505
gac cac ggc aac ccc acg ccg agc ctc aag ccc gtg acc ctg cag 16562
Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro Val Thr Leu Gln
2510 2515 2520
cag gtg ctg ccg agc gcg gcg ccg cga agg ggg ttc aag cgc gag 16607
Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly Phe Lys Arg Glu
2525 2530 2535
ggc gag gat ctg tat ccc acc atg cag ctg atg gtg ccc aaa cgc 16652
Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val Pro Lys Arg
2540 2545 2550
cag aag ctg gaa gac gtg ctg gaa acc atg aag gtg gac ccg gac 16697
Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val Asp Pro Asp
2555 2560 2565
gtg cag ccc gag gtc aag gtg cgg ccc atc aag cag gtg gcc ccg 16742
Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro
2570 2575 2580
ggt ctg ggc gtg cag acc gtg gac atc aag atc ccc acg gag ccc 16787
Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro
2585 2590 2595
atg gaa acg cag acc gag ccc atg atc aag ccc agt acc agc acc 16832
Met Glu Thr Gln Thr Glu Pro Met Ile Lys Pro Ser Thr Ser Thr
2600 2605 2610
atg gag gtg cag acg gat ccc tgg atg cca gcc gcc ccc acc agc 16877
Met Glu Val Gln Thr Asp Pro Trp Met Pro Ala Ala Pro Thr Ser
2615 2620 2625
agc cga aga ccc cgg cgc aag tac ggc gcg gcc agc ctg ctg atg 16922
Ser Arg Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met
2630 2635 2640
ccc aac tac gcg ctg cat cct tcc atc atc ccc acg ccg ggc tac 16967
Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr
2645 2650 2655
cgc ggc acg cgc ttc tac cgc ggt cat aca acc agc tcc cgc cgc 17012
Arg Gly Thr Arg Phe Tyr Arg Gly His Thr Thr Ser Ser Arg Arg
2660 2665 2670
cgc aag acc acc act cgc cgc cgc cgt cgc cgc aca gcc gcc gct 17057
Arg Lys Thr Thr Thr Arg Arg Arg Arg Arg Arg Thr Ala Ala Ala
2675 2680 2685
gca acc acc ccc gcc gcc ctg gtg cgg aga gtg tac cgc cgc ggc 17102
Ala Thr Thr Pro Ala Ala Leu Val Arg Arg Val Tyr Arg Arg Gly
2690 2695 2700
cgc gcg cct ctg acc ctg ccg cgc gcg cgc tac cac ccg agc atc 17147
Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr His Pro Ser Ile
2705 2710 2715
gcc att taaaactttc gcctgctttg cag atg gct ctc aca tgc cgc ctc 17197
Ala Ile Met Ala Leu Thr Cys Arg Leu
2720
cgc gtc ccc att acg ggc tac cga gga aga aaa ccg cgc cgt aga 17242
Arg Val Pro Ile Thr Gly Tyr Arg Gly Arg Lys Pro Arg Arg Arg
2725 2730 2735
agg ctg gcg ggg aac ggg atg cgt cgc cac cac cac cgg cgg cgg 17287
Arg Leu Ala Gly Asn Gly Met Arg Arg His His His Arg Arg Arg
2740 2745 2750
cgc gcc atc agc aag cgg ttg ggg gga ggc ttc ctg ccc gcg ctg 17332
Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe Leu Pro Ala Leu
2755 2760 2765
atc ccc atc atc gcc gcg gcg atc ggg gcg atc ccc ggc att gct 17377
Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro Gly Ile Ala
2770 2775 2780
tcc gtg gcg gtg cag gcc tct cag cgc cac tgagacacac ttggaaacat 17427
Ser Val Ala Val Gln Ala Ser Gln Arg His
2785 2790
cttgtaataa acca atg gac tct gac gct cct ggt cct gtg atg tgt 17474
Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys
2795 2800 2805
ttt cgt aga cag atg gaa gac atc aat ttt tcg tcc ctg gct ccg 17519
Phe Arg Arg Gln Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro
2810 2815 2820
cga cac ggc acg cgg ccg ttc atg ggc acc tgg agc gac atc ggc 17564
Arg His Gly Thr Arg Pro Phe Met Gly Thr Trp Ser Asp Ile Gly
2825 2830 2835
acc agc caa ctg aac ggg ggc gcc ttc aat tgg agc agt ctc tgg 17609
Thr Ser Gln Leu Asn Gly Gly Ala Phe Asn Trp Ser Ser Leu Trp
2840 2845 2850
agc ggg ctt aag aat ttc ggg tcc acg ctt aaa acc tat ggc agc 17654
Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys Thr Tyr Gly Ser
2855 2860 2865
aag gcg tgg aac agc acc aca ggg cag gcg ctg agg gat aag ctg 17699
Lys Ala Trp Asn Ser Thr Thr Gly Gln Ala Leu Arg Asp Lys Leu
2870 2875 2880
aaa gag cag aac ttc cag cag aag gtg gtc gat ggg ctc gcc tcg 17744
Lys Glu Gln Asn Phe Gln Gln Lys Val Val Asp Gly Leu Ala Ser
2885 2890 2895
ggc atc aac ggg gtg gtg gac ctg gcc aac cag gcc gtg cag cgg 17789
Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln Ala Val Gln Arg
2900 2905 2910
cag atc aac agc cgc ctg gac ccg gtg ccg ccc gcc ggc tcc gtg 17834
Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala Gly Ser Val
2915 2920 2925
gag atg ccg cag gtg gag gag gag ctg cct ccc ctg gac aag cgg 17879
Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp Lys Arg
2930 2935 2940
ggc gag aag cga ccc cgc ccc gac gcg gag gag acg ctg ctg acg 17924
Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu Thr
2945 2950 2955
cac acg gac gag ccg ccc ccg tac gag gag gcg gtg aaa ctg ggt 17969
His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly
2960 2965 2970
ctg ccc acc acg cgg ccc atc gcg ccc ctg gcc acc ggg gtg ctg 18014
Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu
2975 2980 2985
aaa ccc gaa agt aat aag ccc gcg acc ctg gac ttg cct cct ccc 18059
Lys Pro Glu Ser Asn Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro
2990 2995 3000
cag cct tct cgc ccc tcc aca gtg gct aag ccc ctg ccg ccg gtg 18104
Gln Pro Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val
3005 3010 3015
gcc gtg gcc cgc gcg cga ccc ggg ggc acc gcc cgc cct cat gcg 18149
Ala Val Ala Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala
3020 3025 3030
aac tgg cag agc act ctg aac agc atc gtg ggt ctg gga gtg cag 18194
Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln
3035 3040 3045
agt gtg aag cgc cgc cgc tgc tat taaacctacc gtagcgctta acttgcttgt 18248
Ser Val Lys Arg Arg Arg Cys Tyr
3050
ctgtgtgtgt atgtattatg tcgccgccgc cgctgtccac cagaaggagg agtgaagagg 18308
cgcgtcgccg agttgcaag atg gcc acc cca tcg atg ctg ccc cag tgg 18357
Met Ala Thr Pro Ser Met Leu Pro Gln Trp
3055 3060
gcg tac atg cac atc gcc gga cag gac gct tcg gag tac ctg agt 18402
Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser
3065 3070 3075
ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt 18447
Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser
3080 3085 3090
ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat 18492
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp
3095 3100 3105
gtg acc acc gac cgc agc cag cgg ctg acg ctg cgc ttc gtg ccc 18537
Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro
3110 3115 3120
gtg gac cgc gag gac aac acc tac tcg tac aaa gtg cgc tac acg 18582
Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr
3125 3130 3135
ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc agc acc tac 18627
Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr
3140 3145 3150
ttt gac atc cgc ggc gtg ttg gac cgg ggc cct agc ttc aaa ccc 18672
Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro
3155 3160 3165
tac tcc ggc acc gcc tac aac agc ctg gct ccc aag gga gcg ccc 18717
Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro
3170 3175 3180
aat tcc agc cag tgg gag caa aat gaa aac aat ggt caa ggt caa 18762
Asn Ser Ser Gln Trp Glu Gln Asn Glu Asn Asn Gly Gln Gly Gln
3185 3190 3195
gct aag aca cac acc tat ggt gtt gct gct atg ggc gga ctt gat 18807
Ala Lys Thr His Thr Tyr Gly Val Ala Ala Met Gly Gly Leu Asp
3200 3205 3210
att acg aaa gag ggt ctt caa att gga act gat gct agt aag gaa 18852
Ile Thr Lys Glu Gly Leu Gln Ile Gly Thr Asp Ala Ser Lys Glu
3215 3220 3225
gat gac aat gaa att tat gcc gat gaa aca tat cag ccc gag cct 18897
Asp Asp Asn Glu Ile Tyr Ala Asp Glu Thr Tyr Gln Pro Glu Pro
3230 3235 3240
caa ata gga gag gaa aat tgg caa gac act gaa aac ttt tat gga 18942
Gln Ile Gly Glu Glu Asn Trp Gln Asp Thr Glu Asn Phe Tyr Gly
3245 3250 3255
ggc aga gct ctt aaa aaa gat acc aag atg aag cca tgc tat ggc 18987
Gly Arg Ala Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly
3260 3265 3270
tca ttt gcc aga cct acc aat gtg aag gga ggg caa gcc aaa gtg 19032
Ser Phe Ala Arg Pro Thr Asn Val Lys Gly Gly Gln Ala Lys Val
3275 3280 3285
aaa aca gaa gaa aat gtt cag tca ttc gac ata gat ctg gct ttc 19077
Lys Thr Glu Glu Asn Val Gln Ser Phe Asp Ile Asp Leu Ala Phe
3290 3295 3300
ttt gat att cca agc acc ggc aca ggg agc aat ggt aca aat gta 19122
Phe Asp Ile Pro Ser Thr Gly Thr Gly Ser Asn Gly Thr Asn Val
3305 3310 3315
aat gat aag cca gac atg gtc atg tac act gaa aat gtg aat ttg 19167
Asn Asp Lys Pro Asp Met Val Met Tyr Thr Glu Asn Val Asn Leu
3320 3325 3330
gag acg cca gat act cat att gtg tac aaa cct gga act tca gat 19212
Glu Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Ser Asp
3335 3340 3345
gac agc tct gaa gcc aac ttg tgc cag cag gcc atg cca aac aga 19257
Asp Ser Ser Glu Ala Asn Leu Cys Gln Gln Ala Met Pro Asn Arg
3350 3355 3360
cct aac tac att ggt ttc aga gac aat ttt att ggg ctc atg tac 19302
Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr
3365 3370 3375
tac aac agt act ggc aac atg ggt gta ctg gct ggt cag gcc tca 19347
Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser
3380 3385 3390
cag ctg aat gct gtg gtt gac ttg caa gac aga aac acc gag ctg 19392
Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu
3395 3400 3405
tcc tac cag ctc ttg ctt gac tct ctg ggc gac aga acc cgg tat 19437
Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr
3410 3415 3420
ttc agt atg tgg aat cag gcg gtg gac agt tat gat cct gat gtg 19482
Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val
3425 3430 3435
cgc att att gaa aac cac ggt gtg gaa gat gaa ctt ccc aac tat 19527
Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr
3440 3445 3450
tgc ttc cca ttg gat gga gct ggt act aat gct gtt tac cag ggt 19572
Cys Phe Pro Leu Asp Gly Ala Gly Thr Asn Ala Val Tyr Gln Gly
3455 3460 3465
gtt aaa gaa aaa act ggc aat aat ggc gag tgg gaa gca gat acc 19617
Val Lys Glu Lys Thr Gly Asn Asn Gly Glu Trp Glu Ala Asp Thr
3470 3475 3480
aat gtt gcc tct cag aac cag ata tgc aag ggt aac atc tat gcc 19662
Asn Val Ala Ser Gln Asn Gln Ile Cys Lys Gly Asn Ile Tyr Ala
3485 3490 3495
atg gaa att aac ctc caa gcc aac ctg tgg aga agt ttc ctc tac 19707
Met Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu Tyr
3500 3505 3510
tcg aac gtg gcc ctg tac ctg ccc gat tct tac aag tac acg ccg 19752
Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro
3515 3520 3525
gcc aac atc acc ctg ccc acc aac acc aac act tat gat tac atg 19797
Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met
3530 3535 3540
aat ggg aga gtg gtg cct ccc tcg ctg gtg gac gcc tac atc aac 19842
Asn Gly Arg Val Val Pro Pro Ser Leu Val Asp Ala Tyr Ile Asn
3545 3550 3555
atc ggg gcg cgc tgg tcg ctg gac ccc atg gac aac gtg aac ccc 19887
Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro
3560 3565 3570
ttc aac cac cac cgc aat gcg ggg ctg cgc tac cgc tcc atg ctc 19932
Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu
3575 3580 3585
ctg ggc aac ggg cgc tac gtg ccc ttc cac atc cag gtg ccc cag 19977
Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln
3590 3595 3600
aaa ttt ttc gcc atc aag agc ctc ctg ctc ctg ccc ggg tcc tac 20022
Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr
3605 3610 3615
acc tac gag tgg aac ttc cgc aag gac gtc aac atg atc ctg cag 20067
Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln
3620 3625 3630
agc tcc ctc ggc aac gac ctg cgc acg gac ggg gcc tcc atc tcc 20112
Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ser
3635 3640 3645
ttc acc agc atc aac ctc tac gcc acc ttc ttc ccc atg gcg cac 20157
Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His
3650 3655 3660
aac acg gcc tcc acg ctc gag gcc atg ctg cgc aac gac acc aac 20202
Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn
3665 3670 3675
gac cag tcc ttc aac gac tac ctc tcg gcg gcc aac atg ctc tac 20247
Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr
3680 3685 3690
ccc atc ccg gcc aac gcc acc aac gtg ccc atc tcc atc ccc tcg 20292
Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser
3695 3700 3705
cgc aac tgg gcc gcc ttc cgc ggc tgg tcc ttc acg cgt ctc aag 20337
Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys
3710 3715 3720
acc aag gag acg ccc tcg ctg ggc tcc ggg ttc gac ccc tac ttc 20382
Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe
3725 3730 3735
gtc tac tcg ggc tcc atc ccc tac ctc gac ggc acc ttc tac ctc 20427
Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu
3740 3745 3750
aac cac acc ttc aag aag gtc tcc atc acc ttc gac tcc tcc gtc 20472
Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val
3755 3760 3765
agc tgg ccc ggc aac gac cgg ctc ctg aca ccc aac gag ttc gaa 20517
Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu
3770 3775 3780
atc aag cgc acc gtc gac ggc gag ggc tac aac gtg gcc cag tgc 20562
Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys
3785 3790 3795
aac atg acc aag gac tgg ttc ctg gtc cag atg ctg gcc cac tac 20607
Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr
3800 3805 3810
aac atc ggc tac cag ggc ttc tac gtg ccc gag ggc tac aag gac 20652
Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp
3815 3820 3825
cgc atg tac tcc ttc ttc cgc aac ttc cag ccc atg agc cgc cag 20697
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln
3830 3835 3840
gtg gtg gac gag gtc aac tac aag gac tac cag gcc gtc acc ctg 20742
Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu
3845 3850 3855
gcc tac cag cac aac aac tcg ggc ttc gtc ggc tac ctc gcg ccc 20787
Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro
3860 3865 3870
acc atg cgc cag ggg cag ccc tac ccc gcc aac tac ccg tac ccg 20832
Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro
3875 3880 3885
ctc atc ggc aag agc gcc gtc acc agc gtc acc cag aaa aag ttc 20877
Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys Lys Phe
3890 3895 3900
ctc tgc gac agg gtc atg tgg cgc atc ccc ttc tcc agc aac ttc 20922
Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe
3905 3910 3915
atg tcc atg ggc gcg ctc acc gac ctc ggc cag aac atg ctc tat 20967
Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr
3920 3925 3930
gcc aac tcc gcc cac gcg cta gac atg aat ttc gaa gtc gac ccc 21012
Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro
3935 3940 3945
atg gat gag tcc acc ctt ctc tat gtt gtc ttc gaa gtc ttc gac 21057
Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp
3950 3955 3960
gtc gtc cga gtg cac cag ccc cac cgc ggc gtc atc gag gcc gtc 21102
Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala Val
3965 3970 3975
tac ctg cgc acc ccc ttc tcg gcc ggt aac gcc acc acc taagctcttg 21151
Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
3980 3985 3990
cttcttgc atg atg gct gag ccc acg ggc tcc ggc gag cag gag ctc 21198
Met Met Ala Glu Pro Thr Gly Ser Gly Glu Gln Glu Leu
3995 4000
agg gcc atc atc cgc gac ctg ggc tgc ggg ccc tac ttc ctg ggc 21243
Arg Ala Ile Ile Arg Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly
4005 4010 4015
acc ttc gat aag cgc ttc ccg gga ttc atg gcc ccg cac aag ctg 21288
Thr Phe Asp Lys Arg Phe Pro Gly Phe Met Ala Pro His Lys Leu
4020 4025 4030
gcc tgc gcc atc gtc aac acg gcc ggt cgc gag acc ggg ggc gag 21333
Ala Cys Ala Ile Val Asn Thr Ala Gly Arg Glu Thr Gly Gly Glu
4035 4040 4045
cac tgg ctg gcc ttc gcc tgg aac ccg cgc tcg aac acc tgc tac 21378
His Trp Leu Ala Phe Ala Trp Asn Pro Arg Ser Asn Thr Cys Tyr
4050 4055 4060
ctc ttc gac ccc ttc ggg ttc tcg gac gag cgc ctc aag cag atc 21423
Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu Arg Leu Lys Gln Ile
4065 4070 4075
tac cag ttc gag tac gag ggc ctg ctg cgc cgc agc gcc ctg gcc 21468
Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg Arg Ser Ala Leu Ala
4080 4085 4090
acc gag gac cgc tgc gtc acc ctg gaa aag tcc acc cag acc gtg 21513
Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser Thr Gln Thr Val
4095 4100 4105
cag ggt ccg cgc tcg gcc gcc tgc ggg ctc ttc tgc tgc atg ttc 21558
Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys Cys Met Phe
4110 4115 4120
ctg cac gcc ttc gtg cac tgg ccc gac cgc ccc atg gac aag aac 21603
Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met Asp Lys Asn
4125 4130 4135
ccc acc atg aac ttg ctg acg ggg gtg ccc aac ggc atg ctc cag 21648
Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met Leu Gln
4140 4145 4150
tcg ccc cag gtg gaa ccc acc ctg cgc cgc aac cag gag gcg ctc 21693
Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala Leu
4155 4160 4165
tac cgc ttc ctc aac gcc cac tcc gcc tac ttt cgc tcc cac cgc 21738
Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His Arg
4170 4175 4180
gcg cgc atc gag aag gcc acc gcc ttc gac cgc atg aat caa gac 21783
Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp
4185 4190 4195
atg taaaccgtgt gtgtatgtga atgctttatt cataataaac agcacatgtt 21836
Met
4200
tatgccacct tctctgaggc tctgacttta tttagaaatc gaaggggttc tgccggctct 21896
cggcgtgccc cgcgggcagg gatacgttgc ggaactggta cttgggcagc cacttgaact 21956
cggggatcag cagcttcggc acggggaggt cggggaacga gtcgctccac agcttgcgcg 22016
tgagttgcag ggcgcccagc aggtcgggcg cggagatctt gaaatcgcag ttgggacccg 22076
cgttctgcgc gcgagagttg cggtacacgg ggttgcagca ctggaacacc atcagggccg 22136
ggtgcttcac gctcgccagc accgtcgcgt cggtgatgcc ctccacgtcc agatcctcgg 22196
cgttggccat cccgaagggg gtcatcttgc aggtctgccg ccccatgctg ggcacgcagc 22256
cgggcttgtg gttgcaatcg cagtgcaggg ggatcagcat catctgggcc tgctcggagc 22316
tcatgcccgg gtacatggcc ttcatgaaag cctccagctg gcggaaggcc tgctgcgcct 22376
tgccgccctc ggtgaagaag accccgcagg acttgctaga gaactggttg gtagcgcagc 22436
ccgcgtcgtg cacgcagcag cgcgcgtcgt tgttggccag ctgcaccacg ctgcgccccc 22496
agcggttctg ggtgatcttg gcccggtcgg ggttctcctt cagcgcgcgc tgcccgttct 22556
cgctcgccac atccatctcg atcgtgtgct ccttctggat catcacggtc ccgtgcaggc 22616
accgcagctt gccctcggcc tcggtgcagc cgtgcagcca cagcgcgcag ccggtgctct 22676
cccagttctt gtgggcgatc tgggagtgcg agtgcacgaa gccctgcagg aagcggccca 22736
tcatcgtggt cagggtcttg ttgctggtga aggtcagcgg gatgccgcgg tgctcctcgt 22796
tcacatacag gtggcagatg cggcggtaca cctcgccctg ctcgggcatc agctggaagg 22856
cggacttcag gtcgctctcc acgcggtacc gctccatcag cagcgtcatc acttccatgc 22916
ccttctccca ggccgaaacg atcggcaggc tcagggggtt cttcaccgtc atcttagtcg 22976
ccgccgccga ggtcaggggg tcgttctcgt ccagggtctc aaacactcgc ttgccgtcct 23036
tctcggtgat gcgcacgggg gggaaggcga agcccacggc cgccagctcc tcctcggcct 23096
gcctttcgtc ctcgctgtcc tggctgatgt cttgcaaagg cacatgcttg gtcttgcggg 23156
gtttcttttt gggcggcaga ggcggcggcg gagacgtgct gggcgagcgc gagttctcgc 23216
tcaccacgac tatttcttct ccttggccgt cgtccgagac cacgcggcgg taggcatgcc 23276
tcttctgggg cagaggcgga ggcgacgggc tctcgcggtt cggcgggcgg ctggcagagc 23336
cccttccgcg ttcgggggtg cgctcctggc ggcgctgctc tgactgactt cctccgcggc 23396
cggccattgt gttctcctag ggagcaagc atg gag act cag cca tcg tcg cca 23449
Met Glu Thr Gln Pro Ser Ser Pro
4205
aca tcg cca tct gcc ccc gcc tcc acc gcc gac gag aac cag cag 23494
Thr Ser Pro Ser Ala Pro Ala Ser Thr Ala Asp Glu Asn Gln Gln
4210 4215 4220
cag cag aat gaa agc tta acc gcc ccg ccg ccc agc ccc acc tcc 23539
Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro Ser Pro Thr Ser
4225 4230 4235
gac gcc gcg gcc cca gac atg caa gag atg gag gaa tcc atc gag 23584
Asp Ala Ala Ala Pro Asp Met Gln Glu Met Glu Glu Ser Ile Glu
4240 4245 4250
att gac ctg ggc tac gtg acg ccc gcg gag cac gag gag gag ctg 23629
Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu Glu Glu Leu
4255 4260 4265
gca gcg cgc ttt tca gcc ccg gaa gaa aac cac caa gag cag cca 23674
Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln Glu Gln Pro
4270 4275 4280
gag cag gaa gca gag aac gag cag aac cag gct ggg ctc gag cat 23719
Glu Gln Glu Ala Glu Asn Glu Gln Asn Gln Ala Gly Leu Glu His
4285 4290 4295
ggc gac tac ctg agc ggg gca gag gac gtg ctc atc aag cat ctg 23764
Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His Leu
4300 4305 4310
gcc cgc caa tgc atc atc gtc aag gac gcg ctg ctc gac cgc gcc 23809
Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala
4315 4320 4325
gag gtg ccc ctc agc gtg gcg gag ctc agc cgc gcc tac gag cgc 23854
Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg
4330 4335 4340
aac ctc ttc tcg cca cgc gtg ccc ccc aag cgc cag ccc aac ggc 23899
Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly
4345 4350 4355
acc tgc gag ccc aac ccg cgc ctc aac ttc tac ccg gtc ttc gcg 23944
Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala
4360 4365 4370
gtg ccc gag gcc ctg gcc acc tac cac ctc ttt ttc aag aac caa 23989
Val Pro Glu Ala Leu Ala Thr Tyr His Leu Phe Phe Lys Asn Gln
4375 4380 4385
agg atc ccc gtc tcc tgc cgc gcc aac cgc acc cgc gcc gac gcc 24034
Arg Ile Pro Val Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala
4390 4395 4400
ctg ctc aac ctg ggc ccc ggc gcc cgc cta cct gat atc gcc tcc 24079
Leu Leu Asn Leu Gly Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser
4405 4410 4415
ttg gaa gag gtt ccc aag atc ttc gag ggt ctg ggc agc gac gag 24124
Leu Glu Glu Val Pro Lys Ile Phe Glu Gly Leu Gly Ser Asp Glu
4420 4425 4430
act cgg gcc gcg aac gct ctg caa gga agc gga gag gag cat gag 24169
Thr Arg Ala Ala Asn Ala Leu Gln Gly Ser Gly Glu Glu His Glu
4435 4440 4445
cac cac agc gcc ctg gtg gag ttg gaa ggc gac aac gcg cgc ctg 24214
His His Ser Ala Leu Val Glu Leu Glu Gly Asp Asn Ala Arg Leu
4450 4455 4460
gcg gtc ctc aag cgc acg gtc gag ctg acc cac ttc gcc tac cca 24259
Ala Val Leu Lys Arg Thr Val Glu Leu Thr His Phe Ala Tyr Pro
4465 4470 4475
gcg ctc aac ctg ccc ccc aag gtc atg agc gcc gtc atg gac cag 24304
Ala Leu Asn Leu Pro Pro Lys Val Met Ser Ala Val Met Asp Gln
4480 4485 4490
gtg ctc atc aag cgc gcc tcg ccc ctc tcg gag gag gag atg cag 24349
Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu Met Gln
4495 4500 4505
gac ccc gag agc tcg gac gag ggc aag ccc gtg gtc agc gac gag 24394
Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser Asp Glu
4510 4515 4520
cag ctg gcg cgc tgg ctg gga gcg agt agc acc ccc cag agc ctg 24439
Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln Ser Leu
4525 4530 4535
gaa gag cgg cgc aag ctc atg atg gcc gtg gtc ctg gtg acc gtg 24484
Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr Val
4540 4545 4550
gag ctg gag tgt ctg cgc cgc ttc ttt gcc gac gcg gag acc ctg 24529
Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu
4555 4560 4565
cgc aag gtc gag gag aac ctg cac tac ctc ttc agg cac ggg ttc 24574
Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe
4570 4575 4580
gtg cgc cag gcc tgc aag atc tcc aac gtg gag ctg acc aac ctg 24619
Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu
4585 4590 4595
gtc tcc tac atg ggc atc ctg cac gag aac cgc ctg ggg cag aac 24664
Val Ser Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn
4600 4605 4610
gtg ctg cac acc acc ctg cgc ggg gag gcc cgc cgc gac tac atc 24709
Val Leu His Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile
4615 4620 4625
cgc gac tgc gtc tac ctg tac ctc tgc cac acc tgg cag acg ggc 24754
Arg Asp Cys Val Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly
4630 4635 4640
atg ggc gtg tgg cag cag tgc ctg gag gag cag aac ctg aaa gag 24799
Met Gly Val Trp Gln Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu
4645 4650 4655
ctc tgc aag ctc ctg cag aag aac ctc aag gcc ctg tgg acc ggg 24844
Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys Ala Leu Trp Thr Gly
4660 4665 4670
ttc gac gag cgc acc acc gcc tcg gac ctg gcc gac ctc atc ttc 24889
Phe Asp Glu Arg Thr Thr Ala Ser Asp Leu Ala Asp Leu Ile Phe
4675 4680 4685
ccc gag cgc ctg cgg ctg acg ctg cgc aac ggg ctg ccc gac ttt 24934
Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu Pro Asp Phe
4690 4695 4700
atg agc caa agc atg ttg caa aac ttt cgc tct ttc atc ctc gaa 24979
Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe Ile Leu Glu
4705 4710 4715
cgc tcc ggg atc ctg ccc gcc acc tgc tcc gcg ctg ccc tcg gac 25024
Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu Pro Ser Asp
4720 4725 4730
ttc gtg ccg ctg acc ttc cgc gag tgc ccc ccg ccg ctc tgg agc 25069
Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu Trp Ser
4735 4740 4745
cac tgc tac ctg ctg cgc ctg gcc aac tac ctg gcc tac cac tcg 25114
His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His Ser
4750 4755 4760
gac gtg atc gag gac gtc agc ggc gag ggt ctg ctc gag tgc cac 25159
Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu Cys His
4765 4770 4775
tgc cgc tgc aac ctc tgc acg ccg cac cgc tcc ctg gcc tgc aac 25204
Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys Asn
4780 4785 4790
ccc cag ctg ctg agc gag acc cag atc atc ggc acc ttc gag ttg 25249
Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu
4795 4800 4805
caa ggc ccc ggc gac agc gag ggc aag ggg ggt ctg aaa ctc acc 25294
Gln Gly Pro Gly Asp Ser Glu Gly Lys Gly Gly Leu Lys Leu Thr
4810 4815 4820
ccg ggg ctg tgg acc tcg gcc tac ttg cgc aag ttc gtg ccc gag 25339
Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu
4825 4830 4835
gac tac cat ccc ttc gag atc agg ttc tac gag gac caa tcc cag 25384
Asp Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln
4840 4845 4850
ccg ccc aag gcc gag ctg tcg gcc tgc gtc atc acc cag ggg gcc 25429
Pro Pro Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala
4855 4860 4865
atc ctg gcc caa ttg caa gcc atc cag aaa tcc cgc caa gaa ttt 25474
Ile Leu Ala Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe
4870 4875 4880
ctg ctg aaa aag ggc cac ggg gtc tac ttg gac ccc cag acc gga 25519
Leu Leu Lys Lys Gly His Gly Val Tyr Leu Asp Pro Gln Thr Gly
4885 4890 4895
gag gag ctc aac ccc agc ttc ccc cag gat gcc ccg agg aag cag 25564
Glu Glu Leu Asn Pro Ser Phe Pro Gln Asp Ala Pro Arg Lys Gln
4900 4905 4910
caa gaa gct gaa agt gga gct gcc gcc gga gga ttt gga gga aga 25609
Gln Glu Ala Glu Ser Gly Ala Ala Ala Gly Gly Phe Gly Gly Arg
4915 4920 4925
ctg gga gag cag tca ggc aga gga gga gat gga aga ctg gga cag 25654
Leu Gly Glu Gln Ser Gly Arg Gly Gly Asp Gly Arg Leu Gly Gln
4930 4935 4940
cac tca ggc aga gga gga cag cct gca aga cag tct gga gga gga 25699
His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly Gly Gly
4945 4950 4955
aga cga ggt gga gga gga ggc aga gga aga agc agc cgc cgc cag 25744
Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg Arg Gln
4960 4965 4970
acc gtc gtc ctc ggc gga gga gaa agc aag cag cac gga tac cat 25789
Thr Val Val Leu Gly Gly Gly Glu Ser Lys Gln His Gly Tyr His
4975 4980 4985
ctc cgc tcc ggg tcg ggg tcg cgg cgg ccg ggc cca cag tagatgggac 25838
Leu Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro Gln
4990 4995 5000
gagaccgggc gcttcccgaa ccccaccacc cagaccggta agaaggagcg gcagggatac 25898
aagtcctggc gggggcacaa aaacgccatc gtctcctgct tgcaagcctg cgggggcaac 25958
atctccttca cccggcgcta cctgctcttc caccgcgggg tgaacttccc ccgcaacatc 26018
ttgcattact accgtcacct ccacagcccc tactactgtt tccaagaaga ggcagaaacc 26078
cagcagcagc agaaaaccag cggcagcagc agctagaaaa tccacagcgg cggcaggtgg 26138
actgaggatc gcggcgaacg agccggcgca gacccgggag ctgaggaacc ggatctttcc 26198
caccctctat gccatcttcc agcagagtcg ggggcaggag caggaactga aagtcaagaa 26258
ccgttctctg cgctcgctca cccgcagttg tctgtatcac aagagcgaag accaacttca 26318
gcgcactctc gaggacgccg aggctctctt caacaagtac tgcgcgctca ctcttaaaga 26378
gtagcccgcg cccgcccaca cacggaaaaa ggcgggaatt acgtcaccac ctgcgccctt 26438
cgcccgacca tc atg agc aaa gag att ccc acg cct tac atg tgg agc 26486
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser
5005 5010
tac cag ccc cag atg ggc ctg gcc gcc ggc gcc gcc cag gac tac 26531
Tyr Gln Pro Gln Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr
5015 5020 5025
tcc acc cgc atg aac tgg ctc agt gcc ggg ccc gcg atg atc tca 26576
Ser Thr Arg Met Asn Trp Leu Ser Ala Gly Pro Ala Met Ile Ser
5030 5035 5040
cgg gtg aat gac atc cgc gcc cgc cga aac cag ata ctc cta gaa 26621
Arg Val Asn Asp Ile Arg Ala Arg Arg Asn Gln Ile Leu Leu Glu
5045 5050 5055
cag tca gcg atc acc gcc acg ccc cgc cat cac ctt aat ccg cgt 26666
Gln Ser Ala Ile Thr Ala Thr Pro Arg His His Leu Asn Pro Arg
5060 5065 5070
aat tgg ccc gcc gcc ctg gtg tac cag gaa att ccc cag ccc acg 26711
Asn Trp Pro Ala Ala Leu Val Tyr Gln Glu Ile Pro Gln Pro Thr
5075 5080 5085
acc gta cta ctt ccg cga gac gcc cag gcc gaa gtc cag ctg act 26756
Thr Val Leu Leu Pro Arg Asp Ala Gln Ala Glu Val Gln Leu Thr
5090 5095 5100
aac tca ggt gtc cag ctg gcc ggc ggc gcc gcc ctg tgt cgt cac 26801
Asn Ser Gly Val Gln Leu Ala Gly Gly Ala Ala Leu Cys Arg His
5105 5110 5115
cgc ccc gct cag ggt ata aag cgg ctg gtg atc cga ggc aga ggc 26846
Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile Arg Gly Arg Gly
5120 5125 5130
aca cag ctc aac gac gag gtg gtg agc tct tcg ctg ggt ctg cga 26891
Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu Gly Leu Arg
5135 5140 5145
cct gac gga gtc ttc caa ctc gcc gga tcg ggg aga tct tcc ttc 26936
Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser Ser Phe
5150 5155 5160
acg cct cgt cag gcc gtc ctg act ttg gag agt tcg tcc tcg cag 26981
Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser Gln
5165 5170 5175
ccc cgc tcg ggc ggc atc ggc act ctc cag ttc gtg gag gag ttc 27026
Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
5180 5185 5190
act ccc tcg gtc tac ttc aac ccc ttc tcc ggc tcc ccc ggc cac 27071
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His
5195 5200 5205
tac ccg gac gag ttc atc ccg aac ttc gac gcc atc agc gag tcg 27116
Tyr Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser
5210 5215 5220
gtg gac ggc tac gat tgaatgtccc atggtggcgc agctgaccta gctcggcttc 27171
Val Asp Gly Tyr Asp
5225
gacacctgga ccactgccgc cgcttccgct gcttcgctcg ggatctcgcc gagtttgcct 27231
actttgagct gcccgaggag caccctcagg gcccagccca cggagtgcgg atcatcgtcg 27291
aagggggcct cgactcccac ctgcttcgga tcttcagcca gcgaccgatc ctggtcgagc 27351
gcgaacaagg acagacccgt ctgaccctgt actgcatctg caaccacccc ggcctgc 27408
atg aaa gtc ttt gtt gtc tgc tgt gta ctg agt ata ata aaa gct 27453
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala
5230 5235 5240
gag atc agc gac tac tcc gga ctc gat tgt ggt gtt cct gct atc 27498
Glu Ile Ser Asp Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile
5245 5250 5255
aac cgg tcc ctg ttc ttc acc ggg aac gaa acc gag ctc cag ctc 27543
Asn Arg Ser Leu Phe Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu
5260 5265 5270
cag tgt aag ccc cac aag aag tac ctc acc tgg ctg ttc cag ggc 27588
Gln Cys Lys Pro His Lys Lys Tyr Leu Thr Trp Leu Phe Gln Gly
5275 5280 5285
tcc ccc atc gcc gtt gtc aac cac tgc gac aac gac gga gtc ctg 27633
Ser Pro Ile Ala Val Val Asn His Cys Asp Asn Asp Gly Val Leu
5290 5295 5300
ctg agc ggc cct gcc aac ctt act ttt tcc acc cgc aga agc aag 27678
Leu Ser Gly Pro Ala Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys
5305 5310 5315
ctc cag ctc ttc caa ccc ttc ctc ccc ggg acc tat cag tgc gtc 27723
Leu Gln Leu Phe Gln Pro Phe Leu Pro Gly Thr Tyr Gln Cys Val
5320 5325 5330
tcg gga ccc tgc cat cac acc ttc cac ctg atc ccg aat acc aca 27768
Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro Asn Thr Thr
5335 5340 5345
gcg ccg ctc ccc gct act aac aac caa act aac ctc cac caa cgc 27813
Ala Pro Leu Pro Ala Thr Asn Asn Gln Thr Asn Leu His Gln Arg
5350 5355 5360
cac cgt cgc gac ctt tcc tct gaa tct aat acc act acc gga ggt 27858
His Arg Arg Asp Leu Ser Ser Glu Ser Asn Thr Thr Thr Gly Gly
5365 5370 5375
gag ctc cga ggt cga cca acc tct ggg att tac tac ggc ccc tgg 27903
Glu Leu Arg Gly Arg Pro Thr Ser Gly Ile Tyr Tyr Gly Pro Trp
5380 5385 5390
gag gtg gtg ggg tta ata gcg cta ggc cta gtt gtg ggt ggg ctt 27948
Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val Val Gly Gly Leu
5395 5400 5405
ttg gct ctc tgc tac cta tac ctc cct tgc tgt tcg tac tta gtg 27993
Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu Val
5410 5415 5420
gtg ctg tgt tgc tgg ttt aag aaa tgg ggc aga tca ccc tagtgagctg 28042
Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
5425 5430 5435
cggtgtgctg gtggcggtgc tttcgattgt gggactgggc ggcgcggctg tagtgaagga 28102
ggagaaggcc gatccctgct tgcatttcaa tcccgacaaa tgccagctga gttttcagcc 28162
cgatggcaat cggtgcgcgg tgctgatcaa gtgcggatgg gaatgcgaga acgtgagaat 28222
cgagtacaat aacaagactc ggaacaatac tctcgcgtcc gtgtggcagc ccggggaccc 28282
cgagtggtac accgtctctg tccccggtgc tgacggctcc ccgcgcaccg tgaataatac 28342
tttcattttt gcgcacatgt gcaacacggt catgtggatg agcaagcagt acgatatgtg 28402
gccccccacg aaggagaaca tcgtggtctt ctccatcgct tacagcctgt gcacggcgct 28462
aatcaccgct atcgtgtgcc tgagcattca catgctcatc gctattcgcc ccagaaataa 28522
tgccgagaaa gagaaacagc cataacacgt tttttcacac accttgtttt tacagaca 28580
atg cgt ctg tta aat ttt tta aac att gtg ctc agt att gct tat 28625
Met Arg Leu Leu Asn Phe Leu Asn Ile Val Leu Ser Ile Ala Tyr
5440 5445 5450
gcc tct ggt tat gca aac ata cag aaa acc ctt tat gta gga tct 28670
Ala Ser Gly Tyr Ala Asn Ile Gln Lys Thr Leu Tyr Val Gly Ser
5455 5460 5465
gat ggt aca cta gag ggt acc caa tca caa gcc aag gtt gca tgg 28715
Asp Gly Thr Leu Glu Gly Thr Gln Ser Gln Ala Lys Val Ala Trp
5470 5475 5480
tat ttt tat aga acc aac act gat cca gtt aaa ctt tgt aag ggt 28760
Tyr Phe Tyr Arg Thr Asn Thr Asp Pro Val Lys Leu Cys Lys Gly
5485 5490 5495
gaa ttg ccg cgt aca cat aaa act cca ctt aca ttt agt tgc agc 28805
Glu Leu Pro Arg Thr His Lys Thr Pro Leu Thr Phe Ser Cys Ser
5500 5505 5510
aat aat aat ctt aca ctt ttt tca att aca aaa caa tat act ggt 28850
Asn Asn Asn Leu Thr Leu Phe Ser Ile Thr Lys Gln Tyr Thr Gly
5515 5520 5525
act tat tac agt aca aac ttt cat aca gga caa gat aaa tat tat 28895
Thr Tyr Tyr Ser Thr Asn Phe His Thr Gly Gln Asp Lys Tyr Tyr
5530 5535 5540
act gtt aag gta gaa aat cct acc act cct aga act acc acc acc 28940
Thr Val Lys Val Glu Asn Pro Thr Thr Pro Arg Thr Thr Thr Thr
5545 5550 5555
acc acc act act gca aag ccc act gtg aaa act aca act agg acc 28985
Thr Thr Thr Thr Ala Lys Pro Thr Val Lys Thr Thr Thr Arg Thr
5560 5565 5570
acc aca act aca gaa acc acc acc agc aca aca ctt gct gca act 29030
Thr Thr Thr Thr Glu Thr Thr Thr Ser Thr Thr Leu Ala Ala Thr
5575 5580 5585
aca cac aca cac act aag cta acc tta cag acc act aat gat ttg 29075
Thr His Thr His Thr Lys Leu Thr Leu Gln Thr Thr Asn Asp Leu
5590 5595 5600
atc gcc ctg ctg caa aag ggg gat aac agc acc act tcc aat gag 29120
Ile Ala Leu Leu Gln Lys Gly Asp Asn Ser Thr Thr Ser Asn Glu
5605 5610 5615
gag ata ccc aaa tcc atg att ggc att att gtt gct gta gtg gtg 29165
Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val Val
5620 5625 5630
tgc atg ttg atc atc gcc ttg tgc atg gtg tac tat gcc ttc tgc 29210
Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys
5635 5640 5645
tac aga aag cac aga ctg aac gac aag ctg gaa cac tta cta agt 29255
Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser
5650 5655 5660
gtt gaa ttt taatttttta gaacc atg aag atc cta ggc ctt ttt agt 29303
Val Glu Phe Met Lys Ile Leu Gly Leu Phe Ser
5665 5670
ttt tct atc att acc tct gct ctt tgt gaa tca gtg gat aga gat 29348
Phe Ser Ile Ile Thr Ser Ala Leu Cys Glu Ser Val Asp Arg Asp
5675 5680 5685
gtt act att acc act ggt tct aat tat aca ctg aaa ggg cca ccc 29393
Val Thr Ile Thr Thr Gly Ser Asn Tyr Thr Leu Lys Gly Pro Pro
5690 5695 5700
tca ggt atg ctt tcg tgg tat tgc tat ttt gga act gac act gat 29438
Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe Gly Thr Asp Thr Asp
5705 5710 5715
caa act gaa tta tgc aat ttt caa aaa ggc aaa acc tca aac tct 29483
Gln Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys Thr Ser Asn Ser
5720 5725 5730
aaa atc tct aat tat caa tgc aat ggc act gat ctg ata cta ctc 29528
Lys Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu Ile Leu Leu
5735 5740 5745
aat gtc acg aaa gca tat ggt ggc agt tat tat tgc cct gga caa 29573
Asn Val Thr Lys Ala Tyr Gly Gly Ser Tyr Tyr Cys Pro Gly Gln
5750 5755 5760
aac act gaa gaa atg att ttt tac aaa gtg gaa gtg gtt gat ccc 29618
Asn Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp Pro
5765 5770 5775
act aca cca ccc acc acc aca act att cat acc aca cac aca gaa 29663
Thr Thr Pro Pro Thr Thr Thr Thr Ile His Thr Thr His Thr Glu
5780 5785 5790
caa aca cca gag gca aca gaa gca gag ttg gcc ttc cag gtt cac 29708
Gln Thr Pro Glu Ala Thr Glu Ala Glu Leu Ala Phe Gln Val His
5795 5800 5805
gga gat tcc ttt gct gtc aat acc cct aca ccc gat cag cgg tgt 29753
Gly Asp Ser Phe Ala Val Asn Thr Pro Thr Pro Asp Gln Arg Cys
5810 5815 5820
ccg ggg ccg cta gtc agc ggc att gtc ggt gtg ctt tcg gga tta 29798
Pro Gly Pro Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu
5825 5830 5835
gca gtc ata atc atc tgc atg ttc att ttt gct tgc tgc tat aga 29843
Ala Val Ile Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg
5840 5845 5850
agg ctt tac cga caa aaa tca gac cca ctg ctg aac ctc tat gtt 29888
Arg Leu Tyr Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
5855 5860 5865
taattttttc cagagcc atg aag gca gtt agc gct cta gtt ttt tgt tct 29938
Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser
5870 5875
ttg att ggc att gtt ttt aat agt aaa att acc aaa gtt agc ttt 29983
Leu Ile Gly Ile Val Phe Asn Ser Lys Ile Thr Lys Val Ser Phe
5880 5885 5890
att aaa cat gtt aat gta act gaa gga gat aac atc aca cta gca 30028
Ile Lys His Val Asn Val Thr Glu Gly Asp Asn Ile Thr Leu Ala
5895 5900 5905
ggt gta gaa ggt gct caa aac acc acc tgg aca aaa tac cat cta 30073
Gly Val Glu Gly Ala Gln Asn Thr Thr Trp Thr Lys Tyr His Leu
5910 5915 5920
gga tgg aga gat att tgc acc tgg aat gta act tat tat tgc ata 30118
Gly Trp Arg Asp Ile Cys Thr Trp Asn Val Thr Tyr Tyr Cys Ile
5925 5930 5935
gga att aat ctt acc att gtt aac gct aac caa tct cag aat ggg 30163
Gly Ile Asn Leu Thr Ile Val Asn Ala Asn Gln Ser Gln Asn Gly
5940 5945 5950
tta att aaa gga cag agt gtt agt gtg acc agt gat ggg tac tat 30208
Leu Ile Lys Gly Gln Ser Val Ser Val Thr Ser Asp Gly Tyr Tyr
5955 5960 5965
acc cag cat agt ttt aac tac aac att act gtc ata cca ctg cct 30253
Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val Ile Pro Leu Pro
5970 5975 5980
acg cct agc cca cct agc act acc aca cag aca acc aca tac agt 30298
Thr Pro Ser Pro Pro Ser Thr Thr Thr Gln Thr Thr Thr Tyr Ser
5985 5990 5995
aca tca aat cag cct acc acc act aca gca gca gag gtt gcc agc 30343
Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val Ala Ser
6000 6005 6010
tcg tct ggg gtc cga gtg gca ttt ttg atg ttg gcc cca tct agc 30388
Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser Ser
6015 6020 6025
agt ccc act gct agt acc aat gag cag act act gaa ttt ttg tcc 30433
Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser
6030 6035 6040
act gtc gag agc cac acc aca gct acc tcc agt gcc ttc tct agc 30478
Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser
6045 6050 6055
acc gcc aat ctc tcc tcg ctt tcc tct aca cca atc agc ccc gct 30523
Thr Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala
6060 6065 6070
act act cct agc ccc gct cct ctt ccc act ccc ctg aag caa aca 30568
Thr Thr Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr
6075 6080 6085
gac ggc ggc atg caa tgg cag atc acc ctg ctc att gtg atc ggg 30613
Asp Gly Gly Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly
6090 6095 6100
ttg gtc atc ctg gcc gtg ttg ctc tac tac atc ttc tgc cgc cgc 30658
Leu Val Ile Leu Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg
6105 6110 6115
att ccc aac gcg cac cgc aag ccg gcc tac aag ccc atc gtt atc 30703
Ile Pro Asn Ala His Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile
6120 6125 6130
ggg cag ccg gag ccg ctt cag gtg gaa ggg ggt cta agg aat ctt 30748
Gly Gln Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu
6135 6140 6145
ctc ttc tct ttt aca gta tgg tgattgaact atg att cct aga caa ttc 30797
Leu Phe Ser Phe Thr Val Trp Met Ile Pro Arg Gln Phe
6150 6155 6160
ttg atc act att ctt atc tgc ctc ctc caa gtc tgt gcc acc ctc 30842
Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln Val Cys Ala Thr Leu
6165 6170 6175
gct ctg gtg gcc aac gcc agt cca gac tgt att ggg ccc ttc gcc 30887
Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro Phe Ala
6180 6185 6190
tcc tac gtg ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc 30932
Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser
6195 6200 6205
ata gtc tgc ctg ctt atc acc ttc ttc cag ttc att gac tgg atc 30977
Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Ile
6210 6215 6220
ttt gtg cgc atc gcc tac ctg cgc cac cac ccc cag tac cgc gac 31022
Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp
6225 6230 6235
cag cga gtg gcg cag ctg ctc agg ctc ctc tgataagc atg cgg gct 31069
Gln Arg Val Ala Gln Leu Leu Arg Leu Leu Met Arg Ala
6240 6245
ctg cta ctt ctc gcg ctt ctg ctg tta gtg ctc ccc cgt ccc gtt 31114
Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg Pro Val
6250 6255 6260
gac ccc cgg ccc ccc act cag tcc ccc gag gag gtc cgc aaa tgc 31159
Asp Pro Arg Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys
6265 6270 6275
aaa ttc caa gaa ccc tgg aaa ttc ctc aaa tgc tac cgc caa aaa 31204
Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys
6280 6285 6290
tca gac atg cat ccc agc tgg atc atg atc att ggg atc gtg aac 31249
Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn
6295 6300 6305
att ctg gcc tgc acc ctc atc tcc ttt gtg att tac ccc tgc ttt 31294
Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe
6310 6315 6320
gac ttt ggt tgg aac tcg cca gag gcg ctc tat ctc ccg cct gaa 31339
Asp Phe Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu
6325 6330 6335
cct gac aca cca cca cag caa cct cag gca cac gca cta cca cca 31384
Pro Asp Thr Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro
6340 6345 6350
cca cca cag cct agg cca caa tac atg ccc ata tta gac tat gag 31429
Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu
6355 6360 6365
gcc gag cca cag cga ccc atg ctc ccc gct att agt tac ttc aat 31474
Ala Glu Pro Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn
6370 6375 6380
cta acc ggc gga gat gac tgacccactg gccaacaaca acgtcaacga 31522
Leu Thr Gly Gly Asp Asp
6385 6390
ccttctcctg gacatggacg gccgcgcctc ggagcagcga ctcgcccaac ttcgcattcg 31582
ccagcagcag gagagagccg tcaaggagct gcaggacggc atagccatcc accagtgcaa 31642
gaaaggcatc ttctgcctgg tgaaacaggc caagatctcc tacgaggtca cccagaccga 31702
ccatcgcctc tcctacgagc tcatgcagca gcgccagaag ttcacctgcc tggtcggagt 31762
caaccccatc gtcatcaccc agcagtcggg cgataccaag gggtgcatcc actgctcctg 31822
cgactccccc gactgcgtcc acactctgat caagaccctc tgcggcctcc gcgacctcct 31882
ccccatgaac taatcacccc cttatccagt gaaataaaga tcatattgat gattaaataa 31942
aaaaaataat catttgattt gaaataaaga tacaatcata ttgatgattt gagtttaata 32002
aaaataaaga atcacttact tgaaatctga taccaggtct ctgtccatgt tttctgccaa 32062
caccacttca ctcccctctt cccagctctg gtactgcagg ccccggcggg ctgcaaactt 32122
cctccacacc ctgaagggga tgtcaaattc ctcctgtccc tcaatcttca ttttatcttc 32182
tatcag atg tcc aaa aag cgc gtc cgg gtg gat gat gac ttc gac ccc 32230
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro
6395 6400
gtc tac ccc tac gat gca gac aac gca ccg acc gtg ccc ttc atc 32275
Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile
6405 6410 6415
aac ccc ccc ttc gtc tct tca gat gga ttc caa gag aag ccc ctg 32320
Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu
6420 6425 6430
ggg gtg ctg tcc ctg cgt ctg gcc gat ccc gtc acc acc aag aac 32365
Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn
6435 6440 6445
ggg gaa atc acc ctc aag ctg gga gat ggg gtg gac ctc gac gac 32410
Gly Glu Ile Thr Leu Lys Leu Gly Asp Gly Val Asp Leu Asp Asp
6450 6455 6460
tcg gga aaa ctc atc tcc aac acg gcc acc aag gcc gcc gcc cct 32455
Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala Ala Ala Pro
6465 6470 6475
ctc agt ttt tcc aac aac acc att tcc ctt aac atg gat acc cct 32500
Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met Asp Thr Pro
6480 6485 6490
ctt tac aac aac aat gga aag cta ggt atg aag gta acc gca cca 32545
Leu Tyr Asn Asn Asn Gly Lys Leu Gly Met Lys Val Thr Ala Pro
6495 6500 6505
tta aag ata tta gac aca gat cta cta aaa aca ctt gtt gtt gct 32590
Leu Lys Ile Leu Asp Thr Asp Leu Leu Lys Thr Leu Val Val Ala
6510 6515 6520
tat ggg cag gga tta gga aca aac acc aat ggt gct ctt gtt gcc 32635
Tyr Gly Gln Gly Leu Gly Thr Asn Thr Asn Gly Ala Leu Val Ala
6525 6530 6535
caa cta gca tac cca ctt gtt ttt aat acc gct agc aaa att gcc 32680
Gln Leu Ala Tyr Pro Leu Val Phe Asn Thr Ala Ser Lys Ile Ala
6540 6545 6550
ctt aat tta ggc aat gga cca tta aaa gtg gat gca aat aga ctg 32725
Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala Asn Arg Leu
6555 6560 6565
aac att aat tgc aaa aga ggt atc tat gtc act acc aca aaa gat 32770
Asn Ile Asn Cys Lys Arg Gly Ile Tyr Val Thr Thr Thr Lys Asp
6570 6575 6580
gca ctg gag att aat atc agt tgg gca aat gct atg aca ttt ata 32815
Ala Leu Glu Ile Asn Ile Ser Trp Ala Asn Ala Met Thr Phe Ile
6585 6590 6595
gga aat gcc att ggt gtc aat att gac aca aaa aaa ggc cta cag 32860
Gly Asn Ala Ile Gly Val Asn Ile Asp Thr Lys Lys Gly Leu Gln
6600 6605 6610
ttc ggc act tca agc act gaa aca gat gtt aaa aat gct ttt cca 32905
Phe Gly Thr Ser Ser Thr Glu Thr Asp Val Lys Asn Ala Phe Pro
6615 6620 6625
ctc caa gta aaa ctt gga gct ggt ctt aca ttt gac agc aca ggt 32950
Leu Gln Val Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly
6630 6635 6640
gcc att gtt gct tgg aac aaa gaa gat gac aaa ctt aca ctg tgg 32995
Ala Ile Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp
6645 6650 6655
acc aca gcc gat cca tct cca aac tgt cac ata tat tct gca aag 33040
Thr Thr Ala Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Ala Lys
6660 6665 6670
gat gct aag ctt aca ctc tgc ttg aca aag tgt ggt agt cag ata 33085
Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile
6675 6680 6685
ctg ggc act gtt tct ctc ata gct gtt gat act ggt agc tta aat 33130
Leu Gly Thr Val Ser Leu Ile Ala Val Asp Thr Gly Ser Leu Asn
6690 6695 6700
cca ata aca gga aaa gta acc act gct ctt gtt tca ctt aaa ttc 33175
Pro Ile Thr Gly Lys Val Thr Thr Ala Leu Val Ser Leu Lys Phe
6705 6710 6715
gat gcc aat gga gtt ttg caa gcc agt tca aca cta gat aaa gaa 33220
Asp Ala Asn Gly Val Leu Gln Ala Ser Ser Thr Leu Asp Lys Glu
6720 6725 6730
tat tgg aat ttc aga aaa gga gat gtg aca cct gct gac ccc tac 33265
Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Asp Pro Tyr
6735 6740 6745
act aat gct ata ggc ttt atg ccc aac ctt aat gca tac cca aaa 33310
Thr Asn Ala Ile Gly Phe Met Pro Asn Leu Asn Ala Tyr Pro Lys
6750 6755 6760
aac aca aac gca gct gca aaa agt cac att gtt gga aaa gta tac 33355
Asn Thr Asn Ala Ala Ala Lys Ser His Ile Val Gly Lys Val Tyr
6765 6770 6775
cta cat ggg gat gta agc aag cca cta gac ttg ata att aca ttt 33400
Leu His Gly Asp Val Ser Lys Pro Leu Asp Leu Ile Ile Thr Phe
6780 6785 6790
aat gaa acc agt gat gaa tcc tgt act tat tgc att aac ttt cag 33445
Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile Asn Phe Gln
6795 6800 6805
tgg cag tgg gga act gac caa tat aaa gat gaa aca ctt gca gtc 33490
Trp Gln Trp Gly Thr Asp Gln Tyr Lys Asp Glu Thr Leu Ala Val
6810 6815 6820
agt tca ttc acc ttc tca tac att gct aaa gaa taacatccac 33533
Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu
6825 6830 6835
cctgcatgcc aacccatttc cctctatcta tacatggaaa actctgaagc agaaaaaata 33593
aagttcaagt gttttattga ttcaacagtt tttacagaat tcgagtagtt attttccctc 33653
caccctccca actcatggaa tacaccatcc tctccccacg cacagcctta aacatctgaa 33713
tgccattggt aatggacatg gttttggcct ccacattcca cacagtttca gagcgagcca 33773
gtctcgggtc ggtcagggag atgaaaccct ccgggcactc ctgcatctgc acctcacagt 33833
tcaacagctg agggctgtcc tcggtggtcg ggatcacggt tatctggaag aagagcgatg 33893
agaatcataa tccgcgaacg ggatcgggcg gttgtggcgc atcaggcccc gcagcagtcg 33953
ctgtctgcgc cgctccgtca agctgctgct caaggggtcc gggtccaggg actccctgcg 34013
catgatgccg atggccctga gcatcagtcg cctggtgcgg cgggcgcagc agcggatgcg 34073
gatctcactc aggtcggagc agtacgtgca gcacagcacc accaagttgt tcaacagtcc 34133
atagttcaac acgctccagc caaaactcat ctgtggaact atgctgccca cgtgtccatc 34193
gtaccagatc ctgatgtaaa tcaggtggcg ccccctccag aacacactgc ccatgtacat 34253
gatctccttg ggcatatgca ggttcaccac ctcccggtac cacatcaccc gctggttgaa 34313
catgcagccc tggataattc tgcggaacca gatggccagc accgccccgc ccgccatgca 34373
gcgcagggac cccgggtcct gacagtggca gtggaggacc caccgctcgc ggccgtggat 34433
caactgggag ctgaacaggt ctatgttggc acagcacagg cacacgctca tgcatgtctt 34493
cagcactctc agttcctcgg gggtcaggac catgtcccag ggcacgggga actcttgcag 34553
gacagtgaac ccggcagaac agggcagccc tcgcacacaa cttacattgt gcatggacag 34613
ggtatcgcaa tcaggcagca ccggatgatc ctccaccaga gaagcgcggg tctcggtctc 34673
ctcacaacga ggtaaggggg ccggcggttg gtacggatga tggcgggatg acgctaatcg 34733
tgttctggat cgtgtcatga tggagcttct tcctgacatc ttcgtatttc atgtagcaga 34793
acctggtccg ggcactgcac accgctcgcc ggcgacggtc tcggcgcttc gagcgctcgg 34853
tgttgaagtt gtaaaacagc cactccctca gagcgtgcag tatctcttga gcctcttggg 34913
tgatgaaaat cccatccgcc ctgatggctc tgatcacatc gaccacggtg gaatgggcca 34973
gacccagcca gatgatgcaa ttttgttggg tttcggtgac ggcgggggag ggaagaacag 35033
gaagaaccat gattaacttt attccaaacg gtctcggagc acttcaaaat gcaggtcgcg 35093
gagatggcac ctctcgcccc cactgtgttg atggaaaata acagccaggt caaaggtgac 35153
acggttctcg agatgttcca cggtggcttc cagcaaagcc tccacgcgca catccagaaa 35213
caagaggaca gcgaaagcgg gagcgttctc taattcctca atcatcatat tacactcctg 35273
caccatcccc agataatttt catttttcca gccttgaatg atttgaacta gttcctgagg 35333
taaatccaag ccagccatga taaaaagctc gcgcagagcg ccctccaccg gcattcttaa 35393
gcacaccctc ataattccaa gagattctgc tcctggttca cctgcagcag attaacaagg 35453
ggaatatcaa aatctctgcc gcgatctcta agctcctccc tcagcaataa ctgcaagtac 35513
tctttcatat cttctccgaa atttttagcc atagggccgc caggaatgag agcagggcaa 35573
gccacattac agataaagcg aagtcctccc cagtgagcat tgccaaatgt aagattgaaa 35633
taagcatgct ggctagaccc ggtgatatct tccagataac tggacagaaa atcaggcaag 35693
caatttttaa gaaaatcaac aaaagaaaag tcgtccaggt gcaagtttag agcctcagga 35753
acaacgatgg aataagtgca aggagtgcgt tccagcatgg ttagtgtttt tttggtgatc 35813
tgtagaacaa aaaataaaca tgcaatatta aaccatgcta gcctggcgaa caggtgggta 35873
aatcactctt tccagcacca ggcaggctac ggggtctccg gcgcgaccct cgtagaagct 35933
gtcgccatga ttgaaaagca tcaccgaaag actttcccgg tggccggcat ggatgattcg 35993
cgaagacgcg tacactccgg gaacattggc atccgtgagt gaaaaaaatc gccccaagaa 36053
gccccgaggc actacaatgc tcaaccttaa ttccagcaga gcgaccccat gcggatgaag 36113
cacaaaattg gtaggtgcgt aaaaaatgta attactcccc tcctgcacag gcagcaaagc 36173
ccccgctccc tccagaaaca catacaaagc ctcagcgtcc atagcttacc gagcacggca 36233
ggcgcaagat tcagagaaaa ggctgagctc taacctgact gcccgctcct gagctcaata 36293
tatagcccta acctacactg acgtaaaggc caaagtctaa aaatacccgc caaaatgaca 36353
cacacgccca gcacacgccc agaaaccggt gacacactca aaaaaatacg tgcgcttcct 36413
caaacgccca aaccggcgtc atttccgggt tcccacgcta cgtcaccgct cagcgacttt 36473
caaatttcgt cgaccgttaa acacgtcact cgccccgccc ctaacggtcg ccctcctctc 36533
ggccaatcac agccccgcat ccccaaattc aaacgcctca tttgcatatt aacgcgcacc 36593
aaaagtttga ggtatattat tgatgatg 36621
<210> 99
<211> 503
<212> PRT
<213> Simian adenovirus 30
<400> 99
Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu
1 5 10 15
Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn
20 25 30
Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu
35 40 45
Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly
50 55 60
Ala Ala Ala Arg Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser
65 70 75 80
Gly Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu
85 90 95
Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile
100 105 110
Lys Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val
115 120 125
Ser Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val
130 135 140
Gln Ser Gln Gly Val Asp Glu Val Ser Val Met His Glu Lys Tyr Ser
145 150 155 160
Leu Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu
165 170 175
Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys
180 185 190
Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser
195 200 205
Gly Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala Phe
210 215 220
Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly
225 230 235 240
Val Thr Phe Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val
245 250 255
Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe
260 265 270
Gly Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg
275 280 285
Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys
290 295 300
Ser Lys Val Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly
305 310 315 320
Val Met Ser Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu
325 330 335
Thr Gly Cys Phe Val Cys Ile Lys Gly Asn Ala Gln Val Lys His Asn
340 345 350
Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys
355 360 365
Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His
370 375 380
Pro Arg Lys Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys
385 390 395 400
Asn Val His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys
405 410 415
Asn Met Gln Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg
420 425 430
Val Ser Leu Ala Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile
435 440 445
Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly
450 455 460
Gly Lys His Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp
465 470 475 480
Leu Arg Pro Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly
485 490 495
Ser Ser Gly Glu Glu Ser Asp
500
<210> 100
<211> 157
<212> PRT
<213> Simian adenovirus 30
<400> 100
Met Arg Gly Arg Met Thr Lys Ile Cys Val Phe Leu Cys Ser Ser Met
1 5 10 15
Ser Gly Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr
20 25 30
Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser
35 40 45
Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr
50 55 60
Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala
65 70 75 80
Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly
85 90 95
Tyr Tyr Ser Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala
100 105 110
Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala
115 120 125
Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln
130 135 140
Ala Glu Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys
145 150 155
<210> 101
<211> 392
<212> PRT
<213> Simian adenovirus 30
<400> 101
Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln
1 5 10 15
Gln Gln Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln
20 25 30
Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Ser
35 40 45
Tyr Asp His Gln Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu
50 55 60
Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg Asp
65 70 75 80
Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser
85 90 95
Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg
100 105 110
Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp
115 120 125
Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val
130 135 140
Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu
145 150 155 160
Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala
165 170 175
Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu
180 185 190
Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe
195 200 205
Leu Val Val Gln His Ser Arg Asp Asn Glu Thr Phe Arg Glu Ala Leu
210 215 220
Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn
225 230 235 240
Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu
245 250 255
Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr
260 265 270
Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val
275 280 285
Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu
290 295 300
Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val
305 310 315 320
Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser
325 330 335
Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe
340 345 350
Asp Met Gly Ala Asp Leu Arg Trp Gln Pro Ser Arg Arg Ala Leu Glu
355 360 365
Ala Ala Gly Gly Ser Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp
370 375 380
Glu Glu Gly Glu Tyr Leu Glu Asp
385 390
<210> 102
<211> 593
<212> PRT
<213> Simian adenovirus 30
<400> 102
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln
1 5 10 15
Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
20 25 30
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln
35 40 45
Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro
50 55 60
Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala
65 70 75 80
Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr
85 90 95
Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln
100 105 110
Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln
115 120 125
Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala Leu
130 135 140
Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu
145 150 155 160
Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val
165 170 175
Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr
180 185 190
Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn
195 200 205
Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr
210 215 220
Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val
225 230 235 240
Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly
245 250 255
Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
260 265 270
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln
275 280 285
Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn
290 295 300
Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr Glu Glu Glu
305 310 315 320
Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln
325 330 335
Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met
340 345 350
Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met
355 360 365
Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn
370 375 380
Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly
385 390 395 400
Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val
405 410 415
Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu
420 425 430
Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly
435 440 445
Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro
450 455 460
Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg
465 470 475 480
Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser
485 490 495
Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
500 505 510
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His
515 520 525
Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala
530 535 540
Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp
545 550 555 560
Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser
565 570 575
Gly Gly Gly Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met
580 585 590
Met
<210> 103
<211> 533
<212> PRT
<213> Simian adenovirus 30
<400> 103
Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Met Ala Ala Ala Ala Ala Met Gln Pro Pro Leu
20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg
35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg
50 55 60
Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr
65 70 75 80
Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp
85 90 95
Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg
100 105 110
Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro
115 120 125
Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met
130 135 140
Val Ser Arg Lys Thr Pro Asn Gly Val Gly Glu Asp Tyr Asp Gly Ser
145 150 155 160
Gln Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly
165 170 175
Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile
180 185 190
Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp
195 200 205
Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro
210 215 220
Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His
225 230 235 240
Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser
245 250 255
Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu
260 265 270
Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala
275 280 285
Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala
290 295 300
Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp
305 310 315 320
Asn Phe Ala Ser Ala Ala Ala Val Ala Ala Ala Glu Ala Ala Glu Thr
325 330 335
Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asn Arg
340 345 350
Ser Tyr Asn Val Leu Pro Asp Lys Ile Asn Thr Ala Tyr Arg Ser Trp
355 360 365
Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp
370 375 380
Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr
385 390 395 400
Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr
405 410 415
Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val
420 425 430
Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu
435 440 445
Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn
450 455 460
Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu
465 470 475 480
Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser
485 490 495
Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr
500 505 510
Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val Leu
515 520 525
Ser Ser Arg Thr Phe
530
<210> 104
<211> 193
<212> PRT
<213> Simian adenovirus 30
<400> 104
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
1 5 10 15
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His
20 25 30
Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
35 40 45
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val
50 55 60
Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr
65 70 75 80
Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala
85 90 95
Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
100 105 110
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
115 120 125
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala
130 135 140
Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala
145 150 155 160
Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg
165 170 175
Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg
180 185 190
Thr
<210> 105
<211> 346
<212> PRT
<213> Simian adenovirus 30
<400> 105
Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro
1 5 10 15
Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys Glu Glu Arg Lys Pro Arg
20 25 30
Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Val Asp Asp
35 40 45
Val Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp
50 55 60
Arg Gly Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val
65 70 75 80
Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp
85 90 95
Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu
100 105 110
Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu Glu
115 120 125
Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys
130 135 140
Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly
145 150 155 160
Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val
165 170 175
Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val Asp
180 185 190
Pro Asp Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala
195 200 205
Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro
210 215 220
Met Glu Thr Gln Thr Glu Pro Met Ile Lys Pro Ser Thr Ser Thr Met
225 230 235 240
Glu Val Gln Thr Asp Pro Trp Met Pro Ala Ala Pro Thr Ser Ser Arg
245 250 255
Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr
260 265 270
Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg
275 280 285
Phe Tyr Arg Gly His Thr Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr
290 295 300
Arg Arg Arg Arg Arg Arg Thr Ala Ala Ala Ala Thr Thr Pro Ala Ala
305 310 315 320
Leu Val Arg Arg Val Tyr Arg Arg Gly Arg Ala Pro Leu Thr Leu Pro
325 330 335
Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
340 345
<210> 106
<211> 77
<212> PRT
<213> Simian adenovirus 30
<400> 106
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly
1 5 10 15
Arg Lys Pro Arg Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His
20 25 30
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
35 40 45
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro
50 55 60
Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
65 70 75
<210> 107
<211> 259
<212> PRT
<213> Simian adenovirus 30
<400> 107
Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met
1 5 10 15
Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro
20 25 30
Phe Met Gly Thr Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly
35 40 45
Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser
50 55 60
Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln
65 70 75 80
Ala Leu Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val
85 90 95
Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln
100 105 110
Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala
115 120 125
Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp
130 135 140
Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu
145 150 155 160
Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly
165 170 175
Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys
180 185 190
Pro Glu Ser Asn Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro Gln Pro
195 200 205
Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala
210 215 220
Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser
225 230 235 240
Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg
245 250 255
Arg Cys Tyr
<210> 108
<211> 938
<212> PRT
<213> Simian adenovirus 30
<400> 108
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Ser Gln Trp Glu Gln Asn Glu Asn Asn Gly Gln Gly
130 135 140
Gln Ala Lys Thr His Thr Tyr Gly Val Ala Ala Met Gly Gly Leu Asp
145 150 155 160
Ile Thr Lys Glu Gly Leu Gln Ile Gly Thr Asp Ala Ser Lys Glu Asp
165 170 175
Asp Asn Glu Ile Tyr Ala Asp Glu Thr Tyr Gln Pro Glu Pro Gln Ile
180 185 190
Gly Glu Glu Asn Trp Gln Asp Thr Glu Asn Phe Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Arg
210 215 220
Pro Thr Asn Val Lys Gly Gly Gln Ala Lys Val Lys Thr Glu Glu Asn
225 230 235 240
Val Gln Ser Phe Asp Ile Asp Leu Ala Phe Phe Asp Ile Pro Ser Thr
245 250 255
Gly Thr Gly Ser Asn Gly Thr Asn Val Asn Asp Lys Pro Asp Met Val
260 265 270
Met Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Thr His Ile Val
275 280 285
Tyr Lys Pro Gly Thr Ser Asp Asp Ser Ser Glu Ala Asn Leu Cys Gln
290 295 300
Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe
305 310 315 320
Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala
325 330 335
Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn
340 345 350
Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr
355 360 365
Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp
370 375 380
Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr
385 390 395 400
Cys Phe Pro Leu Asp Gly Ala Gly Thr Asn Ala Val Tyr Gln Gly Val
405 410 415
Lys Glu Lys Thr Gly Asn Asn Gly Glu Trp Glu Ala Asp Thr Asn Val
420 425 430
Ala Ser Gln Asn Gln Ile Cys Lys Gly Asn Ile Tyr Ala Met Glu Ile
435 440 445
Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala
450 455 460
Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Ile Thr Leu
465 470 475 480
Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Pro
485 490 495
Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu
500 505 510
Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly
515 520 525
Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe
530 535 540
His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu
545 550 555 560
Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn
565 570 575
Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala
580 585 590
Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met
595 600 605
Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr
610 615 620
Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr
625 630 635 640
Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg
645 650 655
Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys
660 665 670
Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser
675 680 685
Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe
690 695 700
Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn
705 710 715 720
Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp
725 730 735
Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe
740 745 750
Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr
755 760 765
Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe
770 775 780
Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr
785 790 795 800
Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly
805 810 815
Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr
820 825 830
Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys
835 840 845
Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn
850 855 860
Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr
865 870 875 880
Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met
885 890 895
Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val
900 905 910
Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg
915 920 925
Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
930 935
<210> 109
<211> 209
<212> PRT
<213> Simian adenovirus 30
<400> 109
Met Met Ala Glu Pro Thr Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile
1 5 10 15
Ile Arg Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys
20 25 30
Arg Phe Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val
35 40 45
Asn Thr Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala
50 55 60
Trp Asn Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe
65 70 75 80
Ser Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu
85 90 95
Leu Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu
100 105 110
Lys Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu
115 120 125
Phe Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro
130 135 140
Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly
145 150 155 160
Met Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu
165 170 175
Ala Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His
180 185 190
Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp
195 200 205
Met
<210> 110
<211> 801
<212> PRT
<213> Simian adenovirus 30
<400> 110
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Ser
1 5 10 15
Thr Ala Asp Glu Asn Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro
20 25 30
Pro Pro Ser Pro Thr Ser Asp Ala Ala Ala Pro Asp Met Gln Glu Met
35 40 45
Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His
50 55 60
Glu Glu Glu Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln
65 70 75 80
Glu Gln Pro Glu Gln Glu Ala Glu Asn Glu Gln Asn Gln Ala Gly Leu
85 90 95
Glu His Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His
100 105 110
Leu Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala
115 120 125
Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg Asn
130 135 140
Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys
145 150 155 160
Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu
165 170 175
Ala Leu Ala Thr Tyr His Leu Phe Phe Lys Asn Gln Arg Ile Pro Val
180 185 190
Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly
195 200 205
Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys
210 215 220
Ile Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu
225 230 235 240
Gln Gly Ser Gly Glu Glu His Glu His His Ser Ala Leu Val Glu Leu
245 250 255
Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu
260 265 270
Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser
275 280 285
Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu
290 295 300
Glu Glu Met Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val
305 310 315 320
Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln
325 330 335
Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr
340 345 350
Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu
355 360 365
Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val
370 375 380
Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser
385 390 395 400
Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His
405 410 415
Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val
420 425 430
Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln
435 440 445
Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln
450 455 460
Lys Asn Leu Lys Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala
465 470 475 480
Ser Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu
485 490 495
Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe
500 505 510
Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser
515 520 525
Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro
530 535 540
Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala
545 550 555 560
Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu
565 570 575
Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys
580 585 590
Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu
595 600 605
Gln Gly Pro Gly Asp Ser Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro
610 615 620
Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr
625 630 635 640
His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys
645 650 655
Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln
660 665 670
Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly
675 680 685
His Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser
690 695 700
Phe Pro Gln Asp Ala Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala
705 710 715 720
Ala Ala Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly
725 730 735
Gly Asp Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala
740 745 750
Arg Gln Ser Gly Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg
755 760 765
Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Gly Glu Ser Lys Gln
770 775 780
His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro
785 790 795 800
Gln
<210> 111
<211> 227
<212> PRT
<213> Simian adenovirus 30
<400> 111
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn
20 25 30
Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg
35 40 45
Ala Arg Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr
50 55 60
Pro Arg His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr
65 70 75 80
Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
85 90 95
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala
100 105 110
Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile
115 120 125
Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
130 135 140
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser
145 150 155 160
Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser
165 170 175
Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
180 185 190
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr
195 200 205
Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp
210 215 220
Gly Tyr Asp
225
<210> 112
<211> 208
<212> PRT
<213> Simian adenovirus 30
<400> 112
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu
1 5 10 15
Ile Ser Asp Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg
20 25 30
Ser Leu Phe Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys
35 40 45
Pro His Lys Lys Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala
50 55 60
Val Val Asn His Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala
65 70 75 80
Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro
85 90 95
Phe Leu Pro Gly Thr Tyr Gln Cys Val Ser Gly Pro Cys His His Thr
100 105 110
Phe His Leu Ile Pro Asn Thr Thr Ala Pro Leu Pro Ala Thr Asn Asn
115 120 125
Gln Thr Asn Leu His Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser
130 135 140
Asn Thr Thr Thr Gly Gly Glu Leu Arg Gly Arg Pro Thr Ser Gly Ile
145 150 155 160
Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val
165 170 175
Val Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser
180 185 190
Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
195 200 205
<210> 113
<211> 228
<212> PRT
<213> Simian adenovirus 30
<400> 113
Met Arg Leu Leu Asn Phe Leu Asn Ile Val Leu Ser Ile Ala Tyr Ala
1 5 10 15
Ser Gly Tyr Ala Asn Ile Gln Lys Thr Leu Tyr Val Gly Ser Asp Gly
20 25 30
Thr Leu Glu Gly Thr Gln Ser Gln Ala Lys Val Ala Trp Tyr Phe Tyr
35 40 45
Arg Thr Asn Thr Asp Pro Val Lys Leu Cys Lys Gly Glu Leu Pro Arg
50 55 60
Thr His Lys Thr Pro Leu Thr Phe Ser Cys Ser Asn Asn Asn Leu Thr
65 70 75 80
Leu Phe Ser Ile Thr Lys Gln Tyr Thr Gly Thr Tyr Tyr Ser Thr Asn
85 90 95
Phe His Thr Gly Gln Asp Lys Tyr Tyr Thr Val Lys Val Glu Asn Pro
100 105 110
Thr Thr Pro Arg Thr Thr Thr Thr Thr Thr Thr Thr Ala Lys Pro Thr
115 120 125
Val Lys Thr Thr Thr Arg Thr Thr Thr Thr Thr Glu Thr Thr Thr Ser
130 135 140
Thr Thr Leu Ala Ala Thr Thr His Thr His Thr Lys Leu Thr Leu Gln
145 150 155 160
Thr Thr Asn Asp Leu Ile Ala Leu Leu Gln Lys Gly Asp Asn Ser Thr
165 170 175
Thr Ser Asn Glu Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala
180 185 190
Val Val Val Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala
195 200 205
Phe Cys Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu
210 215 220
Ser Val Glu Phe
225
<210> 114
<211> 203
<212> PRT
<213> Simian adenovirus 30
<400> 114
Met Lys Ile Leu Gly Leu Phe Ser Phe Ser Ile Ile Thr Ser Ala Leu
1 5 10 15
Cys Glu Ser Val Asp Arg Asp Val Thr Ile Thr Thr Gly Ser Asn Tyr
20 25 30
Thr Leu Lys Gly Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe
35 40 45
Gly Thr Asp Thr Asp Gln Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys
50 55 60
Thr Ser Asn Ser Lys Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu
65 70 75 80
Ile Leu Leu Asn Val Thr Lys Ala Tyr Gly Gly Ser Tyr Tyr Cys Pro
85 90 95
Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp
100 105 110
Pro Thr Thr Pro Pro Thr Thr Thr Thr Ile His Thr Thr His Thr Glu
115 120 125
Gln Thr Pro Glu Ala Thr Glu Ala Glu Leu Ala Phe Gln Val His Gly
130 135 140
Asp Ser Phe Ala Val Asn Thr Pro Thr Pro Asp Gln Arg Cys Pro Gly
145 150 155 160
Pro Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala Val Ile
165 170 175
Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg
180 185 190
Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
195 200
<210> 115
<211> 288
<212> PRT
<213> Simian adenovirus 30
<400> 115
Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser Leu Ile Gly Ile Val
1 5 10 15
Phe Asn Ser Lys Ile Thr Lys Val Ser Phe Ile Lys His Val Asn Val
20 25 30
Thr Glu Gly Asp Asn Ile Thr Leu Ala Gly Val Glu Gly Ala Gln Asn
35 40 45
Thr Thr Trp Thr Lys Tyr His Leu Gly Trp Arg Asp Ile Cys Thr Trp
50 55 60
Asn Val Thr Tyr Tyr Cys Ile Gly Ile Asn Leu Thr Ile Val Asn Ala
65 70 75 80
Asn Gln Ser Gln Asn Gly Leu Ile Lys Gly Gln Ser Val Ser Val Thr
85 90 95
Ser Asp Gly Tyr Tyr Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val
100 105 110
Ile Pro Leu Pro Thr Pro Ser Pro Pro Ser Thr Thr Thr Gln Thr Thr
115 120 125
Thr Tyr Ser Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val
130 135 140
Ala Ser Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser
145 150 155 160
Ser Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser
165 170 175
Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr
180 185 190
Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala Thr Thr
195 200 205
Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr Asp Gly Gly
210 215 220
Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu Val Ile Leu
225 230 235 240
Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile Pro Asn Ala His
245 250 255
Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Gln Pro Glu Pro Leu
260 265 270
Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp
275 280 285
<210> 116
<211> 91
<212> PRT
<213> Simian adenovirus 30
<400> 116
Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln
1 5 10 15
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile
20 25 30
Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys
35 40 45
Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp
50 55 60
Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg
65 70 75 80
Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Leu
85 90
<210> 117
<211> 144
<212> PRT
<213> Simian adenovirus 30
<400> 117
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg
1 5 10 15
Pro Val Asp Pro Arg Pro Pro Thr Gln Ser Pro Glu Glu Val Arg Lys
20 25 30
Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys
35 40 45
Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile
50 55 60
Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe
65 70 75 80
Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr
85 90 95
Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Pro Pro Gln Pro
100 105 110
Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg
115 120 125
Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp
130 135 140
<210> 118
<211> 445
<212> PRT
<213> Simian adenovirus 30
<400> 118
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Asp Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr Asn Asn Asn Gly Lys Leu
100 105 110
Gly Met Lys Val Thr Ala Pro Leu Lys Ile Leu Asp Thr Asp Leu Leu
115 120 125
Lys Thr Leu Val Val Ala Tyr Gly Gln Gly Leu Gly Thr Asn Thr Asn
130 135 140
Gly Ala Leu Val Ala Gln Leu Ala Tyr Pro Leu Val Phe Asn Thr Ala
145 150 155 160
Ser Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala
165 170 175
Asn Arg Leu Asn Ile Asn Cys Lys Arg Gly Ile Tyr Val Thr Thr Thr
180 185 190
Lys Asp Ala Leu Glu Ile Asn Ile Ser Trp Ala Asn Ala Met Thr Phe
195 200 205
Ile Gly Asn Ala Ile Gly Val Asn Ile Asp Thr Lys Lys Gly Leu Gln
210 215 220
Phe Gly Thr Ser Ser Thr Glu Thr Asp Val Lys Asn Ala Phe Pro Leu
225 230 235 240
Gln Val Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile
245 250 255
Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala
260 265 270
Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Ala Lys Asp Ala Lys Leu
275 280 285
Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Ser
290 295 300
Leu Ile Ala Val Asp Thr Gly Ser Leu Asn Pro Ile Thr Gly Lys Val
305 310 315 320
Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gln
325 330 335
Ala Ser Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp
340 345 350
Val Thr Pro Ala Asp Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn
355 360 365
Leu Asn Ala Tyr Pro Lys Asn Thr Asn Ala Ala Ala Lys Ser His Ile
370 375 380
Val Gly Lys Val Tyr Leu His Gly Asp Val Ser Lys Pro Leu Asp Leu
385 390 395 400
Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile
405 410 415
Asn Phe Gln Trp Gln Trp Gly Thr Asp Gln Tyr Lys Asp Glu Thr Leu
420 425 430
Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu
435 440 445
<210> 119
<211> 580
<212> DNA
<213> Simian adenovirus 30
<220>
<221> CDS
<222> (1)..(573)
<223> label=Elb\19K
<400> 119
atg gag atc tgg aca gtc ttg gaa gac ttt cac cag act aga cag ctg 48
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
cta gag aac tca tcg gag gga gtc tct tac ctg tgg aga ttc tgc ttc 96
Leu Glu Asn Ser Ser Glu Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
gct ggg cct cta gct aag cta gtc tat agg gcc aag cag gat tat agg 144
Ala Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg
35 40 45
gaa caa ttt gag gat att ttg aga gag tgt cct ggt att ttt gac tct 192
Glu Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
ctc aac ttg ggc cat cag tct cac ttt aac cag agt att ctg aga gcc 240
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
ctt gac ttt tct act cct ggc aga act acc gcc gcg gta gcc ttt ttt 288
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
gcc ttt atc ctt gac aaa tgg agt caa gaa acc cat ttc agc agg gat 336
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
tac cgt ctg gac tgc tta gca gta gct ttg tgg aga aca tgg agg tgc 384
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
cag cgc ctg aat gca atc tcc ggc tac ttg cca gta cag ccg gta gac 432
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
acg ctg agg atc ctg agt ctc cag tca ccc cag gaa cac caa cgc cgc 480
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
cag cag ccg cag cag gag cag cag caa gag gag gag gac cga gaa gag 528
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Asp Arg Glu Glu
165 170 175
aac ccg aga gcc ggt ctg gac cct ccg gtg gcg gag gag gag gag 573
Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
tagctga 580
<210> 120
<211> 191
<212> PRT
<213> Simian adenovirus 30
<400> 120
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
Leu Glu Asn Ser Ser Glu Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
Ala Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg
35 40 45
Glu Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Asp Arg Glu Glu
165 170 175
Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
<210> 121
<211> 6360
<212> DNA
<213> Simian adenovirus 30
<220>
<221> CDS
<222> (8)..(571)
<223> label=22K
<220>
<221> CDS
<222> (1595)..(1912)
<223> label=E3\12.5K
<220>
<221> CDS
<222> (2477)..(3004)
<223> label=E3\gp19K
<220>
<221> CDS
<222> (5948)..(6352)
<223> label=E3\14.7K
<400> 121
ccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 49
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag gag gag 97
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu
15 20 25 30
atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa gac 145
Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp
35 40 45
agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa gca 193
Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala
50 55 60
gcc gcc gcc aga ccg tcg tcc tcg gcg gag gag aaa gca agc agc acg 241
Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 289
Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser
80 85 90
aga tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc ggt 337
Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly
95 100 105 110
aag aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac gcc 385
Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala
115 120 125
atc gtc tcc tgc ttg caa gcc tgc ggg ggc aac atc tcc ttc acc cgg 433
Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg
130 135 140
cgc tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc ttg 481
Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu
145 150 155
cat tac tac cgt cac ctc cac agc ccc tac tac tgt ttc caa gaa gag 529
His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu
160 165 170
gca gaa acc cag cag cag cag aaa acc agc ggc agc agc agc 571
Ala Glu Thr Gln Gln Gln Gln Lys Thr Ser Gly Ser Ser Ser
175 180 185
tagaaaatcc acagcggcgg caggtggact gaggatcgcg gcgaacgagc cggcgcagac 631
ccgggagctg aggaaccgga tctttcccac cctctatgcc atcttccagc agagtcgggg 691
gcaggagcag gaactgaaag tcaagaaccg ttctctgcgc tcgctcaccc gcagttgtct 751
gtatcacaag agcgaagacc aacttcagcg cactctcgag gacgccgagg ctctcttcaa 811
caagtactgc gcgctcactc ttaaagagta gcccgcgccc gcccacacac ggaaaaaggc 871
gggaattacg tcaccacctg cgcccttcgc ccgaccatca tgagcaaaga gattcccacg 931
ccttacatgt ggagctacca gccccagatg ggcctggccg ccggcgccgc ccaggactac 991
tccacccgca tgaactggct cagtgccggg cccgcgatga tctcacgggt gaatgacatc 1051
cgcgcccgcc gaaaccagat actcctagaa cagtcagcga tcaccgccac gccccgccat 1111
caccttaatc cgcgtaattg gcccgccgcc ctggtgtacc aggaaattcc ccagcccacg 1171
accgtactac ttccgcgaga cgcccaggcc gaagtccagc tgactaactc aggtgtccag 1231
ctggccggcg gcgccgccct gtgtcgtcac cgccccgctc agggtataaa gcggctggtg 1291
atccgaggca gaggcacaca gctcaacgac gaggtggtga gctcttcgct gggtctgcga 1351
cctgacggag tcttccaact cgccggatcg gggagatctt ccttcacgcc tcgtcaggcc 1411
gtcctgactt tggagagttc gtcctcgcag ccccgctcgg gcggcatcgg cactctccag 1471
ttcgtggagg agttcactcc ctcggtctac ttcaacccct tctccggctc ccccggccac 1531
tacccggacg agttcatccc gaacttcgac gccatcagcg agtcggtgga cggctacgat 1591
tga atg tcc cat ggt ggc gca gct gac cta gct cgg ctt cga cac ctg 1639
Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu
190 195 200
gac cac tgc cgc cgc ttc cgc tgc ttc gct cgg gat ctc gcc gag ttt 1687
Asp His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe
205 210 215
gcc tac ttt gag ctg ccc gag gag cac cct cag ggc cca gcc cac gga 1735
Ala Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly
220 225 230 235
gtg cgg atc atc gtc gaa ggg ggc ctc gac tcc cac ctg ctt cgg atc 1783
Val Arg Ile Ile Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile
240 245 250
ttc agc cag cga ccg atc ctg gtc gag cgc gaa caa gga cag acc cgt 1831
Phe Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Arg
255 260 265
ctg acc ctg tac tgc atc tgc aac cac ccc ggc ctg cat gaa agt ctt 1879
Leu Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu
270 275 280
tgt tgt ctg ctg tgt act gag tat aat aaa agc tgagatcagc gactactccg 1932
Cys Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
285 290
gactcgattg tggtgttcct gctatcaacc ggtccctgtt cttcaccggg aacgaaaccg 1992
agctccagct ccagtgtaag ccccacaaga agtacctcac ctggctgttc cagggctccc 2052
ccatcgccgt tgtcaaccac tgcgacaacg acggagtcct gctgagcggc cctgccaacc 2112
ttactttttc cacccgcaga agcaagctcc agctcttcca acccttcctc cccgggacct 2172
atcagtgcgt ctcgggaccc tgccatcaca ccttccacct gatcccgaat accacagcgc 2232
cgctccccgc tactaacaac caaactaacc tccaccaacg ccaccgtcgc gacctttcct 2292
ctgaatctaa taccactacc ggaggtgagc tccgaggtcg accaacctct gggatttact 2352
acggcccctg ggaggtggtg gggttaatag cgctaggcct agttgtgggt gggcttttgg 2412
ctctctgcta cctatacctc ccttgctgtt cgtacttagt ggtgctgtgt tgctggttta 2472
agaa atg ggg cag atc acc cta gtg agc tgc ggt gtg ctg gtg gcg gtg 2521
Met Gly Gln Ile Thr Leu Val Ser Cys Gly Val Leu Val Ala Val
295 300 305
ctt tcg att gtg gga ctg ggc ggc gcg gct gta gtg aag gag gag aag 2569
Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Glu Lys
310 315 320 325
gcc gat ccc tgc ttg cat ttc aat ccc gac aaa tgc cag ctg agt ttt 2617
Ala Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe
330 335 340
cag ccc gat ggc aat cgg tgc gcg gtg ctg atc aag tgc gga tgg gaa 2665
Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu
345 350 355
tgc gag aac gtg aga atc gag tac aat aac aag act cgg aac aat act 2713
Cys Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr
360 365 370
ctc gcg tcc gtg tgg cag ccc ggg gac ccc gag tgg tac acc gtc tct 2761
Leu Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser
375 380 385
gtc ccc ggt gct gac ggc tcc ccg cgc acc gtg aat aat act ttc att 2809
Val Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile
390 395 400 405
ttt gcg cac atg tgc aac acg gtc atg tgg atg agc aag cag tac gat 2857
Phe Ala His Met Cys Asn Thr Val Met Trp Met Ser Lys Gln Tyr Asp
410 415 420
atg tgg ccc ccc acg aag gag aac atc gtg gtc ttc tcc atc gct tac 2905
Met Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr
425 430 435
agc ctg tgc acg gcg cta atc acc gct atc gtg tgc ctg agc att cac 2953
Ser Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His
440 445 450
atg ctc atc gct att cgc ccc aga aat aat gcc gag aaa gag aaa cag 3001
Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln
455 460 465
cca taacacgttt tttcacacac cttgttttta cagacaatgc gtctgttaaa 3054
Pro
470
ttttttaaac attgtgctca gtattgctta tgcctctggt tatgcaaaca tacagaaaac 3114
cctttatgta ggatctgatg gtacactaga gggtacccaa tcacaagcca aggttgcatg 3174
gtatttttat agaaccaaca ctgatccagt taaactttgt aagggtgaat tgccgcgtac 3234
acataaaact ccacttacat ttagttgcag caataataat cttacacttt tttcaattac 3294
aaaacaatat actggtactt attacagtac aaactttcat acaggacaag ataaatatta 3354
tactgttaag gtagaaaatc ctaccactcc tagaactacc accaccacca ccactactgc 3414
aaagcccact gtgaaaacta caactaggac caccacaact acagaaacca ccaccagcac 3474
aacacttgct gcaactacac acacacacac taagctaacc ttacagacca ctaatgattt 3534
gatcgccctg ctgcaaaagg gggataacag caccacttcc aatgaggaga tacccaaatc 3594
catgattggc attattgttg ctgtagtggt gtgcatgttg atcatcgcct tgtgcatggt 3654
gtactatgcc ttctgctaca gaaagcacag actgaacgac aagctggaac acttactaag 3714
tgttgaattt taatttttta gaaccatgaa gatcctaggc ctttttagtt tttctatcat 3774
tacctctgct ctttgtgaat cagtggatag agatgttact attaccactg gttctaatta 3834
tacactgaaa gggccaccct caggtatgct ttcgtggtat tgctattttg gaactgacac 3894
tgatcaaact gaattatgca attttcaaaa aggcaaaacc tcaaactcta aaatctctaa 3954
ttatcaatgc aatggcactg atctgatact actcaatgtc acgaaagcat atggtggcag 4014
ttattattgc cctggacaaa acactgaaga aatgattttt tacaaagtgg aagtggttga 4074
tcccactaca ccacccacca ccacaactat tcataccaca cacacagaac aaacaccaga 4134
ggcaacagaa gcagagttgg ccttccaggt tcacggagat tcctttgctg tcaatacccc 4194
tacacccgat cagcggtgtc cggggccgct agtcagcggc attgtcggtg tgctttcggg 4254
attagcagtc ataatcatct gcatgttcat ttttgcttgc tgctatagaa ggctttaccg 4314
acaaaaatca gacccactgc tgaacctcta tgtttaattt tttccagagc catgaaggca 4374
gttagcgctc tagttttttg ttctttgatt ggcattgttt ttaatagtaa aattaccaaa 4434
gttagcttta ttaaacatgt taatgtaact gaaggagata acatcacact agcaggtgta 4494
gaaggtgctc aaaacaccac ctggacaaaa taccatctag gatggagaga tatttgcacc 4554
tggaatgtaa cttattattg cataggaatt aatcttacca ttgttaacgc taaccaatct 4614
cagaatgggt taattaaagg acagagtgtt agtgtgacca gtgatgggta ctatacccag 4674
catagtttta actacaacat tactgtcata ccactgccta cgcctagccc acctagcact 4734
accacacaga caaccacata cagtacatca aatcagccta ccaccactac agcagcagag 4794
gttgccagct cgtctggggt ccgagtggca tttttgatgt tggccccatc tagcagtccc 4854
actgctagta ccaatgagca gactactgaa tttttgtcca ctgtcgagag ccacaccaca 4914
gctacctcca gtgccttctc tagcaccgcc aatctctcct cgctttcctc tacaccaatc 4974
agccccgcta ctactcctag ccccgctcct cttcccactc ccctgaagca aacagacggc 5034
ggcatgcaat ggcagatcac cctgctcatt gtgatcgggt tggtcatcct ggccgtgttg 5094
ctctactaca tcttctgccg ccgcattccc aacgcgcacc gcaagccggc ctacaagccc 5154
atcgttatcg ggcagccgga gccgcttcag gtggaagggg gtctaaggaa tcttctcttc 5214
tcttttacag tatggtgatt gaactatgat tcctagacaa ttcttgatca ctattcttat 5274
ctgcctcctc caagtctgtg ccaccctcgc tctggtggcc aacgccagtc cagactgtat 5334
tgggcccttc gcctcctacg tgctctttgc cttcatcacc tgcatctgct gctgtagcat 5394
agtctgcctg cttatcacct tcttccagtt cattgactgg atctttgtgc gcatcgccta 5454
cctgcgccac cacccccagt accgcgacca gcgagtggcg cagctgctca ggctcctctg 5514
ataagcatgc gggctctgct acttctcgcg cttctgctgt tagtgctccc ccgtcccgtt 5574
gacccccggc cccccactca gtcccccgag gaggtccgca aatgcaaatt ccaagaaccc 5634
tggaaattcc tcaaatgcta ccgccaaaaa tcagacatgc atcccagctg gatcatgatc 5694
attgggatcg tgaacattct ggcctgcacc ctcatctcct ttgtgattta cccctgcttt 5754
gactttggtt ggaactcgcc agaggcgctc tatctcccgc ctgaacctga cacaccacca 5814
cagcaacctc aggcacacgc actaccacca ccaccacagc ctaggccaca atacatgccc 5874
atattagact atgaggccga gccacagcga cccatgctcc ccgctattag ttacttcaat 5934
ctaaccggcg gag atg act gac cca ctg gcc aac aac aac gtc aac gac 5983
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp
475 480
ctt ctc ctg gac atg gac ggc cgc gcc tcg gag cag cga ctc gcc caa 6031
Leu Leu Leu Asp Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln
485 490 495
ctt cgc att cgc cag cag cag gag aga gcc gtc aag gag ctg cag gac 6079
Leu Arg Ile Arg Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp
500 505 510
ggc ata gcc atc cac cag tgc aag aaa ggc atc ttc tgc ctg gtg aaa 6127
Gly Ile Ala Ile His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys
515 520 525 530
cag gcc aag atc tcc tac gag gtc acc cag acc gac cat cgc ctc tcc 6175
Gln Ala Lys Ile Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser
535 540 545
tac gag ctc atg cag cag cgc cag aag ttc acc tgc ctg gtc gga gtc 6223
Tyr Glu Leu Met Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val
550 555 560
aac ccc atc gtc atc acc cag cag tcg ggc gat acc aag ggg tgc atc 6271
Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile
565 570 575
cac tgc tcc tgc gac tcc ccc gac tgc gtc cac act ctg atc aag acc 6319
His Cys Ser Cys Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr
580 585 590
ctc tgc ggc ctc cgc gac ctc ctc ccc atg aac taatcacc 6360
Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn
595 600 605
<210> 122
<211> 188
<212> PRT
<213> Simian adenovirus 30
<400> 122
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Met Glu
20 25 30
Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser Leu
35 40 45
Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala Ala
50 55 60
Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys Lys
100 105 110
Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile Val
115 120 125
Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg Tyr
130 135 140
Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His Tyr
145 150 155 160
Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala Glu
165 170 175
Thr Gln Gln Gln Gln Lys Thr Ser Gly Ser Ser Ser
180 185
<210> 123
<211> 106
<212> PRT
<213> Simian adenovirus 30
<400> 123
Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp
1 5 10 15
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala
20 25 30
Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
35 40 45
Arg Ile Ile Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe
50 55 60
Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Arg Leu
65 70 75 80
Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu Cys
85 90 95
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
100 105
<210> 124
<211> 176
<212> PRT
<213> Simian adenovirus 30
<400> 124
Met Gly Gln Ile Thr Leu Val Ser Cys Gly Val Leu Val Ala Val Leu
1 5 10 15
Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Glu Lys Ala
20 25 30
Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln
35 40 45
Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys
50 55 60
Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu
65 70 75 80
Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val
85 90 95
Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe
100 105 110
Ala His Met Cys Asn Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met
115 120 125
Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
130 135 140
Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met
145 150 155 160
Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
165 170 175
<210> 125
<211> 135
<212> PRT
<213> Simian adenovirus 30
<400> 125
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp
1 5 10 15
Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg
20 25 30
Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala Ile
35 40 45
His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile
50 55 60
Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu Met
65 70 75 80
Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val
85 90 95
Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys
100 105 110
Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu
115 120 125
Arg Asp Leu Leu Pro Met Asn
130 135
<210> 126
<211> 870
<212> DNA
<213> Simian adenovirus 30
<220>
<221> CDS
<222> (6)..(573)
<223> label=Ela
<220>
<221> CDS
<222> (659)..(864)
<223> label=Ela
<400> 126
gaaag atg agg cac ctg aga gac ctg ccc ggt aat gtt ttc ctg gct act 50
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr
1 5 10 15
ggg aac gag att ctg gaa ctg gtg gtg gac gcc atg atg ggt gac gac 98
Gly Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp
20 25 30
cct ccc gag ccc cct acc cca ttt gag gcg cct tcg ctg tac gat ttg 146
Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu
35 40 45
tat gat ctg gag gtg gat gtg tcc gag aac gac ccc aac gag gag gcg 194
Tyr Asp Leu Glu Val Asp Val Ser Glu Asn Asp Pro Asn Glu Glu Ala
50 55 60
gtg aat gat ttg ttt agc gat gcc gcg ctg ctg gct gcc gag cag gct 242
Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala
65 70 75
aat acg gac tct ggc tca gac agc gat tcc tct ctc cat acc ccg aga 290
Asn Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg
80 85 90 95
ccc ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag ctc 338
Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu
100 105 110
gac ctg cgc tgc tat gag gaa tgc ttg cct ccg agc gat gat gag gag 386
Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu
115 120 125
gac gag gag gcg att cga gct gca gcg agc gag gga gtg aaa gtt gcg 434
Asp Glu Glu Ala Ile Arg Ala Ala Ala Ser Glu Gly Val Lys Val Ala
130 135 140
ggc gag agc ttt agc ctg gac tgt cct act ctg ccc gga cac ggc tgt 482
Gly Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys
145 150 155
aag tct tgt gaa ttt cat cgc atg aat act gga gat aag aat gtg atg 530
Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met
160 165 170 175
tgt gcc ctg tgc tat atg aga gct tac aac cat tgt gtt tac a 573
Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr
180 185
gtaagtgtga ttaactttag ttgggaaagg cagagggtga ctgggtgctg actggtttat 633
ttatgtatat gttttttatg tgtag gt ccc gtc tct gac gca gat gag acc 684
Ser Pro Val Ser Asp Ala Asp Glu Thr
195
ccc act tca gag tgc att tca tca ccc cca gaa att ggc gag gaa ccg 732
Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro Glu Ile Gly Glu Glu Pro
200 205 210
ccc gaa gat att att cat aga cca gtt gca gtg aga gtc acc ggg cgg 780
Pro Glu Asp Ile Ile His Arg Pro Val Ala Val Arg Val Thr Gly Arg
215 220 225 230
aga gca gct gtg gag agt ttg gat gac ttg cta cag ggt ggg gat gaa 828
Arg Ala Ala Val Glu Ser Leu Asp Asp Leu Leu Gln Gly Gly Asp Glu
235 240 245
cct ttg gac ttg tgt acc cgg aaa cgc ccc agg cac taagtg 870
Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His
250 255
<210> 127
<211> 258
<212> PRT
<213> Simian adenovirus 30
<400> 127
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr Gly
1 5 10 15
Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp Pro
20 25 30
Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu Tyr
35 40 45
Asp Leu Glu Val Asp Val Ser Glu Asn Asp Pro Asn Glu Glu Ala Val
50 55 60
Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala Asn
65 70 75 80
Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro
85 90 95
Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp
100 105 110
Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp
115 120 125
Glu Glu Ala Ile Arg Ala Ala Ala Ser Glu Gly Val Lys Val Ala Gly
130 135 140
Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys Lys
145 150 155 160
Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met Cys
165 170 175
Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val
180 185 190
Ser Asp Ala Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro
195 200 205
Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala
210 215 220
Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp Asp Leu
225 230 235 240
Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro
245 250 255
Arg His
<210> 128
<211> 840
<212> DNA
<213> Simian adenovirus 30
<220>
<221> CDS
<222> (8)..(335)
<223> label=33K
<220>
<221> CDS
<222> (505)..(839)
<223> label=33K
<400> 128
ccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 49
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag gag gag 97
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu
15 20 25 30
atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa gac 145
Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp
35 40 45
agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa gca 193
Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala
50 55 60
gcc gcc gcc aga ccg tcg tcc tcg gcg gag gag aaa gca agc agc acg 241
Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 289
Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser
80 85 90
aga tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc g 335
Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
95 100 105
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 395
gcttgcaagc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 455
gggtgaactt cccccgcaac atcttgcatt actaccgtca cctccacag cc cct act 512
Ala Pro Thr
act gtt tcc aag aag agg cag aaa ccc agc agc agc aga aaa cca gcg 560
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
gca gca gca gct aga aaa tcc aca gcg gcg gca ggt gga ctg agg atc 608
Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Gly Gly Leu Arg Ile
130 135 140
gcg gcg aac gag ccg gcg cag acc cgg gag ctg agg aac cgg atc ttt 656
Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe
145 150 155 160
ccc acc ctc tat gcc atc ttc cag cag agt cgg ggg cag gag cag gaa 704
Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu
165 170 175
ctg aaa gtc aag aac cgt tct ctg cgc tcg ctc acc cgc agt tgt ctg 752
Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu
180 185 190
tat cac aag agc gaa gac caa ctt cag cgc act ctc gag gac gcc gag 800
Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu
195 200 205
gct ctc ttc aac aag tac tgc gcg ctc act ctt aaa gag t 840
Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 129
<211> 221
<212> PRT
<213> Simian adenovirus 30
<400> 129
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Glu Met Glu
20 25 30
Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser Leu
35 40 45
Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala Ala
50 55 60
Ala Arg Pro Ser Ser Ser Ala Glu Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr
100 105 110
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Gly Gly Leu Arg Ile
130 135 140
Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe
145 150 155 160
Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu
165 170 175
Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu
180 185 190
Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu
195 200 205
Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 130
<211> 36629
<212> DNA
<213> Simian adenovirus 25.2
<220>
<221> repeat_region
<222> (1)..(126)
<223> label=ITR
<220>
<221> CDS
<222> (1905)..(3416)
<223> label=Elb\55K
<220>
<221> CDS
<222> (3501)..(3926)
<223> label=pIX
<220>
<221> misc_feature
<222> (3990)..(5611)
<223> complement (3990..5320, 5599..5611) label=IVa2
<220>
<221> misc_feature
<222> (5093)..(13834)
<223> complement (5093..8665, 13826..13834) label=pol
<220>
<221> misc_feature
<222> (8467)..(13834)
<223> complement (8467..10398, 13826..13834) label=pTP
<220>
<221> CDS
<222> (10833)..(12017)
<223> label=52K
<220>
<221> CDS
<222> (12044)..(13792)
<223> label=pIIIa
<220>
<221> CDS
<222> (13874)..(15466)
<223> label=penton
<220>
<221> CDS
<222> (15473)..(16054)
<223> label=pVII
<220>
<221> CDS
<222> (16102)..(17145)
<223> label=V
<220>
<221> CDS
<222> (17173)..(17403)
<223> label=pX
<220>
<221> CDS
<222> (17478)..(18209)
<223> label=pVI
<220>
<221> CDS
<222> (18315)..(21113)
<223> label=hexon
<220>
<221> CDS
<222> (21136)..(21759)
<223> label=protease
<220>
<221> misc_feature
<222> (21845)..(23377)
<223> complement label=DBP
<220>
<221> CDS
<222> (23400)..(25790)
<223> label=100K
<220>
<221> CDS
<222> (26425)..(27105)
<223> label=pVIII
<220>
<221> CDS
<222> (27109)..(27426)
<223> label=E3\12.5K
<220>
<221> CDS
<222> (27994)..(28527)
<223> label=E3\gp19K
<220>
<221> CDS
<222> (28560)..(29285)
<223> label=E3\CR1\beta
<220>
<221> CDS
<222> (29301)..(29906)
<223> label=E3\CR1\gamma
<220>
<221> CDS
<222> (29924)..(30784)
<223> label=E3\CR1\delta
<220>
<221> CDS
<222> (30795)..(31067)
<223> label=E3\RID\alpha
<220>
<221> CDS
<222> (31076)..(31507)
<223> label=E3\RID\beta
<220>
<221> CDS
<222> (32207)..(33535)
<223> label=fiber
<220>
<221> misc_feature
<222> (33632)..(34782)
<223> complement (33632..33880, 34603..34782) label=E4\orf6/7
<220>
<221> misc_feature
<222> (33880)..(34782)
<223> complement label=E4\orf6
<220>
<221> misc_feature
<222> (34691)..(35053)
<223> complement label=E4\orf4
<220>
<221> misc_feature
<222> (35066)..(35416)
<223> complement label=E4\orf3
<220>
<221> misc_feature
<222> (35416)..(35802)
<223> complement label=E4\orf2
<220>
<221> misc_feature
<222> (35846)..(36217)
<223> complement label=E4\orf1
<220>
<221> repeat_region
<222> (36504)..(36629)
<223> complement label=ITR
<400> 130
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg aggcgtttga 60
atttggggat gcggggcgct gattggctgt gacgaaggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtgtt tgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660
gtgacgaccc tccggagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960
aggaggcgat tcgagctgca gcgaaccagg gagtgaaagc ggcgggcgag ggctttagcc 1020
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200
atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260
tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320
agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380
ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440
ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500
atccgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560
gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gacggtcttg 1620
gaagactttc atcagactag acagctgcta gagaactcat cggaggaagt ctcttacctg 1680
tggagatttt gcttcggtgg ggctctagct aagctagtct atagggccaa acaggattat 1740
aaggatcaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800
ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860
agaactaccg ccgcggtagc cttttttgcc tttatccttg acaa atg gag tca aga 1916
Met Glu Ser Arg
1
aac cca ttt cag cag gga tta ccg tct gga ctg ctt agc agt agc ttt 1964
Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu Ser Ser Ser Phe
5 10 15 20
gtg gag aac atg gag gtg cca gcg cct gaa tgc aat ctc cgg cta ctt 2012
Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu
25 30 35
gcc agt aca gcc ggt aga cac gct gag gat cct gag tct cca gtc acc 2060
Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu Ser Pro Val Thr
40 45 50
cca gga aca cca acg ccg cca gca gcc gca gca gga gca gca gca aga 2108
Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly Ala Ala Ala Arg
55 60 65
gga gga gga gga ccg aga aga gaa ccc gag agc cgg tct gga ccc tcc 2156
Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser Gly Pro Ser
70 75 80
ggt ggc gga gga gga gga gta gct gac ttg ttt ccc gag ctg cgc cgg 2204
Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg
85 90 95 100
gtg ctg act agg tct tcc agt gga cgg gag agg ggg att aag cgg gag 2252
Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu
105 110 115
agg cat gag gag act agc cac aga act gaa ctg act gtc agt ctg atg 2300
Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val Ser Leu Met
120 125 130
agc cgc agg cgc cca gaa tcg gtg tgg tgg cat gag gtg cag tcg cag 2348
Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val Gln Ser Gln
135 140 145
ggg ata gat gag gtc tca gtg atg cat gag aaa tat tcc cta gaa caa 2396
Gly Ile Asp Glu Val Ser Val Met His Glu Lys Tyr Ser Leu Glu Gln
150 155 160
gtc aag act tgt tgg ttg gag ccc gag gat gat tgg gag tta gcc atc 2444
Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Leu Ala Ile
165 170 175 180
agg aat tat gcc aag cta gct ctg aag cca gac aag aag tac aag att 2492
Arg Asn Tyr Ala Lys Leu Ala Leu Lys Pro Asp Lys Lys Tyr Lys Ile
185 190 195
acc aag ttg att aat atc aga aat tcc tgc tac att tca ggg aat ggg 2540
Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser Gly Asn Gly
200 205 210
gcc gag gtg gag atc agt acc cag gag agg gtg gcc ttc aga tgt tgt 2588
Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala Phe Arg Cys Cys
215 220 225
atg atg aat atg tac ccg ggg gtg gtg ggc atg gag gga gtc acc ttt 2636
Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly Val Thr Phe
230 235 240
atg aac gcg agg ttc agg ggt gat ggg tat aat ggg gtg gtc ttt atg 2684
Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val Val Phe Met
245 250 255 260
gcc aac acc aag ctg aca gtg cac gga tgc tcc ttc ttt ggc ttc aat 2732
Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn
265 270 275
aac atg tgc atc gag gcc tgg ggc agt gtt tca gtg agg gga tgc agc 2780
Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg Gly Cys Ser
280 285 290
ttt tca gcc aac tgg atg ggg gtc gtg ggc aga acc aag agc aag gtg 2828
Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Lys Val
295 300 305
tca gtg aag aaa tgc ctg ttc gag agg tgc cac ctg ggg gtg atg agc 2876
Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser
310 315 320
gag ggc gaa gcc aaa gtc aaa cac tgc gcc tct acc gag acg ggc tgc 2924
Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu Thr Gly Cys
325 330 335 340
ttt gtg ctg atc aag ggc aat gcc caa gtc aag cat aac atg atc tgt 2972
Phe Val Leu Ile Lys Gly Asn Ala Gln Val Lys His Asn Met Ile Cys
345 350 355
ggg gcc tcg gat gag cgc ggc tac cag atg ctg acc tgc gcc ggt ggg 3020
Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly
360 365 370
aac agc cat atg ctg gcc acc gtg cat gtg gcc tcg cac ccc cgc aag 3068
Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Pro Arg Lys
375 380 385
aca tgg ccc gag ttc gag cac aac gtc atg acc cgc tgc aat gtg cac 3116
Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Val His
390 395 400
ctg ggc tcc cgc cga ggc atg ttc atg ccc tac cag tgc aac atg caa 3164
Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Met Gln
405 410 415 420
ttt gtg aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg 3212
Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu
425 430 435
acg ggg gtg ttt gac atg aat gtg gag ctg tgg aaa att ctg aga tat 3260
Thr Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile Leu Arg Tyr
440 445 450
gat gaa tcc aag acc agg tgc cgg gcc tgc gaa tgc gga ggc aag cac 3308
Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His
455 460 465
gcc agg ctt cag ccc gtg tgt gtg gag gtg acg gag gac ctg cga ccc 3356
Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro
470 475 480
gat cat ttg gtg ttg tcc tgc aac ggg acg gag ttc ggc tcc agc ggg 3404
Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly Ser Ser Gly
485 490 495 500
gaa gaa tct gac tagagtgagt agtgcttggg ggaggtggag ggcttgtatg 3456
Glu Glu Ser Asp
aggggcagaa tgactaaaat ctgtgttttt ctgtgtgttg cagc atg agc gga agc 3512
Met Ser Gly Ser
505
gcc tcc ttt gag gga ggg gta ttc agc cct tat ctg acg ggg cgt ctc 3560
Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu
510 515 520
ccc tcc tgg gct gga gtg cgt cag aat gtg atg gga tcc acg gtg gac 3608
Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val Asp
525 530 535 540
ggc cgg ccc gtg cag ccc gcg aac tct tca acc ctg acc tac gcg acc 3656
Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr
545 550 555
ctg agc tct tcg tcc gtg gac gca gct gcc gcc gca gct gct gct tcc 3704
Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser
560 565 570
gcc gcc agc gcc gtg cgc gga atg gcc ctg ggc gcc ggc tac tac agc 3752
Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly Tyr Tyr Ser
575 580 585
tct ctg gtg gcc aac tcg agt tcc acc aat aat ccc gcc agc ctg aac 3800
Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn
590 595 600
gag gag aag ctg ctg ctg ctg atg gcc cag ctc gag gcc ctg acc cag 3848
Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln
605 610 615 620
cgc ctg ggc gag ctg acc cag cag gtg gct cag ctg cag gcg gag acg 3896
Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Ala Glu Thr
625 630 635
cgg gcc gcg gtt gcc acg gtg aaa acc aaa taaaaaatga atcaataaat 3946
Arg Ala Ala Val Ala Thr Val Lys Thr Lys
640 645
aaacggagac ggttgttgat tttaacagag tcttgaatct ttatttgatt tttcgcgcgc 4006
ggtaggccct ggaccaccgg tctcgatcat tgagcacccg gtggatcttt tccaggaccc 4066
ggtagaggtg ggcttggatg ttgaggtaca tgggcatgag cccgtcccgg gggtggaggt 4126
agctccattg cagggcctcg tgctcggggg tggtgttgta aatcacccag tcatagcagg 4186
ggcgcagggc gtggtgctgc acgatgtcct tgaggaggag actgatggcc acgggcagcc 4246
ccttggtgta ggtgttgacg aacctgttga gctgggaggg atgcatgcgg ggggagatga 4306
gatgcatctt ggcctggatc ttgagattgg cgatgttccc gcccagatcc cgccgggggt 4366
tcatgttgtg caggaccacc agcacggtgt atccggtgca cttggggaat ttgtcatgca 4426
acttggaagg gaaggcgtga aagaatttgg agacgccctt gtggccgccc aggttttcca 4486
tgcactcatc catgatgatg gcgatgggcc cgtgggcggc ggcctgggca aagacgtttc 4546
gggggtcgga cacatcgtag ttgtggtcct gggtgagctc gtcataggcc attttaatga 4606
atttggggcg gagggtgccc gactggggga cgaaggtgcc ctcgatcccg ggggcgtagt 4666
tgccctcgca gatctgcatc tcccaggcct tgagctcgga gggggggatc atgtccacct 4726
gcggggcgat gaaaaaaacg gtttccgggg cgggggagat gagctgggcc gaaagcaggt 4786
tccggagcag ctgggacttg ccgcagccgg tgggaccgta gatgaccccg atgaccggct 4846
gcaggtggta gttgagggag agacagctgc cgtcctcgcg gaggaggggg gccacctcgt 4906
tcatcatctc gcgcacatgc atgttctcgc gcacgagttc cgccaggagg cgctcgcccc 4966
ccagcgagag gagctcttgc agcgaggcga agtttttcag cggcttgagc ccgtcggcca 5026
tgggcatttt ggagagggtc tgttgcaaga gttccagacg gtcccagagc tcggtgatgt 5086
gctctagggc atctcgatcc agcagacctc ctcgtttcgc gggttggggc gactgcggga 5146
gtagggcacc aggcgatggg cgtccagcga ggccagggtc cggtccttcc agggccgtag 5206
ggtccgcgtc agcgtggtct ccgtcacggt gaaggggtgc gcgccgggct gggcgcttgc 5266
gagggtgcgc ttcaggctca tccggctggt cgagaaccgc tcccggtcgg cgccctgcgc 5326
gtcggccagg tagcaattga gcatgagttc gtagttgagc gcctcggccg cgtggccctt 5386
ggcgcggagc ttacctttgg aagtgtgtcc gcagacggga cagaggaggg acttgagggc 5446
gtagagcttg ggggcgagga agacggactc gggggcgtag gcgtccgcgc cgcagctggc 5506
gcagacggtc tcgcactcca cgagccaggt gaggtcgggg cggtcggggt caaaaacgag 5566
gtttcctccg tgctttttga tgcgtttctt acctctggtc tccatgagct cgtgtccccg 5626
ctgggtgaca aagaggctgt ccgtgtcccc gtagaccgac tttatgggcc ggtcctcgag 5686
cggggtgccg cggtcctcgt cgtagaggaa ccccgcccac tccgagacga aggcccgggt 5746
ccaggccagc acgaaggagg ccacgtggga ggggtagcgg tcgttgtcca ccagcgggtc 5806
caccttctcc agggtatgca agcacatgtc cccctcgtcc acatccagga aggtgattgg 5866
cttgtaagtg taggccacgt gaccgggggt cccggccggg ggggtataaa agggggcggg 5926
cccctgctcg tcctcactgt cttccggatc gctgtccagg agcgccagct gttggggtag 5986
gtattccctc tcgaaggcgg gcatgacctc ggcactcagg ttgtcagttt ctagaaacga 6046
ggaggatttg atattgacgg tgccgttgga gacgcctttc atgagcccct cgtccatctg 6106
gtcagaaaag acgatctttt tgttgtcgag cttggtggcg aaggagccgt agagggcatt 6166
ggagaggagc ttggcgatgg agcgcatggt ctggttcttt tccttgtcgg cgcgctcctt 6226
ggcggcgatg ttgagctgca cgtactcgcg cgccacgcac ttccattcgg ggaagacggt 6286
ggtgagctcg tcgggcacga ttctgacccg ccagccgcgg ttgtgcaggg tgatgaggtc 6346
cacgctggtg gccacctcgc cgcgcagggg ctcgttggtc cagcagaggc gcccgccctt 6406
gcgcgagcag aaggggggca gcgggtccag catgagctcg tcgggggggt cggcgtccac 6466
ggtgaagatg ccgggcagga gctcggggtc gaagtagctg atgcaggtgc ccagatcgtc 6526
cagcgccgct tgccagtcgc gcacggccag cgcgcgctcg taggggctga ggggcgtgcc 6586
ccagggcatg gggtgcgtga gcgcggaggc gtacatgccg cagatgtcgt agacgtagag 6646
gggctcctcg aggacgccga tgtaggtggg gtagcagcgc cccccgcgga tgctggcgcg 6706
cacgtagtcg tacagctcgt gcgagggcgc gaggagcccc gcgccgaggt tggagcgctg 6766
cggcttttcg gcgcggtaga cgatctggcg gaagatggcg tgggagttgg aggagatggt 6826
gggcctctgg aagatgttga agtgggcgtg gggcaggccg accgagtccc tgatgaagtg 6886
ggcgtaggag tcctgcagct tggcgacgag ctcggcggtg acgaggacgt ccagggcgca 6946
gtagtcgagg gtctcttgga tgatgtcgta cttgagctgg cccttctgct tccacagctc 7006
gcggttgaga aggaactctt cgcggtcctt ccagtactct tcgaggggga acccgtcctg 7066
atcggcacgg taagagccca ccatgtagaa ctggttgacg gccttgtagg cgcagcagcc 7126
cttctccacg gggagggcat aagcttgcgc ggccttgcgc agggaggtgt gggtgagggc 7186
gaaggtgtcg cgcaccatga ccttgaggaa ctggtgcttg aagtcgaggt cgtcgcagcc 7246
gccctgctcc cagagttgga agtccgtgcg cttcttgtag gcggggttgg gcaaagcgaa 7306
agtaacatcg ttgaagagga tcttgcccgc gcggggcatg aagttgcgag tgatgcggaa 7366
aggctggggc acctcggccc ggttgttgat gacctgggcg gcgaggacga tctcgtcgaa 7426
gccgttgatg ttgtgcccga cgatgtagag ttccacgaat cgcgggcggc ccttgacgtg 7486
gggcagcttc ttgagctcgt cgtaggtgag ctcggcgggg tcgctgaggc cgtgctgctc 7546
aagggcccag tcggcgacgt gggggttggc gctgaggaag gaagtccaga gatccacggc 7606
cagggcggtt tgcaagcggt cccggtactg acggaactgc tggcccacgg ccattttttc 7666
gggggtgatg cagtagaagg tgcgggggtc gccgtgccag cggtcccact tgagctggag 7726
ggcgaggtcg tgggcgagct cgacgagcgg cgggtccccg gagagtttca tgaccagcat 7786
gaaggggacg agctgcttgc cgaaggaccc catccaggtg taggtttcca catcgtaggt 7846
gaggaagagc ctttcggtgc gaggatgcga gccgatgggg aagaactgga tctcctgcca 7906
ccagttggag gaatggctgt tgatgtgatg gaagtagaaa tgccgacggc gcgccgagca 7966
ctcgtgcttg tgtttataca agcgtccgca gtgctcgcaa cgctgcacgg gatgcacgtg 8026
ctgcacgagc tgtacctggg ttcctttgac gaggaatttc agtgggcagt ggagcgctgg 8086
cggctgcatc tggtgctgta ctacgtcctg gccatcggcg tggccatcgt ctgcctcgat 8146
ggtggtcatg ctgacgagcc cgcgcgggag gcaggtccag acctcggctc ggacgggtcg 8206
gagagcgagg acgagggcgc gcaggccgga gctgtccagg gtcctgagac gctgcggagt 8266
caggtcagtg ggcagcggcg gcgcgcggtt gacttgcagg agcttttcca gggcgcgcgg 8326
gaggtccaga tggtacttga tctccacggc gccgttggtg gcgacgtcca cggcttgcag 8386
ggtcccgtgc ccctggggcg ccaccaccgt gccccgtttc ttcttgggcg ctggcgttgg 8446
cgctgcttcc atgtcggtca gaagcggcgg cgaggacgcg cgccgggcgg caggggcggc 8506
tcggggcccg gaggcagggg cggcaggggc acgtcggcgc cgcgcgcggg caggttctgg 8566
tactgcgccc ggagaagact ggcgtgagcg acgacgcgac ggttgacgtc ctggatctga 8626
cgcctctggg tgaaggccac gggacccgtg agtttgaacc tgaaagagag ttcgacagaa 8686
tcaatctcgg tatcgttgac ggcggcctgc cgcaggatct cttgcacgtc gcccgagttg 8746
tcctggtagg cgatctcggt catgaactgc tcgatctcct cctcctgaag gtctccgcgg 8806
ccggcgcgct cgacggtggc cgcgaggtcg ttggagatgc ggcccatgag ctgcgagaag 8866
gcgttcatgc cggcctcgtt ccagacgcgg ctgtagacca cggctccgtc ggggtcgcgc 8926
gcgcgcatga ccacctgggc gaggttgagc tcgacgtggc gcgtgaagac cgcgtagttg 8986
cagaggcgct ggtagaggta gttgagcgtg gtggcgatgt gctcggtgac gaagaagtac 9046
atgatccagc ggcggagcgg catctcgctg acgtcgccca gggcttccaa gcgctccatg 9106
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga gacggtcaac 9166
tcctcctcca gaagacggat gagctcggcg atggtggcgc gcacttcgcg ctcgaaggcc 9226
ccggggggct cctcctcttc catctcctcc tcttcctcct cctccactaa catctcttct 9286
acttcctcct caggaggcgg cggcggggga gggggcctgc gtcgccggcg gcgcacgggc 9346
agacggtcga tgaagcgctc gatggtctcc ccgcgccggc gacgcatggt ctcggtgacg 9406
gcgcgcccgt cctcgcgggg ccgcagcgtg aagacgccgc cgcgcatttc caggtggccg 9466
ggggggtccc cgttgggcag ggagagggcg ctgacgatgc atcttatcaa ttgccccgta 9526
gggactccgc gcaaggacct gagcgtctcg agatccacgg gatctgaaaa ccgttgaacg 9586
aaggcttcga gccagtcgca gtcgcaaggt aggctgagca cggtttcttc tggcgggtca 9646
tgttggggag cggggcgggc gatgctgctg gtgatgaagt tgaaataggc ggttctgaga 9706
cggcggatgg tggcgaggag caccaggtct ttgggcccgg cttgctggat gcgcagacgg 9766
tcggccatgc cccaggcgtg gtcctgacac ctggccagat ccttgtagta gtcctgcatg 9826
agccgctcca cgggcacctc ctcctcgccc gcgcggccgt gcatgcgcgt gagcccgaag 9886
ccgcgctggg gctggacgag cgccaggtcg gcgacgacgc gctcggcgag gatggcctgc 9946
tggatctggg tgagggtggt ctggaagtcg tcaaagtcga cgaagcggtg gtaggctccg 10006
gtgttgatgg tgtaggagca gttggccatg acggaccagt tgacggtctg gtggccggga 10066
cgcacgagct cgtggtactt gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg 10126
caggtgcgca ccaggtactg gtagccgatg aggaagtgcg gcggcggctg gcggtagagc 10186
ggccatcgct cggtggcggg ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag 10246
ccgtagatgt acctggacat ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac 10306
tcgcggacgc ggttccagat gttgcgcagc ggcaggaagt agttcatggt gggcacggtc 10366
tggcccgtga ggcgcgcgca gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag 10426
cggctcgact ccgtggcctg gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt 10486
tcgaatctcg aatcaggctg gagccgcagc taacgtggta ctggcactcc cgtctcgacc 10546
caagcctgca ccaaccctcc aggatacgga ggcgggtcgt ttttgcaact ttttttcgga 10606
ggccggaaat gaagactagt aagcgcggaa agcggccgac cgcgatggct cgctgccgta 10666
gtctggagaa gaatcgccag ggttgcgttg cggtgtgccc cggttcgagg ccggccggat 10726
tccgcggcta acgagggcgt ggctgccccg tcgtttccaa gacccctagc cagccgactt 10786
ctccagttac ggagcgagcc cctcttttgt tttttgtttt tgccag atg cat ccc 10841
Met His Pro
gta ctg cgg cag atg cgc ccc cac cac cct cca ccg caa caa cag ccc 10889
Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro
650 655 660 665
cct cct cca cag ccg gcg ctt ctg ccc ccg ccc cag cag cag cag caa 10937
Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln Gln Gln
670 675 680
ctt cca gcc acg acc gcc gcg gcc gcc gtg agc ggg gct gga cag act 10985
Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Thr
685 690 695
tct cag tat gat cac ctg gcc ttg gaa gag ggc gag ggg ctg gcg cgc 11033
Ser Gln Tyr Asp His Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg
700 705 710
ctg ggg gcg tcg tcg ccg gag cgg cac ccg cgc gtg cag atg aaa agg 11081
Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg
715 720 725
gac gct cgc gag gcc tac gtg ccc aag cag aac ctg ttc aga gac agg 11129
Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg
730 735 740 745
agc ggc gag gag ccc gag gag atg cgc gcg gcc cgg ttc cac gcg ggg 11177
Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly
750 755 760
cgg gag ctg cgg cgc ggc ctg gac cga aag agg gtg ctg agg gac gag 11225
Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu
765 770 775
gat ttc gag gcg gac gag ctg acg ggg atc agc ccc gcg cgc gcg cac 11273
Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His
780 785 790
gtg gcc gcg gcc aac ctg gtc acg gcg tac gag cag acc gtg aag gag 11321
Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu
795 800 805
gag agc aac ttc caa aaa tcc ttc aac aac cac gtg cgc acc ctg atc 11369
Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile
810 815 820 825
gcg cgc gag gag gtg acc ctg ggc ctg atg cac ctg tgg gac ctg ctg 11417
Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu
830 835 840
gag gcc atc gtg cag aac ccc acc agc aag ccg ctg acg gcg cag ctg 11465
Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu
845 850 855
ttc ctg gtg gtg cag cat agt cgg gac aac gag gcg ttc agg gag gcg 11513
Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala
860 865 870
ctg ctg aat atc acc gag ccc gag ggc cgc tgg ctc ctg gac ctg gtg 11561
Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val
875 880 885
aac att ctg cag agc atc gtg gtg cag gag cgc ggg ctg ccg ctg tcc 11609
Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser
890 895 900 905
gag aag ctg gcg gcc atc aac ttc tcg gtg ctg agt ctg ggc aag tac 11657
Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr
910 915 920
tac gct agg aag atc tac aag acc ccg tac gtg ccc ata gac aag gag 11705
Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu
925 930 935
gtg aag atc gac ggg ttt tac atg cgc atg acc ctg aaa gtg ctg acc 11753
Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr
940 945 950
ctg agc gac gat ctg ggg gtg tac cgc aac gac agg atg cac cgc gcg 11801
Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala
955 960 965
gtg agc gcc agc agg cgg cgc gag ctg agc gac cag gag ctg atg cac 11849
Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His
970 975 980 985
agc ctg cag cgg gcc ctg acc ggg gcc ggg acc gag ggg gag agc tac 11897
Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr
990 995 1000
ttt gac atg ggc gcg gac ctg cac tgg caa ccc agc cgc cgg gcc 11942
Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg Ala
1005 1010 1015
ttg gag gcg gcg gca gga ccc tac gta gaa gag gtg gac gat gag 11987
Leu Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp Glu
1020 1025 1030
gtg gac gag gag ggc gag tac ctg gaa gac tgatggcgcg accgtatttt 12037
Val Asp Glu Glu Gly Glu Tyr Leu Glu Asp
1035 1040
tgctag atg caa caa cag cca cct cct gat ccc gcg atg cgg gcg gcg 12085
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala
1045 1050 1055
ctg cag agc cag ccg tcc ggc att aac tcc tcg gac gat tgg acc 12130
Leu Gln Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr
1060 1065 1070
cag gcc atg caa cgc atc atg gcg ctg acg acc cgc aac ccc gaa 12175
Gln Ala Met Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu
1075 1080 1085
gcc ttt aga cag cag ccc cag gcc aac cgg ctc tcg gcc atc ctg 12220
Ala Phe Arg Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu
1090 1095 1100
gag gcc gtg gtg ccc tcg cgc tcc aac ccc acg cac gag aag gtc 12265
Glu Ala Val Val Pro Ser Arg Ser Asn Pro Thr His Glu Lys Val
1105 1110 1115
ctg gcc atc gtg aac gcg ctg gtg gag aac aag gcc atc cgc ggc 12310
Leu Ala Ile Val Asn Ala Leu Val Glu Asn Lys Ala Ile Arg Gly
1120 1125 1130
gac gag gcc ggc ctg gtg tac aac gcg ctg ctg gag cgc gtg gcc 12355
Asp Glu Ala Gly Leu Val Tyr Asn Ala Leu Leu Glu Arg Val Ala
1135 1140 1145
cgc tac aac agc acc aac gtg cag acc aac ctg gac agg atg gtg 12400
Arg Tyr Asn Ser Thr Asn Val Gln Thr Asn Leu Asp Arg Met Val
1150 1155 1160
acc gac gtg cgc gag gcc gtg gcc cag cgc gag cgg ttc cac cgc 12445
Thr Asp Val Arg Glu Ala Val Ala Gln Arg Glu Arg Phe His Arg
1165 1170 1175
gag tcc aac ctg gga tcc atg gtg gcg ctg aac gcc ttc ctc agc 12490
Glu Ser Asn Leu Gly Ser Met Val Ala Leu Asn Ala Phe Leu Ser
1180 1185 1190
acc cag ccc gcc aac gtg ccc cgg ggc cag gag gac tac acc aac 12535
Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn
1195 1200 1205
ttc atc agc gcc ctg cgc ctg atg gtg acc gag gtg ccc cag agc 12580
Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val Pro Gln Ser
1210 1215 1220
gag gtg tac cag tcc ggg ccg gac tac ttc ttc cag acc agt cgc 12625
Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr Ser Arg
1225 1230 1235
cag ggc ttg cag acc gtg aac ctg agc cag gcg ttc aag aac ttg 12670
Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn Leu
1240 1245 1250
cag ggc ctg tgg ggc gtg cag gcc ccg gtc ggg gac cgc gcg acg 12715
Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr
1255 1260 1265
gtg tcg agc ctg ctg acg ccg aac tcg cgc ctg ctg ctg ctg ctg 12760
Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu
1270 1275 1280
gtg gcc ccc ttc acg gac agc ggc agt atc aac cgc gac tcg tac 12805
Val Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asp Ser Tyr
1285 1290 1295
ctg ggc tac ctg att aac ctg tac cgc gag gcc atc ggc cag gcg 12850
Leu Gly Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala
1300 1305 1310
cac gtg gac gag cag acc tac cag gag atc acc cac gtg agc cgc 12895
His Val Asp Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg
1315 1320 1325
gcg ctg ggc cag gag gac ccg ggc aac ctg gag gcc acc ctg aac 12940
Ala Leu Gly Gln Glu Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn
1330 1335 1340
ttc ctg ctg acc aac cgg tcg cag aag atc ccg ccc cag tac gcg 12985
Phe Leu Leu Thr Asn Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala
1345 1350 1355
ctg agc acc gag gag gag cgc att ttg cgc tac gtg cag cag agc 13030
Leu Ser Thr Glu Glu Glu Arg Ile Leu Arg Tyr Val Gln Gln Ser
1360 1365 1370
gtg ggg ctg ttc ctg atg cag gag ggg gcc acg ccc agc gcc gcg 13075
Val Gly Leu Phe Leu Met Gln Glu Gly Ala Thr Pro Ser Ala Ala
1375 1380 1385
ctc gac atg acc gcg cgc aac atg gag ccc agc atg tac gcc cgc 13120
Leu Asp Met Thr Ala Arg Asn Met Glu Pro Ser Met Tyr Ala Arg
1390 1395 1400
aac cgc ccg ttc atc aat aag ctg atg gac tac ttg cat cgg gcg 13165
Asn Arg Pro Phe Ile Asn Lys Leu Met Asp Tyr Leu His Arg Ala
1405 1410 1415
gcc gcc atg aac tcg gac tac ttt acc aac gcc atc ttg aac ccg 13210
Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn Ala Ile Leu Asn Pro
1420 1425 1430
cac tgg ctc ccg ccg ccc ggg ttc tac acg ggc gag tac gac atg 13255
His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly Glu Tyr Asp Met
1435 1440 1445
ccc gac ccc aac gac ggg ttc ctg tgg gat gac gtg gac agc agc 13300
Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val Asp Ser Ser
1450 1455 1460
gtg ttc tcg ccg cgc ccc gcc acc acc gtg tgg aag aaa gag ggc 13345
Val Phe Ser Pro Arg Pro Ala Thr Thr Val Trp Lys Lys Glu Gly
1465 1470 1475
ggg gac cgg cgg ccg tcc tcg gcg ctg tcc ggt cgc gcg ggt gct 13390
Gly Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Ala Gly Ala
1480 1485 1490
gcc gcg gcg gtg ccc gag gcc gcc agt ccg ttc ccg agc ttg ccc 13435
Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro
1495 1500 1505
ttc tcg ctg aac agt att cgc agc agc gag ctg ggc agg atc acg 13480
Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr
1510 1515 1520
cgc ccg cgc ttg ctg ggc gag gag gag tac ttg aat gac tcg ctg 13525
Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu
1525 1530 1535
ttg aga ccc gag cgc gag aag aac ttc ccc aat aac ggg ata gag 13570
Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
1540 1545 1550
agc ctg gtg gac aag atg agc cgc tgg aag acg tac gcg cac gag 13615
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala His Glu
1555 1560 1565
cac agg gac gag ccc cga gct agc agc gcc acc cgt aga cgc cag 13660
His Arg Asp Glu Pro Arg Ala Ser Ser Ala Thr Arg Arg Arg Gln
1570 1575 1580
cgg cac gac agg cag cgg gga ctg gtg tgg gac gat gag gat tcc 13705
Arg His Asp Arg Gln Arg Gly Leu Val Trp Asp Asp Glu Asp Ser
1585 1590 1595
gcc gac gac agc agc gtg ttg gac ttg ggt ggg agt ggt ggt ggt 13750
Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser Gly Gly Gly
1600 1605 1610
aac ccg ttc gct cac ctg cgt ccc cgt atc ggg cgc ctg atg 13792
Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Leu Met
1615 1620
taagaatctg aaaaaataaa aaacggtact caccaaggcc atggcgacca gcgtgcgttc 13852
ttctctgttg tttgtagtag t atg atg agg cgc gtg tac ccg gag ggt cct 13903
Met Met Arg Arg Val Tyr Pro Glu Gly Pro
1625 1630
cct ccc tcg tac gag agc gtg atg cag cag gcg gtg gcg gcg gcg 13948
Pro Pro Ser Tyr Glu Ser Val Met Gln Gln Ala Val Ala Ala Ala
1635 1640 1645
atg cag ccc ccg ctg gag gcg cct tac gtg ccc ccg cgg tac ctg 13993
Met Gln Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu
1650 1655 1660
gcg cct acg gag ggg cgg aac agc att cgt tac tcg gag ctg gca 14038
Ala Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala
1665 1670 1675
ccc ttg tac gat acc acc cgg ttg tac ctg gtg gac aac aag tcg 14083
Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser
1680 1685 1690
gcg gac atc gcc tcg ctg aac tac cag aac gac cac agc aac ttc 14128
Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe
1695 1700 1705
ctg acc acc gtg gtg cag aac aac gat ttc acc ccc acg gag gcc 14173
Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala
1710 1715 1720
agc acc cag acc atc aac ttt gac gag cgc tcg cgg tgg ggc ggc 14218
Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly
1725 1730 1735
cag ctg aaa acc atc atg cac acc aac atg ccc aac gtg aac gag 14263
Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val Asn Glu
1740 1745 1750
ttc atg tac agc aac aag ttc aag gcg cgg gtg atg gtc tcg cgc 14308
Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg
1755 1760 1765
aag acc ccc aac ggg gtg acg gta ggg gat gat tat gat ggt agt 14353
Lys Thr Pro Asn Gly Val Thr Val Gly Asp Asp Tyr Asp Gly Ser
1770 1775 1780
cag gac gag ctg acc tac gag tgg gtg gag ttt gag ctg ccc gag 14398
Gln Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu
1785 1790 1795
ggc aac ttc tcg gtg acc atg acc atc gat ctg atg aac aac gcc 14443
Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala
1800 1805 1810
atc atc gac aac tac ttg gcg gtg ggg cgg cag aac ggg gtg ctg 14488
Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu
1815 1820 1825
gag agc gac atc ggc gtg aag ttc gac acg cgc aac ttc cgg ctg 14533
Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu
1830 1835 1840
ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc 14578
Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr
1845 1850 1855
aac gag gcc ttc cac ccc gac atc gtc ctg ctg ccc ggc tgc ggc 14623
Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly
1860 1865 1870
gtg gac ttc acc gag agc cgc ctc agc aac ctg ctg ggc atc cgc 14668
Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg
1875 1880 1885
aag cgg cag ccc ttc cag gag ggc ttc cag atc ctg tac gag gac 14713
Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln Ile Leu Tyr Glu Asp
1890 1895 1900
ctg gag ggg ggc aac atc ccc gcg ctg ctg gat gtg gac gcc tac 14758
Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Asp Ala Tyr
1905 1910 1915
gag aaa agc aag gag gat agc gcc gcc gcg gcg acc gca gcc gtg 14803
Glu Lys Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr Ala Ala Val
1920 1925 1930
gcc acc gcc tct acc gag gtg cgg ggc gat aat ttt gct agc gcc 14848
Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala
1935 1940 1945
gcg gca gtg gcc gag gcg gct gaa acc gaa agt aag ata gtg atc 14893
Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile
1950 1955 1960
cag ccg gtg gag aag gac agc aag aac agg agc tac aac gtg ctc 14938
Gln Pro Val Glu Lys Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu
1965 1970 1975
gcg gac aag aaa aac acc gcc tac cgc agc tgg tac ctg gcc tac 14983
Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr
1980 1985 1990
aac tac ggc gac ccc gag aag ggc gtg cgc tcc tgg acg ctg ctc 15028
Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu
1995 2000 2005
acc acc tcg gac gtc acc tgc ggc gtg gag caa gtc tac tgg tcg 15073
Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser
2010 2015 2020
ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc tcc acg cga 15118
Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg
2025 2030 2035
caa gtt agc aac tac ccg gtg gtg ggc gcc gag ctc ctg ccc gtc 15163
Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val
2040 2045 2050
tac tcc aag agc ttc ttc aac gag cag gcc gtc tac tcg cag cag 15208
Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln
2055 2060 2065
ctg cgc gcc ttc acc tcg ctc acg cac gtc ttc aac cgc ttc ccc 15253
Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro
2070 2075 2080
gag aac cag atc ctc gtt cgc ccg ccc gcg ccc acc att acc acc 15298
Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr
2085 2090 2095
gtc agt gaa aac gtt cct gct ctc aca gat cac ggg acc ctg ccg 15343
Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro
2100 2105 2110
ctg cgc agc agt atc cgg gga gtc cag cgc gtg acc gtc act gac 15388
Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp
2115 2120 2125
gcc aga cgc cgc acc tgc ccc tac gtc tac aag gcc ctg ggc gta 15433
Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val
2130 2135 2140
gtc gcg ccg cgc gtc ctc tcg agc cgc acc ttc taaaaa atg tcc att 15481
Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe Met Ser Ile
2145 2150 2155
ctc atc tcg ccc agt aat aac acc ggt tgg ggc ctg cgc gcg ccc 15526
Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro
2160 2165 2170
agc aag atg tac gga ggc gct cgc caa cgc tcc acg caa cac ccc 15571
Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His Pro
2175 2180 2185
gtg cgc gtg cgc ggg cac ttc cgc gct ccc tgg ggc gcc ctc aag 15616
Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
2190 2195 2200
ggc cgc gtg cgc tcg cgc acc acc gtc gac gac gtg atc gac cag 15661
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln
2205 2210 2215
gtg gtg gcc gac gcg cgc aac tac acg ccc gcc gcc gcg ccc gtc 15706
Val Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val
2220 2225 2230
tcc acc gtg gac gcc gtc atc gac agc gtg gtg gcc gac gcg cgc 15751
Ser Thr Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg
2235 2240 2245
cgg tac gcc cgc gcc aag agc cgg cgg cgg cgc atc gcc cgg cgg 15796
Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg
2250 2255 2260
cac cgg agc acc ccc gcc atg cgc gcg gca cga gcc ttg ctg cgc 15841
His Arg Ser Thr Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg
2265 2270 2275
agg gcc agg cgc acg gga cgc agg gcc atg ctc agg gcg gcc aga 15886
Arg Ala Arg Arg Thr Gly Arg Arg Ala Met Leu Arg Ala Ala Arg
2280 2285 2290
cgc gcg gcc tcc ggc agc agc agc gcc ggc agg acc cgc aga cgc 15931
Arg Ala Ala Ser Gly Ser Ser Ser Ala Gly Arg Thr Arg Arg Arg
2295 2300 2305
gcg gcc acg gcg gcg gcg gcg gcc atc gcc agc atg tcc cgc ccg 15976
Ala Ala Thr Ala Ala Ala Ala Ala Ile Ala Ser Met Ser Arg Pro
2310 2315 2320
cgg cgc ggc aac gtg tac tgg gtg cgc gac gcc gcc acc ggt gtg 16021
Arg Arg Gly Asn Val Tyr Trp Val Arg Asp Ala Ala Thr Gly Val
2325 2330 2335
cgc gtg ccc gtg cgc acc cgc ccc cct cgc act tgaagatgct 16064
Arg Val Pro Val Arg Thr Arg Pro Pro Arg Thr
2340 2345
gacttcgcga tgttgatgtg tcccagcggc gaggagg atg tcc aag cgc aaa 16116
Met Ser Lys Arg Lys
2350
tac aag gaa gag atg ctc cag gtc atc gcg cct gag atc tac ggc 16161
Tyr Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr Gly
2355 2360 2365
ccc gcg gtg gtg aag gag gaa aga aag ccc cgc aaa ctg aag cgg 16206
Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys Leu Lys Arg
2370 2375 2380
gtc aaa aag gac aaa aag gag gag gaa gat gac gga ctg gtg gag 16251
Val Lys Lys Asp Lys Lys Glu Glu Glu Asp Asp Gly Leu Val Glu
2385 2390 2395
ttt gtg cgc gag ttc gcc ccc cgg cgg cgc gtg cag tgg cgc ggg 16296
Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly
2400 2405 2410
cgg aaa gtg aaa ccg gtg ctg cga ccc ggc acc acg gtg gtc ttc 16341
Arg Lys Val Lys Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe
2415 2420 2425
acg ccc ggc gag cgt tcc ggc tcc gcc tcc aag cgc tcc tac gac 16386
Thr Pro Gly Glu Arg Ser Gly Ser Ala Ser Lys Arg Ser Tyr Asp
2430 2435 2440
gag gtg tac ggg gac gag gac atc ctc gag cag gcg gtc gag cgt 16431
Glu Val Tyr Gly Asp Glu Asp Ile Leu Glu Gln Ala Val Glu Arg
2445 2450 2455
ctg ggc gag ttt gct tac ggc aag cgc agc cgc ccc gcg ccc ttg 16476
Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Pro Leu
2460 2465 2470
aaa gag gag gcg gtg tcc atc ccg ctg gac cac ggc aac ccc acg 16521
Lys Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr
2475 2480 2485
ccg agc ctg aag ccg gtg acc ctg cag cag gtg ctg ccg agc gcg 16566
Pro Ser Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala
2490 2495 2500
gcg ccg cgc cgg ggc ttc aag cgc gag ggc ggc gag gat ctg tac 16611
Ala Pro Arg Arg Gly Phe Lys Arg Glu Gly Gly Glu Asp Leu Tyr
2505 2510 2515
ccg acc atg cag ctg atg gtg ccc aag cgc cag aag ctg gag gac 16656
Pro Thr Met Gln Leu Met Val Pro Lys Arg Gln Lys Leu Glu Asp
2520 2525 2530
gtg ctg gag cac atg aag gtg gac ccc gag gtg cag ccc gag gtc 16701
Val Leu Glu His Met Lys Val Asp Pro Glu Val Gln Pro Glu Val
2535 2540 2545
aag gtg cgg ccc atc aag cag gtg gcc ccg ggc ctg ggc gtg cag 16746
Lys Val Arg Pro Ile Lys Gln Val Ala Pro Gly Leu Gly Val Gln
2550 2555 2560
acc gtg gac atc aag atc ccc acg gag ccc atg gaa acg cag acc 16791
Thr Val Asp Ile Lys Ile Pro Thr Glu Pro Met Glu Thr Gln Thr
2565 2570 2575
gag ccc gtg aag ccc agc acc agc acc atg gag gtg cag acg gat 16836
Glu Pro Val Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr Asp
2580 2585 2590
ccc tgg atg ccc gcg gct ccc acc acc acc act cgc cga aga cgc 16881
Pro Trp Met Pro Ala Ala Pro Thr Thr Thr Thr Arg Arg Arg Arg
2595 2600 2605
aag tac ggc gcg gcc agc ctg ctg atg ccc aac tac gcg ctg cat 16926
Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His
2610 2615 2620
cct tcc atc atc ccc acg ccg ggc tac cgc ggc acg cgc ttc tac 16971
Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr
2625 2630 2635
cgc ggc tac acc agc agc cgc cgc cgc aag acc acc acc cgc cgc 17016
Arg Gly Tyr Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg
2640 2645 2650
cgc cgc cgt cgc cac acc cgc cgc agc agc acc gcg act tcc gcc 17061
Arg Arg Arg Arg His Thr Arg Arg Ser Ser Thr Ala Thr Ser Ala
2655 2660 2665
gcc gcc ttg gtg cgg aga gtg tac cgc agc ggg cgc gag cct ctg 17106
Ala Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu
2670 2675 2680
acc ctg ccg cgc gcg cgc tac cac ccg agc atc gcc att taactctgcc 17155
Thr Leu Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
2685 2690 2695
gtcgcctcct tgcagat atg gcc ctc aca tgc cgc ctc cgc gtc ccc att 17205
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile
2700 2705
acg ggc tac cga gga aga aag ccg cgc cgt aga agg ctg acg ggg 17250
Thr Gly Tyr Arg Gly Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly
2710 2715 2720
aac ggg ctg cgt cgc cat cac cac cgg cgg cgg cgc gcc atc agc 17295
Asn Gly Leu Arg Arg His His His Arg Arg Arg Arg Ala Ile Ser
2725 2730 2735
aag cgg ttg ggg gga ggc ttc ctg ccc gcg ctg atc ccc atc atc 17340
Lys Arg Leu Gly Gly Gly Phe Leu Pro Ala Leu Ile Pro Ile Ile
2740 2745 2750
gcc gcg gcg atc ggg gcg atc ccc ggc ata gct tcc gtg gcg gtg 17385
Ala Ala Ala Ile Gly Ala Ile Pro Gly Ile Ala Ser Val Ala Val
2755 2760 2765
cag gcc tct cag cgc cac tgagacacaa aaaagcatgg atttgtaata 17433
Gln Ala Ser Gln Arg His
2770
aaaaaatgga ctgacgctcc tggtcctgtg atgtgtgttt ttag atg gaa gac atc 17489
Met Glu Asp Ile
2775
aat ttt tcg tcc ctg gca ccg cga cac ggc acg cgg ccg ttt atg 17534
Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met
2780 2785 2790
ggc acc tgg agc gac atc ggc aac agc caa ctg aac ggg ggc gcc 17579
Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly Gly Ala
2795 2800 2805
ttc aat tgg agc agt ctc tgg agc ggg ctt aag aat ttc ggg tcc 17624
Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser
2810 2815 2820
acg ctc aaa acc tat ggc aac aag gcg tgg aac agc agc aca ggg 17669
Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly
2825 2830 2835
cag gcg ctg agg gaa aag ctg aaa gag cag aac ttc cag cag aag 17714
Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys
2840 2845 2850
gtg gtc gat ggc ctg gcc tcg ggc atc aac ggg gtg gtg gac ctg 17759
Val Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu
2855 2860 2865
gcc aac cag gcc gtg cag aaa cag atc aac agc cgc ctg gac gcg 17804
Ala Asn Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala
2870 2875 2880
gtc ccg ccc gcg ggc tcc gtg gag atg ccc cag gtg gag gag gag 17849
Val Pro Pro Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu
2885 2890 2895
ctg cct ccc ctg gac aag cgc ggc gac aag cga ccg cgt ccc gac 17894
Leu Pro Pro Leu Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp
2900 2905 2910
gcg gag gag acg ctg ctg acg cac acg gac gag ccg ccc ccg tac 17939
Ala Glu Glu Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr
2915 2920 2925
gag gag gcg gtg aaa ctg ggt ctg ccc acc acg cgg ccc atc gcg 17984
Glu Glu Ala Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala
2930 2935 2940
ccc ctg gcc acc ggg gtg ctg aaa ccc gaa agt aat aag ccc gcg 18029
Pro Leu Ala Thr Gly Val Leu Lys Pro Glu Ser Asn Lys Pro Ala
2945 2950 2955
acc ctg gac ttg cct cct ccc cag cct tcc cgc ccc tcc aca gtg 18074
Thr Leu Asp Leu Pro Pro Pro Gln Pro Ser Arg Pro Ser Thr Val
2960 2965 2970
gct aag ccc ctg ccg ccg gtg gcc gtg gcc cgc gcg cga ccc ggg 18119
Ala Lys Pro Leu Pro Pro Val Ala Val Ala Arg Ala Arg Pro Gly
2975 2980 2985
ggc acc gcc cgc cct cat gcg aac tgg cag agc act ctg aac agc 18164
Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser Thr Leu Asn Ser
2990 2995 3000
atc gtg ggt ctg gga gtg cag agt gtg aag cgc cgc cgc tgc tat 18209
Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr
3005 3010 3015
taaacctacc gtagcgctta acttgcttgt ctgtgtgtgt atgtattatg tcgccgccgc 18269
tgtcgccaga aggaggagtg aagaggcgcg tcgccgagtt gcaag atg gcc acc 18323
Met Ala Thr
3020
cca tcg atg ctg ccc cag tgg gcg tac atg cac atc gcc gga cag 18368
Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala Gly Gln
3025 3030 3035
gac gct tcg gag tac ctg agt ccg ggt ctg gtg cag ttt gcc cgc 18413
Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg
3040 3045 3050
gcc aca gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc 18458
Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
3055 3060 3065
acg gtg gcg ccc acg cac gat gtg acc acc gac cgc agc cag cgg 18503
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg
3070 3075 3080
ctg acg ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac 18548
Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr
3085 3090 3095
tcg tac aaa gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg 18593
Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val
3100 3105 3110
ctg gac atg gcc agc acc tac ttt gac atc cgg ggc gtg ctg gac 18638
Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp
3115 3120 3125
cgg ggc cct agc ttc aaa ccc tac tcc ggc acc gcc tac aac agc 18683
Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser
3130 3135 3140
ctg gcc ccc aag gga gca ccc aac act tgt cag tgg aca tat aaa 18728
Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys
3145 3150 3155
gcc gat ggt gaa act gcc aca gaa aaa acc tat aca tat gga aat 18773
Ala Asp Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn
3160 3165 3170
gca ccc gtg cag ggc att aac atc aca aaa gat ggt att caa ctt 18818
Ala Pro Val Gln Gly Ile Asn Ile Thr Lys Asp Gly Ile Gln Leu
3175 3180 3185
gga act gac acc gat gat cag cca atc tat gca gat gaa acc tat 18863
Gly Thr Asp Thr Asp Asp Gln Pro Ile Tyr Ala Asp Glu Thr Tyr
3190 3195 3200
cag cct gaa cct caa gtg ggt gat gct gaa tgg cat gac atc act 18908
Gln Pro Glu Pro Gln Val Gly Asp Ala Glu Trp His Asp Ile Thr
3205 3210 3215
ggt act gat gaa aag tat gga ggc aga gct ctt aag cct gat acc 18953
Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr
3220 3225 3230
aaa atg aag cct tgt tat ggt tct ttt gcc aag cct act aat aaa 18998
Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys
3235 3240 3245
gaa gga ggt cag gca aat gtg aaa aca gga aca ggc act act aaa 19043
Glu Gly Gly Gln Ala Asn Val Lys Thr Gly Thr Gly Thr Thr Lys
3250 3255 3260
gaa tat gac ata gac atg gct ttc ttt gac aac aga agt gcg gcc 19088
Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala Ala
3265 3270 3275
gct gcc ggc cta gct cca gaa att gtt ttg tat act gaa aat gtg 19133
Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val
3280 3285 3290
gat ttg gaa act cca gat acc cat att gta tac aaa gca ggc aca 19178
Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr
3295 3300 3305
gat gac agc agc tct tct att aat ttg ggc cag caa gcc atg ccc 19223
Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln Ala Met Pro
3310 3315 3320
aac aga cct aac tac att ggc ttc aga gac aac ttt atc ggg ctc 19268
Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu
3325 3330 3335
atg tac tac aac agc act ggc aat atg ggg gtg ctg gcc ggt cag 19313
Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln
3340 3345 3350
gct tct cag ctg aat gct gtg gtt gac ttg caa gac aga aac acc 19358
Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr
3355 3360 3365
gag ctg tcc tac cag ctc ttg ctt gac tct ctg ggc gac aga acc 19403
Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr
3370 3375 3380
cgg tat ttc agt atg tgg aat cag gcg gtg gac agc tat gat cct 19448
Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro
3385 3390 3395
gat gta cgc att att gaa aat cat ggt gtg gag gat gaa ctt ccc 19493
Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro
3400 3405 3410
aac tat tgt ttc cct ctg gat gct gtt ggc aaa aca gat act tat 19538
Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Lys Thr Asp Thr Tyr
3415 3420 3425
cag gga att aag gct aat gga act gat caa acc aca tgg acc aaa 19583
Gln Gly Ile Lys Ala Asn Gly Thr Asp Gln Thr Thr Trp Thr Lys
3430 3435 3440
gat gac agt gtc aat gat gct aat gag ata ggc aag ggt aat cca 19628
Asp Asp Ser Val Asn Asp Ala Asn Glu Ile Gly Lys Gly Asn Pro
3445 3450 3455
ttt gct atg gag atc aac atc caa gcc aac ctg tgg agg aac ttc 19673
Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe
3460 3465 3470
ctc tac gcc aac gtg gcc ctg tac ctg ccc gat tct tac aag tac 19718
Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr
3475 3480 3485
acg ccg gcc aac gtc acc ctg ccc gcc aac acc aac acc tac gat 19763
Thr Pro Ala Asn Val Thr Leu Pro Ala Asn Thr Asn Thr Tyr Asp
3490 3495 3500
tac atg aac ggc cgg gtg gtg gcg ccc tcg ctg gtg gac gcc tat 19808
Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala Tyr
3505 3510 3515
atc aac atc ggg gcg cgc tgg tcg ctg gac ccc atg gac aac gtc 19853
Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val
3520 3525 3530
aat ccc ttc aac cac cac cgc aac gcg ggg ctg cgc tac cgc tcc 19898
Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser
3535 3540 3545
atg ctc ctg ggc aac ggg cgc tac gtg ccc ttc cac atc cag gtg 19943
Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val
3550 3555 3560
ccc cag aaa ttt ttc gcc atc aag agc ctc ctg ctc ctg ccc ggg 19988
Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly
3565 3570 3575
tcc tac acc tac gag tgg aac ttc cgc aag gac gtc aac atg atc 20033
Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile
3580 3585 3590
ctg cag agc tcc ctc ggc aac gac ctg cgc acg gac ggg gcc tcc 20078
Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser
3595 3600 3605
atc tcc ttc acc agc atc aac ctc tac gcc acc ttc ttc ccc atg 20123
Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met
3610 3615 3620
gcg cac aac acg gcc tcc acg ctc gag gcc atg ctg cgc aac gac 20168
Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp
3625 3630 3635
acc aac gac cag tcc ttc aac gac tac ctc tcg gcg gcc aac atg 20213
Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met
3640 3645 3650
ctc tac ccc atc ccg gcc aac gcc acc aac gtg ccc atc tcc atc 20258
Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile
3655 3660 3665
ccc tcg cgc aac tgg gcc gcc ttc cgc ggc tgg tcc ttc acg cgt 20303
Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg
3670 3675 3680
ctc aag acc aag gag acg ccc tcg ctg ggc tcc ggg ttc gac ccc 20348
Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro
3685 3690 3695
tac ttc gtc tac tcg ggc tcc atc ccc tac ctc gac ggc acc ttc 20393
Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe
3700 3705 3710
tac ctc aac cac acc ttc aag aag gtc tcc atc acc ttc gac tcc 20438
Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser
3715 3720 3725
tcc gtc agc tgg ccc ggc aac gac cgg ctc ctg acg ccc aac gag 20483
Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu
3730 3735 3740
ttc gaa atc aag cgc acc gtc gac ggc gag gga tac aac gtg gcc 20528
Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala
3745 3750 3755
cag tgc aac atg acc aag gac tgg ttc ctg gtc cag atg ctg gcc 20573
Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala
3760 3765 3770
cac tac aac atc ggc tac cag ggc ttc tac gtg ccc gag ggc tac 20618
His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr
3775 3780 3785
aag gac cgc atg tac tcc ttc ttc cgc aac ttc cag ccc atg agc 20663
Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser
3790 3795 3800
cgc cag gtg gtg gac gag gtc aac tac aag gac tac cag gcc gtc 20708
Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val
3805 3810 3815
acc cta gcc tac cag cac aac aac tcg ggc ttc gtc ggc tac ctc 20753
Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu
3820 3825 3830
gcg ccc acc atg cgc cag gga cag ccc tac ccc gcc aac tac ccc 20798
Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro
3835 3840 3845
tac ccg ctc atc ggc aag agc gcc gtc gcc agc gtc acc cag aaa 20843
Tyr Pro Leu Ile Gly Lys Ser Ala Val Ala Ser Val Thr Gln Lys
3850 3855 3860
aag ttc ctc tgc gac cgg gtc atg tgg cgc atc ccc ttc tcc agc 20888
Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser
3865 3870 3875
aac ttc atg tcc atg ggc gcg ctc acc gac ctc ggc cag aac atg 20933
Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met
3880 3885 3890
ctc tac gcc aac tcc gcc cac gcg cta gac atg aat ttc gaa gtc 20978
Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val
3895 3900 3905
gac ccc atg gat gag tcc acc ctt ctc tat gtt gtc ttc gaa gtc 21023
Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val
3910 3915 3920
ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc gtc atc gag 21068
Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu
3925 3930 3935
gcc gtc tac ctg cgc acg ccc ttc tcg gcc ggc aac gcc acc acc 21113
Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
3940 3945 3950
taagcctctt gcttcttgca ag atg acg gcc tgt gcg ggc tcc ggc gag 21162
Met Thr Ala Cys Ala Gly Ser Gly Glu
3955 3960
cag gag ctc agg gcc atc ctc cgc gac ctg ggc tgc ggg ccc tgc 21207
Gln Glu Leu Arg Ala Ile Leu Arg Asp Leu Gly Cys Gly Pro Cys
3965 3970 3975
ttc ctg ggc acc ttc gac aag cgc ttc ccg gga ttc atg gcc ccg 21252
Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro Gly Phe Met Ala Pro
3980 3985 3990
cac aag ctg gcc tgc gcc atc gtc aac acg gcc ggc cgc gag acc 21297
His Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg Glu Thr
3995 4000 4005
ggg ggc gag cac tgg ctg gcc ttc gcc tgg aac ccg cgc acc cac 21342
Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn Pro Arg Thr His
4010 4015 4020
acc tgc tac ctc ttc gac ccc ttc ggg ttc tca gac gag cgc ctc 21387
Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu Arg Leu
4025 4030 4035
aag cag atc tac cag ttc gag tac gag ggc ctg ctg cgc cgc agc 21432
Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg Arg Ser
4040 4045 4050
gcc ctg gcc acc gag gac cgc tgc gtc acc ctg gaa aag tcc acc 21477
Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser Thr
4055 4060 4065
cag acc gtg cag ggt ccg cgc tcg gcc gcc tgc ggg ctc ttc tgc 21522
Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys
4070 4075 4080
tgc atg ttc ctg cac gcc ttc gtg cac tgg ccc gac cgc ccc atg 21567
Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met
4085 4090 4095
gac aag aac ccc acc atg aac ttg ctg acg ggg gtg ccc aac ggc 21612
Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly
4100 4105 4110
atg ctc cag tcg ccc cag gtg gaa acc acc ctg cgc cgc aac cag 21657
Met Leu Gln Ser Pro Gln Val Glu Thr Thr Leu Arg Arg Asn Gln
4115 4120 4125
gag gcg ctc tac cgc ttc ctc aac gcc cac tcc gcc tac ttt cgc 21702
Glu Ala Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg
4130 4135 4140
tcc cac cgc gcg cgc atc gag aag gcc acc gcc ttc gac cgc atg 21747
Ser His Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met
4145 4150 4155
aat caa gac atg taaaaaaccg gtgtgtgtat gtgaatgctt tattaataaa 21799
Asn Gln Asp Met
cagcacatgt ttatgccacc ttctctgagg ctctgacttt atttagaaat cgaaggggtt 21859
ctgccggctc tcggcgtgcc ccgcgggcag ggatacgttg cggaactggt acttgggcag 21919
ccacttgaac tcggggatca gcagcttggg cacggggagg tcggggaacg agtcgctcca 21979
cagcttgcgc gtgagttgca gggcgcccag caggtcgggc gcggagatct tgaaatcgca 22039
gttgggaccc gcgttctgcg cgcgagagtt acggtacacg gggttgcagc actggaacac 22099
catcagggcc gggtgcttca cgctcgccag caccgtcgcg tcggtgatgc cctccacgtc 22159
cagatcctcg gcgttggtca tcccgaaggg ggtcatcttg caggtctgcc gccccatgct 22219
gggcacgcag ccgggcttgt ggttgcaatc gcagtgcagg gggatcagca tcatctgggc 22279
ctgctcggag ctcatgcccg ggtacatggc cttcatgaaa gcctccagct ggcggaaggc 22339
ctgctgcgcc ttgccgccct cggtgaagaa gaccccgcag gacttgctag agaactggtt 22399
ggtggcgcag ccggcgtcgt gcacgcagca gcgcgcgtcg ttgttggcca gctgcaccac 22459
gctgcgtccc cagcggttct gggtgatctt ggcccggtcg gggttctcct tcagcgcgcg 22519
ctgcccgttc tcgctcgcca catccatctc gatcgtgtgc tccttctgga tcatcacggt 22579
cccgtgcagg caccgcagct tgccctcggc ctcggtgcag ccgtgcagcc acagcgcgca 22639
gccggtgctc tcccagttct tgtgggcgat ctgggagtgc gagtgcacga agccctgcag 22699
gaagcggccc atcatcgtgg tcagggtctt gttgctggtg aaggtcagcg ggatgccgcg 22759
gtgctcctcg ttcacataca ggtggcagat gcggcggtac acctcgccct gctcgggcat 22819
cagctggaag gcggacttca ggtcgctctc cacgcggtac cggtccatca gcagcgtcat 22879
cacttccatg cccttctccc aggccgagac gatcggcagg ctcagggggt tcttcaccgt 22939
tgtcatctta gtcgccgccg ccgaggtcag ggggtcgttc tcgtccaggg tctcaaacac 22999
tcgcttgccg tccttctcgg tgatgcgcac gggggggaag gcgaagccca cggccgccag 23059
ctcctcctcg gcctgccttt cgtcctcgct gtcctggctg atgtcttgca aaggcacatg 23119
cttggtcttg cggggtttct ttttgggcgg cagaggcggc ggcgatgtgc tgggcgagcg 23179
cgagttctcg ctcaccacga ctatttcttc tccttggccg tcgtccgaga ccacgcggcg 23239
gtaggcatgc ctcttctggg gcagaggcgg aggcgacggg ctctcgcggt tcggcgggcg 23299
gctggcagag ccccttccgc gttcgggggt gcgctcctgg cggcgctgct ctgactgact 23359
tcctccgcgg ccggccattg tgttctccta gggagcaagc atg gag act cag cca 23414
Met Glu Thr Gln Pro
4160
tcg tcg cca aca tcg cca tct gcc ccc gcc gcc gcc gac gag aac 23459
Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Ala Ala Asp Glu Asn
4165 4170 4175
cag cag cag cag aat gaa agc tta acc gcc ccg ccg ccc agc ccc 23504
Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro Ser Pro
4180 4185 4190
acc tcc gac gcg gcc cca gac atg caa gag atg gag gaa tcc atc 23549
Thr Ser Asp Ala Ala Pro Asp Met Gln Glu Met Glu Glu Ser Ile
4195 4200 4205
gag att gac ctg ggc tac gtg acg ccc gcg gag cac gag gag gag 23594
Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu Glu Glu
4210 4215 4220
ctg gca gcg cgc ttt tca gcc ccg gaa gag aac cac caa gag cag 23639
Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln Glu Gln
4225 4230 4235
cca gag cag gaa gca gag agc gag cag cag cag gct ggg ctc gag 23684
Pro Glu Gln Glu Ala Glu Ser Glu Gln Gln Gln Ala Gly Leu Glu
4240 4245 4250
cat ggc gac tac ctg agc ggg gca gag gac gtg ctc atc aag cat 23729
His Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His
4255 4260 4265
ctg gcc cgc caa tgc atc atc gtc aag gac gcg ctg ctc gac cgc 23774
Leu Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg
4270 4275 4280
gcc gag gtg ccc ctc agc gtg gcg gag ctc agc cgc gcc tac gag 23819
Ala Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu
4285 4290 4295
cgc aac ctc ttc tcg ccg cgc gtg ccc ccc aag cgc cag ccc aac 23864
Arg Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn
4300 4305 4310
ggc acc tgc gag ccc aac ccg cgc ctc aac ttc tac ccg gtc ttc 23909
Gly Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe
4315 4320 4325
gcg gtg ccc gag gcc ctg gcc acc tac cac ctc ttt ttc aag aac 23954
Ala Val Pro Glu Ala Leu Ala Thr Tyr His Leu Phe Phe Lys Asn
4330 4335 4340
caa agg atc ccc gtc tcc tgc cgc gcc aac cgc acc cgc gcc gac 23999
Gln Arg Ile Pro Val Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp
4345 4350 4355
gcc ctg ctc aac ctg ggc ccc ggc gcc cgc cta cct gat atc gcc 24044
Ala Leu Leu Asn Leu Gly Pro Gly Ala Arg Leu Pro Asp Ile Ala
4360 4365 4370
tcc ttg gaa gag gtt ccc aag atc ttc gag ggt ctg ggc agc gac 24089
Ser Leu Glu Glu Val Pro Lys Ile Phe Glu Gly Leu Gly Ser Asp
4375 4380 4385
gag act cgg gcc gcg aac gct ctg caa gga agc gga gag gag cat 24134
Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly Ser Gly Glu Glu His
4390 4395 4400
gag cac cac agc gcc ctg gtg gag ttg gaa ggc gac aac gcg cgc 24179
Glu His His Ser Ala Leu Val Glu Leu Glu Gly Asp Asn Ala Arg
4405 4410 4415
ctg gcg gtc ctc aag cgc acg gtc gag ctg acc cac ttc gcc tac 24224
Leu Ala Val Leu Lys Arg Thr Val Glu Leu Thr His Phe Ala Tyr
4420 4425 4430
ccg gcg ctc aac ctg ccc ccc aag gtc atg agc gcc gtc atg gac 24269
Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser Ala Val Met Asp
4435 4440 4445
cag gtg ctc atc aag cgc gcc tcg ccc ctc tcg gag gag gag atg 24314
Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu Met
4450 4455 4460
cag gac ccc gag agc tcg gac gag ggc aag ccc gtg gtc agc gac 24359
Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser Asp
4465 4470 4475
gag cag ctg gcg cgc tgg ctg gga gcg agt agc acc ccc cag agc 24404
Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln Ser
4480 4485 4490
ctg gaa gag cgg cgc aag ctc atg atg gcc gtg gtc ctg gtg acc 24449
Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr
4495 4500 4505
gtg gag ctg gag tgt ctg cgc cgc ttc ttc gcc gac gcg gag acc 24494
Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr
4510 4515 4520
ctg cgc aag gtc gag gaa aac ctg cac tac ctc ttc agg cac ggg 24539
Leu Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly
4525 4530 4535
ttc gtg cgc cag gcc tgc aag atc tcc aac gtg gag ctg acc aac 24584
Phe Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn
4540 4545 4550
ctg gtc tcc tac atg ggc atc ctg cac gag aac cgc ctg ggg cag 24629
Leu Val Ser Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln
4555 4560 4565
aac gtg ctg cac acc acc ctg cgc ggg gag gcc cgc cgc gac tac 24674
Asn Val Leu His Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr
4570 4575 4580
atc cgc gac tgc gtc tac ctg tac ctc tgc cac acc tgg cag acg 24719
Ile Arg Asp Cys Val Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr
4585 4590 4595
ggc atg ggc gtg tgg cag cag tgc ctg gag gag cag aac ctg aaa 24764
Gly Met Gly Val Trp Gln Gln Cys Leu Glu Glu Gln Asn Leu Lys
4600 4605 4610
gag ctc tgc aag ctc ctg cag aag aac ctc aag gcc ctg tgg acc 24809
Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys Ala Leu Trp Thr
4615 4620 4625
ggg ttc gac gag cgc acc acc gcc tcg gac ctg gcc gac ctc atc 24854
Gly Phe Asp Glu Arg Thr Thr Ala Ser Asp Leu Ala Asp Leu Ile
4630 4635 4640
ttc ccc gag cgc ctg cgg ctg acg ctg cgc aac ggt ctg ccc gac 24899
Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu Pro Asp
4645 4650 4655
ttt atg agc caa agc atg ttg caa aac ttt cgc tct ttc atc ctc 24944
Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe Ile Leu
4660 4665 4670
gaa cgc tcc ggg atc ctg ccc gcc acc tgc tcc gcg ctg ccc tcg 24989
Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu Pro Ser
4675 4680 4685
gac ttc gtg ccg ctg acc ttc cgc gag tgc ccc ccg ccg ctc tgg 25034
Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu Trp
4690 4695 4700
agc cac tgc tac ttg ctg cgc ctg gcc aac tac ctg gcc tac cac 25079
Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His
4705 4710 4715
tcg gac gtg atc gag gac gtc agc ggc gag ggt ctg ctc gag tgc 25124
Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu Cys
4720 4725 4730
cac tgc cgc tgc aac ctc tgc acg ccg cac cgc tcc ctg gcc tgc 25169
His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys
4735 4740 4745
aac ccc cag ctg ctg agc gag acc cag atc atc ggc acc ttc gag 25214
Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu
4750 4755 4760
ttg caa ggc ccc ggc gag ggc aag ggg ggt ctg aaa ctc acc ccg 25259
Leu Gln Gly Pro Gly Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro
4765 4770 4775
ggg ctg tgg acc tcg gcc tac ttg cgc aag ttc gtg ccc gag gac 25304
Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp
4780 4785 4790
tac cat ccc ttc gag atc agg ttc tac gag gac caa tcc cag ccg 25349
Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro
4795 4800 4805
ccc aag gcc gag ctg tcg gcc tgc gtc atc acc cag ggg gcc atc 25394
Pro Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile
4810 4815 4820
ctg gcc caa ttg caa gcc atc cag aaa tcc cgc caa gaa ttt ctg 25439
Leu Ala Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu
4825 4830 4835
ctg aaa aag ggc cac ggg gtc tac ctg gac ccc cag acc gga gag 25484
Leu Lys Lys Gly His Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu
4840 4845 4850
gag ctc aac ccc agc ttc ccc cag gat gcc ccg agg aag cag caa 25529
Glu Leu Asn Pro Ser Phe Pro Gln Asp Ala Pro Arg Lys Gln Gln
4855 4860 4865
gaa gct gaa agt gga gct gcc gct gcc gcc gga gga ttt gga gga 25574
Glu Ala Glu Ser Gly Ala Ala Ala Ala Ala Gly Gly Phe Gly Gly
4870 4875 4880
aga ctg gga gag cag tca ggc aga gga gat gga aga ctg gga cag 25619
Arg Leu Gly Glu Gln Ser Gly Arg Gly Asp Gly Arg Leu Gly Gln
4885 4890 4895
cac tca ggc aga gga gga cag cct gca aga cag tct gga gga gga 25664
His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly Gly Gly
4900 4905 4910
aga cga ggt gga gga gga ggc aga gga aga agc agc cgc cgc cag 25709
Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg Arg Gln
4915 4920 4925
acc gtc gtc ctc ggc gga gaa agc aag cag cac gga tac cat ctc 25754
Thr Val Val Leu Gly Gly Glu Ser Lys Gln His Gly Tyr His Leu
4930 4935 4940
cgc tcc ggg tcg ggg tcg cgg cgg ccg ggc cca cag tagatgggac 25800
Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro Gln
4945 4950 4955
gagaccgggc gcttcccaaa ccccaccacc cagaccggta agaaggagcg gcagggatac 25860
aagtcctggc gggggcacaa aaacgccatc gtctcctgct tgcaagcctg cgggggcaac 25920
atctccttca cccggcgcta cctgctcttc caccgcgggg tgaacttccc ccgcaacatc 25980
ttgcattact accgtcacct ccacagcccc tactactgtt tccaagaaga ggcagaaacc 26040
cagcagcagc agaaaaccag cagcagcagc agcagcagct agaaaatcca cagcggcggc 26100
ggcaggtgga ctgaggatcg cggcgaacga gccggcgcag acccgggagc tgaggaaccg 26160
gatctttccc accctctatg ccatcttcca gcagagtcgg gggcaggagc aggaactgaa 26220
agtcaagaac cgttctctgc gctcgctcac ccgcagttgt ctgtatcaca agagcgaaga 26280
ccaacttcag cgcactctcg aggacgccga ggctctcttc aacaagtact gcgcgctcac 26340
tcttaaagag tagcccgcgc ccgcccacac acggaaaaag gcgggaatta cgtcaccacc 26400
tgcgcccttc gcccgaccat catc atg agc aaa gag att ccc acg cct tac 26451
Met Ser Lys Glu Ile Pro Thr Pro Tyr
4960 4965
atg tgg agc tac cag ccc cag atg ggc ctg gcc gcc ggc gcc gcc 26496
Met Trp Ser Tyr Gln Pro Gln Met Gly Leu Ala Ala Gly Ala Ala
4970 4975 4980
cag gac tac tcc acc cgc atg aac tgg ctc agt gcc ggg ccc gcg 26541
Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu Ser Ala Gly Pro Ala
4985 4990 4995
atg atc tca cgg gtg aat gac atc cgc gcc cac cga aac cag ata 26586
Met Ile Ser Arg Val Asn Asp Ile Arg Ala His Arg Asn Gln Ile
5000 5005 5010
ctc cta gaa cag tca gca atc acc gcc acg ccc cgc cat cac ctt 26631
Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr Pro Arg His His Leu
5015 5020 5025
aat ccg cgt aat tgg ccc gcc gcc ctg gtg tac cag gaa att ccc 26676
Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr Gln Glu Ile Pro
5030 5035 5040
cag ccc acg acc gta cta ctt ccg cga gac gcc cag gcc gaa gtc 26721
Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln Ala Glu Val
5045 5050 5055
cag ctg act aac tca ggt gtc cag ctg gcc ggc ggc gcc gcc ctg 26766
Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala Ala Leu
5060 5065 5070
tgt cgt cac cgc ccc gct cag ggt ata aag cgg ctg gtg atc cga 26811
Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile Arg
5075 5080 5085
ggc aga ggc aca cag ctc aac gac gag gtg gtg agc tct tcg ctg 26856
Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
5090 5095 5100
ggt ctg cga cct gac gga gtc ttc caa ctc gcc gga tcg ggg aga 26901
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg
5105 5110 5115
tct tcc ttc acg cct cgt cag gcc gtc ctg act ttg gag agt tcg 26946
Ser Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser
5120 5125 5130
tcc tcg cag ccc cgc tcg ggt ggc atc ggc act ctc cag ttc gtg 26991
Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val
5135 5140 5145
gag gag ttc act ccc tcg gtc tac ttc aac ccc ttc tcc ggc tcc 27036
Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser
5150 5155 5160
ccc ggc cac tac ccg gac gag ttc atc ccg aac ttc gac gcc atc 27081
Pro Gly His Tyr Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile
5165 5170 5175
agc gag tcg gtg gac ggc tac gat tga atg tcc cat ggt ggc gcg 27126
Ser Glu Ser Val Asp Gly Tyr Asp Met Ser His Gly Gly Ala
5180 5185
gct gac cta gct cgg ctt cga cac ctg gac cac tgc cgc cgc ttc 27171
Ala Asp Leu Ala Arg Leu Arg His Leu Asp His Cys Arg Arg Phe
5190 5195 5200
cgc tgc ttc gct cgg gat ctc gcc gag ttt gcc tac ttt gag ctg 27216
Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala Tyr Phe Glu Leu
5205 5210 5215
ccc gag gag cac cct cag ggc ccg gcc cac gga gtg cgg atc gtc 27261
Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val Arg Ile Val
5220 5225 5230
gtc gaa ggg ggc ctc gac tcc cac ctg ctt cgg atc ttc agc cag 27306
Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe Ser Gln
5235 5240 5245
cga ccg atc ctg gtc gag cgc gag caa gga cag acc ctt ctg acc 27351
Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Leu Leu Thr
5250 5255 5260
ctg tac tgc atc tgc aac cgc ccc ggc ctg cat gaa agt ctt tgt 27396
Leu Tyr Cys Ile Cys Asn Arg Pro Gly Leu His Glu Ser Leu Cys
5265 5270 5275
tgt ctg ctg tgt act gag tat aat aaa agc tgaaatcagc gactactccg 27446
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
5280 5285
gactcgattg tggtgttcct gctatcaacc ggtccctgtt cttcaccggg aacgagaccg 27506
agcttcagct ccagtgtaag ccccacaaga agtacctcac ctggctgttc cagggctctc 27566
cgatcgccgt tgtcaaccac tgcgacaacg acggagtcct gctgagcggc cctgccaacc 27626
ttactttttc cacccgcaga agcaagctcc agctcttcca acccttcctc cccgggacct 27686
atcagtgcgt ctcgggaccc tgccatcaca ccttccacct gatcccgaat accacagcgt 27746
cgctccccgc tactaacaac caatctaccc accaacgcca ccgtcgcgac ctttcctctg 27806
aatctaatac taccacccac accggaggtg agctccgagg tcgaccaacc tctgggattt 27866
actacggccc ctgggaggtg gtggggttaa tagcgctagg cctagttgtg ggtgggcttt 27926
tggctctctg ctacctatac ctcccttgct gttcgtactt agtggtgctg tgttgctggt 27986
ttaagaa atg ggg aag atc acc cta gtg agc tgc ggt gtg ccg gtg 28032
Met Gly Lys Ile Thr Leu Val Ser Cys Gly Val Pro Val
5290 5295 5300
gcg gtg gtg gtg ctt tcg att gtg gga ctg ggc ggc gcg gct gta 28077
Ala Val Val Val Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val
5305 5310 5315
gtg aag gag gag aag gcc gat tcc tgc ttg cat ttc aat ccc gac 28122
Val Lys Glu Glu Lys Ala Asp Ser Cys Leu His Phe Asn Pro Asp
5320 5325 5330
aaa tgc cag ctg agt ttt cag ccc gat ggc aat cgg tgc acg gtg 28167
Lys Cys Gln Leu Ser Phe Gln Pro Asp Gly Asn Arg Cys Thr Val
5335 5340 5345
ctg atc aag tgc gga tgg gaa tgc gag aac gtg aga atc gag tac 28212
Leu Ile Lys Cys Gly Trp Glu Cys Glu Asn Val Arg Ile Glu Tyr
5350 5355 5360
aat aac aag act cgg aac aat act ctc gcg tcc gtg tgg cag ccc 28257
Asn Asn Lys Thr Arg Asn Asn Thr Leu Ala Ser Val Trp Gln Pro
5365 5370 5375
ggg gac ccc gag tgg tac acc gtc tct gtc ccc ggt gct gac ggc 28302
Gly Asp Pro Glu Trp Tyr Thr Val Ser Val Pro Gly Ala Asp Gly
5380 5385 5390
tcc ccg cgc acc gtg aat aat act ttc att ttt gca cac atg tgc 28347
Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe Ala His Met Cys
5395 5400 5405
gac acg gtc atg tgg atg agc aag cag tac gat atg tgg ccc ccc 28392
Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met Trp Pro Pro
5410 5415 5420
acg aag gag aac atc gtg gtc ttc tcc atc gct tac agc ctg tgc 28437
Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser Leu Cys
5425 5430 5435
acg gcg cta atc acc gct atc gtg tgc ctg agc att cac atg ctc 28482
Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met Leu
5440 5445 5450
atc gct att cgc ccc aga aat aat gcc gag aaa gag aaa cag cca 28527
Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
5455 5460 5465
taacacgttt ttttacacac ctttttcaga cc atg gcc tct gtt act gcc cta 28580
Met Ala Ser Val Thr Ala Leu
5470
att att ttt ttg ggc ctt gtg ggc act agc agc act ttt gat cat 28625
Ile Ile Phe Leu Gly Leu Val Gly Thr Ser Ser Thr Phe Asp His
5475 5480 5485
aaa aac tta act gtt ttt gtt ggt tct gat gtt aca cta ccc ggg 28670
Lys Asn Leu Thr Val Phe Val Gly Ser Asp Val Thr Leu Pro Gly
5490 5495 5500
cat caa tcg cac cag agg gtt tca tgg tat cat ttt aat aaa cag 28715
His Gln Ser His Gln Arg Val Ser Trp Tyr His Phe Asn Lys Gln
5505 5510 5515
aac aca gct tat aca ctt tgc aaa ggt cat cag caa gcc aca tat 28760
Asn Thr Ala Tyr Thr Leu Cys Lys Gly His Gln Gln Ala Thr Tyr
5520 5525 5530
cgc agt ggt ctt tat tac aga tgc aat aac aat aac ctc aca cta 28805
Arg Ser Gly Leu Tyr Tyr Arg Cys Asn Asn Asn Asn Leu Thr Leu
5535 5540 5545
ctc tca gtt aat gca aat tat tct ggc aca tac tat gga acc aat 28850
Leu Ser Val Asn Ala Asn Tyr Ser Gly Thr Tyr Tyr Gly Thr Asn
5550 5555 5560
ttt aac aca aaa cag gac act tac tat agt gtc aaa gta ttg aat 28895
Phe Asn Thr Lys Gln Asp Thr Tyr Tyr Ser Val Lys Val Leu Asn
5565 5570 5575
cca acc tct cct aga act acc act aag cct acc gct acc acc act 28940
Pro Thr Ser Pro Arg Thr Thr Thr Lys Pro Thr Ala Thr Thr Thr
5580 5585 5590
act act gca aag ccc act aaa cct aaa act acc aag aaa acc act 28985
Thr Thr Ala Lys Pro Thr Lys Pro Lys Thr Thr Lys Lys Thr Thr
5595 5600 5605
gtg aag act aca aca act aga acc acc aca act aca gag acc acc 29030
Val Lys Thr Thr Thr Thr Arg Thr Thr Thr Thr Thr Glu Thr Thr
5610 5615 5620
acc agc aca ctt gct gcc act aca cac aca cac att gag cta acc 29075
Thr Ser Thr Leu Ala Ala Thr Thr His Thr His Ile Glu Leu Thr
5625 5630 5635
tta cag acc act aat gat ttg atc gcc ctg ttg caa aag ggg gat 29120
Leu Gln Thr Thr Asn Asp Leu Ile Ala Leu Leu Gln Lys Gly Asp
5640 5645 5650
aac agc acc act tcc gat gag gaa ata ccc aaa tcc atg att ggc 29165
Asn Ser Thr Thr Ser Asp Glu Glu Ile Pro Lys Ser Met Ile Gly
5655 5660 5665
att att gtt gct gta gtg gtg tgc atg ttg atc atc gcc ttg tgc 29210
Ile Ile Val Ala Val Val Val Cys Met Leu Ile Ile Ala Leu Cys
5670 5675 5680
atg gtg tac tat gcc ttc tgc tac aga aag cac aga ctg aac gac 29255
Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His Arg Leu Asn Asp
5685 5690 5695
aag ctg gaa cac tta cta agt gtt gaa ttt taatttttta gaacc atg 29303
Lys Leu Glu His Leu Leu Ser Val Glu Phe Met
5700 5705 5710
aag atc cta ggc ctt tta gtt ttt tct atc att acc tct gct ctt 29348
Lys Ile Leu Gly Leu Leu Val Phe Ser Ile Ile Thr Ser Ala Leu
5715 5720 5725
tgt gaa tcg gtg gat aaa gat gtt act att acc act ggt tct aac 29393
Cys Glu Ser Val Asp Lys Asp Val Thr Ile Thr Thr Gly Ser Asn
5730 5735 5740
tat aca ctg aaa ggg cca ccc tca ggt atg ctt tcg tgg tat tgc 29438
Tyr Thr Leu Lys Gly Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys
5745 5750 5755
tat ttt gga act gac tct gaa caa act gag ctt tgc aat gca atg 29483
Tyr Phe Gly Thr Asp Ser Glu Gln Thr Glu Leu Cys Asn Ala Met
5760 5765 5770
aga ggc caa atg cca acc tca aaa att aaa cat aaa tgc aat ggt 29528
Arg Gly Gln Met Pro Thr Ser Lys Ile Lys His Lys Cys Asn Gly
5775 5780 5785
act gat ttg ata ctc ctc aat gtc acg aaa gca tat gct ggc agt 29573
Thr Asp Leu Ile Leu Leu Asn Val Thr Lys Ala Tyr Ala Gly Ser
5790 5795 5800
tac acc tgc cct gga gat gat gct gac agt atg att ttt tac aaa 29618
Tyr Thr Cys Pro Gly Asp Asp Ala Asp Ser Met Ile Phe Tyr Lys
5805 5810 5815
gta act gtt gtt gat ccc act act cca cca ccc acc acc aca act 29663
Val Thr Val Val Asp Pro Thr Thr Pro Pro Pro Thr Thr Thr Thr
5820 5825 5830
act cat acc aca cac aca gaa caa aca cca gag gca gca gga gag 29708
Thr His Thr Thr His Thr Glu Gln Thr Pro Glu Ala Ala Gly Glu
5835 5840 5845
tta gcc ttg cag gtt cag gaa gat tcc ctt atg gct aat acc cct 29753
Leu Ala Leu Gln Val Gln Glu Asp Ser Leu Met Ala Asn Thr Pro
5850 5855 5860
aca ccc gat cat cgg tgt ccg ggg ttg ctc gtc agc ggc att gtc 29798
Thr Pro Asp His Arg Cys Pro Gly Leu Leu Val Ser Gly Ile Val
5865 5870 5875
ggt gtg ctt tcg gga tta gca gtc ata atc atc tgc atg ttc att 29843
Gly Val Leu Ser Gly Leu Ala Val Ile Ile Ile Cys Met Phe Ile
5880 5885 5890
ttt gct tgc tgc tat aga agg ctt tac cga caa aaa tca gac cca 29888
Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg Gln Lys Ser Asp Pro
5895 5900 5905
ctg ctg aac ctc tat gtt taattttttc cagagcc atg aag gca gtt aga 29938
Leu Leu Asn Leu Tyr Val Met Lys Ala Val Arg
5910 5915
gtt cta gtt ttt tgt tct ttg att ggc att gtt ttt agt gct ggg 29983
Val Leu Val Phe Cys Ser Leu Ile Gly Ile Val Phe Ser Ala Gly
5920 5925 5930
ttt ttg aaa aat ctt acc att tat gaa ggt gag aat gcc act cta 30028
Phe Leu Lys Asn Leu Thr Ile Tyr Glu Gly Glu Asn Ala Thr Leu
5935 5940 5945
gtg ggc atc agt ggt caa aat gtc agc tgg cta aaa tac cat cta 30073
Val Gly Ile Ser Gly Gln Asn Val Ser Trp Leu Lys Tyr His Leu
5950 5955 5960
gat agg tgg aaa gac att tgc gat tgg aat gtc act gtg tat aca 30118
Asp Arg Trp Lys Asp Ile Cys Asp Trp Asn Val Thr Val Tyr Thr
5965 5970 5975
tgt aat gga gtt aac ctc acc att act aat gcc acc caa gat caa 30163
Cys Asn Gly Val Asn Leu Thr Ile Thr Asn Ala Thr Gln Asp Gln
5980 5985 5990
aat ggt agg ttt aag ggt cag agt ttc act aga aat aat ggg tat 30208
Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr Arg Asn Asn Gly Tyr
5995 6000 6005
gaa tcc cat aac atg ttt atc tat gac gtc act gtc atc aga aat 30253
Glu Ser His Asn Met Phe Ile Tyr Asp Val Thr Val Ile Arg Asn
6010 6015 6020
gag act gcc acc acc acc aca cag atg caa acc aca cag acg acc 30298
Glu Thr Ala Thr Thr Thr Thr Gln Met Gln Thr Thr Gln Thr Thr
6025 6030 6035
aca tac agt aca tca aat cag cct acc acc act aca gca gca gag 30343
Thr Tyr Ser Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu
6040 6045 6050
gtt gcc agc tcg tct ggt gtc aaa gtg gca ttt ttg ttg ttg ccc 30388
Val Ala Ser Ser Ser Gly Val Lys Val Ala Phe Leu Leu Leu Pro
6055 6060 6065
cca tct agc agt ccc act gct agt acc aat gag cag act act gaa 30433
Pro Ser Ser Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu
6070 6075 6080
ttt ttg tcc act gtc gag agc cac acc aca gct acc tcg agt gcc 30478
Phe Leu Ser Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala
6085 6090 6095
ttc tct agc acc gcc aat ctc tcc tcg ctt tcc tct aca cca atc 30523
Phe Ser Ser Thr Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile
6100 6105 6110
agt tcc gct act act act cct agc ccc gct cct ctt ccc act ccc 30568
Ser Ser Ala Thr Thr Thr Pro Ser Pro Ala Pro Leu Pro Thr Pro
6115 6120 6125
ctg aag caa act gag gac agc ggc atg caa tgg cag atc acc ctg 30613
Leu Lys Gln Thr Glu Asp Ser Gly Met Gln Trp Gln Ile Thr Leu
6130 6135 6140
ctc att gtg atc ggg ttg gtc atc ctg gcc gtg ttg ctc tac tac 30658
Leu Ile Val Ile Gly Leu Val Ile Leu Ala Val Leu Leu Tyr Tyr
6145 6150 6155
atc ttc tgc cgc cgc att ccc aac gcg cac cgc aag ccg gtc tac 30703
Ile Phe Cys Arg Arg Ile Pro Asn Ala His Arg Lys Pro Val Tyr
6160 6165 6170
aag ccc atc att gtc ggg cag cca gag ccg ctt cag gtg gaa ggg 30748
Lys Pro Ile Ile Val Gly Gln Pro Glu Pro Leu Gln Val Glu Gly
6175 6180 6185
ggt cta agg aat ctt ctc ttc tct ttt aca gta tgg tgattgaact atg 30797
Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp Met
6190 6195
att cct aga caa ttc ttg atc act att ctt atc tgc ctc ctc caa 30842
Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln
6200 6205 6210
gtc tgt gcc acc ctc gct ctg gtg gcc aac gcc agt cca gac tgt 30887
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys
6215 6220 6225
att ggg ccc ttc gcc tcc tac gtg ctc ttt gcc ttc atc acc tgc 30932
Ile Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys
6230 6235 6240
atc tgc tgc tgt agc ata gtc tgc ctg ctt atc acc ttc ttc cag 30977
Ile Cys Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln
6245 6250 6255
ttc att gac tgg atc ttt gtg cgc atc gcc tac ctg cgc cac cac 31022
Phe Ile Asp Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His
6260 6265 6270
ccc cag tac cgc gac cag cga gtg gcg cga ctg ctc agg ctc ctc 31067
Pro Gln Tyr Arg Asp Gln Arg Val Ala Arg Leu Leu Arg Leu Leu
6275 6280 6285
tgataagc atg cgg gct ctg cta ctt ctc gcg ctt ctg ctg tta gtg 31114
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val
6290 6295 6300
ctc ccc cgt ccc gtc gac ccc cgg tcc ccc act cag tcc ccc gag 31159
Leu Pro Arg Pro Val Asp Pro Arg Ser Pro Thr Gln Ser Pro Glu
6305 6310 6315
gag gtc cgc aaa tgc aaa ttc caa gaa ccc tgg aaa ttc ctc aaa 31204
Glu Val Arg Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys
6320 6325 6330
tgc tac cgc caa aaa tca gac atg cat ccc agc tgg atc atg atc 31249
Cys Tyr Arg Gln Lys Ser Asp Met His Pro Ser Trp Ile Met Ile
6335 6340 6345
att ggg atc gtg aac att ctg gcc tgc acc ctc atc tcc ttt gtg 31294
Ile Gly Ile Val Asn Ile Leu Ala Cys Thr Leu Ile Ser Phe Val
6350 6355 6360
att tac ccc tgc ttt gac ttt ggt tgg aac tcg cca gag gcg ctc 31339
Ile Tyr Pro Cys Phe Asp Phe Gly Trp Asn Ser Pro Glu Ala Leu
6365 6370 6375
tat ctc ccg cct gaa cct gac aca cca cca cag cag caa cct cag 31384
Tyr Leu Pro Pro Glu Pro Asp Thr Pro Pro Gln Gln Gln Pro Gln
6380 6385 6390
gca cac gca cta cca cca cca cag cct agg cca caa tac atg ccc 31429
Ala His Ala Leu Pro Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro
6395 6400 6405
ata tta gac tat gag gcc gag cca cag cga ccc atg ctc ccc gct 31474
Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg Pro Met Leu Pro Ala
6410 6415 6420
att agt tac ttc aat cta acc ggc gga gat gac tgacccactg 31517
Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp
6425 6430
gccaacaaca acgtcaacga ccttctcctg gacatggacg gccgcgcctc ggagcagcga 31577
ctcgcccaac ttcgcattcg ccagcagcag gagagagccg tcaaggagct gcaggacggc 31637
atagccatcc accagtgcaa gaaaggcatc ttctgcctgg tgaaacaggc caagatctcc 31697
tacgaggtca cccagaccga ccatcgcctc tcctacgagc tcctgcagca gcgccagaag 31757
ttcacctgcc tggtcggagt caaccccatc gtcatcaccc agcagtcggg cgataccaag 31817
gggtgcatcc actgctcctg cgactccccc gactgcgtcc acactctgat caagaccctc 31877
tgcggcctcc gcgacctcct ccccatgaac taatcacccc cttatccagt gaaataaaga 31937
tcatattgat gatgatttaa ataaaaaata atcatttgat ttgaaaataa agatacaatc 31997
atattgatga tttgagttta acaaaaataa agaatcactt acttgaaatc tgataccagg 32057
tctctgtcca tgttttctgc caacaccacc tcactcccct cttcccagct ctggtactgc 32117
aggccccggc gggctgcaaa cttcctccac acgctgaagg ggatgtcaaa ttcctcctgt 32177
ccctcaatct tcattttatc ttctatcag atg tcc aaa aag cgc gtc cgg gtg 32230
Met Ser Lys Lys Arg Val Arg Val
6435 6440
gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac gca 32275
Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala
6445 6450 6455
ccg acc gtg ccc ttc atc aac ccc ccc ttc gtc tct tca gat gga 32320
Pro Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly
6460 6465 6470
ttc caa gag aag ccc ctg ggg gtg ctg tcc ctg cgc ctg gcc gac 32365
Phe Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp
6475 6480 6485
ccc gtc acc acc aag aac ggg gct gtc acc ctc aag ctg ggg gag 32410
Pro Val Thr Thr Lys Asn Gly Ala Val Thr Leu Lys Leu Gly Glu
6490 6495 6500
ggg gtg gac ctc gac gac tcg gga aaa ctc atc tcc aaa aat gcc 32455
Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser Lys Asn Ala
6505 6510 6515
acc aag gcc act gcc cct ctc agt att tcc aac agc acc att tcc 32500
Thr Lys Ala Thr Ala Pro Leu Ser Ile Ser Asn Ser Thr Ile Ser
6520 6525 6530
ctt aac atg gct gcc cct ttt tac aac aac aat gga acg tta agt 32545
Leu Asn Met Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu Ser
6535 6540 6545
ctc aat gtt tct aca cca tta gca gta ttt ccc act ttt aac act 32590
Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr
6550 6555 6560
tta ggt atc agt ctt ggc aac ggt ctt caa act tct aat aag ttg 32635
Leu Gly Ile Ser Leu Gly Asn Gly Leu Gln Thr Ser Asn Lys Leu
6565 6570 6575
ctg act gta cag tta act cat cct ctt aca ttc agc tca aat agc 32680
Leu Thr Val Gln Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser
6580 6585 6590
atc aca gta aaa aca gac aaa gga ctc tat att aat tct agt gga 32725
Ile Thr Val Lys Thr Asp Lys Gly Leu Tyr Ile Asn Ser Ser Gly
6595 6600 6605
aac aga ggg ctt gag gct aac ata agc cta aaa aga gga ctg att 32770
Asn Arg Gly Leu Glu Ala Asn Ile Ser Leu Lys Arg Gly Leu Ile
6610 6615 6620
ttt gat ggt aat gct att gca aca tac ctt gga agt ggt tta gac 32815
Phe Asp Gly Asn Ala Ile Ala Thr Tyr Leu Gly Ser Gly Leu Asp
6625 6630 6635
tat gga tcc tat gat agc gat gga aaa aca aga ccc atc atc acc 32860
Tyr Gly Ser Tyr Asp Ser Asp Gly Lys Thr Arg Pro Ile Ile Thr
6640 6645 6650
aaa att gga gca ggt ttg aat ttt gat gct aat aaa gcc atg gct 32905
Lys Ile Gly Ala Gly Leu Asn Phe Asp Ala Asn Lys Ala Met Ala
6655 6660 6665
gtg aag cta ggc aca ggt tta agt ttt gac tct gcc ggt gcc tta 32950
Val Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser Ala Gly Ala Leu
6670 6675 6680
aca gct gga aac aaa gag gat gac aag cta aca ctt tgg act aca 32995
Thr Ala Gly Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr
6685 6690 6695
cct gac ccc agc cct aat tgt caa tta ctt tca gac aga gat gcc 33040
Pro Asp Pro Ser Pro Asn Cys Gln Leu Leu Ser Asp Arg Asp Ala
6700 6705 6710
aaa ttt acc cta tgt ctt aca aaa tgc ggt agt caa ata cta ggc 33085
Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly
6715 6720 6725
act gtt gca gta gct gct gtt act gta ggt tca gca cta aat cca 33130
Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu Asn Pro
6730 6735 6740
att aat gac aca gta aaa agc gcc ata gta ttc ctt aga ttt gat 33175
Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu Arg Phe Asp
6745 6750 6755
tcc aat ggt gtg ctc atg tca aac tca tca atg gta ggc gat tac 33220
Ser Asn Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr
6760 6765 6770
tgg aac ttt agg gaa gga cag acc acc caa agt gtg gcc tat aca 33265
Trp Asn Phe Arg Glu Gly Gln Thr Thr Gln Ser Val Ala Tyr Thr
6775 6780 6785
aat gct gtg gga ttt atg ccc aat cta ggt gca tat cct aaa acc 33310
Asn Ala Val Gly Phe Met Pro Asn Leu Gly Ala Tyr Pro Lys Thr
6790 6795 6800
caa agc aaa aca cca aaa aat agt ata gtt agt cag gta tat tta 33355
Gln Ser Lys Thr Pro Lys Asn Ser Ile Val Ser Gln Val Tyr Leu
6805 6810 6815
aat gga gaa acc act atg cca atg aca cta aca ata act ttc aat 33400
Asn Gly Glu Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn
6820 6825 6830
ggc act gat gaa aaa gat aca aca cct gtc agc act tac tct atg 33445
Gly Thr Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met
6835 6840 6845
act ttt aca tgg cag tgg act gga gac tat aag gac aag aat att 33490
Thr Phe Thr Trp Gln Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile
6850 6855 6860
acc ttt gct acc aac tca ttc tct ttc tcc tac atg gcc caa gaa 33535
Thr Phe Ala Thr Asn Ser Phe Ser Phe Ser Tyr Met Ala Gln Glu
6865 6870 6875
taaaccctgc atgccaaccc catccccacc gctctatgga aaactctgaa gcagaaaaat 33595
aaagttcaag tgtttttatt gattcaacag ttttcacagg attcgagtag ttattttccc 33655
tcctccctcc caactcatgg aatacaccac cctctcccca cgcacagcct taaacatctg 33715
aatgccattg gtaatggaca tggttttggt ctccacgttc cacacagttt cagagcgagc 33775
cagtctcggg tcggtcaggg agatgaaacc ctccgggcac tcccgcatct gcacctcaca 33835
gctcaacagc tgaggattgt cctcggtggt cgggatcacg gttatctgga agaagcagaa 33895
gagcggcggt gggaatcata gtccgcgaac gggatcggtc ggtggtgccg catcaggccc 33955
cgcagcagtc gctgccgccg ccgctccgtc aagctgctgc tcagggggtc cgggtccagt 34015
gactccctca gcatgatgcc cacggccctc agcatcagtc gtctggtgcg gcgggcgcag 34075
cagcgcatgc ggatctcgct caggtcactg cagtacgtgc aacacaggac caccaggttg 34135
ttcaacagtc catagttcaa cacgctccag ccgaaactca tcgcgggaag gatgctaccc 34195
acgtggccgt cgtaccagat cctcaggtaa atcaagtggc gctccctcca gaagacgctg 34255
cccatgtaca tgatctcctt gggcatgtgg cggttcacca cctcccggta ccacatcacc 34315
ctctggttga acatgcagcc ccggatgatc ctgcggaacc acagggccag caccgccccg 34375
cccgccatgc agcgaagaga ccccggatcc cggcaatgac aatggaggac ccaccgctcg 34435
tacccgtgga tcatctggga gctgaacaag tctatgttgg cacagcacag gcacacgctc 34495
atgcatctct tcagcactct cagctcctcg ggggtcaaaa ccatatccca gggcacgggg 34555
aactcttgca ggacagcgaa ccccgcagaa cagggcaatc ctcgcacata acttacattg 34615
tgcatggaca gggtatcgca atcaggcagc accgggtgat cctccaccag agaagcgcgg 34675
gtctcggtct cctcacagcg tggtaagggg gccggccgat acgggtgatg gcgggacgcg 34735
gctgatcgtg ttctcgaccg tgtcatgatg cagttgcttt cggacatttt cgtacttgct 34795
gtagcagaac ctggtccggg cgctgcacac cgatcgccgg cggcggtctc ggcgcttgga 34855
acgctcggtg ttaaagttgt aaaacagcca ctctctcaga ccgtgcagca gatctagggc 34915
ctcaggagtg atgaagatcc catcatgcct gatagctctg atcacatcga ccaccgtgga 34975
atgggccagg cccagccaga tgatgcaatt ttgttgggtt tcggtgacgg cgggggaggg 35035
aagaacagga agaaccatga ttaactttta atccaaacgg tctcggagca cttcaaaatg 35095
aaggtcacgg agatggcacc tctcgccccc gctgtgttgg tggaaaataa cagccaggtc 35155
aaaggtgata cggttctcga gatgttccac ggtggcttcc agcaaagcct ccacgcgcac 35215
atccagaaac aagacaatag cgaaagcggg agggttctct aattcctcaa tcatcatgtt 35275
acactcctgc accatcccca gataattttc atttttccag ccttgaatga ttcgaactag 35335
ttcctgaggt aaatccaagc cagccatgat aaagagctcg cgcagagcgc cctccaccgg 35395
cattcttaag cacaccctca taattccaag atattctgct cctggttcac ctgcagcaga 35455
ttgacaagcg gaatatcaaa atctctgccg cgatccctaa gctcctccct cagcaataac 35515
tgtaagtact ctttcatatc ctctccgaaa tttttagcca taggaccgcc aggaatgaga 35575
ttaggacaag ccacattaca gataaaccga agtccccccc agtgagcatt gccaaatgta 35635
agattgaaat aagcatgctg gctagacccg gtgatatctt ccagataact ggacagaaaa 35695
tcgcccaggc aatttttaag aaaatcaaca aaagaaaaat cttccaggtg cacgtttagg 35755
gcctcgggaa caacgatgga gtaagtgcaa ggggtgcgtt ccagcatggt tagttagctg 35815
atctgtaaaa aaacaaaaaa taaaacatta aaccatgcta gcctggcgaa caggtgggta 35875
aatcgttctc tccagcacca ggcaggccac ggggtctccg gcgcgaccct cgtaaaaatt 35935
gtcgctatga ttgaaaacca tcacagagag acgttcccgg tggccggcgt gaatgattcg 35995
acaagatgaa tacacccccg gaacattggc gtccgcgagt gaaaaaaagc gcccaaggaa 36055
gcaataaggc actacaatgc tcagtctcaa gtccagcaaa gcgatgccat gcggatgaag 36115
cacaaaattc tcaggtgcgt acaaaatgta attactcccc tcctgcacag gcagcaaagc 36175
ccccgatccc tccaggtaca catacaaagc ctcagcgtcc atagcttacc gagcagcagc 36235
acacaacagg cgcaagagtc agagaaaggc tgagctctaa cctgtccacc cgctctctgc 36295
tcaatatata gcccagatct acactgacgt aaaggccaaa gtctaaaaat acccgccaaa 36355
tagtcacaca cgcccagcac acgcccagaa accggtgaca cactcagaaa aatacgcgca 36415
cttcctcaaa cgcccaaact gtcgtcattt ccgggttccc acgctacgtc atcagaattc 36475
gactttcaaa ttccgtcgac cgttaaacac gtcacccgcc ccgcccctaa cggtcgcctt 36535
cgtcacagcc aatcagcgcc ccgcatcccc aaattcaaac gcctcatttg catattaacg 36595
cgcaccaaaa gtttgaggta tattattgat gatg 36629
<210> 131
<211> 504
<212> PRT
<213> Simian adenovirus 25.2
<400> 131
Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu
1 5 10 15
Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn
20 25 30
Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu
35 40 45
Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Gly
50 55 60
Ala Ala Ala Arg Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg
65 70 75 80
Ser Gly Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro
85 90 95
Glu Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly
100 105 110
Ile Lys Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr
115 120 125
Val Ser Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu
130 135 140
Val Gln Ser Gln Gly Ile Asp Glu Val Ser Val Met His Glu Lys Tyr
145 150 155 160
Ser Leu Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp
165 170 175
Glu Leu Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Lys Pro Asp Lys
180 185 190
Lys Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile
195 200 205
Ser Gly Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala
210 215 220
Phe Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu
225 230 235 240
Gly Val Thr Phe Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly
245 250 255
Val Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe
260 265 270
Phe Gly Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val
275 280 285
Arg Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr
290 295 300
Lys Ser Lys Val Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu
305 310 315 320
Gly Val Met Ser Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr
325 330 335
Glu Thr Gly Cys Phe Val Leu Ile Lys Gly Asn Ala Gln Val Lys His
340 345 350
Asn Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr
355 360 365
Cys Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser
370 375 380
His Pro Arg Lys Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg
385 390 395 400
Cys Asn Val His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln
405 410 415
Cys Asn Met Gln Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser
420 425 430
Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys
435 440 445
Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys
450 455 460
Gly Gly Lys His Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu
465 470 475 480
Asp Leu Arg Pro Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe
485 490 495
Gly Ser Ser Gly Glu Glu Ser Asp
500
<210> 132
<211> 142
<212> PRT
<213> Simian adenovirus 25.2
<400> 132
Met Ser Gly Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu
1 5 10 15
Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly
20 25 30
Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu
35 40 45
Thr Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala
50 55 60
Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala
65 70 75 80
Gly Tyr Tyr Ser Ser Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro
85 90 95
Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu
100 105 110
Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu
115 120 125
Gln Ala Glu Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys
130 135 140
<210> 133
<211> 395
<212> PRT
<213> Simian adenovirus 25.2
<400> 133
Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln
1 5 10 15
Gln Gln Pro Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln
20 25 30
Gln Gln Gln Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala
35 40 45
Gly Gln Thr Ser Gln Tyr Asp His Leu Ala Leu Glu Glu Gly Glu Gly
50 55 60
Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln
65 70 75 80
Met Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe
85 90 95
Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe
100 105 110
His Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu
115 120 125
Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala
130 135 140
Arg Ala His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr
145 150 155 160
Val Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg
165 170 175
Thr Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp
180 185 190
Asp Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr
195 200 205
Ala Gln Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe
210 215 220
Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu
225 230 235 240
Asp Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu
245 250 255
Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu
260 265 270
Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile
275 280 285
Asp Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys
290 295 300
Val Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met
305 310 315 320
His Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu
325 330 335
Leu Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly
340 345 350
Glu Ser Tyr Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg
355 360 365
Arg Ala Leu Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp
370 375 380
Glu Val Asp Glu Glu Gly Glu Tyr Leu Glu Asp
385 390 395
<210> 134
<211> 583
<212> PRT
<213> Simian adenovirus 25.2
<400> 134
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln
1 5 10 15
Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
20 25 30
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln
35 40 45
Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro
50 55 60
Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala
65 70 75 80
Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr
85 90 95
Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln
100 105 110
Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln
115 120 125
Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala Leu
130 135 140
Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu
145 150 155 160
Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val
165 170 175
Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr
180 185 190
Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn
195 200 205
Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr
210 215 220
Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val
225 230 235 240
Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asp Ser Tyr Leu Gly
245 250 255
Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
260 265 270
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln
275 280 285
Glu Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn
290 295 300
Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Thr Glu Glu Glu
305 310 315 320
Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln
325 330 335
Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met
340 345 350
Glu Pro Ser Met Tyr Ala Arg Asn Arg Pro Phe Ile Asn Lys Leu Met
355 360 365
Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn
370 375 380
Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly
385 390 395 400
Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val
405 410 415
Asp Ser Ser Val Phe Ser Pro Arg Pro Ala Thr Thr Val Trp Lys Lys
420 425 430
Glu Gly Gly Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Ala Gly
435 440 445
Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro
450 455 460
Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr Arg
465 470 475 480
Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu Leu Arg
485 490 495
Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu Ser Leu Val
500 505 510
Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala His Glu His Arg Asp Glu
515 520 525
Pro Arg Ala Ser Ser Ala Thr Arg Arg Arg Gln Arg His Asp Arg Gln
530 535 540
Arg Gly Leu Val Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser Ser Val
545 550 555 560
Leu Asp Leu Gly Gly Ser Gly Gly Gly Asn Pro Phe Ala His Leu Arg
565 570 575
Pro Arg Ile Gly Arg Leu Met
580
<210> 135
<211> 531
<212> PRT
<213> Simian adenovirus 25.2
<400> 135
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Thr Val Gly Asp Asp Tyr Asp Gly Ser
145 150 155 160
Gln Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly
165 170 175
Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile
180 185 190
Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp
195 200 205
Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro
210 215 220
Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His
225 230 235 240
Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser
245 250 255
Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu
260 265 270
Gly Phe Gln Ile Leu Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala
275 280 285
Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ser Ala Ala
290 295 300
Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp
305 310 315 320
Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser
325 330 335
Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asn Arg Ser Tyr
340 345 350
Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu
355 360 365
Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu
370 375 380
Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser
385 390 395 400
Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln
405 410 415
Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser
420 425 430
Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala
435 440 445
Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile
450 455 460
Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val
465 470 475 480
Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg
485 490 495
Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro
500 505 510
Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser
515 520 525
Arg Thr Phe
530
<210> 136
<211> 194
<212> PRT
<213> Simian adenovirus 25.2
<400> 136
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
1 5 10 15
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His
20 25 30
Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
35 40 45
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val
50 55 60
Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr
65 70 75 80
Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala
85 90 95
Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
100 105 110
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
115 120 125
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ser
130 135 140
Ser Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala
145 150 155 160
Ala Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val
165 170 175
Arg Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro
180 185 190
Arg Thr
<210> 137
<211> 348
<212> PRT
<213> Simian adenovirus 25.2
<400> 137
Met Ser Lys Arg Lys Tyr Lys Glu Glu Met Leu Gln Val Ile Ala Pro
1 5 10 15
Glu Ile Tyr Gly Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys
20 25 30
Leu Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Asp Asp Gly Leu
35 40 45
Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg
50 55 60
Gly Arg Lys Val Lys Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe
65 70 75 80
Thr Pro Gly Glu Arg Ser Gly Ser Ala Ser Lys Arg Ser Tyr Asp Glu
85 90 95
Val Tyr Gly Asp Glu Asp Ile Leu Glu Gln Ala Val Glu Arg Leu Gly
100 105 110
Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Pro Leu Lys Glu Glu
115 120 125
Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys
130 135 140
Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly
145 150 155 160
Phe Lys Arg Glu Gly Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met
165 170 175
Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu His Met Lys Val
180 185 190
Asp Pro Glu Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val
195 200 205
Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu
210 215 220
Pro Met Glu Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr Met
225 230 235 240
Glu Val Gln Thr Asp Pro Trp Met Pro Ala Ala Pro Thr Thr Thr Thr
245 250 255
Arg Arg Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr
260 265 270
Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg
275 280 285
Phe Tyr Arg Gly Tyr Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr Arg
290 295 300
Arg Arg Arg Arg Arg His Thr Arg Arg Ser Ser Thr Ala Thr Ser Ala
305 310 315 320
Ala Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu Thr
325 330 335
Leu Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile
340 345
<210> 138
<211> 77
<212> PRT
<213> Simian adenovirus 25.2
<400> 138
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly
1 5 10 15
Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly Asn Gly Leu Arg Arg His
20 25 30
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
35 40 45
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro
50 55 60
Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
65 70 75
<210> 139
<211> 244
<212> PRT
<213> Simian adenovirus 25.2
<400> 139
Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg
1 5 10 15
Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly
20 25 30
Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly
35 40 45
Ser Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly
50 55 60
Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val
65 70 75 80
Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn
85 90 95
Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala Val Pro Pro
100 105 110
Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu
115 120 125
Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu
130 135 140
Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu
145 150 155 160
Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu
165 170 175
Lys Pro Glu Ser Asn Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro Gln
180 185 190
Pro Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val
195 200 205
Ala Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln
210 215 220
Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg
225 230 235 240
Arg Arg Cys Tyr
<210> 140
<211> 933
<212> PRT
<213> Simian adenovirus 25.2
<400> 140
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp Gly Glu Thr Ala
130 135 140
Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gln Gly Ile Asn
145 150 155 160
Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr Asp Asp Gln Pro
165 170 175
Ile Tyr Ala Asp Glu Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Ala
180 185 190
Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr Gly Thr Gly
225 230 235 240
Thr Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser
245 250 255
Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn
260 265 270
Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr
275 280 285
Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gln Gln Ala Met Pro Asn
290 295 300
Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr
305 310 315 320
Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln
325 330 335
Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr
340 345 350
Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met
355 360 365
Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu
370 375 380
Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp
385 390 395 400
Ala Val Gly Lys Thr Asp Thr Tyr Gln Gly Ile Lys Ala Asn Gly Thr
405 410 415
Asp Gln Thr Thr Trp Thr Lys Asp Asp Ser Val Asn Asp Ala Asn Glu
420 425 430
Ile Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn
435 440 445
Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp
450 455 460
Ser Tyr Lys Tyr Thr Pro Ala Asn Val Thr Leu Pro Ala Asn Thr Asn
465 470 475 480
Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp
485 490 495
Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn
500 505 510
Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser
515 520 525
Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro
530 535 540
Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr
545 550 555 560
Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser
565 570 575
Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ser Phe Thr
580 585 590
Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala
595 600 605
Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe
610 615 620
Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn
625 630 635 640
Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe
645 650 655
Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu
660 665 670
Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr
675 680 685
Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile
690 695 700
Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr
705 710 715 720
Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn
725 730 735
Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu
740 745 750
Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr
755 760 765
Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg
770 775 780
Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu
785 790 795 800
Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr
805 810 815
Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile
820 825 830
Gly Lys Ser Ala Val Ala Ser Val Thr Gln Lys Lys Phe Leu Cys Asp
835 840 845
Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly
850 855 860
Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His
865 870 875 880
Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu
885 890 895
Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro
900 905 910
His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala
915 920 925
Gly Asn Ala Thr Thr
930
<210> 141
<211> 208
<212> PRT
<213> Simian adenovirus 25.2
<400> 141
Met Thr Ala Cys Ala Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Leu
1 5 10 15
Arg Asp Leu Gly Cys Gly Pro Cys Phe Leu Gly Thr Phe Asp Lys Arg
20 25 30
Phe Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn
35 40 45
Thr Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp
50 55 60
Asn Pro Arg Thr His Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser
65 70 75 80
Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu
85 90 95
Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys
100 105 110
Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe
115 120 125
Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met
130 135 140
Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met
145 150 155 160
Leu Gln Ser Pro Gln Val Glu Thr Thr Leu Arg Arg Asn Gln Glu Ala
165 170 175
Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His Arg
180 185 190
Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp Met
195 200 205
<210> 142
<211> 797
<212> PRT
<213> Simian adenovirus 25.2
<400> 142
Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Ala
1 5 10 15
Ala Asp Glu Asn Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro
20 25 30
Pro Ser Pro Thr Ser Asp Ala Ala Pro Asp Met Gln Glu Met Glu Glu
35 40 45
Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu Glu
50 55 60
Glu Leu Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln Glu Gln
65 70 75 80
Pro Glu Gln Glu Ala Glu Ser Glu Gln Gln Gln Ala Gly Leu Glu His
85 90 95
Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His Leu Ala
100 105 110
Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala Glu Val
115 120 125
Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg Asn Leu Phe
130 135 140
Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro
145 150 155 160
Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu
165 170 175
Ala Thr Tyr His Leu Phe Phe Lys Asn Gln Arg Ile Pro Val Ser Cys
180 185 190
Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly Pro Gly
195 200 205
Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile Phe
210 215 220
Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly
225 230 235 240
Ser Gly Glu Glu His Glu His His Ser Ala Leu Val Glu Leu Glu Gly
245 250 255
Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu Thr His
260 265 270
Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser Ala Val
275 280 285
Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu
290 295 300
Met Gln Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser Asp
305 310 315 320
Glu Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln Ser Leu
325 330 335
Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr Val Glu
340 345 350
Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu Arg Lys
355 360 365
Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val Arg Gln
370 375 380
Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser Tyr Met
385 390 395 400
Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr
405 410 415
Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu
420 425 430
Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys
435 440 445
Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn
450 455 460
Leu Lys Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ser Asp
465 470 475 480
Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn
485 490 495
Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser
500 505 510
Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu
515 520 525
Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu
530 535 540
Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His
545 550 555 560
Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu Cys His
565 570 575
Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys Asn Pro
580 585 590
Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu Gln Gly
595 600 605
Pro Gly Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr
610 615 620
Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His Pro Phe Glu
625 630 635 640
Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser
645 650 655
Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile
660 665 670
Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His Gly Val Tyr
675 680 685
Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe Pro Gln Asp
690 695 700
Ala Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala Ala Ala Ala
705 710 715 720
Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly Asp Gly
725 730 735
Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser
740 745 750
Gly Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg
755 760 765
Arg Gln Thr Val Val Leu Gly Gly Glu Ser Lys Gln His Gly Tyr His
770 775 780
Leu Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro Gln
785 790 795
<210> 143
<211> 227
<212> PRT
<213> Simian adenovirus 25.2
<400> 143
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn
20 25 30
Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg
35 40 45
Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr
50 55 60
Pro Arg His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr
65 70 75 80
Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
85 90 95
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala
100 105 110
Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile
115 120 125
Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
130 135 140
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser
145 150 155 160
Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser
165 170 175
Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
180 185 190
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr
195 200 205
Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp
210 215 220
Gly Tyr Asp
225
<210> 144
<211> 106
<212> PRT
<213> Simian adenovirus 25.2
<400> 144
Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp
1 5 10 15
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala
20 25 30
Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
35 40 45
Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe
50 55 60
Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Leu Leu
65 70 75 80
Thr Leu Tyr Cys Ile Cys Asn Arg Pro Gly Leu His Glu Ser Leu Cys
85 90 95
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
100 105
<210> 145
<211> 178
<212> PRT
<213> Simian adenovirus 25.2
<400> 145
Met Gly Lys Ile Thr Leu Val Ser Cys Gly Val Pro Val Ala Val Val
1 5 10 15
Val Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Glu
20 25 30
Lys Ala Asp Ser Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser
35 40 45
Phe Gln Pro Asp Gly Asn Arg Cys Thr Val Leu Ile Lys Cys Gly Trp
50 55 60
Glu Cys Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn
65 70 75 80
Thr Leu Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val
85 90 95
Ser Val Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe
100 105 110
Ile Phe Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr
115 120 125
Asp Met Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala
130 135 140
Tyr Ser Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile
145 150 155 160
His Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys
165 170 175
Gln Pro
<210> 146
<211> 242
<212> PRT
<213> Simian adenovirus 25.2
<400> 146
Met Ala Ser Val Thr Ala Leu Ile Ile Phe Leu Gly Leu Val Gly Thr
1 5 10 15
Ser Ser Thr Phe Asp His Lys Asn Leu Thr Val Phe Val Gly Ser Asp
20 25 30
Val Thr Leu Pro Gly His Gln Ser His Gln Arg Val Ser Trp Tyr His
35 40 45
Phe Asn Lys Gln Asn Thr Ala Tyr Thr Leu Cys Lys Gly His Gln Gln
50 55 60
Ala Thr Tyr Arg Ser Gly Leu Tyr Tyr Arg Cys Asn Asn Asn Asn Leu
65 70 75 80
Thr Leu Leu Ser Val Asn Ala Asn Tyr Ser Gly Thr Tyr Tyr Gly Thr
85 90 95
Asn Phe Asn Thr Lys Gln Asp Thr Tyr Tyr Ser Val Lys Val Leu Asn
100 105 110
Pro Thr Ser Pro Arg Thr Thr Thr Lys Pro Thr Ala Thr Thr Thr Thr
115 120 125
Thr Ala Lys Pro Thr Lys Pro Lys Thr Thr Lys Lys Thr Thr Val Lys
130 135 140
Thr Thr Thr Thr Arg Thr Thr Thr Thr Thr Glu Thr Thr Thr Ser Thr
145 150 155 160
Leu Ala Ala Thr Thr His Thr His Ile Glu Leu Thr Leu Gln Thr Thr
165 170 175
Asn Asp Leu Ile Ala Leu Leu Gln Lys Gly Asp Asn Ser Thr Thr Ser
180 185 190
Asp Glu Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val
195 200 205
Val Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys
210 215 220
Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val
225 230 235 240
Glu Phe
<210> 147
<211> 202
<212> PRT
<213> Simian adenovirus 25.2
<400> 147
Met Lys Ile Leu Gly Leu Leu Val Phe Ser Ile Ile Thr Ser Ala Leu
1 5 10 15
Cys Glu Ser Val Asp Lys Asp Val Thr Ile Thr Thr Gly Ser Asn Tyr
20 25 30
Thr Leu Lys Gly Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe
35 40 45
Gly Thr Asp Ser Glu Gln Thr Glu Leu Cys Asn Ala Met Arg Gly Gln
50 55 60
Met Pro Thr Ser Lys Ile Lys His Lys Cys Asn Gly Thr Asp Leu Ile
65 70 75 80
Leu Leu Asn Val Thr Lys Ala Tyr Ala Gly Ser Tyr Thr Cys Pro Gly
85 90 95
Asp Asp Ala Asp Ser Met Ile Phe Tyr Lys Val Thr Val Val Asp Pro
100 105 110
Thr Thr Pro Pro Pro Thr Thr Thr Thr Thr His Thr Thr His Thr Glu
115 120 125
Gln Thr Pro Glu Ala Ala Gly Glu Leu Ala Leu Gln Val Gln Glu Asp
130 135 140
Ser Leu Met Ala Asn Thr Pro Thr Pro Asp His Arg Cys Pro Gly Leu
145 150 155 160
Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala Val Ile Ile
165 170 175
Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg Gln
180 185 190
Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
195 200
<210> 148
<211> 287
<212> PRT
<213> Simian adenovirus 25.2
<400> 148
Met Lys Ala Val Arg Val Leu Val Phe Cys Ser Leu Ile Gly Ile Val
1 5 10 15
Phe Ser Ala Gly Phe Leu Lys Asn Leu Thr Ile Tyr Glu Gly Glu Asn
20 25 30
Ala Thr Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp Leu Lys Tyr
35 40 45
His Leu Asp Arg Trp Lys Asp Ile Cys Asp Trp Asn Val Thr Val Tyr
50 55 60
Thr Cys Asn Gly Val Asn Leu Thr Ile Thr Asn Ala Thr Gln Asp Gln
65 70 75 80
Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr Arg Asn Asn Gly Tyr Glu
85 90 95
Ser His Asn Met Phe Ile Tyr Asp Val Thr Val Ile Arg Asn Glu Thr
100 105 110
Ala Thr Thr Thr Thr Gln Met Gln Thr Thr Gln Thr Thr Thr Tyr Ser
115 120 125
Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val Ala Ser Ser
130 135 140
Ser Gly Val Lys Val Ala Phe Leu Leu Leu Pro Pro Ser Ser Ser Pro
145 150 155 160
Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser Thr Val Glu
165 170 175
Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr Ala Asn Leu
180 185 190
Ser Ser Leu Ser Ser Thr Pro Ile Ser Ser Ala Thr Thr Thr Pro Ser
195 200 205
Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr Glu Asp Ser Gly Met
210 215 220
Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu Val Ile Leu Ala
225 230 235 240
Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile Pro Asn Ala His Arg
245 250 255
Lys Pro Val Tyr Lys Pro Ile Ile Val Gly Gln Pro Glu Pro Leu Gln
260 265 270
Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp
275 280 285
<210> 149
<211> 91
<212> PRT
<213> Simian adenovirus 25.2
<400> 149
Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln
1 5 10 15
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile
20 25 30
Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys
35 40 45
Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp
50 55 60
Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg
65 70 75 80
Asp Gln Arg Val Ala Arg Leu Leu Arg Leu Leu
85 90
<210> 150
<211> 144
<212> PRT
<213> Simian adenovirus 25.2
<400> 150
Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg
1 5 10 15
Pro Val Asp Pro Arg Ser Pro Thr Gln Ser Pro Glu Glu Val Arg Lys
20 25 30
Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys
35 40 45
Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile
50 55 60
Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe
65 70 75 80
Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr
85 90 95
Pro Pro Gln Gln Gln Pro Gln Ala His Ala Leu Pro Pro Pro Gln Pro
100 105 110
Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg
115 120 125
Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp
130 135 140
<210> 151
<211> 443
<212> PRT
<213> Simian adenovirus 25.2
<400> 151
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Ala Val Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser
65 70 75 80
Lys Asn Ala Thr Lys Ala Thr Ala Pro Leu Ser Ile Ser Asn Ser Thr
85 90 95
Ile Ser Leu Asn Met Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu
100 105 110
Ser Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr
115 120 125
Leu Gly Ile Ser Leu Gly Asn Gly Leu Gln Thr Ser Asn Lys Leu Leu
130 135 140
Thr Val Gln Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser Ile Thr
145 150 155 160
Val Lys Thr Asp Lys Gly Leu Tyr Ile Asn Ser Ser Gly Asn Arg Gly
165 170 175
Leu Glu Ala Asn Ile Ser Leu Lys Arg Gly Leu Ile Phe Asp Gly Asn
180 185 190
Ala Ile Ala Thr Tyr Leu Gly Ser Gly Leu Asp Tyr Gly Ser Tyr Asp
195 200 205
Ser Asp Gly Lys Thr Arg Pro Ile Ile Thr Lys Ile Gly Ala Gly Leu
210 215 220
Asn Phe Asp Ala Asn Lys Ala Met Ala Val Lys Leu Gly Thr Gly Leu
225 230 235 240
Ser Phe Asp Ser Ala Gly Ala Leu Thr Ala Gly Asn Lys Glu Asp Asp
245 250 255
Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Leu
260 265 270
Leu Ser Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly
275 280 285
Ser Gln Ile Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser
290 295 300
Ala Leu Asn Pro Ile Asn Asp Thr Val Lys Ser Ala Ile Val Phe Leu
305 310 315 320
Arg Phe Asp Ser Asn Gly Val Leu Met Ser Asn Ser Ser Met Val Gly
325 330 335
Asp Tyr Trp Asn Phe Arg Glu Gly Gln Thr Thr Gln Ser Val Ala Tyr
340 345 350
Thr Asn Ala Val Gly Phe Met Pro Asn Leu Gly Ala Tyr Pro Lys Thr
355 360 365
Gln Ser Lys Thr Pro Lys Asn Ser Ile Val Ser Gln Val Tyr Leu Asn
370 375 380
Gly Glu Thr Thr Met Pro Met Thr Leu Thr Ile Thr Phe Asn Gly Thr
385 390 395 400
Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr
405 410 415
Trp Gln Trp Thr Gly Asp Tyr Lys Asp Lys Asn Ile Thr Phe Ala Thr
420 425 430
Asn Ser Phe Ser Phe Ser Tyr Met Ala Gln Glu
435 440
<210> 152
<211> 590
<212> DNA
<213> Simian adenovirus 25.2
<220>
<221> CDS
<222> (10)..(585)
<223> label=Elb\19K
<400> 152
gcaggactc atg gag atc tgg acg gtc ttg gaa gac ttt cat cag act aga 51
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg
1 5 10
cag ctg cta gag aac tca tcg gag gaa gtc tct tac ctg tgg aga ttt 99
Gln Leu Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe
15 20 25 30
tgc ttc ggt ggg gct cta gct aag cta gtc tat agg gcc aaa cag gat 147
Cys Phe Gly Gly Ala Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp
35 40 45
tat aag gat caa ttt gag gat att ttg aga gag tgt cct ggt att ttt 195
Tyr Lys Asp Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe
50 55 60
gac tct ctc aac ttg ggc cat cag tct cac ttt aac cag agt att ctg 243
Asp Ser Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu
65 70 75
aga gcc ctt gac ttt tct act cct ggc aga act acc gcc gcg gta gcc 291
Arg Ala Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala
80 85 90
ttt ttt gcc ttt atc ctt gac aaa tgg agt caa gaa acc cat ttc agc 339
Phe Phe Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser
95 100 105 110
agg gat tac cgt ctg gac tgc tta gca gta gct ttg tgg aga aca tgg 387
Arg Asp Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp
115 120 125
agg tgc cag cgc ctg aat gca atc tcc ggc tac ttg cca gta cag ccg 435
Arg Cys Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro
130 135 140
gta gac acg ctg agg atc ctg agt ctc cag tca ccc cag gaa cac caa 483
Val Asp Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln
145 150 155
cgc cgc cag cag ccg cag cag gag cag cag caa gag gag gag gag gac 531
Arg Arg Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Glu Asp
160 165 170
cga gaa gag aac ccg aga gcc ggt ctg gac cct ccg gtg gcg gag gag 579
Arg Glu Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu
175 180 185 190
gag gag tagct 590
Glu Glu
<210> 153
<211> 192
<212> PRT
<213> Simian adenovirus 25.2
<400> 153
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu
1 5 10 15
Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
Gly Gly Ala Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Lys
35 40 45
Asp Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser
50 55 60
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala
65 70 75 80
Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys
115 120 125
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp
130 135 140
Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg
145 150 155 160
Gln Gln Pro Gln Gln Glu Gln Gln Gln Glu Glu Glu Glu Asp Arg Glu
165 170 175
Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu
180 185 190
<210> 154
<211> 6410
<212> DNA
<213> Simian adenovirus 25.2
<220>
<221> CDS
<222> (10)..(579)
<223> label=22K
<220>
<221> CDS
<222> (1883)..(2509)
<223> label=E3\CR1\alpha
<220>
<221> CDS
<222> (6003)..(6407)
<223> label=E3\14.7K
<400> 154
tcccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 51
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ctg ccg ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag 99
Leu Pro Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu
15 20 25 30
gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 147
Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln
35 40 45
gac agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa 195
Asp Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu
50 55 60
gca gcc gcc gcc aga ccg tcg tcc tcg gcg gag aaa gca agc agc acg 243
Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 291
Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser
80 85 90
aga tgg gac gag acc ggg cgc ttc cca aac ccc acc acc cag acc ggt 339
Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly
95 100 105 110
aag aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac gcc 387
Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala
115 120 125
atc gtc tcc tgc ttg caa gcc tgc ggg ggc aac atc tcc ttc acc cgg 435
Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg
130 135 140
cgc tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc ttg 483
Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu
145 150 155
cat tac tac cgt cac ctc cac agc ccc tac tac tgt ttc caa gaa gag 531
His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu
160 165 170
gca gaa acc cag cag cag cag aaa acc agc agc agc agc agc agc agc 579
Ala Glu Thr Gln Gln Gln Gln Lys Thr Ser Ser Ser Ser Ser Ser Ser
175 180 185 190
tagaaaatcc acagcggcgg cggcaggtgg actgaggatc gcggcgaacg agccggcgca 639
gacccgggag ctgaggaacc ggatctttcc caccctctat gccatcttcc agcagagtcg 699
ggggcaggag caggaactga aagtcaagaa ccgttctctg cgctcgctca cccgcagttg 759
tctgtatcac aagagcgaag accaacttca gcgcactctc gaggacgccg aggctctctt 819
caacaagtac tgcgcgctca ctcttaaaga gtagcccgcg cccgcccaca cacggaaaaa 879
ggcgggaatt acgtcaccac ctgcgccctt cgcccgacca tcatcatgag caaagagatt 939
cccacgcctt acatgtggag ctaccagccc cagatgggcc tggccgccgg cgccgcccag 999
gactactcca cccgcatgaa ctggctcagt gccgggcccg cgatgatctc acgggtgaat 1059
gacatccgcg cccaccgaaa ccagatactc ctagaacagt cagcaatcac cgccacgccc 1119
cgccatcacc ttaatccgcg taattggccc gccgccctgg tgtaccagga aattccccag 1179
cccacgaccg tactacttcc gcgagacgcc caggccgaag tccagctgac taactcaggt 1239
gtccagctgg ccggcggcgc cgccctgtgt cgtcaccgcc ccgctcaggg tataaagcgg 1299
ctggtgatcc gaggcagagg cacacagctc aacgacgagg tggtgagctc ttcgctgggt 1359
ctgcgacctg acggagtctt ccaactcgcc ggatcgggga gatcttcctt cacgcctcgt 1419
caggccgtcc tgactttgga gagttcgtcc tcgcagcccc gctcgggtgg catcggcact 1479
ctccagttcg tggaggagtt cactccctcg gtctacttca accccttctc cggctccccc 1539
ggccactacc cggacgagtt catcccgaac ttcgacgcca tcagcgagtc ggtggacggc 1599
tacgattgaa tgtcccatgg tggcgcggct gacctagctc ggcttcgaca cctggaccac 1659
tgccgccgct tccgctgctt cgctcgggat ctcgccgagt ttgcctactt tgagctgccc 1719
gaggagcacc ctcagggccc ggcccacgga gtgcggatcg tcgtcgaagg gggcctcgac 1779
tcccacctgc ttcggatctt cagccagcga ccgatcctgg tcgagcgcga gcaaggacag 1839
acccttctga ccctgtactg catctgcaac cgccccggcc tgc atg aaa gtc ttt 1894
Met Lys Val Phe
gtt gtc tgc tgt gta ctg agt ata ata aaa gct gaa atc agc gac tac 1942
Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp Tyr
195 200 205 210
tcc gga ctc gat tgt ggt gtt cct gct atc aac cgg tcc ctg ttc ttc 1990
Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg Ser Leu Phe Phe
215 220 225
acc ggg aac gag acc gag ctt cag ctc cag tgt aag ccc cac aag aag 2038
Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys Pro His Lys Lys
230 235 240
tac ctc acc tgg ctg ttc cag ggc tct ccg atc gcc gtt gtc aac cac 2086
Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala Val Val Asn His
245 250 255
tgc gac aac gac gga gtc ctg ctg agc ggc cct gcc aac ctt act ttt 2134
Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala Asn Leu Thr Phe
260 265 270
tcc acc cgc aga agc aag ctc cag ctc ttc caa ccc ttc ctc ccc ggg 2182
Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro Phe Leu Pro Gly
275 280 285 290
acc tat cag tgc gtc tcg gga ccc tgc cat cac acc ttc cac ctg atc 2230
Thr Tyr Gln Cys Val Ser Gly Pro Cys His His Thr Phe His Leu Ile
295 300 305
ccg aat acc aca gcg tcg ctc ccc gct act aac aac caa tct acc cac 2278
Pro Asn Thr Thr Ala Ser Leu Pro Ala Thr Asn Asn Gln Ser Thr His
310 315 320
caa cgc cac cgt cgc gac ctt tcc tct gaa tct aat act acc acc cac 2326
Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser Asn Thr Thr Thr His
325 330 335
acc gga ggt gag ctc cga ggt cga cca acc tct ggg att tac tac ggc 2374
Thr Gly Gly Glu Leu Arg Gly Arg Pro Thr Ser Gly Ile Tyr Tyr Gly
340 345 350
ccc tgg gag gtg gtg ggg tta ata gcg cta ggc cta gtt gtg ggt ggg 2422
Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val Val Gly Gly
355 360 365 370
ctt ttg gct ctc tgc tac cta tac ctc cct tgc tgt tcg tac tta gtg 2470
Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu Val
375 380 385
gtg ctg tgt tgc tgg ttt aag aaa tgg gga aga tca ccc tagtgagctg 2519
Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
390 395
cggtgtgccg gtggcggtgg tggtgctttc gattgtggga ctgggcggcg cggctgtagt 2579
gaaggaggag aaggccgatt cctgcttgca tttcaatccc gacaaatgcc agctgagttt 2639
tcagcccgat ggcaatcggt gcacggtgct gatcaagtgc ggatgggaat gcgagaacgt 2699
gagaatcgag tacaataaca agactcggaa caatactctc gcgtccgtgt ggcagcccgg 2759
ggaccccgag tggtacaccg tctctgtccc cggtgctgac ggctccccgc gcaccgtgaa 2819
taatactttc atttttgcac acatgtgcga cacggtcatg tggatgagca agcagtacga 2879
tatgtggccc cccacgaagg agaacatcgt ggtcttctcc atcgcttaca gcctgtgcac 2939
ggcgctaatc accgctatcg tgtgcctgag cattcacatg ctcatcgcta ttcgccccag 2999
aaataatgcc gagaaagaga aacagccata acacgttttt ttacacacct ttttcagacc 3059
atggcctctg ttactgccct aattattttt ttgggccttg tgggcactag cagcactttt 3119
gatcataaaa acttaactgt ttttgttggt tctgatgtta cactacccgg gcatcaatcg 3179
caccagaggg tttcatggta tcattttaat aaacagaaca cagcttatac actttgcaaa 3239
ggtcatcagc aagccacata tcgcagtggt ctttattaca gatgcaataa caataacctc 3299
acactactct cagttaatgc aaattattct ggcacatact atggaaccaa ttttaacaca 3359
aaacaggaca cttactatag tgtcaaagta ttgaatccaa cctctcctag aactaccact 3419
aagcctaccg ctaccaccac tactactgca aagcccacta aacctaaaac taccaagaaa 3479
accactgtga agactacaac aactagaacc accacaacta cagagaccac caccagcaca 3539
cttgctgcca ctacacacac acacattgag ctaaccttac agaccactaa tgatttgatc 3599
gccctgttgc aaaaggggga taacagcacc acttccgatg aggaaatacc caaatccatg 3659
attggcatta ttgttgctgt agtggtgtgc atgttgatca tcgccttgtg catggtgtac 3719
tatgccttct gctacagaaa gcacagactg aacgacaagc tggaacactt actaagtgtt 3779
gaattttaat tttttagaac catgaagatc ctaggccttt tagttttttc tatcattacc 3839
tctgctcttt gtgaatcggt ggataaagat gttactatta ccactggttc taactataca 3899
ctgaaagggc caccctcagg tatgctttcg tggtattgct attttggaac tgactctgaa 3959
caaactgagc tttgcaatgc aatgagaggc caaatgccaa cctcaaaaat taaacataaa 4019
tgcaatggta ctgatttgat actcctcaat gtcacgaaag catatgctgg cagttacacc 4079
tgccctggag atgatgctga cagtatgatt ttttacaaag taactgttgt tgatcccact 4139
actccaccac ccaccaccac aactactcat accacacaca cagaacaaac accagaggca 4199
gcaggagagt tagccttgca ggttcaggaa gattccctta tggctaatac ccctacaccc 4259
gatcatcggt gtccggggtt gctcgtcagc ggcattgtcg gtgtgctttc gggattagca 4319
gtcataatca tctgcatgtt catttttgct tgctgctata gaaggcttta ccgacaaaaa 4379
tcagacccac tgctgaacct ctatgtttaa ttttttccag agccatgaag gcagttagag 4439
ttctagtttt ttgttctttg attggcattg tttttagtgc tgggtttttg aaaaatctta 4499
ccatttatga aggtgagaat gccactctag tgggcatcag tggtcaaaat gtcagctggc 4559
taaaatacca tctagatagg tggaaagaca tttgcgattg gaatgtcact gtgtatacat 4619
gtaatggagt taacctcacc attactaatg ccacccaaga tcaaaatggt aggtttaagg 4679
gtcagagttt cactagaaat aatgggtatg aatcccataa catgtttatc tatgacgtca 4739
ctgtcatcag aaatgagact gccaccacca ccacacagat gcaaaccaca cagacgacca 4799
catacagtac atcaaatcag cctaccacca ctacagcagc agaggttgcc agctcgtctg 4859
gtgtcaaagt ggcatttttg ttgttgcccc catctagcag tcccactgct agtaccaatg 4919
agcagactac tgaatttttg tccactgtcg agagccacac cacagctacc tcgagtgcct 4979
tctctagcac cgccaatctc tcctcgcttt cctctacacc aatcagttcc gctactacta 5039
ctcctagccc cgctcctctt cccactcccc tgaagcaaac tgaggacagc ggcatgcaat 5099
ggcagatcac cctgctcatt gtgatcgggt tggtcatcct ggccgtgttg ctctactaca 5159
tcttctgccg ccgcattccc aacgcgcacc gcaagccggt ctacaagccc atcattgtcg 5219
ggcagccaga gccgcttcag gtggaagggg gtctaaggaa tcttctcttc tcttttacag 5279
tatggtgatt gaactatgat tcctagacaa ttcttgatca ctattcttat ctgcctcctc 5339
caagtctgtg ccaccctcgc tctggtggcc aacgccagtc cagactgtat tgggcccttc 5399
gcctcctacg tgctctttgc cttcatcacc tgcatctgct gctgtagcat agtctgcctg 5459
cttatcacct tcttccagtt cattgactgg atctttgtgc gcatcgccta cctgcgccac 5519
cacccccagt accgcgacca gcgagtggcg cgactgctca ggctcctctg ataagcatgc 5579
gggctctgct acttctcgcg cttctgctgt tagtgctccc ccgtcccgtc gacccccggt 5639
cccccactca gtcccccgag gaggtccgca aatgcaaatt ccaagaaccc tggaaattcc 5699
tcaaatgcta ccgccaaaaa tcagacatgc atcccagctg gatcatgatc attgggatcg 5759
tgaacattct ggcctgcacc ctcatctcct ttgtgattta cccctgcttt gactttggtt 5819
ggaactcgcc agaggcgctc tatctcccgc ctgaacctga cacaccacca cagcagcaac 5879
ctcaggcaca cgcactacca ccaccacagc ctaggccaca atacatgccc atattagact 5939
atgaggccga gccacagcga cccatgctcc ccgctattag ttacttcaat ctaaccggcg 5999
gag atg act gac cca ctg gcc aac aac aac gtc aac gac ctt ctc ctg 6047
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu
400 405 410
gac atg gac ggc cgc gcc tcg gag cag cga ctc gcc caa ctt cgc att 6095
Asp Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile
415 420 425 430
cgc cag cag cag gag aga gcc gtc aag gag ctg cag gac ggc ata gcc 6143
Arg Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala
435 440 445
atc cac cag tgc aag aaa ggc atc ttc tgc ctg gtg aaa cag gcc aag 6191
Ile His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys
450 455 460
atc tcc tac gag gtc acc cag acc gac cat cgc ctc tcc tac gag ctc 6239
Ile Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu
465 470 475
ctg cag cag cgc cag aag ttc acc tgc ctg gtc gga gtc aac ccc atc 6287
Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile
480 485 490
gtc atc acc cag cag tcg ggc gat acc aag ggg tgc atc cac tgc tcc 6335
Val Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser
495 500 505 510
tgc gac tcc ccc gac tgc gtc cac act ctg atc aag acc ctc tgc ggc 6383
Cys Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly
515 520 525
ctc cgc gac ctc ctc ccc atg aac taa 6410
Leu Arg Asp Leu Leu Pro Met Asn
530
<210> 155
<211> 190
<212> PRT
<213> Simian adenovirus 25.2
<400> 155
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Leu Pro
1 5 10 15
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Met
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala
50 55 60
Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys Lys
100 105 110
Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile Val
115 120 125
Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg Tyr
130 135 140
Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His Tyr
145 150 155 160
Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala Glu
165 170 175
Thr Gln Gln Gln Gln Lys Thr Ser Ser Ser Ser Ser Ser Ser
180 185 190
<210> 156
<211> 209
<212> PRT
<213> Simian adenovirus 25.2
<400> 156
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu
1 5 10 15
Ile Ser Asp Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg
20 25 30
Ser Leu Phe Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys
35 40 45
Pro His Lys Lys Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala
50 55 60
Val Val Asn His Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala
65 70 75 80
Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro
85 90 95
Phe Leu Pro Gly Thr Tyr Gln Cys Val Ser Gly Pro Cys His His Thr
100 105 110
Phe His Leu Ile Pro Asn Thr Thr Ala Ser Leu Pro Ala Thr Asn Asn
115 120 125
Gln Ser Thr His Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser Asn
130 135 140
Thr Thr Thr His Thr Gly Gly Glu Leu Arg Gly Arg Pro Thr Ser Gly
145 150 155 160
Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu
165 170 175
Val Val Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys
180 185 190
Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser
195 200 205
Pro
<210> 157
<211> 135
<212> PRT
<213> Simian adenovirus 25.2
<400> 157
Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp
1 5 10 15
Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg
20 25 30
Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala Ile
35 40 45
His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile
50 55 60
Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu Leu
65 70 75 80
Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val
85 90 95
Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys
100 105 110
Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu
115 120 125
Arg Asp Leu Leu Pro Met Asn
130 135
<210> 158
<211> 870
<212> DNA
<213> Simian adenovirus 25.2
<220>
<221> CDS
<222> (6)..(573)
<223> label=Ela
<220>
<221> CDS
<222> (659)..(864)
<223> label=Ela
<400> 158
gaaag atg agg cac ctg aga gac ctg ccc ggt aat gtt ttc ctg gct act 50
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr
1 5 10 15
ggg aac gag att ctg gaa ctg gtg gtg gac gcc atg atg ggt gac gac 98
Gly Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp
20 25 30
cct ccg gag ccc cct acc cca ttt gag gcg cct tcg ctg tac gat ttg 146
Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu
35 40 45
tat gat ctg gag gtg gat gtg ccc gag aac gac ccc aac gag gag gcg 194
Tyr Asp Leu Glu Val Asp Val Pro Glu Asn Asp Pro Asn Glu Glu Ala
50 55 60
gtg aat gat ttg ttt agc gat gcc gcg ctg ctg gct gcc gag cag gct 242
Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala
65 70 75
aat acg gac tct ggc tca gac agc gat tcc tct ctc cat acc ccg aga 290
Asn Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg
80 85 90 95
ccc ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag ctc 338
Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu
100 105 110
gac ctg cgc tgc tat gag gaa tgc ttg cct ccg agc gat gat gag gag 386
Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu
115 120 125
gac gag gag gcg att cga gct gca gcg aac cag gga gtg aaa gcg gcg 434
Asp Glu Glu Ala Ile Arg Ala Ala Ala Asn Gln Gly Val Lys Ala Ala
130 135 140
ggc gag ggc ttt agc ctg gac tgt cct act ctg ccc gga cac ggc tgt 482
Gly Glu Gly Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys
145 150 155
aag tct tgt gaa ttt cat cgc atg aat act gga gat aag aat gtg atg 530
Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met
160 165 170 175
tgt gcc ctg tgc tat atg aga gct tac aac cat tgt gtt tac a 573
Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr
180 185
gtaagtgtga ttaactttag ttgggaaggc agagggtgac tgggtgctga ctggtttatt 633
tatgtatatg tttttttatg tgtag gt ccc gtc tct gac gta gat gag acc 684
Ser Pro Val Ser Asp Val Asp Glu Thr
195
ccc act tca gag tgc att tca tca ccc cca gaa att ggc gag gaa ccg 732
Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro Glu Ile Gly Glu Glu Pro
200 205 210
ccc gaa gat att att cat aga cca gtt gca gtg aga gtc acc ggg cgg 780
Pro Glu Asp Ile Ile His Arg Pro Val Ala Val Arg Val Thr Gly Arg
215 220 225 230
aga gca gct gtg gag agt ttg gat gac ttg cta cag ggt ggg gat gaa 828
Arg Ala Ala Val Glu Ser Leu Asp Asp Leu Leu Gln Gly Gly Asp Glu
235 240 245
cct ttg gac ttg tgt acc cgg aaa cgc ccc agg cac taagtg 870
Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His
250 255
<210> 159
<211> 258
<212> PRT
<213> Simian adenovirus 25.2
<400> 159
Met Arg His Leu Arg Asp Leu Pro Gly Asn Val Phe Leu Ala Thr Gly
1 5 10 15
Asn Glu Ile Leu Glu Leu Val Val Asp Ala Met Met Gly Asp Asp Pro
20 25 30
Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu Tyr Asp Leu Tyr
35 40 45
Asp Leu Glu Val Asp Val Pro Glu Asn Asp Pro Asn Glu Glu Ala Val
50 55 60
Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Gln Ala Asn
65 70 75 80
Thr Asp Ser Gly Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro
85 90 95
Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp
100 105 110
Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp
115 120 125
Glu Glu Ala Ile Arg Ala Ala Ala Asn Gln Gly Val Lys Ala Ala Gly
130 135 140
Glu Gly Phe Ser Leu Asp Cys Pro Thr Leu Pro Gly His Gly Cys Lys
145 150 155 160
Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Asn Val Met Cys
165 170 175
Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val
180 185 190
Ser Asp Val Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro
195 200 205
Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala
210 215 220
Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp Asp Leu
225 230 235 240
Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro
245 250 255
Arg His
<210> 160
<211> 860
<212> DNA
<213> Simian adenovirus 25.2
<220>
<221> CDS
<222> (10)..(337)
<223> label=33K
<220>
<221> CDS
<222> (507)..(850)
<223> label=33K
<400> 160
tcccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 51
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro
1 5 10
ctg ccg ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag 99
Leu Pro Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu
15 20 25 30
gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 147
Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln
35 40 45
gac agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa 195
Asp Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu
50 55 60
gca gcc gcc gcc aga ccg tcg tcc tcg gcg gag aaa gca agc agc acg 243
Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 291
Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser
80 85 90
aga tgg gac gag acc ggg cgc ttc cca aac ccc acc acc cag acc g 337
Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
95 100 105
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 397
gcttgcaagc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 457
gggtgaactt cccccgcaac atcttgcatt actaccgtca cctccacag cc cct act 514
Ala Pro Thr
act gtt tcc aag aag agg cag aaa ccc agc agc agc aga aaa cca gca 562
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
gca gca gca gca gca gct aga aaa tcc aca gcg gcg gcg gca ggt gga 610
Ala Ala Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Gly Gly
130 135 140
ctg agg atc gcg gcg aac gag ccg gcg cag acc cgg gag ctg agg aac 658
Leu Arg Ile Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn
145 150 155 160
cgg atc ttt ccc acc ctc tat gcc atc ttc cag cag agt cgg ggg cag 706
Arg Ile Phe Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln
165 170 175
gag cag gaa ctg aaa gtc aag aac cgt tct ctg cgc tcg ctc acc cgc 754
Glu Gln Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg
180 185 190
agt tgt ctg tat cac aag agc gaa gac caa ctt cag cgc act ctc gag 802
Ser Cys Leu Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu
195 200 205
gac gcc gag gct ctc ttc aac aag tac tgc gcg ctc act ctt aaa gag 850
Asp Ala Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
tagcccgcgc 860
<210> 161
<211> 224
<212> PRT
<213> Simian adenovirus 25.2
<400> 161
Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Leu Pro
1 5 10 15
Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Met
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala
50 55 60
Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr Asp Thr
65 70 75 80
Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr
100 105 110
Thr Val Ser Lys Lys Arg Gln Lys Pro Ser Ser Ser Arg Lys Pro Ala
115 120 125
Ala Ala Ala Ala Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Gly Gly
130 135 140
Leu Arg Ile Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn
145 150 155 160
Arg Ile Phe Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln
165 170 175
Glu Gln Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg
180 185 190
Ser Cys Leu Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu
195 200 205
Asp Ala Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215 220
<210> 162
<211> 36628
<212> DNA
<213> Simian adenovirus 26
<220>
<221> repeat_region
<222> (1)..(131)
<223> label=ITR
<220>
<221> CDS
<222> (1908)..(3404)
<223> label=Elb\55K
<220>
<221> CDS
<222> (3492)..(3917)
<223> label=pIX
<220>
<221> misc_feature
<222> (3982)..(5603)
<223> complement (3982...5312, 5591...5603) label=IVa2
<220>
<221> misc_feature
<222> (5085)..(13842)
<223> complement (5085 ...8663, 13834 ...13842) label=pol
<220>
<221> misc_feature
<222> (8465)..(13842)
<223> complement (8465...10399, 13834...13842) label=pTP
<220>
<221> CDS
<222> (10612)..(12003)
<223> label=52K
<220>
<221> CDS
<222> (12030)..(13805)
<223> label=pIIIa
<220>
<221> CDS
<222> (13884)..(15521)
<223> label=penton
<220>
<221> CDS
<222> (15528)..(16106)
<223> label=pVII
<220>
<221> CDS
<222> (16151)..(17167)
<223> label=V
<220>
<221> CDS
<222> (17195)..(17428)
<223> label=pX
<220>
<221> CDS
<222> (17461)..(18234)
<223> label=PVI
<220>
<221> CDS
<222> (18344)..(21154)
<223> label=hexon
<220>
<221> CDS
<222> (21176)..(21802)
<223> label=protease
<220>
<221> misc_feature
<222> (21885)..(23423)
<223> complement label=DBP
<220>
<221> CDS
<222> (23449)..(25854)
<223> label=100K
<220>
<221> CDS
<222> (26482)..(27162)
<223> label=pVIII
<220>
<221> CDS
<222> (27166)..(27483)
<223> label=E3\12.5K
<220>
<221> CDS
<222> (28057)..(28584)
<223> label=E3\gp19K
<220>
<221> CDS
<222> (28618)..(29355)
<223> label=E3\CR1-beta
<220>
<221> CDS
<222> (29371)..(29988)
<223> label=E3\CR1-gamma
<220>
<221> CDS
<222> (30011)..(30883)
<223> label=E3\CR1-delta
<220>
<221> CDS
<222> (30895)..(31167)
<223> label=E3\RID-alpha
<220>
<221> CDS
<222> (31170)..(31613)
<223> label=E3\RID-beta
<220>
<221> CDS
<222> (32264)..(33538)
<223> label=fiber
<220>
<221> misc_feature
<222> (33635)..(34779)
<223> complement (33635...33883, 34630...34779) label=E4\orf6/7
<220>
<221> misc_feature
<222> (33883)..(34779)
<223> complement label=E4\orf6
<220>
<221> misc_feature
<222> (34685)..(35050)
<223> complement label=E4\orf4
<220>
<221> misc_feature
<222> (35062)..(35412)
<223> complement label=E4\orf3
<220>
<221> misc_feature
<222> (35412)..(35798)
<223> complement label=E4\orf2
<220>
<221> misc_feature
<222> (35851)..(36222)
<223> complement label=E4\orf1
<220>
<221> repeat_region
<222> (36498)..(36628)
<223> complement label=ITR
<400> 162
catcatcaat aatatacctc aaactttttg tgcgcgttaa tatgcaaatg aggcgtttga 60
atttggggag gaagggttgt gattggctgc gggaagggcg accgttaggg gcggggcgag 120
tgacgttttt atgacgtggc cgtgaggagg agctagcttg caagttcttg tgggaaaagt 180
gacgttaaac gaggtgtggt ttgaacacgg aaatacttaa ttttcccgcg ctctttgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatgac agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtgtttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480
tttaaacctg cgctctccag tcaagaggcc actcttgagt gccagcgaga agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600
atgagaaaat catcatcgct tccgggaacg agattctgga actggtggta aatgccatga 660
tgggcgacga ccctccggag ccccccaccc catttgaggc accttcgctg cacgatttgt 720
atgatctgga ggtggatgtg cccgatgacg accccaacga ggaggcggta aatgatttat 780
ttagcgatgc cgcgctgcta gctgccgagg aggcttcgag ccctagctca gacagcgact 840
cttcactgca tacccctaga cctggcagag gtgagaaaaa gatccccgag cttaaagggg 900
aagagatgga cttgcgctgc tatgaggaat gcttgccccc gagcgatgat gaggacgagc 960
aggcgatcca gaacgcagcg agccagggag tgcaagccgc cagcgagagc tttgcgctgg 1020
actgcccgcc tctgcccgga cacggctgta agtcttgtga atttcatcgc atgaatactg 1080
gagataaagc tgtgttgtgt gcactttgct atatgagagc ttacaaccat tgtgtttaca 1140
gtaagtgtga ttaagttgaa ctttagaggg aggcagagag cagggtgact gggcgatgac 1200
tggtttattt atgtatatat atgttcttta tataggtccc gtctctgacg cagatgatga 1260
gacccccact acagagtcca cttcgtcacc cccagaaatt ggcacatctc cacctgagaa 1320
tattgttaga ccagttcctg ttagagccac tgggaggaga gcagctgtgg aatgtttgga 1380
tgacttgcta cagggtgggg atgaaccttt ggacttgtgt acccggaaac gccccaggca 1440
ctaagtgcca cacatgtgtg tttacttgag gtgatgtcag tatttatagg gtgtggagtg 1500
caataaaaaa tgtgttgact ttaagtgcgt ggtttatgac tcaggggtgg ggactgtggg 1560
tatataagca ggtgcagacc tgtgtggtta gctcagagcg gcatggagat ttggacggtc 1620
ttggaagact ttcacaagac tagacagctg ctagagaacg cctcgaacgg agtctcttac 1680
ctgtggagat tctgcttcgg tggcgaccta gctaggctag tctatagggc caaacaggat 1740
tatagtgaac aatttgaggt tattttgaga gagtgttctg gtctttttga cgctcttaac 1800
ttgggccatc agtctcactt taaccagagg atttcgagag cccttgactt tactactcct 1860
ggcagaacca ctgcggcagt agcctttttt gcttttcttc ttgacaa atg gag tca 1916
Met Glu Ser
1
aga aac cca ttt cag cag gga tta cca gct gga ttt ctt agc agt agc 1964
Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser
5 10 15
ttt gtg gag aac atg gaa gtg cca gcg cct gaa tgc aat ctc agg cta 2012
Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu
20 25 30 35
ctt gcc ggt aca gcc gct aga cac tct gag gat cct gaa tct cca gga 2060
Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly
40 45 50
gag tcc cag ggc aca cca acg tcg cca gca gca gca gcg gca gca gga 2108
Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Ala Ala Gly
55 60 65
gga gga tca aga aga gaa ccc gag agc cgg cct gga ccc tcc gga gga 2156
Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly
70 75 80
gga gga gta gct gac ctg ttt cct gaa ctg cgc cgg gtg ctg act agg 2204
Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr Arg
85 90 95
tct tcg agt ggt cgg gag agg ggg att aag cgg gag agg cat gat gag 2252
Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His Asp Glu
100 105 110 115
act aat cac aga act gaa ctg act gtg ggt ctg atg agc cgc aag cgt 2300
Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser Arg Lys Arg
120 125 130
cca gaa aca gtg tgg tgg cat gag gtg cag tcg act ggc aca gat gag 2348
Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr Asp Glu
135 140 145
gtg tca gtg atg cat gag agg ttt tcc cta gaa caa gtc aag act tgt 2396
Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gln Val Lys Thr Cys
150 155 160
tgg tta gag cct gag gat gat tgg gag gta gcc atc agg aat tat gcc 2444
Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn Tyr Ala
165 170 175
aag ctg gct ctc agg cca gac aag aag tac aag att act aag ctg ata 2492
Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys Leu Ile
180 185 190 195
aat atc aga aat gcc tgc tac atc tca ggg aat ggg gct gaa gtg gag 2540
Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu Val Glu
200 205 210
atc tgt ctt cag gat aga gtg gct ttc aga tgc tgt atg atg aat atg 2588
Ile Cys Leu Gln Asp Arg Val Ala Phe Arg Cys Cys Met Met Asn Met
215 220 225
tac ccg gga gtg gtg ggc atg gat ggg gtc acc ttt atg aac atg agg 2636
Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn Met Arg
230 235 240
ttc agg gga gat ggg tat aat ggt acg gtc ttt atg gcc aat acc aag 2684
Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn Thr Lys
245 250 255
ctg aca gtc cat ggc tgc tcc ttc ttt ggg ttt aat aac acc tgc att 2732
Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr Cys Ile
260 265 270 275
gag gcc tgg ggc cag gtt ggt gtg agg ggc tgt agt ttt tca gcc aac 2780
Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser Ala Asn
280 285 290
tgg atg ggg gtc gtg ggc agg acc aag agt atg ctg tct gtg aag aaa 2828
Trp Met Gly Val Val Gly Arg Thr Lys Ser Met Leu Ser Val Lys Lys
295 300 305
tgc ttg ttc gag agg tgc cac ctg ggg gtg atg agc gag ggc gaa gcc 2876
Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly Glu Ala
310 315 320
aga atc cgc cac tgc gcc tct acc gag acg ggc tgc ttt gtg ctg tgc 2924
Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val Leu Cys
325 330 335
aag ggc aat gct aag atc aag cat aat atg att tgt gga gcc tcg gac 2972
Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala Ser Asp
340 345 350 355
gag cgc ggc tac cag atg ctg acc tgc gcc ggg ggg aac agc cat atg 3020
Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn Ser His Met
360 365 370
ctg gcc acc gtg cat gtg gct tcc cat tcc cgc aag ccc tgg ccc gag 3068
Leu Ala Thr Val His Val Ala Ser His Ser Arg Lys Pro Trp Pro Glu
375 380 385
ttc gag cac aat gtc atg acc agg tgc aat atg cat ctg ggg tct cgc 3116
Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly Ser Arg
390 395 400
cga ggc atg ttc atg ccc tac cag tgc aac ctg aat tat gtg aag gtg 3164
Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val Lys Val
405 410 415
ctg ctg gag cct gat gcc atg tcc aga gtg agc ctg acg ggg gtg ttt 3212
Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly Val Phe
420 425 430 435
gac atg aat gtg gag gtg tgg aag att ctg aga tat gat gaa tcc aag 3260
Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser Lys
440 445 450
acc agg tgc cga gcc tgc gag tgc gga ggg aag cat gcc agg ttc cag 3308
Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe Gln
455 460 465
ccc gtg tgt gtg gag gtg acg gag gac ctg cga ccc gat cat ttg gtg 3356
Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro Asp His Leu Val
470 475 480
ttg tcc tgc acc ggg acg gag ttc ggt tcc agc ggg gaa gaa tct gac 3404
Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser Asp
485 490 495
tagagtgagt agtgttctgg ggcgggggag ggcctgcatg aggggcagaa tgactgaaat 3464
ctgtgctttt ctgtgtgttg cagcagc atg agc gga agc ggc tcc ttt gag gga 3518
Met Ser Gly Ser Gly Ser Phe Glu Gly
500 505
ggg gta ttc agc cct tat ctg acg ggg cgt ctc ccc tcc tgg gcg gga 3566
Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu Pro Ser Trp Ala Gly
510 515 520
gtg cgt cag aat gtg atg gga tcc acg gtg gac ggc cgg ccc gtg cag 3614
Val Arg Gln Asn Val Met Gly Ser Thr Val Asp Gly Arg Pro Val Gln
525 530 535 540
ccc gcg aac tct tca acc ctg acc tat gca acc ctg agc tct tcg tcg 3662
Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr Leu Ser Ser Ser Ser
545 550 555
gtg gac gca gct gcc gcc gca gct gct gca tct gcc gcc agc gcc gtg 3710
Val Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ser Ala Val
560 565 570
cgc gga atg gcc atg ggc gcc ggc tac tac ggc act ctg gtg gcc aac 3758
Arg Gly Met Ala Met Gly Ala Gly Tyr Tyr Gly Thr Leu Val Ala Asn
575 580 585
tcg agt tcc acc aat aat ccc gcc agc ctg aac gag gag aag ctg ctg 3806
Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn Glu Glu Lys Leu Leu
590 595 600
ctg ctg atg gcc cag ctc gag gcc ttg acc cag cgc ctg ggc gag ctg 3854
Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln Arg Leu Gly Glu Leu
605 610 615 620
acc cag cag gtg gct cag ctg cag gag cag acg cgg gcc gcg gtt gcc 3902
Thr Gln Gln Val Ala Gln Leu Gln Glu Gln Thr Arg Ala Ala Val Ala
625 630 635
acg gtg aaa tcc aaa taaaaaatga atcaataaat aaacggagac ggttgttgat 3957
Thr Val Lys Ser Lys
640
tttaacacag agtctgaatc tttatttgat ttttcgcgcg cggtaggccc tggaccaccg 4017
gtctcgatca ttgagcaccc ggtggatctt ttccaggacc cggtagaggt gggcttggat 4077
gttgaggtac atgggcatga gcccgtcccg ggggtggagg tagctccatt gcagggcctc 4137
atgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg catggtgttg 4197
cacaatatct ttgaggagga gactgatggc cacgggcagc cctttggtgt aggtgtttac 4257
aaatctgttg agctgggagg gatgcatgcg gggggagatg aggtgcatct tggcctggat 4317
cttgagattg gcgatgttac cgcccagatc ccgcctgggg ttcatgttgt gcaggaccac 4377
cagcacggtg tatccggtgc acttggggaa tttatcatgc aacttggaag ggaaggcgtg 4437
aaagaatttg gcgacgccct tgtgcccgcc caggttttcc atgcactcat ccatgatgat 4497
ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt cgggggtcgg acacatcata 4557
gttgtggtcc tgggtgagat catcataggc cattttaatg aatttggggc ggagggtgcc 4617
ggactggggg acaaaggtac cctcgatccc gggggcgtag ttcccctcac agatctgcat 4677
ctcccaggct ttgagctcgg agggggggat catgtccacc tgcggggcga taaagaacac 4737
ggtttccggg gcgggggaga tgagctgggc cgaaagcaag ttccggagca gctgggactt 4797
gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt agttgaggga 4857
gagacagctg ccgtcctcgc ggaggagggg ggccacttcg ttcatcatct cgcgcacatg 4917
catgttctcg cgcaccagtt ccgccaggag gcgctctccc cccagggata ggagctcctg 4977
gagcgaggcg aagtttttca gcggcttgag tccgtcggcc atgggcattt tggagagggt 5037
ctgttgcaag agttccaagc ggtcccagag ctcggtgatg tgctctacgg catctcgatc 5097
cagcagacct cctcgtttcg cgggttggga cggctgcggg agtagggcac cagacgatgg 5157
gcgtccagcg cagccagggt ccggtccttc cagggccgca gcgtccgcgt cagggtggtc 5217
tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg cttcaggctc 5277
atccggctgg tcgaaaaccg ctcccgatcg gcgccctgcg cgtcggccag gtagcaattg 5337
accatgagtt cgtagttgag cgcctcggcc gcgtggcctt tggcgcggag cttacctttg 5397
gaagtctgcc cgcaggcggg acagaggagg gacttgaggg cgtagagctt gggggcgagg 5457
aagacggact cgggggcgta ggcgtccgcg ccgcagtggg cgcagacggt ctcgcactcc 5517
acgagccagg tgaggtcggg ctggtcgggg tcaaaaacca gtttcccgcc gttctttttg 5577
atgcgtttct tacctttggt ctccatgagc tcgtgtcccc gctgggtgac aaagaggctg 5637
tccgtgtccc cgtagaccga ctttatgggc cggtcctcga gcggtgtgcc gcggtcctcc 5697
tcgtagagga accccgccca ctccgagacg aaagcccggg tccaggccag cacgaaggag 5757
gccacgtggg acgggtagcg gtcgttgtcc accagcgggt ccactttctc cagggtatgc 5817
aaacacatgt ccccctcgtc cacatccagg aaggtgattg gcttgtaagt gtaggccacg 5877
tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc gtcctcactg 5937
tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct ctcgaaggcg 5997
ggcatgacct cggcactcag gttgtcagtt tctagaaacg aggaggattt gatattgacg 6057
gtgccggcgg agatgccttt caagagcccc tcgtccatct ggtcagaaaa gacgatcttt 6117
ttgttgtcga gtttggtggc gaaggagccg tagagggcgt tggaaaggag cttggcgatg 6177
gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct tggccgcgat gttgagctgc 6237
acgtactcgc gcgccacaca cttccattcg gggaagacgg tggtcagctc gtcgggcacg 6297
attctgacct gccagccccg attatgcagg gtgatgaggt ccacactggt ggccacctcg 6357
ccgcgcaggg gctcgttggt ccagcagagg cgcccgccct tgcgcgagca gaaggggggc 6417
agggggtcca gcatgacctc gtcggggggg tcggcatcga tggtgaagat gccgggcagg 6477
aggtcggggt caaagtagct gatggaagtg tccagatcgt ccagggcagc ttgccattcg 6537
cgcacggcca gcgcgcgctc gtagggactg aggggcgtgc cccagggcat ggggtgggtg 6597
agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga ggggctcctc gaggatgccg 6657
atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc atacagctcg 6717
tgcgaggggg cgaggagccc cgggcccagg ttggtgcgac tgggcttttc ggcgcggtag 6777
acgatctggc ggaagatggc gtgcgagttg gaggagatgg tgggcctttg gaagatgttg 6837
aagtgggcgt gggggagacc gaccgagtcg cggaggaagt gggcgtagga gtcttgcagc 6897
ttggcgacga gctcggcggt gacgaggacg tccagagcgc agtagtcgag ggtctcctgg 6957
atgatgtcat acttgagctg gcccttttgt ttccacagct cgcggttgag aaggaactct 7017
tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gatctgcacg gtaagagcct 7077
agcatgtaga actggttgac ggccttgtag gcgcagcagc ccttctccac ggggagggcg 7137
taggcctggg cggccttgcg cagggaggtg tgcgtgaggg cgaaagtgtc cctgaccatg 7197
accttgagga actggtgctt gaagtcgata tcgtcgcagc ccccctgctc ccagagctgg 7257
aagtccgtgc gctttttgta ggcggggttg ggcaaagcga aagtaacatc gttgaagagg 7317
atcttgcccg cgcggggcat aaagttgcga gtgatgcgga aaggctgggg cacctcggcc 7377
cggttgttga tgacctgggc ggcgagcacg atctcgtcga agccgttgat gttgtggccc 7437
acgatgtaga gttccactaa tcgcgggcgg cccttgacgt ggggcagttt cttgagctcc 7497
tcgtaggtga gctcgtcggg gtcgctgagg ccgtactgct cgagcgccca gtcggcgaga 7557
tgggggttgg cgcggaggaa ggaagtccag agatccacgg ccagggcggt ctgcaggcgg 7617
tcccggtact gacggaactg ctgcccgacg gccatttttt cgggggtgac gcagtagaag 7677
gtgcgggggt ccccgtgcca gcgatcccat ttgagctgga gggcgagatc gagggcgagc 7737
tcgacgagcc ggtcgtcccc ggagagtttc atgaccagca tgaaggggac gagctgcttg 7797
ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag cctttcggtg 7857
cgaggatgcg agccgatggg gaagaactgg atctcctgcc accaattgga ggaatggctg 7917
ttgatgtgat ggaagtagaa atgccgacgg cgcgccgaac actcgtgctt gtgtttatac 7977
aagcggccac agtgctcgca acgctgcacg ggatgcacgt gctgcacgag ctgtacctga 8037
gttcctttga cgaggaattt cagtgggaag tggagtcgtg gcgcctgcat ctcgtgctgt 8097
actacgtcgt ggtggtcggc ctggccctct tctgcctcga tggtggtcat gctgacgagc 8157
ccgcgcggga ggcaggtcca gacctcggcg cgagcgggtc ggagagcgag gacgagggcg 8217
cgcaggccgg agctgtccag ggtcctgaga cgctgcggag tcaggtcagt gggcagcggc 8277
ggcgcgcggt tgacttgcag gagtttttcc agggcgcgcg ggaggtccag atggtacttg 8337
atctccaccg cgccgttggt ggcgacgtcg atggcttgca gggtcccgtg cccctggggt 8397
gtgaccaccg tcccccgttt cttcttgggc ggctggggcg acgggggcgg tacttcttcc 8457
atggttagaa gcggcggcga ggacgcgcgc cgggcggcag aggcggctcg gggcccggag 8517
gcaggggcgg caggggcacg tcggcgccgc gcgcgggtag gttctggtac tgcgcccgga 8577
gaagactggc gtgagcgacg acgcgacggt tgacgtcctg gatctgacgc ctctgggtga 8637
aggccacggg acccgtgagt ttgaacctga aagagagttc gacagaatca atctcggtat 8697
cgttgacggc ggcctgccgc aggatctctt gcacgtcgcc cgagttgtcc tggtaggcga 8757
tctcggtcat gaactgctcg atctcctcct cctgaaggtc tccgcggccg gcgcgctcca 8817
cggtggccgc gaggtcgttg gagatgcggc ccatgagctg cgagaaggcg ttcatgcccg 8877
cctcgttcca gacgcggctg tagaccacga cgccctcggg atcgcgggcg cgcatgacca 8937
cctgggcgag gttgagctcc acgtggcgcg tgaagaccgc gtagttgcag aggcgctggt 8997
agaggtagtt gagcgtggtg gcgatgtgct cggtgacgaa gaaatacatg atccagcgac 9057
ggagcggcat ctcgctgacg tcgcccagcg cctccaaacg ttccatggcc tcgtaaaagt 9117
ccacggcgaa gttgaaaaac tgggagttgc gcgccgagac ggtcaactcc tcctccagaa 9177
gacggatgag ctcggcgatg gtggcgcgca cctcgcgctc gaaggccccc gggagttcct 9237
cctcttccat ctcttcttct acctcctcca ctaacatctc ttctacttcc tcctcaggcg 9297
gtggtggcgg gggagggggc ctgcgtcgcc ggcggcgcac gggcagacgg tcgatgaagc 9357
gctcgatggt ctcgccgcgc cggcgtcgca tggtctcggt gacggcgcgc ccgtcctcgc 9417
ggggccgcag cgtgaagacg ccgccgcgca tctccaggtg gccggggggg tccccgttgg 9477
gcagggagag ggcgctgacg atgcatctta tcaattgccc cgtagggact ccgcgcaagg 9537
acctgagcgt ctcgagatcc acgggatctg aaaaccgttg aacgaaggct tcgagccagt 9597
cgcagtcgca aggtaggctg agcacggttt cttccggcgg gtcatgttgg ttggagggag 9657
cggggcgggc gatgctgctg gtgatgaagt tgaaataggc ggttctgaga cggcggatgg 9717
tggcgaggag caccaggtct ttgggcccgg cttgctggat gcgcagacgg tcggccatgc 9777
cccaggcgtg gtcctgacac ctggccaggt ccttgtagta gtcctgcatg agccgctcca 9837
cgggcacctc ctcctcgccc gcgcggccgt gcatgcgcgt gagcccgaag ccgcgctggg 9897
gctggacgag cgccaggtcg gcgacgacgc gctcggcgag gatggcctgc tgaatctggg 9957
tgagggtggt ctggaagtcg tcaaagtcga cgaagcggtg gtaggctccg gtgttgatgg 10017
tgtaggagca gttggccatg acggaccagt tgacggtctg gtggcccgga cgcacgagct 10077
cgtggtactt gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg caggtgcgca 10137
ccaggtactg gtagccgatg aggaagtgcg gcggcggctg gcggtagagc ggccatcgct 10197
cggtggcggg ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag ccgtagatgt 10257
acctggacat ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac tcgcggacgc 10317
ggttccagat gttgcgcagc ggcaggaagt agttcatggt gggcacggtc tggcccgtga 10377
ggcgcgcgca gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag cggctcgact 10437
ccgtggcctg gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt tcgaatctcg 10497
aatcaggctg gagccgcagc taacgtggta ctggcactcc cgtctcgacc caagcctgca 10557
caaaacctcc aggatacgga ggcgggtcgt tttgcaactt tttttggagg ccgg atg 10614
Met
aaa cta gta agc gca gaa agc ggc cga ccg cga tgg ctc gct gcc gta 10662
Lys Leu Val Ser Ala Glu Ser Gly Arg Pro Arg Trp Leu Ala Ala Val
645 650 655
gtc tgg aga aga atc gcc agg gtt gcg ttg cgg tgt gcc ccg gtt cga 10710
Val Trp Arg Arg Ile Ala Arg Val Ala Leu Arg Cys Ala Pro Val Arg
660 665 670
agc cgg ccg gat tcc gcg gct aac gag ggc gtg gct gcc ccg tcg ttt 10758
Ser Arg Pro Asp Ser Ala Ala Asn Glu Gly Val Ala Ala Pro Ser Phe
675 680 685 690
cca aga ccc cct agc cag ccg act tct cca gtt acg gag cga gcc cct 10806
Pro Arg Pro Pro Ser Gln Pro Thr Ser Pro Val Thr Glu Arg Ala Pro
695 700 705
ctt ttg ttt ttt tgt ttt tgc cag atg cat ccc gta ctg cgg cag atg 10854
Leu Leu Phe Phe Cys Phe Cys Gln Met His Pro Val Leu Arg Gln Met
710 715 720
cgc ccc cac cac cct cca ccg caa caa cag ccc cct cca cag ccg gcg 10902
Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro Pro Gln Pro Ala
725 730 735
ctt ctg ccc ccg ccc cag cag cag cag caa ctt cca gcc acg acc gcc 10950
Leu Leu Pro Pro Pro Gln Gln Gln Gln Gln Leu Pro Ala Thr Thr Ala
740 745 750
gcg gcc gcc gtg agc ggg gct gga cag act tct cag tat gac ctg gcc 10998
Ala Ala Ala Val Ser Gly Ala Gly Gln Thr Ser Gln Tyr Asp Leu Ala
755 760 765 770
ttg gaa gag ggc gag ggg ctg gcg cgc ctg ggg gcg tcg tcg ccg gag 11046
Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu
775 780 785
cgg cac ccg cgc gtg cag atg aaa agg gac gct cgc gag gcc tac gtg 11094
Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr Val
790 795 800
ccc aag cag aac ctg ttc aga gac agg agc ggc gag gag ccc gag gag 11142
Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu
805 810 815
atg cgc gcg gcc cgg ttc cac gcg ggg cgg gag ctg cgg cgc ggc ctg 11190
Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly Leu
820 825 830
gac cga aag agg gtg ctg agg gac gag gat ttc gag gcg gac gag ctg 11238
Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu
835 840 845 850
acg ggg atc agc ccc gcg cgc gcg cac gtg gcc gcg gcc aac ctg gtc 11286
Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu Val
855 860 865
acg gcg tac gag cag acc gtg aag gag gag agc aac ttc caa aaa tcc 11334
Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys Ser
870 875 880
ttc aac aac cac gtg cgc acc ctg atc gcg cgc gag gag gtg acc ctg 11382
Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr Leu
885 890 895
ggc ctg atg cac ctg tgg gac ctg ctg gag gcc atc gtg cag aac ccc 11430
Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn Pro
900 905 910
acc agc aag ccg ctg acg gcg cag ctg ttc ctg gtg gtg cag cac agt 11478
Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln His Ser
915 920 925 930
cgg gac aac gag gcg ttc agg gag gcg ctg ctg aat atc act gag ccc 11526
Arg Asp Asn Glu Ala Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro
935 940 945
gag ggc cgc tgg ctc ctg gac ctg gtg aac att ctg caa agc atc gtg 11574
Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile Val
950 955 960
gtg cag gag cgc ggg ctg ccg ctg tcc gag aag ctg gcg gcc atc aac 11622
Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn
965 970 975
ttc tcg gtg ctg agt ctg ggc aag tac tac gct agg aag atc tac aag 11670
Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys
980 985 990
acc ccg tac gtg ccc ata gac aag gag gtg aag atc gat ggg ttt 11715
Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe
995 1000 1005
tac atg cgc atg acc ctg aaa gtg ctg acc ctg agc gac gat ctg 11760
Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu
1010 1015 1020
ggg gtg tac cgc aac gac agg atg cac cgc gcg gtg agc gcc agc 11805
Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser
1025 1030 1035
agg cgg cgc gag ctg agc gac cag gag ctg atg cat agt ctg cag 11850
Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln
1040 1045 1050
cgg gcc ctg acc ggg gcc ggg acc gag ggg gag agc tac ttt gac 11895
Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp
1055 1060 1065
atg ggc gcg gac ctg cac tgg cag ccc agc cgc cgg gcc ttg gag 11940
Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg Ala Leu Glu
1070 1075 1080
gcg gcc ggt cct ccc tac gta gaa gag gtg gac gag gac gag gag 11985
Ala Ala Gly Pro Pro Tyr Val Glu Glu Val Asp Glu Asp Glu Glu
1085 1090 1095
ggc gag tac ctg gaa gac tgatggcgcg accgtatttt tgctag atg caa 12035
Gly Glu Tyr Leu Glu Asp Met Gln
1100 1105
caa cag cca cct cct gat ccc gca atg cgg gcg gcg ctg cag agc 12080
Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln Ser
1110 1115 1120
cag ccg tcc ggc att aac tcc tcg gac gat tgg acc cag gcc atg 12125
Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
1125 1130 1135
caa cgc atc atg gcg ctg acg acc cgc aac ccc gaa gcc ttt aga 12170
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg
1140 1145 1150
cag cag ccc cag gcc aac cgg ctc tcg gcc atc ctg gag gcc gtg 12215
Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val
1155 1160 1165
gtg ccc tcg cgc tcc aac ccc acg cac gag aag gtc ctg gcc atc 12260
Val Pro Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile
1170 1175 1180
gtg aac gcg ctg gtg gag aac aag gcc atc cgc ggc gac gag gcc 12305
Val Asn Ala Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala
1185 1190 1195
ggc ctg gtg tac aac gcg ctg ctg gag cgc gtg gct cgc tac aac 12350
Gly Leu Val Tyr Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn
1200 1205 1210
agc acc aac gtg cag acc aac ctg gac cgc atg gtg acc gac gtg 12395
Ser Thr Asn Val Gln Thr Asn Leu Asp Arg Met Val Thr Asp Val
1215 1220 1225
cgc gag gcc gtg gcc cag cgc gag cgg ttc cac cgc gag tcc aac 12440
Arg Glu Ala Val Ala Gln Arg Glu Arg Phe His Arg Glu Ser Asn
1230 1235 1240
ctg gga tcc atg gtg gcg ctg aac gcc ttc ctc agc acc cag ccc 12485
Leu Gly Ser Met Val Ala Leu Asn Ala Phe Leu Ser Thr Gln Pro
1245 1250 1255
gcc aac gtg ccc cgg ggc cag gag gac tac acc aac ttc atc agc 12530
Ala Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn Phe Ile Ser
1260 1265 1270
gcc ctg cgc ctg atg gtg acc gag gtg ccc cag agc gag gtg tac 12575
Ala Leu Arg Leu Met Val Thr Glu Val Pro Gln Ser Glu Val Tyr
1275 1280 1285
cag tcc ggg ccg gac tac ttc ttc cag acc agt cgc cag ggc ttg 12620
Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr Ser Arg Gln Gly Leu
1290 1295 1300
cag acc gtg aac ctg agc cag gcg ttc aag aac ttg cag ggc ctg 12665
Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu
1305 1310 1315
tgg ggc gtg cag gcc ccg gtc ggg gac cgc gcg acg gtg tcg agc 12710
Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr Val Ser Ser
1320 1325 1330
ctg ctg acg ccg aac tcg cgc ctg ctg ctg ttg ctg gtg gcg ccc 12755
Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val Ala Pro
1335 1340 1345
ttc acg gac agc ggc agc atc aac cgc aac tcg tac ctg ggc tac 12800
Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly Tyr
1350 1355 1360
ctg att aac ttg tac cgc gag gcc atc ggc cag gcg cac gtg gac 12845
Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
1365 1370 1375
gag cag acc tac cag gag atc acc cac gtg agc cgc gcc ctg ggc 12890
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly
1380 1385 1390
cag gac gac ccg ggc aat ctg gaa gcc acc ctg aac ttt ttg ctg 12935
Gln Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu
1395 1400 1405
acc aac cgg tcg cag aag atc ccg ccc cag tac gcg ctc agt gcc 12980
Thr Asn Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala
1410 1415 1420
gag gag gag cgc atc ctg cga tac gtg cag cag agc gtg ggc ctg 13025
Glu Glu Glu Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu
1425 1430 1435
ttc ctg atg cag gag ggg gcc acc ccc agc gcc gcg ctc gac atg 13070
Phe Leu Met Gln Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met
1440 1445 1450
acc gcg cgc aac atg gag ccc agc atg tac gcc agc aac cgc ccg 13115
Thr Ala Arg Asn Met Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro
1455 1460 1465
ttc atc aat aaa ctg atg gac tac ttg cat cgg gcg gcc gcc atg 13160
Phe Ile Asn Lys Leu Met Asp Tyr Leu His Arg Ala Ala Ala Met
1470 1475 1480
aac tct gac tat ttc acc aac gcc atc ctg aat ccc cac tgg ctc 13205
Asn Ser Asp Tyr Phe Thr Asn Ala Ile Leu Asn Pro His Trp Leu
1485 1490 1495
ccg ccg ccg ggg ttc tac acg ggc gag tac gac atg ccc gac ccc 13250
Pro Pro Pro Gly Phe Tyr Thr Gly Glu Tyr Asp Met Pro Asp Pro
1500 1505 1510
aat gac ggg ttc ctg tgg gac gat gtg gac agc agc gtg ttc tcc 13295
Asn Asp Gly Phe Leu Trp Asp Asp Val Asp Ser Ser Val Phe Ser
1515 1520 1525
ccc cga ccg ggt gct aac gag cgc ccc ttg tgg aag aag gaa ggc 13340
Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu Trp Lys Lys Glu Gly
1530 1535 1540
agc gac cgg cgc ccg tcc tcg gcg ctg tcc ggc cgc gag ggt gct 13385
Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Glu Gly Ala
1545 1550 1555
gcc gcg gcg gtg ccc gag gcc gcc agt cct ttc cct agc ttg ccc 13430
Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro
1560 1565 1570
ttc tcg ctg aac agt atc cgc agc agc gag ctg ggc agg atc acg 13475
Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr
1575 1580 1585
cgc ccg cgc ttg ctg ggc gag gag gag tac ttg aat gac tcc ttg 13520
Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu
1590 1595 1600
ctg aga ccc gag cgg gag aag aac ttc ccc aat aac ggg ata gag 13565
Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
1605 1610 1615
agc ctg gtg gac aag atg agc cgc tgg aag acg tac gcg cag gag 13610
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu
1620 1625 1630
cac agg gac gat ccc cgg gcg tcg cag ggg gcc gcg agc cgg ggc 13655
His Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Ala Ser Arg Gly
1635 1640 1645
agc gcc gcc cgt aaa cgc cgg tgg cac gac agg cag cgg gga ctg 13700
Ser Ala Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu
1650 1655 1660
atg tgg gac gat gag gac tcc gcc gac gac agc agc gtg ttg gac 13745
Met Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp
1665 1670 1675
ttg ggt ggg agt ggt ggt aac ccg ttc gct cac ctg cgc ccc cgc 13790
Leu Gly Gly Ser Gly Gly Asn Pro Phe Ala His Leu Arg Pro Arg
1680 1685 1690
atc ggg cgc atg atg taagaaaccg aaaataaatg atactcacca aggccatggc 13845
Ile Gly Arg Met Met
1695
gaccagcgtg cgttcgtttc ttctctgttg tatctagt atg atg agg cgt gcg 13898
Met Met Arg Arg Ala
1700
tac ccg gag ggt cct cct ccc tcg tac gag agc gtg atg cag cag 13943
Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gln Gln
1705 1710 1715
gcg atg gcg gcg gcg gcg gcg atg cag ccc ccg ctg gag gct cct 13988
Ala Met Ala Ala Ala Ala Ala Met Gln Pro Pro Leu Glu Ala Pro
1720 1725 1730
tac gtg ccc ccg cgg tac ctg gcg cct acg gag ggg cgg aac agc 14033
Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser
1735 1740 1745
att cgt tac tcg gag ctg gca ccc ttg tac gat acc acc cgg ttg 14078
Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu
1750 1755 1760
tac ctg gtg gac aac aag tcg gcg gac atc gcc tcg ctg aac tac 14123
Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr
1765 1770 1775
cag aac gac cac agc aac ttc ctg acc acc gtg gtg cag aac aat 14168
Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn
1780 1785 1790
gac ttc acc ccc acg gag gcc agc acc cag acc atc aac ttt gac 14213
Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp
1795 1800 1805
gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc atg cac acc 14258
Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr
1810 1815 1820
aac atg ccc aac gtg aac gag ttc atg tac agc aac aag ttc aag 14303
Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys
1825 1830 1835
gcg cgg gtc atg gtc tcc cgc aag acc ccc aac ggg gtg aca gtg 14348
Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Thr Val
1840 1845 1850
aca gat ggt agt cag gat gag ttg aaa tac gag tgg gtg gag ttt 14393
Thr Asp Gly Ser Gln Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe
1855 1860 1865
gag ctg ccc gaa ggc aac ttc tcg gtg acc atg acc atc gac ctg 14438
Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu
1870 1875 1880
atg aac aac gcc atc atc gac aat tac ttg gcg gtg ggg cgg cag 14483
Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln
1885 1890 1895
aac ggg gtc ctg gag agc gac atc ggc gtg aag ttc gac act agg 14528
Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg
1900 1905 1910
aac ttc agg ctg ggc tgg gac ccc gtg acc gag ctg gtc atg ccc 14573
Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro
1915 1920 1925
ggg gtg tac acc aac gag gcc ttc cat ccc gat att gtg ctg ctg 14618
Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu
1930 1935 1940
ccc ggc tgc ggg gtg gac ttt acc gag agc cgc ctt agc aac ctg 14663
Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu
1945 1950 1955
ctg ggc att cgc aag agg cag cct ttc cag gag ggt ttc cag atc 14708
Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln Ile
1960 1965 1970
atg tac gag gat ctg gag ggg ggc aac atc ccc gca ctc ctg gat 14753
Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp
1975 1980 1985
gtc gag gcc tac gag aaa agc aag gag gag aat gca gca gcc gag 14798
Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Asn Ala Ala Ala Glu
1990 1995 2000
gcc gtg gct acc gcc gcg acc gcc gag gcc aag gct gtg gta gat 14843
Ala Val Ala Thr Ala Ala Thr Ala Glu Ala Lys Ala Val Val Asp
2005 2010 2015
gca gac gcc aat gtg acc agg ggc gat aca ttc gcc act cag gcg 14888
Ala Asp Ala Asn Val Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala
2020 2025 2030
gag gaa gca gcc gcc cta gcg gtc gcc gat gat agt gaa agt acc 14933
Glu Glu Ala Ala Ala Leu Ala Val Ala Asp Asp Ser Glu Ser Thr
2035 2040 2045
aag aca gtg acc att cag cca gta aaa gtg gat agc aag aac agg 14978
Lys Thr Val Thr Ile Gln Pro Val Lys Val Asp Ser Lys Asn Arg
2050 2055 2060
agt tac aac gtg ctg ccg gac gag aaa aac acc gcc tac cgc agc 15023
Ser Tyr Asn Val Leu Pro Asp Glu Lys Asn Thr Ala Tyr Arg Ser
2065 2070 2075
tgg tac ctg gcc tac aac tat ggc gac ccc gag aag ggc gtg cgc 15068
Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg
2080 2085 2090
tcc tgg acg ctg ctc acc acc tcg gac gtc acc tgc ggc gtg gag 15113
Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu
2095 2100 2105
caa gtc tac tgg tcg ctg ccc gac atg atg caa gac ccg gtc acc 15158
Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr
2110 2115 2120
ttc cgc tcc acg cgt caa gtt agc aac tac ccg gtg gtg ggc gcc 15203
Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala
2125 2130 2135
gag ctc ctg ccc gtc tac tcc aag agc ttc ttc aac gag cag gcc 15248
Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala
2140 2145 2150
gtc tac tcg cag cag ctg cgc gcc ttc acc tcg ctc acg cac gtc 15293
Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val
2155 2160 2165
ttc aac cgc ttc ccc gag aac cag atc ctc gtc cgc ccg ccc gcg 15338
Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala
2170 2175 2180
ccc acc att acc acc gtc agt gaa aac gtt cct gct ctc aca gat 15383
Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp
2185 2190 2195
cac ggg acc ctg ccg ctg cgc agc agt atc cgg gga gtc cag cgc 15428
His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg
2200 2205 2210
gtg acc gtt act gac gcc aga cgc cgc acc tgc ccc tac gtc tac 15473
Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr
2215 2220 2225
aag gcc ctg ggc ata gtc gcg ccg cgc gtc ctc tcg agc cgc acc 15518
Lys Ala Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr
2230 2235 2240
ttc taaaaa atg tcc att ctc atc tcg ccc agt aat aac acc ggt tgg 15566
Phe Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp
2245 2250 2255
ggc ctg cgc gcg ccc agc aag atg tac gga ggc gct cgc caa cgc 15611
Gly Leu Arg Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg
2260 2265 2270
tcc acg caa cac ccc gtg cgc gtg cgc ggg cac ttc cgc gct ccc 15656
Ser Thr Gln His Pro Val Arg Val Arg Gly His Phe Arg Ala Pro
2275 2280 2285
tgg ggc gcc ctc aag ggc cgc gtg cgg tcg cgc acc acc gtc gac 15701
Trp Gly Ala Leu Lys Gly Arg Val Arg Ser Arg Thr Thr Val Asp
2290 2295 2300
gac gtg atc gac cag gtg gtg gcc gac gcg cgc aac tac acc ccc 15746
Asp Val Ile Asp Gln Val Val Ala Asp Ala Arg Asn Tyr Thr Pro
2305 2310 2315
gcc gcc gcg ccc gtc tcc acc gtg gac gcc gtc atc gac agc gtg 15791
Ala Ala Ala Pro Val Ser Thr Val Asp Ala Val Ile Asp Ser Val
2320 2325 2330
gtg gcc gac gcg cgc cgg tac gcc cgc gcc aag agc cgg cgg cgg 15836
Val Ala Asp Ala Arg Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg
2335 2340 2345
cgc atc gcc cgg cgg cac cgg agc acc ccc gcc atg cgc gcg gcg 15881
Arg Ile Ala Arg Arg His Arg Ser Thr Pro Ala Met Arg Ala Ala
2350 2355 2360
cga gcc ttg ctg cgc agg gcc agg cgc acg gga cgc agg gcc atg 15926
Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr Gly Arg Arg Ala Met
2365 2370 2375
ctc agg gcg gcc aga cgc gcg gct tca ggc gcc agc gcc ggc agg 15971
Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala Ser Ala Gly Arg
2380 2385 2390
acc cgg aga cgc gcg gcc acg gcg gcg gca gcg gcc atc gcc agc 16016
Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala Ile Ala Ser
2395 2400 2405
atg tcc cgc ccg cgg cga ggg aac gtg tac tgg gtg cgc gac gcc 16061
Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg Asp Ala
2410 2415 2420
gcc acc ggt gtg cgc gtg ccc gtg cgc acc cgc ccc cct cgc act 16106
Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg Thr
2425 2430 2435
tgaagatgtt cacttcgcga tgttgatgtg tcccagcggc gagg atg tcc aag cgc 16162
Met Ser Lys Arg
2440
aaa ttc aag gaa gag atg ctc cag gtc atc gcg cct gag atc tac 16207
Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr
2445 2450 2455
ggc ccc gcg gtg gtg aag gag gaa aga aag ccc cgc aaa atc aag 16252
Gly Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys Ile Lys
2460 2465 2470
cgg gtc aaa aag gac aaa aag gaa gaa gat gac gat ctg gtg gag 16297
Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Asp Asp Leu Val Glu
2475 2480 2485
ttt gtg cgc gag ttc gcc ccc cgg cgg cgc gtg cag tgg cgc ggg 16342
Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly
2490 2495 2500
cgg aag gtg caa ccg gtg ctg aga ccc ggc acc acc gtg gtc ttc 16387
Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe
2505 2510 2515
acg ccc ggc gag cgc tca ggc acc gct tcc aag cgc tcc tac gac 16432
Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp
2520 2525 2530
gag gtg tac ggg gat gat gat att ctg gag cag gcg gcc gag cgc 16477
Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg
2535 2540 2545
ctg ggc gag ttt gct tac ggc aag cgc agc cgc tcc gcg ccg aag 16522
Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys
2550 2555 2560
gaa gag gcg gtg tcc atc ccg ctg gac cat ggc aac ccc acg ccg 16567
Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro
2565 2570 2575
agc ctc aag ccc gtg acc ctg cag cag gtg ctg ccg agc gcg gcg 16612
Ser Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala
2580 2585 2590
ccg cgt cgg ggg ttc aag cgc gag ggc gag gat ctg tac ccc acc 16657
Pro Arg Arg Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr
2595 2600 2605
atg cag ctg atg gtg ccc aag cgc cag aag ctg gaa gac gtg ctg 16702
Met Gln Leu Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu
2610 2615 2620
gag acc atg aag gtg gac ccg gac gtg cag ccc gag gtc aag gtg 16747
Glu Thr Met Lys Val Asp Pro Asp Val Gln Pro Glu Val Lys Val
2625 2630 2635
cgg ccc atc aag cag gtg gcc ccg ggc ctg ggc gtg cag acc gtg 16792
Arg Pro Ile Lys Gln Val Ala Pro Gly Leu Gly Val Gln Thr Val
2640 2645 2650
gac atc aag atc ccc acg gag ccc atg gaa acg cag acc gag ccc 16837
Asp Ile Lys Ile Pro Thr Glu Pro Met Glu Thr Gln Thr Glu Pro
2655 2660 2665
gtg aaa ccc agc acc agc acc atg gag gtg cag acg gat cct tgg 16882
Val Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp
2670 2675 2680
atg cca gcc gcc ccc acc act cga aga ccc cgg cgc aag tac ggc 16927
Met Pro Ala Ala Pro Thr Thr Arg Arg Pro Arg Arg Lys Tyr Gly
2685 2690 2695
gcg gcc agc ctg ctg atg ccc aac tac gcg ctg cat cct tcc atc 16972
Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile
2700 2705 2710
atc ccc acg ccg ggc tac cgc ggc acg cgc ttc tac cgc ggt cat 17017
Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly His
2715 2720 2725
aca agc cgc cgc cgc aag acc acc act cgc cgc cgc cgt cgc cgc 17062
Thr Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Arg Arg Arg Arg
2730 2735 2740
acc gcc gct gca tct acc cct gcc gcc ctg gtg cgg aga gtg tac 17107
Thr Ala Ala Ala Ser Thr Pro Ala Ala Leu Val Arg Arg Val Tyr
2745 2750 2755
cgc cgc ggc cgc gcg cct ctg acc ctg ccg cgc gcg cgc tac cac 17152
Arg Arg Gly Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr His
2760 2765 2770
ccg agc atc gcc att taaaactttc gcctgctttg cagatca atg gcc ctc 17203
Pro Ser Ile Ala Ile Met Ala Leu
2775
aca tgc cgc ctc cgc gtt ccc att acg ggc tac cga gga aga aaa 17248
Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly Arg Lys
2780 2785 2790
ccg cgc cgt aga agg ctg gcg ggg aac ggg atg cgt cgc cac cac 17293
Pro Arg Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His His
2795 2800 2805
cac cgg cgg cgg cgc gcc atc agc aag cgg ttg ggg gga ggc ttc 17338
His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
2810 2815 2820
ctg ccc gcg ctg atc ccc atc atc gcc gcg gcg atc ggg gcg atc 17383
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile
2825 2830 2835
ccc ggc att gct tcc gtg gcg gtg cag gcc tct cag cgc cac tga 17428
Pro Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
2840 2845 2850
gacacacttg gaaaacatct tgtaataaac ca atg gac tct gac gct cct ggt 17481
Met Asp Ser Asp Ala Pro Gly
2855
cct gtg atg tgt ttt cgt aga cag atg gaa gac atc aat ttt tcg 17526
Pro Val Met Cys Phe Arg Arg Gln Met Glu Asp Ile Asn Phe Ser
2860 2865 2870
tcc ctg gct ccg cga cac ggc acg cgg ccg ttc atg ggc acc tgg 17571
Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met Gly Thr Trp
2875 2880 2885
agc gac atc ggc acc agc caa ctg aac ggg ggc gcc ttc aat tgg 17616
Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly Ala Phe Asn Trp
2890 2895 2900
agc agt ctc tgg agc ggg ctt aag aat ttc ggg tcc acg ctt aaa 17661
Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys
2905 2910 2915
acc tat ggc agc aag gcg tgg aac agc acc aca ggg cag gcg ctg 17706
Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln Ala Leu
2920 2925 2930
agg gat aag ctg aaa gag cag aac ttc cag cag aag gtg gtc gat 17751
Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val Asp
2935 2940 2945
ggg ctc gcc tcg ggc atc aac ggg gtg gtg gac ctg gcc aac cag 17796
Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln
2950 2955 2960
gcc gtg cag cgg cag atc aac agc cgc ctg gac ccg gtg ccg ccc 17841
Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro
2965 2970 2975
gcc ggc tcc gtg gag atg ccg cag gtg gag gag gag ctg cct ccc 17886
Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro
2980 2985 2990
ctg gac aag cgg ggc gag aag cga ccc cgc ccc gac gcg gag gag 17931
Leu Asp Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu
2995 3000 3005
acg ctg ctg acg cac acg gac gag ccg ccc ccg tac gag gag gcg 17976
Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala
3010 3015 3020
gtg aaa ctg ggt ctg ccc acc acg cgg ccc atc gcg ccc ctg gcc 18021
Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala
3025 3030 3035
acc gga gtg ctg aaa ccc gaa act aag ccc gcg acc ctg gac ttg 18066
Thr Gly Val Leu Lys Pro Glu Thr Lys Pro Ala Thr Leu Asp Leu
3040 3045 3050
cct cct ccc cag cct tcc cgc ccc tcc aca gtg gct aag ccc ctg 18111
Pro Pro Pro Gln Pro Ser Arg Pro Ser Thr Val Ala Lys Pro Leu
3055 3060 3065
ccg ccg gtg gcc gtg gcc cgc gcg cga ccc ggg ggc acc gcc cgc 18156
Pro Pro Val Ala Val Ala Arg Ala Arg Pro Gly Gly Thr Ala Arg
3070 3075 3080
cct cat gcg aac tgg cag agc act ctg aac agc atc gtg ggt ctg 18201
Pro His Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly Leu
3085 3090 3095
gga gtg cag agt gtg aag cgc cgc cgc tgc tat taaacctacc 18244
Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr
3100 3105 3110
gtagcgctta acttgcttgt ctgtgtgtgt atgtattatg tcgccgccgc cgctgtccac 18304
cagaaggagg agtgaagagg cgcgtcgccg agttgcaag atg gcc acc cca tcg 18358
Met Ala Thr Pro Ser
3115
atg ctg ccc cag tgg gcg tac atg cac atc gcc gga cag gac gct 18403
Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala
3120 3125 3130
tcg gag tac ctg agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca 18448
Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr
3135 3140 3145
gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc acg gtg 18493
Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val
3150 3155 3160
gcg ccc acg cac gat gtg acc acc gac cgc agc cag cgg ctg acg 18538
Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr
3165 3170 3175
ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac tcg tac 18583
Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
3180 3185 3190
aaa gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac 18628
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp
3195 3200 3205
atg gcc agc acc tac ttt gac atc cgc ggc gtg ctg gac cgg ggc 18673
Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly
3210 3215 3220
cct agc ttc aaa ccc tac tcc ggc acc gcc tac aac agc ctg gct 18718
Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala
3225 3230 3235
ccc aag gga gca ccc aac acc tca cag tgg gta acc aaa gat aat 18763
Pro Lys Gly Ala Pro Asn Thr Ser Gln Trp Val Thr Lys Asp Asn
3240 3245 3250
ggg act gat aaa aca tac agc ttt ggt aat gct cct gtc cga ggt 18808
Gly Thr Asp Lys Thr Tyr Ser Phe Gly Asn Ala Pro Val Arg Gly
3255 3260 3265
ttg gac att aca gaa gag ggt ctc caa att gga act gat gac tct 18853
Leu Asp Ile Thr Glu Glu Gly Leu Gln Ile Gly Thr Asp Asp Ser
3270 3275 3280
tca acc gaa agc aag aaa att ttt gca gac aaa aca tat cag cct 18898
Ser Thr Glu Ser Lys Lys Ile Phe Ala Asp Lys Thr Tyr Gln Pro
3285 3290 3295
gaa cct cag gtt gga gat gag gaa tgg cat gac acc att ggg gct 18943
Glu Pro Gln Val Gly Asp Glu Glu Trp His Asp Thr Ile Gly Ala
3300 3305 3310
gaa gac aaa tat gga ggc aga gct ctt aaa cct gcc acc aac atg 18988
Glu Asp Lys Tyr Gly Gly Arg Ala Leu Lys Pro Ala Thr Asn Met
3315 3320 3325
aaa ccc tgt tat ggt tct ttt gcc aag cca act aat gct aag gga 19033
Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Ala Lys Gly
3330 3335 3340
ggt cag gct aaa acc aga acc aaa gac gat gga act acc gag cct 19078
Gly Gln Ala Lys Thr Arg Thr Lys Asp Asp Gly Thr Thr Glu Pro
3345 3350 3355
gat att gat atg gcc ttc ttt gac gat cgc agt cag cag gct agt 19123
Asp Ile Asp Met Ala Phe Phe Asp Asp Arg Ser Gln Gln Ala Ser
3360 3365 3370
ttc agt cca gaa ctt gtt ttg tat act gag aat gtg gat ttg gag 19168
Phe Ser Pro Glu Leu Val Leu Tyr Thr Glu Asn Val Asp Leu Glu
3375 3380 3385
acc cca gat acc cac att att tac aaa ccc ggt act gat gaa acc 19213
Thr Pro Asp Thr His Ile Ile Tyr Lys Pro Gly Thr Asp Glu Thr
3390 3395 3400
agt tct tct ttc aac ttg ggc cag caa tcc atg ccc aac agg ccc 19258
Ser Ser Ser Phe Asn Leu Gly Gln Gln Ser Met Pro Asn Arg Pro
3405 3410 3415
aac tac att ggt ttc aga gac aac ttt att ggt ctc atg tat tac 19303
Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr
3420 3425 3430
aac agc act ggc aat atg ggg gtg ctg gcc ggt cag gct tct cag 19348
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln
3435 3440 3445
ctg aat gct gtg gtt gac ttg caa gac aga aac acc gag ctg tcc 19393
Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser
3450 3455 3460
tac cag ctc ttg ctt gac tct ctg ggc gac aga acc cgg tat ttc 19438
Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe
3465 3470 3475
agt atg tgg aac cag gcg gtg gac agc tat gat cct gat gtg cgc 19483
Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg
3480 3485 3490
att att gaa aac cat ggt gtg gag gat gaa ttg cca aac tat tgt 19528
Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys
3495 3500 3505
ttc cct ctg aac ggt gtg ggc ttc aca gac aca ttc cag gga att 19573
Phe Pro Leu Asn Gly Val Gly Phe Thr Asp Thr Phe Gln Gly Ile
3510 3515 3520
aag gtt aaa act acc aac aac ggt act gct aat gct aca gag tgg 19618
Lys Val Lys Thr Thr Asn Asn Gly Thr Ala Asn Ala Thr Glu Trp
3525 3530 3535
gag tct gat act tct gtc aat aat gcc aat gag att gcc aag ggt 19663
Glu Ser Asp Thr Ser Val Asn Asn Ala Asn Glu Ile Ala Lys Gly
3540 3545 3550
aat cca ttt gcc atg gaa atc aac att caa gcc aac ctg tgg agg 19708
Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg
3555 3560 3565
aac ttc ctc tat gcc aac gtg gcc ctg tac ctg ccc gat tct tac 19753
Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr
3570 3575 3580
aag tac acg ccg gcc aac gtc acc ctg ccc acc aac atc aac acc 19798
Lys Tyr Thr Pro Ala Asn Val Thr Leu Pro Thr Asn Ile Asn Thr
3585 3590 3595
tac gat tac atg aac ggc cgg gtg gtg gcg ccc tcg ctg gtg gac 19843
Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp
3600 3605 3610
tcc tac atc aac atc ggg gcg cgc tgg tcg ctg gac ccc atg gac 19888
Ser Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp
3615 3620 3625
aac gtc aat ccc ttc aac cac cac cgc aac gcg ggg ctg cgc tac 19933
Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr
3630 3635 3640
cgc tcc atg ctc ctg ggc aac ggg cgc tac gtg ccc ttc cac atc 19978
Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile
3645 3650 3655
cag gtg ccc cag aaa ttt ttc gcc atc aag agc ctc ctg ctc ctg 20023
Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu
3660 3665 3670
ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac gtc aac 20068
Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn
3675 3680 3685
atg atc ctg cag agc tcc ctc ggc aac gac ctg cgc acg gac ggg 20113
Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly
3690 3695 3700
gcc tcc atc tcc ttc acc agc atc aac ctc tac gcc acc ttc ttc 20158
Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe
3705 3710 3715
ccc atg gcg cac aac acg gcc tcc acg ctc gag gcc atg ctg cgc 20203
Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg
3720 3725 3730
aac gac acc aac gac cag tcc ttc aac gac tac ctc tcg gcg gcc 20248
Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala
3735 3740 3745
aac atg ctc tac ccc atc ccg gcc aac gcc acc aac gtg ccc atc 20293
Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile
3750 3755 3760
tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc ggc tgg tcc ttc 20338
Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe
3765 3770 3775
acg cgc ctc aag acc aag gag acg ccc tcg ctg ggc tcc ggg ttc 20383
Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe
3780 3785 3790
gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac ctc gac ggc 20428
Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly
3795 3800 3805
acc ttc tac ctc aac cac acc ttc aag aag gtc tcc atc acc ttc 20473
Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe
3810 3815 3820
gac tcc tcc gtc agc tgg ccc ggc aac gac cgg ctc ctg acg ccc 20518
Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro
3825 3830 3835
aac gag ttc gaa atc aag cgc acc gtc gac ggc gag gga tac aac 20563
Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn
3840 3845 3850
gtg gcc cag tgc aac atg acc aag gac tgg ttc ctg gtc cag atg 20608
Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met
3855 3860 3865
ctg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg ccc gag 20653
Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu
3870 3875 3880
ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc cag ccc 20698
Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro
3885 3890 3895
atg agc cgc cag gtg gtg gac gag gtc aac tac aag gac tac cag 20743
Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln
3900 3905 3910
gcc gtc acc ctg gcc tac cag cac aac aac tcg ggc ttc gtc ggc 20788
Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly
3915 3920 3925
tac ctc gcg ccc acc atg cgc cag ggc cag ccc tac ccc gcc aac 20833
Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn
3930 3935 3940
tac ccc tat ccg ctc atc ggc aag agc gcc gtc acc agc gtc acc 20878
Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr
3945 3950 3955
cag aaa aag ttc ctc tgc gac agg gtc atg tgg cgc atc ccc ttc 20923
Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe
3960 3965 3970
tcc agc aac ttc atg tcc atg ggc gcg ctc acc gac ctc ggc cag 20968
Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln
3975 3980 3985
aac atg ctc tat gcc aac tcc gcc cac gcg cta gac atg aat ttc 21013
Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe
3990 3995 4000
gaa gtc gac ccc atg gat gag tcc acc ctt ctc tat gtt gtc ttc 21058
Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe
4005 4010 4015
gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc gtc 21103
Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val
4020 4025 4030
atc gag gcc gtc tac ctg cgc acc ccc ttc tcg gcc ggt aac gcc 21148
Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala
4035 4040 4045
acc acc taagctcttg cttcttgcaa g atg gct gag ccc acg ggc tcc ggc 21199
Thr Thr Met Ala Glu Pro Thr Gly Ser Gly
4050 4055
gag cag gag ctc agg gcc atc atc cgc gac ctg ggc tgc ggg ccc 21244
Glu Gln Glu Leu Arg Ala Ile Ile Arg Asp Leu Gly Cys Gly Pro
4060 4065 4070
tac ttc ctg ggc acc ttc gat aag cgc ttc ccg gga ttc atg gcc 21289
Tyr Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro Gly Phe Met Ala
4075 4080 4085
ccg cac aag ctg gcc tgc gcc atc gtc aac acg gcc ggc cgc gag 21334
Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg Glu
4090 4095 4100
acc ggg ggc gag cac tgg ctg gcc ttc gcc tgg aac ccg cgc tcg 21379
Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn Pro Arg Ser
4105 4110 4115
aac acc tgc tac ctc ttc gac ccc ttc ggg ttc tcg gac gag cga 21424
Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu Arg
4120 4125 4130
ctc aag cag atc tac cag ttc gag tac gaa ggc ctg ctg cgc cgc 21469
Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg Arg
4135 4140 4145
agc gcc ctg gcc acc gag gac cgc tgc gtc acc ctg gaa aag tcc 21514
Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser
4150 4155 4160
acc cag acc gtg cag ggt ccg cgc tcg gcc gcc tgc ggg ctc ttc 21559
Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe
4165 4170 4175
tgc tgc atg ttc ctg cac gcc ttc gtg cac tgg ccc gac cgc ccc 21604
Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro
4180 4185 4190
atg gac aag aac ccc acc atg aac ttg ctg acg ggg gtg ccc aac 21649
Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn
4195 4200 4205
ggc atg ctc cag tcg ccc cag gtg gaa ccc acc ctg cgc cgc aac 21694
Gly Met Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn
4210 4215 4220
cag gag gcg ctc tac cgc ttc ctc aac tcc cac tcc gcc tac ttt 21739
Gln Glu Ala Leu Tyr Arg Phe Leu Asn Ser His Ser Ala Tyr Phe
4225 4230 4235
cgc tcc cac cgc gcg cgc atc gag aag gcc acc gcc ttc gac cgc 21784
Arg Ser His Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg
4240 4245 4250
atg aac aat caa gac atg taacactgtg tgtgtatgtt aaaatatctt 21832
Met Asn Asn Gln Asp Met
4255
ttaataaaca gcactttcat gttacacatg catctgagat gattttattt tagaaatcga 21892
aagggttctg ccgggtctcg gcatggcccg cgggcaggga cacgttgcgg aactggtact 21952
tggccagcca cttgaactcg gggatcagca gtttcggcag cggggtgtcg ggaaaggagt 22012
cggtccacag cttccgcgtc agttgcaggg cgcccagcag gtcgggcgcg gagatcttga 22072
aatcgcagtt gggacccgcg ttctgcgcgc gagagttgcg gtacacgggg ttgcagcact 22132
ggaacaccat cagggccggg tgcttcacgc tcgccagcac cgtcgcgtcg gtgatgctct 22192
ccacgtcgag gtcctcggcg ttggccatcc cgaagggggt catcttgcag gtctgccttc 22252
ccatagtggg cacgcagccg ggcttgtggt tgcaatcgca gtgcaggggg atcagcatca 22312
tctgggcctg gtcggcgttc atccccgggt acatggcctt catgaaagcc tccaattgcc 22372
taaaagcctg ctgggccttg gctccctcgg tgaagaagac cccgcaggac ttgctagaga 22432
actggttggt ggcgcacccg gcgtcgtgca cgcagcagcg cgcgtcgttg ttggccagct 22492
gcaccacgct gcgcccccag cggttctggg tgatcttggc ccggtcgggg ttctccttca 22552
gcgcgcgctg tccgttctcg ctcgccacat ccatctcgat catgtgctcc ttctggatca 22612
tggtggtacc atgcaggcac cgcagcttgc cctcggtctc ggtgcacccg tgcagccaca 22672
gcgcgcaccc ggtgcactcc cagttcttgt gggcgatctg ggaatgcgcg tgcacgaagc 22732
cctgcaggaa gcggcccatc atggtggtta gggtcttgtt gctagtgaag gtcagcggga 22792
tgccgcggtg ctcctcattg atgtacaggt ggcagatgcg gcggtacacc tcgccctgct 22852
cgggcatcag ttggaagttg gctttcaggt cggtctccac gcggtagcgg tccatcagca 22912
tagtcatgat ttccatgccc ttctcccagg ccgagacgat gggtaggctc atggggttct 22972
tcaccatcat cttagcacta gcagccgcgg ccagggggtc gctctcgttc agggtctcaa 23032
agctccgctt gccgtccttc tcggtgatcc gcacgggggg gtagctgaag cccacggccg 23092
ccagctcctc ctcggcctgt ctttcgtcct cgctgtcctg gctgacgtcc tgcaggacca 23152
catgcttggt cttgcggggt ttcttcttgg gcggcagcgg tggcggagat gttggagatg 23212
gtgaggggga gcgcgagttc tcgctcacca ctactatctc ttcctcttct tggtccgagg 23272
ccacgcggcg gtaggtatgt ctcttcgggg gcagaggcgg aggcgacggg ctctcgccgc 23332
cgcgacttgg cggatggctg gcagagcccc ttccgcgttc gggggtgcgc tcccggcggc 23392
gctctgactg acttcctccg cggccggcca ttgtgttctc ctagggaaca acaagc atg 23451
Met
gag act cag cca tcg cca acc tcg cca tct gcc ccc acc acc gcc 23496
Glu Thr Gln Pro Ser Pro Thr Ser Pro Ser Ala Pro Thr Thr Ala
4260 4265 4270
gac gag aag cag cag cag cag aat gaa agc tta acc gcc ccg ccg 23541
Asp Glu Lys Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro
4275 4280 4285
ccc agc ccc gcc tcc gac gcg gcc gcg gtc cca gac atg caa gag 23586
Pro Ser Pro Ala Ser Asp Ala Ala Ala Val Pro Asp Met Gln Glu
4290 4295 4300
atg gag gaa tcc atc gag att gac ctg ggc tat gtg acg ccc gcg 23631
Met Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala
4305 4310 4315
gag cac gag gag gag ctg gca gtg cgc ttt caa tcg tca agc cag 23676
Glu His Glu Glu Glu Leu Ala Val Arg Phe Gln Ser Ser Ser Gln
4320 4325 4330
gaa gat aaa gaa cag cca gag cag gaa gca gag aac gag cag agt 23721
Glu Asp Lys Glu Gln Pro Glu Gln Glu Ala Glu Asn Glu Gln Ser
4335 4340 4345
cag act ggg ctc gag cat ggc gac tac ctc cac ctg agc ggg gag 23766
Gln Thr Gly Leu Glu His Gly Asp Tyr Leu His Leu Ser Gly Glu
4350 4355 4360
gag gac gcg ctc atc aag cat ctg gcc cgg cag gcc acc atc gtc 23811
Glu Asp Ala Leu Ile Lys His Leu Ala Arg Gln Ala Thr Ile Val
4365 4370 4375
aag gat gcg ctg ctc gac cgc acc gag gtg ccc ctc agc gtg gag 23856
Lys Asp Ala Leu Leu Asp Arg Thr Glu Val Pro Leu Ser Val Glu
4380 4385 4390
gag ctc agc cgc gcc tac gag ctc aac ctc ttc tcg ccg cgc gtg 23901
Glu Leu Ser Arg Ala Tyr Glu Leu Asn Leu Phe Ser Pro Arg Val
4395 4400 4405
ccc ccc aag cgc cag ccc aac ggc acc tgc gag ccc aac ccg cgc 23946
Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg
4410 4415 4420
ctc aac ttc tac ccg gtc ttc gcg gtg ccc gag gcc ctg gcc acc 23991
Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr
4425 4430 4435
tac cac atc ttt ttc aag aac caa aag atc ccc gtc tcc tgc cgc 24036
Tyr His Ile Phe Phe Lys Asn Gln Lys Ile Pro Val Ser Cys Arg
4440 4445 4450
gcc aac cgc acc cgc gcc gac gcc ctt ttc aac ctg ggc ccc ggc 24081
Ala Asn Arg Thr Arg Ala Asp Ala Leu Phe Asn Leu Gly Pro Gly
4455 4460 4465
gcc cgc cta cct gat atc gcc tcc ttg gaa gag gtt ccc aag atc 24126
Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile
4470 4475 4480
ttc gag ggt ctg ggc agc gac gag act cgg gcc gcg aac gct ctg 24171
Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu
4485 4490 4495
caa gga aat gaa gag cat gag cac cac agc gcc ctg gtc gag ttg 24216
Gln Gly Asn Glu Glu His Glu His His Ser Ala Leu Val Glu Leu
4500 4505 4510
gaa ggc gac aac gcg cgg ctg gcg gtg ctc aaa cgc acg gtc gag 24261
Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu
4515 4520 4525
ctg acc cat ttc gcc tac ccg gct ctg aac ctg ccc ccc aaa gtc 24306
Leu Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val
4530 4535 4540
atg agc gcg gtc atg gac cag gtg ctc atc aag cgc gcg tcg ccc 24351
Met Ser Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro
4545 4550 4555
atc tcc gag gac gag ggc atg caa gac tcc gag gag ggc aag ccc 24396
Ile Ser Glu Asp Glu Gly Met Gln Asp Ser Glu Glu Gly Lys Pro
4560 4565 4570
gtg gtc agt gac gag cag ctg gcc cgg tgg ctg ggt cct aat gct 24441
Val Val Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Pro Asn Ala
4575 4580 4585
acc cct cag agt ttg gaa gag cgg cgc aag ctt atg atg gcc gtg 24486
Thr Pro Gln Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val
4590 4595 4600
gtc ctg gtg acc gtg gag ctg gag tgc ctg cgc cgc ttc ttc gcc 24531
Val Leu Val Thr Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala
4605 4610 4615
gac gcg gag acc ctg cgc aag gtc gag gag aac ctg cac tac ctc 24576
Asp Ala Glu Thr Leu Arg Lys Val Glu Glu Asn Leu His Tyr Leu
4620 4625 4630
ttc agg cac ggg ttc gtg cgc cag gcc tgc aag atc tcc aat gtg 24621
Phe Arg His Gly Phe Val Arg Gln Ala Cys Lys Ile Ser Asn Val
4635 4640 4645
gag ctg acc aac ctg gtc tcc tac atg ggc atc ttg cac gag aac 24666
Glu Leu Thr Asn Leu Val Ser Tyr Met Gly Ile Leu His Glu Asn
4650 4655 4660
cgc ctg ggg cag aac gtg ctg cac acc acc ctg cgc ggg gag gcc 24711
Arg Leu Gly Gln Asn Val Leu His Thr Thr Leu Arg Gly Glu Ala
4665 4670 4675
cgc cgc gac tac atc cgc gac tgc gtc tac ctg tac ctc tgc cac 24756
Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu Cys His
4680 4685 4690
acc tgg cag acg ggc atg ggc gtg tgg cag cag tgc ctg gag gag 24801
Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys Leu Glu Glu
4695 4700 4705
cag aac ctg aaa gag cta tgc aag ctc ctg cag aag aac ctc aag 24846
Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys
4710 4715 4720
ggt ctg tgg acc ggg ttc gac gag cgg acc acc gcc tcg gac ctg 24891
Gly Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ser Asp Leu
4725 4730 4735
gcc gac ctc atc ttc ccc gag cgc ctc agg ctg acg ctg cgc aac 24936
Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn
4740 4745 4750
ggc ctg ccc gac ttt atg agc caa agc atg ttg caa aac ttt cgc 24981
Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg
4755 4760 4765
tct ttc atc ctc gaa cgc tcc gga atc ctg ccc gcc acc tgc tcc 25026
Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser
4770 4775 4780
gcg ctg ccc tcg gac ttc gtg ccg ctg acc ttc cgc gag tgc ccc 25071
Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro
4785 4790 4795
ccg ccg ctg tgg agc cac tgc tac ctg ctg cgt ctg gcc aac tac 25116
Pro Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr
4800 4805 4810
ctg gcc tac cac tcg gac gtg atc gag gac gtc agc ggc gag gcc 25161
Leu Ala Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Ala
4815 4820 4825
ctg ctc gag tgc cac tgc cgc tgc aac ctc tgc acg ccg cac cgc 25206
Leu Leu Glu Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg
4830 4835 4840
tcc ctg gcc tgc aac ccc cag ctg ctg agc gag acc cag atc atc 25251
Ser Leu Ala Cys Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile
4845 4850 4855
ggc acc ttc gag ttg caa ggg ccc agc gag ggc gag ggt tca gcc 25296
Gly Thr Phe Glu Leu Gln Gly Pro Ser Glu Gly Glu Gly Ser Ala
4860 4865 4870
gcc aag ggg ggt ctg aaa ctc acc ccg ggg ctg tgg acc tcg gcc 25341
Ala Lys Gly Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala
4875 4880 4885
tac ttg cgc aag ttc gtg ccc gag gac tac cat ccc ttc gag atc 25386
Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His Pro Phe Glu Ile
4890 4895 4900
agg ttc tac gag gac caa tcc cag ccg ccc aag gcc gag ctg tcg 25431
Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser
4905 4910 4915
gcc tgc gtc atc acc cag ggg gcg atc ctg gcc caa ttg caa gcc 25476
Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala
4920 4925 4930
atc cag aaa tcc cgc caa gaa ttc ttg ctg aaa aag ggc cgc ggg 25521
Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly Arg Gly
4935 4940 4945
gtc tac ctc gac ccc cag acc ggt gag gag ctc aac ccc ggc ttc 25566
Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Gly Phe
4950 4955 4960
ccc cag gat gcc ccg agg aaa caa gaa gct gaa agt gga gct gcc 25611
Pro Gln Asp Ala Pro Arg Lys Gln Glu Ala Glu Ser Gly Ala Ala
4965 4970 4975
gcc cgt gga gga ttt gga gga aga ctg gga gaa cag cag tca ggc 25656
Ala Arg Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Gln Ser Gly
4980 4985 4990
aga gga gat gga gga aga ctg gga cag cac tca ggc aga gga gga 25701
Arg Gly Asp Gly Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly
4995 5000 5005
cag cct gca aga cag tct gga gga aga cga gga gga ggc aga gga 25746
Gln Pro Ala Arg Gln Ser Gly Gly Arg Arg Gly Gly Gly Arg Gly
5010 5015 5020
gga ggt gga aga agc agc cgc cgc cag acc gtc gtc ctc ggc ggg 25791
Gly Gly Gly Arg Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly
5025 5030 5035
gga gaa agc aag cag cac gga tac cat ctc cgc tcc ggg tcg ggg 25836
Gly Glu Ser Lys Gln His Gly Tyr His Leu Arg Ser Gly Ser Gly
5040 5045 5050
tcc cgc tcg gcc cca cag tagatgggac gagaccgggc gattcccgaa 25884
Ser Arg Ser Ala Pro Gln
5055
ccccaccacc cagaccggta agaaggagcg gcagggatac aagtcctggc gggggcacaa 25944
aaacgccatc gtctcctgct tgcaggcctg cgggggcaac atctccttca cccggcgcta 26004
cctgctcttc caccgcgggg tgaacttccc ccgcaacatc ttgcattact accgtcacct 26064
ccacagcccc tactacttcc aagaagaggc agcagcggca gaaaaagacc agcagaaaac 26124
cagcagctag aaaatccaca gcggcggcag gtggactgag gatcgcggcg aacgagccgg 26184
cgcagacccg ggagctgagg aaccggatct ttcccaccct ctatgccatc ttccagcaga 26244
gtcgggggca ggagcaggaa ctgaaagtca agaaccgttc tctgcgctcg ctcacccgca 26304
gttgtctgta tcacaagagc gaagaccaac ttcagcgcac tctcgaggac gccgaggctc 26364
tcttcaacaa gtactgcgcg ctcactctta aagagtagcc cgcgcccgcc cagtcgcaga 26424
aaaaggcggg aattacgtca cctgtgccct tcgccctagc cgcctccacc catcatc 26481
atg agc aaa gag att ccc acg cct tac atg tgg agc tac cag ccc 26526
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro
5060 5065 5070
cag atg ggc ctg gcc gcc ggc gcc gcc cag gac tac tcc acc cgc 26571
Gln Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg
5075 5080 5085
atg aat tgg ctc agc gcc ggg ccc gcg atg atc tca cgg gtg aat 26616
Met Asn Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn
5090 5095 5100
gac atc cgc gcc cac cga aac cag ata ctc cta gaa cag tca gcg 26661
Asp Ile Arg Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala
5105 5110 5115
ctc acc gcc acg ccc cgc aat cac ctc aat ccg cgt aat tgg ccc 26706
Leu Thr Ala Thr Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro
5120 5125 5130
gcc gcc ctg gtg tac cag gaa att ccc cag ccc acg acc gta cta 26751
Ala Ala Leu Val Tyr Gln Glu Ile Pro Gln Pro Thr Thr Val Leu
5135 5140 5145
ctt ccg cga gac gcc cag gcc gaa gtc cag ctg act aac tca ggt 26796
Leu Pro Arg Asp Ala Gln Ala Glu Val Gln Leu Thr Asn Ser Gly
5150 5155 5160
gtc cag ctg gcg ggc ggc gcc acc ctg tgt cgt cac cgc ccc gct 26841
Val Gln Leu Ala Gly Gly Ala Thr Leu Cys Arg His Arg Pro Ala
5165 5170 5175
cag ggt ata aag cgg ctg gtg atc cgg ggc aga ggc aca cag ctc 26886
Gln Gly Ile Lys Arg Leu Val Ile Arg Gly Arg Gly Thr Gln Leu
5180 5185 5190
aac gac gag gtg gtg agc tct tcg ctg ggt ctg cga cct gac gga 26931
Asn Asp Glu Val Val Ser Ser Ser Leu Gly Leu Arg Pro Asp Gly
5195 5200 5205
gtc ttc caa ctc gcc gga tcg ggg aga tct tcc ttc acg cct cgt 26976
Val Phe Gln Leu Ala Gly Ser Gly Arg Ser Ser Phe Thr Pro Arg
5210 5215 5220
cag gcg gtc ctg act ttg gaa agt tcg tcc tcg cag ccc cgc tcg 27021
Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser Gln Pro Arg Ser
5225 5230 5235
ggc ggc atc ggc act ctc cag ttc gtg gag gag ttc act ccc tcg 27066
Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe Thr Pro Ser
5240 5245 5250
gtc tac ttc aac ccc ttc tcc ggc tcc ccc ggc cac tac ccg gac 27111
Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr Pro Asp
5255 5260 5265
gag ttc atc ccg aac ttt gac gcc atc agc gag tcg gtg gac ggc 27156
Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp Gly
5270 5275 5280
tac gat tga atg tcc cat ggt ggc gcg gct gac cta gct cgg ctt 27201
Tyr Asp Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu
5285 5290 5295
cga cac ctt gac cac tgc cgc cgc ttt cgc tgc ttc gca cgg gac 27246
Arg His Leu Asp His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp
5300 5305 5310
ctc gcc gag ttc acc tac ttc gag ctg ccc gag gag cat cct cag 27291
Leu Ala Glu Phe Thr Tyr Phe Glu Leu Pro Glu Glu His Pro Gln
5315 5320 5325
ggc ccg gcc cac gga gtg cgg atc gtc gtc gaa ggg ggc cta gac 27336
Gly Pro Ala His Gly Val Arg Ile Val Val Glu Gly Gly Leu Asp
5330 5335 5340
tcc cac ctg ctt cgg atc ttc agc cag cgc ccg atc ctg gtc gag 27381
Ser His Leu Leu Arg Ile Phe Ser Gln Arg Pro Ile Leu Val Glu
5345 5350 5355
cgc caa cag ggc aac acc ctc ctg acc ctc tac tgc atc tgc gac 27426
Arg Gln Gln Gly Asn Thr Leu Leu Thr Leu Tyr Cys Ile Cys Asp
5360 5365 5370
cac ccc ggc ctg cat gaa agt ctt tgt tgt ctg ctg tgt act gag 27471
His Pro Gly Leu His Glu Ser Leu Cys Cys Leu Leu Cys Thr Glu
5375 5380 5385
tat aat aaa agc tgagatcagc gactactccg gactcaactg tggtgtttct 27523
Tyr Asn Lys Ser
5390
gcatccatca atcggtcact gaccttcacc gggaacgaga ccgagctcca gctccagtgt 27583
aagccccaca agaagtacct cacctggctg taccagggct ccccgatcgc cgttgttaac 27643
cactgcgacg acgacggagt cctgctgaac ggccccgcca accttacttt ttccacccgc 27703
agaagcaagc tcgagctctt ccaacccttc ctccccggga cctatcagtg catctcggga 27763
ccctgccatc acaccttcca cctgatcccg aataccacct cttccccagc gccgctcccc 27823
actaacaacc aaactaacca ccaccaacgc taccgacgcg acctcgttga atctaatacc 27883
acccacaccg gaggtgagct ccgaggtcct gaatcctctg ggatttatta cggcccctgg 27943
gaggtggtgg ggttaatagc tttaggctta gtagcgggtg ggcttttggc tctctgctac 28003
ctatacctcc cttgcttttc ctacttagtg gtgctttgtt gctggtttaa gaa atg 28059
Met
ggg aag atc acc cta gtg tgc ggt gtg ctg gtg acg gtg gtg ctt 28104
Gly Lys Ile Thr Leu Val Cys Gly Val Leu Val Thr Val Val Leu
5395 5400 5405
tcg att ctg gga ggg gga agc gcg gct gta gtg acg gag aag aag 28149
Ser Ile Leu Gly Gly Gly Ser Ala Ala Val Val Thr Glu Lys Lys
5410 5415 5420
gcc gat ccc tgc ttg act ttc aac ccc gat aaa tgc cgg ctg agt 28194
Ala Asp Pro Cys Leu Thr Phe Asn Pro Asp Lys Cys Arg Leu Ser
5425 5430 5435
ttt cag ccc gat ggc aat cgg tgc gcg gtg ttg atc aag tgc gga 28239
Phe Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly
5440 5445 5450
tgg gaa tgc gag agt gtg gcg att cag tat aaa aac aag acg cgg 28284
Trp Glu Cys Glu Ser Val Ala Ile Gln Tyr Lys Asn Lys Thr Arg
5455 5460 5465
aac aat act ctc gcg tcc aca tgg cag ccc ggg gac ccc gag tgg 28329
Asn Asn Thr Leu Ala Ser Thr Trp Gln Pro Gly Asp Pro Glu Trp
5470 5475 5480
tac acc gtc tct gtc cct ggt gct gac ggc tcc ctc cgc acg gtg 28374
Tyr Thr Val Ser Val Pro Gly Ala Asp Gly Ser Leu Arg Thr Val
5485 5490 5495
aac aac act ttc att ttt gag cac atg tgc gat acc gcc atg ttc 28419
Asn Asn Thr Phe Ile Phe Glu His Met Cys Asp Thr Ala Met Phe
5500 5505 5510
atg agc aag cag tac ggt atg tgg ccc cca cga aaa gag aat atc 28464
Met Ser Lys Gln Tyr Gly Met Trp Pro Pro Arg Lys Glu Asn Ile
5515 5520 5525
gtg gtc ttc tcc atc gct tac agc gcg tgc acg gtg cta atc acc 28509
Val Val Phe Ser Ile Ala Tyr Ser Ala Cys Thr Val Leu Ile Thr
5530 5535 5540
gcg atc gtg tgc ctg agc att cac atg ctc atc gct att cgc ccc 28554
Ala Ile Val Cys Leu Ser Ile His Met Leu Ile Ala Ile Arg Pro
5545 5550 5555
aga aat aat gcc gag aaa gag aaa cag cca taacacactt ttttcacaca 28604
Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
5560 5565
cctttttcag acc atg gcc tct gtt aca atc ctt att tat ttt ttg ggc 28653
Met Ala Ser Val Thr Ile Leu Ile Tyr Phe Leu Gly
5570 5575
ctt gtg ggc act atc agc agt ttc gac cat aaa aac gta act gct 28698
Leu Val Gly Thr Ile Ser Ser Phe Asp His Lys Asn Val Thr Ala
5580 5585 5590
tat gtt ggt tct aac tgt gta cta tct ggg tac cag tca cat cag 28743
Tyr Val Gly Ser Asn Cys Val Leu Ser Gly Tyr Gln Ser His Gln
5595 5600 5605
cgg gtt tca tgg tac tgg ttt gat aaa aag aac aca gct tat aca 28788
Arg Val Ser Trp Tyr Trp Phe Asp Lys Lys Asn Thr Ala Tyr Thr
5610 5615 5620
ctc tgc aaa ggc tat cag cag ccc aca cat cgc agt ggt ctt tat 28833
Leu Cys Lys Gly Tyr Gln Gln Pro Thr His Arg Ser Gly Leu Tyr
5625 5630 5635
tac agc tgc acc aat aat aat atc aca cta ctt caa gta acc aac 28878
Tyr Ser Cys Thr Asn Asn Asn Ile Thr Leu Leu Gln Val Thr Asn
5640 5645 5650
caa tat tct ggg acc tac tat gga acc aat ttt aac aca aaa cag 28923
Gln Tyr Ser Gly Thr Tyr Tyr Gly Thr Asn Phe Asn Thr Lys Gln
5655 5660 5665
gac act tac tat agt gtc aga gta ttg gat cca act act ccc aga 28968
Asp Thr Tyr Tyr Ser Val Arg Val Leu Asp Pro Thr Thr Pro Arg
5670 5675 5680
act act act aaa cat acc aca act aag aag ccc act aca cct aaa 29013
Thr Thr Thr Lys His Thr Thr Thr Lys Lys Pro Thr Thr Pro Lys
5685 5690 5695
aag cct acc acg ccc aaa acc act aag aca aca act gct aag cag 29058
Lys Pro Thr Thr Pro Lys Thr Thr Lys Thr Thr Thr Ala Lys Gln
5700 5705 5710
acc act acc aca gag cca acc aca acc agc acc aca ctt gct ata 29103
Thr Thr Thr Thr Glu Pro Thr Thr Thr Ser Thr Thr Leu Ala Ile
5715 5720 5725
act aca cac act gag ctg acc tca cag gca act act gaa aat ggt 29148
Thr Thr His Thr Glu Leu Thr Ser Gln Ala Thr Thr Glu Asn Gly
5730 5735 5740
ttt gcc cta ttg caa aag ggg gag aac agt agc agc agt cct ctg 29193
Phe Ala Leu Leu Gln Lys Gly Glu Asn Ser Ser Ser Ser Pro Leu
5745 5750 5755
cct act acc ccc agt gag gaa ata ccc aag tcc atg gtt ggc att 29238
Pro Thr Thr Pro Ser Glu Glu Ile Pro Lys Ser Met Val Gly Ile
5760 5765 5770
atc gct gct gta gtg gtg tgt atg gtg att atc atc ttg tgc atg 29283
Ile Ala Ala Val Val Val Cys Met Val Ile Ile Ile Leu Cys Met
5775 5780 5785
atg tac tat gcc tgc tac tac aga aaa cac agg cta aac aat aag 29328
Met Tyr Tyr Ala Cys Tyr Tyr Arg Lys His Arg Leu Asn Asn Lys
5790 5795 5800
ctg gac ccc cta ctg aat gtt gat ttt taatttttta gaacc atg aag 29376
Leu Asp Pro Leu Leu Asn Val Asp Phe Met Lys
5805 5810 5815
atc cta agc ctt ttt gtt ttt tct atc att acc tct gct att tgt 29421
Ile Leu Ser Leu Phe Val Phe Ser Ile Ile Thr Ser Ala Ile Cys
5820 5825 5830
aaa tca gtg gat aag gac gtt act gtc acc act ggc tct aat tat 29466
Lys Ser Val Asp Lys Asp Val Thr Val Thr Thr Gly Ser Asn Tyr
5835 5840 5845
aca cta aaa ggg cct tcc tca ggt atg ctt tcg tgg tat tgt tat 29511
Thr Leu Lys Gly Pro Ser Ser Gly Met Leu Ser Trp Tyr Cys Tyr
5850 5855 5860
ttt gga aat gat gat aaa cag aca gag ctt tgt aat ttc cag aac 29556
Phe Gly Asn Asp Asp Lys Gln Thr Glu Leu Cys Asn Phe Gln Asn
5865 5870 5875
gga aaa acc aaa aat tct aaa ata gat aac tat caa tgc cat ggt 29601
Gly Lys Thr Lys Asn Ser Lys Ile Asp Asn Tyr Gln Cys His Gly
5880 5885 5890
act gat tta gta ctg atg aat atc acg aaa gca tat gct ggc agt 29646
Thr Asp Leu Val Leu Met Asn Ile Thr Lys Ala Tyr Ala Gly Ser
5895 5900 5905
tat tcc tgt cct gga caa aac acc gaa gaa atg att ttt tac aaa 29691
Tyr Ser Cys Pro Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys
5910 5915 5920
ttg att gtg gtt gat ccc act aca cct cca ccc acc aca act acc 29736
Leu Ile Val Val Asp Pro Thr Thr Pro Pro Pro Thr Thr Thr Thr
5925 5930 5935
aat gca cct acc aca gac aca cag gaa acc act cca gag gca gta 29781
Asn Ala Pro Thr Thr Asp Thr Gln Glu Thr Thr Pro Glu Ala Val
5940 5945 5950
gca gag tta gca aag cag att cat gaa gat tcc ttt gtt gct aat 29826
Ala Glu Leu Ala Lys Gln Ile His Glu Asp Ser Phe Val Ala Asn
5955 5960 5965
act ccc aca cac ccc gga ccg caa tgt cca ggg tta gta gtc agc 29871
Thr Pro Thr His Pro Gly Pro Gln Cys Pro Gly Leu Val Val Ser
5970 5975 5980
ggc att gtc ggt gtg ctt tgc ggg tta gca gtt ata atc atc tgc 29916
Gly Ile Val Gly Val Leu Cys Gly Leu Ala Val Ile Ile Ile Cys
5985 5990 5995
atg ttc att ttt gct tgc tgc tac aga agg ctt cac cga caa aaa 29961
Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu His Arg Gln Lys
6000 6005 6010
tca gac cca ctg ctg aac ctc tat gtt taatttttga ttttccagag cc 30010
Ser Asp Pro Leu Leu Asn Leu Tyr Val
6015
atg aag gca ctt agc act tta gta ttt ttg tcc ttg att ggc att 30055
Met Lys Ala Leu Ser Thr Leu Val Phe Leu Ser Leu Ile Gly Ile
6020 6025 6030
gtt ttc agt gct ggg ttt ttg aaa aat ctt acc att att gaa ggc 30100
Val Phe Ser Ala Gly Phe Leu Lys Asn Leu Thr Ile Ile Glu Gly
6035 6040 6045
gat aat gca aca ctg gta gga atc agt ggt cag aat gtt agt tgg 30145
Asp Asn Ala Thr Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp
6050 6055 6060
cta aaa tat cat cta gat ggg tgg aaa cct att tgc acc tgg aat 30190
Leu Lys Tyr His Leu Asp Gly Trp Lys Pro Ile Cys Thr Trp Asn
6065 6070 6075
gtc agt gtg tac aca tgt cat ggt gtt aac ctc acc att acc aat 30235
Val Ser Val Tyr Thr Cys His Gly Val Asn Leu Thr Ile Thr Asn
6080 6085 6090
gcc acc caa gat cag aat ggc agg ttt aag ggt cag agt ttc act 30280
Ala Thr Gln Asp Gln Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr
6095 6100 6105
agc aac aat ggc tat gaa acc cat aac atg ttc atc tat gat gtc 30325
Ser Asn Asn Gly Tyr Glu Thr His Asn Met Phe Ile Tyr Asp Val
6110 6115 6120
act gtc ata tca aat aag act aca cct acc acc caa aca ccc act 30370
Thr Val Ile Ser Asn Lys Thr Thr Pro Thr Thr Gln Thr Pro Thr
6125 6130 6135
aca cat agc tca act cat gcc atg cag acc act cag acc acc act 30415
Thr His Ser Ser Thr His Ala Met Gln Thr Thr Gln Thr Thr Thr
6140 6145 6150
tac act aca tcc att cag ccc acc acc act aca gca gag gta acc 30460
Tyr Thr Thr Ser Ile Gln Pro Thr Thr Thr Thr Ala Glu Val Thr
6155 6160 6165
agc aca gcg cct cag ccc caa gca ttg gct ttg atg gct gca cag 30505
Ser Thr Ala Pro Gln Pro Gln Ala Leu Ala Leu Met Ala Ala Gln
6170 6175 6180
cct agc agc atg act gct aaa acc aat gag cag act act gaa ttt 30550
Pro Ser Ser Met Thr Ala Lys Thr Asn Glu Gln Thr Thr Glu Phe
6185 6190 6195
ttg tcc act act cag agc agc acc aca gct acc tcg agt gcc ttc 30595
Leu Ser Thr Thr Gln Ser Ser Thr Thr Ala Thr Ser Ser Ala Phe
6200 6205 6210
tct agc acc gcc aat ctc acc tcg ctt tct tct acg cca atc agt 30640
Ser Ser Thr Ala Asn Leu Thr Ser Leu Ser Ser Thr Pro Ile Ser
6215 6220 6225
aat gct act acc tcc ccc gct cct ctt ccc act cct ctg aag caa 30685
Asn Ala Thr Thr Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln
6230 6235 6240
tcc gag tct agc acg cag ctg cag atc acc ctg ctc att gtg atc 30730
Ser Glu Ser Ser Thr Gln Leu Gln Ile Thr Leu Leu Ile Val Ile
6245 6250 6255
ggg gtg gtc atc ctg gca gtg ctg ctc tac ttt atc ttc tgc cgc 30775
Gly Val Val Ile Leu Ala Val Leu Leu Tyr Phe Ile Phe Cys Arg
6260 6265 6270
cgc atc ccc aac gcg aaa ccg gcc tac aag ccc att gtt atc ggg 30820
Arg Ile Pro Asn Ala Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly
6275 6280 6285
acg ccg gaa ccg ctt cag gtg gag gga ggt cta agg aat ctt ctc 30865
Thr Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu
6290 6295 6300
ttc tct ttt aca gta tgg tgatttgaac t atg att cct aga cat ttc 30912
Phe Ser Phe Thr Val Trp Met Ile Pro Arg His Phe
6305 6310 6315
att atc act tct cta atc tgt gtg ctc caa gtc tgt gcc acc ctc 30957
Ile Ile Thr Ser Leu Ile Cys Val Leu Gln Val Cys Ala Thr Leu
6320 6325 6330
gct ctc gtg gct aac gcg agt cca gac tgc att gga gcg ttc gcc 31002
Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Ala Phe Ala
6335 6340 6345
tcc tac gtg ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc 31047
Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser
6350 6355 6360
ata gtc tgc ctg ctt atc acc ttc ttc cag ttc gtt gac tgg gtc 31092
Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Val Asp Trp Val
6365 6370 6375
ttt gtg cgc atc gcc tac ctg cgc cac cat ccc cag tac cgc gac 31137
Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp
6380 6385 6390
cag aga gtg gcg caa ctg ttg aga ctc atc tg atg ata agc atg cgg 31184
Gln Arg Val Ala Gln Leu Leu Arg Leu Ile Met Ile Ser Met Arg
6395 6400 6405
gct ctg cta cta ctc ctc gcg ctt ctg cta gtt ccc ctc gcc gcc 31229
Ala Leu Leu Leu Leu Leu Ala Leu Leu Leu Val Pro Leu Ala Ala
6410 6415 6420
ccc tta tcc ttc aaa tcc ccc acc cag tcc cct gaa gag gtt cga 31274
Pro Leu Ser Phe Lys Ser Pro Thr Gln Ser Pro Glu Glu Val Arg
6425 6430 6435
aaa tgt aaa ttc caa gaa ccc tgg aaa ttc ctt tca tgc tac aaa 31319
Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Ser Cys Tyr Lys
6440 6445 6450
ctc aaa tca gaa atg cac ccc agc tgg atc atg atc att gga atc 31364
Leu Lys Ser Glu Met His Pro Ser Trp Ile Met Ile Ile Gly Ile
6455 6460 6465
gtg aac atc ctt gcc tgt acc ctc ttc tcc ttt gtg att tac ccc 31409
Val Asn Ile Leu Ala Cys Thr Leu Phe Ser Phe Val Ile Tyr Pro
6470 6475 6480
cgc ttt gac ttt ggg tgg aac gca ccc gag gcg ctc tgg ctc ccg 31454
Arg Phe Asp Phe Gly Trp Asn Ala Pro Glu Ala Leu Trp Leu Pro
6485 6490 6495
cct gat ccc gac ata cca cca cag cag cag caa aat cag gca cac 31499
Pro Asp Pro Asp Ile Pro Pro Gln Gln Gln Gln Asn Gln Ala His
6500 6505 6510
gca cca cca cca cag cct agg cca caa tac atg ccc atc tta gac 31544
Ala Pro Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp
6515 6520 6525
tat gag gcc gag cca cag cga gcc atg ctt cct gct att agt tac 31589
Tyr Glu Ala Glu Pro Gln Arg Ala Met Leu Pro Ala Ile Ser Tyr
6530 6535 6540
ttc aac cta acc ggc gga gat gac tgaccccatg gccaacaaca ccgtcaacga 31643
Phe Asn Leu Thr Gly Gly Asp Asp
6545
cctcctggac atggacggcc gcgcctcgga gcagcgactc gcccaactcc gcatccgcca 31703
gcagcaggag agagccgtca aggagctgca ggatgcggtg gctatccacc agtgcaagag 31763
aggcatcttc tgcctggtga agcaggccaa gatcaccttc gaggtgactt ccaccgacca 31823
tcgcctctcc tacgagctcc tgcagcagcg ccagaagttc acctgcctgg tcggagtcaa 31883
ccccatcgtc atcacccagc agtctggcga taccaagggg tgcatccact gctcctgcga 31943
ctcccccgag tgccttcaca ccctggtcaa gaccctctgc ggcctccgcg acctcctccc 32003
catgaactaa tcaactaacc cctacccctt taccctccag taaaaaataa agattaaaaa 32063
atgattgaat tgatcaataa agaatcactt acttgaaatc tgaaaccagg tctctgtcca 32123
tgttttctgt cagcagcact tcactcccct cttcccagct ctggtactgc aggccccggc 32183
gggctgcaaa cttcctccac actctgaagg ggatgtcaaa ttcctcctgt ccctcaatct 32243
tcatttttat cttctatcag atg tcc aaa aag cgc gcg cgg gtg gat gat 32293
Met Ser Lys Lys Arg Ala Arg Val Asp Asp
6550 6555
ggc ttc gac ccc gtg tac ccc tac gat gca gac aac gca ccg act 32338
Gly Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr
6560 6565 6570
gtg ccc ttc atc aac cct ccc ttc gtc tct tca gat gga ttc caa 32383
Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln
6575 6580 6585
gaa aag ccc ttg ggg gtg ttg tcc ctg cgc ctg gcc gac ccc gtc 32428
Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val
6590 6595 6600
acc acc aag aac ggg gct gtc acc ctc aag ctg ggg gag ggg gtg 32473
Thr Thr Lys Asn Gly Ala Val Thr Leu Lys Leu Gly Glu Gly Val
6605 6610 6615
gac ctc gac gac tcg gga aaa ctc att gca aac aca gtc agc aag 32518
Asp Leu Asp Asp Ser Gly Lys Leu Ile Ala Asn Thr Val Ser Lys
6620 6625 6630
gcc att gcc cct ctc agt ttt tcc aac aac acc att tcc ctt aac 32563
Ala Ile Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn
6635 6640 6645
atg gat acc cct tta tat acc aaa gat gga aaa cta tcc tta caa 32608
Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gln
6650 6655 6660
gtt tct cca cca tta aat ata tta aga tca acg att ctg aac aca 32653
Val Ser Pro Pro Leu Asn Ile Leu Arg Ser Thr Ile Leu Asn Thr
6665 6670 6675
tta gct cta gct ttt ggc tca ggt tta gga ctc agt ggc agc gcc 32698
Leu Ala Leu Ala Phe Gly Ser Gly Leu Gly Leu Ser Gly Ser Ala
6680 6685 6690
ctg gca gta cag tta gcc tct cca ctt aca ttt gat gat aaa gga 32743
Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Asp Lys Gly
6695 6700 6705
aat ata aag att acc cta gac agg gga ttg cat gtt acg aca gga 32788
Asn Ile Lys Ile Thr Leu Asp Arg Gly Leu His Val Thr Thr Gly
6710 6715 6720
aat gca att gaa agc aac ata agc tgg gct aaa ggt ata aaa ttt 32833
Asn Ala Ile Glu Ser Asn Ile Ser Trp Ala Lys Gly Ile Lys Phe
6725 6730 6735
gaa gat gga gcc ata gct gcc aac att ggt aaa ggg cta gag ttt 32878
Glu Asp Gly Ala Ile Ala Ala Asn Ile Gly Lys Gly Leu Glu Phe
6740 6745 6750
ggc acc agt agt aca gta aca ggt gtt gac gac gct tat cca ata 32923
Gly Thr Ser Ser Thr Val Thr Gly Val Asp Asp Ala Tyr Pro Ile
6755 6760 6765
caa gtt aaa ctt gga tct ggt ctt agc ttt gat agc aca ggg gct 32968
Gln Val Lys Leu Gly Ser Gly Leu Ser Phe Asp Ser Thr Gly Ala
6770 6775 6780
atc atg gct ggc aat aag gag gat gac aaa ctt act tta tgg aca 33013
Ile Met Ala Gly Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr
6785 6790 6795
aca cct gat cca tcg cca aat tgt caa ata ctc gca gaa aat gat 33058
Thr Pro Asp Pro Ser Pro Asn Cys Gln Ile Leu Ala Glu Asn Asp
6800 6805 6810
gca aaa cta aca ctt tgc tta aca aag tgt ggt agt caa ata ttg 33103
Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu
6815 6820 6825
gcc act gta tca gtc tta gtg gta gga agt gga aag tta aac cca 33148
Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Lys Leu Asn Pro
6830 6835 6840
att act ggg gaa gta agc agt gct caa gtc ttt cta cga ttt gat 33193
Ile Thr Gly Glu Val Ser Ser Ala Gln Val Phe Leu Arg Phe Asp
6845 6850 6855
tct aat gga gta ctc tta gta aac agc tcc aca ttg aaa aag tat 33238
Ser Asn Gly Val Leu Leu Val Asn Ser Ser Thr Leu Lys Lys Tyr
6860 6865 6870
tgg ggg tat agg cag ggt gac agc ata gat ggt act cca tac acc 33283
Trp Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly Thr Pro Tyr Thr
6875 6880 6885
aac gcc gta ggt ttc atg cca aat tta aaa gct tac cca aag tcc 33328
Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser
6890 6895 6900
caa agt tct act act aaa aat aat ata gtg gga caa gta tac atg 33373
Gln Ser Ser Thr Thr Lys Asn Asn Ile Val Gly Gln Val Tyr Met
6905 6910 6915
aat ggt gat aat tca aaa cct atg ctt ctt act ata act ctc aat 33418
Asn Gly Asp Asn Ser Lys Pro Met Leu Leu Thr Ile Thr Leu Asn
6920 6925 6930
ggc act gat gat acc acc agt gca tat tca atg tca ttt tca tac 33463
Gly Thr Asp Asp Thr Thr Ser Ala Tyr Ser Met Ser Phe Ser Tyr
6935 6940 6945
acc tgg act aat gga agt tat acc gga gca aca ttt gga gct aac 33508
Thr Trp Thr Asn Gly Ser Tyr Thr Gly Ala Thr Phe Gly Ala Asn
6950 6955 6960
tca tac aca ttt tcc tac att gcc caa gaa taatcccacc ctgcatgcca 33558
Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu
6965 6970
accccttttc ccactctata aatggaactg aaacaaaaat aaagttcaag tgtttttatt 33618
gattcaacag ttttcacagg attcgagtag ttattttccc tcctccctcc caactcatgg 33678
aatacaccac cctctcccca cgcacagcct taaacatctg aatgccattg gtaatggaca 33738
tggttttggt ctccacattc cacacagttt cagagcgagc cagtctcggg tcggtcaggg 33798
agatgaaacc ctccgggcac tcctgcatct gcacctcaaa gttcagtagc tgagggctgt 33858
cctcggtggt cgggatcaca gttatctgga agaagagcga tgagagtcat aatccgcgaa 33918
cgggatcggg cggttgtggc gcatcaggcc ccgcagcagt cgctgtctgc gccgctccgt 33978
caagctgctg cttaaggggt ccgggtccag ggactccctg cgcatgatgc caatggccct 34038
gagcatcagt cgcctggtgc ggcgggcgca gcagcggatg cggatctcac tcaggtcgga 34098
gcagtacgtg cagcacagca ctaccaagtt gttcaacagt ccatagttca acgtgctcca 34158
gccaaaactc atctgtggaa ctatgctgcc cacatgtcca tcgtaccaga tcctgatgta 34218
aatcaggtgg cgccccctcc agaacacact gcccatgtac atgatctcct tgggcatatg 34278
caggttcacc acctcccggt accacatcac ccgctggttg aacatgcagc cctggataat 34338
cctgcggaac cagatggcca gtaccgcccc gcccgccatg cagcgaaggg accccgggtc 34398
ctgacagtgg cagtggatga tccaccgctc gcggccgtgg atcaactggg aactgaacaa 34458
gtctatgttg gcacagcaca ggcacacgct catgcatgtc ttcagcactc tcagctcctc 34518
gggggtcagg accatgtccc agggcacggg gaactcttgc aggacagtga acccggcaga 34578
acaaggcaac cctcgcacac aacttacatt gtgcatggac agggtatcgc aatcaggcag 34638
caccggatga tcctccacca gagaagcgcg ggtctcggtc tcctcacagc gaggtaaggg 34698
ggccgggggt tggtacggat gatggcggga tgacgctaat cgtgttctgg atcgtgtcat 34758
gatggagctg tttcctgaca ttttcgtact tcacgaagca gaacctggta cgggcactgc 34818
acaccgctcg tcggcgacgg tctcggcgct tcgagcgctc ggtgttgaag ttatagaaca 34878
gccactccct cagagcatgc agtatctcct gagcctcttg ggtgatgaaa atcccatccg 34938
ccctgatggc tctgatcaca tcgaccacgg tggaatgggc cagacccagc cagatgatgc 34998
aattttgttg ggtttcggtg acggcggggg agggaagaac aggaagaacc atgattaact 35058
ttattccaaa cggtctcgga gcacttcaaa atgcaggtcc cggagatggc acctctcgcc 35118
cccactgtgt tggtggaaaa taacagccag gtcaaaggtg acacggttct cgagatgttc 35178
cacggtggct tccagtaaag cctccacgcg cacatccaga aacaagagga cagcgaaagc 35238
gggagcgttt tctaattcct caatcatcat attacactcc tgcaccatcc ccagataatt 35298
ttcatttttc cagccttgaa tgattcgtat tagttcctgg ggtaaatcca agccagccat 35358
gataaaaagc tcgcgcagag cgccctccac cggcattctt aagcacactc tcataattcc 35418
aagagattct gctcctggtt cacctgcagc agattaacaa tgggaatatc aaaatctctg 35478
ccgcgatccc taagctcctc cctcaacaat aactgtatgt aatctttcat gtcatctccg 35538
aaatttttag ccatagggcc gccaggaata agagaagggc aagccacatt acagataaag 35598
cgaagtcctc cccagtgagc attgccaaat gtaagattga aataagcatg ctggctagac 35658
ccggtgatat cctccagata actggacaga aaatcaggca agcaattttt aagaaaatca 35718
acaaaagaaa agtcgtccag gtgcacgttt agagcctcgg gaacaacgat ggaataagtg 35778
caaggagtgc gctccagcat ggttagtgtt ttttggtgat ctgtagaaca aaaaaataaa 35838
catgcaatat taaaccatgc tagcctggcg aacaggtggg taaatcactc tttccagcac 35898
caggcaggct acggggtctc cggcgcgacc ctcgtagaaa ctgtcgccat gattgaaaag 35958
catcaccgag agaccttccc ggtgaccggc atggatgatt cgagaagaag catacactcc 36018
gggaacattg gcgtccgtga gtgaaaaaaa gcgacctata aagcctcgag ggactacaat 36078
gctcaatctc aattccagca aagcgacccc atgcggatga agcacaaaat tggcaggtgc 36138
gtaaaaaatg taattactcc cctcctgcac aggcagcaaa gcccccgctc cctccagaaa 36198
cacatacaaa gcctcagcgt ccatagctta ccgagcacgg caggcgcaag agtcagagaa 36258
aaggctgagc tctaacctga ctgcccgctc ctgtgctcaa tatatagccc taacctacac 36318
tgacgtaaag gccaaagtct aaaaataccc gccaaaatga cacacacgcc cagcacacgc 36378
ccagaaaccg gtgacacact caaaaaaata cgtgcgcttc ctcaaacgcc caaaccggcg 36438
tcatttccgg gttcccacgc tacgtcacct atcagcgact ttcaaattcc gtcgaccgtt 36498
aaaaacgtca ctcgccccgc ccctaacggt cgcccttccc gcagccaatc acaacccttc 36558
ctccccaaat tcaaacgcct catttgcata ttaacgcgca caaaaagttt gaggtatatt 36618
attgatgatg 36628
<210> 163
<211> 499
<212> PRT
<213> Simian adenovirus 26
<400> 163
Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu
1 5 10 15
Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn
20 25 30
Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu
35 40 45
Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala
50 55 60
Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro
65 70 75 80
Ser Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val
85 90 95
Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg
100 105 110
His Asp Glu Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser
115 120 125
Arg Lys Arg Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly
130 135 140
Thr Asp Glu Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gln Val
145 150 155 160
Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg
165 170 175
Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr
180 185 190
Lys Leu Ile Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala
195 200 205
Glu Val Glu Ile Cys Leu Gln Asp Arg Val Ala Phe Arg Cys Cys Met
210 215 220
Met Asn Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met
225 230 235 240
Asn Met Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala
245 250 255
Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn
260 265 270
Thr Cys Ile Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe
275 280 285
Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Met Leu Ser
290 295 300
Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu
305 310 315 320
Gly Glu Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe
325 330 335
Val Leu Cys Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly
340 345 350
Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn
355 360 365
Ser His Met Leu Ala Thr Val His Val Ala Ser His Ser Arg Lys Pro
370 375 380
Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu
385 390 395 400
Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr
405 410 415
Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr
420 425 430
Gly Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp
435 440 445
Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala
450 455 460
Arg Phe Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro Asp
465 470 475 480
His Leu Val Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu
485 490 495
Glu Ser Asp
<210> 164
<211> 142
<212> PRT
<213> Simian adenovirus 26
<400> 164
Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu
1 5 10 15
Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly
20 25 30
Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu
35 40 45
Thr Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala
50 55 60
Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala
65 70 75 80
Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro
85 90 95
Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu
100 105 110
Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu
115 120 125
Gln Glu Gln Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys
130 135 140
<210> 165
<211> 464
<212> PRT
<213> Simian adenovirus 26
<400> 165
Met Lys Leu Val Ser Ala Glu Ser Gly Arg Pro Arg Trp Leu Ala Ala
1 5 10 15
Val Val Trp Arg Arg Ile Ala Arg Val Ala Leu Arg Cys Ala Pro Val
20 25 30
Arg Ser Arg Pro Asp Ser Ala Ala Asn Glu Gly Val Ala Ala Pro Ser
35 40 45
Phe Pro Arg Pro Pro Ser Gln Pro Thr Ser Pro Val Thr Glu Arg Ala
50 55 60
Pro Leu Leu Phe Phe Cys Phe Cys Gln Met His Pro Val Leu Arg Gln
65 70 75 80
Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro Pro Gln Pro
85 90 95
Ala Leu Leu Pro Pro Pro Gln Gln Gln Gln Gln Leu Pro Ala Thr Thr
100 105 110
Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Thr Ser Gln Tyr Asp Leu
115 120 125
Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro
130 135 140
Glu Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr
145 150 155 160
Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu
165 170 175
Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly
180 185 190
Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu
195 200 205
Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu
210 215 220
Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys
225 230 235 240
Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr
245 250 255
Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn
260 265 270
Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln His
275 280 285
Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu
290 295 300
Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile
305 310 315 320
Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile
325 330 335
Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr
340 345 350
Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe
355 360 365
Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu Gly
370 375 380
Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg Arg
385 390 395 400
Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala Leu
405 410 415
Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly Ala Asp
420 425 430
Leu His Trp Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala Gly Pro Pro
435 440 445
Tyr Val Glu Glu Val Asp Glu Asp Glu Glu Gly Glu Tyr Leu Glu Asp
450 455 460
<210> 166
<211> 592
<212> PRT
<213> Simian adenovirus 26
<400> 166
Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln
1 5 10 15
Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met
20 25 30
Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln
35 40 45
Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro
50 55 60
Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala
65 70 75 80
Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr
85 90 95
Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln
100 105 110
Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln
115 120 125
Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala Leu
130 135 140
Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu
145 150 155 160
Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val
165 170 175
Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr
180 185 190
Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn
195 200 205
Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr
210 215 220
Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val
225 230 235 240
Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly
245 250 255
Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp
260 265 270
Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln
275 280 285
Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn
290 295 300
Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu Glu
305 310 315 320
Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln
325 330 335
Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met
340 345 350
Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met
355 360 365
Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn
370 375 380
Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly
385 390 395 400
Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val
405 410 415
Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu
420 425 430
Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly
435 440 445
Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro
450 455 460
Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg
465 470 475 480
Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser
485 490 495
Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu
500 505 510
Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His
515 520 525
Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Ala Ser Arg Gly Ser Ala
530 535 540
Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp
545 550 555 560
Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser
565 570 575
Gly Gly Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met Met
580 585 590
<210> 167
<211> 546
<212> PRT
<213> Simian adenovirus 26
<400> 167
Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Met Ala Ala Ala Ala Ala Met Gln Pro Pro Leu
20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg
35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg
50 55 60
Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr
65 70 75 80
Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp
85 90 95
Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg
100 105 110
Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro
115 120 125
Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met
130 135 140
Val Ser Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gln
145 150 155 160
Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn
165 170 175
Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp
180 185 190
Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile
195 200 205
Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val
210 215 220
Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro
225 230 235 240
Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg
245 250 255
Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly
260 265 270
Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu
275 280 285
Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Asn Ala Ala Ala
290 295 300
Glu Ala Val Ala Thr Ala Ala Thr Ala Glu Ala Lys Ala Val Val Asp
305 310 315 320
Ala Asp Ala Asn Val Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala Glu
325 330 335
Glu Ala Ala Ala Leu Ala Val Ala Asp Asp Ser Glu Ser Thr Lys Thr
340 345 350
Val Thr Ile Gln Pro Val Lys Val Asp Ser Lys Asn Arg Ser Tyr Asn
355 360 365
Val Leu Pro Asp Glu Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala
370 375 380
Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu
385 390 395 400
Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu
405 410 415
Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val
420 425 430
Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys
435 440 445
Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe
450 455 460
Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu
465 470 475 480
Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro
485 490 495
Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly
500 505 510
Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr
515 520 525
Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg
530 535 540
Thr Phe
545
<210> 168
<211> 193
<212> PRT
<213> Simian adenovirus 26
<400> 168
Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg
1 5 10 15
Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His
20 25 30
Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys
35 40 45
Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val
50 55 60
Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr
65 70 75 80
Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala
85 90 95
Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr
100 105 110
Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr
115 120 125
Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala
130 135 140
Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala
145 150 155 160
Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg
165 170 175
Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg
180 185 190
Thr
<210> 169
<211> 339
<212> PRT
<213> Simian adenovirus 26
<400> 169
Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro
1 5 10 15
Glu Ile Tyr Gly Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys
20 25 30
Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Asp Asp Leu Val
35 40 45
Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly
50 55 60
Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr
65 70 75 80
Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu Val
85 90 95
Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu Gly Glu
100 105 110
Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu Glu Ala Val
115 120 125
Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro Val
130 135 140
Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly Phe Lys
145 150 155 160
Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val Pro Lys
165 170 175
Arg Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val Asp Pro Asp
180 185 190
Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro Gly
195 200 205
Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro Met Glu
210 215 220
Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr Met Glu Val Gln
225 230 235 240
Thr Asp Pro Trp Met Pro Ala Ala Pro Thr Thr Arg Arg Pro Arg Arg
245 250 255
Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro
260 265 270
Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly
275 280 285
His Thr Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Arg Arg Arg Arg
290 295 300
Thr Ala Ala Ala Ser Thr Pro Ala Ala Leu Val Arg Arg Val Tyr Arg
305 310 315 320
Arg Gly Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr His Pro Ser
325 330 335
Ile Ala Ile
<210> 170
<211> 77
<212> PRT
<213> Simian adenovirus 26
<400> 170
Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly
1 5 10 15
Arg Lys Pro Arg Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His
20 25 30
His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe
35 40 45
Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro
50 55 60
Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His
65 70 75
<210> 171
<211> 258
<212> PRT
<213> Simian adenovirus 26
<400> 171
Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met
1 5 10 15
Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro
20 25 30
Phe Met Gly Thr Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly
35 40 45
Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser
50 55 60
Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln
65 70 75 80
Ala Leu Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val
85 90 95
Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln
100 105 110
Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala
115 120 125
Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp
130 135 140
Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu
145 150 155 160
Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly
165 170 175
Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys
180 185 190
Pro Glu Thr Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro Gln Pro Ser
195 200 205
Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala Arg
210 215 220
Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser Thr
225 230 235 240
Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg
245 250 255
Cys Tyr
<210> 172
<211> 937
<212> PRT
<213> Simian adenovirus 26
<400> 172
Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Ser Gln Trp Val Thr Lys Asp Asn Gly Thr Asp Lys
130 135 140
Thr Tyr Ser Phe Gly Asn Ala Pro Val Arg Gly Leu Asp Ile Thr Glu
145 150 155 160
Glu Gly Leu Gln Ile Gly Thr Asp Asp Ser Ser Thr Glu Ser Lys Lys
165 170 175
Ile Phe Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Glu
180 185 190
Glu Trp His Asp Thr Ile Gly Ala Glu Asp Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Ala Thr Asn Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Ala Lys Gly Gly Gln Ala Lys Thr Arg Thr Lys Asp Asp
225 230 235 240
Gly Thr Thr Glu Pro Asp Ile Asp Met Ala Phe Phe Asp Asp Arg Ser
245 250 255
Gln Gln Ala Ser Phe Ser Pro Glu Leu Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Glu Thr Pro Asp Thr His Ile Ile Tyr Lys Pro Gly Thr Asp
275 280 285
Glu Thr Ser Ser Ser Phe Asn Leu Gly Gln Gln Ser Met Pro Asn Arg
290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly
385 390 395 400
Val Gly Phe Thr Asp Thr Phe Gln Gly Ile Lys Val Lys Thr Thr Asn
405 410 415
Asn Gly Thr Ala Asn Ala Thr Glu Trp Glu Ser Asp Thr Ser Val Asn
420 425 430
Asn Ala Asn Glu Ile Ala Lys Gly Asn Pro Phe Ala Met Glu Ile Asn
435 440 445
Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu
450 455 460
Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Val Thr Leu Pro
465 470 475 480
Thr Asn Ile Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro
485 490 495
Ser Leu Val Asp Ser Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp
500 505 510
Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu
515 520 525
Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His
530 535 540
Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu
545 550 555 560
Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met
565 570 575
Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser
580 585 590
Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala
595 600 605
His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn
610 615 620
Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro
625 630 635 640
Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn
645 650 655
Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu
660 665 670
Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly
675 680 685
Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys
690 695 700
Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp
705 710 715 720
Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly
725 730 735
Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu
740 745 750
Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val
755 760 765
Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln
770 775 780
Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln
785 790 795 800
Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr
805 810 815
Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro
820 825 830
Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys Lys
835 840 845
Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe
850 855 860
Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala
865 870 875 880
Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp
885 890 895
Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg
900 905 910
Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr Leu Arg Thr
915 920 925
Pro Phe Ser Ala Gly Asn Ala Thr Thr
930 935
<210> 173
<211> 209
<212> PRT
<213> Simian adenovirus 26
<400> 173
Met Ala Glu Pro Thr Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Ile
1 5 10 15
Arg Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys Arg
20 25 30
Phe Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn
35 40 45
Thr Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp
50 55 60
Asn Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser
65 70 75 80
Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu
85 90 95
Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys
100 105 110
Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe
115 120 125
Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met
130 135 140
Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met
145 150 155 160
Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala
165 170 175
Leu Tyr Arg Phe Leu Asn Ser His Ser Ala Tyr Phe Arg Ser His Arg
180 185 190
Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Asn Gln Asp
195 200 205
Met
<210> 174
<211> 802
<212> PRT
<213> Simian adenovirus 26
<400> 174
Met Glu Thr Gln Pro Ser Pro Thr Ser Pro Ser Ala Pro Thr Thr Ala
1 5 10 15
Asp Glu Lys Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro
20 25 30
Ser Pro Ala Ser Asp Ala Ala Ala Val Pro Asp Met Gln Glu Met Glu
35 40 45
Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu
50 55 60
Glu Glu Leu Ala Val Arg Phe Gln Ser Ser Ser Gln Glu Asp Lys Glu
65 70 75 80
Gln Pro Glu Gln Glu Ala Glu Asn Glu Gln Ser Gln Thr Gly Leu Glu
85 90 95
His Gly Asp Tyr Leu His Leu Ser Gly Glu Glu Asp Ala Leu Ile Lys
100 105 110
His Leu Ala Arg Gln Ala Thr Ile Val Lys Asp Ala Leu Leu Asp Arg
115 120 125
Thr Glu Val Pro Leu Ser Val Glu Glu Leu Ser Arg Ala Tyr Glu Leu
130 135 140
Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr
145 150 155 160
Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro
165 170 175
Glu Ala Leu Ala Thr Tyr His Ile Phe Phe Lys Asn Gln Lys Ile Pro
180 185 190
Val Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Phe Asn Leu
195 200 205
Gly Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro
210 215 220
Lys Ile Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala
225 230 235 240
Leu Gln Gly Asn Glu Glu His Glu His His Ser Ala Leu Val Glu Leu
245 250 255
Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu
260 265 270
Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser
275 280 285
Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Ile Ser Glu
290 295 300
Asp Glu Gly Met Gln Asp Ser Glu Glu Gly Lys Pro Val Val Ser Asp
305 310 315 320
Glu Gln Leu Ala Arg Trp Leu Gly Pro Asn Ala Thr Pro Gln Ser Leu
325 330 335
Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr Val Glu
340 345 350
Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu Arg Lys
355 360 365
Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val Arg Gln
370 375 380
Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser Tyr Met
385 390 395 400
Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr
405 410 415
Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu
420 425 430
Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys
435 440 445
Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn
450 455 460
Leu Lys Gly Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ser Asp
465 470 475 480
Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn
485 490 495
Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser
500 505 510
Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu
515 520 525
Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu
530 535 540
Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His
545 550 555 560
Ser Asp Val Ile Glu Asp Val Ser Gly Glu Ala Leu Leu Glu Cys His
565 570 575
Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys Asn Pro
580 585 590
Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu Gln Gly
595 600 605
Pro Ser Glu Gly Glu Gly Ser Ala Ala Lys Gly Gly Leu Lys Leu Thr
610 615 620
Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp
625 630 635 640
Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro
645 650 655
Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala
660 665 670
Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys
675 680 685
Gly Arg Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro
690 695 700
Gly Phe Pro Gln Asp Ala Pro Arg Lys Gln Glu Ala Glu Ser Gly Ala
705 710 715 720
Ala Ala Arg Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Gln Ser Gly
725 730 735
Arg Gly Asp Gly Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln
740 745 750
Pro Ala Arg Gln Ser Gly Gly Arg Arg Gly Gly Gly Arg Gly Gly Gly
755 760 765
Gly Arg Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Gly Glu Ser
770 775 780
Lys Gln His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Ser Ala
785 790 795 800
Pro Gln
<210> 175
<211> 227
<212> PRT
<213> Simian adenovirus 26
<400> 175
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn
20 25 30
Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg
35 40 45
Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr
50 55 60
Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr
65 70 75 80
Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln
85 90 95
Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala
100 105 110
Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile
115 120 125
Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu
130 135 140
Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser
145 150 155 160
Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser
165 170 175
Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe
180 185 190
Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr
195 200 205
Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp
210 215 220
Gly Tyr Asp
225
<210> 176
<211> 106
<212> PRT
<213> Simian adenovirus 26
<400> 176
Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp
1 5 10 15
His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Thr
20 25 30
Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val
35 40 45
Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe
50 55 60
Ser Gln Arg Pro Ile Leu Val Glu Arg Gln Gln Gly Asn Thr Leu Leu
65 70 75 80
Thr Leu Tyr Cys Ile Cys Asp His Pro Gly Leu His Glu Ser Leu Cys
85 90 95
Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser
100 105
<210> 177
<211> 176
<212> PRT
<213> Simian adenovirus 26
<400> 177
Met Gly Lys Ile Thr Leu Val Cys Gly Val Leu Val Thr Val Val Leu
1 5 10 15
Ser Ile Leu Gly Gly Gly Ser Ala Ala Val Val Thr Glu Lys Lys Ala
20 25 30
Asp Pro Cys Leu Thr Phe Asn Pro Asp Lys Cys Arg Leu Ser Phe Gln
35 40 45
Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys
50 55 60
Glu Ser Val Ala Ile Gln Tyr Lys Asn Lys Thr Arg Asn Asn Thr Leu
65 70 75 80
Ala Ser Thr Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val
85 90 95
Pro Gly Ala Asp Gly Ser Leu Arg Thr Val Asn Asn Thr Phe Ile Phe
100 105 110
Glu His Met Cys Asp Thr Ala Met Phe Met Ser Lys Gln Tyr Gly Met
115 120 125
Trp Pro Pro Arg Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser
130 135 140
Ala Cys Thr Val Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met
145 150 155 160
Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro
165 170 175
<210> 178
<211> 246
<212> PRT
<213> Simian adenovirus 26
<400> 178
Met Ala Ser Val Thr Ile Leu Ile Tyr Phe Leu Gly Leu Val Gly Thr
1 5 10 15
Ile Ser Ser Phe Asp His Lys Asn Val Thr Ala Tyr Val Gly Ser Asn
20 25 30
Cys Val Leu Ser Gly Tyr Gln Ser His Gln Arg Val Ser Trp Tyr Trp
35 40 45
Phe Asp Lys Lys Asn Thr Ala Tyr Thr Leu Cys Lys Gly Tyr Gln Gln
50 55 60
Pro Thr His Arg Ser Gly Leu Tyr Tyr Ser Cys Thr Asn Asn Asn Ile
65 70 75 80
Thr Leu Leu Gln Val Thr Asn Gln Tyr Ser Gly Thr Tyr Tyr Gly Thr
85 90 95
Asn Phe Asn Thr Lys Gln Asp Thr Tyr Tyr Ser Val Arg Val Leu Asp
100 105 110
Pro Thr Thr Pro Arg Thr Thr Thr Lys His Thr Thr Thr Lys Lys Pro
115 120 125
Thr Thr Pro Lys Lys Pro Thr Thr Pro Lys Thr Thr Lys Thr Thr Thr
130 135 140
Ala Lys Gln Thr Thr Thr Thr Glu Pro Thr Thr Thr Ser Thr Thr Leu
145 150 155 160
Ala Ile Thr Thr His Thr Glu Leu Thr Ser Gln Ala Thr Thr Glu Asn
165 170 175
Gly Phe Ala Leu Leu Gln Lys Gly Glu Asn Ser Ser Ser Ser Pro Leu
180 185 190
Pro Thr Thr Pro Ser Glu Glu Ile Pro Lys Ser Met Val Gly Ile Ile
195 200 205
Ala Ala Val Val Val Cys Met Val Ile Ile Ile Leu Cys Met Met Tyr
210 215 220
Tyr Ala Cys Tyr Tyr Arg Lys His Arg Leu Asn Asn Lys Leu Asp Pro
225 230 235 240
Leu Leu Asn Val Asp Phe
245
<210> 179
<211> 206
<212> PRT
<213> Simian adenovirus 26
<400> 179
Met Lys Ile Leu Ser Leu Phe Val Phe Ser Ile Ile Thr Ser Ala Ile
1 5 10 15
Cys Lys Ser Val Asp Lys Asp Val Thr Val Thr Thr Gly Ser Asn Tyr
20 25 30
Thr Leu Lys Gly Pro Ser Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe
35 40 45
Gly Asn Asp Asp Lys Gln Thr Glu Leu Cys Asn Phe Gln Asn Gly Lys
50 55 60
Thr Lys Asn Ser Lys Ile Asp Asn Tyr Gln Cys His Gly Thr Asp Leu
65 70 75 80
Val Leu Met Asn Ile Thr Lys Ala Tyr Ala Gly Ser Tyr Ser Cys Pro
85 90 95
Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys Leu Ile Val Val Asp
100 105 110
Pro Thr Thr Pro Pro Pro Thr Thr Thr Thr Asn Ala Pro Thr Thr Asp
115 120 125
Thr Gln Glu Thr Thr Pro Glu Ala Val Ala Glu Leu Ala Lys Gln Ile
130 135 140
His Glu Asp Ser Phe Val Ala Asn Thr Pro Thr His Pro Gly Pro Gln
145 150 155 160
Cys Pro Gly Leu Val Val Ser Gly Ile Val Gly Val Leu Cys Gly Leu
165 170 175
Ala Val Ile Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg
180 185 190
Leu His Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val
195 200 205
<210> 180
<211> 291
<212> PRT
<213> Simian adenovirus 26
<400> 180
Met Lys Ala Leu Ser Thr Leu Val Phe Leu Ser Leu Ile Gly Ile Val
1 5 10 15
Phe Ser Ala Gly Phe Leu Lys Asn Leu Thr Ile Ile Glu Gly Asp Asn
20 25 30
Ala Thr Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp Leu Lys Tyr
35 40 45
His Leu Asp Gly Trp Lys Pro Ile Cys Thr Trp Asn Val Ser Val Tyr
50 55 60
Thr Cys His Gly Val Asn Leu Thr Ile Thr Asn Ala Thr Gln Asp Gln
65 70 75 80
Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr Ser Asn Asn Gly Tyr Glu
85 90 95
Thr His Asn Met Phe Ile Tyr Asp Val Thr Val Ile Ser Asn Lys Thr
100 105 110
Thr Pro Thr Thr Gln Thr Pro Thr Thr His Ser Ser Thr His Ala Met
115 120 125
Gln Thr Thr Gln Thr Thr Thr Tyr Thr Thr Ser Ile Gln Pro Thr Thr
130 135 140
Thr Thr Ala Glu Val Thr Ser Thr Ala Pro Gln Pro Gln Ala Leu Ala
145 150 155 160
Leu Met Ala Ala Gln Pro Ser Ser Met Thr Ala Lys Thr Asn Glu Gln
165 170 175
Thr Thr Glu Phe Leu Ser Thr Thr Gln Ser Ser Thr Thr Ala Thr Ser
180 185 190
Ser Ala Phe Ser Ser Thr Ala Asn Leu Thr Ser Leu Ser Ser Thr Pro
195 200 205
Ile Ser Asn Ala Thr Thr Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys
210 215 220
Gln Ser Glu Ser Ser Thr Gln Leu Gln Ile Thr Leu Leu Ile Val Ile
225 230 235 240
Gly Val Val Ile Leu Ala Val Leu Leu Tyr Phe Ile Phe Cys Arg Arg
245 250 255
Ile Pro Asn Ala Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Thr Pro
260 265 270
Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe
275 280 285
Thr Val Trp
290
<210> 181
<211> 91
<212> PRT
<213> Simian adenovirus 26
<400> 181
Met Ile Pro Arg His Phe Ile Ile Thr Ser Leu Ile Cys Val Leu Gln
1 5 10 15
Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile
20 25 30
Gly Ala Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys
35 40 45
Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Val Asp
50 55 60
Trp Val Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg
65 70 75 80
Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Ile
85 90
<210> 182
<211> 148
<212> PRT
<213> Simian adenovirus 26
<400> 182
Met Ile Ser Met Arg Ala Leu Leu Leu Leu Leu Ala Leu Leu Leu Val
1 5 10 15
Pro Leu Ala Ala Pro Leu Ser Phe Lys Ser Pro Thr Gln Ser Pro Glu
20 25 30
Glu Val Arg Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Ser Cys
35 40 45
Tyr Lys Leu Lys Ser Glu Met His Pro Ser Trp Ile Met Ile Ile Gly
50 55 60
Ile Val Asn Ile Leu Ala Cys Thr Leu Phe Ser Phe Val Ile Tyr Pro
65 70 75 80
Arg Phe Asp Phe Gly Trp Asn Ala Pro Glu Ala Leu Trp Leu Pro Pro
85 90 95
Asp Pro Asp Ile Pro Pro Gln Gln Gln Gln Asn Gln Ala His Ala Pro
100 105 110
Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala
115 120 125
Glu Pro Gln Arg Ala Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr
130 135 140
Gly Gly Asp Asp
145
<210> 183
<211> 425
<212> PRT
<213> Simian adenovirus 26
<400> 183
Met Ser Lys Lys Arg Ala Arg Val Asp Asp Gly Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Ala Val Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ala
65 70 75 80
Asn Thr Val Ser Lys Ala Ile Ala Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu
100 105 110
Ser Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Arg Ser Thr Ile Leu
115 120 125
Asn Thr Leu Ala Leu Ala Phe Gly Ser Gly Leu Gly Leu Ser Gly Ser
130 135 140
Ala Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Asp Lys Gly
145 150 155 160
Asn Ile Lys Ile Thr Leu Asp Arg Gly Leu His Val Thr Thr Gly Asn
165 170 175
Ala Ile Glu Ser Asn Ile Ser Trp Ala Lys Gly Ile Lys Phe Glu Asp
180 185 190
Gly Ala Ile Ala Ala Asn Ile Gly Lys Gly Leu Glu Phe Gly Thr Ser
195 200 205
Ser Thr Val Thr Gly Val Asp Asp Ala Tyr Pro Ile Gln Val Lys Leu
210 215 220
Gly Ser Gly Leu Ser Phe Asp Ser Thr Gly Ala Ile Met Ala Gly Asn
225 230 235 240
Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro
245 250 255
Asn Cys Gln Ile Leu Ala Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu
260 265 270
Thr Lys Cys Gly Ser Gln Ile Leu Ala Thr Val Ser Val Leu Val Val
275 280 285
Gly Ser Gly Lys Leu Asn Pro Ile Thr Gly Glu Val Ser Ser Ala Gln
290 295 300
Val Phe Leu Arg Phe Asp Ser Asn Gly Val Leu Leu Val Asn Ser Ser
305 310 315 320
Thr Leu Lys Lys Tyr Trp Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly
325 330 335
Thr Pro Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr
340 345 350
Pro Lys Ser Gln Ser Ser Thr Thr Lys Asn Asn Ile Val Gly Gln Val
355 360 365
Tyr Met Asn Gly Asp Asn Ser Lys Pro Met Leu Leu Thr Ile Thr Leu
370 375 380
Asn Gly Thr Asp Asp Thr Thr Ser Ala Tyr Ser Met Ser Phe Ser Tyr
385 390 395 400
Thr Trp Thr Asn Gly Ser Tyr Thr Gly Ala Thr Phe Gly Ala Asn Ser
405 410 415
Tyr Thr Phe Ser Tyr Ile Ala Gln Glu
420 425
<210> 184
<211> 570
<212> DNA
<213> Simian adenovirus 26
<220>
<221> CDS
<222> (3)..(563)
<223> label=Elb\19K
<400> 184
gc atg gag att tgg acg gtc ttg gaa gac ttt cac aag act aga cag 47
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gln
1 5 10 15
ctg cta gag aac gcc tcg aac gga gtc tct tac ctg tgg aga ttc tgc 95
Leu Leu Glu Asn Ala Ser Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys
20 25 30
ttc ggt ggc gac cta gct agg cta gtc tat agg gcc aaa cag gat tat 143
Phe Gly Gly Asp Leu Ala Arg Leu Val Tyr Arg Ala Lys Gln Asp Tyr
35 40 45
agt gaa caa ttt gag gtt att ttg aga gag tgt tct ggt ctt ttt gac 191
Ser Glu Gln Phe Glu Val Ile Leu Arg Glu Cys Ser Gly Leu Phe Asp
50 55 60
gct ctt aac ttg ggc cat cag tct cac ttt aac cag agg att tcg aga 239
Ala Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Arg Ile Ser Arg
65 70 75
gcc ctt gac ttt act act cct ggc aga acc act gcg gca gta gcc ttt 287
Ala Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe
80 85 90 95
ttt gct ttt ctt ctt gac aaa tgg agt caa gaa acc cat ttc agc agg 335
Phe Ala Phe Leu Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg
100 105 110
gat tac cag ctg gat ttc tta gca gta gct ttg tgg aga aca tgg aag 383
Asp Tyr Gln Leu Asp Phe Leu Ala Val Ala Leu Trp Arg Thr Trp Lys
115 120 125
tgc cag cgc ctg aat gca atc tca ggc tac ttg ccg gta cag ccg cta 431
Cys Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu
130 135 140
gac act ctg agg atc ctg aat ctc cag gag agt ccc agg gca cac caa 479
Asp Thr Leu Arg Ile Leu Asn Leu Gln Glu Ser Pro Arg Ala His Gln
145 150 155
cgt cgc cag cag cag cag cgg cag cag gag gag gat caa gaa gag aac 527
Arg Arg Gln Gln Gln Gln Arg Gln Gln Glu Glu Asp Gln Glu Glu Asn
160 165 170 175
ccg aga gcc ggc ctg gac cct ccg gag gag gag gag tagctga 570
Pro Arg Ala Gly Leu Asp Pro Pro Glu Glu Glu Glu
180 185
<210> 185
<211> 187
<212> PRT
<213> Simian adenovirus 26
<400> 185
Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu
1 5 10 15
Leu Glu Asn Ala Ser Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe
20 25 30
Gly Gly Asp Leu Ala Arg Leu Val Tyr Arg Ala Lys Gln Asp Tyr Ser
35 40 45
Glu Gln Phe Glu Val Ile Leu Arg Glu Cys Ser Gly Leu Phe Asp Ala
50 55 60
Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala
65 70 75 80
Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe
85 90 95
Ala Phe Leu Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp
100 105 110
Tyr Gln Leu Asp Phe Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys
115 120 125
Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp
130 135 140
Thr Leu Arg Ile Leu Asn Leu Gln Glu Ser Pro Arg Ala His Gln Arg
145 150 155 160
Arg Gln Gln Gln Gln Arg Gln Gln Glu Glu Asp Gln Glu Glu Asn Pro
165 170 175
Arg Ala Gly Leu Asp Pro Pro Glu Glu Glu Glu
180 185
<210> 186
<211> 6450
<212> DNA
<213> Simian adenovirus 26
<220>
<221> CDS
<222> (4)..(564)
<223> label=22K
<220>
<221> CDS
<222> (1870)..(2502)
<223> label=CR1-alpha
<220>
<221> CDS
<222> (6039)..(6440)
<223> label=E3/14.7K
<400> 186
agg atg ccc cga gga aac aag aag ctg aaa gtg gag ctg ccg ccc gtg 48
Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val
1 5 10 15
gag gat ttg gag gaa gac tgg gag aac agc agt cag gca gag gag atg 96
Glu Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu Met
20 25 30
gag gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa gac 144
Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp
35 40 45
agt ctg gag gaa gac gag gag gag gca gag gag gag gtg gaa gaa gca 192
Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu Ala
50 55 60
gcc gcc gcc aga ccg tcg tcc tcg gcg ggg gag aaa gca agc agc acg 240
Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt ccc gct cgg ccc cac agt aga 288
Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg
80 85 90 95
tgg gac gag acc ggg cga ttc ccg aac ccc acc acc cag acc ggt aag 336
Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys
100 105 110
aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac gcc atc 384
Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile
115 120 125
gtc tcc tgc ttg cag gcc tgc ggg ggc aac atc tcc ttc acc cgg cgc 432
Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg
130 135 140
tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc ttg cat 480
Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His
145 150 155
tac tac cgt cac ctc cac agc ccc tac tac ttc caa gaa gag gca gca 528
Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Phe Gln Glu Glu Ala Ala
160 165 170 175
gcg gca gaa aaa gac cag cag aaa acc agc agc tag aaaatccaca 574
Ala Ala Glu Lys Asp Gln Gln Lys Thr Ser Ser
180 185
gcggcggcag gtggactgag gatcgcggcg aacgagccgg cgcagacccg ggagctgagg 634
aaccggatct ttcccaccct ctatgccatc ttccagcaga gtcgggggca ggagcaggaa 694
ctgaaagtca agaaccgttc tctgcgctcg ctcacccgca gttgtctgta tcacaagagc 754
gaagaccaac ttcagcgcac tctcgaggac gccgaggctc tcttcaacaa gtactgcgcg 814
ctcactctta aagagtagcc cgcgcccgcc cagtcgcaga aaaaggcggg aattacgtca 874
cctgtgccct tcgccctagc cgcctccacc catcatcatg agcaaagaga ttcccacgcc 934
ttacatgtgg agctaccagc cccagatggg cctggccgcc ggcgccgccc aggactactc 994
cacccgcatg aattggctca gcgccgggcc cgcgatgatc tcacgggtga atgacatccg 1054
cgcccaccga aaccagatac tcctagaaca gtcagcgctc accgccacgc cccgcaatca 1114
cctcaatccg cgtaattggc ccgccgccct ggtgtaccag gaaattcccc agcccacgac 1174
cgtactactt ccgcgagacg cccaggccga agtccagctg actaactcag gtgtccagct 1234
ggcgggcggc gccaccctgt gtcgtcaccg ccccgctcag ggtataaagc ggctggtgat 1294
ccggggcaga ggcacacagc tcaacgacga ggtggtgagc tcttcgctgg gtctgcgacc 1354
tgacggagtc ttccaactcg ccggatcggg gagatcttcc ttcacgcctc gtcaggcggt 1414
cctgactttg gaaagttcgt cctcgcagcc ccgctcgggc ggcatcggca ctctccagtt 1474
cgtggaggag ttcactccct cggtctactt caaccccttc tccggctccc ccggccacta 1534
cccggacgag ttcatcccga actttgacgc catcagcgag tcggtggacg gctacgattg 1594
aatgtcccat ggtggcgcgg ctgacctagc tcggcttcga caccttgacc actgccgccg 1654
ctttcgctgc ttcgcacggg acctcgccga gttcacctac ttcgagctgc ccgaggagca 1714
tcctcagggc ccggcccacg gagtgcggat cgtcgtcgaa gggggcctag actcccacct 1774
gcttcggatc ttcagccagc gcccgatcct ggtcgagcgc caacagggca acaccctcct 1834
gaccctctac tgcatctgcg accaccccgg cctgc atg aaa gtc ttt gtt gtc 1887
Met Lys Val Phe Val Val
190
tgc tgt gta ctg agt ata ata aaa gct gag atc agc gac tac tcc gga 1935
Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp Tyr Ser Gly
195 200 205
ctc aac tgt ggt gtt tct gca tcc atc aat cgg tca ctg acc ttc acc 1983
Leu Asn Cys Gly Val Ser Ala Ser Ile Asn Arg Ser Leu Thr Phe Thr
210 215 220
ggg aac gag acc gag ctc cag ctc cag tgt aag ccc cac aag aag tac 2031
Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys Pro His Lys Lys Tyr
225 230 235 240
ctc acc tgg ctg tac cag ggc tcc ccg atc gcc gtt gtt aac cac tgc 2079
Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile Ala Val Val Asn His Cys
245 250 255
gac gac gac gga gtc ctg ctg aac ggc ccc gcc aac ctt act ttt tcc 2127
Asp Asp Asp Gly Val Leu Leu Asn Gly Pro Ala Asn Leu Thr Phe Ser
260 265 270
acc cgc aga agc aag ctc gag ctc ttc caa ccc ttc ctc ccc ggg acc 2175
Thr Arg Arg Ser Lys Leu Glu Leu Phe Gln Pro Phe Leu Pro Gly Thr
275 280 285
tat cag tgc atc tcg gga ccc tgc cat cac acc ttc cac ctg atc ccg 2223
Tyr Gln Cys Ile Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro
290 295 300
aat acc acc tct tcc cca gcg ccg ctc ccc act aac aac caa act aac 2271
Asn Thr Thr Ser Ser Pro Ala Pro Leu Pro Thr Asn Asn Gln Thr Asn
305 310 315 320
cac cac caa cgc tac cga cgc gac ctc gtt gaa tct aat acc acc cac 2319
His His Gln Arg Tyr Arg Arg Asp Leu Val Glu Ser Asn Thr Thr His
325 330 335
acc gga ggt gag ctc cga ggt cct gaa tcc tct ggg att tat tac ggc 2367
Thr Gly Gly Glu Leu Arg Gly Pro Glu Ser Ser Gly Ile Tyr Tyr Gly
340 345 350
ccc tgg gag gtg gtg ggg tta ata gct tta ggc tta gta gcg ggt ggg 2415
Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu Val Ala Gly Gly
355 360 365
ctt ttg gct ctc tgc tac cta tac ctc cct tgc ttt tcc tac tta gtg 2463
Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Phe Ser Tyr Leu Val
370 375 380
gtg ctt tgt tgc tgg ttt aag aaa tgg gga aga tca ccc tagtgtgcgg 2512
Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro
385 390 395
tgtgctggtg acggtggtgc tttcgattct gggaggggga agcgcggctg tagtgacgga 2572
gaagaaggcc gatccctgct tgactttcaa ccccgataaa tgccggctga gttttcagcc 2632
cgatggcaat cggtgcgcgg tgttgatcaa gtgcggatgg gaatgcgaga gtgtggcgat 2692
tcagtataaa aacaagacgc ggaacaatac tctcgcgtcc acatggcagc ccggggaccc 2752
cgagtggtac accgtctctg tccctggtgc tgacggctcc ctccgcacgg tgaacaacac 2812
tttcattttt gagcacatgt gcgataccgc catgttcatg agcaagcagt acggtatgtg 2872
gcccccacga aaagagaata tcgtggtctt ctccatcgct tacagcgcgt gcacggtgct 2932
aatcaccgcg atcgtgtgcc tgagcattca catgctcatc gctattcgcc ccagaaataa 2992
tgccgagaaa gagaaacagc cataacacac ttttttcaca cacctttttc agaccatggc 3052
ctctgttaca atccttattt attttttggg ccttgtgggc actatcagca gtttcgacca 3112
taaaaacgta actgcttatg ttggttctaa ctgtgtacta tctgggtacc agtcacatca 3172
gcgggtttca tggtactggt ttgataaaaa gaacacagct tatacactct gcaaaggcta 3232
tcagcagccc acacatcgca gtggtcttta ttacagctgc accaataata atatcacact 3292
acttcaagta accaaccaat attctgggac ctactatgga accaatttta acacaaaaca 3352
ggacacttac tatagtgtca gagtattgga tccaactact cccagaacta ctactaaaca 3412
taccacaact aagaagccca ctacacctaa aaagcctacc acgcccaaaa ccactaagac 3472
aacaactgct aagcagacca ctaccacaga gccaaccaca accagcacca cacttgctat 3532
aactacacac actgagctga cctcacaggc aactactgaa aatggttttg ccctattgca 3592
aaagggggag aacagtagca gcagtcctct gcctactacc cccagtgagg aaatacccaa 3652
gtccatggtt ggcattatcg ctgctgtagt ggtgtgtatg gtgattatca tcttgtgcat 3712
gatgtactat gcctgctact acagaaaaca caggctaaac aataagctgg accccctact 3772
gaatgttgat ttttaatttt ttagaaccat gaagatccta agcctttttg ttttttctat 3832
cattacctct gctatttgta aatcagtgga taaggacgtt actgtcacca ctggctctaa 3892
ttatacacta aaagggcctt cctcaggtat gctttcgtgg tattgttatt ttggaaatga 3952
tgataaacag acagagcttt gtaatttcca gaacggaaaa accaaaaatt ctaaaataga 4012
taactatcaa tgccatggta ctgatttagt actgatgaat atcacgaaag catatgctgg 4072
cagttattcc tgtcctggac aaaacaccga agaaatgatt ttttacaaat tgattgtggt 4132
tgatcccact acacctccac ccaccacaac taccaatgca cctaccacag acacacagga 4192
aaccactcca gaggcagtag cagagttagc aaagcagatt catgaagatt cctttgttgc 4252
taatactccc acacaccccg gaccgcaatg tccagggtta gtagtcagcg gcattgtcgg 4312
tgtgctttgc gggttagcag ttataatcat ctgcatgttc atttttgctt gctgctacag 4372
aaggcttcac cgacaaaaat cagacccact gctgaacctc tatgtttaat ttttgatttt 4432
ccagagccat gaaggcactt agcactttag tatttttgtc cttgattggc attgttttca 4492
gtgctgggtt tttgaaaaat cttaccatta ttgaaggcga taatgcaaca ctggtaggaa 4552
tcagtggtca gaatgttagt tggctaaaat atcatctaga tgggtggaaa cctatttgca 4612
cctggaatgt cagtgtgtac acatgtcatg gtgttaacct caccattacc aatgccaccc 4672
aagatcagaa tggcaggttt aagggtcaga gtttcactag caacaatggc tatgaaaccc 4732
ataacatgtt catctatgat gtcactgtca tatcaaataa gactacacct accacccaaa 4792
cacccactac acatagctca actcatgcca tgcagaccac tcagaccacc acttacacta 4852
catccattca gcccaccacc actacagcag aggtaaccag cacagcgcct cagccccaag 4912
cattggcttt gatggctgca cagcctagca gcatgactgc taaaaccaat gagcagacta 4972
ctgaattttt gtccactact cagagcagca ccacagctac ctcgagtgcc ttctctagca 5032
ccgccaatct cacctcgctt tcttctacgc caatcagtaa tgctactacc tcccccgctc 5092
ctcttcccac tcctctgaag caatccgagt ctagcacgca gctgcagatc accctgctca 5152
ttgtgatcgg ggtggtcatc ctggcagtgc tgctctactt tatcttctgc cgccgcatcc 5212
ccaacgcgaa accggcctac aagcccattg ttatcgggac gccggaaccg cttcaggtgg 5272
agggaggtct aaggaatctt ctcttctctt ttacagtatg gtgatttgaa ctatgattcc 5332
tagacatttc attatcactt ctctaatctg tgtgctccaa gtctgtgcca ccctcgctct 5392
cgtggctaac gcgagtccag actgcattgg agcgttcgcc tcctacgtgc tctttgcctt 5452
catcacctgc atctgctgct gtagcatagt ctgcctgctt atcaccttct tccagttcgt 5512
tgactgggtc tttgtgcgca tcgcctacct gcgccaccat ccccagtacc gcgaccagag 5572
agtggcgcaa ctgttgagac tcatctgatg ataagcatgc gggctctgct actactcctc 5632
gcgcttctgc tagttcccct cgccgccccc ttatccttca aatcccccac ccagtcccct 5692
gaagaggttc gaaaatgtaa attccaagaa ccctggaaat tcctttcatg ctacaaactc 5752
aaatcagaaa tgcaccccag ctggatcatg atcattggaa tcgtgaacat ccttgcctgt 5812
accctcttct cctttgtgat ttacccccgc tttgactttg ggtggaacgc acccgaggcg 5872
ctctggctcc cgcctgatcc cgacatacca ccacagcagc agcaaaatca ggcacacgca 5932
ccaccaccac agcctaggcc acaatacatg cccatcttag actatgaggc cgagccacag 5992
cgagccatgc ttcctgctat tagttacttc aacctaaccg gcggag atg act gac 6047
Met Thr Asp
400
ccc atg gcc aac aac acc gtc aac gac ctc ctg gac atg gac ggc cgc 6095
Pro Met Ala Asn Asn Thr Val Asn Asp Leu Leu Asp Met Asp Gly Arg
405 410 415
gcc tcg gag cag cga ctc gcc caa ctc cgc atc cgc cag cag cag gag 6143
Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln Gln Gln Glu
420 425 430
aga gcc gtc aag gag ctg cag gat gcg gtg gct atc cac cag tgc aag 6191
Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His Gln Cys Lys
435 440 445
aga ggc atc ttc tgc ctg gtg aag cag gcc aag atc acc ttc gag gtg 6239
Arg Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Thr Phe Glu Val
450 455 460
act tcc acc gac cat cgc ctc tcc tac gag ctc ctg cag cag cgc cag 6287
Thr Ser Thr Asp His Arg Leu Ser Tyr Glu Leu Leu Gln Gln Arg Gln
465 470 475 480
aag ttc acc tgc ctg gtc gga gtc aac ccc atc gtc atc acc cag cag 6335
Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile Thr Gln Gln
485 490 495
tct ggc gat acc aag ggg tgc atc cac tgc tcc tgc gac tcc ccc gag 6383
Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp Ser Pro Glu
500 505 510
tgc ctt cac acc ctg gtc aag acc ctc tgc ggc ctc cgc gac ctc ctc 6431
Cys Leu His Thr Leu Val Lys Thr Leu Cys Gly Leu Arg Asp Leu Leu
515 520 525
ccc atg aac taatcaacta 6450
Pro Met Asn
530
<210> 187
<211> 186
<212> PRT
<213> Simian adenovirus 26
<400> 187
Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu Met Glu
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu Ala Ala
50 55 60
Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser Thr Asp
65 70 75 80
Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys Lys
100 105 110
Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile Val
115 120 125
Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg Tyr
130 135 140
Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His Tyr
145 150 155 160
Tyr Arg His Leu His Ser Pro Tyr Tyr Phe Gln Glu Glu Ala Ala Ala
165 170 175
Ala Glu Lys Asp Gln Gln Lys Thr Ser Ser
180 185
<210> 188
<211> 211
<212> PRT
<213> Simian adenovirus 26
<400> 188
Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu
1 5 10 15
Ile Ser Asp Tyr Ser Gly Leu Asn Cys Gly Val Ser Ala Ser Ile Asn
20 25 30
Arg Ser Leu Thr Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys
35 40 45
Lys Pro His Lys Lys Tyr Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile
50 55 60
Ala Val Val Asn His Cys Asp Asp Asp Gly Val Leu Leu Asn Gly Pro
65 70 75 80
Ala Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Glu Leu Phe Gln
85 90 95
Pro Phe Leu Pro Gly Thr Tyr Gln Cys Ile Ser Gly Pro Cys His His
100 105 110
Thr Phe His Leu Ile Pro Asn Thr Thr Ser Ser Pro Ala Pro Leu Pro
115 120 125
Thr Asn Asn Gln Thr Asn His His Gln Arg Tyr Arg Arg Asp Leu Val
130 135 140
Glu Ser Asn Thr Thr His Thr Gly Gly Glu Leu Arg Gly Pro Glu Ser
145 150 155 160
Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu
165 170 175
Gly Leu Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro
180 185 190
Cys Phe Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly
195 200 205
Arg Ser Pro
210
<210> 189
<211> 134
<212> PRT
<213> Simian adenovirus 26
<400> 189
Met Thr Asp Pro Met Ala Asn Asn Thr Val Asn Asp Leu Leu Asp Met
1 5 10 15
Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln
20 25 30
Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His
35 40 45
Gln Cys Lys Arg Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Thr
50 55 60
Phe Glu Val Thr Ser Thr Asp His Arg Leu Ser Tyr Glu Leu Leu Gln
65 70 75 80
Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile
85 90 95
Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp
100 105 110
Ser Pro Glu Cys Leu His Thr Leu Val Lys Thr Leu Cys Gly Leu Arg
115 120 125
Asp Leu Leu Pro Met Asn
130
<210> 190
<211> 880
<212> DNA
<213> Simian adenovirus 26
<220>
<221> CDS
<222> (6)..(570)
<223> label=Ela
<220>
<221> CDS
<222> (666)..(871)
<223> label=Ela
<400> 190
gaaag atg agg cac ctg aga gac ctg ccc gat gag aaa atc atc atc gct 50
Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala
1 5 10 15
tcc ggg aac gag att ctg gaa ctg gtg gta aat gcc atg atg ggc gac 98
Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp
20 25 30
gac cct ccg gag ccc ccc acc cca ttt gag gca cct tcg ctg cac gat 146
Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu His Asp
35 40 45
ttg tat gat ctg gag gtg gat gtg ccc gat gac gac ccc aac gag gag 194
Leu Tyr Asp Leu Glu Val Asp Val Pro Asp Asp Asp Pro Asn Glu Glu
50 55 60
gcg gta aat gat tta ttt agc gat gcc gcg ctg cta gct gcc gag gag 242
Ala Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu
65 70 75
gct tcg agc cct agc tca gac agc gac tct tca ctg cat acc cct aga 290
Ala Ser Ser Pro Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg
80 85 90 95
cct ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag atg 338
Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met
100 105 110
gac ttg cgc tgc tat gag gaa tgc ttg ccc ccg agc gat gat gag gac 386
Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp
115 120 125
gag cag gcg atc cag aac gca gcg agc cag gga gtg caa gcc gcc agc 434
Glu Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Val Gln Ala Ala Ser
130 135 140
gag agc ttt gcg ctg gac tgc ccg cct ctg ccc gga cac ggc tgt aag 482
Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys
145 150 155
tct tgt gaa ttt cat cgc atg aat act gga gat aaa gct gtg ttg tgt 530
Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys
160 165 170 175
gca ctt tgc tat atg aga gct tac aac cat tgt gtt tac a gtaagtgtga 580
Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr
180 185
ttaagttgaa ctttagaggg aggcagagag cagggtgact gggcgatgac tggtttattt 640
atgtatatat atgttcttta tatag gt ccc gtc tct gac gca gat gat gag 691
Ser Pro Val Ser Asp Ala Asp Asp Glu
190 195
acc ccc act aca gag tcc act tcg tca ccc cca gaa att ggc aca tct 739
Thr Pro Thr Thr Glu Ser Thr Ser Ser Pro Pro Glu Ile Gly Thr Ser
200 205 210
cca cct gag aat att gtt aga cca gtt cct gtt aga gcc act ggg agg 787
Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr Gly Arg
215 220 225
aga gca gct gtg gaa tgt ttg gat gac ttg cta cag ggt ggg gat gaa 835
Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu Gln Gly Gly Asp Glu
230 235 240 245
cct ttg gac ttg tgt acc cgg aaa cgc ccc agg cac taagtgcca 880
Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His
250 255
<210> 191
<211> 257
<212> PRT
<213> Simian adenovirus 26
<400> 191
Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser
1 5 10 15
Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp
20 25 30
Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu His Asp Leu
35 40 45
Tyr Asp Leu Glu Val Asp Val Pro Asp Asp Asp Pro Asn Glu Glu Ala
50 55 60
Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala
65 70 75 80
Ser Ser Pro Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro
85 90 95
Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp
100 105 110
Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu
115 120 125
Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Val Gln Ala Ala Ser Glu
130 135 140
Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys Ser
145 150 155 160
Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys Ala
165 170 175
Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser
180 185 190
Asp Ala Asp Asp Glu Thr Pro Thr Thr Glu Ser Thr Ser Ser Pro Pro
195 200 205
Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val
210 215 220
Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu
225 230 235 240
Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg
245 250 255
His
<210> 192
<211> 830
<212> DNA
<213> Simian adenovirus 26
<220>
<221> CDS
<222> (4)..(331)
<223> label=33K
<220>
<221> CDS
<222> (501)..(829)
<223> label=33K
<400> 192
agg atg ccc cga gga aac aag aag ctg aaa gtg gag ctg ccg ccc gtg 48
Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val
1 5 10 15
gag gat ttg gag gaa gac tgg gag aac agc agt cag gca gag gag atg 96
Glu Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu Met
20 25 30
gag gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa gac 144
Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp
35 40 45
agt ctg gag gaa gac gag gag gag gca gag gag gag gtg gaa gaa gca 192
Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu Ala
50 55 60
gcc gcc gcc aga ccg tcg tcc tcg gcg ggg gag aaa gca agc agc acg 240
Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser Thr
65 70 75
gat acc atc tcc gct ccg ggt cgg ggt ccc gct cgg ccc cac agt aga 288
Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg
80 85 90 95
tgg gac gag acc ggg cga ttc ccg aac ccc acc acc cag acc g 331
Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr
100 105
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 391
gcttgcaggc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 451
gggtgaactt cccccgcaac atcttgcatt actaccgtca cctccacag cc cct act 508
Ala Pro Thr
act tcc aag aag agg cag cag cgg cag aaa aag acc agc aga aaa cca 556
Thr Ser Lys Lys Arg Gln Gln Arg Gln Lys Lys Thr Ser Arg Lys Pro
115 120 125
gca gct aga aaa tcc aca gcg gcg gca ggt gga ctg agg atc gcg gcg 604
Ala Ala Arg Lys Ser Thr Ala Ala Ala Gly Gly Leu Arg Ile Ala Ala
130 135 140
aac gag ccg gcg cag acc cgg gag ctg agg aac cgg atc ttt ccc acc 652
Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro Thr
145 150 155 160
ctc tat gcc atc ttc cag cag agt cgg ggg cag gag cag gaa ctg aaa 700
Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu Lys
165 170 175
gtc aag aac cgt tct ctg cgc tcg ctc acc cgc agt tgt ctg tat cac 748
Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr His
180 185 190
aag agc gaa gac caa ctt cag cgc act ctc gag gac gcc gag gct ctc 796
Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu Ala Leu
195 200 205
ttc aac aag tac tgc gcg ctc act ctt aaa gag t 830
Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215
<210> 193
<211> 219
<212> PRT
<213> Simian adenovirus 26
<400> 193
Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu
1 5 10 15
Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu Met Glu
20 25 30
Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser
35 40 45
Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu Ala Ala
50 55 60
Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser Thr Asp
65 70 75 80
Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg Trp
85 90 95
Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr
100 105 110
Thr Ser Lys Lys Arg Gln Gln Arg Gln Lys Lys Thr Ser Arg Lys Pro
115 120 125
Ala Ala Arg Lys Ser Thr Ala Ala Ala Gly Gly Leu Arg Ile Ala Ala
130 135 140
Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro Thr
145 150 155 160
Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu Lys
165 170 175
Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr His
180 185 190
Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu Ala Leu
195 200 205
Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu
210 215
<210> 194
<211> 65
<212> DNA
<213> Artificial
<220>
<223> oligonucleotide SV39T
<400> 194
aattatttaa atcccgggga tcatcgatga tctctagaga tcactagtct aggatatcat 60
ttaaa 65
<210> 195
<211> 63
<212> DNA
<213> Artificial
<220>
<223> oligonucleotide SV39B
<400> 195
tatttaaatg atatcctaga ctagtgatct ctagagatca tcgatgatcc ccgggattta 60
aat 63
<210> 196
<211> 46
<212> DNA
<213> Artificial
<220>
<223> oligomer SV25 Top
<400> 196
aattatttaa atcccgggta tcaagcttga tagatatcat ttaaat 46
<210> 197
<211> 44
<212> DNA
<213> Artificial
<220>
<223> oligomer SV25 Bot
<400> 197
taatttaaat gatatctatc aagcttgata cccgggattt aaat 44
<210> 198
<211> 58
<212> DNA
<213> Artificial
<220>
<223> oligomer pSR6 top
<400> 198
aattttaatt aacccgggta tcggcgcgcc ttaacctagg gatagatatc ttaattaa 58
<210> 199
<211> 56
<212> DNA
<213> Artificial
<220>
<223> oligomer pSR6 bot
<400> 199
tattaattaa gatatctatc cctaggttaa ggcgcgccga tacccgggtt aattaa 56
<210> 200
<211> 660
<212> DNA
<213> Unknown
<220>
<223> Plasmid sequence
<400> 200
gatatcattt ccccgaaaag tgccacctga cgtaactata acggtcctaa ggtagcgaaa 60
gctcagatct cccgatcccc tatggtgcac tctcagtaca atctgctctg atgccgcata 120
gttaagccag tatctgctcc ctgcttgtgt gttggaggtc gctgagtagt gcgcgagcaa 180
aatttaagct acaacaaggc aaggcttgac cgacaattgc atgaagaatc tgcttagggt 240
taggcgtttt gcgctgcttc gcgatgtacg ggccagatat acgcggtacg aaaccgctga 300
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 360
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 420
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 480
ggggaggatt gggaagacaa tagcaggcat gctggggatg cggtgggctc tatggcttct 540
gaggcggaaa gaaccagcag atctgcagat ctgaattcat ctatgtcggg tgcggagaaa 600
gaggtaatga aatggcatta tgggtattat gggtctgcat taatgaatcg gccagatatc 660
Claims (21)
- (a) SAdV-39의 헥손 단백질, SEQ ID NO: 11의 아미노산 1 내지 940; SAdV-30의 헥손 단백질, SEQ ID NO: 108의 아미노산 1 내지 938; SAdV-25.2의 헥손 단백질, SEQ ID NO: 140의 아미노산 1 내지 933; SAdV-37의 헥손 단백질, SEQ ID NO: 43의 아미노산 1 내지 942; SAdV-38의 헥손 단백질, SEQ ID NO: 75의 아미노산 1 내지 930; SAdV-26의 헥손 단백질, SEQ ID NO: 172의 아미노산 1 내지 937;
(b) SAdV-39의 펜톤 단백질, SEQ ID NO: 6의 아미노산 1 내지 532; SAdV-30의 펜톤 단백질, SEQ ID NO: 103의 아미노산 1 내지 533; SAdV-25.2의 펜톤 단백질, SEQ ID NO: 135의 아미노산 1 내지 531; SAdV-37의 펜톤 단백질, SEQ ID NO: 38의 아미노산 1 내지 542; SAdV-38의 펜톤 단백질, SEQ ID NO: 70의 아미노산 1 내지 539; SAdV-26의 펜톤 단백질, SEQ ID NO: 167의 아미노산 1 내지 546; 및
(c) SAdV-39의 섬유 단백질, SEQ ID NO: 22의 아미노산 1 내지 489, SAdV-30의 섬유 단백질, SEQ ID NO: 118의 아미노산 1 내지 445; SAdV-25.2의 섬유 단백질, SEQ ID NO: 151의 아미노산 1 내지 443; SAdV-37의 섬유 단백질, SEQ ID NO: 54의 아미노산 1 내지 445; SAdV-38의 섬유 단백질, SEQ ID NO: 85의 아미노산 1 내지 425; SAdV-26의 섬유 단백질, SEQ ID NO: 183의 아미노산 1 내지 425;
로 구성되는 군으로부터 선택되는 캡시드 단백질을 포함하는 캡시드를 가지며,
상기 캡시드는 숙주 세포에서 그것의 전사, 번역 및/또는 발현을 지시하는 발현 조절 서열에 작동가능하게 연결된 유전자를 전달하는 이종성 분자를 단백질막으로 싸는 아데노바이러스. - 제 1 항에 있어서, 복제 및 단백질 막화에 필요한 5' 및 3' 아데노바이러스 시스-구성요소를 추가로 포함하는 것을 특징으로 하는 아데노바이러스.
- 제 1 항에 있어서, 상기 아데노바이러스는 E1 유전자의 모두 또는 일부를 결핍하는 것을 특징으로 하는 아데노바이러스.
- 제 3 항에 있어서, 상기 아데노바이러스는 복제-결함인 것을 특징으로 하는 아데노바이러스.
- 제 5 항에 있어서, 상기 바이러스는 하이브리드 캡시드인 것을 특징으로 하는 아데노바이러스.
- 제 5 항에 있어서, 상기 벡터는 SAdV-39, SAdV-30, SAdV-25.2, SAdV-37, SAdV-38, 및 SAdV-26으로부터 선택되는 하나 이상의 캡시드 단백질을 포함하는 것을 특징으로 하는 아데노바이러스.
- SAdV 헥손 단백질의 단편은 길이에 있어서 약 50개의 아미노산의 N-말단 또는 C-말단 절단을 가지는 SEQ ID NO: 11, 108, 140, 43, 75 또는 172의 SAdV 헥손 단백질 또는
SEQ ID NO: 11, 108, 140, 43, 75 또는 172의 아미노산 잔기 125 내지 443;
SEQ ID NO: 11, 108, 140, 43, 75 또는 172의 아미노산 잔기 138 내지 441;
SEQ ID NO: 11, 108, 140, 43, 75 또는 172의 아미노산 잔기 138 내지 163;
SEQ ID NO: 11, 108, 140, 43, 75 또는 172의 아미노산 잔기 170 내지 176; 및
SEQ ID NO: 11, 108, 140, 43, 75 또는 172의 아미노산 잔기 404 내지 430으로 구성되는 군으로부터 선택되는 유인원 아데노바이러스 헥손 단백질의 단편 및 SAdV에 이종성인 핵산 서열을 함유하는 헥손을 포함하는 캡시드를 가지는 재조합 아데노바이러스. - 제 7 항에 있어서, 캡시드는 SAdV-39, SAdV-30, SAdV-25.2, SAdV-37, SAdV-38 또는 SAdV-26 섬유 단백질을 추가로 포함하는 것을 특징으로 하는 재조합 아데노바이러스.
- 제 7 항에 있어서, 캡시드는 추가로 SAdV-39, SAdV-30, SAdV-25.2, SAdV-37, SAdV-38 또는 SAdV-26 펜톤 단백질을 포함하는 것을 특징으로 하는 재조합 아데노바이러스.
- 제 7 항에 있어서, 상기 아데노바이러스는 복제 및 단백질 막화에 필요한 5' 및 3' 아데노바이러스 시스-구성요소를 포함하는 슈도타입화된 아데노바이러스이고, 상기 시스-구성요소는 아데노바이러스 5' 역위 말단 반복 및 아데노바이러스 3' 역위 말단 반복을 포함하는 것을 특징으로 하는 재조합 아데노바이러스.
- 제 7 항에 있어서, 아데노바이러스는 숙주 세포에서 생성물의 발현을 지시하는 서열에 작동가능하게 연결된 생성물을 암호화하는 핵산 서열을 포함하는 것을 특징으로 하는 재조합 아데노바이러스.
- 제 7 항에 있어서, 재조합 아데노바이러스는 하나 이상의 아데노바이러스 유전자를 포함하는 것을 특징으로 하는 재조합 아데노바이러스.
- 제 7 항에 있어서, 재조합 아데노바이러스는 복제-결함인 것을 특징으로 하는 재조합 아데노바이러스.
- 제 13 항에 있어서, 재조합 아데노바이러스는 아데노바이러스 E1에서 결실된 것을 특징으로 하는 재조합 아데노바이러스.
- 약학적으로 허용가능한 담체 중에 제 1 항 내지 제 14 항 중 어느 한 항의 바이러스를 포함하는 조성물.
- 제 1 항 내지 제 14 항 중 어느 한 항에 따르는 바이러스를 피험자에게 전달하는 단계를 포함하는 아데노바이러스 수용체를 가지는 세포를 표적화하는 방법.
- 유인원 아데노바이러스 39 핵산 SEQ ID NO:1의 1 내지 36553 및 그것의 보체;
유인원 아데노바이러스 25.2 핵산 SEQ ID NO: 130의 1 내지 36629 및 그것의 보체;
유인원 아데노바이러스 26 핵산 SEQ ID NO: 162의 1 내지 36628 및 그것의 보체;
유인원 아데노바이러스 30 핵산 SEQ ID NO: 98의 1 내지 36621 및 그것의 보체;
유인원 아데노바이러스 37 핵산 SEQ ID NO: 33의 1 내지 36634 및 그것의 보체; 및
유인원 아데노바이러스 38 핵산 SEQ ID NO: 65의 1 내지 36494 및 그것의 보체
로 구성되는 군으로부터 선택되는 분리된 유인원 아데노바이러스 핵산. - (a) 5' 역위 말단 반복 (ITR) 서열;
(b) 아데노바이러스 E1a 영역;
(c) 아데노바이러스 E1b 영역, 또는 작은 T, 거대한 T, IX, 및 IVa2영역에 대한 오픈리딩프레임으로 구성되는 군 중에서 선택되는 그것의 단편;
(d) pTP, 폴리머라아제, 및 IVa 영역에 대한 오픈리딩프레임을 포함하는 E2b 영역;
(e) L1 영역, 또는 52/55 kD 단백질, 및 IIIa 단백질에 대한 오픈리딩프레임으로 구성되는 군 중에서 선택되는 그것의 단편;
(f) L2 영역, 또는 펜톤, VII, VI, 및 X 단백질에 대한 오픈리딩프레임으로 구성되는 군으로부터 선택되는 그것의 단편;
(g) L3 영역, 또는 VI, 헥손, 또는 엔도프로테아제에 대한 오픈리딩프레임으로 구성되는 군으로부터 선택되는 그것의 단편;
(h) DNA-결합 단백질(DBP)에 대한 오픈리딩프레임을 포함하는 E2a 단백질;
(i) L4 영역, 또는 100 kD 단백질, 33 kD 상동체, 22kD 상동체 및 VIII에 대한 오픈리딩프레임으로 구성되는 군으로부터 선택되는 그것의 단편;
(j) E3 영역, 또는 12.5K, CR1-알파, gp19K, CR1-베타, CR1-감마, CR1-델타, RID-알파, RID-베타, 및 14.7 K에 대한 오픈리딩프레임으로 구성되는 군으로부터 선택되는 그것의 단편;
(k) L5 영역, 또는 섬유 단백질에 대한 오픈리딩프레임으로부터 선택되는 그것의 단편;
(l) E4 영역, 또는 E4 ORF6/7, E4 ORF6, E4 ORF4, E4 ORF3, E4 ORF2, 및 E4 ORF1에 대한 오픈리딩프레임으로 구성되는 군으로부터 선택되는 그것의 단편; 및
(m) 유인원 아데노바이러스 39, SEQ ID NO:1; SAdV-25.2, SEQ ID NO: 130; 유인원 아데노바이러스 26, SEQ ID NO: 162, 유인원 아데노바이러스 30, SEQ ID NO: 98; 유인원 아데노바이러스 37, SEQ ID NO: 33; 유인원 아데노바이러스 38, SEQ ID NO: 65의 3' ITR
로 구성되는 하나 이상의 군으로부터 선택되는 유인원 아데노바이러스 핵산 서열을 포함하는 벡터. - 제 18 항에 따르는 핵산 서열에 의해 암호화되는 유인원 아데노바이러스 단백질.
- SEQ ID NO:30, 127, 159, 62, 95 및 191의 아미노산 서열로부터 선택되는 E1a;
SEQ ID NO:24, 120, 153, 56, 89, 및 185의 아미노산 서열로부터 선택되는 E1b, 작은 T/19K;
SEQ ID NO: 2, 99, 131, 34, 66, 및 163의 아미노산 서열로부터 선택되는 E1b, 거대 T/55K;
SEQ ID NO:3, 100, 132, 35, 67, 및 164의 아미노산 서열로부터 선택되는 IX;
SEQ ID NO:4, 101, 133, 36, 68, 및 165의 아미노산 서열로부터 선택되는 52/55D;
SEQ ID NO:5, 102, 134, 37, 69 및 166의 아미노산 서열로부터 선택되는 IIa;
SEQ ID NO:6, 103, 135, 38, 70, 및 167의 아미노산 서열로부터 선택되는 펜톤;
SEQ ID NO: 7, 104, 136, 39, 71, 및 168의 아미노산 서열로부터 선택되는 VII;
SEQ ID NO: 8, 105, 137, 40, 72, 및 169의 아미노산 서열로부터 선택되는 V;
SEQ ID NO: 9, 106, 138, 41, 73, 및 170의 아미노산 서열로부터 선택되는 pX;
SEQ ID NO: 10, 107, 139, 42, 74, 및 171의 아미노산 서열로부터 선택되는 VI;
SEQ ID NO: 11, 108, 140, 43, 75, 및 172의 아미노산 서열로부터 선택되는 헥손;
SEQ ID NO:12, 109, 141, 44, 76, 및 173의 아미노산 서열로부터 선택되는 엔도프로테아제;
SEQ ID NO: 13, 110, 142, 45, 77, 및 174의 아미노산 서열로부터 선택되는 100 kD;
SEQ ID NO: 32, 129, 161, 64, 97, 97, 및 193의 아미노산 서열로부터 선택되는 33 kD;
SEQ ID NO: 26, 122, 155, 58, 91, 및 187의 아미노산 서열로부터 선택되는 22 kD;
SEQ ID NO: 14, 111, 143, 46, 78, 및 175의 아미노산 서열로부터 선택되는 VIII;
SEQ ID NO: 15, 123, 144, 47, 79, 및 176의 아미노산 서열로부터 선택되는 E3/12.5 K;
SEQ ID NO:27, 112, 156, 59, 92, 및 188의 아미노산 서열로부터 선택되는 CR1-알파;
SEQ ID NO: 16, 124, 145, 48, 87, 및 177의 아미노산 서열로부터 선택되는 gp19K;
SEQ ID NO: 17, 113, 146, 49, 80, 및 178의 아미노산 서열로부터 선택되는 CR1-베타;
SEQ ID NO:18, 114, 147, 50, 81, 및 179의 아미노산 서열로부터 선택되는 CR1-감마;
SEQ ID NO: 19, 115, 148, 51, 82, 180의 아미노산 서열로부터 선택되는 CR1-델타;
SEQ ID NO:20, 116, 149, 52, 83, 및 181의 아미노산 서열로부터 선택되는 RID-알파;
SEQ ID NO:21, 117, 150, 53, 93, 및 182의 아미노산 서열로부터 선택되는 RID-베타;
SEQ ID NO:28, 125, 158, 60, 84, 및 189의 아미노산 서열로부터 선택되는 E3/14.7K; 및
SEQ ID NO:22, 118, 151, 54, 85 및 183의 아미노산 서열로부터 선택되는 섬유소로 구성되는 군으로부터 선택되는 하나 이상의 유인원 아데노바이러스 단백질을 포함하는 조성물. - 제 20 항에 따르는 조성물을 피험자에 전달하는 단계를 포함하며, 상기 조성물은 헥손, 펜톤 및 섬유소로부터 선택되는 하나 이상의 유인원 아데노바이러스 SAdV-39, -25.2, -26, -30, -37, 및 -38 단백질인 아데노바이러스 수용체를 가지는 세포를 표적화하는 방법.
Applications Claiming Priority (13)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US446407P | 2007-11-28 | 2007-11-28 | |
US449907P | 2007-11-28 | 2007-11-28 | |
US450707P | 2007-11-28 | 2007-11-28 | |
US453207P | 2007-11-28 | 2007-11-28 | |
US454107P | 2007-11-28 | 2007-11-28 | |
US446107P | 2007-11-28 | 2007-11-28 | |
US61/004,507 | 2007-11-28 | ||
US61/004,541 | 2007-11-28 | ||
US61/004,532 | 2007-11-28 | ||
US61/004,499 | 2007-11-28 | ||
US61/004,461 | 2007-11-28 | ||
US61/004,464 | 2007-11-28 | ||
PCT/US2008/013066 WO2009073104A2 (en) | 2007-11-28 | 2008-11-24 | Simian e adenoviruses sadv-39, -25. 2, -26, -30, -37, and -38 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020107014132A Division KR101614364B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020167026650A Division KR101761691B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20150108945A true KR20150108945A (ko) | 2015-09-30 |
KR101662574B1 KR101662574B1 (ko) | 2016-10-05 |
Family
ID=40380705
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020167026650A KR101761691B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
KR1020107014132A KR101614364B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
KR1020157025105A KR101662574B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020167026650A KR101761691B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
KR1020107014132A KR101614364B1 (ko) | 2007-11-28 | 2008-11-24 | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 |
Country Status (20)
Country | Link |
---|---|
US (3) | US8685387B2 (ko) |
EP (2) | EP2220241B1 (ko) |
JP (3) | JP5740158B2 (ko) |
KR (3) | KR101761691B1 (ko) |
CN (1) | CN101883858B (ko) |
AU (1) | AU2008331906B2 (ko) |
BR (1) | BRPI0819783A2 (ko) |
CA (2) | CA2964396A1 (ko) |
CY (1) | CY1118351T1 (ko) |
DK (1) | DK2220241T3 (ko) |
ES (1) | ES2607029T3 (ko) |
HR (1) | HRP20161573T1 (ko) |
HU (1) | HUE032142T2 (ko) |
LT (1) | LT2220241T (ko) |
MX (2) | MX347246B (ko) |
PL (1) | PL2220241T3 (ko) |
PT (1) | PT2220241T (ko) |
SG (1) | SG10201604001XA (ko) |
SI (1) | SI2220241T1 (ko) |
WO (1) | WO2009073104A2 (ko) |
Families Citing this family (61)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2621165T3 (es) | 2007-11-28 | 2017-07-03 | The Trustees Of The University Of Pennsylvania | Adenovirus de simio de la subfamilia B SADV-28, -27, -29, -32, -33 y -35 y usos de los mismos |
BRPI0819783A2 (pt) * | 2007-11-28 | 2015-06-23 | Univ Pennsylvania | Subfamília simiana e adenovírus sadv-39, -25.2, -26, -30, -37 e -38 e usos dos mesmos. |
EP2220217A2 (en) | 2007-11-28 | 2010-08-25 | The Trustees of the University of Pennsylvania | Simian subfamily c adenoviruses sadv-40, -31, and-34 and uses thereof |
EP2250255A2 (en) * | 2008-03-04 | 2010-11-17 | The Trustees of the University of Pennsylvania | Simian adenoviruses sadv-36,-42.1, -42.2, and -44 and uses thereof |
JP5809978B2 (ja) | 2008-10-31 | 2015-11-11 | ザ・トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | サルアデノウイルスSAdV−43、−45、−46、−47、−48、−49および−50ならびにそれらの用途 |
WO2010085984A1 (en) * | 2009-02-02 | 2010-08-05 | Okairos Ag | Simian adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
KR101761425B1 (ko) | 2009-02-02 | 2017-07-26 | 글락소스미스클라인 바이오로지칼즈 에스.에이. | 시미안 아데노바이러스 핵산- 및 아미노산-서열, 이를 포함하는 벡터 및 이의 용도 |
US8846031B2 (en) | 2009-05-29 | 2014-09-30 | The Trustees Of The University Of Pennsylvania | Simian adenovirus 41 and uses thereof |
WO2011057254A2 (en) * | 2009-11-09 | 2011-05-12 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Simian adenoviral vector-based vaccines |
CA2779632C (en) | 2009-11-09 | 2019-08-20 | Jason Gall | Simian adenovirus and methods of use |
EP2643465B1 (en) * | 2010-11-23 | 2016-05-11 | The Trustees Of The University Of Pennsylvania | Subfamily e simian adenovirus a1321 and uses thereof |
ES2696551T3 (es) | 2010-12-17 | 2019-01-16 | Globeimmune Inc | Composiciones y métodos para el tratamiento o la prevención de la infección por adenovirus-36 humano |
US10221218B2 (en) | 2011-05-10 | 2019-03-05 | The Regents Of The University Of California | Adenovirus isolated from titi monkeys |
US9267112B2 (en) | 2011-05-10 | 2016-02-23 | The Regents Of The University Of California | Adenovirus isolated from Titi Monkeys |
GB201108879D0 (en) * | 2011-05-25 | 2011-07-06 | Isis Innovation | Vector |
TWI575070B (zh) | 2011-07-12 | 2017-03-21 | 傳斯堅公司 | Hbv聚合酶突變體 |
TW201318637A (zh) | 2011-09-29 | 2013-05-16 | Transgene Sa | 免疫療法組成物及用於治療c型肝炎病毒感染之療程(一) |
WO2013045668A2 (en) | 2011-09-29 | 2013-04-04 | Transgene Sa | Immunotherapy composition and regimen for treating hepatitis c virus infection |
KR20150014505A (ko) * | 2012-05-18 | 2015-02-06 | 더 트러스티스 오브 더 유니버시티 오브 펜실바니아 | 아과 e 원숭이 아데노바이러스 a1302, a1320, a1331 및 a1337 및 이것들의 사용 |
PT2880160T (pt) * | 2012-08-03 | 2018-12-17 | Cedars Sinai Medical Center | Isolamento de mutantes que aumentam o tráfego de proteínas para administração de fármacos |
AU2014236207B2 (en) | 2013-03-14 | 2019-05-23 | Salk Institute For Biological Studies | Oncolytic adenovirus compositions |
WO2015191508A1 (en) | 2014-06-09 | 2015-12-17 | Voyager Therapeutics, Inc. | Chimeric capsids |
US10941452B2 (en) | 2014-10-06 | 2021-03-09 | The Trustees Of The University Of Pennsylvania | Compositions and methods for isolation of circulating tumor cells (CTC) |
JP6401871B2 (ja) | 2014-11-05 | 2018-10-10 | ボイジャー セラピューティクス インコーポレイテッドVoyager Therapeutics,Inc. | パーキンソン病の治療のためのaadcポリヌクレオチド |
KR102584655B1 (ko) | 2014-11-14 | 2023-10-06 | 보이저 테라퓨틱스, 인크. | 조절성 폴리뉴클레오티드 |
KR20230169197A (ko) | 2014-11-14 | 2023-12-15 | 보이저 테라퓨틱스, 인크. | 근위축성 측삭 경화증(als)을 치료하는 조성물 및 방법 |
US11697825B2 (en) | 2014-12-12 | 2023-07-11 | Voyager Therapeutics, Inc. | Compositions and methods for the production of scAAV |
WO2016131945A1 (en) | 2015-02-20 | 2016-08-25 | Transgene Sa | Combination product with autophagy modulator |
EA038402B9 (ru) * | 2015-06-12 | 2021-09-22 | Глаксосмитклайн Байолоджикалс Са | Аденовирусные полинуклеотиды и полипептиды |
WO2017096162A1 (en) * | 2015-12-02 | 2017-06-08 | Voyager Therapeutics, Inc. | Assays for the detection of aav neutralizing antibodies |
CN117384961A (zh) | 2016-02-23 | 2024-01-12 | 萨克生物研究学院 | 对病毒动力学影响最小的治疗性腺病毒中的外源基因表达 |
CA3013637A1 (en) | 2016-02-23 | 2017-08-31 | Salk Institute For Biological Studies | High throughput assay for measuring adenovirus replication kinetics |
WO2017189964A2 (en) | 2016-04-29 | 2017-11-02 | Voyager Therapeutics, Inc. | Compositions for the treatment of disease |
WO2017189959A1 (en) | 2016-04-29 | 2017-11-02 | Voyager Therapeutics, Inc. | Compositions for the treatment of disease |
US20190134190A1 (en) | 2016-05-04 | 2019-05-09 | Transgene Sa | Combination therapy with cpg tlr9 ligand |
WO2017201258A1 (en) | 2016-05-18 | 2017-11-23 | Voyager Therapeutics, Inc. | Compositions and methods of treating huntington's disease |
SG11201809699XA (en) | 2016-05-18 | 2018-12-28 | Voyager Therapeutics Inc | Modulatory polynucleotides |
CN110650673B (zh) | 2016-08-30 | 2024-04-09 | 加利福尼亚大学董事会 | 用于生物医学靶向和递送的方法以及用于实践该方法的装置和系统 |
WO2018069316A2 (en) | 2016-10-10 | 2018-04-19 | Transgene Sa | Immunotherapeutic product and mdsc modulator combination therapy |
WO2018111767A1 (en) | 2016-12-12 | 2018-06-21 | Salk Institute For Biological Studies | Tumor-targeting synthetic adenoviruses and uses thereof |
AU2018261790A1 (en) | 2017-05-05 | 2019-11-28 | Voyager Therapeutics, Inc. | Compositions and methods of treating amyotrophic lateral sclerosis (ALS) |
EP3619308A4 (en) | 2017-05-05 | 2021-01-27 | Voyager Therapeutics, Inc. | COMPOSITIONS AND METHODS OF TREATMENT FOR HUNTINGTON'S MORBUS |
JOP20190269A1 (ar) | 2017-06-15 | 2019-11-20 | Voyager Therapeutics Inc | بولي نوكليوتيدات aadc لعلاج مرض باركنسون |
CN111132626B (zh) | 2017-07-17 | 2024-01-30 | 沃雅戈治疗公司 | 轨迹阵列引导系统 |
TWI832036B (zh) | 2017-08-03 | 2024-02-11 | 美商航海家醫療公司 | 用於aav之遞送之組合物及方法 |
US11434502B2 (en) | 2017-10-16 | 2022-09-06 | Voyager Therapeutics, Inc. | Treatment of amyotrophic lateral sclerosis (ALS) |
US20200237799A1 (en) | 2017-10-16 | 2020-07-30 | Voyager Therapeutics, Inc. | Treatment of amyotrophic lateral sclerosis (als) |
JP7366014B2 (ja) * | 2017-10-31 | 2023-10-20 | ヤンセン ファッシンズ アンド プリベンション ベーフェー | アデノウイルス及びその用途 |
CA3077630A1 (en) | 2017-10-31 | 2019-05-09 | Janssen Vaccines & Prevention B.V. | Adenovirus vectors and uses thereof |
KR20200077559A (ko) | 2017-10-31 | 2020-06-30 | 얀센 백신스 앤드 프리벤션 비.브이. | 아데노바이러스 및 이의 용도 |
AU2018359492B2 (en) | 2017-10-31 | 2023-12-14 | Janssen Vaccines & Prevention B.V. | Adenovirus and uses thereof |
US20210301305A1 (en) | 2018-06-13 | 2021-09-30 | Voyager Therapeutics, Inc. | Engineered untranslated regions (utr) for aav production |
CN112770812A (zh) | 2018-07-24 | 2021-05-07 | 沃雅戈治疗公司 | 产生基因治疗制剂的系统和方法 |
WO2020072849A1 (en) | 2018-10-04 | 2020-04-09 | Voyager Therapeutics, Inc. | Methods for measuring the titer and potency of viral vector particles |
WO2020072844A1 (en) | 2018-10-05 | 2020-04-09 | Voyager Therapeutics, Inc. | Engineered nucleic acid constructs encoding aav production proteins |
EP3867389A1 (en) | 2018-10-15 | 2021-08-25 | Voyager Therapeutics, Inc. | Expression vectors for large-scale production of raav in the baculovirus/sf9 system |
KR20210130158A (ko) | 2019-01-31 | 2021-10-29 | 오레곤 헬스 앤드 사이언스 유니버시티 | Aav 캡시드의 전사 의존적 유도 진화를 사용하는 방법 |
WO2022013221A1 (en) | 2020-07-13 | 2022-01-20 | Transgene | Treatment of immune depression |
WO2022165313A1 (en) | 2021-02-01 | 2022-08-04 | Regenxbio Inc. | Gene therapy for neuronal ceroid lipofuscinoses |
WO2022218997A1 (en) | 2021-04-12 | 2022-10-20 | Centre National De La Recherche Scientifique (Cnrs) | Novel universal vaccine presenting system |
WO2023213764A1 (en) | 2022-05-02 | 2023-11-09 | Transgene | Fusion polypeptide comprising an anti-pd-l1 sdab and a member of the tnfsf |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005071093A2 (en) * | 2004-01-23 | 2005-08-04 | Istituto Di Ricerche Di Biologia Molecolare P Angeletti Spa | Chimpanzee adenovirus vaccine carriers |
Family Cites Families (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SU1364343A1 (ru) | 1984-07-13 | 1988-01-07 | Всесоюзный научно-исследовательский институт генетики и селекции промышленных микроорганизмов | Способ получени человеческого лейкоцитарного интерферона альфа-2 |
GB8607679D0 (en) | 1986-03-27 | 1986-04-30 | Winter G P | Recombinant dna product |
US4732683A (en) | 1986-12-02 | 1988-03-22 | Biospectrum, Inc. | Purification method for alpha interferon |
IL162181A (en) | 1988-12-28 | 2006-04-10 | Pdl Biopharma Inc | A method of producing humanized immunoglubulin, and polynucleotides encoding the same |
US5240846A (en) | 1989-08-22 | 1993-08-31 | The Regents Of The University Of Michigan | Gene therapy vector for cystic fibrosis |
US6174666B1 (en) | 1992-03-27 | 2001-01-16 | The United States Of America As Represented By The Department Of Health And Human Services | Method of eliminating inhibitory/instability regions from mRNA |
JPH10507758A (ja) | 1994-10-19 | 1998-07-28 | ジェネティック セラピー,インコーポレイテッド | アデノウイルスおよび免疫抑制剤同時反復投与を伴う遺伝子治療 |
AU704391B2 (en) | 1994-10-28 | 1999-04-22 | Trustees Of The University Of Pennsylvania, The | Improved adenovirus and methods of use thereof |
US5856152A (en) | 1994-10-28 | 1999-01-05 | The Trustees Of The University Of Pennsylvania | Hybrid adenovirus-AAV vector and methods of use therefor |
WO1998010088A1 (en) | 1996-09-06 | 1998-03-12 | Trustees Of The University Of Pennsylvania | An inducible method for production of recombinant adeno-associated viruses utilizing t7 polymerase |
US6083716A (en) | 1996-09-06 | 2000-07-04 | The Trustees Of The University Of Pennsylvania | Chimpanzee adenovirus vectors |
US5922315A (en) | 1997-01-24 | 1999-07-13 | Genetic Therapy, Inc. | Adenoviruses having altered hexon proteins |
US5891994A (en) | 1997-07-11 | 1999-04-06 | Thymon L.L.C. | Methods and compositions for impairing multiplication of HIV-1 |
WO1999014354A1 (en) | 1997-09-19 | 1999-03-25 | The Trustees Of The University Of The Pennsylvania | Methods and vector constructs useful for production of recombinant aav |
CA2304168A1 (en) | 1997-09-19 | 1999-04-01 | The Trustees Of The University Of Pennsylvania | Methods and cell line useful for production of recombinant adeno-associated viruses |
GB9720585D0 (en) | 1997-09-26 | 1997-11-26 | Smithkline Beecham Biolog | Vaccine |
WO1999029334A1 (en) * | 1997-12-12 | 1999-06-17 | Saint Louis University | CtIP, A NOVEL PROTEIN THAT INTERACTS WITH CtBP AND USES THEREFOR |
AU3097399A (en) | 1998-03-20 | 1999-10-11 | Trustees Of The University Of Pennsylvania, The | Compositions and methods for helper-free production of recombinant adeno-associated viruses |
AU5677399A (en) | 1998-08-20 | 2000-03-14 | Wistar Institute Of Anatomy And Biology, The | Methods of augmenting mucosal immunity through systemic priming and mucosal boosting |
US6258595B1 (en) | 1999-03-18 | 2001-07-10 | The Trustees Of The University Of Pennsylvania | Compositions and methods for helper-free production of recombinant adeno-associated viruses |
HUP0204250A3 (en) | 2000-01-31 | 2005-06-28 | Smithkline Beecham Biolog | Use of hiv-protein or -polynucleotide for vaccine produce |
US6740525B2 (en) | 2000-02-09 | 2004-05-25 | Genvec, Inc. | Adenoviral capsid containing chimeric protein IX |
JP4399255B2 (ja) * | 2001-06-22 | 2010-01-13 | ザ・トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | 細菌の形質転換体を迅速に選別する方法および新規なサルアデノウイルス蛋白質 |
US20040136963A1 (en) * | 2001-06-22 | 2004-07-15 | The Trustees Of The University Of Pennsylvania | Simian adenovirus vectors and methods of use |
CA2466431C (en) * | 2001-11-21 | 2014-08-05 | The Trustees Of The University Of Pennsylvania | Simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use |
US7291498B2 (en) | 2003-06-20 | 2007-11-06 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
JP4754480B2 (ja) * | 2003-06-20 | 2011-08-24 | ザ・トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | キメラアデノウイルスの作成法およびそのようなキメラアデノウイルスの使用 |
US20080004236A1 (en) | 2004-02-06 | 2008-01-03 | Comper Wayne D | High Dose, Short Interval Use of Sulfated Polysaccharides for Treatment of Infections |
AU2005293572B2 (en) | 2004-10-14 | 2011-08-04 | Crucell Holland B.V. | Malaria prime/boost vaccines |
US7745147B2 (en) | 2005-02-12 | 2010-06-29 | Viranative Ab | Methods and uses of antibodies in the purification of interferon |
WO2006120034A1 (en) * | 2005-05-12 | 2006-11-16 | Glaxo Group Limited | Vaccine composition |
WO2008010864A2 (en) | 2006-04-28 | 2008-01-24 | The Trustees Of The University Of Pennsylvania | Modified adenovirus hexon protein and uses thereof |
EP2220217A2 (en) | 2007-11-28 | 2010-08-25 | The Trustees of the University of Pennsylvania | Simian subfamily c adenoviruses sadv-40, -31, and-34 and uses thereof |
ES2621165T3 (es) * | 2007-11-28 | 2017-07-03 | The Trustees Of The University Of Pennsylvania | Adenovirus de simio de la subfamilia B SADV-28, -27, -29, -32, -33 y -35 y usos de los mismos |
BRPI0819783A2 (pt) * | 2007-11-28 | 2015-06-23 | Univ Pennsylvania | Subfamília simiana e adenovírus sadv-39, -25.2, -26, -30, -37 e -38 e usos dos mesmos. |
EP2250255A2 (en) | 2008-03-04 | 2010-11-17 | The Trustees of the University of Pennsylvania | Simian adenoviruses sadv-36,-42.1, -42.2, and -44 and uses thereof |
EP2777185B1 (en) | 2011-10-13 | 2016-08-24 | Telefonaktiebolaget LM Ericsson (publ) | Method and node related to channel estimation |
-
2008
- 2008-11-24 BR BRPI0819783-0A patent/BRPI0819783A2/pt not_active IP Right Cessation
- 2008-11-24 PT PT88571591T patent/PT2220241T/pt unknown
- 2008-11-24 MX MX2014000110A patent/MX347246B/es unknown
- 2008-11-24 SI SI200831730T patent/SI2220241T1/sl unknown
- 2008-11-24 KR KR1020167026650A patent/KR101761691B1/ko active IP Right Grant
- 2008-11-24 MX MX2010005858A patent/MX2010005858A/es active IP Right Grant
- 2008-11-24 AU AU2008331906A patent/AU2008331906B2/en not_active Ceased
- 2008-11-24 KR KR1020107014132A patent/KR101614364B1/ko not_active IP Right Cessation
- 2008-11-24 CA CA2964396A patent/CA2964396A1/en not_active Abandoned
- 2008-11-24 WO PCT/US2008/013066 patent/WO2009073104A2/en active Application Filing
- 2008-11-24 EP EP08857159.1A patent/EP2220241B1/en not_active Not-in-force
- 2008-11-24 LT LTEP08857159.1T patent/LT2220241T/lt unknown
- 2008-11-24 CN CN200880118591.5A patent/CN101883858B/zh not_active Expired - Fee Related
- 2008-11-24 CA CA2706258A patent/CA2706258C/en not_active Expired - Fee Related
- 2008-11-24 ES ES08857159.1T patent/ES2607029T3/es active Active
- 2008-11-24 JP JP2010535987A patent/JP5740158B2/ja not_active Expired - Fee Related
- 2008-11-24 EP EP16188300.4A patent/EP3128010A1/en not_active Withdrawn
- 2008-11-24 DK DK08857159.1T patent/DK2220241T3/en active
- 2008-11-24 US US12/744,441 patent/US8685387B2/en active Active
- 2008-11-24 HU HUE08857159A patent/HUE032142T2/en unknown
- 2008-11-24 PL PL08857159T patent/PL2220241T3/pl unknown
- 2008-11-24 SG SG10201604001XA patent/SG10201604001XA/en unknown
- 2008-11-24 KR KR1020157025105A patent/KR101662574B1/ko active IP Right Grant
-
2014
- 2014-02-12 US US14/178,738 patent/US9359618B2/en active Active
- 2014-12-05 JP JP2014246768A patent/JP2015083006A/ja active Pending
-
2016
- 2016-05-13 US US15/154,368 patent/US20160244783A1/en not_active Abandoned
- 2016-05-18 JP JP2016099660A patent/JP6224769B2/ja active Active
- 2016-11-24 HR HRP20161573TT patent/HRP20161573T1/hr unknown
- 2016-12-13 CY CY20161101281T patent/CY1118351T1/el unknown
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005071093A2 (en) * | 2004-01-23 | 2005-08-04 | Istituto Di Ricerche Di Biologia Molecolare P Angeletti Spa | Chimpanzee adenovirus vaccine carriers |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101761691B1 (ko) | 유인원 e 아데노바이러스 sadv-39, -25.2, -26, -30, -37, 및 -38 | |
KR101761683B1 (ko) | 유인원 아과 b 아데노바이러스 sadv-28,27,-29,-32,-33, 및 -35 및 그것의 사용 | |
KR101614369B1 (ko) | 유인원 아과 c 아데노바이러스 sadv-40, -31, 및 -34 및 그것의 사용 | |
AU2011332025B2 (en) | Subfamily E simian adenoviruses A1321, A1325, A1295, A1309 and A1322 and uses thereof | |
EP1409748B1 (en) | Recombinant Adenoviruses comprising simian adenovirus proteins and uses thereof. | |
EP1636370A2 (en) | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses | |
AU2014203073B2 (en) | Simian E adenovirus SAdV-30 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A107 | Divisional application of patent | ||
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant |