KR20120117977A - Polypeptides that bind il-23r - Google Patents

Polypeptides that bind il-23r Download PDF

Info

Publication number
KR20120117977A
KR20120117977A KR1020127012030A KR20127012030A KR20120117977A KR 20120117977 A KR20120117977 A KR 20120117977A KR 1020127012030 A KR1020127012030 A KR 1020127012030A KR 20127012030 A KR20127012030 A KR 20127012030A KR 20120117977 A KR20120117977 A KR 20120117977A
Authority
KR
South Korea
Prior art keywords
leu
lys
glu
polypeptide
gln
Prior art date
Application number
KR1020127012030A
Other languages
Korean (ko)
Inventor
안케 크레츠-롬멜
마사 와일드
캐서린 보우디시
엘리스 첸
다니엘라 올틴
마리아 곤잘레스
밀리 카푸어
Original Assignee
아나포르, 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/577,067 external-priority patent/US20100105620A1/en
Application filed by 아나포르, 인코포레이티드 filed Critical 아나포르, 인코포레이티드
Priority claimed from US12/703,752 external-priority patent/US20110086770A1/en
Publication of KR20120117977A publication Critical patent/KR20120117977A/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/52Cytokines; Lymphokines; Interferons
    • C07K14/54Interleukins [IL]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/17Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • A61K38/19Cytokines; Lymphokines; Interferons
    • A61K38/20Interleukins [IL]
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals

Abstract

본 발명은 다합체화, 예를 들어, 삼합체화 도메인 및 IL-23R에 결합하는 폴리펩티드 서열을 갖는 폴리펩티드를 포함하는 IL-23R에 결합하는 폴리펩티드에 관한 것이다. 다합체화 도메인은 인간 테트라넥틴으로부터 유래될 수 있다. IL-23R 결합 폴리펩티드는 자연 IL-23에 의해 IL-23R의 활성화를 억제하고, 이는 다양한 면역 관련 장애 및 암에 대한 치료제로 사용될 수 있다. 폴리펩티드를 선택하는 방법 및 다합체 복합체를 제조하는 방법이 기재된다.The present invention relates to a polypeptide that binds to IL-23R including a multimerization, eg, a polypeptide having a polypeptide sequence that binds to the trimerization domain and IL-23R. The multimerization domain can be derived from human tetranectin. IL-23R binding polypeptides inhibit the activation of IL-23R by natural IL-23, which can be used as a therapeutic for various immune related disorders and cancers. Methods of selecting polypeptides and methods of making multimeric complexes are described.

Figure P1020127012030
Figure P1020127012030

Description

IL-23R에 결합하는 폴리펩티드{POLYPEPTIDES THAT BIND IL-23R}Polypeptide which binds to IL-23R {POLYPEPTIDES THAT BIND IL-23R}

관련 출원의 전후 참조Before and after reference of related application

본 출원은 2009년 10월 9일에 출원된 미국 특허 출원 12/577,067호의 일부계속출원, 2009년 10월 9일에 출원된 국제 출원 PCT/US09/60271호의 일부계속출원, 및 2010년 2월 10일에 출원된 미국 출원 12/703,752호의 일부계속출원이며, 이들 각각의 전체내용은 참조로서 본원에 포함된다.This application is partly filed in U.S. Patent Application No. 12 / 577,067, filed Oct. 9, 2009, partly filed in International Application PCT / US09 / 60271, filed Oct. 9, 2009, and Feb. 10, 2010. Partially filed in US Application 12 / 703,752, filed on May, each of which is incorporated herein by reference in its entirety.

서열 목록 설명Sequence Listing Description

서열 목록이 전자 포맷으로만 본 출원에 제출되었으며, 이는 참조로서 본원에 포함된다. 서열 목록 문서 파일 "10- .SequenceListing.txt"를 에 생성시켰으며, 이는 바이트 크기이다.The Sequence Listing has been submitted to this application in electronic format only, which is incorporated herein by reference. Sequence Listing Document File "10- .SequenceListing.txt " Generated by The byte size.

발명의 분야Field of invention

본 발명은 널리 염증 및 자가면역 질병 뿐만 아니라 암의 치료에 관한 것이다. 특히, 본 발명은 IL-23R 이종이합체 수용체의 IL-23R 서브유닛에 결합하고, IL-23과 이의 수용체의 상호작용을 차단하는 폴리펩티드에 관한 것이다.The present invention relates to the treatment of cancer as well as inflammatory and autoimmune diseases. In particular, the present invention relates to polypeptides that bind to the IL-23R subunit of the IL-23R heterodimeric receptor and block the interaction of IL-23 with its receptor.

발명의 배경BACKGROUND OF THE INVENTION

IL-23은 Th17 세포의 발생 및 생존에 필수적인 사이토카인이다. 전임상 모델 및 임상 실험으로부터, Th17 세포가 류마티스 관절염, 염증성장질환, 건선, 전신홍반루푸스(SLE) 및 다발경화증을 포함하는 많은 자가면역 질병의 병리에서 중요한 역할을 한다는 증거들이 계속 나오고 있다. IL-23R은 Th17 세포에서 중요한 표적이다. IL-23 이종이합체 수용체는 IL-23R 및 IL-12Rβ1의 2개의 서브유닛으로 구성되며, IL-23R은 IL-23 경로에 대해 유일한 서브유닛이다. IL-12Rβ1은 IL-12 수용체와 공유되며, 이에 따라 IL-12 경로와 공유된다. 유사하게, IL-23 사이토카인은 p19 및 p40의 2개의 서브유닛으로 구성되며, p19는 IL-23에 대해 유일하고, p40은 IL-12와 공유된다. 이종이합체 IL-23 수용체에 대한 IL-23의 결합은 특정 T 세포 서브셋, NK 세포 및 골수성 세포의 활성화를 매개한다.IL-23 is a cytokine essential for the development and survival of Th17 cells. From preclinical models and clinical trials, evidence continues to emerge that Th17 cells play an important role in the pathology of many autoimmune diseases, including rheumatoid arthritis, inflammatory growth disease, psoriasis, systemic lupus erythematosus (SLE) and multiple sclerosis. IL-23R is an important target in Th17 cells. The IL-23 heterodimeric receptor consists of two subunits of IL-23R and IL-12Rβ1, and IL-23R is the only subunit for the IL-23 pathway. IL-12Rβ1 is shared with the IL-12 receptor and thus with the IL-12 pathway. Similarly, the IL-23 cytokine consists of two subunits of p19 and p40, p19 is unique for IL-23, and p40 is shared with IL-12. Binding of IL-23 to the heterodimeric IL-23 receptor mediates activation of specific T cell subsets, NK cells and myeloid cells.

중요하게는, IL-23R에서의 유전적 변화는 건선 및 크론병에 대한 감수성과 연관되고, 또한 강직척추염, 보그트-고야나기-하라다병, 전신경화증, 빌리프스 질병(Beliefs disease, BD), 원발성 쇼그렌 증후군, 굿파스처 질병에 대한 감수성과 관련된다. 또한, 이식편대숙주병 및 만성 궤양에서의 IL-23의 중요성이 제안되었고, IL-23은 종양발생과 관련된다.Importantly, genetic changes in IL-23R are associated with susceptibility to psoriasis and Crohn's disease, and also include ankylosing spondylitis, Bog-Koyagi-Harada disease, systemic sclerosis, Beliefs disease (BD), Primary Sjogren's syndrome, related to susceptibility to goodpasture disease. In addition, the importance of IL-23 in graft-versus-host disease and chronic ulcers has been proposed, and IL-23 is associated with tumorigenesis.

IL-23 경로의 차단은 자가면역 질병의 많은 전임상 모델에서 효능이 있다. 그러나, IL-23과 IL-12 경로 사이의 공유된 리간드 및 수용체 서브유닛의 특징은 이전에 인지된 것보다 복잡한 생물학을 발생시키며, IL-12 차단으로부터의 IL-23 차단의 분리는 효능 및 안전성과 관련하여 중요한 치료적 연관을 갖는 것으로 보인다. IL-23 및 IL-12 중 어느 하나 또는 나머지, 또는 둘 모두의 차단은 사이토카인 서브유닛 또는 수용체 서브유닛의 수준에서 수행될 수 있다.Blocking the IL-23 pathway is effective in many preclinical models of autoimmune disease. However, the characteristics of shared ligand and receptor subunits between IL-23 and IL-12 pathways result in more complex biology than previously recognized, and the separation of IL-23 blockade from IL-12 blockade is potent and safe. Seems to have an important therapeutic association with Blocking either or the other, or both, of IL-23 and IL-12 may be performed at the level of cytokine subunits or receptor subunits.

초기 임상 개발중인 쉐링-프라우(Schering Plough)사의 IL-23 특이적 항-p19 항체와 함께, IL-23/IL-12 사이토카인을 표적으로 하는 항체가 승인(예를 들어, p40-표적화 우스테키누맙(Ustekinumab))되거나 임생 개발중(Abbott Laboratories)이나, 하기 이유로 인해 우수한 효능 및 보다 나은 안전성 프로파일을 갖는 IL-23 특이적 차단이 필요하다:Antibodies targeting IL-23 / IL-12 cytokines are approved (e.g., p40-targeted Uste), along with IL-23 specific anti-p19 antibodies from Schering Plough in early clinical development Ustekinumab) or Abbott Laboratories, but there is a need for IL-23 specific blocking with good efficacy and better safety profiles for the following reasons:

● IL-23 이종이합체 수용체의 분포는 염증/질병 조직에서 주로 발견되는 IL-23 이종이합체 수용체 발현 세포로 비교적 제한된다. 대조적으로, IL-23은 전신적으로 검출될 수 있고, 더욱 풍부하다.The distribution of IL-23 heterodimeric receptors is relatively limited to IL-23 heterodimeric receptor expressing cells, which are mainly found in inflammatory / disease tissues. In contrast, IL-23 can be detected systemically and is more abundant.

● IL-23의 p19 서브유닛 상에 수용체를 표적화시키는 것은 류마티스 관절염 환자로부터의 윤활막과 같은 자가면역 조직에서 입증되는 바와 같이 사이토카인이 세포 결합되고/되거나 풍부하지 않은 상황에서 이로운 것으로 밝혀졌다.Targeting receptors on the p19 subunit of IL-23 has been found to be beneficial in situations where cytokines are not cell bound and / or abundant, as evidenced in autoimmune tissues such as synovial membranes from rheumatoid arthritis patients.

● 표적화 수용체는 IL-23 신호전달에 더욱 민감할 수 있는 수용체 변이체(즉, 신호전달에 매우 적은 리간드가 요구되는 낮은 역치 변이체)를 갖는 환자에서 더욱 효과적으로 차단할 것이다.Targeting receptors will block more effectively in patients with receptor variants that may be more sensitive to IL-23 signaling (ie, low threshold variants that require very few ligands for signaling).

또한, 최초에 IL-12를 차단하기 위해 개발되는 경우, 우스테키누맙의 효능이 IL-12 차단을 통해 매개되고, IL-12 경로를 차단하는 것이 하기 관찰을 기초로 하여 유해할 수 있다는 전임상 및 임상적 증거가 존재한다:In addition, if initially developed to block IL-12, preclinical efficacy of Ustekinumab is mediated through IL-12 blockade and blocking the IL-12 pathway may be detrimental based on the following observations. And clinical evidence exists:

● 우스테키누맙을 이용한 건선 시험에서, IL-23-특이적 사이토카인 서브유닛인 p19가 플라크(plaque)에서 하향 조절되었다(그러나, IL-12-특이적 사이토카인 서브유닛인 p35는 그렇지 않았다).In psoriasis trials with Ustekinumab, p19, an IL-23-specific cytokine subunit, was downregulated in plaque (but not p35, an IL-12-specific cytokine subunit). ).

● p19 및 p40 넉아웃 마우스는 실험적 자가면역 질병의 유도에 내성인 반면, IL-12 특이적 서브유닛 p35의 넉아웃은 다수의 실험적 자가면역 질병을 악화시켰다.P19 and p40 knockout mice were resistant to the induction of experimental autoimmune disease, whereas knockout of IL-12 specific subunit p35 exacerbated many experimental autoimmune diseases.

● 우수한 효능에 대한 잠재성 외에, IL-12 및 IL-23 둘 모두에 비한 IL-23의 선택적 차단은 감염에 대한 감수성과 관련된 안전성과 관련하여 현저히 유리한데, 이는 상기 둘 모두의 사이토카인의 차단이 톡소플라스마 곤디이(Toxoplasma gondii), 크립토코커스 네오포르만스(Cryptococcus neoformans), 및 M. 투베르쿨로시스(M. tuberculosis), 및 유사한 다른 병원체에 대한 감수성을 증가시키는 것으로 밝혀졌기 때문이다.In addition to the potential for good efficacy, selective blocking of IL-23 relative to both IL-12 and IL-23 is particularly advantageous with regard to the safety associated with susceptibility to infection, which is the blocking of both cytokines. This has been shown to increase susceptibility to this Toxoplasma gondii, Cryptococcus neoformans, and M. tuberculosis, and similar other pathogens.

● 안전성 장점은 또한 종양유발능력(tumorigenicity)에 대한 잠재성과 관련이 있을 수 있다. 전임상 데이터는 IL-12를 억제하는 것이 종양 성장을 향상시키는 반면, IL-23을 억제하는 것이 종양 성장을 감소시킬 수 있는 것을 제시한다. IL-12p40과 대조적으로, IL-23은 인간 종양에서 과다발현된다. 또한, 뮤린 확인 연구는 IL-23 넉아웃 마우스 또는 항-IL-23 처리 마우스가 종양 형성에 내성이 있는 반면, 상승된 IL-23 수준이 종양 형성을 증가시킬 수 있는 것을 입증한다.• Safety advantages may also be related to the potential for tumorigenicity. Preclinical data suggest that inhibiting IL-12 enhances tumor growth, while inhibiting IL-23 can reduce tumor growth. In contrast to IL-12p40, IL-23 is overexpressed in human tumors. In addition, murine validation studies demonstrate that IL-23 knockout mice or anti-IL-23 treated mice are resistant to tumor formation, while elevated IL-23 levels can increase tumor formation.

따라서, IL-23R을 차단함으로써 IL-23 이종이합체 수용체를 선택적으로 차단하는 분자, 상기 분자를 포함하는 조성물, 상기 분자를 스크리닝하는 방법, 및 광범위한 염증 및 자가면역 질환 및 암의 치료적 치료에서 상기 분자를 이용하는 방법이 당 분야에 필요하다. 이러한 분자는 결합성 효과로 인해 우수한 표적 잔류를 나타내며, 전신 면역을 현저히 손상시키지 않고 상기 장애와 관련된 염증 부위에 대한 치료를 국소화시킨다.Thus, a molecule that selectively blocks IL-23 heterodimeric receptors by blocking IL-23R, a composition comprising the molecule, a method of screening the molecule, and the therapeutic treatment of a wide range of inflammatory and autoimmune diseases and cancers There is a need in the art for using molecules. These molecules show good target retention due to the binding effect and localize treatment to the inflammatory site associated with the disorder without significantly compromising systemic immunity.

발명의 개요Summary of the Invention

일 양태에서, 본 발명은 삼합체화 도메인(trimerizing domain) 및 IL-23 이종이합체 수용체를 활성화시키지 않으면서 인간 IL-23R에 결합하는 적어도 하나의 폴리펩티드 서열을 갖는 폴리펩티드에 관한 것이다. 다른 양태에서, 본 발명의 폴리펩티드는 인간 IL-12Rβ1 또는 인간 IL-12Rβ2 중 적어도 하나에 결합하지 않고, 인간 IL-23R에 대한 결합에 대해 자연 인간 IL-23과 경쟁한다. 삼합체화 도메인은 위치 26, 30, 33, 36, 37, 40, 41, 42, 45, 46, 47, 48, 49, 50 및 51에서 5개 이하의 아미노산 치환을 갖는 인간 테트라넥틴 삼합체화 도메인(SEQ ID NO:108)의 폴리펩티드를 포함할 수 있다. 이러한 폴리펩티드는 삼합체 복합체를 형성할 수 있다. 폴리펩티드는 삼함체 복합체를 형성하기 위해 삼합체화될 수 있다.In one aspect, the invention relates to a polypeptide having a trimerizing domain and at least one polypeptide sequence that binds human IL-23R without activating the IL-23 heterodimeric receptor. In another embodiment, the polypeptides of the invention do not bind to at least one of human IL-12Rβ1 or human IL-12Rβ2 and compete with native human IL-23 for binding to human IL-23R. The trimerization domain is a human tetranectin trimerization domain having up to 5 amino acid substitutions at positions 26, 30, 33, 36, 37, 40, 41, 42, 45, 46, 47, 48, 49, 50 and 51 ( The polypeptide of SEQ ID NO: 108). Such polypeptides may form trimer complexes. Polypeptides may be trimerized to form trimeric complexes.

추가로, 본 발명의 폴리펩티드는 IL-23R에 결합하는 적어도 하나의 폴리펩티드를 포함하고, 삼합체화 도메인의 N-말단 및 C-말단 중 하나에 연결되고, 또한 N-말단 및 C-말단의 나머지에 위치된 염증의 조정자를 포함한다. 본 발명의 폴리펩티드는 또한 N-말단 및 C-말단의 각각에 연결된 IL-23에 결합하는 폴리펩티드를 가질 수 있으며, 상기 N-말단의 폴리펩티드는 C-말단의 폴리펩티드와 동일하거나 상이하다. 폴리펩티드는 또한 이러한 폴리펩티드에 공유적으로 부착된 치료제를 가질 수 있다.In addition, the polypeptides of the invention comprise at least one polypeptide that binds IL-23R and is linked to one of the N-terminus and C-terminus of the trimerization domain, and also to the rest of the N-terminus and C-terminus. Includes coordinators of located inflammation. Polypeptides of the invention may also have a polypeptide that binds to IL-23 linked to each of the N-terminus and C-terminus, wherein the N-terminal polypeptide is the same as or different from the C-terminal polypeptide. Polypeptides may also have therapeutic agents covalently attached to such polypeptides.

또한 추가로, 본 발명의 폴리펩티드는 C-타입 렉틴 유사 도메인(CTLD)을 포함하고, CTLD의 루프 세그먼트 A 또는 루프 세그먼트 B 중 루프 1, 2, 3 또는 4 중 하나는 IL-23에 결합하는 폴리펩티드 서열을 포함한다. 다양한 양태에서, CTLD의 폴리펩티드 서열은 SEQ ID NO:133, 134, 135, 167, 137, 138, 139, 140, 및 141로 구성된 군으로부터 선택된다.Further, the polypeptide of the present invention comprises a C-type lectin-like domain (CTLD), wherein one of loops 1, 2, 3 or 4 of loop segment A or loop segment B of CTLD binds to IL-23. Sequence. In various embodiments, the polypeptide sequence of CTLD is selected from the group consisting of SEQ ID NOs: 133, 134, 135, 167, 137, 138, 139, 140, and 141.

본 발명은 또한 IL-23R을 발현하는 세포에서의 IL-23에 의한 IL-23R의 활성화를 방지하는 방법에 관한 것이다. 상기 방법은 상기 세포와 본 발명의 삼합체 복합체를 접촉시키는 것을 포함한다. 또 다른 양태에서, 본 발명은 삼합체 복합체 및 적어도 하나의 약학적으로 허용되는 부형제를 포함하는 약학적 조성물을 포함한다. 조성물은 면역 장애 또는 암을 치료하기 위해 투여될 수 있다. 조성물은 또한 염증의 조정자, 화학치료제 또는 세포독성제를 포함할 수 있다.The present invention also relates to a method for preventing the activation of IL-23R by IL-23 in cells expressing IL-23R. The method comprises contacting the cell with the trimer complex of the present invention. In another embodiment, the present invention includes a pharmaceutical composition comprising a trimer complex and at least one pharmaceutically acceptable excipient. The composition can be administered to treat an immune disorder or cancer. The composition may also include a modulator of inflammation, a chemotherapeutic agent or a cytotoxic agent.

또한 추가로, 본 발명은 본 발명의 폴리펩티드를 제조하는 방법에 관한 것이다. 상기 방법은 IL-23R에 결합하는 제 1 폴리펩티드를 선택하고, 이러한 제 1 폴리펩티드를 다합체화 도메인의 N-말단 또는 C-말단 중 하나와 융합시키는 것을 포함한다. 상기 방법은 또한 염증의 조정자인 제 2 폴리펩티드 서열을 선택하고, 이러한 제 2 폴리펩티드를 다합체화 도메인의 N-말단 또는 C-말단의 나머지와 융합시키는 것을 포함할 수 있다. 제 1 폴리펩티드는 IL-12Rβ1 또는 IL-12Rβ2 중 적어도 하나와 결합하지 않도록 선택될 수 있다. 폴리펩티드는 IL-23R을 발현하는 세포에서 IL-23R의 활성화를 방지하는 삼합체 복합체를 제조하는데 사용될 수 있다.Further, the present invention relates to a method of producing a polypeptide of the present invention. The method comprises selecting a first polypeptide that binds IL-23R and fusing this first polypeptide with either the N-terminus or C-terminus of the multimerization domain. The method may also include selecting a second polypeptide sequence that is a mediator of inflammation and fusing this second polypeptide with the rest of the N-terminus or C-terminus of the multimerization domain. The first polypeptide may be selected to not bind to at least one of IL-12Rβ1 or IL-12Rβ2. The polypeptide can be used to prepare a trimer complex that prevents the activation of IL-23R in cells expressing IL-23R.

또한 추가로, 본 발명은 자연 IL-23R에 대한 결합에 대해 자연 인간 IL-23과 경쟁하는 폴리펩티드에 관한 것으로, 상기 폴리펩티드는 인간 IL-23R을 활성시키지 않고, IL-12Rβ1 또는 IL-12Rβ2 중 적어도 하나에 결합하지 않는다. 상기 폴리펩티드는 IL-23R에 대한 결합에 대해 루프 세그먼트 A 또는 루프 세그먼트 B 중 루프 1, 2, 3 또는 4 중 하나에서 변형된 CTLD일 수 있고, 이는 SEQ ID NO:133, 134, 135, 167, 137, 138, 139, 140, 및 141 중 하나로부터 선택될 수 있다.Further, the present invention relates to a polypeptide that competes with natural human IL-23 for binding to natural IL-23R, wherein the polypeptide does not activate human IL-23R and at least one of IL-12Rβ1 or IL-12Rβ2. Do not combine in one. The polypeptide can be a CTLD modified in one of loops 1, 2, 3 or 4 of loop segment A or loop segment B for binding to IL-23R, which is SEQ ID NO: 133, 134, 135, 167, 137, 138, 139, 140, and 141.

도면의 설명Description of Drawings

도 1A 및 1B는 인간 IL-23(SEQ ID NO:1), 인간 IL-23R(SEQ ID NO:5), 인간 IL-12Rβ1(SEQ ID NO:6), 인간 IL-12Rβ2(SEQ ID NO:7), 인간 IL-12A(SEQ ID NO:3), 및 인간 IL-12B(SEQ ID NO:2)의 폴리펩티드를 도시한다.1A and 1B show human IL-23 (SEQ ID NO: 1), human IL-23R (SEQ ID NO: 5), human IL-12Rβ1 (SEQ ID NO: 6), human IL-12Rβ2 (SEQ ID NO: 7), polypeptides of human IL-12A (SEQ ID NO: 3), and human IL-12B (SEQ ID NO: 2) are shown.

도 2A, B, C 및 D는 본 발명의 예시적 폴리펩티드와 함께 사용하기 위한 테트라넥틴 삼합체화 모듈 변이체의 예를 도시한다.2A, B, C and D show examples of tetranectin trimerization module variants for use with exemplary polypeptides of the invention.

도 3은 테트라넥틴 단백질 패밀리의 삼합체화 구조 구성요소의 아미노산 서열의 정렬을 도시한다. 인간 테트라넥틴의 엑손 2를 포함하는 잔기 V17 내지 K52에 해당하는 아미노산 서열(1 문자 부호) 및 엑손 3의 처음 3개의 잔기(SEQ ID NO:99); 뮤린 테트라넥틴(SEQ ID NO:100)(Sorensen et al ., Gene, 152: 243-245, 1995); 립상어(reefshark) 연골로부터 분리된 테트라넥틴 동족체 단백질(SEQ ID NO:107)(Neame and Boynton, 1992, 1996); 및 소 연골로부터 분리된 테트라넥틴 동족체 단백질(SEQ ID NO:106)(Neame and Boynton, database accession number PATCHX:u22298)이 밑줄로 표시된다. 헵타드(heptad) 반복부 내의 a 및 d 위치의 잔기는 굵은 글씨로 표시된다. 테트라넥틴 단백질 패밀리 삼합체화 구조 구성요소의 나열된 콘센서스 서열(SEQ ID NO:108)은 도면에 제시된 헵타드 반복부 내의 a 및 d 위치에 존재하는 잔기 뿐만 아니라 상기 영역의 다른 보존된 잔기를 포함한다. "*"은 지방족 소수성 잔기를 표시한다.3 shows the alignment of amino acid sequences of trimerized structural components of the tetranectin protein family. The amino acid sequence corresponding to residues V17 to K52 comprising exon 2 of human tetranectin (1-letter code) and the first three residues of exon 3 (SEQ ID NO: 99); Murine tetranectin (SEQ ID NO: 100) (Sorensen et al . , Gene, 152: 243-245, 1995); Tetranectin homologue protein (SEQ ID NO: 107) isolated from reefshark cartilage (Neame and Boynton, 1992, 1996); And tetranectin homologue protein (SEQ ID NO: 106) (Neame and Boynton, database accession number PATCHX: u22298) isolated from bovine cartilage. The residues at positions a and d in the heptad repeat are indicated in bold. The listed consensus sequence (SEQ ID NO: 108) of the tetranectin protein family trimerization structural component includes residues present at positions a and d in the heptad repeats shown in the figures, as well as other conserved residues in that region. . "*" Represents an aliphatic hydrophobic residue.

도 4는 공지된 3D-구조의 10개의 CTLD의 아미노산 서열의 정렬을 도시한다. 주요 이차 구조 구성요소의 서열 위치가 상기 각각의 서열에 표시되며, 이는 α-헬릭스 수 N을 표시하는 "αN", 및 β-가닥 수 M을 표시하는 "βM"으로 연속적 수의 순서로 표지된다. CTLD의 2개의 보존된 이황화 브릿지(disulfide bridge)의 형성과 관련된 4개의 시스테인 잔기가 표시되고, 도면에서 "CⅠ", "CⅡ", "CⅢ" 및 "CⅣ"로 각각 열거된다. 2개의 보존된 이황화 브릿지는 각각 CⅠ-CⅣ 및 CⅡ-CⅢ이다. 인간 테트라넥틴 서열 내의 다양한 루프 1-4 및 LSB(루프 5)가 밑줄로 표시된다. 10개의 C-타입 렉틴은, hTN: 인간 테트라넥틴(SEQ ID NO:109), MBP: 만노오스 결합 단백질(SEQ ID NO:110); SP-D: 표면활성제 단백질 D(SEQ ID NO:111); LY49A: NK 수용체 LY49A(SEQ ID NO:112); H1-ASR: 아시알로당단백질 수용체의 H1 서브유닛(SEQ ID NO:113); MMR-4: 대식세포 만노오스 수용체 도메인 4(SEQ ID NO:114); Ⅸ-A(SEQ ID NO:115) 및 Ⅸ-B(SEQ ID NO:116): 응고 인자 Ⅸ/Ⅹ-결합 단백질 도메인 A 및 B 각각; Lit: 리토스타틴(SEQ ID NO:117); TU14: 피막 C-타입 렉틴(SEQ ID NO:118)이다. TU14를 제외한 상기 모든 CTLD는 인간 단백질로부터 유래된다.4 shows the alignment of amino acid sequences of ten CTLDs of known 3D-structure. The sequence positions of the major secondary structural components are indicated in each of the above sequences, which are labeled in successive numbered order with "αN" for α-helix number N and "βM" for β-stranded number M. . Four cysteine residues associated with the formation of two conserved disulfide bridges of CTLD are indicated and listed in the figures as "CI", "CII", "CIII" and "CIV", respectively. The two conserved disulfide bridges are CI-CIV and CII-CIII, respectively. Various loops 1-4 and LSB (loop 5) in the human tetranectin sequence are underlined. Ten C-type lectins include hTN: human tetranectin (SEQ ID NO: 109), MBP: mannose binding protein (SEQ ID NO: 110); SP-D: surfactant protein D (SEQ ID NO: 111); LY49A: NK receptor LY49A (SEQ ID NO: 112); H1-ASR: H1 subunit of asialo glycoprotein receptor (SEQ ID NO: 113); MMR-4: macrophage mannose receptor domain 4 (SEQ ID NO: 114); VIII-A (SEQ ID NO: 115) and VIII-B (SEQ ID NO: 116): coagulation factor VIII / VIII-binding protein domains A and B, respectively; Lit: ristostatin (SEQ ID NO: 117); TU14: Encapsulated C-type lectin (SEQ ID NO: 118). All of the above CTLDs except TU14 are derived from human proteins.

도 5는 인간(Swissprot P05452)(SEQ ID NO:119), 마우스(Swissprot P43025)(SEQ ID NO:120), 닭(Swissprot Q9DDD4)(SEQ ID NO:121), 소(Swissprot Q2KIS7)(SEQ ID NO:122), 대서양 연어(Swissprot B5XCV4)(SEQ ID NO:123), 개구리(Swissprot Q5I0R9)(SEQ ID NO:124), 제브라피쉬(zebrafish)(GenBank XP_701303)(SEQ ID NO:125)로부터 분리된 테트라넥틴의 아미노산 서열, 및 소(Swissprot u22298)(SEQ ID NO:126) 및 립상어(Swissprot p26258)(SEQ ID NO:127)의 연골로부터 분리된 관련 CTLD 동족체의 정렬을 도시한다.5 shows a human (Swissprot P05452) (SEQ ID NO: 119), a mouse (Swissprot P43025) (SEQ ID NO: 120), a chicken (Swissprot Q9DDD4) (SEQ ID NO: 121), a cow (Swissprot Q2KIS7) (SEQ ID NO: 122), Atlantic Salmon (Swissprot B5XCV4) (SEQ ID NO: 123), Frog (Swissprot Q5I0R9) (SEQ ID NO: 124), Zebrafish (GenBank XP_701303) (SEQ ID NO: 125) Amino acid sequence of tetranectin and related CTLD homologues isolated from cartilage of bovine (Swissprot u22298) (SEQ ID NO: 126) and shark (Swissprot p26258) (SEQ ID NO: 127).

도 6은 CTLD에서 무작위화된 루프를 생성시키기 위한 PCR 방법을 도시한다.6 shows a PCR method for generating randomized loops in CTLD.

도 7은 Ca2+ 결합 부위를 나타내는 클로닝을 위한 제한 부위를 함유하도록 변형된 인간 테트라넥틴 CTLD의 DNA 및 아미노산 서열을 도시한다. 제한 부위는 실선 밑줄로 표시된다. 루프는 점선 밑줄로 표시된다. 칼슘 협동 잔기(calcium coordinating residue)는 굵은 이탤릭체로 표시되고, 이는 부위 1: D116, E120, G147, E150, N151; 부위 2: Q143, D145, E150, D165를 포함한다. CTLD 도메인은 굵은 글씨의 아미노산 A45에서 시작한다(즉, ALQTVCL...). 자연 테트라넥틴(TNCTLD) 염기 서열로의 변화는 소문자로 제시된다. 제한 부위는 자연 아미노산 서열을 변경시키지 않는 침묵 돌연변이(silent mutation)를 이용하여 생성되었다.FIG. 7 shows the DNA and amino acid sequences of human tetranectin CTLD modified to contain restriction sites for cloning showing Ca2 + binding sites. Restriction sites are indicated by underlined solid lines. The loop is indicated by a dashed underline. Calcium coordinating residues are indicated in bold italics, which are site 1: D 116, E 120, G 147, E 150, N 151; Site 2: includes Q 143, D 145, E 150, D 165. The CTLD domain starts at amino acid A45 in bold (ie ALQTVCL ...). Changes to the native tetranectin (TNCTLD) nucleotide sequence are shown in lowercase letters. Restriction sites were generated using silent mutations that do not alter the natural amino acid sequence.

도 8은 IL-23R에 결합하는 본 발명의 폴리펩티드의 다수의 서열을 도시한다. 상기 서열은 하나 이상의 루프 영역에서 변형된 인간 테트라넥틴 CTLD의 스캐폴드 구조를 갖는 폴리펩티드의 라이브러리로부터 폴리펩티드를 선택함으로써 본 발명의 방법에 따라 생성되었다. 이러한 서열의 CTLD 스캐폴드는 인간 테트라넥틴(SEQ ID NO:119)의 A45에서 시작한다. 무작위화된 루프 영역을 나타내는 서열의 부분은 밑줄로 표시된다.8 depicts a number of sequences of polypeptides of the invention that bind to IL-23R. The sequence was generated according to the method of the present invention by selecting a polypeptide from a library of polypeptides having a scaffold structure of human tetranectin CTLD modified in one or more loop regions. The CTLD scaffold of this sequence starts at A45 of human tetranectin (SEQ ID NO: 119). Portions of the sequence representing the randomized loop region are underlined.

도 9는 삼합체화 도메인에서 시작하는 인간(SEQ ID NOS:143[누클레오티드 서열] 및 142[아미노산 서열]) 및 뮤린 테트라넥틴(SEQ ID NOS:144[누클레오티드 서열] 및 145[아미노산 서열])의 성숙 형태의 코딩 영역의 누클레오티드 및 아미노산 서열의 정렬을 도시하며, 공지된 이차 구조 구성요소가 표시되어 있다.9 shows maturation of humans starting in the trimerization domain (SEQ ID NOS: 143 [nucleotide sequence] and 142 [amino acid sequence]) and murine tetranectin (SEQ ID NOS: 144 [nucleotide sequence] and 145 [amino acid sequence]). The alignment of the nucleotide and amino acid sequences of the coding region of the form is shown, and known secondary structural components are indicated.

도 10은 경쟁 ELISA의 결과를 도시한다. 본 발명의 폴리펩티드의 존재 또는 부재하에서의 인간 IL-23R에 대한 인간 IL-23의 결합이 평가되었다.10 shows the results of a competitive ELISA. Binding of human IL-23 to human IL-23R in the presence or absence of a polypeptide of the invention was evaluated.

도 11은 본 발명의 아트리머™(ATRIMER™) 복합체 4G8, 자연 인간 IL-23, 및 우스테키누맙(Ustekinumab)의 존재하에서의 IL-23-유도성 IL-17 생성을 비교하는 실험의 결과를 도시한다.FIG. 11 shows the results of an experiment comparing IL-23-induced IL-17 production in the presence of the ATRIMER ™ complex 4G8, natural human IL-23, and Ustekinumab of the present invention. Illustrated.

도 12는 본 발명의 아트리머™ 복합체 1A4 및 우스테키누맙의 존재하에서의 IL-23-유도성 IL-17 생성을 비교하는 실험의 결과를 도시한다.FIG. 12 shows the results of an experiment comparing IL-23-induced IL-17 production in the presence of the Atrimer ™ Complex 1A4 and Ustekinumab of the present invention.

도 13은 본 발명의 아트리머™ 복합체 4G8, 자연 인간 IL-23, 및 우스테키누맙의 존재하에서의 IL-12-유도성 IFNγ 생성을 비교하는 실험의 결과를 도시한다.FIG. 13 shows the results of an experiment comparing IL-12-induced IFNγ production in the presence of the Atrimer ™ Complex 4G8, natural human IL-23, and Ustekinumab of the present invention.

도 14는 IL-23 및 본 발명의 폴리펩티드에 대한 반응에서의 NKL 세포에서 Stat-3 인산화를 비교하는 실험의 결과를 도시한다.14 shows the results of experiments comparing Stat-3 phosphorylation in NKL cells in response to IL-23 and polypeptides of the invention.

도 15는 본 발명의 여러 아트리머™ 폴리펩티드 복합체와 관련된 실험 결과를 나타내는 표이다.FIG. 15 is a table showing experimental results associated with various Atrimer ™ polypeptide complexes of the present invention. FIG.

도 16은 단백질의 이차 구조 특징을 나타내는 인간 테트라넥틴에 대한 3차원 구조(리본 포맷)를 도시한다. 구조는 Ca2+-결합된 형태로 설명되었다.FIG. 16 shows the three-dimensional structure (ribbon format) for human tetranectin, which represents the secondary structural characteristics of the protein. The structure has been described in Ca 2+ -bonded form.

도 17A는 인간 만노오스 결합 단백질(MBP), 래트 만노오스 결합 단백질-C(MBP-C), 인간 표면활성제 단백질 D, 래트 만노오스 결합 단백질-A(MBP-A), 및 래트 표면활성제 단백질 A를 포함하는 인간 테트라넥틴(HTN) 및 여러 테트라넥틴 동족체에 대한 CTLD의 3차원 중첩구조를 도시한다. CTLD 중첩 구조는 주형으로 인간 테트라넥틴의 3차원 구조를 이용하여 매킨토시(Macintosh)에 대한 Swiss PDB Viewer Deep View v.4.0.1을 이용하여 생성시켰다. 도 17B는 도 17A에 도시된 인간 테트라넥틴 및 테트라넥틴 동족체에 대한 CTLDS의 해당 아미노산 서열을 도시한다. 도 17B에서, 1HUP는 인간 만노오스 결합 단백질이고, 1BV4A는 래트 만노오스 결합 단백질이고, 2GGUA는 인간 표면활성제 단백질 D이고, 1KX0A는 래트 만노오스 결합 단백질 A이고, 1R13은 래트 표면활성제 단백질 A이다.17A includes human mannose binding protein (MBP), rat mannose binding protein-C (MBP-C), human surfactant protein D, rat mannose binding protein-A (MBP-A), and rat surfactant protein A. Three-dimensional superposition of CTLD for human tetranectin (HTN) and various tetranectin homologues is shown. CTLD superimposition structures were generated using Swiss PDB Viewer Deep View v.4.0.1 for Macintosh using the three-dimensional structure of human tetranectin as a template. FIG. 17B shows the corresponding amino acid sequence of CTLDS for the human tetranectin and tetranectin analogs shown in FIG. 17A. In FIG. 17B, 1HUP is human mannose binding protein, 1BV4A is rat mannose binding protein, 2GGUA is human surfactant protein D, 1KX0A is rat mannose binding protein A, and 1R13 is rat surfactant protein A.

도 18A는 인간 췌장염-관련 단백질, 인간 수지상세포-특이적 ICAM-3-그래빙(grabbing) 비-인테그린 2(DC-SIGNR), 래트 아그레칸(aggrecan), 마우스 스캐빈저(scavenger) 수용체, 및 인간 스캐빈저 수용체를 포함하는 인간 테트라넥틴(HTN) 및 여러 테트라넥틴 동족체에 대한 CTLD의 3차원 중첩 구조를 도시한다. CTLD 중첩 구조는 주형으로 인간 테트라넥틴의 3차원 구조를 이용하여 매킨토시(Macintosh)에 대한 Swiss PDB Viewer Deep View v.4.0.1을 이용하여 생성시켰다. 도 18B는 도 18A에 도시된 인간 테트라넥틴 및 테트라넥틴 동족체에 대한 CTLDS의 해당 아미노산 서열을 도시한다. 도 18B에서, 1TDQB는 래트 아그레칸이고, 1UV0A는 인간 췌장염-관련 단백질이고, 20X8A는 인간 스캐빈저 수용체이고, 20X9A는 마우스 스캐빈저 수용체이고, 1SL6A는 인간 DC-SIGNR이다.18A shows human pancreatitis-related protein, human dendritic cell-specific ICAM-3-grabbing non-integrin 2 (DC-SIGNR), rat agrecan, mouse scavenger receptor And three-dimensional overlapping structures of CTLDs for human tetranectin (HTN) and various tetranectin analogs, including human scavenger receptors. CTLD superimposition structures were generated using Swiss PDB Viewer Deep View v.4.0.1 for Macintosh using the three-dimensional structure of human tetranectin as a template. FIG. 18B shows the corresponding amino acid sequence of CTLDS for the human tetranectin and tetranectin analogs shown in FIG. 18A. In FIG. 18B, 1TDQB is rat aggrecan, 1UV0A is a human pancreatitis-related protein, 20X8A is a human scavenger receptor, 20X9A is a mouse scavenger receptor, and 1SL6A is a human DC-SIGNR.

본 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION

다양한 양태에서, 본 발명은 IL-23R에 결합하고, 다합체화 도메인의 폴리펩티드 서열 및 IL-23R에 결합하는 하나 이상의 폴리펩티드 서열을 포함하는 폴리펩티드에 관한 것이다. 일 양태에서, 본 발명의 폴리펩티드는 IL-23R 길항제로 작용한다. 2, 3 또는 그 초과의 폴리펩티드가 다합체화되어, IL-23R에 결합하는 폴리펩티드를 포함하는 다합체 복합체를 형성할 수 있다. 한 대안적 구체예에서, 상기 폴리펩티드는 IL-23R에 결합하지만, IL-12β1 또는 IL-12β2에는 결합하지 않는다. 또한, 본 발명은 면역 매개 장애, 암 및 기타 질병을 치료할 필요가 있는 환자에게 폴리펩티드 또는 폴리펩티드의 다합체 복합체를 투여함으로써 대상체에서 면역 매개 장애, 암 및 기타 질병을 치료하는 방법을 제공한다.In various aspects, the invention relates to a polypeptide that binds to IL-23R and comprises a polypeptide sequence of a multimerization domain and one or more polypeptide sequences that bind to IL-23R. In one embodiment, the polypeptides of the invention act as IL-23R antagonists. Two, three or more polypeptides can be multimerized to form a multimer complex comprising a polypeptide that binds IL-23R. In one alternative embodiment, the polypeptide binds to IL-23R but not to IL-12β1 or IL-12β2. The invention also provides a method of treating an immune mediated disorder, cancer and other diseases in a subject by administering the polypeptide or a multimeric complex of polypeptides to a patient in need thereof.

정의Justice

본 발명을 더욱 상세히 정의하기 전, 다수의 용어가 정의된다. 용어에 대한 특정 정의가 본원에 제공되지 않는 한, 본 명세서 전체에 걸쳐 사용되는 용어 및 구는 당 분야에서 통상적으로 이해되는 것과 같은 의미를 갖는 것으로 해석되어야 한다. 또한, 본 발명의 명세서 및 첨부된 청구항에서 사용되는 바와 같은 단수 형태는 문맥이 달리 명백히 기재하지 않는 한 복수의 지시 대상을 포함한다.Before defining the invention in more detail, a number of terms are defined. Unless specific definitions of terms are provided herein, the terms and phrases used throughout this specification should be interpreted to have the same meaning as commonly understood in the art. Also, the singular forms as used in the specification and the appended claims of the present invention include plural referents unless the context clearly dictates otherwise.

"IL-23"은 선천 면역 및 적응 면역으로 작용하는 사이토카인이며, 이는 IL-6 상과에 속하는 이종이합체 단백질 복합체를 의미한다. 이종이합체 복합체는 활성화된 수지상 세포 및 포식 세포 및 각질세포에 의해 분비된다. IL-23은 또한 피부 랑게르한스 세포에 의해 발현된다. IL-B30으로도 공지된 IL-23A, p19 서브유닛, 또는 단순히 "p19"는 IL-12B, p40 서브유닛과 회합되어 IL-23(p19/p40)이 형성된다. IL-23A(p19)(SEQ ID NO:1) 및 IL-12B(SEQ ID NO:2)의 아미노산 서열이 도 1에 제시된다."IL-23" is a cytokine that acts as an innate and adaptive immunity, meaning a heterodimeric protein complex belonging to the IL-6 superfamily. Heterodimeric complexes are secreted by activated dendritic cells and phagocytes and keratinocytes. IL-23 is also expressed by cutaneous Langerhans cells. IL-23A, p19 subunit, also known as IL-B30, or simply “p19” is associated with IL-12B, p40 subunit to form IL-23 (p19 / p40). The amino acid sequences of IL-23A (p19) (SEQ ID NO: 1) and IL-12B (SEQ ID NO: 2) are shown in FIG. 1.

IL-23은 위험 또는 손상에 대한 자가-신호와 함께 매우 다양한 병원체 및 병원체-생성물에 의해 상향 조절된다. IL-23은 건선 피부 조직, 다발경화증 환자의 수지상 세포에서 상향 조절되고, IL-23은 종양 발병률 및 성장을 촉진하는데 있어서 활성인 것으로 널리 밝혀졌다. 또한, IL-23은 호중구 및 대식세포 침윤을 자극할 뿐만 아니라, 종양 미세환경에서의 혈관형성 및 염증 매개체를 촉진한다. IL-23은 IL-12 및 인터페론 γ의 하향 조절을 발생시킬 수 있으며, 상기 IL-12 및 인터페론 γ 둘 모두는 세포독성 면역 반응에 대한 필수 사이토카인이며, 이는 항-종양 효과기 림프구의 유입 및 활성을 조절한다. IL-23이 항-종양 면역과 동떨어지고, 종양에 영양을 공급하는 염증유발(proinflammatory) 및 신생혈관유발(proangiogenic) 효과기 경로와 가까운 적응 세포독성 효과기 반응의 리퍼포징(repurposing)에 영향을 주는 것으로 제안되었다. 결과로서, IL-23은 종양-관련 염증을 동반하는 인지된 종양 세포의 지속을 가능케 한다. 이러한 개념은 많은 양의 종양-특이적 T 세포의 존재하에서의 종양 성장을 설명할 수 있다.IL-23 is upregulated by a wide variety of pathogens and pathogen-products with self-signaling for risk or damage. IL-23 is upregulated in psoriasis skin tissue, dendritic cells of multiple sclerosis patients, and IL-23 has been found to be active in promoting tumor incidence and growth. In addition, IL-23 stimulates neutrophils and macrophage infiltration, as well as promotes angiogenesis and inflammatory mediators in the tumor microenvironment. IL-23 can cause down regulation of IL-12 and interferon γ, both of which are essential cytokines for cytotoxic immune responses, which are influx and activity of anti-tumor effector lymphocytes Adjust. IL-23 is indistinguishable from anti-tumor immunity and affects repurposing of adaptive cytotoxic effector responses close to the proinflammatory and proangiogenic effector pathways that feed tumors Proposed. As a result, IL-23 enables the persistence of recognized tumor cells with tumor-associated inflammation. This concept may explain tumor growth in the presence of large amounts of tumor-specific T cells.

용어 "IL-23 이종이합체 수용체"는 IL-23R 및 IL-12Rβ1의 이종이합체 폴리펩티드 복합체를 의미한다. 이러한 수용체는 IL-23에 결합한다. IL-23R 및 IL-12Rβ1의 폴리펩티드 서열은 도 1에 제시된다.The term “IL-23 heterodimeric receptor” refers to a heterodimeric polypeptide complex of IL-23R and IL-12Rβ1. This receptor binds to IL-23. Polypeptide sequences of IL-23R and IL-12Rβ1 are shown in FIG. 1.

용어 "IL-23R"은 IL-12Rβ1와 복합체화되어 IL-23 이종이합체 수용체를 형성할 수 있는 폴리펩티드를 의미한다. IL-23R은 IL-23R 서브유닛으로도 언급된다.The term "IL-23R" refers to a polypeptide capable of complexing with IL-12Rβ1 to form an IL-23 heterodimeric receptor. IL-23R is also referred to as IL-23R subunit.

용어 "IL-12Rβ1"은 IL-23R과 복합체화되어 IL-23 이종삼합체 수용체를 형성하고, 별개로 및 독립적으로 IL-12Rβ2와 복합체화되어 이종이합체 IL-12 수용체를 형성하는 폴리펩티드를 의미한다. IL-12Rβ1 및 IL-12Rβ2의 폴리펩티드 서열은 도 1에 제시된다.The term “IL-12Rβ1” refers to a polypeptide complexed with IL-23R to form an IL-23 heterotrimeric receptor and separately and independently complexed with IL-12Rβ2 to form a heterodimeric IL-12 receptor. Polypeptide sequences of IL-12Rβ1 and IL-12Rβ2 are shown in FIG. 1.

"억제제" 및 "길항제" 또는 "활성제" 및 "효능제"는 각각 억제 또는 활성화 분자를 의미한다. "억제제"는 예를 들어, 유전자, 단백질, 리간드, 수용체, 또는 세포와 관련된 생물학적 기능 또는 활성을 감소시키거나, 차단하거나, 방지하거나, 활성화를 지연시키거나, 비활성화시키거나, 둔감화시키거나, 하향 조절하는 화합물이다. 활성제는, 예를 들어, 유전자, 단백질, 리간드, 수용체, 또는 세포의 생물학적 기능 또는 활성을 증가시키거나, 활성화시키거나, 촉진시키거나, 활성화를 향상시키거나, 민감화시키거나, 상향 조절하는 화합물이다. "효능제"는 표적의 활성화에서 증가를 야기시키거나 촉진하기 위해 표적과 상호작용하는 화합물이다. "길항제"는 효능제의 작용에 대항하는 화합물이다. 길항제는 효능제의 활성을 방지하거나, 감소시키거나, 억제하거나, 중화시킨다. 효능제는 또한 확인된 효능제가 없는 경우에서도 표적, 예를 들어, 표적 수용체의 항시적(constitutive) 활성을 방지하거나, 억제하거나, 감소시킬 수 있다."Inhibitor" and "antagonist" or "active agent" and "agonist" mean an inhibitory or activating molecule, respectively. An "inhibitor" can reduce, block, prevent, retard, inactivate, or desensitize a biological function or activity associated with, for example, a gene, protein, ligand, receptor, or cell, Down-regulating compounds. Active agents are, for example, genes, proteins, ligands, receptors, or compounds that increase, activate, promote, enhance activation, sensitize, or upregulate the biological function or activity of a cell. . An “agonist” is a compound that interacts with a target to cause or promote an increase in activation of the target. An "antagonist" is a compound that counters the action of an agonist. Antagonists prevent, reduce, inhibit or neutralize the activity of agonists. An agonist can also prevent, inhibit or reduce the constitutive activity of a target, eg, a target receptor, even in the absence of an agonist identified.

유전자, 수용체, 리간드, 또는 세포의 "조정자"는 조절 특성에서 활성이 활성화되거나, 억제되거나, 변경될 수 있는 경우에 유전자, 수용체, 리간드, 또는 세포의 활성을 변경시키는 분자이다. 조정자는 단독으로 작용할 수 있거나, 이는 보조인자, 예를 들어, 단백질, 금속 이온, 또는 소분자를 사용할 수 있다.A "modulator" of a gene, receptor, ligand, or cell is a molecule that alters the activity of a gene, receptor, ligand, or cell when activity in regulatory properties can be activated, inhibited or altered. Modulators may act alone or may use cofactors such as proteins, metal ions, or small molecules.

용어 "IL-23R 길항제"는 단독으로 또는 IL-12Rβ1와 복합체화되어 IL-23R에 결합하고, 결합하는 IL-23의 능력을 차단하거나, 수용체 이종이합체 형성을 차단하거나, 형태 변화 또는 수용체 내재화를 포함하는 세포내 신호전달에 영향을 주는 변화를 차단하거나 유도할 수 있는 다양한 메커니즘을 통해 수용체 신호전달을 차단하거나, 이를 감소(dampen)시키는 임의의 분자를 의미한다.The term “IL-23R antagonist” alone or in combination with IL-12Rβ1 blocks IL-23R's ability to bind and bind IL-23, blocks receptor heterodimer formation, changes in form or receptor internalization. By any molecule that blocks or attenuates receptor signaling through a variety of mechanisms that can block or induce changes that affect intracellular signaling, including.

본원에서 사용되는 용어 "결합 일원"은 서로에 대해 결합 특이성을 갖는 분자쌍의 일원을 의미한다. 결합쌍의 일원은 자연 유래되거나, 전체적 또는 부분적으로 합성적으로 생성될 수 있다. 분자쌍의 하나의 일원은 분자쌍의 다른 일원의 특정 공간적 및 극성 구성에 결합하고, 따라서 이에 상보적인 표면 상의 영역, 또는 공동을 갖는다. 따라서, 상기 쌍의 일원은 서로에 대해 특이적인 결합의 특성을 갖는다.As used herein, the term “binding member” refers to a member of a molecular pair having binding specificity for each other. Members of a binding pair may be naturally derived or synthetically produced in whole or in part. One member of a molecular pair binds to a particular spatial and polar configuration of the other member of the molecular pair and thus has regions or cavities on the surface that are complementary thereto. Thus, members of the pair have the property of binding specific to each other.

리간드/수용체, 항체/항원, 또는 다른 결합쌍에서 언급되는 경우 "특이적" 또는 "선택적" 결합은 결합쌍의 또 다른 일원의 이종성 집합에서 결합쌍의 일원의 존재를 결정하는 결합 반응을 나타낸다. 따라서, 지정된 조건하에서, 예를 들어, 특정된 리간드는 특정 수용체에 결합하고, 샘플에 존재하는 다른 단백질에 유의한 양으로 결합하지 않는다.When referred to in a ligand / receptor, antibody / antigen, or other binding pair, “specific” or “selective” binding refers to a binding reaction that determines the presence of a member of a binding pair in the heterologous set of another member of the binding pair. Thus, under specified conditions, for example, the specified ligand binds to a specific receptor and does not bind to a significant amount of other proteins present in the sample.

본원에서 사용되는 용어 "다합체화 도메인"은 다합체화 도메인을 갖는 다른 아미노산 서열(들)과 회합되어 다합체 복합체를 형성할 수 있는 기능성(functionality)을 포함하는 아미노산 서열을 의미한다. 본 발명의 다양한 구체예에서, 다합체화 도메인은 이합체화 도메인, 삼합체화 도메인, 사합체화 도메인, 오합체화 도메인 등이다. 이러한 도메인은 본 발명의 2, 3, 4, 5개 또는 그 초과의 폴리펩티드의 폴리펩티드 복합체를 형성할 수 있다. 일 예에서, 폴리펩티드는 2개의 다른 삼합체화 도메인과 함께 삼합체 복합체를 형성하는 "삼합체화 도메인"의 아미노산 서열을 함유한다. 삼합체화 도메인은 동일한 아미노산 서열의 다른 삼합체화 도메인(동종삼합체), 또는 상이한 아미노산 서열의 삼합체화 도메인(이종삼합체)과 회합될 수 있다. 이러한 상호작용은 삼합체화 도메인의 구성요소 사이의 공유 결합 뿐만 아니라 수소 결합력, 소수성 결합력, 반데르발스 결합력 및 염 브릿지(salt bridge)에 의해 야기될 수 있다.As used herein, the term “multimerization domain” refers to an amino acid sequence that includes functionality that can be associated with other amino acid sequence (s) having a multimerization domain to form a multimer complex. In various embodiments of the present invention, the multimerization domain is a dimerization domain, trimerization domain, tetramerization domain, pentomerization domain, and the like. Such domains may form polypeptide complexes of two, three, four, five or more polypeptides of the invention. In one embodiment, the polypeptide contains an amino acid sequence of a “trimerization domain” that forms a trimeric complex with two other trimerization domains. The trimerization domain can be associated with another trimerization domain (homotrimer) of the same amino acid sequence, or with the trimerization domain (heterotrimer) of different amino acid sequence. This interaction can be caused by covalent bonds between components of the trimerization domain as well as by hydrogen bonding, hydrophobic bonding, van der Waals bonding and salt bridges.

본 발명의 폴리펩티드의 삼합체화 도메인은 전체내용이 참조로서 포함되는 미국 특허 출원 공개공보 2007/0154901호('901 출원)에 기재된 바와 같이 테트라넥틴으로부터 유래될 수 있다. 성숙 인간 테트라넥틴 단쇄 폴리펩티드 서열이 SEQ ID NO:142로 본원에 제공된다. 테트라넥틴 삼합체화 도메인의 예는 인간 테트라넥틴 유전자의 엑손 2에 의해 엔코딩되는 아미노산인 SEQ ID NO:99의 아미노산 17 내지 49, 17 내지 50, 17 내지 51 및 17-52, 및 임의로 유전자의 엑손 3에 의해 엔코딩되는 처음 1, 2 또는 3개의 아미노산을 포함한다. 다른 예는 엑손 1 및 2 모두인 아미노산 1 내지 49, 1 내지 50, 1 내지 51 및 1 내지 52, 및 임의로 유전자의 엑손 3에 의해 엔코딩되는 처음 1, 2 또는 3개의 아미노산을 포함한다. 대안적으로, 엑손 1에 의해 엔코딩되는 아미노산 서열의 단지 일부가 삼합체화 도메인에 포함된다. 특히, 삼합체화 도메인의 N-말단은 SEQ ID NO:99의 잔기 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 및 17 중 임의의 잔기에서 시작할 수 있다. 특정 구체예에서, N-말단은 I10 또는 V17이고, C-말단은 Q47, T48, V49, C(S)50, L51 또는 K52(SEQ ID NO:99에 따른 넘버링)이다. 또한, 도 2A-2D는 인간 테트라넥틴 삼합체화 도메인의 다수의 잠재적 트렁케이션 변이체를 제공한다.The trimerization domain of the polypeptides of the invention can be derived from tetranectin as described in US Patent Application Publication No. 2007/0154901 (the '901 application), which is incorporated by reference in its entirety. The mature human tetranectin single chain polypeptide sequence is provided herein as SEQ ID NO: 142. Examples of tetranectin trimerization domains include amino acids 17-49, 17-50, 17-51, and 17-52 of SEQ ID NO: 99, the amino acids encoded by exon 2 of the human tetranectin gene, and optionally exon 3 of the gene. Contains the first 1, 2 or 3 amino acids encoded by. Other examples include amino acids 1 to 49, 1 to 50, 1 to 51 and 1 to 52, which are both exons 1 and 2, and optionally the first 1, 2 or 3 amino acids encoded by exon 3 of the gene. Alternatively, only a portion of the amino acid sequence encoded by exon 1 is included in the trimerization domain. In particular, the N-terminus of the trimerization domain is residues 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 and 17 of SEQ ID NO: 99. May begin at any residue of In certain embodiments, the N-terminus is I10 or V17 and the C-terminus is Q47, T48, V49, C (S) 50, L51 or K52 (numbering according to SEQ ID NO: 99). 2A-2D also provide a number of potential truncation variants of the human tetranectin trimerization domain.

본 발명의 일 양태에서, 삼합체화 도메인은 전체내용이 참조로서 본원에 포함되는 US 2007/00154901호에 더욱 충분히 기재되는 바와 같은 테트라넥틴 패밀리 삼합체화 구조 구성요소의 콘센서스 서열인 SEQ ID NO:108의 아미노산 서열을 갖는 테트라넥틴 삼합체화 구조 구성요소("TTSE")이다. 도 3에 도시된 바와 같이, TTSE는 단백질의 테트라넥틴 패밀리의 자연 발생 일원의 변이체, 특히 알파 헬리칼 코일드 코일(alpha helical coiled coil) 삼합체를 형성하는 TTSE의 능력에 임의의 실질적인 정도로 불리한 영향을 미치지 않으면서 아미노산 서열 내에서 변형된 변이체를 포함한다. 본 발명의 다양한 양태에서, 본 발명에 따른 삼합체 폴리펩티드는 SEQ ID NO:108의 콘센서스 서열과 적어도 66%의 아미노산 서열 동일성, 예를 들어, SEQ ID NO:108의 콘센서스 서열과 적어도 73%, 적어도 80%, 적어도 86% 또는 적어도 92%의 서열 동일성(규정된(X는 아님) 잔기만 계산)을 갖는 삼합체화 도메인으로서 TTSE를 포함한다. 즉, SEQ ID NO:108의 규정된 아미노산의 적어도 1개, 적어도 2개, 적어도 3개, 적어도 4개, 또는 적어도 5개가 치환될 수 있다.In one aspect of the invention, the trimerization domain is SEQ ID NO: 108 which is the consensus sequence of the tetranectin family trimerization structural component as described more fully in US 2007/00154901, which is incorporated herein by reference in its entirety. Tetranectin trimerization structural component ("TTSE") having the amino acid sequence of. As shown in FIG. 3, TTSE has any substantial degree of adverse effect on the ability of TTSE to form variants of naturally occurring members of the tetranectin family of proteins, especially alpha helical coiled coil trimers. Variants modified within the amino acid sequence without affecting. In various embodiments of the invention, the trimeric polypeptide according to the invention has at least 66% amino acid sequence identity with the consensus sequence of SEQ ID NO: 108, eg, at least 73% with the consensus sequence of SEQ ID NO: 108. And TTSE as a trimerization domain having sequence identity (calculating only defined (but not X) residues) of at least 80%, at least 86% or at least 92%. That is, at least one, at least two, at least three, at least four, or at least five of the defined amino acids of SEQ ID NO: 108 may be substituted.

한 특정 구체예에서, SEQ ID NO:142의 위치 50의 시스테인(C50)은 원치 않는 다합체화를 발생시킬 수 있는 원치 않는 사슬간 이황화 브릿지의 형성을 피하기 위해 세린, 트레오닌, 메티오닌 또는 임의의 다른 아미노산 잔기로 유리하게 돌연변이화될 수 있다. 다른 공지된 변이체는 임의의 비-헬릭스를 파괴하는 아미노산 잔기에 의해 치환될 수 있는 아미노산 잔기 번호 6, 21, 22, 24, 25, 27, 28, 31, 32, 35, 39, 41, 및 42(SEQ ID NO:142에 따른 넘버링)로부터 선택된 적어도 하나의 아미노산 잔기를 포함한다. 이러한 잔기는 자연 테트라넥틴 단량체의 3개의 TTSE 사이의 삼합체 복합체를 안정화시키는 분자간 상호작용과 직접적으로 관련된 것으로 밝혀지지는 않았다. 도 3에 도시된 일 양태에서, TTSE는 식 a-b-c-d-e-f-g(N 내지 C)를 갖는 반복된 헵타드를 가지며, 여기서 잔기 a 및 d(즉, 위치 26, 30, 33, 37, 40, 44, 47, 및 51)는 임의의 소수성 아미노산일 수 있다(SEQ ID NO:99에 따른 넘버링).In one specific embodiment, the cysteine (C50) at position 50 of SEQ ID NO: 142 is a serine, threonine, methionine or any other amino acid to avoid the formation of unwanted interchain disulfide bridges that may result in unwanted multimerization. It may be advantageously mutated with residues. Other known variants are amino acid residue numbers 6, 21, 22, 24, 25, 27, 28, 31, 32, 35, 39, 41, and 42 that may be substituted by amino acid residues that destroy any non-helix. At least one amino acid residue selected from (numbering according to SEQ ID NO: 142). These residues have not been found to be directly related to the intermolecular interactions that stabilize the trimer complex between the three TTSEs of the native tetranectin monomer. In one embodiment shown in FIG. 3, the TTSE has a repeated heptad having the formula abcdefg (N to C), wherein residues a and d (ie, positions 26, 30, 33, 37, 40, 44, 47, And 51) can be any hydrophobic amino acid (numbering according to SEQ ID NO: 99).

추가 구체예에서, TTSE 삼합체화 도메인은 폴리히스티딘 서열 및/또는 프로테아제 분해 부위, 예를 들어, 혈액 응고 인자 Ⅹa 또는 그랜자임 B(Granzyme B)의 통합(참조로서 본원에 포함되는 US 2005/0199251호 참조), 및 C-말단 KG 또는 KGS 서열의 포함에 의해 변형될 수 있다. 또한, 정제를 돕기 위해, 위치 2의 프롤린은 글리신으로 치환될 수 있다.In further embodiments, the TTSE trimerization domain comprises a polyhistidine sequence and / or integration of a protease cleavage site, such as blood coagulation factor VIIa or Granzyme B (US 2005/0199251, incorporated herein by reference). And C-terminal KG or KGS sequences. In addition, to aid purification, the proline at position 2 may be substituted with glycine.

TTSE 트렁케이션 및 변이체의 특정 비제한적 예가 도 2A-2D에 제시된다. 또한, 인간 테트라넥틴의 삼합체화 도메인과 실질적인 상동성(66% 초과)을 갖는 다수의 삼합체화 도메인이 공지되어 있다:Certain non-limiting examples of TTSE truncations and variants are shown in FIGS. 2A-2D. In addition, a number of trimerization domains are known that have substantial homology (greater than 66%) with the trimerization domain of human tetranectin:

표 1Table 1

Figure pct00001
Figure pct00001

Figure pct00002
Figure pct00002

삼합체화되는 것으로 공지된 다른 인간 폴리펩티드는 하기를 포함한다:Other human polypeptides known to be trimerized include:

Figure pct00003
Figure pct00003

삼합체화 도메인의 또 다른 예는 콜렉틴(collectin) 넥 영역을 포함하는 폴리펩티드를 기재하는 US 6,190,886호(전체내용이 본원에 참조로서 포함됨)에 개시되어 있다. 이후, 삼합체는 콜렉틴 넥 영역 아미노산 서열을 포함하는 3개의 폴리펩티드를 이용하여 적절한 조건하에서 제조될 수 있다. 하기를 포함하는 다수의 콜렉틴이 확인된다:Another example of the trimerization domain is disclosed in US Pat. No. 6,190,886, which is incorporated herein by reference in its entirety. Trimers can then be prepared under appropriate conditions using three polypeptides comprising the collector neck region amino acid sequence. A number of collectors have been identified, including:

인간 SP-D의 콜렉틴 넥 영역:Collector's neck area of human SP-D:

Figure pct00004
Figure pct00004

소 SP-D의 콜렉틴 넥 영역:Collector's neck area of bovine SP-D:

Figure pct00005
Figure pct00005

래트 SP-D의 콜렉틴 넥 영역:Collector's neck area of rat SP-D:

Figure pct00006
Figure pct00006

소 콘글루티닌의 콜렉틴 넥 영역:Collector's neck area of bovine conglutinin:

Figure pct00007
Figure pct00007

소 콜렉틴의 콜렉틴 넥 영역:Collector's neck area of bovine collector:

Figure pct00008
Figure pct00008

인간 SP-D의 넥 영역:Neck area of human SP-D:

Figure pct00009
Figure pct00009

MBP 삼합체화 도메인의 다른 예는 전체내용이 참조로서 포함되는 WO 2009/036349호로 공개된 PCT 출원 일련 번호 US08/76266호에 기재되어 있다. 이러한 삼합체화 도메인은 더욱 추가로 올리고화될 수 있고, 보다 높은 체제의 다합체 복합체를 생성시킬 수 있다.Other examples of MBP trimerization domains are described in PCT Application Serial No. US08 / 76266, published as WO 2009/036349, which is incorporated by reference in its entirety. Such trimerization domains can be further oligomerized and create higher regimen multimer complexes.

본 발명의 상황에서, "삼합체화 도메인"은 다른 유사하거나 동일한 삼합체화 도메인과 상호작용할 수 있다. 상기 상호작용은 삼합체 단백질 또는 폴리펩티드를 생성시키는 타입이다. 이러한 상호작용은 삼합체화 도메인의 구성요소 사이의 공유 결합 뿐만 아니라 수소 결합력, 소수성 결합력, 반데르발스 결합력, 및 염 브릿지에 의해 야기될 수 있다. 삼합체화 도메인의 삼합체화 효과는 두개의 다른 삼합체화 도메인의 코일드 코일 구조와 상호작용하여 비교적 높은 온도에서도 안정적인 삼중 알파 헬리칼 코일드 코일 삼합체를 형성하는 코일드 코일 구조에 의해 야기된다. 다양한 구체예에서, 예를 들어, 테트라넥틴 구조 구성요소에 기초한 삼합체화 도메인의 구체예에서, 상기 복합체는 적어도 60℃, 예를 들어, 일부 구체예에서, 적어도 70℃에서 안정적이다.In the context of the present invention, a "trimerization domain" may interact with other similar or identical trimerization domains. This interaction is the type that produces the trimer protein or polypeptide. Such interactions can be caused by hydrogen bonding, hydrophobic bonding, van der Waals bonding, and salt bridges as well as covalent bonding between the components of the trimerization domain. The trimerization effect of the trimerization domain is caused by a coiled coil structure that interacts with the coiled coil structure of two different trimerization domains to form a triple alpha helical coiled coil trimer that is stable even at relatively high temperatures. In various embodiments, for example, in embodiments of trimerization domains based on tetranectin structural components, the complex is stable at least 60 ° C., eg, in some embodiments, at least 70 ° C.

용어 "C-타입 렉틴-유사 단백질" 및 "C-타입 렉틴"은 임의의 진핵생물 종의 유전체에 존재하거나 여기서 엔코딩되는 임의의 단백질을 의미하는데 사용되며, 이러한 단백질은 탄수화물 리간드에 결합하는 하나 이상의 CTLD 또는 CTLD의 서브그룹에 속하는 하나 이상의 도메인을 함유한다. 상기 정의는 특별히 막 부착된 C-타입 렉틴-유사 단백질 및 C-타입 렉틴, "가용성" C-타입 렉틴-유사 단백질 및 기능성 막 도메인이 결핍된 C-타입 렉틴 및 변이체 C-타입 렉틴-유사 단백질 및 하나 이상의 아미노산 잔기가 당화 또는 임의의 다른 합성후 변형에 의해 생체내에서 변경된 C-타입 렉틴, 뿐만 아니라 C-타입 렉틴-유사 단백질 및 C-타입 렉틴의 화학적 변형에 의해 수득되는 임의의 생성물을 포함한다.The terms "C-type lectin-like protein" and "C-type lectin" are used to mean any protein that is present in or encoded in the genome of any eukaryotic species, which protein is one or more that binds to a carbohydrate ligand. Contain one or more domains belonging to a CTLD or a subgroup of CTLD. The above definitions refer to C-type lectin-like proteins and C-type lectin- and mutant C-type lectin-like proteins that are specifically lacking membrane attached C-type lectin-like proteins and C-type lectins, "soluble" C-type lectin-like proteins and functional membrane domains. And any product obtained by chemical modification of C-type lectin-like proteins and C-type lectins, as well as C-type lectins, wherein one or more amino acid residues have been modified in vivo by glycosylation or any other post-synthesis modification. Include.

CTLD는 개략적으로 120개의 아미노산 잔기로 구성되고, 특징으로서 2 또는 3개의 사슬내 이황화 브릿지를 함유한다. 상이한 단백질로부터의 CTLD 사이의 아미노산 서열 수준에서의 유사성은 비교적 낮지만, 다수의 CTLD의 3D-구조는 종종 5개 이하의 루프에 의해 규정되는 소위 루프-영역으로 본질적으로 제한되는 구조 변화성과 함께 고도로 보존적인 것으로 밝혀졌다. 여러 CTLD는 칼슘에 대한 1 또는 2개의 결합 부위를 함유하며, 칼슘과 상호작용하는 측쇄 대부분은 루프-영역 내에 위치된다.CTLD consists of approximately 120 amino acid residues and contains two or three intrachain disulfide bridges as a feature. Although the similarity at the amino acid sequence level between CTLDs from different proteins is relatively low, the 3D-structure of many CTLDs is highly highly oriented with structural variations that are essentially limited to the so-called loop-regions defined by five or fewer loops. It turns out to be conservative. Many CTLDs contain one or two binding sites for calcium, with most of the side chains interacting with calcium located within the loop-regions.

3D 구조 정보가 이용가능한 CTLD를 기초로 하여, 정준(canonical) CTLD는 β1, α1, α2, β2, β3, β4 및 β5의 순서로 연속적으로 보이는 7개의 주요 이차 구조 구성요소(즉, 5개의 β-가닥 및 2개의 α-헬릭스)를 구조적 특징으로 하는 것으로 추론된다. 도 4는 10개의 공지된 C-타입 렉틴의 CTLD의 정렬을 예시한다. 3D 구조가 결정된 모든 CTLD에서, β-가닥은 하나는 β1 및 β5로 구성되고, 나머지는 β2, β3 및 β4로 구성되는 2개의 역평행 β-시트로 배열된다. 추가의 β-가닥인 β0는 종종 서열 내에서 β1에 선행하고, 존재시, β1, β5-시트에 통합되는 추가 가닥을 형성한다. 추가로, 하나는 α1 및 β5를 연결(C-C)하고, 하나는 β3를 연결하는 2개의 이황화 브릿지, 및 β4 및 β5를 연결(C-C)하는 폴리펩티드 세그먼터가 지금까지 특성규명된 모든 CTLD에서 불변적으로 발견된다. 또한, 도 5는 인간 테트라넥틴 및 8개의 다른 테트라넥틴 또는 테트라넥틴 유사 폴리펩티드로부터의 CTLD의 정렬을 도시한다.Based on the CTLD where 3D structural information is available, the canonical CTLD is composed of seven major secondary structural components (i.e. five βs) that appear in succession in the order of β1, α1, α2, β2, β3, β4 and β5. -Strand and two α-helices) are inferred as structural features. 4 illustrates the alignment of CTLD of ten known C-type lectins. In all CTLDs with 3D structure determined, the β-strand is arranged in two antiparallel β-sheets, one consisting of β1 and β5 and the other consisting of β2, β3 and β4. An additional β-strand, β0, often precedes β1 in the sequence and, when present, forms an additional strand that is incorporated into β1, β5-sheet. In addition, two disulfide bridges linking α 1 and β 5 (C I -C IV ), one linking β 3, and polypeptide segments linking β 4 and β 5 (C II -C III ) so far It is found invariably in all characterized CTLDs. 5 also shows the alignment of CTLDs from human tetranectin and eight other tetranectin or tetranectin-like polypeptides.

CTLD 3D-구조에서, 이러한 보존된 이차 구조 구성요소는 본 발명의 상황에서 코어로부터 돌출된 "루프-영역"으로 집합적으로 언급되는 다수의 루프에 대한 밀집한 스캐폴드를 형성한다. CTLD의 일차 구조에서, 이러한 루프는 2개의 세그먼트, 루프 세그먼트 A, LSA, 및 루프 세그먼트 B, LSB로 조직화된다. LSA는 종종 규칙적인 이차 구조가 결핍되고, 4개 이하의 루프를 함유하는 β2 및 β3를 연결하는 긴 폴리펩티드 세그먼트이다. LSB는 β-가닥 β3 및 β4를 연결하는 폴리펩티드 세그먼트이다. β4 내의 단일 잔기와 함께 LSA의 잔기는 테트라넥틴의 CTLD를 포함하는 여러 CTLD의 Ca2 +-결합 부위 및 리간드-결합 부위를 특정하는 것으로 밝혀졌다. 예를 들어, 1개 또는 소수의 잔기의 치환을 포함하는 돌연변이유발 연구는 결합 특이성, Ca2 +-민감성 및/또는 친화성의 변화가 CTLD 도메인에 의해 제공될 수 있는 것으로 밝혀졌다. 비제한적인 예로 테트라넥틴, 리토스타틴, 마우스 대식세포 갈락토오스 렉틴, 쿠퍼 세포 수용체, 닭 뉴로칸(neurocan), 퍼루신(perlucin), 아시알로당단백질 수용체, 연골 프로테오글리칸 코어 단백질, IgE Fc 수용체, 췌장염-관련 단백질, 마우스 대식세포 수용체, 자연살세포 그룹, 줄기세포 성장 인자, 인자 Ⅸ/Ⅹ 결합 단백질, 만노오스 결합 단백질, 소 콘글루티닌, 소 CL43, 콜렉틴 간 1, 표면활성제 단백질 A, 표면활성제 단백질 D, e-셀렉틴, 피막 c-타입 렉틴, CD94 NK 수용체 도메인, LY49A NK 수용체 도메인, 닭 간 렉틴, 송어 c-타입 렉틴, HIV gp 120-결합 c-타입 렉틴, 및 수지상 세포 면역수용체를 포함하는 다수의 CTLD가 공지되어 있다. 예를 들어, 전체내용이 참조로서 본원에 포함되는 미국 특허 공개공보 2007/0275393호, 및 문헌[Essentials of Glycobiology, second edition. Edited by A. Varki, R.D. Cummings, J.D. Esko, HH. Freeze, P. Stanley, C.R. Bertozzi, G.W. Hart, M.E. Etzler. CHS Press]을 참조하라.In CTLD 3D-structures, these conserved secondary structure components form dense scaffolds for multiple loops collectively referred to as "loop-regions" protruding from the core in the context of the present invention. In the primary structure of the CTLD, this loop is organized into two segments, loop segment A, LSA, and loop segment B, LSB. LSAs are long polypeptide segments that link β2 and β3, which often lack regular secondary structure and contain no more than four loops. LSB is a polypeptide segment that connects β-strand β3 and β4. moiety of the LSA with a single residue in β4 is Ca 2 + the number of CTLD comprising the CTLD of tetra nektin - It has been found that the specific binding site-binding sites and a ligand. For example, mutations induced with 1 or substitution of a few residues study binding specificity, Ca 2 + - it has been found that the sensitivity and / or affinity can be provided by changing the CTLD domain. Non-limiting examples include tetranectin, ritstatin, mouse macrophage galactose lectin, cooper cell receptor, chicken neurocan, perlucin, asialo glycoprotein receptor, cartilage proteoglycan core protein, IgE Fc receptor, pancreatitis- Related proteins, mouse macrophage receptor, natural killer cell group, stem cell growth factor, factor VIII / VIII binding protein, mannose binding protein, bovine conglutinin, bovine CL43, collectin liver 1, surfactant protein A, surfactant protein D, e-selectin, capsular c-type lectin, CD94 NK receptor domain, LY49A NK receptor domain, chicken liver lectin, trout c-type lectin, HIV gp 120-binding c-type lectin, and dendritic cell immunoreceptors Many CTLDs are known. See, eg, US Patent Publication No. 2007/0275393, which is incorporated herein by reference in its entirety, and Essentials of Glycobiology, second edition. Edited by A. Varki, RD Cummings, JD Esko, HH. Freeze, P. Stanley, CR Bertozzi, GW Hart, ME Etzler. CHS Press].

"아트리머™ 폴리펩티드 복합체" 또는 "아트리머™ 복합체"는 CTLD를 또한 포함하는 3개의 삼합체화 도메인의 삼합체 복합체를 의미한다(Anaphore, Inc., San Diego, California)."Atrimer ™ polypeptide complex" or "Atrimer ™ complex" means a trimer complex of three trimerization domains that also includes CTLD (Anaphore, Inc., San Diego, California).

표현 "유효량"은 동시에 투여되거나 연속적으로 투여되는지 간에 당해 질병 또는 질환을 예방하거나, 개선시키거나, 치료하는데 효과적인, 임의로 치료제와 함께 본 발명의 폴리펩티드의 양을 의미한다. 특정 구체예에서, 유효량은 IL-23R 발현 세포에 대한 IL-23의 효과를 감소시키거나, IL-23R과 상승작용적으로 작용하는 IL-23R 발현 세포에 대한 다른 경로에 영향을 미치거나, IL-23R 발현 세포와 함께 작용하는 다른 면역 세포에 영향을 미치거나, 증식하거나 생존하는 세포의 경향을 감소시키거나, 아폽토시스를 겪는 세포의 경향(예를 들어, 상승작용적으로)을 향상시키거나 달리 증가시키거나, 종양 부피를 감소시키거나, 암 또는 면역 관련 질병을 갖는 포유동물의 생존을 연장시키기에 충분한, 조합된 본 발명의 폴리펩티드, 및 치료제, 예를 들어, 세포독성제 또는 면역억제제의 양이다.The expression “effective amount” means the amount of a polypeptide of the present invention, optionally in combination with a therapeutic agent, effective to prevent, ameliorate, or treat the disease or condition whether administered simultaneously or sequentially. In certain embodiments, the effective amount reduces the effect of IL-23 on IL-23R expressing cells, affects other pathways for IL-23R expressing cells that synergize with IL-23R, or Affect other immune cells that work with -23R expressing cells, reduce the tendency of cells to proliferate or survive, or enhance or otherwise enhance the tendency (eg, synergistically) of cells that undergo apoptosis Amounts of polypeptides of the invention in combination, and therapeutic agents, such as cytotoxic or immunosuppressive agents, that are sufficient to increase, decrease tumor volume, or prolong survival of a mammal with cancer or an immune related disease. to be.

"치료제"는 세포독성제, 화학요법제, 면역억제제, 항염증제, 면역자극제, 및/또는 성장억제제를 의미한다."Therapeutic agent" means a cytotoxic agent, chemotherapeutic agent, immunosuppressant, anti-inflammatory agent, immunostimulatory agent, and / or growth inhibitory agent.

부수적 요법에 대해 본원에서 사용되는 용어 "면역억제제" 및 "염증의 조정자"는 본원에서 치료되는 포유동물의 면역계를 억제하거나 감추는 작용을 하는 물질을 의미한다. 이는 사이토카인 생성을 억제하거나, 자가-항원 발현을 하향조절하거나 억제하거나, 만성 염증의 부위로의 면역 세포의 이동을 억제하거나, MHC 항원을 감추는 물질을 포함한다. 이러한 작용제의 예는 2-아미노-6-아릴-5-치환된 피리미딘(U.S. Pat. No. 4,665,077 참조); 비스테로이드성 항염증 약물(NSAIDs); 아자티오프린(azathioprine); 사이클로포스파미드(cyclophosphamide); 브로모크립틴(bromocryptine); 다나졸(danazol); 답손(dapsone); 글루타르알데히드(U.S. Pat. No. 4,120,649호에 기재된 바와 같이 MHC 항원을 감춤); MHC 항원 및 MHC 단편에 대한 항-이디오타입 항체; 사이클로스포린 A(cyclosporin A); 스테로이드, 예를 들어, 글루코코르티코스테로이드, 예를 들어, 프레드니손(prednisone), 메틸프레드니솔론(methylprednisolone), 덱사메타손(dexamethasone), 및 하이드로코르티손(hydrocortisone); 메토트렉세이트(methotrexate)(경구 또는 피하); 하이드록시클로로퀸(hydroxycloroquine); 설파살라진(sulfasalazine); 레플루노마이드(leflunomide); 사이토카인 또는 사이토카인 수용체 길항제, 예를 들어, 항-인터페론-감마(IFN-γ), -β, 또는 -α 항체, 항-종양 괴사 인자-α 항체(예를 들어, 인플릭시맙(infliximab), 아달리무맙(adalimumab) 또는 심지아(Cimzia)), 항-TNFα 면역어드헤신(immunoadhesin)(에타너셉트(etanercept)), 항-종양 괴사 인자-β 항체, 항-TGF-β 항체, 항-인터루킨-2 항체 및 항-IL-2 수용체 항체; 항-IL-6 항체, 항-IL-6R 항체, 항-LFA-1 항체, 예를 들어, 항-CD11a 및 항-CD18 항체; 항-L3T4 항체; 이종성 항-림프구 글로불린; 판(pan)-T 항체, 바람직하게는 항-CD3 또는 항-CD4/CD4a 항체; LFA-3 결합 도메인을 함유하는 가용성 펩티드(WO 90/08187 published Jul. 26, 1990); 스트렙토키나제; TGF-β; 스트렙토도마제(streptodomase); 숙주로부터의 RNA 또는 DNA; FK506; RS-61443; 데옥시스퍼구알린(deoxyspergualin); 라파마이신; T-세포 수용체(Cohen et al., U.S. Pat. No. 5,114,721); T-세포 수용체 단편(Offner et al., Science, 251:430-432 (1991); WO 90/11294; Janeway, Nature, 341:482 (1989); 및 WO 91/01133); 및 T-세포 수용체 항체(EP 340,109), 예를 들어, T10B9, 인테그린 억제제, 예를 들어, 티사브리(Tysabri), CCR9 또는 CCR6 길항제, 항-TL1A 항체 또는 면역 반응을 억제하는 것으로 공지된 사이토카인, 예를 들어, IL-10 또는 IL-27을 포함하나, 이에 제한되지는 않는다.As used herein for collateral therapy, the terms "immunosuppressant" and "modulator of inflammation" refer to substances that act to inhibit or conceal the immune system of the mammal being treated herein. These include substances that inhibit cytokine production, downregulate or inhibit self-antigen expression, inhibit the migration of immune cells to the site of chronic inflammation, or conceal MHC antigens. Examples of such agents include 2-amino-6-aryl-5-substituted pyrimidines (see US Pat. No. 4,665,077); Nonsteroidal anti-inflammatory drugs (NSAIDs); Azathioprine; Cyclophosphamide; Bromocryptine; Danazol; Dapsone; Glutaraldehyde (hiding MHC antigens as described in US Pat. No. 4,120,649); Anti-idiotype antibodies against MHC antigens and MHC fragments; Cyclosporin A; Steroids such as glucocorticosteroids such as prednisone, methylprednisolone, dexamethasone, and hydrocortisone; Methotrexate (oral or subcutaneous); Hydroxycloroquine; Sulfasalazine; Leflunomide; Cytokine or cytokine receptor antagonists such as anti-interferon-gamma (IFN-γ), -β, or -α antibodies, anti-tumor necrosis factor-α antibodies (eg, infliximab ), Adalimumab or azizia), anti-TNFα immunoadhesin (etanercept), anti-tumor necrosis factor-β antibody, anti-TGF-β antibody, anti- Interleukin-2 antibodies and anti-IL-2 receptor antibodies; Anti-IL-6 antibodies, anti-IL-6R antibodies, anti-LFA-1 antibodies, such as anti-CD11a and anti-CD18 antibodies; Anti-L3T4 antibodies; Heterologous anti-lymphocyte globulin; Pan-T antibodies, preferably anti-CD3 or anti-CD4 / CD4a antibodies; Soluble peptides containing LFA-3 binding domains (WO 90/08187 published Jul. 26, 1990); Streptokinase; TGF-β; Streptodomase; RNA or DNA from the host; FK506; RS-61443; Deoxyspergualin; Rapamycin; T-cell receptor (Cohen et al. , US Pat. No. 5,114,721); T-cell receptor fragments (Offner et al. , Science, 251: 430-432 (1991); WO 90/11294; Janeway, Nature, 341: 482 (1989); and WO 91/01133); And T-cell receptor antibodies (EP 340,109) such as T10B9, integrin inhibitors such as Tysabri, CCR9 or CCR6 antagonists, anti-TL1A antibodies or cytokines known to inhibit immune responses Such as, but not limited to, IL-10 or IL-27.

본원에서 사용되는 용어 "세포독성제"는 세포의 기능을 억제하거나 방지하고/하거나 세포의 파괴를 야기시키는 물질을 의미한다. 상기 용어는 방사성 동위원소(예를 들어, At211, I131, I125, Y90, Re186, Re188, Sm153, Bi212, P32 및 Lu의 방사성 동위원소), 화학치료제, 및 독소, 예를 들어, 박테리아, 진균, 식물 또는 동물 기원의 소분자 독소 또는 효소적으로 활성인 독소, 또는 이들의 단편)을 포함한다.As used herein, the term “cytotoxic agent” means a substance that inhibits or prevents the function of a cell and / or causes cell destruction. The term refers to radioisotopes (eg, radioisotopes of At 211 , I 131 , I 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 and Lu), chemotherapeutic agents, and toxins , Eg, small molecule toxins or enzymatically active toxins of bacterial, fungal, plant or animal origin, or fragments thereof.

"화학치료제"는 암의 치료에 유용한 화학적 화합물이다. 화학치료제의 예는 알킬화제, 예를 들어, 티오테파(thiotepa) 및 CYTOXAN® 사이클로포스파미드; 알킬 설포네이트, 예를 들어, 부술판(busulfan), 임프로술판(improsulfan) 및 피포술판(piposulfan); 아지리딘(aziridine), 예를 들어, 벤조도파(benzodopa), 카르보쿠온(carboquone), 메투레도파(meturedopa), 및 우레도파(uredopa); 에틸렌이민 및 메틸아멜아민, 예를 들어, 알트레타민(altretamine), 트리에틸렌멜라민, 트리에틸렌포스포르아미드, 트리에틸렌티오포스포르아미드 및 트리메틸올멜라민; 아세토게닌(acetogenin)(특히, 불라타신(bullatacin) 및 불라타시논(bullatacinone)); 캄프토테신(camptothecin)(합성 유사체 토포테칸(topotecan)을 포함함); 브리오스타틴(bryostatin); 칼리스타틴(callystatin); CC-1065(이의 아도젤레신(adozelesin), 카르젤레신(carzelesin) 및 비젤레신(bizelesin) 합성 유사체를 포함함); 크립토파이신(cryptophycin)(특히, 크립토파이신 1 및 크립토파이신 8); 돌라스타틴(dolastatin); 듀오카르마이신(duocarmycin)(합성 유사체 KW-2189 및 CB1-TM1을 포함함); 일루테로빈(eleutherobin); 판크라티스타틴(pancratistatin); 사르코딕티인(sarcodictyin); 스폰지스타틴(spongistatin); 질소 머스타드, 예를 들어, 클로람부실(chlorambucil), 클로르나파진(chlornaphazine), 콜로포스파미드(cholophosphamide), 에스트라무스틴(estramustine), 이포스파미드(ifosfamide), 메클로르에타민(mechlorethamine), 메클로르에타민 옥시드 하이드로클로라이드, 멜팔란(melphalan), 노벰비킨(novembichin), 페네스테린(phenesterine), 프레드니무스틴(prednimustine), 트로포스파미드(trofosfamide), 우라실 머스타드; 니트로스우레아, 예를 들어, 카르무스틴(carmustine), 클로로조토신(chlorozotocin), 포테무스틴(fotemustine), 로무스틴(lomustine), 니무스틴(nimustine), 및 라니무스틴(ranimustine); 항생제, 예를 들어, 에네디인(enediyne) 항생제(예를 들어, 칼리케아마이신(calicheamicin), 특히 칼리케아마이신 감마 11 및 칼리케아마이신 오메가 11(예를 들어, Agnew, Chem Intl. Ed. Engl, 33: 183-186 (1994) 참조); 다이네마이신(dynemicin), 예를 들어, 다이네마이신 A; 비스포스포네이트, 예를 들어, 클로드로네이트(clodronate); 에스퍼라마이신(esperamicin); 뿐만 아니라 네오카르지노스타틴 발색단(neocarzinostatin chromophore) 및 관련 색단백질 에네디인 항생제 발색단), 아클라시노마이신(aclacinomysin), 악티노마이신(actinomycin), 오쓰라마이신(authramycin), 아자세린(azaserine), 블레오마이신(bleomycin), 칵티노마이신(cactinomycin), 카라바이신(carabicin), 카르미노마이신(carminomycin), 카르지노필린(carzinophilin), 크로모마이시니스(chromomycinis), 닥티노마이신(dactinomycin), 다우노루비신(daunorubicin), 데토루비신(detorubicin), 6-디아조-5-옥소-L-노르류신, ADRIAMYCIN® 독소루비신(doxorubicin)(모르폴리노-독소루비신, 시아노모르폴리노-독소루비신, 2-피롤리노-독소루비신 및 데옥시독소루비신을 포함함), 에피루비신(epirubicin), 에소루비신(esorubicin), 이다루비신(idarubicin), 마르셀로마이신(marcellomycin), 미토마이신(mitomycin), 예를 들어, 미토마이신 C, 미코페놀산(mycophenolic acid), 노갈라마이신(nogalamycin), 올리보마이신(olivomycin), 페프로마이신(peplomycin), 포트피로마이신(potfiromycin), 푸로마이신(puromycin), 쿠엘라마이신(quelamycin), 로도루비신(rodorubicin), 스트렙토니그린(streptonigrin), 스트렙토조신(streptozocin), 튜베르시딘(tubercidin), 유베니멕스(ubenimex), 지노스타틴(zinostatin), 조루비신(zorubicin); 항-대사물, 예를 들어, 메토트렉세이트 및 5-플루오로우라실(5-FU); 폴산 유사체, 예를 들어, 데노프테린(denopterin), 메토트렉세이트, 프테로프테린(pteropterin), 트리메트렉세이트; 퓨린 유사체, 예를 들어, 플루다라빈(fludarabine), 6-머캅토퓨린, 티아미프린(thiamiprine), 티오구아닌(thioguanine); 피리미딘 유사체, 예를 들어, 안시타빈(ancitabine), 아자시티딘(azacitidine), 6-아자우리딘(6-azauridine), 카르모푸르(carmofur), 시타라빈(cytarabine), 디데옥시우리딘(dideoxyuridine), 독시플루리딘(doxifluridine), 에노시타빈(enocitabine), 플록스우리딘(floxuridine); 안드로겐, 예를 들어, 칼루스테론(calusterone), 드로모스타놀론 프로피오네이트(dromostanolone propionate), 에피티오스타놀(epitiostanol), 메피티오스탄(mepitiostane), 테스톨락톤(testolactone); 항-부신, 예를 들어, 아미노글루테티미드(aminoglutethimide), 미토탄(mitotane), 트릴로스탄(trilostane); 폴산 보충물, 예를 들어, 프롤린산(frolinic acid); 아세글라톤(aceglatone); 알도포스파미드 글리코시드(aldophosphamide glycoside); 아미노레불린산; 에니루라실(eniluracil); 암사크린(amsacrine); 베스트라부실(bestrabucil); 비산트렌(bisantrene); 에다트렉세이트(edatraxate); 데포파민(defofamine); 데메콜신(demecolcine); 디아지쿠온(diaziquone); 엘포르니틴(elfornithine); 엘리프티늄 아세테이트(elliptinium acetate); 에포틸론(epothilone); 에토글루시드(etoglucid); 갈륨 니트레이트; 하이드록시우레아; 렌티난(lentinan); 로니다이닌(lonidainine); 마이탄시노이드(maytansinoid), 예를 들어, 마이탄신(maytansine) 및 안사미토신(ansamitocin); 미토구아존(mitoguazone); 미톡산트론(mitoxantrone); 모피단몰(mopidanmol); 니트라에린(nitraerine); 펜토스타틴(pentostatin); 페나멧(phenamet); 피라루비신(pirarubicin); 록소산트론(losoxantrone); 포도필린산(podophyllinic acid); 2-에틸히드라지드; 프로카르바진(procarbazine); PSK® 다당류 복합체(JHS Natural Products, Eugene, Oreg.); 라족산(razoxane); 리족신(rhizoxin); 시조피란(sizofiran); 스피로게르마늄(spirogermanium); 테누아존산(tenuazonic acid); 트리아지쿠온(triaziquone); 2,2',22"-트리클로로트리에틸아민; 트리코테센(trichothecene)(특히, T-2 독소, 베라쿠린 A(verracurin A), 로리딘 A(roridin A) 및 앙구이딘(anguidine)); 우레탄(urethan); 빈데신(vindesine); 다카르바진(dacarbazine); 만노무스틴(mannomustine); 미토브로니톨(mitobronitol); 미토락톨(mitolactol); 피포브로만(pipobroman); 가시토신(gacytosine); 아라비노시드(arabinoside)("Ara-C"); 사이클로포스파미드; 티오테파(thiotepa); 탁소이드(taxoid), 예를 들어, TAXOL® 파클리탁셀(Bristol-Myers Squibb Oncology, Princeton, N.J.), ABRAXANE™ 크레모포르(Cremophor)-비함유, 파클리탁셀의 알부민-설계된 나노입자 제형(American Pharmaceutical Partners, Schaumberg, Illinois), 및 TAXOTERE® 독세탁셀(doxetaxel)(Rhone-Poulenc Rorer, Antony, France); 클로란부실(chloranbucil); GEMZAR® 젬시타빈; 6-티오구아닌; 머캅토퓨린; 메토트렉세이트; 백금 유사체, 예를 들어, 시스플라틴 및 카르보플라틴; 빈블라스틴; 백금; 에토포시드(VP-16); 이포스파미드(ifosfamide); 미톡산트론(mitoxantrone); 빈크리스틴(vincristine); NAVELBINE® 비노렐빈(vinorelbine); 노반트론(novantrone); 테니포시드(teniposide); 에다트렉세이트(edatrexate); 다우노마이신(daunomycin); 아미노프테린(aminopterin); 젤로다(xeloda); 이반드로네이트(ibandronate); CPT-11; 토포이소머라제(topoisomerase) 억제제 RFS 2000; 디플루오로메틀하일로르니틴(difluorometlhylornithine, DMFO); 레티노이드, 예를 들어, 레티노산; 카페시타빈(capecitabine); 및 상기 중 임의의 것의 약학적으로 허용되는 염, 산 또는 유도체를 포함한다. 또한, 프로테아좀(proteasome) 억제제, 예를 들어, 보르테조미브(bortezomib)(Velcade), BCL-2 억제제, IAP 길항제(예를 들어, Smac 미믹(mimics)/xIAP 및 cIAP 억제제, 예를 들어, 특정 펩티드, 피리딘 화합물, 예를 들어, (S)-N-{6-벤조[1,3]디옥솔-5-일-1-[5-(4-플루오로-벤조일)-피리딘-3-일메틸]-2-옥소-1,2-디하이드로-피리딘-3-일}-2-메틸아미노-프로피온아미드, xIAP 안티센스), HDAC 억제제(HDACI) 및 키나제 억제제(소라페니브(Sorafenib))가 상기 정의에 포함된다.A "chemotherapeutic agent" is a chemical compound useful for the treatment of cancer. Examples of chemotherapeutic agents include alkylating agents such as thiotepa and CYTOXAN® cyclophosphamide; Alkyl sulfonates such as busulfan, improsulfan and piposulfan; Aziiridine such as benzodopa, carboquone, meturedopa, and uredopa; Ethyleneimine and methylamelamine, for example altretamine, triethylenemelamine, triethylenephosphoramide, triethylenethiophosphoramide and trimethylolmelamine; Acetogenin (especially bullatacin and bullatacinone); Camptothecin (including the synthetic analog topotecan); Bryostatin; Calstatin; CC-1065 (including its adozelesin, carzelesin and bizelesin synthetic analogues); Cryptophycins (especially cryptophycin 1 and cryptophycin 8); Dolastatin; Duocarmycin (including the synthetic analogues KW-2189 and CB1-TM1); Elutherobin; Pancratistatin; Sarcodictyin; Spongestatin; Nitrogen mustards such as chlorambucil, chlornaphazine, cholophosphamide, estramustine, ifosfamide, mechlorethamine Mechlorethamine oxide hydrochloride, melphalan, nomobichin, fenesterine, prednimustine, trofosfamide, uracil mustard; Nitrosureas such as carmustine, chlorozotocin, fotemustine, lomustine, nimustine, and ranimustine; Antibiotics, for example, enediyne antibiotics (e.g. calicheamicin, in particular calicheamicin gamma 11 and calicheamicin omega 11 (e.g. Agnew, Chem Intl. Ed. Engl) 33: 183-186 (1994); dynemicin, for example dynemycin A; bisphosphonates, such as clodronate; esperamicin; Neocarzinostatin chromophore and related chromoprotein enantiin antibiotic chromophores, aclacinomysin, actinomycin, authramycin, azaserine, bleomycin (bleomycin), cactinomycin (cactinomycin), carabicin (carabicin), carminomycin (carminomycin), carzinophilin, chromomycinis, dactinomycin, daunorubicin daunorubicin, Detoru Detorubicin, 6-diazo-5-oxo-L-norleucine, ADRIAMYCIN® doxorubicin (morpholino-doxorubicin, cyanomorpholino-doxorubicin, 2-pyrrolino-doxorubicin and deoxy Doxorubicin), epirubicin, epirubicin, esorubicin, idarubicin, marcelomycin, mitomycin, for example mitomycin C, mycophenolic acid (mycophenolic acid), nogalamycin, nogalamycin, olivomycin, peplomycin, potpyromycin, potoromycin, puromycin, quelamycin, rodorubicin ), Streptonigrin, streptozocin, tubercidin, ubenimex, zinostatin, zorubicin; Anti-metabolites such as methotrexate and 5-fluorouracil (5-FU); Folic acid analogs, for example denopterin, methotrexate, pteropterin, trimetrexate; Purine analogs such as fludarabine, 6-mercaptopurine, thiamiprine, thioguanine; Pyrimidine analogs, for example, ancitabine, azaciitidine, 6-azauridine, carmofur, cytarabine, dideoxyuridine ( dideoxyuridine, doxifluridine, enocitabine, floxuridine; Androgens such as calussterone, dromostanolone propionate, epitiostanol, mepitiostane, testolactone; Anti-adrenal glands such as aminoglutethimide, mitotane, trilostane; Folic acid supplements such as frolinic acid; Aceglatone; Aldophosphamide glycoside; Aminolevulinic acid; Eniluracil; Amsacrine; Bestrabucil; Bisantrene; Edatraxate; Defofamine; Demecolcine; Diaziquone; Elfornithine; Elliptinium acetate; Epothilone; Etoglucid; Gallium nitrate; Hydroxyurea; Lentinan; Lonidainine; Maytansinoids such as maytansine and ansamitocin; Mitoguazone; Mitoxantrone; Mopidanmol; Nitrarine; Pentostatin; Phennamet; Pyrarubicin; Loxoxantrone; Podophyllinic acid; 2-ethylhydrazide; Procarbazine; PSK® polysaccharide complex (JHS Natural Products, Eugene, Oreg.); Razoxane; Rhizoxin; Sizofiran; Spirogermanium; Tenuazonic acid; Triaziquone; 2,2 ', 22 "-trichlorotriethylamine; trichothecene (especially T-2 toxin, verracurin A, roridin A and anguidine); Urethane; vindesine; dacarbazine; mannomustine; mitobronitol; mitobronitol; mitolactol; pipobroman; pipobroman; gacytosine; Arabinoside ("Ara-C"); cyclophosphamide; thiotepa; taxoids such as TAXOL® Paclitaxel (Bristol-Myers Squibb Oncology, Princeton, NJ), ABRAXANE ™ Cremophor-free, albumin-designed nanoparticle formulations of paclitaxel (American Pharmaceutical Partners, Schaumberg, Illinois), and TAXOTERE® doxetaxel (Rhone-Poulenc Rorer, Antony, France); Chloranbucil; GEMZAR® gemcitabine; 6-thioguanine; mercaptopurine; methotrexate; platinum analogs such as cisp Tin and carboplatin; vinblastine; platinum; etoposide (VP-16); ifosfamide; mitoxantrone; vincristine; NAVELBINE® vinorelbine; norban Tronantrone; teniposide; edatrexate; daunomycin; aminopterin; gelloda; xeloda; ibandronate; CPT-11; Topoisomerase inhibitor RFS 2000; difluorometlhylornithine (DMFO); Retinoids such as retinoic acid; Capecitabine; And pharmaceutically acceptable salts, acids or derivatives of any of the above. In addition, proteasome inhibitors such as bortezomib (Velcade), BCL-2 inhibitors, IAP antagonists (eg Smac mimics / xIAP and cIAP inhibitors such as Specific peptides, pyridine compounds, for example (S) -N- {6-benzo [1,3] dioxol-5-yl-1- [5- (4-fluoro-benzoyl) -pyridine-3 -Ylmethyl] -2-oxo-1,2-dihydro-pyridin-3-yl} -2-methylamino-propionamide, xIAP antisense), HDAC inhibitor (HDACI) and kinase inhibitor (Sorafenib) Is included in the above definition.

또한, 종양에 대한 호르몬 작용을 조절하거나 억제하는 작용을 하는 항-종양제, 예를 들어, 항-에스트로겐 및 선택적 에스트로겐 수용체 조정자(SERMs), 예를 들어, 타목시펜(tamoxifen)(NOLVADEX® 타목시펜을 포함함), 랄록시펜(raloxifene), 드롤록시펜(droloxifene), 4-히드록시타목시펜(4-hydroxytamoxifen), 트리옥시펜(trioxifene), 케옥시펜(keoxifene), LY117018, 오나프리스톤(onapristone), 및 FARESTON-토레미펜(toremifene); 부신에서 에스트로겐 생성을 조절하는 효소 아로마타제를 억제하는 아로마타제 억제제, 예를 들어, 4(5)-이미다졸, 아미노글루테티미드(aminoglutethimide), MEGASE® 메게스트롤 아세테이트(megestrol acetate), AROMASIN® 엑세메스탄(exemestane), 포르메스타니(formestanie), 파드로졸(fadrozole), RIVISOR® 보로졸(vorozole), FEMARA® 레트로졸(letrozole), 및 ARIMIDEX® 아나스트로졸(anastrozole); 및 항-안드로겐, 예를 들어, 플루타미드(flutamide), 닐루타미드(nilutamide), 비칼루타미드(bicalutamide), 류프롤리드(leuprolide), 및 고세렐린(goserelin); 뿐만 아니라 트록사시타빈(troxacitabine)(1,3-디옥솔란 누클레오시드 시토신 유사체); 안티센스 올리고누클레오티드, 특히 압헤런트 세포(abherant cell) 증식과 관련된 신호전달 경로에서의 유전자의 발현을 억제하는 안티센스 올리고누클레오티드, 예를 들어, PKC-알파, Ralf 및 H-Ras; 리보자임, 예를 들어, VEGF 발현 억제제(예를 들어, ANGIOZYME® 리보자임) 및 HER2 발현 억제제; 백신, 예를 들어, 유전자 요법 백신, 예를 들어, ALLOVECTIN® 백신, LEUVECTIN® 백신, 및 VAXID® 백신; PROLEUKIN® rIL-2; LURTOTECAN® 토포이소머라제 1 억제제; ABARELIX® rmRH; 및 상기 중 임의의 것의 약학적으로 허용되는 염, 산 또는 유도체가 포함된다.Also included are anti-tumor agents that act to modulate or inhibit hormonal action on tumors, such as anti-estrogens and selective estrogen receptor modulators (SERMs) such as tamoxifen (NOLVADEX® tamoxifen). Raloxifene, drroloxifene, 4-hydroxytamoxifen, trioxifene, keoxifene, LY117018, onapristone, and FARESTON Toremifene; Aromatase inhibitors that inhibit the enzyme aromatase that modulates estrogen production in the adrenal glands, such as 4 (5) -imidazole, aminoglutethimide, MEGASE® megestrol acetate, AROMASIN® Exemestane, formestanie, fadrozole, RIVISOR® vorozole, FEMARA® letrozole, and ARIMIDEX® anastrozole; And anti-androgens such as flutamide, nilutamide, bicalutamide, leuprolide, and goserelin; As well as troxacitabine (1,3-dioxolane nucleoside cytosine analogue); Antisense oligonucleotides, in particular antisense oligonucleotides that inhibit expression of genes in signaling pathways associated with abherant cell proliferation, such as PKC-alpha, Ralf and H-Ras; Ribozymes such as VEGF expression inhibitors (eg, ANGIOZYME® ribozymes) and HER2 expression inhibitors; Vaccines such as gene therapy vaccines such as ALLOVECTIN® vaccines, LEUVECTIN® vaccines, and VAXID® vaccines; PROLEUKIN® rIL-2; LURTOTECAN® topoisomerase 1 inhibitors; ABARELIX® rmRH; And pharmaceutically acceptable salts, acids or derivatives of any of the above.

본원에서 사용되는 "성장 억제제"는 시험관내 또는 생체내에서 세포의 성장을 억제하는 화합물 또는 조성물을 의미한다. 따라서, 성장 억제제는 S 단계의 상기 유전자를 과다발현하는 세포의 백분율을 유의하기 감소시키는 것이다. 성장 억제제의 예는 세포 주기 진행을 차단하는 작용제(S 단계가 아닌 위치), 예를 들어, G1 정지 및 M-단계 정지를 유도하는 작용제를 포함한다. 고전적 M-단계 차단제는 빈카스(vincas)(빈크리스틴(vincristine) 및 빈블라스틴(vinblastine)), 탁솔(taxol), 및 토포 II 억제제, 예를 들어, 독소루비신(doxorubicin), 에피루비신(epirubicin), 다우노루비신(daunorubicin), 에토포시드(etoposide), 및 블레오마이신(bleomycin)을 포함한다. G1을 정지시키는 상기 작용제, 예를 들어, DNA 알킬화제, 예를 들어, 타목시펜(tamoxifen), 프레드니손(prednisone), 다카르바진(dacarbazine), 메클로르에타민(mechlorethamine), 시스플라틴(cisplatin), 메토트렉세이트, 5-플루오로우라실, 및 아라-C(ara-C)는 또한 S-단계 정지까지 영향을 미친다. 추가 정보는 문헌[The Molecular Basis of Cancer, Mendelsohn and Israel, eds., Chapter 1, entitled "Cell cycle regulation, oncogenes, and antineoplastic drugs" by Murakami et al . (WB Saunders: Philadelphia, 1995, pg. 13)]에서 발견될 수 있다.As used herein, “growth inhibitor” refers to a compound or composition that inhibits the growth of cells in vitro or in vivo. Thus, a growth inhibitory agent is one that significantly reduces the percentage of cells that overexpress the gene in S phase. Examples of growth inhibitory agents include agents that block cell cycle progression (positions other than S phase), for example agents that induce G1 arrest and M-phase arrest. Classical M-stage blockers are vincas (vincristine and vinblastine), taxol, and topo II inhibitors such as doxorubicin, epirubicin ), Daunorubicin, etoposide, and bleomycin. Such agents that stop G1, such as DNA alkylating agents, for example tamoxifen, prednisone, dacarbazine, mechlorethamine, cisplatin, methotrexate, 5 Fluorouracil, and ara-C, also affects up to S-phase stop. Further information is provided in The Molecular Basis of Cancer, Mendelsohn and Israel, eds., Chapter 1, entitled "Cell cycle regulation, oncogenes, and antineoplastic drugs" by Murakami et al . (WB Saunders: Philadelphia, 1995, pg. 13).

세포 스트레스를 유도하는 작용제, 예를 들어, 아르기닌 고갈(arginine depleting) 작용제, 예를 들어, 아르기나제가 추가로 포함된다.Agents that induce cellular stress, such as arginine depleting agents, such as arginase, are further included.

B 세포에 영향을 미치는 항체, 예를 들어, 리툭시맙(Rituximab), 항-BAFF 또는 항-APRIL 항체 및 T 세포 고갈 항체, 예를 들어, 캄파쓰(Campath)가 추가로 포함된다. 또한, IL-23R 길항제와 아스피린 및 NFkB 경로의 억제제의 조합물이 유익할 수 있다.Further included are antibodies that affect B cells, such as Rituximab, anti-BAFF or anti-APRIL antibodies, and T cell depleted antibodies, such as Camppath. In addition, combinations of IL-23R antagonists with inhibitors of the aspirin and NFkB pathways may be beneficial.

본원에서 사용되는 "상승작용 활성", "상승작용", "상승작용 효과" 또는 "상승작용 유효량"은 IL-23R 길항제 및 치료제의 조합물을 이용하는 경우에 관찰되는 효과가 (1) IL-23R 길항제 또는 치료제가 단독으로(또는 개별적으로) 이용되는 경우에 달성되는 효과보다 크고, (2) IL-23R 길항제 또는 치료제에 대한 합산된(더해진) 효과의 합계보다 큰 것을 의미한다. 이러한 상승작용 또는 상승작용 효과는 당업자에게 공지된 다양한 수단에 의해 결정될 수 있다. 예를 들어, IL-23R 길항제 및 치료제의 상승작용 효과가 면역 세포로부터의 사이토카인 방출, 존재하는 면역 세포의 수 또는 유형의 감소, 또는 암의 경우, 종양 세포수 또는 종양 크기의 감소를 시험하는 시험관내 또는 생체내 검정 포맷에서 관찰될 수 있다.As used herein, "synergistic activity", "synergy", "synergistic effect" or "synergistically effective amount" means that the effect observed when using a combination of an IL-23R antagonist and a therapeutic agent is (1) IL-23R Greater than the effect achieved when the antagonist or therapeutic agent is used alone (or separately), and (2) greater than the sum of the combined (added) effect on the IL-23R antagonist or therapeutic agent. This synergy or synergistic effect can be determined by various means known to those skilled in the art. For example, the synergistic effect of IL-23R antagonists and therapeutic agents may be used to test for cytokine release from immune cells, a reduction in the number or type of immune cells present, or, in the case of cancer, a decrease in tumor cell number or tumor size. It can be observed in an in vitro or in vivo assay format.

용어 "암", "암성" 및 "악성"은 조절되지 않는 세포 성장을 전형적으로 특징으로 하는 포유동물의 생리학적 질환을 의미하거나, 이를 기재한다. 암의 예는 암종, 예를 들어, 선암종, 림프종, 모세포종, 흑색종, 육종, 및 백혈병을 포함하나, 이에 제한되지는 않는다. 상기 암의 더욱 특정한 예는 편평세포암, 소세포 폐암, 비-소세포 폐암(NSCLC), 위장암, 호지킨 및 비-호지킨 림프종, 췌장암, 교모세포종, 신경아교종, 자궁경부암, 난소암, 간암, 예를 들어, 간암종 및 간암, 방광암, 유방암, 결장암, 결장직장암, 자궁내막 암종, 골수종(예를 들어, 다발골수종), 타액선 암종, 신장암, 예를 들어, 신세포 암종 및 윌름즈 종양, 기저세포 암종, 흑색종, 전립선암, 외음부암, 갑상샘암, 고환암, 식도암, 및 다양한 타입의 두경부암을 포함한다.The terms "cancer", "cancerous" and "malignant" refer to or describe the physiological diseases of mammals that are typically characterized by unregulated cell growth. Examples of cancer include, but are not limited to, carcinoma, such as adenocarcinoma, lymphoma, blastoma, melanoma, sarcoma, and leukemia. More specific examples of such cancers include squamous cell cancer, small cell lung cancer, non-small cell lung cancer (NSCLC), gastrointestinal cancer, Hodgkin's and non-Hodgkin's lymphoma, pancreatic cancer, glioblastoma, glioma, cervical cancer, ovarian cancer, liver cancer, For example, hepatocarcinoma and liver cancer, bladder cancer, breast cancer, colon cancer, colorectal cancer, endometrial carcinoma, myeloma (eg multiple myeloma), salivary gland carcinoma, kidney cancer, eg renal cell carcinoma and Wilms' tumor, Basal cell carcinoma, melanoma, prostate cancer, vulvar cancer, thyroid cancer, testicular cancer, esophageal cancer, and various types of head and neck cancers.

용어 "면역 관련 질병"은 포유동물의 면역계의 구성요소가 포유동물에서의 이환률을 야기시키거나, 매개하거나, 달리 기여하는 질병 또는 장애를 의미한다. 면역 반응의 자극 또는 개재가 질병의 진행에 대한 개선적인 효과를 갖는 질병이 또한 포함된다. 자가면역 질병, 면역-매개 염증 질병이 상기 용어에 포함된다. 본 발명에 따라 치료될 수 있는 면역-관련 질병 및 염증 질병의 예(이중 일부는 면역 또는 T 세포 매개임)는 전신홍반루푸스, 류마티스 관절염, 연소기 만성 관절염, 척추관절염, 강직척추염, 전신경화증(공피증), 특발성 염증성 근육병증(피부근육염, 다발근육염), 원발성 쇼그렌 증후군, 전신성 혈관염, 사코이드증, 자가면역용혈빈혈(면역범혈구감소증, 발작성야간혈색뇨), 자가면역성혈소판 감소증(특발저혈소판자색반병, 면역-매개 저혈소판증), 갑상선염(그레이브스병, 하시모토 갑상선염, 연소기 림프구성 갑상선염, 위축성 갑상선염), 당뇨병, 면역-매개 신장병(사구체신염, 요세관간질신장염(tubulointerstitial nephritis)), 중추신경계 및 말초신경계의 탈수초병, 예를 들어, 다발경화증, 특발성 탈수초성 다발신경병증 또는 길랑-바레 증후군, 보그트-고야나기-하라다병, 굿파스처 질병, 및 만성염증성탈수초성 다발신경병증, 간쓸개 질병, 예를 들어, 전염성 간염(간염 A, B, C, D, E 및 다른 비-간친화 바이러스), 자가면역 만성 활동 간염, 원발쓸개관간경화, 육아종간염, 및 경화쓸개관염, 염증 질병, 예를 들어, 염증성장질환(궤양대장염: 크론병), 글루텐민감소장병, 휘플병, 및 섬유성 폐 질병, 자가면역 또는 면역-매개 피부 질병, 예를 들어, 물집 피부 질병, 다형홍반 및 접촉피부염, 건선, 알레르기 질병, 예를 들어, 천식, 알레르기비염, 아토피피부염, 식품과민성 및 두드러기, 폐의 면역 질병, 예를 들어, 호산구폐렴, 특발폐섬유증 및 과민성폐렴, 이식 관련 질병, 예를 들어, 이식편거부 및 이식편대숙주병, 면역-매개 또는 자가면역 안구 질병, 예를 들어, 포도막염, 건성안, 빌리프스 질병(Beliefs disease, BD)을 포함한다.The term “immune related disease” means a disease or disorder in which a component of the mammal's immune system causes, mediates, or otherwise contributes to morbidity in a mammal. Also included are diseases in which the stimulation or interference of the immune response has an improved effect on disease progression. Autoimmune diseases, immune-mediated inflammatory diseases are included in the term. Examples of immune-related diseases and inflammatory diseases that can be treated according to the present invention (some of which are immune or T cell mediated) include systemic lupus erythematosus, rheumatoid arthritis, combustor chronic arthritis, spondyloarthritis, ankylosing spondylitis, systemic sclerosis (coccus) Subcutaneous), idiopathic inflammatory myopathy (skin myositis, polymyositis), primary Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune hemolytic anemia (immune thrombocytopenia, paroxysmal nocturnal hemoglobinuria), autoimmune thrombocytopenia (idic hypoplatelet) Purple plague, immune-mediated hypothyroidism), thyroiditis (Grave's disease, Hashimoto's thyroiditis, combustive lymphocytic thyroiditis, atrophic thyroiditis), diabetes mellitus, immune-mediated nephropathy (tubulointerstitial nephritis), central nervous system And demyelinating diseases of the peripheral nervous system such as multiple sclerosis, idiopathic demyelinating polyneuropathy or Guillain-Barré syndrome, Vogt-Goya Ki-Harada disease, goodpasture disease, and chronic inflammatory demyelinating polyneuropathy, hepatobiliary disease, such as infectious hepatitis (hepatitis A, B, C, D, E and other non-hepatic viruses), autologous Immune Chronic Activity Hepatitis, Primary Gallbladder Liver Cirrhosis, Granuloma Hepatitis, and Sclerosing Gallbladder, Inflammatory Diseases, eg, Inflammatory Growth Disease (ulcerative colitis: Crohn's disease), Gluten-sensitive small bowel disease, Whiple disease, and fibrotic lung disease , Autoimmune or immune-mediated skin diseases, such as blister skin disease, polymorphic erythema and contact dermatitis, psoriasis, allergic diseases, for example asthma, allergic rhinitis, atopic dermatitis, food intolerance and urticaria, immune diseases of the lung Eosinophilic pneumonia, idiopathic fibrosis and irritable pneumonia, graft-related diseases such as graft rejection and graft-versus-host disease, immune-mediated or autoimmune ocular diseases such as uveitis, dry eye, billifs Disease (Beliefs disease, BD) .

감염성 질병은 AIDS(HIV 감염), 간염 A, B, C, D, 및 E, 박테리아 감염, 진균 감염, 원생동물 감염 및 기생충 감염을 포함한다.Infectious diseases include AIDS (HIV infection), hepatitis A, B, C, D, and E, bacterial infections, fungal infections, protozoan infections, and parasitic infections.

"B-세포 악성종양"은 B 세포를 포함하는 악성종양이다. 예로는 호지킨병, 예를 들어, 림프구 우위성 호지킨병(LPHD); 비-호지킨 림프종(NHL); 소포 중심세포(FCC) 림프종; 급성 림프구성 백혈병(ALL); 만성 림프구성 백혈병(CLL); 털세포백혈병; 형질세포양 림프구성 림프종; 외투세포림프종; AIDS 또는 HIV-관련 림프종; 다발골수종; 중추신경계(CNS) 림프종; 이식후 림프세포증식질환(PTLD); 발덴스트롬마크로글로불린혈증(림프형질세포성 림프종); 점막연관림프조직(MALT) 림프종; 및 변연부림프종/백혈병을 포함한다."B-cell malignancies" are malignancies including B cells. Examples include Hodgkin's disease, such as lymphocyte predominant Hodgkin's disease (LPHD); Non-Hodgkin's lymphoma (NHL); Follicular central cell (FCC) lymphoma; Acute lymphocytic leukemia (ALL); Chronic lymphocytic leukemia (CLL); Hairy cell leukemia; Plasmacytoid lymphocytic lymphoma; Mantle cell lymphoma; AIDS or HIV-related lymphomas; Multiple myeloma; Central nervous system (CNS) lymphoma; Lymphocyte proliferative disease (PTLD) after transplantation; Waldenstrom's macroglobulinemia (lymphoblastic lymphoma); Mucosal associated lymphoid tissue (MALT) lymphoma; And limbic lymphoma / leukemia.

"비-호지킨 림프종"(NHL)은 저등급/소포 NHL, 재발성 또는 난치성 NHL, 제일선 저등급 NHL, 단계 III/IV NHL, 화학요법 내성 NHL, 소림프구(SL) NHL, 중간 등급/소포 NHL, 중간 등급 미만성 NHL, 미만성 대세포 림프종, 공격 NHL(공격 제일선 NHL 및 공격 재발성 NHL을 포함함), 자가 줄기세포 이식 후에 재발하거나 이에 난치성인 NHL, 고등급 면역모세포 NHL, 고등급 림프모구 NHL, 고등급 소 비-분할 세포 NHL, 큰 종양(bulky disease) NHL 등을 포함하나, 이에 제한되지는 않는다."Non-Hodgkin's lymphoma" (NHL) is low grade / vesicle NHL, recurrent or refractory NHL, foremost low grade NHL, stage III / IV NHL, chemotherapy resistant NHL, small lymphocyte (SL) NHL, intermediate grade / vesicle NHL, intermediate grade diffuse NHL, diffuse large cell lymphoma, attack NHL (including attack first NHL and attack recurrent NHL), NHL relapsed or refractory to autologous stem cell transplantation, high grade immunoblast NHL, high grade lymphocytic NHL, high grade bovine non-dividing cell NHL, bulky disease NHL, and the like.

"종양-관련 항원"(TAA) 또는 "종양-특이적 항원"(TSA)은 숙주에서 면역 반응을 촉발시킬 수 있는 종양 세포 내에서 생성되는 분자이다. 종양 관련 항원은 종양 및 정상 세포 둘 모두에서 발견되나, 차별적 발현 수준으로 발견되는 반면, 종양 특이적 항원은 종양 세포에 의해서만 발현된다. 종양 세포의 표면에서 나타나는 TAA 또는 TSA는 알파태아단백, 암배아 항원(CEA), CA-125, MUC-1, 글리피칸-3(glypican-3), 종양 관련 당단백질-72(TAG-72), 상피 종양 항원, 티로시나제, 흑색종 관련 항원, MART-1, gp100, TRP-1, TRP-2, MSH-1, MAGE-1, -2, -3, -12, RAGE-1, GAGE 1-, -2, BAGE, NY-ESO-1, 베타-카테닌, CDCP-1, CDC-27, SART-1, EpCAM, CD20, CD23, CD33, EGFR, HER-2, 유방 종양-관련 항원 BTA-1 및 BTA-2, RCAS1(SiSo 세포에서 발현되는 수용체-결합 암 항원), 태반-특이적 1(PLAC-1), 신데칸(syndecan), MN(gp250), 특히 이디오타입(idiotype)을 포함하나, 이에 제한되지는 않는다. 종양 관련 항원은 또한 혈액군 항원, 예를 들어, Lea, Leb, LeX, LeY, H-2, B-1, B-2 항원을 포함한다(본 명세서의 하부의 표 19 참조). 이상적으로, 본 발명의 목적상, TAA 또는 TSA 표적은 결합시 내재화되지 않는다.A "tumor-associated antigen" (TAA) or a "tumor-specific antigen" (TSA) is a molecule produced in tumor cells that can trigger an immune response in a host. Tumor related antigens are found in both tumor and normal cells, but at differential expression levels, while tumor specific antigens are expressed only by tumor cells. TAAs or TSAs that appear on the surface of tumor cells are alpha-fetoprotein, cancer embryo antigen (CEA), CA-125, MUC-1, glypican-3, tumor-associated glycoprotein-72 (TAG-72) , Epithelial tumor antigen, tyrosinase, melanoma-associated antigen, MART-1, gp100, TRP-1, TRP-2, MSH-1, MAGE-1, -2, -3, -12, RAGE-1, GAGE 1- , -2, BAGE, NY-ESO-1, beta-catenin, CDCP-1, CDC-27, SART-1, EpCAM, CD20, CD23, CD33, EGFR, HER-2, breast tumor-associated antigen BTA-1 And BTA-2, RCAS1 (receptor-binding cancer antigen expressed in SiSo cells), placental-specific 1 (PLAC-1), syndecan, MN (gp250), in particular idiotype However, it is not limited thereto. Tumor-associated antigens also include blood group antigens, eg, Le a , Le b , LeX, LeY, H-2, B-1, B-2 antigens (see Table 19 below). Ideally, for the purposes of the present invention, TAA or TSA targets are not internalized upon binding.

"비-자연 아미노산" 또는 "비-자연 발생 아미노산"은, 예를 들어, 자연 엔코딩되는 아미노산(20개의 통상적인 아미노산 또는 피로리신 및 셀레노시스테인을 포함하나, 이에 제한되지는 않음)의 변형(예를 들어, 번역후 변형)에 의해 발생하지만, 번역 복합체에 의해 성장하는 폴리펩티드 사슬 내로 자신이 자연적으로 통합되지 않는 아미노산을 포함하는, 20개의 통상적인 아미노산 중 하나가 아닌 아미노산을 의미한다. 이러한 비-자연 발생 아미노산의 예는 N-아세틸글루코사미닐-L-세린, N-아세틸글루코사미닐-L-트레오닌, 및 O-포스포티로신을 포함하나, 이에 제한되지는 않는다.A "non-natural amino acid" or "non-naturally occurring amino acid" means, for example, a modification of a naturally encoded amino acid (including, but not limited to, 20 conventional amino acids or pyrrolysine and selenocysteine) Eg, amino acids, caused by post-translational modifications, but not one of the 20 common amino acids, including amino acids that do not naturally integrate into the polypeptide chains growing by the translation complex. Examples of such non-naturally occurring amino acids include, but are not limited to, N-acetylglucosaminyl-L-serine, N-acetylglucosaminyl-L-threonine, and O-phosphotyrosine.

"보존적으로 변형된 변이체"는 아미노산 및 핵산 서열 둘 모두에 적용된다. 특정 핵산 서열과 관련하여, 보존적으로 변형된 변이체는 동일하거나 본질적으로 동일한 아미노산 서열을 엔코딩하는 핵산을 의미하거나, 핵산이 아미노산 서열을 엔코딩하지 않는 경우, 본질적으로 동일한 핵산 서열을 의미한다. 유전 부호의 축퇴성으로 인해, 매우 많은 수의 기능적으로 동일한 핵산이 임의의 제공된 단백질을 엔코딩할 수 있다."Conservatively modified variants" applies to both amino acid and nucleic acid sequences. In the context of a particular nucleic acid sequence, a conservatively modified variant refers to a nucleic acid that encodes the same or essentially the same amino acid sequence or, if the nucleic acid does not encode an amino acid sequence, to essentially the same nucleic acid sequence. Due to the degeneracy of the genetic code, a very large number of functionally identical nucleic acids can encode any given protein.

아미노산 서열에 관해서, 당업자는 보존된 아미노산에 대해 엔코딩되는 서열 내의 아미노산 또는 아미노산의 특정 백분율을 치환하는 핵산, 펩티드, 폴리펩티드, 또는 단백질 서열에 대한 개별적 치환이 "보존적으로 변형된 변이체"임을 인지할 것이다. 기능적으로 유사한 아미노산을 제공하는 보존성 치환 표가 당 분야에 널리 공지되어 있다.With respect to amino acid sequences, one of ordinary skill in the art would recognize that individual substitutions to nucleic acids, peptides, polypeptides, or protein sequences that substitute for a specific percentage of amino acids or amino acids in the sequence encoded relative to conserved amino acids are “conservatively modified variants”. will be. Conservative substitution tables that provide functionally similar amino acids are well known in the art.

보존성 치환의 예는 하기 군 중 하나의 아미노산과 동일 군 중 또 다른 아미노산의 교환이다(U.S. Pat. No. 5,767,063 issued to Lee, et al; Kyte and Doolittle (1982) J. Mol. Biol. 157: 105-132): (1) 소수성: 노르류신, Ile, Val, Leu, Phe, Cys, 또는 Met; (2) 중성 친수성: Cys, Ser, Thr; (3) 산성: Asp, Glu; (4) 염기성: Asn, Gln, His, Lys, Arg; (5) 사슬 배향에 영향을 주는 잔기: Gly, Pro; (6) 방향족: Trp, Tyr, Phe; (7) 작은 아미노산: Gly, Ala, Ser.An example of a conservative substitution is the exchange of an amino acid of one of the following groups with another amino acid of the same group (US Pat. No. 5,767,063 issued to Lee, et al; Kyte and Doolittle (1982) J. Mol. Biol. 157: 105 132): (1) hydrophobic: norleucine, Ile, Val, Leu, Phe, Cys, or Met; (2) neutral hydrophilic: Cys, Ser, Thr; (3) acidic: Asp, Glu; (4) basic: Asn, Gln, His, Lys, Arg; (5) Residues that affect chain orientation: Gly, Pro; (6) aromatic: Trp, Tyr, Phe; (7) small amino acids: Gly, Ala, Ser.

억제 정도를 조사하기 위해, 예를 들어, 제공된, 예를 들어, 단백질, 유전자, 세포 또는 유기체를 포함하는 샘플 또는 검정이 잠재적 활성제 또는 억제제로 처리되고, 억제제가 없는 대조군 샘플과 비교된다. 대조군 샘플, 즉, 길항제로 처리되지 않은 샘플이 100%의 상대 활성 값으로 지정된다. 대조군에 비한 활성 값이 약 90% 또는 그 미만, 전형적으로 85% 또는 그 미만, 더욱 전형적으로 80% 또는 그 미만, 가장 전형적으로 75% 또는 그 미만, 일반적으로 70% 또는 그 미만, 더욱 일반적으로 65% 또는 그 미만, 가장 일반적으로 60% 또는 그 미만, 전형적으로 55% 또는 그 미만, 일반적으로 50% 또는 그 미만, 더욱 일반적으로 45% 또는 그 미만, 가장 일반적으로 40% 또는 그 미만, 바람직하게는 35% 또는 그 미만, 더욱 바람직하게는 30% 또는 그 미만, 더욱 더 바람직하게는 25% 또는 그 미만, 가장 바람직하게는 25% 미만인 경우에 억제가 달성된다. 대조군에 비한 활성 값이 약 110%, 일반적으로 적어도 120%, 더욱 일반적으로 적어도 140%, 더욱 일반적으로 적어도 160%, 흔히 적어도 180%, 더욱 흔히 적어도 2배, 가장 흔히 적어도 2.5배, 일반적으로 적어도 5배, 더욱 일반적으로 적어도 10배, 바람직하게는 적어도 20배, 더욱 바람직하게는 적어도 40배, 가장 바람직하게는 40배를 초과하여 높은 경우에 활성화가 달성된다.To investigate the degree of inhibition, for example, a sample or assay provided, eg, comprising a protein, gene, cell or organism, is treated with a potential active agent or inhibitor and compared to a control sample without inhibitor. Control samples, ie samples not treated with antagonists, are assigned a relative activity value of 100%. Activity values relative to the control are about 90% or less, typically 85% or less, more typically 80% or less, most typically 75% or less, generally 70% or less, more generally 65% or less, most typically 60% or less, typically 55% or less, generally 50% or less, more generally 45% or less, most generally 40% or less, preferably Preferably, inhibition is achieved when it is 35% or less, more preferably 30% or less, even more preferably 25% or less, and most preferably less than 25%. An activity value relative to the control is about 110%, generally at least 120%, more generally at least 140%, more generally at least 160%, often at least 180%, more often at least 2 times, most often at least 2.5 times, generally at least Activation is achieved when 5 times, more generally at least 10 times, preferably at least 20 times, more preferably at least 40 times, most preferably more than 40 times high.

활성화 또는 억제의 종점은 다음과 같이 모니터될 수 있다. 활성화, 억제, 및, 예를 들어, 세포, 생리학적 유체, 조직, 기관, 및 동물 또는 인간 대상체의 처리에 대한 반응은 종점에 의해 모니터될 수 있다. 종점은, 예를 들어, 염증, 발암성, 또는 세포 탈과립 또는 분비의 지표, 예를 들어, 사이토카인, 독성 산소, 또는 프로테아제의 방출의 소정량 또는 소정의 백분율을 포함할 수 있다. 종점은, 예를 들어, 이온 유동 또는 수송; 세포 이동; 세포 부착; 세포 증식; 전이 잠재성; 세포 분화; 및 표현형의 변화, 예를 들어, 염증, 아폽토시스, 형질전환, 세포 주기, 또는 전이와 관련된 유전자의 발현에서의 변화의 소정량을 포함할 수 있다(예를 들어, Knight (2000) Ann. Clin. Lab. Sci. 30: 145-158; Hood and Cheresh (2002) Nature Rev. Cancer 2:91-100; Timme, et al. (2003) Curr. Drug Targets 4:251-261; Robbins and Itzkowitz (2002) Med. Clin. North Am. 86: 1467-1495; Grady and Markowitz (2002) Annu. Rev. Genomics Hum. Genet. 3: 101-128; Bauer, et al . (2001) Glia 36:235-243; Stanimirovic and Satoh (2000) Brain Pathol. 10: 113-126 참조).The endpoint of activation or inhibition can be monitored as follows. Activation, inhibition, and response to, for example, treatment of cells, physiological fluids, tissues, organs, and animal or human subjects can be monitored by endpoints. The endpoint may include, for example, an amount or a percentage of the release of inflammation, carcinogenicity, or cellular degranulation or secretion, for example cytokine, toxic oxygen, or protease. The endpoint may be, for example, ion flow or transport; Cell migration; Cell adhesion; Cell proliferation; Metastatic potential; Cell differentiation; And a predetermined amount of change in phenotype, eg, change in expression of a gene associated with inflammation, apoptosis, transformation, cell cycle, or metastasis (eg, Knight (2000) Ann. Clin. Lab.Sci. 30: 145-158; Hood and Cheresh (2002) Nature Rev. Cancer 2: 91-100; Timme, et al. (2003) Curr.Drug Targets 4: 251-261; Robbins and Itzkowitz (2002) Med. Clin.North Am. 86: 1467-1495; Grady and Markowitz (2002) Annu. Rev. Genomics Hum. Genet. 3: 101-128; Bauer, et. al . (2001) Glia 36: 235-243; Stanimirovic and Satoh (2000) Brain Pathol. 10: 113-126).

억제의 종점은 일반적으로 대조군의 75% 또는 그 미만, 바람직하게는 대조군의 50% 또는 그 미만, 더욱 바람직하게는 대조군의 25% 또는 그 미만, 가장 바람직하게는 대조군의 10% 또는 그 미만이다. 일반적으로, 활성화의 종점은 대조군의 적어도 150%, 바람직하게는 대조군의 적어도 2배, 더욱 바람직하게는 대조군의 적어도 4배, 가장 바람직하게는 대조군의 적어도 10배이다.The end point of inhibition is generally 75% or less of the control, preferably 50% or less of the control, more preferably 25% or less of the control, most preferably 10% or less of the control. In general, the end point of activation is at least 150% of the control group, preferably at least 2 times the control group, more preferably at least 4 times the control group, and most preferably at least 10 times the control group.

"표지"되는 조성물은 분광, 광화학, 생화학, 면역화학, 동위원소, 또는 화학 방법에 의해 직접 또는 간접적으로 검출가능하다. 예를 들어, 유용한 표지는 32P, 33P, 35S, 14C, 3H, 125I, 안정적 동위원소, 형광 염료, 전자-밀도 시약, 기질, 에피토프 태그, 또는 효소, 예를 들어, 효소결합 면역 흡착검사(enzyme-linked immunoassay)에 사용되는 효소, 또는 플루오렛(fluorette)을 포함한다(예를 들어, Rozinov and Nolan (1998) Chem. Biol. 5:713-728 참조).The "labeled" composition can be detected directly or indirectly by spectroscopic, photochemical, biochemical, immunochemical, isotope, or chemical methods. For example, useful labels include 32 P, 33 P, 35 S, 14 C, 3 H, 125 I, stable isotopes, fluorescent dyes, electron-density reagents, substrates, epitope tags, or enzymes such as enzymes Enzymes used in enzyme-linked immunoassay, or fluorettes (see, eg, Rozinov and Nolan (1998) Chem. Biol. 5: 713-728).

본 발명에 사용하기에 적합한 비자연 아미노산 중 다수의 비자연 아미노산이, 예를 들어 시그마(Sigma)(USA) 또는 알드리치(Aldrich)(Milwaukee, Wis., USA)에서 시판된다. 시판되지 않는 비자연 아미노산은 임의로 본원에 제공되거나, 다양한 간행물에 제공된 바와 같이 합성되거나, 당업자에게 공지된 표준 방법을 이용하여 합성된다. 유기 합성 기술에 대해, 예를 들어, 문헌[Organic Chemistry by Fessendon and Fessendon, (1982, Second Edition, Willard Grant Press, Boston Mass.); Advanced Organic Chemistry by March (Third Edition, 1985, Wiley and Sons, New York); and Advanced Organic Chemistry by Carey and Sundberg (Third Edition, Parts A and B, 1990, Plenum Press, New York)]을 참조하라. 비자연 아미노산의 합성을 기재하는 추가 간행물은, 예를 들어, 문헌[WO 2002/085923 entitled "In vivo incorporation of Unnatural Amino Acids;" Matsoukas et al., (1995) J. Med. Chem., 38, 4660-4669; King, F.E. & Kidd, D.A.A. (1949) A New Synthesis of Glutamine and of .gamma. -Dipeptides of Glutamic Acid from Phthylated Intermediates. J. Chem. Soc, 3315-3319; Friedman, O.M. & Chatterrji, R. (1959) Synthesis of Derivatives of Glutamine as Model Substrates for Anti-Tumor Agents. J. Am. Chem. Soc. 81, 3750-3752; Craig, J.C. et al. (1988) Absolute Configuration of the Enantiomers of 7-Chloro-4[[4-(diethylamino)-1-methylbutyl]amino]quinoline (Chloroquine). J. Org. Chem. 53, 1167-1170; Azoulay, M., Vilmont, M. & Frappier, F. (1991) Glutamine analogues as Potential Antimalarials, Eur. J. Med. Chem. 26, 201-5; Koskinen, A.M.P. & Rapoport, H. (1989) Synthesis of 4-Substituted Prolines as Conformationally Constrained Amino Acid Analogues. J. Org. Chem. 54, 1859-1866; Christie, B. D. & Rapoport, H. (1985) Synthesis of Optically Pure Pipecolates from L-Asparagine. Application to the Total Synthesis of (+)-Apovincamine through Amino Acid Decarbonylation and Iminium Ion Cyclization. J. Org. Chem. 1989: 1859-1866; Barton et al, (1987) Synthesis of Novel a-Amino-Acids and Derivatives Using Radical Chemistry: Synthesis of L-and D-a-Amino-Adipic Acids, L-a-aminopimelic Acid and Appropriate Unsaturated Derivatives. Tetrahedron Lett. 43: 4297-4308; and, Subasinghe et al, (1992) Quisqualic acid analogues: synthesis of beta-heterocyclic 2-aminopropanoic acid derivatives and their activity at a novel quisqualate-sensitized site. J. Med. Chem. 35: 4602-7]을 포함한다. 또한, 각각 전체내용이 참조로서 본원에 포함되는 US 2004/0198637호 및 US 2005/0170404호를 참조하라.Many of the unnatural amino acids suitable for use in the present invention are commercially available, for example, from Sigma (USA) or Aldrich (Milwaukee, Wis., USA). Non-commercial amino acids which are not commercially available are optionally synthesized as provided herein, as provided in various publications, or using standard methods known to those skilled in the art. For organic synthesis techniques, see, for example, Organic Chemistry by Fessendon and Fessendon, (1982, Second Edition, Willard Grant Press, Boston Mass.); Advanced Organic Chemistry by March (Third Edition, 1985, Wiley and Sons, New York); and Advanced Organic Chemistry by Carey and Sundberg (Third Edition, Parts A and B, 1990, Plenum Press, New York). Additional publications describing the synthesis of unnatural amino acids are described, for example, in WO 2002/085923 entitled "In vivo incorporation of Unnatural Amino Acids;" Matsoukas et al., (1995) J. Med. Chem., 38, 4660-4669; King, F.E. & Kidd, D.A.A. (1949) A New Synthesis of Glutamine and of .gamma. -Dipeptides of Glutamic Acid from Phthylated Intermediates. J. Chem. Soc, 3315-3319; Friedman, O. M. & Chatterrji, R. (1959) Synthesis of Derivatives of Glutamine as Model Substrates for Anti-Tumor Agents. J. Am. Chem. Soc. 81, 3750-3752; Craig, J.C. et al. (1988) Absolute Configuration of the Enantiomers of 7-Chloro-4 [[4- (diethylamino) -1-methylbutyl] amino] quinoline (Chloroquine). J. Org. Chem. 53, 1167-1170; Azoulay, M., Vilmont, M. & Frappier, F. (1991) Glutamine analogues as Potential Antimalarials, Eur. J. Med. Chem. 26, 201-5; Koskinen, A.M.P. & Rapoport, H. (1989) Synthesis of 4-Substituted Prolines as Conformationally Constrained Amino Acid Analogues. J. Org. Chem. 54, 1859-1866; Christie, B. D. & Rapoport, H. (1985) Synthesis of Optically Pure Pipecolates from L-Asparagine. Application to the Total Synthesis of (+)-Apovincamine through Amino Acid Decarbonylation and Iminium Ion Cyclization. J. Org. Chem. 1989: 1859-1866; Barton et al, (1987) Synthesis of Novel a-Amino-Acids and Derivatives Using Radical Chemistry: Synthesis of L-and D-a-Amino-Adipic Acids, L-a-aminopimelic Acid and Appropriate Unsaturated Derivatives. Tetrahedron Lett. 43: 4297-4308; and, Subasinghe et al, (1992) Quisqualic acid analogues: synthesis of beta-heterocyclic 2-aminopropanoic acid derivatives and their activity at a novel quisqualate-sensitized site. J. Med. Chem. 35: 4602-7. See also US 2004/0198637 and US 2005/0170404, each of which is incorporated herein by reference in its entirety.

용어 "아미노산 변형(들)" 및 "변형(들)"은 또 다른 아미노산 서열, 예를 들어, 자연 아미노산 서열에 비한 아미노산 서열에서의 아미노산 치환, 결실 또는 삽입 또는 이의 임의의 조합을 의미한다. 본원의 치환 변이체는 자연 CTLD 서열 내의 적어도 하나의 아미노산 잔기가 제거되고, 동일 위치의 그 자리에 상이한 아미노산이 삽입된 치환 변이체이다. 치환은 분자 내의 단지 하나의 아미노산이 치환된 경우에 단일 치환일 수 있거나, 이는 동일 분자 내에서 2개 이상의 아미노산이 치환된 경우에 다수의 치환일 수 있다. CTLD 내의 하나 이상의 아미노산 치환에 대한 특정 언급은 각각의 개별적 아미노산 치환이 연속적 및 비연속적 아미노산 위치를 포함하는 CTLD 내의 임의의 아미노산 위치에서 발생할 수 있는 다수의 치환을 의미한다. 마찬가지로, CTLD 내의 하나 이상의 아미노산 삽입 또는 결실에 대한 특정 언급은 각각의 개별적 아미노산 삽입 또는 결실이 연속적 및 비연속적 아미노산 위치를 포함하는 CTLD 내의 임의의 아미노산 위치에서 발생할 수 있는 다수의 삽입 또는 결실을 의미한다.The terms “amino acid modification (s)” and “modification (s)” refer to amino acid substitutions, deletions or insertions or any combination thereof in another amino acid sequence, eg, an amino acid sequence relative to a natural amino acid sequence. Substitution variants herein are substitution variants wherein at least one amino acid residue in the native CTLD sequence has been removed and a different amino acid inserted in its place at the same position. The substitution may be a single substitution when only one amino acid in the molecule is substituted, or it may be a plurality of substitutions when two or more amino acids are substituted in the same molecule. Specific reference to one or more amino acid substitutions in the CTLD means a number of substitutions where each individual amino acid substitution may occur at any amino acid position in the CTLD, including consecutive and discontinuous amino acid positions. Likewise, specific reference to one or more amino acid insertions or deletions in the CTLD refers to a number of insertions or deletions in which each individual amino acid insertion or deletion may occur at any amino acid position in the CTLD, including consecutive and discontinuous amino acid positions. .

용어 "핵산 분자 엔코딩", "DNA 서열 엔코딩", 및 "DNA 엔코딩"은 데옥시리보핵산의 가닥에 따른 데옥시리보누클레오티드의 순서 또는 차례를 의미한다. 이러한 데옥시리보누클레오티드의 순서는 폴리펩티드 사슬에 따른 아미노산의 순서를 결정한다. 따라서, DNA 서열은 아미노산 서열을 엔코딩한다.The terms “nucleic acid molecule encoding”, “DNA sequence encoding”, and “DNA encoding” refer to the order or order of deoxyribonucleotides along the strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order of amino acids along the polypeptide chain. Thus, the DNA sequence encodes an amino acid sequence.

무작위화된 폴리펩티드 또는 핵산 서열을 확인하기 위해 임의의 문맥에서 사용되는 용어 "무작위화시키다", "무작위화시키는" 및 "무작위화된" 뿐만 아니라 임의의 유사한 용어는 폴리펩티드 또는 핵산 서열 또는 세그먼트의 총체(ensemble)를 의미하며, 여기서 하나 이상의 서열 위치의 아미노산 잔기 또는 누클레오티드는 폴리펩티드 또는 핵산의 다양한 일원 사이에서 다양할 수 있어, 각각의 상기 서열 위치에서 발생하는 핵산 잔기 또는 누클레오티드는 모든 가능한 아미노산 잔기 또는 누클레오티드 또는 이의 임의의 제한된 서브셋을 포함할 수 있는 아미노산 잔기 또는 누클레오티드의 세트에 속할 수 있다. 상기 용어는 종종 가능한 아미노산 잔기 또는 누클레오티드의 수가 상기 총체의 각각의 일원에 대해 동일한 총체를 의미하는데 사용되나, 이는 또한 상기 총체의 각각의 일원의 가능한 아미노산 잔기 또는 누클레오티드의 수가 적절한 범위의 정수 내의 임의의 정수일 수 있는 총체를 의미하는데 사용될 수 있다.The terms “randomize,” “randomize,” and “randomized,” as used in any context to identify a randomized polypeptide or nucleic acid sequence, as well as any analogous terminology refer to the entirety of a polypeptide or nucleic acid sequence or segment ( ensemble, wherein the amino acid residues or nucleotides at one or more sequence positions can vary between various members of the polypeptide or nucleic acid, such that the nucleic acid residues or nucleotides occurring at each said sequence position are all possible amino acid residues or nucleotides or Belong to a set of amino acid residues or nucleotides that may include any restricted subset thereof. The term is often used to mean a total number of possible amino acid residues or nucleotides equal for each member of the total, but this also means that the number of possible amino acid residues or nucleotides of each member of the total is within an appropriate range of integers. Can be used to mean a total that can be an integer.

이제 보다 상세히 본 발명으로 돌아가서, 일 양태에서, 본 발명은 다합체화 도메인 및 IL-23R에 결합하는 적어도 하나의 폴리펩티드 결합 일원을 갖는 폴리펩티드에 관한 것이다. 본 발명에 따르면, 결합 일원은, 예를 들어, N-말단 또는 C-말단에서 다합체화 도메인에 연결될 수 있다. 또한, 특정 구체예에서, IL-23R에 결합하는 결합 일원, 또는 2개의 상이한 결합 일원을 단량체의 다합체화 도메인의 N-말단 및 C-말단 둘 모두에 연결시킴으로써, IL-23R에 결합할 수 있는 6개의 결합 일원을 포함하는 다합체 폴리펩티드 복합체를 제공하는 것이 유리할 수 있다. 일반적으로, 본 발명의 폴리펩티드는 비-자연 폴리펩티드, 예를 들어, 다합체화 도메인 및 IL-23R에 결합하는 폴리펩티드 서열의 융합 단백질이다. 비-자연 폴리펩티드는 또한 자연 발생 아미노산 서열은 아미노산의 첨가, 결실, 또는 치환에 의해 변경된 자연 폴리펩티드일 수 있다. 이러한 폴리펩티드의 예는 도메인의 루프 영역 중 하나 이상이 본원에 기재된 바와 같이 변형된 C-타입 렉틴 유사 도메인(CTLD)을 갖는 폴리펩티드를 포함한다. 본 발명의 다른 양태에서, IL-23R에 결합하는 폴리펩티드는 수용체에 결합하는 자연 폴리펩티드의 단편 또는 변이체이며, 여기서 자연 발생 폴리펩티드, 변이체 또는 단편이 다합체화 도메인에 융합되는 경우, 융합 도메인은 더 이상 자연 발생 폴리펩티드가 아니다. 따라서, 본 발명은 자연 발생 폴리펩티드, 단편 또는 이의 변이체를 본 발명의 융합 단백질의 부분으로부터 배제하지 않는다.Turning now to the present invention in more detail, in one aspect, the present invention relates to a polypeptide having a multimerization domain and at least one polypeptide binding member that binds to IL-23R. According to the invention, the binding member can be linked to the multimerization domain, for example at the N-terminus or the C-terminus. Furthermore, in certain embodiments, a binding member that binds to IL-23R, or two different binding members, can be bound to IL-23R by linking to both the N-terminus and C-terminus of the multimerization domain of the monomer. It may be advantageous to provide multimeric polypeptide complexes comprising six binding members. In general, polypeptides of the invention are fusion proteins of non-natural polypeptides, eg, polypeptide sequences that bind to the multimerization domain and IL-23R. Non-natural polypeptides can also be natural polypeptides whose naturally occurring amino acid sequences have been altered by addition, deletion, or substitution of amino acids. Examples of such polypeptides include polypeptides in which at least one of the loop regions of the domain has a modified C-type lectin-like domain (CTLD) as described herein. In another aspect of the invention, a polypeptide that binds to IL-23R is a fragment or variant of a natural polypeptide that binds to a receptor, where the fusion domain is no longer natural when the naturally occurring polypeptide, variant or fragment is fused to a multimerization domain. It is not a developing polypeptide. Thus, the present invention does not exclude naturally occurring polypeptides, fragments or variants thereof from portions of the fusion proteins of the present invention.

이러한 양태의 구체예에서, 폴리펩티드는 IL-23R에 결합하는 IL-23R 길항제이며, IL-23 경로를 통한 신호전달을 방지한다. 일 구체예에서, 폴리펩티드는 IL23-R(SEQ ID NO:5) 또는 이의 변이체에 결합한다. 자연 IL-23 리간드의 결합을 방지하는 본 발명의 폴리펩티드는 IL-23R 상의 하나 이상의 부위에 결합함으로써, IL-23 리간드에 의한 수용체의 활성화를 방지한다. 또한, 본 발명의 폴리펩티드는 효능제 활성을 갖지 않으며, IL-23 이종이합체 수용체를 활성화시키지 않는다.In an embodiment of this aspect, the polypeptide is an IL-23R antagonist that binds IL-23R and prevents signaling through the IL-23 pathway. In one embodiment, the polypeptide binds to IL23-R (SEQ ID NO: 5) or variants thereof. Polypeptides of the invention that prevent binding of natural IL-23 ligands bind to one or more sites on IL-23R, thereby preventing activation of the receptor by the IL-23 ligand. In addition, the polypeptides of the present invention do not have agonist activity and do not activate IL-23 heterodimeric receptors.

한 특정 구체예에서, 폴리펩티드는 IL-12Rβ1 또는 IL-12Rβ2에 특이적으로 결합하지 않는다. 따라서, 치료 조성물에서의 본 발명의 폴리펩티드의 사용은 특정 요법을 위한 IL-12의 활성을 원치 않게 차단하는 결과를 피할 수 있다.In one specific embodiment, the polypeptide does not specifically bind IL-12Rβ1 or IL-12Rβ2. Thus, the use of the polypeptides of the invention in therapeutic compositions can avoid the consequences of undesirably blocking the activity of IL-12 for certain therapies.

다양한 양태에서, 본 발명의 단량체 폴리펩티드는 다른 다합체화 도메인과 다합체 복합체를 형성할 수 있는 다합체화 도메인, 및 IL-23R에 결합하는 폴리펩티드 서열의 적어도 2개의 세그먼트를 포함한다. IL-23R에 결합하는 서열은 도메인의 N-말단, C-말단, 또는 N-말단 및 C-말단 둘 모두에서 다합체화 도메인과 융합될 수 있다. 일 구체예에서, N-말단에서 IL-23R에 결합하는 폴리펩티드는 삼합체화 도메인의 C-말단에서 IL-23R에 결합하는 폴리펩티드와 상이하다.In various embodiments, monomeric polypeptides of the invention comprise a multimerization domain capable of forming a multimer complex with another multimerization domain, and at least two segments of a polypeptide sequence that binds IL-23R. The sequence that binds IL-23R may be fused with a multimerization domain at the N-terminus, C-terminus, or both N-terminus and C-terminus of the domain. In one embodiment, the polypeptide that binds IL-23R at the N-terminus is different from the polypeptide that binds IL-23R at the C-terminus of the trimerization domain.

일 구체예에서, IL-23R에 결합하는 제 1 폴리펩티드는 삼합체화 도메인의 N-말단 및 C-말단 중 하나에 융합되고, 염증의 조정자인 제 2 폴리펩티드는 삼합체화 도메인의 N-말단 또는 C-말단의 나머지에 융합된다. 폴리펩티드가 아닌 조정자는 공유적 또는 비공유적으로 삼합체화 도메인에 연결될 수 있으며, 이는 당업자에 의해 이해될 것이다. 염증의 조정자 외에, 다른 폴리펩티드 및 비-폴리펩티드 치료제가 삼합체화 모듈에 연결될 수 있다.In one embodiment, the first polypeptide that binds IL-23R is fused to one of the N-terminus and C-terminus of the trimerization domain, and the second polypeptide that is a modulator of inflammation is the N-terminus or C- of the trimerization domain Fused to the rest of the terminus. Modulators other than polypeptides may be covalently or non-covalently linked to the trimerization domain, as will be understood by those skilled in the art. In addition to modulators of inflammation, other polypeptides and non-polypeptide therapeutics may be linked to the trimerization module.

암의 치료를 위해, 종양 세포 상의 IL-23의 종양-촉진 작용을 더욱 효과적으로 방지하기 위해 본 발명의 폴리펩티드를 종양 환경에 표적화시키는 것이 요망될 수 있다. 따라서, 본 발명의 또 다른 양태는 도메인의 한 말단(N-말단 또는 C-말단 중 어느 하나) 상에 IL-23R에 결합하는 폴리펩티드, 및 나머지 말단(N-말단 또는 C-말단의 나머지) 상에 종양-관련(TAA) 또는 종양-특이적 항원(TSA)에 결합하는 폴리펩티드를 갖는 다합체화 도메인을 포함한다. TAA 또는 TSA에 결합하는 도메인은 펩티드, 예를 들어, CTLD, 단쇄 항체, 또는 요망되는 표적에 특이적으로 결합하는 임의의 유형의 도메인일 수 있다.For the treatment of cancer, it may be desirable to target the polypeptides of the invention to the tumor environment to more effectively prevent the tumor-promoting action of IL-23 on tumor cells. Thus, another aspect of the invention provides a polypeptide that binds to IL-23R on one terminus (either the N-terminus or C-terminus) of a domain, and on the other terminus (the rest of the N-terminus or C-terminus). And a multimerization domain having a polypeptide that binds to a tumor-associated (TAA) or tumor-specific antigen (TSA). The domain that binds TAA or TSA can be a peptide, eg, CTLD, single chain antibody, or any type of domain that specifically binds to the desired target.

일 특정 방법에서, 죽음 수용체 효능제의 활성은 암의 환경에서 염증 부위로 약물을 유도하고, 삼합체화 도메인의 제 2 말단 상에 죽음 수용체 특이적 폴리펩티드의 집락화(clustering)를 가능케 하는 삼함체화 도메인의 한 말단에 IL-23R 결합 폴리펩티드를 통해 매개되는 결합 활성을 갖는 분자를 설계함으로써 향상될 수 있다. 다양한 양태에서, 폴리펩티드는 IL-23R 보다 낮은 친화성으로 죽음 수용체에 결합한다. 더욱 특히, IL-23R에 결합하는 폴리펩티드는 죽음 수용체에 결합하는 폴리펩티드보다 적어도 2배 큰 친화성, 예를 들어, 2, 2.5, 3, 3.5, 4, 4.5, 5, 10, 15, 20, 50 및 100배 큰 친화성으로 결합할 수 있다.In one particular method, the activity of the death receptor agonist induces the drug to the site of inflammation in the cancerous environment and allows for the clustering of the death receptor specific polypeptide on the second end of the trimerization domain. It can be improved by designing molecules with binding activity mediated through the IL-23R binding polypeptide at one end. In various embodiments, the polypeptide binds to the death receptor with a lower affinity than IL-23R. More particularly, polypeptides that bind to IL-23R have at least twice as much affinity as polypeptides that bind to death receptors, eg, 2, 2.5, 3, 3.5, 4, 4.5, 5, 10, 15, 20, 50 And 100-fold greater affinity.

IL-23R-결합 폴리펩티드(들) 및 TAA 또는 TSA 표적화 작용제(들) 둘 모두를 갖는 삼합체 복합체의 적응증은 비소세포폐암(NSCLC), 직장결장암, 난소암, 신장암, 췌장암, 육종, 비-호지킨 림프종(NHL), 다발골수종, 유방암, 전립선암, 흑색종, 교모세포종, 신경모세포종을 포함한다.Indications of the trimeric complex with both IL-23R-binding polypeptide (s) and TAA or TSA targeting agent (s) include non-small cell lung cancer (NSCLC), colorectal cancer, ovarian cancer, kidney cancer, pancreatic cancer, sarcoma, non- Hodgkin's lymphoma (NHL), multiple myeloma, breast cancer, prostate cancer, melanoma, glioblastoma, neuroblastoma.

또 다른 양태에서, IL-23 수용체에 특이적으로 결합하는 폴리펩티드가 CTLD의 루프 영역 내에 함유된다. 폴리펩티드는 IL-23 폴리펩티드의 일부일 수 있거나, 본원에 제공된 바와 같이 확인되는 서열일 수 있다. 이러한 양태에서, 상기 서열은 CTLD의 루프 영역에 함유되고, CTLD는 직접적으로 또는 적절한 링커를 통해 도메인의 N-말단 또는 C-말단에서 삼합체화 도메인에 융합된다. 또한, 본 발명의 폴리펩티드는 N-말단 및 C-말단의 나머지에 융합된 제 2의 CTLD 도메인을 포함할 수 있고, 상기 CTLD의 서열 및/또는 IL-23R에 대한 이의 친화성은 동일하거나 상이할 수 있다. 본 양태의 변형에서, 상기 폴리펩티드는 삼합체화 도메인의 말단 중 하나에 IL-23R에 결합하는 폴리펩티드, 및 말단의 나머지에 CTLD를 포함한다. 폴리펩티드 중 1, 2 또는 3개는 IL-23R에 대한 6개 이하의 특이적 결합 일원을 함유하는 삼합체 복합체의 일부일 수 있다.In another embodiment, a polypeptide that specifically binds to an IL-23 receptor is contained within the loop region of the CTLD. The polypeptide may be part of an IL-23 polypeptide or may be a sequence identified as provided herein. In this embodiment, the sequence is contained in the loop region of the CTLD and the CTLD is fused to the trimerization domain either at the N-terminus or C-terminus of the domain either directly or via an appropriate linker. In addition, the polypeptide of the invention may comprise a second CTLD domain fused to the N-terminus and the rest of the C-terminus, and the sequence of said CTLD and / or its affinity for IL-23R may be the same or different. have. In a variation of this embodiment, the polypeptide comprises a polypeptide that binds IL-23R at one of the termini of the trimerization domain, and a CTLD at the remainder of the terminus. One, two or three of the polypeptides may be part of a trimer complex containing up to six specific binding members for IL-23R.

IL-23R에 결합하는 폴리펩티드 서열은 천연 IL-23이 IL-23R에 대해 갖는 결합 친화성과 대략 동등한 IL-23R에 대한 결합 친화성을 가질 수 있다. 특정 구체예에서, 본 발명의 폴리펩티드는 천연 IL-23이 상기와 동일한 IL-23R에 대해 갖는 결합 친화성보다 크거나 적은 IL-23R에 대한 결합 친화성을 갖는다.A polypeptide sequence that binds IL-23R may have a binding affinity for IL-23R that is approximately equivalent to the binding affinity that native IL-23 has to IL-23R. In certain embodiments, polypeptides of the invention have a binding affinity for IL-23R that is greater or less than the binding affinity that native IL-23 has to the same IL-23R as above.

본 발명의 폴리펩티드는 IL-23R에 대한 선택적 결합 친화성을 가지지만, IL-12Rβ1 또는 IL-12Rβ2에는 선택적 결합 친화성을 갖지 않는 자연 IL-23(p19) 서열, 또는 무작위 서열 내에 하나 이상의 아미노산 돌연변이를 포함할 수 있다. 예를 들어, IL-23R에 대한 상기 결합 일원의 결합 친화성이 자연 IL-23과 비교하여 대략 동등(변화되지 않음)하거나 더 크고(증가된), IL-12Rβ1 또는 IL-12Rβ2에 대한 상기 결합 일원의 결합 친화성이 자연 서열 IL-23과 비교하여 더 작거나 거의 없어진 경우, 본원에서의 목적상, 상기 결합 일원의 결합 친화성은 IL-23R에 대해 "선택적"인 것으로 간주된다. 또 다른 예에서, IL-23R에 대한 결합 일원의 친화성은 수용체에 대한 IL-23의 친화성보다 적으나, 결합 일원이 IL-12Rβ1 또는 IL-12Rβ2에 대한 친화성보다 IL-23R에 대해 더 큰 친화성을 갖는 경우 상기 결합 일원은 여전히 수용체에 대해 선택적이다. 본 발명의 바람직한 IL-23R 선택적 길항제는 IL-12Rβ1 또는 IL-12Rβ2와 비교하여 IL-23R에 대해 적어도 5배, 바람직하게는 적어도 10배 더 큰 결합 친화성을 가질 것이고, 더욱 더 바람직하게는, IL-12Rβ1 또는 IL-12Rβ2와 비교하여 IL-23R에 대해 적어도 100배 더 큰 결합 친화성을 가질 것이다.One or more amino acid mutations in a native IL-23 (p19) sequence, or random sequence, having a selective binding affinity for IL-23R, but no selective binding affinity for IL-12Rβ1 or IL-12Rβ2. It may include. For example, the binding affinity of the binding member to IL-23R is approximately equivalent (unchanged) or greater (increased) compared to native IL-23, and the binding to IL-12Rβ1 or IL-12Rβ2 Where a member's binding affinity is smaller or nearly lost compared to the native sequence IL-23, for purposes herein, the binding affinity of the member is considered to be "selective" for IL-23R. In another example, the affinity of the binding member for IL-23R is less than the affinity of IL-23 for the receptor, but the binding member is greater for IL-23R than the affinity for IL-12Rβ1 or IL-12Rβ2. The binding member is still selective for the receptor if it has affinity. Preferred IL-23R selective antagonists of the present invention will have a binding affinity of at least 5 times, preferably at least 10 times greater for IL-23R compared to IL-12Rβ1 or IL-12Rβ2, even more preferably, Will have at least 100-fold greater binding affinity for IL-23R compared to IL-12Rβ1 or IL-12Rβ2.

길항제의 각각의 결합 친화성은 당 분야에 공지된 ELISA, RIA, 및/또는 BIAcore 검정에 의해 자연 IL-23 또는 이의 일부의 결합 특성에 의해 비교되고, 결정될 수 있다. 본 발명의 바람직한 IL-23R 선택적 길항제는 포유동물 세포의 적어도 하나의 유형에서 IL-12 신호전달을 억제하지 않을 것이며, 이러한 신호전달 억제는 당 분야에 공지된 방법, 예를 들어, ELISA에 의해 결정될 수 있다.Each binding affinity of the antagonist can be compared and determined by the binding properties of native IL-23 or a portion thereof by ELISA, RIA, and / or BIAcore assays known in the art. Preferred IL-23R selective antagonists of the invention will not inhibit IL-12 signaling in at least one type of mammalian cell, and such signaling inhibition may be determined by methods known in the art, such as ELISA. Can be.

일 구체예에서, IL-23R 길항제는 항체 또는 항체 단편을 포함한다. 본 발명의 문맥에서, 용어 "항체"는 자연이거나 부분적 또는 전체적으로 합성적으로 생성되는지의 여부와 상관없이 면역글로불린을 기재하는데 사용된다. 항체는 다수의 방식으로 변형될 수 있으므로, 용어 "항체"는 필요한 수용체 특이성을 갖는 결합 도메인을 갖는 임의의 특이적 결합 일원 또는 물질을 포함하는 것으로 간주되어야 한다. 따라서, 상기 용어는 자연이거나 전체적 또는 부분적으로 합성인지의 여부에 상관없이 면역글로불린 결합 도메인을 포함하는 임의의 폴리펩티드를 포함하여 항체 단편, 항체의 유도체, 기능적 동등물 및 동족체를 포함한다. 따라서, 또 다른 폴리펩티드에 융합된 면역글로불린 결합 도메인 또는 동등물을 포함하는 키메라 분자가 포함된다. 상기 용어는 또한 항체 결합 도메인이거나, 항체 결합 도메인과 상동성인 결합 도메인을 갖는 임의의 폴리펩티드 또는 단백질, 예를 들어, 항체 모방체(mimic)를 포함한다. 이들은 자연원으로부터 유래될 수 있거나, 부분적 또는 전체적으로 합성적으로 생성될 수 있다. 항체의 예는 면역글로불린 아이소형 및 이의 아이소형 서브클래스; 항원 결합 도메인을 포함하는 단편, 예를 들어, Fab, Fab', F(ab')2, scFv, Fv, dAb, Fd; 및 디아바디(diabody)이다.In one embodiment, the IL-23R antagonist comprises an antibody or antibody fragment. In the context of the present invention, the term “antibody” is used to describe an immunoglobulin whether it is natural or partially or wholly synthetically produced. As antibodies can be modified in many ways, the term “antibody” should be considered to include any specific binding member or substance having a binding domain with the necessary receptor specificity. Thus, the term includes antibody fragments, derivatives, functional equivalents and homologues of any antibody, including any polypeptide comprising an immunoglobulin binding domain, whether natural or wholly or partially synthetic. Thus, chimeric molecules comprising immunoglobulin binding domains or equivalents fused to another polypeptide are included. The term also includes any polypeptide or protein, eg, antibody mimic, that is an antibody binding domain or has a binding domain homologous to an antibody binding domain. They may be derived from natural sources or may be produced synthetically in part or in whole. Examples of antibodies include immunoglobulin isotypes and isotype subclasses thereof; Fragments comprising antigen binding domains such as Fab, Fab ', F (ab') 2 , scFv, Fv, dAb, Fd; And diabody.

또 다른 양태에서, 본 발명은 3개의 폴리펩티드의 다합체 복합체에 관한 것으로, 상기 폴리펩티드 각각은 다합체화 도메인 및 IL-23R에 결합하는 적어도 하나의 폴리펩티드를 포함한다. 일 구체예에서, 다합체 복합체는 인간 테트라넥틴 삼합체화 구조 구성요소에 대해 실질적으로 상동성을 갖는 폴리펩티드, 또는 만노오스 결합 단백질(MBP) 삼합체화 도메인을 포함하는 다른 인간 삼합체화 폴리펩티드, 콜렉틴 넥 영역 폴리펩티드, 및 기타 폴리펩티드로부터 선택된 다합체화 도메인을 갖는 폴리펩티드를 포함한다. 다합체 복합체는 본 발명의 폴리펩티드 중 임의의 폴리펩티드로 구성될 수 있고, 상기 다합체 복합체의 폴리펩티드는 서로 회합되어 다합체를 형성할 수 있는 다합체화 도메인을 포함한다. 따라서, 일부 구체예에서, 다합체 복합체는 동일한 아미노산 서열을 갖는 폴리펩티드로 구성된 동종다합체 복합체이다. 다른 구체예에서, 다합체 복합체는 상이한 아미노산 서열, 예를 들어, 상이한 다합체화 도메인을 갖는 폴리펩티드, 및/또는 IL-23R에 결합하는 상이한 폴리펩티드로 구성된 이종다합체 복합체이다. 또한, 이종다합체 복합체는 치료제 및 IL-23R 길항제를 포함할 수 있다.In another aspect, the invention relates to a multimer complex of three polypeptides, each comprising at least one polypeptide that binds to a multimerization domain and IL-23R. In one embodiment, the multimer complex is a polypeptide substantially homologous to a human tetranectin trimer structural component, or other human trimerized polypeptide, lectin neck region comprising a mannose binding protein (MBP) trimer domain. Polypeptides having a multimerization domain selected from polypeptides, and other polypeptides. The multimer complex may be composed of any of the polypeptides of the invention, wherein the polypeptides of the multimer complex comprise multimerization domains that can be associated with one another to form a multimer. Thus, in some embodiments, the multimer complex is a homomultimer complex consisting of polypeptides having the same amino acid sequence. In other embodiments, the multimer complex is a heteromultimer complex consisting of different amino acid sequences, eg, polypeptides having different multimerization domains, and / or different polypeptides that bind IL-23R. In addition, the heteromultimer complex may comprise a therapeutic agent and an IL-23R antagonist.

추가로, 일 양태에서, 본 발명은 IL-23R을 발현하는 세포에서 IL-23R의 활성화를 방지하는 폴리펩티드를 제조하는 방법에 관한 것이다. 상기 방법은 (a) IL-23R에 특이적으로 결합하는 제 1 폴리펩티드(들)을 선택하는 단계; (b) 상기 제 1 폴리펩티드(들)을 테트라넥틴 CTLD의 1 또는 2개의 루프 영역에 이식하여 제 1 결합 결정기를 형성시키거나, 상기 폴리펩티드를 테트라넥틴 삼합체화 도메인에 직접 융합시키는 단계; 및 (c) 상기 제 1 CTLD와 테트라넥틴 삼합체화 도메인의 N-말단 또는 C-말단 중 하나를 융합시키는 단계를 포함한다. 상기 방법의 일 특정 구체예에서, IL-23R에 결합하는 폴리펩티드는 IL-12Rβ1 또는 IL-12Rβ2에 결합하지 않는다.Further, in one aspect, the present invention relates to a method of preparing a polypeptide for preventing the activation of IL-23R in a cell expressing IL-23R. The method comprises (a) selecting a first polypeptide (s) that specifically binds IL-23R; (b) implanting the first polypeptide (s) into one or two loop regions of tetranectin CTLD to form a first binding determinant, or fusion of the polypeptide directly to a tetranectin trimerization domain; And (c) fusing either the N-terminus or C-terminus of the first CTLD with the tetranectin trimerization domain. In one specific embodiment of the method, the polypeptide that binds IL-23R does not bind IL-12Rβ1 or IL-12Rβ2.

테트라넥틴 CTLD는 본원에 기재된 바와 같은 무작위화된 라이브러리로부터의 선택에 의해 IL-23R에 대한 결합 일원이 삽입되거나 확인될 수 있는 5개 이하의 루프 영역을 갖는다. 따라서, 본 발명의 폴리펩티드가 CTLD를 포함하는 경우, 폴리펩티드는 CTLD를 통해 삼합체화 도메인에 부착된 IL-23R에 대한 5개 이하의 결합 일원을 가질 수 있다. 결합 일원 각각은 동일하거나 상이할 수 있다.Tetranectin CTLDs have up to 5 loop regions in which binding members for IL-23R can be inserted or identified by selection from a randomized library as described herein. Thus, if the polypeptide of the invention comprises CTLD, the polypeptide may have up to 5 binding members for IL-23R attached to the trimerization domain via CTLD. Each of the bonding members may be the same or different.

본 발명의 폴리펩티드의 다른 양태에서, 수용체 길항제는 삼합체화 도메인의 한 말단에 결합될 수 있고, 하나 이상의 치료제는 제 2의 말단에 결합될 수 있다. 상기 작용제는 직접 결합되거나 당업자에게 이해되는 바와 같은 적절한 링커를 통해 결합될 수 있다. 이러한 작용제는 길항제와 동일한 경로에서 작용할 수 있거나, 면역 장애, 암 및 다른 질환에 대해 상이한 경로에서 작용할 수 있다. 폴리펩티드의 말단 중 하나에 결합되는 것 외에, 상기 작용제는 삼합체화 도메인 내의 측쇄에 결합된 펩티드를 통하거나, 시스테인 잔기로의 결합을 통해 삼합체화 도메인에 공유적으로 연결될 수 있다. 예를 들어, 본원에 참조로서 포함되는 US 6,190,886호에 제시된 바와 같은, 작용제를 모듈에 공유적으로 커플링시키는 다른 방법이 또한 이용될 수 있다.In another embodiment of the polypeptides of the invention, the receptor antagonist may be bound to one end of the trimerization domain and one or more therapeutic agents may be bound to the second end. The agent can be linked directly or via a suitable linker as would be understood by one skilled in the art. Such agents may act in the same pathway as antagonists or in different pathways for immune disorders, cancer and other diseases. In addition to binding to one of the ends of the polypeptide, the agent may be covalently linked to the trimerization domain via a peptide bound to the side chain in the trimerization domain or through binding to a cysteine residue. Other methods of covalently coupling an agent to a module can also be used, for example, as set forth in US Pat. No. 6,190,886, which is incorporated herein by reference.

IL-23R에 특이적인 폴리펩티드 서열의 확인Identification of Polypeptide Sequences Specific to IL-23R

일 양태에서, IL-23R에 대한 특이적 결합 일원은 수용체에 특이적으로 결합하는 폴리펩티드의 무작위 라이브러리의 일원의 선택에 의해 상기 폴리펩티드의 무작위 라이브러리로부터 수득될 수 있다. 추정 리간드 결합 부위를 갖는 표현형을 제시하는 다수의 시스템이 공지되어 있다. 이는 파지 디스플레이(예를 들어, 섬유상 파지 fd [Dunn (1996), Griffiths and Duncan (1998), Marks et al. (1992)], 파지 람다 [Mikawa et al. (1996)]), 진핵세포 바이러스 상에서의 디스플레이(예를 들어, 배큘로바이러스 [Ernst et al. (2000)]), 세포 디스플레이(예를 들어, 박테리아 세포 [Benhar et al. (2000)], 효모 세포 [Boder and Wittrup (1997)], 및 포유동물 세포 [Whitehorn et al. (1995)] 상에서의 디스플레이), 리보솜 연관 디스플레이(ribosome linked display) [Schaffitzel et al. (1999)], 및 파지미드 연관 디스플레이(plasmid linked display) [Gates et al. (1996)]를 포함한다.In one aspect, specific binding members for IL-23R can be obtained from a random library of polypeptides by selection of members of a random library of polypeptides that specifically bind to a receptor. Many systems are known which present phenotypes with putative ligand binding sites. This is shown on phage display (eg, fibrous phage fd [Dunn (1996), Griffiths and Duncan (1998), Marks et al. (1992)), phage lambda [Mikawa et al. (1996)), eukaryotic virus viruses. Display (eg, baculovirus [Ernst et al. (2000)]), cell display (eg, bacterial cells [Benhar et al. (2000)], yeast cells [Boder and Wittrup (1997)] , And displays on mammalian cells (Whitehorn et al. (1995)), ribosome linked displays (Schaffitzel et al. (1999), and phagemid linked displays [Gates et al. (1996).

또한, 전체내용이 참조로서 본원에 포함되는 US2007/0275393호에는 CTLD 라이브러리의 생성을 위한 디스플레이 시스템을 달성하는 절차가 특별히 기재되어 있다. 전반적 절차는 (1) 이용가능한 경우 선택 CTLD의 3D 구조를 참조함에 의한 루프-영역의 위치의 확인, 또는 상기 정보가 이용가능하지 않은 경우, 상기 개시된 바와 같은 β2 및 β3 컨센서스 서열 구성요소 및 β4-가닥 특징에 해당하는 서열 구성요소의 확인에 의한 추가 확증에 의해 보조되는, 공지된 서열과의 서열 정렬에 의한 β2, β3 및 β4 가닥의 서열 위치의 확인; (2) β2, β3 및 β4를 엔코딩하는 서열에 가까운 엔도누클레아제 제한 부위의 사전 삽입을 갖거나, 이러한 사전 삽입이 없는 단백질 디스플레이 벡터 시스템에 선택 CTLD를 엔코딩하는 핵산 단편의 서브클로닝; 및 (3) 선택 CTLD의 루프-영역 일부 또는 전부를 엔코딩하는 핵산 단편의, 수용 프레임워크를 엔코딩하는 핵산 환경으로의 삽입 후에 본래의 루프-영역 폴리펩티드 단편을 엔코딩하는 핵산 단편을 무작위적으로 선택된 핵산 단편으로 치환시키는 다수의 핵산 단편으로 구성된 총체의 무작위적으로 선택된 일원으로의 치환을 포함한다. 본래의 루프-세그먼트 또는 전체 루프-영역을 대체하는 신규한 폴리펩티드를 엔코딩하는 클로닝된 핵산 단편 각각은 이의 신규한 서열 상황 내에서 결정된 해독틀 내에서 디코딩(decoding)될 것이다.In addition, US2007 / 0275393, which is hereby incorporated by reference in its entirety, specifically describes the procedure for achieving a display system for the generation of CTLD libraries. The overall procedure is to (1) identify the location of the loop-region by referring to the 3D structure of the selected CTLD, if available, or the β2 and β3 consensus sequence components as disclosed above and β4-, if the information is not available. Identification of the sequence position of the β2, β3 and β4 strands by sequence alignment with known sequences, aided by further confirmation by identification of sequence elements corresponding to the strand feature; (2) subcloning of nucleic acid fragments encoding select CTLD into a protein display vector system with or without prior insertion of an endonuclease restriction site close to the sequences encoding β2, β3 and β4; And (3) a nucleic acid fragment that is randomly selected from a nucleic acid fragment encoding a portion or all of the loop-region of the selected CTLD, the nucleic acid fragment encoding the original loop-region polypeptide fragment after insertion into a nucleic acid environment encoding the receptive framework. Substitution of a randomly selected member of a total of a plurality of nucleic acid fragments for substitution by fragments. Each cloned nucleic acid fragment encoding a novel polypeptide that replaces the original loop-segment or the entire loop-region will be decoded within a translation frame determined within its novel sequence context.

IL-23R 결합 및 활성화로부터 자연 IL-23을 차단하는 동종삼합체 단백질로 작용하는 복합체가 형성될 수 있다. 그러나, IL-23R 결합 활성을 갖는 펩티드가 먼저 확인되어야 한다. 이를 달성하기 위해, 공지된 결합 활성을 갖는 펩티드가 사용될 수 있거나, 디스플레이 라이브러로부터의 스크리닝에 의해 확인된 추가의 신규한 펩티드가 사용될 수 있다. 비제한적인 예로, 파지, 리보솜 및 효모 디스플레이와 같은 다수의 다양한 디스플레이 시스템이 이용가능하다.Complexes that act as homotrimeric proteins that block native IL-23 from IL-23R binding and activation can be formed. However, peptides with IL-23R binding activity must first be identified. To achieve this, peptides with known binding activity can be used, or additional novel peptides identified by screening from the display library can be used. By way of non-limiting example, many different display systems are available, such as phage, ribosomal and yeast displays.

결합 활성을 갖는 신규한 펩티드를 선택하기 위해, 라이브러리가 작제될 수 있고, 먼저 단일 단량체 CTLD 도메인, 또는 파지의 표면 상에서 디스플레이된 개별적 펩티드로서 IL-23R에 대한 결합에 대해 스크리닝될 수 있다. IL-23R 결합 활성을 갖는 서열이 확인되면, 이러한 서열은 이후 인간 테트라넥틴의 삼합체화 도메인에 이식되어, IL-23R에 결합할 수 있는 잠재적 단백질 치료제가 생성될 것이다.To select novel peptides with binding activity, libraries can be constructed and first screened for binding to IL-23R as individual peptides displayed on the surface of a single monomer CTLD domain, or phage. If a sequence with IL-23R binding activity is identified, this sequence will then be transplanted into the trimerization domain of human tetranectin, resulting in a potential protein therapeutic that can bind IL-23R.

상기 파지 디스플레이 라이브러리 및 삼합체화 도메인 작제물의 작제에 4개의 주요 방법이 이용될 수 있다. 첫번째 방법은 무작위 펩티드 파지 디스플레이 라이브러리를 작제하고/하거나 이용하는 것이다. 이황화물 한정 루프로서 작제된 무작위 선형 펩티드 및/또는 무작위 펩티드는 파지 입자의 표면 상에 개별적으로 디스플레이되고, 파지 디스플레이 "패닝(panning)"을 통해 요망되는 IL-23R으로의 결합에 대해 선택된다. IL-23R 결합 활성을 갖는 펩티드 클론을 수득한 후, 이러한 펩티드는 인간 테트라넥틴의 삼합체화 도메인 또는 CTLD 도메인의 루프로 이식된 후, 삼합체화 도메인 상에 이식되고, 길항제 활성에 대해 스크리닝된다.Four main methods can be used to construct the phage display library and trimerized domain constructs. The first method is to construct and / or use random peptide phage display libraries. Random linear peptides and / or random peptides constructed as disulfide confinement loops are individually displayed on the surface of phage particles and selected for binding to the desired IL-23R via phage display “panning”. After obtaining peptide clones with IL-23R binding activity, these peptides are implanted into the trimerization domain or loop of CTLD domain of human tetranectin, and then onto the trimerization domain and screened for antagonist activity.

파지 디스플레이 라이브러리 및 삼합체화 도메인 작제물의 작제를 위한 두번째 방법은 CTLD 유래 결합체를 수득하는 것을 포함한다. 라이브러리는 파지 표면 상에 디스플레이된 인간 테트라넥틴의 CTLD 스캐폴드 내의 5개의 상이한 루프 중 하나 이상의 아미노산을 무작위화시킴으로써 작제될 수 있다. IL-23R에 대한 결합은 파지 디스플레이 패닝을 통해 선택될 수 있다. IL-23R 결합 활성을 나타내는 펩티드 루프를 갖는 CTLD 클론을 수득한 후, 이러한 CTLD 클론은 인간 테트라넥틴의 삼합체화 도메인에 이식될 수 있고, 길항제 활성에 대해 스크리닝될 수 있다.A second method for the construction of phage display libraries and trimerized domain constructs involves obtaining CTLD derived conjugates. Libraries can be constructed by randomizing one or more amino acids of five different loops in the CTLD scaffold of human tetranectin displayed on the phage surface. Binding to IL-23R can be selected via phage display panning. After obtaining CTLD clones with peptide loops exhibiting IL-23R binding activity, such CTLD clones can be transplanted into the trimerized domain of human tetranectin and screened for antagonist activity.

파지 디스플레이 라이브러리 및 삼합체화 도메인 작제물의 작제를 위한 세번째 방법은 IL-23R에 대한 결합 능력을 갖는 공지된 서열을 취하고, 이를 인간 테트라넥틴의 삼합체화 도메인에 직접 이식하고, 결합 활성에 대해 스크리닝하는 것을 포함한다.A third method for the construction of phage display libraries and trimerization domain constructs takes known sequences with binding capacity to IL-23R, transplants them directly into the trimerization domain of human tetranectin, and screens for binding activity It includes.

네번째 방법은 IL-23R에 대한 공지된 결합 능력을 갖는 펩티드 서열을 이용하는 것을 포함하며, 이러한 펩티드 서열은 우선 펩티드 측면에 존재하는 무작위화된 아미노산 및/또는 펩티드 내의 무작위화된 선택된 내부 아미노산을 갖는 신규한 라이브러리를 생성시킨 후, 파지 디스플레이를 통해 개선된 결합에 대해 선택함으로써 결합을 개선시킨다. 개선된 친화성을 갖는 결합체를 수득한 후, 이러한 펩티드의 결합체는 인간 테트라넥틴의 삼합체화 도메인에 이식될 수 있고, 길항제 효능에 대해 스크리닝될 수 있다. 이러한 방법에서, 최초 라이브러리는 첫번째 방법(상기)에서와 같이 파지 입자의 표면 상에 디스플레이된 자유 펩티드, 또는 상기 논의된 두번째 방법에서와 같이 CTLD 스캐폴드 내의 한정된 루프로 작제될 수 있다. 개선된 친화성을 갖는 결합체를 수득한 후, 이러한 펩티드의 인간 테트라넥틴의 삼합체화 도메인으로의 이식 및 길항제 활성에 대한 스크리닝이 발생한다.The fourth method involves using a peptide sequence with known binding capacity to IL-23R, which peptide sequence is first novel with randomized amino acids present on the peptide side and / or randomized selected internal amino acids within the peptide. After creating a library, binding is improved by selecting for improved binding via phage display. After obtaining a conjugate with improved affinity, the conjugate of this peptide can be implanted into the trimerization domain of human tetranectin and screened for antagonist efficacy. In this method, the original library can be constructed with free peptide displayed on the surface of the phage particles as in the first method (above), or with defined loops in the CTLD scaffold as in the second method discussed above. After obtaining the conjugates with improved affinity, screening for the antagonist activity and transplantation of such peptides into the trimerization domain of human tetranectin occurs.

N-말단에서 16개 이하의 잔기를 제거(V17)하거나, C-말단을 변경한 삼합체화 도메인의 형태가 사용될 수 있다. Trip V [SEQ ID NO:60], TripT [SEQ ID NO:61], TripQ [SEQ ID NO:62] 및 TripK [SEQ ID NO:59](도 2 참조)로 명명된 C-말단 변이체는 삼합체화 도메인 상의 CTLD 도메인의 독특한 제시를 가능케 한다. TripV, TripT, TripQ는 임의의 구조적 유연성이 없는 삼합체화 모듈로의 직접적인 CTLD 분자의 융합체이나, 이는 TripV로부터 TripT, 및 TripT로부터 TripQ까지 CTLD 분자 1/3이 회전되어 있다. 이는 상기 아미노산 각각이 α-헬리칼 턴으로 존재하고, 3.2 aa가 완전한 회전에 필요하다는 사실에 기인한다. 첫번째, 세번째 및 네번째 방법에서 결합에 대해 선택된 자유 펩티드는 삼합체화 도메인의 상기 형태 중 임의의 형태로 이식될 수 있다. 이후, 생성된 융합체는 펩티드 및 배향의 조합이 최적의 활성을 제공하는지 확인하기 위해 스크리닝될 수 있다. 네트라넥틴의 CTLD의 루프 내에 한정된 결합에 대해 선택된 펩티드는 전장 삼합체화 도메인에 이식될 수 있다.Forms of trimerization domains with up to 16 residues removed at the N-terminus (V17) or with altered C-terminus can be used. C-terminal variants named Trip V [SEQ ID NO: 60], TripT [SEQ ID NO: 61], TripQ [SEQ ID NO: 62], and TripK [SEQ ID NO: 59] (see FIG. 2) are trimmed. Enables unique presentation of CTLD domains on embodied domains. TripV, TripT, TripQ are fusions of CTLD molecules directly into the trimerization module without any structural flexibility, but the CTLD molecules 1/3 rotated from TripV to TripT and TripT to TripQ. This is due to the fact that each of these amino acids is present in α-helical turn and 3.2 aa is required for complete rotation. The free peptide selected for binding in the first, third and fourth methods can be grafted into any of the above forms of the trimerization domain. The resulting fusions can then be screened to ensure that the combination of peptide and orientation provides for optimal activity. Peptides selected for defined binding within the loop of CTLD of netranectin can be implanted into the full length trimerization domain.

더욱 특히, 4개의 방법이 하기에 기재된다. 이러한 방법은 파지 디스플레이에 초점을 두고 있으나, 폴리펩티드를 확인하는 다른 동등한 방법이 이용될 수 있다.More particularly, four methods are described below. While this method focuses on phage display, other equivalent methods of identifying polypeptides can be used.

방법 1Method 1

비제한적인 예로 New England Biolabs Ph.D. Phage display Peptide Library Kit와 같은 펩티드 디스플레이 라이브러리 키트가 시판되며, IL-23R 결합 활성을 갖는 새롭고 신규한 펩티드의 선택에 사용하기 위해 구입될 수 있다. 다음과 같은 3개 형태의 New England Biolabs 키트가 이용가능하다: 2.8x109개의 독립적 클론의 라이브러리 크기를 갖는, 7개 아미노산 길이의 선형 무작위 펩티드를 함유하는 Ph.D.-7 Peptide Library Kit, 7개 아미노산 길이의 무작위 펩티드 및 1.2x109개의 독립적 클론의 라이브러리 크기를 갖는, 이황화물 한정된 루프로서 작제된 펩티드를 함유하는 Ph.D.-C7C Disulfide Constrained Peptide Library Kit, 및 2.8x109개의 독립적 클론의 라이브러리 크기를 갖는, 12개 아미노산 길이의 선형 무작위 펩티드를 함유하는 Ph.D.-12 Peptide Library Kit.Non-limiting examples include New England Biolabs Ph.D. Peptide display library kits, such as the Phage display Peptide Library Kit, are commercially available and can be purchased for use in the selection of new and novel peptides having IL-23R binding activity. Three types of New England Biolabs kits are available: Ph.D.-7 Peptide Library Kit, 7 containing a linear random peptide of 7 amino acids length, with a library size of 2.8x10 9 independent clones. dog Ph.D.-C7C disulfide Constrained peptide library Kit , and 2.8x10 9 independent clones containing the construct a peptide having a size of the random peptide library and 1.2x10 9 independent clones of the amino acids in length, as defined disulfide loop Ph.D.-12 Peptide Library Kit containing a linear random peptide of 12 amino acids length with a library size.

대안적으로, 상기 키트와 유사한 무작위 아미노산을 함유하는 펩티드를 이용한 유사한 라이브러리가 새로이 작제될 수 있다. 작제를 위해, NNK, 또는 NNS 방법을 이용하여 무작위 누클레오티드가 생성되며, 여기서 N은 4개의 핵산 염기 A, C, G 및 T의 동등한 혼합물이다. K는 G 또는 T의 동등한 혼합물이고, S는 G 또는 C의 동등한 혼합물이다. 이러한 무작위화된 위치는 파지 또는 파지미드 디스플레이 벡터 시스템 내의 유전자 Ⅲ 단백질로 클로닝될 수 있다. NNK 및 NNS 방법 둘 모두는 20개 모두의 가능한 아미노산, 및 엔코딩된 아미노산에 대해 약간 상이한 빈도의 하나의 정지 코돈을 포함한다. 박테리아 형질전환 효율의 제한으로 인해, 파지 디스플레이에 대해 생성된 라이브러리 크기는 대개 상기 언급된 정도이며, 따라서 7개 이하의 무작위화된 아미노산 위치를 함유하는 펩티드가 생성될 수 있고, 이는 또한 이론적 조합의 전체 레퍼토리(207=1.28x109)를 포함한다. NNK 또는 NNS 방법을 이용하여 보다 긴 펩티드 라이브러리가 작제될 수 있으나, 실제 파지 디스플레이 라이브러리 크기는 박테리아 형질전환에 대한 필요조건으로 인해 상기 길이와 관련하여 가능한 이론적 아미노산 조합을 모두 포함하지 않을 가능성이 크다.Alternatively, a similar library can be constructed with peptides containing random amino acids similar to the kit. For construction, random nucleotides are generated using the NNK, or NNS method, where N is an equivalent mixture of four nucleic acid bases A, C, G and T. K is an equivalent mixture of G or T and S is an equivalent mixture of G or C. Such randomized positions can be cloned into gene III proteins in phage or phagemid display vector systems. Both NNK and NNS methods include all 20 possible amino acids and one stop codon at slightly different frequencies for the encoded amino acids. Due to the limitation of bacterial transformation efficiency, the library size generated for phage display is usually to the extent mentioned above, so that peptides containing up to seven randomized amino acid positions can be produced, which is also a theoretical combination of It includes the entire repertoire (20 7 = 1.28x10 9 ). Longer peptide libraries can be constructed using NNK or NNS methods, but the actual phage display library size is likely to not include all possible theoretical amino acid combinations with respect to this length due to the requirement for bacterial transformation.

따라서, 보다 크고/긴 무작위 펩티드가 수반되는 리보솜 디스플레이 라이브러리가 이로울 수 있다. 이황화물 한정된 라이브러리에 대해, 유사한 NNK 또는 NNS 무작위 누클레오티드 방법이 이용된다. 그러나, 이러한 무작위 위치는 시스테인 아미노산 잔기 측면에 존재하여, 이황화 브릿지 형성을 가능케 한다. N 말단 시스테인은 종종 알라닌과 같은 추가적인 아미노산이 선행한다. 또한, 여러 글리신 잔기로 구성되나 이에 제한되지는 않는 유연한 링커가 상기 무작위 펩티드 라이브러리 중 임의의 무작위 펩티드 라이브러리에 대해 펩티드와 유전자 Ⅲ 단백질 사이에서 스페이서(spacer)로 작용할 수 있다.Thus, ribosomal display libraries involving larger / longer random peptides may be beneficial. For disulfide defined libraries, similar NNK or NNS random nucleotide methods are used. However, these random positions are flanked by cysteine amino acid residues, allowing disulfide bridge formation. N-terminal cysteines are often preceded by additional amino acids such as alanine. In addition, a flexible linker consisting of, but not limited to, several glycine residues can serve as a spacer between the peptide and the gene III protein for any of the random peptide libraries.

방법 2Method 2

도 4 및 5에 제시된 인간 테트라넥틴 CTLD는 5개의 루프(LSA 내 4개의 루프 및 LSB를 포함하는 하나의 루프)를 함유하며, 이는 변경되어 다양한 단백질 표적에 대한 CTLD의 결합을 제공할 수 있다. 무작위 아미노산 서열은 상기 루프 중 하나 이상에 위치되어, 요망되는 결합 특성을 갖는 CTLD 도메인이 선택될 수 있는 라이브러리가 생성될 수 있다. 인간 테트라넥틴 CTLD의 5개의 루프 중 임의의 루프 또는 모든 루프 내에 한정된 무작위 펩티드를 함유하는 상기 라이브러리의 작제는 상기 방법 1에서 기재된 NNK 또는 NNS를 이용하여 달성(이에 제한되지는 않음)될 수 있다. 7개의 무작위 펩티드가 TN CTLD의 루프 1에 삽입될 수 있는 방법의 하나의 예는 다음과 같다.The human tetranectin CTLDs shown in FIGS. 4 and 5 contain five loops (one loop including four loops in the LSA and one LSB), which can be altered to provide binding of CTLD to various protein targets. Random amino acid sequences may be located in one or more of these loops to generate a library from which CTLD domains with the desired binding properties can be selected. Construction of the library containing random peptides defined within any or all of the five loops of human tetranectin CTLD can be achieved using, but not limited to, NNK or NNS described in Method 1 above. One example of how seven random peptides can be inserted into loop 1 of TN CTLD is as follows.

PCR은 단편 A를 생성시키기 위해 주형이 없는 PCR 반응에서 프라이머 1X for(SEQ ID NO:) 및 1X rev2(SEQ ID NO:)를 이용하여 달성될 수 있고, 단편 B를 생성시키기 위해 주형이 없는 별개의 PCR 반응에서 프라이머 BstX1for(SEQ ID NO:) 및 PstBssRevC(SEQ ID NO:)가 사용될 수 있다. PCR은 고 정확도 중합효소 또는 taq 블렌드 및 표준 PCR 써모사이클링(thermocycling) 조건을 이용하여 수행될 수 있다. 이후, 이러한 2개의 중첩 단편은 정제될 수 있고, PCR에 의해 요망되는 DNA 단편을 생성시키기 위해 외부 프라이머 Bglfor12(SEQ ID NO:5) 및 PstRev(SEQ ID NO:6)와 함께 사용될 수 있다. 제한 효소 BglⅡ 및 PstⅠ, 또는 다른 프라이머를 이용하는 경우 다른 적절한 제한 효소를 이용한 절단은 TN CTLD의 루프 또는 이의 일부를 함유하는 단편의 겔 분리를 가능케 한다. 이후, 이러한 정제된 단편은 유전자 Ⅲ에 융합된 제한 변형된 CTLD를 함유하는 유사하게 절단된 파지 디스플레이 벡터, 예를 들어, pPHCPAB(SEQ ID NO:150) 또는 pANA27(SEQ ID NO:164)로 라이게시션될 수 있다(도 6 참조).PCR can be accomplished using primers 1X for (SEQ ID NO :) and 1X rev2 (SEQ ID NO :) in a PCR reaction without template to generate fragment A, and separate without template to generate fragment B In the PCR reaction, primers BstX1for (SEQ ID NO :) and PstBssRevC (SEQ ID NO :) can be used. PCR can be performed using high accuracy polymerase or taq blends and standard PCR thermocycling conditions. These two overlapping fragments can then be purified and used with external primers Bglfor12 (SEQ ID NO: 5) and PstRev (SEQ ID NO: 6) to generate the desired DNA fragments by PCR. Cleavage with other suitable restriction enzymes when using restriction enzymes BglII and PstI, or other primers, allows for gel separation of fragments containing loops or portions of TN CTLD. This purified fragment is then lysed with a similarly cleaved phage display vector containing restriction modified CTLD fused to gene III, for example pPHCPAB (SEQ ID NO: 150) or pANA27 (SEQ ID NO: 164). May be published (see FIG. 6).

무작위화된 아미노산을 이용한 대체에 의한 다른 루프의 변형은 상기 제시된 것과 유사하게 수행될 수 있다. 무작위화된 아미노산을 이용한 루프 내의 규정된 아미노산의 대체는 임의의 특정 루프에 제한되지 않고, 루프의 본래의 크기에 제한되지 않는다. 마찬가지로, 루프의 전체 대체가 필요하지 않고, 루프 중 임의의 루프에 대해 부분적 대체가 가능하다. 일부 경우에, 루프 내에 본래의 아미노산 중 일부, 예를 들어, 도 7에 제시된 칼슘 협동 아미노산의 보유가 요망될 수 있다. 이러한 경우, 무작위화된 아미노산을 이용한 대체는 칼슘 협동 아미노산을 보유하기 위해 루프 내에 보다 적은 아미노산에 대해 발생할 수 있거나, 상기 칼슘 협동 아미노산을 여전히 보유하면서 루프의 전체 크기를 증가시키기 위해 추가의 무작위화된 아미노산이 루프에 첨가될 수 있다. 매우 큰 펩티드는 루프 1 및 2 또는 루프 3 및 4와 같은 루프 영역을 하나의 큰 대체 루프에 조합시킴으로써 수용되고 시험될 수 있다. 또한, 다른 CTLD, 비제한적인 예로, MBL CTLD가 테트라넥틴의 CTLD 대신 사용될 수 있다. 펩티드의 상기 CTLD로의 이식은 상기 기재된 것과 유사한 방법을 이용하여 발생할 수 있다.Modification of other loops by replacement with randomized amino acids can be performed similarly to that set forth above. Replacement of defined amino acids in a loop with randomized amino acids is not limited to any particular loop, nor to the original size of the loop. Similarly, no full replacement of the loop is required, and partial replacement is possible for any of the loops. In some cases, retention of some of the original amino acids in the loop, for example the calcium co-amino acids shown in FIG. 7, may be desired. In such cases, substitutions using randomized amino acids may occur for fewer amino acids in the loop to retain calcium cooperative amino acids, or additional randomized to increase the overall size of the loop while still retaining the calcium cooperative amino acids. Amino acids can be added to the loop. Very large peptides can be accepted and tested by combining loop regions such as loops 1 and 2 or loops 3 and 4 into one large replacement loop. In addition, other CTLD, non-limiting examples, MBL CTLD may be used in place of the CTLD of tetranectin. Transplantation of peptides into the CTLD can occur using methods similar to those described above.

본 발명의 다양한 예시적 양태에서, IL-23R에 결합하는 폴리펩티드는 CTLD 백본을 기초로 하여 조합 펩티드 라이브러리, 및 라이브러리의 폴리펩티드를 엔코딩하는 핵산 서열의 라이브러리를 이용하여 확인될 수 있으며, 여기서 폴리펩티드의 CTLD는 확인 목적을 위해 계획 (a)-(h)로만 표지되는 다수의 예시적 계획에 따라 변형된다:In various exemplary embodiments of the invention, a polypeptide that binds to IL-23R can be identified using a combinatorial peptide library based on a CTLD backbone, and a library of nucleic acid sequences encoding the polypeptide of the library, wherein the CTLD of the polypeptide Is modified according to a number of exemplary plans labeled only as plans (a)-(h) for identification purposes:

일 양태에서, 본 발명은 조합 펩티드 라이브러리, 및 라이브러리의 폴리펩티드를 엔코딩하는 핵산 서열의 라이브러리를 제공하며, 여기서 폴리펩티드의 CTLD는 확인 목적을 위해 계획 (a)-(j)로만 표지되는 다수의 계획에 따라 변형된다. 각각의 계획은 본원에 더욱 상세히 기재되며, 변형은 적어도 아래와 같다:In one aspect, the invention provides a combinatorial peptide library and a library of nucleic acid sequences encoding a polypeptide of the library, wherein the CTLD of the polypeptide is in a number of schemes labeled only as Schemes (a)-(j) for identification purposes. Is deformed accordingly. Each scheme is described in more detail herein, with variations at least as follows:

(a) 아미노산 변형이 루프 1에 적어도 하나의 아미노산의 삽입 및 루프 1 내의 적어도 5개의 아미노산의 무작위 치환을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(a) amino acid modification in at least one of four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises insertion of at least one amino acid into loop 1 and random substitution of at least five amino acids in loop 1;

(b) 아미노산 변형이 루프 1 내의 적어도 5개의 아미노산의 무작위 치환 및 루프 2 내의 적어도 3개의 아미노산의 무작위 치환을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(b) amino acid modification in at least one of four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises random substitution of at least five amino acids in loop 1 and random substitution of at least three amino acids in loop 2 ;

(c) 아미노산 변형이 루프 1 내의 적어도 7개의 아미노산의 무작위 치환 및 루프 4에 적어도 하나의 아미노산 삽입을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(c) amino acid modification in at least one of the four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises random substitution of at least seven amino acids in loop 1 and insertion of at least one amino acid in loop 4;

(d) 아미노산 변형이 루프 3에 적어도 하나의 아미노산 삽입 및 루프 3 내의 적어도 3개의 아미노산의 무작위 치환을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(d) amino acid modification in at least one of the four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises at least one amino acid insertion in loop 3 and random substitution of at least three amino acids in loop 3;

(e) 아미노산 변형이 2개의 루프를 단일한 루프로 조합시키는 변형을 포함하고, 2개의 조합된 루프가 루프 3 및 루프 4인, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(e) in at least one of the four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises a modification that combines the two loops into a single loop, wherein the two combined loops are loop 3 and loop 4 Amino acid modification of;

(f) 아미노산 변형이 루프 4에 적어도 하나의 아미노산 삽입 및 루프 4 내의 적어도 3개의 아미노산의 무작위 치환을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(f) amino acid modification in at least one of the four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises at least one amino acid insertion in loop 4 and random substitution of at least three amino acids in loop 4;

(g) 아미노산 변형이 루프 3에 적어도 5개의 아미노산 잔기의 무작위 치환 및 루프 5 내의 적어도 3개의 아미노산의 무작위 치환을 포함하는, CTLD의 루프 세그먼트 A(LSA) 및 루프 세그먼트 B(LSB) 내의 5개의 루프 중 적어도 하나에서의 아미노산 변형;(g) five in loop segment A (LSA) and loop segment B (LSB) of CTLD, wherein the amino acid modification comprises random substitution of at least five amino acid residues in loop 3 and random substitution of at least three amino acids in loop 5 Amino acid modification in at least one of the loops;

(h) 아미노산 변형이 루프 3에 적어도 하나의 아미노산의 무작위 치환 및 적어도 6개의 아미노산의 삽입을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;(h) amino acid modification in at least one of the four loops in loop segment A (LSA) of CTLD, wherein the amino acid modification comprises random substitution of at least one amino acid and insertion of at least six amino acids in loop 3;

(i) 아미노산 변형이 (1) 루프 3에 적어도 6개의 아미노산의 무작위 치환 및 (2) 루프 3에 적어도 6개의 아미노산의 무작위 치환 및 적어도 하나의 아미노산 삽입의 혼합을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형;loop segment A of CTLD, wherein (i) the amino acid modification comprises (1) random substitution of at least 6 amino acids in loop 3 and (2) random substitution of at least 6 amino acids in loop 3 and at least one amino acid insertion Amino acid modification in at least one of the four loops in (LSA);

(j) 아미노산 변형이 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나 또는 루프 세그먼트 B(LSB) 내의 루프 5에 적어도 4개 또는 그 초과의 아미노산 삽입을 포함하는, CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서의 아미노산 변형.(j) loop segment A of CTLD, wherein the amino acid modification comprises at least one or more amino acid insertions in at least one of the four loops in loop segment A (LSA) of CTLD or loop 5 in loop segment B (LSB) Amino acid modification in at least one of the four loops in (LSA).

계획 (a)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)을 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 LSA의 4개의 루프 중 적어도 하나 또는 LSB의 루프에서의 아미노산 변형을 포함하고, 아미노산 변형은 루프 1에서 적어도 하나의 아미노산 삽입 및 루프 1 내의 적어도 5개의 아미노산의 무작위 치환을 포함한다.In connection with scheme (a), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is in four loops of LSA of CTLD. At least one or amino acid modification in the loop of the LSB, wherein the amino acid modification comprises at least one amino acid insertion in loop 1 and random substitution of at least 5 amino acids in loop 1.

조합 라이브러리의 상기 양태의 특정 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, CTLD는 또한 아르기닌-130의 무작위 치환을 갖는다. 인간 테트라넥틴의 CTLD가 아닌 CTLD에 대해, 이러한 펩티드는 C-말단 방향에서 루프 2의 C-말단 펩티드에 바로 인접하여 위치된다. 예를 들어, 마우스 테트라넥틴에서, 이러한 펩티드는 Gly-130이다. 조합 라이브러리의 상기 양태의 특정 구체예에서, CTLD가 인간 또는 마우스 테트라넥틴으로부터 유래되는 경우, CTLD는 루프 4에서 리신-148의 알라닌으로의 치환을 포함한다.In certain embodiments of this aspect of the combinatorial library, when the CTLD is derived from human tetranectin, the CTLD also has a random substitution of arginine-130. For CTLDs that are not CTLDs of human tetranectin, these peptides are located directly adjacent to the C-terminal peptide of loop 2 in the C-terminal direction. For example, in mouse tetranectin, this peptide is Gly-130. In certain embodiments of this aspect of the combinatorial library, when the CTLD is derived from human or mouse tetranectin, the CTLD comprises substitution of lysine-148 with alanine in loop 4.

특정 구체예에서, 조합 라이브러리가 계획 (a)의 변형된 CTLD를 갖는 경우, 아미노산 변형은 루프 1에 2개의 아미노산 삽입 및 루프 1 내의 적어도 5개의 아미노산의 무작위 치환을 포함한다. 다른 구체예에서, 조합 라이브러리가 계획 (a)의 변형된 CTLD를 갖고, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1에서 적어도 하나의 아미노산 삽입, 및 루프 1 내의 적어도 5개의 아미노산의 무작위 치환을 포함하고, 아르기닌 130의 무작위 치환을 포함한다. 일 특정 구체예에서, 조합 라이브러리가 계획 (a)의 변형된 CTLD를 갖고, CTLD가 인간 테트라넥티으로부터 유래되는 경우, 아미노산 변형은 루프 1에 2개의 아미노산 삽입, 루프 1 내의 5개의 아미노산의 무작위 치환, 및 아르기닌 130의 무작위 치환을 포함한다. 일 특정 구체예에서, 조합 라이브러리가 계획 (a)의 변형된 CTLD를 갖고, CTLD가 마우스 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1에 2개의 아미노산 삽입, 루프 1 내의 5개의 아미노산의 무작위 치환, 및 류신 130의 무작위 치환을 포함한다. 계획 (a)에 대한 구체예 중 임의의 구체예에서, 아미노산 변형은 리신-148의 알라닌으로의 치환을 추가로 포함할 수 있다. 따라서, 조합 라이브러리의 상기 양태의 일 특정 구체예에서, CTLD는 루프 1에 2개의 아미노산 삽입, 루프 1 내의 적어도 5개의 아미노산의 무작위 치환, 아르기닌-130 또는 C-말단 방향으로 루프 2의 외부에 인접하여 위치된 다른 아미노산의 무작위 치환, 및 루프 4에서의 리신-148의 알라닌으로의 치환을 포함한다.In certain embodiments, where the combinatorial library has a modified CTLD of Scheme (a), the amino acid modification comprises two amino acid insertions in loop 1 and random substitution of at least five amino acids in loop 1. In another embodiment, where the combinatorial library has a modified CTLD of Scheme (a) and the CTLD is derived from human tetranectin, the amino acid modification is at least one amino acid insertion in loop 1, and at least 5 amino acids in loop 1 Random substitutions, and random substitutions of arginine 130. In one specific embodiment, where the combinatorial library has a modified CTLD of Scheme (a) and the CTLD is derived from human tetranecti, the amino acid modification is 2 amino acid insertions in loop 1, random substitution of 5 amino acids in loop 1 , And random substitutions of arginine 130. In one specific embodiment, where the combinatorial library has a modified CTLD of Scheme (a) and the CTLD is derived from mouse tetranectin, the amino acid modification is 2 amino acid insertions in loop 1, random substitution of 5 amino acids in loop 1 , And random substitution of leucine 130. In any of the embodiments of scheme (a), the amino acid modification may further comprise a substitution of lysine-148 with alanine. Thus, in one specific embodiment of this aspect of the combinatorial library, the CTLD comprises two amino acid insertions in loop 1, a random substitution of at least five amino acids in loop 1, adjacent to loop 2 outside of the arginine-130 or C-terminal direction. Random substitutions of other amino acids located therein and substitution of lysine-148 with alanine in loop 4.

계획 (b)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)을 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 LSA 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 루프 1 내의 적어도 5개의 아미노산의 무작위 치환 및 루프 2 내의 적어도 3개의 아미노산의 무작위 치환을 포함한다.In relation to scheme (b), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is in four loops within the LSA of CTLD. At least one amino acid modification, wherein the amino acid modification comprises a random substitution of at least 5 amino acids in loop 1 and a random substitution of at least 3 amino acids in loop 2.

계획 (b)의 조합 라이브러리의 상기 양태의 특정 구체예에서, CTLD가 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1내의 적어도 5개의 아미노산의 무작위 치환, 루프 2 내의 적어도 3개의 아미노산의 무작위 치환, 및 아르기닌-130 또는 C-말단 방향으로 루프 2의 외부에 인접하여 위치된 다른 아미노산의 무작위 치환을 포함한다. 특정 구체예에서, 조합 라이브러리가 계획 (b)의 변형된 CTLD를 갖고, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1에 적어도 5개의 아미노산의 무작위 치환, 루프 2에 적어도 3개의 아미노산의 무작위 치환을 포함하고, 아르기닌 130의 무작위 치환을 포함한다. 일 구체예에서, 조합 라이브러리가 계획 (b)의 변형된 CTLD를 갖고, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1에 5개의 아미노산의 무작위 치환, 루프 2에 3개의 아미노산의 무작위 치환, 및 아르기닌 130의 무작위 치환을 포함한다. 특정한 다른 구체예에서, 조합 라이브러리가 계획 (b)의 변형된 CTLD를 갖고, CTLD가 마우스 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1에 적어도 5개의 아미노산의 무작위 치환, 루프 2에 적어도 3개의 아미노산의 무작위 치환을 포함하고, 류신 130의 무작위 치환을 포함한다. 일 구체예에서, 조합 라이브러리가 계획 (b)의 변형된 CTLD를 갖고, CTLD가 마우스 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 1에 5개의 아미노산의 무작위 치환, 루프 2에 3개의 아미노산의 무작위 치환, 및 류신 130의 무작위 치환을 포함한다. 계획 (b)의 구체예 중 임의의 구체예에서, 아미노산 변형은 리신-148의 알라닌으로의 치환을 추가로 포함할 수 있다. 따라서, 일 특정 구체예에서, 아미노산 변형은 루프 1 내의 적어도 5개의 아미노산의 무작위 치환, 루프 2 내의 적어도 3개의 아미노산의 무작위 치환, 및 아르기닌-130 또는 C-말단 방향으로 루프 2의 외부에 인접하여 위치된 다른 아미노산의 무작위 치환 및 루프 4에서의 리신-148의 알라닌으로의 치환을 포함한다.In certain embodiments of this embodiment of the combinatorial library of scheme (b), where the CTLD is derived from tetranectin, the amino acid modification is at least 5 amino acids in loop 1, at least 3 amino acids in loop 2, And random substitution of another amino acid located adjacent to the outside of loop 2 in the arginine-130 or C-terminal direction. In certain embodiments, where the combinatorial library has a modified CTLD of Scheme (b) and the CTLD is derived from human tetranectin, the amino acid modification is random substitution of at least 5 amino acids in loop 1, at least 3 amino acids in loop 2 And a random substitution of Arginine 130. In one embodiment, where the combinatorial library has a modified CTLD of Scheme (b) and the CTLD is derived from human tetranectin, the amino acid modification is a random substitution of 5 amino acids in loop 1, a random of 3 amino acids in loop 2 Substitutions, and random substitutions of arginine 130. In certain other embodiments, where the combinatorial library has a modified CTLD of Scheme (b) and the CTLD is derived from mouse tetranectin, the amino acid modification is random substitution of at least 5 amino acids in loop 1, at least 3 in loop 2 Random substitutions of amino acids, and random substitutions of leucine 130. In one embodiment, where the combinatorial library has a modified CTLD of Scheme (b) and the CTLD is derived from mouse tetranectin, the amino acid modification is a random substitution of 5 amino acids in loop 1, a random of 3 amino acids in loop 2 Substitutions, and random substitutions of leucine 130. In any of the embodiments of scheme (b), the amino acid modification may further comprise a substitution of lysine-148 with alanine. Thus, in one specific embodiment, the amino acid modifications comprise a random substitution of at least 5 amino acids in loop 1, a random substitution of at least 3 amino acids in loop 2, and adjacent to outside of loop 2 in the arginine-130 or C-terminal direction. Random substitutions of the other amino acids positioned and substitution of lysine-148 with alanine in loop 4.

계획 (c)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)를 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 루프 1 내의 적어도 7개의 아미노산의 무작위 치환 및 루프 4에 적어도 하나의 아미노산 삽입을 포함한다.In relation to scheme (c), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is a loop segment A (LSA) of CTLD. At least one amino acid modification in at least one of the four loops in the amino acid modification, wherein the amino acid modification comprises random substitution of at least seven amino acids in loop 1 and at least one amino acid insertion in loop 4.

조합 라이브러리의 상기 양태의 특정 구체예에서, 조합 라이브러리의 폴리펩티드 일원은 루프 4 내의 적어도 2개의 아미노산의 무작위 치환을 추가로 포함한다. 상기 양태의 특정한 다른 구체예에서, 아미노산 변형은 루프 4 내의 3개의 아미노산 삽입을 포함하고, 임의로 적어도 2개의 아미노산의 무작위 치환을 추가로 포함한다. 일 구체예에서, 아미노산 변형은 루프 1 내의 적어도 7개의 아미노산의 무작위 치환, 루프 4에 적어도 3개의 아미노산 삽입, 및 루프 4 내의 적어도 2개의 아미노산의 무작위 치환을 포함한다. 일 특정 구체예에서, 아미노산 변형은 루프 1 내의 7개의 아미노산의 무작위 치환, 루프 4에 3개의 아미노산 삽입, 및 루프 4 내의 2개의 아미노산의 무작위 치환을 포함한다.In certain embodiments of this aspect of the combinatorial library, the polypeptide member of the combinatorial library further comprises random substitution of at least two amino acids in loop 4. In certain other embodiments of the above embodiments, the amino acid modifications include three amino acid insertions within loop 4 and optionally further comprise random substitution of at least two amino acids. In one embodiment, the amino acid modification comprises random substitution of at least seven amino acids in loop 1, at least three amino acid insertions in loop 4, and random substitution of at least two amino acids in loop 4. In one particular embodiment, the amino acid modification comprises random substitution of seven amino acids in loop 1, three amino acid insertions in loop 4, and random substitution of two amino acids in loop 4.

계획 (d)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)를 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 루프 3에 적어도 하나의 아미노산 삽입 및 루프 3 내의 적어도 3개의 아미노산의 무작위 치환을 포함한다.In relation to scheme (d), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is a loop segment A (LSA) of CTLD. At least one amino acid modification in at least one of the four loops within, wherein the amino acid modification includes at least one amino acid insertion in loop 3 and random substitution of at least three amino acids in loop 3.

특정 구체예에서, 조합 라이브러리가 계획 (d)의 변형된 CTLD를 갖는 경우, 아미노산 변형은 루프 4에 적어도 하나의 아미노산 삽입을 추가로 포함할 수 있고, 루프 4 내의 적어도 3개의 아미노산의 무작위 치환을 추가로 포함할 수 있다. 계획 (d)에 대해 기재된 구체예 중 임의의 구체예에서, 아미노산 변형은 루프 3에 3개의 아미노산 삽입을 포함할 수 있다. 계획 (d)에 대해 기재된 구체예 중 임의의 구체예에서, 아미노산 변형은 루프 4에 3개의 아미노산 삽입을 포함할 수 있다. 따라서, 특정 구체예에서, 아미노산 변형은 루프 3 내의 적어도 3개의 아미노산의 무작위 치환, 루프 4 내의 적어도 3개의 아미노산의 무작위 치환, 루프 3에 적어도 하나의 아미노산 삽입, 및 루프 4에 적어도 하나의 아미노산 삽입을 포함한다. 특정 구체예에서, 아미노산 변형은 루프 3 내의 적어도 3개의 아미노산의 무작위 치환, 루프 4 내의 적어도 3개의 아미노산의 무작위 치환, 루프 3에 적어도 3개의 아미노산 삽입 및 루프 4에 적어도 3개의 아미노산 삽입을 포함한다. 일 특정 구체예에서, 아미노산 변형은 루프 3 내의 3개의 아미노산의 무작위 치환, 루프 4 내의 3개의 아미노산의 무작위 치환, 루프 3에 3개의 아미노산 삽입 및 루프 4에 3개의 아미노산 삽입을 포함한다. 기재된 구체예 중 임의의 구체예에서, CTLD가 테트라넥틴인 경우, 아미노산 변형은 리신-148의 알라닌으로의 무작위 치환 또는 루프 4에서의 무작위 치환을 추가로 포함할 수 있다.In certain embodiments, where the combinatorial library has a modified CTLD of Scheme (d), the amino acid modification may further comprise at least one amino acid insertion in loop 4, resulting in random substitution of at least three amino acids in loop 4 It may further comprise. In any of the embodiments described for scheme (d), the amino acid modification may comprise three amino acid insertions in loop 3. In any of the embodiments described for scheme (d), the amino acid modification may comprise three amino acid insertions in loop 4. Thus, in certain embodiments, amino acid modifications include random substitution of at least 3 amino acids in loop 3, random substitution of at least 3 amino acids in loop 4, at least one amino acid insertion in loop 3, and at least one amino acid insertion in loop 4 It includes. In certain embodiments, the amino acid modification comprises random substitution of at least 3 amino acids in loop 3, random substitution of at least 3 amino acids in loop 4, at least 3 amino acid insertions in loop 3 and at least 3 amino acid insertions in loop 4 . In one specific embodiment, the amino acid modification comprises random substitution of three amino acids in loop 3, random substitution of three amino acids in loop 4, three amino acid insertions in loop 3 and three amino acid insertions in loop 4. In any of the embodiments described, wherein the CTLD is tetranectin, the amino acid modification may further comprise random substitution of lysine-148 with alanine or random substitution in loop 4.

계획 (e)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)을 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 2개의 루프를 단일한 루프로 조합시키는 변형을 포함하고, 상기 2개의 조합된 루프는 루프 3 및 루프 4이다. 특정 구체예에서, 조합 라이브러리의 일원이 계획 (e)의 변형된 CTLD를 갖는 경우, 아미노산 변형은 루프 3 내의 적어도 6개의 아미노산의 무작위 치환 및 루프 4 내의 적어도 4개의 아미노산의 무작위 치환을 포함한다. 일 특정 구체예에서, 아미노산 변형은 루프 3 내의 6개의 아미노산의 무작위 치환 및 루프 4 내의 4개의 아미노산의 무작위 치환을 포함한다. 계획 (e)에 대한 구체예 중 임의의 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 프롤린-144의 무작위 치환을 추가로 포함할 수 있다. 일 특정 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 3 내의 6개의 아미노산의 무작위 치환, 루프 4 내의 4개의 아미노산의 무작위 치환, 및 프롤린 144의 무작위 치환을 포함하여, 예를 들어, NWEXXXXXXX XGGXXXN (SEQ ID NO:)를 포함하는 조합된 루프 3 및 루프 4 아미노산 서열을 발생시키며, 여기서 X는 임의의 아미노산이고, 상기 SEQ ID NO:의 아미노산 서열은 단일 루프 영역을 형성한다. 따라서, 일 특정 구체예에서, 조합 라이브러리의 폴리펩티드 일원은 서열 NWEXXXXXXX XGGXXXN (SEQ ID NO:)을 포함하여, 여기서 X는 임의의 아미노산이고, 상기 SEQ ID NO:의 아미노산 서열은 조합되고 변형된 루프 3 및 루프 4로부터의 단일 루프를 형성한다.In connection with scheme (e), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is loop segment A (LSA) of CTLD. At least one of the four loops within includes amino acid modifications, wherein the amino acid modifications include modifications that combine the two loops into a single loop, wherein the two combined loops are loop 3 and loop 4. In certain embodiments, where a member of the combinatorial library has a modified CTLD of Scheme (e), the amino acid modification comprises random substitution of at least 6 amino acids in loop 3 and random substitution of at least 4 amino acids in loop 4. In one specific embodiment, the amino acid modification comprises random substitution of six amino acids in loop 3 and random substitution of four amino acids in loop 4. In any of embodiments of scheme (e), when the CTLD is derived from human tetranectin, the amino acid modification may further comprise random substitution of proline-144. In one specific embodiment, where the CTLD is derived from human tetranectin, the amino acid modification comprises random substitution of six amino acids in loop 3, random substitution of four amino acids in loop 4, and random substitution of proline 144, eg For example, it generates a combined loop 3 and loop 4 amino acid sequence comprising NWEXXXXXXX XGGXXXN (SEQ ID NO :), wherein X is any amino acid and the amino acid sequence of SEQ ID NO: forms a single loop region. . Thus, in one specific embodiment, the polypeptide member of the combinatorial library comprises the sequence NWEXXXXXXX XGGXXXN (SEQ ID NO :), wherein X is any amino acid and the amino acid sequence of SEQ ID NO: is combined and modified loop 3 And form a single loop from loop 4.

계획 (f)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)를 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 루프 4에 적어도 하나의 아미노산 삽입 및 루프 4 내의 적어도 3개의 아미노산의 무작위화 치환을 포함한다. 특정 구체예에서, 아미노산 변형은 루프 4에 4개의 아미노산 삽입을 포함한다. 일 구체예에서, 아미노산 변형은 루프 4에 적어도 4개의 아미노산 삽입 및 루프 4 내의 적어도 3개의 아미노산의 무작위 치환을 포함한다. 일 특정 구체예에서, 아미노산 치환은 루프 4에 4개의 아미노산 삽입 및 루프 4 내의 3개의 아미노산의 무작위 치환을 포함한다.In relation to scheme (f), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is loop segment A (LSA) of CTLD. At least one amino acid modification in at least one of the four loops within, wherein the amino acid modification includes at least one amino acid insertion in loop 4 and randomized substitution of at least three amino acids in loop 4. In certain embodiments, the amino acid modifications comprise four amino acid insertions in loop 4. In one embodiment, the amino acid modification comprises at least four amino acid insertions in loop 4 and random substitution of at least three amino acids in loop 4. In one specific embodiment, the amino acid substitutions comprise four amino acid insertions into loop 4 and random substitution of three amino acids in loop 4.

계획 (g)와 관련하여, 조합 라이브러리의 폴리펩티드 일원은 변형된 루프 3 및 변형된 루프 5를 포함하며, 상기 변형된 루프 3은 5개의 아미노산 잔기의 무작위화를 포함하고, 상기 변형된 루프 5는 3개의 아미노산 잔기의 무작위화를 포함한다. 일 구체예에서, 조합 라이브러리의 폴리펩티드 일원은 변형된 루프 3, 변형된 루프 5, 및 변형된 루프 4를 포함하며, 상기 루프 4에 대한 변형은 플라스미노겐 결합을 폐지시킨다. 예를 들어, 조합 라이브러리가 계획 (g)의 변형된 CTLD를 갖고, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 CTLD의 플라스미노겐 결합 친화성을 조절하는 루프 4에서의 하나 이상의 아미노산 변형, 예를 들어, 리신 148의 알라닌으로의 치환을 추가로 포함할 수 있다. 따라서, 특정 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 3에 적어도 5개의 아미노산 잔기의 무작위 치환, 루프 5에 적어도 3개의 아미노산 잔기의 무작위 치환, 및 루프 4에서의 리신 148의 알라닌으로의 치환을 포함한다. 일 특정 구체예에서, 아미노산 변형은 루프 3에 5개의 아미노산 잔기의 무작위 치환 및 루프 5에 3개의 아미노산 잔기의 무작위 치환을 포함하고, 또 다른 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 4에서 리신 148의 알라닌으로의 치환을 추가로 포함한다.In relation to scheme (g), the polypeptide members of the combinatorial library include modified loop 3 and modified loop 5, wherein the modified loop 3 comprises randomization of five amino acid residues, wherein the modified loop 5 is Randomization of three amino acid residues. In one embodiment, the polypeptide members of the combinatorial library include modified loop 3, modified loop 5, and modified loop 4, wherein the modification to loop 4 abolishes plasminogen binding. For example, if the combinatorial library has a modified CTLD of Scheme (g) and the CTLD is derived from human tetranectin, then the amino acid modification is one or more amino acid modifications in loop 4 that regulate the plasminogen binding affinity of the CTLD. For example, the substitution of lysine 148 with alanine may be further included. Thus, in certain embodiments, where the CTLD is derived from human tetranectin, the amino acid modification is random substitution of at least 5 amino acid residues in loop 3, random substitution of at least 3 amino acid residues in loop 5, and lysine in loop 4 Substitution of 148 for alanine. In one specific embodiment, the amino acid modification comprises random substitution of five amino acid residues in loop 3 and random substitution of three amino acid residues in loop 5, and in another embodiment, where CTLD is derived from human tetranectin, The amino acid modification further comprises substitution of lysine 148 with alanine in loop 4.

계획 (h)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)를 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 적어도 하나의 아미노산의 무작위 치환 및 적어도 6개의 아미노산 삽입을 포함한다. 특정 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 CTLD의 플라스미노겐 결합 친화성을 조절하는 루프 4에서의 하나 이상의 아미노산 변형, 예를 들어, 리신 148의 알라닌으로의 치환을 추가로 포함할 수 있다. 특정 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 조합 라이브러리의 일원은 루프 3에 적어도 하나의 아미노산의 무작위 치환 및 적어도 6개의 아미노산의 삽입, 및 루프 4에서의 리신 148의 알라닌으로의 치환을 갖는다. 일 특정 구체예에서, 아미노산 변형은 루프 3에 하나의 아미노산의 무작위 치환 및 6개의 아미노산의 삽입을 포함한다. 일 특정 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 조합 라이브러리의 일원은 루프 3에 하나의 아미노산의 무작위 치환 및 6개의 아미노산의 삽입, 및 루프 4에서의 리신 148의 알라닌으로의 치환을 갖는다. 상기 구체예 중 임의의 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 치환 중 하나는 이소류신 140의 치환이다.In relation to scheme (h), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is loop segment A (LSA) of CTLD. At least one amino acid modification in at least one of the four loops within the amino acid modification, wherein the amino acid modification comprises random substitution of at least one amino acid and at least six amino acid insertions. In certain embodiments, where the CTLD is derived from human tetranectin, the amino acid modifications result in one or more amino acid modifications in loop 4 that modulate the plasminogen binding affinity of the CTLD, eg, substitution of alanine for lysine 148. It may further comprise. In certain embodiments, where the CTLD is derived from human tetranectin, a member of the combinatorial library comprises random substitution of at least one amino acid and insertion of at least six amino acids in loop 3, and substitution of alanine for lysine 148 in loop 4 Has In one specific embodiment, the amino acid modification comprises random substitution of one amino acid and insertion of six amino acids in loop 3. In one specific embodiment, when the CTLD is derived from human tetranectin, a member of the combinatorial library is responsible for random substitution of one amino acid and insertion of six amino acids in loop 3, and substitution of alanine for lysine 148 in loop 4 Have In any of the above embodiments, when the CTLD is derived from human tetranectin, one of the substitutions is the substitution of isoleucine 140.

계획 (i)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)를 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 루프 3에 6개의 아미노산의 무작위 치환 및 루프 3에 6개의 아미노산의 무작위 치환 및 하나의 아미노산의 삽입의 혼합을 포함한다. 일 구체예에서, 상기 혼합은 루프 3에 6개의 아미노산의 무작위 치환 및 2개의 아미노산의 삽입을 추가로 포함한다. 따라서, 일 구체예에서, 아미노산 변형은 루프 3에 6개의 아미노산의 무작위 치환, 루프 3에 6개 아미노산의 무작위 치환 및 하나의 아미노산의 삽입, 및 루프 3에 6개 아미노산의 무작위 치환 및 2개의 아미노산의 삽입의 혼합을 포함한다. 계획 (i)의 구체예 중 임의의 구체예에서, CTLD가 인간 테트라넥틴으로부터 유래되는 경우, 아미노산 변형은 루프 4에서의 리신 148의 알라닌으로의 치환을 추가로 포함한다.In relation to scheme (i), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is loop segment A (LSA) of CTLD. Amino acid modifications in at least one of the four loops within, wherein the amino acid modifications comprise a combination of random substitution of six amino acids in loop 3 and random substitution of six amino acids in loop 3 and insertion of one amino acid. In one embodiment, the mixing further comprises random substitution of six amino acids and insertion of two amino acids in loop 3. Thus, in one embodiment, the amino acid modification is random substitution of six amino acids in loop 3, random substitution of six amino acids in loop 3 and insertion of one amino acid, and random substitution of six amino acids in loop 3 and two amino acids Includes a mix of insertions. In any of embodiments of scheme (i), wherein the CTLD is derived from human tetranectin, the amino acid modification further comprises a substitution of lysine 148 with alanine in loop 4.

계획 (i)와 관련하여, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)를 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에서 아미노산 변형을 포함하고, 상기 아미노산 변형은 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나에 존재하고, 상기 아미노산 변형은 CTLD의 루프 세그먼트 A(LSA) 내의 4개의 루프 중 적어도 하나 또는 루프 세그먼트 B(LSB) 내의 루프 5에 적어도 4개 또는 그 초과의 아미노산 삽입을 포함한다.In relation to scheme (i), the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD is loop segment A (LSA) of CTLD. An amino acid modification in at least one of the four loops in the amino acid modification, wherein the amino acid modification is in at least one of the four loops in loop segment A (LSA) of CTLD, and wherein the amino acid modification is in loop segment A (LSA) of CTLD At least four or more amino acid insertions in at least one of the four loops or loop 5 in loop segment B (LSB).

조합 라이브러리가 루프 4 영역에 대해 하나 이상의 아미노산 변형(단독 또는 CTLD의 다른 영역에 대한 변형과 조합됨)을 포함하는 구체예에서, 특정 변형(들)은 CTLD의 금속 이온-결합 친화성을 유지하거나, 조절하거나, 폐지시키도록 설계된다. 이러한 변형은 CTLD의 플라스미노겐-결합 활성에 영향을 미친다(예를 들어, Nielbo, et al., Biochemistry, 2004, 43 (27), pp 8636-8643; 또는 Graversen 1998 참조).In embodiments in which the combinatorial library comprises one or more amino acid modifications (either alone or in combination with modifications to other regions of the CTLD) for the loop 4 region, the particular modification (s) may maintain the metal ion-binding affinity of the CTLD or It is designed to be regulated or abolished. Such modifications affect the plasminogen-binding activity of CTLD (see, eg, Nielbo, et al., Biochemistry , 2004 , 43 (27), pp 8636-8643; or Graversen 1998).

라이브러리의 폴리펩티드 일원은 CTLD의 4개의 LSA 루프 및 LSB 루프(루프 5)의 임의의 조합에서 하나 이상의 아미노산 변형(예를 들어, 삽입, 치환, 신장, 또는 무작위화에 의함)을 포함할 수 있다. 따라서, 본원에 기재된 다양한 구체예 중 임의의 구체예에서, 무작위화된 CTLD는 단독이거나 LSA 루프 영역(루프 1-4)의 임의의 1, 2, 3 또는 4개의 루프에서의 하나 이상의 아미노산의 변형과 조합된, LSB 루프 영역(루프 5)의 루프에서의 하나 이상의 아미노산 변형을 포함할 수 있다. 일 양태에서, 본 발명은 무작위화된 C-타입 렉틴 도메인(CTLD)을 갖는 폴리펩티드 일원을 포함하는 조합 폴리펩티드 라이브러리를 제공하며, 상기 무작위화된 CTLD는 CTLD의 루프 세그먼트 A(LSA)의 4개의 루프 중 적어도 하나에서의 하나 이상의 아미노산 변형 및 루프 세그먼트 B(LSB)의 루프(루프 5)에서의 하나 이상의 아미노산 변형을 포함하고, 상기 하나 이상의 아미노산 변형은 LSB 아미노산 잔기의 무작위화를 포함한다.Polypeptide members of the library may comprise one or more amino acid modifications (eg, by insertion, substitution, extension, or randomization) in any combination of the four LSA loops and the LSB loop (loop 5) of the CTLD. Thus, in any of the various embodiments described herein, the randomized CTLD is alone or a modification of one or more amino acids in any one, two, three, or four loops of the LSA loop region (loops 1-4) And one or more amino acid modifications in the loop of the LSB loop region (Loop 5) in combination with. In one aspect, the present invention provides a combinatorial polypeptide library comprising a member of a polypeptide having a randomized C-type lectin domain (CTLD), wherein the randomized CTLD comprises four loops of loop segment A (LSA) of CTLD. One or more amino acid modifications in at least one of and one or more amino acid modifications in a loop (loop 5) of loop segment B (LSB), wherein the one or more amino acid modifications comprise randomization of LSB amino acid residues.

본원에 기재된 다양한 구체예에 따르면, 조합 라이브러리의 폴리펩티드 일원은 CTLD의 루프 영역(LSA 및 LSB) 내의 임의의 2, 3, 4 또는 5개의 루프에서의 하나 이상의 아미노산 변형(예를 들어, 2개의 루프, 3개의 루프, 4개의 루프, 또는 5개 모두의 루프에 대한 무작위 아미노산 변형의 임의의 무작위 조합)을 가질 수 있다. 조합 라이브러리의 폴리펩티드 일원은 루프 영역(LSA 및 LSB)의 외부의 CTLD의 영역, 예를 들어, α-헬릭스 또는 β-가닥 내에 대해 추가의 아미노산 변형을 추가로 포함할 수 있다(예를 들어, 도 1 참조).According to various embodiments described herein, a polypeptide member of a combinatorial library may comprise one or more amino acid modifications (eg, two loops) in any two, three, four, or five loops within the loop regions (LSA and LSB) of the CTLD. , 3 loops, 4 loops, or any random combination of random amino acid modifications to all 5 loops). Polypeptide members of the combinatorial library may further comprise additional amino acid modifications within regions of the CTLD outside the loop regions (LSA and LSB), eg, α-helices or β-strands (eg, FIG. 1).

본 발명의 추가 구체예에서, CTLD 루프 영역은 하기 비제한적인 실시예에 상술된 예시적 작제물 이상으로 확장될 수 있다.In a further embodiment of the invention, the CTLD loop region may extend beyond the exemplary constructs detailed in the following non-limiting examples.

일 양태에서, 본 발명은 또한 상기 기재된 양태 및 구체예 중 어느 하나에 따른 조합 폴리펩티드 라이브러리의 폴리펩티드를 엔코딩하는 핵산 분자의 라이브러리를 제공한다. 이러한 양태의 일 구체예에서, 본 발명은 라이브러리의 폴리펩티드를 엔코딩하는 핵산 서열의 라이브러리를 제공하며, 상기 폴리펩티드의 CTLD는 계획 (a)-(j)에 따라 변형된다.In one aspect, the invention also provides a library of nucleic acid molecules encoding a polypeptide of a combination polypeptide library according to any one of the embodiments and embodiments described above. In one embodiment of this aspect, the invention provides a library of nucleic acid sequences encoding a polypeptide of a library, wherein the CTLD of the polypeptide is modified according to schemes (a)-(j).

하기 실시예에 더욱 충분히 기재되는 바와 같이, 바람직한 결합 특징을 갖는다수의 폴리펩티드가 도 8에 제시된 바와 같은, 예를 들어, SEQ ID NOS:133-141을 포함하는 변형 계획 (a)-(h) 중 하나 이상에 의해 확인된다.As will be described more fully in the Examples below, modification schemes (a)-(h) comprising preferred polypeptides include, for example, SEQ ID NOS: 133-141 as shown in FIG. 8. It is confirmed by one or more of.

방법 3Method 3

또 다른 방법에서, IL-23R에 결합하는 공지된 폴리펩티드가 시스테인을 이용하여 자유 선형 펩티드 또는 이황화물 한정된 루프로서 N 또는 C 말단 삼합체화 도메인에 직접적으로 클로닝될 수 있다. IL-23R에 결합할 수 있는 단일 사슬 항체 또는 도메인 항체가 또한 삼합체화 도메인의 어느 하나의 말단에 클로닝될 수 있다. 또한, 공지된 결합 특성을 갖는 펩티드가 TN CTLD의 루프 영역 중 어느 하나에 직접 클로닝될 수 있다. 이황화물 한정된 루프 또는 항체의 상보성 결정 영역으로서 선택된 펩티드가 인간 테트라넥틴의 CTLD의 루프 영역으로의 재배치에 매우 적용가능할 수 있다. 이러한 모든 작제물에 대해, 단량체로서의 결합 뿐만 아니라 삼합체화 도메인과 융합되는 경우 삼합체로서의 결합 및 차단 활성이 이후에 시험될 수 있다.In another method, known polypeptides that bind IL-23R can be cloned directly to the N or C terminal trimerization domain using cysteine as a free linear peptide or disulfide defined loop. Single chain antibodies or domain antibodies capable of binding to IL-23R can also be cloned at either end of the trimerization domain. In addition, peptides with known binding properties can be cloned directly into either of the loop regions of the TN CTLD. Peptides selected as disulfide-limited loops or complementarity determining regions of antibodies may be highly applicable to rearrangement of human tetranectin into the loop region of CTLD. For all such constructs, binding as a monomer as well as binding and blocking activity as trimer when fused with the trimerization domain can be tested later.

방법 4:Method 4:

결합 활성을 갖는 펩티드의 직접적인 클로닝이 충분하지 않을 수 있는 일부 경우, 추가 최적화 및 선택이 필요할 수 있다. 예를 들어, 상기 언급된 것을 포함하나 이에 제한되지는 않는 IL-23R에 대한 공지된 결합을 갖는 펩티드는 인간 테트라넥틴의 CTLD로 이식될 수 있다. 결합에 대해 상기 펩티드의 최적 제시를 선택하기 위해, 측면에 존재하는 아미노산 중 하나 이상이 무작위화된 후, 결합에 대한 파지 디스플레이 선택이 수행될 수 있다. 또한, 제한되거나 약한 결합을 단독으로나타내는 펩티드는 또한 또 다른 추가 루프의 무작위화를 함유하는 CTLD 라이브러리의 루프 중 하나에 이식된 후, 증가된 결합 및/또는 특이성에 대해 파지 디스플레이를 통한 선택이 수행될 수 있다. 추가로, 특이적 상호작용/결합 아미노산이 공지된 결정 구조를 통해 확인된 펩티드에 대해, 비-결합 아미노산의 무작위화가 조사된 후, 증가된 결합 및 수용체 특이성에 대해 파지 디스플레이를 통한 선택이 수행될 수 있다. 결합을 책임지는 것으로 확인된 IL-23 리간드의 영역이 또한 종을 교차하여 시험될 수 있다. 보존된 아미노산은 보유되면서, 비-종 보존된 위치에 대한 무작위화 및 선택이 시험될 수 있다.In some cases where direct cloning of peptides with binding activity may not be sufficient, further optimization and selection may be necessary. For example, peptides with known binding to IL-23R, including but not limited to those mentioned above, can be transplanted into CTLD of human tetranectin. In order to select the optimal presentation of the peptide for binding, phage display selection for binding can be performed after one or more of the amino acids present on the side are randomized. In addition, peptides that exhibit limited or weak binding alone are also implanted into one of the loops of the CTLD library containing randomization of another additional loop, followed by selection via phage display for increased binding and / or specificity. Can be. In addition, for peptides whose specific interaction / binding amino acids have been identified through known crystal structures, randomization of non-binding amino acids is investigated, followed by selection via phage display for increased binding and receptor specificity. Can be. Regions of the IL-23 ligand identified as responsible for binding can also be tested across species. While conserved amino acids are retained, randomization and selection for non-species conserved positions can be tested.

처리 방법Processing method

본 발명의 또 다른 양태는 IL-23R을 발현하는 세포에서 IL-23R의 활성화를 방지하는 방법에 관한 것이다. 상기 방법은 상기 세포와 삼합체화 도메인 및 IL-23R에 특이적으로 결합하는 적어도 하나의 폴리펩티드를 포함하는 본 발명의 IL-23R 결합 폴리펩티드를 접촉시키는 것을 포함한다. 이러한 양태의 일 구체예에서, 상기 방법은 상기 세포와 본 발명의 삼합체 복합체를 접촉시키는 것을 포함한다. IL-23R 결합 폴리펩티드는 IL-23R(또는 이종이합체 수용체)의 길항제일 수 있거나, IL-23R에 결합하여, 종양, 염증 부위, 또는 IL-23R을 제시하는 다른 요망되는 위치로의 상기 기재된 바와 같은 삼합체화 도메인과 회합된 치료제의 국소 전달을 가능케 할 수 있다.Another aspect of the invention relates to a method of preventing the activation of IL-23R in cells expressing IL-23R. The method comprises contacting the cell with an IL-23R binding polypeptide of the invention comprising at least one polypeptide that specifically binds to the trimerization domain and IL-23R. In one embodiment of this aspect, the method comprises contacting said cell with the trimer complex of the invention. The IL-23R binding polypeptide may be an antagonist of IL-23R (or a heterodimeric receptor) or as described above to bind to IL-23R to present a tumor, site of inflammation, or other desired location presenting IL-23R. Local delivery of therapeutic agents associated with trimerization domains may be possible.

또 다른 양태에서, 본 발명은 삼합체화 도메인 및 IL-23R에 특이적으로 결합하는 적어도 하나의 폴리펩티드를 갖는 폴리펩티드를 포함하는 치료적 유효량의 IL-23R 길항제를 대상체에 투여함으로써 면역 장애 또는 종양을 갖는 대상체를 치료하는 방법에 관한 것이다. 이러한 양태의 일 구체예에서, 상기 방법은 본 발명의 삼합체 복합체를 대상체에 투여하는 것을 포함한다.In another aspect, the invention provides an immunological disorder or tumor by administering to a subject a therapeutically effective amount of an IL-23R antagonist comprising a polypeptide having a trimerization domain and at least one polypeptide that specifically binds IL-23R. A method of treating a subject. In one embodiment of this aspect, the method comprises administering to the subject a trimer complex of the invention.

본 발명의 또 다른 양태는 조합 요법에 관한 것이다. IL-23R 길항제 및 치료제를 포함하는 제형이 또한 본 발명에 의해 제공된다. 이러한 제형은 저장 뿐만 아니라 치료적 투여에 특히 적합한 것으로 생각된다. 상기 제형은 공지된 기술에 의해 제조될 수 있다. 예를 들어, 제형은 겔 여과 컬럼 상에서의 완충액 교환에 의해 제조될 수 있다.Another aspect of the invention relates to combination therapy. Formulations comprising an IL-23R antagonist and a therapeutic agent are also provided by the present invention. Such formulations are believed to be particularly suitable for storage as well as therapeutic administration. Such formulations may be prepared by known techniques. For example, formulations can be prepared by buffer exchange on a gel filtration column.

본원에 기재된 IL-23R 길항제 및 치료제는 다양한 치료 적용에 사용될 수 있다. 상기 적용 중 하나는 다양한 암을 치료하는 방법이다. IL-23R 길항제 및 치료제가 공지된 방법에 따라, 예를 들어, 볼루스로서 정맥내 투여 또는 일정 기간에 걸친 연속적 주입, 근내, 복막내, 뇌척수내, 피하, 관절내, 활액내, 수막강내, 경구, 국소 또는 흡입 경로에 의해 투여될 수 있다. 임의로, 투여는 다양한 시판되는 장치를 이용하여 미니-펌프(mini-pump) 주입을 통해 수행될 수 있다.The IL-23R antagonists and therapeutic agents described herein can be used in a variety of therapeutic applications. One such application is a method of treating various cancers. IL-23R antagonists and therapeutic agents are known, for example, by intravenous administration as a bolus or continuous infusion over a period of time, intramuscular, intraperitoneal, intracerebral, subcutaneous, intra-articular, intramuscular, intramedullary It may be administered by the oral, topical or inhalation route. Optionally, administration can be performed via mini-pump infusion using various commercially available devices.

IL-23R 길항제를 투여하기 위한 유효 용량 및 스케줄은 경험적으로 결정될 수 있고, 이러한 결정은 당 분야의 기술 범위 내이다. 단일 또는 다수의 투여량이 이용될 수 있다. 단독으로 사용되는 길항제의 유효 용량 또는 유효량은 하루 당 체중 kg 당 약 1 μg 내지 약 100 mg 또는 이 초과의 범위일 수 있는 것으로 현재 생각된다. 투여량의 종간 크기 조정은 당 분야에 공지된 방식으로 수행될 수 있으며, 예를 들어, 이는 문헌[Mordenti et al ., Pharmaceut. Res., 8: 1351 (1991)]에 개시되어 있다.Effective doses and schedules for administering IL-23R antagonists can be determined empirically, and such determinations are within the skill of the art. Single or multiple dosages may be used. It is currently contemplated that the effective dose or effective amount of the antagonist used alone may range from about 1 μg to about 100 mg or more per kg of body weight per day. Interstitial sizing of the dosage can be performed in a manner known in the art, for example, as described in Mordenti et. al . , Pharmaceut. Res., 8: 1351 (1991).

IL-23R 길항제의 생체내 투여가 이용되는 경우, 일반 투여량은 투여 경로에 따라 하루 당 포유동물 체중 kg 당 약 10 ng 내지 100 mg 또는 이 초과, 바람직하게는 약 1 μg/kg/일 내지 10 mg/kg/일로 다양할 수 있다. 특정 투여량 및 전달 방법에 관한 지침은 문헌에 제공된다[예를 들어, U.S. Pat. Nos. 4,657,760; 5,206,344; 또는 5,225,212 참조]. 당업자는 다양한 제형이 다양한 치료 화합물 및 다양한 장애에 대해 효과적일 것이며, 예를 들어, 하나의 기관 또는 조직을 표적으로 하는 투여가 또 다른 기관 또는 조직에 대한 방식과 상이한 방식의 전달을 필요로 할 수 있음을 인지할 것이다. 당업자는 투여되어야 하는 IL-23R 길항제의 투여량이, 예를 들어, IL-23R 길항제가 투여될 포유동물, 투여 경로, 및 포유동물에 투여되는 다른 약물 또는 요법에 따라 다양할 것임을 이해할 것이다.When in vivo administration of an IL-23R antagonist is used, the general dosage is from about 10 ng to 100 mg or more, preferably from about 1 μg / kg / day to 10, per kg body weight of mammal per day, depending on the route of administration. May vary from mg / kg / day. Guidance as to particular dosages and methods of delivery is provided in the literature [eg, U.S. Pat. Pat. Nos. 4,657,760; 5,206,344; Or 5,225,212]. Those skilled in the art will appreciate that various formulations will be effective for a variety of therapeutic compounds and a variety of disorders, for example, administration targeted to one organ or tissue may require delivery in a manner different from that for another organ or tissue. It will be recognized. Those skilled in the art will appreciate that the dosage of the IL-23R antagonist to be administered will vary depending, for example, on the mammal to which the IL-23R antagonist will be administered, the route of administration, and other drugs or therapies administered to the mammal.

상기 방법에 또 다른 추가 요법이 이용될 수 있는 것이 고려된다. 하나 이상의 다른 요법은 방사선 요법제, 사이토카인(들), 성장 억제제(들), 화학요법제(들), 세포독성제(들), 티로신 키나제 억제제, ras 파르네실 트랜스퍼라제 억제제, 혈관형성 억제제, 및 사이클린-의존성 키나제 억제제 또는 당 분야에 공지된 IL-23R 길항제에 의한 사멸에 대해 암세포의 감수성을 향상시키는 임의의 다른 작용제의 투여를 포함할 수 있으나, 이에 제한되지는 않는다.It is contemplated that other additional therapies may be used in the method. One or more other therapies include radiation therapy, cytokine (s), growth inhibitor (s), chemotherapeutic agent (s), cytotoxic agent (s), tyrosine kinase inhibitors, ras farnesyl transferase inhibitors, angiogenesis inhibitors, And administration of any other agent that enhances the sensitivity of cancer cells to death by cyclin-dependent kinase inhibitors or IL-23R antagonists known in the art.

화학요법제의 제조 및 투여 스케줄은 제조업체의 설명서에 따라 이용되거나, 숙련된 의료인에 의해 경험적으로 결정될 수 있다. 상기 화학요법제에 대한 제조 및 투여 스케줄은 또한 문헌[Chemotherapy Service Ed., M. C. Perry, Williams & Wilkins, Baltimore, Md. (1992)]에 기재되어 있다. 화학요법제는 Apo2L 변이체의 투여에 선행하거나 후속될 수 있거나, Apo2L 변이체와 동시에 제공될 수 있다.Preparation and dosing schedules for chemotherapeutic agents may be used in accordance with the manufacturer's instructions or may be empirically determined by a skilled practitioner. Preparation and dosing schedules for such chemotherapeutic agents are also described in Chemotherapy Service Ed., M. C. Perry, Williams & Wilkins, Baltimore, Md. (1992). Chemotherapeutic agents may precede or follow administration of the Apo2L variant, or may be provided concurrently with the Apo2L variant.

본 발명의 폴리펩티드 및 치료제(및 하나 이상의 다른 요법)는 동시(함께) 또는 순차적으로 투여될 수 있다. 특정 구체예에서, 본 발명의 비 자연 폴리펩티드 또는 이의 다합체(예를 들어, 삼합체) 복합체, 및 치료제는 동시에 투여된다. 또 다른 구체예에서, 폴리펩티드 또는 삼합체 복합체는 치료제의 투여 전에 투여된다. 또 다른 구체예에서, 치료제는 폴리펩티드 또는 삼합체 복합체 전에 투여된다. 투여 후, 시험관내 치료되는 세포가 분석될 수 있다. 생체내 치료되는 경우, 치료된 포유동물은 숙련된 의료인에게 널리 공지된 다양한 방법으로 모니터될 수 있다. 예를 들어, 종양 조직은 세포 사멸에 대해 검정하기 위해 병리학적으로 시험될 수 있거나, 혈청이 면역계 반응에 대해 분석될 수 있다.The polypeptides and therapeutic agents (and one or more other therapies) of the invention may be administered simultaneously (in conjunction) or sequentially. In certain embodiments, non-natural polypeptides of the invention or multimers (eg, trimer) complexes thereof, and the therapeutic agent are administered simultaneously. In another embodiment, the polypeptide or trimer complex is administered prior to the administration of the therapeutic agent. In another embodiment, the therapeutic agent is administered before the polypeptide or trimer complex. After administration, the cells to be treated in vitro can be analyzed. When treated in vivo, the treated mammal can be monitored by a variety of methods well known to the skilled practitioner. For example, tumor tissue may be pathologically tested to assay for cell death, or serum may be analyzed for immune system response.

약학적 조성물Pharmaceutical composition

또 다른 양태에서, 본 발명은 약학적으로 허용되는 담체 또는 부형제와 함께 치료적 유효량의 본 발명의 폴리펩티드를 포함하는 약학적 조성물에 관한 것이다. 본원에서 사용되는 "약학적으로 허용되는 담체" 또는 "약학적으로 허용되는 부형제"는 생리학적으로 적합한 임의의 및 모든 용매, 분산 매질, 코팅, 항균제 및 항진균제, 등장화제 및 흡수 지연제 등을 포함한다. 약학적으로 허용되는 담체 또는 부형제의 예는 물, 염수, 인산염 완충 염수, 덱스트로오스, 글리세롤, 에탄올 등 뿐만 아니라 이의 조합물 중 하나 이상을 포함한다. 많은 경우에, 조성물 내에 등장화제, 예를 들어, 당, 폴리알콜, 예를 들어, 만니톨, 소르비톨, 또는 염화나트륨을 포함시키는 것이 바람직할 것이다. 항체 또는 항체 부분의 저장수명 또는 효과를 향상시키는 약학적으로 허용되는 물질, 예를 들어, 습윤 물질 또는 소량의 보조 물질, 예를 들어, 습윤제 또는 유화제, 보존제 또는 완충제가 또한 포함될 수 있다. 임의로, 붕해제, 예를 들어, 가교된 폴리비닐 피롤리돈, 아가, 알긴산 또는 이의 염, 예를 들어, 소듐 알기네이트 등이 포함될 수 있다. 부형제 외에, 약학적 조성물은 담체 단백질, 예를 들어, 혈청 알부민, 완충제, 결합제, 감미제 및 다른 착향제, 착색제 및 폴리에틸렌 글리콜 중 하나 이상을 포함할 수 있다.In another aspect, the invention relates to a pharmaceutical composition comprising a therapeutically effective amount of a polypeptide of the invention in combination with a pharmaceutically acceptable carrier or excipient. As used herein, “pharmaceutically acceptable carrier” or “pharmaceutically acceptable excipient” includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like that are physiologically compatible. do. Examples of pharmaceutically acceptable carriers or excipients include water, saline, phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations thereof. In many cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the composition. Pharmaceutically acceptable substances, such as wetting substances or small amounts of auxiliary substances, such as wetting or emulsifying agents, preservatives or buffers, which enhance the shelf life or effectiveness of the antibody or antibody portion, may also be included. Optionally, disintegrants such as crosslinked polyvinyl pyrrolidone, agar, alginic acid or salts thereof, such as sodium alginate and the like, may be included. In addition to excipients, the pharmaceutical composition may comprise carrier proteins such as one or more of serum albumin, buffers, binders, sweeteners and other flavoring agents, colorants and polyethylene glycols.

조성물은, 예를 들어, 액체, 반-고체 및 고체 투여 형태, 예를 들어, 액체 용액(예를 들어, 주사 및 주입 용액), 분산액 또는 현탁액, 정제, 환약, 분말, 리포솜 및 좌약을 포함하는 다양한 형태로 존재할 수 있다. 바람직한 형태는 의도된 투여 경로 및 치료 적용에 좌우될 것이다. 일 구체예에서, 조성물은 주사 또는 주입 용액, 예를 들어, 항체를 이용한 인간의 수동 면역화에 사용되는 것과 유사한 조성물의 형태이다. 일 구체예에서, 투여 방식은 비경구(예를 들어, 정맥내, 피하, 복막내, 근내) 투여이다. 일 구체예에서, 폴리펩티드(또는 삼합체 복합체)는 정맥내 주입 또는 주사에 의해 투여된다. 또 다른 구체예에서, 폴리펩티드 또는 삼합체 복합체는 근내 또는 피하 주사에 의해 투여된다.The composition includes, for example, liquid, semi-solid and solid dosage forms, such as liquid solutions (eg injection and infusion solutions), dispersions or suspensions, tablets, pills, powders, liposomes and suppositories. It can exist in various forms. Preferred forms will depend on the intended route of administration and therapeutic application. In one embodiment, the composition is in the form of a composition similar to that used for passive immunization of a human with an injection or infusion solution, eg, an antibody. In one embodiment, the mode of administration is parenteral (eg, intravenous, subcutaneous, intraperitoneal, intramuscular) administration. In one embodiment, the polypeptide (or trimer complex) is administered by intravenous infusion or injection. In another embodiment, the polypeptide or trimer complex is administered by intramuscular or subcutaneous injection.

약학적 조성물에 대한 투여의 다른 적합한 경로는 직장, 경피, 질내, 경점막 또는 장내 투여를 포함하나, 이에 제한되지는 않는다.Other suitable routes of administration for the pharmaceutical compositions include, but are not limited to, rectal, transdermal, vaginal, transmucosal or intestinal administration.

치료 조성물은 전형적으로 멸균되며, 제조 및 저장 조건하에서 안정적이다. 조성물은 용액, 마이크로에멀젼, 분산액, 리포솜, 또는 높은 약물 농도에 적합한 다른 정연한 구조로 제형화될 수 있다. 멸균 주사 용액은 활성 화합물(즉, 폴리펩티드 또는 삼합체 복합체)을 필요시 상기 열거된 성분 중 하나 또는 이의 조합물을 갖는 필요량의 적절한 용매에 혼입시킨 후, 멸균 여과시킴으로써 제조될 수 있다. 일반적으로, 분산액은 활성 화합물을 기본 분산액 매질 및 상기 열거된 것으로부터의 필요한 다른 성분을 함유하는 멸균 비히클로 혼입시킴으로써 제조된다. 멸균 주사 용액의 제조를 위한 멸균 분말의 경우에서, 바람직한 제조 방법은 사전에 멸균-여과된 용액으로부터 활성 성분의 분말과 함께 임의의 추가적인 요망되는 성분을 생성시키는 진공 건조 및 동결-건조이다. 용액의 적당한 유동성은, 예를 들어, 레시틴과 같은 코팅의 사용, 분산액의 경우 필요한 입자 크기의 유지, 및 표면활성제의 사용에 의해 유지될 수 있다. 주사 조성물의 연장된 흡수는 조성물 내에 흡수를 지연시키는 작용제, 예를 들어, 모노스테아레이트 염 및 겔라틴을 포함시킴으로써 발생될 수 있다.Therapeutic compositions are typically sterile and stable under the conditions of manufacture and storage. The compositions may be formulated in solution, microemulsions, dispersions, liposomes, or other ordered structures suitable for high drug concentrations. Sterile injectable solutions can be prepared by incorporating the active compound (ie, a polypeptide or trimer complex) in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by sterile filtration. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle which contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, preferred methods of preparation are vacuum drying and freeze-drying to produce any additional desired ingredients together with the powder of the active ingredient from the presterile-filtered solution. Proper fluidity of the solution can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersions, and by the use of surfactants. Prolonged absorption of the injectable composition may occur by including an agent in the composition which delays absorption, for example, monostearate salt and gelatin.

IL-23R 길항제 및 본원에 기재된 장애의 치료에 유용한 치료제를 함유하는 키트와 같은 제조 물품은 적어도 용기 및 표지를 포함한다. 적합한 용기는, 예를 들어, 병, 바이얼, 주사기, 및 시험관을 포함한다. 용기는 다양한 물질, 예를 들어, 유리 또는 플라스틱으로부터 형성될 수 있다. 용기 상 또는 용기와 관련된 표지는 제형이 선택 질환을 치료하는데 사용되는 것을 나타낸다. 제조 물품은 약학적으로 허용되는 완충제, 예를 들어, 인산염 완충 염수, 링거액, 및 덱스트로오스 용액을 포함하는 용기를 추가로 포함할 수 있다. 이는 추가로 다른 완충제, 희석제, 필터, 바늘, 주사기, 및 사용 설명서를 갖는 패키지 삽입물을 포함하는 시판 및 사용자 입장에서 요망되는 다른 물질을 추가로 포함할 수 있다. 제조 물품은 또한 상기 기재된 바와 같은 또 다른 활성제를 갖는 용기를 포함할 수 있다.Articles of manufacture, such as kits containing IL-23R antagonists and therapeutic agents useful for the treatment of the disorders described herein, include at least a container and a label. Suitable containers include, for example, bottles, vials, syringes, and test tubes. The container may be formed from various materials, for example glass or plastic. Labels on or associated with the container indicate that the formulation is to be used to treat the disease of choice. The article of manufacture may further comprise a container comprising a pharmaceutically acceptable buffer such as phosphate buffered saline, Ringer's solution, and dextrose solution. It may further include other materials desired from the commercial and user standpoint, including other buffers, diluents, filters, needles, syringes, and package inserts with instructions for use. The article of manufacture may also include a container having another active agent as described above.

전형적으로, 적절한 양의 약학적으로 허용되는 염이 제형에 사용되어, 제형이 등장성이 되도록 한다. 약학적으로 허용되는 담체의 예는 염, 링거액 및 덱스트로오스 용액을 포함한다. 제형의 pH는 바람직하게는 약 6 내지 약 9, 더욱 바람직하게는 약 7 내지 약 7.5이다. 특정 담체가, 예를 들어, 투여 경로 및 IL-23R 길항제 및 치료제의 농도에 따라 더욱 바람직할 수 있음이 당업자에게 명백할 것이다.Typically, an appropriate amount of pharmaceutically acceptable salt is used in the formulation to render the formulation isotonic. Examples of pharmaceutically acceptable carriers include salts, Ringer's solution and dextrose solution. The pH of the formulation is preferably about 6 to about 9, more preferably about 7 to about 7.5. It will be apparent to those skilled in the art that particular carriers may be more desirable depending on, for example, the route of administration and the concentration of IL-23R antagonist and therapeutic agent.

치료 조성물은 적절한 정도의 순도를 갖는 요망되는 분자와 동결건조된 제형, 수용액 또는 수성 현탁액 형태의 임의의 약학적으로 허용되는 담체, 부형제, 또는 안정화제를 혼합시킴으로써 제조될 수 있다(Remington's Pharmaceutical Sciences, 16th edition, Osol, A. ed. (1980)). 허용되는 담체, 부형제, 또는 안정화제는 바람직하게는 사용되는 투여량 및 농도에서 수용자에게 비독성이고, 이는 완충제, 예를 들어, Tris, HEPES, PIPES, 인산염, 구연산염, 및 다른 유기산; 항산화제, 예를 들어, 아스코르브산 및 메티오닌; 보존제(예를 들어, 옥타데실디메틸벤질 암모늄 클로라이드; 헥사메토늄 클로라이드; 벤즈알코늄 클로라이드, 벤제토늄 클로라이드; 페놀, 부틸 또는 벤질 알콜; 알킬 파라벤, 예를 들어, 메틸 또는 프로필 파라벤; 카테콜; 레조르시놀; 사이클로헥사놀; 3-펜탄올; 및 m-크레솔); 저분자량(약 10개 잔기 미만) 폴리펩티드; 단백질, 예를 들어, 혈청 알부민, 겔라틴, 또는 면역글로불린; 친수성 중합체, 예를 들어, 폴리비닐피롤리돈; 아미노산, 예를 들어, 글리신, 글루타민, 아스파라긴, 히스티딘, 아르기닌, 또는 리신; 단당류, 이당류, 및 다른 탄수화물, 예를 들어, 글루코오스, 만노오스, 또는 덱스트린; 당, 예를 들어, 수크로오스, 만니톨, 트레할로오스 또는 소르비톨; 염-형성 반대-이온, 예를 들어, 나트륨; 및/또는 비-이온성 표면활성제, 예를 들어, TWEEN™, PLURONICS™ 또는 폴리에틸렌 글리콜(PEG)를 포함한다.Therapeutic compositions can be prepared by mixing the desired molecule with the appropriate degree of purity with any pharmaceutically acceptable carrier, excipient, or stabilizer in the form of a lyophilized formulation, aqueous solution or aqueous suspension (Remington's Pharmaceutical Sciences, 16th edition, Osol, A. ed. (1980)). Acceptable carriers, excipients, or stabilizers are preferably nontoxic to recipients at the dosages and concentrations employed, including buffers such as Tris, HEPES, PIPES, phosphate, citrate, and other organic acids; Antioxidants such as ascorbic acid and methionine; Preservatives (e.g., octadecyldimethylbenzyl ammonium chloride; hexamethonium chloride; benzalkonium chloride, benzetonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens, for example methyl or propyl paraben; catechol; rezo Lecinol; cyclohexanol; 3-pentanol; and m-cresol); Low molecular weight (less than about 10 residues) polypeptides; Proteins such as serum albumin, gelatin, or immunoglobulins; Hydrophilic polymers such as polyvinylpyrrolidone; Amino acids such as glycine, glutamine, asparagine, histidine, arginine, or lysine; Monosaccharides, disaccharides, and other carbohydrates such as glucose, mannose, or dextrins; Sugars such as sucrose, mannitol, trehalose or sorbitol; Salt-forming counter-ions such as sodium; And / or non-ionic surfactants such as TWEEN ™, PLURONICS ™ or polyethylene glycol (PEG).

상기 담체의 추가 예는 이온교환제, 알루미나, 알루미늄 스테아레이트, 레시틴, 혈청 단백질, 예를 들어, 인간 혈청 알부민, 완충 물질, 예를 들어, 글리신, 소르브산, 포타슘 소르베이트, 포화된 식물성 지방산의 부분적 글리세라이드 혼합물, 물, 염, 또는 전해질, 예를 들어, 프로타민 설페이트, 인산수소이나트륨, 인산수소칼륨(potassium hydrogen phosphate), 염화나트륨, 콜로이드 실리카, 마그네슘 트리실리케이트, 폴리비닐 피롤리돈, 및 셀룰로오스 기반 물질을 포함한다. 국소 또는 겔 기반 형태용 담체는 다당류, 예를 들어, 소듐 카르복시메틸셀룰로오스 또는 메틸셀룰로오스, 폴리비닐피롤리돈, 폴리아크릴레이트, 폴리옥시에틸렌-폴리옥시프로필렌-블록 중합체, 폴리에틸렌 글리콜, 및 우드 왁스(wood wax) 알콜을 포함한다. 모든 투여를 위해, 통상적인 데포(depot) 형태가 적합하게 사용된다. 이러한 형태는, 예를 들어, 미세캡슐, 나노-캡슐, 리포솜, 석고, 흡입 형태, 비 스프레이, 설하정, 및 지속-방출 제조물을 포함한다.Further examples of such carriers include ion exchangers, alumina, aluminum stearate, lecithin, serum proteins such as human serum albumin, buffer substances such as glycine, sorbic acid, potassium sorbate, saturated vegetable fatty acids. Partially glyceride mixtures, water, salts, or electrolytes, such as protamine sulfate, disodium hydrogen phosphate, potassium hydrogen phosphate, sodium chloride, colloidal silica, magnesium trisilicate, polyvinyl pyrrolidone, and cellulose based Contains substances. Carriers for topical or gel-based forms include polysaccharides such as sodium carboxymethylcellulose or methylcellulose, polyvinylpyrrolidone, polyacrylates, polyoxyethylene-polyoxypropylene-block polymers, polyethylene glycols, and wood waxes ( wood wax) alcohol. For all administrations, conventional depot forms are suitably used. Such forms include, for example, microcapsules, nano-capsules, liposomes, gypsum, inhaled forms, nasal sprays, sublingual tablets, and sustained-release preparations.

생체내 투여에 사용되는 제형은 멸균되어야 한다. 이는 동결건조 및 재구성 전 또는 후에 멸균 여과막을 통한 여과에 의해 용이하게 달성된다. 제형은 전신투여되는 경우 동결건조 형태 또는 용액으로 저장될 수 있다. 동결건조 형태인 경우, 전형적으로 사용시에 적절한 희석제를 이용한 재구성을 위해 다른 성분과 조합되어 제형화된다. 액체 제형의 예는 피하 주사를 위해 단일-용량 바이얼에 충전된 멸균된 투명 무색의 보존제 비첨가 용액이다.Formulations used for in vivo administration must be sterile. This is readily accomplished by filtration through sterile filtration membranes before or after lyophilization and reconstitution. The formulations may be stored in lyophilized form or in solution when administered systemically. In lyophilized form, they are typically formulated in combination with other ingredients for reconstitution with an appropriate diluent in use. An example of a liquid formulation is a sterile clear colorless preservative unadded solution filled in single-dose vials for subcutaneous injection.

치료 제형은 일반적으로 멸균 접근 포트를 갖는 용기, 예를 들어, 피하주사 바늘에 의해 관통될 수 있는 스토퍼를 갖는 정맥내 용액 백 또는 바이얼에 위치된다. 제형은 바람직하게는 반복된 정맥내(i.v.), 피하(s.c), 근내(i.m.) 주사 또는 주입, 또는 비내 또는 폐내 전달에 적합한 에어로졸 제형으로 투여된다(폐내 전달에 대해서는, EP 257,956 참조).The therapeutic formulation is generally placed in a container with a sterile access port, for example an intravenous solution bag or vial with a stopper that can be penetrated by a hypodermic needle. The formulations are preferably administered in aerosol formulations suitable for repeated intravenous (i.v.), subcutaneous (s.c), intramuscular (i.m.) injections or infusions, or intranasal or pulmonary delivery (for pulmonary delivery, see EP 257,956).

본원에 개시된 분자는 또한 지속-방출 제조물 형태로 투여될 수 있다. 지속-방출 제조물의 적합한 예는 단백질을 함유하는 고체 소수성 중합체의 반투과성 매트릭스를 포함하며, 상기 매트릭스는 성형된 물품, 예를 들어, 필름 또는 미세캡슐의 형태로 존재한다. 지속-방출 매트릭스의 예는 폴리에스테르, 하이드로겔(예를 들어, 문헌[Langer et al ., J. Biomed. Mater. Res., 15: 167-277 (1981) 및 Langer, Chem. Tech., 12: 98-105 (1982)]에 기재된 바와 같은 폴리(2-하이드록시에틸-메타크릴레이트) 또는 폴리(비닐알콜)), 폴리락티드(U.S. Pat. No. 3,773,919, EP 58,481), L-글루탐산 및 감마 에틸-L-글루타메이트의 공중합체(Sidman et al., Biopolymers, 22: 547-556 (1983)), 분해가능하지 않은 에틸렌-비닐 아세테이트(Langer et al ., 상기), 분해가능한 락트산-글리콜산 공중합체, 예를 들어, Lupron Depot(락트산-글리콜산 공중합체 및 류프롤라이드 아세테이트로 구성된 주사용 미세구), 및 폴리-D-(-)-3-하이드록시부티르산(EP 133,988)을 포함한다.The molecules disclosed herein may also be administered in the form of sustained-release preparations. Suitable examples of sustained-release preparations include semipermeable matrices of solid hydrophobic polymers containing protein, which matrices are in the form of shaped articles, eg, films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (eg, Langer et. al . , J. Biomed. Mater. Res., 15: 167-277 (1981) and Langer, Chem. Tech., 12: 98-105 (1982), poly (2-hydroxyethyl-methacrylate) or poly (vinyl alcohol)), polylactide (US Pat. No. 3,773,919, EP 58,481) , Copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et al , Biopolymers, 22: 547-556 (1983)), non-degradable ethylene-vinyl acetate (Langer et al . , Above), degradable lactic acid-glycolic acid copolymers such as Lupron Depot (injectable microspheres consisting of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- Hydroxybutyric acid (EP 133,988).

폴리펩티드의 생성Generation of Polypeptides

본 발명의 폴리펩티드는 폴리펩티드가 발현되는 조건하에서 폴리펩티드를 엔코딩하는 벡터로 형질전환된 숙주를 배양함으로써 임의의 적합한 표준 단백질 발현 시스템에서 발현될 수 있다. 바람직하게는, 발현 시스템은 요망되는 단백질이 용이하게 분리될 수 있는 시스템이다. 일반적 내용으로서, 원핵생물 발현 시스템이 이용가능한데, 이는 단백질의 높은 수율 및 효과적인 정제 및 재폴딩 방법이 수득될 수 있기 때문이다. 따라서, 적절한 발현 시스템(벡터 및 세포 유형을 포함함)의 선택은 당업자의 지식 범위 내이다. 유사하게, 본 발명의 폴리펩티드에 대한 일차 아미노산 서열이 선택되면, 당업자는 선택된 숙주에서의 코돈 바이어스(codon bias), 숙주에서의 분비 신호서열에 대한 필요, 신호 서열 내의 단백분해효소의 도입 등과 같은 요인을 고려하여 요망되는 아미노산 서열을 엔코딩하는 적절한 재조합 DNA 작제물을 용이하게 설계할 수 있다.Polypeptides of the invention can be expressed in any suitable standard protein expression system by culturing a host transformed with a vector encoding the polypeptide under conditions in which the polypeptide is expressed. Preferably, the expression system is a system in which the desired protein can be easily separated. As a general rule, prokaryotic expression systems are available because high yields of proteins and effective purification and refolding methods can be obtained. Thus, the selection of appropriate expression systems (including vectors and cell types) is within the knowledge of those skilled in the art. Similarly, once a primary amino acid sequence for a polypeptide of the invention is selected, those skilled in the art will recognize such factors as codon bias in the selected host, the need for secretory signal sequences in the host, the introduction of proteases in the signal sequence, and the like. In view of the above, suitable recombinant DNA constructs encoding the desired amino acid sequences can be readily designed.

일 구체예에서, 분리된 폴리누클레오티드는 IL-23R에 특이적으로 결합하는 폴리펩티드 및 삼합체화 도메인을 엔코딩한다. 일 구체예에서, 분리된 폴리누클레오티드는 IL-23R에 특이적으로 결합하는 제 1 폴리펩티드, 및 삼합체화 도메인을 엔코딩한다. 특정 구체예에서, IL-23R에 특이적으로 결합하는 폴리펩티드 및 삼합체화 도메인은 단일한 연속 폴리누클레오티드 서열(유전 융합체)로 엔코딩된다. 다른 구체예에서, IL-23R에 특이적으로 결합하는 폴리펩티드 및 삼합체화 도메인은 비-연속 폴리누클레오티드 서열에 의해 엔코딩된다. 따라서, 일부 구체예에서, IL-23R에 특이적으로 결합하는 적어도 하나의 폴리펩티드 및 삼합체화 도메인이 별개의 폴리펩티드로 발현되고, 분리되고, 정제되고, 함께 융합되어, 본 발명의 폴리펩티드가 형성된다.In one embodiment, the isolated polynucleotides encode polypeptides and trimerization domains that specifically bind to IL-23R. In one embodiment, the isolated polynucleotide encodes a first polypeptide that specifically binds IL-23R, and a trimerization domain. In certain embodiments, polypeptides and trimerization domains that specifically bind to IL-23R are encoded into a single continuous polynucleotide sequence (genetic fusion). In other embodiments, polypeptides and trimerization domains that specifically bind to IL-23R are encoded by non-contiguous polynucleotide sequences. Thus, in some embodiments, at least one polypeptide and trimerization domain that specifically binds IL-23R are expressed as separate polypeptides, isolated, purified, and fused together to form polypeptides of the invention.

이러한 재조합 DNA 작제물은 선택된 숙주에 대해 적절한 다수의 발현 벡터 중 임의의 벡터로 인-프레임(in-frame)으로 삽입될 수 있다. 특정 구체예에서, 발현 벡터는 재조합 폴리펩티드 작제물의 발현을 조절하는 강한 프로모터를 포함한다. 본 발명의 폴리펩티드를 생성시키기 위해 재조합 발현 방법이 사용되는 경우, 생성된 폴리펩티드는 당 분야에 널리 공지된 적합한 표준 절차를 이용하여 분리되고 정제될 수 있고, 임의로 추가 공정, 예를 들어, 동결건조에 적용될 수 있다.Such recombinant DNA constructs can be inserted in-frame into any of a number of expression vectors appropriate for the selected host. In certain embodiments, the expression vector comprises a strong promoter that regulates the expression of the recombinant polypeptide construct. When recombinant expression methods are used to produce the polypeptides of the invention, the resulting polypeptides can be isolated and purified using suitable standard procedures well known in the art and optionally subjected to further processing, eg, lyophilization. Can be applied.

재조합 DNA 분자, 단백질, 및 폴리펩티드 생성 뿐만 아니라 조직 배양 및 세포 형질전환을 위해 표준 기술이 이용될 수 있다. 예를 들어, 문헌[Sambrook, et al. (하기) 또는 Current Protocols in Molecular Biology (Ausubel et al ., eds., Green Publishers Inc. and Wiley and Sons 1994)]을 참조하라. 정제 기술은 전형적으로 제조업체의 명세서에 따라 수행되거나, 문헌[Sambrook et al. (Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989)]에 기재된 것과 같은 통상적인 절차를 이용하여 당 분야에서 일반적으로 달성되는 바에 따라 수행되거나, 본원에 기재된 바와 같이 수행된다. 특정 정의가 제공되지 않는 경우, 실험 절차, 및 본원에 기재된 분자생물학, 생화학, 분석화학, 및 약학/제형 화학과 관련된 기술과 관련하여 사용되는 명명법은 당 분야에 널리 공지되고 일반적으로 사용되는 것이다. 생화학 합성, 생화학 분석, 약학 제조물, 제형, 및 전달, 및 환자의 치료를 위해 표준 기술이 이용될 수 있다.Standard techniques can be used for the production of recombinant DNA molecules, proteins, and polypeptides as well as tissue culture and cell transformation. See, eg, Sambrook, et al . (To) or Current Protocols in Molecular Biology (Ausubel et al . , eds., Green Publishers Inc. and Wiley and Sons 1994). Purification techniques are typically performed according to the manufacturer's specifications, or are described in Sambrook et. al . (Molecular Cloning: A Laboratory Manual.Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989)) using conventional procedures as described or commonly performed in the art, or as described herein. If no specific definitions are provided, the nomenclature used in connection with the experimental procedures and techniques related to molecular biology, biochemistry, analytical chemistry, and pharmaceutical / formulation chemistry described herein are well known and commonly used in the art. Standard techniques can be used for biochemical synthesis, biochemical analysis, pharmaceutical preparations, formulations, and delivery, and treatment of patients.

유연한 분자 링커가 임의로 특정 결합 일원과 삼합체화 도메인 사이에 삽입되어, 상기 일원과 삼함체화 도메인을 공유적으로 연결시킬 수 있는 것이 인지될 것이다. 특정 구체예에서, 링커는 약 1-20개의 아미노산 잔기의 폴리펩티드 서열이다. 링커는 10개 미만의 아미노산, 가장 바람직하게는 5, 4, 3, 2, 또는 1개의 아미노산일 수 있다. 특정 경우에, 9, 8, 7 또는 6개의 아미노산이 적합할 수 있다. 유용한 구체예에서, 링커는 본질적으로 비-면역원성이고, 단백질분해성 분해되는 경향이 없고, 다른 잔기와 상호작용하는 것으로 공지된 아미노산 잔기(예를 들어, 시스테인 잔기)를 포함하지 않는다.It will be appreciated that a flexible molecular linker may optionally be inserted between a particular binding member and a trimerization domain to covalently link the member to the trimerization domain. In certain embodiments, the linker is a polypeptide sequence of about 1-20 amino acid residues. The linker may be less than 10 amino acids, most preferably 5, 4, 3, 2, or 1 amino acid. In certain cases, 9, 8, 7 or 6 amino acids may be suitable. In useful embodiments, the linker is essentially non-immunogenic, does not tend to proteolytically degrade, and does not comprise amino acid residues (eg, cysteine residues) known to interact with other residues.

하기 기재는 또한 하나 이상의 화학기에 공유적으로 부착된(이후, "컨쥬게이션된") 폴리펩티드 및 삼합체 복합체를 생성하는 방법에 관한 것이다. 상기 컨쥬게이트에 사용하기에 적합한 화학기는 바람직하게는 유의하게 독성이거나 면역원성이 아니다. 저장에 적합한 조건하에서 저장되고 사용될 수 있는 컨쥬게이트를 생성시키기 위해 화학기는 임의로 선택된다. 폴리펩티드에 컨쥬게이션될 수 있는 다양한 예시적 화학기가 당 분야에 공지되어 있고, 이는, 예를 들어, 탄수화물, 예를 들어, 당단백질에서 자연 발생하는 탄수화물, 폴리글루타메이트, 및 비-단백질성 중합체, 예를 들어, 폴리올을 포함한다(예를 들어, U.S. Pat. No. 6,245,901 참조).The following description also relates to methods for generating polypeptide and trimer complexes covalently attached (hereinafter “conjugated”) to one or more chemical groups. Chemical groups suitable for use in the conjugates are preferably not significantly toxic or immunogenic. Chemical groups are optionally selected to produce conjugates that can be stored and used under conditions suitable for storage. Various exemplary chemical groups that can be conjugated to a polypeptide are known in the art and include, for example, carbohydrates, such as carbohydrates, polyglutamate, and non-proteinaceous polymers that occur naturally in glycoproteins, for example. For example, polyols (see, eg, US Pat. No. 6,245,901).

예를 들어, 폴리올이 상기 WO 93/00109호에 개시된 바와 같은 리신 잔기를 포함하는 하나 이상의 아미노산 잔기에서 본 발명의 폴리펩티드에 컨쥬게이션될 수 있다. 사용되는 폴리올은 임의의 수용성 폴리(알킬렌 옥시드) 중합체일 수 있고, 이는 직쇄 또는 분지쇄를 가질 수 있다. 적합한 폴리올은 하나 이상의 하이드록실 위치에서 화학기, 예를 들어, 1 내지 4개의 탄소를 갖는 알킬기로 치환된 것을 포함한다. 전형적으로, 폴리올은 폴리(알킬렌 글리콜), 예를 들어, 폴리(에틸렌 글리콜)(PEG)이며, 따라서, 기재의 용이성을 위해, 논의의 나머지는 사용되는 폴리올이 PEG이고, 폴리올을 폴리펩티드에 컨쥬게이션시키는 방법이 "페길화(pegylation)"로 언급되는 예시적 구체예에 관한 것이다. 그러나, 당업자는 PEG에 대한 본원에 기재된 컨쥬게이션을 위한 기술을 이용하여 다른 폴리올, 예를 들어, 폴리(프로필렌 글리콜) 및 폴리에틸렌-폴리프로필렌 글리콜 공중합체가 이용될 수 있음을 인지한다.For example, polyols may be conjugated to a polypeptide of the invention at one or more amino acid residues comprising lysine residues as disclosed in WO 93/00109, supra. The polyol used may be any water soluble poly (alkylene oxide) polymer, which may have a straight or branched chain. Suitable polyols include those substituted at one or more hydroxyl positions with chemical groups, for example alkyl groups having from 1 to 4 carbons. Typically, the polyol is a poly (alkylene glycol), for example poly (ethylene glycol) (PEG), so for ease of substrate, the remainder of the discussion is that the polyol used is PEG and the polyol is conjugated to the polypeptide. The method of gating relates to an exemplary embodiment referred to as "pegylation." However, those skilled in the art recognize that other polyols, such as poly (propylene glycol) and polyethylene-polypropylene glycol copolymers, may be used using the techniques for conjugation described herein for PEG.

Apo-2L의 페길화에서 사용되는 PEG의 평균 분자량은 다양할 수 있고, 이는 전형적으로 약 500 내지 약 30,000 달톤(D) 범위일 수 있다. 바람직하게는, PEG의 평균 분자량은 약 1,000 내지 약 25,000 D, 더욱 바람직하게는 약 1,000 내지 약 5,000 D이다. 일 구체예에서, 페길화는 약 1,000 D의 평균 분자량을 갖는 PEG로 수행된다. 임의로, PEG 동종중합체는 비치환되나, 이는 또한 한 말단에서 알킬기로 치환될 수 있다. 바람직하게는, 알킬기는 C1-C4 알킬기, 가장 바람직하게는 메틸기이다. PEG 제조물은 시판되며, 전형적으로 본 발명에 사용하기에 적합한 PEG 제조물은 평균 분자량에 따라 시판되는 비동종성 제조물이다. 예를 들어, 시판되는 PEG(5000) 제조물은 보통 ±500 D으로 분자량이 약간 상이한 분자를 전형적으로 함유한다. 본 발명의 폴리펩티드는 당 분야에 공지된 기술을 이용하여 추가로 변형될 수 있고, 예를 들어, 수분자 화합물(예를 들어, 화학요법제)에 컨쥬게이션되거나; 신호 분자(예를 들어, 형광단)에 컨쥬게이션되거나; 특정 결합쌍의 분자(예를 들어, 비오틴/스트렙타비딘, 항체/항원)에 컨쥬게이션되거나; 당화, 페길화, 또는 안정화 도메인(예를 들어, Fc 도메인)에 대한 추가 융합에 의해 안정화될 수 있다.The average molecular weight of PEG used in the PEGylation of Apo-2L can vary, typically in the range of about 500 to about 30,000 Daltons (D). Preferably, the average molecular weight of PEG is about 1,000 to about 25,000 D, more preferably about 1,000 to about 5,000 D. In one embodiment, PEGylation is performed with PEG with an average molecular weight of about 1,000 D. Optionally, the PEG homopolymer is unsubstituted, but it may also be substituted at one end with an alkyl group. Preferably, the alkyl group is a C1-C4 alkyl group, most preferably a methyl group. PEG preparations are commercially available and typically suitable PEG preparations for use in the present invention are non-homogeneous preparations which are commercially available according to average molecular weight. Commercially available PEG (5000) preparations, for example, typically contain molecules that differ slightly in molecular weight, usually at ± 500 D. Polypeptides of the invention can be further modified using techniques known in the art, for example, conjugated to moisturized compounds (eg, chemotherapeutic agents); Conjugated to a signal molecule (eg, a fluorophore); Conjugated to a specific binding pair of molecules (eg biotin / streptavidin, antibody / antigen); Stabilization by glycosylation, PEGylation, or further fusion to stabilizing domains (eg, Fc domains).

단백질을 페길화시키는 다양한 방법이 당 분야에 공지되어 있다. PEG에 컨쥬게이션된 단백질을 생성시키는 특정 방법은 미국 특허 번호 4,179,337호, 4,935,465호 및 5,849,535호에 기재된 방법을 포함한다. 전형적으로, 단백질은 주로 반응 조건, 중합체의 분자량 등에 따라 단백질의 아미노산 잔기 중 하나 이상을 통해 중합체의 말단 반응기에 공유적으로 결합된다. 반응기(들)을 갖는 중합체는 본원에서 활성화된 중합체로 기재된다. 반응기는 단백질의 자유 아미노기 또는 다른 반응기와 선택적으로 반응한다. PEG 중합체는 무작위 또는 부위 특이적 방식으로 단백질의 아미노기 또는 다른 반응기에 커플링될 수 있다. 그러나, 최적의 결과를 수득하기 위한 선택된 반응기의 유형 및 양 뿐만 아니라 이용되는 중합체의 유형은 반응기가 단백질의 너무 많은 특정 활성기와 반응하는 것을 피하기 위해 사용되는 특정 단백질 또는 단백질 변이체에 좌우되는 것이 이해될 것이다. 이를 완전히 피하는 것이 가능하지 않을 수 있으므로, 단백질 농도에 따라 단백질 몰(mole) 당 일반적으로 약 0.1 내지 1000 몰, 바람직하게는 2 내지 200 몰의 활성화된 중합체가 사용되는 것이 권고된다. 단백질의 몰 당 활성화된 중합체의 최종량은 최적 활성을 유지하기 위한 균형이며, 동시에 가능한 경우 단백질의 순환 반감기를 최적화시키는 양이다.Various methods of PEGylating proteins are known in the art. Particular methods for generating proteins conjugated to PEG include those described in US Pat. Nos. 4,179,337, 4,935,465, and 5,849,535. Typically, a protein is covalently bound to the terminal reactor of the polymer via one or more of the amino acid residues of the protein, primarily depending on reaction conditions, molecular weight of the polymer, and the like. Polymers with reactor (s) are described herein as activated polymers. The reactor selectively reacts with free amino groups or other reactors of the protein. PEG polymers can be coupled to amino groups or other reactors of a protein in a random or site specific manner. However, it will be understood that the type and amount of reactor selected as well as the type of polymer employed to obtain optimal results depend on the particular protein or protein variant used to avoid the reactor reacting with too many specific activators of the protein. will be. Since it may not be possible to avoid this completely, it is recommended to use about 0.1 to 1000 moles, preferably 2 to 200 moles of activated polymer per mole of protein, depending on the protein concentration. The final amount of activated polymer per mole of protein is a balance to maintain optimal activity, while at the same time an amount that optimizes the circulating half-life of the protein.

본원에서 사용되는 경우 용어 "폴리올"은 대체로 다가알콜 화합물을 의미한다. 폴리올은, 예를 들어, 임의의 수용성 폴리(알킬렌 옥시드) 중합체일 수 있고, 직쇄 또는 분지쇄를 가질 수 있다. 바람직한 폴리올은 하나 이상의 하이드록실 위치에서 화학기, 예를 들어, 1 내지 4개의 탄소를 갖는 알킬기로 치환된 폴리올을 포함한다. 전형적으로, 폴리올은 폴리(알킬렌 글리콜), 바람직하게는 폴리(에틸렌 글리콜)(PEG)이다. 그러나, 당업자는 PEG에 대해 본원에 기재된 컨쥬게이션을 위한 기술을 이용하여 다른 폴리올, 예를 들어, 폴리(프로필렌 글리콜) 및 폴리에틸렌-폴리프로필렌 글리콜 공중합체가 이용될 수 있음을 인지한다. 본 발명의 폴리올은 당 분야에 널리 공지된 폴리올 및 시판 업체로부터와 같이 공적으로 이용가능한 폴리올을 포함한다.As used herein, the term "polyol" generally means a polyhydric alcohol compound. The polyol may be, for example, any water soluble poly (alkylene oxide) polymer and may have a straight or branched chain. Preferred polyols include polyols substituted at one or more hydroxyl positions with a chemical group, for example an alkyl group having from 1 to 4 carbons. Typically, the polyol is poly (alkylene glycol), preferably poly (ethylene glycol) (PEG). However, one of ordinary skill in the art recognizes that other polyols, such as poly (propylene glycol) and polyethylene-polypropylene glycol copolymers, may be used using the techniques for conjugation described herein for PEG. The polyols of the present invention include polyols well known in the art and publicly available polyols such as those from commercial suppliers.

또한, 혈청 알부민-결합 펩티드, IgG-결합 펩티드 또는 FcRn에 결합하는 펩티드를 포함하는 다른 반감기 연장 분자가 삼합체화 도메인의 N- 또는 C-말단에 부착될 수 있다.In addition, other half-life extending molecules, including serum albumin-binding peptides, IgG-binding peptides or peptides that bind FcRn, can be attached to the N- or C-terminus of the trimerization domain.

섹션 제목은 단지 구성 목적을 위해 본원에서 사용되며, 어떤 방식으로든 기재된 주제를 제한하는 것으로 해석되어선 안되는 것이 인지되어야 한다. 본원에 인용된 모든 참고문헌은 모든 목적상 전체내용이 참조로서 포함된다.It should be appreciated that section headings are used herein for organizational purposes only and should not be construed as limiting the subject matter described in any way. All references cited herein are hereby incorporated by reference in their entirety for all purposes.

하기 기재되는 실시예는 단지 본 발명의 특정 구체예의 예시로, 하기 첨부되는 청구항에 의해 규정되는 본 발명을 제한하는 것으로 해석되어선 안된다.The examples set forth below are merely illustrative of specific embodiments of the invention and should not be construed as limiting the invention as defined by the appended claims below.

실시예Example

하기 실시예에 논의되는 벡터(pANA)는 이전에 기재된 벡터로부터 유래된다[US 2007/0275393 참조]. 특정 벡터 서열은 서열 목록에 제공되며, 당업자는 설명이 본원에 제공되는 경우 벡터를 획득할 수 있을 것이다. pPhCPAB 파지 디스플레이 벡터(SEQ ID NO:150)는 링커를 이용하여 ALQT(등)를 엔코딩하는 hTN 서열에 융합된 gⅢ 신호 펩티드 코딩 영역을 갖는다. CTLD 영역의 C-말단은 링커를 통해 나머지 gⅢ 코딩 영역에 융합된다. CTLD 영역 내에서, 코딩 서열을 변경하지 않았으나, 변경된 루프 영역을 함유하는 PCR 단편을 클로닝하기에 적합한 제한 부위를 발생시킨 누클레오티드 돌연변이를 생성시켰다. 상기 제한 부위 사이에서 루프 영역 부분을 제거하여, 모든 라이브러리 파지가 재조합체만 발현할 수 있고, 야생형 테트라넥틴을 발현하지 않도록 하였다. 뮤린 TN CTLD 파지 디스플레이 벡터를 유사하게 설계하였다. 이러한 벡터의 또 다른 구체예는 유전자 Ⅲ C-말단 영역이 트렁케이션되고, hTN 코딩 서열의 말단에서의 억제가능한 정지 코돈이 글루타민을 엔코딩하도록 변경된 pANA27(SEQ ID NO:164)이다. 뮤린 벡터 pANA28(SEQ ID NO:165)을 유사한 방식으로 작제하였다.The vectors (pANA) discussed in the examples below are derived from the previously described vectors (see US 2007/0275393). Particular vector sequences are provided in the Sequence Listing, and one of ordinary skill in the art would be able to obtain the vectors if a description is provided herein. The pPhCPAB phage display vector (SEQ ID NO: 150) has a gIII signal peptide coding region fused to an hTN sequence encoding ALQT (etc.) using a linker. The C-terminus of the CTLD region is fused to the remaining gIII coding region via a linker. Within the CTLD region, a nucleotide mutation was generated that did not alter the coding sequence but resulted in a restriction site suitable for cloning the PCR fragment containing the altered loop region. The region of the loop region was removed between the restriction sites so that all library phages could express only the recombinant and not the wild type tetranectin. Murine TN CTLD phage display vectors were similarly designed. Another embodiment of this vector is pANA27 (SEQ ID NO: 164) wherein the gene III C-terminal region is truncated and the inhibitory stop codon at the end of the hTN coding sequence is modified to encode glutamine. Murine vector pANA28 (SEQ ID NO: 165) was constructed in a similar manner.

실시예 1Example 1

라이브러리 작제: 루프 1의 돌연변이 및 연장Library Construction: Mutations and Extensions of Loop 1

인간 테트라넥틴의 누클레오티드 및 아미노산 서열, 및 루프 1, 2, 3, 4, 및 5(LSB)의 위치가 도 9에 제시되어 있다. 인간 테트라넥틴 C-타입 렉틴 결합 도메인의 1-2 연장된 라이브러리("인간 1-2X")에 대해, 루프 1에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 5개의 아미노산 AAEGT(SEQ ID NO:)를 누클레오티드 NNK NNK NNK NNK NNK NNK NNK(SEQ ID NO:)(여기서, N은 A, C, G, 또는 T를 나타내고; K는 G 또는 T를 나타냄)에 의해 엔코딩되는 7개의 무작위 아미노산으로 치환시켰다. 루프 2 바로 다음에 존재하는 아미노산 아르기닌을 또한 코딩 가닥 내의 누클레오티드 NNK를 이용하여 충분히 무작위화시켰다. 이러한 아미노산을 무작위화시켰는데, 이는 아르기닌이 루프 1의 아미노산과 접촉하고, 루프 1 무작위화에 의해 달성될 수 있는 형태를 한정할 수 있기 때문이다. 또한, 루프 4 리신에 좌우되는 것으로 밝혀진 플라스미노겐 결합을 폐기시키기 위해 루프 4에 대한 코딩 서열을 리신 148(K) 대신 알라닌(A)을 엔코딩하도록 변경시켰다(Graversen et al, 1998).The nucleotide and amino acid sequences of human tetranectin and the positions of loops 1, 2, 3, 4, and 5 (LSB) are shown in FIG. 9. For 1-2 extended libraries of human tetranectin C-type lectin binding domains (“human 1-2X”), the coding sequence for loop 1 was modified to encode the sequence set forth in Table 2, wherein the five amino acid AAEGT (SEQ ID NO :) is 7 encoded by the nucleotide NNK NNK NNK NNK NNK NNK NNK (SEQ ID NO :), where N represents A, C, G, or T; K represents G or T Were substituted with four random amino acids. The amino acid arginine present immediately after loop 2 was also sufficiently randomized using the nucleotide NNK in the coding strand. These amino acids were randomized because arginine may contact the amino acids of loop 1 and define the forms that can be achieved by loop 1 randomization. In addition, the coding sequence for loop 4 was altered to encode alanine (A) instead of lysine 148 (K) to discard plasminogen binding found to be dependent on loop 4 lysine (Graversen et al, 1998).

표 2Table 2

인간 테트라넥틴(TN)으로부터의 루프 영역의 아미노산.Amino acids of loop regions from human tetranectin (TN).

괄호는 인접하여 있으나, 루프의 부분으로 간주되지 않는 아미노산을 나타낸다.Parentheses denote amino acids that are contiguous but not considered part of the loop.

X = 임의의 아미노산.X = any amino acid.

Figure pct00010
Figure pct00010

인간 루프 1 연장 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다(프라이머 서열은 표 3에 제시됨). 프라이머 1Xfor(SEQ ID NO:) 및 1Xrev(SEQ ID NO:)를 혼합시키고, PCR에 의해 연장시키고, 프라이머 BstX1for(SEQ ID NO:) 및 PstBssRevC(SEQ ID NO:)를 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 외부 프라이머 Bglfor12(SEQ ID NO:) 및 PstRev(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅡ 및 PstⅠ으로 절단하고, 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 파지 디스플레이 벡터 pPhCPAB를 pCANTAB(Pharmacia)으로부터 유도하였고, 이는 M13 유전자 Ⅲ 단백질에 융합된 인간 테트라넥틴 CTLD의 부분을 함유하였다. CTLD 영역을 루프 1-4의 측면에 존재하는 BglⅡ 및 PstⅠ 제한 효소 부위를 포함하도록 변형시키고, 1-4 영역을 정지 코돈을 포함하도록 변경시켜, 인-프레임 삽입물의 라이게이션 없이 벡터로부터 기능적 유전자 Ⅲ 단백질이 생성될 수 없도록 하였다. BamHⅠ 내지 ClaⅠ 영역을 SEQ ID NO:164(pANA27)의 BamHⅠ 내지 ClaⅠ 서열로 대체시켜 pPhCPAB로부터 pANA27을 유도하였다. 이는 앰버(amber) 억제가능 정지 코돈을 글루타민 코돈으로 대체시키고, 유전자 Ⅲ의 아미노 말단 영역을 트렁케이션시킨다.Human loop 1 extension libraries were generated using overlapping PCR in the following manner (primer sequences are shown in Table 3). Primers 1Xfor (SEQ ID NO :) and 1Xrev (SEQ ID NO :) are mixed and extended by PCR, primers BstX1for (SEQ ID NO :) and PstBssRevC (SEQ ID NO :) are mixed and extended by PCR I was. The resulting fragments were purified from the gel, mixed and extended by PCR in the presence of external primers Bglfor12 (SEQ ID NO :) and PstRev (SEQ ID NO :). The resulting fragments were gel purified, digested with BglII and PstI and cloned into phage display vector pPhCPAB or pANA27. Phage display vector pPhCPAB was derived from pCANTAB (Pharmacia), which contained a portion of human tetranectin CTLD fused to M13 gene III protein. The CTLD region is modified to include the BglII and PstI restriction enzyme sites present on the sides of loops 1-4, and the 1-4 regions are modified to include stop codons to allow functional gene III from the vector without ligation of the in-frame insert. The protein could not be produced. The pANA27 was derived from pPhCPAB by replacing the BamHI to ClaI regions with the BamHI to ClaI sequences of SEQ ID NO: 164 (pANA27). This replaces the amber inhibitory stop codon with glutamine codons and truncates the amino terminal region of gene III.

라이게이션된 물질을 전기적격(electrocompetent) XL1-Blue E. 콜리(E. coli)(Stratagene)로 형질전환시키고, 4 내지 8리터의 세포를 밤새 성장시키고, DNA를 분리시켜 패닝을 위한 마스터 라이브러리 DNA 스톡을 생성시켰다. 1.5 x 108의 라이브러리 크기를 수득하였고, 시험된 클론은 표적화된 영역에서 다양한 서열을 나타내었다.The ligated material is transformed with electrocompetent XL1-Blue E. coli (Stratagene), 4-8 liters of cells are grown overnight, and DNA is isolated to master library DNA for panning. Stock was created. A library size of 1.5 × 10 8 was obtained and the clones tested showed various sequences in the targeted region.

표 3TABLE 3

파지 디스플레이된 C-타입 렉틴 도메인 라이브러리의 생성에 사용된 서열.Sequence used for generation of phage displayed C-type lectin domain library.

M = A 또는 C; N = A, C, G, 또는 T; K = G 또는 T; S = G 또는 C; W = A 또는 T.M = A or C; N = A, C, G, or T; K = G or T; S = G or C; W = A or T.

Figure pct00011
Figure pct00011

Figure pct00012
Figure pct00012

실시예 2Example 2

라이브러리 작제: 루프 1 및 2의 돌연변이Library Construction: Mutations in Loops 1 and 2

인간 테트라넥틴 C-타입 렉틴 결합 도메인의 루프 1-2 라이브러리("인간 1-2")에 대해, 루프 1에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 5개의 아미노산 AAEGT(SEQ ID NO:; 인간)를 누클레오티드 NNK NNK NNK NNK NNK(SEQ ID NO:)(여기서, N은 A, C, G, 또는 T를 나타내고; K는 G 또는 T를 나타냄)에 의해 엔코딩되는 5개의 무작위 아미노산으로 대체시켰다. 루프 2(이웃하는 아르기닌을 포함함)에서, 인간의 4개의 아미노산 TGAR을 누클레오티드 NNK NNK NNK NNK(SEQ ID NO:)에 의해 엔코딩되는 4개의 무작위 아미노산으로 대체시켰다. 또한, 루프 4 리신에 좌우되는 것으로 밝혀진 플라스미노겐 결합을 폐기시키기 위해 루프의 리신(K) 대신 알라닌(A)을 엔코딩하도록 변경시켰다(Graversen et al, 1998).For the loop 1-2 library of human tetranectin C-type lectin binding domains (“human 1-2”), the coding sequence for loop 1 was modified to encode the sequence set forth in Table 2, wherein the five amino acid AAEGT ( SEQ ID NO :; human) is encoded by the nucleotide NNK NNK NNK NNK NNK (SEQ ID NO :), where N represents A, C, G, or T; K represents G or T Replaced with random amino acids. In loop 2 (including neighboring arginine), the human 4 amino acid TGAR was replaced with 4 random amino acids encoded by the nucleotide NNK NNK NNK NNK (SEQ ID NO :). It was also modified to encode alanine (A) instead of lysine (K) in the loop to discard plasminogen binding found to be dependent on loop 4 lysine (Graversen et al, 1998).

인간 1-2 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다(프라이머 서열은 표 3에 제시됨). 프라이머 1-2 for(SEQ ID NO:) 및 1-2 rev(SEQ ID NO:)를 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 외부 프라이머 Bglfor12(SEQ ID NO:) 및 PstRev12(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅡ 및 PstⅠ으로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 4.86 x 108의 라이브러리 크기를 수득하였고, 시험된 클론은 표적화된 영역에서 다양한 서열을 나타내었다.Human 1-2 libraries were generated using overlapping PCR in the following manner (primer sequences are shown in Table 3). Primers 1-2 for (SEQ ID NO :) and 1-2 rev (SEQ ID NO :) were mixed and extended by PCR. The resulting fragment was purified from the gel, mixed and extended by PCR in the presence of the external primers Bglfor12 (SEQ ID NO :) and PstRev12 (SEQ ID NO :). The resulting fragments were gel purified, cleaved with BglII and PstI and cloned into cleaved phage display vectors pPhCPAB or pANA27 similar to those described above. A library size of 4.86 x 10 8 was obtained and the clones tested showed various sequences in the targeted region.

실시예 3Example 3

라이브러리 작제: 루프 1 및 4의 돌연변이 및 연장Library Construction: Mutations and Extensions of Loops 1 and 4

인간 C-타입 렉틴 결합 도메인의 루프 1-4 라이브러리("인간 1-4")에 대해, 루프 1에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 인간에 대한 7개의 아미노산 DMAAEGT(SEQ ID NO:)를 누클레오티드 NNS NNS NNS NNS NNS NNS NNS(SEQ ID NO:)(여기서, N은 A, C, G, 또는 T를 나타내고; S는 G 또는 C를 나타냄)에 의해 엔코딩되는 7개의 무작위 아미노산으로 치환시켰다. 또한, 루프 4에 대한 코딩 서열을 표 1에 제시된 서열을 엔코딩하도록 변형 및 연장시켰고, 여기서 인간에 대한 루프 4의 2개의 아미노산 KT를 인간에 대한 누클레오티드 NNS NNS NNS NNS NNS(SEQ ID NO:)에 의해 엔코딩되는 5개의 무작위 아미노산으로 대체시켰다.For the loop 1-4 library of human C-type lectin binding domains (“human 1-4”), the coding sequence for loop 1 was modified to encode the sequence set forth in Table 2, wherein the seven amino acid DMAAEGT for human (SEQ ID NO :) is expressed as nucleotides NNS NNS NNS NNS NNS NNS NNS (SEQ ID NO :), where N represents A, C, G, or T; S represents G or C Were substituted with four random amino acids. In addition, the coding sequence for loop 4 was modified and extended to encode the sequence shown in Table 1, wherein the two amino acids KT of loop 4 for humans were nucleotide NNS NNS NNS NNS NNS (SEQ ID NO :) for humans It was replaced by five random amino acids encoded by.

인간 1-4 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다(프라이머 서열은 표 3에 제시됨). 프라이머 BglBssfor(SEQ ID NO:) 및 BssBglrev(SEQ ID NO:)를 혼합시키고, PCR에 의해 연장시키고, 프라이머 BssPstfor(SEQ ID NO:) 및 PstBssRev(SEQ ID NO:)를 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 외부 프라이머 Bglfor(SEQ ID NO:) 및 PstRev(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅡ 및 PstⅠ 제한효소로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 2 x 109의 라이브러리 크기를 수득하였고, 시험된 12개의 클론은 표적화된 영역에서 다양한 서열을 나타내었다.Human 1-4 libraries were generated using overlapping PCR in the following manner (primer sequences are shown in Table 3). Mix primers BglBssfor (SEQ ID NO :) and BssBglrev (SEQ ID NO :), extend by PCR, mix primers BssPstfor (SEQ ID NO :) and PstBssRev (SEQ ID NO :), extend by PCR I was. The resulting fragment was purified from the gel, mixed and extended by PCR in the presence of the external primers Bglfor (SEQ ID NO :) and PstRev (SEQ ID NO :). The resulting fragments were gel purified, digested with BglII and PstI restriction enzymes and cloned into the digested phage display vector pPhCPAB or pANA27 similar to that described above. A library size of 2 × 10 9 was obtained and the 12 clones tested showed various sequences in the targeted region.

실시예 4Example 4

라이브러리 작제: 루프 3 및 4의 돌연변이 및 연장Library Construction: Mutations and Extensions of Loops 3 and 4

인간 C-타입 렉틴 결합 도메인의 루프 3-4 연장 라이브러리("인간 3-4X")에 대해, 루프 3에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 인간 테트라넥틴의 3개의 아미노산 EIT를 코딩 가닥의 누클레오티드 NNK NNK NNK NNK NNK NNK (SEQ ID NO:)(여기서, N은 A, C, G, 또는 T를 나타내고; K는 G 또는 T를 나타냄)에 의해 엔코딩되는 6개의 무작위 아미노산으로 대체시켰다. 또한, 루프 4에서, 인간에서의 3개의 아미노산 KTE를 누클레오티드 NNK NNK NNK NNK NNK NNK(SEQ ID NO:)에 의해 엔코딩되는 6개의 무작위 아미노산으로 대체시켰다.For the loop 3-4 extension library ("human 3-4X") of the human C-type lectin binding domain, the coding sequence for loop 3 was modified to encode the sequence set forth in Table 2, wherein the three of human tetranectin 6 random encoded amino acids EIT by the nucleotide NNK NNK NNK NNK NNK NNK (SEQ ID NO :) of the coding strand, where N represents A, C, G, or T; K represents G or T Replaced with amino acids. In addition, in loop 4, three amino acids KTE in humans were replaced with six random amino acids encoded by the nucleotides NNK NNK NNK NNK NNK NNK (SEQ ID NO :).

인간 3-4 연장 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다(프라이머 서열은 표 3에 제시됨). 프라이머 H Loop 1-2-F(SEQ ID NO:) 및 H Loop 3-4 Ext-R(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시키고, 프라이머 H Loop 3-4 Ext-F(SEQ ID NO:) 및 H Loop 5-R(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 추가 H Loop 1-2-F(SEQ ID NO:) 및 H Loop 5-R(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅡ 및 PstⅠ 제한효소로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 7.9 x 108의 라이브러리 크기를 수득하였고, 시험된 클론은 표적화된 영역에서 다양한 서열을 나타내었다.Human 3-4 extension libraries were generated using overlapping PCR in the following manner (primer sequences are shown in Table 3). Primer H Loop 1-2-F (SEQ ID NO :) and H Loop 3-4 Ext-R (SEQ ID NO :) are mixed, extended by PCR, and primer H Loop 3-4 Ext-F (SEQ ID NO :) and H Loop 5-R (SEQ ID NO :) were mixed and extended by PCR. The resulting fragments were purified from the gel, mixed and extended by PCR in the presence of additional H Loop 1-2-F (SEQ ID NO :) and H Loop 5-R (SEQ ID NO :). The resulting fragments were gel purified, digested with BglII and PstI restriction enzymes and cloned into the digested phage display vector pPhCPAB or pANA27 similar to that described above. A library size of 7.9 × 10 8 was obtained and the clones tested showed various sequences in the targeted region.

실시예 5Example 5

라이브러리 작제: 루프 3 및 4의 돌연변이 및 이러한 루프 사이의 PROLibrary Construction: Mutations in Loops 3 and 4 and PRO Between These Loops

인간 테트라넥틴 C-타입 렉틴 결합 도메인의 루프 3-4 콤보(combo) 라이브러리("인간 3-4 콤보")에 대해, 루프 3 및 4에 대한 코딩 서열 및 이러한 2개의 루프 사이의 프롤린을 표 2에 제시된 서열을 엔코딩하도록 변경시켰고, 여기서 인간 서열 TEITAQPDGGKTE (SEQ ID NO:)를 13개의 아미노산 서열 XXXXXXXXGGXXX(SEQ ID NO:)로 대체시켰고, 여기서 X는 서열 NNK(N은 A, C, G, 또는 T를 나타내고; K는 G 또는 T를 나타냄)에 의해 엔코딩되는 무작위 아미노산이다.For the loop 3-4 combo library ("human 3-4 combo") of the human tetranectin C-type lectin binding domain, the coding sequences for loops 3 and 4 and the proline between these two loops are shown in Table 2 Was modified to encode the sequence shown in which human sequence TEITAQPDGGKTE (SEQ ID NO :) was replaced with the 13 amino acid sequence XXXXXXXXGGXXX (SEQ ID NO :), where X is the sequence NNK (N is A, C, G, or T represents; K represents G or T).

인간 3-4 콤보 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다(프라이머 서열은 표 3에 제시됨). 프라이머 H Loop 1-2-F(SEQ ID NO:) 및 H Loop 3-4 콤보-R(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시키고, 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 추가 H Loop 1-2-F(SEQ ID NO:) 및 H Loop 5-R(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅡ 및 PstⅠ 제한효소로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 4.95 x 109의 라이브러리 크기를 수득하였고, 시험된 클론은 표적화된 영역에서 다양한 서열을 나타내었다.Human 3-4 combo libraries were generated using overlapping PCR in the following manner (primer sequences are shown in Table 3). Primers H Loop 1-2-F (SEQ ID NO :) and H Loop 3-4 Combo-R (SEQ ID NO :) are mixed, extended by PCR, and the resulting fragments are purified from the gel, mixed and And extended by PCR in the presence of additional H Loop 1-2-F (SEQ ID NO :) and H Loop 5-R (SEQ ID NO :). The resulting fragments were gel purified, digested with BglII and PstI restriction enzymes and cloned into the digested phage display vector pPhCPAB or pANA27 similar to that described above. A library size of 4.95 × 10 9 was obtained and the clones tested showed various sequences in the targeted region.

실시예 6Example 6

라이브러리 작제: 루프 4의 돌연변이 및 연장Library Construction: Mutations and Extensions of Loop 4

인간 테트라넥틴 C-타입 렉틴 결합 도메인의 루프 4 연장 라이브러리("인간 4")에 대해, 루프 4에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 인간 테트라넥틴의 3개의 아미노산 KTE를 누클레오티드 NNK NNK NNK NNK NNK NNK NNK(SEQ ID NO:)(여기서, N은 A, C, G, 또는 T를 나타내고; K는 G 또는 T를 나타냄)에 의해 엔코딩되는 7개의 무작위 아미노산으로 대체시켰다.For the loop 4 extension library ("human 4") of the human tetranectin C-type lectin binding domain, the coding sequence for loop 4 was modified to encode the sequence set forth in Table 2, wherein the three amino acid KTE of human tetranectin Was replaced with seven random amino acids encoded by nucleotides NNK NNK NNK NNK NNK NNK NNK (SEQ ID NO :), where N represents A, C, G, or T; K represents G or T .

인간 4 연장 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다(프라이머 서열은 표 3에 제시됨). 프라이머 H Loop 1-2-F(SEQ ID NO:) 및 H Loop 3-R(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시키고, 프라이머 H Loop 4 Ext-F(SEQ ID NO:) 및 H Loop 5-R(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 추가 H Loop 1-2-F(SEQ ID NO:) 및 H Loop 5-R(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅡ 및 PstⅠ 제한효소로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 2.7 x 109의 라이브러리 크기를 수득하였고, 시험된 클론은 표적화된 영역에서 다양한 서열을 나타내었다.Human 4 extension libraries were generated using overlapping PCR in the following manner (primer sequences are shown in Table 3). Primer H Loop 1-2-F (SEQ ID NO :) and H Loop 3-R (SEQ ID NO :) are mixed, extended by PCR, primer H Loop 4 Ext-F (SEQ ID NO :) and H Loop 5-R (SEQ ID NO :) was mixed and extended by PCR. The resulting fragments were purified from the gel, mixed and extended by PCR in the presence of additional H Loop 1-2-F (SEQ ID NO :) and H Loop 5-R (SEQ ID NO :). The resulting fragments were gel purified, digested with BglII and PstI restriction enzymes and cloned into the digested phage display vector pPhCPAB or pANA27 similar to that described above. A library size of 2.7 × 10 9 was obtained and the clones tested showed various sequences in the targeted region.

실시예 7Example 7

라이브러리 작제: 루프 3의 연장이 있는 돌연변이 및 루프 3의 연장이 없는 돌연변이Library Construction: Mutations with Extension of Loop 3 and Mutations without Extension of Loop 3

인간 테트라넥틴 C-타입 렉틴 결합 도메인의 루프 3 변경 라이브러리에 대해, 루프 3에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 인간의 6개의 아미노산 ETEITA(SEQ ID NO:)를 누클레오티드 NNK NNK NNK NNK NNK NNK(SEQ ID NO:), NNK NNK NNK NNK NNK NNK NNK(SEQ ID NO:), 및 NNK NNK NNK NNK NNK NNK NNK NNK(SEQ ID NO:)(여기서, N은 A, C, G, 또는 T를 나타내고; K는 G 또는 T를 나타냄)에 의해 엔코딩되는 6, 7, 또는 8개의 무작위 아미노산으로 대체시켰다. 또한, 루프 4에서, 인간의 3개의 아미노산 KTE를 누클레오티드 NNK NNK NNK NNK NNK NNK(SEQ ID NO:)에 의해 엔코딩되는 6개의 무작위 아미노산으로 대체시켰다. 또한, 루프 4 리신에 좌우되는 것으로 밝혀진 플라스미노겐 결합을 폐기시키기 위해 루프 4에 대한 코딩 서열을 리신(K) 대신 알라닌(A)을 엔코딩하도록 변경시켰다(Graversen et al, 1998).For the loop 3 alteration library of human tetranectin C-type lectin binding domains, the coding sequence for loop 3 was modified to encode the sequence shown in Table 2, wherein the human 6 amino acid ETEITA (SEQ ID NO :) nucleotides NNK NNK NNK NNK NNK NNK (SEQ ID NO :), NNK NNK NNK NNK NNK NNK NNK (SEQ ID NO :), and NNK NNK NNK NNK NNK NNK NNK NNK (SEQ ID NO :), where N is A, C , G, or T; K represents G or T), and 6, 7, or 8 random amino acids encoded. In addition, in loop 4, human 3 amino acid KTE was replaced with 6 random amino acids encoded by nucleotides NNK NNK NNK NNK NNK NNK (SEQ ID NO :). In addition, the coding sequence for loop 4 was altered to encode alanine (A) instead of lysine (K) to discard plasminogen binding found to be dependent on loop 4 lysine (Graversen et al, 1998).

인간 루프 3 변경 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다. 프라이머 HLoop3F6, HLoop3F7, 및 HLoop3F8(각각, SEQ ID NOS:)를 개별적으로 HLoop4R(SEQ ID NO:)과 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 올리고 H Loop 1-2F(SEQ ID NO:), HuBglfor(SEQ ID NO:) 및 PstRev(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅠ 및 PstⅠ 제한효소로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다. 라이브러리 생성 후, 패닝을 위해 3개의 라이브러리를 풀링시켰다.Human loop 3 altered libraries were generated using overlapping PCR in the following manner. Primers HLoop3F6, HLoop3F7, and HLoop3F8 (SEQ ID NOS :, respectively) were separately mixed with HLoop4R (SEQ ID NO :) and extended by PCR. The resulting fragments were purified from the gels, mixed and extended by PCR in the presence of oligo H Loop 1-2F (SEQ ID NO :), HuBglfor (SEQ ID NO :) and PstRev (SEQ ID NO :). The resulting fragments were gel purified, digested with BglI and PstI restriction enzymes and cloned into the digested phage display vector pPhCPAB or pANA27 similar to that described above. After library generation, three libraries were pooled for panning.

루프 3의 대안적 루프 연장Alternative loop extension of loop 3

인간 루프 3 루프 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다. 프라이머 Loop3AF2(SEQ ID NO:) 및 Loop3AR2(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시키고, 프라이머 Loop3BF(SEQ ID NO:) 및 Loop3BR(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, 프라이머 Bgl for(SEQ ID NO:) 및 Loop30R(SEQ ID NO:)의 존재하에서 PCR에 적용시켰다. 생성물을 BglⅡ 및 PstⅠ 제한 효소로 절단시키고, 정제된 단편을 상기 기재된 바와 같이 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27로 클로닝시켰다. 또한, 루프 4 리신에 좌우되는 것으로 밝혀진 플라스미노겐 결합을 폐기시키기 위해 루프 4에 대한 코딩 서열을 리신(K) 대신 알라닌(A)을 엔코딩하도록 변경시켰다(Graversen et al, 1998).Human loop 3 loop libraries were generated using nested PCR in the following manner. Primer Loop3AF2 (SEQ ID NO :) and Loop3AR2 (SEQ ID NO :) are mixed and extended by PCR, primers Loop3BF (SEQ ID NO :) and Loop3BR (SEQ ID NO :) are mixed and extended by PCR I was. The resulting fragments were purified from the gel, mixed and subjected to PCR in the presence of primers Bgl for (SEQ ID NO :) and Loop30R (SEQ ID NO :). The product was cleaved with BglII and PstI restriction enzymes and the purified fragment was cloned into a phage display vector pPhCPAB or pANA27 similarly cleaved as described above. In addition, the coding sequence for loop 4 was altered to encode alanine (A) instead of lysine (K) to discard plasminogen binding found to be dependent on loop 4 lysine (Graversen et al, 1998).

실시예 8Example 8

루프 3 및 5의 돌연변이Mutations in loops 3 and 5

인간 C-타입 렉틴 결합 도메인의 루프 3 및 5 변경 라이브러리에 대해, 루프 3 및 5에 대한 코딩 서열을 표 2에 제시된 서열을 엔코딩하도록 변형시켰고, 여기서 인간의 5개의 아미노산 TEITA를 누클레오티드 NNK NNK NNK NNK NNK(SEQ ID NO:)에 의해 엔코딩되는 5개의 아미노산으로 대체시켰고, 인간의 3개의 아미노산 AAN을 누클레오티드 NNK NNK NNK에 의해 엔코딩되는 3개의 아미노산으로 대체시켰다. 또한, 루프 4 리신에 좌우되는 것으로 밝혀진 플라스미노겐 결합을 폐기시키기 위해 루프 4에 대한 코딩 서열을 리신(K) 대신 알라닌(A)을 엔코딩하도록 변경시켰다(Graversen et al, 1998).For the loop 3 and 5 altered libraries of human C-type lectin binding domains, the coding sequences for loops 3 and 5 were modified to encode the sequences shown in Table 2, wherein the human 5 amino acid TEITA was nucleotide NNK NNK NNK NNK The 5 amino acids encoded by NNK (SEQ ID NO :) were replaced, and the human 3 amino acids AAN were replaced by 3 amino acids encoded by the nucleotide NNK NNK NNK. In addition, the coding sequence for loop 4 was altered to encode alanine (A) instead of lysine (K) to discard plasminogen binding found to be dependent on loop 4 lysine (Graversen et al, 1998).

인간 루프 3 및 5 변경 라이브러리를 다음과 같은 방식으로 중첩 PCR을 이용하여 생성시켰다. 프라이머 h3-5AF(SEQ ID NO:) 및 h3-5AR(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시키고, 프라이머 h3-5BF(SEQ ID NO:) 및 h3-5 BR(SEQ ID NO:)을 혼합시키고, PCR에 의해 연장시켰다. 생성된 단편을 겔로부터 정제시키고, 혼합시키고, h3-5 OF(SEQ ID NO:) 및 PstRev(SEQ ID NO:)의 존재하에서 PCR에 의해 연장시켰다. 생성된 단편을 겔 정제시키고, BglⅠ 및 PstⅠ 제한효소로 절단하고, 상기 기재된 것과 유사하게 절단된 파지 디스플레이 벡터 pPhCPAB 또는 pANA27에 클로닝시켰다.Human loop 3 and 5 altered libraries were generated using overlapping PCR in the following manner. Primers h3-5AF (SEQ ID NO :) and h3-5AR (SEQ ID NO :) are mixed, extended by PCR, primers h3-5BF (SEQ ID NO :) and h3-5 BR (SEQ ID NO: ) Were mixed and extended by PCR. The resulting fragments were purified from gels, mixed and extended by PCR in the presence of h3-5 OF (SEQ ID NO :) and PstRev (SEQ ID NO :). The resulting fragments were gel purified, digested with BglI and PstI restriction enzymes and cloned into the digested phage display vector pPhCPAB or pANA27 similar to that described above.

실시예 9Example 9

인간 라이브러리 1-4의 패닝 및 스크리닝Panning and Screening of Human Library 1-4

인간 라이브러리 1-4로부터 생성된 파지를 재조합 인간 IL-23R/Fc 키메라(R&D Systems)에서 패닝시켰다. ELISA 플레이트 검정을 이용한 3, 4, 및/또는 5 라운드의 패닝 후 상기 결합 패널의 스크리닝은 모든 경우에서 수용체-특이적 결합체를 확인하였다.Phage generated from human libraries 1-4 were panned in recombinant human IL-23R / Fc chimera (R & D Systems). Screening of the binding panel after 3, 4, and / or 5 rounds of panning using an ELISA plate assay confirmed receptor-specific binding in all cases.

패닝을 위한 파지를 생성시키기 위해, 마스터 라이브러리 DNA를 박테리아 균주 TG1(Stratagene)으로의 전기천공에 의해 형질전환시켰다. 세포를 SOC(Super-Optimal broth with Catabolite repression) 배지에서 37℃에서 진탕과 함께 1시간 동안 회복되도록 하고, 수퍼 브로쓰(super broth, SB)를 첨가하여 부피를 10배 증가시켜 20% 글루코오스 및 20 μg/mL 카르베니실린의 최종 농도가 되도록 하였다. 1시간 동안 37℃에서 진탕시킨 후, 카르베니실린 농도를 또 다른 1시간 동안 50 μg/mL로 증가시키고, 이후 헬퍼(helper) 파지 M13K07과 함께 2% 글루코오스 및 50 μg/mL 카르베니실린을 갖는 400 mL의 SB를 5x109 pfu/mL의 최종 농도로 첨가하였다. 인큐베이션을 30분 동안 진탕과 함께 37℃에서 지속시킨 후, 또 다른 30분 동안 100-150 rpm에서 진탕시켰다. 세포를 20분 동안 4℃에서 3200g에서 원심분리시킨 후, 50 μg/mL 카르베니실린 및 50 μg/mL 카나마이신을 함유하는 500 mL SB 배지에 재현탁시켰다. 세포를 150 rpm에서의 진탕과 함께 실온(RT)에서 밤새 성장시켰다. 파지를 20분 동안 15,000 g 및 4℃에서의 원심분리에 의해 박테리아 세포를 펠렛화시킴으로써 분리시켰다. 상층액을 30분 동안 얼음 상에서 1/4 부피(보통, 250 mL의 상층액/병 + 62.5 mL PEG 용액)의 20% PEG/2.5 M NaCl과 함께 인큐베이션시켰다. 파지를 20분 동안 15,000 g 및 4℃에서의 원심분리에 의해 펠렛화시켰다. 파지 펠렛을 제조업체의 설명서에 따라 제조된 0.1% 소듐 아자이드(BSA/PBS/아자이드) 및 완전 미니-EDTA-비함유 프로테아제 억제제(Roche)를 함유하는 인산염 완충 염수(PBS) 중의 1% 소 혈청 알부민(BSA) 중에 재현탁시켰다. 대안적으로, 파지를 0.05% 비등된 카세인, 0.025% Tween-20, 및 프로테아제 억제제를 함유하는 완충액 D에 재현탁시켰다. 물질을 0.2 μm 세공 크기 필터인 25 mm 직경의 Whatman Puradisc를 이용하여 필터 멸균시켰다.To generate phage for panning, the master library DNA was transformed by electroporation to bacterial strain TG1 (Stratagene). The cells were allowed to recover for 1 hour with shaking at 37 ° C. in SOC (Super-Optimal broth with Catabolite repression) medium and the volume increased 10-fold by adding super broth (SB) to 20% glucose and 20 A final concentration of μg / mL carbenicillin was made. After shaking at 37 ° C. for 1 hour, the carbenicillin concentration was increased to 50 μg / mL for another 1 hour and then with 2% glucose and 50 μg / mL carbenicillin with helper phage M13K07. 400 mL of SB was added at a final concentration of 5 × 10 9 pfu / mL. Incubation was continued at 37 ° C. with shaking for 30 minutes and then shaken at 100-150 rpm for another 30 minutes. Cells were centrifuged at 3200 g at 4 ° C. for 20 minutes and then resuspended in 500 mL SB medium containing 50 μg / mL carbenicillin and 50 μg / mL kanamycin. Cells were grown overnight at room temperature (RT) with shaking at 150 rpm. Phage were separated by pelleting bacterial cells by centrifugation at 15,000 g and 4 ° C. for 20 minutes. The supernatant was incubated with 1/4 volume (usually 250 mL supernatant / bottle + 62.5 mL PEG solution) on 20% PEG / 2.5 M NaCl on ice for 30 minutes. Phage were pelleted by centrifugation at 15,000 g and 4 ° C. for 20 minutes. Phage pellets in 1% bovine serum in phosphate buffered saline (PBS) containing 0.1% sodium azide (BSA / PBS / azide) and fully mini-EDTA-free protease inhibitor (Roche) prepared according to manufacturer's instructions Resuspend in albumin (BSA). Alternatively, phages were resuspended in Buffer D containing 0.05% boiled casein, 0.025% Tween-20, and protease inhibitors. The material was filter sterilized using a 25 mm diameter Whatman Puradisc, 0.2 μm pore size filter.

인간 라이브러리 1-4로부터 생성된 파지를 재조합 인간 IL-23R/Fc 키메라(R&D Systems cat #1686-MR)에서 패닝시켰다. 라이브러리 패닝을 플레이트 또는 비드 형식을 이용하여 수행하였다. 플레이트 형식에 대해, 96-웰 Immulon HB2 ELISA 플레이트의 6 내지 8개의 웰을 둘베코(Dulbecco) PBS 중의 250-1000 ng/웰의 담체 비함유 인간 IL-23R/Fc로 코팅시켰다. 물질을 밤새 플레이트에서 인큐베이션시키고, 이후 웰을 PBS로 3회 세척하고, 차단 완충액(1% BSA/PBS/아자이드, 또는 0.05% 비등된 카세인 및 1% Tween-20을 함유하는 완충액 C)을 첨가한 후, 웰을 37℃에서 적어도 1시간 동안 인큐베이션시켰다. 추가 웰을 또한 차단 완충액으로의 파지 결합의 이후의 흡수를 위해 동일 시간으로 차단 완충액으로 처리하였다.Phage generated from human library 1-4 were panned in recombinant human IL-23R / Fc chimera (R & D Systems cat # 1686-MR). Library panning was performed using plate or bead format. For the plate format, 6-8 wells of 96-well Immulon HB2 ELISA plates were coated with carrier-free human IL-23R / Fc at 250-1000 ng / well in Dulbecco PBS. The material is incubated in the plate overnight, then the wells are washed three times with PBS and addition of blocking buffer (1% BSA / PBS / azide, or buffer C containing 0.05% boiled casein and 1% Tween-20). The wells were then incubated at 37 ° C. for at least 1 hour. Additional wells were also treated with blocking buffer at the same time for subsequent uptake of phage binding to blocking buffer.

비희석, 프로테아제 억제제가 더해진 차단 완충액에서의 1:10 및 1:100 희석의 파지 제조물의 3개의 희석액을 사용하였다. 일부 라운드의 패닝에서, 재조합 인간 IgG1 Fc를 10 μg/mL의 최종 농도로 각각의 희석액에 첨가하였다. "차단 단독(Block Only)"(차단을 위한 예비흡수) 웰로부터 차단 완충액을 제거하고, 다양한 파지 혼합물을 37℃에서 또 다른 시간 동안 상기 웰에서 인큐베이션시켰다. 분취량(50 μL)의 각각의 파지 혼합물을 세척되고 차단된 표적 웰로 옮기고, 37℃에서 2시간 동안 인큐베이션시켰다. 첫번째 라운드의 패닝에 대해, 결합된 파지를 1X PBS/0.05% Tween 또는 완충액 D로 1회 세척하고, 1 mg/mL BSA를 함유하는 글리신 완충액, pH 2.2를 이용하여 용리시켰다. 2 M Tris 베이스(pH 11.5)를 이용한 중화 후, 용리된 파지를 효모 추출물-트립톤(YT) 배지 중에서 600 nm(OD600)에서 측정된 약 0.9의 광학 밀도의 2 내지 4 밀리리터의 TG1(Stratagene), XL1-Blue(Stratagene), ER2738(Lucigen 또는 NEB), 또는 SS320(Lucigen) 세포와 함께 15분 동안 인큐베이션시켰다. 상기 프로토콜을 이용하지만, 약 20%(부피)로 감소시켜, 상기 감염으로부터 파지를 제조하였다. 용리된 파지로부터 제조된 파지를 추가 라운드의 패닝에 적용시켰다. 각각의 라운드에서, 투입 및 생산 파지의 역가를 적절한 항생제를 갖는 아가 상에서 플레이팅시킴으로써 결정하고, 이러한 플레이트로부터의 콜로니를 ELISA에 의한 결합체의 스크리닝을 위해 이후에 사용하였다.Three dilutions of phage preparations of 1:10 and 1: 100 dilutions in blocking buffer with undiluted, protease inhibitors were used. In some rounds of panning, recombinant human IgG1 Fc was added to each dilution at a final concentration of 10 μg / mL. Blocking buffer was removed from the “Block Only” (pre-absorption for blocking) wells and various phage mixtures were incubated in the wells at 37 ° C. for another hour. Aliquots (50 μL) of each phage mixture were transferred to washed and blocked target wells and incubated at 37 ° C. for 2 hours. For the first round of panning, bound phages were washed once with 1 × PBS / 0.05% Tween or Buffer D and eluted using glycine buffer, pH 2.2 containing 1 mg / mL BSA. After neutralization with 2 M Tris base (pH 11.5), the eluted phages were 2-4 milliliters of TG1 (Stratagene) at an optical density of about 0.9 measured at 600 nm (OD 600 ) in yeast extract-Tryptone (YT) medium. ), XL1-Blue (Stratagene), ER2738 (Lucigen or NEB), or SS320 (Lucigen) cells were incubated for 15 minutes. Phage was prepared from the infection using this protocol but reduced to about 20% (volume). Phage prepared from eluted phage was subjected to an additional round of panning. In each round, titers of input and production phage were determined by plating on agar with appropriate antibiotics, and colonies from these plates were subsequently used for screening of the conjugates by ELISA.

두번째 라운드의 패닝에서 세척을 5x로 증가시키고, 이후의 라운드에서, 세척을 10x로 증가시킨 것을 제외하고는, 추가 라운드의 패닝을 상기 기재된 바와 같이 수행하였다. 3 내지 6 라운드의 패닝을 수행하였다. 최종 라운드의 패닝을 위해, 감염 후에 파지를 생성시키지 않고, 오히려, 감염된 박테리아를 밤새 성장시키고, DNA로부터 maxiprep(Qiagen kit)을 제조하였다. 투입 파지의 글리세롤 스톡(15%)은 각 라운드로부터 동결 저장(-80℃)하였다.A further round of panning was performed as described above, except that the wash was increased to 5 × in the second round of panning and the wash was increased to 10 × in the subsequent rounds. Three to six rounds of panning were performed. For the last round of panning, no phage was generated after infection, but rather, infected bacteria were grown overnight and maxiprep (Qiagen kit) was prepared from DNA. Glycerol stock (15%) of the input phage was stored frozen (-80 ° C) from each round.

비드 패닝 형식을 위해, 인간 IL-23R을 제조업체의 설명서에 따라 Sulfo-NHS 마이크로 비오티닐화 키트(Thermo-Scientific)를 이용하여 비오티닐화시키고, 정제시켰다. 파지 펠렛을 0.5% 비등된 카세인, 첨가된 EDTA-비함유 프로테아제 억제제(Roche)를 갖는 PBS 중의 0.025% Tween 20을 함유하는 카세인 완충액에 재현탁시킨 것을 제외하고는, 파지를 상기 프로토콜에 따라 마스터 라이브러리로부터 패닝을 위해 생성시켰다. 자석을 이용하여, 스트렙타비딘 자기 비드(50 μL 또는 0.5 mg의 Myone T1 Dynabeads(Invitrogen)를 각각 갖는 2개의 튜브)를 0.5% 비등된 카세인, 1% Tween 20에서 수회 세척하여 보존제를 제거하였다. 150 μL 분취량의 파지 prep을 37℃에서 30분 동안 한 튜브의 비드와 함께 예비인큐베이션시켜 스트렙타비딘 결합체를 제거하였다. 이후, 파지 prep을 비드로부터 제거하고, 1 μg의 비오티닐화된 IL-23R을 100 μg/mL로 10 μL의 인간 Fc와 함께 첨가하고, 회전과 함께 37℃에서 2시간 동안 인큐베이션시켰다. 이후, 이러한 물질을 나머지 튜브의 세척된 비드에 첨가하고, 30분 동안 37℃에서 인큐베이션시켰다. 자석 스탠드를 이용하여, 비드를 PBS/0.05% Tween으로 5회 세척하였다. 파지를 글리신, pH 2.0로 용리시키고, 상기 기재된 바와 같이 박테리아를 감염시키는데 사용하였다. 이후의 라운드의 패닝에서, 비드-결합된 파지를 용리 전에 10회 세척하였다. 투입 및 생산 파지의 역가를 상기 기재된 바와 같이 결정하였다.For the bead panning format, human IL-23R was biotinylated and purified using Sulfo-NHS Micro Biotinylation Kit (Thermo-Scientific) according to the manufacturer's instructions. Phage pellets were resuspended in casein buffer containing 0.025% Tween 20 in PBS with 0.5% boiled casein, added EDTA-free protease inhibitor (Roche), according to the above protocol. For panning from. Using a magnet, streptavidin magnetic beads (two tubes with 50 μL or 0.5 mg of Myone T1 Dynabeads (Invitrogen) each) were washed several times in 0.5% boiled casein, 1% Tween 20 to remove preservatives. A 150 μL aliquot of phage prep was preincubated at 37 ° C. for 30 minutes with beads of one tube to remove streptavidin conjugates. Phage prep was then removed from the beads, 1 μg of biotinylated IL-23R was added at 100 μg / mL with 10 μL of human Fc and incubated for 2 hours at 37 ° C. with rotation. This material was then added to the washed beads of the remaining tubes and incubated at 37 ° C. for 30 minutes. Using a magnetic stand, the beads were washed 5 times with PBS / 0.05% Tween. Phage were eluted with glycine, pH 2.0 and used to infect bacteria as described above. In subsequent rounds of panning, the bead-bound phages were washed 10 times before eluting. The titers of the input and production phages were determined as described above.

ELISA 스크리닝을 위해, 이후의 라운드의 패닝으로부터의 콜로니를 밤새 2% 글루코오스 및 항생제를 갖는 YT 배지에서 성장시킨 후, 새로운 배양을 시작하기 위해 각각의 분취량을 사용하여 0.5의 OD600로 성장시켰다. 헬퍼 파지를 5 x 109 pfu/mL로 첨가하고, 37℃에서 30분 동안 감염시킨 후, 진탕과 함께 37℃에서 성장시켰다. 박테리아를 원심분리시키고, 카르베니실린 및 카나마이신을 갖는 YT 배지에 재현탁시키고, 파지 생성을 위해 밤새 성장시켰다. 이후, 박테리아를 펠렛화시키고, 배지를 제거하고, 1/5 부피(1:5 밀크 혼합물:상층액)의 6X PBS, 18% 밀크와 함께 혼합하였다. ELISA 플레이트를 75-100 ng/웰의 재조합 인간 IL-23R/Fc를 함유하는 50-100 μL의 PBS와 함께 4℃에서 밤새 인큐베이션시킴으로써 제조하였다. 인간 IgG Fc(R&D Systems)로 코팅된 이중 플레이트를 대조군으로 사용하였다. 플레이트를 PBS로 3회 세척하고, 1X PBS 중의 3% 밀크와 함께 37℃에서 1시간 동안 차단시키고, 100 uL/웰의 각각의 밀크 처리된 파지 혼합물로 1시간 동안 인큐베이션시켰다. 플레이트를 PBS/0.05% Tween 20로 1회 세척하고, PBS로 2회 세척하고, HRP-컨쥬게이션된 항-M13 항체(GE Healthcare)로 1시간 동안 인큐베이션시키고, PBS/Tween 및 PBS로 각각 3회 세척하고, TMB 기질(VWR)과 함께 인큐베이션시켰다. 황산을 첨가하여 색 반응을 중지시키고, 흡광도를 450 nm에서 판독하여 양성 결합체를 확인하였다.For ELISA screening, colonies from subsequent rounds of panning were grown overnight in YT medium with 2% glucose and antibiotics and then grown to an OD 600 of 0.5 using each aliquot to start a new culture. Helper phage was added at 5 × 10 9 pfu / mL, infected at 37 ° C. for 30 minutes, and then grown at 37 ° C. with shaking. The bacteria were centrifuged, resuspended in YT medium with carbenicillin and kanamycin and grown overnight for phage production. The bacteria were then pelleted, the medium was removed and mixed with 1/5 volume (1: 5 milk mixture: supernatant) 6X PBS, 18% milk. ELISA plates were prepared by incubating overnight at 4 ° C. with 50-100 μL of PBS containing 75-100 ng / well recombinant human IL-23R / Fc. Double plates coated with human IgG Fc (R & D Systems) were used as controls. Plates were washed three times with PBS, blocked with 3% milk in 1 × PBS for 1 hour at 37 ° C. and incubated for 1 hour with 100 uL / well of each milked phage mixture. Plates were washed once with PBS / 0.05% Tween 20, washed twice with PBS, incubated with HRP-conjugated anti-M13 antibody (GE Healthcare) for 1 hour, three times with PBS / Tween and PBS each Washed and incubated with TMB substrate (VWR). Sulfuric acid was added to stop the color reaction and the absorbance was read at 450 nm to identify the positive conjugate.

인간 IL-23R에 대한 결합체를 세번째 및 네번째 라운드의 패닝으로부터 확인하였다. 인간 IL-23R/Fc 키메라에 대한 파지-디스플레이된 CTLD 결합체로부터의 루프 1 및 4의 무작위화 영역으로부터의 서열의 예가 표 4에 제공된다. 이러한 데이터로부터의 시험은, 결합체의 31/36에 대해, 모티프가 루프 4의 무작위화 영역에서 명백하였고: 두번째 및 다섯번째 아미노산은 항상 글리신이었고, 네번째 아미노산이 항상 고리 아미노산, 예를 들어, 트립토판 또는 페닐알라닌 중 하나였고, 첫번째 아미노산이 소수성이었고, 보통 고리 아미노산, 예를 들어, 페닐알라닌, 티로신, 또는 트립토판이었고, 세번째 아미노산이 소수성이었고, 보통 발린이었는 것을 암시한다. 글리신 및 세린이 첫번째 및 두번째 위치에서 주로 나타났고, 발린이 일곱번째 위치에 종종 존재하지만, 루프 1 영역은 보다 적은 컨센서스를 갖는다. 5개의 추가 결합체는 상기 컨센서스를 갖는 것으로 나타나지 않았지만, 이중 2개는 루프 4 영역에서 MFGMG(SEQ ID NO:) 또는 LFGRG(SEQ ID NO:)와 함께 또 다른 작은 그룹을 형성할 것이다. 많은 결합체가 다수의 클론에 의해 각각 제시되었다.Binding to human IL-23R was identified from the third and fourth rounds of panning. Examples of sequences from randomized regions of loops 1 and 4 from phage-displayed CTLD conjugates for human IL-23R / Fc chimeras are provided in Table 4. Tests from this data indicated that for 31/36 of the conjugate, the motif was evident in the randomization region of loop 4: the second and fifth amino acids were always glycine and the fourth amino acid was always a cyclic amino acid such as tryptophan or It was one of the phenylalanines, implying that the first amino acid was hydrophobic, usually a cyclic amino acid such as phenylalanine, tyrosine, or tryptophan, and the third amino acid was hydrophobic and usually valine. Glycine and serine are mainly present in the first and second positions, and valine is often present in the seventh position, but the loop 1 region has less consensus. Five additional conjugates did not appear to have the consensus, but two of them would form another small group with MFGMG (SEQ ID NO :) or LFGRG (SEQ ID NO :) in the loop 4 region. Many conjugates were each represented by multiple clones.

표 4Table 4

인간 IL-23R/Fc 키메라에 대한 인간 루프 1 및 4 결합체의 서열Sequences of human loop 1 and 4 conjugates to human IL-23R / Fc chimeras

Figure pct00013
Figure pct00013

Figure pct00014
Figure pct00014

ELISA 검정은 상기 결합체가 인간 IgG1 Fc 또는 재조합 마우스 IL-23R과 교차 반응하지 않은 것을 나타내었다. ELISA 및 Biacore 결합 검정은 후보 클론 001-69.4G8 및 다른 클론으로부터의 정제된 단량체 CTLD 및 전장 삼합체가 인간 IL-23R에 대한 결합에 대해 IL-23과 경쟁한 것을 나타내었다. 경쟁 후보는 나노몰의 친화성을 갖는 것으로 확인되었다.ELISA assay showed that the conjugate did not cross react with human IgG1 Fc or recombinant mouse IL-23R. ELISA and Biacore binding assays showed that purified monomeric CTLD and full length trimers from candidate clones 001-69.4G8 and other clones competed with IL-23 for binding to human IL-23R. Competition candidates were found to have nanomolar affinity.

실시예 10Example 10

인간 IL-23R에 대한 결합체의 친화성 성숙Affinity Maturation of Conjugates to Human IL-23R

인간 IL-23R의 루프 4영역이 적절한 모티프인 것으로 확인되므로, 패닝에 의해 이미 수득된 루프 4의 다양성을 보존하지만, 본래의 나이브 라이브러리로부터 모든 가능한 루프 1 영역을 갖는 루프 4 영역을 재분류시키는 셔플링(shuffling) 방법을 개발하였다. 이를 위해, 루프 1과 루프 4 영역 사이를 절단하고, 루프 4 영역을 함유하는 약 1.4 kb의 단편을 분리시키는 EcoRⅠ 및 BssHⅡ 제한 효소로 인간 IL-23R의 라운드 4 패닝으로부터의 DNA를 절단하였다. 별개로, 본래의 인간 1-4 라이브러리 DNA를 동일 효소로 절단하고, 루프 1 영역을 함유하는 약 3.5 kb의 단편을 분리시켰다. 이러한 단편을 함께 라이게이션시키고, 새로운 h1-4 셔플 라이브러리를 상기 기재된 바와 같이 생성시켰다. 패닝의 각각의 라운드에서 비오티닐화된 재조합 인간 IL-23R/Fc의 양을 200 ng로부터 (20 ng, 2 ng) 0.1 ng로 약 10배 감소시킨 것을 제외하고, 라이브러리를 비드 프로토콜(상기)을 이용하여 패닝시켰다. 콜로니로부터의 파지 상층액을 상기 기재된 바와 같이 ELISA에 의해 스크리닝시키고, 결합체를 확인하고, 서열 분석하였다. 친화성-성숙된 친화체의 루프 1 및 4 서열이 표 5에 제시된다(SEQ ID NOS:).Since the loop 4 region of human IL-23R is found to be an appropriate motif, it preserves the diversity of loop 4 already obtained by panning, but shuffles to reclassify the loop 4 region with all possible loop 1 regions from the original naive library. A shuffling method was developed. To this end, DNA from round 4 panning of human IL-23R was cleaved with EcoRl and BssHII restriction enzymes that cleaved between loop 1 and loop 4 regions and separated about 1.4 kb of fragments containing loop 4 regions. Separately, the original human 1-4 library DNA was digested with the same enzyme and a fragment of about 3.5 kb containing the loop 1 region was isolated. These fragments were ligated together and a new h1-4 shuffle library was generated as described above. The library was loaded into the bead protocol (above), except that the amount of biotinylated recombinant human IL-23R / Fc in each round of panning was reduced by approximately 10-fold from 200 ng (20 ng, 2 ng) to 0.1 ng. Panning was used. Phage supernatants from colonies were screened by ELISA as described above, conjugates identified, and sequenced. Loop 1 and 4 sequences of affinity-matured affinity are shown in Table 5 (SEQ ID NOS :).

표 5Table 5

인간 IL-23R에 대한 친화성-성숙된 인간 루프 1-4 결합체로부터의 루프 1 및 4 서열Loop 1 and 4 sequences from affinity-matured human loop 1-4 conjugates to human IL-23R

Figure pct00015
Figure pct00015

최초 패닝 라운드 4에서 수득된 루프 1영역의 다양성이 유지되고, 루프 4 선택권의 제한된 선택이 사용되고, 루프 3이 6개 위치에서 무작위화된 별개의 친화성 성숙 라이브러리를 생성시켰다. 이는 프라이머 Bglfor(SEQ ID NO:) 및 H1-3-4R(SEQ ID NO:)와 함께, 주형으로서 인간 루프 1-4 라이브러리의 본래의 패닝 라운드 4로부터의 DNA를 이용하여 루프 1 영역을 증폭시키기 위한 프라이머를 생성시킴으로써 달성되었다. 이러한 프라이머는 루프 3 및 4에 대해 하기 아미노산 서열을 엔코딩한다:The diversity of loop 1 regions obtained in the first panning round 4 was maintained, a limited selection of loop 4 options was used, and loop 3 generated a separate affinity maturation library randomized at 6 positions. This, together with primers Bglfor (SEQ ID NO :) and H1-3-4R (SEQ ID NO :), amplifies the loop 1 region using DNA from the original panning round 4 of the human loop 1-4 library as a template. Was achieved by generating primers. These primers encode the following amino acid sequences for loops 3 and 4:

Figure pct00016
Figure pct00016

이러한 서열은 루프 4에 대한 주요 양자택일(primary alternative) 뿐만 아니라 CTLD의 루프 3 영역의 변경을 포함한다. 상기와 유사하나 루프 4 영역 서열에 대해 더욱 특이적인 다른 프라이머를 또한 생성시키고, 루프 3 영역에서 무작위화되는 또 다른 라이브러리의 생성에 사용하였다. 프라이머 PstLoop4rev(SEQ ID NO:) 및 Pst Rev(SEQ ID NO:)를 이용한 중첩 PCR에 의해 나머지 관심 영역을 생성시켰다.Such sequences include alterations in the loop 3 region of CTLD as well as the primary alternative to loop 4. Other primers similar to the above but more specific for loop 4 region sequences were also generated and used to generate another library that was randomized in the loop 3 region. The remaining region of interest was generated by overlapping PCR with primers PstLoop4rev (SEQ ID NO :) and Pst Rev (SEQ ID NO :).

상기 라이브러리로부터 수득된 친화성 성숙된 IL-23R 결합 서열이 표 6에 제공된다. 수득된 결합체 중 일부를 보다 바람직한 루프 4 또는 루프 1 서열을 추가 친화성-성숙된 결합체를 수득하기 위한 다른 서열로 교환시킴으로써 변경시켰고, 이들은 표 6에 포함된다.The affinity matured IL-23R binding sequence obtained from the library is provided in Table 6. Some of the conjugates obtained were altered by exchanging more preferred Loop 4 or Loop 1 sequences with other sequences to obtain additional affinity-matured conjugates, which are included in Table 6.

표 6Table 6

Figure pct00017
Figure pct00017

*클론 101-80-5H3은 설계된 루프 4로부터 결실된 아미노산 및 변경된 영역의 바로 업스트림의 루프 4 영역에 2개의 다른 아미노산 변경(Gly 146, Gly 147로부터 Ala 146, Ala 147로)을 가졌다.Clone 101-80-5H3 had two other amino acid alterations (Gly 146, Gly 147 to Ala 146, Ala 147) in the loop 4 region immediately upstream of the amino acid deleted from the designed loop 4 and the altered region.

표 7은 H1-3-4R(SEQ ID NO:)과 유사한 프라이머를 이용하여 제조되지만, 다음과 같은 루프 변형의 선택을 발생시키는 코딩 서열을 갖는 일부 추가 클론을 나타낸다.Table 7 shows some additional clones prepared using primers similar to H1-3-4R (SEQ ID NO :) but with coding sequences that result in the selection of loop modifications as follows.

표 7Table 7

Figure pct00018
Figure pct00018

IL-23R 결합체의 루프 1의 시작에서 발견된 GlySer를 유지시키면서 루프 4를 FGVFG(SEQ ID NO:), WGVFG, FGYFG, WGYFG, 및 WGVWG(각각, SEQ ID NOS:)의 5개의 아미노산 서열을 한정시키고, NNK 방법을 이용하여 루프 1의 이후의 5개의 아미노산을 다양화시킴으로써 또 다른 친화성 성숙 라이브러리를 생성시켰다. 프라이머 GSXX(SEQ ID NO:) 및 090827 BssBglrev(SEQ ID NO:)를 혼합시키고, PCR을 이용하여 연장시키고, 프라이머 FGVFGfor, FGYFGfor, WGVFGfor, WGYFGfor, 및 WGVWGfor(SEQ ID NOS:_ 내지 _)를 개별적으로 프라이머 Pst Loop 4 rev(SEQ ID NO:)와 혼합시키고, PCR을 이용하여 연장시켰다. 생성된 단편을 겔 정제시키고, 혼합시키고, 프라이머 Bgl for(SEQ ID NO:) 및 Pst rev(SEQ ID NO:)의 존재하에서 PCR로 연장시켰다. 생성된 단편을 BglⅡ 및 PstⅠ으로 절단시키고, 파지 디스플레이를 위해 벡터 pANA27로 삽입하였다. 연속 표적 희석액을 위한 비드 패닝을 라이브러리로부터의 친화성-성숙된 후보자를 선택하는데 사용하였다. 이러한 라이브러리로부터 수득된 후보자의 서열은 표 8에 제공된다.Loop 4 defines five amino acid sequences of FGVFG (SEQ ID NO :), WGVFG, FGYFG, WGYFG, and WGVWG (SEQ ID NOS :, respectively) while maintaining the GlySer found at the beginning of loop 1 of the IL-23R conjugate. Another affinity matured library was generated by diversifying the five subsequent amino acids of loop 1 using the NNK method. Primers GSXX (SEQ ID NO :) and 090827 BssBglrev (SEQ ID NO :) are mixed, extended using PCR, and primers FGVFGfor, FGYFGfor, WGVFGfor, WGYFGfor, and WGVWGfor (SEQ ID NOS: _ to _) are individually Mixed with primers Pst Loop 4 rev (SEQ ID NO :) and extended using PCR. The resulting fragments were gel purified, mixed and extended by PCR in the presence of primers Bgl for (SEQ ID NO :) and Pst rev (SEQ ID NO :). The resulting fragments were digested with BglII and PstI and inserted into vector pANA27 for phage display. Bead panning for serial target dilutions was used to select affinity-matured candidates from the library. The sequences of candidates obtained from this library are provided in Table 8.

표 8Table 8

Figure pct00019
Figure pct00019

Figure pct00020
Figure pct00020

Figure pct00021
Figure pct00021

루프의 아미노산 서열 및 주위 서열에서의 추가 변화를 알라닌 스캐닝, 즉, 특정 아미노산의 당업자에게 공지된 유전자 부위 특이적 돌연변이유발에 의한 특정 아미노산의 아미노산 알라닌으로의 대체에 의해 생성시켰다. 표 9는 후보자 056- 53.H4E 서열에서 제조된 알라닌 대체를 기재한다. 이러한 대체는 상기 잔기에 제한되지 않으며, 임의의 후보자 백본에서 이루어질 수 있다. 표 10은 많은 상기 대체가 친화성 및/또는 단백질 생성에 유리한 것을 나타낸다.Further changes in the amino acid sequence and the surrounding sequence of the loop were generated by alanine scanning, ie replacement of specific amino acids with amino acid alanine by gene site specific mutagenesis known to those skilled in the art. Table 9 lists alanine substitutions made in the candidate 056-53.H4E sequences. Such substitution is not limited to these residues and can be made in any candidate backbone. Table 10 shows that many of these substitutions are advantageous for affinity and / or protein production.

표 9Table 9

IL-23R에 결합하는 알라닌 스캔 후보자의 서열.Sequence of alanine scan candidates that bind IL-23R.

Figure pct00022
Figure pct00022

*056-53.H4E 아미노산의 넘버링은 상기 나열된 마지막 4개의 후보자에서의 TN 서열 넘버링과 상이한데, 이는 루프 4에서의 3개의 추가 아미노산의 도입 때문임을 주의하라. 따라서, 예를 들어, 056-53.H4E에서의 E153은 인간 TN 서열[7, SEQ ID NO:131]에서의 E150에 해당한다.Note that the numbering of * 056-53.H4E amino acids differs from the TN sequence numbering in the last four candidates listed above, due to the introduction of three additional amino acids in loop 4. Thus, for example, E153 in 056-53.H4E corresponds to E150 in human TN sequence [7, SEQ ID NO: 131].

표 10Table 10

알라닌 스캐닝에 의해 생성된 056-53.H4E 아트리머™ 폴리펩티드 복합체의 E. 콜리 원형질막공간에서의 친화성 및 생산량 수준Affinity and Yield Levels in the E. coli Plasma Membrane of the 056-53.H4E Atrimer ™ Polypeptide Complex Produced by Alanine Scanning

Figure pct00023
Figure pct00023

실시예 11Example 11

인간 human ILIL -23R에 대한 For -23R CTLDCTLD  And 아트리머Atrimer ™ 폴리펩티드 복합체 결합체의 ™ of polypeptide complex conjugate 서브클로닝Subcloning 및 생성 And generate

루프 영역을 엔코딩하는 DNA 단편을 BglⅡ 및 PstⅠ(또는 MfeⅠ) 제한효소를 이용한 제한 절단에 의해 수득하고, BglⅡ 및 PstⅠ으로 미리 절단된 박테리아 CTLD 발현 벡터 pANA1, pANA3, 또는 pANA12에 라이게이션시켰다. pANA1(SEQ ID NO:151)은 C-말단 6xHis-태깅된 인간 단량체 CTLD를 발현하도록 설계된 T7 기반의 발현 벡터이다. pelB 신호 펩티드는 단백질이 원형질막 주위공간 또는 성장 배지로 향하게 한다. pANA3(SEQ ID NO:153)는 pANA1의 C-말단 HA-His-태깅된 형태이다. pANA12(SEQ ID NO:162)는 pANA1의 C-말단 HA-StrepII-태깅된 형태이다. 삼합체 단백질의 발현을 위해, 루프 영역은 아트리머™ 폴리펩티드 복합체 발현 벡터 pANA4 또는 pANA10으로 서브클로닝되어, E. 콜리에서 분비되는 아트리머™ 폴리펩티드 복합체가 생성될 수 있다. pANA4(SEQ ID NO:154)는 단백질이 원형질막 주위공간 또는 성장 배지로 향하게 하는 ompA 신호 펩티드와 함께 C-말단 His/Myc-태깅된 전장 인간 TN을 함유하는 pBAD 기반의 발현 벡터이다. pANA10(SEQ ID NO:160)은 pANA4의 C-말단 HA-StrepII-태깅된 형태이다.DNA fragments encoding loop regions were obtained by restriction digestion with BglII and PstI (or MfeI) restriction enzymes, and ligated to bacterial CTLD expression vectors pANA1, pANA3, or pANA12 previously cleaved with BglII and PstI. pANA1 (SEQ ID NO: 151) is a T7 based expression vector designed to express C-terminal 6xHis-tagged human monomer CTLD. The pelB signal peptide directs the protein to the periplasm or growth medium. pANA3 (SEQ ID NO: 153) is the C-terminal HA-His-tagged form of pANA1. pANA12 (SEQ ID NO: 162) is the C-terminal HA-StrepII-tagged form of pANA1. For expression of the trimer protein, the loop region can be subcloned into the Atrimer ™ polypeptide complex expression vector pANA4 or pANA10, resulting in the Atrimer ™ polypeptide complex secreted by E. coli. pANA4 (SEQ ID NO: 154) is a pBAD based expression vector containing C-terminal His / Myc-tagged full-length human TN with ompA signal peptide that directs the protein to the periplasm or growth medium. pANA10 (SEQ ID NO: 160) is the C-terminal HA-StrepII-tagged form of pANA4.

발현 작제물로 E. 콜리 균주 BL21(DE3)을 형질전환시켰다. Star(pANA1, pANA3 및 pANA12에 대해; 단량체 CTLD 생성) 또는 BL21(DE3)(pANA4 및 pANA10에 대해; 아트리머™ 폴리펩티드 복합체 생성)를 적절한 항생제와 함께 LB/아가 플레이트에 플레이팅하였다. 새로운 플레이트 상의 단일 콜로니를 1L의 1% 글루코오스 및 카나마이신을 갖는 SB(pANA1 및 pANA12 벡터에 대해) 또는 앰피실린을 갖는 2xYT(두배로 농축된 효모 트립톤) 배지(pANA4 및 pANA10 벡터에 대해)에 접종시켰다. 배양물을 200 rpm에서 진탕기 상에서 37℃에서 0.5의 OD600로 인큐베이션시킨 후, 실온으로 냉각시켰다. IPTG를 pANA1 및 pANA12에 대해 0.05 mM의 최종 농도로 첨가하고, 아라비노시스(arabinosis)를 pANA4 및 pANA10에 대해 0.002-0.02%의 최종 농도로 첨가하였다. 120-150 rpm에서의 진탕과 함께 실온에서 밤새 유도를 수행하였고, 이후 원심분리에 의해 박테리아를 수거하였다. 원형질막 주위공간 단백질을 삼투압 충격 또는 가벼운 초음파처리에 의해 추출하였다.The expression construct was transformed with E. coli strain BL21 (DE3). Star (for pANA1, pANA3 and pANA12; generating monomeric CTLD) or BL21 (DE3) (for pANA4 and pANA10; generating atrimer ™ polypeptide complex) was plated with LB / Agar plates with appropriate antibiotics. Single colonies on new plates were inoculated in SB with 1 L of 1% glucose and kanamycin (for pANA1 and pANA12 vectors) or 2 × YT (doubled enriched yeast tryptone) medium with ampicillin (for pANA4 and pANA10 vectors) I was. Cultures were incubated with an OD 600 of 0.5 at 37 ° C. on a shaker at 200 rpm and then cooled to room temperature. IPTG was added at a final concentration of 0.05 mM for pANA1 and pANA12, and arabinosis was added at a final concentration of 0.002-0.02% for pANA4 and pANA10. Induction was performed overnight at room temperature with shaking at 120-150 rpm, after which the bacteria were harvested by centrifugation. Plasma membrane space proteins were extracted by osmotic shock or light sonication.

6xHis-태깅된 단백질을 Ni+-NTA 친화성 크로마토그래피를 이용하여 정제시켰다. 간단히, 원형질막 주위공간 단백질을 His-결합 완충액(100 mM HEPES, pH 8.0, 500 mM NaCl, 10 mM 이미다졸)에서 재구성시키고, His-결합 완충액으로 미리 평형화된 Ni+-NTA 컬럼에 로딩하였다. 컬럼을 10x 부피의 결합 완충액으로 세척하였다. 결합된 단백질을 용리 완충액(100 mM HEPES, pH 8.0, 500 mM NaCl, 500 mM 이미다졸)으로 용리시켰다. 정제된 단백질을 1X PBS 완충액으로 투석시키고, 박테리아 내독소를 음이온 교환에 의해 제거하였다.6 × His-tagged proteins were purified using Ni + -NTA affinity chromatography. Briefly, the plasma membrane space protein was reconstituted in His-binding buffer (100 mM HEPES, pH 8.0, 500 mM NaCl, 10 mM imidazole) and loaded onto a Ni + -NTA column previously equilibrated with His-binding buffer. The column was washed with 10 × volume of binding buffer. Bound proteins were eluted with elution buffer (100 mM HEPES, pH 8.0, 500 mM NaCl, 500 mM imidazole). Purified protein was dialyzed with IX PBS buffer and bacterial endotoxin was removed by anion exchange.

strep II-태깅된 단량체 CTLD 및 아트리머™ 폴리펩티드 복합체를 Strep-Tactin 친화성 크로마토그래피에 의해 정제하였다. 간단히, 원형질막 주위공간 단백질을 1X PBS 완충액에 재구성시키고, 1X PBS 완충액으로 미리 평형화된 Strep-Tactin 컬럼에 로딩시켰다. 컬럼을 10X 부피의 PBS 완충액으로 세척하였다. 단백질을 용리 완충액(2.5 mM 데스티오비오틴(desthiobiotin)을 갖는 1X PBS)으로 용리시켰다. 정제된 단백질을 1X PBS 완충액으로 투석시키고, 박테리아 내독소를 음이온 교환에 의해 제거하였다.The strep II-tagged monomer CTLD and Atrimer ™ polypeptide complexes were purified by Strep-Tactin affinity chromatography. Briefly, plasma membrane space proteins were reconstituted in IX PBS buffer and loaded onto Strep-Tactin columns previously equilibrated with IX PBS buffer. The column was washed with 10 × volume of PBS buffer. Proteins were eluted with elution buffer (1 × PBS with 2.5 mM desthiobiotin). Purified protein was dialyzed with IX PBS buffer and bacterial endotoxin was removed by anion exchange.

일부 세포 검정에 대해, 아트리머™ 폴리펩티드 복합체를 포유동물 세포에 의해 생성하였다. 루프 영역을 엔코딩하는 DNA 단편을 포유동물 발현 벡터 pANA2 또는 pANA11로 서브클로닝시켜, HEK293 일시 발현 시스템에서 아트리머™ 폴리펩티드 복합체를 생성시켰다. pANA2(SEQ ID NO:152)는 C-말단 His 태그를 함유하는 변형된 pCEP4 벡터이다. pANA11(SEQ ID NO:161)은 pANA2의 C-말단 HA-StrepII-태깅된 형태이다. 루프 영역을 엔코딩하는 DNA 단편을 BglⅡ 및 MfeⅠ을 이용한 이중 절단에 의해 수득하고, BglⅡ 및 MfeⅠ으로 미리 절단된 발현 벡터 pANA2 및 pANA11에 라이게이션시켰다. 발현 플라스미드를 Qiagen HiSpeed Plasmid Maxi Kit(Qiagene)를 이용하여 박테리아로부터 정제시켰다. HEK293 부착 세포에 대해, 제조업체의 프로토콜에 따라 Qiagen SuperFect Reagent를 이용하여 일시적 트랜스펙션을 수행하였다. 트랜스펙션 다음날, 배지를 제거하고, 293 Isopro 혈청-비함유 배지(Irvine Scientific)로 교환하였다. 2일 후, 0.5 M HEPES 완충액 중의 글루코오스를 1%의 최종 농도로 배지에 첨가하였다. 조직 배양 상층액을 정제를 위해 트랜스펙션 4-7일 후에 수거하였다. HEK 293F 현탁 세포에 대해, 제조업체의 프로토콜에 따라 Invitrogen사의 293Fectin으로 일시적 트랜스펙션을 수행하였다. 다음날, 1X 부피의 새로운 배지를 배양물에 첨가하였다. 조직 배양 상층액을 정제를 위해 트랜스펙션 4-7일 후에 수거하였다.For some cell assays, the Atrimer ™ polypeptide complex was generated by mammalian cells. The DNA fragment encoding the loop region was subcloned into the mammalian expression vector pANA2 or pANA11 to generate the Atrimer ™ polypeptide complex in the HEK293 transient expression system. pANA2 (SEQ ID NO: 152) is a modified pCEP4 vector containing a C-terminal His tag. pANA11 (SEQ ID NO: 161) is the C-terminal HA-StrepII-tagged form of pANA2. DNA fragments encoding loop regions were obtained by double digestion with BglII and MfeI and ligated into expression vectors pANA2 and pANA11 previously cleaved with BglII and MfeI. Expression plasmids were purified from bacteria using the Qiagen HiSpeed Plasmid Maxi Kit (Qiagene). For HEK293 adherent cells, transient transfection was performed using Qiagen SuperFect Reagent according to the manufacturer's protocol. The day after transfection, the medium was removed and exchanged with 293 Isopro serum-free medium (Irvine Scientific). After 2 days, glucose in 0.5 M HEPES buffer was added to the medium at a final concentration of 1%. Tissue culture supernatants were harvested 4-7 days after transfection for purification. For HEK 293F suspension cells, transient transfection was performed with 293Fectin from Invitrogen according to the manufacturer's protocol. The following day, 1 × volume of fresh medium was added to the culture. Tissue culture supernatants were harvested 4-7 days after transfection for purification.

포유동물 조직 배양 상층액으로부터의 His 또는 Strep II-태깅된 아트리머™ 폴리펩티드 복합체 정제를 E. 콜리 생성된 아트리머™ 폴리펩티드 복합체에 대해 기재된 바와 같이 수행하였다.His or Strep II-tagged Atrimer ™ polypeptide complex purification from mammalian tissue culture supernatants was performed as described for the E. coli produced Atrimer ™ polypeptide complex.

실시예 12Example 12

ELISA 및 경쟁 ELISA에 의한 결합체의 특성규명Characterization of conjugates by ELISA and competitive ELISA

실시예 9에 기재된 바와 같이 수행된 ELISA 검정은 파지-디스플레이된 결합체 중 어느 것도 인간 IgG1 Fc 또는 재조합 마우스 IL-23R/FC(R&D Systems)와 교차 반응하지 않은 것을 나타내었다.ELISA assays performed as described in Example 9 showed that none of the phage-displayed conjugates cross reacted with human IgG1 Fc or recombinant mouse IL-23R / FC (R & D Systems).

인간 IL-23R에 대한 인간 IL-23의 결합을 차단하는 양성 인간 IL-23R(IL-23R) 결합체로부터 상기 기재된 바와 같이 생성된 정제된 단량체 CTLD 또는 아트리머™ 폴리펩티드 복합체를 이용하여 경쟁 ELISA 검정을 수행하였다. 검정을 일반적으로 다음과 같이 수행하였다. Immulon HB2 플레이트에서의 개별적 웰을 100 ng의 항-인간 IgG Fc(R&D MAB 110 클론 97924)를 함유하는 100 μL PBS를 이용하여 4℃에서 밤새 인큐베이션하였다. 플레이트를 PBS/0.05% Tween 20을 이용하여 5회 세척하고, 웰을 50 ng의 재조합 인간 IL-23R/Fc를 함유하는 100 μL의 각각의 PBS로 실온에서 1.5 시간 동안 인큐베이션하였다. 플레이트를 상기와 같이 세척하고, 150 μL의 PBS 중의 3% 소 혈청 알부민(Sigma)으로 실온에서 1시간 동안 차단시키고, 이후 플레이트를 기재된 바와 같이 세척하고, 웰을 경쟁자(아트리머™ 폴리펩티드 copmlexor CTLD)와 함께 또는 경쟁자 없이 IL-23을 함유하는 각각 100 μL의 PBS와 함께 실온에서 1-2시간 동안 인큐베이션하였다. IL-23-함유 용액을 다음과 같이 제조하였다. 인간 IL-23(eBioscience)을 100 ng/mL의 농도로 첨가하였다. 경쟁자를 1 μg/mL의 최종 농도로 포함시켰다. 인큐베이션 후, 플레이트를 기재된 바와 같이 세척하고, 웰을 스트렙타비딘-HRP 컨쥬게이트의 1:5000 희석액(Pierce catalog no. 21130)을 함유하는 각각 100 μL의 PBS와 함께 실온에서 40분 동안 인큐베이션하였다. 세척 후, 웰을 실온에서 30분 이하 동안 각각 100 μL의 TMB(BioFX Lab catalog no. TMBH-1000-0)와 함께 인큐베이션하였다. 동등한 부피의 0.2M 황산을 이용하여 반응을 중지시켰다.Competitive ELISA assays were performed using purified monomeric CTLD or Atrimer ™ polypeptide complexes generated as described above from positive human IL-23R (IL-23R) conjugates that block binding of human IL-23 to human IL-23R. Was performed. The assay was generally performed as follows. Individual wells in Immulon HB2 plates were incubated overnight at 4 ° C. using 100 μL PBS containing 100 ng of anti-human IgG Fc (R & D MAB 110 clone 97924). Plates were washed five times with PBS / 0.05% Tween 20 and wells were incubated for 1.5 hours at room temperature with 100 μL of each PBS containing 50 ng of recombinant human IL-23R / Fc. Plates were washed as above, blocked with 3% bovine serum albumin (Sigma) in 150 μL of PBS for 1 hour at room temperature, then plates were washed as described and wells were competitor (Atrimer ™ polypeptide copmlexor CTLD). Incubate for 1-2 hours at room temperature with 100 μL of PBS each containing IL-23 with or without competitor. IL-23-containing solution was prepared as follows. Human IL-23 (eBioscience) was added at a concentration of 100 ng / mL. Competitors were included at a final concentration of 1 μg / mL. After incubation, plates were washed as described and wells were incubated for 40 minutes at room temperature with 100 μL of PBS each containing a 1: 5000 dilution of Streptavidin-HRP conjugate (Pierce catalog no. 21130). After washing, the wells were incubated with 100 μL of TMB (BioFX Lab catalog no. TMBH-1000-0) each for up to 30 minutes at room temperature. The reaction was stopped with an equal volume of 0.2 M sulfuric acid.

최초 패닝으로부터의 아트리머™ 폴리펩티드 복합체를 이용한 경쟁 검정(IL-23/IL-23R 상호작용 억제)의 결과의 예가 도 10에 제시된다. 친화성-성숙된 패닝 절차로부터의 클론 59-3B5, 61-p4G3, 78-2E6 및 056-53.H4E로부터의 CTLD를 갖는 아트리머™ 폴리펩티드 복합체를 IL-23R으로의 결합에 대해 IL-23를 이용한 경쟁 검정에 사용하였다.An example of the results of a competition assay (inhibition of IL-23 / IL-23R interactions) using the Atrimer ™ polypeptide complex from initial panning is shown in FIG. 10. IL-23 for binding to the Atrimer ™ polypeptide complex with CTLDs from clones 59-3B5, 61-p4G3, 78-2E6 and 056-53.H4E from the affinity-matured panning procedure Used for competition assay used.

다수의 아트리머™ 폴리펩티드 복합체를 IC50 값을 보다 광범위하게 결정하기 위해 경쟁 ELISA에서 시험하였다. 표 11에 제시된 바와 같이, 아트리머™ 폴리펩티드 복합체는 낮은 서브나노몰(subnanomolar) IC50을 나타내었다.Multiple Atrimer ™ polypeptide complexes were tested in a competitive ELISA to determine IC50 values more widely. As shown in Table 11, the Atrimer ™ polypeptide complex showed a low subnanomolar IC50.

표 11Table 11

IL-23R로의 결합에 대해 IL-23과 경쟁하는 아트리머™ 폴리펩티드 복합체의 능력.The ability of the Atrimer ™ polypeptide complex to compete with IL-23 for binding to IL-23R.

Figure pct00024
Figure pct00024

아트리머™ 폴리펩티드 복합체 056-53.H4E를 비교를 위한 표준으로 선택하였고, 친화성-성숙된 아트리머™ 폴리펩티드 복합체를 이용하여 추가 경쟁 검정을 수행하였다. 표 12는 검정에서의 경쟁 결과를 보다 낫게 비교하기 위해 동일 검정에서 수행된 심험된 아트리머™ 폴리펩티드 복합체의 IC50 대 056-53.H4E의 IC50의 비를 제공한다.Atrimer ™ Polypeptide Complex 056-53.H4E was chosen as a standard for comparison and further competition assays were performed using the affinity-matured Atrimer ™ Polypeptide Complex. Table 12 provides the ratio of IC50 to 056-53.H4E of IC50 of the tested Atrimer ™ polypeptide complex performed in the same assay to better compare the competition results in the assay.

표 12Table 12

IL-23R에 대한 결합에 대해 IL-23과 경쟁하는 아트리머™ 폴리펩티드 복합체 능력의 비교.Comparison of Atrimer ™ polypeptide complex ability to compete with IL-23 for binding to IL-23R.

Figure pct00025
Figure pct00025

실시예 13Example 13

Biacore에 의한 인간 IL-23R 결합체의 친화성의 특성규명Characterization of affinity of human IL-23R conjugate by Biacore

본래의 라이브러리 패닝 및 친화성 성숙된 라이브러리 패닝 둘 모두로부터의 단량체 및 삼량체 결합체의 외견상 친화성이 표 13, 14 및 15에 제공된다. 인간 IL-23R 및 수용체 결합체의 상호작용을 평가하기 위해 Biacore 3000 바이오센서(GE Healthcare)를 사용하였다. 표준 아민 커플링 화학을 이용하여 CM5 칩(GE Healthcare)에 대한 항-인간 IgG Fc 항체(GE Healthcare)의 고정을 수행하고, 이러한 변형된 표면을 재조합 인간 IL-23R/Fc 융합 단백질(R&D Systems)을 포획하는데 사용하였다. 200 RU 미만의 저밀도 수용체 표면을 모든 분석에 사용하였다. 아트리머™ 폴리펩티드 복합체 희석액(1-500 nM)을 30 μl/분으로 IL-23R 표면 상에 주입하고, Biaevaluation 소프트웨어(version 3.1, GE Healthcare)를 이용하여 센소그램(sensorgram) 데이터로부터 반응속도 상수(kinetic constant)를 도출시켰다. 데이터 수집은 회합에 대해 3분 및 해리에 대해 5분이었다. 3M 마그네슘 클로라이드의 30s 펄스를 이용하여 항-인간 IgG 표면을 재생시켰다. 모든 센소그램은 활성화 및 차단 플로우-셀(flow-cell) 뿐만 아니라 완충제 주입에 대해 이중 참조(double-reference)하였다.Apparent affinity of monomer and trimer conjugates from both native library panning and affinity matured library panning are provided in Tables 13, 14 and 15. Biacore 3000 biosensors (GE Healthcare) were used to assess the interaction of human IL-23R and receptor conjugates. Fixation of anti-human IgG Fc antibody (GE Healthcare) against CM5 chip (GE Healthcare) using standard amine coupling chemistry, and the modified surface was replaced with recombinant human IL-23R / Fc fusion protein (R & D Systems). Was used to capture. Low density receptor surfaces below 200 RU were used for all assays. Atrimer ™ polypeptide complex dilution (1-500 nM) was injected onto the IL-23R surface at 30 μl / min and reaction rate constants from sensorgram data using Biaevaluation software (version 3.1, GE Healthcare). kinetic constants were derived. Data collection was 3 minutes for association and 5 minutes for dissociation. Anti-human IgG surface was regenerated using a 30 s pulse of 3M magnesium chloride. All sensograms were double-referenced for buffer injection as well as activation and blocking flow-cells.

표 13Table 13

H Loop 1-4 라이브러리로부터의 단량체 CTLD IL-23R 결합체의 친화성Affinity of Monomer CTLD IL-23R Binders from H Loop 1-4 Library

Figure pct00026
Figure pct00026

표 14Table 14

본래 및 최초 친화성-성숙된 라이브러리로부터의 전장 아트리머™ 폴리펩티드 복합체 IL-23R 결합체의 친화성. "4G8 TN m"은 포유동물-세포 생성된 물질을 의미한다. 모든 다른 물질은 E. 콜리에서 생성되었다.Affinity of the full-length Atrimer ™ polypeptide complex IL-23R conjugate from the original and original affinity-matured library. "4G8 TN m" means mammalian-cell generated material. All other substances were produced in E. coli.

Figure pct00027
Figure pct00027

표 15Table 15

추가 친화성-성숙된 라이브러리 및 알라닌-스캔 후보자로부터의 아트리머™ 폴리펩티드 복합체 IL-23R 결합체의 친화성. 모든 물질은 E. 콜리에서 생성되었다.Additional affinity-matured library and affinity of the Atrimer ™ polypeptide complex IL-23R conjugate from alanine-scan candidates. All materials were produced in E. coli.

Figure pct00028
Figure pct00028

Figure pct00029
Figure pct00029

실시예 14Example 14

IL-23R에 대한 아트리머™ 복합체 결합은 IL-12Rβ1 또는 IL-12Rβ2를 인지하지 않는다.Atrimer ™ complex binding to IL-23R does not recognize IL-12Rβ1 or IL-12Rβ2.

인간 IL-12Rβ1/Fc 또는 IL-12Rβ2/Fc와 IL-23R 결합 아트리머™ 복합체의 상호작용을 평가하기 위해 Biacore 3000 바이오센서(GE Healthcare)를 이용하였다. CM5 칩(GE Healthcare)에 대한 항-인간 IgG Fc 항체(GE Healthcare)의 고정을 표준 아민 커플링 화학을 이용하여 수행하고, 재조합 인간 IL-12Rβ1/Fc 또는 IL-12Rβ2/Fc 융합 단백질(R&D Systems)을 포획하기 위해 상기 변형된 표면을 이용하였다. 200 RU 미만의 저-밀도 수용체 표면을 모든 분석에 이용하였다. 아트리머™ 복합체 희석액(100 nM)을 30 μl/분으로 IL-12R 표면 상에 주입하였다. 데이터 수집은 회합에 대해 3분 및 해리에 대해 5분이었다. 항-인간 IgG 표면을 3M 마그네슘 클로라이드의 30s 펄스를 이용하여 재생시켰다. 모든 센소그램은 항-인간 IgG Fc 항체 표면 뿐만 아니라 완충액 주입에 대해 이중 참조하였다. 표 16에 제시된 바와 같이, 아트리머™ 복합체는 인간 IL-12Rβ1/Fc 또는 IL-12Rβ2/Fc에 대해 임의의 측정가능한 결합을 나타내지 않았다.A Biacore 3000 biosensor (GE Healthcare) was used to assess the interaction of the IL-23R binding Atrimer ™ complex with human IL-12Rβ1 / Fc or IL-12Rβ2 / Fc. Fixation of anti-human IgG Fc antibody (GE Healthcare) against CM5 chip (GE Healthcare) was performed using standard amine coupling chemistry and recombinant human IL-12Rβ1 / Fc or IL-12Rβ2 / Fc fusion protein (R & D Systems). The modified surface was used to capture). Low-density receptor surfaces of less than 200 RU were used for all assays. Atrimer ™ complex diluent (100 nM) was injected on the IL-12R surface at 30 μl / min. Data collection was 3 minutes for association and 5 minutes for dissociation. Anti-human IgG surface was regenerated using 30s pulses of 3M magnesium chloride. All sensograms were double referenced for buffer injection as well as anti-human IgG Fc antibody surface. As shown in Table 16, the Atrimer ™ complex did not show any measurable binding to human IL-12Rβ1 / Fc or IL-12Rβ2 / Fc.

표 16Table 16

Figure pct00030
Figure pct00030

실시예 15Example 15

Biacore를 이용한 IL-23R 결합체의 존재하에서의 IL-23R에 대한 인간 IL-23 결합의 경쟁 검정Competition assay of human IL-23 binding to IL-23R in the presence of IL-23R conjugate using Biacore

IL-23R 결합 아트리머™ 폴리펩티드 복합체를 CM5 칩(GE Healthcare)에 아민-커플링시킨 후, IL-23R(IL-23R)을 칩 표면 상에 주입하였다. 결합 안정화 후, IL-23R과 상호작용하는 인간 IL-23(eBioscience)의 능력을 모니터하였다. 실온에서 30분 동안 IL-23R과 IL-23 사이 또는 IL-23R과 아트리머™ 폴리펩티드 복합체 사이에 복합체를 미리 형성시킴으로써 추가 경쟁 검정을 수행하였다. 이후, 복합체를 아민-커플링된 아트리머™ 복합체를 갖는 표면 상에 주입하였다. 아트리머 A5F에 대해 표 17에 제시된 바와 같이 IL-23R 아트리머의 나머지 결합을 결정하였고, 경쟁자(IL-23 또는 다른 아트리머)의 부재하에서의 결합의 퍼센트로서 표현하였다.The IL-23R binding Atamer polymer polypeptide complex was amine-coupled to the CM5 chip (GE Healthcare) and then IL-23R (IL-23R) was injected onto the chip surface. After binding stabilization, the ability of human IL-23 (eBioscience) to interact with IL-23R was monitored. Further competition assays were performed by preforming the complexes between IL-23R and IL-23 or between IL-23R and Atrimer ™ polypeptide complexes for 30 minutes at room temperature. The complex was then injected onto the surface with the amine-coupled Atrimer ™ complex. The remaining binding of the IL-23R atrimer was determined as shown in Table 17 for Atrimer A5F and expressed as percent of binding in the absence of competitors (IL-23 or other atrimers).

표 17Table 17

A5F는 IL-23R에 대한 IL-23의 결합과 경쟁한다A5F competes with the binding of IL-23 to IL-23R

Figure pct00031
Figure pct00031

실시예 16Example 16

세포 기반 검정에서의 선택된 아트리머™ 폴리펩티드 복합체의 시험 활성Test Activity of Selected Atrimer ™ Polypeptide Complexes in Cell Based Assays

건강한 공여자(AllCells)로부터의 인간 말초 혈액 단핵 세포(PBMC)를 10% FBS/Advanced RPMI 배지(Invitrogen) 중의 IL-23R 아트리머™ 폴리펩티드 복합체 또는 우스테키누맙(Ustekinumab)의 존재하에서 인간 재조합 IL-23(1 ng/mL, eBioscience) 및 PHA(1 μg/mL, Sigma)를 이용하여 1x106 세포/mL로 자극하였다. 배양 4일 후, 세포 상층액을 IL-17 Quantikine 키트(R&D Systems)를 이용한 ELISA에 의해 수거하고 검정하였다. 병행 배양에서, PBMC에 4일 동안 IL-23R 아트리머™ 폴리펩티드 복합체 또는 우스테키누맙의 존재하에서 인간 재조합 IL-12(1 ng/mL, R&D Systems)로 처리하였다. 세포 상층액을 Luminex(Procarta, Panomics)에 의해 IFNγ 및 IL-17에 대해 검정하고, Bioplex 시스템(BioRad)에서 분석하였다. 모든 처리를 삼중으로 수행하고, 평균 및 표준 오차를 GraphPad Prism 소프트웨어를 이용하여 작도하였다. 도 11, 12 및 13에 도시된 바와 같이, IL-23 아트리머™ 폴리펩티드 복합체는 IL-23-유도 IL-17 생성을 차단하였으나, IL-12-유도 IFNγ 생성은 억제하지 않았다. 예상된 바와 같이, 우스테키누맙은 IL-23 및 IL-12 반응 둘 모두를 억제하였다.Human peripheral blood mononuclear cells (PBMC) from healthy donors (AllCells) were subjected to human recombinant IL- in the presence of IL-23R Atrimer ™ polypeptide complex or Ustekinumab in 10% FBS / Advanced RPMI medium (Invitrogen). Stimulation was performed at 1 × 10 6 cells / mL using 23 (1 ng / mL, eBioscience) and PHA (1 μg / mL, Sigma). After 4 days of culture, cell supernatants were harvested and assayed by ELISA using IL-17 Quantikine kit (R & D Systems). In parallel cultures, PBMCs were treated with human recombinant IL-12 (1 ng / mL, R & D Systems) in the presence of IL-23R Atrimer ™ polypeptide complex or Ustekinumab for 4 days. Cell supernatants were assayed for IFNγ and IL-17 by Luminex (Procarta, Panomics) and analyzed in Bioplex system (BioRad). All treatments were performed in triplicate and the mean and standard error were plotted using GraphPad Prism software. As shown in FIGS. 11, 12 and 13, the IL-23 Atrimer ™ polypeptide complex blocked IL-23-induced IL-17 production, but did not inhibit IL-12-induced IFNγ production. As expected, Ustekinumab inhibited both IL-23 and IL-12 responses.

표 18은 PBMC 검정에서 시험된 친화성-성숙된 아트리머™ 폴리펩티드 복합체에 대한 결과를 나타낸다. IL-23-유도성 IL-17, IL-17F, 및 IL-22 생성을 차단하는 아트리머™ 폴리펩티드 복합체의 능력을 표시된 바와 같이 아트리머™ 폴리펩티드 복합체에 대해 측정하였다. 결과는 계산 수단을 이용한 우스테키누맙에 대한 IC50에 비한 아트리머™ 폴리펩티드 복합체에 대한 IC50의 비로 제시된다. 하나 이상의 검정의 결과는 일부 아트리머™ 폴리펩티드 복합체에 대해 제시된다.Table 18 shows the results for the affinity-matured Atomer ™ polypeptide complexes tested in the PBMC assay. The ability of the Atrimer ™ polypeptide complex to block IL-23-induced IL-17, IL-17F, and IL-22 production was measured against the Atrimer ™ polypeptide complex as indicated. The results are presented as the ratio of IC50 to Atrimer ™ polypeptide complex relative to IC50 to Ustekinumab using calculation means. The results of one or more assays are shown for some Atrimer ™ polypeptide complexes.

표 18Table 18

동일 실험에서의 우스테키누맙에 비한 각각의 아트리머™ 폴리펩티드 복합체의 존재하에서의 표시된 사이토카인의 생성 수준.Levels of production of indicated cytokines in the presence of each Atrimer ™ polypeptide complex over Ustekinumab in the same experiment.

아트리머/우스테키누맙Atrimer / Ustekinumab

Figure pct00032
Figure pct00032

실시예 17Example 17

NKLNKL 효능제 검정 Agonist assay

IL-23R에 대한 IL-23R 아트리머™ 폴리펩티드 복합체의 효능제 활성의 결핍을 나타내기 위해, 이종이합체 IL-23 수용체를 발현하는 자연 살세포주 NKL에 대한 선택된 IL-23R 아트리머™ 복합체의 결합시의 STAT-3 인산화를 결정하였다. 150 μg/mL의 농도의 아트리머™ 복합체 또는 양성 대조군으로서의 50 ng/mL에서의 IL-23을 96-웰 플레이트 중의 140,000 NKL 세포/웰과 함께 37℃에서 인큐베이션하였다. 10분 후, 세포를 5분 동안 1200 rpm에서 원심분리시키고, PBS로 2회 세척하였다. 이후, Cell Signaling Technology(PATH SCAN® Phospho Stat3 Sandwich ELISA kit, Cat #7300, Cell Signaling Technology, Inc., Danvers, MA)에서 수득한 Stat3 인산화 키트에 제공된 프로토콜에 따라 세포를 용해시키고 처리하였다. Stat-3 인산화를 Molecular Devices ELISA 플레이트 판독기를 이용하여 450 nM에서의 흡광도에 의해 측정하였다. 056-53.H4E 및 H4EP1E9의 복합체에 대한 예시적인 도 14에 도시된 바와 같이, 아트리머™ 복합체에 의한 IL-23R 수용체의 활성화가 관찰되지 않았고, 예상된 바와 같이 IL-23은 STAT-3 인산화를 발생시켰다. 도 15A 및 15B에 요약된 바와 같이 101-51-1A4, 101-51-1A7, 105-08-1A8, 101-54-4B6, H4E E137A, 101-113-6C108 및 101-54-4B10과 같은 시험된 모든 다른 아트리머에 대해 유사한 결과가 수득되었다.Upon binding of the selected IL-23R Atrimer ™ complex to the natural killer cell line NKL expressing the heterodimeric IL-23 receptor to indicate a lack of agonist activity of the IL-23R Atrimer ™ polypeptide complex against IL-23R. STAT-3 phosphorylation was determined. IL-23 at 50 ng / mL as the Atrimer ™ complex or positive control at a concentration of 150 μg / mL was incubated at 37 ° C. with 140,000 NKL cells / well in 96-well plates. After 10 minutes, the cells were centrifuged at 1200 rpm for 5 minutes and washed twice with PBS. Cells were then lysed and treated according to the protocol provided in the Stat3 phosphorylation kit obtained from Cell Signaling Technology (PATH SCAN Phospho Stat3 Sandwich ELISA kit, Cat # 7300, Cell Signaling Technology, Inc., Danvers, MA). Stat-3 phosphorylation was measured by absorbance at 450 nM using a Molecular Devices ELISA plate reader. As shown in FIG. 14 for the complexes of 056-53.H4E and H4EP1E9, activation of the IL-23R receptor by the Atrimer ™ complex was not observed, and as expected, IL-23 is STAT-3 phosphorylated. Generated. Tests such as 101-51-1A4, 101-51-1A7, 105-08-1A8, 101-54-4B6, H4E E137A, 101-113-6C108 and 101-54-4B10 as summarized in FIGS. 15A and 15B Similar results were obtained for all other atrimers.

상기 실시예는 상기 라이브러리에서 생성될 수 있는 변화의 범위를 제한하지 않는다. 존재하는 아미노산을 대체하기 위해 무작위 또는 보다 표적화된 아미노산의 수를 다양화시키는 것을 이용하고, 다양한 조합의 루프가 이용될 수 있는 다른 라이브러리가 생성될 수 있다. 또한, 다른 돌연변이 및 돌연변이를 발생시키는 방법, 예를 들어, 무작위 PCR 돌연변이유발이 패닝에 적용될 수 있는 다양한 라이브러리를 제공하는데 이용될 수 있다.The embodiment does not limit the range of changes that can be made in the library. Other libraries can be generated that utilize a varying number of random or more targeted amino acids to replace existing amino acids, and various combinations of loops can be used. In addition, other mutations and methods for generating mutations, such as random PCR mutagenesis, can be used to provide a variety of libraries that can be applied to panning.

표 19: TAS 및 TAA 서열 정보:Table 19: TAS and TAA Sequence Information:

Figure pct00033
Figure pct00033

Figure pct00034
Figure pct00034

Figure pct00035
Figure pct00035

Figure pct00036
Figure pct00036

Figure pct00037
Figure pct00037

Figure pct00038
Figure pct00038

본 발명의 다양한 특정 구체예가 본원에 기재되었으나, 본 발명은 상기 정밀한 구체예에 제한되지 않고, 본 발명의 범위 및 사상을 벗어나지 않고 당업자에 의해 다양한 변화 또는 변형이 이루어질 수 있음이 이해되야 한다.While various specific embodiments of the invention have been described herein, it is to be understood that the invention is not limited to the precise embodiments described above, and that various changes or modifications may be made by those skilled in the art without departing from the scope and spirit of the invention.

상기 제공된 실시예는 단지 예시이며, 본 발명의 모든 가능한 구체예, 적용 또는 변형의 총망라된 목록으로 이해되진 않는다. 따라서, 본 발명의 기재된 방법 및 시스템의 다양한 변형 및 변화는 본 발명의 범위 및 사상을 벗어남이 없이 당업자에게 명백할 것이다. 본 발명은 특정 구체예와 관련하여 기재되었으나, 청구된 본 발명은 상기 특정 구체예로 과하게 제한되지 않는 것이 이해되어야 한다. 또한, 분자생물학, 면역학, 화학, 생화학 또는 관련 분야의 당업자에게 명백한 본 발명을 수행하기 위한 기재된 방식의 다양한 변형은 첨부된 청구항의 범위에 포함된다.The examples provided above are illustrative only and are not to be understood as a comprehensive list of all possible embodiments, applications or variations of the present invention. Accordingly, various modifications and variations of the described methods and systems of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. While the invention has been described in connection with specific embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Moreover, various modifications of the described modes for carrying out the invention which are apparent to those skilled in the art of molecular biology, immunology, chemistry, biochemistry or related fields are included in the scope of the appended claims.

본 발명은 본원에 기재된 특정 방법, 프로토콜, 및 시약 등에 제한되지 않으며, 이들은 당업자가 인지하는 바와 같이 다양할 수 있음이 이해된다. 본원에서 사용되는 전문용어는 단지 특정 구체예를 기재하기 위한 목적이며, 본 발명의 범위를 제한하고자 하는 것이 아님이 또한 이해되어야 한다.It is to be understood that the present invention is not limited to the particular methods, protocols, reagents, and the like described herein, which can vary as one skilled in the art will recognize. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to limit the scope of the present invention.

본 발명의 구체예 및 이의 다양한 특징 및 유리한 세부사항은 비제한적인 구체예를 참조로 하여 더욱 충분히 설명되고/되거나, 하기 수반되는 도면에서 예시되고, 하기 기재에서 상술된다. 도면에 예시된 특징은 반드시 일정한 비례로 도시된 것이 아니며, 일 구체예의 특징은 비록 명백히 언급되어 있지 않더라도 당업자가 인지하는 바와 같이 다른 구체예에 이용될 수 있음이 주지되어야 한다.Embodiments of the present invention and various features and advantageous details thereof are more fully described with reference to non-limiting embodiments and / or illustrated in the accompanying drawings and detailed in the following description. It is to be noted that the features illustrated in the figures are not necessarily drawn to scale, and that the features of one embodiment may be used in other embodiments as those skilled in the art will recognize, even if not explicitly stated.

본원에 언급된 모든 수치값은 임의의 하한값과 임의의 상한값 사이에 적어도 2개 단위의 분리가 존재하는 경우에도 1개 단위의 증분으로 하한값 내지 상한값의 모든 값을 포함한다. 예를 들어, 성분의 농도 또는 공정 변수의 값, 예를 들어, 크기, 각도 크기, 압력, 시간 등이, 예를 들어, 1 내지 90, 특히 20 내지 80, 더욱 특히 30 내지 70인 것으로 언급되는 경우, 이는 15 내지 85, 22 내지 68, 43 내지 51, 30 내지 32 등과 같은 값이 본 명세서에서 명백히 열거되는 것이다. 1 미만인 값에 대해, 1 단위는 적절한 경우 0.0001, 0.001, 0.01 또는 0.1인 것으로 간주된다. 이는 특별히 의도하는 바의 단지 예시이며, 열거된 하한값 내지 상한값 사이의 수치값의 모든 가능한 조합이 유사한 방식으로 본 출원에 명백히 언급되는 것으로 간주되어야 한다.All numerical values referred to herein include all values from the lower limit to the upper limit in increments of one unit, even if there is at least two units of separation between any lower limit and any upper limit. For example, the concentration of a component or the value of a process variable, for example size, angular size, pressure, time, etc., is said to be, for example, 1 to 90, in particular 20 to 80, more particularly 30 to 70 In this case, values such as 15 to 85, 22 to 68, 43 to 51, 30 to 32 and the like are explicitly listed in the present specification. For values less than 1, 1 unit is considered to be 0.0001, 0.001, 0.01 or 0.1 where appropriate. This is merely an example of what is specifically intended, and all possible combinations of numerical values between the lowest value and the highest value enumerated, should be considered to be expressly stated in this application in a similar manner.

본원에 인용된 모든 참고문헌 및 간행물의 개시는 이들 각각이 개별적으로 참조로서 포함되는 것과 동일한 정도로 전체내용이 참조로서 명백히 포함된다.
The disclosures of all references and publications cited herein are expressly incorporated by reference in their entirety to the same extent as if each is individually incorporated by reference.

참고문헌references

Figure pct00039
Figure pct00039

Figure pct00040
Figure pct00040

Figure pct00041
Figure pct00041

Figure pct00042
Figure pct00042

Figure pct00043
Figure pct00043

Figure pct00044
Figure pct00044

Figure pct00045
Figure pct00045

Figure pct00046
Figure pct00046

Figure pct00047
Figure pct00047

Figure pct00048
Figure pct00048

SEQUENCE LISTING <110> Kretz-Rommel, Anke Wild, Martha A. Bowdish, Katherine Chen, Elise Oltean, Daniela Gonzalez, Maria Kapoor, Mili <120> POLYPEPTIDES THAT BIND IL-23R <130> 10-090 <140> US 12/703757 <141> 2010-02-10 <150> PCT/US09/60271 <151> 2009-10-09 <150> US 12/577,067 <151> 2009-10-09 <160> 494 <170> PatentIn version 3.5 <210> 1 <211> 189 <212> PRT <213> Homo sapiens <400> 1 Met Leu Gly Ser Arg Ala Val Met Leu Leu Leu Leu Leu Pro Trp Thr 1 5 10 15 Ala Gln Gly Arg Ala Val Pro Gly Gly Ser Ser Pro Ala Trp Thr Gln 20 25 30 Cys Gln Gln Leu Ser Gln Lys Leu Cys Thr Leu Ala Trp Ser Ala His 35 40 45 Pro Leu Val Gly His Met Asp Leu Arg Glu Glu Gly Asp Glu Glu Thr 50 55 60 Thr Asn Asp Val Pro His Ile Gln Cys Gly Asp Gly Cys Asp Pro Gln 65 70 75 80 Gly Leu Arg Asp Asn Ser Gln Phe Cys Leu Gln Arg Ile His Gln Gly 85 90 95 Leu Ile Phe Tyr Glu Lys Leu Leu Gly Ser Asp Ile Phe Thr Gly Glu 100 105 110 Pro Ser Leu Leu Pro Asp Ser Pro Val Gly Gln Leu His Ala Ser Leu 115 120 125 Leu Gly Leu Ser Gln Leu Leu Gln Pro Glu Gly His His Trp Glu Thr 130 135 140 Gln Gln Ile Pro Ser Leu Ser Pro Ser Gln Pro Trp Gln Arg Leu Leu 145 150 155 160 Leu Arg Phe Lys Ile Leu Arg Ser Leu Gln Ala Phe Val Ala Val Ala 165 170 175 Ala Arg Val Phe Ala His Gly Ala Ala Thr Leu Ser Pro 180 185 <210> 2 <211> 357 <212> PRT <213> Homo sapiens <400> 2 Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 1 5 10 15 Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 20 25 30 Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 35 40 45 Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 50 55 60 Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 65 70 75 80 Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 85 90 95 Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 100 105 110 Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 115 120 125 Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 130 135 140 Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 145 150 155 160 Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 165 170 175 Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 180 185 190 Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 195 200 205 Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 210 215 220 Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Lys Asn 225 230 235 240 Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 245 250 255 Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 260 265 270 Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 275 280 285 Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 290 295 300 Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 305 310 315 320 Glu Trp Ala Ser Val Pro Cys Ser Val Asn Glu Glu Leu Pro Ser Ile 325 330 335 Asn Thr Tyr Phe Pro Gln Asn Ile Leu Glu Ser His Phe Asn Arg Ile 340 345 350 Ser Leu Leu Glu Lys 355 <210> 3 <211> 253 <212> PRT <213> Homo sapiens <400> 3 Met Trp Pro Pro Gly Ser Ala Ser Gln Pro Pro Pro Ser Pro Ala Ala 1 5 10 15 Ala Thr Gly Leu His Pro Ala Ala Arg Pro Val Ser Leu Gln Cys Arg 20 25 30 Leu Ser Met Cys Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val 35 40 45 Leu Leu Asp His Leu Ser Leu Ala Arg Asn Leu Pro Val Ala Thr Pro 50 55 60 Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg 65 70 75 80 Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr 85 90 95 Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys 100 105 110 Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu 115 120 125 Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys 130 135 140 Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser 145 150 155 160 Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn 165 170 175 Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn 180 185 190 Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser 195 200 205 Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys 210 215 220 Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala 225 230 235 240 Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 245 250 <210> 4 <211> 155 <212> PRT <213> Homo sapiens <400> 4 Met Thr Pro Gly Lys Thr Ser Leu Val Ser Leu Leu Leu Leu Leu Ser 1 5 10 15 Leu Glu Ala Ile Val Lys Ala Gly Ile Thr Ile Pro Arg Asn Pro Gly 20 25 30 Cys Pro Asn Ser Glu Asp Lys Asn Phe Pro Arg Thr Val Met Val Asn 35 40 45 Leu Asn Ile His Asn Arg Asn Thr Asn Thr Asn Pro Lys Arg Ser Ser 50 55 60 Asp Tyr Tyr Asn Arg Ser Thr Ser Pro Trp Asn Leu His Arg Asn Glu 65 70 75 80 Asp Pro Glu Arg Tyr Pro Ser Val Ile Trp Glu Ala Lys Cys Arg His 85 90 95 Leu Gly Cys Ile Asn Ala Asp Gly Asn Val Asp Tyr His Met Asn Ser 100 105 110 Val Pro Ile Gln Gln Glu Ile Leu Val Leu Arg Arg Glu Pro Pro His 115 120 125 Cys Pro Asn Ser Phe Arg Leu Glu Lys Ile Leu Val Ser Val Gly Cys 130 135 140 Thr Cys Val Thr Pro Ile Val His His Val Ala 145 150 155 <210> 5 <211> 227 <212> PRT <213> Homo sapiens <400> 5 Met Lys Asn Ser Asn Val Val Lys Met Leu Gln Glu Asn Ser Glu Leu 1 5 10 15 Met Asn Asn Asn Ser Ser Glu Gln Val Leu Tyr Val Asp Pro Met Ile 20 25 30 Thr Glu Ile Lys Glu Ile Phe Ile Pro Glu His Lys Pro Thr Asp Tyr 35 40 45 Lys Lys Glu Asn Thr Gly Pro Leu Glu Thr Arg Asp Tyr Pro Gln Asn 50 55 60 Ser Leu Phe Asp Asn Thr Thr Val Val Tyr Ile Pro Asp Leu Asn Thr 65 70 75 80 Gly Tyr Lys Pro Gln Ile Ser Asn Phe Leu Pro Glu Gly Ser His Leu 85 90 95 Ser Asn Asn Asn Glu Ile Thr Ser Leu Thr Leu Lys Pro Pro Val Asp 100 105 110 Ser Leu Asp Ser Gly Asn Asn Pro Arg Leu Gln Lys His Pro Asn Phe 115 120 125 Ala Phe Ser Val Ser Ser Val Asn Ser Leu Ser Asn Thr Ile Phe Leu 130 135 140 Gly Glu Leu Ser Leu Ile Leu Asn Gln Gly Glu Cys Ser Ser Pro Asp 145 150 155 160 Ile Gln Asn Ser Val Glu Glu Glu Thr Thr Met Leu Leu Glu Asn Asp 165 170 175 Ser Pro Ser Glu Thr Ile Pro Glu Gln Thr Leu Leu Pro Asp Glu Phe 180 185 190 Val Ser Cys Leu Gly Ile Val Asn Glu Glu Leu Pro Ser Ile Asn Thr 195 200 205 Tyr Phe Pro Gln Asn Ile Leu Glu Ser His Phe Asn Arg Ile Ser Leu 210 215 220 Leu Glu Lys 225 <210> 6 <211> 660 <212> PRT <213> Homo sapiens <400> 6 Met Glu Pro Leu Val Thr Trp Val Val Pro Leu Leu Phe Leu Phe Leu 1 5 10 15 Leu Ser Arg Gln Gly Ala Ala Cys Arg Thr Ser Glu Cys Cys Phe Gln 20 25 30 Asp Pro Pro Tyr Pro Asp Ala Asp Ser Gly Ser Ala Ser Gly Pro Arg 35 40 45 Asp Leu Arg Cys Tyr Arg Ile Ser Ser Asp Arg Tyr Glu Cys Ser Trp 50 55 60 Gln Tyr Glu Gly Pro Thr Ala Gly Val Ser His Phe Leu Arg Cys Cys 65 70 75 80 Leu Ser Ser Gly Arg Cys Cys Tyr Phe Ala Ala Gly Ser Ala Thr Arg 85 90 95 Leu Gln Phe Ser Asp Gln Ala Gly Val Ser Val Leu Tyr Thr Val Thr 100 105 110 Leu Trp Val Glu Ser Trp Ala Arg Asn Gln Thr Glu Lys Ser Pro Glu 115 120 125 Val Thr Leu Gln Leu Tyr Asn Ser Val Lys Tyr Glu Pro Pro Leu Gly 130 135 140 Asp Ile Lys Val Ser Lys Leu Ala Gly Gln Leu Arg Met Glu Trp Glu 145 150 155 160 Thr Pro Asp Asn Gln Val Gly Ala Glu Val Gln Phe Arg His Arg Thr 165 170 175 Pro Ser Ser Pro Trp Lys Leu Gly Asp Cys Gly Pro Gln Asp Asp Asp 180 185 190 Thr Glu Ser Cys Leu Cys Pro Leu Glu Met Asn Val Ala Gln Glu Phe 195 200 205 Gln Leu Arg Arg Arg Gln Leu Gly Ser Gln Gly Ser Ser Trp Ser Lys 210 215 220 Trp Ser Ser Pro Val Cys Val Pro Pro Glu Asn Pro Pro Gln Pro Gln 225 230 235 240 Val Arg Phe Ser Val Glu Gln Leu Gly Gln Asp Gly Arg Arg Arg Leu 245 250 255 Thr Leu Lys Glu Gln Pro Thr Gln Leu Glu Leu Pro Glu Gly Cys Gln 260 265 270 Gly Leu Ala Pro Gly Thr Glu Val Thr Tyr Arg Leu Gln Leu His Met 275 280 285 Leu Ser Cys Pro Cys Lys Ala Lys Ala Thr Arg Thr Leu His Leu Gly 290 295 300 Lys Met Pro Tyr Leu Ser Gly Ala Ala Tyr Asn Val Ala Val Ile Ser 305 310 315 320 Ser Asn Gln Phe Gly Pro Gly Leu Asn Gln Thr Trp His Ile Pro Ala 325 330 335 Asp Thr His Thr Glu Pro Val Ala Leu Asn Ile Ser Val Gly Thr Asn 340 345 350 Gly Thr Thr Met Tyr Trp Pro Ala Arg Ala Gln Ser Met Thr Tyr Cys 355 360 365 Ile Glu Trp Gln Pro Val Gly Gln Asp Gly Gly Leu Ala Thr Cys Ser 370 375 380 Leu Thr Ala Pro Gln Asp Pro Asp Pro Ala Gly Met Ala Thr Tyr Ser 385 390 395 400 Trp Ser Arg Glu Ser Gly Ala Met Gly Gln Glu Lys Cys Tyr Tyr Ile 405 410 415 Thr Ile Phe Ala Ser Ala His Pro Glu Lys Leu Thr Leu Trp Ser Thr 420 425 430 Val Leu Ser Thr Tyr His Phe Gly Gly Asn Ala Ser Ala Ala Gly Thr 435 440 445 Pro His His Val Ser Val Lys Asn His Ser Leu Asp Ser Val Ser Val 450 455 460 Asp Trp Ala Pro Ser Leu Leu Ser Thr Cys Pro Gly Val Leu Lys Glu 465 470 475 480 Tyr Val Val Arg Cys Arg Asp Glu Asp Ser Lys Gln Val Ser Glu His 485 490 495 Pro Val Gln Pro Thr Glu Thr Gln Val Thr Leu Ser Gly Leu Arg Ala 500 505 510 Gly Val Ala Tyr Thr Val Gln Val Arg Ala Asp Thr Ala Trp Leu Arg 515 520 525 Gly Val Trp Ser Gln Pro Gln Arg Phe Ser Ile Glu Val Gln Val Ser 530 535 540 Asp Trp Leu Ile Phe Phe Ala Ser Leu Gly Ser Phe Leu Ser Ile Leu 545 550 555 560 Leu Val Gly Val Leu Gly Tyr Leu Gly Leu Asn Arg Ala Ala Arg His 565 570 575 Leu Cys Pro Pro Leu Pro Thr Pro Cys Ala Ser Ser Ala Ile Glu Phe 580 585 590 Pro Gly Gly Lys Glu Thr Trp Gln Trp Ile Asn Pro Val Asp Phe Gln 595 600 605 Glu Glu Ala Ser Leu Gln Glu Ala Leu Val Val Glu Met Ser Trp Asp 610 615 620 Lys Gly Glu Arg Thr Glu Pro Leu Glu Lys Thr Glu Leu Pro Glu Gly 625 630 635 640 Ala Pro Glu Leu Ala Leu Asp Thr Glu Leu Ser Leu Glu Asp Gly Asp 645 650 655 Arg Cys Asp Arg 660 <210> 7 <211> 862 <212> PRT <213> Homo sapiens <400> 7 Met Ala His Thr Phe Arg Gly Cys Ser Leu Ala Phe Met Phe Ile Ile 1 5 10 15 Thr Trp Leu Leu Ile Lys Ala Lys Ile Asp Ala Cys Lys Arg Gly Asp 20 25 30 Val Thr Val Lys Pro Ser His Val Ile Leu Leu Gly Ser Thr Val Asn 35 40 45 Ile Thr Cys Ser Leu Lys Pro Arg Gln Gly Cys Phe His Tyr Ser Arg 50 55 60 Arg Asn Lys Leu Ile Leu Tyr Lys Phe Asp Arg Arg Ile Asn Phe His 65 70 75 80 His Gly His Ser Leu Asn Ser Gln Val Thr Gly Leu Pro Leu Gly Thr 85 90 95 Thr Leu Phe Val Cys Lys Leu Ala Cys Ile Asn Ser Asp Glu Ile Gln 100 105 110 Ile Cys Gly Ala Glu Ile Phe Val Gly Val Ala Pro Glu Gln Pro Gln 115 120 125 Asn Leu Ser Cys Ile Gln Lys Gly Glu Gln Gly Thr Val Ala Cys Thr 130 135 140 Trp Glu Arg Gly Arg Asp Thr His Leu Tyr Thr Glu Tyr Thr Leu Gln 145 150 155 160 Leu Ser Gly Pro Lys Asn Leu Thr Trp Gln Lys Gln Cys Lys Asp Ile 165 170 175 Tyr Cys Asp Tyr Leu Asp Phe Gly Ile Asn Leu Thr Pro Glu Ser Pro 180 185 190 Glu Ser Asn Phe Thr Ala Lys Val Thr Ala Val Asn Ser Leu Gly Ser 195 200 205 Ser Ser Ser Leu Pro Ser Thr Phe Thr Phe Leu Asp Ile Val Arg Pro 210 215 220 Leu Pro Pro Trp Asp Ile Arg Ile Lys Phe Gln Lys Ala Ser Val Ser 225 230 235 240 Arg Cys Thr Leu Tyr Trp Arg Asp Glu Gly Leu Val Leu Leu Asn Arg 245 250 255 Leu Arg Tyr Arg Pro Ser Asn Ser Arg Leu Trp Asn Met Val Asn Val 260 265 270 Thr Lys Ala Lys Gly Arg His Asp Leu Leu Asp Leu Lys Pro Phe Thr 275 280 285 Glu Tyr Glu Phe Gln Ile Ser Ser Lys Leu His Leu Tyr Lys Gly Ser 290 295 300 Trp Ser Asp Trp Ser Glu Ser Leu Arg Ala Gln Thr Pro Glu Glu Glu 305 310 315 320 Pro Thr Gly Met Leu Asp Val Trp Tyr Met Lys Arg His Ile Asp Tyr 325 330 335 Ser Arg Gln Gln Ile Ser Leu Phe Trp Lys Asn Leu Ser Val Ser Glu 340 345 350 Ala Arg Gly Lys Ile Leu His Tyr Gln Val Thr Leu Gln Glu Leu Thr 355 360 365 Gly Gly Lys Ala Met Thr Gln Asn Ile Thr Gly His Thr Ser Trp Thr 370 375 380 Thr Val Ile Pro Arg Thr Gly Asn Trp Ala Val Ala Val Ser Ala Ala 385 390 395 400 Asn Ser Lys Gly Ser Ser Leu Pro Thr Arg Ile Asn Ile Met Asn Leu 405 410 415 Cys Glu Ala Gly Leu Leu Ala Pro Arg His Val Ser Ala Asn Ser Glu 420 425 430 Gly Met Asp Asn Ile Leu Val Thr Trp Gln Pro Pro Arg Lys Asp Pro 435 440 445 Ser Ala Val Gln Glu Tyr Val Val Glu Trp Arg Glu Leu His Pro Gly 450 455 460 Gly Asp Thr Gln Val Pro Leu Asn Trp Leu Arg Ser Arg Pro Tyr Asn 465 470 475 480 Val Ser Ala Leu Ile Ser Glu Asn Ile Lys Ser Tyr Ile Cys Tyr Glu 485 490 495 Ile Arg Val Tyr Ala Leu Ser Gly Asp Gln Gly Gly Cys Ser Ser Ile 500 505 510 Leu Gly Asn Ser Lys His Lys Ala Pro Leu Ser Gly Pro His Ile Asn 515 520 525 Ala Ile Thr Glu Glu Lys Gly Ser Ile Leu Ile Ser Trp Asn Ser Ile 530 535 540 Pro Val Gln Glu Gln Met Gly Cys Leu Leu His Tyr Arg Ile Tyr Trp 545 550 555 560 Lys Glu Arg Asp Ser Asn Ser Gln Pro Gln Leu Cys Glu Ile Pro Tyr 565 570 575 Arg Val Ser Gln Asn Ser His Pro Ile Asn Ser Leu Gln Pro Arg Val 580 585 590 Thr Tyr Val Leu Trp Met Thr Ala Leu Thr Ala Ala Gly Glu Ser Ser 595 600 605 His Gly Asn Glu Arg Glu Phe Cys Leu Gln Gly Lys Ala Asn Trp Met 610 615 620 Ala Phe Val Ala Pro Ser Ile Cys Ile Ala Ile Ile Met Val Gly Ile 625 630 635 640 Phe Ser Thr His Tyr Phe Gln Gln Lys Val Phe Val Leu Leu Ala Ala 645 650 655 Leu Arg Pro Gln Trp Cys Ser Arg Glu Ile Pro Asp Pro Ala Asn Ser 660 665 670 Thr Cys Ala Lys Lys Tyr Pro Ile Ala Glu Glu Lys Thr Gln Leu Pro 675 680 685 Leu Asp Arg Leu Leu Ile Asp Trp Pro Thr Pro Glu Asp Pro Glu Pro 690 695 700 Leu Val Ile Ser Glu Val Leu His Gln Val Thr Pro Val Phe Arg His 705 710 715 720 Pro Pro Cys Ser Asn Trp Pro Gln Arg Glu Lys Gly Ile Gln Gly His 725 730 735 Gln Ala Ser Glu Lys Asp Met Met His Ser Ala Ser Ser Pro Pro Pro 740 745 750 Pro Arg Ala Leu Gln Ala Glu Ser Arg Gln Leu Val Asp Leu Tyr Lys 755 760 765 Val Leu Glu Ser Arg Gly Ser Asp Pro Lys Pro Glu Asn Pro Ala Cys 770 775 780 Pro Trp Thr Val Leu Pro Ala Gly Asp Leu Pro Thr His Asp Gly Tyr 785 790 795 800 Leu Pro Ser Asn Ile Asp Asp Leu Pro Ser His Glu Ala Pro Leu Ala 805 810 815 Asp Ser Leu Glu Glu Leu Glu Pro Gln His Ile Ser Leu Ser Val Phe 820 825 830 Pro Ser Ser Ser Leu His Pro Leu Thr Phe Ser Cys Gly Asp Lys Leu 835 840 845 Thr Leu Asp Gln Leu Lys Met Arg Cys Asp Ser Leu Met Leu 850 855 860 <210> 8 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 8 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys 50 <210> 9 <211> 54 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 9 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu Lys Gly Ser 50 <210> 10 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 10 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val <210> 11 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 11 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val 20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 35 40 45 <210> 12 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 12 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 35 40 <210> 13 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 13 Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp 1 5 10 15 Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln 20 25 30 Thr Val Ser Leu Lys 35 <210> 14 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 14 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu 20 25 30 Lys <210> 15 <211> 29 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 15 Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 20 25 <210> 16 <211> 25 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 16 Ser Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln 1 5 10 15 Gln Ala Leu Gln Thr Val Ser Leu Lys 20 25 <210> 17 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 17 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val 20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 <210> 18 <211> 41 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 18 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val 20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu 35 40 <210> 19 <211> 38 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 19 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val 20 25 30 Ala Leu Leu Lys Glu Gln 35 <210> 20 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 20 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val 20 25 30 Ala Leu <210> 21 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 21 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu 20 25 30 <210> 22 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 22 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr Val 35 40 <210> 23 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 23 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 Val <210> 24 <211> 53 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 24 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu Lys Gly 50 <210> 25 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 25 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu Lys 50 <210> 26 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 26 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu 50 <210> 27 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 27 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser 50 <210> 28 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 28 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val <210> 29 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 29 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 <210> 30 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 30 Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val 1 5 10 15 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 20 25 30 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 35 40 45 Ser Leu Lys Gly 50 <210> 31 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 31 Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val 1 5 10 15 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 20 25 30 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 35 40 45 <210> 32 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 32 Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val 1 5 10 15 Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala 20 25 30 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser 35 40 45 Leu Lys Gly 50 <210> 33 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 33 Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn 1 5 10 15 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln 20 25 30 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu 35 40 45 Lys Gly 50 <210> 34 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 34 Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr 1 5 10 15 Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu 20 25 30 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 35 40 45 Gly <210> 35 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 35 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val 20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 36 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 36 Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met 1 5 10 15 Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala 20 25 30 Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 37 <211> 46 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 37 Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe 1 5 10 15 Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu 20 25 30 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 38 <211> 45 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 38 Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu 1 5 10 15 Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu 20 25 30 Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 39 <211> 44 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 39 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 40 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 40 Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu 1 5 10 15 Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu 20 25 30 Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 41 <211> 42 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 41 Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys 1 5 10 15 Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln 20 25 30 Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 42 <211> 41 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 42 Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser 1 5 10 15 Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln 20 25 30 Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 43 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 43 Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg 1 5 10 15 Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala 20 25 30 Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 44 <211> 39 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 44 Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu 1 5 10 15 Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu 20 25 30 Gln Thr Val Ser Leu Lys Gly 35 <210> 45 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 45 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 Val Ser Leu Lys Gly 35 <210> 46 <211> 36 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 46 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 20 25 30 Ser Leu Lys Gly 35 <210> 47 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 47 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 20 25 30 Ser Leu Lys 35 <210> 48 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 48 Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala 1 5 10 15 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser 20 25 30 Leu Lys <210> 49 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 49 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu 20 25 30 Lys <210> 50 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 50 Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 20 25 30 <210> 51 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 51 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val 1 5 10 15 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 20 25 30 <210> 52 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 52 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 Val <210> 53 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 53 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 <210> 54 <211> 30 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 54 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln 20 25 30 <210> 55 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 55 Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala 1 5 10 15 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser 20 25 30 Leu Lys Gly 35 <210> 56 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 56 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu 20 25 30 Lys Gly <210> 57 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 57 Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 20 25 30 Gly <210> 58 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 58 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val 1 5 10 15 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 20 25 30 <210> 59 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 59 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu Lys 50 <210> 60 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 60 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val <210> 61 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 61 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 <210> 62 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 62 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln 35 40 45 <210> 63 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 63 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 35 40 <210> 64 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 64 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr Val 35 40 <210> 65 <211> 39 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 65 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr 35 <210> 66 <211> 38 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 66 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln 35 <210> 67 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 67 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 20 25 30 Ser Leu Lys 35 <210> 68 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 68 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 20 25 30 <210> 69 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 69 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 <210> 70 <211> 30 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 70 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln 20 25 30 <210> 71 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 71 Met Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu 1 5 10 15 Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu 20 25 30 Lys Glu Gln Gln Ala Leu Gln Thr 35 40 <210> 72 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 72 Met Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 <210> 73 <211> 53 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 73 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu Lys Gly 50 <210> 74 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 74 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu Lys 50 <210> 75 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 75 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser Leu 50 <210> 76 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 76 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Ser 50 <210> 77 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 77 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val <210> 78 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 78 Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val 1 5 10 15 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu 20 25 30 Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 35 40 45 Ser Leu Lys Gly 50 <210> 79 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 79 Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val 1 5 10 15 Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser 20 25 30 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser 35 40 45 Leu Lys Gly 50 <210> 80 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 80 Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn 1 5 10 15 Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln 20 25 30 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu 35 40 45 Lys Gly 50 <210> 81 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 81 Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr 1 5 10 15 Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu 20 25 30 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 35 40 45 Gly <210> 82 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 82 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val 20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 83 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 83 Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met 1 5 10 15 Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala 20 25 30 Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 84 <211> 46 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 84 Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe 1 5 10 15 Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu 20 25 30 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 85 <211> 45 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 85 Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu 1 5 10 15 Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu 20 25 30 Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 45 <210> 86 <211> 44 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 86 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys 20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 87 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 87 Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu 1 5 10 15 Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu 20 25 30 Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 88 <211> 42 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 88 Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys 1 5 10 15 Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln 20 25 30 Gln Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 89 <211> 41 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 89 Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala 1 5 10 15 Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln 20 25 30 Ala Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 90 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 90 Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg 1 5 10 15 Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala 20 25 30 Leu Gln Thr Val Ser Leu Lys Gly 35 40 <210> 91 <211> 39 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 91 Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu 1 5 10 15 Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu 20 25 30 Gln Thr Val Ser Leu Lys Gly 35 <210> 92 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 92 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 1 5 10 15 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 20 25 30 Val Ser Leu Lys Gly 35 <210> 93 <211> 36 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 93 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu 1 5 10 15 Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 20 25 30 Ser Leu Lys Gly 35 <210> 94 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 94 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu 1 5 10 15 Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val 20 25 30 Ser Leu Lys 35 <210> 95 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 95 Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser 1 5 10 15 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser 20 25 30 Leu Lys <210> 96 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 96 Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu 20 25 30 Lys <210> 97 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 97 Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val 1 5 10 15 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys 20 25 30 <210> 98 <211> 71 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 98 Met Gly Ser His His His His His Gly Ser Ile Gln Gly Arg Ser Pro 1 5 10 15 Gly Thr Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys 20 25 30 Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu 35 40 45 Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu 50 55 60 Gln Thr Val Ser Leu Lys Gly 65 70 <210> 99 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 99 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys 50 <210> 100 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 100 Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn Ala Lys Lys Asp 1 5 10 15 Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys Asn Arg Met Asp Val 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys 50 <210> 101 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 101 Gln Gln Asn Gly Lys Gly Arg Gln Lys Pro Ala Ala Ser Lys Lys Asp 1 5 10 15 Gly Val Ser Leu Lys Met Ile Glu Asp Leu Lys Ala Met Ile Asp Asn 20 25 30 Ile Ser Gln Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys 50 <210> 102 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 102 Glu Thr Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn Ala Lys Lys Asp 1 5 10 15 Ala Val Ser Pro Lys Met Leu Glu Glu Leu Lys Thr Gln Leu Asp Ser 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys 50 <210> 103 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 103 Gln Gln Thr Ser Ser Lys Lys Lys Gly Gly Lys Lys Asp Ala Glu Asn 1 5 10 15 Asn Ala Ala Ile Glu Glu Leu Lys Lys Gln Ile Asp Asn Ile Val Leu 20 25 30 Glu Leu Asn Leu Leu Lys Glu Gln Gln Ala Leu Gln Ser Val Cys Leu 35 40 45 Lys <210> 104 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 104 Gln Gln Asn Gly Lys Lys Asn Lys Gln Asn Asn Lys Asp Val Val Ser 1 5 10 15 Met Lys Met Tyr Glu Asp Leu Lys Lys Lys Val Gln Asn Ile Glu Glu 20 25 30 Asp Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln Thr Ile Cys Leu 35 40 45 Lys <210> 105 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 105 Glu Gln Ser Leu Thr Lys Arg Lys Asn Gly Lys Lys Glu Ser Asn Ser 1 5 10 15 Ala Ala Ile Glu Glu Leu Lys Lys Gln Ile Asp Gln Ile Ile Gln Asp 20 25 30 Leu Asn Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu Lys 35 40 45 <210> 106 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 106 Gln Thr Ser Cys His Ala Ser Lys Phe Lys Ala Arg Lys His Ser Lys 1 5 10 15 Arg Arg Val Lys Glu Lys Asp Gly Asp Leu Lys Thr Gln Val Glu Lys 20 25 30 Leu Trp Arg Glu Val Asn Ala Leu Lys Glu Met Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Arg 50 <210> 107 <211> 38 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 107 Lys Pro Ser Lys Ser Gly Lys Gly Lys Asp Asp Leu Arg Asn Glu Ile 1 5 10 15 Asp Lys Leu Trp Arg Glu Val Asn Ser Leu Lys Glu Met Gln Ala Leu 20 25 30 Gln Thr Val Cys Leu Lys 35 <210> 108 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(25) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (27)..(29) <223> Xaa can be any naturally occurring amino acid <220> <221> MISC_FEATURE <222> (30)..(30) <223> Xaa can be any hydrophobic naturally occurring amino acid <220> <221> misc_feature <222> (31)..(32) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (34)..(35) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (38)..(39) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (43)..(43) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (52)..(52) <223> Xaa can be any naturally occurring amino acid <400> 108 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1 5 10 15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa 20 25 30 Leu Xaa Xaa Glu Val Xaa Xaa Leu Lys Glu Xaa Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Xaa 50 <210> 109 <211> 137 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 109 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile 85 90 95 Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser 100 105 110 Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu 115 120 125 Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 <210> 110 <211> 126 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 110 Asn Lys Leu His Ala Gly Ser Met Gly Lys Lys Ser Gly Lys Lys Phe 1 5 10 15 Phe Val Thr Asn His Glu Arg Met Pro Phe Ser Lys Val Lys Ala Leu 20 25 30 Cys Ser Glu Leu Arg Gly Thr Val Ala Ile Pro Arg Asn Ala Glu Glu 35 40 45 Asn Lys Ala Ile Gln Glu Val Ala Lys Thr Ser Ala Phe Leu Gly Ile 50 55 60 Thr Asp Glu Val Thr Glu Gly Gln Phe Met Tyr Val Thr Gly Gly Arg 65 70 75 80 Leu Thr Tyr Ser Asn Trp Lys Lys Asp Glu Pro Asn Asp His Gly Ser 85 90 95 Gly Glu Asp Cys Val Thr Ile Val Asp Asn Gly Leu Trp Asn Asp Ile 100 105 110 Ser Cys Gln Ala Ser His Thr Ala Val Cys Ser Phe Pro Ala 115 120 125 <210> 111 <211> 127 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 111 Lys Lys Val Glu Leu Phe Pro Asn Gly Gln Ser Val Gly Glu Lys Ile 1 5 10 15 Phe Lys Thr Ala Gly Phe Val Lys Pro Phe Thr Glu Ala Gln Leu Leu 20 25 30 Cys Thr Gln Ala Gly Gly Gln Leu Ala Ser Pro Arg Ser Ala Ala Glu 35 40 45 Asn Ala Ala Leu Gln Gln Leu Val Val Ala Lys Asn Glu Ala Ala Phe 50 55 60 Leu Ser Met Thr Asp Ser Lys Thr Glu Gly Lys Phe Thr Tyr Pro Thr 65 70 75 80 Gly Glu Ser Leu Val Tyr Ser Asn Trp Ala Pro Gly Glu Pro Asn Asp 85 90 95 Asp Gly Gly Ser Glu Asp Cys Val Glu Ile Phe Thr Asn Gly Lys Trp 100 105 110 Asn Asp Arg Ala Cys Gly Glu Lys Arg Leu Val Val Cys Ala Phe 115 120 125 <210> 112 <211> 123 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 112 Lys Val Tyr Trp Phe Cys Tyr Gly Met Lys Cys Tyr Tyr Phe Val Met 1 5 10 15 Asp Arg Lys Thr Trp Ser Gly Cys Lys Gln Thr Cys Gln Ser Ser Ser 20 25 30 Leu Ser Leu Leu Lys Ile Asp Asp Glu Asp Glu Leu Lys Phe Leu Gln 35 40 45 Leu Leu Val Val Pro Ser Asp Ser Cys Trp Val Gly Leu Ser Tyr Asp 50 55 60 Asn Lys Lys Asp Trp Ala Trp Ile Asp Asn Arg Pro Ser Lys Leu Ala 65 70 75 80 Leu Asn Thr Arg Lys Tyr Asn Ile Arg Asp Arg Gly Gly Cys Met Leu 85 90 95 Leu Ser Lys Thr Arg Leu Asp Asn Gly Asn Cys Asp Gln Val Phe Ile 100 105 110 Cys Ile Cys Gly Lys Arg Leu Asp Lys Phe Pro 115 120 <210> 113 <211> 128 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 113 Cys Pro Val Asn Trp Val Glu His Glu Arg Ser Cys Tyr Trp Phe Ser 1 5 10 15 Arg Ser Gly Lys Ala Trp Ala Asp Ala Asp Asn Tyr Cys Arg Leu Glu 20 25 30 Asp Ala His Leu Val Val Val Thr Ser Trp Glu Glu Gln Leu Phe Val 35 40 45 Gln His His Ile Gly Pro Val Asn Thr Trp Met Gly Leu His Asp Gln 50 55 60 Asn Gly Pro Trp Lys Trp Val Asp Gly Thr Asp Tyr Glu Thr Gly Phe 65 70 75 80 Lys Asn Trp Arg Pro Glu Gln Pro Asp Asp Trp Tyr Gly His Gly Leu 85 90 95 Gly Gly Gly Glu Asp Cys Ala His Phe Thr Asp Asp Gly Arg Trp Asn 100 105 110 Asp Asp Val Cys Gln Arg Pro Tyr Arg Trp Val Cys Ser Thr Glu Leu 115 120 125 <210> 114 <211> 147 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 114 Gly Ile Pro Lys Cys Pro Glu Asp Trp Gly Ala Ser Ser Arg Thr Ser 1 5 10 15 Leu Cys Phe Lys Leu Tyr Ala Lys Gly Lys His Glu Lys Lys Thr Trp 20 25 30 Phe Glu Ser Arg Asp Phe Cys Arg Ala Leu Gly Gly Asp Leu Ala Ser 35 40 45 Ile Asn Asn Lys Glu Glu Gln Gln Thr Ile Trp Arg Leu Ile Thr Ala 50 55 60 Ser Gly Ser Tyr His Lys Leu Phe Trp Leu Gly Leu Thr Tyr Gly Ser 65 70 75 80 Pro Ser Glu Gly Phe Thr Trp Ser Asp Gly Ser Pro Val Ser Tyr Glu 85 90 95 Asn Trp Ala Tyr Gly Glu Pro Asn Asn Tyr Gln Asn Val Glu Tyr Cys 100 105 110 Gly Glu Leu Lys Gly Asp Pro Thr Met Ser Trp Asn Asp Ile Asn Cys 115 120 125 Glu His Leu Asn Asn Trp Ile Cys Gln Ile Gln Lys Gly Gln Thr Pro 130 135 140 Lys Pro Asp 145 <210> 115 <211> 129 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 115 Asp Cys Leu Ser Gly Trp Ser Ser Tyr Glu Gly His Cys Tyr Lys Ala 1 5 10 15 Phe Ser Lys Tyr Lys Thr Trp Glu Asp Ala Glu Arg Val Cys Thr Glu 20 25 30 Gln Ala Lys Gly Ala His Leu Val Ser Ile Glu Ser Ser Gly Glu Ala 35 40 45 Asp Phe Val Ala Gln Leu Val Thr Gln Asn Met Lys Arg Leu Asp Phe 50 55 60 Tyr Ile Trp Ile Gly Leu Arg Val Gln Gly Lys Val Lys Gln Cys Asn 65 70 75 80 Ser Glu Trp Ser Asp Gly Ser Ser Val Ser Tyr Glu Asn Trp Ile Glu 85 90 95 Ala Glu Ser Lys Thr Cys Leu Gly Leu Glu Lys Glu Thr Asp Phe Arg 100 105 110 Lys Trp Val Asn Ile Tyr Cys Gly Gln Gln Asn Pro Phe Val Cys Glu 115 120 125 Ala <210> 116 <211> 122 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 116 Asp Cys Pro Ser Asp Trp Ser Ser Tyr Glu Gly His Cys Tyr Lys Pro 1 5 10 15 Phe Ser Glu Pro Lys Asn Trp Ala Asp Ala Glu Asn Phe Cys Thr Gln 20 25 30 Gln His Ala Gly Gly His Leu Val Ser Phe Gln Ser Ser Glu Glu Ala 35 40 45 Asp Phe Val Val Lys Leu Ala Phe Gln Thr Phe His Ser Ile Phe Trp 50 55 60 Met Gly Leu Ser Asn Val Trp Asn Gln Cys Asn Trp Gln Trp Ser Asn 65 70 75 80 Ala Ala Met Leu Arg Tyr Lys Ala Trp Ala Glu Glu Ser Tyr Cys Val 85 90 95 Tyr Phe Lys Ser Thr Asn Asn Lys Trp Arg Ser Arg Ala Cys Arg Met 100 105 110 Met Ala Gln Phe Val Cys Glu Phe Gln Ala 115 120 <210> 117 <211> 135 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 117 Ala Arg Ile Ser Cys Pro Glu Gly Thr Asn Ala Tyr Arg Ser Tyr Cys 1 5 10 15 Tyr Tyr Phe Asn Glu Asp Arg Glu Thr Trp Val Asp Ala Asp Leu Tyr 20 25 30 Cys Gln Asn Met Asn Ser Gly Asn Leu Val Ser Val Leu Thr Gln Ala 35 40 45 Glu Gly Ala Phe Val Ala Ser Leu Ile Lys Glu Ser Gly Thr Asp Asp 50 55 60 Phe Asn Val Trp Ile Gly Leu His Asp Pro Lys Lys Asn Arg Arg Trp 65 70 75 80 His Trp Ser Ser Gly Ser Leu Val Ser Tyr Lys Ser Trp Gly Ile Gly 85 90 95 Ala Pro Ser Ser Val Asn Pro Gly Tyr Cys Val Ser Leu Thr Ser Ser 100 105 110 Thr Gly Phe Gly Lys Trp Lys Asp Val Pro Cys Glu Asp Lys Phe Ser 115 120 125 Phe Val Cys Lys Phe Lys Asn 130 135 <210> 118 <211> 123 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 118 Asp Tyr Glu Ile Leu Phe Ser Asp Glu Thr Met Asn Tyr Ala Asp Ala 1 5 10 15 Gly Thr Tyr Cys Gly Ser Arg Gly Met Ala Leu Val Ser Ser Ala Met 20 25 30 Arg Asp Ser Thr Met Val Lys Ala Ile Leu Ala Phe Thr Glu Val Lys 35 40 45 Gly His Asp Tyr Trp Val Gly Ala Asp Asn Leu Gln Asp Gly Ala Tyr 50 55 60 Asn Phe Asn Trp Asn Asp Gly Val Ser Leu Pro Thr Asp Ser Asp Leu 65 70 75 80 Trp Ser Pro Asn Glu Pro Ser Asn Pro Gln Ser Trp Gln Leu Cys Val 85 90 95 Gln Ile Trp Ser Lys Tyr Asn Leu Leu Asp Asp Val Gly Cys Gly Gly 100 105 110 Ala Arg Arg Val Ile Cys Glu Lys Glu Leu Asp 115 120 <210> 119 <211> 202 <212> PRT <213> Homo sapiens <400> 119 Met Glu Leu Trp Gly Ala Tyr Leu Leu Leu Cys Leu Phe Ser Leu Leu 1 5 10 15 Thr Gln Val Thr Thr Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val 20 25 30 Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys 35 40 45 Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln 50 55 60 Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys 65 70 75 80 Cys Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu 85 90 95 Asp Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser 100 105 110 Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu 115 120 125 Ala Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp 130 135 140 Val Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu 145 150 155 160 Ile Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu 165 170 175 Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln 180 185 190 Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 195 200 <210> 120 <211> 202 <212> PRT <213> Mus musculus <400> 120 Met Gly Phe Trp Gly Thr Tyr Leu Leu Phe Cys Leu Phe Ser Phe Leu 1 5 10 15 Ser Gln Leu Thr Ala Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala 20 25 30 Asn Ala Lys Lys Asp Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys 35 40 45 Asn Arg Met Asp Val Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Lys 50 55 60 Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val Asn Leu Lys 65 70 75 80 Cys Leu Leu Ala Phe Thr Gln Pro Lys Thr Phe His Glu Ala Ser Glu 85 90 95 Asp Cys Ile Ser Gln Gly Gly Thr Leu Gly Thr Pro Gln Ser Glu Leu 100 105 110 Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val Gly Asn Asp 115 120 125 Ala Asn Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Ala Trp 130 135 140 Val Asp Met Thr Gly Gly Leu Leu Ala Tyr Lys Asn Trp Glu Thr Glu 145 150 155 160 Ile Thr Thr Gln Pro Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala Leu 165 170 175 Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln 180 185 190 Leu Pro Tyr Ile Cys Gln Phe Ala Ile Val 195 200 <210> 121 <211> 201 <212> PRT <213> Gallus gallus <400> 121 Met Ala Leu Arg Gly Ala Cys Leu Leu Leu Cys Leu Val Ser Leu Ala 1 5 10 15 His Ile Ser Val Gln Gln Asn Gly Lys Gly Arg Gln Lys Pro Ala Ala 20 25 30 Ser Lys Lys Asp Gly Val Ser Leu Lys Met Ile Glu Asp Leu Lys Ala 35 40 45 Met Ile Asp Asn Ile Ser Gln Glu Val Ala Leu Leu Lys Glu Lys Gln 50 55 60 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Ile His Leu Lys Cys 65 70 75 80 Phe Leu Ala Phe Ser Glu Ser Lys Thr Tyr His Glu Ala Ser Glu His 85 90 95 Cys Ile Ser Gln Gly Gly Thr Leu Gly Thr Pro Gln Gly Gly Glu Glu 100 105 110 Asn Asp Ala Leu Tyr Asp Tyr Met Arg Lys Ser Ile Gly Asn Glu Ala 115 120 125 Glu Ile Trp Leu Gly Leu Asn Asp Met Val Ala Glu Gly Lys Trp Val 130 135 140 Asp Met Thr Gly Ser Pro Ile Arg Tyr Lys Asn Trp Glu Thr Glu Ile 145 150 155 160 Thr Thr Gln Pro Asp Gly Gly Lys Leu Glu Asn Cys Ala Ala Leu Ser 165 170 175 Gly Val Ala Val Gly Lys Trp Phe Asp Lys Arg Cys Lys Glu Gln Leu 180 185 190 Pro Tyr Val Cys Gln Phe Met Ile Val 195 200 <210> 122 <211> 202 <212> PRT <213> Bos taurus <400> 122 Met Glu Leu Trp Gly Pro Cys Val Leu Leu Cys Leu Phe Ser Leu Leu 1 5 10 15 Thr Gln Val Thr Ala Glu Thr Pro Thr Pro Lys Ala Lys Lys Ala Ala 20 25 30 Asn Ala Lys Lys Asp Ala Val Ser Pro Lys Met Leu Glu Glu Leu Lys 35 40 45 Thr Gln Leu Asp Ser Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln 50 55 60 Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys 65 70 75 80 Cys Phe Leu Ala Phe Val Gln Ala Lys Thr Phe His Glu Ala Ser Glu 85 90 95 Asp Cys Ile Ser Arg Gly Gly Thr Leu Gly Thr Pro Gln Thr Gly Ser 100 105 110 Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Ser Glu 115 120 125 Ala Glu Val Trp Leu Gly Phe Asn Asp Met Ala Ser Glu Gly Ser Trp 130 135 140 Val Asp Met Thr Gly Gly His Ile Ala Tyr Lys Asn Trp Glu Thr Glu 145 150 155 160 Ile Thr Ala Gln Pro Asp Gly Gly Lys Val Glu Asn Cys Ala Thr Leu 165 170 175 Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Lys 180 185 190 Leu Pro Tyr Val Cys Gln Phe Ala Ile Val 195 200 <210> 123 <211> 198 <212> PRT <213> Salmo salar <400> 123 Met Arg Val Ser Gly Val Arg Leu Leu Phe Cys Leu Leu Leu Leu Gly 1 5 10 15 Gln Ser Thr Phe Gln Gln Thr Ser Ser Lys Lys Lys Gly Gly Lys Lys 20 25 30 Asp Ala Glu Asn Asn Ala Ala Ile Glu Glu Leu Lys Lys Gln Ile Asp 35 40 45 Asn Ile Val Leu Glu Leu Asn Leu Leu Lys Glu Gln Gln Ala Leu Gln 50 55 60 Ser Val Cys Leu Lys Gly Ile Lys Ile Ile Gly Lys Cys Phe Leu Ala 65 70 75 80 Asp Thr Ala Lys Lys Ile Tyr His Thr Ala Tyr Asp Asp Cys Ile Ala 85 90 95 Lys Gly Gly Thr Ile Ser Thr Pro Leu Thr Gly Asp Glu Asn Asp Gln 100 105 110 Leu Val Asp Tyr Val Arg Arg Ser Ile Gly Pro Glu Glu His Ile Trp 115 120 125 Leu Gly Ile Asn Asp Met Val Thr Glu Gly Glu Trp Leu Asp Gln Ala 130 135 140 Gly Thr Asn Leu Arg Phe Lys Asn Trp Glu Thr Asp Ile Thr Asn Gln 145 150 155 160 Pro Asp Gly Gly Arg Thr His Asn Cys Ala Ile Leu Ser Thr Thr Ala 165 170 175 Asn Gly Lys Trp Phe Asp Glu Ser Cys Arg Val Glu Lys Ala Ser Val 180 185 190 Cys Glu Phe Asn Ile Val 195 <210> 124 <211> 198 <212> PRT <213> Silurana tropicalis <400> 124 Met Glu Tyr Arg Arg Ala Cys Ile Leu Leu Cys Leu Phe Cys Phe Val 1 5 10 15 Gln Val Thr Leu Gln Gln Asn Gly Lys Lys Asn Lys Gln Asn Asn Lys 20 25 30 Asp Val Val Ser Met Lys Met Tyr Glu Asp Leu Lys Lys Lys Val Gln 35 40 45 Asn Ile Glu Glu Asp Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln 50 55 60 Thr Ile Cys Leu Lys Gly Met Lys Ile Tyr Asn Lys Cys Phe Leu Ala 65 70 75 80 Phe Asn Glu Leu Lys Thr Tyr His Gln Ala Ser Asp Val Cys Phe Ala 85 90 95 Gln Gly Gly Thr Leu Ser Thr Pro Glu Thr Gly Asp Glu Asn Asp Ser 100 105 110 Leu Tyr Asp Tyr Val Arg Lys Ser Ile Gly Ser Ser Ala Glu Ile Trp 115 120 125 Ile Gly Ile Asn Asp Met Ala Thr Glu Gly Thr Trp Leu Asp Leu Thr 130 135 140 Gly Ser Pro Ile Ser Phe Lys His Trp Glu Thr Glu Ile Thr Thr Gln 145 150 155 160 Pro Asp Gly Gly Lys Gln Glu Asn Cys Ala Ala Leu Ser Ala Ser Ala 165 170 175 Ile Gly Arg Trp Phe Asp Lys Asn Cys Lys Thr Glu Leu Pro Phe Val 180 185 190 Cys Gln Phe Ser Ile Val 195 <210> 125 <211> 223 <212> PRT <213> Danio rerio <400> 125 Met Arg Asp Asp Ser Asp Lys Val Pro Ser Leu Leu Thr Asp Tyr Ile 1 5 10 15 Leu Lys Gly Cys Thr Tyr Ala Glu Glu Lys Met Asp Leu Lys Ala Val 20 25 30 Lys Phe Leu Leu Cys Val Ile Cys Leu Val Lys Ser Ser Pro Glu Gln 35 40 45 Ser Leu Thr Lys Arg Lys Asn Gly Lys Lys Glu Ser Asn Ser Ala Ala 50 55 60 Ile Glu Glu Leu Lys Lys Gln Ile Asp Gln Ile Ile Gln Asp Leu Asn 65 70 75 80 Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Phe 85 90 95 Lys Ile Pro Gly Lys Cys Phe Leu Val Asp Thr Val Lys Lys Asp Phe 100 105 110 His Ser Ala Asn Asp Asp Cys Ile Ala Lys Gly Gly Ile Leu Ser Thr 115 120 125 Pro Met Ser Gly His Glu Asn Asp Gln Leu Gln Glu Tyr Val Gln Gln 130 135 140 Thr Val Gly Pro Glu Thr His Ile Trp Leu Gly Val Asn Asp Met Ile 145 150 155 160 Lys Glu Gly Glu Trp Ile Asp Leu Thr Gly Ser Pro Ile Arg Phe Lys 165 170 175 Asn Trp Glu Ser Glu Ile Thr His Gln Pro Asp Gly Gly Arg Thr His 180 185 190 Asn Cys Ala Val Leu Ser Ser Thr Ala Asn Gly Lys Trp Phe Asp Glu 195 200 205 Asp Cys Arg Gly Glu Lys Ala Ser Val Cys Gln Phe Asn Ile Val 210 215 220 <210> 126 <211> 197 <212> PRT <213> Bos taurus <400> 126 Met Ala Lys Asn Gly Leu Val Ile Tyr Ile Leu Val Ile Thr Leu Leu 1 5 10 15 Leu Asp Gln Thr Ser Cys His Ala Ser Lys Phe Lys Ala Arg Lys His 20 25 30 Ser Lys Arg Arg Val Lys Glu Lys Asp Gly Asp Leu Lys Thr Gln Val 35 40 45 Glu Lys Leu Trp Arg Glu Val Asn Ala Leu Lys Glu Met Gln Ala Leu 50 55 60 Gln Thr Val Cys Leu Arg Gly Thr Lys Phe His Lys Lys Cys Tyr Leu 65 70 75 80 Ala Ala Glu Gly Leu Lys His Phe His Glu Ala Asn Glu Asp Cys Ile 85 90 95 Ser Lys Gly Gly Thr Leu Val Val Pro Arg Ser Ala Asp Glu Ile Asn 100 105 110 Ala Leu Arg Asp Tyr Gly Lys Arg Ser Leu Pro Gly Val Asn Asp Phe 115 120 125 Trp Leu Gly Ile Asn Asp Met Val Ala Glu Gly Lys Phe Val Asp Ile 130 135 140 Asn Gly Leu Ala Ile Ser Phe Leu Asn Trp Asp Gln Ala Gln Pro Asn 145 150 155 160 Gly Gly Lys Arg Glu Asn Cys Ala Leu Phe Ser Gln Ser Ala Gln Gly 165 170 175 Lys Trp Ser Asp Glu Ala Cys His Ser Ser Lys Arg Tyr Ile Cys Glu 180 185 190 Phe Thr Ile Pro Gln 195 <210> 127 <211> 166 <212> PRT <213> Carcharhinus springeri <400> 127 Ser Lys Pro Ser Lys Ser Gly Lys Gly Lys Asp Asp Leu Arg Asn Glu 1 5 10 15 Ile Asp Lys Leu Trp Arg Glu Val Asn Ser Leu Lys Glu Met Gln Ala 20 25 30 Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Ile His Lys Lys Cys Tyr 35 40 45 Leu Ala Ser Arg Gly Ser Lys Ser Tyr His Ala Ala Asn Glu Asp Cys 50 55 60 Ile Ala Gln Gly Gly Thr Leu Ser Ile Pro Arg Ser Ser Asp Glu Gly 65 70 75 80 Asn Ser Leu Arg Ser Tyr Ala Lys Lys Ser Leu Val Gly Ala Arg Asp 85 90 95 Phe Trp Ile Gly Val Asn Asp Met Thr Thr Glu Gly Lys Phe Val Asp 100 105 110 Val Asn Gly Leu Pro Ile Thr Tyr Phe Asn Trp Asp Arg Ser Lys Pro 115 120 125 Val Gly Gly Thr Arg Glu Asn Cys Val Ala Ala Ser Thr Ser Gly Gln 130 135 140 Gly Lys Trp Ser Asp Asp Val Cys Arg Ser Glu Lys Arg Tyr Ile Cys 145 150 155 160 Glu Tyr Leu Ile Pro Val 165 <210> 128 <211> 204 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (7)..(8) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (15)..(15) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (17)..(17) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (22)..(26) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (31)..(35) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (39)..(39) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (42)..(42) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (45)..(45) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (50)..(50) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (54)..(54) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (60)..(60) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (80)..(80) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (89)..(89) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (111)..(111) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (119)..(120) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (122)..(122) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (124)..(124) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (131)..(131) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (136)..(136) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (144)..(144) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (152)..(152) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (154)..(154) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (164)..(164) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (171)..(171) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (179)..(180) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (189)..(189) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (201)..(201) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (204)..(204) <223> Xaa can be any naturally occurring amino acid <400> 128 Met Glu Leu Trp Gly Ala Xaa Xaa Leu Leu Cys Leu Phe Ser Xaa Leu 1 5 10 15 Xaa Gln Val Thr Ala Xaa Xaa Xaa Xaa Xaa Lys Ala Lys Lys Xaa Xaa 20 25 30 Xaa Xaa Xaa Lys Lys Asp Xaa Val Ser Xaa Lys Met Xaa Glu Glu Leu 35 40 45 Lys Xaa Gln Ile Asp Xaa Leu Ala Gln Glu Val Xaa Leu Leu Lys Glu 50 55 60 Gln Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Ile His Xaa 65 70 75 80 Lys Cys Phe Leu Ala Phe Thr Gln Xaa Lys Thr Phe His Glu Ala Ser 85 90 95 Glu Asp Cys Ile Ser Gln Gly Gly Thr Leu Ser Thr Pro Gln Xaa Gly 100 105 110 Asp Glu Asn Asp Ala Leu Xaa Xaa Tyr Xaa Arg Xaa Ser Val Gly Asn 115 120 125 Glu Ala Xaa Ile Trp Leu Gly Xaa Asn Asp Met Ala Ala Glu Gly Xaa 130 135 140 Trp Val Asp Met Thr Gly Ser Xaa Ile Xaa Tyr Lys Asn Trp Glu Thr 145 150 155 160 Glu Ile Thr Xaa Gln Pro Asp Gly Gly Lys Xaa Glu Asn Cys Ala Ala 165 170 175 Leu Ser Xaa Xaa Ala Asn Gly Lys Trp Phe Asp Lys Xaa Cys Arg Asp 180 185 190 Glu Leu Pro Tyr Val Cys Gln Phe Xaa Ile Val Xaa 195 200 <210> 129 <211> 240 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36)..(36) <223> k is g or t <220> <221> misc_feature <222> (37)..(38) <223> n is a, c, g, or t <220> <221> misc_feature <222> (39)..(39) <223> k is g or t <220> <221> misc_feature <222> (40)..(41) <223> n is a, c, g, or t <220> <221> misc_feature <222> (42)..(42) <223> k is g or t <220> <221> misc_feature <222> (43)..(44) <223> n is a, c, g, or t <220> <221> misc_feature <222> (45)..(45) <223> k is g or t <220> <221> misc_feature <222> (46)..(47) <223> n is a, c, g, or t <220> <221> misc_feature <222> (48)..(48) <223> k is g or t <220> <221> misc_feature <222> (49)..(50) <223> n is a, c, g, or t <220> <221> misc_feature <222> (51)..(51) <223> k is g or t <220> <221> misc_feature <222> (52)..(53) <223> n is a, c, g, or t <220> <221> misc_feature <222> (54)..(54) <223> k is g or t <400> 129 gaggccgaga tctggctggg cctgaacgac atgnnknnkn nknnknnknn knnktgggtg 60 gatatgactg gcgcccgcat cgcctacaag aactgggaaa ctgagatcac cgcccaacct 120 gatggcggcg caaccgagaa ctgcgcggtc ctgtctggcg ccgccaacgg caagtggttc 180 gacaagcgct gcagggatca attgccctac atctgccagt tcgggatcgt ggcggccgca 240 <210> 130 <211> 80 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (12)..(18) <223> Xaa can be any naturally occurring amino acid <400> 130 Glu Ala Glu Ile Trp Leu Gly Leu Asn Asp Met Xaa Xaa Xaa Xaa Xaa 1 5 10 15 Xaa Xaa Trp Val Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp 20 25 30 Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly Gly Ala Thr Glu Asn Cys 35 40 45 Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys 50 55 60 Arg Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val Ala Ala Ala 65 70 75 80 <210> 131 <211> 137 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 131 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile 85 90 95 Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser 100 105 110 Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu 115 120 125 Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 <210> 132 <211> 414 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 132 caggccctcc agacggtctg cctgaagggg accaaggtgc acatgaaatg ctttctggcc 60 ttcacccaga cgaagacctt ccacgaggcc agcgaggact gcatctcgcg cgggggcacc 120 ctgagcaccc ctcagactgg ctcggagaac gacgccctgt atgagtacct gcgccagagc 180 gtgggcaacg aggccgagat ctggctgggc ctcaacgaca tggcggccga gggcacctgg 240 gtggacatga ctggcgcgcg tatcgcctac aagaactggg agactgagat caccgcgcaa 300 cccgatggcg gcaagaccga gaactgcgcg gtcctgtcag gcgcggccaa cggcaagtgg 360 ttcgacaagc gctgcaggga tcaattgccc tacatctgcc agttcgggat cgtg 414 <210> 133 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 133 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Pro Pro Gly 85 90 95 Pro His His Pro Met Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 134 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 134 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Pro Pro Pro 85 90 95 Pro His His Pro Met Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 135 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 135 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Pro Ala 85 90 95 Leu Val Gln Pro Arg Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 136 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 136 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Pro Pro 85 90 95 Leu Tyr Gln Pro Gly Gly Gly Trp Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 137 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 137 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Thr Pro 85 90 95 Pro Trp Gln Pro Glu Gly Gly Phe Gly Tyr Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 138 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 138 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Leu Arg Thr Asp Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile 85 90 95 Thr Ala Gln Pro Asp Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 139 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 139 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Leu Arg Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile 85 90 95 Thr Ala Gln Pro Asp Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 140 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 140 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Pro Pro 85 90 95 Leu Tyr Gln Pro Gly Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 141 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 141 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp 20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu 35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala 50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Ala Thr Glu Ile 85 90 95 Thr Ala Gln Pro Asp Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala 100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 130 135 140 <210> 142 <211> 181 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 142 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys Phe Leu Ala Phe 50 55 60 Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp Cys Ile Ser Arg 65 70 75 80 Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu Asn Asp Ala Leu 85 90 95 Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala Glu Ile Trp Leu 100 105 110 Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val Asp Met Thr Gly 115 120 125 Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro 130 135 140 Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser Gly Ala Ala Asn 145 150 155 160 Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu Pro Tyr Ile Cys 165 170 175 Gln Phe Gly Ile Val 180 <210> 143 <211> 546 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 143 gagccaccaa cccagaagcc caagaagatt gtaaatgcca agaaagatgt tgtgaacaca 60 aagatgtttg aggagctcaa gagccgtctg gacaccctgg cccaggaggt ggccctgctg 120 aaggagcagc aggccctgca gacggtctgc ctgaagggga ccaaggtgca catgaaatgc 180 tttctggcct tcacccagac gaagaccttc cacgaggcca gcgaggactg catctcgcgc 240 gggggcaccc tgagcacccc tcagactggc tcggagaacg acgccctgta tgagtacctg 300 cgccagagcg tgggcaacga ggccgagatc tggctgggcc tcaacgacat ggcggccgag 360 ggcacctggg tggacatgac cggcgcccgc atcgcctaca agaactggga gactgagatc 420 accgcgcaac ccgatggcgg caagaccgag aactgcgcgg tcctgtcagg cgcggccaac 480 ggcaagtggt tcgacaagcg ctgccgcgat cagctgccct acatctgcca gttcgggatc 540 gtgtag 546 <210> 144 <211> 546 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 144 gagtcaccca ctcccaaggc caagaaggct gcaaatgcca agaaagattt ggtgagctca 60 aagatgttcg aggagctcaa gaacaggatg gatgtcctgg cccaggaggt ggccctgctg 120 aaggagaagc aggccttaca gactgtgtgc ctgaagggca ccaaggtgaa cttgaagtgc 180 ctcctggcct tcacccaacc gaagaccttc catgaggcga gcgaggactg catctcgcaa 240 gggggcacgc tgggcacccc gcagtcagag ctagagaacg aggcgctgtt cgagtacgcg 300 cgccacagcg tgggcaacga tgcgaacatc tggctgggcc tcaacgacat ggccgcggaa 360 ggcgcctggg tggacatgac cggcggcctc ctggcctaca agaactggga gacggagatc 420 acgacgcaac ccgacggcgg caaagccgag aactgcgccg ccctgtctgg cgcagccaac 480 ggcaagtggt tcgacaagcg atgccgcgat cagttgccct acatctgcca gtttgccatt 540 gtgtag 546 <210> 145 <211> 181 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 145 Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn Ala Lys Lys Asp 1 5 10 15 Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys Asn Arg Met Asp Val 20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr 35 40 45 Val Cys Leu Lys Gly Thr Lys Val Asn Leu Lys Cys Leu Leu Ala Phe 50 55 60 Thr Gln Pro Lys Thr Phe His Glu Ala Ser Glu Asp Cys Ile Ser Gln 65 70 75 80 Gly Gly Thr Leu Gly Thr Pro Gln Ser Glu Leu Glu Asn Glu Ala Leu 85 90 95 Phe Glu Tyr Ala Arg His Ser Val Gly Asn Asp Ala Asn Ile Trp Leu 100 105 110 Gly Leu Asn Asp Met Ala Ala Glu Gly Ala Trp Val Asp Met Thr Gly 115 120 125 Gly Leu Leu Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Thr Gln Pro 130 135 140 Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala Leu Ser Gly Ala Ala Asn 145 150 155 160 Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu Pro Tyr Ile Cys 165 170 175 Gln Phe Ala Ile Val 180 <210> 146 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 146 Lys Met Phe Glu Glu Leu Lys Ser Gln Leu Asp Ser Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 147 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 147 Lys Met Phe Glu Glu Leu Lys Ser Gln Val Asp Ser Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 148 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 148 Ser Lys Met Phe Glu Glu Leu Lys Asn Arg Met Asp Val Leu Ala Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 149 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 149 Lys Met Phe Glu Glu Leu Lys Asn Arg Leu Asp Val Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 150 <211> 4779 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 150 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgccctc 2340 cagacggtct gcctgaaggg gaccaaggtg cacatgaaat gctttctggc cttcacccag 2400 acgaagacct tccacgaggc cagcgaggac tgcatctcgc gcgggggcac cctgagcacc 2460 cctcagactg gctcggagaa cgacgccctg tatgagtacc tgcgccagag cgtgggcaac 2520 gaggccgaga tctaagtgac gatatcctga cctaaggtac ctaagtgacg atatcctgac 2580 ctaactgcag ggatcaattg ccctacatct gccagttcgg gatcgtggcg gccgcaggtg 2640 cgccggtgcc gtatccggat ccgctggaac cgcgtgccgc atagactgtt gaaagttgtt 2700 tagcaaaacc tcatacagaa aattcattta ctaacgtctg gaaagacgac aaaactttag 2760 atcgttacgc taactatgag ggctgtctgt ggaatgctac aggcgttgtg gtttgtactg 2820 gtgacgaaac tcagtgttac ggtacatggg ttcctattgg gcttgctatc cctgaaaatg 2880 agggtggtgg ctctgagggt ggcggttctg agggtggcgg ttctgagggt ggcggtacta 2940 aacctcctga gtacggtgat acacctattc cgggctatac ttatatcaac cctctcgacg 3000 gcacttatcc gcctggtact gagcaaaacc ccgctaatcc taatccttct cttgaggagt 3060 ctcagcctct taatactttc atgtttcaga ataataggtt ccgaaatagg cagggtgcat 3120 taactgttta tacgggcact gttactcaag gcactgaccc cgttaaaact tattaccagt 3180 acactcctgt atcatcaaaa gccatgtatg acgcttactg gaacggtaaa ttcagagact 3240 gcgctttcca ttctggcttt aatgaggatc cattcgtttg tgaatatcaa ggccaatcgt 3300 ctgacctgcc tcaacctcct gtcaatgctg gcggcggctc tggtggtggt tctggtggcg 3360 gctctgaggg tggcggctct gagggtggcg gttctgaggg tggcggctct gagggtggcg 3420 gttccggtgg cggctccggt tccggtgatt ttgattatga aaaaatggca aacgctaata 3480 agggggctat gaccgaaaat gccgatgaaa acgcgctaca gtctgacgct aaaggcaaac 3540 ttgattctgt cgctactgat tacggtgctg ctatcgatgg tttcattggt gacgtttccg 3600 gccttgctaa tggtaatggt gctactggtg attttgctgg ctctaattcc caaatggctc 3660 aagtcggtga cggtgataat tcacctttaa tgaataattt ccgtcaatat ttaccttctt 3720 tgcctcagtc ggttgaatgt cgcccttatg tctttggcgc tggtaaacca tatgaatttt 3780 ctattgattg tgacaaaata aacttattcc gtggtgtctt tgcgtttctt ttatatgttg 3840 ccacctttat gtatgtattt tcgacgtttg ctaacatact gcgtaataag gagtcttaat 3900 aagaattcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 3960 cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 4020 accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 4080 tttctcctta cgcatctgtg cggtatttca caccgcatac gtcaaagcaa ccatagtacg 4140 cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4200 cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4260 tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4320 ctttacggca cctcgacccc aaaaaacttg atttgggtga tggttcacgt agtgggccat 4380 cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4440 tcttgttcca aactggaaca acactcaacc ctatctcggg ctattctttt gatttataag 4500 ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4560 cgaattttaa caaaatatta acgtttacaa ttttatggtg cagtctcagt acaatctgct 4620 ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac 4680 gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca 4740 tgtgtcagag gttttcaccg tcatcaccga aacgcgcga 4779 <210> 151 <211> 5747 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 151 tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgggatctc gatcccgcga 4980 aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 ttttgtttaa ctttaagaag gagatataca tatgaaatac cttcttccga ctgctgctgc 5100 tggtctttta ctgctggctg ctcagccggc tatggctgct ggtggtggtt ctgccctcca 5160 gacggtctgc ctgaagggga ccaaggtgca catgaaatgc tttctggcct tcacccagac 5220 gaagaccttc cacgaggcca gcgaggactg catctcgcgc gggggcaccc tgagcacccc 5280 tcagactggc tcggagaacg acgccctgta tgagtacctg cgccagagcg tgggcaacga 5340 ggccgagatc tggctgggcc tcaacgacat ggcggccgag ggcacctggg tggacatgac 5400 cggtacccgc atcgcctaca agaactggga gactgagatc accgcgcaac ccgatggcgg 5460 caagaccgag aactgcgcgg tcctgtcagg cgcggccaac ggcaagtggt tcgacaagcg 5520 ctgcagggat caattgccct acatctgcca gttcgggatc gtgcaccacc accaccacca 5580 ctaactcgag caccaccacc accaccactg agatccggct gctaacaaag cccgaaagga 5640 agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa 5700 acgggtcttg aggggttttt tgctgaaagg aggaactata tccggat 5747 <210> 152 <211> 10975 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 152 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 ggtctgcctg aaggggacca aggtgcacat gaaatgcttt ctggccttca cccagacgaa 1080 gaccttccac gaggccagcg aggactgcat ctcgcgcggg ggcaccctga gcacccctca 1140 gactggctcg gagaacgacg ccctgtatga gtacctgcgc cagagcgtgg gcaacgaggc 1200 cgagatctgg ctgggcctca acgacatggc ggccgagggc acctgggtgg acatgaccgg 1260 tacccgcatc gcctacaaga actgggagac tgagatcacc gcgcaacccg atggcggcaa 1320 gaccgagaac tgcgcggtcc tgtcaggcgc ggccaacggc aagtggttcg acaagcgctg 1380 cagggatcaa ttgccctaca tctgccagtt cgggatcgtg caccaccacc accaccacta 1440 actcgaggcc ggcaaggccg gatccagaca tgataagata cattgatgag tttggacaaa 1500 ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat gctattgctt 1560 tatttgtaac cattataagc tgcaataaac aagttaacaa caagaattgc attcatttta 1620 tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac ctctacaaat 1680 gtggtatggc tgattatgat ccggctgcct cgcgcgtttc ggtgatgacg gtgaaaacct 1740 ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 1800 acaagcccgt caggcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgaggtcgac 1860 tctagaggat cgatgccccg ccccggacga actaaacctg actacgacat ctctgcccct 1920 tcttcgcggg gcagtgcatg taatcccttc agttggttgg tacaacttgc caactgggcc 1980 ctgttccaca tgtgacacgg ggggggacca aacacaaagg ggttctctga ctgtagttga 2040 catccttata aatggatgtg cacatttgcc aacactgagt ggctttcatc ctggagcaga 2100 ctttgcagtc tgtggactgc aacacaacat tgcctttatg tgtaactctt ggctgaagct 2160 cttacaccaa tgctggggga catgtacctc ccaggggccc aggaagacta cgggaggcta 2220 caccaacgtc aatcagaggg gcctgtgtag ctaccgataa gcggaccctc aagagggcat 2280 tagcaatagt gtttataagg cccccttgtt aaccctaaac gggtagcata tgcttcccgg 2340 gtagtagtat atactatcca gactaaccct aattcaatag catatgttac ccaacgggaa 2400 gcatatgcta tcgaattagg gttagtaaaa gggtcctaag gaacagcgat atctcccacc 2460 ccatgagctg tcacggtttt atttacatgg ggtcaggatt ccacgagggt agtgaaccat 2520 tttagtcaca agggcagtgg ctgaagatca aggagcgggc agtgaactct cctgaatctt 2580 cgcctgcttc ttcattctcc ttcgtttagc taatagaata actgctgagt tgtgaacagt 2640 aaggtgtatg tgaggtgctc gaaaacaagg tttcaggtga cgcccccaga ataaaatttg 2700 gacggggggt tcagtggtgg cattgtgcta tgacaccaat ataaccctca caaacccctt 2760 gggcaataaa tactagtgta ggaatgaaac attctgaata tctttaacaa tagaaatcca 2820 tggggtgggg acaagccgta aagactggat gtccatctca cacgaattta tggctatggg 2880 caacacataa tcctagtgca atatgatact ggggttatta agatgtgtcc caggcaggga 2940 ccaagacagg tgaaccatgt tgttacactc tatttgtaac aaggggaaag agagtggacg 3000 ccgacagcag cggactccac tggttgtctc taacaccccc gaaaattaaa cggggctcca 3060 cgccaatggg gcccataaac aaagacaagt ggccactctt ttttttgaaa ttgtggagtg 3120 ggggcacgcg tcagccccca cacgccgccc tgcggttttg gactgtaaaa taagggtgta 3180 ataacttggc tgattgtaac cccgctaacc actgcggtca aaccacttgc ccacaaaacc 3240 actaatggca ccccggggaa tacctgcata agtaggtggg cgggccaaga taggggcgcg 3300 attgctgcga tctggaggac aaattacaca cacttgcgcc tgagcgccaa gcacagggtt 3360 gttggtcctc atattcacga ggtcgctgag agcacggtgg gctaatgttg ccatgggtag 3420 catatactac ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata 3480 ggctatccta atctatatct gggtagcata tgctatccta atctatatct gggtagtata 3540 tgctatccta atttatatct gggtagcata ggctatccta atctatatct gggtagcata 3600 tgctatccta atctatatct gggtagtata tgctatccta atctgtatcc gggtagcata 3660 tgctatccta atagagatta gggtagtata tgctatccta atttatatct gggtagcata 3720 tactacccaa atatctggat agcatatgct atcctaatct atatctgggt agcatatgct 3780 atcctaatct atatctgggt agcataggct atcctaatct atatctgggt agcatatgct 3840 atcctaatct atatctgggt agtatatgct atcctaattt atatctgggt agcataggct 3900 atcctaatct atatctgggt agcatatgct atcctaatct atatctgggt agtatatgct 3960 atcctaatct gtatccgggt agcatatgct atcctcatgc atatacagtc agcatatgat 4020 acccagtagt agagtgggag tgctatcctt tgcatatgcc gccacctccc aagggggcgt 4080 gaattttcgc tgcttgtcct tttcctgctg gttgctccca ttcttaggtg aatttaagga 4140 ggccaggcta aagccgtcgc atgtctgatt gctcaccagg taaatgtcgc taatgttttc 4200 caacgcgaga aggtgttgag cgcggagctg agtgacgtga caacatgggt atgccgaatt 4260 gccccatgtt gggaggacga aaatggtgac aagacagatg gccagaaata caccaacagc 4320 acgcatgatg tctactgggg atttattctt tagtgcgggg gaatacacgg cttttaatac 4380 gattgagggc gtctcctaac aagttacatc actcctgccc ttcctcaccc tcatctccat 4440 cacctccttc atctccgtca tctccgtcat caccctccgc ggcagcccct tccaccatag 4500 gtggaaacca gggaggcaaa tctactccat cgtcaaagct gcacacagtc accctgatat 4560 tgcaggtagg agcgggcttt gtcataacaa ggtccttaat cgcatccttc aaaacctcag 4620 caaatatatg agtttgtaaa aagaccatga aataacagac aatggactcc cttagcgggc 4680 caggttgtgg gccgggtcca ggggccattc caaaggggag acgactcaat ggtgtaagac 4740 gacattgtgg aatagcaagg gcagttcctc gccttaggtt gtaaagggag gtcttactac 4800 ctccatatac gaacacaccg gcgacccaag ttccttcgtc ggtagtcctt tctacgtgac 4860 tcctagccag gagagctctt aaaccttctg caatgttctc aaatttcggg ttggaacctc 4920 cttgaccacg atgctttcca aaccaccctc cttttttgcg cctgcctcca tcaccctgac 4980 cccggggtcc agtgcttggg ccttctcctg ggtcatctgc ggggccctgc tctatcgctc 5040 ccgggggcac gtcaggctca ccatctgggc caccttcttg gtggtattca aaataatcgg 5100 cttcccctac agggtggaaa aatggccttc tacctggagg gggcctgcgc ggtggagacc 5160 cggatgatga tgactgacta ctgggactcc tgggcctctt ttctccacgt ccacgacctc 5220 tccccctggc tctttcacga cttccccccc tggctctttc acgtcctcta ccccggcggc 5280 ctccactacc tcctcgaccc cggcctccac tacctcctcg accccggcct ccactgcctc 5340 ctcgaccccg gcctccacct cctgctcctg cccctcctgc tcctgcccct cctcctgctc 5400 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5460 cccctcctgc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5520 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgcccctcct gctcctgccc 5580 ctcctgcccc tcctgctcct gcccctcctg ctcctgcccc tcctgctcct gcccctcctg 5640 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gctcctgccc 5700 ctcctgcccc tcctgcccct cctgctcctg cccctcctcc tgctcctgcc cctcctgccc 5760 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tcctgctcct gcccctcctc 5820 ctgctcctgc ccctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5880 ctcctgctcc tgcccctcct cctgctcctg cccctcctgc ccctcctgcc cctcctcctg 5940 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct gcccctcctc 6000 ctgctcctgc ccctcctcct gctcctgccc ctcctgctcc tgcccctccc gctcctgctc 6060 ctgctcctgt tccaccgtgg gtccctttgc agccaatgca acttggacgt ttttggggtc 6120 tccggacacc atctctatgt cttggccctg atcctgagcc gcccggggct cctggtcttc 6180 cgcctcctcg tcctcgtcct cttccccgtc ctcgtccatg gttatcaccc cctcttcttt 6240 gaggtccact gccgccggag ccttctggtc cagatgtgtc tcccttctct cctaggccat 6300 ttccaggtcc tgtacctggc ccctcgtcag acatgattca cactaaaaga gatcaataga 6360 catctttatt agacgacgct cagtgaatac agggagtgca gactcctgcc ccctccaaca 6420 gcccccccac cctcatcccc ttcatggtcg ctgtcagaca gatccaggtc tgaaaattcc 6480 ccatcctccg aaccatcctc gtcctcatca ccaattactc gcagcccgga aaactcccgc 6540 tgaacatcct caagatttgc gtcctgagcc tcaagccagg cctcaaattc ctcgtccccc 6600 tttttgctgg acggtaggga tggggattct cgggacccct cctcttcctc ttcaaggtca 6660 ccagacagag atgctactgg ggcaacggaa gaaaagctgg gtgcggcctg tgaggatcag 6720 cttatcgatg ataagctgtc aaacatgaga attcttgaag acgaaagggc ctcgtgatac 6780 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 6840 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 6900 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 6960 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 7020 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 7080 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 7140 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 7200 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 7260 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 7320 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 7380 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 7440 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 7500 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 7560 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 7620 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 7680 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 7740 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 7800 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 7860 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 7920 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 7980 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 8040 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 8100 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 8160 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 8220 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 8280 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 8340 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 8400 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 8460 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 8520 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 8580 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggccttga agctgtccct 8640 gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 8700 cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca 8760 agacgtagcc cagcgcgtcg gccccgagat gcgccgcgtg cggctgctgg agatggcgga 8820 cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg 8880 attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccctg cttcatcccc 8940 gtggcccgtt gctcgcgttt gctggcggtg tccccggaag aaatatattt gcatgtcttt 9000 agttctatga tgacacaaac cccgcccagc gtcttgtcat tggcgaattc gaacacgcag 9060 atgcagtcgg ggcggcgcgg tccgaggtcc acttcgcata ttaaggtgac gcgtgtggcc 9120 tcgaacaccg agcgaccctg cagcgacccg cttaacagcg tcaacagcgt gccgcagatc 9180 ccggggggca atgagatatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc 9240 tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc 9300 gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg 9360 atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc 9420 cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg 9480 cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg 9540 tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc gggttcggcc 9600 cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata tgcgcgattg 9660 ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg 9720 cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg 9780 tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca 9840 ttgactggag cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct 9900 ggaggccgtg gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg 9960 agcttgcagg atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct 10020 atcagagctt ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg 10080 caatcgtccg atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg 10140 ccgtctggac cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca 10200 ctcgtccgga tcgggagatg ggggaggcta actgaaacac ggaaggagac aataccggaa 10260 ggaacccgcg ctatgacggc aataaaaaga cagaataaaa cgcacgggtg ttgggtcgtt 10320 tgttcataaa cgcggggttc ggtcccaggg ctggcactct gtcgataccc caccgagacc 10380 ccattggggc caatacgccc gcgtttcttc cttttcccca ccccaccccc caagttcggg 10440 tgaaggccca gggctcgcag ccaacgtcgg ggcggcaggc cctgccatag ccactggccc 10500 cgtgggttag ggacggggtc ccccatgggg aatggtttat ggttcgtggg ggttattatt 10560 ttgggcgttg cgtggggtca ggtccacgac tggactgagc agacagaccc atggtttttg 10620 gatggcctgg gcatggaccg catgtactgg cgcgacacga acaccgggcg tctgtggctg 10680 ccaaacaccc ccgaccccca aaaaccaccg cgcggatttc tggcgtgcca agctagtcga 10740 ccaattctca tgtttgacag cttatcatcg cagatccggg caacgttgtt gccattgctg 10800 caggcgcaga actggtaggt atggaagatc catacattga atcaatattg gcaattagcc 10860 atattagtca ttggttatat agcataaatc aatattggct attggccatt gcatacgttg 10920 tatctatatc ataatatgta catttatatt ggctcatgtc caatatgacc gccat 10975 <210> 153 <211> 5774 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 153 tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgggatctc gatcccgcga 4980 aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 ttttgtttaa ctttaagaag gagatataca tatgaaatac cttcttccga ctgctgctgc 5100 tggtctttta ctgctggctg ctcagccggc tatggctgct ggtggtggtt ctgccctcca 5160 gacggtctgc ctgaagggga ccaaggtgca catgaaatgc tttctggcct tcacccagac 5220 gaagaccttc cacgaggcca gcgaggactg catctcgcgc gggggcaccc tgagcacccc 5280 tcagactggc tcggagaacg acgccctgta tgagtacctg cgccagagcg tgggcaacga 5340 ggccgagatc tggctgggcc tcaacgacat ggcggccgag ggcacctggg tggacatgac 5400 cggtacccgc atcgcctaca agaactggga gactgagatc accgcgcaac ccgatggcgg 5460 caagaccgag aactgcgcgg tcctgtcagg cgcggccaac ggcaagtggt tcgacaagcg 5520 ctgcagggat caattgccct acatctgcca gttcgggatc gtgtacccct acgacgtgcc 5580 cgactacgcc caccaccacc accaccacta actcgagcac caccaccacc accactgaga 5640 tccggctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata 5700 actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg 5760 aactatatcc ggat 5774 <210> 154 <211> 4649 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 154 aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60 tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca 120 aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg 180 attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg 240 atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc 300 taacaggagg aattcaccat gaaaaagaca gctatcgcga ttgcagtggc actggctggt 360 ttcgctaccg ttgcgcaagc ttctgagcca ccaacccaga agcccaagaa gattgtaaat 420 gccaagaaag atgttgtgaa cacaaagatg tttgaggagc tcaagagccg tctggacacc 480 ctggcccagg aggtggccct gctgaaggag cagcaggccc tccagacggt ctgcctgaag 540 gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac cttccacgag 600 gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac tggctcggag 660 aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga gatctggctg 720 ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac ccgcatcgcc 780 tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac cgagaactgc 840 gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag ggatcaattg 900 ccctacatct gccagttcgg gatcgttcta gaacaaaaac tcatctcaga agaggatctg 960 aatagcgccg tcgaccatca tcatcatcat cattgagttt aaacggtctc cagcttggct 1020 gttttggcgg atgagagaag attttcagcc tgatacagat taaatcagaa cgcagaagcg 1080 gtctgataaa acagaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc 1140 cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag 1200 tagggaactg ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgt 1260 tttatctgtt gtttgtcggt gaacgctctc ctgagtagga caaatccgcc gggagcggat 1320 ttgaacgttg cgaagcaacg gcccggaggg tggcgggcag gacgcccgcc ataaactgcc 1380 aggcatcaaa ttaagcagaa ggccatcctg acggatggcc tttttgcgtt tctacaaact 1440 ctttttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 1500 gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 1560 cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 1620 tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 1680 tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 1740 cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac 1800 tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 1860 agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 1920 ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 1980 ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 2040 aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 2100 gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 2160 tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 2220 ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc 2280 cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 2340 atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 2400 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 2460 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 2520 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 2580 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 2640 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 2700 taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 2760 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 2820 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 2880 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 2940 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 3000 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 3060 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 3120 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 3180 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 3240 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 3300 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc 3360 ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg 3420 atgccgcata gttaagccag tatacactcc gctatcgcta cgtgactggg tcatggctgc 3480 gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc 3540 cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc 3600 atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg aaggcgaagc ggcatgcata 3660 atgtgcctgt caaatggacg aagcagggat tctgcaaacc ctatgctact ccgtcaagcc 3720 gtcaattgtc tgattcgtta ccaattatga caacttgacg gctacatcat tcactttttc 3780 ttcacaaccg gcacggaact cgctcgggct ggccccggtg cattttttaa atacccgcga 3840 gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg gtggcgatag gcatccgggt 3900 ggtgctcaaa agcagcttcg cctggctgat acgttggtcc tcgcgccagc ttaagacgct 3960 aatccctaac tgctggcgga aaagatgtga cagacgcgac ggcgacaagc aaacatgctg 4020 tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga tcgctgatgt actgacaagc 4080 ctcgcgtacc cgattatcca tcggtggatg gagcgactcg ttaatcgctt ccatgcgccg 4140 cagtaacaat tgctcaagca gatttatcgc cagcagctcc gaatagcgcc cttccccttg 4200 cccggcgtta atgatttgcc caaacaggtc gctgaaatgc ggctggtgcg cttcatccgg 4260 gcgaaagaac cccgtattgg caaatattga cggccagtta agccattcat gccagtaggc 4320 gcgcggacga aagtaaaccc actggtgata ccattcgcga gcctccggat gacgaccgta 4380 gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc ggtcggcaaa caaattctcg 4440 tccctgattt ttcaccaccc cctgaccgcg aatggtgaga ttgagaatat aacctttcat 4500 tcccagcggt cggtcgataa aaaaatcgag ataaccgttg gcctcaatcg gcgttaaacc 4560 cgccaccaga tgggcattaa acgagtatcc cggcagcagg ggatcatttt gcgcttcagc 4620 catacttttc atactcccgc cattcagag 4649 <210> 155 <211> 10972 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 155 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 gtgcctgaag gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac 1080 cttccacgag gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac 1140 tggctcggag aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga 1200 gatctggctg ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac 1260 ccgcatcgcc tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac 1320 cgagaactgc gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag 1380 ggatcaattg ccctacatct gccagttcgg gatcgtgcac caccaccacc accactaact 1440 cgaggccggc aaggccggat ccagacatga taagatacat tgatgagttt ggacaaacca 1500 caactagaat gcagtgaaaa aaatgcttta tttgtgaaat ttgtgatgct attgctttat 1560 ttgtaaccat tataagctgc aataaacaag ttaacaacaa gaattgcatt cattttatgt 1620 ttcaggttca gggggaggtg tgggaggttt tttaaagcaa gtaaaacctc tacaaatgtg 1680 gtatggctga ttatgatccg gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg 1740 acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca 1800 agcccgtcag gcgtcagcgg gtgttggcgg gtgtcggggc gcagccatga ggtcgactct 1860 agaggatcga tgccccgccc cggacgaact aaacctgact acgacatctc tgccccttct 1920 tcgcggggca gtgcatgtaa tcccttcagt tggttggtac aacttgccaa ctgggccctg 1980 ttccacatgt gacacggggg gggaccaaac acaaaggggt tctctgactg tagttgacat 2040 ccttataaat ggatgtgcac atttgccaac actgagtggc tttcatcctg gagcagactt 2100 tgcagtctgt ggactgcaac acaacattgc ctttatgtgt aactcttggc tgaagctctt 2160 acaccaatgc tgggggacat gtacctccca ggggcccagg aagactacgg gaggctacac 2220 caacgtcaat cagaggggcc tgtgtagcta ccgataagcg gaccctcaag agggcattag 2280 caatagtgtt tataaggccc ccttgttaac cctaaacggg tagcatatgc ttcccgggta 2340 gtagtatata ctatccagac taaccctaat tcaatagcat atgttaccca acgggaagca 2400 tatgctatcg aattagggtt agtaaaaggg tcctaaggaa cagcgatatc tcccacccca 2460 tgagctgtca cggttttatt tacatggggt caggattcca cgagggtagt gaaccatttt 2520 agtcacaagg gcagtggctg aagatcaagg agcgggcagt gaactctcct gaatcttcgc 2580 ctgcttcttc attctccttc gtttagctaa tagaataact gctgagttgt gaacagtaag 2640 gtgtatgtga ggtgctcgaa aacaaggttt caggtgacgc ccccagaata aaatttggac 2700 ggggggttca gtggtggcat tgtgctatga caccaatata accctcacaa accccttggg 2760 caataaatac tagtgtagga atgaaacatt ctgaatatct ttaacaatag aaatccatgg 2820 ggtggggaca agccgtaaag actggatgtc catctcacac gaatttatgg ctatgggcaa 2880 cacataatcc tagtgcaata tgatactggg gttattaaga tgtgtcccag gcagggacca 2940 agacaggtga accatgttgt tacactctat ttgtaacaag gggaaagaga gtggacgccg 3000 acagcagcgg actccactgg ttgtctctaa cacccccgaa aattaaacgg ggctccacgc 3060 caatggggcc cataaacaaa gacaagtggc cactcttttt tttgaaattg tggagtgggg 3120 gcacgcgtca gcccccacac gccgccctgc ggttttggac tgtaaaataa gggtgtaata 3180 acttggctga ttgtaacccc gctaaccact gcggtcaaac cacttgccca caaaaccact 3240 aatggcaccc cggggaatac ctgcataagt aggtgggcgg gccaagatag gggcgcgatt 3300 gctgcgatct ggaggacaaa ttacacacac ttgcgcctga gcgccaagca cagggttgtt 3360 ggtcctcata ttcacgaggt cgctgagagc acggtgggct aatgttgcca tgggtagcat 3420 atactaccca aatatctgga tagcatatgc tatcctaatc tatatctggg tagcataggc 3480 tatcctaatc tatatctggg tagcatatgc tatcctaatc tatatctggg tagtatatgc 3540 tatcctaatt tatatctggg tagcataggc tatcctaatc tatatctggg tagcatatgc 3600 tatcctaatc tatatctggg tagtatatgc tatcctaatc tgtatccggg tagcatatgc 3660 tatcctaata gagattaggg tagtatatgc tatcctaatt tatatctggg tagcatatac 3720 tacccaaata tctggatagc atatgctatc ctaatctata tctgggtagc atatgctatc 3780 ctaatctata tctgggtagc ataggctatc ctaatctata tctgggtagc atatgctatc 3840 ctaatctata tctgggtagt atatgctatc ctaatttata tctgggtagc ataggctatc 3900 ctaatctata tctgggtagc atatgctatc ctaatctata tctgggtagt atatgctatc 3960 ctaatctgta tccgggtagc atatgctatc ctcatgcata tacagtcagc atatgatacc 4020 cagtagtaga gtgggagtgc tatcctttgc atatgccgcc acctcccaag ggggcgtgaa 4080 ttttcgctgc ttgtcctttt cctgctggtt gctcccattc ttaggtgaat ttaaggaggc 4140 caggctaaag ccgtcgcatg tctgattgct caccaggtaa atgtcgctaa tgttttccaa 4200 cgcgagaagg tgttgagcgc ggagctgagt gacgtgacaa catgggtatg ccgaattgcc 4260 ccatgttggg aggacgaaaa tggtgacaag acagatggcc agaaatacac caacagcacg 4320 catgatgtct actggggatt tattctttag tgcgggggaa tacacggctt ttaatacgat 4380 tgagggcgtc tcctaacaag ttacatcact cctgcccttc ctcaccctca tctccatcac 4440 ctccttcatc tccgtcatct ccgtcatcac cctccgcggc agccccttcc accataggtg 4500 gaaaccaggg aggcaaatct actccatcgt caaagctgca cacagtcacc ctgatattgc 4560 aggtaggagc gggctttgtc ataacaaggt ccttaatcgc atccttcaaa acctcagcaa 4620 atatatgagt ttgtaaaaag accatgaaat aacagacaat ggactccctt agcgggccag 4680 gttgtgggcc gggtccaggg gccattccaa aggggagacg actcaatggt gtaagacgac 4740 attgtggaat agcaagggca gttcctcgcc ttaggttgta aagggaggtc ttactacctc 4800 catatacgaa cacaccggcg acccaagttc cttcgtcggt agtcctttct acgtgactcc 4860 tagccaggag agctcttaaa ccttctgcaa tgttctcaaa tttcgggttg gaacctcctt 4920 gaccacgatg ctttccaaac caccctcctt ttttgcgcct gcctccatca ccctgacccc 4980 ggggtccagt gcttgggcct tctcctgggt catctgcggg gccctgctct atcgctcccg 5040 ggggcacgtc aggctcacca tctgggccac cttcttggtg gtattcaaaa taatcggctt 5100 cccctacagg gtggaaaaat ggccttctac ctggaggggg cctgcgcggt ggagacccgg 5160 atgatgatga ctgactactg ggactcctgg gcctcttttc tccacgtcca cgacctctcc 5220 ccctggctct ttcacgactt ccccccctgg ctctttcacg tcctctaccc cggcggcctc 5280 cactacctcc tcgaccccgg cctccactac ctcctcgacc ccggcctcca ctgcctcctc 5340 gaccccggcc tccacctcct gctcctgccc ctcctgctcc tgcccctcct cctgctcctg 5400 cccctcctgc ccctcctgct cctgcccctc ctgcccctcc tgctcctgcc cctcctgccc 5460 ctcctgctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctcctg 5520 ctcctgcccc tcctgcccct cctgctcctg cccctcctgc ccctcctgct cctgcccctc 5580 ctgcccctcc tgctcctgcc cctcctgctc ctgcccctcc tgctcctgcc cctcctgctc 5640 ctgcccctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgct cctgcccctc 5700 ctgcccctcc tgcccctcct gctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5760 ctgcccctcc tcctgctcct gcccctcctg cccctcctcc tgctcctgcc cctcctcctg 5820 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5880 ctgctcctgc ccctcctcct gctcctgccc ctcctgcccc tcctgcccct cctcctgctc 5940 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc ccctcctgcc cctcctcctg 6000 ctcctgcccc tcctcctgct cctgcccctc ctgctcctgc ccctcccgct cctgctcctg 6060 ctcctgttcc accgtgggtc cctttgcagc caatgcaact tggacgtttt tggggtctcc 6120 ggacaccatc tctatgtctt ggccctgatc ctgagccgcc cggggctcct ggtcttccgc 6180 ctcctcgtcc tcgtcctctt ccccgtcctc gtccatggtt atcaccccct cttctttgag 6240 gtccactgcc gccggagcct tctggtccag atgtgtctcc cttctctcct aggccatttc 6300 caggtcctgt acctggcccc tcgtcagaca tgattcacac taaaagagat caatagacat 6360 ctttattaga cgacgctcag tgaatacagg gagtgcagac tcctgccccc tccaacagcc 6420 cccccaccct catccccttc atggtcgctg tcagacagat ccaggtctga aaattcccca 6480 tcctccgaac catcctcgtc ctcatcacca attactcgca gcccggaaaa ctcccgctga 6540 acatcctcaa gatttgcgtc ctgagcctca agccaggcct caaattcctc gtcccccttt 6600 ttgctggacg gtagggatgg ggattctcgg gacccctcct cttcctcttc aaggtcacca 6660 gacagagatg ctactggggc aacggaagaa aagctgggtg cggcctgtga ggatcagctt 6720 atcgatgata agctgtcaaa catgagaatt cttgaagacg aaagggcctc gtgatacgcc 6780 tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc 6840 ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 6900 cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 6960 gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 7020 ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 7080 tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 7140 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgtg 7200 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 7260 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 7320 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 7380 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 7440 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 7500 cagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 7560 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 7620 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 7680 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 7740 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 7800 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 7860 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 7920 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 7980 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 8040 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 8100 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 8160 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 8220 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 8280 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 8340 gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc 8400 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 8460 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 8520 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 8580 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttgaagc tgtccctgat 8640 ggtcgtcatc tacctgcctg gacagcatgg cctgcaacgc gggcatcccg atgccgccgg 8700 aagcgagaag aatcataatg gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga 8760 cgtagcccag cgcgtcggcc ccgagatgcg ccgcgtgcgg ctgctggaga tggcggacgc 8820 gatggatatg ttctgccaag ggttggtttg cgcattcaca gttctccgca agaattgatt 8880 ggctccaatt cttggagtgg tgaatccgtt agcgaggtgc cgccctgctt catccccgtg 8940 gcccgttgct cgcgtttgct ggcggtgtcc ccggaagaaa tatatttgca tgtctttagt 9000 tctatgatga cacaaacccc gcccagcgtc ttgtcattgg cgaattcgaa cacgcagatg 9060 cagtcggggc ggcgcggtcc gaggtccact tcgcatatta aggtgacgcg tgtggcctcg 9120 aacaccgagc gaccctgcag cgacccgctt aacagcgtca acagcgtgcc gcagatcccg 9180 gggggcaatg agatatgaaa aagcctgaac tcaccgcgac gtctgtcgag aagtttctga 9240 tcgaaaagtt cgacagcgtc tccgacctga tgcagctctc ggagggcgaa gaatctcgtg 9300 ctttcagctt cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg 9360 gtttctacaa agatcgttat gtttatcggc actttgcatc ggccgcgctc ccgattccgg 9420 aagtgcttga cattggggaa ttcagcgaga gcctgaccta ttgcatctcc cgccgtgcac 9480 agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg 9540 cggaggccat ggatgcgatc gctgcggccg atcttagcca gacgagcggg ttcggcccat 9600 tcggaccgca aggaatcggt caatacacta catggcgtga tttcatatgc gcgattgctg 9660 atccccatgt gtatcactgg caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc 9720 aggctctcga tgagctgatg ctttgggccg aggactgccc cgaagtccgg cacctcgtgc 9780 acgcggattt cggctccaac aatgtcctga cggacaatgg ccgcataaca gcggtcattg 9840 actggagcga ggcgatgttc ggggattccc aatacgaggt cgccaacatc ttcttctgga 9900 ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt cgagcggagg catccggagc 9960 ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac caactctatc 10020 agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa 10080 tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga agcgcggccg 10140 tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc cccagcactc 10200 gtccggatcg ggagatgggg gaggctaact gaaacacgga aggagacaat accggaagga 10260 acccgcgcta tgacggcaat aaaaagacag aataaaacgc acgggtgttg ggtcgtttgt 10320 tcataaacgc ggggttcggt cccagggctg gcactctgtc gataccccac cgagacccca 10380 ttggggccaa tacgcccgcg tttcttcctt ttccccaccc caccccccaa gttcgggtga 10440 aggcccaggg ctcgcagcca acgtcggggc ggcaggccct gccatagcca ctggccccgt 10500 gggttaggga cggggtcccc catggggaat ggtttatggt tcgtgggggt tattattttg 10560 ggcgttgcgt ggggtcaggt ccacgactgg actgagcaga cagacccatg gtttttggat 10620 ggcctgggca tggaccgcat gtactggcgc gacacgaaca ccgggcgtct gtggctgcca 10680 aacacccccg acccccaaaa accaccgcgc ggatttctgg cgtgccaagc tagtcgacca 10740 attctcatgt ttgacagctt atcatcgcag atccgggcaa cgttgttgcc attgctgcag 10800 gcgcagaact ggtaggtatg gaagatccat acattgaatc aatattggca attagccata 10860 ttagtcattg gttatatagc ataaatcaat attggctatt ggccattgca tacgttgtat 10920 ctatatcata atatgtacat ttatattggc tcatgtccaa tatgaccgcc at 10972 <210> 156 <211> 10972 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 156 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccaggt 1020 ctgcctgaag gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac 1080 cttccacgag gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac 1140 tggctcggag aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga 1200 gatctggctg ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac 1260 ccgcatcgcc tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac 1320 cgagaactgc gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag 1380 ggatcaattg ccctacatct gccagttcgg gatcgtgcac caccaccacc accactaact 1440 cgaggccggc aaggccggat ccagacatga taagatacat tgatgagttt ggacaaacca 1500 caactagaat gcagtgaaaa aaatgcttta tttgtgaaat ttgtgatgct attgctttat 1560 ttgtaaccat tataagctgc aataaacaag ttaacaacaa gaattgcatt cattttatgt 1620 ttcaggttca gggggaggtg tgggaggttt tttaaagcaa gtaaaacctc tacaaatgtg 1680 gtatggctga ttatgatccg gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg 1740 acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca 1800 agcccgtcag gcgtcagcgg gtgttggcgg gtgtcggggc gcagccatga ggtcgactct 1860 agaggatcga tgccccgccc cggacgaact aaacctgact acgacatctc tgccccttct 1920 tcgcggggca gtgcatgtaa tcccttcagt tggttggtac aacttgccaa ctgggccctg 1980 ttccacatgt gacacggggg gggaccaaac acaaaggggt tctctgactg tagttgacat 2040 ccttataaat ggatgtgcac atttgccaac actgagtggc tttcatcctg gagcagactt 2100 tgcagtctgt ggactgcaac acaacattgc ctttatgtgt aactcttggc tgaagctctt 2160 acaccaatgc tgggggacat gtacctccca ggggcccagg aagactacgg gaggctacac 2220 caacgtcaat cagaggggcc tgtgtagcta ccgataagcg gaccctcaag agggcattag 2280 caatagtgtt tataaggccc ccttgttaac cctaaacggg tagcatatgc ttcccgggta 2340 gtagtatata ctatccagac taaccctaat tcaatagcat atgttaccca acgggaagca 2400 tatgctatcg aattagggtt agtaaaaggg tcctaaggaa cagcgatatc tcccacccca 2460 tgagctgtca cggttttatt tacatggggt caggattcca cgagggtagt gaaccatttt 2520 agtcacaagg gcagtggctg aagatcaagg agcgggcagt gaactctcct gaatcttcgc 2580 ctgcttcttc attctccttc gtttagctaa tagaataact gctgagttgt gaacagtaag 2640 gtgtatgtga ggtgctcgaa aacaaggttt caggtgacgc ccccagaata aaatttggac 2700 ggggggttca gtggtggcat tgtgctatga caccaatata accctcacaa accccttggg 2760 caataaatac tagtgtagga atgaaacatt ctgaatatct ttaacaatag aaatccatgg 2820 ggtggggaca agccgtaaag actggatgtc catctcacac gaatttatgg ctatgggcaa 2880 cacataatcc tagtgcaata tgatactggg gttattaaga tgtgtcccag gcagggacca 2940 agacaggtga accatgttgt tacactctat ttgtaacaag gggaaagaga gtggacgccg 3000 acagcagcgg actccactgg ttgtctctaa cacccccgaa aattaaacgg ggctccacgc 3060 caatggggcc cataaacaaa gacaagtggc cactcttttt tttgaaattg tggagtgggg 3120 gcacgcgtca gcccccacac gccgccctgc ggttttggac tgtaaaataa gggtgtaata 3180 acttggctga ttgtaacccc gctaaccact gcggtcaaac cacttgccca caaaaccact 3240 aatggcaccc cggggaatac ctgcataagt aggtgggcgg gccaagatag gggcgcgatt 3300 gctgcgatct ggaggacaaa ttacacacac ttgcgcctga gcgccaagca cagggttgtt 3360 ggtcctcata ttcacgaggt cgctgagagc acggtgggct aatgttgcca tgggtagcat 3420 atactaccca aatatctgga tagcatatgc tatcctaatc tatatctggg tagcataggc 3480 tatcctaatc tatatctggg tagcatatgc tatcctaatc tatatctggg tagtatatgc 3540 tatcctaatt tatatctggg tagcataggc tatcctaatc tatatctggg tagcatatgc 3600 tatcctaatc tatatctggg tagtatatgc tatcctaatc tgtatccggg tagcatatgc 3660 tatcctaata gagattaggg tagtatatgc tatcctaatt tatatctggg tagcatatac 3720 tacccaaata tctggatagc atatgctatc ctaatctata tctgggtagc atatgctatc 3780 ctaatctata tctgggtagc ataggctatc ctaatctata tctgggtagc atatgctatc 3840 ctaatctata tctgggtagt atatgctatc ctaatttata tctgggtagc ataggctatc 3900 ctaatctata tctgggtagc atatgctatc ctaatctata tctgggtagt atatgctatc 3960 ctaatctgta tccgggtagc atatgctatc ctcatgcata tacagtcagc atatgatacc 4020 cagtagtaga gtgggagtgc tatcctttgc atatgccgcc acctcccaag ggggcgtgaa 4080 ttttcgctgc ttgtcctttt cctgctggtt gctcccattc ttaggtgaat ttaaggaggc 4140 caggctaaag ccgtcgcatg tctgattgct caccaggtaa atgtcgctaa tgttttccaa 4200 cgcgagaagg tgttgagcgc ggagctgagt gacgtgacaa catgggtatg ccgaattgcc 4260 ccatgttggg aggacgaaaa tggtgacaag acagatggcc agaaatacac caacagcacg 4320 catgatgtct actggggatt tattctttag tgcgggggaa tacacggctt ttaatacgat 4380 tgagggcgtc tcctaacaag ttacatcact cctgcccttc ctcaccctca tctccatcac 4440 ctccttcatc tccgtcatct ccgtcatcac cctccgcggc agccccttcc accataggtg 4500 gaaaccaggg aggcaaatct actccatcgt caaagctgca cacagtcacc ctgatattgc 4560 aggtaggagc gggctttgtc ataacaaggt ccttaatcgc atccttcaaa acctcagcaa 4620 atatatgagt ttgtaaaaag accatgaaat aacagacaat ggactccctt agcgggccag 4680 gttgtgggcc gggtccaggg gccattccaa aggggagacg actcaatggt gtaagacgac 4740 attgtggaat agcaagggca gttcctcgcc ttaggttgta aagggaggtc ttactacctc 4800 catatacgaa cacaccggcg acccaagttc cttcgtcggt agtcctttct acgtgactcc 4860 tagccaggag agctcttaaa ccttctgcaa tgttctcaaa tttcgggttg gaacctcctt 4920 gaccacgatg ctttccaaac caccctcctt ttttgcgcct gcctccatca ccctgacccc 4980 ggggtccagt gcttgggcct tctcctgggt catctgcggg gccctgctct atcgctcccg 5040 ggggcacgtc aggctcacca tctgggccac cttcttggtg gtattcaaaa taatcggctt 5100 cccctacagg gtggaaaaat ggccttctac ctggaggggg cctgcgcggt ggagacccgg 5160 atgatgatga ctgactactg ggactcctgg gcctcttttc tccacgtcca cgacctctcc 5220 ccctggctct ttcacgactt ccccccctgg ctctttcacg tcctctaccc cggcggcctc 5280 cactacctcc tcgaccccgg cctccactac ctcctcgacc ccggcctcca ctgcctcctc 5340 gaccccggcc tccacctcct gctcctgccc ctcctgctcc tgcccctcct cctgctcctg 5400 cccctcctgc ccctcctgct cctgcccctc ctgcccctcc tgctcctgcc cctcctgccc 5460 ctcctgctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctcctg 5520 ctcctgcccc tcctgcccct cctgctcctg cccctcctgc ccctcctgct cctgcccctc 5580 ctgcccctcc tgctcctgcc cctcctgctc ctgcccctcc tgctcctgcc cctcctgctc 5640 ctgcccctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgct cctgcccctc 5700 ctgcccctcc tgcccctcct gctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5760 ctgcccctcc tcctgctcct gcccctcctg cccctcctcc tgctcctgcc cctcctcctg 5820 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5880 ctgctcctgc ccctcctcct gctcctgccc ctcctgcccc tcctgcccct cctcctgctc 5940 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc ccctcctgcc cctcctcctg 6000 ctcctgcccc tcctcctgct cctgcccctc ctgctcctgc ccctcccgct cctgctcctg 6060 ctcctgttcc accgtgggtc cctttgcagc caatgcaact tggacgtttt tggggtctcc 6120 ggacaccatc tctatgtctt ggccctgatc ctgagccgcc cggggctcct ggtcttccgc 6180 ctcctcgtcc tcgtcctctt ccccgtcctc gtccatggtt atcaccccct cttctttgag 6240 gtccactgcc gccggagcct tctggtccag atgtgtctcc cttctctcct aggccatttc 6300 caggtcctgt acctggcccc tcgtcagaca tgattcacac taaaagagat caatagacat 6360 ctttattaga cgacgctcag tgaatacagg gagtgcagac tcctgccccc tccaacagcc 6420 cccccaccct catccccttc atggtcgctg tcagacagat ccaggtctga aaattcccca 6480 tcctccgaac catcctcgtc ctcatcacca attactcgca gcccggaaaa ctcccgctga 6540 acatcctcaa gatttgcgtc ctgagcctca agccaggcct caaattcctc gtcccccttt 6600 ttgctggacg gtagggatgg ggattctcgg gacccctcct cttcctcttc aaggtcacca 6660 gacagagatg ctactggggc aacggaagaa aagctgggtg cggcctgtga ggatcagctt 6720 atcgatgata agctgtcaaa catgagaatt cttgaagacg aaagggcctc gtgatacgcc 6780 tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc 6840 ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 6900 cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 6960 gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 7020 ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 7080 tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 7140 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgtg 7200 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 7260 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 7320 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 7380 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 7440 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 7500 cagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 7560 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 7620 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 7680 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 7740 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 7800 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 7860 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 7920 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 7980 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 8040 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 8100 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 8160 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 8220 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 8280 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 8340 gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc 8400 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 8460 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 8520 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 8580 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttgaagc tgtccctgat 8640 ggtcgtcatc tacctgcctg gacagcatgg cctgcaacgc gggcatcccg atgccgccgg 8700 aagcgagaag aatcataatg gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga 8760 cgtagcccag cgcgtcggcc ccgagatgcg ccgcgtgcgg ctgctggaga tggcggacgc 8820 gatggatatg ttctgccaag ggttggtttg cgcattcaca gttctccgca agaattgatt 8880 ggctccaatt cttggagtgg tgaatccgtt agcgaggtgc cgccctgctt catccccgtg 8940 gcccgttgct cgcgtttgct ggcggtgtcc ccggaagaaa tatatttgca tgtctttagt 9000 tctatgatga cacaaacccc gcccagcgtc ttgtcattgg cgaattcgaa cacgcagatg 9060 cagtcggggc ggcgcggtcc gaggtccact tcgcatatta aggtgacgcg tgtggcctcg 9120 aacaccgagc gaccctgcag cgacccgctt aacagcgtca acagcgtgcc gcagatcccg 9180 gggggcaatg agatatgaaa aagcctgaac tcaccgcgac gtctgtcgag aagtttctga 9240 tcgaaaagtt cgacagcgtc tccgacctga tgcagctctc ggagggcgaa gaatctcgtg 9300 ctttcagctt cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg 9360 gtttctacaa agatcgttat gtttatcggc actttgcatc ggccgcgctc ccgattccgg 9420 aagtgcttga cattggggaa ttcagcgaga gcctgaccta ttgcatctcc cgccgtgcac 9480 agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg 9540 cggaggccat ggatgcgatc gctgcggccg atcttagcca gacgagcggg ttcggcccat 9600 tcggaccgca aggaatcggt caatacacta catggcgtga tttcatatgc gcgattgctg 9660 atccccatgt gtatcactgg caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc 9720 aggctctcga tgagctgatg ctttgggccg aggactgccc cgaagtccgg cacctcgtgc 9780 acgcggattt cggctccaac aatgtcctga cggacaatgg ccgcataaca gcggtcattg 9840 actggagcga ggcgatgttc ggggattccc aatacgaggt cgccaacatc ttcttctgga 9900 ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt cgagcggagg catccggagc 9960 ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac caactctatc 10020 agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa 10080 tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga agcgcggccg 10140 tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc cccagcactc 10200 gtccggatcg ggagatgggg gaggctaact gaaacacgga aggagacaat accggaagga 10260 acccgcgcta tgacggcaat aaaaagacag aataaaacgc acgggtgttg ggtcgtttgt 10320 tcataaacgc ggggttcggt cccagggctg gcactctgtc gataccccac cgagacccca 10380 ttggggccaa tacgcccgcg tttcttcctt ttccccaccc caccccccaa gttcgggtga 10440 aggcccaggg ctcgcagcca acgtcggggc ggcaggccct gccatagcca ctggccccgt 10500 gggttaggga cggggtcccc catggggaat ggtttatggt tcgtgggggt tattattttg 10560 ggcgttgcgt ggggtcaggt ccacgactgg actgagcaga cagacccatg gtttttggat 10620 ggcctgggca tggaccgcat gtactggcgc gacacgaaca ccgggcgtct gtggctgcca 10680 aacacccccg acccccaaaa accaccgcgc ggatttctgg cgtgccaagc tagtcgacca 10740 attctcatgt ttgacagctt atcatcgcag atccgggcaa cgttgttgcc attgctgcag 10800 gcgcagaact ggtaggtatg gaagatccat acattgaatc aatattggca attagccata 10860 ttagtcattg gttatatagc ataaatcaat attggctatt ggccattgca tacgttgtat 10920 ctatatcata atatgtacat ttatattggc tcatgtccaa tatgaccgcc at 10972 <210> 157 <211> 10969 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 157 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagtg 1020 cctgaagggg accaaggtgc acatgaaatg ctttctggcc ttcacccaga cgaagacctt 1080 ccacgaggcc agcgaggact gcatctcgcg cgggggcacc ctgagcaccc ctcagactgg 1140 ctcggagaac gacgccctgt atgagtacct gcgccagagc gtgggcaacg aggccgagat 1200 ctggctgggc ctcaacgaca tggcggccga gggcacctgg gtggacatga ccggtacccg 1260 catcgcctac aagaactggg agactgagat caccgcgcaa cccgatggcg gcaagaccga 1320 gaactgcgcg gtcctgtcag gcgcggccaa cggcaagtgg ttcgacaagc gctgcaggga 1380 tcaattgccc tacatctgcc agttcgggat cgtgcaccac caccaccacc actaactcga 1440 ggccggcaag gccggatcca gacatgataa gatacattga tgagtttgga caaaccacaa 1500 ctagaatgca gtgaaaaaaa tgctttattt gtgaaatttg tgatgctatt gctttatttg 1560 taaccattat aagctgcaat aaacaagtta acaacaagaa ttgcattcat tttatgtttc 1620 aggttcaggg ggaggtgtgg gaggtttttt aaagcaagta aaacctctac aaatgtggta 1680 tggctgatta tgatccggct gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca 1740 catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc 1800 ccgtcaggcg tcagcgggtg ttggcgggtg tcggggcgca gccatgaggt cgactctaga 1860 ggatcgatgc cccgccccgg acgaactaaa cctgactacg acatctctgc cccttcttcg 1920 cggggcagtg catgtaatcc cttcagttgg ttggtacaac ttgccaactg ggccctgttc 1980 cacatgtgac acgggggggg accaaacaca aaggggttct ctgactgtag ttgacatcct 2040 tataaatgga tgtgcacatt tgccaacact gagtggcttt catcctggag cagactttgc 2100 agtctgtgga ctgcaacaca acattgcctt tatgtgtaac tcttggctga agctcttaca 2160 ccaatgctgg gggacatgta cctcccaggg gcccaggaag actacgggag gctacaccaa 2220 cgtcaatcag aggggcctgt gtagctaccg ataagcggac cctcaagagg gcattagcaa 2280 tagtgtttat aaggccccct tgttaaccct aaacgggtag catatgcttc ccgggtagta 2340 gtatatacta tccagactaa ccctaattca atagcatatg ttacccaacg ggaagcatat 2400 gctatcgaat tagggttagt aaaagggtcc taaggaacag cgatatctcc caccccatga 2460 gctgtcacgg ttttatttac atggggtcag gattccacga gggtagtgaa ccattttagt 2520 cacaagggca gtggctgaag atcaaggagc gggcagtgaa ctctcctgaa tcttcgcctg 2580 cttcttcatt ctccttcgtt tagctaatag aataactgct gagttgtgaa cagtaaggtg 2640 tatgtgaggt gctcgaaaac aaggtttcag gtgacgcccc cagaataaaa tttggacggg 2700 gggttcagtg gtggcattgt gctatgacac caatataacc ctcacaaacc ccttgggcaa 2760 taaatactag tgtaggaatg aaacattctg aatatcttta acaatagaaa tccatggggt 2820 ggggacaagc cgtaaagact ggatgtccat ctcacacgaa tttatggcta tgggcaacac 2880 ataatcctag tgcaatatga tactggggtt attaagatgt gtcccaggca gggaccaaga 2940 caggtgaacc atgttgttac actctatttg taacaagggg aaagagagtg gacgccgaca 3000 gcagcggact ccactggttg tctctaacac ccccgaaaat taaacggggc tccacgccaa 3060 tggggcccat aaacaaagac aagtggccac tctttttttt gaaattgtgg agtgggggca 3120 cgcgtcagcc cccacacgcc gccctgcggt tttggactgt aaaataaggg tgtaataact 3180 tggctgattg taaccccgct aaccactgcg gtcaaaccac ttgcccacaa aaccactaat 3240 ggcaccccgg ggaatacctg cataagtagg tgggcgggcc aagatagggg cgcgattgct 3300 gcgatctgga ggacaaatta cacacacttg cgcctgagcg ccaagcacag ggttgttggt 3360 cctcatattc acgaggtcgc tgagagcacg gtgggctaat gttgccatgg gtagcatata 3420 ctacccaaat atctggatag catatgctat cctaatctat atctgggtag cataggctat 3480 cctaatctat atctgggtag catatgctat cctaatctat atctgggtag tatatgctat 3540 cctaatttat atctgggtag cataggctat cctaatctat atctgggtag catatgctat 3600 cctaatctat atctgggtag tatatgctat cctaatctgt atccgggtag catatgctat 3660 cctaatagag attagggtag tatatgctat cctaatttat atctgggtag catatactac 3720 ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata tgctatccta 3780 atctatatct gggtagcata ggctatccta atctatatct gggtagcata tgctatccta 3840 atctatatct gggtagtata tgctatccta atttatatct gggtagcata ggctatccta 3900 atctatatct gggtagcata tgctatccta atctatatct gggtagtata tgctatccta 3960 atctgtatcc gggtagcata tgctatcctc atgcatatac agtcagcata tgatacccag 4020 tagtagagtg ggagtgctat cctttgcata tgccgccacc tcccaagggg gcgtgaattt 4080 tcgctgcttg tccttttcct gctggttgct cccattctta ggtgaattta aggaggccag 4140 gctaaagccg tcgcatgtct gattgctcac caggtaaatg tcgctaatgt tttccaacgc 4200 gagaaggtgt tgagcgcgga gctgagtgac gtgacaacat gggtatgccg aattgcccca 4260 tgttgggagg acgaaaatgg tgacaagaca gatggccaga aatacaccaa cagcacgcat 4320 gatgtctact ggggatttat tctttagtgc gggggaatac acggctttta atacgattga 4380 gggcgtctcc taacaagtta catcactcct gcccttcctc accctcatct ccatcacctc 4440 cttcatctcc gtcatctccg tcatcaccct ccgcggcagc cccttccacc ataggtggaa 4500 accagggagg caaatctact ccatcgtcaa agctgcacac agtcaccctg atattgcagg 4560 taggagcggg ctttgtcata acaaggtcct taatcgcatc cttcaaaacc tcagcaaata 4620 tatgagtttg taaaaagacc atgaaataac agacaatgga ctcccttagc gggccaggtt 4680 gtgggccggg tccaggggcc attccaaagg ggagacgact caatggtgta agacgacatt 4740 gtggaatagc aagggcagtt cctcgcctta ggttgtaaag ggaggtctta ctacctccat 4800 atacgaacac accggcgacc caagttcctt cgtcggtagt cctttctacg tgactcctag 4860 ccaggagagc tcttaaacct tctgcaatgt tctcaaattt cgggttggaa cctccttgac 4920 cacgatgctt tccaaaccac cctccttttt tgcgcctgcc tccatcaccc tgaccccggg 4980 gtccagtgct tgggccttct cctgggtcat ctgcggggcc ctgctctatc gctcccgggg 5040 gcacgtcagg ctcaccatct gggccacctt cttggtggta ttcaaaataa tcggcttccc 5100 ctacagggtg gaaaaatggc cttctacctg gagggggcct gcgcggtgga gacccggatg 5160 atgatgactg actactggga ctcctgggcc tcttttctcc acgtccacga cctctccccc 5220 tggctctttc acgacttccc cccctggctc tttcacgtcc tctaccccgg cggcctccac 5280 tacctcctcg accccggcct ccactacctc ctcgaccccg gcctccactg cctcctcgac 5340 cccggcctcc acctcctgct cctgcccctc ctgctcctgc ccctcctcct gctcctgccc 5400 ctcctgcccc tcctgctcct gcccctcctg cccctcctgc tcctgcccct cctgcccctc 5460 ctgctcctgc ccctcctgcc cctcctcctg ctcctgcccc tcctgcccct cctcctgctc 5520 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5580 cccctcctgc tcctgcccct cctgctcctg cccctcctgc tcctgcccct cctgctcctg 5640 cccctcctgc ccctcctgcc cctcctcctg ctcctgcccc tcctgctcct gcccctcctg 5700 cccctcctgc ccctcctgct cctgcccctc ctcctgctcc tgcccctcct gcccctcctg 5760 cccctcctcc tgctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctcctgctc 5820 ctgcccctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctcctg 5880 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct cctgctcctg 5940 cccctcctcc tgctcctgcc cctcctgccc ctcctgcccc tcctgcccct cctcctgctc 6000 ctgcccctcc tcctgctcct gcccctcctg ctcctgcccc tcccgctcct gctcctgctc 6060 ctgttccacc gtgggtccct ttgcagccaa tgcaacttgg acgtttttgg ggtctccgga 6120 caccatctct atgtcttggc cctgatcctg agccgcccgg ggctcctggt cttccgcctc 6180 ctcgtcctcg tcctcttccc cgtcctcgtc catggttatc accccctctt ctttgaggtc 6240 cactgccgcc ggagccttct ggtccagatg tgtctccctt ctctcctagg ccatttccag 6300 gtcctgtacc tggcccctcg tcagacatga ttcacactaa aagagatcaa tagacatctt 6360 tattagacga cgctcagtga atacagggag tgcagactcc tgccccctcc aacagccccc 6420 ccaccctcat ccccttcatg gtcgctgtca gacagatcca ggtctgaaaa ttccccatcc 6480 tccgaaccat cctcgtcctc atcaccaatt actcgcagcc cggaaaactc ccgctgaaca 6540 tcctcaagat ttgcgtcctg agcctcaagc caggcctcaa attcctcgtc cccctttttg 6600 ctggacggta gggatgggga ttctcgggac ccctcctctt cctcttcaag gtcaccagac 6660 agagatgcta ctggggcaac ggaagaaaag ctgggtgcgg cctgtgagga tcagcttatc 6720 gatgataagc tgtcaaacat gagaattctt gaagacgaaa gggcctcgtg atacgcctat 6780 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 6840 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 6900 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 6960 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 7020 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 7080 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 7140 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtgttg 7200 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 7260 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 7320 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 7380 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 7440 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgcag 7500 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 7560 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 7620 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 7680 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 7740 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 7800 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 7860 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 7920 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 7980 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 8040 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 8100 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 8160 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 8220 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 8280 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 8340 cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg 8400 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 8460 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 8520 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 8580 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttgaagctgt ccctgatggt 8640 cgtcatctac ctgcctggac agcatggcct gcaacgcggg catcccgatg ccgccggaag 8700 cgagaagaat cataatgggg aaggccatcc agcctcgcgt cgcgaacgcc agcaagacgt 8760 agcccagcgc gtcggccccg agatgcgccg cgtgcggctg ctggagatgg cggacgcgat 8820 ggatatgttc tgccaagggt tggtttgcgc attcacagtt ctccgcaaga attgattggc 8880 tccaattctt ggagtggtga atccgttagc gaggtgccgc cctgcttcat ccccgtggcc 8940 cgttgctcgc gtttgctggc ggtgtccccg gaagaaatat atttgcatgt ctttagttct 9000 atgatgacac aaaccccgcc cagcgtcttg tcattggcga attcgaacac gcagatgcag 9060 tcggggcggc gcggtccgag gtccacttcg catattaagg tgacgcgtgt ggcctcgaac 9120 accgagcgac cctgcagcga cccgcttaac agcgtcaaca gcgtgccgca gatcccgggg 9180 ggcaatgaga tatgaaaaag cctgaactca ccgcgacgtc tgtcgagaag tttctgatcg 9240 aaaagttcga cagcgtctcc gacctgatgc agctctcgga gggcgaagaa tctcgtgctt 9300 tcagcttcga tgtaggaggg cgtggatatg tcctgcgggt aaatagctgc gccgatggtt 9360 tctacaaaga tcgttatgtt tatcggcact ttgcatcggc cgcgctcccg attccggaag 9420 tgcttgacat tggggaattc agcgagagcc tgacctattg catctcccgc cgtgcacagg 9480 gtgtcacgtt gcaagacctg cctgaaaccg aactgcccgc tgttctgcag ccggtcgcgg 9540 aggccatgga tgcgatcgct gcggccgatc ttagccagac gagcgggttc ggcccattcg 9600 gaccgcaagg aatcggtcaa tacactacat ggcgtgattt catatgcgcg attgctgatc 9660 cccatgtgta tcactggcaa actgtgatgg acgacaccgt cagtgcgtcc gtcgcgcagg 9720 ctctcgatga gctgatgctt tgggccgagg actgccccga agtccggcac ctcgtgcacg 9780 cggatttcgg ctccaacaat gtcctgacgg acaatggccg cataacagcg gtcattgact 9840 ggagcgaggc gatgttcggg gattcccaat acgaggtcgc caacatcttc ttctggaggc 9900 cgtggttggc ttgtatggag cagcagacgc gctacttcga gcggaggcat ccggagcttg 9960 caggatcgcc gcggctccgg gcgtatatgc tccgcattgg tcttgaccaa ctctatcaga 10020 gcttggttga cggcaatttc gatgatgcag cttgggcgca gggtcgatgc gacgcaatcg 10080 tccgatccgg agccgggact gtcgggcgta cacaaatcgc ccgcagaagc gcggccgtct 10140 ggaccgatgg ctgtgtagaa gtactcgccg atagtggaaa ccgacgcccc agcactcgtc 10200 cggatcggga gatgggggag gctaactgaa acacggaagg agacaatacc ggaaggaacc 10260 cgcgctatga cggcaataaa aagacagaat aaaacgcacg ggtgttgggt cgtttgttca 10320 taaacgcggg gttcggtccc agggctggca ctctgtcgat accccaccga gaccccattg 10380 gggccaatac gcccgcgttt cttccttttc cccaccccac cccccaagtt cgggtgaagg 10440 cccagggctc gcagccaacg tcggggcggc aggccctgcc atagccactg gccccgtggg 10500 ttagggacgg ggtcccccat ggggaatggt ttatggttcg tgggggttat tattttgggc 10560 gttgcgtggg gtcaggtcca cgactggact gagcagacag acccatggtt tttggatggc 10620 ctgggcatgg accgcatgta ctggcgcgac acgaacaccg ggcgtctgtg gctgccaaac 10680 acccccgacc cccaaaaacc accgcgcgga tttctggcgt gccaagctag tcgaccaatt 10740 ctcatgtttg acagcttatc atcgcagatc cgggcaacgt tgttgccatt gctgcaggcg 10800 cagaactggt aggtatggaa gatccataca ttgaatcaat attggcaatt agccatatta 10860 gtcattggtt atatagcata aatcaatatt ggctattggc cattgcatac gttgtatcta 10920 tatcataata tgtacattta tattggctca tgtccaatat gaccgccat 10969 <210> 158 <211> 10975 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 158 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 ggtcagcctg aaggggacca aggtgcacat gaaaagcttt ctggccttca cccagacgaa 1080 gaccttccac gaggccagcg aggactgcat ctcgcgcggg ggcaccctga gcacccctca 1140 gactggctcg gagaacgacg ccctgtatga gtacctgcgc cagagcgtgg gcaacgaggc 1200 cgagatctgg ctgggcctca acgacatggc ggccgagggc acctgggtgg acatgaccgg 1260 tacccgcatc gcctacaaga actgggagac tgagatcacc gcgcaacccg atggcggcaa 1320 gaccgagaac tgcgcggtcc tgtcaggcgc ggccaacggc aagtggttcg acaagcgctg 1380 cagggatcaa ttgccctaca tctgccagtt cgggatcgtg caccaccacc accaccacta 1440 actcgaggcc ggcaaggccg gatccagaca tgataagata cattgatgag tttggacaaa 1500 ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat gctattgctt 1560 tatttgtaac cattataagc tgcaataaac aagttaacaa caagaattgc attcatttta 1620 tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac ctctacaaat 1680 gtggtatggc tgattatgat ccggctgcct cgcgcgtttc ggtgatgacg gtgaaaacct 1740 ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 1800 acaagcccgt caggcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgaggtcgac 1860 tctagaggat cgatgccccg ccccggacga actaaacctg actacgacat ctctgcccct 1920 tcttcgcggg gcagtgcatg taatcccttc agttggttgg tacaacttgc caactgggcc 1980 ctgttccaca tgtgacacgg ggggggacca aacacaaagg ggttctctga ctgtagttga 2040 catccttata aatggatgtg cacatttgcc aacactgagt ggctttcatc ctggagcaga 2100 ctttgcagtc tgtggactgc aacacaacat tgcctttatg tgtaactctt ggctgaagct 2160 cttacaccaa tgctggggga catgtacctc ccaggggccc aggaagacta cgggaggcta 2220 caccaacgtc aatcagaggg gcctgtgtag ctaccgataa gcggaccctc aagagggcat 2280 tagcaatagt gtttataagg cccccttgtt aaccctaaac gggtagcata tgcttcccgg 2340 gtagtagtat atactatcca gactaaccct aattcaatag catatgttac ccaacgggaa 2400 gcatatgcta tcgaattagg gttagtaaaa gggtcctaag gaacagcgat atctcccacc 2460 ccatgagctg tcacggtttt atttacatgg ggtcaggatt ccacgagggt agtgaaccat 2520 tttagtcaca agggcagtgg ctgaagatca aggagcgggc agtgaactct cctgaatctt 2580 cgcctgcttc ttcattctcc ttcgtttagc taatagaata actgctgagt tgtgaacagt 2640 aaggtgtatg tgaggtgctc gaaaacaagg tttcaggtga cgcccccaga ataaaatttg 2700 gacggggggt tcagtggtgg cattgtgcta tgacaccaat ataaccctca caaacccctt 2760 gggcaataaa tactagtgta ggaatgaaac attctgaata tctttaacaa tagaaatcca 2820 tggggtgggg acaagccgta aagactggat gtccatctca cacgaattta tggctatggg 2880 caacacataa tcctagtgca atatgatact ggggttatta agatgtgtcc caggcaggga 2940 ccaagacagg tgaaccatgt tgttacactc tatttgtaac aaggggaaag agagtggacg 3000 ccgacagcag cggactccac tggttgtctc taacaccccc gaaaattaaa cggggctcca 3060 cgccaatggg gcccataaac aaagacaagt ggccactctt ttttttgaaa ttgtggagtg 3120 ggggcacgcg tcagccccca cacgccgccc tgcggttttg gactgtaaaa taagggtgta 3180 ataacttggc tgattgtaac cccgctaacc actgcggtca aaccacttgc ccacaaaacc 3240 actaatggca ccccggggaa tacctgcata agtaggtggg cgggccaaga taggggcgcg 3300 attgctgcga tctggaggac aaattacaca cacttgcgcc tgagcgccaa gcacagggtt 3360 gttggtcctc atattcacga ggtcgctgag agcacggtgg gctaatgttg ccatgggtag 3420 catatactac ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata 3480 ggctatccta atctatatct gggtagcata tgctatccta atctatatct gggtagtata 3540 tgctatccta atttatatct gggtagcata ggctatccta atctatatct gggtagcata 3600 tgctatccta atctatatct gggtagtata tgctatccta atctgtatcc gggtagcata 3660 tgctatccta atagagatta gggtagtata tgctatccta atttatatct gggtagcata 3720 tactacccaa atatctggat agcatatgct atcctaatct atatctgggt agcatatgct 3780 atcctaatct atatctgggt agcataggct atcctaatct atatctgggt agcatatgct 3840 atcctaatct atatctgggt agtatatgct atcctaattt atatctgggt agcataggct 3900 atcctaatct atatctgggt agcatatgct atcctaatct atatctgggt agtatatgct 3960 atcctaatct gtatccgggt agcatatgct atcctcatgc atatacagtc agcatatgat 4020 acccagtagt agagtgggag tgctatcctt tgcatatgcc gccacctccc aagggggcgt 4080 gaattttcgc tgcttgtcct tttcctgctg gttgctccca ttcttaggtg aatttaagga 4140 ggccaggcta aagccgtcgc atgtctgatt gctcaccagg taaatgtcgc taatgttttc 4200 caacgcgaga aggtgttgag cgcggagctg agtgacgtga caacatgggt atgccgaatt 4260 gccccatgtt gggaggacga aaatggtgac aagacagatg gccagaaata caccaacagc 4320 acgcatgatg tctactgggg atttattctt tagtgcgggg gaatacacgg cttttaatac 4380 gattgagggc gtctcctaac aagttacatc actcctgccc ttcctcaccc tcatctccat 4440 cacctccttc atctccgtca tctccgtcat caccctccgc ggcagcccct tccaccatag 4500 gtggaaacca gggaggcaaa tctactccat cgtcaaagct gcacacagtc accctgatat 4560 tgcaggtagg agcgggcttt gtcataacaa ggtccttaat cgcatccttc aaaacctcag 4620 caaatatatg agtttgtaaa aagaccatga aataacagac aatggactcc cttagcgggc 4680 caggttgtgg gccgggtcca ggggccattc caaaggggag acgactcaat ggtgtaagac 4740 gacattgtgg aatagcaagg gcagttcctc gccttaggtt gtaaagggag gtcttactac 4800 ctccatatac gaacacaccg gcgacccaag ttccttcgtc ggtagtcctt tctacgtgac 4860 tcctagccag gagagctctt aaaccttctg caatgttctc aaatttcggg ttggaacctc 4920 cttgaccacg atgctttcca aaccaccctc cttttttgcg cctgcctcca tcaccctgac 4980 cccggggtcc agtgcttggg ccttctcctg ggtcatctgc ggggccctgc tctatcgctc 5040 ccgggggcac gtcaggctca ccatctgggc caccttcttg gtggtattca aaataatcgg 5100 cttcccctac agggtggaaa aatggccttc tacctggagg gggcctgcgc ggtggagacc 5160 cggatgatga tgactgacta ctgggactcc tgggcctctt ttctccacgt ccacgacctc 5220 tccccctggc tctttcacga cttccccccc tggctctttc acgtcctcta ccccggcggc 5280 ctccactacc tcctcgaccc cggcctccac tacctcctcg accccggcct ccactgcctc 5340 ctcgaccccg gcctccacct cctgctcctg cccctcctgc tcctgcccct cctcctgctc 5400 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5460 cccctcctgc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5520 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgcccctcct gctcctgccc 5580 ctcctgcccc tcctgctcct gcccctcctg ctcctgcccc tcctgctcct gcccctcctg 5640 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gctcctgccc 5700 ctcctgcccc tcctgcccct cctgctcctg cccctcctcc tgctcctgcc cctcctgccc 5760 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tcctgctcct gcccctcctc 5820 ctgctcctgc ccctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5880 ctcctgctcc tgcccctcct cctgctcctg cccctcctgc ccctcctgcc cctcctcctg 5940 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct gcccctcctc 6000 ctgctcctgc ccctcctcct gctcctgccc ctcctgctcc tgcccctccc gctcctgctc 6060 ctgctcctgt tccaccgtgg gtccctttgc agccaatgca acttggacgt ttttggggtc 6120 tccggacacc atctctatgt cttggccctg atcctgagcc gcccggggct cctggtcttc 6180 cgcctcctcg tcctcgtcct cttccccgtc ctcgtccatg gttatcaccc cctcttcttt 6240 gaggtccact gccgccggag ccttctggtc cagatgtgtc tcccttctct cctaggccat 6300 ttccaggtcc tgtacctggc ccctcgtcag acatgattca cactaaaaga gatcaataga 6360 catctttatt agacgacgct cagtgaatac agggagtgca gactcctgcc ccctccaaca 6420 gcccccccac cctcatcccc ttcatggtcg ctgtcagaca gatccaggtc tgaaaattcc 6480 ccatcctccg aaccatcctc gtcctcatca ccaattactc gcagcccgga aaactcccgc 6540 tgaacatcct caagatttgc gtcctgagcc tcaagccagg cctcaaattc ctcgtccccc 6600 tttttgctgg acggtaggga tggggattct cgggacccct cctcttcctc ttcaaggtca 6660 ccagacagag atgctactgg ggcaacggaa gaaaagctgg gtgcggcctg tgaggatcag 6720 cttatcgatg ataagctgtc aaacatgaga attcttgaag acgaaagggc ctcgtgatac 6780 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 6840 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 6900 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 6960 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 7020 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 7080 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 7140 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 7200 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 7260 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 7320 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 7380 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 7440 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 7500 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 7560 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 7620 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 7680 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 7740 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 7800 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 7860 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 7920 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 7980 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 8040 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 8100 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 8160 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 8220 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 8280 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 8340 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 8400 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 8460 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 8520 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 8580 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggccttga agctgtccct 8640 gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 8700 cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca 8760 agacgtagcc cagcgcgtcg gccccgagat gcgccgcgtg cggctgctgg agatggcgga 8820 cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg 8880 attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccctg cttcatcccc 8940 gtggcccgtt gctcgcgttt gctggcggtg tccccggaag aaatatattt gcatgtcttt 9000 agttctatga tgacacaaac cccgcccagc gtcttgtcat tggcgaattc gaacacgcag 9060 atgcagtcgg ggcggcgcgg tccgaggtcc acttcgcata ttaaggtgac gcgtgtggcc 9120 tcgaacaccg agcgaccctg cagcgacccg cttaacagcg tcaacagcgt gccgcagatc 9180 ccggggggca atgagatatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc 9240 tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc 9300 gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg 9360 atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc 9420 cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg 9480 cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg 9540 tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc gggttcggcc 9600 cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata tgcgcgattg 9660 ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg 9720 cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg 9780 tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca 9840 ttgactggag cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct 9900 ggaggccgtg gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg 9960 agcttgcagg atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct 10020 atcagagctt ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg 10080 caatcgtccg atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg 10140 ccgtctggac cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca 10200 ctcgtccgga tcgggagatg ggggaggcta actgaaacac ggaaggagac aataccggaa 10260 ggaacccgcg ctatgacggc aataaaaaga cagaataaaa cgcacgggtg ttgggtcgtt 10320 tgttcataaa cgcggggttc ggtcccaggg ctggcactct gtcgataccc caccgagacc 10380 ccattggggc caatacgccc gcgtttcttc cttttcccca ccccaccccc caagttcggg 10440 tgaaggccca gggctcgcag ccaacgtcgg ggcggcaggc cctgccatag ccactggccc 10500 cgtgggttag ggacggggtc ccccatgggg aatggtttat ggttcgtggg ggttattatt 10560 ttgggcgttg cgtggggtca ggtccacgac tggactgagc agacagaccc atggtttttg 10620 gatggcctgg gcatggaccg catgtactgg cgcgacacga acaccgggcg tctgtggctg 10680 ccaaacaccc ccgaccccca aaaaccaccg cgcggatttc tggcgtgcca agctagtcga 10740 ccaattctca tgtttgacag cttatcatcg cagatccggg caacgttgtt gccattgctg 10800 caggcgcaga actggtaggt atggaagatc catacattga atcaatattg gcaattagcc 10860 atattagtca ttggttatat agcataaatc aatattggct attggccatt gcatacgttg 10920 tatctatatc ataatatgta catttatatt ggctcatgtc caatatgacc gccat 10975 <210> 159 <211> 10927 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 159 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgtt gtgaacacaa agatgtttga 900 ggagctcaag agccgtctgg acaccctggc ccaggaggtg gccctgctga aggagcagca 960 ggccctccag acggtctgcc tgaaggggac caaggtgcac atgaaatgct ttctggcctt 1020 cacccagacg aagaccttcc acgaggccag cgaggactgc atctcgcgcg ggggcaccct 1080 gagcacccct cagactggct cggagaacga cgccctgtat gagtacctgc gccagagcgt 1140 gggcaacgag gccgagatct ggctgggcct caacgacatg gcggccgagg gcacctgggt 1200 ggacatgacc ggtacccgca tcgcctacaa gaactgggag actgagatca ccgcgcaacc 1260 cgatggcggc aagaccgaga actgcgcggt cctgtcaggc gcggccaacg gcaagtggtt 1320 cgacaagcgc tgcagggatc aattgcccta catctgccag ttcgggatcg tgcaccacca 1380 ccaccaccac taactcgagg ccggcaaggc cggatccaga catgataaga tacattgatg 1440 agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt gaaatttgtg 1500 atgctattgc tttatttgta accattataa gctgcaataa acaagttaac aacaagaatt 1560 gcattcattt tatgtttcag gttcaggggg aggtgtggga ggttttttaa agcaagtaaa 1620 acctctacaa atgtggtatg gctgattatg atccggctgc ctcgcgcgtt tcggtgatga 1680 cggtgaaaac ctctgacaca tgcagctccc ggagacggtc acagcttgtc tgtaagcgga 1740 tgccgggagc agacaagccc gtcaggcgtc agcgggtgtt ggcgggtgtc ggggcgcagc 1800 catgaggtcg actctagagg atcgatgccc cgccccggac gaactaaacc tgactacgac 1860 atctctgccc cttcttcgcg gggcagtgca tgtaatccct tcagttggtt ggtacaactt 1920 gccaactggg ccctgttcca catgtgacac ggggggggac caaacacaaa ggggttctct 1980 gactgtagtt gacatcctta taaatggatg tgcacatttg ccaacactga gtggctttca 2040 tcctggagca gactttgcag tctgtggact gcaacacaac attgccttta tgtgtaactc 2100 ttggctgaag ctcttacacc aatgctgggg gacatgtacc tcccaggggc ccaggaagac 2160 tacgggaggc tacaccaacg tcaatcagag gggcctgtgt agctaccgat aagcggaccc 2220 tcaagagggc attagcaata gtgtttataa ggcccccttg ttaaccctaa acgggtagca 2280 tatgcttccc gggtagtagt atatactatc cagactaacc ctaattcaat agcatatgtt 2340 acccaacggg aagcatatgc tatcgaatta gggttagtaa aagggtccta aggaacagcg 2400 atatctccca ccccatgagc tgtcacggtt ttatttacat ggggtcagga ttccacgagg 2460 gtagtgaacc attttagtca caagggcagt ggctgaagat caaggagcgg gcagtgaact 2520 ctcctgaatc ttcgcctgct tcttcattct ccttcgttta gctaatagaa taactgctga 2580 gttgtgaaca gtaaggtgta tgtgaggtgc tcgaaaacaa ggtttcaggt gacgccccca 2640 gaataaaatt tggacggggg gttcagtggt ggcattgtgc tatgacacca atataaccct 2700 cacaaacccc ttgggcaata aatactagtg taggaatgaa acattctgaa tatctttaac 2760 aatagaaatc catggggtgg ggacaagccg taaagactgg atgtccatct cacacgaatt 2820 tatggctatg ggcaacacat aatcctagtg caatatgata ctggggttat taagatgtgt 2880 cccaggcagg gaccaagaca ggtgaaccat gttgttacac tctatttgta acaaggggaa 2940 agagagtgga cgccgacagc agcggactcc actggttgtc tctaacaccc ccgaaaatta 3000 aacggggctc cacgccaatg gggcccataa acaaagacaa gtggccactc ttttttttga 3060 aattgtggag tgggggcacg cgtcagcccc cacacgccgc cctgcggttt tggactgtaa 3120 aataagggtg taataacttg gctgattgta accccgctaa ccactgcggt caaaccactt 3180 gcccacaaaa ccactaatgg caccccgggg aatacctgca taagtaggtg ggcgggccaa 3240 gataggggcg cgattgctgc gatctggagg acaaattaca cacacttgcg cctgagcgcc 3300 aagcacaggg ttgttggtcc tcatattcac gaggtcgctg agagcacggt gggctaatgt 3360 tgccatgggt agcatatact acccaaatat ctggatagca tatgctatcc taatctatat 3420 ctgggtagca taggctatcc taatctatat ctgggtagca tatgctatcc taatctatat 3480 ctgggtagta tatgctatcc taatttatat ctgggtagca taggctatcc taatctatat 3540 ctgggtagca tatgctatcc taatctatat ctgggtagta tatgctatcc taatctgtat 3600 ccgggtagca tatgctatcc taatagagat tagggtagta tatgctatcc taatttatat 3660 ctgggtagca tatactaccc aaatatctgg atagcatatg ctatcctaat ctatatctgg 3720 gtagcatatg ctatcctaat ctatatctgg gtagcatagg ctatcctaat ctatatctgg 3780 gtagcatatg ctatcctaat ctatatctgg gtagtatatg ctatcctaat ttatatctgg 3840 gtagcatagg ctatcctaat ctatatctgg gtagcatatg ctatcctaat ctatatctgg 3900 gtagtatatg ctatcctaat ctgtatccgg gtagcatatg ctatcctcat gcatatacag 3960 tcagcatatg atacccagta gtagagtggg agtgctatcc tttgcatatg ccgccacctc 4020 ccaagggggc gtgaattttc gctgcttgtc cttttcctgc tggttgctcc cattcttagg 4080 tgaatttaag gaggccaggc taaagccgtc gcatgtctga ttgctcacca ggtaaatgtc 4140 gctaatgttt tccaacgcga gaaggtgttg agcgcggagc tgagtgacgt gacaacatgg 4200 gtatgccgaa ttgccccatg ttgggaggac gaaaatggtg acaagacaga tggccagaaa 4260 tacaccaaca gcacgcatga tgtctactgg ggatttattc tttagtgcgg gggaatacac 4320 ggcttttaat acgattgagg gcgtctccta acaagttaca tcactcctgc ccttcctcac 4380 cctcatctcc atcacctcct tcatctccgt catctccgtc atcaccctcc gcggcagccc 4440 cttccaccat aggtggaaac cagggaggca aatctactcc atcgtcaaag ctgcacacag 4500 tcaccctgat attgcaggta ggagcgggct ttgtcataac aaggtcctta atcgcatcct 4560 tcaaaacctc agcaaatata tgagtttgta aaaagaccat gaaataacag acaatggact 4620 cccttagcgg gccaggttgt gggccgggtc caggggccat tccaaagggg agacgactca 4680 atggtgtaag acgacattgt ggaatagcaa gggcagttcc tcgccttagg ttgtaaaggg 4740 aggtcttact acctccatat acgaacacac cggcgaccca agttccttcg tcggtagtcc 4800 tttctacgtg actcctagcc aggagagctc ttaaaccttc tgcaatgttc tcaaatttcg 4860 ggttggaacc tccttgacca cgatgctttc caaaccaccc tccttttttg cgcctgcctc 4920 catcaccctg accccggggt ccagtgcttg ggccttctcc tgggtcatct gcggggccct 4980 gctctatcgc tcccgggggc acgtcaggct caccatctgg gccaccttct tggtggtatt 5040 caaaataatc ggcttcccct acagggtgga aaaatggcct tctacctgga gggggcctgc 5100 gcggtggaga cccggatgat gatgactgac tactgggact cctgggcctc ttttctccac 5160 gtccacgacc tctccccctg gctctttcac gacttccccc cctggctctt tcacgtcctc 5220 taccccggcg gcctccacta cctcctcgac cccggcctcc actacctcct cgaccccggc 5280 ctccactgcc tcctcgaccc cggcctccac ctcctgctcc tgcccctcct gctcctgccc 5340 ctcctcctgc tcctgcccct cctgcccctc ctgctcctgc ccctcctgcc cctcctgctc 5400 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctcctgct cctgcccctc 5460 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc tcctgcccct cctgcccctc 5520 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgctcctgcc cctcctgctc 5580 ctgcccctcc tgctcctgcc cctcctgccc ctcctgcccc tcctcctgct cctgcccctc 5640 ctgctcctgc ccctcctgcc cctcctgccc ctcctgctcc tgcccctcct cctgctcctg 5700 cccctcctgc ccctcctgcc cctcctcctg ctcctgcccc tcctgcccct cctcctgctc 5760 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc ccctcctcct gctcctgccc 5820 ctcctgcccc tcctcctgct cctgcccctc ctcctgctcc tgcccctcct gcccctcctg 5880 cccctcctcc tgctcctgcc cctcctcctg ctcctgcccc tcctgcccct cctgcccctc 5940 ctgcccctcc tcctgctcct gcccctcctc ctgctcctgc ccctcctgct cctgcccctc 6000 ccgctcctgc tcctgctcct gttccaccgt gggtcccttt gcagccaatg caacttggac 6060 gtttttgggg tctccggaca ccatctctat gtcttggccc tgatcctgag ccgcccgggg 6120 ctcctggtct tccgcctcct cgtcctcgtc ctcttccccg tcctcgtcca tggttatcac 6180 cccctcttct ttgaggtcca ctgccgccgg agccttctgg tccagatgtg tctcccttct 6240 ctcctaggcc atttccaggt cctgtacctg gcccctcgtc agacatgatt cacactaaaa 6300 gagatcaata gacatcttta ttagacgacg ctcagtgaat acagggagtg cagactcctg 6360 ccccctccaa cagccccccc accctcatcc ccttcatggt cgctgtcaga cagatccagg 6420 tctgaaaatt ccccatcctc cgaaccatcc tcgtcctcat caccaattac tcgcagcccg 6480 gaaaactccc gctgaacatc ctcaagattt gcgtcctgag cctcaagcca ggcctcaaat 6540 tcctcgtccc cctttttgct ggacggtagg gatggggatt ctcgggaccc ctcctcttcc 6600 tcttcaaggt caccagacag agatgctact ggggcaacgg aagaaaagct gggtgcggcc 6660 tgtgaggatc agcttatcga tgataagctg tcaaacatga gaattcttga agacgaaagg 6720 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 6780 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 6840 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 6900 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 6960 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 7020 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 7080 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 7140 cggtattatc ccgtgttgac gccgggcaag agcaactcgg tcgccgcata cactattctc 7200 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 7260 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 7320 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 7380 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 7440 acaccacgat gcctgcagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 7500 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 7560 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 7620 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 7680 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 7740 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 7800 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 7860 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 7920 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 7980 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 8040 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 8100 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 8160 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 8220 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 8280 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 8340 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 8400 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 8460 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 8520 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 8580 gaagctgtcc ctgatggtcg tcatctacct gcctggacag catggcctgc aacgcgggca 8640 tcccgatgcc gccggaagcg agaagaatca taatggggaa ggccatccag cctcgcgtcg 8700 cgaacgccag caagacgtag cccagcgcgt cggccccgag atgcgccgcg tgcggctgct 8760 ggagatggcg gacgcgatgg atatgttctg ccaagggttg gtttgcgcat tcacagttct 8820 ccgcaagaat tgattggctc caattcttgg agtggtgaat ccgttagcga ggtgccgccc 8880 tgcttcatcc ccgtggcccg ttgctcgcgt ttgctggcgg tgtccccgga agaaatatat 8940 ttgcatgtct ttagttctat gatgacacaa accccgccca gcgtcttgtc attggcgaat 9000 tcgaacacgc agatgcagtc ggggcggcgc ggtccgaggt ccacttcgca tattaaggtg 9060 acgcgtgtgg cctcgaacac cgagcgaccc tgcagcgacc cgcttaacag cgtcaacagc 9120 gtgccgcaga tcccgggggg caatgagata tgaaaaagcc tgaactcacc gcgacgtctg 9180 tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag ctctcggagg 9240 gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc ctgcgggtaa 9300 atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt gcatcggccg 9360 cgctcccgat tccggaagtg cttgacattg gggaattcag cgagagcctg acctattgca 9420 tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa ctgcccgctg 9480 ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt agccagacga 9540 gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg cgtgatttca 9600 tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac gacaccgtca 9660 gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac tgccccgaag 9720 tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac aatggccgca 9780 taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac gaggtcgcca 9840 acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc tacttcgagc 9900 ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc cgcattggtc 9960 ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct tgggcgcagg 10020 gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca caaatcgccc 10080 gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat agtggaaacc 10140 gacgccccag cactcgtccg gatcgggaga tgggggaggc taactgaaac acggaaggag 10200 acaataccgg aaggaacccg cgctatgacg gcaataaaaa gacagaataa aacgcacggg 10260 tgttgggtcg tttgttcata aacgcggggt tcggtcccag ggctggcact ctgtcgatac 10320 cccaccgaga ccccattggg gccaatacgc ccgcgtttct tccttttccc caccccaccc 10380 cccaagttcg ggtgaaggcc cagggctcgc agccaacgtc ggggcggcag gccctgccat 10440 agccactggc cccgtgggtt agggacgggg tcccccatgg ggaatggttt atggttcgtg 10500 ggggttatta ttttgggcgt tgcgtggggt caggtccacg actggactga gcagacagac 10560 ccatggtttt tggatggcct gggcatggac cgcatgtact ggcgcgacac gaacaccggg 10620 cgtctgtggc tgccaaacac ccccgacccc caaaaaccac cgcgcggatt tctggcgtgc 10680 caagctagtc gaccaattct catgtttgac agcttatcat cgcagatccg ggcaacgttg 10740 ttgccattgc tgcaggcgca gaactggtag gtatggaaga tccatacatt gaatcaatat 10800 tggcaattag ccatattagt cattggttat atagcataaa tcaatattgg ctattggcca 10860 ttgcatacgt tgtatctata tcataatatg tacatttata ttggctcatg tccaatatga 10920 ccgccat 10927 <210> 160 <211> 4641 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 160 aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60 tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca 120 aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg 180 attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg 240 atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc 300 taacaggagg aattcaccat gaaaaagaca gctatcgcga ttgcagtggc actggctggt 360 ttcgctaccg ttgcgcaagc ttctgagcca ccaacccaga agcccaagaa gattgtaaat 420 gccaagaaag atgttgtgaa cacaaagatg tttgaggagc tcaagagccg tctggacacc 480 ctggcccagg aggtggccct gctgaaggag cagcaggccc tccagacggt ctgcctgaag 540 gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac cttccacgag 600 gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac tggctcggag 660 aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga gatctggctg 720 ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac ccgcatcgcc 780 tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac cgagaactgc 840 gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag ggatcaattg 900 ccctacatct gccagttcgg gatcgtgtac ccctacgacg tgcccgacta cgccggttgg 960 agccacccgc agttcgaaaa ataactcgag ataaacggtc tccagcttgg ctgttttggc 1020 ggatgagaga agattttcag cctgatacag attaaatcag aacgcagaag cggtctgata 1080 aaacagaatt tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca 1140 gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac 1200 tgccaggcat caaataaaac gaaaggctca gtcgaaagac tgggcctttc gttttatctg 1260 ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg ccgggagcgg atttgaacgt 1320 tgcgaagcaa cggcccggag ggtggcgggc aggacgcccg ccataaactg ccaggcatca 1380 aattaagcag aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctctttttgt 1440 ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 1500 cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 1560 cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 1620 aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 1680 ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 1740 gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc 1800 cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 1860 acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 1920 gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 1980 aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 2040 ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta 2100 ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 2160 gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 2220 aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 2280 aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 2340 aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 2400 gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 2460 gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 2520 tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 2580 gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 2640 caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 2700 actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 2760 acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 2820 cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 2880 gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 2940 cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 3000 gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 3060 tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 3120 tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 3180 gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 3240 aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 3300 agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat 3360 ctgtgcggta tttcacaccg catatggtgc actctcagta caatctgctc tgatgccgca 3420 tagttaagcc agtatacact ccgctatcgc tacgtgactg ggtcatggct gcgccccgac 3480 acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca 3540 gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga 3600 aacgcgcgag gcagcagatc aattcgcgcg cgaaggcgaa gcggcatgca taatgtgcct 3660 gtcaaatgga cgaagcaggg attctgcaaa ccctatgcta ctccgtcaag ccgtcaattg 3720 tctgattcgt taccaattat gacaacttga cggctacatc attcactttt tcttcacaac 3780 cggcacggaa ctcgctcggg ctggccccgg tgcatttttt aaatacccgc gagaaataga 3840 gttgatcgtc aaaaccaaca ttgcgaccga cggtggcgat aggcatccgg gtggtgctca 3900 aaagcagctt cgcctggctg atacgttggt cctcgcgcca gcttaagacg ctaatcccta 3960 actgctggcg gaaaagatgt gacagacgcg acggcgacaa gcaaacatgc tgtgcgacgc 4020 tggcgatatc aaaattgctg tctgccaggt gatcgctgat gtactgacaa gcctcgcgta 4080 cccgattatc catcggtgga tggagcgact cgttaatcgc ttccatgcgc cgcagtaaca 4140 attgctcaag cagatttatc gccagcagct ccgaatagcg cccttcccct tgcccggcgt 4200 taatgatttg cccaaacagg tcgctgaaat gcggctggtg cgcttcatcc gggcgaaaga 4260 accccgtatt ggcaaatatt gacggccagt taagccattc atgccagtag gcgcgcggac 4320 gaaagtaaac ccactggtga taccattcgc gagcctccgg atgacgaccg tagtgatgaa 4380 tctctcctgg cgggaacagc aaaatatcac ccggtcggca aacaaattct cgtccctgat 4440 ttttcaccac cccctgaccg cgaatggtga gattgagaat ataacctttc attcccagcg 4500 gtcggtcgat aaaaaaatcg agataaccgt tggcctcaat cggcgttaaa cccgccacca 4560 gatgggcatt aaacgagtat cccggcagca ggggatcatt ttgcgcttca gccatacttt 4620 tcatactccc gccattcaga g 4641 <210> 161 <211> 11011 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 161 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 ggtctgcctg aaggggacca aggtgcacat gaaatgcttt ctggccttca cccagacgaa 1080 gaccttccac gaggccagcg aggactgcat ctcgcgcggg ggcaccctga gcacccctca 1140 gactggctcg gagaacgacg ccctgtatga gtacctgcgc cagagcgtgg gcaacgaggc 1200 cgagatctgg ctgggcctca acgacatggc ggccgagggc acctgggtgg acatgaccgg 1260 tacccgcatc gcctacaaga actgggagac tgagatcacc gcgcaacccg atggcggcaa 1320 gaccgagaac tgcgcggtcc tgtcaggcgc ggccaacggc aagtggttcg acaagcgctg 1380 cagggatcaa ttgccctaca tctgccagtt cgggatcgtg tacccctacg acgtgcccga 1440 ctacgccggt tggagccacc cccagttcga gaagtgactc gaggccggca aggccggatc 1500 cagacatgat aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa 1560 aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt ataagctgca 1620 ataaacaagt taacaacaag aattgcattc attttatgtt tcaggttcag ggggaggtgt 1680 gggaggtttt ttaaagcaag taaaacctct acaaatgtgg tatggctgat tatgatccgg 1740 ctgcctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 1800 ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg cgtcagcggg 1860 tgttggcggg tgtcggggcg cagccatgag gtcgactcta gaggatcgat gccccgcccc 1920 ggacgaacta aacctgacta cgacatctct gccccttctt cgcggggcag tgcatgtaat 1980 cccttcagtt ggttggtaca acttgccaac tgggccctgt tccacatgtg acacgggggg 2040 ggaccaaaca caaaggggtt ctctgactgt agttgacatc cttataaatg gatgtgcaca 2100 tttgccaaca ctgagtggct ttcatcctgg agcagacttt gcagtctgtg gactgcaaca 2160 caacattgcc tttatgtgta actcttggct gaagctctta caccaatgct gggggacatg 2220 tacctcccag gggcccagga agactacggg aggctacacc aacgtcaatc agaggggcct 2280 gtgtagctac cgataagcgg accctcaaga gggcattagc aatagtgttt ataaggcccc 2340 cttgttaacc ctaaacgggt agcatatgct tcccgggtag tagtatatac tatccagact 2400 aaccctaatt caatagcata tgttacccaa cgggaagcat atgctatcga attagggtta 2460 gtaaaagggt cctaaggaac agcgatatct cccaccccat gagctgtcac ggttttattt 2520 acatggggtc aggattccac gagggtagtg aaccatttta gtcacaaggg cagtggctga 2580 agatcaagga gcgggcagtg aactctcctg aatcttcgcc tgcttcttca ttctccttcg 2640 tttagctaat agaataactg ctgagttgtg aacagtaagg tgtatgtgag gtgctcgaaa 2700 acaaggtttc aggtgacgcc cccagaataa aatttggacg gggggttcag tggtggcatt 2760 gtgctatgac accaatataa ccctcacaaa ccccttgggc aataaatact agtgtaggaa 2820 tgaaacattc tgaatatctt taacaataga aatccatggg gtggggacaa gccgtaaaga 2880 ctggatgtcc atctcacacg aatttatggc tatgggcaac acataatcct agtgcaatat 2940 gatactgggg ttattaagat gtgtcccagg cagggaccaa gacaggtgaa ccatgttgtt 3000 acactctatt tgtaacaagg ggaaagagag tggacgccga cagcagcgga ctccactggt 3060 tgtctctaac acccccgaaa attaaacggg gctccacgcc aatggggccc ataaacaaag 3120 acaagtggcc actctttttt ttgaaattgt ggagtggggg cacgcgtcag cccccacacg 3180 ccgccctgcg gttttggact gtaaaataag ggtgtaataa cttggctgat tgtaaccccg 3240 ctaaccactg cggtcaaacc acttgcccac aaaaccacta atggcacccc ggggaatacc 3300 tgcataagta ggtgggcggg ccaagatagg ggcgcgattg ctgcgatctg gaggacaaat 3360 tacacacact tgcgcctgag cgccaagcac agggttgttg gtcctcatat tcacgaggtc 3420 gctgagagca cggtgggcta atgttgccat gggtagcata tactacccaa atatctggat 3480 agcatatgct atcctaatct atatctgggt agcataggct atcctaatct atatctgggt 3540 agcatatgct atcctaatct atatctgggt agtatatgct atcctaattt atatctgggt 3600 agcataggct atcctaatct atatctgggt agcatatgct atcctaatct atatctgggt 3660 agtatatgct atcctaatct gtatccgggt agcatatgct atcctaatag agattagggt 3720 agtatatgct atcctaattt atatctgggt agcatatact acccaaatat ctggatagca 3780 tatgctatcc taatctatat ctgggtagca tatgctatcc taatctatat ctgggtagca 3840 taggctatcc taatctatat ctgggtagca tatgctatcc taatctatat ctgggtagta 3900 tatgctatcc taatttatat ctgggtagca taggctatcc taatctatat ctgggtagca 3960 tatgctatcc taatctatat ctgggtagta tatgctatcc taatctgtat ccgggtagca 4020 tatgctatcc tcatgcatat acagtcagca tatgataccc agtagtagag tgggagtgct 4080 atcctttgca tatgccgcca cctcccaagg gggcgtgaat tttcgctgct tgtccttttc 4140 ctgctggttg ctcccattct taggtgaatt taaggaggcc aggctaaagc cgtcgcatgt 4200 ctgattgctc accaggtaaa tgtcgctaat gttttccaac gcgagaaggt gttgagcgcg 4260 gagctgagtg acgtgacaac atgggtatgc cgaattgccc catgttggga ggacgaaaat 4320 ggtgacaaga cagatggcca gaaatacacc aacagcacgc atgatgtcta ctggggattt 4380 attctttagt gcgggggaat acacggcttt taatacgatt gagggcgtct cctaacaagt 4440 tacatcactc ctgcccttcc tcaccctcat ctccatcacc tccttcatct ccgtcatctc 4500 cgtcatcacc ctccgcggca gccccttcca ccataggtgg aaaccaggga ggcaaatcta 4560 ctccatcgtc aaagctgcac acagtcaccc tgatattgca ggtaggagcg ggctttgtca 4620 taacaaggtc cttaatcgca tccttcaaaa cctcagcaaa tatatgagtt tgtaaaaaga 4680 ccatgaaata acagacaatg gactccctta gcgggccagg ttgtgggccg ggtccagggg 4740 ccattccaaa ggggagacga ctcaatggtg taagacgaca ttgtggaata gcaagggcag 4800 ttcctcgcct taggttgtaa agggaggtct tactacctcc atatacgaac acaccggcga 4860 cccaagttcc ttcgtcggta gtcctttcta cgtgactcct agccaggaga gctcttaaac 4920 cttctgcaat gttctcaaat ttcgggttgg aacctccttg accacgatgc tttccaaacc 4980 accctccttt tttgcgcctg cctccatcac cctgaccccg gggtccagtg cttgggcctt 5040 ctcctgggtc atctgcgggg ccctgctcta tcgctcccgg gggcacgtca ggctcaccat 5100 ctgggccacc ttcttggtgg tattcaaaat aatcggcttc ccctacaggg tggaaaaatg 5160 gccttctacc tggagggggc ctgcgcggtg gagacccgga tgatgatgac tgactactgg 5220 gactcctggg cctcttttct ccacgtccac gacctctccc cctggctctt tcacgacttc 5280 cccccctggc tctttcacgt cctctacccc ggcggcctcc actacctcct cgaccccggc 5340 ctccactacc tcctcgaccc cggcctccac tgcctcctcg accccggcct ccacctcctg 5400 ctcctgcccc tcctgctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctgctc 5460 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5520 cccctcctcc tgctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5580 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgcccctcct gctcctgccc 5640 ctcctgctcc tgcccctcct gctcctgccc ctcctgctcc tgcccctcct gcccctcctg 5700 cccctcctcc tgctcctgcc cctcctgctc ctgcccctcc tgcccctcct gcccctcctg 5760 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct cctgctcctg 5820 cccctcctgc ccctcctcct gctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5880 ctgcccctcc tcctgctcct gcccctcctg cccctcctcc tgctcctgcc cctcctcctg 5940 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct cctgctcctg 6000 cccctcctgc ccctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctcctgctc 6060 ctgcccctcc tgctcctgcc cctcccgctc ctgctcctgc tcctgttcca ccgtgggtcc 6120 ctttgcagcc aatgcaactt ggacgttttt ggggtctccg gacaccatct ctatgtcttg 6180 gccctgatcc tgagccgccc ggggctcctg gtcttccgcc tcctcgtcct cgtcctcttc 6240 cccgtcctcg tccatggtta tcaccccctc ttctttgagg tccactgccg ccggagcctt 6300 ctggtccaga tgtgtctccc ttctctccta ggccatttcc aggtcctgta cctggcccct 6360 cgtcagacat gattcacact aaaagagatc aatagacatc tttattagac gacgctcagt 6420 gaatacaggg agtgcagact cctgccccct ccaacagccc ccccaccctc atccccttca 6480 tggtcgctgt cagacagatc caggtctgaa aattccccat cctccgaacc atcctcgtcc 6540 tcatcaccaa ttactcgcag cccggaaaac tcccgctgaa catcctcaag atttgcgtcc 6600 tgagcctcaa gccaggcctc aaattcctcg tccccctttt tgctggacgg tagggatggg 6660 gattctcggg acccctcctc ttcctcttca aggtcaccag acagagatgc tactggggca 6720 acggaagaaa agctgggtgc ggcctgtgag gatcagctta tcgatgataa gctgtcaaac 6780 atgagaattc ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 6840 tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 6900 ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 6960 gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 7020 cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 7080 tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 7140 tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 7200 cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac 7260 tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 7320 agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 7380 ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 7440 ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 7500 aagccatacc aaacgacgag cgtgacacca cgatgcctgc agcaatggca acaacgttgc 7560 gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 7620 tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 7680 ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc 7740 cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 7800 atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 7860 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 7920 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 7980 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 8040 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 8100 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 8160 taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 8220 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 8280 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 8340 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 8400 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 8460 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 8520 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 8580 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 8640 ggttcctggc cttttgctgg ccttgaagct gtccctgatg gtcgtcatct acctgcctgg 8700 acagcatggc ctgcaacgcg ggcatcccga tgccgccgga agcgagaaga atcataatgg 8760 ggaaggccat ccagcctcgc gtcgcgaacg ccagcaagac gtagcccagc gcgtcggccc 8820 cgagatgcgc cgcgtgcggc tgctggagat ggcggacgcg atggatatgt tctgccaagg 8880 gttggtttgc gcattcacag ttctccgcaa gaattgattg gctccaattc ttggagtggt 8940 gaatccgtta gcgaggtgcc gccctgcttc atccccgtgg cccgttgctc gcgtttgctg 9000 gcggtgtccc cggaagaaat atatttgcat gtctttagtt ctatgatgac acaaaccccg 9060 cccagcgtct tgtcattggc gaattcgaac acgcagatgc agtcggggcg gcgcggtccg 9120 aggtccactt cgcatattaa ggtgacgcgt gtggcctcga acaccgagcg accctgcagc 9180 gacccgctta acagcgtcaa cagcgtgccg cagatcccgg ggggcaatga gatatgaaaa 9240 agcctgaact caccgcgacg tctgtcgaga agtttctgat cgaaaagttc gacagcgtct 9300 ccgacctgat gcagctctcg gagggcgaag aatctcgtgc tttcagcttc gatgtaggag 9360 ggcgtggata tgtcctgcgg gtaaatagct gcgccgatgg tttctacaaa gatcgttatg 9420 tttatcggca ctttgcatcg gccgcgctcc cgattccgga agtgcttgac attggggaat 9480 tcagcgagag cctgacctat tgcatctccc gccgtgcaca gggtgtcacg ttgcaagacc 9540 tgcctgaaac cgaactgccc gctgttctgc agccggtcgc ggaggccatg gatgcgatcg 9600 ctgcggccga tcttagccag acgagcgggt tcggcccatt cggaccgcaa ggaatcggtc 9660 aatacactac atggcgtgat ttcatatgcg cgattgctga tccccatgtg tatcactggc 9720 aaactgtgat ggacgacacc gtcagtgcgt ccgtcgcgca ggctctcgat gagctgatgc 9780 tttgggccga ggactgcccc gaagtccggc acctcgtgca cgcggatttc ggctccaaca 9840 atgtcctgac ggacaatggc cgcataacag cggtcattga ctggagcgag gcgatgttcg 9900 gggattccca atacgaggtc gccaacatct tcttctggag gccgtggttg gcttgtatgg 9960 agcagcagac gcgctacttc gagcggaggc atccggagct tgcaggatcg ccgcggctcc 10020 gggcgtatat gctccgcatt ggtcttgacc aactctatca gagcttggtt gacggcaatt 10080 tcgatgatgc agcttgggcg cagggtcgat gcgacgcaat cgtccgatcc ggagccggga 10140 ctgtcgggcg tacacaaatc gcccgcagaa gcgcggccgt ctggaccgat ggctgtgtag 10200 aagtactcgc cgatagtgga aaccgacgcc ccagcactcg tccggatcgg gagatggggg 10260 aggctaactg aaacacggaa ggagacaata ccggaaggaa cccgcgctat gacggcaata 10320 aaaagacaga ataaaacgca cgggtgttgg gtcgtttgtt cataaacgcg gggttcggtc 10380 ccagggctgg cactctgtcg ataccccacc gagaccccat tggggccaat acgcccgcgt 10440 ttcttccttt tccccacccc accccccaag ttcgggtgaa ggcccagggc tcgcagccaa 10500 cgtcggggcg gcaggccctg ccatagccac tggccccgtg ggttagggac ggggtccccc 10560 atggggaatg gtttatggtt cgtgggggtt attattttgg gcgttgcgtg gggtcaggtc 10620 cacgactgga ctgagcagac agacccatgg tttttggatg gcctgggcat ggaccgcatg 10680 tactggcgcg acacgaacac cgggcgtctg tggctgccaa acacccccga cccccaaaaa 10740 ccaccgcgcg gatttctggc gtgccaagct agtcgaccaa ttctcatgtt tgacagctta 10800 tcatcgcaga tccgggcaac gttgttgcca ttgctgcagg cgcagaactg gtaggtatgg 10860 aagatccata cattgaatca atattggcaa ttagccatat tagtcattgg ttatatagca 10920 taaatcaata ttggctattg gccattgcat acgttgtatc tatatcataa tatgtacatt 10980 tatattggct catgtccaat atgaccgcca t 11011 <210> 162 <211> 5783 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 162 tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgggatctc gatcccgcga 4980 aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 ttttgtttaa ctttaagaag gagatataca tatgaaatac cttcttccga ctgctgctgc 5100 tggtctttta ctgctggctg ctcagccggc tatggctgct ggtggtggtt ctgccctcca 5160 gacggtctgc ctgaagggga ccaaggtgca catgaaatgc tttctggcct tcacccagac 5220 gaagaccttc cacgaggcca gcgaggactg catctcgcgc gggggcaccc tgagcacccc 5280 tcagactggc tcggagaacg acgccctgta tgagtacctg cgccagagcg tgggcaacga 5340 ggccgagatc tggctgggcc tcaacgacat ggcggccgag ggcacctggg tggacatgac 5400 cggtacccgc atcgcctaca agaactggga gactgagatc accgcgcaac ccgatggcgg 5460 caagaccgag aactgcgcgg tcctgtcagg cgcggccaac ggcaagtggt tcgacaagcg 5520 ctgcagggat caattgccct acatctgcca gttcgggatc gtgtacccct acgacgtgcc 5580 cgactacgcc ggttggagcc acccgcagtt cgaaaaataa ctcgagcacc accaccacca 5640 ccactgagat ccggctgcta acaaagcccg aaaggaagct gagttggctg ctgccaccgc 5700 tgagcaataa ctagcataac cccttggggc ctctaaacgg gtcttgaggg gttttttgct 5760 gaaaggagga actatatccg gat 5783 <210> 163 <211> 4792 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 163 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgcctta 2340 cagactgtgt gcctgaaggg caccaaggtg aacttgaagt gcctcctggc cttcacccaa 2400 ccgaagacct tccatgaggc gagcgaggac tgcatctcgc aagggggcac gctgggtacc 2460 ccgcagtcag agctggagaa cgaggcgctg ttcgaatacg cgcgccacag cgtgggcaac 2520 gatgcgaaca tctggctggg cctcaacgac atggccgcgg aaggcgcctg ggtcgactaa 2580 gtgatatcct gacctaactg cagagatcag ttgccctaca tctgccagtt tgccattgtg 2640 gcggccgcag gtgcgccggt gccgtatccg gatccgctgg aaccgcgtgc cgcatagact 2700 gttgaaagtt gtttagcaaa acctcataca gaaaattcat ttactaacgt ctggaaagac 2760 gacaaaactt tagatcgtta cgctaactat gagggctgtc tgtggaatgc tacaggcgtt 2820 gtggtttgta ctggtgacga aactcagtgt tacggtacat gggttcctat tgggcttgct 2880 atccctgaaa atgagggtgg tggctctgag ggtggcggtt ctgagggtgg cggttctgag 2940 ggtggcggta ctaaacctcc tgagtacggt gatacaccta ttccgggcta tacttatatc 3000 aaccctctcg acggcactta tccgcctggt actgagcaaa accccgctaa tcctaatcct 3060 tctcttgagg agtctcagcc tcttaatact ttcatgtttc agaataatag gttccgaaat 3120 aggcagggtg cattaactgt ttatacgggc actgttactc aaggcactga ccccgttaaa 3180 acttattacc agtacactcc tgtatcatca aaagccatgt atgacgctta ctggaacggt 3240 aaattcagag actgcgcttt ccattctggc tttaatgagg atccattcgt ttgtgaatat 3300 caaggccaat cgtctgacct gcctcaacct cctgtcaatg ctggcggcgg ctctggtggt 3360 ggttctggtg gcggctctga gggtggcggc tctgagggtg gcggttctga gggtggcggc 3420 tctgagggtg gcggttccgg tggcggctcc ggttccggtg attttgatta tgaaaaaatg 3480 gcaaacgcta ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac 3540 gctaaaggca aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt 3600 ggtgacgttt ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat 3660 tcccaaatgg ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa 3720 tatttacctt ctttgcctca gtcggttgaa tgtcgccctt atgtctttgg cgctggtaaa 3780 ccatatgaat tttctattga ttgtgacaaa ataaacttat tccgtggtgt ctttgcgttt 3840 cttttatatg ttgccacctt tatgtatgta ttttcgacgt ttgctaacat actgcgtaat 3900 aaggagtctt aataagaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 3960 tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 4020 cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 4080 cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag 4140 caaccatagt acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc 4200 agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc 4260 tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg 4320 ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgatttggg tgatggttca 4380 cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc 4440 tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc gggctattct 4500 tttgatttat aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa 4560 caaaaattta acgcgaattt taacaaaata ttaacgttta caattttatg gtgcagtctc 4620 agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc aacacccgct 4680 gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc 4740 tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc ga 4792 <210> 164 <211> 4101 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 164 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgccctc 2340 cagacggtct gcctgaaggg gaccaaggtg cacatgaaat gctttctggc cttcacccag 2400 acgaagacct tccacgaggc cagcgaggac tgcatctcgc gcgggggcac cctgagcacc 2460 cctcagactg gctcggagaa cgacgccctg tatgagtacc tgcgccagag cgtgggcaac 2520 gaggccgaga tctaagtgac gatatcctga cctaaggtac ctaagtgacg atatcctgac 2580 ctaactgcag ggatcaattg ccctacatct gccagttcgg gatcgtggcg gccgcaggtg 2640 cgccggtgcc gtatccggat ccgctggaac cgcgtgccgc acaggctgag ggtggcggct 2700 ctgagggtgg cggttctgag ggtggcggct ctgagggtgg cggttccggt ggcggctccg 2760 gttccggtga ttttgattat gaaaaaatgg caaacgctaa taagggggct atgaccgaaa 2820 atgccgatga aaacgcgcta cagtctgacg ctaaaggcaa acttgattct gtcgctactg 2880 attacggtgc tgctatcgat ggtttcattg gtgacgtttc cggccttgct aatggtaatg 2940 gtgctactgg tgattttgct ggctctaatt cccaaatggc tcaagtcggt gacggtgata 3000 attcaccttt aatgaataat ttccgtcaat atttaccttc tttgcctcag tcggttgaat 3060 gtcgccctta tgtctttggc gctggtaaac catatgaatt ttctattgat tgtgacaaaa 3120 taaacttatt ccgtggtgtc tttgcgtttc ttttatatgt tgccaccttt atgtatgtat 3180 tttcgacgtt tgctaacata ctgcgtaata aggagtctta ataagaattc actggccgtc 3240 gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca 3300 catccccctt tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa 3360 cagttgcgca gcctgaatgg cgaatggcgc ctgatgcggt attttctcct tacgcatctg 3420 tgcggtattt cacaccgcat acgtcaaagc aaccatagta cgcgccctgt agcggcgcat 3480 taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 3540 cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc 3600 aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 3660 ccaaaaaact tgatttgggt gatggttcac gtagtgggcc atcgccctga tagacggttt 3720 ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa 3780 caacactcaa ccctatctcg ggctattctt ttgatttata agggattttg ccgatttcgg 3840 cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat 3900 taacgtttac aattttatgg tgcagtctca gtacaatctg ctctgatgcc gcatagttaa 3960 gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg 4020 catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac 4080 cgtcatcacc gaaacgcgcg a 4101 <210> 165 <211> 4114 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 165 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgcctta 2340 cagactgtgt gcctgaaggg caccaaggtg aacttgaagt gcctcctggc cttcacccaa 2400 ccgaagacct tccatgaggc gagcgaggac tgcatctcgc aagggggcac gctgggtacc 2460 ccgcagtcag agctggagaa cgaggcgctg ttcgaatacg cgcgccacag cgtgggcaac 2520 gatgcgaaca tctggctggg cctcaacgac atggccgcgg aaggcgcctg ggtcgactaa 2580 gtgatatcct gacctaactg cagagatcag ttgccctaca tctgccagtt tgccattgtg 2640 gcggccgcag gtgcgccggt gccgtatccg gatccgctgg aaccgcgtgc cgcacaggct 2700 gagggtggcg gctctgaggg tggcggttct gagggtggcg gctctgaggg tggcggttcc 2760 ggtggcggct ccggttccgg tgattttgat tatgaaaaaa tggcaaacgc taataagggg 2820 gctatgaccg aaaatgccga tgaaaacgcg ctacagtctg acgctaaagg caaacttgat 2880 tctgtcgcta ctgattacgg tgctgctatc gatggtttca ttggtgacgt ttccggcctt 2940 gctaatggta atggtgctac tggtgatttt gctggctcta attcccaaat ggctcaagtc 3000 ggtgacggtg ataattcacc tttaatgaat aatttccgtc aatatttacc ttctttgcct 3060 cagtcggttg aatgtcgccc ttatgtcttt ggcgctggta aaccatatga attttctatt 3120 gattgtgaca aaataaactt attccgtggt gtctttgcgt ttcttttata tgttgccacc 3180 tttatgtatg tattttcgac gtttgctaac atactgcgta ataaggagtc ttaataagaa 3240 ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa 3300 tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga 3360 tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgcctgatgc ggtattttct 3420 ccttacgcat ctgtgcggta tttcacaccg catacgtcaa agcaaccata gtacgcgccc 3480 tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 3540 gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 3600 ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 3660 cggcacctcg accccaaaaa acttgatttg ggtgatggtt cacgtagtgg gccatcgccc 3720 tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 3780 ttccaaactg gaacaacact caaccctatc tcgggctatt cttttgattt ataagggatt 3840 ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 3900 tttaacaaaa tattaacgtt tacaatttta tggtgcagtc tcagtacaat ctgctctgat 3960 gccgcatagt taagccagcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct 4020 tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt 4080 cagaggtttt caccgtcatc accgaaacgc gcga 4114 <210> 166 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 166 Lys Met Leu Glu Glu Leu Lys Thr Gln Leu Asp Ser Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 167 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 167 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 168 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 168 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 169 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 169 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 170 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 170 Asp Leu Lys Thr Gln Ile Glu Lys Leu Trp Thr Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Ile Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 171 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 171 Asp Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Ile Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 172 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 172 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 173 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 173 Asp Leu Lys Ser Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 174 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 174 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 175 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 175 Asp Asp Leu Arg Asn Glu Ile Asp Lys Leu Trp Arg Glu Val Asn Ser 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 176 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 176 Lys Met Ile Glu Asp Leu Lys Ala Met Ile Asp Asn Ile Ser Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 177 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 177 Lys Met Ile Glu Asp Leu Lys Ala Met Ile Asp Asn Ile Ser Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu 20 25 30 <210> 178 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 178 Asp Asp Met Lys Thr Gln Ile Asp Lys Leu Trp Gln Glu Val Asn Ser 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 179 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 179 Asp Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Ser Val Cys Leu 20 25 <210> 180 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 180 Asp Asp Leu Lys Ser Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 181 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 181 Asp Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Ser Val Cys Leu 20 25 <210> 182 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 182 Asp Asp Val Arg Ser Gln Ile Glu Lys Leu Trp Gln Glu Val Asn Ser 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 183 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 183 Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Ile Asn Ser Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 184 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 184 Glu Glu Leu Arg Arg Gln Val Ser Asp Leu Ala Gln Glu Leu Asn Ile 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu His Thr Val Cys Leu 20 25 <210> 185 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 185 Lys Met Tyr Glu Glu Leu Lys Gln Lys Val Gln Asn Ile Glu Leu Glu 1 5 10 15 Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln Thr Ile Cys Leu 20 25 30 <210> 186 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 186 Lys Met Tyr Glu Asp Leu Lys Lys Lys Val Gln Asn Ile Glu Glu Asp 1 5 10 15 Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln Thr Ile Cys Leu 20 25 30 <210> 187 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 187 Glu Glu Leu Lys Lys Gln Ile Asp Asn Ile Val Leu Glu Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Ser Val Cys Leu 20 25 <210> 188 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 188 Glu Glu Leu Lys Lys Gln Ile Asp Gln Ile Ile Gln Asp Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu 20 25 <210> 189 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 189 Glu Gln Met Gln Lys Gln Ile Asn Asp Ile Val Gln Glu Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Ala Val Cys Leu 20 25 <210> 190 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 190 Glu Gln Met Gln Lys Gln Ile Asn Asp Ile Val Gln Glu Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Ala Val Cys Leu 20 25 <210> 191 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 191 Asn Thr Gly Leu Leu Glu Ser Gln Leu Ser Arg His Asp Gln Met Leu 1 5 10 15 Ser Val His Asp Ile Arg Leu Ala Asp Met Asp Leu Arg Phe Gln Val 20 25 30 Leu Glu Thr Ala Ser Tyr Asn Gly Val Leu Ile Trp Lys Ile Arg Asp 35 40 45 Tyr Lys Arg Arg Lys Gln Glu Ala Val Met 50 55 <210> 192 <211> 21 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 192 Ala Ala Ser Glu Arg Lys Ala Leu Gln Thr Glu Met Ala Arg Ile Lys 1 5 10 15 Lys Trp Leu Thr Phe 20 <210> 193 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 193 Phe Asp Met Ser Cys Arg Ser Arg Leu Ala Thr Leu Asn Glu Lys Leu 1 5 10 15 Thr Ala Leu Glu Arg Arg Ile Glu Tyr Ile Glu Ala Arg Val Thr Lys 20 25 30 Gly Glu Thr Leu Thr 35 <210> 194 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 194 Ala Asp Ile Tyr Lys Ala Asp Phe Gln Ala Glu Arg Gln Ala Arg Glu 1 5 10 15 Lys Leu Ala Glu Lys Lys Glu Leu Leu Gln Glu Gln Leu Glu Gln Leu 20 25 30 Gln Arg Glu Tyr Ser Lys Leu Lys Ala Ser Cys Gln Glu Ser Ala Arg 35 40 45 Ile <210> 195 <211> 71 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 195 Leu Thr Gly Ser Ala Gln Asn Ile Glu Phe Arg Thr Gly Ser Leu Gly 1 5 10 15 Lys Ile Lys Leu Asn Asp Glu Asp Leu Ser Glu Cys Leu His Gln Ile 20 25 30 Gln Lys Asn Lys Glu Asp Ile Ile Glu Leu Lys Gly Ser Ala Ile Gly 35 40 45 Leu Pro Ile Tyr Gln Leu Asn Ser Lys Leu Val Asp Leu Glu Arg Lys 50 55 60 Phe Gln Gly Leu Gln Gln Thr 65 70 <210> 196 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 196 Leu Arg Gly Leu Arg Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg 1 5 10 15 Lys Val Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu 20 25 <210> 197 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 197 Val Ala Ser Leu Arg Gln Gln Val Glu Ala Leu Gln Gly Gln Val Gln 1 5 10 15 His Leu Gln Ala Ala Phe Ser Gln Tyr Lys Lys 20 25 <210> 198 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 198 Val Asn Ala Leu Arg Gln Arg Val Gly Ile Leu Glu Gly Gln Leu Gln 1 5 10 15 Arg Leu Gln Asn Ala Phe Ser Gln Tyr Lys Lys 20 25 <210> 199 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 199 Ser Ala Ala Leu Arg Gln Gln Met Glu Ala Leu Asn Gly Lys Leu Gln 1 5 10 15 Arg Leu Glu Ala Ala Phe Ser Arg Tyr Lys Lys 20 25 <210> 200 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 200 Val Asn Ala Leu Lys Gln Arg Val Thr Ile Leu Asp Gly His Leu Arg 1 5 10 15 Arg Phe Gln Asn Ala Phe Ser Gln Tyr Lys Lys 20 25 <210> 201 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 201 Val Asp Thr Leu Arg Gln Arg Met Arg Asn Leu Glu Gly Glu Val Gln 1 5 10 15 Arg Leu Gln Asn Ile Val Thr Gln Tyr Arg Lys 20 25 <210> 202 <211> 64 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 202 Gly Ser Pro Gly Leu Lys Gly Asp Lys Gly Ile Pro Gly Asp Lys Gly 1 5 10 15 Ala Lys Gly Glu Ser Gly Leu Pro Asp Val Ala Ser Leu Arg Gln Gln 20 25 30 Val Glu Ala Leu Gln Gly Gln Val Gln His Leu Gln Ala Ala Phe Ser 35 40 45 Gln Tyr Lys Lys Val Glu Leu Phe Pro Gly Gly Ile Pro His Arg Asp 50 55 60 <210> 203 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 203 Asp Met Ala Ala Glu Gly Thr Trp 1 5 <210> 204 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 204 Asp Met Thr Gly Ala Arg 1 5 <210> 205 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 205 Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro 1 5 10 <210> 206 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 206 Asp Gly Gly Lys Thr Glu Asn 1 5 <210> 207 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (3)..(9) <223> Xaa can be any naturally occurring amino acid <400> 207 Asp Met Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp 1 5 10 <210> 208 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (6)..(6) <223> Xaa can be any naturally occurring amino acid <400> 208 Asp Met Thr Gly Ala Xaa 1 5 <210> 209 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 209 Asp Gly Gly Ala Thr Glu Asn 1 5 <210> 210 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (3)..(7) <223> Xaa can be any naturally occurring amino acid <400> 210 Asp Met Xaa Xaa Xaa Xaa Xaa Trp 1 5 <210> 211 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (3)..(6) <223> Xaa can be any naturally occurring amino acid <400> 211 Asp Met Xaa Xaa Xaa Xaa 1 5 <210> 212 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(7) <223> Xaa can be any naturally occurring amino acid <400> 212 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp 1 5 <210> 213 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (4)..(8) <223> Xaa can be any naturally occurring amino acid <400> 213 Asp Gly Gly Xaa Xaa Xaa Xaa Xaa Glu Asn 1 5 10 <210> 214 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (3)..(8) <223> Xaa can be any naturally occurring amino acid <400> 214 Asn Trp Xaa Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 215 <211> 11 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (3)..(9) <223> Xaa can be any naturally occurring amino acid <400> 215 Asn Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 216 <211> 12 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (3)..(10) <223> Xaa can be any naturally occurring amino acid <400> 216 Asn Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 217 <211> 16 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (6)..(12) <223> Xaa can be any naturally occurring amino acid <400> 217 Asn Trp Glu Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Ala Gln Pro 1 5 10 15 <210> 218 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (5)..(10) <223> Xaa can be any naturally occurring amino acid <400> 218 Asn Trp Glu Thr Xaa Xaa Xaa Xaa Xaa Xaa Ala Gln Pro 1 5 10 <210> 219 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (4)..(9) <223> Xaa can be any naturally occurring amino acid <400> 219 Asp Gly Gly Xaa Xaa Xaa Xaa Xaa Xaa Asn 1 5 10 <210> 220 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (4)..(10) <223> Xaa can be any naturally occurring amino acid <400> 220 Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1 5 10 <210> 221 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(1) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (4)..(6) <223> Xaa can be any naturally occurring amino acid <400> 221 Xaa Gly Gly Xaa Xaa Xaa Asn 1 5 <210> 222 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (4)..(8) <223> Xaa can be any naturally occurring amino acid <400> 222 Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 223 <211> 11 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (4)..(10) <223> Xaa can be any naturally occurring amino acid <400> 223 Asp Gly Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn 1 5 10 <210> 224 <211> 62 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (21)..(22) <223> n is a, c, g, or t <220> <221> misc_feature <222> (23)..(23) <223> k is g or t <220> <221> misc_feature <222> (24)..(25) <223> n is a, c, g, or t <220> <221> misc_feature <222> (26)..(26) <223> k is g or t <220> <221> misc_feature <222> (27)..(28) <223> n is a, c, g, or t <220> <221> misc_feature <222> (29)..(29) <223> k is g or t <220> <221> misc_feature <222> (30)..(31) <223> n is a, c, g, or t <220> <221> misc_feature <222> (32)..(32) <223> k is g or t <220> <221> misc_feature <222> (33)..(34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (35)..(35) <223> k is g or t <220> <221> misc_feature <222> (36)..(37) <223> n is a, c, g, or t <220> <221> misc_feature <222> (38)..(38) <223> k is g or t <220> <221> misc_feature <222> (39)..(40) <223> n is a, c, g, or t <220> <221> misc_feature <222> (41)..(41) <223> k is g or t <400> 224 ggctgggcct gaacgacatg nnknnknnkn nknnknnknn ktgggtggat atgactggcg 60 cc 62 <210> 225 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (37)..(37) <223> m is a or c <220> <221> misc_feature <222> (38)..(39) <223> n is a, c, g, or t <400> 225 ggcggtgatc tcagtttccc agttcttgta ggcgatmnng gcgccagtca tatccaccca 60 <210> 226 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 226 ggcggtgatc tcagtttccc agttcttgta ggcgatgcgg gcgccagtca tatccaccca 60 <210> 227 <211> 62 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 227 actgggaaac tgagatcacc gcccaacctg atggcggcgc aaccgagaac tgcgcggtcc 60 tg 62 <210> 228 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 228 ccctgcagcg cttgtcgaac cacttgccgt tggcggcgcc agacaggacc gcgcagttct 60 <210> 229 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 229 gccgagatct ggctgggcct gaacgacatg 30 <210> 230 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 230 atccctgcag cgcttgtcga acc 23 <210> 231 <211> 79 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (21)..(22) <223> n is a, c, g, or t <220> <221> misc_feature <222> (23)..(23) <223> k is g or t <220> <221> misc_feature <222> (24)..(25) <223> n is a, c, g, or t <220> <221> misc_feature <222> (26)..(26) <223> k is g or t <220> <221> misc_feature <222> (27)..(28) <223> n is a, c, g, or t <220> <221> misc_feature <222> (29)..(29) <223> k is g or t <220> <221> misc_feature <222> (30)..(31) <223> n is a, c, g, or t <220> <221> misc_feature <222> (32)..(32) <223> k is g or t <220> <221> misc_feature <222> (33)..(34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (35)..(35) <223> k is g or t <220> <221> misc_feature <222> (48)..(49) <223> n is a, c, g, or t <220> <221> misc_feature <222> (50)..(50) <223> k is g or t <220> <221> misc_feature <222> (51)..(52) <223> n is a, c, g, or t <220> <221> misc_feature <222> (53)..(53) <223> k is g or t <220> <221> misc_feature <222> (54)..(55) <223> n is a, c, g, or t <220> <221> misc_feature <222> (56)..(56) <223> k is g or t <220> <221> misc_feature <222> (57)..(58) <223> n is a, c, g, or t <220> <221> misc_feature <222> (59)..(59) <223> k is g or t <400> 231 ggctgggcct gaacgacatg nnknnknnkn nknnktgggt ggatatgnnk nnknnknnka 60 tcgcctacaa gaactggga 79 <210> 232 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 232 gacaggacgg cgcagttctc ggttgcgccg ccatcaggtt gggcggtgat ctcagtttcc 60 cagttcttgt aggcgat 77 <210> 233 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 233 atccctgcag cgcttgtcga accacttgcc gttggcggcg ccagacagga cggcgcagtt 60 ctc 63 <210> 234 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (22)..(23) <223> n is a, c, g, or t <220> <221> misc_feature <222> (24)..(24) <223> s is g or c <220> <221> misc_feature <222> (25)..(26) <223> n is a, c, g, or t <220> <221> misc_feature <222> (27)..(27) <223> s is g or c <220> <221> misc_feature <222> (28)..(29) <223> n is a, c, g, or t <220> <221> misc_feature <222> (30)..(30) <223> s is g or c <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> s is g or c <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36)..(36) <223> s is g or c <220> <221> misc_feature <222> (37)..(38) <223> n is a, c, g, or t <220> <221> misc_feature <222> (39)..(39) <223> s is g or c <220> <221> misc_feature <222> (40)..(41) <223> n is a, c, g, or t <220> <221> misc_feature <222> (42)..(42) <223> s is g or c <400> 234 gagatctggc tgggcctcaa cnnsnnsnns nnsnnsnnsn nstgggtgga catgactggc 60 <210> 235 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 235 ttgcgcggtg atctcagtct cccagttctt gtaggcgata cgcgcgccag tcatgtccac 60 cca 63 <210> 236 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (32)..(33) <223> n is a, c, g, or t <220> <221> misc_feature <222> (34)..(34) <223> s is g or c <220> <221> misc_feature <222> (35)..(36) <223> n is a, c, g, or t <220> <221> misc_feature <222> (37)..(37) <223> s is g or c <220> <221> misc_feature <222> (38)..(39) <223> n is a, c, g, or t <220> <221> misc_feature <222> (40)..(40) <223> s is g or c <220> <221> misc_feature <222> (41)..(42) <223> n is a, c, g, or t <220> <221> misc_feature <222> (43)..(43) <223> s is g or c <220> <221> misc_feature <222> (44)..(45) <223> n is a, c, g, or t <220> <221> misc_feature <222> (46)..(46) <223> s is g or c <400> 236 gactgagatc accgcgcaac ccgatggcgg cnnsnnsnns nnsnnsgaga actgcgcggt 60 cctg 64 <210> 237 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 237 ccctgcagcg cttgtcgaac cacttgccgt tggccgcgcc tgacaggacc gcgcagttct 60 <210> 238 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 238 gccgagatct ggctgggcct ca 22 <210> 239 <211> 78 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 239 atctggctgg gcctgaacga catggccgcc gagggcacct gggtggatat gaccggcgcg 60 cgtatcgcct acaagaac 78 <210> 240 <211> 62 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (18)..(18) <223> m is a or c <220> <221> misc_feature <222> (19)..(20) <223> n is a, c, g, or t <220> <221> misc_feature <222> (21)..(21) <223> m is a or c <220> <221> misc_feature <222> (22)..(23) <223> n is a, c, g, or t <220> <221> misc_feature <222> (24)..(24) <223> m is a or c <220> <221> misc_feature <222> (25)..(26) <223> n is a, c, g, or t <220> <221> misc_feature <222> (27)..(27) <223> m is a or c <220> <221> misc_feature <222> (28)..(29) <223> n is a, c, g, or t <220> <221> misc_feature <222> (30)..(30) <223> m is a or c <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> m is a or c <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <400> 240 ccgccatcgg gttgggcmnn mnnmnnmnnm nnmnnagttt cccagttctt gtaggcgata 60 cg 62 <210> 241 <211> 57 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (19)..(20) <223> n is a, c, g, or t <220> <221> misc_feature <222> (21)..(21) <223> k is g or t <220> <221> misc_feature <222> (22)..(23) <223> n is a, c, g, or t <220> <221> misc_feature <222> (24)..(24) <223> k is g or t <220> <221> misc_feature <222> (25)..(26) <223> n is a, c, g, or t <220> <221> misc_feature <222> (27)..(27) <223> k is g or t <220> <221> misc_feature <222> (28)..(29) <223> n is a, c, g, or t <220> <221> misc_feature <222> (30)..(30) <223> k is g or t <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> k is g or t <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36)..(36) <223> k is g or t <400> 241 gcccaacccg atggcggcnn knnknnknnk nnknnkaact gcgccgtcct gtctggc 57 <210> 242 <211> 54 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 242 cctgcagcgc ttgtcgaacc acttgccgtt ggcggcgcca gacaggacgg cgca 54 <210> 243 <211> 84 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (22)..(22) <223> m is a or c <220> <221> misc_feature <222> (23)..(24) <223> n is a, c, g, or t <220> <221> misc_feature <222> (25)..(25) <223> m is a or c <220> <221> misc_feature <222> (26)..(27) <223> n is a, c, g, or t <220> <221> misc_feature <222> (28)..(28) <223> m is a or c <220> <221> misc_feature <222> (29)..(30) <223> n is a, c, g, or t <220> <221> misc_feature <222> (37)..(37) <223> m is a or c <220> <221> misc_feature <222> (38)..(39) <223> n is a, c, g, or t <220> <221> misc_feature <222> (40)..(40) <223> m is a or c <220> <221> misc_feature <222> (41)..(42) <223> n is a, c, g, or t <220> <221> misc_feature <222> (43)..(43) <223> m is a or c <220> <221> misc_feature <222> (44)..(45) <223> n is a, c, g, or t <220> <221> misc_feature <222> (46)..(46) <223> m is a or c <220> <221> misc_feature <222> (47)..(48) <223> n is a, c, g, or t <220> <221> misc_feature <222> (49)..(49) <223> m is a or c <220> <221> misc_feature <222> (50)..(51) <223> n is a, c, g, or t <220> <221> misc_feature <222> (52)..(52) <223> m is a or c <220> <221> misc_feature <222> (53)..(54) <223> n is a, c, g, or t <220> <221> misc_feature <222> (55)..(55) <223> m is a or c <220> <221> misc_feature <222> (56)..(57) <223> n is a, c, g, or t <220> <221> misc_feature <222> (58)..(58) <223> m is a or c <220> <221> misc_feature <222> (59)..(60) <223> n is a, c, g, or t <400> 243 gccagacagg acggcgcagt tmnnmnnmnn gccgccmnnm nnmnnmnnmn nmnnmnnmnn 60 ttcccagttc ttgtaggcga tacg 84 <210> 244 <211> 53 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 244 ccgccatcgg gttgggcggt gatctcagtt tcccagttct tgtaggcgat acg 53 <210> 245 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (19)..(20) <223> n is a, c, g, or t <220> <221> misc_feature <222> (21)..(21) <223> k is g or t <220> <221> misc_feature <222> (22)..(23) <223> n is a, c, g, or t <220> <221> misc_feature <222> (24)..(24) <223> k is g or t <220> <221> misc_feature <222> (25)..(26) <223> n is a, c, g, or t <220> <221> misc_feature <222> (27)..(27) <223> k is g or t <220> <221> misc_feature <222> (28)..(29) <223> n is a, c, g, or t <220> <221> misc_feature <222> (30)..(30) <223> k is g or t <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> k is g or t <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36)..(36) <223> k is g or t <220> <221> misc_feature <222> (37)..(38) <223> n is a, c, g, or t <220> <221> misc_feature <222> (39)..(39) <223> k is g or t <400> 245 gcccaacccg atggcggcnn knnknnknnk nnknnknnka actgcgccgt cctgtctggc 60 <210> 246 <211> 74 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (30)..(31) <223> n is a, c, g, or t <220> <221> misc_feature <222> (32)..(32) <223> k is g or t <220> <221> misc_feature <222> (33)..(34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (35)..(35) <223> k is g or t <220> <221> misc_feature <222> (36)..(37) <223> n is a, c, g, or t <220> <221> misc_feature <222> (38)..(38) <223> k is g or t <220> <221> misc_feature <222> (39)..(40) <223> n is a, c, g, or t <220> <221> misc_feature <222> (41)..(41) <223> k is g or t <220> <221> misc_feature <222> (42)..(43) <223> n is a, c, g, or t <220> <221> misc_feature <222> (44)..(44) <223> k is g or t <220> <221> misc_feature <222> (45)..(46) <223> n is a, c, g, or t <220> <221> misc_feature <222> (47)..(47) <223> k is g or t <400> 246 ctggcgcgcg tatcgcctac aagaactggn nknnknnknn knnknnkcaa cccgatggcg 60 gcgccaccga gaac 74 <210> 247 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (30)..(31) <223> n is a, c, g, or t <220> <221> misc_feature <222> (32)..(32) <223> k is g or t <220> <221> misc_feature <222> (33)..(34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (35)..(35) <223> k is g or t <220> <221> misc_feature <222> (36)..(37) <223> n is a, c, g, or t <220> <221> misc_feature <222> (38)..(38) <223> k is g or t <220> <221> misc_feature <222> (39)..(40) <223> n is a, c, g, or t <220> <221> misc_feature <222> (41)..(41) <223> k is g or t <220> <221> misc_feature <222> (42)..(43) <223> n is a, c, g, or t <220> <221> misc_feature <222> (44)..(44) <223> k is g or t <220> <221> misc_feature <222> (45)..(46) <223> n is a, c, g, or t <220> <221> misc_feature <222> (47)..(47) <223> k is g or t <220> <221> misc_feature <222> (48)..(49) <223> n is a, c, g, or t <220> <221> misc_feature <222> (50)..(50) <223> k is g or t <400> 247 ctggcgcgcg tatcgcctac aagaactggn nknnknnknn knnknnknnk caacccgatg 60 gcggcgccac cgagaac 77 <210> 248 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (30)..(31) <223> n is a, c, g, or t <220> <221> misc_feature <222> (32)..(32) <223> k is g or t <220> <221> misc_feature <222> (33)..(34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (35)..(35) <223> k is g or t <220> <221> misc_feature <222> (36)..(37) <223> n is a, c, g, or t <220> <221> misc_feature <222> (38)..(38) <223> k is g or t <220> <221> misc_feature <222> (39)..(40) <223> n is a, c, g, or t <220> <221> misc_feature <222> (41)..(41) <223> k is g or t <220> <221> misc_feature <222> (42)..(43) <223> n is a, c, g, or t <220> <221> misc_feature <222> (44)..(44) <223> k is g or t <220> <221> misc_feature <222> (45)..(46) <223> n is a, c, g, or t <220> <221> misc_feature <222> (47)..(47) <223> k is g or t <220> <221> misc_feature <222> (48)..(49) <223> n is a, c, g, or t <220> <221> misc_feature <222> (50)..(50) <223> k is g or t <400> 248 ctggcgcgcg tatcgcctac aagaactggn nknnknnknn knnknnknnk caacccgatg 60 gcggcgccac cgagaac 77 <210> 249 <211> 81 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 249 cctgcagcgc ttgtcgaacc acttgccgtt ggcggcgcca gacaggacgg cgcagttctc 60 ggtggcgccg ccatcgggtt g 81 <210> 250 <211> 89 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (24)..(24) <223> s is g or c <220> <221> misc_feature <222> (25)..(25) <223> m is a or c <220> <221> misc_feature <222> (28)..(28) <223> w is a or t <220> <221> misc_feature <222> (29)..(29) <223> m is a or c <220> <221> misc_feature <222> (33)..(33) <223> s is g or c <220> <221> misc_feature <222> (42)..(42) <223> m is a or c <220> <221> misc_feature <222> (43)..(44) <223> n is a, c, g, or t <220> <221> misc_feature <222> (51)..(51) <223> m is a or c <220> <221> misc_feature <222> (52)..(53) <223> n is a, c, g, or t <220> <221> misc_feature <222> (54)..(54) <223> m is a or c <220> <221> misc_feature <222> (55)..(56) <223> n is a, c, g, or t <220> <221> misc_feature <222> (57)..(57) <223> m is a or c <220> <221> misc_feature <222> (58)..(59) <223> n is a, c, g, or t <220> <221> misc_feature <222> (60)..(60) <223> m is a or c <220> <221> misc_feature <222> (61)..(62) <223> n is a, c, g, or t <220> <221> misc_feature <222> (63)..(63) <223> m is a or c <220> <221> misc_feature <222> (64)..(65) <223> n is a, c, g, or t <400> 250 gacaggaccg cgcagttctc gccsmagwmc ccsaagccgc cmnngggttg mnnmnnmnnm 60 nnmnnctccc agttcttgta ggcgatacg 89 <210> 251 <211> 66 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 251 atccctgcag cgcttgtcga accacttgcc gttggccgcg cctgacagga ccgcgcagtt 60 ctcgcc 66 <210> 252 <211> 54 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 252 gagcgtgggc aacgaggccg agatctggct gggcctcaac gacatggccg ccga 54 <210> 253 <211> 70 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 253 ccagttcttg taggcgatac gcgcgccagt catatccacc caggtgccct cggcggccat 60 gtcgttgagg 70 <210> 254 <211> 74 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (28)..(29) <223> n is a, c, g, or t <220> <221> misc_feature <222> (30)..(30) <223> k is g or t <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> k is g or t <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36)..(36) <223> k is g or t <220> <221> misc_feature <222> (37)..(38) <223> n is a, c, g, or t <220> <221> misc_feature <222> (39)..(39) <223> k is g or t <220> <221> misc_feature <222> (40)..(41) <223> n is a, c, g, or t <220> <221> misc_feature <222> (42)..(42) <223> k is g or t <220> <221> misc_feature <222> (43)..(44) <223> n is a, c, g, or t <220> <221> misc_feature <222> (45)..(45) <223> k is g or t <220> <221> misc_feature <222> (46)..(47) <223> n is a, c, g, or t <220> <221> misc_feature <222> (48)..(48) <223> k is g or t <400> 254 atcgcctaca agaactggga gactgrgnnk nnknnknnkn nknnknnkac cgcgcaaccc 60 gatggcggtg caac 74 <210> 255 <211> 74 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 255 cgcttgtcga accacttgcc gttggcggcg ccagacagga cggcgcagtt ctcggttgca 60 ccgccatcgg gttg 74 <210> 256 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 256 gacatggccg cggaaggc 18 <210> 257 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 257 gcagatgtag ggcaactgat ctct 24 <210> 258 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 258 gccgagatct ggctgggcct ga 22 <210> 259 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> k is g or t <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36)..(36) <223> k is g or t <220> <221> misc_feature <222> (37)..(38) <223> n is a, c, g, or t <220> <221> misc_feature <222> (39)..(39) <223> k is g or t <220> <221> misc_feature <222> (40)..(41) <223> n is a, c, g, or t <220> <221> misc_feature <222> (42)..(42) <223> k is g or t <220> <221> misc_feature <222> (43)..(43) <223> w is a or t <400> 259 gccgagatct ggctgggcct caacggcagc nnknnknnkn nkwcctgggt ggacatgact 60 ggc 63 <210> 260 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 260 ttgcgcggtg atctcagtct cccagttctt gtaggcgata cgcgcgccag tcatgtccac 60 cca 63 <210> 261 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 261 gactgagatc accgcgcaac ccgatggcgg cttcggcgtg ttcggcgaga actgcgcggt 60 cctg 64 <210> 262 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 262 gactgagatc accgcgcaac ccgatggcgg ctggggcgtg ttcggcgaga actgcgcggt 60 cctg 64 <210> 263 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 263 gactgagatc accgcgcaac ccgatggcgg cttcgggtac ttcggcgaga actgcgcggt 60 cctg 64 <210> 264 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 264 gactgagatc accgcgcaac ccgatggcgg ctgggggtac ttcggcgaga actgcgcggt 60 cctg 64 <210> 265 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 265 gactgagatc accgcgcaac ccgatggcgg ctggggcgtg tggggcgaga actgcgcggt 60 cctg 64 <210> 266 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 266 tgggcctgaa cgacatggcc gccgagggca cctgggtgga tatgactggc gcgcgtatcg 60 cctacaagaa ctgggag 77 <210> 267 <211> 59 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (21)..(21) <223> m is a or c <220> <221> misc_feature <222> (22)..(23) <223> n is a, c, g, or t <220> <221> misc_feature <222> (24)..(24) <223> m is a or c <220> <221> misc_feature <222> (25)..(26) <223> n is a, c, g, or t <220> <221> misc_feature <222> (27)..(27) <223> m is a or c <220> <221> misc_feature <222> (28)..(29) <223> n is a, c, g, or t <220> <221> misc_feature <222> (30)..(30) <223> m is a or c <220> <221> misc_feature <222> (31)..(32) <223> n is a, c, g, or t <220> <221> misc_feature <222> (33)..(33) <223> m is a or c <220> <221> misc_feature <222> (34)..(35) <223> n is a, c, g, or t <400> 267 gttgcgccgc catcgggttg mnnmnnmnnm nnmnnctccc agttcttgta ggcgatacg 59 <210> 268 <211> 44 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 268 caacccgatg gcggcgcaac cgagaactgc gccgtcctgt ctgg 44 <210> 269 <211> 73 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (44)..(44) <223> m is a or c <220> <221> misc_feature <222> (45)..(46) <223> n is a, c, g, or t <220> <221> misc_feature <222> (47)..(47) <223> m is a or c <220> <221> misc_feature <222> (48)..(49) <223> n is a, c, g, or t <220> <221> misc_feature <222> (50)..(50) <223> m is a or c <220> <221> misc_feature <222> (51)..(52) <223> n is a, c, g, or t <400> 269 tgtagggcaa ttgatccctg cagcgcttgt cgaaccactt gccmnnmnnm nngccagaca 60 ggacggcgca gtt 73 <210> 270 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 270 gccgagatct ggctgggcct gaacgacatg g 31 <210> 271 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 271 Gly Ser Asn Val Thr Gln Thr 1 5 <210> 272 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 272 Phe Gly Ala Phe Gly 1 5 <210> 273 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 273 Gly Ser Ser Val Ser Asp Val 1 5 <210> 274 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 274 Phe Gly Met Trp Gly 1 5 <210> 275 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 275 Ala Gly Arg Tyr Ser Leu Ile 1 5 <210> 276 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 276 Phe Gly Val Phe Gly 1 5 <210> 277 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 277 Gly Ser Arg Arg Ser Gly Val 1 5 <210> 278 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 278 Arg Gly Ala Thr Val Lys Val 1 5 <210> 279 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 279 Ala Asn Pro Ala Gln Asp Leu 1 5 <210> 280 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 280 Phe Gly Val Trp Gly 1 5 <210> 281 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 281 Ala Pro Gly Ala Met Glu Phe 1 5 <210> 282 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 282 Gly Ser Pro Asp Leu Gly Val 1 5 <210> 283 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 283 Gly Ser Val Arg Ser Ala Thr 1 5 <210> 284 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 284 Phe Gly Tyr Phe Gly 1 5 <210> 285 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 285 Gly Ser Pro Val Gly Asp Met 1 5 <210> 286 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 286 Ile Gly Val Trp Gly 1 5 <210> 287 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 287 Gly Ser Ser Lys Leu Gly Leu 1 5 <210> 288 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 288 Gly Ser Val Arg Gly Arg Thr 1 5 <210> 289 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 289 Thr Asn Val Thr Arg Thr Leu 1 5 <210> 290 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 290 Leu Gly Val Trp Gly 1 5 <210> 291 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 291 Gly Ser Ala Leu Thr Asn Thr 1 5 <210> 292 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 292 Ala Asn Arg Arg Arg Thr Met 1 5 <210> 293 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 293 Met Gly Val Trp Gly 1 5 <210> 294 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 294 Gly Ser Ser Val Ser Gly Leu 1 5 <210> 295 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 295 Val Gly Val Phe Gly 1 5 <210> 296 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 296 Gly Ser Trp Leu Gly Asp Val 1 5 <210> 297 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 297 Ser Gly Lys Ala Arg Asp Val 1 5 <210> 298 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 298 Gly Ser Arg Phe Gly His Leu 1 5 <210> 299 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 299 Trp Gly Val Phe Gly 1 5 <210> 300 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 300 Gly Ser Arg Ile Ser Gly Val 1 5 <210> 301 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 301 Ser Gly Lys Arg Arg Thr Val 1 5 <210> 302 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 302 Ser Gly Ser Trp Ala Arg Thr 1 5 <210> 303 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 303 Ala Gly Ala Arg Ala Glu Tyr 1 5 <210> 304 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 304 Trp Gly Val Trp Gly 1 5 <210> 305 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 305 Gly Pro Gly Gln Ala Gly Leu 1 5 <210> 306 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 306 Gly Ser Thr Tyr Thr Asp Leu 1 5 <210> 307 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 307 Gly Thr Arg Met Thr Asn Thr 1 5 <210> 308 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 308 Trp Gly Tyr Phe Gly 1 5 <210> 309 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 309 Gly Ser Leu Leu Thr Gly Leu 1 5 <210> 310 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 310 Tyr Gly Ala Trp Gly 1 5 <210> 311 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 311 Gly Ser Lys Ala Gly Lys Leu 1 5 <210> 312 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 312 Tyr Gly Val Phe Gly 1 5 <210> 313 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 313 Ala Ser Leu Arg Ser Arg Val 1 5 <210> 314 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 314 Tyr Gly Val Trp Gly 1 5 <210> 315 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 315 Gly Asn Pro Ser Gly Ser Val 1 5 <210> 316 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 316 Thr Gly Ala Leu His Gln Val 1 5 <210> 317 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 317 Trp Thr Lys Arg Thr Ala Leu 1 5 <210> 318 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 318 Met Phe Gly Met Gly 1 5 <210> 319 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 319 Trp Thr Leu Ala Lys Asn Leu 1 5 <210> 320 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 320 Leu Phe Gly Arg Gly 1 5 <210> 321 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 321 Val Leu Gly Trp Arg Arg Glu 1 5 <210> 322 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 322 Leu Val Met Pro Met 1 5 <210> 323 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 323 Leu Ala Thr Trp Leu Arg Trp 1 5 <210> 324 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 324 Gln Arg Met Ser Tyr 1 5 <210> 325 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 325 Gln His Leu Gly Ser Phe Trp 1 5 <210> 326 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 326 Val Glu Phe Gln Gly 1 5 <210> 327 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 327 Gly Ser Ala Thr Thr Ala Thr 1 5 <210> 328 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 328 Gly Ser Ala Thr Thr Asp Thr 1 5 <210> 329 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 329 Ser Gly His Trp Arg Ala Val 1 5 <210> 330 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 330 Ala Pro Pro Asp Leu Gly Leu 1 5 <210> 331 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 331 Ala Pro Lys Ser Arg Gln Tyr 1 5 <210> 332 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 332 Val Met Gln Leu Pro Arg Lys 1 5 <210> 333 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 333 Ala Gly Arg Met Gly Leu Val 1 5 <210> 334 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 334 Ala Gly Tyr Thr Lys Gln Pro Ser 1 5 <210> 335 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 335 Leu Leu Leu Arg Asn Gln Pro Pro 1 5 <210> 336 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 336 Gln Glu Pro Ala Lys Gln Pro Thr 1 5 <210> 337 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 337 His Pro Leu Pro Pro Gln Pro Ser 1 5 <210> 338 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 338 His Gln Pro Val Tyr Gln Pro Gly 1 5 <210> 339 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 339 Leu Pro Pro Pro Gly His Pro Gln 1 5 <210> 340 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 340 Asn Gly His Glu Pro Gln Pro Arg 1 5 <210> 341 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 341 Asn Asn Leu Ser Ala Gln Pro Arg 1 5 <210> 342 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 342 Pro Pro Glu Pro Leu His Pro Met 1 5 <210> 343 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 343 Pro Pro Gly Pro His His Pro Met 1 5 <210> 344 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 344 Pro Pro Pro Pro His His Pro Met 1 5 <210> 345 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 345 Arg Pro Ala Leu Val Gln Pro Arg 1 5 <210> 346 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 346 Arg Pro Pro Leu Tyr Gln Pro Gly 1 5 <210> 347 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 347 Arg Thr Pro Pro Trp Gln Pro Glu 1 5 <210> 348 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 348 Pro Pro Pro Pro His His Pro Gln 1 5 <210> 349 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 349 Pro Pro Gly Pro Ala His Pro Gln 1 5 <210> 350 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 350 Leu Ala Gly Trp Gly Met Ser 1 5 <210> 351 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 351 Thr Pro Pro Arg Thr Gln Pro Pro 1 5 <210> 352 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 352 Pro Pro Ala Pro Tyr His Pro Met 1 5 <210> 353 <211> 4 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 353 Gly Val Phe Gly 1 <210> 354 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 354 Gly Ser Thr Leu Thr Arg Ile 1 5 <210> 355 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 355 Leu Gly Ala Phe Gly 1 5 <210> 356 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 356 Gly Ser Ala Gly Thr Asn Thr 1 5 <210> 357 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 357 Gly Ser Ala His Thr Asp Thr 1 5 <210> 358 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 358 Gly Ser Ala Ile Thr Asp Thr 1 5 <210> 359 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 359 Gly Ser Ala Ile Thr Asn Thr 1 5 <210> 360 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 360 Gly Ser Ala Lys Thr Asp Thr 1 5 <210> 361 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 361 Gly Ser Ala Lys Thr Gly Thr 1 5 <210> 362 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 362 Gly Ser Ala Lys Thr Asn Thr 1 5 <210> 363 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 363 Gly Ser Ala Leu Thr Asp Thr 1 5 <210> 364 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 364 Gly Ser Ala Leu Thr Arg Thr 1 5 <210> 365 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 365 Gly Ser Ala Leu Thr Ser Leu 1 5 <210> 366 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 366 Gly Ser Ala Arg Gly Arg Val 1 5 <210> 367 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 367 Gly Ser Ala Arg Thr Asp Thr 1 5 <210> 368 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 368 Gly Ser Ala Arg Thr Gly Thr 1 5 <210> 369 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 369 Gly Ser Ala Val Thr Asn Thr 1 5 <210> 370 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 370 Gly Ser Ala Tyr Thr Asn Thr 1 5 <210> 371 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 371 Gly Ser Gly Leu Thr Asp Thr 1 5 <210> 372 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 372 Gly Ser Gly Trp Thr Gly Leu 1 5 <210> 373 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 373 Gly Ser Lys Leu Thr Asp Thr 1 5 <210> 374 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 374 Gly Ser Lys Val Ser Gly Leu 1 5 <210> 375 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 375 Gly Ser Lys Val Thr Glu Thr 1 5 <210> 376 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 376 Gly Ser Leu Lys Thr Asp Thr 1 5 <210> 377 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 377 Gly Ser Leu Lys Thr Gln Thr 1 5 <210> 378 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 378 Gly Ser Leu Leu Thr Asp Thr 1 5 <210> 379 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 379 Gly Ser Leu Leu Thr Asn Thr 1 5 <210> 380 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 380 Gly Ser Leu Arg Ser Asp Leu 1 5 <210> 381 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 381 Gly Ser Leu Arg Thr Asp Thr 1 5 <210> 382 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 382 Gly Ser Leu Arg Thr Gly Thr 1 5 <210> 383 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 383 Gly Ser Leu Arg Thr His Thr 1 5 <210> 384 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 384 Gly Ser Leu Arg Thr Asn Thr 1 5 <210> 385 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 385 Gly Ser Met Leu Thr Asp Thr 1 5 <210> 386 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 386 Gly Ser Met Arg Thr Asp Thr 1 5 <210> 387 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 387 Gly Ser Asn His Thr Asp Thr 1 5 <210> 388 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 388 Gly Ser Pro Ile Thr Asp Thr 1 5 <210> 389 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 389 Gly Ser Pro Ile Thr Asn Thr 1 5 <210> 390 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 390 Gly Ser Pro Lys Thr Asp Thr 1 5 <210> 391 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 391 Gly Ser Pro Lys Thr Gly Thr 1 5 <210> 392 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 392 Gly Ser Pro Lys Thr His Thr 1 5 <210> 393 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 393 Gly Ser Pro Leu Thr Asp Thr 1 5 <210> 394 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 394 Gly Ser Pro Leu Thr Asn Thr 1 5 <210> 395 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 395 Gly Ser Pro Arg Thr Asp Thr 1 5 <210> 396 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 396 Gly Ser Pro Arg Thr Glu Thr 1 5 <210> 397 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 397 Gly Ser Pro Arg Thr Gly Thr 1 5 <210> 398 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 398 Gly Ser Pro Arg Thr His Thr 1 5 <210> 399 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 399 Gly Ser Pro Arg Thr Asn Thr 1 5 <210> 400 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 400 Gly Ser Pro Arg Thr Gln Thr 1 5 <210> 401 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 401 Gly Ser Pro Arg Thr Ser Val 1 5 <210> 402 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 402 Gly Ser Pro Thr Thr Asp Thr 1 5 <210> 403 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 403 Gly Ser Pro Val Asn Asp Val 1 5 <210> 404 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 404 Gly Ser Pro Val Thr Asp Thr 1 5 <210> 405 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 405 Gly Ser Pro Val Thr Gly Thr 1 5 <210> 406 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 406 Gly Ser Pro Val Thr Asn Thr 1 5 <210> 407 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 407 Gly Ser Gln Leu Thr Asp Thr 1 5 <210> 408 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 408 Gly Ser Gln Leu Thr Asn Thr 1 5 <210> 409 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 409 Gly Ser Gln Arg Thr Asp Thr 1 5 <210> 410 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 410 Gly Ser Arg Ala Thr Asp Thr 1 5 <210> 411 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 411 Gly Ser Arg His Thr Asp Thr 1 5 <210> 412 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 412 Gly Ser Arg Leu Thr Asp Thr 1 5 <210> 413 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 413 Gly Ser Arg Leu Thr Asn Thr 1 5 <210> 414 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 414 Gly Ser Arg Arg Thr Asp Thr 1 5 <210> 415 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 415 Gly Ser Ser Ile Thr Asp Thr 1 5 <210> 416 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 416 Gly Ser Ser Lys Thr Asn Thr 1 5 <210> 417 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 417 Gly Ser Ser Leu Thr Asp Thr 1 5 <210> 418 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 418 Gly Ser Ser Leu Thr Asn Thr 1 5 <210> 419 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 419 Gly Ser Ser Arg Thr Asp Thr 1 5 <210> 420 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 420 Gly Ser Ser Arg Thr Asn Thr 1 5 <210> 421 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 421 Gly Ser Ser Val Thr Asn Thr 1 5 <210> 422 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 422 Gly Ser Ser Val Thr Ser Thr 1 5 <210> 423 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 423 Gly Ser Thr Leu Thr Asp Thr 1 5 <210> 424 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 424 Gly Ser Thr Leu Thr Asn Thr 1 5 <210> 425 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 425 Gly Ser Thr Met Thr Gln Thr 1 5 <210> 426 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 426 Gly Ser Thr Arg Thr Asp Thr 1 5 <210> 427 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 427 Gly Ser Thr Arg Thr Asn Thr 1 5 <210> 428 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 428 Gly Ser Val Ile Thr Gly Thr 1 5 <210> 429 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 429 Gly Ser Val Ile Thr Asn Thr 1 5 <210> 430 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 430 Gly Ser Val Lys Thr Asp Thr 1 5 <210> 431 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 431 Gly Ser Val Leu Thr Asp Thr 1 5 <210> 432 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 432 Gly Ser Val Leu Thr Gly Thr 1 5 <210> 433 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 433 Gly Ser Val Leu Thr Asn Thr 1 5 <210> 434 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 434 Gly Ser Val Arg Thr Asp Thr 1 5 <210> 435 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 435 Gly Ser Val Arg Thr Glu Ser 1 5 <210> 436 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 436 Gly Ser Val Arg Thr Gly Thr 1 5 <210> 437 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 437 Gly Ser Val Arg Thr Asn Thr 1 5 <210> 438 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 438 Gly Ser Val Thr Thr Asp Thr 1 5 <210> 439 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 439 Gly Ser Trp Gly Ser Gly Ile 1 5 <210> 440 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 440 Gly Ser Trp Leu Thr Asp Thr 1 5 <210> 441 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 441 Gly Ser Tyr Leu Thr Asn Thr 1 5 <210> 442 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 442 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 443 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 443 Ala Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 444 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 444 Asn Ala Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 445 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 445 Asn Gly Ala Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 446 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 446 Asn Gly Ser Ala Ala Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 447 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 447 Asn Gly Ser Ala Leu Ala Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 448 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 448 Asn Gly Ser Ala Leu Thr Ala Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 449 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 449 Asn Gly Ser Ala Leu Thr Asn Ala Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 450 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 450 Asn Gly Ser Ala Leu Thr Asn Thr Ala Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 451 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 451 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Ala 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 452 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 452 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Ala Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 453 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 453 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Ala Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 454 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 454 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Ala Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 455 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 455 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Ala Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 456 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 456 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Ala Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 457 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 457 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Ala Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 458 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 458 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ala Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 459 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 459 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Ala Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 460 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 460 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Ala Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 461 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 461 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Ala Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 462 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 462 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Ala 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 463 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 463 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Ala Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 464 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 464 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Ala Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 465 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 465 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Ala Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 50 55 <210> 466 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 466 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Ala Cys Arg 50 55 <210> 467 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 467 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly 20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala 35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Ala 50 55 <210> 468 <211> 17 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (4)..(11) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (14)..(16) <223> Xaa can be any naturally occurring amino acid <400> 468 Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Xaa Xaa Xaa 1 5 10 15 Asn <210> 469 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 469 Ala Ala Glu Gly Thr 1 5 <210> 470 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> k is g or t <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> k is g or t <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> k is g or t <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> k is g or t <220> <221> misc_feature <222> (13)..(14) <223> n is a, c, g, or t <220> <221> misc_feature <222> (15)..(15) <223> k is g or t <220> <221> misc_feature <222> (16)..(17) <223> n is a, c, g, or t <220> <221> misc_feature <222> (18)..(18) <223> k is g or t <220> <221> misc_feature <222> (19)..(20) <223> n is a, c, g, or t <220> <221> misc_feature <222> (21)..(21) <223> k is g or t <400> 470 nnknnknnkn nknnknnknn k 21 <210> 471 <211> 15 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> k is g or t <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> k is g or t <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> k is g or t <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> k is g or t <220> <221> misc_feature <222> (13)..(14) <223> n is a, c, g, or t <220> <221> misc_feature <222> (15)..(15) <223> k is g or t <400> 471 nnknnknnkn nknnk 15 <210> 472 <211> 12 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> k is g or t <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> k is g or t <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> k is g or t <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> k is g or t <400> 472 nnknnknnkn nk 12 <210> 473 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 473 Asp Met Ala Ala Glu Gly Thr 1 5 <210> 474 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> s is g or c <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> s is g or c <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> s is g or c <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> s is g or c <220> <221> misc_feature <222> (13)..(14) <223> n is a, c, g, or t <220> <221> misc_feature <222> (15)..(15) <223> s is g or c <220> <221> misc_feature <222> (16)..(17) <223> n is a, c, g, or t <220> <221> misc_feature <222> (18)..(18) <223> s is g or c <220> <221> misc_feature <222> (19)..(20) <223> n is a, c, g, or t <220> <221> misc_feature <222> (21)..(21) <223> s is g or c <400> 474 nnsnnsnnsn nsnnsnnsnn s 21 <210> 475 <211> 15 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> s is g or c <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> s is g or c <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> s is g or c <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> s is g or c <220> <221> misc_feature <222> (13)..(14) <223> n is a, c, g, or t <220> <221> misc_feature <222> (15)..(15) <223> s is g or c <400> 475 nnsnnsnnsn nsnns 15 <210> 476 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> k is g or t <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> k is g or t <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> k is g or t <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> k is g or t <220> <221> misc_feature <222> (13)..(14) <223> n is a, c, g, or t <220> <221> misc_feature <222> (15)..(15) <223> k is g or t <220> <221> misc_feature <222> (16)..(17) <223> n is a, c, g, or t <220> <221> misc_feature <222> (18)..(18) <223> k is g or t <400> 476 nnknnknnkn nknnknnk 18 <210> 477 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 477 Thr Glu Ile Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu 1 5 10 <210> 478 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> MISC_FEATURE <222> (1)..(8) <223> Xaa represents a random amino acid encoded by the nucleotide sequence nnk (n is a, c, g, or t; k is g or t) <220> <221> MISC_FEATURE <222> (11)..(13) <223> Xaa represents a random amino acid encoded by the nucleotide sequence nnk (n is a, c, g, or t; k is g or t) <400> 478 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Xaa Xaa Xaa 1 5 10 <210> 479 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 479 Glu Thr Glu Ile Thr Ala 1 5 <210> 480 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1)..(2) <223> n is a, c, g, or t <220> <221> misc_feature <222> (3)..(3) <223> k is g or t <220> <221> misc_feature <222> (4)..(5) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6)..(6) <223> k is g or t <220> <221> misc_feature <222> (7)..(8) <223> n is a, c, g, or t <220> <221> misc_feature <222> (9)..(9) <223> k is g or t <220> <221> misc_feature <222> (10)..(11) <223> n is a, c, g, or t <220> <221> misc_feature <222> (12)..(12) <223> k is g or t <220> <221> misc_feature <222> (13)..(14) <223> n is a, c, g, or t <220> <221> misc_feature <222> (15)..(15) <223> k is g or t <220> <221> misc_feature <222> (16)..(17) <223> n is a, c, g, or t <220> <221> misc_feature <222> (18)..(18) <223> k is g or t <220> <221> misc_feature <222> (19)..(20) <223> n is a, c, g, or t <220> <221> misc_feature <222> (21)..(21) <223> k is g or t <220> <221> misc_feature <222> (22)..(23) <223> n is a, c, g, or t <220> <221> misc_feature <222> (24)..(24) <223> k is g or t <400> 480 nnknnknnkn nknnknnknn knnk 24 <210> 481 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 481 Thr Glu Ile Thr Ala 1 5 <210> 482 <211> 30 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (9)..(13) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (16)..(16) <223> Xaa can be any naturally occurring amino acid <220> <221> MISC_FEATURE <222> (19)..(19) <223> Xaa is Phe or Leu <220> <221> MISC_FEATURE <222> (21)..(21) <223> Xaa is Phe, Tyr, Val, or Asp <220> <221> MISC_FEATURE <222> (22)..(22) <223> Xaa is Phe, Trp, Leu, or Cys <400> 482 Arg Ile Ala Tyr Lys Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Gln Pro Xaa 1 5 10 15 Gly Gly Xaa Gly Xaa Xaa Gly Glu Asn Cys Ala Val Leu Ser 20 25 30 <210> 483 <211> 125 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 483 His Met Lys Cys Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu 1 5 10 15 Ala Ser Glu Asp Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln 20 25 30 Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val 35 40 45 Gly Asn Glu Ala Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu 50 55 60 Gly Thr Trp Val Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp 65 70 75 80 Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys 85 90 95 Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys 100 105 110 Arg Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val 115 120 125 <210> 484 <211> 114 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 484 Gly Asn Lys Phe Phe Leu Thr Asn Gly Glu Ile Met Thr Phe Glu Lys 1 5 10 15 Val Lys Ala Leu Cys Val Lys Phe Gln Ala Ser Val Ala Thr Pro Arg 20 25 30 Asn Ala Ala Glu Asn Gly Ala Ile Gln Asn Leu Ile Lys Glu Glu Ala 35 40 45 Phe Leu Gly Ile Thr Asp Glu Lys Thr Glu Gly Gln Phe Val Asp Leu 50 55 60 Thr Gly Asn Arg Leu Thr Tyr Thr Asn Trp Asn Glu Gly Glu Pro Asn 65 70 75 80 Asn Ala Gly Ser Asp Glu Asp Cys Val Leu Leu Leu Lys Asn Gly Gln 85 90 95 Trp Asn Asp Val Pro Cys Ser Thr Ser His Leu Ala Val Cys Glu Phe 100 105 110 Pro Ile <210> 485 <211> 112 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 485 Lys Tyr Phe Met Ser Ser Val Arg Arg Met Pro Leu Asn Arg Ala Lys 1 5 10 15 Ala Leu Cys Ser Glu Leu Gln Gly Thr Val Ala Thr Pro Arg Asn Ala 20 25 30 Glu Glu Asn Arg Ala Ile Gln Asn Val Ala Lys Asp Val Ala Phe Leu 35 40 45 Gly Ile Thr Asp Gln Arg Thr Glu Asn Val Phe Glu Asp Leu Thr Gly 50 55 60 Asn Arg Val Arg Tyr Thr Asn Trp Asn Glu Gly Glu Pro Asn Asn Val 65 70 75 80 Gly Ser Gly Glu Asn Cys Val Val Leu Leu Thr Asn Gly Lys Trp Asn 85 90 95 Asp Val Pro Cys Ser Asp Ser Phe Leu Val Val Cys Glu Phe Ser Asp 100 105 110 <210> 486 <211> 115 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 486 Gly Glu Lys Ile Phe Lys Thr Ala Gly Phe Val Lys Pro Phe Thr Glu 1 5 10 15 Ala Gln Leu Leu Cys Thr Gln Ala Gly Gly Gln Leu Ala Ser Pro Arg 20 25 30 Ser Ala Ala Glu Asn Ala Ala Leu Gln Gln Leu Val Val Ala Lys Asn 35 40 45 Glu Ala Ala Phe Leu Ser Met Thr Asp Ser Lys Thr Glu Gly Lys Phe 50 55 60 Thr Tyr Pro Thr Gly Glu Ser Leu Val Tyr Ser Asn Trp Ala Pro Gly 65 70 75 80 Glu Pro Asn Asp Asp Gly Gly Ser Glu Asp Cys Val Glu Ile Phe Thr 85 90 95 Asn Gly Lys Trp Asn Asp Arg Ala Cys Gly Glu Lys Arg Leu Val Val 100 105 110 Cys Glu Phe 115 <210> 487 <211> 114 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 487 Gly Lys Lys Phe Phe Val Thr Asn His Glu Arg Met Pro Phe Ser Lys 1 5 10 15 Val Lys Ala Leu Cys Ser Glu Leu Arg Gly Thr Val Ala Ile Pro Arg 20 25 30 Asn Ala Glu Glu Asn Lys Ala Ile Gln Glu Val Ala Lys Thr Ser Ala 35 40 45 Phe Leu Gly Ile Thr Asp Glu Val Thr Glu Gly Gln Phe Met Tyr Val 50 55 60 Thr Gly Gly Arg Leu Thr Tyr Ser Asn Trp Lys Lys Asp Glu Pro Asn 65 70 75 80 Asp Val Gly Ser Gly Glu Asp Cys Val Thr Ile Val Asp Asn Gly Leu 85 90 95 Trp Asn Asp Val Ser Cys Gln Ala Ser His Thr Ala Val Cys Glu Phe 100 105 110 Pro Ala <210> 488 <211> 114 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 488 Gly Asp Lys Val Phe Ser Thr Asn Gly Gln Ser Val Asn Phe Asp Thr 1 5 10 15 Ile Lys Glu Met Cys Thr Arg Ala Gly Gly Asn Ile Ala Val Pro Arg 20 25 30 Thr Pro Glu Glu Asn Glu Ala Ile Ala Ser Ile Ala Lys Lys Tyr Asn 35 40 45 Asn Tyr Val Tyr Leu Gly Met Ile Glu Asp Gln Thr Pro Gly Asp Phe 50 55 60 His Tyr Leu Asp Gly Ala Ser Val Ser Tyr Thr Asn Trp Tyr Pro Gly 65 70 75 80 Glu Pro Arg Gly Gln Gly Lys Glu Lys Cys Val Glu Met Tyr Thr Asp 85 90 95 Gly Thr Trp Asn Asp Arg Gly Cys Leu Gln Tyr Arg Leu Ala Val Cys 100 105 110 Glu Phe <210> 489 <211> 119 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 489 Thr Lys Phe Gln Gly His Cys Tyr Arg His Phe Pro Asp Arg Glu Thr 1 5 10 15 Trp Val Asp Ala Glu Arg Arg Cys Arg Glu Gln Gln Ser His Leu Ser 20 25 30 Ser Ile Val Thr Pro Glu Glu Gln Glu Phe Val Asn Lys Asn Ala Gln 35 40 45 Asp Tyr Gln Trp Ile Gly Leu Asn Asp Arg Thr Ile Glu Gly Asp Phe 50 55 60 Arg Trp Ser Asp Gly His Ser Leu Gln Phe Glu Lys Trp Arg Pro Asn 65 70 75 80 Gln Pro Asp Asn Phe Phe Ala Thr Gly Glu Asp Cys Val Val Met Ile 85 90 95 Trp His Glu Arg Gly Glu Trp Asn Asp Val Pro Cys Asn Tyr Gln Leu 100 105 110 Pro Phe Thr Cys Lys Lys Gly 115 <210> 490 <211> 127 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 490 Ser His Cys Tyr Ala Leu Phe Leu Ser Pro Lys Ser Trp Thr Asp Ala 1 5 10 15 Asp Leu Ala Cys Gln Lys Arg Pro Ser Gly Asn Leu Val Ser Val Leu 20 25 30 Ser Gly Ala Glu Gly Ser Phe Val Ser Ser Leu Val Lys Ser Ile Gly 35 40 45 Asn Ser Tyr Ser Tyr Val Trp Ile Gly Leu His Asp Pro Thr Gln Gly 50 55 60 Thr Glu Pro Asn Gly Glu Gly Trp Glu Trp Ser Ser Ser Asp Val Met 65 70 75 80 Asn Tyr Phe Ala Trp Glu Arg Asn Pro Ser Thr Ile Ser Ser Pro Gly 85 90 95 His Cys Ala Ser Leu Ser Arg Ser Thr Ala Phe Leu Arg Trp Lys Asp 100 105 110 Tyr Asn Cys Asn Val Arg Leu Pro Tyr Val Cys Lys Phe Thr Asp 115 120 125 <210> 491 <211> 119 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 491 Asp Lys Cys Tyr Tyr Phe Ser Val Glu Lys Glu Ile Phe Glu Asp Ala 1 5 10 15 Lys Leu Phe Cys Glu Asp Lys Ser Ser His Leu Val Phe Ile Asn Thr 20 25 30 Arg Glu Glu Gln Gln Trp Ile Lys Lys Gln Met Val Gly Arg Glu Ser 35 40 45 His Trp Ile Gly Leu Thr Asp Ser Glu Arg Glu Asn Glu Trp Lys Trp 50 55 60 Leu Asp Gly Thr Ser Pro Asp Tyr Lys Asn Trp Lys Ala Gly Gln Pro 65 70 75 80 Asp Asn Trp Gly His Gly His Gly Pro Gly Glu Asp Cys Ala Gly Leu 85 90 95 Ile Tyr Ala Gly Gln Trp Asn Asp Phe Gln Cys Glu Asp Val Asn Asn 100 105 110 Phe Ile Cys Glu Lys Asp Arg 115 <210> 492 <211> 120 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 492 Asp Lys Cys Tyr Tyr Phe Ser Leu Glu Lys Glu Ile Phe Glu Asp Ala 1 5 10 15 Lys Leu Phe Cys Glu Asp Lys Ser Ser His Leu Val Phe Ile Asn Ser 20 25 30 Arg Glu Glu Gln Gln Trp Ile Lys Lys His Thr Val Gly Arg Glu Ser 35 40 45 His Trp Ile Gly Leu Thr Asp Ser Glu Gln Glu Ser Glu Trp Lys Trp 50 55 60 Leu Asp Gly Ser Pro Val Asp Tyr Lys Asn Trp Lys Ala Gly Gln Pro 65 70 75 80 Asp Asn Trp Gly Ser Gly His Gly Pro Gly Glu Asp Cys Ala Gly Leu 85 90 95 Ile Tyr Ala Gly Gln Trp Asn Asp Phe Gln Cys Asp Glu Ile Asn Asn 100 105 110 Phe Ile Cys Glu Lys Glu Arg Glu 115 120 <210> 493 <211> 121 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 493 Gly Asn Cys Tyr Phe Met Ser Asn Ser Gln Arg Asn Trp His Asp Ser 1 5 10 15 Val Thr Ala Cys Gln Glu Val Arg Ala Gln Leu Val Val Ile Lys Thr 20 25 30 Ala Glu Glu Gln Asn Phe Leu Gln Leu Gln Thr Ser Arg Ser Asn Arg 35 40 45 Phe Ser Trp Met Gly Leu Ser Asp Leu Asn Gln Glu Gly Thr Trp Gln 50 55 60 Trp Val Asp Gly Ser Pro Leu Ser Pro Ser Phe Gln Arg Tyr Trp Asn 65 70 75 80 Ser Gly Glu Pro Asn Asn Ser Gly Asn Glu Asp Cys Ala Glu Phe Ser 85 90 95 Gly Ser Gly Trp Asn Asp Asn Arg Cys Asp Val Asp Asn Tyr Trp Ile 100 105 110 Cys Lys Lys Pro Ala Ala Cys Phe Arg 115 120 <210> 494 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 494 Pro Ala Arg Gln Pro Gln Pro Gly 1 5                          SEQUENCE LISTING <110> Kretz-Rommel, Anke        Wild, Martha A.        Bowdish, Katherine        Chen, Elise        Oltean, Daniela        Gonzalez, Maria        Kapoor, Mili   <120> POLYPEPTIDES THAT BIND IL-23R <130> 10-090 <140> US 12/703757 <141> 2010-02-10 <150> PCT / US09 / 60271 <151> 2009-10-09 <150> US 12 / 577,067 <151> 2009-10-09 <160> 494 <170> PatentIn version 3.5 <210> 1 <211> 189 <212> PRT <213> Homo sapiens <400> 1 Met Leu Gly Ser Arg Ala Val Met Leu Leu Leu Leu Leu Pro Trp Thr 1 5 10 15 Ala Gln Gly Arg Ala Val Pro Gly Gly Ser Ser Pro Ala Trp Thr Gln             20 25 30 Cys Gln Gln Leu Ser Gln Lys Leu Cys Thr Leu Ala Trp Ser Ala His         35 40 45 Pro Leu Val Gly His Met Asp Leu Arg Glu Glu Gly Asp Glu Glu Thr     50 55 60 Thr Asn Asp Val Pro His Ile Gln Cys Gly Asp Gly Cys Asp Pro Gln 65 70 75 80 Gly Leu Arg Asp Asn Ser Gln Phe Cys Leu Gln Arg Ile His Gln Gly                 85 90 95 Leu Ile Phe Tyr Glu Lys Leu Leu Gly Ser Asp Ile Phe Thr Gly Glu             100 105 110 Pro Ser Leu Leu Pro Asp Ser Pro Val Gly Gln Leu His Ala Ser Leu         115 120 125 Leu Gly Leu Ser Gln Leu Leu Gln Pro Glu Gly His His Trp Glu Thr     130 135 140 Gln Gln Ile Pro Ser Leu Ser Pro Ser Gln Pro Trp Gln Arg Leu Leu 145 150 155 160 Leu Arg Phe Lys Ile Leu Arg Ser Leu Gln Ala Phe Val Ala Val Ala                 165 170 175 Ala Arg Val Phe Ala His Gly Ala Ala Thr Leu Ser Pro             180 185 <210> 2 <211> 357 <212> PRT <213> Homo sapiens <400> 2 Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 1 5 10 15 Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val             20 25 30 Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu         35 40 45 Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln     50 55 60 Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 65 70 75 80 Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val                 85 90 95 Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp             100 105 110 Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe         115 120 125 Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp     130 135 140 Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 145 150 155 160 Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser                 165 170 175 Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu             180 185 190 Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile         195 200 205 Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr     210 215 220 Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Lys Asn 225 230 235 240 Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp                 245 250 255 Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr             260 265 270 Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg         275 280 285 Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala     290 295 300 Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 305 310 315 320 Glu Trp Ala Ser Val Pro Cys Ser Val Asn Glu Glu Leu Pro Ser Ile                 325 330 335 Asn Thr Tyr Phe Pro Gln Asn Ile Leu Glu Ser His Phe Asn Arg Ile             340 345 350 Ser Leu Leu Glu Lys         355 <210> 3 <211> 253 <212> PRT <213> Homo sapiens <400> 3 Met Trp Pro Pro Gly Ser Ala Ser Gln Pro Pro Pro Ser Pro Ala Ala 1 5 10 15 Ala Thr Gly Leu His Pro Ala Ala Arg Pro Val Ser Leu Gln Cys Arg             20 25 30 Leu Ser Met Cys Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val         35 40 45 Leu Leu Asp His Leu Ser Leu Ala Arg Asn Leu Pro Val Ala Thr Pro     50 55 60 Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg 65 70 75 80 Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr                 85 90 95 Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys             100 105 110 Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu         115 120 125 Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys     130 135 140 Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser 145 150 155 160 Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn                 165 170 175 Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn             180 185 190 Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser         195 200 205 Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys     210 215 220 Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala 225 230 235 240 Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser                 245 250 <210> 4 <211> 155 <212> PRT <213> Homo sapiens <400> 4 Met Thr Pro Gly Lys Thr Ser Leu Val Ser Leu Leu Leu Leu Leu Ser 1 5 10 15 Leu Glu Ala Ile Val Lys Ala Gly Ile Thr Ile Pro Arg Asn Pro Gly             20 25 30 Cys Pro Asn Ser Glu Asp Lys Asn Phe Pro Arg Thr Val Met Val Asn         35 40 45 Leu Asn Ile His Asn Arg Asn Thr Asn Thr Asn Pro Lys Arg Ser Ser     50 55 60 Asp Tyr Tyr Asn Arg Ser Thr Ser Pro Trp Asn Leu His Arg Asn Glu 65 70 75 80 Asp Pro Glu Arg Tyr Pro Ser Val Ile Trp Glu Ala Lys Cys Arg His                 85 90 95 Leu Gly Cys Ile Asn Ala Asp Gly Asn Val Asp Tyr His Met Asn Ser             100 105 110 Val Pro Ile Gln Gln Glu Ile Leu Val Leu Arg Arg Glu Pro Pro His         115 120 125 Cys Pro Asn Ser Phe Arg Leu Glu Lys Ile Leu Val Ser Val Gly Cys     130 135 140 Thr Cys Val Thr Pro Ile Val His His Val Ala 145 150 155 <210> 5 <211> 227 <212> PRT <213> Homo sapiens <400> 5 Met Lys Asn Ser Asn Val Val Lys Met Leu Gln Glu Asn Ser Glu Leu 1 5 10 15 Met Asn Asn Asn Ser Ser Glu Gln Val Leu Tyr Val Asp Pro Met Ile             20 25 30 Thr Glu Ile Lys Glu Ile Phe Ile Pro Glu His Lys Pro Thr Asp Tyr         35 40 45 Lys Lys Glu Asn Thr Gly Pro Leu Glu Thr Arg Asp Tyr Pro Gln Asn     50 55 60 Ser Leu Phe Asp Asn Thr Thr Val Val Tyr Ile Pro Asp Leu Asn Thr 65 70 75 80 Gly Tyr Lys Pro Gln Ile Ser Asn Phe Leu Pro Glu Gly Ser His Leu                 85 90 95 Ser Asn Asn Asn Glu Ile Thr Ser Leu Thr Leu Lys Pro Pro Val Asp             100 105 110 Ser Leu Asp Ser Gly Asn Asn Pro Arg Leu Gln Lys His Pro Asn Phe         115 120 125 Ala Phe Ser Val Ser Ser Val Asn Ser Leu Ser Asn Thr Ile Phe Leu     130 135 140 Gly Glu Leu Ser Leu Ile Leu Asn Gln Gly Glu Cys Ser Ser Pro Asp 145 150 155 160 Ile Gln Asn Ser Val Glu Glu Glu Thr Thr Met Leu Leu Glu Asn Asp                 165 170 175 Ser Pro Ser Glu Thr Ile Pro Glu Gln Thr Leu Leu Pro Asp Glu Phe             180 185 190 Val Ser Cys Leu Gly Ile Val Asn Glu Glu Leu Pro Ser Ile Asn Thr         195 200 205 Tyr Phe Pro Gln Asn Ile Leu Glu Ser His Phe Asn Arg Ile Ser Leu     210 215 220 Leu Glu Lys 225 <210> 6 <211> 660 <212> PRT <213> Homo sapiens <400> 6 Met Glu Pro Leu Val Thr Trp Val Val Pro Leu Leu Phe Leu Phe Leu 1 5 10 15 Leu Ser Arg Gln Gly Ala Ala Cys Arg Thr Ser Glu Cys Cys Phe Gln             20 25 30 Asp Pro Pro Tyr Pro Asp Ala Asp Ser Gly Ser Ala Ser Gly Pro Arg         35 40 45 Asp Leu Arg Cys Tyr Arg Ile Ser Ser Asp Arg Tyr Glu Cys Ser Trp     50 55 60 Gln Tyr Glu Gly Pro Thr Ala Gly Val Ser His Phe Leu Arg Cys Cys 65 70 75 80 Leu Ser Ser Gly Arg Cys Cys Tyr Phe Ala Ala Gly Ser Ala Thr Arg                 85 90 95 Leu Gln Phe Ser Asp Gln Ala Gly Val Ser Val Leu Tyr Thr Val Thr             100 105 110 Leu Trp Val Glu Ser Trp Ala Arg Asn Gln Thr Glu Lys Ser Pro Glu         115 120 125 Val Thr Leu Gln Leu Tyr Asn Ser Val Lys Tyr Glu Pro Pro Leu Gly     130 135 140 Asp Ile Lys Val Ser Lys Leu Ala Gly Gln Leu Arg Met Glu Trp Glu 145 150 155 160 Thr Pro Asp Gln Val Gly Ala Glu Val Gln Phe Arg His Arg Thr                 165 170 175 Pro Ser Ser Pro Trp Lys Leu Gly Asp Cys Gly Pro Gln Asp Asp Asp             180 185 190 Thr Glu Ser Cys Leu Cys Pro Leu Glu Met Asn Val Ala Gln Glu Phe         195 200 205 Gln Leu Arg Arg Arg Gln Leu Gly Ser Gln Gly Ser Ser Trp Ser Lys     210 215 220 Trp Ser Ser Pro Val Cys Val Pro Pro Glu Asn Pro Pro Gln Pro Gln 225 230 235 240 Val Arg Phe Ser Val Glu Gln Leu Gly Gln Asp Gly Arg Arg Arg Leu                 245 250 255 Thr Leu Lys Glu Gln Pro Thr Gln Leu Glu Leu Pro Glu Gly Cys Gln             260 265 270 Gly Leu Ala Pro Gly Thr Glu Val Thr Tyr Arg Leu Gln Leu His Met         275 280 285 Leu Ser Cys Pro Cys Lys Ala Lys Ala Thr Arg Thr Leu His Leu Gly     290 295 300 Lys Met Pro Tyr Leu Ser Gly Ala Ala Tyr Asn Val Ala Val Ile Ser 305 310 315 320 Ser Asn Gln Phe Gly Pro Gly Leu Asn Gln Thr Trp His Ile Pro Ala                 325 330 335 Asp Thr His Thr Glu Pro Val Ala Leu Asn Ile Ser Val Gly Thr Asn             340 345 350 Gly Thr Thr Met Tyr Trp Pro Ala Arg Ala Gln Ser Met Thr Tyr Cys         355 360 365 Ile Glu Trp Gln Pro Val Gly Gln Asp Gly Gly Leu Ala Thr Cys Ser     370 375 380 Leu Thr Ala Pro Gln Asp Pro Asp Pro Ala Gly Met Ala Thr Tyr Ser 385 390 395 400 Trp Ser Arg Glu Ser Gly Ala Met Gly Gln Glu Lys Cys Tyr Tyr Ile                 405 410 415 Thr Ile Phe Ala Ser Ala His Pro Glu Lys Leu Thr Leu Trp Ser Thr             420 425 430 Val Leu Ser Thr Tyr His Phe Gly Gly Asn Ala Ser Ala Ala Gly Thr         435 440 445 Pro His His Val Ser Val Lys Asn His Ser Leu Asp Ser Val Val Ser     450 455 460 Asp Trp Ala Pro Ser Leu Leu Ser Thr Cys Pro Gly Val Leu Lys Glu 465 470 475 480 Tyr Val Val Arg Cys Arg Asp Glu Asp Ser Lys Gln Val Ser Glu His                 485 490 495 Pro Val Gln Pro Thr Glu Thr Gln Val Thr Leu Ser Gly Leu Arg Ala             500 505 510 Gly Val Ala Tyr Thr Val Gln Val Arg Ala Asp Thr Ala Trp Leu Arg         515 520 525 Gly Val Trp Ser Gln Pro Gln Arg Phe Ser Ile Glu Val Gln Val Ser     530 535 540 Asp Trp Leu Ile Phe Phe Ala Ser Leu Gly Ser Phe Leu Ser Ile Leu 545 550 555 560 Leu Val Gly Val Leu Gly Tyr Leu Gly Leu Asn Arg Ala Ala Arg His                 565 570 575 Leu Cys Pro Pro Leu Pro Thr Pro Cys Ala Ser Ser Ala Ile Glu Phe             580 585 590 Pro Gly Gly Lys Glu Thr Trp Gln Trp Ile Asn Pro Val Asp Phe Gln         595 600 605 Glu Glu Ala Ser Leu Gln Glu Ala Leu Val Val Glu Met Ser Trp Asp     610 615 620 Lys Gly Glu Arg Thr Glu Pro Leu Glu Lys Thr Glu Leu Pro Glu Gly 625 630 635 640 Ala Pro Glu Leu Ala Leu Asp Thr Glu Leu Ser Leu Glu Asp Gly Asp                 645 650 655 Arg Cys Asp Arg             660 <210> 7 <211> 862 <212> PRT <213> Homo sapiens <400> 7 Met Ala His Thr Phe Arg Gly Cys Ser Leu Ala Phe Met Phe Ile Ile 1 5 10 15 Thr Trp Leu Leu Ile Lys Ala Lys Ile Asp Ala Cys Lys Arg Gly Asp             20 25 30 Val Thr Val Lys Pro Ser His Val Ile Leu Leu Gly Ser Thr Val Asn         35 40 45 Ile Thr Cys Ser Leu Lys Pro Arg Gln Gly Cys Phe His Tyr Ser Arg     50 55 60 Arg Asn Lys Leu Ile Leu Tyr Lys Phe Asp Arg Arg Ile Asn Phe His 65 70 75 80 His Gly His Ser Leu Asn Ser Gln Val Thr Gly Leu Pro Leu Gly Thr                 85 90 95 Thr Leu Phe Val Cys Lys Leu Ala Cys Ile Asn Ser Asp Glu Ile Gln             100 105 110 Ile Cys Gly Ala Glu Ile Phe Val Gly Val Ala Pro Glu Gln Pro Gln         115 120 125 Asn Leu Ser Cys Ile Gln Lys Gly Glu Gln Gly Thr Val Ala Cys Thr     130 135 140 Trp Glu Arg Gly Arg Asp Thr His Leu Tyr Thr Glu Tyr Thr Leu Gln 145 150 155 160 Leu Ser Gly Pro Lys Asn Leu Thr Trp Gln Lys Gln Cys Lys Asp Ile                 165 170 175 Tyr Cys Asp Tyr Leu Asp Phe Gly Ile Asn Leu Thr Pro Glu Ser Pro             180 185 190 Glu Ser Asn Phe Thr Ala Lys Val Thr Ala Val Asn Ser Leu Gly Ser         195 200 205 Ser Ser Ser Leu Pro Ser Thr Phe Thr Phe Leu Asp Ile Val Arg Pro     210 215 220 Leu Pro Pro Trp Asp Ile Arg Ile Lys Phe Gln Lys Ala Ser Val Ser 225 230 235 240 Arg Cys Thr Leu Tyr Trp Arg Asp Glu Gly Leu Val Leu Leu Asn Arg                 245 250 255 Leu Arg Tyr Arg Pro Ser Asn Ser Arg Leu Trp Asn Met Val Asn Val             260 265 270 Thr Lys Ala Lys Gly Arg His Asp Leu Leu Asp Leu Lys Pro Phe Thr         275 280 285 Glu Tyr Glu Phe Gln Ile Ser Ser Lys Leu His Leu Tyr Lys Gly Ser     290 295 300 Trp Ser Asp Trp Ser Glu Ser Leu Arg Ala Gln Thr Pro Glu Glu Glu 305 310 315 320 Pro Thr Gly Met Leu Asp Val Trp Tyr Met Lys Arg His Ile Asp Tyr                 325 330 335 Ser Arg Gln Gln Ile Ser Leu Phe Trp Lys Asn Leu Ser Val Ser Glu             340 345 350 Ala Arg Gly Lys Ile Leu His Tyr Gln Val Thr Leu Gln Glu Leu Thr         355 360 365 Gly Gly Lys Ala Met Thr Gln Asn Ile Thr Gly His Thr Ser Trp Thr     370 375 380 Thr Val Ile Pro Arg Thr Gly Asn Trp Ala Val Ala Val Ser Ala Ala 385 390 395 400 Asn Ser Lys Gly Ser Ser Leu Pro Thr Arg Ile Asn Ile Met Asn Leu                 405 410 415 Cys Glu Ala Gly Leu Leu Ala Pro Arg His Val Ser Ala Asn Ser Glu             420 425 430 Gly Met Asp Asn Ile Leu Val Thr Trp Gln Pro Pro Arg Lys Asp Pro         435 440 445 Ser Ala Val Gln Glu Tyr Val Val Glu Trp Arg Glu Leu His Pro Gly     450 455 460 Gly Asp Thr Gln Val Pro Leu Asn Trp Leu Arg Ser Arg Pro Tyr Asn 465 470 475 480 Val Ser Ala Leu Ile Ser Glu Asn Ile Lys Ser Tyr Ile Cys Tyr Glu                 485 490 495 Ile Arg Val Tyr Ala Leu Ser Gly Asp Gln Gly Gly Cys Ser Ser Ile             500 505 510 Leu Gly Asn Ser Lys His Lys Ala Pro Leu Ser Gly Pro His Ile Asn         515 520 525 Ala Ile Thr Glu Glu Lys Gly Ser Ile Leu Ile Ser Trp Asn Ser Ile     530 535 540 Pro Val Gln Glu Gln Met Gly Cys Leu Leu His Tyr Arg Ile Tyr Trp 545 550 555 560 Lys Glu Arg Asp Ser Asn Ser Gln Pro Gln Leu Cys Glu Ile Pro Tyr                 565 570 575 Arg Val Ser Gln Asn Ser His Pro Ile Asn Ser Leu Gln Pro Arg Val             580 585 590 Thr Tyr Val Leu Trp Met Thr Ala Leu Thr Ala Ala Gly Glu Ser Ser         595 600 605 His Gly Asn Glu Arg Glu Phe Cys Leu Gln Gly Lys Ala Asn Trp Met     610 615 620 Ala Phe Val Ala Pro Ser Ile Cys Ile Ala Ile Ile Met Val Gly Ile 625 630 635 640 Phe Ser Thr His Tyr Phe Gln Gln Lys Val Phe Val Leu Leu Ala Ala                 645 650 655 Leu Arg Pro Gln Trp Cys Ser Arg Glu Ile Pro Asp Pro Ala Asn Ser             660 665 670 Thr Cys Ala Lys Lys Tyr Pro Ile Ala Glu Glu Lys Thr Gln Leu Pro         675 680 685 Leu Asp Arg Leu Leu Ile Asp Trp Pro Thr Pro Glu Asp Pro Glu Pro     690 695 700 Leu Val Ile Ser Glu Val Leu His Gln Val Thr Pro Val Phe Arg His 705 710 715 720 Pro Pro Cys Ser Asn Trp Pro Gln Arg Glu Lys Gly Ile Gln Gly His                 725 730 735 Gln Ala Ser Glu Lys Asp Met Met His Ser Ala Ser Ser Pro Pro Pro             740 745 750 Pro Arg Ala Leu Gln Ala Glu Ser Arg Gln Leu Val Asp Leu Tyr Lys         755 760 765 Val Leu Glu Ser Arg Gly Ser Asp Pro Lys Pro Glu Asn Pro Ala Cys     770 775 780 Pro Trp Thr Val Leu Pro Ala Gly Asp Leu Pro Thr His Asp Gly Tyr 785 790 795 800 Leu Pro Ser Asn Ile Asp Asp Leu Pro Ser His Glu Ala Pro Leu Ala                 805 810 815 Asp Ser Leu Glu Glu Leu Glu Pro Gln His Ile Ser Leu Ser Val Phe             820 825 830 Pro Ser Ser Ser Leu His Pro Leu Thr Phe Ser Cys Gly Asp Lys Leu         835 840 845 Thr Leu Asp Gln Leu Lys Met Arg Cys Asp Ser Leu Met Leu     850 855 860 <210> 8 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 8 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys     50 <210> 9 <211> 54 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 9 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr             20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu Lys Gly Ser     50 <210> 10 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 10 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val      <210> 11 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 11 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val             20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys         35 40 45 <210> 12 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 12 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys         35 40 <210> 13 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 13 Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp 1 5 10 15 Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln             20 25 30 Thr Val Ser Leu Lys         35 <210> 14 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 14 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu             20 25 30 Lys      <210> 15 <211> 29 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 15 Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys             20 25 <210> 16 <211> 25 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 16 Ser Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln 1 5 10 15 Gln Ala Leu Gln Thr Val Ser Leu Lys             20 25 <210> 17 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 17 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val             20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 <210> 18 <211> 41 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 18 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val             20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu         35 40 <210> 19 <211> 38 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 19 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val             20 25 30 Ala Leu Leu Lys Glu Gln         35 <210> 20 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 20 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu Val             20 25 30 Ala leu          <210> 21 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 21 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ser Gln Glu             20 25 30 <210> 22 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 22 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr Val         35 40 <210> 23 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 23 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 Val      <210> 24 <211> 53 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 24 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu Lys Gly     50 <210> 25 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 25 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu Lys     50 <210> 26 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 26 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu     50 <210> 27 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 27 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser     50 <210> 28 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 28 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val      <210> 29 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 29 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 <210> 30 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 30 Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val 1 5 10 15 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu             20 25 30 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val         35 40 45 Ser Leu Lys Gly     50 <210> 31 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 31 Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val 1 5 10 15 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu             20 25 30 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val         35 40 45 <210> 32 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 32 Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val 1 5 10 15 Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala             20 25 30 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser         35 40 45 Leu Lys Gly     50 <210> 33 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 33 Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn 1 5 10 15 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln             20 25 30 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu         35 40 45 Lys Gly     50 <210> 34 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 34 Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr 1 5 10 15 Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu             20 25 30 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys         35 40 45 Gly      <210> 35 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 35 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val             20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 36 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 36 Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met 1 5 10 15 Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala             20 25 30 Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 37 <211> 46 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 37 Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe 1 5 10 15 Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu             20 25 30 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 38 <211> 45 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 38 Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu 1 5 10 15 Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu             20 25 30 Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 39 <211> 44 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 39 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 40 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 40 Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu 1 5 10 15 Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu             20 25 30 Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 41 <211> 42 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 41 Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys 1 5 10 15 Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln             20 25 30 Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 42 <211> 41 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 42 Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser 1 5 10 15 Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln             20 25 30 Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 43 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 43 Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg 1 5 10 15 Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala             20 25 30 Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 44 <211> 39 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 44 Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu 1 5 10 15 Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu             20 25 30 Gln Thr Val Ser Leu Lys Gly         35 <210> 45 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 45 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 Val Ser Leu Lys Gly         35 <210> 46 <211> 36 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 46 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val             20 25 30 Ser Leu Lys Gly         35 <210> 47 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 47 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val             20 25 30 Ser leu lys         35 <210> 48 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 48 Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala 1 5 10 15 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser             20 25 30 Leu Lys          <210> 49 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 49 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu             20 25 30 Lys      <210> 50 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 50 Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys             20 25 30 <210> 51 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 51 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val 1 5 10 15 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys             20 25 30 <210> 52 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 52 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 Val      <210> 53 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 53 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 <210> 54 <211> 30 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 54 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln             20 25 30 <210> 55 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 55 Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala 1 5 10 15 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser             20 25 30 Leu Lys Gly         35 <210> 56 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 56 Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu             20 25 30 Lys Gly          <210> 57 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 57 Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys             20 25 30 Gly      <210> 58 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 58 Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val 1 5 10 15 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly             20 25 30 <210> 59 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 59 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu Lys     50 <210> 60 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 60 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val      <210> 61 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 61 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 <210> 62 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 62 Glu Gly Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln         35 40 45 <210> 63 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 63 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys         35 40 <210> 64 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 64 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr Val         35 40 <210> 65 <211> 39 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 65 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr         35 <210> 66 <211> 38 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 66 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln         35 <210> 67 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 67 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val             20 25 30 Ser leu lys         35 <210> 68 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 68 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val             20 25 30 <210> 69 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 69 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 <210> 70 <211> 30 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 70 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 1 5 10 15 Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln             20 25 30 <210> 71 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 71 Met Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu 1 5 10 15 Glu Leu Lys Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu             20 25 30 Lys Glu Gln Gln Ala Leu Gln Thr         35 40 <210> 72 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 72 Met Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 1 5 10 15 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 <210> 73 <211> 53 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 73 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr             20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu Lys Gly     50 <210> 74 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 74 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr             20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu Lys     50 <210> 75 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 75 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr             20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser Leu     50 <210> 76 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 76 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr             20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Ser     50 <210> 77 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 77 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr             20 25 30 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val      <210> 78 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 78 Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val 1 5 10 15 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu             20 25 30 Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val         35 40 45 Ser Leu Lys Gly     50 <210> 79 <211> 51 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 79 Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val 1 5 10 15 Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser             20 25 30 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser         35 40 45 Leu Lys Gly     50 <210> 80 <211> 50 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 80 Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn 1 5 10 15 Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln             20 25 30 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu         35 40 45 Lys Gly     50 <210> 81 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 81 Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr 1 5 10 15 Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu             20 25 30 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys         35 40 45 Gly      <210> 82 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 82 Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys 1 5 10 15 Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val             20 25 30 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 83 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 83 Pro Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met 1 5 10 15 Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala             20 25 30 Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 84 <211> 46 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 84 Lys Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe 1 5 10 15 Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu             20 25 30 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 85 <211> 45 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 85 Lys Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu 1 5 10 15 Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu             20 25 30 Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 45 <210> 86 <211> 44 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 86 Ile Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 1 5 10 15 Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys             20 25 30 Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 87 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 87 Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu 1 5 10 15 Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu             20 25 30 Gln Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 88 <211> 42 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 88 Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys 1 5 10 15 Ala Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln             20 25 30 Gln Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 89 <211> 41 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 89 Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala 1 5 10 15 Arg Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln             20 25 30 Ala Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 90 <211> 40 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 90 Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg 1 5 10 15 Leu Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala             20 25 30 Leu Gln Thr Val Ser Leu Lys Gly         35 40 <210> 91 <211> 39 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 91 Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu 1 5 10 15 Asp Thr Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu             20 25 30 Gln Thr Val Ser Leu Lys Gly         35 <210> 92 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 92 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr 1 5 10 15 Leu Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr             20 25 30 Val Ser Leu Lys Gly         35 <210> 93 <211> 36 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 93 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu 1 5 10 15 Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val             20 25 30 Ser Leu Lys Gly         35 <210> 94 <211> 35 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 94 Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu 1 5 10 15 Ser Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val             20 25 30 Ser leu lys         35 <210> 95 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 95 Asn Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser 1 5 10 15 Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser             20 25 30 Leu Lys          <210> 96 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 96 Thr Lys Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu             20 25 30 Lys      <210> 97 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 97 Met Phe Glu Glu Leu Lys Ala Arg Leu Asp Thr Leu Ser Gln Glu Val 1 5 10 15 Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Ser Leu Lys             20 25 30 <210> 98 <211> 71 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 98 Met Gly Ser His His His His His Gly Ser Ile Gln Gly Arg Ser Pro 1 5 10 15 Gly Thr Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys             20 25 30 Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu         35 40 45 Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu     50 55 60 Gln Thr Val Ser Leu Lys Gly 65 70 <210> 99 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 99 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys     50 <210> 100 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 100 Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn Ala Lys Lys Asp 1 5 10 15 Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys Asn Arg Met Asp Val             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys     50 <210> 101 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 101 Gln Gln Asn Gly Lys Gly Arg Gln Lys Pro Ala Ala Ser Lys Lys Asp 1 5 10 15 Gly Val Ser Leu Lys Met Ile Glu Asp Leu Lys Ala Met Ile Asp Asn             20 25 30 Ile Ser Gln Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys     50 <210> 102 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 102 Glu Thr Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn Ala Lys Lys Asp 1 5 10 15 Ala Val Ser Pro Lys Met Leu Glu Glu Leu Lys Thr Gln Leu Asp Ser             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys     50 <210> 103 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 103 Gln Gln Thr Ser Ser Lys Lys Lys Gly Gly Lys Lys Asp Ala Glu Asn 1 5 10 15 Asn Ala Ala Ile Glu Glu Leu Lys Lys Gln Ile Asp Asn Ile Val Leu             20 25 30 Glu Leu Asn Leu Leu Lys Glu Gln Gln Ala Leu Gln Ser Val Cys Leu         35 40 45 Lys      <210> 104 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 104 Gln Gln Asn Gly Lys Lys Asn Lys Gln Asn Asn Lys Asp Val Val Ser 1 5 10 15 Met Lys Met Tyr Glu Asp Leu Lys Lys Lys Val Gln Asn Ile Glu Glu             20 25 30 Asp Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln Thr Ile Cys Leu         35 40 45 Lys      <210> 105 <211> 48 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 105 Glu Gln Ser Leu Thr Lys Arg Lys Asn Gly Lys Lys Glu Ser Asn Ser 1 5 10 15 Ala Ala Ile Glu Glu Leu Lys Lys Gln Ile Asp Gln Ile Ile Gln Asp             20 25 30 Leu Asn Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu Lys         35 40 45 <210> 106 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 106 Gln Thr Ser Cys His Ala Ser Lys Phe Lys Ala Arg Lys His Ser Lys 1 5 10 15 Arg Arg Val Lys Glu Lys Asp Gly Asp Leu Lys Thr Gln Val Glu Lys             20 25 30 Leu Trp Arg Glu Val Asn Ala Leu Lys Glu Met Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Arg     50 <210> 107 <211> 38 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 107 Lys Pro Ser Lys Ser Gly Lys Gly Lys Asp Asp Leu Arg Asn Glu Ile 1 5 10 15 Asp Lys Leu Trp Arg Glu Val Asn Ser Leu Lys Glu Met Gln Ala Leu             20 25 30 Gln Thr Val Cys Leu Lys         35 <210> 108 <211> 52 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (1) .. (25) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (27) .. (29) <223> Xaa can be any naturally occurring amino acid <220> <221> MISC_FEATURE (222) (30) .. (30) <223> Xaa can be any hydrophobic naturally occurring amino acid <220> <221> misc_feature (222) (31) .. (32) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (34) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (38) .. (39) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (43) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (52). (52) <223> Xaa can be any naturally occurring amino acid <400> 108 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1 5 10 15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa             20 25 30 Leu Xaa Xaa Glu Val Xaa Xaa Leu Lys Glu Xaa Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Xaa     50 <210> 109 <211> 137 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 109 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile                 85 90 95 Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser             100 105 110 Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu         115 120 125 Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 <210> 110 <211> 126 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 110 Asn Lys Leu His Ala Gly Ser Met Gly Lys Lys Ser Gly Lys Lys Phe 1 5 10 15 Phe Val Thr Asn His Glu Arg Met Pro Phe Ser Lys Val Lys Ala Leu             20 25 30 Cys Ser Glu Leu Arg Gly Thr Val Ala Ile Pro Arg Asn Ala Glu Glu         35 40 45 Asn Lys Ala Ile Gln Glu Val Ala Lys Thr Ser Ala Phe Leu Gly Ile     50 55 60 Thr Asp Glu Val Thr Glu Gly Gln Phe Met Tyr Val Thr Gly Gly Arg 65 70 75 80 Leu Thr Tyr Ser Asn Trp Lys Lys Asp Glu Pro Asn Asp His Gly Ser                 85 90 95 Gly Glu Asp Cys Val Thr Ile Val Asp Asn Gly Leu Trp Asn Asp Ile             100 105 110 Ser Cys Gln Ala Ser His Thr Ala Val Cys Ser Phe Pro Ala         115 120 125 <210> 111 <211> 127 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 111 Lys Lys Val Glu Leu Phe Pro Asn Gly Gln Ser Val Gly Glu Lys Ile 1 5 10 15 Phe Lys Thr Ala Gly Phe Val Lys Pro Phe Thr Glu Ala Gln Leu Leu             20 25 30 Cys Thr Gln Ala Gly Gly Gln Leu Ala Ser Pro Arg Ser Ala Ala Glu         35 40 45 Asn Ala Ala Leu Gln Gln Leu Val Val Ala Lys Asn Glu Ala Ala Phe     50 55 60 Leu Ser Met Thr Asp Ser Lys Thr Glu Gly Lys Phe Thr Tyr Pro Thr 65 70 75 80 Gly Glu Ser Leu Val Tyr Ser Asn Trp Ala Pro Gly Glu Pro Asn Asp                 85 90 95 Asp Gly Gly Ser Glu Asp Cys Val Glu Ile Phe Thr Asn Gly Lys Trp             100 105 110 Asn Asp Arg Ala Cys Gly Glu Lys Arg Leu Val Val Cys Ala Phe         115 120 125 <210> 112 <211> 123 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 112 Lys Val Tyr Trp Phe Cys Tyr Gly Met Lys Cy Ts Tyr Tyr Phe Val Met 1 5 10 15 Asp Arg Lys Thr Trp Ser Gly Cys Lys Gln Thr Cys Gln Ser Ser Ser             20 25 30 Leu Ser Leu Leu Lys Ile Asp Asp Glu Asp Glu Leu Lys Phe Leu Gln         35 40 45 Leu Leu Val Val Pro Ser Asp Ser Cys Trp Val Gly Leu Ser Tyr Asp     50 55 60 Asn Lys Lys Asp Trp Ala Trp Ile Asp Asn Arg Pro Ser Lys Leu Ala 65 70 75 80 Leu Asn Thr Arg Lys Tyr Asn Ile Arg Asp Arg Gly Gly Cys Met Leu                 85 90 95 Leu Ser Lys Thr Arg Leu Asp Asn Gly Asn Cys Asp Gln Val Phe Ile             100 105 110 Cys Ile Cys Gly Lys Arg Leu Asp Lys Phe Pro         115 120 <210> 113 <211> 128 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 113 Cys Pro Val Asn Trp Val Glu His Glu Arg Ser Cys Tyr Trp Phe Ser 1 5 10 15 Arg Ser Gly Lys Ala Trp Ala Asp Ala Asp Asn Tyr Cys Arg Leu Glu             20 25 30 Asp Ala His Leu Val Val Val Thr Ser Trp Glu Glu Gln Leu Phe Val         35 40 45 Gln His His Ile Gly Pro Val Asn Thr Trp Met Gly Leu His Asp Gln     50 55 60 Asn Gly Pro Trp Lys Trp Val Asp Gly Thr Asp Tyr Glu Thr Gly Phe 65 70 75 80 Lys Asn Trp Arg Pro Glu Gln Pro Asp Asp Trp Tyr Gly His Gly Leu                 85 90 95 Gly Gly Gly Glu Asp Cys Ala His Phe Thr Asp Asp Gly Arg Trp Asn             100 105 110 Asp Asp Val Cys Gln Arg Pro Tyr Arg Trp Val Cys Ser Thr Glu Leu         115 120 125 <210> 114 <211> 147 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 114 Gly Ile Pro Lys Cys Pro Glu Asp Trp Gly Ala Ser Ser Arg Thr Ser 1 5 10 15 Leu Cys Phe Lys Leu Tyr Ala Lys Gly Lys His Glu Lys Lys Thr Trp             20 25 30 Phe Glu Ser Arg Asp Phe Cys Arg Ala Leu Gly Gly Asp Leu Ala Ser         35 40 45 Ile Asn Asn Lys Glu Glu Gln Gln Thr Ile Trp Arg Leu Ile Thr Ala     50 55 60 Ser Gly Ser Tyr His Lys Leu Phe Trp Leu Gly Leu Thr Tyr Gly Ser 65 70 75 80 Pro Ser Glu Gly Phe Thr Trp Ser Asp Gly Ser Pro Val Ser Tyr Glu                 85 90 95 Asn Trp Ala Tyr Gly Glu Pro Asn Asn Tyr Gln Asn Val Glu Tyr Cys             100 105 110 Gly Glu Leu Lys Gly Asp Pro Thr Met Ser Trp Asn Asp Ile Asn Cys         115 120 125 Glu His Leu Asn Asn Trp Ile Cys Gln Ile Gln Lys Gly Gln Thr Pro     130 135 140 Lys pro asp 145 <210> 115 <211> 129 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 115 Asp Cys Leu Ser Gly Trp Ser Ser Tyr Glu Gly His Cys Tyr Lys Ala 1 5 10 15 Phe Ser Lys Tyr Lys Thr Trp Glu Asp Ala Glu Arg Val Cys Thr Glu             20 25 30 Gln Ala Lys Gly Ala His Leu Val Ser Ile Glu Ser Ser Gly Glu Ala         35 40 45 Asp Phe Val Ala Gln Leu Val Thr Gln Asn Met Lys Arg Leu Asp Phe     50 55 60 Tyr Ile Trp Ile Gly Leu Arg Val Gln Gly Lys Val Lys Gln Cys Asn 65 70 75 80 Ser Glu Trp Ser Asp Gly Ser Ser Val Ser Tyr Glu Asn Trp Ile Glu                 85 90 95 Ala Glu Ser Lys Thr Cys Leu Gly Leu Glu Lys Glu Thr Asp Phe Arg             100 105 110 Lys Trp Val Asn Ile Tyr Cys Gly Gln Gln Asn Pro Phe Val Cys Glu         115 120 125 Ala      <210> 116 <211> 122 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 116 Asp Cys Pro Ser Asp Trp Ser Ser Tyr Glu Gly His Cys Tyr Lys Pro 1 5 10 15 Phe Ser Glu Pro Lys Asn Trp Ala Asp Ala Glu Asn Phe Cys Thr Gln             20 25 30 Gln His Ala Gly Gly His Leu Val Ser Phe Gln Ser Ser Glu Glu Ala         35 40 45 Asp Phe Val Val Lys Leu Ala Phe Gln Thr Phe His Ser Ile Phe Trp     50 55 60 Met Gly Leu Ser Asn Val Trp Asn Gln Cys Asn Trp Gln Trp Ser Asn 65 70 75 80 Ala Ala Met Leu Arg Tyr Lys Ala Trp Ala Glu Glu Ser Tyr Cys Val                 85 90 95 Tyr Phe Lys Ser Thr Asn Asn Lys Trp Arg Ser Arg Ala Cys Arg Met             100 105 110 Met Ala Gln Phe Val Cys Glu Phe Gln Ala         115 120 <210> 117 <211> 135 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 117 Ala Arg Ile Ser Cys Pro Glu Gly Thr Asn Ala Tyr Arg Ser Tyr Cys 1 5 10 15 Tyr Tyr Phe Asn Glu Asp Arg Glu Thr Trp Val Asp Ala Asp Leu Tyr             20 25 30 Cys Gln Asn Met Asn Ser Gly Asn Leu Val Ser Val Leu Thr Gln Ala         35 40 45 Glu Gly Ala Phe Val Ala Ser Leu Ile Lys Glu Ser Gly Thr Asp Asp     50 55 60 Phe Asn Val Trp Ile Gly Leu His Asp Pro Lys Lys Asn Arg Arg Trp 65 70 75 80 His Trp Ser Ser Gly Ser Leu Val Ser Tyr Lys Ser Trp Gly Ile Gly                 85 90 95 Ala Pro Ser Ser Val Asn Pro Gly Tyr Cys Val Ser Leu Thr Ser Ser             100 105 110 Thr Gly Phe Gly Lys Trp Lys Asp Val Pro Cys Glu Asp Lys Phe Ser         115 120 125 Phe Val Cys Lys Phe Lys Asn     130 135 <210> 118 <211> 123 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 118 Asp Tyr Glu Ile Leu Phe Ser Asp Glu Thr Met Asn Tyr Ala Asp Ala 1 5 10 15 Gly Thr Tyr Cys Gly Ser Arg Gly Met Ala Leu Val Ser Ser Ala Met             20 25 30 Arg Asp Ser Thr Met Val Lys Ala Ile Leu Ala Phe Thr Glu Val Lys         35 40 45 Gly His Asp Tyr Trp Val Gly Ala Asp Asn Leu Gln Asp Gly Ala Tyr     50 55 60 Asn Phe Asn Trp Asn Asp Gly Val Ser Leu Pro Thr Asp Ser Asp Leu 65 70 75 80 Trp Ser Pro Asn Glu Pro Ser Asn Pro Gln Ser Trp Gln Leu Cys Val                 85 90 95 Gln Ile Trp Ser Lys Tyr Asn Leu Leu Asp Asp Val Gly Cys Gly Gly             100 105 110 Ala Arg Arg Val Ile Cys Glu Lys Glu Leu Asp         115 120 <210> 119 <211> 202 <212> PRT <213> Homo sapiens <400> 119 Met Glu Leu Trp Gly Ala Tyr Leu Leu Leu Cys Leu Phe Ser Leu Leu 1 5 10 15 Thr Gln Val Thr Thr Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val             20 25 30 Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys         35 40 45 Ser Arg Leu Asp Thr Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln     50 55 60 Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys 65 70 75 80 Cys Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu                 85 90 95 Asp Cys Ile Ser Arg Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser             100 105 110 Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu         115 120 125 Ala Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp     130 135 140 Val Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu 145 150 155 160 Ile Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu                 165 170 175 Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln             180 185 190 Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val         195 200 <210> 120 <211> 202 <212> PRT <213> Mus musculus <400> 120 Met Gly Phe Trp Gly Thr Tyr Leu Leu Phe Cys Leu Phe Ser Phe Leu 1 5 10 15 Ser Gln Leu Thr Ala Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala             20 25 30 Asn Ala Lys Lys Asp Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys         35 40 45 Asn Arg Met Asp Val Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Lys     50 55 60 Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val Asn Leu Lys 65 70 75 80 Cys Leu Leu Ala Phe Thr Gln Pro Lys Thr Phe His Glu Ala Ser Glu                 85 90 95 Asp Cys Ile Ser Gln Gly Gly Thr Leu Gly Thr Pro Gln Ser Glu Leu             100 105 110 Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val Gly Asn Asp         115 120 125 Ala Asn Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Ala Trp     130 135 140 Val Asp Met Thr Gly Gly Leu Leu Ala Tyr Lys Asn Trp Glu Thr Glu 145 150 155 160 Ile Thr Thr Gln Pro Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala Leu                 165 170 175 Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln             180 185 190 Leu Pro Tyr Ile Cys Gln Phe Ala Ile Val         195 200 <210> 121 <211> 201 <212> PRT <213> Gallus gallus <400> 121 Met Ala Leu Arg Gly Ala Cys Leu Leu Leu Cys Leu Val Ser Leu Ala 1 5 10 15 His Ile Ser Val Gln Gln Asn Gly Lys Gly Arg Gln Lys Pro Ala Ala             20 25 30 Ser Lys Lys Asp Gly Val Ser Leu Lys Met Ile Glu Asp Leu Lys Ala         35 40 45 Met Ile Asp Asn Ile Ser Gln Glu Val Ala Leu Leu Lys Glu Lys Gln     50 55 60 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Ile His Leu Lys Cys 65 70 75 80 Phe Leu Ala Phe Ser Glu Ser Lys Thr Tyr His Glu Ala Ser Glu His                 85 90 95 Cys Ile Ser Gln Gly Gly Thr Leu Gly Thr Pro Gln Gly Gly Glu Glu             100 105 110 Asn Asp Ala Leu Tyr Asp Tyr Met Arg Lys Ser Ile Gly Asn Glu Ala         115 120 125 Glu Ile Trp Leu Gly Leu Asn Asp Met Val Ala Glu Gly Lys Trp Val     130 135 140 Asp Met Thr Gly Ser Pro Ile Arg Tyr Lys Asn Trp Glu Thr Glu Ile 145 150 155 160 Thr Thr Gln Pro Asp Gly Gly Lys Leu Glu Asn Cys Ala Ala Leu Ser                 165 170 175 Gly Val Ala Val Gly Lys Trp Phe Asp Lys Arg Cys Lys Glu Gln Leu             180 185 190 Pro Tyr Val Cys Gln Phe Met Ile Val         195 200 <210> 122 <211> 202 <212> PRT <213> Bos taurus <400> 122 Met Glu Leu Trp Gly Pro Cys Val Leu Leu Cys Leu Phe Ser Leu Leu 1 5 10 15 Thr Gln Val Thr Ala Glu Thr Pro Thr Pro Lys Ala Lys Lys Ala Ala             20 25 30 Asn Ala Lys Lys Asp Ala Val Ser Pro Lys Met Leu Glu Glu Leu Lys         35 40 45 Thr Gln Leu Asp Ser Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln     50 55 60 Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys 65 70 75 80 Cys Phe Leu Ala Phe Val Gln Ala Lys Thr Phe His Glu Ala Ser Glu                 85 90 95 Asp Cys Ile Ser Arg Gly Gly Thr Leu Gly Thr Pro Gln Thr Gly Ser             100 105 110 Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Ser Glu         115 120 125 Ala Glu Val Trp Leu Gly Phe Asn Asp Met Ala Ser Glu Gly Ser Trp     130 135 140 Val Asp Met Thr Gly Gly His Ile Ala Tyr Lys Asn Trp Glu Thr Glu 145 150 155 160 Ile Thr Ala Gln Pro Asp Gly Gly Lys Val Glu Asn Cys Ala Thr Leu                 165 170 175 Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Lys             180 185 190 Leu Pro Tyr Val Cys Gln Phe Ala Ile Val         195 200 <210> 123 <211> 198 <212> PRT <213> Salmo salar <400> 123 Met Arg Val Ser Gly Val Arg Leu Leu Phe Cys Leu Leu Leu Leu Gly 1 5 10 15 Gln Ser Thr Phe Gln Gln Thr Ser Ser Lys Lys Lys Gly Gly Lys Lys             20 25 30 Asp Ala Glu Asn Asn Ala Ala Ile Glu Glu Leu Lys Lys Gln Ile Asp         35 40 45 Asn Ile Val Leu Glu Leu Asn Leu Leu Lys Glu Gln Gln Ala Leu Gln     50 55 60 Ser Val Cys Leu Lys Gly Ile Lys Ile Ile Gly Lys Cys Phe Leu Ala 65 70 75 80 Asp Thr Ala Lys Lys Ile Tyr His Thr Ala Tyr Asp Asp Cys Ile Ala                 85 90 95 Lys Gly Gly Thr Ile Ser Thr Pro Leu Thr Gly Asp Glu Asn Asp Gln             100 105 110 Leu Val Asp Tyr Val Arg Arg Ser Ile Gly Pro Glu Glu His Ile Trp         115 120 125 Leu Gly Ile Asn Asp Met Val Thr Glu Gly Glu Trp Leu Asp Gln Ala     130 135 140 Gly Thr Asn Leu Arg Phe Lys Asn Trp Glu Thr Asp Ile Thr Asn Gln 145 150 155 160 Pro Asp Gly Gly Arg Thr His Asn Cys Ala Ile Leu Ser Thr Thr Ala                 165 170 175 Asn Gly Lys Trp Phe Asp Glu Ser Cys Arg Val Glu Lys Ala Ser Val             180 185 190 Cys Glu Phe Asn Ile Val         195 <210> 124 <211> 198 <212> PRT <213> Silurana tropicalis <400> 124 Met Glu Tyr Arg Arg Ala Cys Ile Leu Leu Cys Leu Phe Cys Phe Val 1 5 10 15 Gln Val Thr Leu Gln Gln Asn Gly Lys Lys Asn Lys Gln Asn Asn Lys             20 25 30 Asp Val Val Ser Met Lys Met Tyr Glu Asp Leu Lys Lys Lys Val Gln         35 40 45 Asn Ile Glu Glu Asp Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln     50 55 60 Thr Ile Cys Leu Lys Gly Met Lys Ile Tyr Asn Lys Cys Phe Leu Ala 65 70 75 80 Phe Asn Glu Leu Lys Thr Tyr His Gln Ala Ser Asp Val Cys Phe Ala                 85 90 95 Gln Gly Gly Thr Leu Ser Thr Pro Glu Thr Gly Asp Glu Asn Asp Ser             100 105 110 Leu Tyr Asp Tyr Val Arg Lys Ser Ile Gly Ser Ser Ala Glu Ile Trp         115 120 125 Ile Gly Ile Asn Asp Met Ala Thr Glu Gly Thr Trp Leu Asp Leu Thr     130 135 140 Gly Ser Pro Ile Ser Phe Lys His Trp Glu Thr Glu Ile Thr Thr Gln 145 150 155 160 Pro Asp Gly Gly Lys Gln Glu Asn Cys Ala Ala Leu Ser Ala Ser Ala                 165 170 175 Ile Gly Arg Trp Phe Asp Lys Asn Cys Lys Thr Glu Leu Pro Phe Val             180 185 190 Cys Gln Phe Ser Ile Val         195 <210> 125 <211> 223 <212> PRT <213> Danio rerio <400> 125 Met Arg Asp Asp Ser Asp Lys Val Pro Ser Leu Leu Thr Asp Tyr Ile 1 5 10 15 Leu Lys Gly Cys Thr Tyr Ala Glu Glu Lys Met Asp Leu Lys Ala Val             20 25 30 Lys Phe Leu Leu Cys Val Ile Cys Leu Val Lys Ser Ser Pro Glu Gln         35 40 45 Ser Leu Thr Lys Arg Lys Asn Gly Lys Lys Glu Ser Asn Ser Ala Ala     50 55 60 Ile Glu Glu Leu Lys Lys Lys Gln Ile Asp Gln Ile Ile Gln Asp Leu Asn 65 70 75 80 Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Phe                 85 90 95 Lys Ile Pro Gly Lys Cys Phe Leu Val Asp Thr Val Lys Lys Asp Phe             100 105 110 His Ser Ala Asn Asp Asp Cys Ile Ala Lys Gly Gly Ile Leu Ser Thr         115 120 125 Pro Met Ser Gly His Glu Asn Asp Gln Leu Gln Glu Tyr Val Gln Gln     130 135 140 Thr Val Gly Pro Glu Thr His Ile Trp Leu Gly Val Asn Asp Met Ile 145 150 155 160 Lys Glu Gly Glu Trp Ile Asp Leu Thr Gly Ser Pro Ile Arg Phe Lys                 165 170 175 Asn Trp Glu Ser Glu Ile Thr His Gln Pro Asp Gly Gly Arg Thr His             180 185 190 Asn Cys Ala Val Leu Ser Ser Thr Ala Asn Gly Lys Trp Phe Asp Glu         195 200 205 Asp Cys Arg Gly Glu Lys Ala Ser Val Cys Gln Phe Asn Ile Val     210 215 220 <210> 126 <211> 197 <212> PRT <213> Bos taurus <400> 126 Met Ala Lys Asn Gly Leu Val Ile Tyr Ile Leu Val Ile Thr Leu Leu 1 5 10 15 Leu Asp Gln Thr Ser Cys His Ala Ser Lys Phe Lys Ala Arg Lys His             20 25 30 Ser Lys Arg Arg Val Lys Glu Lys Asp Gly Asp Leu Lys Thr Gln Val         35 40 45 Glu Lys Leu Trp Arg Glu Val Asn Ala Leu Lys Glu Met Gln Ala Leu     50 55 60 Gln Thr Val Cys Leu Arg Gly Thr Lys Phe His Lys Lys Cys Tyr Leu 65 70 75 80 Ala Ala Glu Gly Leu Lys His Phe His Glu Ala Asn Glu Asp Cys Ile                 85 90 95 Ser Lys Gly Gly Thr Leu Val Val Pro Arg Ser Ala Asp Glu Ile Asn             100 105 110 Ala Leu Arg Asp Tyr Gly Lys Arg Ser Leu Pro Gly Val Asn Asp Phe         115 120 125 Trp Leu Gly Ile Asn Asp Met Val Ala Glu Gly Lys Phe Val Asp Ile     130 135 140 Asn Gly Leu Ala Ile Ser Phe Leu Asn Trp Asp Gln Ala Gln Pro Asn 145 150 155 160 Gly Gly Lys Arg Glu Asn Cys Ala Leu Phe Ser Gln Ser Ala Gln Gly                 165 170 175 Lys Trp Ser Asp Glu Ala Cys His Ser Ser Lys Arg Tyr Ile Cys Glu             180 185 190 Phe Thr Ile Pro Gln         195 <210> 127 <211> 166 <212> PRT <213> Carcharhinus springeri <400> 127 Ser Lys Pro Ser Lys Ser Gly Lys Gly Lys Asp Asp Leu Arg Asn Glu 1 5 10 15 Ile Asp Lys Leu Trp Arg Glu Val Asn Ser Leu Lys Glu Met Gln Ala             20 25 30 Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Ile His Lys Lys Cys Tyr         35 40 45 Leu Ala Ser Arg Gly Ser Lys Ser Tyr His Ala Ala Asn Glu Asp Cys     50 55 60 Ile Ala Gln Gly Gly Thr Leu Ser Ile Pro Arg Ser Ser Asp Glu Gly 65 70 75 80 Asn Ser Leu Arg Ser Tyr Ala Lys Lys Ser Leu Val Gly Ala Arg Asp                 85 90 95 Phe Trp Ile Gly Val Asn Asp Met Thr Thr Glu Gly Lys Phe Val Asp             100 105 110 Val Asn Gly Leu Pro Ile Thr Tyr Phe Asn Trp Asp Arg Ser Lys Pro         115 120 125 Val Gly Gly Thr Arg Glu Asn Cys Val Ala Ala Ser Thr Ser Gly Gln     130 135 140 Gly Lys Trp Ser Asp Asp Val Cys Arg Ser Glu Lys Arg Tyr Ile Cys 145 150 155 160 Glu Tyr Leu Ile Pro Val                 165 <210> 128 <211> 204 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (7) .. (8) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (15) .. (15) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (17) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (22) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (31) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (39) .. (39) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (42) .. (42) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (45). (45) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (50) .. (50) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (54) .. (54) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (60) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature &Lt; 222 > (80) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (89) .. (89) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (111) .. (111) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (119) .. (120) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (122) .. (122) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (124) .. (124) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (131) .. (131) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (136) .. (136) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (144) .. (144) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (152) .. (152) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (154) .. (154) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (164). (164) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <171> (171) .. (171) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (179) .. (180) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (189) .. (189) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (201). (201) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (204) (204) .. (204) <223> Xaa can be any naturally occurring amino acid <400> 128 Met Glu Leu Trp Gly Ala Xaa Xaa Leu Leu Cys Leu Phe Ser Xaa Leu 1 5 10 15 Xaa Gln Val Thr Ala Xaa Xaa Xaa Xaa Xaa Lys Ala Lys Lys Xaa Xaa             20 25 30 Xaa Xaa Xaa Lys Lys Asp Xaa Val Ser Xaa Lys Met Xaa Glu Glu Leu         35 40 45 Lys Xaa Gln Ile Asp Xaa Leu Ala Gln Glu Val Xaa Leu Leu Lys Glu     50 55 60 Gln Gln Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Ile His Xaa 65 70 75 80 Lys Cys Phe Leu Ala Phe Thr Gln Xaa Lys Thr Phe His Glu Ala Ser                 85 90 95 Glu Asp Cys Ile Ser Gln Gly Gly Thr Leu Ser Thr Pro Gln Xaa Gly             100 105 110 Asp Glu Asn Asp Ala Leu Xaa Xaa Tyr Xaa Arg Xaa Ser Val Gly Asn         115 120 125 Glu Ala Xaa Ile Trp Leu Gly Xaa Asn Asp Met Ala Ala Glu Gly Xaa     130 135 140 Trp Val Asp Met Thr Gly Ser Xaa Ile Xaa Tyr Lys Asn Trp Glu Thr 145 150 155 160 Glu Ile Thr Xaa Gln Pro Asp Gly Gly Lys Xaa Glu Asn Cys Ala Ala                 165 170 175 Leu Ser Xaa Xaa Ala Asn Gly Lys Trp Phe Asp Lys Xaa Cys Arg Asp             180 185 190 Glu Leu Pro Tyr Val Cys Gln Phe Xaa Ile Val Xaa         195 200 <210> 129 <211> 240 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36) .. (36) <223> k is g or t <220> <221> misc_feature (222) (37) .. (38) <223> n is a, c, g, or t <220> <221> misc_feature (222) (39) .. (39) <223> k is g or t <220> <221> misc_feature (222) (40) .. (41) <223> n is a, c, g, or t <220> <221> misc_feature (222) (42) .. (42) <223> k is g or t <220> <221> misc_feature (222) (43) .. (44) <223> n is a, c, g, or t <220> <221> misc_feature (45). (45) <223> k is g or t <220> <221> misc_feature (222) (46) .. (47) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (48) <223> k is g or t <220> <221> misc_feature (222) (49) .. (50) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (51) <223> k is g or t <220> <221> misc_feature <222> (52) .. (53) <223> n is a, c, g, or t <220> <221> misc_feature <222> (54) .. (54) <223> k is g or t <400> 129 gaggccgaga tctggctggg cctgaacgac atgnnknnkn nknnknnknn knnktgggtg 60 gatatgactg gcgcccgcat cgcctacaag aactgggaaa ctgagatcac cgcccaacct 120 gatggcggcg caaccgagaa ctgcgcggtc ctgtctggcg ccgccaacgg caagtggttc 180 gacaagcgct gcagggatca attgccctac atctgccagt tcgggatcgt ggcggccgca 240 <210> 130 <211> 80 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (12) .. (18) <223> Xaa can be any naturally occurring amino acid <400> 130 Glu Ala Glu Ile Trp Leu Gly Leu Asn Asp Met Xaa Xaa Xaa Xaa Xaa 1 5 10 15 Xaa Xaa Trp Val Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp             20 25 30 Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly Gly Ala Thr Glu Asn Cys         35 40 45 Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys     50 55 60 Arg Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val Ala Ala Ala 65 70 75 80 <210> 131 <211> 137 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 131 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile                 85 90 95 Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser             100 105 110 Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu         115 120 125 Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 <210> 132 <211> 414 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 132 caggccctcc agacggtctg cctgaagggg accaaggtgc acatgaaatg ctttctggcc 60 ttcacccaga cgaagacctt ccacgaggcc agcgaggact gcatctcgcg cgggggcacc 120 ctgagcaccc ctcagactgg ctcggagaac gacgccctgt atgagtacct gcgccagagc 180 gtgggcaacg aggccgagat ctggctgggc ctcaacgaca tggcggccga gggcacctgg 240 gtggacatga ctggcgcgcg tatcgcctac aagaactggg agactgagat caccgcgcaa 300 cccgatggcg gcaagaccga gaactgcgcg gtcctgtcag gcgcggccaa cggcaagtgg 360 ttcgacaagc gctgcaggga tcaattgccc tacatctgcc agttcgggat cgtg 414 <210> 133 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 133 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Pro Pro Gly                 85 90 95 Pro His His Pro Met Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 134 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 134 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Pro Pro Pro                 85 90 95 Pro His His Pro Met Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 135 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 135 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Pro Ala                 85 90 95 Leu Val Gln Pro Arg Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 136 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 136 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Pro Pro                 85 90 95 Leu Tyr Gln Pro Gly Gly Gly Trp Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 137 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 137 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Thr Pro                 85 90 95 Pro Trp Gln Pro Glu Gly Gly Phe Gly Tyr Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 138 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 138 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Leu Arg Thr Asp Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile                 85 90 95 Thr Ala Gln Pro Asp Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 139 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 139 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Leu Arg Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile                 85 90 95 Thr Ala Gln Pro Asp Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 140 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 140 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Glu Arg Pro Pro                 85 90 95 Leu Tyr Gln Pro Gly Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 141 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 141 Ala Leu Gln Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys 1 5 10 15 Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp             20 25 30 Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu         35 40 45 Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala     50 55 60 Glu Ile Trp Leu Gly Leu Asn Gly Ser Ala Leu Thr Asn Thr Trp Val 65 70 75 80 Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp Ala Thr Glu Ile                 85 90 95 Thr Ala Gln Pro Asp Gly Gly Phe Gly Val Phe Gly Glu Asn Cys Ala             100 105 110 Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg         115 120 125 Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val     130 135 140 <210> 142 <211> 181 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 142 Glu Pro Pro Thr Gln Lys Pro Lys Lys Ile Val Asn Ala Lys Lys Asp 1 5 10 15 Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys Phe Leu Ala Phe     50 55 60 Thr Gln Thr Lys Thr Phe His Glu Ala Ser Glu Asp Cys Ile Ser Arg 65 70 75 80 Gly Gly Thr Leu Ser Thr Pro Gln Thr Gly Ser Glu Asn Asp Ala Leu                 85 90 95 Tyr Glu Tyr Leu Arg Gln Ser Val Gly Asn Glu Ala Glu Ile Trp Leu             100 105 110 Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val Asp Met Thr Gly         115 120 125 Ala Arg Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro     130 135 140 Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser Gly Ala Ala Asn 145 150 155 160 Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu Pro Tyr Ile Cys                 165 170 175 Gln Phe Gly Ile Val             180 <210> 143 <211> 546 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 143 gagccaccaa cccagaagcc caagaagatt gtaaatgcca agaaagatgt tgtgaacaca 60 aagatgtttg aggagctcaa gagccgtctg gacaccctgg cccaggaggt ggccctgctg 120 aaggagcagc aggccctgca gacggtctgc ctgaagggga ccaaggtgca catgaaatgc 180 tttctggcct tcacccagac gaagaccttc cacgaggcca gcgaggactg catctcgcgc 240 gggggcaccc tgagcacccc tcagactggc tcggagaacg acgccctgta tgagtacctg 300 cgccagagcg tgggcaacga ggccgagatc tggctgggcc tcaacgacat ggcggccgag 360 ggcacctggg tggacatgac cggcgcccgc atcgcctaca agaactggga gactgagatc 420 accgcgcaac ccgatggcgg caagaccgag aactgcgcgg tcctgtcagg cgcggccaac 480 ggcaagtggt tcgacaagcg ctgccgcgat cagctgccct acatctgcca gttcgggatc 540 gtgtag 546 <210> 144 <211> 546 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 144 gagtcaccca ctcccaaggc caagaaggct gcaaatgcca agaaagattt ggtgagctca 60 aagatgttcg aggagctcaa gaacaggatg gatgtcctgg cccaggaggt ggccctgctg 120 aaggagaagc aggccttaca gactgtgtgc ctgaagggca ccaaggtgaa cttgaagtgc 180 ctcctggcct tcacccaacc gaagaccttc catgaggcga gcgaggactg catctcgcaa 240 gggggcacgc tgggcacccc gcagtcagag ctagagaacg aggcgctgtt cgagtacgcg 300 cgccacagcg tgggcaacga tgcgaacatc tggctgggcc tcaacgacat ggccgcggaa 360 ggcgcctggg tggacatgac cggcggcctc ctggcctaca agaactggga gacggagatc 420 acgacgcaac ccgacggcgg caaagccgag aactgcgccg ccctgtctgg cgcagccaac 480 ggcaagtggt tcgacaagcg atgccgcgat cagttgccct acatctgcca gtttgccatt 540 gtgtag 546 <210> 145 <211> 181 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 145 Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn Ala Lys Lys Asp 1 5 10 15 Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys Asn Arg Met Asp Val             20 25 30 Leu Ala Gln Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr         35 40 45 Val Cys Leu Lys Gly Thr Lys Val Asn Leu Lys Cys Leu Leu Ala Phe     50 55 60 Thr Gln Pro Lys Thr Phe His Glu Ala Ser Glu Asp Cys Ile Ser Gln 65 70 75 80 Gly Gly Thr Leu Gly Thr Pro Gln Ser Glu Leu Glu Asn Glu Ala Leu                 85 90 95 Phe Glu Tyr Ala Arg His Ser Val Gly Asn Asp Ala Asn Ile Trp Leu             100 105 110 Gly Leu Asn Asp Met Ala Ala Glu Gly Ala Trp Val Asp Met Thr Gly         115 120 125 Gly Leu Leu Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Thr Gln Pro     130 135 140 Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala Leu Ser Gly Ala Ala Asn 145 150 155 160 Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gln Leu Pro Tyr Ile Cys                 165 170 175 Gln Phe Ala Ile Val             180 <210> 146 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 146 Lys Met Phe Glu Glu Leu Lys Ser Gln Leu Asp Ser Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 147 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 147 Lys Met Phe Glu Glu Leu Lys Ser Gln Val Asp Ser Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 148 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 148 Ser Lys Met Phe Glu Glu Leu Lys Asn Arg Met Asp Val Leu Ala Gln 1 5 10 15 Glu Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 149 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 149 Lys Met Phe Glu Glu Leu Lys Asn Arg Leu Asp Val Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 150 <211> 4779 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 150 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgccctc 2340 cagacggtct gcctgaaggg gaccaaggtg cacatgaaat gctttctggc cttcacccag 2400 acgaagacct tccacgaggc cagcgaggac tgcatctcgc gcgggggcac cctgagcacc 2460 cctcagactg gctcggagaa cgacgccctg tatgagtacc tgcgccagag cgtgggcaac 2520 gaggccgaga tctaagtgac gatatcctga cctaaggtac ctaagtgacg atatcctgac 2580 ctaactgcag ggatcaattg ccctacatct gccagttcgg gatcgtggcg gccgcaggtg 2640 cgccggtgcc gtatccggat ccgctggaac cgcgtgccgc atagactgtt gaaagttgtt 2700 tagcaaaacc tcatacagaa aattcattta ctaacgtctg gaaagacgac aaaactttag 2760 atcgttacgc taactatgag ggctgtctgt ggaatgctac aggcgttgtg gtttgtactg 2820 gtgacgaaac tcagtgttac ggtacatggg ttcctattgg gcttgctatc cctgaaaatg 2880 agggtggtgg ctctgagggt ggcggttctg agggtggcgg ttctgagggt ggcggtacta 2940 aacctcctga gtacggtgat acacctattc cgggctatac ttatatcaac cctctcgacg 3000 gcacttatcc gcctggtact gagcaaaacc ccgctaatcc taatccttct cttgaggagt 3060 ctcagcctct taatactttc atgtttcaga ataataggtt ccgaaatagg cagggtgcat 3120 taactgttta tacgggcact gttactcaag gcactgaccc cgttaaaact tattaccagt 3180 acactcctgt atcatcaaaa gccatgtatg acgcttactg gaacggtaaa ttcagagact 3240 gcgctttcca ttctggcttt aatgaggatc cattcgtttg tgaatatcaa ggccaatcgt 3300 ctgacctgcc tcaacctcct gtcaatgctg gcggcggctc tggtggtggt tctggtggcg 3360 gctctgaggg tggcggctct gagggtggcg gttctgaggg tggcggctct gagggtggcg 3420 gttccggtgg cggctccggt tccggtgatt ttgattatga aaaaatggca aacgctaata 3480 agggggctat gaccgaaaat gccgatgaaa acgcgctaca gtctgacgct aaaggcaaac 3540 ttgattctgt cgctactgat tacggtgctg ctatcgatgg tttcattggt gacgtttccg 3600 gccttgctaa tggtaatggt gctactggtg attttgctgg ctctaattcc caaatggctc 3660 aagtcggtga cggtgataat tcacctttaa tgaataattt ccgtcaatat ttaccttctt 3720 tgcctcagtc ggttgaatgt cgcccttatg tctttggcgc tggtaaacca tatgaatttt 3780 ctattgattg tgacaaaata aacttattcc gtggtgtctt tgcgtttctt ttatatgttg 3840 ccacctttat gtatgtattt tcgacgtttg ctaacatact gcgtaataag gagtcttaat 3900 aagaattcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 3960 cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 4020 accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 4080 tttctcctta cgcatctgtg cggtatttca caccgcatac gtcaaagcaa ccatagtacg 4140 cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4200 cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4260 tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4320 ctttacggca cctcgacccc aaaaaacttg atttgggtga tggttcacgt agtgggccat 4380 cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4440 tcttgttcca aactggaaca acactcaacc ctatctcggg ctattctttt gatttataag 4500 ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4560 cgaattttaa caaaatatta acgtttacaa ttttatggtg cagtctcagt acaatctgct 4620 ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac 4680 gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca 4740 tgtgtcagag gttttcaccg tcatcaccga aacgcgcga 4779 <210> 151 <211> 5747 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 151 tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgggatctc gatcccgcga 4980 aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 ttttgtttaa ctttaagaag gagatataca tatgaaatac cttcttccga ctgctgctgc 5100 tggtctttta ctgctggctg ctcagccggc tatggctgct ggtggtggtt ctgccctcca 5160 gacggtctgc ctgaagggga ccaaggtgca catgaaatgc tttctggcct tcacccagac 5220 gaagaccttc cacgaggcca gcgaggactg catctcgcgc gggggcaccc tgagcacccc 5280 tcagactggc tcggagaacg acgccctgta tgagtacctg cgccagagcg tgggcaacga 5340 ggccgagatc tggctgggcc tcaacgacat ggcggccgag ggcacctggg tggacatgac 5400 cggtacccgc atcgcctaca agaactggga gactgagatc accgcgcaac ccgatggcgg 5460 caagaccgag aactgcgcgg tcctgtcagg cgcggccaac ggcaagtggt tcgacaagcg 5520 ctgcagggat caattgccct acatctgcca gttcgggatc gtgcaccacc accaccacca 5580 ctaactcgag caccaccacc accaccactg agatccggct gctaacaaag cccgaaagga 5640 agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa 5700 acgggtcttg aggggttttt tgctgaaagg aggaactata tccggat 5747 <210> 152 <211> 10975 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 152 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 ggtctgcctg aaggggacca aggtgcacat gaaatgcttt ctggccttca cccagacgaa 1080 gaccttccac gaggccagcg aggactgcat ctcgcgcggg ggcaccctga gcacccctca 1140 gactggctcg gagaacgacg ccctgtatga gtacctgcgc cagagcgtgg gcaacgaggc 1200 cgagatctgg ctgggcctca acgacatggc ggccgagggc acctgggtgg acatgaccgg 1260 tacccgcatc gcctacaaga actgggagac tgagatcacc gcgcaacccg atggcggcaa 1320 gaccgagaac tgcgcggtcc tgtcaggcgc ggccaacggc aagtggttcg acaagcgctg 1380 cagggatcaa ttgccctaca tctgccagtt cgggatcgtg caccaccacc accaccacta 1440 actcgaggcc ggcaaggccg gatccagaca tgataagata cattgatgag tttggacaaa 1500 ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat gctattgctt 1560 tatttgtaac cattataagc tgcaataaac aagttaacaa caagaattgc attcatttta 1620 tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac ctctacaaat 1680 gtggtatggc tgattatgat ccggctgcct cgcgcgtttc ggtgatgacg gtgaaaacct 1740 ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 1800 acaagcccgt caggcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgaggtcgac 1860 tctagaggat cgatgccccg ccccggacga actaaacctg actacgacat ctctgcccct 1920 tcttcgcggg gcagtgcatg taatcccttc agttggttgg tacaacttgc caactgggcc 1980 ctgttccaca tgtgacacgg ggggggacca aacacaaagg ggttctctga ctgtagttga 2040 catccttata aatggatgtg cacatttgcc aacactgagt ggctttcatc ctggagcaga 2100 ctttgcagtc tgtggactgc aacacaacat tgcctttatg tgtaactctt ggctgaagct 2160 cttacaccaa tgctggggga catgtacctc ccaggggccc aggaagacta cgggaggcta 2220 caccaacgtc aatcagaggg gcctgtgtag ctaccgataa gcggaccctc aagagggcat 2280 tagcaatagt gtttataagg cccccttgtt aaccctaaac gggtagcata tgcttcccgg 2340 gtagtagtat atactatcca gactaaccct aattcaatag catatgttac ccaacgggaa 2400 gcatatgcta tcgaattagg gttagtaaaa gggtcctaag gaacagcgat atctcccacc 2460 ccatgagctg tcacggtttt atttacatgg ggtcaggatt ccacgagggt agtgaaccat 2520 tttagtcaca agggcagtgg ctgaagatca aggagcgggc agtgaactct cctgaatctt 2580 cgcctgcttc ttcattctcc ttcgtttagc taatagaata actgctgagt tgtgaacagt 2640 aaggtgtatg tgaggtgctc gaaaacaagg tttcaggtga cgcccccaga ataaaatttg 2700 gacggggggt tcagtggtgg cattgtgcta tgacaccaat ataaccctca caaacccctt 2760 gggcaataaa tactagtgta ggaatgaaac attctgaata tctttaacaa tagaaatcca 2820 tggggtgggg acaagccgta aagactggat gtccatctca cacgaattta tggctatggg 2880 caacacataa tcctagtgca atatgatact ggggttatta agatgtgtcc caggcaggga 2940 ccaagacagg tgaaccatgt tgttacactc tatttgtaac aaggggaaag agagtggacg 3000 ccgacagcag cggactccac tggttgtctc taacaccccc gaaaattaaa cggggctcca 3060 cgccaatggg gcccataaac aaagacaagt ggccactctt ttttttgaaa ttgtggagtg 3120 ggggcacgcg tcagccccca cacgccgccc tgcggttttg gactgtaaaa taagggtgta 3180 ataacttggc tgattgtaac cccgctaacc actgcggtca aaccacttgc ccacaaaacc 3240 actaatggca ccccggggaa tacctgcata agtaggtggg cgggccaaga taggggcgcg 3300 attgctgcga tctggaggac aaattacaca cacttgcgcc tgagcgccaa gcacagggtt 3360 gttggtcctc atattcacga ggtcgctgag agcacggtgg gctaatgttg ccatgggtag 3420 catatactac ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata 3480 ggctatccta atctatatct gggtagcata tgctatccta atctatatct gggtagtata 3540 tgctatccta atttatatct gggtagcata ggctatccta atctatatct gggtagcata 3600 tgctatccta atctatatct gggtagtata tgctatccta atctgtatcc gggtagcata 3660 tgctatccta atagagatta gggtagtata tgctatccta atttatatct gggtagcata 3720 tactacccaa atatctggat agcatatgct atcctaatct atatctgggt agcatatgct 3780 atcctaatct atatctgggt agcataggct atcctaatct atatctgggt agcatatgct 3840 atcctaatct atatctgggt agtatatgct atcctaattt atatctgggt agcataggct 3900 atcctaatct atatctgggt agcatatgct atcctaatct atatctgggt agtatatgct 3960 atcctaatct gtatccgggt agcatatgct atcctcatgc atatacagtc agcatatgat 4020 acccagtagt agagtgggag tgctatcctt tgcatatgcc gccacctccc aagggggcgt 4080 gaattttcgc tgcttgtcct tttcctgctg gttgctccca ttcttaggtg aatttaagga 4140 ggccaggcta aagccgtcgc atgtctgatt gctcaccagg taaatgtcgc taatgttttc 4200 caacgcgaga aggtgttgag cgcggagctg agtgacgtga caacatgggt atgccgaatt 4260 gccccatgtt gggaggacga aaatggtgac aagacagatg gccagaaata caccaacagc 4320 acgcatgatg tctactgggg atttattctt tagtgcgggg gaatacacgg cttttaatac 4380 gattgagggc gtctcctaac aagttacatc actcctgccc ttcctcaccc tcatctccat 4440 cacctccttc atctccgtca tctccgtcat caccctccgc ggcagcccct tccaccatag 4500 gtggaaacca gggaggcaaa tctactccat cgtcaaagct gcacacagtc accctgatat 4560 tgcaggtagg agcgggcttt gtcataacaa ggtccttaat cgcatccttc aaaacctcag 4620 caaatatatg agtttgtaaa aagaccatga aataacagac aatggactcc cttagcgggc 4680 caggttgtgg gccgggtcca ggggccattc caaaggggag acgactcaat ggtgtaagac 4740 gacattgtgg aatagcaagg gcagttcctc gccttaggtt gtaaagggag gtcttactac 4800 ctccatatac gaacacaccg gcgacccaag ttccttcgtc ggtagtcctt tctacgtgac 4860 tcctagccag gagagctctt aaaccttctg caatgttctc aaatttcggg ttggaacctc 4920 cttgaccacg atgctttcca aaccaccctc cttttttgcg cctgcctcca tcaccctgac 4980 cccggggtcc agtgcttggg ccttctcctg ggtcatctgc ggggccctgc tctatcgctc 5040 ccgggggcac gtcaggctca ccatctgggc caccttcttg gtggtattca aaataatcgg 5100 cttcccctac agggtggaaa aatggccttc tacctggagg gggcctgcgc ggtggagacc 5160 cggatgatga tgactgacta ctgggactcc tgggcctctt ttctccacgt ccacgacctc 5220 tccccctggc tctttcacga cttccccccc tggctctttc acgtcctcta ccccggcggc 5280 ctccactacc tcctcgaccc cggcctccac tacctcctcg accccggcct ccactgcctc 5340 ctcgaccccg gcctccacct cctgctcctg cccctcctgc tcctgcccct cctcctgctc 5400 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5460 cccctcctgc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5520 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgcccctcct gctcctgccc 5580 ctcctgcccc tcctgctcct gcccctcctg ctcctgcccc tcctgctcct gcccctcctg 5640 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gctcctgccc 5700 ctcctgcccc tcctgcccct cctgctcctg cccctcctcc tgctcctgcc cctcctgccc 5760 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tcctgctcct gcccctcctc 5820 ctgctcctgc ccctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5880 ctcctgctcc tgcccctcct cctgctcctg cccctcctgc ccctcctgcc cctcctcctg 5940 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct gcccctcctc 6000 ctgctcctgc ccctcctcct gctcctgccc ctcctgctcc tgcccctccc gctcctgctc 6060 ctgctcctgt tccaccgtgg gtccctttgc agccaatgca acttggacgt ttttggggtc 6120 tccggacacc atctctatgt cttggccctg atcctgagcc gcccggggct cctggtcttc 6180 cgcctcctcg tcctcgtcct cttccccgtc ctcgtccatg gttatcaccc cctcttcttt 6240 gaggtccact gccgccggag ccttctggtc cagatgtgtc tcccttctct cctaggccat 6300 ttccaggtcc tgtacctggc ccctcgtcag acatgattca cactaaaaga gatcaataga 6360 catctttatt agacgacgct cagtgaatac agggagtgca gactcctgcc ccctccaaca 6420 gcccccccac cctcatcccc ttcatggtcg ctgtcagaca gatccaggtc tgaaaattcc 6480 ccatcctccg aaccatcctc gtcctcatca ccaattactc gcagcccgga aaactcccgc 6540 tgaacatcct caagatttgc gtcctgagcc tcaagccagg cctcaaattc ctcgtccccc 6600 tttttgctgg acggtaggga tggggattct cgggacccct cctcttcctc ttcaaggtca 6660 ccagacagag atgctactgg ggcaacggaa gaaaagctgg gtgcggcctg tgaggatcag 6720 cttatcgatg ataagctgtc aaacatgaga attcttgaag acgaaagggc ctcgtgatac 6780 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 6840 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 6900 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 6960 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 7020 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 7080 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 7140 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 7200 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 7260 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 7320 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 7380 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 7440 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 7500 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 7560 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 7620 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 7680 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 7740 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 7800 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 7860 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 7920 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 7980 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 8040 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 8100 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 8160 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 8220 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 8280 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 8340 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 8400 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 8460 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 8520 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 8580 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggccttga agctgtccct 8640 gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 8700 cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca 8760 agacgtagcc cagcgcgtcg gccccgagat gcgccgcgtg cggctgctgg agatggcgga 8820 cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg 8880 attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccctg cttcatcccc 8940 gtggcccgtt gctcgcgttt gctggcggtg tccccggaag aaatatattt gcatgtcttt 9000 agttctatga tgacacaaac cccgcccagc gtcttgtcat tggcgaattc gaacacgcag 9060 atgcagtcgg ggcggcgcgg tccgaggtcc acttcgcata ttaaggtgac gcgtgtggcc 9120 tcgaacaccg agcgaccctg cagcgacccg cttaacagcg tcaacagcgt gccgcagatc 9180 ccggggggca atgagatatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc 9240 tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc 9300 gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg 9360 atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc 9420 cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg 9480 cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg 9540 tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc gggttcggcc 9600 cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata tgcgcgattg 9660 ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg 9720 cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg 9780 tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca 9840 ttgactggag cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct 9900 ggaggccgtg gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg 9960 agcttgcagg atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct 10020 atcagagctt ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg 10080 caatcgtccg atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg 10140 ccgtctggac cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca 10200 ctcgtccgga tcgggagatg ggggaggcta actgaaacac ggaaggagac aataccggaa 10260 ggaacccgcg ctatgacggc aataaaaaga cagaataaaa cgcacgggtg ttgggtcgtt 10320 tgttcataaa cgcggggttc ggtcccaggg ctggcactct gtcgataccc caccgagacc 10380 ccattggggc caatacgccc gcgtttcttc cttttcccca ccccaccccc caagttcggg 10440 tgaaggccca gggctcgcag ccaacgtcgg ggcggcaggc cctgccatag ccactggccc 10500 cgtgggttag ggacggggtc ccccatgggg aatggtttat ggttcgtggg ggttattatt 10560 ttgggcgttg cgtggggtca ggtccacgac tggactgagc agacagaccc atggtttttg 10620 gatggcctgg gcatggaccg catgtactgg cgcgacacga acaccgggcg tctgtggctg 10680 ccaaacaccc ccgaccccca aaaaccaccg cgcggatttc tggcgtgcca agctagtcga 10740 ccaattctca tgtttgacag cttatcatcg cagatccggg caacgttgtt gccattgctg 10800 caggcgcaga actggtaggt atggaagatc catacattga atcaatattg gcaattagcc 10860 atattagtca ttggttatat agcataaatc aatattggct attggccatt gcatacgttg 10920 tatctatatc ataatatgta catttatatt ggctcatgtc caatatgacc gccat 10975 <210> 153 <211> 5774 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 153 tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgggatctc gatcccgcga 4980 aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 ttttgtttaa ctttaagaag gagatataca tatgaaatac cttcttccga ctgctgctgc 5100 tggtctttta ctgctggctg ctcagccggc tatggctgct ggtggtggtt ctgccctcca 5160 gacggtctgc ctgaagggga ccaaggtgca catgaaatgc tttctggcct tcacccagac 5220 gaagaccttc cacgaggcca gcgaggactg catctcgcgc gggggcaccc tgagcacccc 5280 tcagactggc tcggagaacg acgccctgta tgagtacctg cgccagagcg tgggcaacga 5340 ggccgagatc tggctgggcc tcaacgacat ggcggccgag ggcacctggg tggacatgac 5400 cggtacccgc atcgcctaca agaactggga gactgagatc accgcgcaac ccgatggcgg 5460 caagaccgag aactgcgcgg tcctgtcagg cgcggccaac ggcaagtggt tcgacaagcg 5520 ctgcagggat caattgccct acatctgcca gttcgggatc gtgtacccct acgacgtgcc 5580 cgactacgcc caccaccacc accaccacta actcgagcac caccaccacc accactgaga 5640 tccggctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata 5700 actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg 5760 aactatatcc ggat 5774 <210> 154 <211> 4649 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 154 aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60 tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca 120 aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg 180 attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg 240 atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc 300 taacaggagg aattcaccat gaaaaagaca gctatcgcga ttgcagtggc actggctggt 360 ttcgctaccg ttgcgcaagc ttctgagcca ccaacccaga agcccaagaa gattgtaaat 420 gccaagaaag atgttgtgaa cacaaagatg tttgaggagc tcaagagccg tctggacacc 480 ctggcccagg aggtggccct gctgaaggag cagcaggccc tccagacggt ctgcctgaag 540 gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac cttccacgag 600 gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac tggctcggag 660 aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga gatctggctg 720 ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac ccgcatcgcc 780 tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac cgagaactgc 840 gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag ggatcaattg 900 ccctacatct gccagttcgg gatcgttcta gaacaaaaac tcatctcaga agaggatctg 960 aatagcgccg tcgaccatca tcatcatcat cattgagttt aaacggtctc cagcttggct 1020 gttttggcgg atgagagaag attttcagcc tgatacagat taaatcagaa cgcagaagcg 1080 gtctgataaa acagaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc 1140 cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag 1200 tagggaactg ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgt 1260 tttatctgtt gtttgtcggt gaacgctctc ctgagtagga caaatccgcc gggagcggat 1320 ttgaacgttg cgaagcaacg gcccggaggg tggcgggcag gacgcccgcc ataaactgcc 1380 aggcatcaaa ttaagcagaa ggccatcctg acggatggcc tttttgcgtt tctacaaact 1440 ctttttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 1500 gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 1560 cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 1620 tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 1680 tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 1740 cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac 1800 tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 1860 agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 1920 ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 1980 ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 2040 aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 2100 gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 2160 tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 2220 ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc 2280 cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 2340 atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 2400 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 2460 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 2520 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 2580 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 2640 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 2700 taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 2760 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 2820 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 2880 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 2940 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 3000 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 3060 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 3120 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 3180 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 3240 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 3300 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc 3360 ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg 3420 atgccgcata gttaagccag tatacactcc gctatcgcta cgtgactggg tcatggctgc 3480 gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc 3540 cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc 3600 atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg aaggcgaagc ggcatgcata 3660 atgtgcctgt caaatggacg aagcagggat tctgcaaacc ctatgctact ccgtcaagcc 3720 gtcaattgtc tgattcgtta ccaattatga caacttgacg gctacatcat tcactttttc 3780 ttcacaaccg gcacggaact cgctcgggct ggccccggtg cattttttaa atacccgcga 3840 gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg gtggcgatag gcatccgggt 3900 ggtgctcaaa agcagcttcg cctggctgat acgttggtcc tcgcgccagc ttaagacgct 3960 aatccctaac tgctggcgga aaagatgtga cagacgcgac ggcgacaagc aaacatgctg 4020 tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga tcgctgatgt actgacaagc 4080 ctcgcgtacc cgattatcca tcggtggatg gagcgactcg ttaatcgctt ccatgcgccg 4140 cagtaacaat tgctcaagca gatttatcgc cagcagctcc gaatagcgcc cttccccttg 4200 cccggcgtta atgatttgcc caaacaggtc gctgaaatgc ggctggtgcg cttcatccgg 4260 gcgaaagaac cccgtattgg caaatattga cggccagtta agccattcat gccagtaggc 4320 gcgcggacga aagtaaaccc actggtgata ccattcgcga gcctccggat gacgaccgta 4380 gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc ggtcggcaaa caaattctcg 4440 tccctgattt ttcaccaccc cctgaccgcg aatggtgaga ttgagaatat aacctttcat 4500 tcccagcggt cggtcgataa aaaaatcgag ataaccgttg gcctcaatcg gcgttaaacc 4560 cgccaccaga tgggcattaa acgagtatcc cggcagcagg ggatcatttt gcgcttcagc 4620 catacttttc atactcccgc cattcagag 4649 <210> 155 <211> 10972 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 155 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 gtgcctgaag gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac 1080 cttccacgag gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac 1140 tggctcggag aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga 1200 gatctggctg ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac 1260 ccgcatcgcc tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac 1320 cgagaactgc gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag 1380 ggatcaattg ccctacatct gccagttcgg gatcgtgcac caccaccacc accactaact 1440 cgaggccggc aaggccggat ccagacatga taagatacat tgatgagttt ggacaaacca 1500 caactagaat gcagtgaaaa aaatgcttta tttgtgaaat ttgtgatgct attgctttat 1560 ttgtaaccat tataagctgc aataaacaag ttaacaacaa gaattgcatt cattttatgt 1620 ttcaggttca gggggaggtg tgggaggttt tttaaagcaa gtaaaacctc tacaaatgtg 1680 gtatggctga ttatgatccg gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg 1740 acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca 1800 agcccgtcag gcgtcagcgg gtgttggcgg gtgtcggggc gcagccatga ggtcgactct 1860 agaggatcga tgccccgccc cggacgaact aaacctgact acgacatctc tgccccttct 1920 tcgcggggca gtgcatgtaa tcccttcagt tggttggtac aacttgccaa ctgggccctg 1980 ttccacatgt gacacggggg gggaccaaac acaaaggggt tctctgactg tagttgacat 2040 ccttataaat ggatgtgcac atttgccaac actgagtggc tttcatcctg gagcagactt 2100 tgcagtctgt ggactgcaac acaacattgc ctttatgtgt aactcttggc tgaagctctt 2160 acaccaatgc tgggggacat gtacctccca ggggcccagg aagactacgg gaggctacac 2220 caacgtcaat cagaggggcc tgtgtagcta ccgataagcg gaccctcaag agggcattag 2280 caatagtgtt tataaggccc ccttgttaac cctaaacggg tagcatatgc ttcccgggta 2340 gtagtatata ctatccagac taaccctaat tcaatagcat atgttaccca acgggaagca 2400 tatgctatcg aattagggtt agtaaaaggg tcctaaggaa cagcgatatc tcccacccca 2460 tgagctgtca cggttttatt tacatggggt caggattcca cgagggtagt gaaccatttt 2520 agtcacaagg gcagtggctg aagatcaagg agcgggcagt gaactctcct gaatcttcgc 2580 ctgcttcttc attctccttc gtttagctaa tagaataact gctgagttgt gaacagtaag 2640 gtgtatgtga ggtgctcgaa aacaaggttt caggtgacgc ccccagaata aaatttggac 2700 ggggggttca gtggtggcat tgtgctatga caccaatata accctcacaa accccttggg 2760 caataaatac tagtgtagga atgaaacatt ctgaatatct ttaacaatag aaatccatgg 2820 ggtggggaca agccgtaaag actggatgtc catctcacac gaatttatgg ctatgggcaa 2880 cacataatcc tagtgcaata tgatactggg gttattaaga tgtgtcccag gcagggacca 2940 agacaggtga accatgttgt tacactctat ttgtaacaag gggaaagaga gtggacgccg 3000 acagcagcgg actccactgg ttgtctctaa cacccccgaa aattaaacgg ggctccacgc 3060 caatggggcc cataaacaaa gacaagtggc cactcttttt tttgaaattg tggagtgggg 3120 gcacgcgtca gcccccacac gccgccctgc ggttttggac tgtaaaataa gggtgtaata 3180 acttggctga ttgtaacccc gctaaccact gcggtcaaac cacttgccca caaaaccact 3240 aatggcaccc cggggaatac ctgcataagt aggtgggcgg gccaagatag gggcgcgatt 3300 gctgcgatct ggaggacaaa ttacacacac ttgcgcctga gcgccaagca cagggttgtt 3360 ggtcctcata ttcacgaggt cgctgagagc acggtgggct aatgttgcca tgggtagcat 3420 atactaccca aatatctgga tagcatatgc tatcctaatc tatatctggg tagcataggc 3480 tatcctaatc tatatctggg tagcatatgc tatcctaatc tatatctggg tagtatatgc 3540 tatcctaatt tatatctggg tagcataggc tatcctaatc tatatctggg tagcatatgc 3600 tatcctaatc tatatctggg tagtatatgc tatcctaatc tgtatccggg tagcatatgc 3660 tatcctaata gagattaggg tagtatatgc tatcctaatt tatatctggg tagcatatac 3720 tacccaaata tctggatagc atatgctatc ctaatctata tctgggtagc atatgctatc 3780 ctaatctata tctgggtagc ataggctatc ctaatctata tctgggtagc atatgctatc 3840 ctaatctata tctgggtagt atatgctatc ctaatttata tctgggtagc ataggctatc 3900 ctaatctata tctgggtagc atatgctatc ctaatctata tctgggtagt atatgctatc 3960 ctaatctgta tccgggtagc atatgctatc ctcatgcata tacagtcagc atatgatacc 4020 cagtagtaga gtgggagtgc tatcctttgc atatgccgcc acctcccaag ggggcgtgaa 4080 ttttcgctgc ttgtcctttt cctgctggtt gctcccattc ttaggtgaat ttaaggaggc 4140 caggctaaag ccgtcgcatg tctgattgct caccaggtaa atgtcgctaa tgttttccaa 4200 cgcgagaagg tgttgagcgc ggagctgagt gacgtgacaa catgggtatg ccgaattgcc 4260 ccatgttggg aggacgaaaa tggtgacaag acagatggcc agaaatacac caacagcacg 4320 catgatgtct actggggatt tattctttag tgcgggggaa tacacggctt ttaatacgat 4380 tgagggcgtc tcctaacaag ttacatcact cctgcccttc ctcaccctca tctccatcac 4440 ctccttcatc tccgtcatct ccgtcatcac cctccgcggc agccccttcc accataggtg 4500 gaaaccaggg aggcaaatct actccatcgt caaagctgca cacagtcacc ctgatattgc 4560 aggtaggagc gggctttgtc ataacaaggt ccttaatcgc atccttcaaa acctcagcaa 4620 atatatgagt ttgtaaaaag accatgaaat aacagacaat ggactccctt agcgggccag 4680 gttgtgggcc gggtccaggg gccattccaa aggggagacg actcaatggt gtaagacgac 4740 attgtggaat agcaagggca gttcctcgcc ttaggttgta aagggaggtc ttactacctc 4800 catatacgaa cacaccggcg acccaagttc cttcgtcggt agtcctttct acgtgactcc 4860 tagccaggag agctcttaaa ccttctgcaa tgttctcaaa tttcgggttg gaacctcctt 4920 gaccacgatg ctttccaaac caccctcctt ttttgcgcct gcctccatca ccctgacccc 4980 ggggtccagt gcttgggcct tctcctgggt catctgcggg gccctgctct atcgctcccg 5040 ggggcacgtc aggctcacca tctgggccac cttcttggtg gtattcaaaa taatcggctt 5100 cccctacagg gtggaaaaat ggccttctac ctggaggggg cctgcgcggt ggagacccgg 5160 atgatgatga ctgactactg ggactcctgg gcctcttttc tccacgtcca cgacctctcc 5220 ccctggctct ttcacgactt ccccccctct ctctttcacg tcctctaccc cggcggcctc 5280 cactacctcc tcgaccccgg cctccactac ctcctcgacc ccggcctcca ctgcctcctc 5340 gaccccggcc tccacctcct gctcctgccc ctcctgctcc tgcccctcct cctgctcctg 5400 cccctcctgc ccctcctgct cctgcccctc ctgcccctcc tgctcctgcc cctcctgccc 5460 ctcctgctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctcctg 5520 ctcctgcccc tcctgcccct cctgctcctg cccctcctgc ccctcctgct cctgcccctc 5580 ctgcccctcc tgctcctgcc cctcctgctc ctgcccctcc tgctcctgcc cctcctgctc 5640 ctgcccctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgct cctgcccctc 5700 ctgcccctcc tgcccctcct gctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5760 ctgcccctcc tcctgctcct gcccctcctg cccctcctcc tgctcctgcc cctcctcctg 5820 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5880 ctgctcctgc ccctcctcct gctcctgccc ctcctgcccc tcctgcccct cctcctgctc 5940 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc ccctcctgcc cctcctcctg 6000 ctcctgcccc tcctcctgct cctgcccctc ctgctcctgc ccctcccgct cctgctcctg 6060 ctcctgttcc accgtgggtc cctttgcagc caatgcaact tggacgtttt tggggtctcc 6120 ggacaccatc tctatgtctt ggccctgatc ctgagccgcc cggggctcct ggtcttccgc 6180 ctcctcgtcc tcgtcctctt ccccgtcctc gtccatggtt atcaccccct cttctttgag 6240 gtccactgcc gccggagcct tctggtccag atgtgtctcc cttctctcct aggccatttc 6300 caggtcctgt acctggcccc tcgtcagaca tgattcacac taaaagagat caatagacat 6360 ctttattaga cgacgctcag tgaatacagg gagtgcagac tcctgccccc tccaacagcc 6420 cccccaccct catccccttc atggtcgctg tcagacagat ccaggtctga aaattcccca 6480 tcctccgaac catcctcgtc ctcatcacca attactcgca gcccggaaaa ctcccgctga 6540 acatcctcaa gatttgcgtc ctgagcctca agccaggcct caaattcctc gtcccccttt 6600 ttgctggacg gtagggatgg ggattctcgg gacccctcct cttcctcttc aaggtcacca 6660 gacagagatg ctactggggc aacggaagaa aagctgggtg cggcctgtga ggatcagctt 6720 atcgatgata agctgtcaaa catgagaatt cttgaagacg aaagggcctc gtgatacgcc 6780 tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc 6840 ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 6900 cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 6960 gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 7020 ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 7080 tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 7140 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgtg 7200 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 7260 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 7320 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 7380 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 7440 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 7500 cagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 7560 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 7620 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 7680 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 7740 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 7800 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 7860 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 7920 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 7980 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 8040 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 8100 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 8160 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 8220 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 8280 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 8340 gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc 8400 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 8460 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 8520 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 8580 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttgaagc tgtccctgat 8640 ggtcgtcatc tacctgcctg gacagcatgg cctgcaacgc gggcatcccg atgccgccgg 8700 aagcgagaag aatcataatg gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga 8760 cgtagcccag cgcgtcggcc ccgagatgcg ccgcgtgcgg ctgctggaga tggcggacgc 8820 gatggatatg ttctgccaag ggttggtttg cgcattcaca gttctccgca agaattgatt 8880 ggctccaatt cttggagtgg tgaatccgtt agcgaggtgc cgccctgctt catccccgtg 8940 gcccgttgct cgcgtttgct ggcggtgtcc ccggaagaaa tatatttgca tgtctttagt 9000 tctatgatga cacaaacccc gcccagcgtc ttgtcattgg cgaattcgaa cacgcagatg 9060 cagtcggggc ggcgcggtcc gaggtccact tcgcatatta aggtgacgcg tgtggcctcg 9120 aacaccgagc gaccctgcag cgacccgctt aacagcgtca acagcgtgcc gcagatcccg 9180 gggggcaatg agatatgaaa aagcctgaac tcaccgcgac gtctgtcgag aagtttctga 9240 tcgaaaagtt cgacagcgtc tccgacctga tgcagctctc ggagggcgaa gaatctcgtg 9300 ctttcagctt cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg 9360 gtttctacaa agatcgttat gtttatcggc actttgcatc ggccgcgctc ccgattccgg 9420 aagtgcttga cattggggaa ttcagcgaga gcctgaccta ttgcatctcc cgccgtgcac 9480 agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg 9540 cggaggccat ggatgcgatc gctgcggccg atcttagcca gacgagcggg ttcggcccat 9600 tcggaccgca aggaatcggt caatacacta catggcgtga tttcatatgc gcgattgctg 9660 atccccatgt gtatcactgg caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc 9720 aggctctcga tgagctgatg ctttgggccg aggactgccc cgaagtccgg cacctcgtgc 9780 acgcggattt cggctccaac aatgtcctga cggacaatgg ccgcataaca gcggtcattg 9840 actggagcga ggcgatgttc ggggattccc aatacgaggt cgccaacatc ttcttctgga 9900 ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt cgagcggagg catccggagc 9960 ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac caactctatc 10020 agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa 10080 tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga agcgcggccg 10140 tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc cccagcactc 10200 gtccggatcg ggagatgggg gaggctaact gaaacacgga aggagacaat accggaagga 10260 acccgcgcta tgacggcaat aaaaagacag aataaaacgc acgggtgttg ggtcgtttgt 10320 tcataaacgc ggggttcggt cccagggctg gcactctgtc gataccccac cgagacccca 10380 ttggggccaa tacgcccgcg tttcttcctt ttccccaccc caccccccaa gttcgggtga 10440 aggcccaggg ctcgcagcca acgtcggggc ggcaggccct gccatagcca ctggccccgt 10500 gggttaggga cggggtcccc catggggaat ggtttatggt tcgtgggggt tattattttg 10560 ggcgttgcgt ggggtcaggt ccacgactgg actgagcaga cagacccatg gtttttggat 10620 ggcctgggca tggaccgcat gtactggcgc gacacgaaca ccgggcgtct gtggctgcca 10680 aacacccccg acccccaaaa accaccgcgc ggatttctgg cgtgccaagc tagtcgacca 10740 attctcatgt ttgacagctt atcatcgcag atccgggcaa cgttgttgcc attgctgcag 10800 gcgcagaact ggtaggtatg gaagatccat acattgaatc aatattggca attagccata 10860 ttagtcattg gttatatagc ataaatcaat attggctatt ggccattgca tacgttgtat 10920 ctatatcata atatgtacat ttatattggc tcatgtccaa tatgaccgcc at 10972 <210> 156 <211> 10972 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 156 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccaggt 1020 ctgcctgaag gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac 1080 cttccacgag gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac 1140 tggctcggag aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga 1200 gatctggctg ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac 1260 ccgcatcgcc tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac 1320 cgagaactgc gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag 1380 ggatcaattg ccctacatct gccagttcgg gatcgtgcac caccaccacc accactaact 1440 cgaggccggc aaggccggat ccagacatga taagatacat tgatgagttt ggacaaacca 1500 caactagaat gcagtgaaaa aaatgcttta tttgtgaaat ttgtgatgct attgctttat 1560 ttgtaaccat tataagctgc aataaacaag ttaacaacaa gaattgcatt cattttatgt 1620 ttcaggttca gggggaggtg tgggaggttt tttaaagcaa gtaaaacctc tacaaatgtg 1680 gtatggctga ttatgatccg gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg 1740 acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca 1800 agcccgtcag gcgtcagcgg gtgttggcgg gtgtcggggc gcagccatga ggtcgactct 1860 agaggatcga tgccccgccc cggacgaact aaacctgact acgacatctc tgccccttct 1920 tcgcggggca gtgcatgtaa tcccttcagt tggttggtac aacttgccaa ctgggccctg 1980 ttccacatgt gacacggggg gggaccaaac acaaaggggt tctctgactg tagttgacat 2040 ccttataaat ggatgtgcac atttgccaac actgagtggc tttcatcctg gagcagactt 2100 tgcagtctgt ggactgcaac acaacattgc ctttatgtgt aactcttggc tgaagctctt 2160 acaccaatgc tgggggacat gtacctccca ggggcccagg aagactacgg gaggctacac 2220 caacgtcaat cagaggggcc tgtgtagcta ccgataagcg gaccctcaag agggcattag 2280 caatagtgtt tataaggccc ccttgttaac cctaaacggg tagcatatgc ttcccgggta 2340 gtagtatata ctatccagac taaccctaat tcaatagcat atgttaccca acgggaagca 2400 tatgctatcg aattagggtt agtaaaaggg tcctaaggaa cagcgatatc tcccacccca 2460 tgagctgtca cggttttatt tacatggggt caggattcca cgagggtagt gaaccatttt 2520 agtcacaagg gcagtggctg aagatcaagg agcgggcagt gaactctcct gaatcttcgc 2580 ctgcttcttc attctccttc gtttagctaa tagaataact gctgagttgt gaacagtaag 2640 gtgtatgtga ggtgctcgaa aacaaggttt caggtgacgc ccccagaata aaatttggac 2700 ggggggttca gtggtggcat tgtgctatga caccaatata accctcacaa accccttggg 2760 caataaatac tagtgtagga atgaaacatt ctgaatatct ttaacaatag aaatccatgg 2820 ggtggggaca agccgtaaag actggatgtc catctcacac gaatttatgg ctatgggcaa 2880 cacataatcc tagtgcaata tgatactggg gttattaaga tgtgtcccag gcagggacca 2940 agacaggtga accatgttgt tacactctat ttgtaacaag gggaaagaga gtggacgccg 3000 acagcagcgg actccactgg ttgtctctaa cacccccgaa aattaaacgg ggctccacgc 3060 caatggggcc cataaacaaa gacaagtggc cactcttttt tttgaaattg tggagtgggg 3120 gcacgcgtca gcccccacac gccgccctgc ggttttggac tgtaaaataa gggtgtaata 3180 acttggctga ttgtaacccc gctaaccact gcggtcaaac cacttgccca caaaaccact 3240 aatggcaccc cggggaatac ctgcataagt aggtgggcgg gccaagatag gggcgcgatt 3300 gctgcgatct ggaggacaaa ttacacacac ttgcgcctga gcgccaagca cagggttgtt 3360 ggtcctcata ttcacgaggt cgctgagagc acggtgggct aatgttgcca tgggtagcat 3420 atactaccca aatatctgga tagcatatgc tatcctaatc tatatctggg tagcataggc 3480 tatcctaatc tatatctggg tagcatatgc tatcctaatc tatatctggg tagtatatgc 3540 tatcctaatt tatatctggg tagcataggc tatcctaatc tatatctggg tagcatatgc 3600 tatcctaatc tatatctggg tagtatatgc tatcctaatc tgtatccggg tagcatatgc 3660 tatcctaata gagattaggg tagtatatgc tatcctaatt tatatctggg tagcatatac 3720 tacccaaata tctggatagc atatgctatc ctaatctata tctgggtagc atatgctatc 3780 ctaatctata tctgggtagc ataggctatc ctaatctata tctgggtagc atatgctatc 3840 ctaatctata tctgggtagt atatgctatc ctaatttata tctgggtagc ataggctatc 3900 ctaatctata tctgggtagc atatgctatc ctaatctata tctgggtagt atatgctatc 3960 ctaatctgta tccgggtagc atatgctatc ctcatgcata tacagtcagc atatgatacc 4020 cagtagtaga gtgggagtgc tatcctttgc atatgccgcc acctcccaag ggggcgtgaa 4080 ttttcgctgc ttgtcctttt cctgctggtt gctcccattc ttaggtgaat ttaaggaggc 4140 caggctaaag ccgtcgcatg tctgattgct caccaggtaa atgtcgctaa tgttttccaa 4200 cgcgagaagg tgttgagcgc ggagctgagt gacgtgacaa catgggtatg ccgaattgcc 4260 ccatgttggg aggacgaaaa tggtgacaag acagatggcc agaaatacac caacagcacg 4320 catgatgtct actggggatt tattctttag tgcgggggaa tacacggctt ttaatacgat 4380 tgagggcgtc tcctaacaag ttacatcact cctgcccttc ctcaccctca tctccatcac 4440 ctccttcatc tccgtcatct ccgtcatcac cctccgcggc agccccttcc accataggtg 4500 gaaaccaggg aggcaaatct actccatcgt caaagctgca cacagtcacc ctgatattgc 4560 aggtaggagc gggctttgtc ataacaaggt ccttaatcgc atccttcaaa acctcagcaa 4620 atatatgagt ttgtaaaaag accatgaaat aacagacaat ggactccctt agcgggccag 4680 gttgtgggcc gggtccaggg gccattccaa aggggagacg actcaatggt gtaagacgac 4740 attgtggaat agcaagggca gttcctcgcc ttaggttgta aagggaggtc ttactacctc 4800 catatacgaa cacaccggcg acccaagttc cttcgtcggt agtcctttct acgtgactcc 4860 tagccaggag agctcttaaa ccttctgcaa tgttctcaaa tttcgggttg gaacctcctt 4920 gaccacgatg ctttccaaac caccctcctt ttttgcgcct gcctccatca ccctgacccc 4980 ggggtccagt gcttgggcct tctcctgggt catctgcggg gccctgctct atcgctcccg 5040 ggggcacgtc aggctcacca tctgggccac cttcttggtg gtattcaaaa taatcggctt 5100 cccctacagg gtggaaaaat ggccttctac ctggaggggg cctgcgcggt ggagacccgg 5160 atgatgatga ctgactactg ggactcctgg gcctcttttc tccacgtcca cgacctctcc 5220 ccctggctct ttcacgactt ccccccctct ctctttcacg tcctctaccc cggcggcctc 5280 cactacctcc tcgaccccgg cctccactac ctcctcgacc ccggcctcca ctgcctcctc 5340 gaccccggcc tccacctcct gctcctgccc ctcctgctcc tgcccctcct cctgctcctg 5400 cccctcctgc ccctcctgct cctgcccctc ctgcccctcc tgctcctgcc cctcctgccc 5460 ctcctgctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctcctg 5520 ctcctgcccc tcctgcccct cctgctcctg cccctcctgc ccctcctgct cctgcccctc 5580 ctgcccctcc tgctcctgcc cctcctgctc ctgcccctcc tgctcctgcc cctcctgctc 5640 ctgcccctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgct cctgcccctc 5700 ctgcccctcc tgcccctcct gctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5760 ctgcccctcc tcctgctcct gcccctcctg cccctcctcc tgctcctgcc cctcctcctg 5820 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5880 ctgctcctgc ccctcctcct gctcctgccc ctcctgcccc tcctgcccct cctcctgctc 5940 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc ccctcctgcc cctcctcctg 6000 ctcctgcccc tcctcctgct cctgcccctc ctgctcctgc ccctcccgct cctgctcctg 6060 ctcctgttcc accgtgggtc cctttgcagc caatgcaact tggacgtttt tggggtctcc 6120 ggacaccatc tctatgtctt ggccctgatc ctgagccgcc cggggctcct ggtcttccgc 6180 ctcctcgtcc tcgtcctctt ccccgtcctc gtccatggtt atcaccccct cttctttgag 6240 gtccactgcc gccggagcct tctggtccag atgtgtctcc cttctctcct aggccatttc 6300 caggtcctgt acctggcccc tcgtcagaca tgattcacac taaaagagat caatagacat 6360 ctttattaga cgacgctcag tgaatacagg gagtgcagac tcctgccccc tccaacagcc 6420 cccccaccct catccccttc atggtcgctg tcagacagat ccaggtctga aaattcccca 6480 tcctccgaac catcctcgtc ctcatcacca attactcgca gcccggaaaa ctcccgctga 6540 acatcctcaa gatttgcgtc ctgagcctca agccaggcct caaattcctc gtcccccttt 6600 ttgctggacg gtagggatgg ggattctcgg gacccctcct cttcctcttc aaggtcacca 6660 gacagagatg ctactggggc aacggaagaa aagctgggtg cggcctgtga ggatcagctt 6720 atcgatgata agctgtcaaa catgagaatt cttgaagacg aaagggcctc gtgatacgcc 6780 tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc 6840 ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 6900 cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 6960 gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 7020 ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 7080 tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 7140 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgtg 7200 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 7260 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 7320 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 7380 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 7440 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 7500 cagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 7560 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 7620 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 7680 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 7740 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 7800 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 7860 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 7920 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 7980 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 8040 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 8100 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 8160 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 8220 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 8280 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 8340 gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc 8400 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 8460 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 8520 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 8580 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttgaagc tgtccctgat 8640 ggtcgtcatc tacctgcctg gacagcatgg cctgcaacgc gggcatcccg atgccgccgg 8700 aagcgagaag aatcataatg gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga 8760 cgtagcccag cgcgtcggcc ccgagatgcg ccgcgtgcgg ctgctggaga tggcggacgc 8820 gatggatatg ttctgccaag ggttggtttg cgcattcaca gttctccgca agaattgatt 8880 ggctccaatt cttggagtgg tgaatccgtt agcgaggtgc cgccctgctt catccccgtg 8940 gcccgttgct cgcgtttgct ggcggtgtcc ccggaagaaa tatatttgca tgtctttagt 9000 tctatgatga cacaaacccc gcccagcgtc ttgtcattgg cgaattcgaa cacgcagatg 9060 cagtcggggc ggcgcggtcc gaggtccact tcgcatatta aggtgacgcg tgtggcctcg 9120 aacaccgagc gaccctgcag cgacccgctt aacagcgtca acagcgtgcc gcagatcccg 9180 gggggcaatg agatatgaaa aagcctgaac tcaccgcgac gtctgtcgag aagtttctga 9240 tcgaaaagtt cgacagcgtc tccgacctga tgcagctctc ggagggcgaa gaatctcgtg 9300 ctttcagctt cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg 9360 gtttctacaa agatcgttat gtttatcggc actttgcatc ggccgcgctc ccgattccgg 9420 aagtgcttga cattggggaa ttcagcgaga gcctgaccta ttgcatctcc cgccgtgcac 9480 agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg 9540 cggaggccat ggatgcgatc gctgcggccg atcttagcca gacgagcggg ttcggcccat 9600 tcggaccgca aggaatcggt caatacacta catggcgtga tttcatatgc gcgattgctg 9660 atccccatgt gtatcactgg caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc 9720 aggctctcga tgagctgatg ctttgggccg aggactgccc cgaagtccgg cacctcgtgc 9780 acgcggattt cggctccaac aatgtcctga cggacaatgg ccgcataaca gcggtcattg 9840 actggagcga ggcgatgttc ggggattccc aatacgaggt cgccaacatc ttcttctgga 9900 ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt cgagcggagg catccggagc 9960 ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac caactctatc 10020 agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa 10080 tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga agcgcggccg 10140 tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc cccagcactc 10200 gtccggatcg ggagatgggg gaggctaact gaaacacgga aggagacaat accggaagga 10260 acccgcgcta tgacggcaat aaaaagacag aataaaacgc acgggtgttg ggtcgtttgt 10320 tcataaacgc ggggttcggt cccagggctg gcactctgtc gataccccac cgagacccca 10380 ttggggccaa tacgcccgcg tttcttcctt ttccccaccc caccccccaa gttcgggtga 10440 aggcccaggg ctcgcagcca acgtcggggc ggcaggccct gccatagcca ctggccccgt 10500 gggttaggga cggggtcccc catggggaat ggtttatggt tcgtgggggt tattattttg 10560 ggcgttgcgt ggggtcaggt ccacgactgg actgagcaga cagacccatg gtttttggat 10620 ggcctgggca tggaccgcat gtactggcgc gacacgaaca ccgggcgtct gtggctgcca 10680 aacacccccg acccccaaaa accaccgcgc ggatttctgg cgtgccaagc tagtcgacca 10740 attctcatgt ttgacagctt atcatcgcag atccgggcaa cgttgttgcc attgctgcag 10800 gcgcagaact ggtaggtatg gaagatccat acattgaatc aatattggca attagccata 10860 ttagtcattg gttatatagc ataaatcaat attggctatt ggccattgca tacgttgtat 10920 ctatatcata atatgtacat ttatattggc tcatgtccaa tatgaccgcc at 10972 <210> 157 <211> 10969 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 157 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagtg 1020 cctgaagggg accaaggtgc acatgaaatg ctttctggcc ttcacccaga cgaagacctt 1080 ccacgaggcc agcgaggact gcatctcgcg cgggggcacc ctgagcaccc ctcagactgg 1140 ctcggagaac gacgccctgt atgagtacct gcgccagagc gtgggcaacg aggccgagat 1200 ctggctgggc ctcaacgaca tggcggccga gggcacctgg gtggacatga ccggtacccg 1260 catcgcctac aagaactggg agactgagat caccgcgcaa cccgatggcg gcaagaccga 1320 gaactgcgcg gtcctgtcag gcgcggccaa cggcaagtgg ttcgacaagc gctgcaggga 1380 tcaattgccc tacatctgcc agttcgggat cgtgcaccac caccaccacc actaactcga 1440 ggccggcaag gccggatcca gacatgataa gatacattga tgagtttgga caaaccacaa 1500 ctagaatgca gtgaaaaaaa tgctttattt gtgaaatttg tgatgctatt gctttatttg 1560 taaccattat aagctgcaat aaacaagtta acaacaagaa ttgcattcat tttatgtttc 1620 aggttcaggg ggaggtgtgg gaggtttttt aaagcaagta aaacctctac aaatgtggta 1680 tggctgatta tgatccggct gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca 1740 catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc 1800 ccgtcaggcg tcagcgggtg ttggcgggtg tcggggcgca gccatgaggt cgactctaga 1860 ggatcgatgc cccgccccgg acgaactaaa cctgactacg acatctctgc cccttcttcg 1920 cggggcagtg catgtaatcc cttcagttgg ttggtacaac ttgccaactg ggccctgttc 1980 cacatgtgac acgggggggg accaaacaca aaggggttct ctgactgtag ttgacatcct 2040 tataaatgga tgtgcacatt tgccaacact gagtggcttt catcctggag cagactttgc 2100 agtctgtgga ctgcaacaca acattgcctt tatgtgtaac tcttggctga agctcttaca 2160 ccaatgctgg gggacatgta cctcccaggg gcccaggaag actacgggag gctacaccaa 2220 cgtcaatcag aggggcctgt gtagctaccg ataagcggac cctcaagagg gcattagcaa 2280 tagtgtttat aaggccccct tgttaaccct aaacgggtag catatgcttc ccgggtagta 2340 gtatatacta tccagactaa ccctaattca atagcatatg ttacccaacg ggaagcatat 2400 gctatcgaat tagggttagt aaaagggtcc taaggaacag cgatatctcc caccccatga 2460 gctgtcacgg ttttatttac atggggtcag gattccacga gggtagtgaa ccattttagt 2520 cacaagggca gtggctgaag atcaaggagc gggcagtgaa ctctcctgaa tcttcgcctg 2580 cttcttcatt ctccttcgtt tagctaatag aataactgct gagttgtgaa cagtaaggtg 2640 tatgtgaggt gctcgaaaac aaggtttcag gtgacgcccc cagaataaaa tttggacggg 2700 gggttcagtg gtggcattgt gctatgacac caatataacc ctcacaaacc ccttgggcaa 2760 taaatactag tgtaggaatg aaacattctg aatatcttta acaatagaaa tccatggggt 2820 ggggacaagc cgtaaagact ggatgtccat ctcacacgaa tttatggcta tgggcaacac 2880 ataatcctag tgcaatatga tactggggtt attaagatgt gtcccaggca gggaccaaga 2940 caggtgaacc atgttgttac actctatttg taacaagggg aaagagagtg gacgccgaca 3000 gcagcggact ccactggttg tctctaacac ccccgaaaat taaacggggc tccacgccaa 3060 tggggcccat aaacaaagac aagtggccac tctttttttt gaaattgtgg agtgggggca 3120 cgcgtcagcc cccacacgcc gccctgcggt tttggactgt aaaataaggg tgtaataact 3180 tggctgattg taaccccgct aaccactgcg gtcaaaccac ttgcccacaa aaccactaat 3240 ggcaccccgg ggaatacctg cataagtagg tgggcgggcc aagatagggg cgcgattgct 3300 gcgatctgga ggacaaatta cacacacttg cgcctgagcg ccaagcacag ggttgttggt 3360 cctcatattc acgaggtcgc tgagagcacg gtgggctaat gttgccatgg gtagcatata 3420 ctacccaaat atctggatag catatgctat cctaatctat atctgggtag cataggctat 3480 cctaatctat atctgggtag catatgctat cctaatctat atctgggtag tatatgctat 3540 cctaatttat atctgggtag cataggctat cctaatctat atctgggtag catatgctat 3600 cctaatctat atctgggtag tatatgctat cctaatctgt atccgggtag catatgctat 3660 cctaatagag attagggtag tatatgctat cctaatttat atctgggtag catatactac 3720 ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata tgctatccta 3780 atctatatct gggtagcata ggctatccta atctatatct gggtagcata tgctatccta 3840 atctatatct gggtagtata tgctatccta atttatatct gggtagcata ggctatccta 3900 atctatatct gggtagcata tgctatccta atctatatct gggtagtata tgctatccta 3960 atctgtatcc gggtagcata tgctatcctc atgcatatac agtcagcata tgatacccag 4020 tagtagagtg ggagtgctat cctttgcata tgccgccacc tcccaagggg gcgtgaattt 4080 tcgctgcttg tccttttcct gctggttgct cccattctta ggtgaattta aggaggccag 4140 gctaaagccg tcgcatgtct gattgctcac caggtaaatg tcgctaatgt tttccaacgc 4200 gagaaggtgt tgagcgcgga gctgagtgac gtgacaacat gggtatgccg aattgcccca 4260 tgttgggagg acgaaaatgg tgacaagaca gatggccaga aatacaccaa cagcacgcat 4320 gatgtctact ggggatttat tctttagtgc gggggaatac acggctttta atacgattga 4380 gggcgtctcc taacaagtta catcactcct gcccttcctc accctcatct ccatcacctc 4440 cttcatctcc gtcatctccg tcatcaccct ccgcggcagc cccttccacc ataggtggaa 4500 accagggagg caaatctact ccatcgtcaa agctgcacac agtcaccctg atattgcagg 4560 taggagcggg ctttgtcata acaaggtcct taatcgcatc cttcaaaacc tcagcaaata 4620 tatgagtttg taaaaagacc atgaaataac agacaatgga ctcccttagc gggccaggtt 4680 gtgggccggg tccaggggcc attccaaagg ggagacgact caatggtgta agacgacatt 4740 gtggaatagc aagggcagtt cctcgcctta ggttgtaaag ggaggtctta ctacctccat 4800 atacgaacac accggcgacc caagttcctt cgtcggtagt cctttctacg tgactcctag 4860 ccaggagagc tcttaaacct tctgcaatgt tctcaaattt cgggttggaa cctccttgac 4920 cacgatgctt tccaaaccac cctccttttt tgcgcctgcc tccatcaccc tgaccccggg 4980 gtccagtgct tgggccttct cctgggtcat ctgcggggcc ctgctctatc gctcccgggg 5040 gcacgtcagg ctcaccatct gggccacctt cttggtggta ttcaaaataa tcggcttccc 5100 ctacagggtg gaaaaatggc cttctacctg gagggggcct gcgcggtgga gacccggatg 5160 atgatgactg actactggga ctcctgggcc tcttttctcc acgtccacga cctctccccc 5220 tggctctttc acgacttccc cccctggctc tttcacgtcc tctaccccgg cggcctccac 5280 tacctcctcg accccggcct ccactacctc ctcgaccccg gcctccactg cctcctcgac 5340 cccggcctcc acctcctgct cctgcccctc ctgctcctgc ccctcctcct gctcctgccc 5400 ctcctgcccc tcctgctcct gcccctcctg cccctcctgc tcctgcccct cctgcccctc 5460 ctgctcctgc ccctcctgcc cctcctcctg ctcctgcccc tcctgcccct cctcctgctc 5520 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5580 cccctcctgc tcctgcccct cctgctcctg cccctcctgc tcctgcccct cctgctcctg 5640 cccctcctgc ccctcctgcc cctcctcctg ctcctgcccc tcctgctcct gcccctcctg 5700 cccctcctgc ccctcctgct cctgcccctc ctcctgctcc tgcccctcct gcccctcctg 5760 cccctcctcc tgctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctcctgctc 5820 ctgcccctcc tgcccctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctcctg 5880 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct cctgctcctg 5940 cccctcctcc tgctcctgcc cctcctgccc ctcctgcccc tcctgcccct cctcctgctc 6000 ctgcccctcc tcctgctcct gcccctcctg ctcctgcccc tcccgctcct gctcctgctc 6060 ctgttccacc gtgggtccct ttgcagccaa tgcaacttgg acgtttttgg ggtctccgga 6120 caccatctct atgtcttggc cctgatcctg agccgcccgg ggctcctggt cttccgcctc 6180 ctcgtcctcg tcctcttccc cgtcctcgtc catggttatc accccctctt ctttgaggtc 6240 cactgccgcc ggagccttct ggtccagatg tgtctccctt ctctcctagg ccatttccag 6300 gtcctgtacc tggcccctcg tcagacatga ttcacactaa aagagatcaa tagacatctt 6360 tattagacga cgctcagtga atacagggag tgcagactcc tgccccctcc aacagccccc 6420 ccaccctcat ccccttcatg gtcgctgtca gacagatcca ggtctgaaaa ttccccatcc 6480 tccgaaccat cctcgtcctc atcaccaatt actcgcagcc cggaaaactc ccgctgaaca 6540 tcctcaagat ttgcgtcctg agcctcaagc caggcctcaa attcctcgtc cccctttttg 6600 ctggacggta gggatgggga ttctcgggac ccctcctctt cctcttcaag gtcaccagac 6660 agagatgcta ctggggcaac ggaagaaaag ctgggtgcgg cctgtgagga tcagcttatc 6720 gatgataagc tgtcaaacat gagaattctt gaagacgaaa gggcctcgtg atacgcctat 6780 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 6840 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 6900 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 6960 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 7020 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 7080 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 7140 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtgttg 7200 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 7260 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 7320 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 7380 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 7440 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgcag 7500 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 7560 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 7620 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 7680 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 7740 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 7800 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 7860 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 7920 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 7980 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 8040 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 8100 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 8160 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 8220 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 8280 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 8340 cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg 8400 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 8460 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 8520 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 8580 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttgaagctgt ccctgatggt 8640 cgtcatctac ctgcctggac agcatggcct gcaacgcggg catcccgatg ccgccggaag 8700 cgagaagaat cataatgggg aaggccatcc agcctcgcgt cgcgaacgcc agcaagacgt 8760 agcccagcgc gtcggccccg agatgcgccg cgtgcggctg ctggagatgg cggacgcgat 8820 ggatatgttc tgccaagggt tggtttgcgc attcacagtt ctccgcaaga attgattggc 8880 tccaattctt ggagtggtga atccgttagc gaggtgccgc cctgcttcat ccccgtggcc 8940 cgttgctcgc gtttgctggc ggtgtccccg gaagaaatat atttgcatgt ctttagttct 9000 atgatgacac aaaccccgcc cagcgtcttg tcattggcga attcgaacac gcagatgcag 9060 tcggggcggc gcggtccgag gtccacttcg catattaagg tgacgcgtgt ggcctcgaac 9120 accgagcgac cctgcagcga cccgcttaac agcgtcaaca gcgtgccgca gatcccgggg 9180 ggcaatgaga tatgaaaaag cctgaactca ccgcgacgtc tgtcgagaag tttctgatcg 9240 aaaagttcga cagcgtctcc gacctgatgc agctctcgga gggcgaagaa tctcgtgctt 9300 tcagcttcga tgtaggaggg cgtggatatg tcctgcgggt aaatagctgc gccgatggtt 9360 tctacaaaga tcgttatgtt tatcggcact ttgcatcggc cgcgctcccg attccggaag 9420 tgcttgacat tggggaattc agcgagagcc tgacctattg catctcccgc cgtgcacagg 9480 gtgtcacgtt gcaagacctg cctgaaaccg aactgcccgc tgttctgcag ccggtcgcgg 9540 aggccatgga tgcgatcgct gcggccgatc ttagccagac gagcgggttc ggcccattcg 9600 gaccgcaagg aatcggtcaa tacactacat ggcgtgattt catatgcgcg attgctgatc 9660 cccatgtgta tcactggcaa actgtgatgg acgacaccgt cagtgcgtcc gtcgcgcagg 9720 ctctcgatga gctgatgctt tgggccgagg actgccccga agtccggcac ctcgtgcacg 9780 cggatttcgg ctccaacaat gtcctgacgg acaatggccg cataacagcg gtcattgact 9840 ggagcgaggc gatgttcggg gattcccaat acgaggtcgc caacatcttc ttctggaggc 9900 cgtggttggc ttgtatggag cagcagacgc gctacttcga gcggaggcat ccggagcttg 9960 caggatcgcc gcggctccgg gcgtatatgc tccgcattgg tcttgaccaa ctctatcaga 10020 gcttggttga cggcaatttc gatgatgcag cttgggcgca gggtcgatgc gacgcaatcg 10080 tccgatccgg agccgggact gtcgggcgta cacaaatcgc ccgcagaagc gcggccgtct 10140 ggaccgatgg ctgtgtagaa gtactcgccg atagtggaaa ccgacgcccc agcactcgtc 10200 cggatcggga gatgggggag gctaactgaa acacggaagg agacaatacc ggaaggaacc 10260 cgcgctatga cggcaataaa aagacagaat aaaacgcacg ggtgttgggt cgtttgttca 10320 taaacgcggg gttcggtccc agggctggca ctctgtcgat accccaccga gaccccattg 10380 gggccaatac gcccgcgttt cttccttttc cccaccccac cccccaagtt cgggtgaagg 10440 cccagggctc gcagccaacg tcggggcggc aggccctgcc atagccactg gccccgtggg 10500 ttagggacgg ggtcccccat ggggaatggt ttatggttcg tgggggttat tattttgggc 10560 gttgcgtggg gtcaggtcca cgactggact gagcagacag acccatggtt tttggatggc 10620 ctgggcatgg accgcatgta ctggcgcgac acgaacaccg ggcgtctgtg gctgccaaac 10680 acccccgacc cccaaaaacc accgcgcgga tttctggcgt gccaagctag tcgaccaatt 10740 ctcatgtttg acagcttatc atcgcagatc cgggcaacgt tgttgccatt gctgcaggcg 10800 cagaactggt aggtatggaa gatccataca ttgaatcaat attggcaatt agccatatta 10860 gtcattggtt atatagcata aatcaatatt ggctattggc cattgcatac gttgtatcta 10920 tatcataata tgtacattta tattggctca tgtccaatat gaccgccat 10969 <210> 158 <211> 10975 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 158 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 ggtcagcctg aaggggacca aggtgcacat gaaaagcttt ctggccttca cccagacgaa 1080 gaccttccac gaggccagcg aggactgcat ctcgcgcggg ggcaccctga gcacccctca 1140 gactggctcg gagaacgacg ccctgtatga gtacctgcgc cagagcgtgg gcaacgaggc 1200 cgagatctgg ctgggcctca acgacatggc ggccgagggc acctgggtgg acatgaccgg 1260 tacccgcatc gcctacaaga actgggagac tgagatcacc gcgcaacccg atggcggcaa 1320 gaccgagaac tgcgcggtcc tgtcaggcgc ggccaacggc aagtggttcg acaagcgctg 1380 cagggatcaa ttgccctaca tctgccagtt cgggatcgtg caccaccacc accaccacta 1440 actcgaggcc ggcaaggccg gatccagaca tgataagata cattgatgag tttggacaaa 1500 ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat gctattgctt 1560 tatttgtaac cattataagc tgcaataaac aagttaacaa caagaattgc attcatttta 1620 tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac ctctacaaat 1680 gtggtatggc tgattatgat ccggctgcct cgcgcgtttc ggtgatgacg gtgaaaacct 1740 ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 1800 acaagcccgt caggcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgaggtcgac 1860 tctagaggat cgatgccccg ccccggacga actaaacctg actacgacat ctctgcccct 1920 tcttcgcggg gcagtgcatg taatcccttc agttggttgg tacaacttgc caactgggcc 1980 ctgttccaca tgtgacacgg ggggggacca aacacaaagg ggttctctga ctgtagttga 2040 catccttata aatggatgtg cacatttgcc aacactgagt ggctttcatc ctggagcaga 2100 ctttgcagtc tgtggactgc aacacaacat tgcctttatg tgtaactctt ggctgaagct 2160 cttacaccaa tgctggggga catgtacctc ccaggggccc aggaagacta cgggaggcta 2220 caccaacgtc aatcagaggg gcctgtgtag ctaccgataa gcggaccctc aagagggcat 2280 tagcaatagt gtttataagg cccccttgtt aaccctaaac gggtagcata tgcttcccgg 2340 gtagtagtat atactatcca gactaaccct aattcaatag catatgttac ccaacgggaa 2400 gcatatgcta tcgaattagg gttagtaaaa gggtcctaag gaacagcgat atctcccacc 2460 ccatgagctg tcacggtttt atttacatgg ggtcaggatt ccacgagggt agtgaaccat 2520 tttagtcaca agggcagtgg ctgaagatca aggagcgggc agtgaactct cctgaatctt 2580 cgcctgcttc ttcattctcc ttcgtttagc taatagaata actgctgagt tgtgaacagt 2640 aaggtgtatg tgaggtgctc gaaaacaagg tttcaggtga cgcccccaga ataaaatttg 2700 gacggggggt tcagtggtgg cattgtgcta tgacaccaat ataaccctca caaacccctt 2760 gggcaataaa tactagtgta ggaatgaaac attctgaata tctttaacaa tagaaatcca 2820 tggggtgggg acaagccgta aagactggat gtccatctca cacgaattta tggctatggg 2880 caacacataa tcctagtgca atatgatact ggggttatta agatgtgtcc caggcaggga 2940 ccaagacagg tgaaccatgt tgttacactc tatttgtaac aaggggaaag agagtggacg 3000 ccgacagcag cggactccac tggttgtctc taacaccccc gaaaattaaa cggggctcca 3060 cgccaatggg gcccataaac aaagacaagt ggccactctt ttttttgaaa ttgtggagtg 3120 ggggcacgcg tcagccccca cacgccgccc tgcggttttg gactgtaaaa taagggtgta 3180 ataacttggc tgattgtaac cccgctaacc actgcggtca aaccacttgc ccacaaaacc 3240 actaatggca ccccggggaa tacctgcata agtaggtggg cgggccaaga taggggcgcg 3300 attgctgcga tctggaggac aaattacaca cacttgcgcc tgagcgccaa gcacagggtt 3360 gttggtcctc atattcacga ggtcgctgag agcacggtgg gctaatgttg ccatgggtag 3420 catatactac ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata 3480 ggctatccta atctatatct gggtagcata tgctatccta atctatatct gggtagtata 3540 tgctatccta atttatatct gggtagcata ggctatccta atctatatct gggtagcata 3600 tgctatccta atctatatct gggtagtata tgctatccta atctgtatcc gggtagcata 3660 tgctatccta atagagatta gggtagtata tgctatccta atttatatct gggtagcata 3720 tactacccaa atatctggat agcatatgct atcctaatct atatctgggt agcatatgct 3780 atcctaatct atatctgggt agcataggct atcctaatct atatctgggt agcatatgct 3840 atcctaatct atatctgggt agtatatgct atcctaattt atatctgggt agcataggct 3900 atcctaatct atatctgggt agcatatgct atcctaatct atatctgggt agtatatgct 3960 atcctaatct gtatccgggt agcatatgct atcctcatgc atatacagtc agcatatgat 4020 acccagtagt agagtgggag tgctatcctt tgcatatgcc gccacctccc aagggggcgt 4080 gaattttcgc tgcttgtcct tttcctgctg gttgctccca ttcttaggtg aatttaagga 4140 ggccaggcta aagccgtcgc atgtctgatt gctcaccagg taaatgtcgc taatgttttc 4200 caacgcgaga aggtgttgag cgcggagctg agtgacgtga caacatgggt atgccgaatt 4260 gccccatgtt gggaggacga aaatggtgac aagacagatg gccagaaata caccaacagc 4320 acgcatgatg tctactgggg atttattctt tagtgcgggg gaatacacgg cttttaatac 4380 gattgagggc gtctcctaac aagttacatc actcctgccc ttcctcaccc tcatctccat 4440 cacctccttc atctccgtca tctccgtcat caccctccgc ggcagcccct tccaccatag 4500 gtggaaacca gggaggcaaa tctactccat cgtcaaagct gcacacagtc accctgatat 4560 tgcaggtagg agcgggcttt gtcataacaa ggtccttaat cgcatccttc aaaacctcag 4620 caaatatatg agtttgtaaa aagaccatga aataacagac aatggactcc cttagcgggc 4680 caggttgtgg gccgggtcca ggggccattc caaaggggag acgactcaat ggtgtaagac 4740 gacattgtgg aatagcaagg gcagttcctc gccttaggtt gtaaagggag gtcttactac 4800 ctccatatac gaacacaccg gcgacccaag ttccttcgtc ggtagtcctt tctacgtgac 4860 tcctagccag gagagctctt aaaccttctg caatgttctc aaatttcggg ttggaacctc 4920 cttgaccacg atgctttcca aaccaccctc cttttttgcg cctgcctcca tcaccctgac 4980 cccggggtcc agtgcttggg ccttctcctg ggtcatctgc ggggccctgc tctatcgctc 5040 ccgggggcac gtcaggctca ccatctgggc caccttcttg gtggtattca aaataatcgg 5100 cttcccctac agggtggaaa aatggccttc tacctggagg gggcctgcgc ggtggagacc 5160 cggatgatga tgactgacta ctgggactcc tgggcctctt ttctccacgt ccacgacctc 5220 tccccctggc tctttcacga cttccccccc tggctctttc acgtcctcta ccccggcggc 5280 ctccactacc tcctcgaccc cggcctccac tacctcctcg accccggcct ccactgcctc 5340 ctcgaccccg gcctccacct cctgctcctg cccctcctgc tcctgcccct cctcctgctc 5400 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5460 cccctcctgc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gcccctcctc 5520 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgcccctcct gctcctgccc 5580 ctcctgcccc tcctgctcct gcccctcctg ctcctgcccc tcctgctcct gcccctcctg 5640 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct gctcctgccc 5700 ctcctgcccc tcctgcccct cctgctcctg cccctcctcc tgctcctgcc cctcctgccc 5760 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tcctgctcct gcccctcctc 5820 ctgctcctgc ccctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5880 ctcctgctcc tgcccctcct cctgctcctg cccctcctgc ccctcctgcc cctcctcctg 5940 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct gcccctcctc 6000 ctgctcctgc ccctcctcct gctcctgccc ctcctgctcc tgcccctccc gctcctgctc 6060 ctgctcctgt tccaccgtgg gtccctttgc agccaatgca acttggacgt ttttggggtc 6120 tccggacacc atctctatgt cttggccctg atcctgagcc gcccggggct cctggtcttc 6180 cgcctcctcg tcctcgtcct cttccccgtc ctcgtccatg gttatcaccc cctcttcttt 6240 gaggtccact gccgccggag ccttctggtc cagatgtgtc tcccttctct cctaggccat 6300 ttccaggtcc tgtacctggc ccctcgtcag acatgattca cactaaaaga gatcaataga 6360 catctttatt agacgacgct cagtgaatac agggagtgca gactcctgcc ccctccaaca 6420 gcccccccac cctcatcccc ttcatggtcg ctgtcagaca gatccaggtc tgaaaattcc 6480 ccatcctccg aaccatcctc gtcctcatca ccaattactc gcagcccgga aaactcccgc 6540 tgaacatcct caagatttgc gtcctgagcc tcaagccagg cctcaaattc ctcgtccccc 6600 tttttgctgg acggtaggga tggggattct cgggacccct cctcttcctc ttcaaggtca 6660 ccagacagag atgctactgg ggcaacggaa gaaaagctgg gtgcggcctg tgaggatcag 6720 cttatcgatg ataagctgtc aaacatgaga attcttgaag acgaaagggc ctcgtgatac 6780 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 6840 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 6900 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 6960 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 7020 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 7080 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 7140 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 7200 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 7260 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 7320 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 7380 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 7440 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 7500 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 7560 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 7620 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 7680 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 7740 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 7800 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 7860 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 7920 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 7980 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 8040 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 8100 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 8160 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 8220 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 8280 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 8340 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 8400 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 8460 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 8520 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 8580 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggccttga agctgtccct 8640 gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 8700 cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca 8760 agacgtagcc cagcgcgtcg gccccgagat gcgccgcgtg cggctgctgg agatggcgga 8820 cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg 8880 attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccctg cttcatcccc 8940 gtggcccgtt gctcgcgttt gctggcggtg tccccggaag aaatatattt gcatgtcttt 9000 agttctatga tgacacaaac cccgcccagc gtcttgtcat tggcgaattc gaacacgcag 9060 atgcagtcgg ggcggcgcgg tccgaggtcc acttcgcata ttaaggtgac gcgtgtggcc 9120 tcgaacaccg agcgaccctg cagcgacccg cttaacagcg tcaacagcgt gccgcagatc 9180 ccggggggca atgagatatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc 9240 tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc 9300 gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg 9360 atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc 9420 cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg 9480 cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg 9540 tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc gggttcggcc 9600 cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata tgcgcgattg 9660 ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg 9720 cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg 9780 tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca 9840 ttgactggag cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct 9900 ggaggccgtg gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg 9960 agcttgcagg atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct 10020 atcagagctt ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg 10080 caatcgtccg atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg 10140 ccgtctggac cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca 10200 ctcgtccgga tcgggagatg ggggaggcta actgaaacac ggaaggagac aataccggaa 10260 ggaacccgcg ctatgacggc aataaaaaga cagaataaaa cgcacgggtg ttgggtcgtt 10320 tgttcataaa cgcggggttc ggtcccaggg ctggcactct gtcgataccc caccgagacc 10380 ccattggggc caatacgccc gcgtttcttc cttttcccca ccccaccccc caagttcggg 10440 tgaaggccca gggctcgcag ccaacgtcgg ggcggcaggc cctgccatag ccactggccc 10500 cgtgggttag ggacggggtc ccccatgggg aatggtttat ggttcgtggg ggttattatt 10560 ttgggcgttg cgtggggtca ggtccacgac tggactgagc agacagaccc atggtttttg 10620 gatggcctgg gcatggaccg catgtactgg cgcgacacga acaccgggcg tctgtggctg 10680 ccaaacaccc ccgaccccca aaaaccaccg cgcggatttc tggcgtgcca agctagtcga 10740 ccaattctca tgtttgacag cttatcatcg cagatccggg caacgttgtt gccattgctg 10800 caggcgcaga actggtaggt atggaagatc catacattga atcaatattg gcaattagcc 10860 atattagtca ttggttatat agcataaatc aatattggct attggccatt gcatacgttg 10920 tatctatatc ataatatgta catttatatt ggctcatgtc caatatgacc gccat 10975 <210> 159 <211> 10927 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 159 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgtt gtgaacacaa agatgtttga 900 ggagctcaag agccgtctgg acaccctggc ccaggaggtg gccctgctga aggagcagca 960 ggccctccag acggtctgcc tgaaggggac caaggtgcac atgaaatgct ttctggcctt 1020 cacccagacg aagaccttcc acgaggccag cgaggactgc atctcgcgcg ggggcaccct 1080 gagcacccct cagactggct cggagaacga cgccctgtat gagtacctgc gccagagcgt 1140 gggcaacgag gccgagatct ggctgggcct caacgacatg gcggccgagg gcacctgggt 1200 ggacatgacc ggtacccgca tcgcctacaa gaactgggag actgagatca ccgcgcaacc 1260 cgatggcggc aagaccgaga actgcgcggt cctgtcaggc gcggccaacg gcaagtggtt 1320 cgacaagcgc tgcagggatc aattgcccta catctgccag ttcgggatcg tgcaccacca 1380 ccaccaccac taactcgagg ccggcaaggc cggatccaga catgataaga tacattgatg 1440 agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt gaaatttgtg 1500 atgctattgc tttatttgta accattataa gctgcaataa acaagttaac aacaagaatt 1560 gcattcattt tatgtttcag gttcaggggg aggtgtggga ggttttttaa agcaagtaaa 1620 acctctacaa atgtggtatg gctgattatg atccggctgc ctcgcgcgtt tcggtgatga 1680 cggtgaaaac ctctgacaca tgcagctccc ggagacggtc acagcttgtc tgtaagcgga 1740 tgccgggagc agacaagccc gtcaggcgtc agcgggtgtt ggcgggtgtc ggggcgcagc 1800 catgaggtcg actctagagg atcgatgccc cgccccggac gaactaaacc tgactacgac 1860 atctctgccc cttcttcgcg gggcagtgca tgtaatccct tcagttggtt ggtacaactt 1920 gccaactggg ccctgttcca catgtgacac ggggggggac caaacacaaa ggggttctct 1980 gactgtagtt gacatcctta taaatggatg tgcacatttg ccaacactga gtggctttca 2040 tcctggagca gactttgcag tctgtggact gcaacacaac attgccttta tgtgtaactc 2100 ttggctgaag ctcttacacc aatgctgggg gacatgtacc tcccaggggc ccaggaagac 2160 tacgggaggc tacaccaacg tcaatcagag gggcctgtgt agctaccgat aagcggaccc 2220 tcaagagggc attagcaata gtgtttataa ggcccccttg ttaaccctaa acgggtagca 2280 tatgcttccc gggtagtagt atatactatc cagactaacc ctaattcaat agcatatgtt 2340 acccaacggg aagcatatgc tatcgaatta gggttagtaa aagggtccta aggaacagcg 2400 atatctccca ccccatgagc tgtcacggtt ttatttacat ggggtcagga ttccacgagg 2460 gtagtgaacc attttagtca caagggcagt ggctgaagat caaggagcgg gcagtgaact 2520 ctcctgaatc ttcgcctgct tcttcattct ccttcgttta gctaatagaa taactgctga 2580 gttgtgaaca gtaaggtgta tgtgaggtgc tcgaaaacaa ggtttcaggt gacgccccca 2640 gaataaaatt tggacggggg gttcagtggt ggcattgtgc tatgacacca atataaccct 2700 cacaaacccc ttgggcaata aatactagtg taggaatgaa acattctgaa tatctttaac 2760 aatagaaatc catggggtgg ggacaagccg taaagactgg atgtccatct cacacgaatt 2820 tatggctatg ggcaacacat aatcctagtg caatatgata ctggggttat taagatgtgt 2880 cccaggcagg gaccaagaca ggtgaaccat gttgttacac tctatttgta acaaggggaa 2940 agagagtgga cgccgacagc agcggactcc actggttgtc tctaacaccc ccgaaaatta 3000 aacggggctc cacgccaatg gggcccataa acaaagacaa gtggccactc ttttttttga 3060 aattgtggag tgggggcacg cgtcagcccc cacacgccgc cctgcggttt tggactgtaa 3120 aataagggtg taataacttg gctgattgta accccgctaa ccactgcggt caaaccactt 3180 gcccacaaaa ccactaatgg caccccgggg aatacctgca taagtaggtg ggcgggccaa 3240 gataggggcg cgattgctgc gatctggagg acaaattaca cacacttgcg cctgagcgcc 3300 aagcacaggg ttgttggtcc tcatattcac gaggtcgctg agagcacggt gggctaatgt 3360 tgccatgggt agcatatact acccaaatat ctggatagca tatgctatcc taatctatat 3420 ctgggtagca taggctatcc taatctatat ctgggtagca tatgctatcc taatctatat 3480 ctgggtagta tatgctatcc taatttatat ctgggtagca taggctatcc taatctatat 3540 ctgggtagca tatgctatcc taatctatat ctgggtagta tatgctatcc taatctgtat 3600 ccgggtagca tatgctatcc taatagagat tagggtagta tatgctatcc taatttatat 3660 ctgggtagca tatactaccc aaatatctgg atagcatatg ctatcctaat ctatatctgg 3720 gtagcatatg ctatcctaat ctatatctgg gtagcatagg ctatcctaat ctatatctgg 3780 gtagcatatg ctatcctaat ctatatctgg gtagtatatg ctatcctaat ttatatctgg 3840 gtagcatagg ctatcctaat ctatatctgg gtagcatatg ctatcctaat ctatatctgg 3900 gtagtatatg ctatcctaat ctgtatccgg gtagcatatg ctatcctcat gcatatacag 3960 tcagcatatg atacccagta gtagagtggg agtgctatcc tttgcatatg ccgccacctc 4020 ccaagggggc gtgaattttc gctgcttgtc cttttcctgc tggttgctcc cattcttagg 4080 tgaatttaag gaggccaggc taaagccgtc gcatgtctga ttgctcacca ggtaaatgtc 4140 gctaatgttt tccaacgcga gaaggtgttg agcgcggagc tgagtgacgt gacaacatgg 4200 gtatgccgaa ttgccccatg ttgggaggac gaaaatggtg acaagacaga tggccagaaa 4260 tacaccaaca gcacgcatga tgtctactgg ggatttattc tttagtgcgg gggaatacac 4320 ggcttttaat acgattgagg gcgtctccta acaagttaca tcactcctgc ccttcctcac 4380 cctcatctcc atcacctcct tcatctccgt catctccgtc atcaccctcc gcggcagccc 4440 cttccaccat aggtggaaac cagggaggca aatctactcc atcgtcaaag ctgcacacag 4500 tcaccctgat attgcaggta ggagcgggct ttgtcataac aaggtcctta atcgcatcct 4560 tcaaaacctc agcaaatata tgagtttgta aaaagaccat gaaataacag acaatggact 4620 cccttagcgg gccaggttgt gggccgggtc caggggccat tccaaagggg agacgactca 4680 atggtgtaag acgacattgt ggaatagcaa gggcagttcc tcgccttagg ttgtaaaggg 4740 aggtcttact acctccatat acgaacacac cggcgaccca agttccttcg tcggtagtcc 4800 tttctacgtg actcctagcc aggagagctc ttaaaccttc tgcaatgttc tcaaatttcg 4860 ggttggaacc tccttgacca cgatgctttc caaaccaccc tccttttttg cgcctgcctc 4920 catcaccctg accccggggt ccagtgcttg ggccttctcc tgggtcatct gcggggccct 4980 gctctatcgc tcccgggggc acgtcaggct caccatctgg gccaccttct tggtggtatt 5040 caaaataatc ggcttcccct acagggtgga aaaatggcct tctacctgga gggggcctgc 5100 gcggtggaga cccggatgat gatgactgac tactgggact cctgggcctc ttttctccac 5160 gtccacgacc tctccccctg gctctttcac gacttccccc cctggctctt tcacgtcctc 5220 taccccggcg gcctccacta cctcctcgac cccggcctcc actacctcct cgaccccggc 5280 ctccactgcc tcctcgaccc cggcctccac ctcctgctcc tgcccctcct gctcctgccc 5340 ctcctcctgc tcctgcccct cctgcccctc ctgctcctgc ccctcctgcc cctcctgctc 5400 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctcctgct cctgcccctc 5460 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc tcctgcccct cctgcccctc 5520 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgctcctgcc cctcctgctc 5580 ctgcccctcc tgctcctgcc cctcctgccc ctcctgcccc tcctcctgct cctgcccctc 5640 ctgctcctgc ccctcctgcc cctcctgccc ctcctgctcc tgcccctcct cctgctcctg 5700 cccctcctgc ccctcctgcc cctcctcctg ctcctgcccc tcctgcccct cctcctgctc 5760 ctgcccctcc tcctgctcct gcccctcctg cccctcctgc ccctcctcct gctcctgccc 5820 ctcctgcccc tcctcctgct cctgcccctc ctcctgctcc tgcccctcct gcccctcctg 5880 cccctcctcc tgctcctgcc cctcctcctg ctcctgcccc tcctgcccct cctgcccctc 5940 ctgcccctcc tcctgctcct gcccctcctc ctgctcctgc ccctcctgct cctgcccctc 6000 ccgctcctgc tcctgctcct gttccaccgt gggtcccttt gcagccaatg caacttggac 6060 gtttttgggg tctccggaca ccatctctat gtcttggccc tgatcctgag ccgcccgggg 6120 ctcctggtct tccgcctcct cgtcctcgtc ctcttccccg tcctcgtcca tggttatcac 6180 cccctcttct ttgaggtcca ctgccgccgg agccttctgg tccagatgtg tctcccttct 6240 ctcctaggcc atttccaggt cctgtacctg gcccctcgtc agacatgatt cacactaaaa 6300 gagatcaata gacatcttta ttagacgacg ctcagtgaat acagggagtg cagactcctg 6360 ccccctccaa cagccccccc accctcatcc ccttcatggt cgctgtcaga cagatccagg 6420 tctgaaaatt ccccatcctc cgaaccatcc tcgtcctcat caccaattac tcgcagcccg 6480 gaaaactccc gctgaacatc ctcaagattt gcgtcctgag cctcaagcca ggcctcaaat 6540 tcctcgtccc cctttttgct ggacggtagg gatggggatt ctcgggaccc ctcctcttcc 6600 tcttcaaggt caccagacag agatgctact ggggcaacgg aagaaaagct gggtgcggcc 6660 tgtgaggatc agcttatcga tgataagctg tcaaacatga gaattcttga agacgaaagg 6720 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 6780 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 6840 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 6900 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 6960 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 7020 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 7080 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 7140 cggtattatc ccgtgttgac gccgggcaag agcaactcgg tcgccgcata cactattctc 7200 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 7260 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 7320 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 7380 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 7440 acaccacgat gcctgcagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 7500 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 7560 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 7620 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 7680 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 7740 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 7800 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 7860 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 7920 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 7980 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 8040 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 8100 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 8160 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 8220 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 8280 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 8340 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 8400 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 8460 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 8520 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 8580 gaagctgtcc ctgatggtcg tcatctacct gcctggacag catggcctgc aacgcgggca 8640 tcccgatgcc gccggaagcg agaagaatca taatggggaa ggccatccag cctcgcgtcg 8700 cgaacgccag caagacgtag cccagcgcgt cggccccgag atgcgccgcg tgcggctgct 8760 ggagatggcg gacgcgatgg atatgttctg ccaagggttg gtttgcgcat tcacagttct 8820 ccgcaagaat tgattggctc caattcttgg agtggtgaat ccgttagcga ggtgccgccc 8880 tgcttcatcc ccgtggcccg ttgctcgcgt ttgctggcgg tgtccccgga agaaatatat 8940 ttgcatgtct ttagttctat gatgacacaa accccgccca gcgtcttgtc attggcgaat 9000 tcgaacacgc agatgcagtc ggggcggcgc ggtccgaggt ccacttcgca tattaaggtg 9060 acgcgtgtgg cctcgaacac cgagcgaccc tgcagcgacc cgcttaacag cgtcaacagc 9120 gtgccgcaga tcccgggggg caatgagata tgaaaaagcc tgaactcacc gcgacgtctg 9180 tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag ctctcggagg 9240 gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc ctgcgggtaa 9300 atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt gcatcggccg 9360 cgctcccgat tccggaagtg cttgacattg gggaattcag cgagagcctg acctattgca 9420 tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa ctgcccgctg 9480 ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt agccagacga 9540 gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg cgtgatttca 9600 tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac gacaccgtca 9660 gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac tgccccgaag 9720 tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac aatggccgca 9780 taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac gaggtcgcca 9840 acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc tacttcgagc 9900 ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc cgcattggtc 9960 ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct tgggcgcagg 10020 gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca caaatcgccc 10080 gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat agtggaaacc 10140 gacgccccag cactcgtccg gatcgggaga tgggggaggc taactgaaac acggaaggag 10200 acaataccgg aaggaacccg cgctatgacg gcaataaaaa gacagaataa aacgcacggg 10260 tgttgggtcg tttgttcata aacgcggggt tcggtcccag ggctggcact ctgtcgatac 10320 cccaccgaga ccccattggg gccaatacgc ccgcgtttct tccttttccc caccccaccc 10380 cccaagttcg ggtgaaggcc cagggctcgc agccaacgtc ggggcggcag gccctgccat 10440 agccactggc cccgtgggtt agggacgggg tcccccatgg ggaatggttt atggttcgtg 10500 ggggttatta ttttgggcgt tgcgtggggt caggtccacg actggactga gcagacagac 10560 ccatggtttt tggatggcct gggcatggac cgcatgtact ggcgcgacac gaacaccggg 10620 cgtctgtggc tgccaaacac ccccgacccc caaaaaccac cgcgcggatt tctggcgtgc 10680 caagctagtc gaccaattct catgtttgac agcttatcat cgcagatccg ggcaacgttg 10740 ttgccattgc tgcaggcgca gaactggtag gtatggaaga tccatacatt gaatcaatat 10800 tggcaattag ccatattagt cattggttat atagcataaa tcaatattgg ctattggcca 10860 ttgcatacgt tgtatctata tcataatatg tacatttata ttggctcatg tccaatatga 10920 ccgccat 10927 <210> 160 <211> 4641 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 160 aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60 tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca 120 aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg 180 attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg 240 atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc 300 taacaggagg aattcaccat gaaaaagaca gctatcgcga ttgcagtggc actggctggt 360 ttcgctaccg ttgcgcaagc ttctgagcca ccaacccaga agcccaagaa gattgtaaat 420 gccaagaaag atgttgtgaa cacaaagatg tttgaggagc tcaagagccg tctggacacc 480 ctggcccagg aggtggccct gctgaaggag cagcaggccc tccagacggt ctgcctgaag 540 gggaccaagg tgcacatgaa atgctttctg gccttcaccc agacgaagac cttccacgag 600 gccagcgagg actgcatctc gcgcgggggc accctgagca cccctcagac tggctcggag 660 aacgacgccc tgtatgagta cctgcgccag agcgtgggca acgaggccga gatctggctg 720 ggcctcaacg acatggcggc cgagggcacc tgggtggaca tgaccggtac ccgcatcgcc 780 tacaagaact gggagactga gatcaccgcg caacccgatg gcggcaagac cgagaactgc 840 gcggtcctgt caggcgcggc caacggcaag tggttcgaca agcgctgcag ggatcaattg 900 ccctacatct gccagttcgg gatcgtgtac ccctacgacg tgcccgacta cgccggttgg 960 agccacccgc agttcgaaaa ataactcgag ataaacggtc tccagcttgg ctgttttggc 1020 ggatgagaga agattttcag cctgatacag attaaatcag aacgcagaag cggtctgata 1080 aaacagaatt tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca 1140 gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac 1200 tgccaggcat caaataaaac gaaaggctca gtcgaaagac tgggcctttc gttttatctg 1260 ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg ccgggagcgg atttgaacgt 1320 tgcgaagcaa cggcccggag ggtggcgggc aggacgcccg ccataaactg ccaggcatca 1380 aattaagcag aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctctttttgt 1440 ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 1500 cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 1560 cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 1620 aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 1680 ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 1740 gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc 1800 cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 1860 acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 1920 gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 1980 aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 2040 ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta 2100 ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 2160 gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 2220 aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 2280 aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 2340 aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 2400 gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 2460 gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 2520 tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 2580 gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 2640 caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 2700 actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 2760 acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 2820 cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 2880 gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 2940 cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 3000 gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 3060 tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 3120 tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 3180 gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 3240 aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 3300 agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat 3360 ctgtgcggta tttcacaccg catatggtgc actctcagta caatctgctc tgatgccgca 3420 tagttaagcc agtatacact ccgctatcgc tacgtgactg ggtcatggct gcgccccgac 3480 acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca 3540 gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga 3600 aacgcgcgag gcagcagatc aattcgcgcg cgaaggcgaa gcggcatgca taatgtgcct 3660 gtcaaatgga cgaagcaggg attctgcaaa ccctatgcta ctccgtcaag ccgtcaattg 3720 tctgattcgt taccaattat gacaacttga cggctacatc attcactttt tcttcacaac 3780 cggcacggaa ctcgctcggg ctggccccgg tgcatttttt aaatacccgc gagaaataga 3840 gttgatcgtc aaaaccaaca ttgcgaccga cggtggcgat aggcatccgg gtggtgctca 3900 aaagcagctt cgcctggctg atacgttggt cctcgcgcca gcttaagacg ctaatcccta 3960 actgctggcg gaaaagatgt gacagacgcg acggcgacaa gcaaacatgc tgtgcgacgc 4020 tggcgatatc aaaattgctg tctgccaggt gatcgctgat gtactgacaa gcctcgcgta 4080 cccgattatc catcggtgga tggagcgact cgttaatcgc ttccatgcgc cgcagtaaca 4140 attgctcaag cagatttatc gccagcagct ccgaatagcg cccttcccct tgcccggcgt 4200 taatgatttg cccaaacagg tcgctgaaat gcggctggtg cgcttcatcc gggcgaaaga 4260 accccgtatt ggcaaatatt gacggccagt taagccattc atgccagtag gcgcgcggac 4320 gaaagtaaac ccactggtga taccattcgc gagcctccgg atgacgaccg tagtgatgaa 4380 tctctcctgg cgggaacagc aaaatatcac ccggtcggca aacaaattct cgtccctgat 4440 ttttcaccac cccctgaccg cgaatggtga gattgagaat ataacctttc attcccagcg 4500 gtcggtcgat aaaaaaatcg agataaccgt tggcctcaat cggcgttaaa cccgccacca 4560 gatgggcatt aaacgagtat cccggcagca ggggatcatt ttgcgcttca gccatacttt 4620 tcatactccc gccattcaga g 4641 <210> 161 <211> 11011 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 161 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 120 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 180 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 240 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 300 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 360 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 420 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 480 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 540 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 600 gtcagatcac tagaagctgg gtaccagctg ctagcgttta aacttaagct tagcgcagag 660 gcttggggca gccgagcggc agccaggccc cggcccgggc ctcggttcca gaagggagag 720 gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag tccgagccgg agagggagcg 780 cgagccgcgc cggccccgga cggcctccga aaccatggag ctgtgggggg cctacctgct 840 gctgtgcctg ttctccctgc tgacccaggt gaccaccgag ccaccaaccc agaagcccaa 900 gaagattgta aatgccaaga aagatgttgt gaacacaaag atgtttgagg agctcaagag 960 ccgtctggac accctggccc aggaggtggc cctgctgaag gagcagcagg ccctccagac 1020 ggtctgcctg aaggggacca aggtgcacat gaaatgcttt ctggccttca cccagacgaa 1080 gaccttccac gaggccagcg aggactgcat ctcgcgcggg ggcaccctga gcacccctca 1140 gactggctcg gagaacgacg ccctgtatga gtacctgcgc cagagcgtgg gcaacgaggc 1200 cgagatctgg ctgggcctca acgacatggc ggccgagggc acctgggtgg acatgaccgg 1260 tacccgcatc gcctacaaga actgggagac tgagatcacc gcgcaacccg atggcggcaa 1320 gaccgagaac tgcgcggtcc tgtcaggcgc ggccaacggc aagtggttcg acaagcgctg 1380 cagggatcaa ttgccctaca tctgccagtt cgggatcgtg tacccctacg acgtgcccga 1440 ctacgccggt tggagccacc cccagttcga gaagtgactc gaggccggca aggccggatc 1500 cagacatgat aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa 1560 aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt ataagctgca 1620 ataaacaagt taacaacaag aattgcattc attttatgtt tcaggttcag ggggaggtgt 1680 gggaggtttt ttaaagcaag taaaacctct acaaatgtgg tatggctgat tatgatccgg 1740 ctgcctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 1800 ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg cgtcagcggg 1860 tgttggcggg tgtcggggcg cagccatgag gtcgactcta gaggatcgat gccccgcccc 1920 ggacgaacta aacctgacta cgacatctct gccccttctt cgcggggcag tgcatgtaat 1980 cccttcagtt ggttggtaca acttgccaac tgggccctgt tccacatgtg acacgggggg 2040 ggaccaaaca caaaggggtt ctctgactgt agttgacatc cttataaatg gatgtgcaca 2100 tttgccaaca ctgagtggct ttcatcctgg agcagacttt gcagtctgtg gactgcaaca 2160 caacattgcc tttatgtgta actcttggct gaagctctta caccaatgct gggggacatg 2220 tacctcccag gggcccagga agactacggg aggctacacc aacgtcaatc agaggggcct 2280 gtgtagctac cgataagcgg accctcaaga gggcattagc aatagtgttt ataaggcccc 2340 cttgttaacc ctaaacgggt agcatatgct tcccgggtag tagtatatac tatccagact 2400 aaccctaatt caatagcata tgttacccaa cgggaagcat atgctatcga attagggtta 2460 gtaaaagggt cctaaggaac agcgatatct cccaccccat gagctgtcac ggttttattt 2520 acatggggtc aggattccac gagggtagtg aaccatttta gtcacaaggg cagtggctga 2580 agatcaagga gcgggcagtg aactctcctg aatcttcgcc tgcttcttca ttctccttcg 2640 tttagctaat agaataactg ctgagttgtg aacagtaagg tgtatgtgag gtgctcgaaa 2700 acaaggtttc aggtgacgcc cccagaataa aatttggacg gggggttcag tggtggcatt 2760 gtgctatgac accaatataa ccctcacaaa ccccttgggc aataaatact agtgtaggaa 2820 tgaaacattc tgaatatctt taacaataga aatccatggg gtggggacaa gccgtaaaga 2880 ctggatgtcc atctcacacg aatttatggc tatgggcaac acataatcct agtgcaatat 2940 gatactgggg ttattaagat gtgtcccagg cagggaccaa gacaggtgaa ccatgttgtt 3000 acactctatt tgtaacaagg ggaaagagag tggacgccga cagcagcgga ctccactggt 3060 tgtctctaac acccccgaaa attaaacggg gctccacgcc aatggggccc ataaacaaag 3120 acaagtggcc actctttttt ttgaaattgt ggagtggggg cacgcgtcag cccccacacg 3180 ccgccctgcg gttttggact gtaaaataag ggtgtaataa cttggctgat tgtaaccccg 3240 ctaaccactg cggtcaaacc acttgcccac aaaaccacta atggcacccc ggggaatacc 3300 tgcataagta ggtgggcggg ccaagatagg ggcgcgattg ctgcgatctg gaggacaaat 3360 tacacacact tgcgcctgag cgccaagcac agggttgttg gtcctcatat tcacgaggtc 3420 gctgagagca cggtgggcta atgttgccat gggtagcata tactacccaa atatctggat 3480 agcatatgct atcctaatct atatctgggt agcataggct atcctaatct atatctgggt 3540 agcatatgct atcctaatct atatctgggt agtatatgct atcctaattt atatctgggt 3600 agcataggct atcctaatct atatctgggt agcatatgct atcctaatct atatctgggt 3660 agtatatgct atcctaatct gtatccgggt agcatatgct atcctaatag agattagggt 3720 agtatatgct atcctaattt atatctgggt agcatatact acccaaatat ctggatagca 3780 tatgctatcc taatctatat ctgggtagca tatgctatcc taatctatat ctgggtagca 3840 taggctatcc taatctatat ctgggtagca tatgctatcc taatctatat ctgggtagta 3900 tatgctatcc taatttatat ctgggtagca taggctatcc taatctatat ctgggtagca 3960 tatgctatcc taatctatat ctgggtagta tatgctatcc taatctgtat ccgggtagca 4020 tatgctatcc tcatgcatat acagtcagca tatgataccc agtagtagag tgggagtgct 4080 atcctttgca tatgccgcca cctcccaagg gggcgtgaat tttcgctgct tgtccttttc 4140 ctgctggttg ctcccattct taggtgaatt taaggaggcc aggctaaagc cgtcgcatgt 4200 ctgattgctc accaggtaaa tgtcgctaat gttttccaac gcgagaaggt gttgagcgcg 4260 gagctgagtg acgtgacaac atgggtatgc cgaattgccc catgttggga ggacgaaaat 4320 ggtgacaaga cagatggcca gaaatacacc aacagcacgc atgatgtcta ctggggattt 4380 attctttagt gcgggggaat acacggcttt taatacgatt gagggcgtct cctaacaagt 4440 tacatcactc ctgcccttcc tcaccctcat ctccatcacc tccttcatct ccgtcatctc 4500 cgtcatcacc ctccgcggca gccccttcca ccataggtgg aaaccaggga ggcaaatcta 4560 ctccatcgtc aaagctgcac acagtcaccc tgatattgca ggtaggagcg ggctttgtca 4620 taacaaggtc cttaatcgca tccttcaaaa cctcagcaaa tatatgagtt tgtaaaaaga 4680 ccatgaaata acagacaatg gactccctta gcgggccagg ttgtgggccg ggtccagggg 4740 ccattccaaa ggggagacga ctcaatggtg taagacgaca ttgtggaata gcaagggcag 4800 ttcctcgcct taggttgtaa agggaggtct tactacctcc atatacgaac acaccggcga 4860 cccaagttcc ttcgtcggta gtcctttcta cgtgactcct agccaggaga gctcttaaac 4920 cttctgcaat gttctcaaat ttcgggttgg aacctccttg accacgatgc tttccaaacc 4980 accctccttt tttgcgcctg cctccatcac cctgaccccg gggtccagtg cttgggcctt 5040 ctcctgggtc atctgcgggg ccctgctcta tcgctcccgg gggcacgtca ggctcaccat 5100 ctgggccacc ttcttggtgg tattcaaaat aatcggcttc ccctacaggg tggaaaaatg 5160 gccttctacc tggagggggc ctgcgcggtg gagacccgga tgatgatgac tgactactgg 5220 gactcctggg cctcttttct ccacgtccac gacctctccc cctggctctt tcacgacttc 5280 cccccctggc tctttcacgt cctctacccc ggcggcctcc actacctcct cgaccccggc 5340 ctccactacc tcctcgaccc cggcctccac tgcctcctcg accccggcct ccacctcctg 5400 ctcctgcccc tcctgctcct gcccctcctc ctgctcctgc ccctcctgcc cctcctgctc 5460 ctgcccctcc tgcccctcct gctcctgccc ctcctgcccc tcctgctcct gcccctcctg 5520 cccctcctcc tgctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5580 ctgctcctgc ccctcctgcc cctcctgctc ctgcccctcc tgcccctcct gctcctgccc 5640 ctcctgctcc tgcccctcct gctcctgccc ctcctgctcc tgcccctcct gcccctcctg 5700 cccctcctcc tgctcctgcc cctcctgctc ctgcccctcc tgcccctcct gcccctcctg 5760 ctcctgcccc tcctcctgct cctgcccctc ctgcccctcc tgcccctcct cctgctcctg 5820 cccctcctgc ccctcctcct gctcctgccc ctcctcctgc tcctgcccct cctgcccctc 5880 ctgcccctcc tcctgctcct gcccctcctg cccctcctcc tgctcctgcc cctcctcctg 5940 ctcctgcccc tcctgcccct cctgcccctc ctcctgctcc tgcccctcct cctgctcctg 6000 cccctcctgc ccctcctgcc cctcctgccc ctcctcctgc tcctgcccct cctcctgctc 6060 ctgcccctcc tgctcctgcc cctcccgctc ctgctcctgc tcctgttcca ccgtgggtcc 6120 ctttgcagcc aatgcaactt ggacgttttt ggggtctccg gacaccatct ctatgtcttg 6180 gccctgatcc tgagccgccc ggggctcctg gtcttccgcc tcctcgtcct cgtcctcttc 6240 cccgtcctcg tccatggtta tcaccccctc ttctttgagg tccactgccg ccggagcctt 6300 ctggtccaga tgtgtctccc ttctctccta ggccatttcc aggtcctgta cctggcccct 6360 cgtcagacat gattcacact aaaagagatc aatagacatc tttattagac gacgctcagt 6420 gaatacaggg agtgcagact cctgccccct ccaacagccc ccccaccctc atccccttca 6480 tggtcgctgt cagacagatc caggtctgaa aattccccat cctccgaacc atcctcgtcc 6540 tcatcaccaa ttactcgcag cccggaaaac tcccgctgaa catcctcaag atttgcgtcc 6600 tgagcctcaa gccaggcctc aaattcctcg tccccctttt tgctggacgg tagggatggg 6660 gattctcggg acccctcctc ttcctcttca aggtcaccag acagagatgc tactggggca 6720 acggaagaaa agctgggtgc ggcctgtgag gatcagctta tcgatgataa gctgtcaaac 6780 atgagaattc ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 6840 tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 6900 ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 6960 gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 7020 cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 7080 tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 7140 tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 7200 cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac 7260 tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 7320 agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 7380 ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 7440 ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 7500 aagccatacc aaacgacgag cgtgacacca cgatgcctgc agcaatggca acaacgttgc 7560 gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 7620 tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 7680 ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc 7740 cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 7800 atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 7860 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 7920 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 7980 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 8040 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 8100 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 8160 taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 8220 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 8280 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 8340 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 8400 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 8460 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 8520 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 8580 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 8640 ggttcctggc cttttgctgg ccttgaagct gtccctgatg gtcgtcatct acctgcctgg 8700 acagcatggc ctgcaacgcg ggcatcccga tgccgccgga agcgagaaga atcataatgg 8760 ggaaggccat ccagcctcgc gtcgcgaacg ccagcaagac gtagcccagc gcgtcggccc 8820 cgagatgcgc cgcgtgcggc tgctggagat ggcggacgcg atggatatgt tctgccaagg 8880 gttggtttgc gcattcacag ttctccgcaa gaattgattg gctccaattc ttggagtggt 8940 gaatccgtta gcgaggtgcc gccctgcttc atccccgtgg cccgttgctc gcgtttgctg 9000 gcggtgtccc cggaagaaat atatttgcat gtctttagtt ctatgatgac acaaaccccg 9060 cccagcgtct tgtcattggc gaattcgaac acgcagatgc agtcggggcg gcgcggtccg 9120 aggtccactt cgcatattaa ggtgacgcgt gtggcctcga acaccgagcg accctgcagc 9180 gacccgctta acagcgtcaa cagcgtgccg cagatcccgg ggggcaatga gatatgaaaa 9240 agcctgaact caccgcgacg tctgtcgaga agtttctgat cgaaaagttc gacagcgtct 9300 ccgacctgat gcagctctcg gagggcgaag aatctcgtgc tttcagcttc gatgtaggag 9360 ggcgtggata tgtcctgcgg gtaaatagct gcgccgatgg tttctacaaa gatcgttatg 9420 tttatcggca ctttgcatcg gccgcgctcc cgattccgga agtgcttgac attggggaat 9480 tcagcgagag cctgacctat tgcatctccc gccgtgcaca gggtgtcacg ttgcaagacc 9540 tgcctgaaac cgaactgccc gctgttctgc agccggtcgc ggaggccatg gatgcgatcg 9600 ctgcggccga tcttagccag acgagcgggt tcggcccatt cggaccgcaa ggaatcggtc 9660 aatacactac atggcgtgat ttcatatgcg cgattgctga tccccatgtg tatcactggc 9720 aaactgtgat ggacgacacc gtcagtgcgt ccgtcgcgca ggctctcgat gagctgatgc 9780 tttgggccga ggactgcccc gaagtccggc acctcgtgca cgcggatttc ggctccaaca 9840 atgtcctgac ggacaatggc cgcataacag cggtcattga ctggagcgag gcgatgttcg 9900 gggattccca atacgaggtc gccaacatct tcttctggag gccgtggttg gcttgtatgg 9960 agcagcagac gcgctacttc gagcggaggc atccggagct tgcaggatcg ccgcggctcc 10020 gggcgtatat gctccgcatt ggtcttgacc aactctatca gagcttggtt gacggcaatt 10080 tcgatgatgc agcttgggcg cagggtcgat gcgacgcaat cgtccgatcc ggagccggga 10140 ctgtcgggcg tacacaaatc gcccgcagaa gcgcggccgt ctggaccgat ggctgtgtag 10200 aagtactcgc cgatagtgga aaccgacgcc ccagcactcg tccggatcgg gagatggggg 10260 aggctaactg aaacacggaa ggagacaata ccggaaggaa cccgcgctat gacggcaata 10320 aaaagacaga ataaaacgca cgggtgttgg gtcgtttgtt cataaacgcg gggttcggtc 10380 ccagggctgg cactctgtcg ataccccacc gagaccccat tggggccaat acgcccgcgt 10440 ttcttccttt tccccacccc accccccaag ttcgggtgaa ggcccagggc tcgcagccaa 10500 cgtcggggcg gcaggccctg ccatagccac tggccccgtg ggttagggac ggggtccccc 10560 atggggaatg gtttatggtt cgtgggggtt attattttgg gcgttgcgtg gggtcaggtc 10620 cacgactgga ctgagcagac agacccatgg tttttggatg gcctgggcat ggaccgcatg 10680 tactggcgcg acacgaacac cgggcgtctg tggctgccaa acacccccga cccccaaaaa 10740 ccaccgcgcg gatttctggc gtgccaagct agtcgaccaa ttctcatgtt tgacagctta 10800 tcatcgcaga tccgggcaac gttgttgcca ttgctgcagg cgcagaactg gtaggtatgg 10860 aagatccata cattgaatca atattggcaa ttagccatat tagtcattgg ttatatagca 10920 taaatcaata ttggctattg gccattgcat acgttgtatc tatatcataa tatgtacatt 10980 tatattggct catgtccaat atgaccgcca t 11011 <210> 162 <211> 5783 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 162 tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgggatctc gatcccgcga 4980 aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 ttttgtttaa ctttaagaag gagatataca tatgaaatac cttcttccga ctgctgctgc 5100 tggtctttta ctgctggctg ctcagccggc tatggctgct ggtggtggtt ctgccctcca 5160 gacggtctgc ctgaagggga ccaaggtgca catgaaatgc tttctggcct tcacccagac 5220 gaagaccttc cacgaggcca gcgaggactg catctcgcgc gggggcaccc tgagcacccc 5280 tcagactggc tcggagaacg acgccctgta tgagtacctg cgccagagcg tgggcaacga 5340 ggccgagatc tggctgggcc tcaacgacat ggcggccgag ggcacctggg tggacatgac 5400 cggtacccgc atcgcctaca agaactggga gactgagatc accgcgcaac ccgatggcgg 5460 caagaccgag aactgcgcgg tcctgtcagg cgcggccaac ggcaagtggt tcgacaagcg 5520 ctgcagggat caattgccct acatctgcca gttcgggatc gtgtacccct acgacgtgcc 5580 cgactacgcc ggttggagcc acccgcagtt cgaaaaataa ctcgagcacc accaccacca 5640 ccactgagat ccggctgcta acaaagcccg aaaggaagct gagttggctg ctgccaccgc 5700 tgagcaataa ctagcataac cccttggggc ctctaaacgg gtcttgaggg gttttttgct 5760 gaaaggagga actatatccg gat 5783 <210> 163 <211> 4792 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 163 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgcctta 2340 cagactgtgt gcctgaaggg caccaaggtg aacttgaagt gcctcctggc cttcacccaa 2400 ccgaagacct tccatgaggc gagcgaggac tgcatctcgc aagggggcac gctgggtacc 2460 ccgcagtcag agctggagaa cgaggcgctg ttcgaatacg cgcgccacag cgtgggcaac 2520 gatgcgaaca tctggctggg cctcaacgac atggccgcgg aaggcgcctg ggtcgactaa 2580 gtgatatcct gacctaactg cagagatcag ttgccctaca tctgccagtt tgccattgtg 2640 gcggccgcag gtgcgccggt gccgtatccg gatccgctgg aaccgcgtgc cgcatagact 2700 gttgaaagtt gtttagcaaa acctcataca gaaaattcat ttactaacgt ctggaaagac 2760 gacaaaactt tagatcgtta cgctaactat gagggctgtc tgtggaatgc tacaggcgtt 2820 gtggtttgta ctggtgacga aactcagtgt tacggtacat gggttcctat tgggcttgct 2880 atccctgaaa atgagggtgg tggctctgag ggtggcggtt ctgagggtgg cggttctgag 2940 ggtggcggta ctaaacctcc tgagtacggt gatacaccta ttccgggcta tacttatatc 3000 aaccctctcg acggcactta tccgcctggt actgagcaaa accccgctaa tcctaatcct 3060 tctcttgagg agtctcagcc tcttaatact ttcatgtttc agaataatag gttccgaaat 3120 aggcagggtg cattaactgt ttatacgggc actgttactc aaggcactga ccccgttaaa 3180 acttattacc agtacactcc tgtatcatca aaagccatgt atgacgctta ctggaacggt 3240 aaattcagag actgcgcttt ccattctggc tttaatgagg atccattcgt ttgtgaatat 3300 caaggccaat cgtctgacct gcctcaacct cctgtcaatg ctggcggcgg ctctggtggt 3360 ggttctggtg gcggctctga gggtggcggc tctgagggtg gcggttctga gggtggcggc 3420 tctgagggtg gcggttccgg tggcggctcc ggttccggtg attttgatta tgaaaaaatg 3480 gcaaacgcta ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac 3540 gctaaaggca aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt 3600 ggtgacgttt ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat 3660 tcccaaatgg ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa 3720 tatttacctt ctttgcctca gtcggttgaa tgtcgccctt atgtctttgg cgctggtaaa 3780 ccatatgaat tttctattga ttgtgacaaa ataaacttat tccgtggtgt ctttgcgttt 3840 cttttatatg ttgccacctt tatgtatgta ttttcgacgt ttgctaacat actgcgtaat 3900 aaggagtctt aataagaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 3960 tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 4020 cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 4080 cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag 4140 caaccatagt acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc 4200 agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc 4260 tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg 4320 ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgatttggg tgatggttca 4380 cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc 4440 tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc gggctattct 4500 tttgatttat aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa 4560 caaaaattta acgcgaattt taacaaaata ttaacgttta caattttatg gtgcagtctc 4620 agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc aacacccgct 4680 gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc 4740 tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc ga 4792 <210> 164 <211> 4101 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 164 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgccctc 2340 cagacggtct gcctgaaggg gaccaaggtg cacatgaaat gctttctggc cttcacccag 2400 acgaagacct tccacgaggc cagcgaggac tgcatctcgc gcgggggcac cctgagcacc 2460 cctcagactg gctcggagaa cgacgccctg tatgagtacc tgcgccagag cgtgggcaac 2520 gaggccgaga tctaagtgac gatatcctga cctaaggtac ctaagtgacg atatcctgac 2580 ctaactgcag ggatcaattg ccctacatct gccagttcgg gatcgtggcg gccgcaggtg 2640 cgccggtgcc gtatccggat ccgctggaac cgcgtgccgc acaggctgag ggtggcggct 2700 ctgagggtgg cggttctgag ggtggcggct ctgagggtgg cggttccggt ggcggctccg 2760 gttccggtga ttttgattat gaaaaaatgg caaacgctaa taagggggct atgaccgaaa 2820 atgccgatga aaacgcgcta cagtctgacg ctaaaggcaa acttgattct gtcgctactg 2880 attacggtgc tgctatcgat ggtttcattg gtgacgtttc cggccttgct aatggtaatg 2940 gtgctactgg tgattttgct ggctctaatt cccaaatggc tcaagtcggt gacggtgata 3000 attcaccttt aatgaataat ttccgtcaat atttaccttc tttgcctcag tcggttgaat 3060 gtcgccctta tgtctttggc gctggtaaac catatgaatt ttctattgat tgtgacaaaa 3120 taaacttatt ccgtggtgtc tttgcgtttc ttttatatgt tgccaccttt atgtatgtat 3180 tttcgacgtt tgctaacata ctgcgtaata aggagtctta ataagaattc actggccgtc 3240 gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca 3300 catccccctt tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa 3360 cagttgcgca gcctgaatgg cgaatggcgc ctgatgcggt attttctcct tacgcatctg 3420 tgcggtattt cacaccgcat acgtcaaagc aaccatagta cgcgccctgt agcggcgcat 3480 taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 3540 cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc 3600 aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 3660 ccaaaaaact tgatttgggt gatggttcac gtagtgggcc atcgccctga tagacggttt 3720 ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa 3780 caacactcaa ccctatctcg ggctattctt ttgatttata agggattttg ccgatttcgg 3840 cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat 3900 taacgtttac aattttatgg tgcagtctca gtacaatctg ctctgatgcc gcatagttaa 3960 gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg 4020 catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac 4080 cgtcatcacc gaaacgcgcg a 4101 <210> 165 <211> 4114 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 165 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgct cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcataca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt tggagccttt tttttggaga ttttcaacgt gaaaaaatta 2280 ttattcgcaa ttcctttagt tgttcctttc tatgcggccc agccggccat ggccgcctta 2340 cagactgtgt gcctgaaggg caccaaggtg aacttgaagt gcctcctggc cttcacccaa 2400 ccgaagacct tccatgaggc gagcgaggac tgcatctcgc aagggggcac gctgggtacc 2460 ccgcagtcag agctggagaa cgaggcgctg ttcgaatacg cgcgccacag cgtgggcaac 2520 gatgcgaaca tctggctggg cctcaacgac atggccgcgg aaggcgcctg ggtcgactaa 2580 gtgatatcct gacctaactg cagagatcag ttgccctaca tctgccagtt tgccattgtg 2640 gcggccgcag gtgcgccggt gccgtatccg gatccgctgg aaccgcgtgc cgcacaggct 2700 gagggtggcg gctctgaggg tggcggttct gagggtggcg gctctgaggg tggcggttcc 2760 ggtggcggct ccggttccgg tgattttgat tatgaaaaaa tggcaaacgc taataagggg 2820 gctatgaccg aaaatgccga tgaaaacgcg ctacagtctg acgctaaagg caaacttgat 2880 tctgtcgcta ctgattacgg tgctgctatc gatggtttca ttggtgacgt ttccggcctt 2940 gctaatggta atggtgctac tggtgatttt gctggctcta attcccaaat ggctcaagtc 3000 ggtgacggtg ataattcacc tttaatgaat aatttccgtc aatatttacc ttctttgcct 3060 cagtcggttg aatgtcgccc ttatgtcttt ggcgctggta aaccatatga attttctatt 3120 gattgtgaca aaataaactt attccgtggt gtctttgcgt ttcttttata tgttgccacc 3180 tttatgtatg tattttcgac gtttgctaac atactgcgta ataaggagtc ttaataagaa 3240 ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa 3300 tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga 3360 tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgcctgatgc ggtattttct 3420 ccttacgcat ctgtgcggta tttcacaccg catacgtcaa agcaaccata gtacgcgccc 3480 tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 3540 gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 3600 ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 3660 cggcacctcg accccaaaaa acttgatttg ggtgatggtt cacgtagtgg gccatcgccc 3720 tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 3780 ttccaaactg gaacaacact caaccctatc tcgggctatt cttttgattt ataagggatt 3840 ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 3900 tttaacaaaa tattaacgtt tacaatttta tggtgcagtc tcagtacaat ctgctctgat 3960 gccgcatagt taagccagcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct 4020 tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt 4080 cagaggtttt caccgtcatc accgaaacgc gcga 4114 <210> 166 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 166 Lys Met Leu Glu Glu Leu Lys Thr Gln Leu Asp Ser Leu Ala Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 167 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 167 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 168 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 168 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 169 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 169 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 170 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 170 Asp Leu Lys Thr Gln Ile Glu Lys Leu Trp Thr Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Ile Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 171 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 171 Asp Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Ile Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 172 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 172 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 173 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 173 Asp Leu Lys Ser Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 174 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 174 Asp Leu Lys Thr Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <175> 175 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 175 Asp Asp Leu Arg Asn Glu Ile Asp Lys Leu Trp Arg Glu Val Asn Ser 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 176 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 176 Lys Met Ile Glu Asp Leu Lys Ala Met Ile Asp Asn Ile Ser Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 177 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 177 Lys Met Ile Glu Asp Leu Lys Ala Met Ile Asp Asn Ile Ser Gln Glu 1 5 10 15 Val Ala Leu Leu Lys Glu Lys Gln Ala Leu Gln Thr Val Cys Leu             20 25 30 <210> 178 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 178 Asp Asp Met Lys Thr Gln Ile Asp Lys Leu Trp Gln Glu Val Asn Ser 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 179 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 179 Asp Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Ser Val Cys Leu             20 25 <210> 180 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 180 Asp Asp Leu Lys Ser Gln Val Glu Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 181 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 181 Asp Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Val Asn Ala 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Ser Val Cys Leu             20 25 <210> 182 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 182 Asp Asp Val Arg Ser Gln Ile Glu Lys Leu Trp Gln Glu Val Asn Ser 1 5 10 15 Leu Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 183 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 183 Asp Leu Lys Thr Gln Ile Asp Lys Leu Trp Arg Glu Ile Asn Ser Leu 1 5 10 15 Lys Glu Met Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 184 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 184 Glu Glu Leu Arg Arg Gln Val Ser Asp Leu Ala Gln Glu Leu Asn Ile 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu His Thr Val Cys Leu             20 25 <210> 185 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 185 Lys Met Tyr Glu Glu Leu Lys Gln Lys Val Gln Asn Ile Glu Leu Glu 1 5 10 15 Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln Thr Ile Cys Leu             20 25 30 <210> 186 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 186 Lys Met Tyr Glu Asp Leu Lys Lys Lys Val Gln Asn Ile Glu Glu Asp 1 5 10 15 Val Ile His Leu Lys Glu Gln Gln Ala Leu Gln Thr Ile Cys Leu             20 25 30 <210> 187 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 187 Glu Glu Leu Lys Lys Gln Ile Asp Asn Ile Val Leu Glu Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Ser Val Cys Leu             20 25 <210> 188 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 188 Glu Glu Leu Lys Lys Gln Ile Asp Gln Ile Ile Gln Asp Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Thr Val Cys Leu             20 25 <210> 189 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 189 Glu Gln Met Gln Lys Gln Ile Asn Asp Ile Val Gln Glu Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Ala Val Cys Leu             20 25 <210> 190 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 190 Glu Gln Met Gln Lys Gln Ile Asn Asp Ile Val Gln Glu Leu Asn Leu 1 5 10 15 Leu Lys Glu Gln Gln Ala Leu Gln Ala Val Cys Leu             20 25 <210> 191 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 191 Asn Thr Gly Leu Leu Glu Ser Gln Leu Ser Arg His Asp Gln Met Leu 1 5 10 15 Ser Val His Asp Ile Arg Leu Ala Asp Met Asp Leu Arg Phe Gln Val             20 25 30 Leu Glu Thr Ala Ser Tyr Asn Gly Val Leu Ile Trp Lys Ile Arg Asp         35 40 45 Tyr Lys Arg Arg Lys Gln Glu Ala Val Met     50 55 <210> 192 <211> 21 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 192 Ala Ala Ser Glu Arg Lys Ala Leu Gln Thr Glu Met Ala Arg Ile Lys 1 5 10 15 Lys Trp Leu Thr Phe             20 <210> 193 <211> 37 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 193 Phe Asp Met Ser Cys Arg Ser Arg Leu Ala Thr Leu Asn Glu Lys Leu 1 5 10 15 Thr Ala Leu Glu Arg Arg Ile Glu Tyr Ile Glu Ala Arg Val Thr Lys             20 25 30 Gly Glu Thr Leu Thr         35 <210> 194 <211> 49 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 194 Ala Asp Ile Tyr Lys Ala Asp Phe Gln Ala Glu Arg Gln Ala Arg Glu 1 5 10 15 Lys Leu Ala Glu Lys Lys Glu Leu Leu Gln Glu Gln Leu Glu Gln Leu             20 25 30 Gln Arg Glu Tyr Ser Lys Leu Lys Ala Ser Cys Gln Glu Ser Ala Arg         35 40 45 Ile      <210> 195 <211> 71 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 195 Leu Thr Gly Ser Ala Gln Asn Ile Glu Phe Arg Thr Gly Ser Leu Gly 1 5 10 15 Lys Ile Lys Leu Asn Asp Glu Asp Leu Ser Glu Cys Leu His Gln Ile             20 25 30 Gln Lys Asn Lys Glu Asp Ile Ile Glu Leu Lys Gly Ser Ala Ile Gly         35 40 45 Leu Pro Ile Tyr Gln Leu Asn Ser Lys Leu Val Asp Leu Glu Arg Lys     50 55 60 Phe Gln Gly Leu Gln Gln Thr 65 70 <210> 196 <211> 28 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 196 Leu Arg Gly Leu Arg Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg 1 5 10 15 Lys Val Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu             20 25 <210> 197 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 197 Val Ala Ser Leu Arg Gln Gln Val Glu Ala Leu Gln Gly Gln Val Gln 1 5 10 15 His Leu Gln Ala Ala Phe Ser Gln Tyr Lys Lys             20 25 <210> 198 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 198 Val Asn Ala Leu Arg Gln Arg Val Gly Ile Leu Glu Gly Gln Leu Gln 1 5 10 15 Arg Leu Gln Asn Ala Phe Ser Gln Tyr Lys Lys             20 25 <210> 199 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 199 Ser Ala Ala Leu Arg Gln Gln Met Glu Ala Leu Asn Gly Lys Leu Gln 1 5 10 15 Arg Leu Glu Ala Ala Phe Ser Arg Tyr Lys Lys             20 25 <210> 200 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 200 Val Asn Ala Leu Lys Gln Arg Val Thr Ile Leu Asp Gly His Leu Arg 1 5 10 15 Arg Phe Gln Asn Ala Phe Ser Gln Tyr Lys Lys             20 25 <210> 201 <211> 27 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 201 Val Asp Thr Leu Arg Gln Arg Met Arg Asn Leu Glu Gly Glu Val Gln 1 5 10 15 Arg Leu Gln Asn Ile Val Thr Gln Tyr Arg Lys             20 25 <210> 202 <211> 64 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 202 Gly Ser Pro Gly Leu Lys Gly Asp Lys Gly Ile Pro Gly Asp Lys Gly 1 5 10 15 Ala Lys Gly Glu Ser Gly Leu Pro Asp Val Ala Ser Leu Arg Gln Gln             20 25 30 Val Glu Ala Leu Gln Gly Gln Val Gln His Leu Gln Ala Ala Phe Ser         35 40 45 Gln Tyr Lys Lys Val Glu Leu Phe Pro Gly Gly Ile Pro His Arg Asp     50 55 60 <210> 203 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 203 Asp Met Ala Ala Glu Gly Thr Trp 1 5 <210> 204 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 204 Asp Met Thr Gly Ala Arg 1 5 <210> 205 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 205 Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro 1 5 10 <210> 206 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 206 Asp Gly Gly Lys Thr Glu Asn 1 5 <210> 207 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (3) .. (9) <223> Xaa can be any naturally occurring amino acid <400> 207 Asp Met Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp 1 5 10 <210> 208 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (6) <223> Xaa can be any naturally occurring amino acid <400> 208 Asp Met Thr Gly Ala Xaa 1 5 <210> 209 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 209 Asp Gly Gly Ala Thr Glu Asn 1 5 <210> 210 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (3) .. (7) <223> Xaa can be any naturally occurring amino acid <400> 210 Asp Met Xaa Xaa Xaa Xaa Xaa Trp 1 5 <210> 211 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature &Lt; 222 > (3) <223> Xaa can be any naturally occurring amino acid <400> 211 Asp Met Xaa Xaa Xaa Xaa 1 5 <210> 212 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (1) .. (7) <223> Xaa can be any naturally occurring amino acid <400> 212 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp 1 5 <210> 213 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (4) .. (8) <223> Xaa can be any naturally occurring amino acid <400> 213 Asp Gly Gly Xaa Xaa Xaa Xaa Xaa Glu Asn 1 5 10 <210> 214 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (3) .. (8) <223> Xaa can be any naturally occurring amino acid <400> 214 Asn Trp Xaa Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 215 <211> 11 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (3) .. (9) <223> Xaa can be any naturally occurring amino acid <400> 215 Asn Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 216 <211> 12 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (3) .. (10) <223> Xaa can be any naturally occurring amino acid <400> 216 Asn Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 217 <211> 16 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (6) .. (12) <223> Xaa can be any naturally occurring amino acid <400> 217 Asn Trp Glu Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Ala Gln Pro 1 5 10 15 <210> 218 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (5) .. (10) <223> Xaa can be any naturally occurring amino acid <400> 218 Asn Trp Glu Thr Xaa Xaa Xaa Xaa Xaa Xaa Ala Gln Pro 1 5 10 <210> 219 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (4) .. (9) <223> Xaa can be any naturally occurring amino acid <400> 219 Asp Gly Gly Xaa Xaa Xaa Xaa Xaa Xaa Asn 1 5 10 <210> 220 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (4) .. (10) <223> Xaa can be any naturally occurring amino acid <400> 220 Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1 5 10 <210> 221 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (1) .. (1) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature (222) (4) .. (6) <223> Xaa can be any naturally occurring amino acid <400> 221 Xaa Gly Gly Xaa Xaa Xaa Asn 1 5 <210> 222 <211> 10 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (4) .. (8) <223> Xaa can be any naturally occurring amino acid <400> 222 Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Gln Pro 1 5 10 <210> 223 <211> 11 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (4) .. (10) <223> Xaa can be any naturally occurring amino acid <400> 223 Asp Gly Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn 1 5 10 <210> 224 <211> 62 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (21) .. (22) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (23) <223> k is g or t <220> <221> misc_feature (222) (24) .. (25) <223> n is a, c, g, or t <220> <221> misc_feature (222) (26) .. (26) <223> k is g or t <220> <221> misc_feature (222) (27) .. (28) <223> n is a, c, g, or t <220> <221> misc_feature (222) (29) .. (29) <223> k is g or t <220> <221> misc_feature (222) (30) .. (31) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (32) <223> k is g or t <220> <221> misc_feature (222) (33) .. (34) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (35) <223> k is g or t <220> <221> misc_feature (222) (36) .. (37) <223> n is a, c, g, or t <220> <221> misc_feature (222) (38) .. (38) <223> k is g or t <220> <221> misc_feature (222) (39) .. (40) <223> n is a, c, g, or t <220> <221> misc_feature (222) (41) .. (41) <223> k is g or t <400> 224 ggctgggcct gaacgacatg nnknnknnkn nknnknnknn ktgggtggat atgactggcg 60 cc 62 <210> 225 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (37) .. (37) <223> m is a or c <220> <221> misc_feature (222) (38) .. (39) <223> n is a, c, g, or t <400> 225 ggcggtgatc tcagtttccc agttcttgta ggcgatmnng gcgccagtca tatccaccca 60 <210> 226 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 226 ggcggtgatc tcagtttccc agttcttgta ggcgatgcgg gcgccagtca tatccaccca 60 <210> 227 <211> 62 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 227 actgggaaac tgagatcacc gcccaacctg atggcggcgc aaccgagaac tgcgcggtcc 60 tg 62 <210> 228 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 228 ccctgcagcg cttgtcgaac cacttgccgt tggcggcgcc agacaggacc gcgcagttct 60 <210> 229 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 229 gccgagatct ggctgggcct gaacgacatg 30 <210> 230 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 230 atccctgcag cgcttgtcga acc 23 <210> 231 <211> 79 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (21) .. (22) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (23) <223> k is g or t <220> <221> misc_feature (222) (24) .. (25) <223> n is a, c, g, or t <220> <221> misc_feature (222) (26) .. (26) <223> k is g or t <220> <221> misc_feature (222) (27) .. (28) <223> n is a, c, g, or t <220> <221> misc_feature (222) (29) .. (29) <223> k is g or t <220> <221> misc_feature (222) (30) .. (31) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (32) <223> k is g or t <220> <221> misc_feature (222) (33) .. (34) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (35) <223> k is g or t <220> <221> misc_feature (222) (48) .. (49) <223> n is a, c, g, or t <220> <221> misc_feature (222) (50) .. (50) <223> k is g or t <220> <221> misc_feature (222) (51) .. (52) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (53) <223> k is g or t <220> <221> misc_feature <222> (54) .. (55) <223> n is a, c, g, or t <220> <221> misc_feature <222> (56) .. (56) <223> k is g or t <220> <221> misc_feature (222) (57) .. (58) <223> n is a, c, g, or t <220> <221> misc_feature (222) (59) .. (59) <223> k is g or t <400> 231 ggctgggcct gaacgacatg nnknnknnkn nknnktgggt ggatatgnnk nnknnknnka 60 tcgcctacaa gaactggga 79 <210> 232 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 232 gacaggacgg cgcagttctc ggttgcgccg ccatcaggtt gggcggtgat ctcagtttcc 60 cagttcttgt aggcgat 77 <210> 233 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 233 atccctgcag cgcttgtcga accacttgcc gttggcggcg ccagacagga cggcgcagtt 60 ctc 63 <210> 234 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature &Lt; 222 > (22) <223> n is a, c, g, or t <220> <221> misc_feature (222) (24) .. (24) <223> s is g or c <220> <221> misc_feature (222) (25) .. (26) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (27) <223> s is g or c <220> <221> misc_feature <222> (28) .. (29) <223> n is a, c, g, or t <220> <221> misc_feature (222) (30) .. (30) <223> s is g or c <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> s is g or c <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36) .. (36) <223> s is g or c <220> <221> misc_feature (222) (37) .. (38) <223> n is a, c, g, or t <220> <221> misc_feature (222) (39) .. (39) <223> s is g or c <220> <221> misc_feature (222) (40) .. (41) <223> n is a, c, g, or t <220> <221> misc_feature (222) (42) .. (42) <223> s is g or c <400> 234 gagatctggc tgggcctcaa cnnsnnsnns nnsnnsnnsn nstgggtgga catgactggc 60 <210> 235 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 235 ttgcgcggtg atctcagtct cccagttctt gtaggcgata cgcgcgccag tcatgtccac 60 cca 63 <210> 236 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (32) .. (33) <223> n is a, c, g, or t <220> <221> misc_feature (222) (34) .. (34) <223> s is g or c <220> <221> misc_feature (222) (35) .. (36) <223> n is a, c, g, or t <220> <221> misc_feature (222) (37) .. (37) <223> s is g or c <220> <221> misc_feature (222) (38) .. (39) <223> n is a, c, g, or t <220> <221> misc_feature (222) (40) .. (40) <223> s is g or c <220> <221> misc_feature (222) (41) .. (42) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (43) <223> s is g or c <220> <221> misc_feature (222) (44) .. (45) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (46) .. (46) <223> s is g or c <400> 236 gactgagatc accgcgcaac ccgatggcgg cnnsnnsnns nnsnnsgaga actgcgcggt 60 cctg 64 <210> 237 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 237 ccctgcagcg cttgtcgaac cacttgccgt tggccgcgcc tgacaggacc gcgcagttct 60 <210> 238 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 238 gccgagatct ggctgggcct ca 22 <210> 239 <211> 78 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 239 atctggctgg gcctgaacga catggccgcc gagggcacct gggtggatat gaccggcgcg 60 cgtatcgcct acaagaac 78 <210> 240 <211> 62 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (18) .. (18) <223> m is a or c <220> <221> misc_feature (222) (19) .. (20) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (21) <223> m is a or c <220> <221> misc_feature &Lt; 222 > (22) <223> n is a, c, g, or t <220> <221> misc_feature (222) (24) .. (24) <223> m is a or c <220> <221> misc_feature (222) (25) .. (26) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (27) <223> m is a or c <220> <221> misc_feature <222> (28) .. (29) <223> n is a, c, g, or t <220> <221> misc_feature (222) (30) .. (30) <223> m is a or c <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> m is a or c <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <400> 240 ccgccatcgg gttgggcmnn mnnmnnmnnm nnmnnagttt cccagttctt gtaggcgata 60 cg 62 <210> 241 <211> 57 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (19) .. (20) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (21) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (22) <223> n is a, c, g, or t <220> <221> misc_feature (222) (24) .. (24) <223> k is g or t <220> <221> misc_feature (222) (25) .. (26) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (27) <223> k is g or t <220> <221> misc_feature <222> (28) .. (29) <223> n is a, c, g, or t <220> <221> misc_feature (222) (30) .. (30) <223> k is g or t <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36) .. (36) <223> k is g or t <400> 241 gcccaacccg atggcggcnn knnknnknnk nnknnkaact gcgccgtcct gtctggc 57 <210> 242 <211> 54 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 242 cctgcagcgc ttgtcgaacc acttgccgtt ggcggcgcca gacaggacgg cgca 54 <210> 243 <211> 84 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (22) .. (22) <223> m is a or c <220> <221> misc_feature (222) (23) .. (24) <223> n is a, c, g, or t <220> <221> misc_feature (222) (25) .. (25) <223> m is a or c <220> <221> misc_feature (222) (26) .. (27) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (28) <223> m is a or c <220> <221> misc_feature (222) (29) .. (30) <223> n is a, c, g, or t <220> <221> misc_feature (222) (37) .. (37) <223> m is a or c <220> <221> misc_feature (222) (38) .. (39) <223> n is a, c, g, or t <220> <221> misc_feature (222) (40) .. (40) <223> m is a or c <220> <221> misc_feature (222) (41) .. (42) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (43) <223> m is a or c <220> <221> misc_feature (222) (44) .. (45) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (46) .. (46) <223> m is a or c <220> <221> misc_feature (222) (47) .. (48) <223> n is a, c, g, or t <220> <221> misc_feature (222) (49) .. (49) <223> m is a or c <220> <221> misc_feature (222) (50) .. (51) <223> n is a, c, g, or t <220> <221> misc_feature (52). (52) <223> m is a or c <220> <221> misc_feature (222) (53) .. (54) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (55) <223> m is a or c <220> <221> misc_feature (222) (56) .. (57) <223> n is a, c, g, or t <220> <221> misc_feature <222> (58) .. (58) <223> m is a or c <220> <221> misc_feature (222) (59) .. (60) <223> n is a, c, g, or t <400> 243 gccagacagg acggcgcagt tmnnmnnmnn gccgccmnnm nnmnnmnnmn nmnnmnnmnn 60 ttcccagttc ttgtaggcga tacg 84 <210> 244 <211> 53 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 244 ccgccatcgg gttgggcggt gatctcagtt tcccagttct tgtaggcgat acg 53 <210> 245 <211> 60 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (19) .. (20) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (21) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (22) <223> n is a, c, g, or t <220> <221> misc_feature (222) (24) .. (24) <223> k is g or t <220> <221> misc_feature (222) (25) .. (26) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (27) <223> k is g or t <220> <221> misc_feature <222> (28) .. (29) <223> n is a, c, g, or t <220> <221> misc_feature (222) (30) .. (30) <223> k is g or t <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36) .. (36) <223> k is g or t <220> <221> misc_feature (222) (37) .. (38) <223> n is a, c, g, or t <220> <221> misc_feature (222) (39) .. (39) <223> k is g or t <400> 245 gcccaacccg atggcggcnn knnknnknnk nnknnknnka actgcgccgt cctgtctggc 60 <210> 246 <211> 74 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (30) .. (31) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (32) <223> k is g or t <220> <221> misc_feature (222) (33) .. (34) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (35) <223> k is g or t <220> <221> misc_feature (222) (36) .. (37) <223> n is a, c, g, or t <220> <221> misc_feature (222) (38) .. (38) <223> k is g or t <220> <221> misc_feature (222) (39) .. (40) <223> n is a, c, g, or t <220> <221> misc_feature (222) (41) .. (41) <223> k is g or t <220> <221> misc_feature (222) (42) .. (43) <223> n is a, c, g, or t <220> <221> misc_feature (222) (44) .. (44) <223> k is g or t <220> <221> misc_feature (222) (45) .. (46) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (47) .. (47) <223> k is g or t <400> 246 ctggcgcgcg tatcgcctac aagaactggn nknnknnknn knnknnkcaa cccgatggcg 60 gcgccaccga gaac 74 <210> 247 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (30) .. (31) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (32) <223> k is g or t <220> <221> misc_feature (222) (33) .. (34) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (35) <223> k is g or t <220> <221> misc_feature (222) (36) .. (37) <223> n is a, c, g, or t <220> <221> misc_feature (222) (38) .. (38) <223> k is g or t <220> <221> misc_feature (222) (39) .. (40) <223> n is a, c, g, or t <220> <221> misc_feature (222) (41) .. (41) <223> k is g or t <220> <221> misc_feature (222) (42) .. (43) <223> n is a, c, g, or t <220> <221> misc_feature (222) (44) .. (44) <223> k is g or t <220> <221> misc_feature (222) (45) .. (46) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (47) .. (47) <223> k is g or t <220> <221> misc_feature (222) (48) .. (49) <223> n is a, c, g, or t <220> <221> misc_feature (222) (50) .. (50) <223> k is g or t <400> 247 ctggcgcgcg tatcgcctac aagaactggn nknnknnknn knnknnknnk caacccgatg 60 gcggcgccac cgagaac 77 <210> 248 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (30) .. (31) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (32) <223> k is g or t <220> <221> misc_feature (222) (33) .. (34) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (35) <223> k is g or t <220> <221> misc_feature (222) (36) .. (37) <223> n is a, c, g, or t <220> <221> misc_feature (222) (38) .. (38) <223> k is g or t <220> <221> misc_feature (222) (39) .. (40) <223> n is a, c, g, or t <220> <221> misc_feature (222) (41) .. (41) <223> k is g or t <220> <221> misc_feature (222) (42) .. (43) <223> n is a, c, g, or t <220> <221> misc_feature (222) (44) .. (44) <223> k is g or t <220> <221> misc_feature (222) (45) .. (46) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (47) .. (47) <223> k is g or t <220> <221> misc_feature (222) (48) .. (49) <223> n is a, c, g, or t <220> <221> misc_feature (222) (50) .. (50) <223> k is g or t <400> 248 ctggcgcgcg tatcgcctac aagaactggn nknnknnknn knnknnknnk caacccgatg 60 gcggcgccac cgagaac 77 <210> 249 <211> 81 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 249 cctgcagcgc ttgtcgaacc acttgccgtt ggcggcgcca gacaggacgg cgcagttctc 60 ggtggcgccg ccatcgggtt g 81 <210> 250 <211> 89 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (24) .. (24) <223> s is g or c <220> <221> misc_feature (222) (25) .. (25) <223> m is a or c <220> <221> misc_feature &Lt; 222 > (28) <223> w is a or t <220> <221> misc_feature (222) (29) .. (29) <223> m is a or c <220> <221> misc_feature &Lt; 222 > (33) <223> s is g or c <220> <221> misc_feature (222) (42) .. (42) <223> m is a or c <220> <221> misc_feature (222) (43) .. (44) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (51) <223> m is a or c <220> <221> misc_feature <222> (52) .. (53) <223> n is a, c, g, or t <220> <221> misc_feature <222> (54) .. (54) <223> m is a or c <220> <221> misc_feature (222) (55) .. (56) <223> n is a, c, g, or t <220> <221> misc_feature (222) (57) .. (57) <223> m is a or c <220> <221> misc_feature (222) (58) .. (59) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (60) <223> m is a or c <220> <221> misc_feature (222) (61) .. (62) <223> n is a, c, g, or t <220> <221> misc_feature (222) (63) .. (63) <223> m is a or c <220> <221> misc_feature (222) (64) .. (65) <223> n is a, c, g, or t <400> 250 gacaggaccg cgcagttctc gccsmagwmc ccsaagccgc cmnngggttg mnnmnnmnnm 60 nnmnnctccc agttcttgta ggcgatacg 89 <210> 251 <211> 66 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 251 atccctgcag cgcttgtcga accacttgcc gttggccgcg cctgacagga ccgcgcagtt 60 ctcgcc 66 <210> 252 <211> 54 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 252 gagcgtgggc aacgaggccg agatctggct gggcctcaac gacatggccg ccga 54 <210> 253 <211> 70 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 253 ccagttcttg taggcgatac gcgcgccagt catatccacc caggtgccct cggcggccat 60 gtcgttgagg 70 <210> 254 <211> 74 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (28) .. (29) <223> n is a, c, g, or t <220> <221> misc_feature (222) (30) .. (30) <223> k is g or t <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36) .. (36) <223> k is g or t <220> <221> misc_feature (222) (37) .. (38) <223> n is a, c, g, or t <220> <221> misc_feature (222) (39) .. (39) <223> k is g or t <220> <221> misc_feature (222) (40) .. (41) <223> n is a, c, g, or t <220> <221> misc_feature (222) (42) .. (42) <223> k is g or t <220> <221> misc_feature (222) (43) .. (44) <223> n is a, c, g, or t <220> <221> misc_feature (45). (45) <223> k is g or t <220> <221> misc_feature (222) (46) .. (47) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (48) <223> k is g or t <400> 254 atcgcctaca agaactggga gactgrgnnk nnknnknnkn nknnknnkac cgcgcaaccc 60 gatggcggtg caac 74 <210> 255 <211> 74 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 255 cgcttgtcga accacttgcc gttggcggcg ccagacagga cggcgcagtt ctcggttgca 60 ccgccatcgg gttg 74 <210> 256 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 256 gacatggccg cggaaggc 18 <210> 257 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 257 gcagatgtag ggcaactgat ctct 24 <210> 258 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 258 gccgagatct ggctgggcct ga 22 <210> 259 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <220> <221> misc_feature <222> (36) .. (36) <223> k is g or t <220> <221> misc_feature (222) (37) .. (38) <223> n is a, c, g, or t <220> <221> misc_feature (222) (39) .. (39) <223> k is g or t <220> <221> misc_feature (222) (40) .. (41) <223> n is a, c, g, or t <220> <221> misc_feature (222) (42) .. (42) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (43) <223> w is a or t <400> 259 gccgagatct ggctgggcct caacggcagc nnknnknnkn nkwcctgggt ggacatgact 60 ggc 63 <210> 260 <211> 63 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 260 ttgcgcggtg atctcagtct cccagttctt gtaggcgata cgcgcgccag tcatgtccac 60 cca 63 <210> 261 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 261 gactgagatc accgcgcaac ccgatggcgg cttcggcgtg ttcggcgaga actgcgcggt 60 cctg 64 <210> 262 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 262 gactgagatc accgcgcaac ccgatggcgg ctggggcgtg ttcggcgaga actgcgcggt 60 cctg 64 <210> 263 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 263 gactgagatc accgcgcaac ccgatggcgg cttcgggtac ttcggcgaga actgcgcggt 60 cctg 64 <210> 264 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 264 gactgagatc accgcgcaac ccgatggcgg ctgggggtac ttcggcgaga actgcgcggt 60 cctg 64 <210> 265 <211> 64 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 265 gactgagatc accgcgcaac ccgatggcgg ctggggcgtg tggggcgaga actgcgcggt 60 cctg 64 <210> 266 <211> 77 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 266 tgggcctgaa cgacatggcc gccgagggca cctgggtgga tatgactggc gcgcgtatcg 60 cctacaagaa ctgggag 77 <210> 267 <211> 59 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature &Lt; 222 > (21) <223> m is a or c <220> <221> misc_feature &Lt; 222 > (22) <223> n is a, c, g, or t <220> <221> misc_feature (222) (24) .. (24) <223> m is a or c <220> <221> misc_feature (222) (25) .. (26) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (27) <223> m is a or c <220> <221> misc_feature <222> (28) .. (29) <223> n is a, c, g, or t <220> <221> misc_feature (222) (30) .. (30) <223> m is a or c <220> <221> misc_feature (222) (31) .. (32) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (33) <223> m is a or c <220> <221> misc_feature &Lt; 222 > (34) <223> n is a, c, g, or t <400> 267 gttgcgccgc catcgggttg mnnmnnmnnm nnmnnctccc agttcttgta ggcgatacg 59 <210> 268 <211> 44 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 268 caacccgatg gcggcgcaac cgagaactgc gccgtcctgt ctgg 44 <210> 269 <211> 73 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (44) .. (44) <223> m is a or c <220> <221> misc_feature (222) (45) .. (46) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (47) .. (47) <223> m is a or c <220> <221> misc_feature (222) (48) .. (49) <223> n is a, c, g, or t <220> <221> misc_feature (222) (50) .. (50) <223> m is a or c <220> <221> misc_feature (222) (51) .. (52) <223> n is a, c, g, or t <400> 269 tgtagggcaa ttgatccctg cagcgcttgt cgaaccactt gccmnnmnnm nngccagaca 60 ggacggcgca gtt 73 <210> 270 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <400> 270 gccgagatct ggctgggcct gaacgacatg g 31 <210> 271 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 271 Gly Ser Asn Val Thr Gln Thr 1 5 <210> 272 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 272 Phe Gly Ala Phe Gly 1 5 <210> 273 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 273 Gly Ser Ser Val Ser Asp Val 1 5 <210> 274 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 274 Phe Gly Met Trp Gly 1 5 <210> 275 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 275 Ala Gly Arg Tyr Ser Leu Ile 1 5 <210> 276 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 276 Phe Gly Val Phe Gly 1 5 <210> 277 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 277 Gly Ser Arg Arg Ser Gly Val 1 5 <210> 278 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 278 Arg Gly Ala Thr Val Lys Val 1 5 <210> 279 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 279 Ala Asn Pro Ala Gln Asp Leu 1 5 <210> 280 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 280 Phe Gly Val Trp Gly 1 5 <210> 281 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 281 Ala Pro Gly Ala Met Glu Phe 1 5 <210> 282 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 282 Gly Ser Pro Asp Leu Gly Val 1 5 <210> 283 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 283 Gly Ser Val Arg Ser Ala Thr 1 5 <210> 284 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 284 Phe Gly Tyr Phe Gly 1 5 <210> 285 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 285 Gly Ser Pro Val Gly Asp Met 1 5 <210> 286 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 286 Ile Gly Val Trp Gly 1 5 <210> 287 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 287 Gly Ser Ser Lys Leu Gly Leu 1 5 <210> 288 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 288 Gly Ser Val Arg Gly Arg Thr 1 5 <210> 289 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 289 Thr Asn Val Thr Arg Thr Leu 1 5 <210> 290 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 290 Leu Gly Val Trp Gly 1 5 <210> 291 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 291 Gly Ser Ala Leu Thr Asn Thr 1 5 <210> 292 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 292 Ala Asn Arg Arg Arg Thr Met 1 5 <210> 293 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 293 Met Gly Val Trp Gly 1 5 <210> 294 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 294 Gly Ser Ser Val Ser Gly Leu 1 5 <210> 295 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 295 Val Gly Val Phe Gly 1 5 <210> 296 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 296 Gly Ser Trp Leu Gly Asp Val 1 5 <210> 297 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 297 Ser Gly Lys Ala Arg Asp Val 1 5 <210> 298 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 298 Gly Ser Arg Phe Gly His Leu 1 5 <210> 299 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 299 Trp Gly Val Phe Gly 1 5 <210> 300 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 300 Gly Ser Arg Ile Ser Gly Val 1 5 <210> 301 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 301 Ser Gly Lys Arg Arg Thr Val 1 5 <210> 302 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 302 Ser Gly Ser Trp Ala Arg Thr 1 5 <210> 303 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 303 Ala Gly Ala Arg Ala Glu Tyr 1 5 <210> 304 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 304 Trp Gly Val Trp Gly 1 5 <210> 305 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 305 Gly Pro Gly Gln Ala Gly Leu 1 5 <210> 306 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 306 Gly Ser Thr Tyr Thr Asp Leu 1 5 <210> 307 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 307 Gly Thr Arg Met Thr Asn Thr 1 5 <210> 308 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 308 Trp Gly Tyr Phe Gly 1 5 <210> 309 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 309 Gly Ser Leu Leu Thr Gly Leu 1 5 <210> 310 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 310 Tyr Gly Ala Trp Gly 1 5 <210> 311 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 311 Gly Ser Lys Ala Gly Lys Leu 1 5 <210> 312 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 312 Tyr Gly Val Phe Gly 1 5 <210> 313 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 313 Ala Ser Leu Arg Ser Arg Val 1 5 <210> 314 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 314 Tyr Gly Val Trp Gly 1 5 <210> 315 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 315 Gly Asn Pro Ser Gly Ser Val 1 5 <210> 316 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 316 Thr Gly Ala Leu His Gln Val 1 5 <210> 317 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 317 Trp Thr Lys Arg Thr Ala Leu 1 5 <210> 318 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 318 Met Phe Gly Met Gly 1 5 <210> 319 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 319 Trp Thr Leu Ala Lys Asn Leu 1 5 <210> 320 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 320 Leu Phe Gly Arg Gly 1 5 <210> 321 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 321 Val Leu Gly Trp Arg Arg Glu 1 5 <210> 322 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 322 Leu Val Met Pro Met 1 5 <210> 323 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 323 Leu Ala Thr Trp Leu Arg Trp 1 5 <210> 324 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 324 Gln Arg Met Ser Tyr 1 5 <210> 325 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 325 Gln His Leu Gly Ser Phe Trp 1 5 <210> 326 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 326 Val Glu Phe Gln Gly 1 5 <210> 327 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 327 Gly Ser Ala Thr Thr Ala Thr 1 5 <210> 328 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 328 Gly Ser Ala Thr Thr Asp Thr 1 5 <210> 329 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 329 Ser Gly His Trp Arg Ala Val 1 5 <210> 330 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 330 Ala Pro Pro Asp Leu Gly Leu 1 5 <210> 331 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 331 Ala Pro Lys Ser Arg Gln Tyr 1 5 <210> 332 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 332 Val Met Gln Leu Pro Arg Lys 1 5 <210> 333 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 333 Ala Gly Arg Met Gly Leu Val 1 5 <210> 334 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 334 Ala Gly Tyr Thr Lys Gln Pro Ser 1 5 <210> 335 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 335 Leu Leu Leu Arg Asn Gln Pro Pro 1 5 <210> 336 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 336 Gln Glu Pro Ala Lys Gln Pro Thr 1 5 <210> 337 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 337 His Pro Leu Pro Pro Gln Pro Ser 1 5 <210> 338 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 338 His Gln Pro Val Tyr Gln Pro Gly 1 5 <210> 339 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 339 Leu Pro Pro Pro Gly His Pro Gln 1 5 <210> 340 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 340 Asn Gly His Glu Pro Gln Pro Arg 1 5 <210> 341 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 341 Asn Asn Leu Ser Ala Gln Pro Arg 1 5 <210> 342 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 342 Pro Pro Glu Pro Leu His Pro Met 1 5 <210> 343 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 343 Pro Pro Gly Pro His His Pro Met 1 5 <210> 344 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 344 Pro Pro Pro Pro His His Pro Met 1 5 <210> 345 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 345 Arg Pro Ala Leu Val Gln Pro Arg 1 5 <210> 346 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 346 Arg Pro Pro Leu Tyr Gln Pro Gly 1 5 <210> 347 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 347 Arg Thr Pro Pro Trp Gln Pro Glu 1 5 <210> 348 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 348 Pro Pro Pro Pro His His Pro Gln 1 5 <210> 349 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 349 Pro Pro Gly Pro Ala His Pro Gln 1 5 <210> 350 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 350 Leu Ala Gly Trp Gly Met Ser 1 5 <210> 351 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 351 Thr Pro Pro Arg Thr Gln Pro Pro 1 5 <210> 352 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <352> 352 Pro Pro Ala Pro Tyr His Pro Met 1 5 <210> 353 <211> 4 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 353 Gly Val Phe Gly One <210> 354 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 354 Gly Ser Thr Leu Thr Arg Ile 1 5 <210> 355 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 355 Leu Gly Ala Phe Gly 1 5 <210> 356 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 356 Gly Ser Ala Gly Thr Asn Thr 1 5 <210> 357 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 357 Gly Ser Ala His Thr Asp Thr 1 5 <210> 358 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 358 Gly Ser Ala Ile Thr Asp Thr 1 5 <210> 359 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 359 Gly Ser Ala Ile Thr Asn Thr 1 5 <210> 360 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 360 Gly Ser Ala Lys Thr Asp Thr 1 5 <210> 361 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 361 Gly Ser Ala Lys Thr Gly Thr 1 5 <210> 362 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 362 Gly Ser Ala Lys Thr Asn Thr 1 5 <210> 363 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 363 Gly Ser Ala Leu Thr Asp Thr 1 5 <210> 364 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 364 Gly Ser Ala Leu Thr Arg Thr 1 5 <210> 365 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 365 Gly Ser Ala Leu Thr Ser Leu 1 5 <210> 366 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 366 Gly Ser Ala Arg Gly Arg Val 1 5 <210> 367 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 367 Gly Ser Ala Arg Thr Asp Thr 1 5 <210> 368 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 368 Gly Ser Ala Arg Thr Gly Thr 1 5 <210> 369 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 369 Gly Ser Ala Val Thr Asn Thr 1 5 <210> 370 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 370 Gly Ser Ala Tyr Thr Asn Thr 1 5 <210> 371 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 371 Gly Ser Gly Leu Thr Asp Thr 1 5 <210> 372 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 372 Gly Ser Gly Trp Thr Gly Leu 1 5 <210> 373 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 373 Gly Ser Lys Leu Thr Asp Thr 1 5 <210> 374 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 374 Gly Ser Lys Val Ser Gly Leu 1 5 <210> 375 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 375 Gly Ser Lys Val Thr Glu Thr 1 5 <210> 376 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 376 Gly Ser Leu Lys Thr Asp Thr 1 5 <210> 377 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 377 Gly Ser Leu Lys Thr Gln Thr 1 5 <210> 378 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 378 Gly Ser Leu Leu Thr Asp Thr 1 5 <210> 379 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 379 Gly Ser Leu Leu Thr Asn Thr 1 5 <210> 380 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 380 Gly Ser Leu Arg Ser Asp Leu 1 5 <210> 381 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 381 Gly Ser Leu Arg Thr Asp Thr 1 5 <210> 382 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 382 Gly Ser Leu Arg Thr Gly Thr 1 5 <210> 383 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 383 Gly Ser Leu Arg Thr His Thr 1 5 <210> 384 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 384 Gly Ser Leu Arg Thr Asn Thr 1 5 <210> 385 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 385 Gly Ser Met Leu Thr Asp Thr 1 5 <210> 386 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 386 Gly Ser Met Arg Thr Asp Thr 1 5 <210> 387 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 387 Gly Ser Asn His Thr Asp Thr 1 5 <210> 388 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 388 Gly Ser Pro Ile Thr Asp Thr 1 5 <210> 389 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 389 Gly Ser Pro Ile Thr Asn Thr 1 5 <210> 390 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 390 Gly Ser Pro Lys Thr Asp Thr 1 5 <210> 391 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 391 Gly Ser Pro Lys Thr Gly Thr 1 5 <210> 392 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 392 Gly Ser Pro Lys Thr His Thr 1 5 <210> 393 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 393 Gly Ser Pro Leu Thr Asp Thr 1 5 <210> 394 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 394 Gly Ser Pro Leu Thr Asn Thr 1 5 <210> 395 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 395 Gly Ser Pro Arg Thr Asp Thr 1 5 <210> 396 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 396 Gly Ser Pro Arg Thr Glu Thr 1 5 <210> 397 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 397 Gly Ser Pro Arg Thr Gly Thr 1 5 <210> 398 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 398 Gly Ser Pro Arg Thr His Thr 1 5 <210> 399 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 399 Gly Ser Pro Arg Thr Asn Thr 1 5 <210> 400 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 400 Gly Ser Pro Arg Thr Gln Thr 1 5 <210> 401 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 401 Gly Ser Pro Arg Thr Ser Val 1 5 <210> 402 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 402 Gly Ser Pro Thr Thr Asp Thr 1 5 <210> 403 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 403 Gly Ser Pro Val Asn Asp Val 1 5 <210> 404 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 404 Gly Ser Pro Val Thr Asp Thr 1 5 <210> 405 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 405 Gly Ser Pro Val Thr Gly Thr 1 5 <210> 406 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 406 Gly Ser Pro Val Thr Asn Thr 1 5 <210> 407 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 407 Gly Ser Gln Leu Thr Asp Thr 1 5 <210> 408 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 408 Gly Ser Gln Leu Thr Asn Thr 1 5 <210> 409 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 409 Gly Ser Gln Arg Thr Asp Thr 1 5 <210> 410 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 410 Gly Ser Arg Ala Thr Asp Thr 1 5 <210> 411 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 411 Gly Ser Arg His Thr Asp Thr 1 5 <210> 412 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 412 Gly Ser Arg Leu Thr Asp Thr 1 5 <210> 413 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 413 Gly Ser Arg Leu Thr Asn Thr 1 5 <210> 414 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 414 Gly Ser Arg Arg Thr Asp Thr 1 5 <210> 415 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 415 Gly Ser Ser Ile Thr Asp Thr 1 5 <210> 416 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 416 Gly Ser Ser Lys Thr Asn Thr 1 5 <210> 417 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 417 Gly Ser Ser Leu Thr Asp Thr 1 5 <210> 418 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 418 Gly Ser Ser Leu Thr Asn Thr 1 5 <210> 419 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 419 Gly Ser Ser Arg Thr Asp Thr 1 5 <210> 420 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 420 Gly Ser Ser Arg Thr Asn Thr 1 5 <210> 421 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 421 Gly Ser Ser Val Thr Asn Thr 1 5 <210> 422 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 422 Gly Ser Ser Val Thr Ser Thr 1 5 <210> 423 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 423 Gly Ser Thr Leu Thr Asp Thr 1 5 <210> 424 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 424 Gly Ser Thr Leu Thr Asn Thr 1 5 <210> 425 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 425 Gly Ser Thr Met Thr Gln Thr 1 5 <210> 426 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 426 Gly Ser Thr Arg Thr Asp Thr 1 5 <210> 427 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 427 Gly Ser Thr Arg Thr Asn Thr 1 5 <210> 428 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 428 Gly Ser Val Ile Thr Gly Thr 1 5 <210> 429 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 429 Gly Ser Val Ile Thr Asn Thr 1 5 <210> 430 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 430 Gly Ser Val Lys Thr Asp Thr 1 5 <210> 431 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 431 Gly Ser Val Leu Thr Asp Thr 1 5 <210> 432 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 432 Gly Ser Val Leu Thr Gly Thr 1 5 <210> 433 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 433 Gly Ser Val Leu Thr Asn Thr 1 5 <210> 434 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 434 Gly Ser Val Arg Thr Asp Thr 1 5 <210> 435 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 435 Gly Ser Val Arg Thr Glu Ser 1 5 <210> 436 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 436 Gly Ser Val Arg Thr Gly Thr 1 5 <210> 437 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 437 Gly Ser Val Arg Thr Asn Thr 1 5 <210> 438 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 438 Gly Ser Val Thr Thr Asp Thr 1 5 <210> 439 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 439 Gly Ser Trp Gly Ser Gly Ile 1 5 <210> 440 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 440 Gly Ser Trp Leu Thr Asp Thr 1 5 <210> 441 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 441 Gly Ser Tyr Leu Thr Asn Thr 1 5 <210> 442 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 442 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 443 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 443 Ala Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 444 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 444 Asn Ala Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 445 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 445 Asn Gly Ala Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 446 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 446 Asn Gly Ser Ala Ala Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 447 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 447 Asn Gly Ser Ala Leu Ala Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 448 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 448 Asn Gly Ser Ala Leu Thr Ala Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 449 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 449 Asn Gly Ser Ala Leu Thr Asn Ala Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 450 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 450 Asn Gly Ser Ala Leu Thr Asn Thr Ala Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 451 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 451 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Ala 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 452 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 452 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Ala Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 453 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 453 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Ala Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 454 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 454 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Ala Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 455 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 455 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Ala Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 456 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 456 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Ala Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 457 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 457 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Ala Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 458 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 458 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ala Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 459 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 459 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Ala Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 460 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 460 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Ala Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 461 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 461 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Ala Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 462 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 462 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Ala             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 463 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 463 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Ala Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 464 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 464 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Ala Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 465 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 465 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Ala Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg     50 55 <210> 466 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 466 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Ala Cys Arg     50 55 <210> 467 <211> 58 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 467 Asn Gly Ser Ala Leu Thr Asn Thr Trp Val Asp Met Thr Gly Ala Arg 1 5 10 15 Ile Ala Tyr Lys Asn Trp Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly             20 25 30 Gly Phe Gly Val Phe Gly Glu Asn Cys Ala Val Leu Ser Gly Ala Ala         35 40 45 Asn Gly Lys Trp Phe Asp Lys Arg Cys Ala     50 55 <210> 468 <211> 17 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (4) .. (11) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (14) .. (16) <223> Xaa can be any naturally occurring amino acid <400> 468 Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Xaa Xaa Xaa 1 5 10 15 Asn      <210> 469 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 469 Ala Ala Glu Gly Thr 1 5 <210> 470 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> k is g or t <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> k is g or t <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> k is g or t <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> k is g or t <220> <221> misc_feature (222) (13) .. (14) <223> n is a, c, g, or t <220> <221> misc_feature (222) (15) .. (15) <223> k is g or t <220> <221> misc_feature <222> (16) .. (17) <223> n is a, c, g, or t <220> <221> misc_feature (222) (18) .. (18) <223> k is g or t <220> <221> misc_feature (222) (19) .. (20) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (21) <223> k is g or t <400> 470 nnknnknnkn nknnknnknn k 21 <210> 471 <211> 15 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> k is g or t <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> k is g or t <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> k is g or t <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> k is g or t <220> <221> misc_feature (222) (13) .. (14) <223> n is a, c, g, or t <220> <221> misc_feature (222) (15) .. (15) <223> k is g or t <400> 471 nnknnknnkn nknnk 15 <210> 472 <211> 12 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> k is g or t <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> k is g or t <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> k is g or t <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> k is g or t <400> 472 nnknnknnkn nk 12 <210> 473 <211> 7 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 473 Asp Met Ala Ala Glu Gly Thr 1 5 <210> 474 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> s is g or c <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> s is g or c <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> s is g or c <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> s is g or c <220> <221> misc_feature (222) (13) .. (14) <223> n is a, c, g, or t <220> <221> misc_feature (222) (15) .. (15) <223> s is g or c <220> <221> misc_feature <222> (16) .. (17) <223> n is a, c, g, or t <220> <221> misc_feature (222) (18) .. (18) <223> s is g or c <220> <221> misc_feature (222) (19) .. (20) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (21) <223> s is g or c <400> 474 nnsnnsnnsn nsnnsnnsnn s 21 <210> 475 <211> 15 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> s is g or c <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> s is g or c <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> s is g or c <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> s is g or c <220> <221> misc_feature (222) (13) .. (14) <223> n is a, c, g, or t <220> <221> misc_feature (222) (15) .. (15) <223> s is g or c <400> 475 nnsnnsnnsn nsnns 15 <210> 476 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> k is g or t <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> k is g or t <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> k is g or t <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> k is g or t <220> <221> misc_feature (222) (13) .. (14) <223> n is a, c, g, or t <220> <221> misc_feature (222) (15) .. (15) <223> k is g or t <220> <221> misc_feature <222> (16) .. (17) <223> n is a, c, g, or t <220> <221> misc_feature (222) (18) .. (18) <223> k is g or t <400> 476 nnknnknnkn nknnknnk 18 <210> 477 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 477 Thr Glu Ile Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu 1 5 10 <210> 478 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> MISC_FEATURE (222) (1) .. (8) <223> Xaa represents a random amino acid encoded by the nucleotide        sequence nnk (n is a, c, g, or t; k is g or t) <220> <221> MISC_FEATURE (222) (11) .. (13) <223> Xaa represents a random amino acid encoded by the nucleotide        sequence nnk (n is a, c, g, or t; k is g or t) <400> 478 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Xaa Xaa Xaa 1 5 10 <210> 479 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 479 Glu Thr Glu Ile Thr Ala 1 5 <210> 480 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature <222> (1) (2) <223> n is a, c, g, or t <220> <221> misc_feature (222) (3) .. (3) <223> k is g or t <220> <221> misc_feature <222> (4) <223> n is a, c, g, or t <220> <221> misc_feature <222> (6) <223> k is g or t <220> <221> misc_feature (222) (7) .. (8) <223> n is a, c, g, or t <220> <221> misc_feature (222) (9) .. (9) <223> k is g or t <220> <221> misc_feature (222) (10) .. (11) <223> n is a, c, g, or t <220> <221> misc_feature (222) (12) .. (12) <223> k is g or t <220> <221> misc_feature (222) (13) .. (14) <223> n is a, c, g, or t <220> <221> misc_feature (222) (15) .. (15) <223> k is g or t <220> <221> misc_feature <222> (16) .. (17) <223> n is a, c, g, or t <220> <221> misc_feature (222) (18) .. (18) <223> k is g or t <220> <221> misc_feature (222) (19) .. (20) <223> n is a, c, g, or t <220> <221> misc_feature &Lt; 222 > (21) <223> k is g or t <220> <221> misc_feature &Lt; 222 > (22) <223> n is a, c, g, or t <220> <221> misc_feature (222) (24) .. (24) <223> k is g or t <400> 480 nnknnknnkn nknnknnknn knnk 24 <210> 481 <211> 5 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 481 Thr Glu Ile Thr Ala 1 5 <210> 482 <211> 30 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <220> <221> misc_feature (222) (9) .. (13) <223> Xaa can be any naturally occurring amino acid <220> <221> misc_feature <222> (16) .. (16) <223> Xaa can be any naturally occurring amino acid <220> <221> MISC_FEATURE <222> (19). (19) &Lt; 223 > Xaa is Phe or Leu <220> <221> MISC_FEATURE &Lt; 222 > (21) <223> Xaa is Phe, Tyr, Val, or Asp <220> <221> MISC_FEATURE (222) (22) .. (22) <223> Xaa is Phe, Trp, Leu, or Cys <400> 482 Arg Ile Ala Tyr Lys Asn Trp Glu Xaa Xaa Xaa Xaa Xaa Gln Pro Xaa 1 5 10 15 Gly Gly Xaa Gly Xaa Xaa Gly Glu Asn Cys Ala Val Leu Ser             20 25 30 <210> 483 <211> 125 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 483 His Met Lys Cys Phe Leu Ala Phe Thr Gln Thr Lys Thr Phe His Glu 1 5 10 15 Ala Ser Glu Asp Cys Ile Ser Arg Gly Gly Thr Leu Ser Thr Pro Gln             20 25 30 Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gln Ser Val         35 40 45 Gly Asn Glu Ala Glu Ile Trp Leu Gly Leu Asn Asp Met Ala Ala Glu     50 55 60 Gly Thr Trp Val Asp Met Thr Gly Ala Arg Ile Ala Tyr Lys Asn Trp 65 70 75 80 Glu Thr Glu Ile Thr Ala Gln Pro Asp Gly Gly Lys Thr Glu Asn Cys                 85 90 95 Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys             100 105 110 Arg Asp Gln Leu Pro Tyr Ile Cys Gln Phe Gly Ile Val         115 120 125 <210> 484 <211> 114 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 484 Gly Asn Lys Phe Phe Leu Thr Asn Gly Glu Ile Met Thr Phe Glu Lys 1 5 10 15 Val Lys Ala Leu Cys Val Lys Phe Gln Ala Ser Val Ala Thr Pro Arg             20 25 30 Asn Ala Ala Glu Asn Gly Ala Ile Gln Asn Leu Ile Lys Glu Glu Ala         35 40 45 Phe Leu Gly Ile Thr Asp Glu Lys Thr Glu Gly Gln Phe Val Asp Leu     50 55 60 Thr Gly Asn Arg Leu Thr Tyr Thr Asn Trp Asn Glu Gly Glu Pro Asn 65 70 75 80 Asn Ala Gly Ser Asp Glu Asp Cys Val Leu Leu Leu Lys Asn Gly Gln                 85 90 95 Trp Asn Asp Val Pro Cys Ser Thr Ser His Leu Ala Val Cys Glu Phe             100 105 110 Pro yle          <210> 485 <211> 112 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 485 Lys Tyr Phe Met Ser Ser Val Arg Arg Met Pro Leu Asn Arg Ala Lys 1 5 10 15 Ala Leu Cys Ser Glu Leu Gln Gly Thr Val Ala Thr Pro Arg Asn Ala             20 25 30 Glu Glu Asn Arg Ala Ile Gln Asn Val Ala Lys Asp Val Ala Phe Leu         35 40 45 Gly Ile Thr Asp Gln Arg Thr Glu Asn Val Phe Glu Asp Leu Thr Gly     50 55 60 Asn Arg Val Arg Tyr Thr Asn Trp Asn Glu Gly Glu Pro Asn Asn Val 65 70 75 80 Gly Ser Gly Glu Asn Cys Val Val Leu Leu Thr Asn Gly Lys Trp Asn                 85 90 95 Asp Val Pro Cys Ser Asp Ser Phe Leu Val Val Cys Glu Phe Ser Asp             100 105 110 <210> 486 <211> 115 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 486 Gly Glu Lys Ile Phe Lys Thr Ala Gly Phe Val Lys Pro Phe Thr Glu 1 5 10 15 Ala Gln Leu Leu Cys Thr Gln Ala Gly Gly Gln Leu Ala Ser Pro Arg             20 25 30 Ser Ala Ala Glu Asn Ala Ala Leu Gln Gln Leu Val Val Ala Lys Asn         35 40 45 Glu Ala Ala Phe Leu Ser Met Thr Asp Ser Lys Thr Glu Gly Lys Phe     50 55 60 Thr Tyr Pro Thr Gly Glu Ser Leu Val Tyr Ser Asn Trp Ala Pro Gly 65 70 75 80 Glu Pro Asn Asp Asp Gly Gly Ser Glu Asp Cys Val Glu Ile Phe Thr                 85 90 95 Asn Gly Lys Trp Asn Asp Arg Ala Cys Gly Glu Lys Arg Leu Val Val             100 105 110 Cys glu phe         115 <210> 487 <211> 114 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 487 Gly Lys Lys Phe Phe Val Thr Asn His Glu Arg Met Pro Phe Ser Lys 1 5 10 15 Val Lys Ala Leu Cys Ser Glu Leu Arg Gly Thr Val Ala Ile Pro Arg             20 25 30 Asn Ala Glu Glu Asn Lys Ala Ile Gln Glu Val Ala Lys Thr Ser Ala         35 40 45 Phe Leu Gly Ile Thr Asp Glu Val Thr Glu Gly Gln Phe Met Tyr Val     50 55 60 Thr Gly Gly Arg Leu Thr Tyr Ser Asn Trp Lys Lys Asp Glu Pro Asn 65 70 75 80 Asp Val Gly Ser Gly Glu Asp Cys Val Thr Ile Val Asp Asn Gly Leu                 85 90 95 Trp Asn Asp Val Ser Cys Gln Ala Ser His Thr Ala Val Cys Glu Phe             100 105 110 Pro ala          <210> 488 <211> 114 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 488 Gly Asp Lys Val Phe Ser Thr Asn Gly Gln Ser Val Asn Phe Asp Thr 1 5 10 15 Ile Lys Glu Met Cys Thr Arg Ala Gly Gly Asn Ile Ala Val Pro Arg             20 25 30 Thr Pro Glu Glu Asn Glu Ala Ile Ala Ser Ile Ala Lys Lys Tyr Asn         35 40 45 Asn Tyr Val Tyr Leu Gly Met Ile Glu Asp Gln Thr Pro Gly Asp Phe     50 55 60 His Tyr Leu Asp Gly Ala Ser Val Ser Tyr Thr Asn Trp Tyr Pro Gly 65 70 75 80 Glu Pro Arg Gly Gln Gly Lys Glu Lys Cys Val Glu Met Tyr Thr Asp                 85 90 95 Gly Thr Trp Asn Asp Arg Gly Cys Leu Gln Tyr Arg Leu Ala Val Cys             100 105 110 Glu phe          <210> 489 <211> 119 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 489 Thr Lys Phe Gln Gly His Cys Tyr Arg His Phe Pro Asp Arg Glu Thr 1 5 10 15 Trp Val Asp Ala Glu Arg Arg Cys Arg Glu Gln Gln Ser His Leu Ser             20 25 30 Ser Ile Val Thr Pro Glu Glu Gln Glu Phe Val Asn Lys Asn Ala Gln         35 40 45 Asp Tyr Gln Trp Ile Gly Leu Asn Asp Arg Thr Ile Glu Gly Asp Phe     50 55 60 Arg Trp Ser Asp Gly His Ser Leu Gln Phe Glu Lys Trp Arg Pro Asn 65 70 75 80 Gln Pro Asp Asn Phe Phe Ala Thr Gly Glu Asp Cys Val Val Met Ile                 85 90 95 Trp His Glu Arg Gly Glu Trp Asn Asp Val Pro Cys Asn Tyr Gln Leu             100 105 110 Pro Phe Thr Cys Lys Lys Gly         115 <210> 490 <211> 127 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 490 Ser His Cys Tyr Ala Leu Phe Leu Ser Pro Lys Ser Trp Thr Asp Ala 1 5 10 15 Asp Leu Ala Cys Gln Lys Arg Pro Ser Gly Asn Leu Val Ser Val Leu             20 25 30 Ser Gly Ala Glu Gly Ser Phe Val Ser Ser Leu Val Lys Ser Ile Gly         35 40 45 Asn Ser Tyr Ser Tyr Val Trp Ile Gly Leu His Asp Pro Thr Gln Gly     50 55 60 Thr Glu Pro Asn Gly Glu Gly Trp Glu Trp Ser Ser Ser Asp Val Met 65 70 75 80 Asn Tyr Phe Ala Trp Glu Arg Asn Pro Ser Thr Ile Ser Ser Pro Gly                 85 90 95 His Cys Ala Ser Leu Ser Arg Ser Thr Ala Phe Leu Arg Trp Lys Asp             100 105 110 Tyr Asn Cys Asn Val Arg Leu Pro Tyr Val Cys Lys Phe Thr Asp         115 120 125 <210> 491 <211> 119 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 491 Asp Lys Cys Tyr Tyr Phe Ser Val Glu Lys Glu Ile Phe Glu Asp Ala 1 5 10 15 Lys Leu Phe Cys Glu Asp Lys Ser Ser His Leu Val Phe Ile Asn Thr             20 25 30 Arg Glu Glu Gln Gln Trp Ile Lys Lys Gln Met Val Gly Arg Glu Ser         35 40 45 His Trp Ile Gly Leu Thr Asp Ser Glu Arg Glu Asn Glu Trp Lys Trp     50 55 60 Leu Asp Gly Thr Ser Pro Asp Tyr Lys Asn Trp Lys Ala Gly Gln Pro 65 70 75 80 Asp Asn Trp Gly His Gly His Gly Pro Gly Glu Asp Cys Ala Gly Leu                 85 90 95 Ile Tyr Ala Gly Gln Trp Asn Asp Phe Gln Cys Glu Asp Val Asn Asn             100 105 110 Phe Ile Cys Glu Lys Asp Arg         115 <210> 492 <211> 120 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 492 Asp Lys Cys Tyr Tyr Phe Ser Leu Glu Lys Glu Ile Phe Glu Asp Ala 1 5 10 15 Lys Leu Phe Cys Glu Asp Lys Ser Ser His Leu Val Phe Ile Asn Ser             20 25 30 Arg Glu Glu Gln Gln Trp Ile Lys Lys His Thr Val Gly Arg Glu Ser         35 40 45 His Trp Ile Gly Leu Thr Asp Ser Glu Gln Glu Ser Glu Trp Lys Trp     50 55 60 Leu Asp Gly Ser Pro Val Asp Tyr Lys Asn Trp Lys Ala Gly Gln Pro 65 70 75 80 Asp Asn Trp Gly Ser Gly His Gly Pro Gly Glu Asp Cys Ala Gly Leu                 85 90 95 Ile Tyr Ala Gly Gln Trp Asn Asp Phe Gln Cys Asp Glu Ile Asn Asn             100 105 110 Phe Ile Cys Glu Lys Glu Arg Glu         115 120 <210> 493 <211> 121 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 493 Gly Asn Cys Tyr Phe Met Ser Asn Ser Gln Arg Asn Trp His Asp Ser 1 5 10 15 Val Thr Ala Cys Gln Glu Val Arg Ala Gln Leu Val Val Ile Lys Thr             20 25 30 Ala Glu Glu Gln Asn Phe Leu Gln Leu Gln Thr Ser Arg Ser Asn Arg         35 40 45 Phe Ser Trp Met Gly Leu Ser Asp Leu Asn Gln Glu Gly Thr Trp Gln     50 55 60 Trp Val Asp Gly Ser Pro Leu Ser Pro Ser Phe Gln Arg Tyr Trp Asn 65 70 75 80 Ser Gly Glu Pro Asn Asn Ser Gly Asn Glu Asp Cys Ala Glu Phe Ser                 85 90 95 Gly Ser Gly Trp Asn Asp Asn Arg Cys Asp Val Asp Asn Tyr Trp Ile             100 105 110 Cys Lys Lys Pro Ala Ala Cys Phe Arg         115 120 <210> 494 <211> 8 <212> PRT <213> Artificial Sequence <220> <223> Synthetic <400> 494 Pro Ala Arg Gln Pro Gln Pro Gly 1 5

Claims (34)

삼합체화 도메인(trimerizing domain) 및 IL-23 이종이합체 수용체를 활성화시키지 않으면서 인간 IL-23R에 결합하는 적어도 하나의 폴리펩티드 서열을 포함하는 폴리펩티드.A polypeptide comprising a trimerizing domain and at least one polypeptide sequence that binds human IL-23R without activating the IL-23 heterodimeric receptor. 제 1항에 있어서, 상기 폴리펩티드가 인간 IL-12Rβ1 또는 인간 IL-12Rβ2 중 적어도 하나에 결합하지 않는 폴리펩티드.The polypeptide of claim 1, wherein said polypeptide does not bind at least one of human IL-12Rβ1 or human IL-12Rβ2. 제 1항에 있어서, 상기 폴리펩티드가 인간 IL-23R에 대한 결합에 대해 자연 인간 IL-23과 경쟁하는 폴리펩티드.The polypeptide of claim 1, wherein said polypeptide competes with natural human IL-23 for binding to human IL-23R. 제 1항에 있어서, 상기 삼합체화 도메인이 위치 26, 30, 33, 36, 37, 40, 31, 42, 45, 46, 47, 48, 49, 50 및 51에서 5개 이하의 아미노산 치환을 갖는 인간 테트라넥틴 삼합체화 도메인(SEQ ID NO:108)의 폴리펩티드를 포함하고, 3개의 삼합체화 도메인이 삼합체 복합체를 형성하는 폴리펩티드.The method of claim 1, wherein the trimerization domain has up to 5 amino acid substitutions at positions 26, 30, 33, 36, 37, 40, 31, 42, 45, 46, 47, 48, 49, 50, and 51. A polypeptide comprising a polypeptide of human tetranectin trimerization domain (SEQ ID NO: 108), wherein the three trimerization domains form a trimer complex. 제 1항에 있어서, 상기 삼합체화 도메인이 hTRAF3 [SEQ ID NO:_], hMBP [SEQ ID NO:_], hSPC300 [SEQ ID NO:_], hNEMO [SEQ ID NO:_], h큐빌린(hcubilin) [SEQ ID NO:_], h트롬보스폰딘(hThrombospondin) [SEQ ID NO:_], 및 인간 SP-D의 넥(neck) 영역 [SEQ ID NO:_], 소 SP-D의 넥 영역 [SEQ ID NO:_], 래트 SP-D의 넥 영역 [SEQ ID NO:_], 소 콘글루티닌의 넥 영역 [SEQ ID NO:_]; 소 콜렉틴의 넥 영역 [SEQ ID NO:_]; 및 인간 SP-D의 넥 영역 [SEQ ID NO:_]으로 구성된 군으로부터 선택된 삼합체화 폴리펩티드를 포함하는 폴리펩티드.The method of claim 1, wherein the trimerization domain is hTRAF3 [SEQ ID NO: _], hMBP [SEQ ID NO: _], hSPC300 [SEQ ID NO: _], hNEMO [SEQ ID NO: _], hcubiline (hcubilin) [SEQ ID NO: _], hThrombospondin [SEQ ID NO: _], and the neck region [SEQ ID NO: _] of human SP-D, of bovine SP-D Neck area [SEQ ID NO: _], neck area of rat SP-D [SEQ ID NO: _], neck area of bovine conglutinin [SEQ ID NO: _]; Neck region of bovine collector [SEQ ID NO: _]; And a trimerized polypeptide selected from the group consisting of neck region [SEQ ID NO: _] of human SP-D. 제 1항에 있어서, 인간 IL-23R이 SEQ ID NO:5의 서열을 포함하는 폴리펩티드.The polypeptide of claim 1, wherein human IL-23R comprises the sequence of SEQ ID NO: 5. 제 1항에 있어서, IL-23R에 결합하는 상기 적어도 하나의 폴리펩티드가 삼합체화 도메인의 N-말단 및 C-말단 중 하나에 연결되고, 상기 N-말단 및 C-말단의 나머지에 위치된 염증의 조정자를 추가로 포함하는 폴리펩티드.The method of claim 1, wherein the at least one polypeptide that binds IL-23R is linked to one of the N-terminus and C-terminus of the trimerization domain and is located at the rest of the N-terminus and C-terminus. A polypeptide further comprising a modulator. 제 1항에 있어서, IL-23R에 결합하는 상기 적어도 하나의 폴리펩티드가 C-타입 렉틴 유사 도메인(CTLD)을 포함하고, CTLD의 루프 세그먼트 A 또는 루프 세그먼트 B 중 루프 1, 2, 3 또는 4 중 하나가 IL-23에 결합하는 폴리펩티드 서열을 포함하는 폴리펩티드.The method of claim 1, wherein the at least one polypeptide that binds IL-23R comprises a C-type lectin-like domain (CTLD) and comprises one of loops 1, 2, 3, or 4 of loop segment A or loop segment B of CTLD. A polypeptide comprising a polypeptide sequence, one of which binds to IL-23. 제 7항에 있어서, 상기 CTLD의 폴리펩티드 서열이 SEQ ID NO:133, 134, 135, 167, 137, 138, 139, 140, 및 141로 구성된 군으로부터 선택되는 폴리펩티드.8. The polypeptide of claim 7 wherein the polypeptide sequence of the CTLD is selected from the group consisting of SEQ ID NOs: 133, 134, 135, 167, 137, 138, 139, 140, and 141. 제 1항에 있어서, IL-23에 결합하는 상기 폴리펩티드가 삼합체화 도메인의 N-말단 및 C-말단 중 하나에 연결되고, 상기 N-말단 및 C-말단의 나머지에 위치된 염증의 조정자를 추가로 포함하는 폴리펩티드.The method of claim 1, wherein the polypeptide that binds IL-23 is linked to one of the N-terminus and C-terminus of the trimerization domain, and adds a modulator of inflammation located at the rest of the N-terminus and C-terminus. Polypeptides, including. 제 1항에 있어서, N-말단 및 C-말단의 각각에 연결된 IL-23에 결합하는 폴리펩티드를 가지며, N-말단의 폴리펩티드가 C-말단의 폴리펩티드와 동일하거나 상이한 폴리펩티드.The polypeptide of claim 1, having a polypeptide that binds to IL-23 linked to each of the N-terminus and C-terminus, wherein the N-terminal polypeptide is the same as or different from the C-terminal polypeptide. 제 1항에 있어서, 상기 폴리펩티드가 융합 단백질인 폴리펩티드.The polypeptide of claim 1, wherein said polypeptide is a fusion protein. 제 1항에 있어서, IL-23R에 결합하는 상기 폴리펩티드가 삼합체화 도메인의 N-말단 및 C-말단 중 하나에 위치하고, 상기 N-말단 및 C-말단의 나머지에 종양 관련 항원(TAA) 또는 종양 특이적 항원(TSA)에 결합하는 폴리펩티드 서열을 추가로 포함하는 폴리펩티드.The method of claim 1, wherein the polypeptide that binds IL-23R is located at one of the N-terminus and C-terminus of the trimerization domain and has a tumor associated antigen (TAA) or tumor at the rest of the N-terminus and C-terminus. A polypeptide further comprising a polypeptide sequence that binds to a specific antigen (TSA). 제 1항에 있어서, 폴리펩티드에 공유적으로 부착된 치료제를 추가로 포함하는 폴리펩티드.The polypeptide of claim 1, further comprising a therapeutic agent covalently attached to the polypeptide. 제 1항의 3개의 폴리펩티드를 포함하는 삼합체 복합체.A trimer complex comprising the three polypeptides of claim 1. 제 15항에 있어서, 상기 삼합체화 도메인이 테트라넥틴 삼합체화 구조 엘리먼트인 삼합체 복합체.16. The trimer complex of claim 15 wherein said trimerization domain is a tetranectin trimerization structural element. IL-23R을 발현하는 세포와 제 15항의 삼합체 복합체를 접촉시키는 것을 포함하는, IL-23R을 발현하는 세포에서의 IL-23에 의한 IL-23R의 활성화를 방지하는 방법.A method for preventing activation of IL-23R by IL-23 in a cell expressing IL-23R, comprising contacting the cell expressing IL-23R with the trimer complex of claim 15. 제 16항의 삼합체 복합체 및 적어도 하나의 약학적으로 허용되는 부형제를 포함하는 약학적 조성물.A pharmaceutical composition comprising the trimer complex of claim 16 and at least one pharmaceutically acceptable excipient. 제 18항의 약학적 조성물을 동물에 투여하는 것을 포함하는, 대상체의 면역 장애를 치료하는 방법.A method of treating an immune disorder in a subject comprising administering the pharmaceutical composition of claim 18 to the animal. 제 19항에 있어서, 염증의 조정자를 동시에 또는 순차적으로 대상체에 투여하는 것을 추가로 포함하는 방법.The method of claim 19, further comprising administering a modulator of inflammation to the subject simultaneously or sequentially. 제 18항의 약학적 조성물을 암을 치료할 필요가 있는 대상체에 투여하는 것을 포함하는, 동물의 암을 치료하는 방법.A method of treating cancer in an animal comprising administering the pharmaceutical composition of claim 18 to a subject in need thereof. 제 21항에 있어서, 화학치료제 또는 세포독성제 중 적어도 하나를 동시에 또는 순차적으로 동물에 투여하는 것을 추가로 포함하는 방법.The method of claim 21, further comprising administering at least one of the chemotherapeutic or cytotoxic agent to the animal simultaneously or sequentially. a) IL-23R에 결합하는 제 1 폴리펩티드를 선택하는 단계; 및
b) 상기 제 1 폴리펩티드와 다합체화 도메인의 N-말단 또는 C-말단 중 하나를 융합시키는 단계를 포함하는, 제 1항의 폴리펩티드를 제조하는 방법.
a) selecting a first polypeptide that binds IL-23R; And
b) fusing the first polypeptide with either the N-terminus or the C-terminus of the multimerization domain.
제 23항에 있어서,
a) 염증의 조정자인 제 2 폴리펩티드 서열을 선택하는 단계; 및
b) 상기 제 2 폴리펩티드와 상기 다합체화 도메인의 N-말단 또는 C-말단의 나머지를 융합시키는 단계를 포함하는 방법.
24. The method of claim 23,
a) selecting a second polypeptide sequence that is a modulator of inflammation; And
b) fusing said second polypeptide with the rest of the N-terminus or C-terminus of said multimerization domain.
제 21항에 있어서, 상기 단계 a)에서 상기 폴리펩티드가 IL-12Rβ1 또는 IL-12Rβ2 중 적어도 하나에 결합하지 않도록 선택되는 방법.The method of claim 21, wherein in step a) the polypeptide is selected to not bind to at least one of IL-12Rβ1 or IL-12Rβ2. 제 23항에 따라 제조된 3개의 폴리펩티드를 삼합체화시키는 것을 포함하는, IL-23R을 발현하는 세포에서 IL-23R의 활성화를 방지하는 폴리펩티드 복합체를 제조하는 방법.A method of preparing a polypeptide complex that prevents activation of IL-23R in a cell expressing IL-23R, comprising trimerizing three polypeptides prepared according to claim 23. a) 적어도 하나의 무작위화된 루프 영역을 포함하는 CTLD를 포함하는 폴리펩티드의 라이브러리를 생성시키는 단계; 및
b) 상기 라이브러리로부터 IL-23R에 결합하나, IL-12Rβ1 또는 IL-12Rβ2 중 적어도 하나에 결합하지 않는 제 1 폴리펩티드를 선택하는 단계를 포함하는, 면역 관련 장애를 매개하는 폴리펩티드를 제조하는 방법.
a) generating a library of polypeptides comprising CTLDs comprising at least one randomized loop region; And
b) selecting from said library a first polypeptide that binds to IL-23R but not to at least one of IL-12Rβ1 or IL-12Rβ2.
제 27항에 있어서, (c) 다합체화 도메인의 N-말단 또는 C-말단에 선택된 폴리펩티드를 부착시키는 단계를 추가로 포함하는 방법.The method of claim 27, further comprising (c) attaching the selected polypeptide to the N-terminus or C-terminus of the multimerization domain. 자연 IL-23R에 대한 결합에 대해 자연 인간 IL-23과 경쟁하는 폴리펩티드로서, 상기 폴리펩티드가 인간 IL-23R을 활성화시키지 않고, IL-12β1 또는 IL-12β2 중 적어도 하나에 결합하지 않는 폴리펩티드.A polypeptide that competes with natural human IL-23 for binding to natural IL-23R, wherein the polypeptide does not activate human IL-23R and does not bind to at least one of IL-12β1 or IL-12β2. 제 30항에 있어서, 상기 폴리펩티드가 IL-23R에 대한 결합에 대해 루프 세그먼트 A 또는 루프 세그먼트 B 중 루프 1, 2, 3 또는 4 중 하나에서 변형된 CTLD인 폴리펩티드.31. The polypeptide of claim 30, wherein the polypeptide is CTLD modified in one of loops 1, 2, 3 or 4 of loop segment A or loop segment B for binding to IL-23R. 제 30항에 있어서, SEQ ID NO:133, 134, 135, 167, 137, 138, 139, 140 및 141로 구성된 군으로부터 선택된 폴리펩티드를 포함하는 폴리펩티드.31. The polypeptide of claim 30 comprising a polypeptide selected from the group consisting of SEQ ID NOs: 133, 134, 135, 167, 137, 138, 139, 140 and 141. 제 1항의 폴리펩티드를 포함하는 폴리펩티드를 엔코딩하는 분리된 폴리누클레오티드.An isolated polynucleotide encoding a polypeptide comprising the polypeptide of claim 1. 제 32항의 폴리누클레오티드를 포함하는 벡터.A vector comprising the polynucleotide of claim 32. 제 34항의 벡터를 포함하는 숙주 세포.A host cell comprising the vector of claim 34.
KR1020127012030A 2009-10-09 2010-02-10 Polypeptides that bind il-23r KR20120117977A (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US6027109A 2009-10-09 2009-10-09
US12/577,067 US20100105620A1 (en) 2008-10-10 2009-10-09 Polypeptides that bind Trail-R1 and Trail-R2
US12/577,067 2009-10-09
WOPCT/US2009/060271 2009-10-09
US12/703,752 2010-02-10
US12/703,752 US20110086770A1 (en) 2009-10-09 2010-02-10 Combinatorial Libraries Based on C-type Lectin-like Domain

Publications (1)

Publication Number Publication Date
KR20120117977A true KR20120117977A (en) 2012-10-25

Family

ID=47285615

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020127012030A KR20120117977A (en) 2009-10-09 2010-02-10 Polypeptides that bind il-23r

Country Status (1)

Country Link
KR (1) KR20120117977A (en)

Similar Documents

Publication Publication Date Title
CN112673092B (en) Engineered immunostimulatory bacterial strains and uses thereof
KR20170108946A (en) Chimeric antigen receptors targeting fc receptor-like 5 and uses thereof
KR20190100200A (en) Allogeneic tumor cell vaccine
CN113271955A (en) Enhanced systems for cell-mediated oncolytic viral therapy
KR20200064980A (en) Microorganisms programmed to produce immunomodulators and anti-cancer drugs in tumor cells
CN107580503B (en) Combination of a bactericidal agent with a lysosome-philic basifying agent for the treatment of bacterial infections
PL208712B1 (en) Soluble t cell receptor
CN110290814A (en) For treating the gene therapy of hepatolenticular degeneration
TW202237826A (en) Gene-edited natural killer cells
CN101610793A (en) Long lasting drug formulations
CN112779291B (en) Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage, fast growth, high reproductive capacity and resistance to series epidemic diseases and application thereof
CN112779292B (en) Method for constructing high-quality pig nuclear transplantation donor cells with high lean meat percentage and rapid growth and capable of resisting blue ear diseases and serial diarrhea diseases and application of donor cells
KR20220103928A (en) Compositions and methods for in vitro activation and expansion of serial killer T cell populations and passive immunization of cancer patients with tumor apoptotic cells
KR20210005146A (en) Expression of human FOXP3 in gene edited T cells
JP2619232B2 (en) Gene system encoding human immunoglobulin E binding factor active polypeptide
CN116083398B (en) Isolated Cas13 proteins and uses thereof
CN111979240B (en) Gene expression regulation method and system based on Type I-F CRISPR/Cas
CN112522260B (en) CRISPR system and application thereof in preparing TTN gene mutation dilated cardiomyopathy clone pig nuclear donor cells
CN112877362A (en) Gene editing system for constructing high-quality porcine nuclear transplantation donor cells with high fertility and capability of resisting porcine reproductive and respiratory syndrome and serial diarrhea diseases and application of gene editing system
KR20120117977A (en) Polypeptides that bind il-23r
CN114958759A (en) Construction method and application of amyotrophic lateral sclerosis model pig
RU2779747C2 (en) Chimeric antigen receptors targeted to fc receptor-like protein 5, and their use
KR20230038460A (en) allogeneic tumor cell vaccine
CN112522255B (en) CRISPR/Cas9 system and application thereof in construction of porcine recombinant cell with insulin receptor substrate gene defect
KR20230123115A (en) Ace2 variants and use thereof

Legal Events

Date Code Title Description
WITN Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid